← Back to country index
Synthestat · Portugal · population QA

PT 1:1 population synthesis QA cycle

Country-specific layer for synthetic people in households, dwellings, real building stock where available, hidden-population overlays, and work/school assignment evidence.

Board: synthestat-population-qa · Tenant: synthestat · Country: PT · Overall status: ready

Ideal-country quality criteria: impossible 1:1 benchmark

This is the common gold-standard benchmark for an ideal country. It is intentionally impossible to fully satisfy: complete success would mean a 1:1 replica of the real population where every person, household, dwelling, attribute, and assignment is exactly represented. The QA page uses it as an asymptote and gap taxonomy, not as a release promise.

Apply this same rubric to this country’s latest run, then report which needs are measured, constrained, modelled, unavailable, or blocked.

NeedUnachievable idealQA evidence we require insteadWhy perfection cannot be achieved
Complete de jure resident coverageEvery real resident represented exactly once in the right country, municipality, small area, household, and dwelling.Synthetic person count equals official population at all enforced geographies; no unexplained duplicate, missing, or out-of-universe people.A true 1:1 resident list is a confidential population register and changes continuously; Synthestat can only match official aggregates and declared source universes.
Complete attribute truthEach synthetic person has the same age, sex, household role, education, occupation, industry, origin, health proxy, income proxy, and lifecycle state as the corresponding real person.Published marginal and cross-tab constraints pass within HARD/FIRM/SOFT tolerances; modelled fields carry uncertainty and measured/constrained/modelled provenance.Official releases do not expose a complete individual joint distribution, and many attributes are survey-derived, lagged, suppressed, or unavailable at fine geography.
Perfect household and family structureEvery household contains the exact real members and relationships, including multi-generation, partnership, child, shared, institutional, and edge-case arrangements.Household totals, household-type distributions, age/sex/role consistency, fertility/child constraints, and structural invariants pass with explicit residuals.Household membership is sensitive microdata; public sources usually expose only aggregate household/family tables and partial cross-tabs.
Exact dwelling and building groundingEvery household is assigned to its real dwelling and building with exact occupancy, vacancy, dwelling type, floor area, tenure, and address-level geography.Dwelling/building capacity checks pass; vacancy/second-home/institutional dwellings are represented or explicitly unavailable; building links have source provenance.Many countries lack open address-level registers; dwelling occupancy is confidential and time-varying.
Complete de facto and hidden-population overlaysHomeless, undocumented, refugees, students away from home, seasonal, institutional, tourists, and daytime populations are all represented with exact location and timing.Overlay layers use interval estimates, source-specific quality flags, and never silently modify de jure HARD constraints.Hidden populations are partly unobserved by definition; ethical/privacy constraints forbid exact person-level labels.
Exact school, workplace, facility, and mobility assignmentEvery person is assigned to the real school, workplace, care provider, commute, and daily activity chain they use.Assignment layers use official registers/OD flows where available; modelled assignments are flagged and validated only against aggregate flows/capacities.Operational assignments are usually protected registers or dynamic behavioural data; Phase 1 must not imply they are known.
Full joint-distribution realismThe full multivariate joint distribution is identical to reality across all attributes, households, geography, and rare subgroups.High-priority marginals/cross-tabs pass; sparse zones and prior-dominated attributes are clearly marked with quality tiers and credible intervals.The joint distribution is non-identifiable from published marginals; IPF/BN/hierarchical pooling choose plausible distributions, not truth.
Zero uncertainty and zero lagAll values are current today and known without error.Every output records reference period, retrieval timestamp, lag, confidence, uncertainty bounds, and degradation decisions.Official statistics are lagged, revised, sampled, suppressed, and harmonized after collection.
Privacy-safe yet maximally detailed releaseThe system releases maximum useful detail while creating zero re-identification risk.Release mode, k-anonymity/cell safeguards, perturbation/aggregation policy, and sensitive-field treatment are explicit.Fine-area synthetic microdata can still create structurally unique records; synthetic does not mean anonymous.
Perfect reproducibility and auditabilityAny user can trace every output record to exact source snapshots, transformations, constraints, relaxations, seeds, and code versions.Run manifests, source provenance, checksums, frozen extracts, seeds, versioned crosswalks, validation reports, and relaxation logs are complete.This is approachable but never final: source portals, classifications, geography, and code keep changing, so audits must be continuously renewed.

Population output status

PeopleTarget populationNational coverageAbsolute shortfallHouseholdsDwellingsHouses/buildingsMax marginal deviationHARD statusRun
pt_scope_declared_non_national_20260519_1834Z

Deviation is the maximum absolute relative error across collected HARD/FIRM/SOFT marginal constraints in the latest review bundle. GUIDE/INFORMATIONAL priors are excluded. National target/coverage are read from build_manifest.json when available and override any visual impression of completion.

ready
1
done
9

Datasets and distributions

Lists come from the latest run bundle: source_provenance.json, distribution_diagnostics.json, and build_manifest.json.

Summary

Datasets used0
Distributions available0
Constraints/distributions used in synthesis0
Constraint types
Dataset variants
Finest-geography status

Source gaps

  • No source gaps listed.

Datasets used

Dataset/source ID
None listed yet.

Best source by distribution family

Distribution familyDataset/source ID
None listed yet.

Available distributions / priors in registry

SpecLabelTypeGeoStatusVariantConfidenceData URI
None listed yet.

Constraints/distributions used in synthesis manifest

Constraint or distribution ID
None listed yet.

Current country tasks

IDTitleAssigneeStatusCreatedLatest summary
t_f11aa4baPT downloader/research follow-up: freeze national source payloads for future national bundlesynth-researcherready2026-05-19 20:36:43 CEST
t_357b5258PT model fix: build valid national-scale bundle or declare narrowed scopesynth-modelerdone2026-05-19 20:33:07 CESTResolved PT model-fix task by explicitly declaring narrowed non-national scope instead of relabelling the 8-person fixture as national. Added guardrail declarations under /home/synthestat/output/PT and a scope package under /home/synthestat/output/runs/PT/pt_scope_declared_non_national_20260519_1834
t_43ced4d4PT review: national-scale population artifact gap closuresynth-reviewerdone2026-05-19 20:15:42 CESTReviewed PT national-scale gap-closure output and returned BLOCKED_INVALID_OUTPUT: the inspected artefact remains an 8-person/8-household seeded two-zone fixture, not a national Portugal population or national core. Wrote the full QA report to /home/synthestat/workspace/manager_handoffs/reviewer/202
t_ecda7621PT national-scale population artifact gap: replace seeded PASS fixture or block with exact source/model requirementssynth-modelerdone2026-05-19 20:15:35 CESTCompleted PT national-scale readiness/gap handoff: blocked relabelling the 8-person/8-household seeded PT PASS fixture as a national artifact, documented official Eurostat/INE target gaps, and updated modeller latest plus manager updates.
t_f22e64adPT final human review/delivery note after PASSsynth-reviewerdone2026-05-19 20:00:28 CESTPrepared PT final human-facing delivery/readiness notes after the reviewer PASS. The notes explicitly limit the PASS to the internal seeded two-zone slice, cite artifact paths and metrics, and preserve unavailable/modelled layer, uncertainty/provenance, privacy, and broader-claim caveats.
t_6b5d1390PT cycle-1 remediation re-review: pt_population_review_cycle1_d828fc79_seed420987synth-reviewerdone2026-05-19 18:53:50 CESTPASS: re-reviewed PT remediation bundle pt_population_review_cycle1_d828fc79_seed420987 and wrote reviewer report at /home/synthestat/workspace/manager_handoffs/reviewer/2026-05-19_165631Z_PT_population_review_cycle1_rereview.md. Verified the four requested fixes and found no remaining critical bloc
t_bc46c93fPT cycle-1 review remediation: dwelling references, household age semantics, provenance metadatasynth-modelerdone2026-05-19 18:45:06 CESTBuilt and verified PT cycle-1 remediation bundle pt_population_review_cycle1_d828fc79_seed420987 at /home/synthestat/output/runs/PT/pt_population_review_cycle1_d828fc79_seed420987. The bundle addresses the reviewer’s blocking fixes: occupied dwelling household references are populated, HH_SINGLE_Y/M
t_bfaf5795PT population synthesis QA cycle 1 bundle reviewsynth-reviewerdone2026-05-19 18:35:25 CESTNEEDS_MODEL_FIX for PT run pt_population_review_cycle1_dd672afa_seed420987. Bundle is present/readable and HARD/FIRM summaries pass, but QA found internal artefact/diagnostic inconsistency in dwelling household references, unsupported HH_SINGLE_E/M/Y age semantics, and incomplete provenance metadata
t_ebf9abe4PT population synthesis QA cycle 1 review bundlesynth-modelerdone2026-05-19 18:35:12 CESTBuilt and verified the PT cycle-1 population review bundle at /home/synthestat/output/runs/PT/pt_population_review_cycle1_dd672afa_seed420987. Bundle is contract-complete for internal research review, with 8 persons, 8 households, 8 dwellings, 8 dwelling-building assignment rows, HARD constraints pa
t_03e70139orchestrate PT population synthesis QA loopsynth-managerdone2026-05-19 18:34:08 CESTRouted PT population synthesis QA cycle 1: created synth-modeler bundle-build task t_ebf9abe4 and dependent synth-reviewer bundle-review task t_bfaf5795. Updated /home/synthestat/workspace/manager_handoffs/manager_updates.md with the bottleneck summary, routing decision, blockers, and next branch po

Process

Manager kickoff

synth-manager creates and controls the country loop.

Model build

synth-modeler generates the review bundle: people, households, dwellings/buildings or unavailable markers, overlays, assignments, manifests, residuals, diagnostics, uncertainty, provenance.

Reviewer gate

synth-reviewer audits constraints, marginals, household/family realism, hidden populations, dwelling/building grounding, work/school assignment, uncertainty, provenance, and privacy.

Branch

PASS finalizes; NEEDS_MODEL_FIX routes back to modeler; NEEDS_MORE_SOURCES routes to marginal/distribution researchers then downloader; exhausted evidence/model plateau stops for human decision.

Quality gates and stop conditions