← Back to country index
Synthestat · Lithuania · population QA

LT 1:1 population synthesis QA cycle

Country-specific layer for synthetic people in households, dwellings, real building stock where available, hidden-population overlays, and work/school assignment evidence.

Board: synthestat-population-qa · Tenant: synthestat · Country: LT · Overall status: running

Ideal-country quality criteria: impossible 1:1 benchmark

This is the common gold-standard benchmark for an ideal country. It is intentionally impossible to fully satisfy: complete success would mean a 1:1 replica of the real population where every person, household, dwelling, attribute, and assignment is exactly represented. The QA page uses it as an asymptote and gap taxonomy, not as a release promise.

Apply this same rubric to this country’s latest run, then report which needs are measured, constrained, modelled, unavailable, or blocked.

NeedUnachievable idealQA evidence we require insteadWhy perfection cannot be achieved
Complete de jure resident coverageEvery real resident represented exactly once in the right country, municipality, small area, household, and dwelling.Synthetic person count equals official population at all enforced geographies; no unexplained duplicate, missing, or out-of-universe people.A true 1:1 resident list is a confidential population register and changes continuously; Synthestat can only match official aggregates and declared source universes.
Complete attribute truthEach synthetic person has the same age, sex, household role, education, occupation, industry, origin, health proxy, income proxy, and lifecycle state as the corresponding real person.Published marginal and cross-tab constraints pass within HARD/FIRM/SOFT tolerances; modelled fields carry uncertainty and measured/constrained/modelled provenance.Official releases do not expose a complete individual joint distribution, and many attributes are survey-derived, lagged, suppressed, or unavailable at fine geography.
Perfect household and family structureEvery household contains the exact real members and relationships, including multi-generation, partnership, child, shared, institutional, and edge-case arrangements.Household totals, household-type distributions, age/sex/role consistency, fertility/child constraints, and structural invariants pass with explicit residuals.Household membership is sensitive microdata; public sources usually expose only aggregate household/family tables and partial cross-tabs.
Exact dwelling and building groundingEvery household is assigned to its real dwelling and building with exact occupancy, vacancy, dwelling type, floor area, tenure, and address-level geography.Dwelling/building capacity checks pass; vacancy/second-home/institutional dwellings are represented or explicitly unavailable; building links have source provenance.Many countries lack open address-level registers; dwelling occupancy is confidential and time-varying.
Complete de facto and hidden-population overlaysHomeless, undocumented, refugees, students away from home, seasonal, institutional, tourists, and daytime populations are all represented with exact location and timing.Overlay layers use interval estimates, source-specific quality flags, and never silently modify de jure HARD constraints.Hidden populations are partly unobserved by definition; ethical/privacy constraints forbid exact person-level labels.
Exact school, workplace, facility, and mobility assignmentEvery person is assigned to the real school, workplace, care provider, commute, and daily activity chain they use.Assignment layers use official registers/OD flows where available; modelled assignments are flagged and validated only against aggregate flows/capacities.Operational assignments are usually protected registers or dynamic behavioural data; Phase 1 must not imply they are known.
Full joint-distribution realismThe full multivariate joint distribution is identical to reality across all attributes, households, geography, and rare subgroups.High-priority marginals/cross-tabs pass; sparse zones and prior-dominated attributes are clearly marked with quality tiers and credible intervals.The joint distribution is non-identifiable from published marginals; IPF/BN/hierarchical pooling choose plausible distributions, not truth.
Zero uncertainty and zero lagAll values are current today and known without error.Every output records reference period, retrieval timestamp, lag, confidence, uncertainty bounds, and degradation decisions.Official statistics are lagged, revised, sampled, suppressed, and harmonized after collection.
Privacy-safe yet maximally detailed releaseThe system releases maximum useful detail while creating zero re-identification risk.Release mode, k-anonymity/cell safeguards, perturbation/aggregation policy, and sensitive-field treatment are explicit.Fine-area synthetic microdata can still create structurally unique records; synthetic does not mean anonymous.
Perfect reproducibility and auditabilityAny user can trace every output record to exact source snapshots, transformations, constraints, relaxations, seeds, and code versions.Run manifests, source provenance, checksums, frozen extracts, seeds, versioned crosswalks, validation reports, and relaxation logs are complete.This is approachable but never final: source portals, classifications, geography, and code keep changing, so audits must be continuously renewed.

Population output status

PeopleTarget populationNational coverageAbsolute shortfallHouseholdsDwellingsHouses/buildingsMax marginal deviationHARD statusRun
2,810,7611,215,3601,215,360pass_exactlt_population_review_cycle4_888b7387_seed420987

Deviation is the maximum absolute relative error across collected HARD/FIRM/SOFT marginal constraints in the latest review bundle. GUIDE/INFORMATIONAL priors are excluded. National target/coverage are read from build_manifest.json when available and override any visual impression of completion.

running
1
todo
2
done
14

Datasets and distributions

Lists come from the latest run bundle: source_provenance.json, distribution_diagnostics.json, and build_manifest.json.

Summary

Datasets used0
Distributions available38
Constraints/distributions used in synthesis3
Constraint types
Dataset variants
Finest-geography status

Source gaps

  • No source gaps listed.

Datasets used

Dataset/source ID
None listed yet.

Best source by distribution family

Distribution familyDataset/source ID
None listed yet.

Available distributions / priors in registry

SpecLabelTypeGeoStatusVariantConfidenceData URI
C01_education_occupation_couplingEducation-occupation coupling strengthGUIDEnationalmodelledcomparable_country0.615data/literature/seeded_occupation_priors.yaml
C02_assortative_mating_educationAssortative mating by educationGUIDEmunicipalitymodelledcomparable_country0.625data/literature/seeded_occupation_priors.yaml
C03_assortative_mating_ageAssortative mating by ageGUIDEmunicipalitymodelledcomparable_country0.695data/literature/seeded_occupation_priors.yaml
C04_assortative_mating_originAssortative mating by originGUIDEmunicipalitymodelledcomparable_country0.635data/literature/seeded_occupation_priors.yaml
C05_spatial_sorting_educationSpatial sorting by educationGUIDEnationalmodelledcomparable_country0.715data/literature/seeded_occupation_priors.yaml
C06_spatial_sorting_incomeSpatial sorting by incomeGUIDEnationalmodelledcomparable_country0.715data/literature/seeded_occupation_priors.yaml
C07_spatial_sorting_originSpatial sorting by originGUIDEnationalmodelledcomparable_country0.735data/literature/seeded_occupation_priors.yaml
C08_intergenerational_income_elasticityIntergenerational income elasticityGUIDEnationalmodelledcomparable_country0.595data/literature/seeded_occupation_priors.yaml
C09_intergenerational_occupation_transmissionIntergenerational occupation transmissionGUIDEnationalmodelledcomparable_country0.595data/literature/seeded_occupation_priors.yaml
C10_commuting_mode_distanceCommuting mode × distance × occupation × regionGUIDEmunicipalitymodelledcomparable_country0.655data/literature/seeded_occupation_priors.yaml
C11_health_age_sex_educationHealth × age × sex × educationGUIDEnationalmodelledcomparable_country0.635data/literature/seeded_occupation_priors.yaml
D01_age_sex_nuts3Age × sex at NUTS-3HARDNUTS-3constrainedrobust0.74docs/wiki/compiled/D01_age_sex_nuts3.md
D01_census_age_sex_nuts3Census age × sex at NUTS-3HARDNUTS-3constrainedrobust0.74docs/wiki/compiled/D01_census_age_sex_nuts3.md
D02_marital_nuts3Marital status × age × sex at NUTS-3FIRMNUTS-3constrainedrobust0.73docs/wiki/compiled/D02_marital_nuts3.md
D03_origin_age_sexOrigin group × age × sexFIRMNUTS-3constrainedrobust0.73docs/wiki/compiled/D03_origin_age_sex.md
D04_religion_age_sex_regionReligion × age × sex × regionGUIDENUTS-3constrainedrobust0.71docs/wiki/compiled/D04_religion_age_sex_region.md
D05_census_education_nuts3Census education at NUTS-3FIRMNUTS-3constrainedrobust0.73docs/wiki/compiled/D05_census_education_nuts3.md
D05_education_nuts2Education at NUTS-2FIRMNUTS-2constrainedcurrent0.7docs/wiki/compiled/D05_education_nuts2.md
D06_employment_age_sex_educationEmployment status × age × sex × educationFIRMunknownconstrainedrobust0.73docs/wiki/compiled/D06_employment_age_sex_education.md
D07_occupation_isco3Occupation ISCO-3 distributionSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D07_occupation_isco3.md
D08_occupation_educationOccupation × educationSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D08_occupation_education.md
D09_industry_nace2Industry NACE-2 distributionSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D09_industry_nace2.md
D10_income_education_occupationIncome × education × occupationSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D10_income_education_occupation.md
D11_income_household_type_regionIncome × household type × regionSOFTNUTS-3constrainedrobust0.71docs/wiki/compiled/D11_income_household_type_region.md
D12_household_type_size_regionHousehold type × size × regionFIRMNUTS-3constrainedrobust0.73docs/wiki/compiled/D12_household_type_size_region.md
D13_children_mother_age_educationChildren × mother age × educationSOFTNUTS-3constrainedrobust0.71docs/wiki/compiled/D13_children_mother_age_education.md
D14_partner_age_gap_homogamyPartner age gap × homogamySOFTNUTS-3constrainedrobust0.71docs/wiki/compiled/D14_partner_age_gap_homogamy.md
D15_coresidence_structureCo-residence structureSOFTNUTS-3constrainedrobust0.71docs/wiki/compiled/D15_coresidence_structure.md
D16_household_income_type_regionHousehold income × type × regionSOFTNUTS-3constrainedrobust0.71docs/wiki/compiled/D16_household_income_type_region.md
D17_education_mobilityEducation mobilityGUIDEunknownconstrainedrobust0.71docs/wiki/compiled/D17_education_mobility.md
D18_occupation_given_educationOccupation | educationSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D18_occupation_given_education.md
D19_employment_given_demographicsEmployment | demographicsSOFTunknownconstrainedrobust0.71docs/wiki/compiled/D19_employment_given_demographics.md
D20_birth_intervalsBirth intervalsGUIDEunknownconstrainedrobust0.71docs/wiki/compiled/D20_birth_intervals.md
D21_age_first_birthAge at first birth × education × cohortGUIDEunknownconstrainedrobust0.71docs/wiki/compiled/D21_age_first_birth.md
D22_age_leaving_homeAge leaving homeGUIDEunknownconstrainedrobust0.71docs/wiki/compiled/D22_age_leaving_home.md
D23_divorce_duration_children_educationDivorce × duration × children × educationGUIDENUTS-3constrainedrobust0.71docs/wiki/compiled/D23_divorce_duration_children_education.md
D24_age_marriage_sex_educationAge at marriage × sex × educationGUIDENUTS-3constrainedrobust0.71docs/wiki/compiled/D24_age_marriage_sex_education.md
D25_internal_migrationInternal migrationFIRMunknownconstrainedrobust0.73docs/wiki/compiled/D25_internal_migration.md

Constraints/distributions used in synthesis manifest

Constraint or distribution ID
Eurostat:cens_21resh_r2
Eurostat:cens_21hhcs_r3
Eurostat:cens_21empo_r2

Current country tasks

IDTitleAssigneeStatusCreatedLatest summary
t_d3b17db4LT population QA downloader: freeze exact official payloads for national-scale rerunsynth-downloaderrunning2026-05-19 20:17:42 CEST
t_bc67be5cLT population QA reviewer: inspect national-scale non-fixture rerunsynth-reviewertodo2026-05-19 20:17:44 CEST
t_ee0b60c7LT population QA modeler: national-scale non-fixture synthesis rerunsynth-modelertodo2026-05-19 20:17:43 CEST
t_78a5c4a3LT cycle 4 model fix: enforce household-family age/role/type realismsynth-modelerdone2026-05-19 20:26:45 CESTBuilt LT cycle-4 population review bundle at /home/synthestat/output/runs/LT/lt_population_review_cycle4_888b7387_seed420987. It preserves 2,810,761 persons / 1,215,360 households / 1,215,360 dwellings, 76/76 HARD residual rows pass_exact, complete explicit unavailable markers, and now passes househ
t_292b6371LT cycle 3 reviewer inspection: national/NUTS2 constrained model fix bundlesynth-reviewerdone2026-05-19 20:22:16 CESTReviewed LT cycle-3 bundle and returned NEEDS_MODEL_FIX. Cycle-2 artefact blockers are materially addressed (contract complete, no 8-person fixture, 76/76 HARD residual rows pass_exact, variation present, unavailable layers explicit), but household-family realism fails at national scale with 487,000
t_1bfbaf35LT population QA distribution closure: joint priors for non-fixture synthesissynth-distributions-researcherdone2026-05-19 20:17:40 CESTCompleted LT distribution evidence closure: verdict DISTRIBUTION_READY_FOR_MODEL_FIX for national/NUTS2-constrained non-fixture synthesis, with unsupported fine joints kept modelled/unavailable. Wrote LT findings, extraction specs, refreshed distributions/latest.md, and appended a manager update.
t_405d5f40LT population QA source closure: exact national marginals for non-fixture synthesissynth-marginals-researcherdone2026-05-19 20:17:39 CESTCompleted LT national marginal source closure and wrote the required handoffs under /home/synthestat/workspace/manager_handoffs/marginals. Status is SOURCE_READY_FOR_MODEL_FIX: official controls close 2,810,761 de jure residents, 1,215,360 private households, dwelling targets, non-private/institutio
t_4408d0caLT cycle 3 model fix: integrate frozen official LT sources beyond seeded fixturesynth-modelerdone2026-05-19 20:09:27 CESTBuilt LT cycle-3 model-fix review bundle at /home/synthestat/output/runs/LT/lt_population_review_cycle3_888b7387_seed420987, replacing the prior 8-person fixture with a national/NUTS2-constrained internal candidate: 2,810,761 persons, 1,215,360 households, and 1,215,360 synthetic dwellings. Frozen E
t_bfeae4a3LT cycle 2 review after source-upgraded model rerunsynth-reviewerdone2026-05-19 19:59:54 CESTReviewed LT cycle-2 bundle and wrote the QA report at /home/synthestat/workspace/manager_handoffs/reviewer/2026-05-19_181900Z_LT_population_review_cycle2.md. Verdict NEEDS_MODEL_FIX: the bundle is contract-complete and honest about unavailable layers, but remains an 8-person seeded fixture, not a cr
t_cd2a944eLT cycle 2 model rerun after source/download follow-upssynth-modelerdone2026-05-19 19:59:39 CESTBuilt and verified the LT cycle-2 population review bundle at /home/synthestat/output/runs/LT/lt_population_review_cycle2_837bfa75_seed420987 after rerunning the LT seeded source/building/country/hierarchical/population/validation path. The bundle is contract-complete with explicit unavailable hidde
t_4cf4377eLT downloader schema profile: RC/VDA/NTR official source candidates from t_faad3b28synth-downloaderdone2026-05-19 18:56:45 CESTCompleted LT official-source downloader schema profiling for the approved RC/VDA/NTR/VMI/Migration/RPPC candidates. Wrote an auditable manifest with small API samples, cheap row counts, bounded VDA/RC cache downloads, checksums, and explicit Sodra/hidden-population degradation flags; no modelling or
t_fbb4483cLT downloader: freeze Eurostat distribution evidence pulls and OSP native table discoverysynth-downloaderdone2026-05-19 18:52:29 CESTFroze the approved Lithuania Eurostat distribution evidence queue: all 26 datasets returned HTTP 200 and are stored with raw JSON-stat payloads, headers, URLs, checksums, retrieval timestamps, dimensions/geographies/periods, and flag/status metadata under /home/synthestat/data/ingestion/lt/official_
t_fd1d7d8fLT cycle-1 distribution gaps: household microstructure and joint priors beyond comparable-country fallbackssynth-distributions-researcherdone2026-05-19 18:43:41 CESTCompleted LT distribution-gap research and wrote findings/extraction/latest handoffs under /home/synthestat/workspace/manager_handoffs/distributions/. Found actionable Lithuania-specific Eurostat Census/LFS/SILC/SES/fertility sources for household microstructure, occupation-origin-workplace, educati
t_faad3b28LT cycle-1 source gaps: live national marginals, hidden overlays, building/work-school evidencesynth-marginals-researcherdone2026-05-19 18:43:33 CESTCompleted LT marginal-source research freeze for cycle-1 gaps. Wrote official source handoffs covering RC population-register marginals, VDA census/geospatial/housing packages, RC/Datalab NTR building evidence, VMI/Sodra/LFS/EU-SILC income-employment sources, and Migration/RPPC overlays; created dow
t_c2ade41eLT population synthesis QA cycle 1 — review bundle and verdictsynth-reviewerdone2026-05-19 18:35:23 CESTReviewed LT cycle-1 population bundle and wrote reviewer memo at /home/synthestat/workspace/manager_handoffs/reviewer/2026-05-19_164321Z_LT_population_review_cycle1.md. Verdict NEEDS_MORE_SOURCES: bundle is contract-complete with exact HARD/FIRM fit, but source/distribution gaps block PASS beyond th
t_bef90222LT population synthesis QA cycle 1 — build review bundlesynth-modelerdone2026-05-19 18:35:09 CESTBuilt and verified the LT cycle-1 population review bundle at /home/synthestat/output/runs/LT/lt_population_review_cycle1_813e5acd_seed420987. The bundle is contract-complete for internal review, HARD residuals pass exactly, and hidden-population/work-school layers are explicit unavailable markers r
t_2bb29733orchestrate LT population synthesis QA loopsynth-managerdone2026-05-19 18:34:10 CESTRouted LT population synthesis QA cycle 1: created synth-modeler bundle-build task t_bef90222 and dependent synth-reviewer bundle-review task t_c2ade41e. Updated /home/synthestat/workspace/manager_handoffs/manager_updates.md with bottleneck, routing decisions, blockers, and next branch instructions.

Process

Manager kickoff

synth-manager creates and controls the country loop.

Model build

synth-modeler generates the review bundle: people, households, dwellings/buildings or unavailable markers, overlays, assignments, manifests, residuals, diagnostics, uncertainty, provenance.

Reviewer gate

synth-reviewer audits constraints, marginals, household/family realism, hidden populations, dwelling/building grounding, work/school assignment, uncertainty, provenance, and privacy.

Branch

PASS finalizes; NEEDS_MODEL_FIX routes back to modeler; NEEDS_MORE_SOURCES routes to marginal/distribution researchers then downloader; exhausted evidence/model plateau stops for human decision.

Quality gates and stop conditions