DeSO/municipality/national fallback metadata per row
model_notes.md
# SE population review bundle cycle 2
Run: `se_population_review_cycle2_3a9d999a_seed420987`
This bundle replaces the prior 8-person/two-zone fixture with an explicitly labelled stratified multi-DeSO Sweden candidate. It is not full-national. It covers 42 selected DeSO zones (median and largest positive-population DeSO per county) and emits 91030 synthetic persons, 43739 households, and 43739 dwelling shell records.
## Constraint and uncertainty position
- HARD: 2025 SCB DeSO age-sex counts from frozen `population_age_sex_deso` are exactly replicated for every selected zone and sex.
- FIRM/SOFT/GUIDE: household size/composition, origin, education, labour, income, occupation/industry and dwelling shells are source-informed modelled fields with per-row provenance/fallback metadata.
- Occupation and industry at fine geography are not measured here; they are broad fallback/modelled values only.
- Hidden-population overlays, building assignments, and work/school assignments are emitted as unavailable JSON files rather than silently inferred.
## Official target comparison
See `constraint_residuals.json`: selected-zone official target population is 91030; synthetic person count is 91030; residual is 0.
## Review recommendation
Review as `EVIDENCE_EXHAUSTED_HUMAN_REVIEW` / non-toy stratified multi-DeSO candidate, not production full-national Sweden. The next material improvement requires official residential building/dwelling/address anchors and richer household/family roster evidence by fine geography.
build_manifest.json
{
"assignment_scope": {
"dwelling_building": "unavailable official building assignment; modelled dwelling shells only",
"facility": "unavailable",
"school": "unavailable",
"work": "unavailable"
},
"candidate_scope": "stratified_multi_DeSO_not_full_national",
"classification_crosswalk_versions": {
"education": "ISCED-2011 broad model mapping; not row-measured",
"industry": "NACE Rev.2/SNI 1-digit fallback only",
"occupation": "ISCO-08 1-digit fallback only"
},
"constraints_relaxed": [
{
"reason": "No exact public household roster/family graph by selected DeSO frozen; modelled from national prior",
"residual": null,
"source": "household composition at DeSO",
"tolerance": "not claimed as HARD"
},
{
"reason": "No official residential building anchor frozen",
"residual": null,
"source": "building assignment",
"tolerance": "unavailable"
}
],
"constraints_used": [
"D01_population_age_sex_deso_HARD_exact_for_selected_zones",
"national_household_size_prior_FIRM_MODELLED",
"DeSO/municipality/national fallback metadata per row"
],
"contract_files": [
"synthetic_persons.parquet",
"synthetic_households.parquet",
"synthetic_dwellings.parquet",
"synthetic_building_assignments.unavailable.json",
"hidden_population_overlays.unavailable.json",
"work_school_assignments.unavailable.json",
"build_manifest.json",
"constraint_residuals.json",
"distribution_diagnostics.json",
"household_diagnostics.json",
"dwelling_building_diagnostics.json",
"assignment_diagnostics.json",
"geography_quality_tiers.json",
"uncertainty_summary.json",
"source_provenance.json",
"model_notes.md"
],
"country": "SE",
"counts": {
"dwellings": 43739,
"households": 43739,
"persons": 91030
},
"created_at": "2026-05-19T18:58:01Z",
"geography_version": "SCB DeSO/RegSO 2025 from TAB6574 frozen source; selected stratified DeSO subset",
"git_commit": "a5ad12d74bcf64a2c256e1fe83d99cc700e02bba-dirty",
"hidden_population_scope": {
"evidence_status": "source closure found no hard/firm DeSO overlay evidence",
"excluded": [
"homelessness",
"irregular migrants",
"seasonal workers",
"student dormitories",
"institutional residents"
],
"included": [],
"status": "unavailable"
},
"known_limitations": [
"Not a full-national 1:1 Sweden run; explicit stratified multi-DeSO candidate with exact age-sex controls for selected zones.",
"Person attributes beyond age/sex/location are modelled and flagged; occupation/industry are fallback broad categories.",
"No hidden-population overlays or real building assignments emitted."
],
"project_root": "/home/synthestat",
"random_seed": 420987,
"release_mode": "review_candidate_cycle2_non_toy",
"run_id": "se_population_review_cycle2_3a9d999a_seed420987",
"selected_zone_count": 42,
"selected_zones": [
"0180C1890",
"0180C6390",
"0380B6011",
"0330C1020",
"0484C1240",
"0484C1500",
"0582A0030",
"0580B5010",
"0685C1040",
"0680C1072",
"0763A0030",
"0780C1230",
"0881C1030",
"0880C1190",
"0980A0180",
"0980C1060",
"1081A0010",
"1080A0030",
"1292B2020",
"1263B2020",
"1384C1310",
"1381B2010",
"1480C1770",
"1480C2970",
"1782C1020",
"1780C1400",
"1861A0030",
"1880C1610",
"1980C1080",
"1980A0020",
"2080C1130",
"2080C1030",
"2121A0010",
"2180C1390",
"2284C1190",
"2281C1310",
"2380C1240",
"2380C1040",
"2480C1520",
"2480C1160",
"2560A0020",
"2582C1080"
],
"source_catalogue_version": "SE_cycle2_p0_source_freeze_20260519T180750Z",
"zones_degraded": []
}
{
"building_assignment_status": "unavailable",
"country": "SE",
"degraded_reason": "No official residential building/dwelling/address extract frozen; dwellings are shell records only.",
"dwelling_count": 43739,
"run_id": "se_population_review_cycle2_3a9d999a_seed420987"
}
distribution_diagnostics.json
{
"confidence_or_quality_notes": [
"P0 official SCB age-sex DeSO controls used as HARD for selected zones.",
"Household size distribution uses national SCB household-size table; household membership is modelled.",
"Occupation/industry at fine geography are fallback/prior-dominated and flagged per row."
],
"country": "SE",
"household_size_prior": {
"counts": {
"1": 2077552,
"2": 1489277,
"3": 558121,
"4": 560515,
"5": 195741,
"6": 56185
},
"reference_year": 2024,
"source_id": "SE_SCB_households_by_size"
},
"records_frozen": 20,
"run_id": "se_population_review_cycle2_3a9d999a_seed420987",
"selected_zones": [
"0180C1890",
"0180C6390",
"0380B6011",
"0330C1020",
"0484C1240",
"0484C1500",
"0582A0030",
"0580B5010",
"0685C1040",
"0680C1072",
"0763A0030",
"0780C1230",
"0881C1030",
"0880C1190",
"0980A0180",
"0980C1060",
"1081A0010",
"1080A0030",
"1292B2020",
"1263B2020",
"1384C1310",
"1381B2010",
"1480C1770",
"1480C2970",
"1782C1020",
"1780C1400",
"1861A0030",
"1880C1610",
"1980C1080",
"1980A0020",
"2080C1130",
"2080C1030",
"2121A0010",
"2180C1390",
"2284C1190",
"2281C1310",
"2380C1240",
"2380C1040",
"2480C1520",
"2480C1160",
"2560A0020",
"2582C1080"
],
"source_freeze_id": "SE_cycle2_p0_source_freeze_20260519T180750Z"
}
uncertainty_summary.json
{
"country": "SE",
"methods": {
"age_sex": "exact integer replication from official frozen SCB DeSO table",
"dwellings_buildings": "dwelling shells only; real building anchor unavailable",
"households": "national household-size prior with structural plausibility only",
"other_person_attributes": "deterministic source-informed priors; wide categorical uncertainty not fit to individual truth"
},
"run_id": "se_population_review_cycle2_3a9d999a_seed420987",
"summary": [
"Age-sex counts for selected DeSO zones have zero-width residuals against frozen SCB 2025 controls.",
"Household composition, education, labour, origin, income, dwelling shells and all assignments are modelled with explicit fallback metadata.",
"Hidden population overlays are unavailable rather than silently folded into private households."
]
}
source_provenance.json
{
"country": "SE",
"per_row_provenance_columns": [
"provenance_source_ids",
"constraint_geography_level",
"fallback_level",
"uncertainty_method",
"uncertainty_low",
"uncertainty_high"
],
"records": [
{
"artifact_id": "population_age_sex_deso",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6574_en.zip",
"bytes": 65520296,
"content_length_header": "65520296",
"content_type": "cached",
"evidence_tier": "HARD",
"geography_level": "DeSO/RegSO 2025 + municipality/county/national rows",
"http_status": "cached",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_age_sex_deso__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6574?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 831916400,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_age_sex_deso.csv",
"processed_csv_sha256": "9319d17b4aaee24a310738a14945c92f5f474ee525c0877446784c1b1be56a43",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_age_sex_deso__TAB6574_en.zip",
"reference_period": "2010-2025; latest 2025",
"retrieval_timestamp": "2026-05-19T18:24:04.181215+00:00",
"role": "Core population totals and age-sex marginals",
"row_count_observed": 16573248,
"scb_label": "Population per region by age and sex. Year 2010-2025",
"scb_updated": "2026-03-24T07:00:00Z",
"sha256": "04c29c3a4ba79d46be1f8fe6c40f9869cef3804869662b375a4651c46a59fb1a",
"source_id": "SE_SCB_population_age_sex_deso",
"source_status": "official",
"source_system": "SCB PxWeb public bulk API",
"status": "frozen",
"table_id": "TAB6574",
"variables": [
"region",
"age",
"sex",
"observations",
"year"
]
},
{
"artifact_id": "population_marital_status_deso",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6570_en.zip",
"bytes": 19150268,
"content_length_header": "19150268",
"content_type": "cached",
"evidence_tier": "HARD",
"geography_level": "DeSO/RegSO 2025 + municipality/county/national rows",
"http_status": "cached",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_marital_status_deso__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6570?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 222589539,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_marital_status_deso.csv",
"processed_csv_sha256": "fa2766246ab402fe0a36f2f5b9f7761a3ee221b00a4310217b77a2e6fc1b3ea5",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_marital_status_deso__TAB6570_en.zip",
"reference_period": "2010-2025; latest 2025",
"retrieval_timestamp": "2026-05-19T18:24:13.724737+00:00",
"role": "Marital/civil-status marginal by sex",
"row_count_observed": 4603680,
"scb_label": "Population per region by marital status and sex. Year 2010-2025",
"scb_updated": "2026-03-24T07:00:00Z",
"sha256": "d9f55c8f29794395fb5af474601a35ed2b9af31a2a8fbb36534870d204a32b48",
"source_id": "SE_SCB_population_marital_status_deso",
"source_status": "official",
"source_system": "SCB PxWeb public bulk API",
"status": "frozen",
"table_id": "TAB6570",
"variables": [
"region",
"marital status",
"sex",
"observations",
"year"
]
},
{
"artifact_id": "population_background_deso",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6571_en.zip",
"bytes": 12142729,
"content_length_header": "12142729",
"content_type": "cached",
"evidence_tier": "HARD",
"geography_level": "DeSO/RegSO 2025 + municipality/county/national rows",
"http_status": "cached",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_background_deso__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6571?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 149158193,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_background_deso.csv",
"processed_csv_sha256": "ec58fbe701f508fad333eed536d816a7f9c546eea01e69f366a1d3c15a9aa7a1",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_background_deso__TAB6571_en.zip",
"reference_period": "2010-2025; latest 2025",
"retrieval_timestamp": "2026-05-19T18:24:16.727656+00:00",
"role": "Swedish/foreign background marginal by sex",
"row_count_observed": 2762208,
"scb_label": "Population per region by foreign or Swedish background and sex. Year 2010-2025",
"scb_updated": "2026-03-24T07:00:00Z",
"sha256": "d998c76d3c8ae89fb710d1566b72fc3b254e9a58e900facd9c026977957768af",
"source_id": "SE_SCB_population_background_deso",
"source_status": "official",
"source_system": "SCB PxWeb public bulk API",
"status": "frozen",
"table_id": "TAB6571",
"variables": [
"region",
"foreign/Swedish background",
"sex",
"observations",
"year"
]
},
{
"artifact_id": "population_citizenship_deso",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6569_en.zip",
"bytes": 15951682,
"content_length_header": "15951682",
"content_type": "cached",
"evidence_tier": "HARD",
"geography_level": "DeSO/RegSO 2025 + municipality/county/national rows",
"http_status": "cached",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_citizenship_deso__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6569?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 232315916,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_citizenship_deso.csv",
"processed_csv_sha256": "1b0552967e5198133be030fa102a6b320cdb37d0f429aac5b27b1ad608660f24",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_citizenship_deso__TAB6569_en.zip",
"reference_period": "2010-2025; latest 2025",
"retrieval_timestamp": "2026-05-19T18:24:18.495190+00:00",
"role": "Citizenship marginal by sex",
"row_count_observed": 3682944,
"scb_label": "Population per region by citizenship and sex. Year 2010-2025",
"scb_updated": "2026-03-24T07:00:00Z",
"sha256": "3f2a68d89aa74ab7194c490f83e4d411e8f76c203146a0788d0c6381d8e519f2",
"source_id": "SE_SCB_population_citizenship_deso",
"source_status": "official",
"source_system": "SCB PxWeb public bulk API",
"status": "frozen",
"table_id": "TAB6569",
"variables": [
"region",
"citizenship",
"sex",
"observations",
"year"
]
},
{
"artifact_id": "population_birth_region_deso",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6572_en.zip",
"bytes": 16175355,
"content_length_header": null,
"content_type": null,
"evidence_tier": "HARD",
"geography_level": "DeSO/RegSO 2025 + municipality/county/national rows",
"http_status": "downloaded_by_curl",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_birth_region_deso__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6572?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 220674507,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_birth_region_deso.csv",
"processed_csv_sha256": "7c5949b68270fad451bf1615a4514f7d7368cd9e3fb362f425ea6be9c8188e98",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_birth_region_deso__TAB6572_en.zip",
"reference_period": "2010-2025; latest 2025",
"retrieval_timestamp": "2026-05-19T18:24:21.285571+00:00",
"role": "Region-of-birth marginal by sex",
"row_count_observed": 3682944,
"scb_label": "Population per region by region of birth and sex. Year 2010-2025",
"scb_updated": "2026-03-24T07:00:00Z",
"sha256": "b5682d6a965cb9ec8c7ff7cda2af91e20593e9dd05366eb79bd8e4cf47c6062c",
"source_id": "SE_SCB_population_birth_region_deso",
"source_status": "official",
"source_system": "SCB PxWeb public bulk API",
"status": "frozen",
"table_id": "TAB6572",
"variables": [
"region",
"region of birth",
"sex",
"observations",
"year"
]
},
{
"artifact_id": "population_birth_country_muni_2024",
"bulk_url": "https://www.statistikdatabasen.scb.se/Resources/PX/bulk/ssd/en/TAB6030_en.zip",
"bytes": 15573520,
"content_length_header": null,
"content_type": null,
"evidence_tier": "FIRM",
"geography_level": "municipality/county/national",
"http_status": "downloaded_by_curl",
"lang": "en",
"license_access_notes": "Public SCB/Statistics Sweden PxWeb/bulk API; attribution and SCB terms should be preserved.",
"metadata_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/metadata/population_birth_country_muni_2024__metadata.json",
"metadata_url": "https://statistikdatabasen.scb.se/api/v2/tables/TAB6030?lang=en",
"parsing_warnings": [],
"processed_csv_bytes": 203022538,
"processed_csv_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/processed/population_birth_country_muni_2024.csv",
"processed_csv_sha256": "f7f126c756d0189af3f6e8109836b8bfe569b4198e3092c41ebd86a59ef43218",
"quality_flags": [],
"raw_path": "/home/synthestat/data/ingestion/SE/source_freeze/20260519T180750Z/raw/population_birth_country_muni_2024__TAB6030_en.zip",
"reference_period": "2024 selected from annual table",
"retrieval_timestamp": "2026-05-19T18:24:34.871373+00:00",
"role": "Detailed country-of-birth by sex for origin assignment",
"row_count_observed": 4315125,
"scb_label": "Population by region, country of birth and sex. Year 2000-2024",
"scb_updated": "2025-02-21T07:00:00Z",
"sha256": "b466a87357953238a5ca17306666165cf5d6fe45f8b2578a215b456792bd8e68",
"source_id": "SE_SCB_population_birth_country_muni_
… truncated after 12,000 characters …
unavailable.json
{
"hidden_population_overlays": {
"reason": "Marginal-source closure found official hidden-population source families only at institutional/national/soft-guide scope; no DeSO private-household hidden stock can be hard- or firm-constrained.",
"required_evidence": "Homelessness, irregular migrant, dormitory/institutional and seasonal-worker resident-stock estimates by geography with reference-period semantics.",
"status": "unavailable"
},
"synthetic_building_assignments": {
"reason": "No approved live official Lantm\u00e4teriet/SCB/HERE residential building/dwelling/address extract was frozen for cycle 2; do not assign households to real buildings silently.",
"required_evidence": "Official residential building/address/dwelling extract with stable IDs and DeSO/municipality linkage.",
"status": "unavailable"
},
"work_school_assignments": {
"reason": "OD commuters are frozen at municipality scope but individual workplace/school truth and school roster assignment evidence were not frozen.",
"required_evidence": "Workplace/school destination marginals or assignment priors by age/education/municipality with uncertainty.",
"status": "unavailable"
}
}