Opendatabay APP

Luxembourg Population Synthetic Profiles

Synthetic Data Generation

Tags and Keywords

Luxembourg

Synthetic

Demographics

Population

Europe

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Luxembourg Population Synthetic Profiles Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Provides a synthetic dataset of Luxembourg citizens, referred to as SDF, generated to reflect accurate statistical distributions based on official public sources. This resource is constructed using open-source methodologies developed by the Luxembourg National Data Service (LNDS). It serves as an example of a well-structured dataset, allowing users the ability to tailor parameters, such as the sample size. The data incorporates real statistical information regarding the age structure, population distribution across different municipalities, the presence of various nationalities, and detailed salary statistics across geographical regions in Luxembourg.

Columns (14 total)

The data product contains 14 features detailing individual profiles, organised by rows:
  • Social_matricule: A unique identifier for each synthetic individual profile.
  • First_name and Surname: Personal identifiers used in the simulation.
  • Gender: Attributes distributed with 53% Female (F) and 47% Male (M).
  • Age: Ranges from 18 years up to 95 years, with an average age of 47.
  • Date_of_birth: Temporal data correlated with the age structure.
  • Nationality: Distribution includes 61% Luxembourgish, 15% Portuguese, and 18 other nationalities.
  • Municipality and Canton: Geographic attributes detailing residency within Luxembourg. Luxembourg City and Canton Esch are the most common locations.
  • Salary: Annual salary attribute, ranging up to 46,000, with a mean value of approximately 4.67k.
  • Ethnicity: Distributed with White representing 79% of the population, Black at 11%, and other groups making up the remainder.
  • hair_color and hair_lenght: Additional physical attributes.

Distribution

The data product is available as a .csv file named synthetic-lux-pop-dataset-1000.csv, with a file size of 108.54 kB. The structure consists of 1000 valid individual records (rows), detailing 14 distinct personal features (columns). The dataset is organised with individual profiles on the rows and their respective features on the columns.

Usage

This data is ideal for simulations and modelling exercises focused on European demographics. Potential applications include developing and testing data pipelines, demonstrating synthetic data generation techniques, and conducting socio-economic analyses of population structures. It is highly suitable for educational purposes related to statistics and data handling.

Coverage

The data focuses entirely on simulated citizens of Luxembourg. It covers population distributions across the 90 municipalities and 12 cantons. The modelled age range is 18 to 95 years. It is important to note that the data is synthetic and designed to comply with privacy regulations by ensuring a near-zero risk of identifying any real person. This dataset is static and has an expected update frequency of "Never."

License

CC0: Public Domain

Who Can Use It

Intended users include data scientists requiring realistic, anonymised data for model training; academic researchers studying population statistics and privacy preservation; software developers needing test data that reflects realistic demographic variances; and intermediate analysts focusing on European socio-economic trends.

Dataset Name Suggestions

  • Luxembourg Population Synthetic Profiles
  • SDF Citizens of Luxembourg (1000 Records)
  • Luxembourg Demographic Structure Model
  • Statistical Simulation Data for Luxembourg

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

07/10/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format