Dutch Proficiency Exam Scores Dataset
Education & Learning Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers rich information on the proficiency and linguistic characteristics of adult language learners in the Netherlands. Drawing from results collected over several decades, it is a valuable resource for studying the relationship between language proficiency and various individual factors. These factors include native language, country of origin, age at arrival in the Netherlands, length of residence, days spent on formal Dutch as a second language education, gender identity, and family status. The dataset also includes scores from a Dutch proficiency exam, covering speaking performance, lexical understanding, morphological knowledge, and the acquisition of new sounds/features. This data provides a unique opportunity to uncover previously unseen correlations related to language learning success globally.
Columns
L1
: Native language of the participant. (String)C
: Country of origin of the participant. (String)L1L2
: Linguistic similarity between the native language and the target language. (Integer)AaA
: Age at arrival in the Netherlands. (Integer)LoR
: Length of residence in the Netherlands. (Integer)Edu.day
: Formal education days in the target language. (Integer)Sex
: Gender of the participant. (String)Family
: Family status of the participant. (String)ISO639.3
: ISO 639-3 codes for the target language. (String)Enroll
: Duration enrolled in language courses. (Integer)Speaking
: Speaking proficiency test score on the State Examination of Dutch as a Second Language. (Integer)morph
: Morphological score related to knowledge structures within words. (Integer)lex
: Lexicon score indicating understanding of written words. (Integer)new_feat
: Feature score reflecting ability to acquire new sounds/grammatical structures. (Integer)new_sounds
: Sound symbols score evaluating pronunciation. (Integer)
Distribution
The dataset is provided in a CSV file format (
stex.csv
), with a size of 5.79 MB. It contains 16 columns and approximately 50,200 records for most columns, providing a substantial collection of data points.Usage
This dataset is ideal for:
- Identifying correlations between linguistic similarity, length of residence in the Netherlands, and language proficiency.
- Exploring differences in language proficiency amongst various gender and family statuses.
- Creating targeted educational programmes for adult language learners based on their native languages, ages of arrival, and other characteristics.
- Understanding how individuals learn new languages as adults in the Netherlands.
- Analysing pronunciation and grammatical ability.
- Conducting powerful analyses using descriptive statistics such as mean, median, frequency tables, correlation tests, pivot tables, visualisations, clustering algorithms, chi-square tests, ANOVA testing, and regression modelling.
Coverage
The dataset focuses on adult language learners in the Netherlands, with results collected over several decades. It includes a diverse demographic scope, detailing participants' native languages (e.g., Arabic 12%, German 10%), countries of origin (e.g., Germany 10%, Morocco 7%), linguistic similarity (e.g., GermanEnglish 9%), age at arrival (mean 26.5), length of residence (mean 3.92), formal education days (mean 3.13), gender (66% Female, 34% Male), and family statuses (e.g., Indo-European 68%, Afro-Asiatic 15%).
License
CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
Who Can Use It
- Researchers in linguistics and second language acquisition, to study factors influencing adult language learning.
- Educators and policy makers, to develop and refine language education programmes for adult immigrants and residents.
- Data scientists and analysts, to uncover patterns, trends, and correlations within adult language proficiency data.
- Organisations concerned with integration and language support for international populations in the Netherlands.
Dataset Name Suggestions
- Adult Dutch Language Learner Profiles
- Netherlands Second Language Proficiency Data
- Dutch Adult Language Acquisition Factors
- Adult Language Learning Outcomes Netherlands
- Dutch Proficiency Exam Scores Dataset
Attributes
Original Data Source: Dutch Proficiency Exam Scores Dataset