Opendatabay APP

Dutch Proficiency Exam Scores Dataset

Education & Learning Analytics

Tags and Keywords

Language

Dutch

Proficiency

Learning

Adults

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Dutch Proficiency Exam Scores Dataset Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset offers rich information on the proficiency and linguistic characteristics of adult language learners in the Netherlands. Drawing from results collected over several decades, it is a valuable resource for studying the relationship between language proficiency and various individual factors. These factors include native language, country of origin, age at arrival in the Netherlands, length of residence, days spent on formal Dutch as a second language education, gender identity, and family status. The dataset also includes scores from a Dutch proficiency exam, covering speaking performance, lexical understanding, morphological knowledge, and the acquisition of new sounds/features. This data provides a unique opportunity to uncover previously unseen correlations related to language learning success globally.

Columns

  • L1: Native language of the participant. (String)
  • C: Country of origin of the participant. (String)
  • L1L2: Linguistic similarity between the native language and the target language. (Integer)
  • AaA: Age at arrival in the Netherlands. (Integer)
  • LoR: Length of residence in the Netherlands. (Integer)
  • Edu.day: Formal education days in the target language. (Integer)
  • Sex: Gender of the participant. (String)
  • Family: Family status of the participant. (String)
  • ISO639.3: ISO 639-3 codes for the target language. (String)
  • Enroll: Duration enrolled in language courses. (Integer)
  • Speaking: Speaking proficiency test score on the State Examination of Dutch as a Second Language. (Integer)
  • morph: Morphological score related to knowledge structures within words. (Integer)
  • lex: Lexicon score indicating understanding of written words. (Integer)
  • new_feat: Feature score reflecting ability to acquire new sounds/grammatical structures. (Integer)
  • new_sounds: Sound symbols score evaluating pronunciation. (Integer)

Distribution

The dataset is provided in a CSV file format (stex.csv), with a size of 5.79 MB. It contains 16 columns and approximately 50,200 records for most columns, providing a substantial collection of data points.

Usage

This dataset is ideal for:
  • Identifying correlations between linguistic similarity, length of residence in the Netherlands, and language proficiency.
  • Exploring differences in language proficiency amongst various gender and family statuses.
  • Creating targeted educational programmes for adult language learners based on their native languages, ages of arrival, and other characteristics.
  • Understanding how individuals learn new languages as adults in the Netherlands.
  • Analysing pronunciation and grammatical ability.
  • Conducting powerful analyses using descriptive statistics such as mean, median, frequency tables, correlation tests, pivot tables, visualisations, clustering algorithms, chi-square tests, ANOVA testing, and regression modelling.

Coverage

The dataset focuses on adult language learners in the Netherlands, with results collected over several decades. It includes a diverse demographic scope, detailing participants' native languages (e.g., Arabic 12%, German 10%), countries of origin (e.g., Germany 10%, Morocco 7%), linguistic similarity (e.g., GermanEnglish 9%), age at arrival (mean 26.5), length of residence (mean 3.92), formal education days (mean 3.13), gender (66% Female, 34% Male), and family statuses (e.g., Indo-European 68%, Afro-Asiatic 15%).

License

CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication

Who Can Use It

  • Researchers in linguistics and second language acquisition, to study factors influencing adult language learning.
  • Educators and policy makers, to develop and refine language education programmes for adult immigrants and residents.
  • Data scientists and analysts, to uncover patterns, trends, and correlations within adult language proficiency data.
  • Organisations concerned with integration and language support for international populations in the Netherlands.

Dataset Name Suggestions

  • Adult Dutch Language Learner Profiles
  • Netherlands Second Language Proficiency Data
  • Dutch Adult Language Acquisition Factors
  • Adult Language Learning Outcomes Netherlands
  • Dutch Proficiency Exam Scores Dataset

Attributes

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

22/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in CSV Format