Korean Linguistic Phrases Reference
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This collection provides access to 6,548 distinct Korean proverbs (속담) and fixed idioms (관용구), gathered from authoritative linguistic sources. It is designed to support detailed research into Korean linguistic patterns, cultural context, and is highly useful for developers creating natural language processing tools or educational applications focused on the Korean language.
Columns
- Description (문장): The full text of the Korean idiom or proverb. All 6548 entries are unique in this field.
- Meaning (풀이, 의미): The detailed definition or explanation associated with the phrase.
- Source (출처): Indicates the primary dictionary reference from which the entry was sourced. Major contributors include the 표준국어대사전 (60%) and 고려대 한국어대사전 (30%).
- Type: Categorises the entry, classifying it as either a proverb (속담), which constitutes approximately 68% of the data, or a fixed idiom (관용구), which makes up the remaining 32%.
Distribution
The dataset is structured as a CSV file, named
idioms.csv, with a file size of 1.2 MB. It consists of four columns and 6,548 validated records. The data is considered static, as the expected update frequency is listed as 'Never'.Usage
Ideal applications for this data include enhancing the accuracy of Korean language translation tools, training advanced NLP models to recognise cultural nuances in text, developing specialised Korean language learning software, and supporting academic research in linguistics, lexicography, and cultural studies.
Coverage
The material focuses exclusively on the Korean language and its usage. The data represents established proverbs and idioms sourced from major Korean dictionaries. The scope is cultural and linguistic, and it does not contain specific geographic or temporal limitations beyond the established record of these phrases in Korean literature and reference materials.
License
Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
Who Can Use It
- Linguists and Academics: For detailed analysis of Korean syntax, semantics, and phrase construction.
- NLP Developers: Utilising the structured data for model training and improving contextual understanding in AI applications.
- Korean Language Students: As an authoritative reference to learn and master advanced, culturally significant phrases.
- Lexicographers: For building or verifying content in digital and print dictionaries.
Dataset Name Suggestions
- Korean Linguistic Phrases Reference
- 6500+ Korean Idioms and Proverbs Data
- Korean 속담 and 관용구 Dataset
Attributes
Original Data Source: Korean Linguistic Phrases Reference
Loading...
