Hebrew Music Lyrics Database
Entertainment & Media Consumption
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a substantial collection of approximately 15,000 Israeli song lyrics. It has been compiled by scraping content from the
shironet.mako.co.il
website and features lyrics from 167 distinct singers. A key characteristic of this dataset is that it exclusively contains Hebrew characters, with non-Hebrew words and characters having been removed during processing. Each entry in the dataset includes the song, artist, and a link to its original source. It is ideal for linguistic analysis, natural language processing, and understanding trends within Israeli music.Columns
The dataset is structured with the following columns:
- artist: The Hebrew name of the artist performing the song.
- songs: A list representing the words within the song lyrics.
- song: The title of the song.
- artist_key: A unique identifier assigned to each artist.
- url: The original web link to the song's lyrics.
- words count: The total number of words recognised in the song lyrics.
- unique words count: The count of distinct words recognised in the song lyrics.
Distribution
The data is typically provided as a single file, usually in CSV format. It contains around 15,000 records, representing individual Israeli songs. While specific row counts for each song are not provided, various metric distributions, such as word counts (ranging from 0 to over 700 words per song) and unique word counts, indicate the textual depth of the entries. This suggests the dataset is well-suited for statistical analysis of lyrical content.
Usage
This dataset is ideally suited for:
- Natural Language Processing (NLP) tasks, including text analysis, sentiment analysis, and topic modelling of Hebrew lyrics.
- Linguistic research into the Hebrew language, vocabulary usage, and stylistic differences among artists.
- Music industry analysis to identify popular themes, artists, or lyrical structures in Israeli music.
- Developing AI and machine learning models that require a large corpus of Hebrew text data.
- Cultural studies focusing on Israeli identity and expression through song lyrics.
Coverage
The dataset's coverage is primarily geographic to Israel, as it contains Israeli songs performed by artists with Hebrew names. The content is exclusively in Hebrew characters. It includes songs from 167 different singers, providing a broad scope of artists within the Israeli music scene. There is no specific time range mentioned for the song releases, but the source of the data suggests it reflects content available on the
shironet.mako.co.il
website. The region for the dataset listing is stated as GLOBAL.License
CC-BY-SA:
Who Can Use It
This dataset is valuable for:
- Researchers and Academics: For linguistic studies, cultural analysis, and computational linguistics projects involving Hebrew.
- Data Scientists and Machine Learning Engineers: For training NLP models, text classification, and recommendation systems related to music.
- Music Analysts and Enthusiasts: For exploring lyrical trends, artist discographies, and the evolution of Israeli music.
- Developers: Those building applications that require a robust source of Hebrew text or song lyrics.
Dataset Name Suggestions
- Israeli Song Lyrics Corpus
- Hebrew Music Lyrics Database
- Shironet Hebrew Song Data
- Israeli Artists Lyric Archive
- Hebrew Song Text Analysis Set
Attributes
Original Data Source: Hebrew songs lyrics