Opendatabay APP

Youtube Videos Dataset (~3400 videos)

Social Media and Networking

Tags and Keywords

Arts and Entertainment

Tabular

Classification

Intermediate

NLP

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Youtube Videos Dataset (~3400 videos) Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Context πŸ“ƒ I wanted to practice text classification using NLP techniques, so I thought why not practice it by generating the data myself! This way, I brushed up on my scraping techniques using Selenium, collected the data, cleaned it, and then started working on it. You can take a peek at my work Github Repository For This Dataset and Trained Models/ Results
Content πŸ“° The total number of videos scraped was 3600. I scraped the following things from each video:
link title description category Video ID Category for which the video was scraped Description of the video Category for which the video was scraped. I queried the videos for 4 categories:
Travel Vlogs 🧳 Food πŸ₯‘ Art and Music 🎨 🎻 History πŸ“œ
Acknowledgements πŸ™ I could have used a ready made API, but just for the fun of it, I scraped the data from Youtube using Selenium.
Inspiration πŸ¦‹ The data is not clean (for your enjoyment of cleaning the data!), has some missing values, and is imbalanced. Practice text classification on this dataset, you will have to learn different techniques for eg:- How to handle imbalanced classes..? While working on this dataset, you will learn a lot of different things and also get an opportunity to apply on this dataset.

Listing Stats

VIEWS

1

DOWNLOADS

0

LISTED

08/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free