CNN News Dataset
Data Science and Analytics
Related Searches
Trusted By




"No reviews yet"
£190
About
CNN News dataset to access structured information about CNN articles, including headlines, authors, topics, publication dates, and multimedia elements like videos and images. Popular use cases include analyzing journalistic trends, tracking content dissemination, and studying the evolution of news topics over time.
The CNN News dataset offers a comprehensive collection of metadata and content attributes for articles published by CNN, making it a valuable resource for understanding modern journalism and media trends. Each entry includes essential fields such as article ID, URL, authorship details, headline, assigned topics for categorization, publication date, and an updated timestamp indicating the most recent modifications. The content field provides the full textual body of the article, complemented by embedded videos and images that enhance the multimedia storytelling experience.
The dataset also links related articles, offering additional context or perspectives on related topics, and includes keywords that highlight the primary themes and subjects of each piece. Ideal for researchers, media analysts, and journalism professionals, this dataset supports studies on news dissemination, audience engagement, and the dynamics of digital reporting. By leveraging the CNN News dataset, users can explore the evolution of news content, analyze media practices, and uncover trends shaping the digital news ecosystem.
Dataset Features
Below is a list of the different columns in the dataset along with a brief description of each:
- id: Unique identifier for each article
- url: Web address of the article
- author: Writer or contributor of the article
- headline: Title of the news article
- topics: Subject categories or themes
- publication_date: When the article was first published
- updated_last: Last modification date
- content: Main body text of the article
- videos: Video content associated with the article
- images: Visual media included in the article
- related_articles: Links to connected stories
- keyword: Key terms for categorization
Distribution
- Data Volume: 12 Columns and 712.8K Rows
- Format: CSV
Usage
This dataset is valuable for:
- Content Analysis: Studying news reporting patterns and editorial focus
- Media Research: Analyzing CNN's coverage and reporting style
- NLP Applications: Training models for news classification and content analysis
- Multimedia Analysis: Studying the integration of text, images, and videos in digital news
Coverage
- Geographic Coverage: Global
License
CUSTOM
Please review the respective licenses below:
- Data Provider's License
Who Can Use It
- Media Researchers: For studying digital journalism patterns
- Data Scientists: For text analysis and content classification
- Journalism Students: For studying professional news writing and structure
- Content Strategists: For understanding digital news presentation
Suggested Dataset Names
- CNN Media Vault
- Chrono CNN
- Insight CNN Dataset
- News Weave-CNN
- CNN Press Graph
Pricing
Based on Delivery frequency
~Up to $0.0025 per record. Min order $250
Approximately 295K new records are added each month.
Approximately 726K records are updated each month.
Get the complete dataset each delivery, including all records.
Retrieve only the data you need with the flexibility to set Smart Updates.
- Monthly
New snapshot each month, 12 snapshots/year
Paid monthly
- Quarterly
New snapshot each quarter, 4 snapshots/year
Paid quarterly
- Bi-annual
New snapshot every 6 months, 2 snapshots/year
Paid twice-a-year
- One-time purchase
New snapshot one-time delivery
Paid once