Academic Influence and Online Attention Data
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This collection provides detailed metadata for scientific research articles, focusing heavily on publications in the fields of physics and computational science. The resource is designed to serve as a practice dataset for altmetrics analysis and bibliometric studies, offering crucial metrics such as citation counts, digital object identifiers (DOI), and various measures of online attention and social media engagement. This data enables users to explore the digital footprint and scholarly impact of contemporary scientific literature.
Columns
The dataset contains 16 attributes derived from the research articles:
- Title: The formal title of the published article.
- Abstract: A brief descriptive summary of the publication’s content. Note that 15 entries are missing an abstract.
- DOI: The Digital Object Identifier, which acts as a permanent, unique alphanumeric string for accessing the content online.
- Citations: The total number of times the article has been cited in subsequent academic works.
- Accesses: The recorded frequency of views or accesses to the publication.
- Online Attention: A general measure of the digital engagement the article has garnered.
- Published Datetime: The date and time the publication was originally released. The time range spans from 22 September 2005 to 25 April 2024.
- Tweeters: The numerical count of mentions on X (formerly Twitter).
- Blogs: The number of unique blog posts mentioning the article.
- Facebook Pages: The number of mentions or shares on Facebook pages.
- News Outlets: The count of times the publication has been featured in news media.
- Redditors: The number of discussions or mentions found on Reddit.
- Video Uploaders: The count of videos related to or mentioning the publication.
- Wikipedia Page: An integer indicating mentions on or connections to Wikipedia pages.
- Mendeley: The number of times the publication has been saved or referenced using the Mendeley tool.
- Topic: Labels assigned to categorise the subject matter, such as Physics or Engineering.
Distribution
The source data is available as a CSV file titled
latest_research_articles.csv
, with an approximate file size of 6.48 MB. The structure consists of 4,184 distinct records. The articles cover a time period stretching nearly two decades, with the earliest publication date recorded in September 2005 and the most recent in April 2024.Usage
This data is perfectly suited for academic research, particularly in the fields of library science and data science. Typical uses include:
- Performing detailed altmetrics analysis to study how research impact is reflected outside of traditional citation counts.
- Training machine learning models to predict future citation success or online engagement based on article metadata.
- Conducting longitudinal studies on publishing trends and shifts in research focus within Physics and Engineering over time.
- Investigating the correlation between different measures of attention, such as tweets versus blog posts.
Coverage
The dataset spans a temporal window from September 2005 through April 2024. The primary topical scope is dominated by Physics articles (approximately 70%) and Engineering articles (approximately 24%), along with minor representation from other scientific disciplines. The source does not specify geographic or demographic restrictions, suggesting the coverage is of globally available scholarly publications.
License
CC0: Public Domain
Who Can Use It
- Researchers: To study the dynamics of academic influence and public engagement with science.
- Data Scientists/Analysts: For practising data cleaning, feature engineering, and predictive modelling using real-world bibliometric data.
- Librarians and Institutional Strategists: To monitor the visibility and digital performance of scholarly output.
Dataset Name Suggestions
- Global Research Article Altmetrics
- Physics and Computational Science Publication Metrics
- Academic Influence and Online Attention Data
- Latest Scientific Research Citations
Attributes
Original Data Source: Academic Influence and Online Attention Data