Opendatabay APP

Global Top Scientists Citation Metrics

Education & Learning Analytics

Tags and Keywords

Scientists

Citations

Research

Academia

Bibliometrics

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Top Scientists Citation Metrics Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset lists the 217,097 most cited scientists, based on data available up to and including 2023. It provides a standardised database of author citation metrics, annotated by scientific field, offering valuable insights into global scientific influence. The data is derived from methodology described in publications by John P. A. Ioannidis, Jeroen Baas, Richard Klavans, and Kevin W. Boyack.

Columns

  • authfull: The full name of the author.
  • inst_name: The name of the author's primary institution.
  • cntry: The country of the author's primary institution.
  • np6021: A metric related to publications.
  • lastyr: The last year represented in the author's career data or when their data was last updated.
  • rank (ns): The normalised rank of the scientist globally.
  • nc9621 (ns): The normalised number of citations accumulated up to 2021.
  • h21 (ns): The normalised h-index of the scientist up to 2021.
  • hm21 (ns): The normalised hm-index of the scientist up to 2021.
  • nps (ns): The normalised number of publications where the scientist is the sole author.
  • ncs (ns): The normalised number of citations received for publications where the scientist is the sole author.
  • cpsf (ns): The normalised citations per first or single author paper.
  • ncsf (ns): The normalised number of citations received for publications where the scientist is the first or single author.
  • npsfl (ns): The normalised number of publications where the scientist is the first, last, or sole author.
  • ncsfl (ns): The normalised number of citations received for publications where the scientist is the first, last, or sole author.
  • c (ns): A normalised score, potentially indicating career stage or overall impact.
  • npciting (ns): The normalised number of papers citing the scientist's work.
  • cprat (ns): The normalised citation ratio.
  • np6021 cited9621 (ns): The normalised number of publications cited since 1996 for authors with at least 60 publications up to 2021.
  • self%: The percentage of self-citations.
  • rank: The overall rank of the scientist.
  • nc9621: The total number of citations accumulated up to 2021.
  • h21: The h-index of the scientist up to 2021.
  • hm21: The hm-index of the scientist up to 2021.
  • nps: The number of publications where the scientist is the sole author.
  • ncs: The number of citations received for publications where the scientist is the sole author.
  • cpsf: The citations per first or single author paper.
  • ncsf: The number of citations received for publications where the scientist is the first or single author.
  • npsfl: The number of publications where the scientist is the first, last, or sole author.
  • ncsfl: The number of citations received for publications where the scientist is the first, last, or sole author.
  • c: A score, potentially indicating career stage or overall impact.
  • npciting: The total number of papers citing the scientist's work.
  • cprat: The citation ratio.
  • np6021_d: A difference metric related to np6021 publications.
  • nc9621_d: A difference metric related to nc9621 citations.
  • sm-subfield-1: The primary scientific subfield associated with the author.
  • sm-subfield-1-frac: The fraction of the author's publications within their primary subfield.
  • sm-subfield-2: The secondary scientific subfield associated with the author.
  • sm-subfield-2-frac: The fraction of the author's publications within their secondary subfield.
  • sm-field: The broad scientific field associated with the author (e.g., Clinical Medicine, Physics & Astronomy).
  • sm-field-frac: The fraction of the author's publications within their broad scientific field.
  • rank sm-subfield-1: The rank of the scientist within their primary subfield.
  • rank sm-subfield-1 (ns): The normalised rank of the scientist within their primary subfield.
  • sm-subfield-1 count: The total count of authors within the scientist's primary subfield.

Distribution

The dataset is presented in a CSV file format (Top_scientists_2023.csv). It contains 217,097 records, each detailing a scientist's profile. There are 46 columns available for each record, providing a rich array of information. The previous 2021 version of this dataset was approximately 80.43 MB, suggesting a similar file size for the 2023 data. Most columns contain 195,000 valid entries, indicating high data completeness with minimal missing values.

Usage

This dataset is ideal for bibliometric research, for performing trend analysis in scientific publishing, for identifying influential researchers and institutions, and for academic performance benchmarking. It can also support studies on scientific collaboration patterns and career trajectories within various fields.

Coverage

The dataset offers global coverage, featuring scientists from 163 unique countries. The data includes information current up to and including 2023, with career publication records extending as far back as 1788. Specific citation metrics like nc9621, h21, and hm21 are updated up to 2021. Geographically, the United States accounts for 40% of the listed scientists, while the United Kingdom represents 9% of the entries.

License

Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)

Who Can Use It

  • Researchers: For in-depth analysis of scientific output, impact, and evolving trends across diverse fields and geographical regions.
  • Academic Institutions: For benchmarking their faculty's research performance against international and national peers.
  • Data Scientists: For developing predictive models or creating visualisations related to scientific influence and productivity.
  • Policy Makers: To inform strategic funding decisions and assess national and global research strengths and priorities.

Dataset Name Suggestions

  • Global Top Scientists Citation Metrics 2023
  • Highly Cited Researchers Database
  • Science-Wide Author Citation Indicators
  • 2023 Most Influential Scientists
  • Academic Impact Ranking Dataset

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

31/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format