Global Top Scientists Citation Metrics
Education & Learning Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset lists the 217,097 most cited scientists, based on data available up to and including 2023. It provides a standardised database of author citation metrics, annotated by scientific field, offering valuable insights into global scientific influence. The data is derived from methodology described in publications by John P. A. Ioannidis, Jeroen Baas, Richard Klavans, and Kevin W. Boyack.
Columns
authfull
: The full name of the author.inst_name
: The name of the author's primary institution.cntry
: The country of the author's primary institution.np6021
: A metric related to publications.lastyr
: The last year represented in the author's career data or when their data was last updated.rank (ns)
: The normalised rank of the scientist globally.nc9621 (ns)
: The normalised number of citations accumulated up to 2021.h21 (ns)
: The normalised h-index of the scientist up to 2021.hm21 (ns)
: The normalised hm-index of the scientist up to 2021.nps (ns)
: The normalised number of publications where the scientist is the sole author.ncs (ns)
: The normalised number of citations received for publications where the scientist is the sole author.cpsf (ns)
: The normalised citations per first or single author paper.ncsf (ns)
: The normalised number of citations received for publications where the scientist is the first or single author.npsfl (ns)
: The normalised number of publications where the scientist is the first, last, or sole author.ncsfl (ns)
: The normalised number of citations received for publications where the scientist is the first, last, or sole author.c (ns)
: A normalised score, potentially indicating career stage or overall impact.npciting (ns)
: The normalised number of papers citing the scientist's work.cprat (ns)
: The normalised citation ratio.np6021 cited9621 (ns)
: The normalised number of publications cited since 1996 for authors with at least 60 publications up to 2021.self%
: The percentage of self-citations.rank
: The overall rank of the scientist.nc9621
: The total number of citations accumulated up to 2021.h21
: The h-index of the scientist up to 2021.hm21
: The hm-index of the scientist up to 2021.nps
: The number of publications where the scientist is the sole author.ncs
: The number of citations received for publications where the scientist is the sole author.cpsf
: The citations per first or single author paper.ncsf
: The number of citations received for publications where the scientist is the first or single author.npsfl
: The number of publications where the scientist is the first, last, or sole author.ncsfl
: The number of citations received for publications where the scientist is the first, last, or sole author.c
: A score, potentially indicating career stage or overall impact.npciting
: The total number of papers citing the scientist's work.cprat
: The citation ratio.np6021_d
: A difference metric related tonp6021
publications.nc9621_d
: A difference metric related tonc9621
citations.sm-subfield-1
: The primary scientific subfield associated with the author.sm-subfield-1-frac
: The fraction of the author's publications within their primary subfield.sm-subfield-2
: The secondary scientific subfield associated with the author.sm-subfield-2-frac
: The fraction of the author's publications within their secondary subfield.sm-field
: The broad scientific field associated with the author (e.g., Clinical Medicine, Physics & Astronomy).sm-field-frac
: The fraction of the author's publications within their broad scientific field.rank sm-subfield-1
: The rank of the scientist within their primary subfield.rank sm-subfield-1 (ns)
: The normalised rank of the scientist within their primary subfield.sm-subfield-1 count
: The total count of authors within the scientist's primary subfield.
Distribution
The dataset is presented in a CSV file format (Top_scientists_2023.csv). It contains 217,097 records, each detailing a scientist's profile. There are 46 columns available for each record, providing a rich array of information. The previous 2021 version of this dataset was approximately 80.43 MB, suggesting a similar file size for the 2023 data. Most columns contain 195,000 valid entries, indicating high data completeness with minimal missing values.
Usage
This dataset is ideal for bibliometric research, for performing trend analysis in scientific publishing, for identifying influential researchers and institutions, and for academic performance benchmarking. It can also support studies on scientific collaboration patterns and career trajectories within various fields.
Coverage
The dataset offers global coverage, featuring scientists from 163 unique countries. The data includes information current up to and including 2023, with career publication records extending as far back as 1788. Specific citation metrics like
nc9621
, h21
, and hm21
are updated up to 2021. Geographically, the United States accounts for 40% of the listed scientists, while the United Kingdom represents 9% of the entries.License
Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)
Who Can Use It
- Researchers: For in-depth analysis of scientific output, impact, and evolving trends across diverse fields and geographical regions.
- Academic Institutions: For benchmarking their faculty's research performance against international and national peers.
- Data Scientists: For developing predictive models or creating visualisations related to scientific influence and productivity.
- Policy Makers: To inform strategic funding decisions and assess national and global research strengths and priorities.
Dataset Name Suggestions
- Global Top Scientists Citation Metrics 2023
- Highly Cited Researchers Database
- Science-Wide Author Citation Indicators
- 2023 Most Influential Scientists
- Academic Impact Ranking Dataset
Attributes
Original Data Source: Global Top Scientists Citation Metrics