Opendatabay APP

Global Phylogenetic Tree Structure

Data Science and Analytics

Tags and Keywords

Phylogeny

Evolution

Species

Taxonomy

Genetics

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Global Phylogenetic Tree Structure Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

The Phylogenetic Tree of Life on Earth illustrates the evolutionary relationships and kinship among various biological species. This dataset provides a structured representation of the world's biodiversity, detailing 35,960 species and the links between them. It is highly suitable for usage with graph databases due to its structure. The content was developed through a seventeen-year collaboration among biologists globally, with all submissions peer reviewed before being incorporated into the resulting tree.

Columns

The product is delivered across two main files:
Species File (treeoflife_nodes.csv)
  • node_id: A numerical identifier unique to the species or taxon within the tree structure.
  • node_name: The biological name of the species or group, or 'none' if the name is currently unknown.
  • child_nodes: The count of nodes immediately descending from this specific node.
  • leaf_node: A binary indicator determining if the node represents a terminal leaf in the phylogenetic tree.
  • tolorg_link: Indicates whether a descriptive page for this species exists on the tolweb.org website.
  • extinct: Designates the living status of the species (0 for living, 1 for extinct species).
  • confidence: Specifies the certainty of the species' placement in the tree structure (0=confident, 1=problematic, 2=unspecified position).
  • phylesis: Defines the monophyly status (0=monophyletic, 1=uncertain monophyly, 2=not monophyletic).
Tree Links File (treeoflife_links.csv)
  • source_node_id: The identifier for the ancestor or source node in an evolutionary relationship.
  • target_node_id: The identifier for the descendant or target node in an evolutionary relationship.

Distribution

The structure is inherently hierarchical, representing a branching diagram, making it optimal for analysis using graph databases. It details 35,960 species nodes. The organisation follows standard taxonomy, moving from the most specific to the most general levels: species, genus, family, order, class, phylum, kingdom, domain, and finally, life. Data is typically delivered in CSV format. The file detailing the tree links is approximately 410.58 kB, and the total number of records within the links file is significant, exceeding 78,000.

Usage

This dataset is ideal for building and analysing phylogenetic graphs and visualisations of species relationships. It is frequently applied in studies concerning evolutionary biology, taxonomy, and genetics. Specific use cases include mapping species divergence, identifying patterns of evolutionary change, and exploring classification confidence across different biological groups.

Coverage

The scope covers the known Phylogenetic Tree of Life on Earth, spanning all biological kingdoms and domains. The data includes both currently living species and those designated as extinct. The content was derived from scientific collaboration completed up to 2007 and is not scheduled for future updates.

License

Attribution 3.0 Unported (CC BY 3.0)

Who Can Use It

  • Evolutionary Biologists: To study diversification rates, evolutionary history, and species kinship.
  • Data Scientists/Graph Analysts: To test complex network algorithms and build large-scale relational models.
  • Educators and Students: For teaching fundamental principles of taxonomy, phylogeny, and biological classification.

Dataset Name Suggestions

  • Global Phylogenetic Tree Structure
  • Earth Species Evolutionary Kinship Map
  • Tree of Life Taxonomy and Linkages

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

29/11/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in ZIP Format