Opendatabay APP

North American Bird Vocalisation Data

Data Science and Analytics

Tags and Keywords

Birds

United

States

Pollution

Biology

Nature

Trusted By
Trusted by company1Trusted by company2Trusted by company3
North American Bird Vocalisation Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Bird Songs Recordings from United States offers detailed metadata captured about avian acoustic records sourced from Xeno-Canto. This resource focuses exclusively on information related to birds found within the United States. You can use this data to explore the remarkable landscape of bird recording, facilitating analysis of species migration and species evolution patterns. Furthermore, based on frequency of recordings in different locales, users can investigate the impact of various factors like air pollution, as birds often serve as an early biological indicator of environmental quality. The dataset supports the discovery of interesting facts regarding bird distribution and characteristics within the rich diversity of North America regions.

Columns

The dataset is structured across 26 fields, providing fine-grained details for each record:
  • id: A unique identification number assigned to the specific recording.
  • gen: The recorded bird's genus (e.g., Setophaga).
  • sp: The recorded bird's species.
  • ssp: Subspecies classification, which is often missing for many records.
  • en: The common English name of the species (e.g., Song Sparrow).
  • rec: The name of the individual who created the recording (e.g., Paul Marvin).
  • cnt: The country where the recording occurred, which is strictly "United States".
  • loc: Specific location details for the recording site (e.g., Portal, Arizona).
  • lat: The latitude coordinate of the recording location.
  • lng: The longitude coordinate of the recording location.
  • alt: The altitude, measured in metres, where the recording was made.
  • type: The category of vocalisation recorded, such as "song" or "call".
  • url: A web link to the corresponding recording page on the source platform.
  • file: A direct download link for the associated audio file.
  • file-name: The title of the audio file (e.g., TuftedTitmouse.mp3).
  • sono: Links to various sizes of the sonogram image associated with the recording.
  • lic: The specific license URL governing the use of the content.
  • q: The quality rating assigned to the recording (e.g., A or B).
  • length: The duration or period of the recording.
  • time: The time of day the recording took place.
  • date: The calendar date the recording was made.
  • uploaded: The date the recording metadata was uploaded to the platform.
  • also: Notes detailing other species potentially present in the recording (e.g., Agelaius phoeniceus).
  • rmk: General remarks or annotations regarding the recording (e.g., Recording amplified).
  • bird-seen: A binary indicator stating if the bird was visually confirmed at the time of recording (yes/no).
  • playback-used: A field indicating if external playback was employed during the recording process (yes/no/unknown).

Distribution

The primary dataset file is titled birds_united_states.csv. This resource is approximately 44.01 MB in size. The data contains 26 columns and includes over 53,300 unique recording records.

Usage

This data product is highly valuable for several analytical and scientific purposes:
  • Bioacoustics Research: Studying bird vocalisation patterns and analysing recording quality based on the 'type' and 'q' fields.
  • Ecological Monitoring: Assessing species migration, distribution, and population trends based on geographic coordinates and dates.
  • Environmental Studies: Investigating the links between bird species evolution or characteristics and external factors, such as evaluating the influence of air pollution.
  • Geospatial Analysis: Mapping species occurrences across different locations, latitudes, longitudes, and altitudes within the United States.

Coverage

The dataset's geographic focus is strictly limited to bird recordings originating from the United States. The latitude coordinates range from -14.3 to 71.4, and longitude values span from -177 to 177. Recording metadata was uploaded between November 20, 2008, and August 1, 2020. The records cover 4,665 unique dates and 8,601 distinct recording locations.

License

CC0: Public Domain

Who Can Use It

Intended users include:
  • Biologists and Ecologists: For studying avian behaviour, population dynamics, and distribution.
  • Data Scientists and Machine Learning Engineers: To develop models for automated bird recognition or classification using the acoustic metadata.
  • Environmental Scientists: For assessing biological indicators of environmental quality, particularly pollution impacts.
  • General Researchers: Anyone seeking detailed, geographically tagged metadata concerning North American avian populations.

Dataset Name Suggestions

  • US Avian Acoustic Metadata
  • Xeno-Canto US Bird Recordings
  • North American Bird Vocalisation Data
  • United States Wildlife Sound Records

Attributes

Listing Stats

VIEWS

4

DOWNLOADS

0

LISTED

12/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format