Opendatabay APP

Venture Capital & Startup Landscape Data

Education & Learning Analytics

Tags and Keywords

Startups

Companies

Us

Trends

Business

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Venture Capital & Startup Landscape Data Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

This dataset offers insights into the trends of US startup companies from 2008 onwards, providing valuable information for analysing the dynamic startup ecosystem. Its purpose is to facilitate the study of startup growth over time, track industry-specific company numbers, compare companies and industries, and pinpoint geographical areas with a high concentration of startups for recruitment. It can also be used to map the development cycle of a startup from its initial idea to a successful exit.

Columns

The dataset includes information from two main files: company.csv and ycombinator.csv.
From company.csv:
  • company_name: The name of the startup company. (String)
  • link: A direct link to the company's official website. (String)
  • short_description: A brief overview or summary of the company. (String)
  • founded: The year in which the company was established. (Integer)
  • team_size: The total number of individuals working within the company's team. (Integer)
  • location: The specific city where the company is based. (String)
  • country: The country in which the company is located. (String)
  • no_founders: The count of individuals who founded the company. (Integer)
  • no_company_socials: The number of social media accounts associated with the company. (Integer)
  • no_tags: The number of descriptive tags linked to the company. (Integer)
From ycombinator.csv:
  • company_name: The name of the startup company. (String)
  • link: A direct link to the company's official website. (String)
  • short_description: A brief overview or summary of the company. (String)
  • no_tags: The number of descriptive tags linked to the company. (Integer)
  • no_company_socials: The number of social media accounts associated with the company. (Integer)
  • founded: The year in which the company was established. (Integer)
  • team_size: The total number of individuals working within the company's team. (Integer)
  • location: The specific city where the company is based. (String)

Distribution

The data is typically provided in CSV (Comma Separated Values) format, with sample files available separately on the platform. The company.csv file is approximately 634.51 kB and features 12 columns. While exact row counts for the entire dataset are not specified, column validity checks suggest around 1000 records are included for many key fields. The dataset is structured into two distinct files, company.csv and ycombinator.csv, allowing for varied analyses.

Usage

This dataset is ideal for a range of applications, including:
  • Studying the evolving trends of startups over an extended period.
  • Tracking the number of startups in various industries and comparing performance across companies and sectors.
  • Identifying specific geographical regions with a high concentration of startup activity, particularly useful for recruitment strategies.
  • Mapping the typical development cycle of a startup, from its initial concept to a successful exit.
  • Conducting market research and competitive analysis within the startup landscape.

Coverage

The dataset primarily focuses on US startup companies, with data availability extending from 2008 onwards. Geographically, the data highlights locations within the USA, with notable concentrations in cities such as San Francisco (32%) and New York (9%). The country distribution shows 38% of companies are solely in the USA, and 28% are "USA; Remote", indicating a strong North American emphasis. Information on data availability for specific groups or years within this timeframe is available through the founded column and detailed column statistics. It focuses on company attributes rather than human demographics.

License

CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication

Who Can Use It

This dataset is particularly useful for:
  • Business Analysts: For market trend analysis and competitor benchmarking.
  • Recruitment Professionals: To identify and target high-density startup regions and talent pools.
  • Entrepreneurs: For understanding market opportunities and the competitive landscape.
  • Academics and Researchers: To study economic development, innovation, and startup dynamics.
  • Investors: To gain insights into startup growth, founder characteristics, and team sizes.

Dataset Name Suggestions

  • US Startup Ecosystem Analytics
  • North American Startup Trends (2008+)
  • Startup Company Profiles: USA & Y Combinator
  • Venture Capital & Startup Landscape Data
  • Post-2008 US Startup Data

Attributes

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

31/08/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free

Download Dataset in ZIP Format