Venture Capital & Startup Landscape Data
Education & Learning Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset offers insights into the trends of US startup companies from 2008 onwards, providing valuable information for analysing the dynamic startup ecosystem. Its purpose is to facilitate the study of startup growth over time, track industry-specific company numbers, compare companies and industries, and pinpoint geographical areas with a high concentration of startups for recruitment. It can also be used to map the development cycle of a startup from its initial idea to a successful exit.
Columns
The dataset includes information from two main files:
company.csv
and ycombinator.csv
.From
company.csv
:company_name
: The name of the startup company. (String)link
: A direct link to the company's official website. (String)short_description
: A brief overview or summary of the company. (String)founded
: The year in which the company was established. (Integer)team_size
: The total number of individuals working within the company's team. (Integer)location
: The specific city where the company is based. (String)country
: The country in which the company is located. (String)no_founders
: The count of individuals who founded the company. (Integer)no_company_socials
: The number of social media accounts associated with the company. (Integer)no_tags
: The number of descriptive tags linked to the company. (Integer)
From
ycombinator.csv
:company_name
: The name of the startup company. (String)link
: A direct link to the company's official website. (String)short_description
: A brief overview or summary of the company. (String)no_tags
: The number of descriptive tags linked to the company. (Integer)no_company_socials
: The number of social media accounts associated with the company. (Integer)founded
: The year in which the company was established. (Integer)team_size
: The total number of individuals working within the company's team. (Integer)location
: The specific city where the company is based. (String)
Distribution
The data is typically provided in CSV (Comma Separated Values) format, with sample files available separately on the platform. The
company.csv
file is approximately 634.51 kB and features 12 columns. While exact row counts for the entire dataset are not specified, column validity checks suggest around 1000 records are included for many key fields. The dataset is structured into two distinct files, company.csv
and ycombinator.csv
, allowing for varied analyses.Usage
This dataset is ideal for a range of applications, including:
- Studying the evolving trends of startups over an extended period.
- Tracking the number of startups in various industries and comparing performance across companies and sectors.
- Identifying specific geographical regions with a high concentration of startup activity, particularly useful for recruitment strategies.
- Mapping the typical development cycle of a startup, from its initial concept to a successful exit.
- Conducting market research and competitive analysis within the startup landscape.
Coverage
The dataset primarily focuses on US startup companies, with data availability extending from 2008 onwards. Geographically, the data highlights locations within the USA, with notable concentrations in cities such as San Francisco (32%) and New York (9%). The country distribution shows 38% of companies are solely in the USA, and 28% are "USA; Remote", indicating a strong North American emphasis. Information on data availability for specific groups or years within this timeframe is available through the
founded
column and detailed column statistics. It focuses on company attributes rather than human demographics.License
CC0 1.0 Universal (CC0 1.0) - Public Domain Dedication
Who Can Use It
This dataset is particularly useful for:
- Business Analysts: For market trend analysis and competitor benchmarking.
- Recruitment Professionals: To identify and target high-density startup regions and talent pools.
- Entrepreneurs: For understanding market opportunities and the competitive landscape.
- Academics and Researchers: To study economic development, innovation, and startup dynamics.
- Investors: To gain insights into startup growth, founder characteristics, and team sizes.
Dataset Name Suggestions
- US Startup Ecosystem Analytics
- North American Startup Trends (2008+)
- Startup Company Profiles: USA & Y Combinator
- Venture Capital & Startup Landscape Data
- Post-2008 US Startup Data
Attributes
Original Data Source: Venture Capital & Startup Landscape Data