Opendatabay APP

AmbitionBox Scraped Business Insights

Data Science and Analytics

Tags and Keywords

Business

Recruitment

India

Ratings

Corporate

Trusted By
Trusted by company1Trusted by company2Trusted by company3
AmbitionBox Scraped Business Insights Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

Captured via web scraping techniques, this data archive provides a detailed overview of the corporate landscape in India, specifically curated to facilitate the development of machine learning job recommendation models. The collection encompasses key variables regarding approximately 46,000 companies, offering insights into industry presence, workforce scale, and public perception through ratings and reviews. This resource serves as a foundational element for analyzing employment trends and business characteristics within the Indian market.

Columns

  • Index: The numerical index representing the row number for the record.
  • company_name: The specific name of the organisation (e.g., TCS, Accenture).
  • rating: A numerical score on a scale of 1 to 5 representing the company's rating (Mean value: 3.96).
  • company_reviews: The total count of reviews submitted for the company (e.g., 19.8k Reviews).
  • company_age: The age of the company expressed in years since its establishment (e.g., 29 years old).
  • number_of_employees: A categorical range indicating the size of the workforce (e.g., 1 Lakh+ Employees (India)).

Distribution

  • Format: CSV (Companies_Information.csv)
  • Size: 3.55 MB
  • Rows: Approximately 46,034 records
  • Columns: 6 columns
  • Structure: The dataset maintains a 100% validity rate with zero mismatched or missing values across all columns.

Usage

  • Job Recommendation Systems: Training algorithms to suggest workplaces based on company profiles and ratings.
  • Feature Engineering: Developing predictive models for business analytics.
  • Market Analysis: Assessing the distribution of large-scale employers versus smaller entities in India.
  • Sentiment Analysis: Evaluating corporate reputation based on aggregated ratings and review volumes.

Coverage

  • Geographic Scope: India.
  • Demographic Scope: Covers a wide range of companies, with a significant portion (55%) identified as having over 1 Lakh employees.
  • Entity Types: Includes major corporations like TCS and Accenture alongside roughly 40,000 other entities.

License

CC0: Public Domain

Who Can Use It

  • Data Scientists: For building and testing recommender systems and classification models.
  • Business Analysts: For conducting competitive research and sector analysis.
  • Recruitment Specialists: For understanding market standards regarding company age and size.
  • Researchers: For academic studies on the Indian corporate ecosystem.

Dataset Name Suggestions

  • Indian Companies Ratings and Workforce Data
  • 46k Indian Corporate Profiles for Machine Learning
  • AmbitionBox Scraped Business Insights
  • India Job Market and Company Statistics

Attributes

Listing Stats

VIEWS

3

DOWNLOADS

0

LISTED

07/12/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Loading...

Free

Download Dataset in CSV Format