Global Roller Coaster Directory
NLP / Natural Language Processing
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides detailed information on over 1,000 roller coasters, offering insights into their design, operational characteristics, and historical context. The information was initially sourced by scraping Wikipedia, making it a valuable resource for anyone interested in the mechanics, history, and geographical distribution of roller coasters.
Columns
- coaster_name: The name of the roller coaster.
- Length: The length of the coaster, expressed in feet or metres (raw text).
- Speed: The speed of the coaster, typically in miles per hour or kilometres per hour (raw text).
- Location: The specific location or amusement park where the roller coaster is situated.
- Status: The current operating status of the roller coaster (e.g., Operating).
- Opening date: The original opening date of the roller coaster (raw text).
- Type: The primary material used for the coaster's construction (e.g., Steel, Wood).
- Manufacturer: The company that manufactured the roller coaster.
- Height restriction: Any height limitations for riders.
- Model: The specific model of the roller coaster.
- Height: The height of the coaster (raw text field).
- Inversions: The number of inversions the roller coaster features (raw text field).
- Lift/launch system: Details on how the coaster gains its initial momentum (e.g., Chain lift hill).
- Cost: The approximate cost of building the roller coaster.
- Trains: Information regarding the number of trains and their passenger capacity.
- Park section: The specific section of the park where the coaster is located.
- Duration: The typical duration of the ride.
- Capacity: The hourly rider capacity of the roller coaster.
- G-force: The maximum G-force experienced on the coaster (raw text).
- Designer: The individual or team responsible for designing the coaster.
- Max vertical angle: The steepest vertical angle of the coaster's drop.
- Drop: The height of the coaster's main drop (raw text).
- Soft opening date: The primary soft opening date for the coaster.
- Fast Lane available: Indicates if a Fast Lane (or similar express pass) is available.
- Replaced: Details if the coaster replaced a previous attraction.
- Track layout: The specific type of track layout (e.g., Twister).
- Fastrack available: Indicates if Fastrack (or similar express pass) is available.
- Soft opening date.1: A secondary soft opening date, if applicable.
- Closing date: The date the coaster ceased operations, if applicable.
- Opened: Raw text field for the opening date.
- Replaced by: Indicates if the coaster was replaced by another attraction.
- Website: The official website for the roller coaster, if available.
- Flash Pass Available: Indicates if a Flash Pass (or similar express pass) is available.
- Must transfer from wheelchair: Indicates if riders in wheelchairs must transfer to the ride vehicle.
- Theme: The specific theme or concept of the roller coaster.
- Single rider line available: Indicates if a single rider queue is available.
- Restraint Style: The raw text describing the type of restraints used.
- Flash Pass available: Another field indicating Flash Pass availability.
- Acceleration: The acceleration characteristics of the coaster (raw text).
- Restraints: A more general description of the restraint types.
- Name: A secondary name for the roller coaster.
- year_introduced: The year the roller coaster was first introduced.
- latitude: The latitude coordinate of the coaster's location.
- longitude: The longitude coordinate of the coaster's location.
- Type_Main: The main categorised type of material for the coaster (e.g., Steel, Wood).
- opening_date_clean: A cleaned version of the opening date.
- speed1: The primary speed value (raw text).
- speed2: The secondary speed value (raw text).
- speed1_value: The numerical value of the primary speed.
- speed1_unit: The unit of measurement for the primary speed (e.g., mph, km/h).
- speed_mph: The cleaned speed value converted to miles per hour.
- height_value: The numerical value of the raw height.
- height_unit: The unit of measurement for the raw height (e.g., ft, m).
- height_ft: The cleaned height value converted to feet.
- Inversions_clean: The cleaned numerical value for the number of inversions.
- Gforce_clean: The cleaned numerical G-force value in Gs.
Distribution
This dataset is provided as a CSV file (coaster_db.csv) with a size of approximately 457.43 KB. It contains data on over 1,000 roller coasters, with 56 distinct columns and 1,087 records detailing various attributes of each ride.
Usage
This dataset is ideal for:
- Analysing trends in roller coaster design, such as increasing speeds, heights, or number of inversions over time.
- Mapping roller coaster locations globally to identify regional concentrations or distribution patterns.
- Researching the historical evolution of amusement ride technology and engineering.
- Developing applications or visualisations related to theme park attractions.
- Understanding the operational lifespans and common features of various roller coaster models.
Coverage
The dataset offers a global geographic scope, with latitude and longitude data available for a significant portion of the records. Specific locations such as Cedar Point are mentioned. The time range for data spans from 1884 to 2022, covering over a century of roller coaster history, with detailed opening and closing dates where available. While not explicitly demographic, it includes information such as height restrictions and wheelchair transfer requirements, which touch upon accessibility.
License
CC0: Public Domain
Who Can Use It
- Theme Park Enthusiasts: To deepen their knowledge of specific rides or explore the characteristics of roller coasters worldwide.
- Data Analysts and Scientists: For performing statistical analysis, identifying correlations between features, or building predictive models related to roller coaster characteristics.
- Engineering Students and Researchers: To study structural design, mechanics, and innovation in amusement ride engineering.
- Travel and Tourism Professionals: For market analysis, identifying popular attractions, or developing travel guides.
- Educational Institutions: As a practical dataset for teaching data analysis, mapping, or historical trends.
Dataset Name Suggestions
- Global Roller Coaster Directory
- Ultimate Roller Coaster Metrics
- Worldwide Thrill Ride Data
- Amusement Ride Statistics
- Historic Roller Coaster Archive
Attributes
Original Data Source: Global Roller Coaster Directory