National Housing Market Dataset
Stock & Market Data
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset provides a vast collection of housing listings for sale across the United States, primarily sourced from Craigslist. It aims to overcome the difficulty of consolidating disparate private housing options, enabling detailed experimental analysis on the USA's housing market as a whole, rather than focusing solely on isolated urban areas. It is designed to offer a broad perspective on property availability and characteristics nationwide.
Columns
- id: The unique identifier for each listing.
- url: The direct URL to the original listing.
- region: The specific Craigslist region where the listing was posted.
- region_url: The URL for the Craigslist region.
- price: The monthly rental price for the property.
- type: The type of housing (e.g., apartment, house).
- sqfeet: The total square footage of the property.
- beds: The number of bedrooms in the property.
- baths: The number of bathrooms in the property.
- cats_allowed: A boolean indicator (1 for yes, 0 for no) if cats are permitted.
- dogs_allowed: A boolean indicator (1 for yes, 0 for no) if dogs are permitted.
- smoking_allowed: A boolean indicator (1 for yes, 0 for no) if smoking is allowed.
- wheelchair_access: A boolean indicator (1 for yes, 0 for no) for wheelchair accessibility.
- electric_vehicle_charge: A boolean indicator (1 for yes, 0 for no) if an electric vehicle charger is available.
- comes_furnished: A boolean indicator (1 for yes, 0 for no) if the property comes furnished.
- laundry_options: Details on available laundry facilities.
- parking_options: Details on available parking facilities.
- image_url: The URL for an image associated with the listing.
- description: The textual description provided by the poster.
- lat: The geographical latitude of the listing.
- long: The geographical longitude of the listing.
- state: The state in the US where the listing is located.
Distribution
The dataset is provided as a CSV file named
housing.csv
, with a size of 558.44 MB. It contains 22 columns and approximately 385,000 records, detailing various housing options. Data for latitude and longitude fields are present for around 383,000 records, while laundry and parking options have fewer valid entries, approximately 306,000 and 244,000 respectively.Usage
This dataset is ideal for:
- Experimental analysis of housing markets across the United States.
- Investigating state-level housing trends rather than isolated urban markets.
- Developing real estate pricing models based on various property attributes.
- Studying the availability of amenities like pet-friendliness, parking, and laundry options nationwide.
- Analysing geographical distribution of different housing types and sizes.
Coverage
The dataset covers housing listings throughout the United States, providing geographic coordinates (latitude and longitude) and state information for a broad spatial scope. The data is scraped and updated every few months, with an expected quarterly update frequency, ensuring relatively current information on housing availability. No specific historical time range is detailed for the collected listings, but the collection process is ongoing. The dataset includes listings from 51 unique states and 404 Craigslist regions.
License
CC0: Public Domain
Who Can Use It
- Researchers and academics: For studying real estate economics, urban development, and housing policies.
- Data scientists and analysts: For building predictive models, identifying market trends, and creating visualisations of housing data.
- Real estate professionals: For market research, competitive analysis, and identifying investment opportunities.
- Urban planners: For understanding housing distribution, density, and amenity availability across different regions.
Dataset Name Suggestions
- USA Craigslist Housing Listings
- United States Property Data
- National Housing Market Dataset
- Craigslist US Homes for Sale
- American Housing Listings
Attributes
Original Data Source: National Housing Market Dataset