NLP Chatbot Intent Dataset
Data Science and Analytics
Tags and Keywords
Trusted By




"No reviews yet"
Free
About
This dataset is designed for training and evaluating natural language processing models in intent classification tasks. It contains 1000 synthetic chatbot user inputs, each labelled with its corresponding user intent. It is ideal for developing classifiers to detect user intent and building end-to-end chatbot systems, allowing for experimentation with various NLP techniques like BERT or Naive Bayes.
Columns
user_input
: A short message sent by a chatbot user.intent
: The identified user intent, such asorder_status
orcancellation
.
Distribution
This dataset comprises 1000 synthetic chatbot user inputs. The data is typically available in CSV format. Each user input is clearly labelled with its intent. For example, some intents like 'payment_update' account for 12% of the messages, and 'business_hours' for 22%, with 'Other' making up a larger proportion (77% or 66% in samples).
Usage
This dataset is suitable for:
- Training classifiers to identify user intent.
- Developing end-to-end chatbot systems.
- Experimenting with various natural language processing algorithms, including BERT, Logistic Regression, and Naive Bayes.
- Enhancing the natural language understanding capabilities of chatbot systems.
Coverage
The dataset covers a global region and is provided in English.
License
CC BY-SA
Who Can Use It
This dataset is intended for data scientists, analysts, and developers working on natural language processing projects. It is particularly useful for those looking to:
- Train and evaluate models for intent classification.
- Build and improve chatbot systems.
- Conduct research or develop applications in the field of natural language understanding.
Dataset Name Suggestions
- Chatbot Intent Classification Dataset
- Synthetic Chatbot User Intents
- NLP Chatbot Intent Data
- User Intent Classification for Chatbots
Attributes
Original Data Source: Chatbot Intent Classification Dataset