Opendatabay APP

Stanford Question Answering Dataset (SQuAD)

Education & Learning Analytics

Tags and Keywords

Education

Text

NLP

Text Mining

Text Pre-Processing

Trusted By
Trusted by company1Trusted by company2Trusted by company3
Stanford Question Answering Dataset (SQuAD) Dataset on Opendatabay data marketplace

"No reviews yet"

Free

About

SQuAD is a reading comprehension dataset consisting of questions posed by crowdworkers on a set of Wikipedia articles. The answers to the questions are span of text, or segments, from the corresponding reading passages. The data fields in this dataset are the same across all splits
How to use the dataset The SQuAD dataset is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage. The data fields are the same among all splits
Columns:context,question,answers
To use this dataset, simply download one of the split files (train.csv or validation.csv) and load it into your preferred data analysis tool. Each row in the file corresponds to a single question-answer pair. The context column contains the full text of the corresponding Wikipedia article, while the question and answers columns contain the question posed by the crowdworker and its corresponding answer(s)
Research Ideas Learning to answer multiple choice questions by extracting text spans from source materials Developing Reading Comprehension models that can answer open-ended questions about passages of text Building systems that can generate large training datasets for Reading Comprehension models by creating synthetic questions from existing passages

License

CC0

Listing Stats

VIEWS

0

DOWNLOADS

0

LISTED

17/06/2025

REGION

GLOBAL

Universal Data Quality Score Logo UDQSQUALITY

5 / 5

VERSION

1.0

Free