QBLink: A Dataset for Sequential Open-Domain Question Answering
dc.contributor.advisor | Boyd-Graber, Jordan | |
dc.contributor.author | Ghoneim, Ahmed Elgohary | |
dc.contributor.author | Zhao, Chen | |
dc.date.accessioned | 2021-08-01T16:59:59Z | |
dc.date.available | 2021-08-01T16:59:59Z | |
dc.date.issued | 2018-11-03 | |
dc.description.abstract | We introduce QBLink, a new dataset of about 18,000 question sequences, each sequence consists of three naturally occurring human-authored questions (totaling around 56,000 unique questions). The sequences themselves are also naturally occurring (i.e., we do not artificially combine individually-authored questions to form sequences), which allows us to focus more on the important connections between questions that should be incorporated to improve the end-to-end question answering accuracy. QBLink is based on the bonus questions of Quiz Bowl tournaments. Unlike previous work that only uses the starter (or tossup) questions, bonus questions are not interruptable (players always hear the complete question) and have greater variability in difficulty. Bonus questions start with a lead-in text, which sets the stage for the rest of the question, followed by a sequence of related questions. | en_US |
dc.description.sponsorship | Jordan Boyd-Graber is supported by NSF Grant IIS-1652666. Ahmed Elgohary was supported by an IBM PhD fellowship. | en_US |
dc.identifier | https://doi.org/10.13016/t92u-mpwn | |
dc.identifier.citation | Ahmed Elgohary, Chen Zhao, and Jordan Boyd-Graber. 2018. Dataset and baselines for sequential open- domain question answering. In Proceedings of Em- pirical Methods in Natural Language Processing. | en_US |
dc.identifier.uri | http://hdl.handle.net/1903/27594 | |
dc.language.iso | en | en_US |
dc.relation.isAvailableAt | College of Computer, Mathematical & Natural Sciences | en_us |
dc.relation.isAvailableAt | Computer Science | en_us |
dc.relation.isAvailableAt | Digital Repository at the University of Maryland | en_us |
dc.relation.isAvailableAt | University of Maryland (College Park, MD) | en_us |
dc.subject | Question Answering | en_US |
dc.subject | Dialog Systems | en_US |
dc.subject | QuizBowl | en_US |
dc.subject | QBLink | en_US |
dc.title | QBLink: A Dataset for Sequential Open-Domain Question Answering | en_US |
dc.type | Dataset | en_US |
Files
Original bundle
1 - 3 of 3
No Thumbnail Available
- Name:
- QBLink-train.json
- Size:
- 24.75 MB
- Format:
- Unknown data format
- Description:
- Training Split of QBLink
No Thumbnail Available
- Name:
- QBLink-dev.json
- Size:
- 1.98 MB
- Format:
- Unknown data format
- Description:
- Development Split of QBLink
No Thumbnail Available
- Name:
- QBLink-test.json
- Size:
- 3.41 MB
- Format:
- Unknown data format
- Description:
- Test Split of QBLink