QBLink: A Dataset for Sequential Open-Domain Question Answering

dc.contributor.advisorBoyd-Graber, Jordan
dc.contributor.authorGhoneim, Ahmed Elgohary
dc.contributor.authorZhao, Chen
dc.date.accessioned2021-08-01T16:59:59Z
dc.date.available2021-08-01T16:59:59Z
dc.date.issued2018-11-03
dc.description.abstractWe introduce QBLink, a new dataset of about 18,000 question sequences, each sequence consists of three naturally occurring human-authored questions (totaling around 56,000 unique questions). The sequences themselves are also naturally occurring (i.e., we do not artificially combine individually-authored questions to form sequences), which allows us to focus more on the important connections between questions that should be incorporated to improve the end-to-end question answering accuracy. QBLink is based on the bonus questions of Quiz Bowl tournaments. Unlike previous work that only uses the starter (or tossup) questions, bonus questions are not interruptable (players always hear the complete question) and have greater variability in difficulty. Bonus questions start with a lead-in text, which sets the stage for the rest of the question, followed by a sequence of related questions.en_US
dc.description.sponsorshipJordan Boyd-Graber is supported by NSF Grant IIS-1652666. Ahmed Elgohary was supported by an IBM PhD fellowship.en_US
dc.identifierhttps://doi.org/10.13016/t92u-mpwn
dc.identifier.citationAhmed Elgohary, Chen Zhao, and Jordan Boyd-Graber. 2018. Dataset and baselines for sequential open- domain question answering. In Proceedings of Em- pirical Methods in Natural Language Processing.en_US
dc.identifier.urihttp://hdl.handle.net/1903/27594
dc.language.isoenen_US
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectQuestion Answeringen_US
dc.subjectDialog Systemsen_US
dc.subjectQuizBowlen_US
dc.subjectQBLinken_US
dc.titleQBLink: A Dataset for Sequential Open-Domain Question Answeringen_US
dc.typeDataseten_US

Files

Original bundle
Now showing 1 - 3 of 3
No Thumbnail Available
Name:
QBLink-train.json
Size:
24.75 MB
Format:
Unknown data format
Description:
Training Split of QBLink
No Thumbnail Available
Name:
QBLink-dev.json
Size:
1.98 MB
Format:
Unknown data format
Description:
Development Split of QBLink
No Thumbnail Available
Name:
QBLink-test.json
Size:
3.41 MB
Format:
Unknown data format
Description:
Test Split of QBLink
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.57 KB
Format:
Item-specific license agreed upon to submission
Description: