CANARD: A dataset for Question-in-Context Rewriting

No Thumbnail Available

Files (3.11 MB)
No. of downloads: 39

Related Publication Link



Related Publication Citation

Ahmed Elgohary, Denis Peskov, and Jordan Boyd-Graber. Can You Unpack That? Learning to Rewrite Questions-in-Context. Empirical Methods in Natural Language Processing, 2019.


In conversational question answering multiple questions in an information-seeking dialogs which requires models to link questions together to resolve the conversational dependencies between them: each question needs to be under- stood in the conversation context. For example, the question “What was he like in that episode?” cannot be understood without knowing what “he” and “that episode” refer to, which can be resolved using the conversation context. CANARD is a dataset of 40,000 questions asked in conversational contexts paired with their gold context-independent (stand-alone) rewrite.