Library Award for Undergraduate Research
Permanent URI for this communityhttp://hdl.handle.net/1903/11321
Browse
1 results
Search Results
Item Evaluating Evaluation Metrics for Ancient Chinese to English Machine Translation(2024) Bennett, Eric; Schonebaum, AndrewEvaluation metrics are an important driver of progress in Machine Translation (MT), but they have been primarily validated on high-resource modern languages. In this paper, we conduct an empirical evaluation of metrics commonly used to evaluate MT from Ancient Chinese into English. Using LLMs, we construct a contrastive test set, pairing high-quality MT and purposefully flawed MT of the same Pre-Qin texts. We then evaluate the ability of each metric to discriminate between accurate and flawed translations.