Evaluating Evaluation Metrics for Ancient Chinese to English Machine Translation

dc.contributor.advisorSchonebaum, Andrew
dc.contributor.authorBennett, Eric
dc.date.accessioned2025-04-02T15:55:54Z
dc.date.issued2024
dc.description.abstractEvaluation metrics are an important driver of progress in Machine Translation (MT), but they have been primarily validated on high-resource modern languages. In this paper, we conduct an empirical evaluation of metrics commonly used to evaluate MT from Ancient Chinese into English. Using LLMs, we construct a contrastive test set, pairing high-quality MT and purposefully flawed MT of the same Pre-Qin texts. We then evaluate the ability of each metric to discriminate between accurate and flawed translations.
dc.identifierhttps://doi.org/10.13016/hluh-dgoh
dc.identifier.urihttp://hdl.handle.net/1903/33827
dc.relation.isAvailableAtDigital Repository at the University of Maryland
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md)
dc.subjectMachine Translation
dc.subjectAncient Chinese
dc.subjectNatural Language Processing
dc.subjectMachine Learning
dc.subjectArtificial Intelligence
dc.titleEvaluating Evaluation Metrics for Ancient Chinese to English Machine Translation
dc.typeOther

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
Bennett_Essay.pdf
Size:
52.75 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Bennett_ResearchPaper.pdf
Size:
274.53 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Bennett_Bibliography.pdf
Size:
115.63 KB
Format:
Adobe Portable Document Format