Abstract

Machine translation can be evaluated using precision, recall, and the F-measure. These standard measures have significantly higher correlation with human judgments than recently proposed alternatives. More importantly, the standard measures have an intuitive interpretation, which can facilitate insights into how MT systems might be improved. The relevant software is publicly available.

Keywords

Machine translationRecallComputer sciencePrecision and recallInterpretation (philosophy)Measure (data warehouse)Translation (biology)Artificial intelligenceSoftwareNatural language processingMachine learningData miningProgramming languageCognitive psychologyPsychology

Affiliated Institutions

Related Publications

Publication Info

Year
2003
Type
article
Volume
2
Pages
61-63
Citations
226
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

226
OpenAlex

Cite This

I. Dan Melamed, Ryan Green, Joseph Turian (2003). Precision and recall of machine translation. , 2 , 61-63. https://doi.org/10.3115/1073483.1073504

Identifiers

DOI
10.3115/1073483.1073504