Precision and recall of machine translation

I. Dan Melamed; Ryan Green; Joseph Turian

doi:10.3115/1073483.1073504

Abstract

Machine translation can be evaluated using precision, recall, and the F-measure. These standard measures have significantly higher correlation with human judgments than recently proposed alternatives. More importantly, the standard measures have an intuitive interpretation, which can facilitate insights into how MT systems might be improved. The relevant software is publicly available.

Keywords

Machine translationRecallComputer sciencePrecision and recallInterpretation (philosophy)Measure (data warehouse)Translation (biology)Artificial intelligenceSoftwareNatural language processingMachine learningData miningProgramming languageCognitive psychologyPsychology

Affiliated Institutions

New York University US

Related Publications

Evaluation: from precision, recall and F-measure to ROC, informedness,\n markedness and correlation

David Powers

Commonly used evaluation measures including Recall, Precision, F-Measure and\nRand Accuracy are biased and should not be used without clear understanding of\nthe biases, and cor...

2020 arXiv (Cornell University) 1514 citations

Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation

David Powers

Commonly used evaluation measures including Recall, Precision, F-Measure and Rand Accuracy are biased and should not be used without clear understanding of the biases, and corre...

2020 arXiv (Cornell University) 4425 citations

SemEval-2012 Task 6: A Pilot on Semantic Textual Similarity

Eneko Agirre , Daniel Cer , Mona Diab +1 more

Semantic Textual Similarity (STS) measures the degree of semantic equivalence between two texts. This paper presents the results of the STS pilot task in Semeval. The training d...

2012 679 citations

Towards Learning Terminological Concept Systems from Multilingual Natural Language Text

Yinhan Liu , Myle Ott , Naman Goyal +7 more

Terminological Concept Systems (TCS) provide a means of organizing, structuring and representing domain-specific multilingual information and are important to ensure terminologi...

2021 Leibniz-Zentrum für Informatik (Schlo... 16995 citations

Cross-lingual Language Model Pretraining

Guillaume Lample , Alexis Conneau

Recent studies have demonstrated the efficiency of generative pretraining for English natural language understanding. In this work, we extend this approach to multiple languages...

2019 arXiv (Cornell University) 1617 citations

Publication Info

Year: 2003
Type: article
Volume: 2
Pages: 61-63
Citations: 226
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Precision and recall of machine translation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

226

OpenAlex

Cite This

APA Style

                            
                                    I. Dan Melamed, 
                                
                                    Ryan Green, 
                                
                                    Joseph Turian
                                
                            (2003). 
                            Precision and recall of machine translation. 
                            
                            , 2
                            
                            , 61-63.
                            https://doi.org/10.3115/1073483.1073504

Identifiers

DOI: 10.3115/1073483.1073504