ROUGE: A Package for Automatic Evaluation of Summaries

Abstract

ROUGE stands for Recall-Oriented Understudy for Gisting Evaluation. It includes measures to automatically determine the quality of a summary by comparing it to other (ideal) summaries created by humans. The measures count the number of overlapping units such as n-gram, word sequences, and word pairs between the computer-generated summary to be evaluated and the ideal summaries created by humans. This paper introduces four different ROUGE measures: ROUGE-N, ROUGE-L, ROUGE-W, and ROUGE-S included in the ROUGE summarization evaluation package and their evaluations. Three of them have been used in the Document Understanding Conference (DUC) 2004, a large-scale summarization evaluation sponsored by NIST. 1

Keywords

ROUGEAutomatic summarizationNISTComputer scienceNatural language processingIdeal (ethics)Word (group theory)Artificial intelligenceInformation retrievalMathematics

Related Publications

An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition

Jiping Sun , Li Deng

Modeling phonological units of speech is a critical issue in speech recognition. In this paper, our recent development of an overlapping-feature-based phonological model that re...

2002 The Journal of the Acoustical Society... 71 citations

Publication Info

Year: 2004
Type: article
Pages: 74-81
Citations: 8287
Access: Closed

External Links

Citation Metrics

8287

OpenAlex

Cite This

APA Style

                            
                                    Chin-Yew Lin
                                
                            (2004). 
                            ROUGE: A Package for Automatic Evaluation of Summaries. 
                            
                            , 74-81.