Abstract

Recognizing lines of unconstrained handwritten text is a challenging task. The difficulty of segmenting cursive or overlapping characters, combined with the need to exploit surrounding context, has led to low recognition rates for even the best current recognizers. Most recent progress in the field has been made either through improved preprocessing or through advances in language modeling. Relatively little work has been done on the basic recognition algorithms. Indeed, most systems rely on the same hidden Markov models that have been used for decades in speech and handwriting recognition, despite their well-known shortcomings. This paper proposes an alternative approach based on a novel type of recurrent neural network, specifically designed for sequence labeling tasks where the data is hard to segment and contains long-range bidirectional interdependencies. In experiments on two large unconstrained handwriting databases, our approach achieves word recognition accuracies of 79.7 percent on online data and 74.1 percent on offline data, significantly outperforming a state-of-the-art HMM-based system. In addition, we demonstrate the network's robustness to lexicon size, measure the individual influence of its hidden layers, and analyze its use of context. Last, we provide an in-depth discussion of the differences between the network and HMMs, suggesting reasons for the network's superior performance.

Keywords

Computer scienceHidden Markov modelArtificial intelligenceHandwriting recognitionRobustness (evolution)Speech recognitionConnectionismArtificial neural networkPattern recognition (psychology)Intelligent character recognitionPreprocessorContext (archaeology)Feature extractionCharacter recognition

MeSH Terms

AlgorithmsElectronic Data ProcessingHandwritingImage EnhancementImage InterpretationComputer-AssistedInformation Storage and RetrievalModelsStatisticalPattern RecognitionAutomatedReadingReproducibility of ResultsSensitivity and SpecificitySubtraction Technique

Affiliated Institutions

Related Publications

Publication Info

Year
2009
Type
article
Volume
31
Issue
5
Pages
855-868
Citations
1961
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1961
OpenAlex
115
Influential
1529
CrossRef

Cite This

Alexander Graves, Marcus Liwicki, S. George Fernandez et al. (2009). A Novel Connectionist System for Unconstrained Handwriting Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence , 31 (5) , 855-868. https://doi.org/10.1109/tpami.2008.137

Identifiers

DOI
10.1109/tpami.2008.137
PMID
19299860

Data Quality

Data completeness: 90%