Foundations of statistical natural language processing

Abstract

Statistical approaches to processing natural language text have become dominant in recent years. This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear. The book contains all the theory and algorithms needed for building NLP tools. It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations. The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications.

Keywords

Computer scienceNatural language processingArtificial intelligenceCollocation (remote sensing)ParsingProbabilistic logicNatural languageConstruct (python library)Word (group theory)Natural (archaeology)Question answeringStatistical modelLinguisticsProgramming languageMachine learning

Affiliated Institutions

Related Publications

Deep Contextualized Word Representations

Matthew E. Peters , Mark E Neumann , Mohit Iyyer +4 more

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses ...

2018 Proceedings of the 2018 Conference of... 1786 citations

Natural Language Processing with Python

Steven Bird , Ewan Klein , Edward Loper

This book offers a highly accessible introduction to natural language processing, the field that supports a variety of language technologies, from predictive text and email filt...

2009 CERN Document Server (European Organi... 3449 citations

BERT Rediscovers the Classical NLP Pipeline

Ian Tenney , Dipanjan Das , Ellie Pavlick

Pre-trained text encoders have rapidly advanced the state of the art on many NLP tasks. We focus on one such model, BERT, and aim to quantify where linguistic information is cap...

2019 1214 citations

Supervised Learning of Universal Sentence Representations from Natural\n Language Inference Data

Alexis Conneau , Douwe Kiela , Holger Schwenk +2 more

Many modern NLP systems rely on word embeddings, previously trained in an\nunsupervised manner on large corpora, as base features. Efforts to obtain\nembeddings for larger chunk...

2017 arXiv (Cornell University) 2038 citations

Publicly Available Clinical

Emily Alsentzer , John R. Murphy , William Boag +4 more

Contextual word embedding models such as ELMo and BERT have dramatically improved performance for many natural language processing (NLP) tasks in recent months. However, these m...

2019 Proceedings of the 2nd Clinical Natur... 1422 citations

Publication Info

Year: 1999
Type: book
Citations: 9969
Access: Closed

External Links

Citation Metrics

9969

OpenAlex

Cite This

APA Style

                            
                                    Christopher D. Manning, 
                                
                                    Hinrich Schütze
                                
                            (1999). 
                            Foundations of statistical natural language processing. 
                            
                            .