Abstract

Abstract The 2010 i2b2/VA Workshop on Natural Language Processing Challenges for Clinical Records presented three tasks: a concept extraction task focused on the extraction of medical concepts from patient reports; an assertion classification task focused on assigning assertion types for medical problem concepts; and a relation classification task focused on assigning relation types that hold between medical problems, tests, and treatments. i2b2 and the VA provided an annotated reference standard corpus for the three tasks. Using this reference standard, 22 systems were developed for concept extraction, 21 for assertion classification, and 16 for relation classification. These systems showed that machine learning approaches could be augmented with rule-based systems to determine concepts, assertions, and relations. Depending on the task, the rule-based systems can either provide input for machine learning or post-process the output of machine learning. Ensembles of classifiers, information from unlabeled data, and external knowledge sources can help when the training data are inadequate.

Keywords

AssertionComputer scienceTask (project management)Artificial intelligenceRelation (database)Natural language processingInformation extractionProcess (computing)Machine learningInformation retrievalData miningProgramming language

Affiliated Institutions

Related Publications

The teachable language comprehender

The Teachable Language Comprehender (TLC) is a program designed to be capable of being taught to “comprehend” English text. When text which the program has not seen before is in...

1969 Communications of the ACM 483 citations

Publication Info

Year
2011
Type
article
Volume
18
Issue
5
Pages
552-556
Citations
1234
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1234
OpenAlex

Cite This

Özlem Uzuner, Brett R. South, Shuying Shen et al. (2011). 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. Journal of the American Medical Informatics Association , 18 (5) , 552-556. https://doi.org/10.1136/amiajnl-2011-000203

Identifiers

DOI
10.1136/amiajnl-2011-000203