Abstract

levelevidenceabouttheuseofwordsinlocalcontextstrievaltaskandimprovingretrievaleectiveness.Passagebyimportanttextexcerpts,therebysimplifyingtheretrieval,passage-retrievaltechniquesarebecomingincreasinglypopular.LargertextscanthenbereplacedWiththewidespreaduseoffull-textinformationre- textpassages,astudyoftextpassagesisalsoimportant traversal,andtextsummarization. toformulatespecicationsforinformationretrieval,text themesisthenusedtocharacterizetextstructure,and exhibitstheresultsofsimilaritymeasurementsbetween knowledgeoftexttypeandtextstructureinturnaects pairsoftexts,ortextexcerpts.Typically,eachtext,or manytexthandlingoperations,includingretrieval,text textexcerptisrepresentedbyavectorofweightedterms readingandtraversal,andtextsummarization. ments,themes,informationretrieval,passageretrieval,oftheformDi=(di1;di2;:::;:::;dit)wheredikrepre- Thestructureofindividualtexts,orsetsofrelatedtexts, canbestudiedbyusingatextrelationshipmapthat Withtheadventoffull-textdocumentprocessing,the textsummarization. sentsanimportanceweightfortermTkattachedtodoctentrepresentationpurposesmaybewordsorphrases interestinmanipulatingtextpassagesratherthanonly wordusageinlocaltextenvironmentsisoftenhelpfulin full-textitemshascontinuedtogrow.Retrievinglarge improvingretrievaleectiveness,becausethemeaningof textsinanswertouserqueriestendstobeinecient thetermsintheindividualdocumentsandthedocument texts.Inaddition,passage-levelevidenceaccountingfor ambiguoustermsbecomesclearwhenthelocalcontext becausetheuseristhenforcedtocopewithlargemasses collectionasawhole.[7] umentDi.Thetermsattachedtodocumentsforconderivedfromthedocumenttextsbyanautomaticin- bytakingintoaccounttheoccurrencecharacteristicsof dexingprocedure,andthetermweightsarecomputed oftext,andineectivebecauserelevanttextpassages oftenprovidebetteranswersthancompletedocument

Keywords

HypertextComputer scienceCitationLibrary scienceInformation retrievalWorld Wide Web

Affiliated Institutions

Related Publications

Chemical Reaction Engineering

ADVERTISEMENT RETURN TO ISSUEPREVCommentaryNEXTChemical Reaction EngineeringOctave LevenspielView Author Information Chemical Engineering Department Oregon State University Corv...

1999 Industrial & Engineering Chemistry Re... 9922 citations

Publication Info

Year
1996
Type
article
Pages
53-65
Citations
176
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

176
OpenAlex
9
Influential
79
CrossRef

Cite This

Gerard Salton, Amit Singhal, Chris Buckley et al. (1996). Automatic text decomposition using text segments and text themes. Proceedings of the the seventh ACM conference on Hypertext - HYPERTEXT '96 , 53-65. https://doi.org/10.1145/234828.234834

Identifiers

DOI
10.1145/234828.234834

Data Quality

Data completeness: 81%