Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Abstract

This article surveys and organizes research works in a new paradigm in natural language processing, which we dub “prompt-based learning.” Unlike traditional supervised learning, which trains a model to take in an input x and predict an output y as P ( y|x ), prompt-based learning is based on language models that model the probability of text directly. To use these models to perform prediction tasks, the original input x is modified using a template into a textual string prompt x′ that has some unfilled slots, and then the language model is used to probabilistically fill the unfilled information to obtain a final string x̂ , from which the final output y can be derived. This framework is powerful and attractive for a number of reasons: It allows the language model to be pre-trained on massive amounts of raw text, and by defining a new prompting function the model is able to perform few-shot or even zero-shot learning, adapting to new scenarios with few or no labeled data. In this article, we introduce the basics of this promising paradigm, describe a unified set of mathematical notations that can cover a wide variety of existing work, and organize existing work along several dimensions, e.g., the choice of pre-trained language models, prompts, and tuning strategies. To make the field more accessible to interested beginners, we not only make a systematic review of existing works and a highly structured typology of prompt-based concepts but also release other resources, e.g., a website NLPedia–Pretrain including constantly updated survey and paperlist.

Keywords

Computer scienceVariety (cybernetics)Set (abstract data type)NotationCover (algebra)Artificial intelligenceNatural languageField (mathematics)Function (biology)Language modelString (physics)Natural language processingNatural language understandingProgramming languageLinguistics

Affiliated Institutions

Related Publications

Learning to Prompt for Vision-Language Models

Kaiyang Zhou , Jingkang Yang , Chen Change Loy +1 more

Large pre-trained vision-language models like CLIP have shown great potential in learning representations that are transferable across a wide range of downstream tasks. Differen...

2022 International Journal of Computer Vision 2040 citations

Conditional Prompt Learning for Vision-Language Models

Kaiyang Zhou , Jingkang Yang , Chen Change Loy +1 more

With the rise of powerful pre-trained vision-language models like CLIP, it becomes essential to investigate ways to adapt these models to downstream datasets. A recently propose...

2022 2022 IEEE/CVF Conference on Computer ... 1169 citations

Large language models encode clinical knowledge

Karan Singhal , Shekoofeh Azizi , Tao Tu +29 more

Abstract Large language models (LLMs) have demonstrated impressive capabilities, but the bar for clinical applications is high. Attempts to assess the clinical knowledge of mode...

2023 Nature 2248 citations

Deep Contextualized Word Representations

Matthew E. Peters , Mark E Neumann , Mohit Iyyer +4 more

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses ...

2018 Proceedings of the 2018 Conference of... 1786 citations

Exploring the Limits of Transfer Learning with a Unified Text-to-Text\n Transformer

Colin Raffel , Noam Shazeer , Adam Roberts +6 more

Transfer learning, where a model is first pre-trained on a data-rich task\nbefore being fine-tuned on a downstream task, has emerged as a powerful\ntechnique in natural language...

2019 arXiv (Cornell University) 8299 citations

Publication Info

Year: 2022
Type: review
Volume: 55
Issue: 9
Pages: 1-35
Citations: 3108
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

3108

OpenAlex

Cite This

APA Style

                            
                                
                                    Pengfei Liu, 
                                
                                    Weizhe Yuan, 
                                
                                    Jinlan Fu
                                
                                et al.
                            
                            (2022). 
                            Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing. 
                            ACM Computing Surveys
                            , 55
                            (9)
                            , 1-35.
                            https://doi.org/10.1145/3560815
                        

Identifiers

DOI: 10.1145/3560815