Domain adaptation for large-scale sentiment classification: A deep learning approach

Xavier Glorot; Antoine Bordes; Yoshua Bengio

Abstract

The exponential increase in the availability of online reviews and recommendations makes sentiment classification an interesting topic in academic and industrial research. Reviews can span so many different domains that it is difficult to gather annotated training data for all of them. Hence, this paper studies the problem of domain adaptation for sentiment classifiers, hereby a system is trained on labeled reviews from one source domain but is meant to be deployed on another. We propose a deep learning approach which learns to extract a meaningful representation for each review in an unsupervised fashion. Sentiment classifiers trained with this high-level feature representation clearly outperform state-of-the-art methods on a benchmark composed of reviews of 4 types of Amazon products. Furthermore, this method scales well and allowed us to successfully perform domain adaptation on a larger industrial-strength dataset of 22 domains. 1.

Keywords

Domain adaptationComputer scienceBenchmark (surveying)Artificial intelligenceAdaptation (eye)Domain (mathematical analysis)Sentiment analysisMachine learningRepresentation (politics)Feature learningDeep learningFeature (linguistics)Scale (ratio)Classifier (UML)

Affiliated Institutions

Related Publications

A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems

Ali Elkahky , Yang Song , Xiaodong He

Recent online services rely heavily on automatic personalization to recommend relevant content to a large number of users. This requires systems to scale promptly to accommodate...

2015 710 citations

Deep Domain Confusion: Maximizing for Domain Invariance

Eric Tzeng , Judy Hoffman , Ning Zhang +2 more

Recent reports suggest that a generic supervised deep CNN model trained on a large-scale dataset reduces, but does not remove, dataset bias on a standard benchmark. Fine-tuning ...

2014 arXiv (Cornell University) 2347 citations

Biased Representation Learning for Domain Adaptation

Fei Huang , Alexander Yates

Representation learning is a promising technique for discovering features that allow supervised classifiers to generalize from a source domain dataset to arbitrary new domains. ...

2012 23 citations

Deep Contextualized Word Representations

Matthew E. Peters , Mark E Neumann , Mohit Iyyer +4 more

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses ...

2018 Proceedings of the 2018 Conference of... 1786 citations

Analysis of Representations for Domain Adaptation

Shai Ben-David , John Blitzer , Koby Crammer +1 more

Discriminative learning methods for classification perform well when training and test data are drawn from the same distribution. In many situations, though, we have labeled tra...

2007 The MIT Press eBooks 1963 citations

Publication Info

Year: 2012
Type: preprint
Citations: 1563
Access: Closed

External Links

Citation Metrics

1563

OpenAlex

Cite This

APA Style

                            
                                    Xavier Glorot, 
                                
                                    Antoine Bordes, 
                                
                                    Yoshua Bengio
                                
                            (2012). 
                            Domain adaptation for large-scale sentiment classification: A deep learning approach. 
                            
                            .