From opinion lexicons to sentiment classification of tweets and vice versa: a transfer learning approach
Bravo-Marquez, F., Frank, E., & Pfahringer, B. (2016). From opinion lexicons to sentiment classification of tweets and vice versa: a transfer learning approach. In Proceeding of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence, Omaha, Nebraska, USA, 13-16 October, 2016.(pp. 145–152). Los Alamitos, CA, USA: IEEE Computer Society. http://doi.org/10.1109/WI.2016.29
Permanent Research Commons link: https://hdl.handle.net/10289/10782
Message-level and word-level polarity classification are two popular tasks in Twitter sentiment analysis. They have been commonly addressed by training supervised models from labelled data. The main limitation of these models is the high cost of data annotation. Transferring existing labels from a related problem domain is one possible solution for this problem. In this paper, we propose a simple model for transferring sentiment labels from words to tweets and vice versa by representing both tweets and words using feature vectors residing in the same feature space. Tweets are represented by standard NLP features such as unigrams and part-of-speech tags. Words are represented by averaging the vectors of the tweets in which they occur. We evaluate our approach in two transfer learning problems: 1) training a tweet-level polarity classifier from a polarity lexicon, and 2) inducing a polarity lexicon from a collection of polarity-annotated tweets. Our results show that the proposed approach can successfully classify words and tweets after transfer.
IEEE Computer Society
This is an author’s accepted version of an article published in the Proceeding of the 2016 IEEE/WIC/ACM International Conference on Web Intelligence. © 2016 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.