dc.contributor.author | Bravo-Marquez, Felipe | en_NZ |
dc.contributor.author | Frank, Eibe | en_NZ |
dc.contributor.author | Pfahringer, Bernhard | en_NZ |
dc.coverage.spatial | Santiago, Chile | en_NZ |
dc.date.accessioned | 2015-08-27T00:53:18Z | |
dc.date.available | 2015 | en_NZ |
dc.date.available | 2015-08-27T00:53:18Z | |
dc.date.issued | 2015 | en_NZ |
dc.identifier.citation | Bravo-Márquez, F., Frank, E., & Pfahringer, B. (2015). From unlabelled tweets to Twitter-specific opinion words. In Proc 38th International ACM SIGIR Conference on Research and Development, Santiago de Chile, 9-13 August 2015 (pp. 743–746). New York, USA: ACM. http://doi.org/10.1145/2766462.2767770 | en |
dc.identifier.uri | https://hdl.handle.net/10289/9567 | |
dc.description.abstract | In this article, we propose a word-level classification model for automatically generating a Twitter-specific opinion lexicon from a corpus of unlabelled tweets. The tweets from the corpus are represented by two vectors: a bag-of-words vector and a semantic vector based on word-clusters. We propose a distributional representation for words by treating them as the centroids of the tweet vectors in which they appear. The lexicon generation is conducted by training a word-level classifier using these centroids to form the instance space and a seed lexicon to label the training instances. Experimental results show that the two types of tweet vectors complement each other in a statistically significant manner and that our generated lexicon produces significant improvements for tweet-level polarity classification. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.publisher | ACM | en_NZ |
dc.rights | © 2015 Copyright is held by the author(s). Publication rights licensed to ACM | |
dc.source | SIGIR '15 | en_NZ |
dc.subject | Lexicon Generation | |
dc.subject | computer science | |
dc.subject | Sentiment Analysis | |
dc.subject | Twitter | |
dc.subject | Machine learning | |
dc.title | From unlabelled tweets to Twitter-specific opinion words | en_NZ |
dc.type | Conference Contribution | |
dc.identifier.doi | 10.1145/2766462.2767770 | en_NZ |
dc.relation.isPartOf | Proc 38th International ACM SIGIR Conference on Research and Development | en_NZ |
pubs.begin-page | 743 | |
pubs.elements-id | 128082 | |
pubs.end-page | 746 | |
pubs.finish-date | 2015-08-13 | en_NZ |
pubs.place-of-publication | New York, USA | |
pubs.start-date | 2015-08-09 | en_NZ |