Show simple item record

dc.contributor.advisorGulla, Jon Atlenb_NO
dc.contributor.authorWei, Weinb_NO
dc.date.accessioned2014-12-19T13:39:53Z
dc.date.available2014-12-19T13:39:53Z
dc.date.created2013-10-06nb_NO
dc.date.issued2013nb_NO
dc.identifier653857nb_NO
dc.identifier.isbn978-82-471-4636-1nb_NO
dc.identifier.isbn978-82-471-4637-8nb_NO
dc.identifier.urihttp://hdl.handle.net/11250/253248
dc.description.abstractAs continuous growth of Internet, an ever increasing amount of information becomesavailable on the World Wide Web (WWW). Information on the WWW has never been soexploded that search engines using traditional keyword-based searching strategies hardlymeet people’s needs to retrieve knowledge from online massive text data. The motivationof this thesis comes from the great demands on discovering implicit knowledge and richsemantics from online documents.This thesis focuses on analyzing online business news, a representative of objective information,and online customer reviews, a representative of subjective information. Foronline business news, a topic driven impact analysis model is proposed that quantifies theimpact of topic of a news article. With the proposed topic driven impact analysis model,an explorative visual analysis system called ImpactWheel is developed to help users betternavigate and understand topic-specific companies’ impact relationships through miningrich information source of online business news.For online customer reviews, both document overall sentiment classification and attributedbasedsentiment analysis are performed. In the regard of document overall sentiment classification,taking advantages of high frequency of Co-occurring Term (CoT) patterns incustomer reviews, a frequency-based algorithm is proposed to generate complex featureswhich benefits sentiment classifiers. In order to search for effective features and ignoreuseless ones produced by the frequency-based complex feature generation algorithm, anEffective Feature Search (EFS) framework is proposed, which makes a novel connectionbetween feature candidate generation and a Stochastic Local Search process. In theregard of attributed-based sentiment analysis, the concept of Sentiment Ontology Tree isproposed, which organizes a product’s domain specific knowledge as well as sentiments ina tree-like ontology structure. With the concept of SOT, a Hierarchial Learning via SentimentOntology Tree (HL-SOT) approach is proposed to solve the sentiment analysis tasksin a hierarchical classification process. To enhance the classification performance andcomputational efficiency of the HL-SOT approach which encodes texts using a globallyunified index term space, a Localized Feature Selection (LFS) framework is developedwhich generates the customized index term space for each node of SOT. Since that theHL-SOT approach was estimated by a RLS estimator which is not competent enough tofind max class separation and that the statistical linear classifier has been evidently provenits fallibility on classifying sentiment, a more pragmatic Hybrid Hierarchical ClassificationProcess (HHCP) is proposed. The HHCP approach employs a linear classifier thatis capable of maximizing the class separation while minimizing the within-class variancefor attribute detection and turns to a rule-based solution for sentiment orientation.nb_NO
dc.languageengnb_NO
dc.publisherNorges teknisk-naturvitenskapelige universitetnb_NO
dc.relation.ispartofseriesDoctoral Theses at NTNU, 1503-8181; 2013:256nb_NO
dc.relation.haspartWei, Wei; Cao, Nan; Gulla, Jon Atle; Qu, Huamin. ImpactWheel. Proceedings of The 2011 IEEE/WIC/ACM International Conferences on Web Intelligence: 465-474, 2011. <a href='http://dx.doi.org/10.1109/WI-IAT.2011.108'>10.1109/WI-IAT.2011.108</a>.nb_NO
dc.relation.haspartWei, Wei; Gulla, Jon Atle; Fu, Zhang. Enhancing Negation-Aware Sentiment Classification on Product Reviews via Multi-Unigram Feature Generation. Proceedings of 6th International Conference on Intelligent Computing: 380-391, 2010. <a href='http://dx.doi.org/978-3-642-14921-4'>978-3-642-14921-4</a>.nb_NO
dc.relation.haspartWei, Wei; Mengshoel, Ole J.; Gulla, Jon Atle. Stochastic Search for Effective Features for Sentiment Classification. .nb_NO
dc.relation.haspartWei, Wei; Gulla, Jon Atle. Sentiment Learning on Product Reviews via Sentiment Ontology Tree. Association for Computational Linguistics (ACL). Annual Meeting Conference Proceedings: 404-413, 2010.nb_NO
dc.relation.haspartWei, Wei. Analyzing Text Data for Opinion Mining. Lecture Notes in Computer Science. (ISSN 0302-9743). 6716: 330-335, 2011. <a href='http://dx.doi.org/10.1007/978-3-642-22327-3_49'>10.1007/978-3-642-22327-3_49</a>.nb_NO
dc.relation.haspartWei, Wei; Gulla, Jon Atle. Enhancing the HL-SOT Approach to Sentiment Analysis via a Localized Feature Selection Framework. Proceedings of 5th International Joint Conference on Natural Language Processing: 327-335, 2011.nb_NO
dc.relation.haspartWei, Wei; Gulla, Jon Atle. Sentiment Analysis in a Hybrid Hierarchical Classification Process. Proceedings of the Seventh International Conference on Digital Information Management (ICDIM): 47-55, 2012. <a href='http://dx.doi.org/10.1109/ICDIM.2012.6360120'>10.1109/ICDIM.2012.6360120</a>.nb_NO
dc.titleMining Online Text Data for Sentiment and News Impact Analysisnb_NO
dc.typeDoctoral thesisnb_NO
dc.source.pagenumber200nb_NO
dc.contributor.departmentNorges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskapnb_NO
dc.description.degreePhD i informasjonsteknologinb_NO
dc.description.degreePhD in Information Technologyen_GB


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record