基于CRFs的微博評論情感分類的研究
[Abstract]:There are many ways of information transmission in information society. Weibo is a convenient way to communicate information, and the transmission of information has gone deep into every corner of our life. With tens of thousands of users on the Weibo platform, and often on Weibo, personal insights into the discussion of something or a hot topic are posted. Therefore, the analysis of a large number of corpus retained on the Weibo platform can find that the general emotion, emotion and value orientation of most people can provide the basis for the decision makers concerned about the related problems to analyze the problems. First of all, this paper summarizes the related research of emotional analysis of the existing corpus. Then, several commonly used affective classification models are compared, including similarity-based methods, Bayesian classifiers, support vector machines, and so on. Based on the analysis of the advantages and disadvantages of each model, finally, a widely accepted emotion classification method, conditional Random Field (CRFs);, is adopted at the end of the paper. Secondly, the Chinese sentences in the text are marked at the level of word granularity, and the experimental corpus is trained by the conditional random field model to form a training model. The trained model is used to judge the emotional tendency of the comment information. Finally, a classification mechanism of emotion strength is proposed, which makes emotional analysis not only confined to positive, neutral and negative cases, but also quantifies the original three aspects of the experimental results, so as to pass the quantized results. Rank the strength of the emotion. Through the analysis of the corpus by using CRFs, this paper shows that CRFs has a good classification effect for affective sentences, and the experimental results basically verify the feasibility of the mechanism of emotional intensity classification proposed by the author. Quantitative results can provide data support for decision makers. However, there are still some problems that need to be improved, such as the incomplete corpus and so on, which will be further improved in the future.
【學(xué)位授予單位】:東北師范大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.092
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 郭雯,葛朝陽,吳曉波;基于客戶認(rèn)知價(jià)值的CRM戰(zhàn)略[J];商業(yè)研究;2003年08期
2 馮奇峰,李言;客戶的認(rèn)知投入與保持投入模型研究[J];計(jì)算機(jī)集成制造系統(tǒng);2005年09期
3 何鳳英;;基于語義理解的中文博文傾向性分析[J];計(jì)算機(jī)應(yīng)用;2011年08期
4 張玉芳;莫凌琳;熊忠陽;耿曉斐;;基于條件隨機(jī)場的科研論文信息分層抽取[J];計(jì)算機(jī)應(yīng)用研究;2009年10期
5 唐慧豐;譚松波;程學(xué)旗;;基于監(jiān)督學(xué)習(xí)的中文情感分類技術(shù)比較研究[J];中文信息學(xué)報(bào);2007年06期
6 徐軍;丁宇新;王曉龍;;使用機(jī)器學(xué)習(xí)方法進(jìn)行新聞的情感自動(dòng)分類[J];中文信息學(xué)報(bào);2007年06期
7 劉康;趙軍;;基于層疊CRFs模型的句子褒貶度分析研究[J];中文信息學(xué)報(bào);2008年01期
8 金昌虎;;在線WOM內(nèi)容和效果的關(guān)系:產(chǎn)品知識和相關(guān)的影響[J];沈陽大學(xué)學(xué)報(bào);2006年01期
9 郭國慶;楊學(xué)成;張楊;;口碑傳播對消費(fèi)者態(tài)度的影響:一個(gè)理論模型[J];管理評論;2007年03期
10 孫艷;周學(xué)廣;付偉;;無監(jiān)督的主題情感混合模型研究[J];西安交通大學(xué)學(xué)報(bào);2013年01期
本文編號:2456587
本文鏈接:http://www.wukwdryxk.cn/guanlilunwen/ydhl/2456587.html