主題事件挖掘及動態(tài)演化分析研究
[Abstract]:Thematic event mining and evolutionary analysis is a structured form of events that people are interested in, extracting key information from events, such as time, place, and character, and sorting out and analyzing the relationship and development situation between events, so that the participants can understand events more clearly and quickly. Mining mainly includes time series analysis, information retrieval, automatic abstracting, topic detection and tracking, event detection, burst detection, anomaly detection and so on. Early basic work needs data acquisition, that is, to obtain related data of events and to carry out structured or semi-structured. This paper will study from sentence to text, and then to a number of chapters. The object of processing is subject oriented events. The main task is to understand the theme events deeply, that is, thematic event extraction and event analysis oriented to multiple documents. Event extraction includes sentence or phrase oriented event information identification, including time, ground point, character, shallow semantic analysis, etc.; document oriented events. Information recognition mainly includes time, key actions, locations, characters, and information fusion of subject events oriented to multiple documents. Event analysis includes dynamic evolution analysis of subtopics, character influence analysis and anomaly detection. This paper covers four key points for thematic event mining and focuses on different research issues. 1) study the information extraction and timing characteristics of subject events. The simple sentence based event argument does not reflect the occurrence of thematic events. This study takes thematic events as the research object, and the action meaning meta events are the necessary single positions for the theme events, including the event extraction within the sentence scope, and the text in the text. In this paper, a time recognition model for thematic events is proposed in this paper, which transforms the time recognition of the sentence or phrase into the time recognition for the text, thus identifying the time of the subject event fragment. The model uses the reference time dynamic selection mechanism to standardize the time expression. There is a certain correspondence between the event elements and the elements of the verbs dominated by the verb, so in this study, the event extraction and the shallow semantic analysis are combined to correspond the event elements to the semantic role tagging, and the performance of the time recognition of the subject pieces, which are based on the pure keyword or the static reference time mechanism, is improved. (2) based on the momentum. This study will combine the elements of the event and the idea of sudden detection to study the influence of the characters in the course of the development of the whole event. The physical model is used to define and construct the dynamic character of the characters' influence, combining the social elements of the characters, not only by the rate of arrival. By using stock analysis indicators to characterize and analyze the momentum characteristics of people's influence, the combination of several Moving Average Convergence Divergence (MACD) technical indicators is used to avoid a high index and no sudden situation. In order to analyze the factors in the event and the participation of these elements in the development process of the theme events. (3) study the application of dynamic incremental strategy in the subtopic evolution analysis of the theme events. Dynamic tracking of knowledge topics. These topics may be independent topics, or may not be the description of the same event. This study is based on the characteristics of the subtopic evolution as a dynamic data stream, combined with the Single-Pass clustering method, both ideas and dynamic increments, for the detection and tracking of subtopics. According to the timing and dynamics of subtopics, the algorithm is analyzed in terms of threshold selection, similarity smoothness and time factors. (4) the problem of anomaly detection in the synergistic effect of statistical theory and fuzzy set theory is studied. Anomaly detection is also a kind of time series analysis, which takes into account the data The time sequence and dynamics of flow. Outliers are data which are significantly different from other data. Some outliers can be considered noise, and some are key information. For example, the exception point in the event development often reveals the critical period or turning point of the event. Anomaly detection technology usually requires a large number of tagged data. The statistical distribution characteristics of the data are unknown, and many parameters are needed, the control limit is difficult to determine and the fuzziness of the data itself. In this paper, based on the theory of statistical process control, this paper defines the concept of abnormal points and abnormality. According to the characteristics of the anomaly point itself, the combination of the fuzzy theory and the statistical method is combined. The technique performs the anomaly detection in the event. This method can not require any annotation data and is independent of the distribution. The parameters are determined by the enhanced fuzzification process and the optimization model.
【學(xué)位授予單位】:哈爾濱工業(yè)大學(xué)
【學(xué)位級別】:博士
【學(xué)位授予年份】:2016
【分類號】:TP391.1
【相似文獻(xiàn)】
相關(guān)期刊論文 前10條
1 周道林;;論過程感知信息系統(tǒng)中過程的動態(tài)演化[J];信息系統(tǒng)工程;2012年09期
2 趙倫;肖鏃;;計算機(jī)軟件動態(tài)演化技術(shù)概述[J];計算機(jī)光盤軟件與應(yīng)用;2013年13期
3 鄧?yán)?吳健;胡正國;;一種嵌入式系統(tǒng)的動態(tài)演化方法[J];計算機(jī)應(yīng)用;2007年11期
4 王愛萍;閭國年;黃家柱;鄭新奇;林冰仙;;面向動態(tài)演化的城鎮(zhèn)地價評估系統(tǒng)[J];計算機(jī)工程;2008年14期
5 李玉龍;李長云;;軟件動態(tài)演化技術(shù)[J];計算機(jī)技術(shù)與發(fā)展;2008年09期
6 陳春華;馬勤;陳雅莉;;基于動態(tài)演化的水文數(shù)據(jù)轉(zhuǎn)貯技術(shù)研究與應(yīng)用[J];水文;2011年S1期
7 賈向陽;應(yīng)時;張韜;余曉峰;;一個支持業(yè)務(wù)過程動態(tài)演化的可反射框架[J];計算機(jī)工程;2006年10期
8 陳洪龍;李仁發(fā);;面向服務(wù)對象的動態(tài)演化機(jī)制[J];計算機(jī)應(yīng)用;2010年07期
9 賴格英,于革;古氣候動力模擬動態(tài)演化的可視化研究與實現(xiàn)[J];計算機(jī)應(yīng)用;2004年06期
10 沈思;鄭昌興;;基于動態(tài)演化模式的詞表組織設(shè)計與實現(xiàn)[J];計算機(jī)光盤軟件與應(yīng)用;2013年24期
相關(guān)會議論文 前5條
1 趙會群;孫晶;魏瑩;王文文;;服務(wù)體系結(jié)構(gòu)的動態(tài)演化方法研究[A];CCF NCSC 2011——第二屆中國計算機(jī)學(xué)會服務(wù)計算學(xué)術(shù)會議論文集[C];2011年
2 王弟海;龔六堂;;持續(xù)性不平等的原因及其動態(tài)演化綜述[A];經(jīng)濟(jì)學(xué)(季刊)第7卷第2期[C];2008年
3 張涵信;沈孟育;;基于動態(tài)演化的最優(yōu)化方法[A];近代空氣動力學(xué)研討會論文集[C];2005年
4 鄭江淮;張曉云;;從國際代工到國際研發(fā):價值鏈攀升的動態(tài)演化[A];社會主義經(jīng)濟(jì)理論研究集萃——從經(jīng)濟(jì)大國走向經(jīng)濟(jì)強(qiáng)國的戰(zhàn)略思維(2011)[C];2011年
5 高瑩瑩;何楓;沈孟育;;非定常動態(tài)演化伴隨方法在翼型氣動設(shè)計中的應(yīng)用[A];北京力學(xué)會第19屆學(xué)術(shù)年會論文集[C];2013年
相關(guān)博士學(xué)位論文 前6條
1 李風(fēng)環(huán);主題事件挖掘及動態(tài)演化分析研究[D];哈爾濱工業(yè)大學(xué);2016年
2 苗又山;大規(guī)模動態(tài)演化圖的存儲與分析系統(tǒng)研究[D];中國科學(xué)技術(shù)大學(xué);2015年
3 姚毅;中國城鄉(xiāng)貧困動態(tài)演化的理論與實證研究[D];西南財經(jīng)大學(xué);2010年
4 陳洪龍;面向?qū)ο蟆獦?gòu)件的軟件動態(tài)演化技術(shù)研究[D];湖南大學(xué);2011年
5 謝仲文;一種需求驅(qū)動、以體系結(jié)構(gòu)為視圖的面向軟件動態(tài)演化的模型與方法[D];云南大學(xué);2012年
6 趙旭劍;中文新聞話題動態(tài)演化及其關(guān)鍵技術(shù)研究[D];中國科學(xué)技術(shù)大學(xué);2012年
相關(guān)碩士學(xué)位論文 前10條
1 王華;軟件動態(tài)演化良性化建模與評估方法研究[D];湖南工業(yè)大學(xué);2015年
2 魏秋彥;環(huán)境變化對軟件動態(tài)演化的作用機(jī)理研究[D];湖南工業(yè)大學(xué);2015年
3 薛彤;微博輿情動態(tài)演化特性及多主體仿真研究[D];南京航空航天大學(xué);2015年
4 蔣旭東;面向動態(tài)演化的軟件行為相關(guān)性問題分析研究[D];云南大學(xué);2016年
5 江晶;基于競爭優(yōu)勢動態(tài)演化的高新技術(shù)企業(yè)可持續(xù)發(fā)展研究[D];武漢理工大學(xué);2007年
6 楊軼波;我國大學(xué)衍生企業(yè)的動態(tài)演化分析[D];上海交通大學(xué);2010年
7 張丹;基于SCA的動態(tài)演化模型SO-DSAM的研究與應(yīng)用[D];西北大學(xué);2011年
8 曾惠芳;基于高階挖掘的動態(tài)演化規(guī)律研究[D];暨南大學(xué);2011年
9 蘇衛(wèi)華;復(fù)雜網(wǎng)絡(luò)社區(qū)發(fā)現(xiàn)及其動態(tài)演化研究[D];太原理工大學(xué);2010年
10 仇書禮;面向服務(wù)的構(gòu)件動態(tài)演化方法及其實現(xiàn)[D];哈爾濱工業(yè)大學(xué);2011年
,本文編號:2144370
本文鏈接:http://www.wukwdryxk.cn/shoufeilunwen/xxkjbs/2144370.html