a国产,中文字幕久久波多野结衣AV,欧美粗大猛烈老熟妇,女人av天堂

當前位置:主頁 > 科技論文 > 自動化論文 >

基于循環(huán)神經網絡的中文人名識別的研究

發(fā)布時間:2018-05-20 10:19

  本文選題:中文人名識別 + 詞向量 ; 參考:《大連理工大學》2016年碩士論文


【摘要】:中文人名識別任務是中文信息處理領域中的基礎任務,其性能的好壞將直接影響到其他任務的性能。中文人名的隨意性使其在未登錄詞中占有較大的比重,解決未登錄詞識別問題首先要解決人名識別問題。因此,解決中文人名識別問題具有重要的意義,F有基于統(tǒng)計的中文人名識別方法存在特征選取復雜和人工干預等問題,針對這些問題,本文提出了一種基于循環(huán)神經網絡(Recurrent Neural Networks)的中文人名識別方法,該方法僅采用詞向量作為模型的特征且無需人工干預,有效降低了特征選取的復雜性和人工干預對實驗造成的影響。此外,詞向量可以通過大量未標注的中文數據訓練獲得,然后將蘊含豐富語義信息的詞向量作為循環(huán)神經網絡模型的輸入,可以使模型學習到更多的信息,提升模型的性能。本文將模型分為兩個階段:模型構建階段和后處理階段。在模型構建階段,我們將重點放在詞向量的優(yōu)化策略上。針對詞向量的優(yōu)化問題,本文提出了三種策略:(1)將word2vec訓練得到的詞向量替換循環(huán)神經網絡模型的隨機初始詞向量(2)對詞向量訓練語料進行數詞泛化操作(3)改進word2vec模型,將特征信息融入詞向量實驗結果表明,通過詞向量的優(yōu)化操作,中文人名識別模型的F值提高了2.23%。在后處理階段,通過上下文規(guī)則對候選人名進行過濾;采用基于篇章的全局擴散操作召回在某一位置由于信息不足識別不出而在其他位置能夠被識別的人名;使用基于篇章的局部擴散操作識別篇章信息中有名無姓或者有姓無名的人名。實驗結果表明,通過規(guī)則過濾和擴散操作,中文人名識別模型的F值提高了4.74%。
[Abstract]:The task of Chinese name recognition is the basic task in the field of Chinese information processing, and its performance will directly affect the performance of other tasks. The randomness of Chinese names makes them occupy a large proportion in unrecorded words. To solve the problem of unrecorded words recognition, we must first solve the problem of personal name recognition. Therefore, it is of great significance to solve the problem of Chinese name recognition. The existing Chinese name recognition methods based on statistics have the problems of complex feature selection and artificial intervention. In view of these problems, this paper proposes a Chinese name recognition method based on cyclic neural network (Recurrent Neural Network). This method only uses word vector as the feature of the model and does not need human intervention, which effectively reduces the complexity of feature selection and the influence of artificial intervention on the experiment. In addition, the word vector can be obtained through a large number of unlabeled Chinese data training, and then the word vector with rich semantic information can be used as the input of the cyclic neural network model, so that the model can learn more information and improve the performance of the model. This paper divides the model into two stages: model construction stage and post-processing phase. In the stage of model construction, we focus on the optimization strategy of word vector. To solve the problem of word vector optimization, this paper proposes three strategies: 1) the word vector is replaced by the random initial word vector of the neural network model, which is trained by word2vec, and the random initial word vector is used to generalize the word vector training corpus. (3) the word2vec model is improved. The experimental results show that the F value of the Chinese name recognition model is increased by 2.233 by the optimization of the word vector. In the post-processing stage, the candidate's name is filtered by contextual rules, and the text based global diffusion operation is used to recall the names of people who can be recognized in other places because of the lack of information. A text-based local diffusion operation is used to identify a person with no or no name in the text information. The experimental results show that the F value of the Chinese name recognition model is increased by 4.74 by regular filtering and diffusion operation.
【學位授予單位】:大連理工大學
【學位級別】:碩士
【學位授予年份】:2016
【分類號】:TP391.1;TP183

【參考文獻】

相關期刊論文 前10條

1 王s,

本文編號:1914230


資料下載
論文發(fā)表

本文鏈接:http://www.wukwdryxk.cn/kejilunwen/zidonghuakongzhilunwen/1914230.html


Copyright(c)文論論文網All Rights Reserved | 網站地圖 |

版權申明:資料由用戶f64c7***提供,本站僅收錄摘要或目錄,作者需要刪除請E-mail郵箱bigeng88@qq.com
搡老熟女国产| 67pao国产成视频| 麻豆chinese新婚xxx| 国产精品香蕉在线观看网| 亚洲精品xxx| 人妻无码ΑV中文字幕久久| 国产二区交换配乱婬| 婷婷大香蕉| 第一次处破女18分钟高清| 亚洲图片| 中文字幕av无码不卡免费| 中文字幕乱码一区二区三区免费 | 色婷婷av一区二区三区仙踪林| 99国产在线国语精品| 天天躁日日躁狠狠躁AV中文| 91久久综合亚洲鲁鲁五月天| 亚洲人成网站999久久久综合| 97se亚洲国产综合自在线尤物| 亚洲一区色| 中文字幕AV不卡电影网| 日韩精品一区二区三区中文 | 中文一区| 亚洲男女内射在线播放| 忘忧草日本在线播放www| 一本一道久久a久久精品蜜桃 | 亚洲色偷偷色噜噜狠狠99网| 亚洲欧美自偷自拍视频图片| 国产人成无码视频在线| 草草影院国产| 国产精品自在欧美一区| 亚洲精品无码专区在线播放| 51ⅴ精品国产91久久久久久| 99精品一区二区三区无码吞精| 无码熟妇人妻av| 一本加勒比HEZYO中文无码| 人妻夜夜爽天天天爽欧美色院 | 精品一区国产| 国产精品白浆| 亚洲熟妇无码久久精品| 欧美牲交a欧美牲交AⅤ| 久久精品国产亚洲AV麻豆不卡|