一云多屏的旅游搜索比價系統(tǒng)的研究與實現(xiàn)
本文關(guān)鍵詞: 垂直搜索 sphinx 預(yù)處理 LCS綁定 搜索比價 出處:《中國計量學(xué)院》2014年碩士論文 論文類型:學(xué)位論文
【摘要】:隨著我國經(jīng)濟(jì)的快速發(fā)展和國民收入的不斷提高,越來越多人喜歡外出旅游,而且伴隨著互聯(lián)網(wǎng)的高速發(fā)展,越來越多的游客選擇在網(wǎng)絡(luò)上獲取相關(guān)的旅游信息,游客可以根據(jù)自己的需求來獲取到自己需要的旅游的信息但是目前相關(guān)的網(wǎng)站提供內(nèi)容可能不全面相關(guān)信息參差不齊價格更是有很大的差異,,并不能滿足用戶關(guān)于旅游信息的需求,而且隨著網(wǎng)絡(luò)信息的爆炸式增長,用戶也很難在短時間內(nèi)獲取到自己想要的旅游信息為此,本文以垂直搜索為基礎(chǔ),在linux系統(tǒng)平臺上搭建sphinx引擎系統(tǒng),對數(shù)據(jù)的爬取預(yù)處理綁定和聚類,垂直搜索的體系結(jié)構(gòu)和基于垂直搜索的sphinx引擎系統(tǒng),中文分詞的匹配算法,索引的實時更新和手機(jī)客戶端等功能進(jìn)行了深入研究,并設(shè)計和實現(xiàn)了一套基于垂直搜索的搜索比價系統(tǒng) 由于從網(wǎng)絡(luò)上爬取的信息含有大量冗余信息和重復(fù)信息,從而會增加搜索的速度以及正確率通過對垂直搜索體系結(jié)構(gòu)和傳統(tǒng)LCS算法定位方法的分析和對比,本文對LCS算法進(jìn)行優(yōu)化,并將其應(yīng)用到數(shù)據(jù)綁定中,從而去除大量的冗余信息和重復(fù)信息,因此大幅提高搜索的速度以及正確率,驗證了該算法的高效性和可行性 通過深入研究垂直搜索的體系結(jié)構(gòu),sphinx引擎系統(tǒng)技術(shù)理論以及搜索比價的系統(tǒng)架構(gòu),并且結(jié)合網(wǎng)絡(luò)平臺上的其他資源,開發(fā)出了一種以sphinx系統(tǒng)為引擎的關(guān)于旅游的搜索比價系統(tǒng),并通過實驗驗證了此系統(tǒng)的可行性 由于數(shù)據(jù)處理的運算量很大,同時為了維護(hù)系統(tǒng)安全和保證系統(tǒng)運行的流暢與穩(wěn)定,本文將數(shù)據(jù)庫和sphinx引擎系統(tǒng)放在linux系統(tǒng)下運行最后在MyEclipse環(huán)境下結(jié)合web應(yīng)用開發(fā)了一套有關(guān)旅游的搜索比價系統(tǒng),該系統(tǒng)包括旅游定制模塊景區(qū)與酒店搜索系統(tǒng)比價模塊地圖顯示模塊它提供了文字介紹地圖顯示圖片瀏覽價格比對以及門票定制等功能,還具有手機(jī)客戶端的功能最后本文通過實驗對該系統(tǒng)的實用性和可行性進(jìn)行了驗證
[Abstract]:With the rapid development of our economy and the continuous improvement of national income, more and more people love to travel, but with the rapid development of the Internet, more and more tourists choose to travel to obtain relevant information on the network, visitors can according to their own needs to get to the tourist information but the related website the content may not be comprehensive information related to uneven price is very different, and can not meet the needs of users of tourism information, and with the explosive growth of network information, the user is also very difficult in a short period of time to get what you want for the tourism information, based on the vertical search engine, build Sphinx system in Linux system platform, data preprocessing and clustering crawling binding, system structure and Sphinx based vertical search engine system in vertical search. The matching algorithm, index real-time updating and mobile client functions have been deeply studied, and a vertical search based search price comparison system has been designed and implemented.
Because crawling from the network information contains a lot of redundant information and repeat information, which would increase the search speed and accuracy through analysis and comparison of the vertical search system structure and positioning method of traditional LCS algorithm, the LCS algorithm is optimized in this paper, and its application to data binding, so as to remove redundant information and repeat the information, thus greatly improve the search speed and accuracy, verify the feasibility and efficiency of the algorithm.
Through in-depth study of the architecture of the vertical search engine technology, Sphinx system theory and search terms of the system architecture, and combined with other resources on the network platform, developed a sphinx system for engine on travel search rate of exchange system, and the feasibility of this system is verified by experiment
Because the computation of data processing greatly, at the same time in order to maintain system security and ensure smooth and stable operation of the system, the database and the Sphinx engine system operation finally in the MyEclipse environment with the web application development of a travel search comparison system in the Linux system, the system includes a module map of tourism scenic spot price customization module with the hotel search system display module which provides text map display picture browsing and ticket price comparison customization features, finally this paper also has the function of the mobile phone client through the experiment on the feasibility and practicability of the system is verified
【學(xué)位授予單位】:中國計量學(xué)院
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.092
【參考文獻(xiàn)】
相關(guān)期刊論文 前10條
1 鄭凱明;;垂直搜索引擎應(yīng)用研究[J];赤峰學(xué)院學(xué)報(自然科學(xué)版);2011年02期
2 張麗敏;;垂直搜索引擎的主題爬蟲策略[J];電腦知識與技術(shù);2010年15期
3 苗海;張仰森;岳明;;基于聚類算法的垂直搜索引擎技術(shù)研究[J];北京信息科技大學(xué)學(xué)報(自然科學(xué)版);2013年01期
4 張旭;;構(gòu)建基于本地服務(wù)的垂直搜索引擎[J];才智;2011年18期
5 林彤,江志軍;Internet的搜索引擎[J];計算機(jī)工程與應(yīng)用;2000年05期
6 王少康;董科軍;閻保平;;使用特征文本密度的網(wǎng)頁正文提取[J];計算機(jī)工程與應(yīng)用;2010年20期
7 汲業(yè);陳燕;楊健;慕蓉;;生活服務(wù)領(lǐng)域垂直搜索引擎的設(shè)計與實現(xiàn)[J];計算機(jī)工程;2010年24期
8 王新;劉曉霞;;基于關(guān)聯(lián)規(guī)則挖掘的垂直元搜索引擎研究[J];計算機(jī)工程;2011年04期
9 趙珂;逯鵬;李永強(qiáng);;基于Lucene的搜索引擎設(shè)計與實現(xiàn)[J];計算機(jī)工程;2011年16期
10 鄭偉;于雙元;;基于語義的垂直搜索引擎的研究[J];計算機(jī)時代;2007年12期
本文編號:1471399
本文鏈接:http://www.wukwdryxk.cn/guanlilunwen/ydhl/1471399.html