面向移動設(shè)備的WEB頁面分塊算法研究
發(fā)布時間:2018-10-16 09:06
【摘要】: 隨著移動通信的發(fā)展,人們迫切希望能方便地利用手持移動設(shè)備訪問Web網(wǎng)站。由于移動設(shè)備的小屏幕和低帶寬的缺點,使得這一難題一直沒有得到很好的解決。Web網(wǎng)頁是為桌面計算機而設(shè)計的,桌面計算機的屏幕大,翻頁和滾屏十分方便。因而Web頁面的內(nèi)容很豐富。但移動設(shè)備,例如手機,具有內(nèi)存小、上網(wǎng)帶寬低和屏幕小的先天缺點,不可能把一個完整的Web頁面直接映射到移動設(shè)備屏幕上,因而如何在小屏幕設(shè)備上方便地瀏覽Web網(wǎng)頁一直是研究的熱點。 分析在Web信息分塊領(lǐng)域中已有的研究工作,其中包括純HTML文檔的轉(zhuǎn)換以及自然語言的總結(jié)等等。大部分算法都是單純的利用HTML標簽之間的關(guān)系來對Web文檔進行解析,這些分塊算法對固定格式以及規(guī)范標簽用法的頁面來說,分塊效率以及正確率會比較高,但是,現(xiàn)在的大部分Web頁面都是復雜,不規(guī)則以及含有太多噪聲的,例如導航鏈接以及廣告鏈接等等,這時這樣的分塊算法的效果就不是很理想,在此基礎(chǔ)上設(shè)計出兩種適合于移動設(shè)備小屏幕的Web頁面分塊算法,一種是基于DOM TREE的頁面分塊算法,將DOM TREE的葉子結(jié)點看做是不可分割的最小語義塊,然后再根據(jù)結(jié)點之間的語義關(guān)系進行迭代,通過語義塊的提取,分割條的檢測以及權(quán)重的設(shè)置,找到各個頁面分塊,再根據(jù)內(nèi)容結(jié)構(gòu)的重構(gòu)形成適合于手持終端顯示的頁面。另一種算法利用Web網(wǎng)頁上對象的位置信息對信息塊進行逐層聚類,生成一棵網(wǎng)頁分塊樹,再根據(jù)移動設(shè)備屏幕的特點把網(wǎng)頁分塊樹轉(zhuǎn)換成適合小屏幕瀏覽的頁面。 設(shè)計并實現(xiàn)了面向移動設(shè)備的Web頁面分塊算法的小型實驗系統(tǒng),系統(tǒng)能有效地將Web頁面轉(zhuǎn)換成適合小屏幕瀏覽的頁面。
[Abstract]:With the development of mobile communication, people are eager to use handheld mobile devices to visit Web website. Because of the shortcomings of small screen and low bandwidth of mobile devices, this problem has not been solved well. Web web pages are designed for desktop computers, which have large screens and are convenient to flip and scroll. As a result, Web pages are rich in content. But mobile devices, such as mobile phones, have the inherent disadvantages of small memory, low internet bandwidth, and small screens, making it impossible to map a complete Web page directly to a mobile device screen. Therefore, how to easily browse Web pages on small screen devices has been a hot topic. This paper analyzes the existing research work in the field of Web information partitioning, including the transformation of pure HTML documents and the summary of natural languages, and so on. Most algorithms simply use the relationship between HTML tags to parse Web documents. These block algorithms are more efficient and accurate for pages with fixed format and standard tag usage, but, Most of today's Web pages are complex, irregular and noisy, such as navigation links and ad links, and so on. On this basis, two Web page partitioning algorithms suitable for small screen of mobile devices are designed. One is page partitioning algorithm based on DOM TREE, which regards the leaf node of DOM TREE as the indivisible minimum semantic block. Then iterated according to the semantic relationship between nodes, through the extraction of semantic blocks, the detection of segmentation bar and the setting of weights, each page is found into blocks, and then according to the reconstruction of content structure, a page suitable for display by handheld terminals is formed. Another algorithm uses the location information of the objects on the Web web page to cluster the information block layer by layer to generate a web page block tree, and then according to the characteristics of mobile device screen, the web page block tree is converted into a page suitable for small screen browsing. A small experimental system of Web page partitioning algorithm for mobile devices is designed and implemented. The system can effectively transform Web pages into pages suitable for small screen browsing.
【學位授予單位】:華中科技大學
【學位級別】:碩士
【學位授予年份】:2007
【分類號】:TP393.092
本文編號:2273883
[Abstract]:With the development of mobile communication, people are eager to use handheld mobile devices to visit Web website. Because of the shortcomings of small screen and low bandwidth of mobile devices, this problem has not been solved well. Web web pages are designed for desktop computers, which have large screens and are convenient to flip and scroll. As a result, Web pages are rich in content. But mobile devices, such as mobile phones, have the inherent disadvantages of small memory, low internet bandwidth, and small screens, making it impossible to map a complete Web page directly to a mobile device screen. Therefore, how to easily browse Web pages on small screen devices has been a hot topic. This paper analyzes the existing research work in the field of Web information partitioning, including the transformation of pure HTML documents and the summary of natural languages, and so on. Most algorithms simply use the relationship between HTML tags to parse Web documents. These block algorithms are more efficient and accurate for pages with fixed format and standard tag usage, but, Most of today's Web pages are complex, irregular and noisy, such as navigation links and ad links, and so on. On this basis, two Web page partitioning algorithms suitable for small screen of mobile devices are designed. One is page partitioning algorithm based on DOM TREE, which regards the leaf node of DOM TREE as the indivisible minimum semantic block. Then iterated according to the semantic relationship between nodes, through the extraction of semantic blocks, the detection of segmentation bar and the setting of weights, each page is found into blocks, and then according to the reconstruction of content structure, a page suitable for display by handheld terminals is formed. Another algorithm uses the location information of the objects on the Web web page to cluster the information block layer by layer to generate a web page block tree, and then according to the characteristics of mobile device screen, the web page block tree is converted into a page suitable for small screen browsing. A small experimental system of Web page partitioning algorithm for mobile devices is designed and implemented. The system can effectively transform Web pages into pages suitable for small screen browsing.
【學位授予單位】:華中科技大學
【學位級別】:碩士
【學位授予年份】:2007
【分類號】:TP393.092
【引證文獻】
相關(guān)碩士學位論文 前1條
1 于長富;基于頁面分塊的Web檔案構(gòu)建技術(shù)研究[D];東北大學;2009年
,本文編號:2273883
本文鏈接:http://www.wukwdryxk.cn/wenyilunwen/guanggaoshejilunwen/2273883.html
最近更新
教材專著