面向協(xié)同標(biāo)記質(zhì)量的用戶激勵機(jī)制研究
發(fā)布時間:2018-01-31 10:31
本文關(guān)鍵詞: 協(xié)同標(biāo)記 標(biāo)簽質(zhì)量 激勵機(jī)制 出處:《山東大學(xué)》2014年碩士論文 論文類型:學(xué)位論文
【摘要】:協(xié)同標(biāo)記系統(tǒng)是利用眾包機(jī)制實(shí)現(xiàn)網(wǎng)絡(luò)資源管理的代表性應(yīng)用,是展現(xiàn)群體智慧的平臺。它允許眾多用戶對網(wǎng)絡(luò)資源自由標(biāo)記,由此產(chǎn)生的標(biāo)簽數(shù)據(jù)在海量Web資源的搜索、挖掘和推薦中發(fā)揮重要作用。然而,由于用戶標(biāo)記的自由性,在實(shí)際應(yīng)用中,標(biāo)簽數(shù)據(jù)存在不相關(guān)、拼寫錯誤、同義詞、一詞多義等問題,降低了系統(tǒng)資源的標(biāo)記質(zhì)量,成為制約標(biāo)簽應(yīng)用的重要原因。因此,提升系統(tǒng)資源標(biāo)記質(zhì)量成為當(dāng)前協(xié)同標(biāo)記領(lǐng)域研究的熱點(diǎn)。 針對系統(tǒng)資源標(biāo)記質(zhì)量較低的問題,現(xiàn)有工作主要包括標(biāo)簽推薦、基于語義的標(biāo)記以及資源足量標(biāo)記方法。然而,標(biāo)簽推薦的方法容易限制用戶的思維,不利于群體智慧的搜集;基于語義的標(biāo)記方法實(shí)現(xiàn)過程較為復(fù)雜,一定程度上增加了用戶標(biāo)記負(fù)擔(dān)。資源足量標(biāo)記法是用戶為資源添加足夠多標(biāo)記的簡單自然方法,資源獲得足夠數(shù)量的標(biāo)記后,標(biāo)記狀態(tài)趨于穩(wěn)定,穩(wěn)定狀態(tài)的標(biāo)簽信息能夠準(zhǔn)確描述被標(biāo)記資源。但是,實(shí)際中存在少數(shù)資源被過度標(biāo)記而大部分資源標(biāo)記不足的不平衡現(xiàn)象。激勵機(jī)制通過激勵用戶對標(biāo)記不足資源進(jìn)行標(biāo)記能夠改善資源標(biāo)記不平衡現(xiàn)象。但是現(xiàn)有激勵機(jī)制沒有衡量不同用戶的標(biāo)記質(zhì)量,不能區(qū)分具有不同標(biāo)記行為的用戶。針對現(xiàn)有機(jī)制缺乏對用戶標(biāo)記質(zhì)量度量的問題,本文提出了基于用戶標(biāo)記質(zhì)量的動態(tài)激勵機(jī)制PQIM (Post-Quality based dynamic Incentive Mechanism)和實(shí)施方案。具體內(nèi)容如下: 提出用戶標(biāo)記質(zhì)量度量方法。引入資源相對穩(wěn)定標(biāo)簽集合的概念,從資源對應(yīng)標(biāo)簽頻率的高低和種類的多少兩方面給出資源相對穩(wěn)定標(biāo)簽集合的度量標(biāo)準(zhǔn),并給出分段點(diǎn)的概念。資源收到的標(biāo)記數(shù)量接近分段點(diǎn)時,標(biāo)簽集合中的高頻標(biāo)簽和頻次排名較高的標(biāo)簽都趨于穩(wěn)定。在分段點(diǎn)之后,對新來的用戶標(biāo)記,對比資源已收到的相對穩(wěn)定的標(biāo)簽集合,分別從其所含標(biāo)簽的覆蓋率,頻率,標(biāo)記自身大小和其對被標(biāo)記資源穩(wěn)定性的影響上,設(shè)計了適用于資源標(biāo)記初期的基于密集分布和精確密集分布的標(biāo)記質(zhì)量度量方法,以及適用于后期的基于標(biāo)簽覆蓋率和標(biāo)記穩(wěn)定性的標(biāo)記質(zhì)量度量方法。 提出基于用戶標(biāo)記質(zhì)量的動態(tài)激勵機(jī)制。根據(jù)用戶標(biāo)記時間和標(biāo)記質(zhì)量兩方面因素設(shè)定激勵規(guī)則,用戶對資源標(biāo)記的越早,標(biāo)記質(zhì)量越高,獎勵越多。具體實(shí)施中,將用戶標(biāo)記時間與資源的標(biāo)記狀態(tài)關(guān)聯(lián),并結(jié)合本文提出的用戶標(biāo)記質(zhì)量度量方法,設(shè)計基于用戶標(biāo)記質(zhì)量的動態(tài)激勵函數(shù)。設(shè)定用戶所獲獎勵與資源的標(biāo)記狀態(tài)負(fù)相關(guān),與用戶的標(biāo)記質(zhì)量正相關(guān)。最后,從博弈論的角度分析PQIM的有效性。 設(shè)計PQIM系統(tǒng)框架和實(shí)施算法,并采用真實(shí)的數(shù)據(jù)集驗(yàn)證PQIM有效性。一方面分析系統(tǒng)標(biāo)記質(zhì)量與PQIM機(jī)制的關(guān)系。采用最高獎勵優(yōu)先(HA)策略模擬PQIM機(jī)制激勵分配過程,并與現(xiàn)有機(jī)制中的優(yōu)勢策略對比。實(shí)驗(yàn)結(jié)果表明,本文方法不僅能夠在確定的預(yù)算下使系統(tǒng)的標(biāo)記質(zhì)量更優(yōu),而且能夠縮短系統(tǒng)達(dá)到預(yù)期標(biāo)記質(zhì)量的時間。另一方面針對用戶效益,本文依據(jù)歷史數(shù)據(jù)進(jìn)行分析,選擇歷史數(shù)據(jù)中活躍的用戶,分析其在PQIM機(jī)制下不同標(biāo)記時間段的效益分布。同時,對比具有不同標(biāo)記質(zhì)量的用戶效益,實(shí)驗(yàn)結(jié)果顯示,在本文機(jī)制下,用戶更傾向于以較高的質(zhì)量更早對資源添加標(biāo)記。
[Abstract]:Collaborative tagging system is typical applications of cyber source management using Crowdsourcing mechanism, is to show the group intelligence platform. It allows many users to mark cyber source free, resulting in massive Web tag data resource search, play an important role in mining and recommendation. However, due to the freedom of the user mark, in in practical application, there is no relevant data, label spelling errors, synonyms, polysemy and other issues, reduce the system resources marking quality, has become an important reason for restricting the label application. Therefore, improving the system resource marking quality has become a hot topic in the research on Collaborative marker field.
According to the system resources marking quality problems of low, existing work mainly includes the semantic markup tag recommendation, and adequate resources marking method based on tag recommendation method. However, easy to limit the user's thinking, is not conducive to collective intelligence collection; marking method of semantic realization process based on more complex, to a certain extent, increased the burden on the user mark adequate resources. Mark method is a simple and natural method for user resources to add enough marks, resources to obtain sufficient number of markers, marker state tends to be stable, steady state label information can accurately describe labeled resources. However, there is imbalance of minority resources are over mark but most of the resources in practice. The incentive insufficiency the mechanism of marking can improve the imbalance of resource mark insufficiency resources by motivating users. But the existing incentive mechanism Different users do not measure the marking quality, can not distinguish between markers with different behavior of users. Aiming at the lack of mechanism to measure the user mark quality problem, this paper proposes PQIM dynamic incentive mechanism based on the quality of user mark (Post-Quality based dynamic Incentive Mechanism) and the implementation of the program. The specific contents are as follows:
The user mark quality measurement method. By introducing the concept of resources relatively stable set of tags, metrics from a given level of resources and types of resources corresponding to the number of two tag frequency relatively stable label sets, and give the piecewise point concept. Mark number resources received close to the segmentation point, high frequency tag and a set of tags the frequency of higher ranked labels are stable. In the piecewise point, for users to tag new, comparison of resources has received a relatively stable set of tags, respectively from the containing label coverage, frequency, marking its size and its influence on the stability of labeled resources, designed for resources early marker measurement methods and accurate dense dense markers based on the quality metrics, label coverage and quality based on marker marker stability and method for later.
The dynamic incentive mechanism based on user mark quality. According to the two factors of the user mark time and marking quality incentive to set rules, users of the resources labeled earlier, higher quality marks more rewards. The specific implementation, will mark the state of the associated user mark of time and resources, and combined with the quality of the user mark measurement method, design of dynamic excitation function based on the quality of user mark. Set the user awards and resources of the state marked negative correlation, positive correlation with the user's label quality. Finally, the validity analysis of PQIM from the angle of game theory.
The framework and implementation of algorithm design of PQIM system, and validation of PQIM using real data. The relationship between hand marking quality analysis system and PQIM system. The highest award priority (HA) strategy simulation PQIM incentive allocation process, comparative advantage strategy and the existing mechanism. The experimental results show that this method not only can in determining the budget system to mark better quality, and can shorten the system to achieve the desired mark quality time. On the other hand for the user benefit, on the basis of historical data analysis, selection of active users of the history data, analysis of the distribution of benefits in the mechanism of PQIM under different time markers. At the same time, compared with different marking quality user benefits, the experimental results show that in this mechanism, users prefer to add tags to resources to higher quality earlier.
【學(xué)位授予單位】:山東大學(xué)
【學(xué)位級別】:碩士
【學(xué)位授予年份】:2014
【分類號】:TP393.07;TP391.3
【參考文獻(xiàn)】
相關(guān)期刊論文 前1條
1 涂金龍;涂風(fēng)華;;一種綜合標(biāo)簽和時間因素的個性化推薦方法[J];計算機(jī)應(yīng)用研究;2013年04期
,本文編號:1478856
本文鏈接:http://www.wukwdryxk.cn/guanlilunwen/ydhl/1478856.html
最近更新
教材專著