基于小样本置信区间的众包答案决策方法

Truth Inference Based on Confidence Interval of Small Samples in Crowdsourcing

下载PDF

导出

摘要众包工人的水平良莠不齐,质量控制是众包面临的挑战之一。目前的研究大多通过评估工人质量来保证最终答案的有效性,但是常常忽略众包任务中普遍存在的长尾现象。因此,综合考虑不同任务类型、长尾现象的特点以及工人完成任务的情况,提出构造小样本置信区间来估计工人质量,以解决工人完成任务数量普遍较少情况下的答案决策问题。首先依据黄金标准答案策略对工人质量进行预评估,根据工人质量分布分别对数值型任务和单项选择型任务采用不同的真值初始化方法;然后构造小样本置信区间以准确评估工人质量;最后进行任务答案决策并迭代更新工人质量。为了验证提出方法的有效性,实验在5个真实数据集上进行,与现有方法相比,所提方法能很好地解决长尾现象。特别是在工人完成任务数量普遍较少的情况下,提出的方法在单项选择型任务数据集中的平均准确率高达93%,相比现有方法的最好表现高出16%,且在数值型任务数据集中的MAE值和RMSE值均低于现有方法。 Crowdsourcing is an increasingly important area of computer applications,because it can address problems that difficult for computer to handle alone.For the openness of crowdsourcing,quality control becomes one of the important challenges.In order to ensure the effectiveness of truth inference,current researches leverage answers of trustful workers to infer truths by evalua-ting worker quality generally.However,most existing methods ignore the long-tail phenomena in crowdsourcing,and there is a lack of researches on the truth inference when the number of tasks completed by workers is generally small.Considering the chara cteristics of different task types,long-tail phenomenon and worker answers,this paper constructs the confidence interval of small samples to solve truth inference when the number of tasks completed by workers are generally small.Firstly,worker quality is pre-estimated according to the gold standard answer strategy,and different truth initialization methods are adopted according to the result of pre-estimated.Then,the confidence interval of small samples is constructed to evaluate worker quality accurately.Finally,task truths are inferred and worker quality is updated iteratively.In order to verify the effectiveness of the proposed me-thod,5 real datasets are selected to conduct experiments.Compared with the existing methods,the proposed method can solve the problem of the long tail phenomenon effectively,especially the number of tasks completed by each worker is generally small.The average accuracy of the proposed method for the single-choice tasks is as high as 93%,and higher than 16%of the best perfor-mance of the existing methods.Meanwhile,the values of MAE and RMSE of the proposed method for the numerical tasks are lower than that of the existing methods.

作者张光园王宁 ZHANG Guang-yuan;WANG Ning(School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China)

机构地区北京交通大学计算机与信息技术学院

出处《计算机科学》 CSCD 北大核心 2020年第10期26-31,共6页 Computer Science

基金国家重点研发计划项目(2018YFC0809800)。

关键词众包长尾现象小样本置信区间工人质量估计答案决策 Crowdsourcing Long-tail phenomenon Small sample confidence interval Worker quality estimation Truth inference

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1于千城,於志文,王柱.对抗样本训练图分类器进行模型推理质量评估[J].计算机工程与应用,2020,56(17):142-149. 被引量：2
2余勇.从高校预评估中论体育教学档案管理的重要性探讨[J].文学少年,2020(18):0198-0198.
3李金鹏,鲁志强.高校体育训练中的防伤害策略探析[J].当代体育,2019(24):85-86.
4廖坤艳.被遗忘的文明桶[J].班主任,2020(9):63-63.
5无.新疆公平竞争审查工作将启用第三方评估[J].中国价格监管与反垄断,2020(6):19-19.
6罗月童,吴帅,尹光源,汪涛.数值型关联分析中连续属性的探索式分区方法[J].计算机辅助设计与图形学学报,2020,32(10):1606-1616.
7徐雪松,曾智,邵红燕,杨胜杰,李想.基于个体-协同触发强化学习的多机器人行为决策方法[J].仪器仪表学报,2020(5):66-75. 被引量：11
8杨洋,赵晓冬.第三方监管机制下PPP项目的三边匹配决策模型[J].数学的实践与认识,2020,50(18):20-29. 被引量：2
9张健,江建飞.一种平衡牵引变压器短路阻抗计算与分析[J].变压器,2020,57(9):1-4. 被引量：10
10蔡映云.除了诊断,还需评估才能制定和修改治疗方案[J].上海医药,2020,41(18):3-6.

计算机科学

2020年第10期

浏览历史

内容加载中请稍等...

基于小样本置信区间的众包答案决策方法

相关作者

相关机构

相关主题

浏览历史