期刊文献+

COSSETS+: Crowdsourced Missing Value Imputation Optimized byKnowledge Base

COSSET+: Crowdsourced Missing Value Imputation Optimized by Knowledge Base
原文传递
导出
摘要 Missing value imputation with crowdsourcing is a novel method in data cleaning to capture missing values that could hardly be filled with automatic approaches. However, the time cost and overhead in crowdsourcing are high. Therefore, we have to reduce cost and guarantee the accuracy of crowdsourced imputation. To achieve the optimization goal, we present COSSET+, a crowdsourced framework optimized by knowledge base. We combine the advantages of both knowledge-based filter and crowdsourcing platform to capture missing values. Since the amount of crowd values will affect the cost of COSSET+, we aim to select partial missing values to be crowdsourced. We prove that the crowd value selection problem is an NP-hard problem and develop an approximation algorithm for this problem. Extensive experimental results demonstrate the efficiency and effectiveness of the proposed approaches. Missing value imputation with crowdsourcing is a novel method in data cleaning to capture missing values that could hardly be filled with automatic approaches. However, the time cost and overhead in crowdsourcing are high. Therefore, we have to reduce cost and guarantee the accuracy of crowdsourced imputation. To achieve the optimization goal, we present COSSET+, a crowdsourced framework optimized by knowledge base. We combine the advantages of both knowledge-based filter and crowdsourcing platform to capture missing values. Since the amount of crowd values will affect the cost of COSSET+, we aim to select partial missing values to be crowdsourced. We prove that the crowd value selection problem is an NP-hard problem and develop an approximation algorithm for this problem. Extensive experimental results demonstrate the efficiency and effectiveness of the proposed approaches.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第5期845-857,共13页 计算机科学技术学报(英文版)
关键词 crowdsourcing missing value IMPUTATION knowledge base OPTIMIZATION crowdsourcing missing value imputation knowledge base optimization
  • 相关文献

参考文献1

二级参考文献8

  • 1Lakshminarayan K,Harp S A,Samad T.Imputation of missing data in industrial databases[J].Applied Intelligence,1999,11:259-275.
  • 2Li K H.Imputation using Markov chains[J].Journal of Statisticalt Comput Simul,1988,30:57-79.
  • 3Little R J,Rubin D B.Statistical analysis with missing data[M].[S.l] :John Wiley and Sons,1987.
  • 4Gustavo E A,Batista P A,Monard M C.An analysis of four missing data treatment methods for supervised learning[J].Applied Artificial Intelligence,2003,17(5/6):519-533.
  • 5Huang C,Lee H.A grey-based nearegt neighbor approach for missing attribute value prediction[J].Applied Artificial Intelligence,2004,20(3):239-252.
  • 6Hruschka E R,Jr,Ebecken N F F.Missing values prediction with K2[J].Intelligent Data Allalysis,2002,6(6):557-566.
  • 7Petersen B,Winther O,Hansen L K.On the slow convergence of EM and VBEM in low-noise linear models[J].Neural Computation,2005,17(9):1921-1926.
  • 8Stanimirova I,Duszykowski M,Walczak B.Dealing with missing values and outliers in principle component analysis[J].Tanlonta,2007,72:172-178.

共引文献20

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部