期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Truth Discovery on Inconsistent Relational Data
1
作者 Jizhou Sun Jianzhong Li +1 位作者 Hong Gao Hongzhi Wang 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2018年第3期288-302,共15页
In this era of big data, data are often collected from multiple sources that have different reliabilities, and there is inevitable conflict with respect to the various information obtained when it relates to the the s... In this era of big data, data are often collected from multiple sources that have different reliabilities, and there is inevitable conflict with respect to the various information obtained when it relates to the the same object.One important task is to identify the most trustworthy value out of all the conflicting claims, and this is known as truth discovery. Existing truth discovery methods simultaneously identify the most trustworthy information and source reliability degrees and are based on the idea that more reliable sources often provide more trustworthy information,and vice versa. However, there are often semantic constrains defined upon relational database, which can be violated by a single data source. To remove violations, an important task is to repair data to satisfy the constrains,and this is known as data cleaning. The two problems above may coexist, but considering them together can provide some benefits, and to the authors knowledge, this has not yet been the focus of any research. In this paper, therefore, a schema-decomposing based method is proposed to simultaneously discover the truth and to clean the data, with the aim of improving accuracy. Experimental results using real world data sets of notebooks and mobile phones, as well as simulated data sets, demonstrate the effectiveness and efficiency of our proposed method. 展开更多
关键词 inconsistent data truth discovery data cleaning
原文传递
Integrate inconsistent and heterogeneous data based on user feedback
2
作者 Lihua Lu Hengzhen Zhang Xiao-Zhi Gao 《International Journal of Intelligent Computing and Cybernetics》 EI 2015年第2期187-203,共17页
Purpose–Data integration is to combine data residing at different sources and to provide the users with a unified interface of these data.An important issue on data integration is the existence of conflicts among the... Purpose–Data integration is to combine data residing at different sources and to provide the users with a unified interface of these data.An important issue on data integration is the existence of conflicts among the different data sources.Data sources may conflict with each other at data level,which is defined as data inconsistency.The purpose of this paper is to aim at this problem and propose a solution for data inconsistency in data integration.Design/methodology/approach–A relational data model extended with data source quality criteria is first defined.Then based on the proposed data model,a data inconsistency solution strategy is provided.To accomplish the strategy,fuzzy multi-attribute decision-making(MADM)approach based on data source quality criteria is applied to obtain the results.Finally,users feedbacks strategies are proposed to optimize the result of fuzzy MADM approach as the final data inconsistent solution.Findings–To evaluate the proposed method,the data obtained from the sensors are extracted.Some experiments are designed and performed to explain the effectiveness of the proposed strategy.The results substantiate that the solution has a better performance than the other methods on correctness,time cost and stability indicators.Practical implications–Since the inconsistent data collected from the sensors are pervasive,the proposed method can solve this problem and correct the wrong choice to some extent.Originality/value–In this paper,for the first time the authors study the effect of users feedbacks on integration results aiming at the inconsistent data. 展开更多
关键词 Decision making data fusion data inconsistency data integration User feedback
原文传递
A Solution of Data Inconsistencies in Data Integration——Designed for Pervasive Computing Environment 被引量:1
3
作者 王欣 黄林鹏 +2 位作者 章义 徐小辉 陈俊清 《Journal of Computer Science & Technology》 SCIE EI CSCD 2010年第3期499-508,共10页
New challenges including how to share information on heterogeneous devices appear in data-intensive pervasive computing environments. Data integration is a practical approach to these applications. Dealing with incons... New challenges including how to share information on heterogeneous devices appear in data-intensive pervasive computing environments. Data integration is a practical approach to these applications. Dealing with inconsistencies is one of the important problems in data integration. In this paper we motivate the problem of data inconsistency solution for data integration in pervasive environments. We define data qualit~ criteria and expense quality criteria for data sources to solve data inconsistency. In our solution, firstly, data sources needing high expense to obtain data from them are discarded by using expense quality criteria and utility function. Since it is difficult to obtain the actual quality of data sources in pervasive computing environment, we introduce fuzzy multi-attribute group decision making approach to selecting the appropriate data sources. The experimental results show that our solution has ideal effectiveness. 展开更多
关键词 pervasive computing data integration data inconsistency group decision making history credibility
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部