期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Improve Data Quality by Processing Null Values and Semantic Dependencies
1
作者 Houda Zaidi faouzi boufarès Yann Pollet 《Journal of Computer and Communications》 2016年第5期78-85,共8页
Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is v... Today, the quantity of data continues to increase, furthermore, the data are heterogeneous, from multiple sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to manipulate data without knowledge about their structures and their semantics. In fact, the meta-data may be insufficient or totally absent. Data Anomalies may be due to the poverty of their semantic descriptions, or even the absence of their description. In this paper, we propose an approach to better understand the semantics and the structure of the data. Our approach helps to correct automatically the intra-column anomalies and the inter-col- umns ones. We aim to improve the quality of data by processing the null values and the semantic dependencies between columns. 展开更多
关键词 Data Quality Big Data Contextual Semantics Semantic Dependencies Functional Dependencies Null Values Data Cleaning
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部