期刊文献+
共找到3,284篇文章
< 1 2 165 >
每页显示 20 50 100
基于BIM和Linked Data的建筑业电子商务信息融合框架 被引量:4
1
作者 李忠富 何丹丹 姜韶华 《工程管理学报》 2017年第5期7-12,共6页
建筑业电子商务平台信息存在海量、异构和分散等问题,影响了建筑业电子商务的发展。在分析现有电子商务平台信息特征,以及研究BIM和Linked Data技术在信息集成方面应用的基础上,提出一种基于BIM和Linked Data的建筑业电子商务信息融合... 建筑业电子商务平台信息存在海量、异构和分散等问题,影响了建筑业电子商务的发展。在分析现有电子商务平台信息特征,以及研究BIM和Linked Data技术在信息集成方面应用的基础上,提出一种基于BIM和Linked Data的建筑业电子商务信息融合框架。该框架以BIM作为商品信息模型;使用Linked Data将供应商信息、客户信息、配送信息、交易信息、采购合同信息关联到商品信息BIM模型中,形成融合的信息环境,为实现建筑业电子商务信息融合提供了一种有效的解决思路。 展开更多
关键词 建筑业电子商务 BIM linked data 信息集成
下载PDF
基于Linked Data技术的公安监控系统技术研究 被引量:1
2
作者 李占羽 吴玥 李丹宁 《贵州科学》 2011年第2期18-21,35,共5页
信息化对传统的公安系统情报系统提出了前所未有的挑战,公安系统信息化是公安情报系统升级改造的重要工作。警用语义网情报系统对公安的情报信息的提取和共享等方面有很好的支持。运用基于Linked Data技术的语义网,对公安的重点人员监... 信息化对传统的公安系统情报系统提出了前所未有的挑战,公安系统信息化是公安情报系统升级改造的重要工作。警用语义网情报系统对公安的情报信息的提取和共享等方面有很好的支持。运用基于Linked Data技术的语义网,对公安的重点人员监控系统进行研究分析,提出监控系统的工作流程。 展开更多
关键词 公安情报 基于linked data技术语义网 人员监控系统
下载PDF
基于Geo Linked Data的地理框架数据描述与发布
3
作者 蒋波涛 高惠君 《浙江测绘》 2012年第3期21-24,共4页
随着Web化地理信息共享平台成为GIS领域信息服务的主流形式,对来自不同数据源的地理信息进行深层共享与集成已经成为一个重要的问题。地理语义网技术通过结构化数据描述及发布技术,为异构地理空间数据提供了一种语义集成的方式。本文... 随着Web化地理信息共享平台成为GIS领域信息服务的主流形式,对来自不同数据源的地理信息进行深层共享与集成已经成为一个重要的问题。地理语义网技术通过结构化数据描述及发布技术,为异构地理空间数据提供了一种语义集成的方式。本文通过介绍基础地理框架数据的GeoLinkedData化过程.描述了整个发布及应用过程的相关流程和技术。 展开更多
关键词 地理语义网 RDF GEO linked data 结构化数据
下载PDF
Linked Data数据集的主题模型建立方法 被引量:1
4
作者 刘海池 王挺 +3 位作者 唐晋韬 宁洪 魏登萍 刘培磊 《东北师大学报(自然科学版)》 CAS CSCD 北大核心 2017年第2期77-83,共7页
提出了建立Linked Data数据集主题模型的方法.首先,将数据集中的RDF陈述三元组转换成主谓宾结构的语句,从而将Linked Data数据集转化为文本文档;然后,使用LDA算法对所有数据集的文本文档进行主题建模,即可得到每个数据集的主题向量,该... 提出了建立Linked Data数据集主题模型的方法.首先,将数据集中的RDF陈述三元组转换成主谓宾结构的语句,从而将Linked Data数据集转化为文本文档;然后,使用LDA算法对所有数据集的文本文档进行主题建模,即可得到每个数据集的主题向量,该向量就是描述数据集内容主题的特征.在Linked Data数据集链接目标推荐问题上,引入数据集的主题特征进行实验.使用数据集主题向量的余弦相似度替换基于记忆的协同过滤推荐算法中的相似度计算模块.结果表明,推荐效果比原始的协同过滤算法有很大提升. 展开更多
关键词 linked data 数据集 主题模型 LDA 推荐系统 协同过滤
下载PDF
基于Open Linked Data的中西医关联发现云平台
5
作者 顾珮嵚 吴朝晖 +1 位作者 陈华钧 陈曦 《中国数字医学》 2014年第5期88-92,共5页
提出了一个用于中西医关联发现的云平台—BioTCM Cloud。该平台是构建在大量的开放链接数据(Linked Data)的基础上,以及跨领域知识整合的需要。面对海量的链接数据,提出了基于MapReduce框架的分布式语义推理框架,用于解决基于领域规则... 提出了一个用于中西医关联发现的云平台—BioTCM Cloud。该平台是构建在大量的开放链接数据(Linked Data)的基础上,以及跨领域知识整合的需要。面对海量的链接数据,提出了基于MapReduce框架的分布式语义推理框架,用于解决基于领域规则的知识推理问题。以中医草药为案例,分布式语义推埋可以建立中医药和西医之间的关联,以促进中西医之间的沟通和数据共享。 展开更多
关键词 开放链接数据 生物医药信息学 分布式计算 中医药信息化
下载PDF
BioPW+:基于Linked Data的生物途径数据可视化系统 被引量:1
6
作者 刘源 王鑫 +2 位作者 甘瀛 杨朝洲 李维熙 《计算机科学》 CSCD 北大核心 2019年第2期18-23,共6页
自Linked Data项目被提出以来,大量的开放关联数据被发布到语义Web上,这其中就包含了许多的生物途径数据集。为了使生物学家能够有效地利用这些开放的数据集,对基于Linked Data的生物途径数据可视化系统进行研究,提出了生物途径可视化... 自Linked Data项目被提出以来,大量的开放关联数据被发布到语义Web上,这其中就包含了许多的生物途径数据集。为了使生物学家能够有效地利用这些开放的数据集,对基于Linked Data的生物途径数据可视化系统进行研究,提出了生物途径可视化模型和展示布局方案,并且采用标识符动态映射实现了多源生物途径数据的浏览,最终开发了基于Linked Data的生物途径数据查询可视化系统——BioPW+。该系统应用语义Web技术,依靠SPARQL查询来定位生物途径的基本信息,然后基于Open PHACTS平台获取生物途径元素的详细信息,最终Web界面采用力导向图布局、Sankey图布局对生物途径数据进行展示并提供多种交互操作。与已有的仅仅基于某一特定数据库的生物途径工具相比,BioPW+系统基于Linked Data,可以同时一次性展示多个数据集中的生物途径数据及与其相关的其他生物化学数据,极大节省了时间并增强了数据的完整性。 展开更多
关键词 linked data 生物途径 可视化 语义WEB
下载PDF
开放关联数据赋能书目数据的实践指南--评Linked Open Data Enabled Bibliographical Data(LODE-BD)3.0
7
作者 安璐 邵琦 《科技情报研究》 2022年第2期95-98,共4页
[目的/意义]评价Linked Open Data Enabled Bibliographical Data(LODE-BD)3.0一书在开放关联数据赋能书目数据方面做出的学术贡献,帮助读者掌握开放关联数据的应用技能。[方法/过程]阐述开放关联数据应用指南的编撰目的,理解LODE-BD的... [目的/意义]评价Linked Open Data Enabled Bibliographical Data(LODE-BD)3.0一书在开放关联数据赋能书目数据方面做出的学术贡献,帮助读者掌握开放关联数据的应用技能。[方法/过程]阐述开放关联数据应用指南的编撰目的,理解LODE-BD的实践建议,思考如何将书目数据表示为开放关联数据,帮助用户开放获取相关的书目资源,实现书目资源的互联互通。[结果/结论]该书是一本成熟的,关于如何选择合适编码策略来生成开放关联数据赋能的书目数据的操作指南,具有丰富的理论价值、方法指导与实践意义。 展开更多
关键词 开放关联数据 书目数据 元数据 书评 实践指南
下载PDF
LinkMF:结合Linked Data的协同过滤推荐算法
8
作者 黄山山 马军 +1 位作者 郭磊 王帅强 《中文信息学报》 CSCD 北大核心 2016年第1期85-92,共8页
协同过滤(CF)是推荐系统中应用最为广泛的推荐算法之一,然而数据稀疏性和冷启动问题是协同过滤方法的两个主要挑战。由于Linked Data整合了关于实体的丰富且结构化的特征,可以作为额外的信息源来缓解以上两种挑战。该文中我们首次提出... 协同过滤(CF)是推荐系统中应用最为广泛的推荐算法之一,然而数据稀疏性和冷启动问题是协同过滤方法的两个主要挑战。由于Linked Data整合了关于实体的丰富且结构化的特征,可以作为额外的信息源来缓解以上两种挑战。该文中我们首次提出了结合Linked Data改进CF推荐算法,基于矩阵分解提出了一种新的CF模型——LinkMF,在保证推荐准确度的基础上利用Linked Data缓解数据稀疏性和冷启动问题。首先,我们从Linked Data中抽取项目的特征表示并为项目建模;然后提出新的相似度度量方法计算项目相似度;最后利用项目相似度约束和指导MF分解过程产生推荐。在MovielLens和YAGO标准数据集上的大量实验结果表明,LinkMF优于现有的一些CF方法,特别在缓解数据稀疏性和冷启动问题上取得很好地效果。 展开更多
关键词 推荐系统 矩阵分解 linked data 数据稀疏性 冷启动
下载PDF
FAIR + FIT: Guiding Principles and Functional Metrics for Linked Open Data(LOD) KOS Products 被引量:1
9
作者 Marcia Lei Zeng Julaine Clunis 《Journal of Data and Information Science》 CSCD 2020年第1期93-118,共26页
Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the f... Purpose:To develop a set of metrics and identify criteria for assessing the functionality of LOD KOS products while providing common guiding principles that can be used by LOD KOS producers and users to maximize the functions and usages of LOD KOS products.Design/methodology/approach:Data collection and analysis were conducted at three time periods in 2015–16,2017 and 2019.The sample data used in the comprehensive data analysis comprises all datasets tagged as types of KOS in the Datahub and extracted through their respective SPARQL endpoints.A comparative study of the LOD KOS collected from terminology services Linked Open Vocabularies(LOV)and BioPortal was also performed.Findings:The study proposes a set of Functional,Impactful and Transformable(FIT)metrics for LOD KOS as value vocabularies.The FAIR principles,with additional recommendations,are presented for LOD KOS as open data.Research limitations:The metrics need to be further tested and aligned with the best practices and international standards of both open data and various types of KOS.Practical implications:Assessment performed with FAIR and FIT metrics support the creation and delivery of user-friendly,discoverable and interoperable LOD KOS datasets which can be used for innovative applications,act as a knowledge base,become a foundation of semantic analysis and entity extractions and enhance research in science and the humanities.Originality/value:Our research provides best practice guidelines for LOD KOS as value vocabularies. 展开更多
关键词 Knowledge Organization Systems linked Open data FAIR FIT Semantic web
下载PDF
Linked Data Based Framework for Tourism Decision Support System: Case Study of Chinese Tourists in Switzerland
10
作者 Zhan Liu Anne Le Calvé +3 位作者 Fabian Cretton Nicole Glassey Balet Maria Sokhn Nicolas Délétroz 《Journal of Computer and Communications》 2015年第5期118-126,共9页
Switzerland is one of the most desirable European destinations for Chinese tourists;therefore, a better understanding of Chinese tourists is essential for successful business practices. In China, the largest and leadi... Switzerland is one of the most desirable European destinations for Chinese tourists;therefore, a better understanding of Chinese tourists is essential for successful business practices. In China, the largest and leading social media platform—Sina Weibo, a hybrid of Twitter and Facebook—has more than 600 million users. Weibo’s great market penetration suggests that tourism operators and markets need to understand how to build effective and sustainable communications on Chinese social media platforms. In order to offer a better decision support platform to tourism destination managers as well as Chinese tourists, we proposed a framework using linked data on Sina Weibo. Linked Data is a term referring to using the Internet to connect related data. We will show how it can be used and how ontology can be designed to include the users’ context (e.g., GPS locations). Our framework will provide a good theoretical foundation for further understand Chinese tourists’ expectation, experiences, behaviors and new trends in Switzerland. 展开更多
关键词 linked data Semantic Web DECISION Support System Natural Language Processing BEHAVIORS Analysis Social Networks Chinese TOURIST Switzerland New TRENDS SINA Weibo
下载PDF
The Method of Data Access for Nuclear Instrument Based on Linked List
11
作者 Zhi Liu Rui Li +2 位作者 Hong Huang Yi Cheng Xiaoping Yu 《Journal of Computer and Communications》 2016年第7期1-6,共6页
A new method of data access which can effectively resolve the problem of high speed and real time reading data of nuclear instrument in small storage space is introduced. This method applies the data storage mode of ... A new method of data access which can effectively resolve the problem of high speed and real time reading data of nuclear instrument in small storage space is introduced. This method applies the data storage mode of “linked list” to the system of Micro Control Unit (MCU), and realizes the pointer access of nuclear data on the small storage space of MCU. Experimental results show that this method can solve some problems of traditional data storage method, which has the advantages of simple program design, stable performance, accurate data, strong repeatability, saving storage space and so on. 展开更多
关键词 linked List data Storage Method Nuclear Instrument
下载PDF
Linked Data-based Slide Repository: The Episodic Slide Retrieval Using the Episodic Keyword Networks
12
作者 Tomohiro Iwasa Yudai Kato +2 位作者 Shun Shiramatsu Tadaehika Ozono Toramatsu Shintani 《Journal of Control Science and Engineering》 2016年第1期36-49,共14页
This paper focuses on developing a system that allows presentation authors to effectively retrieve presentation slides for reuse from a large volume of existing presentation materials. We assume episodic memories of t... This paper focuses on developing a system that allows presentation authors to effectively retrieve presentation slides for reuse from a large volume of existing presentation materials. We assume episodic memories of the authors can be used as contextual keywords in query expressions to efficiently dig out the expected slides for reuse rather than using only the part-of-slide-descriptions-based keyword queries. As a system, a new slide repository is proposed, composed of slide material collections, slide content data and pieces of information from authors' episodic memories related to each slide and presentation together with a slide retrieval application enabling authors to use the episodic memories as part of queries. The result of our experiment shows that the episodic memory-used queries can give more discoverability than the keyword-based queries. Additionally, an improvement model is discussed on the slide retrieval for further slide-finding efficiency by expanding the episodic memories model in the repository taking in the links with the author-and-slide-related data and events having been post on the private and social media sites. 展开更多
关键词 Slide Retrieval linked data-based Slide Repository Episodic Keyword Networks linked data episodic memories social media life event.
下载PDF
Studies on Novel Anti-jamming Technique of Unmanned Aerial Vehicle Data Link 被引量:7
13
作者 黄文准 王永生 叶向阳 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 2008年第2期141-148,共8页
Based on the M-ary spread spectrum (M-ary-SS), direct sequence spread spectrum (DS-SS), and orthogonal frequency division multiplex (OFDM), a novel anti-jamming scheme, named orthogonal code time division multi-... Based on the M-ary spread spectrum (M-ary-SS), direct sequence spread spectrum (DS-SS), and orthogonal frequency division multiplex (OFDM), a novel anti-jamming scheme, named orthogonal code time division multi-subchannels spread spectrum modulation (OC-TDMSCSSM), is proposed to enhance the anti-jamming ability of the unmanned aerial vehicle (UAV) data link. The anti-jamming system with its mathematical model is presented first, and then the signal formats of transmitter and receiver are derived. The receiver's bit error rate (BER) is demonstrated and anti-jamming performance analysis is carded out in an additive white Ganssian noise (AWGN) channel. Theoretical research and simulation results show the anti-jamming performance of the proposed scheme better than that of the hybrid direct sequence frequency hopping spread spectrum (DS/FH SS) system. The jamming margin of the OC-TDMSCSSM system is 5 dB higher than that of DS/FH SS system under the condition of Rician channel and full-band jamming, and 6 dB higher under the condition of Rician channel environment and partial-band jamming. 展开更多
关键词 UAV data link OC-TDMSCSSM systemview simulation orthogonal code ANTI-JAMMING
下载PDF
国际科学数据共享政策法规体系:Linked Science制度基础 被引量:25
14
作者 唐义 张晓蒙 郑燃 《图书情报知识》 CSSCI 北大核心 2013年第3期67-73,共7页
随着关联数据等技术的发展,科学研究方式也从电子科研(e-Science)向关联科学(Linked Science)转变。关联科学是一种实现科研资产互联的方法,推动透明的、可重复的和跨学科的研究。关联科学的前提之一就是科学资源和数据的共享。为了促... 随着关联数据等技术的发展,科学研究方式也从电子科研(e-Science)向关联科学(Linked Science)转变。关联科学是一种实现科研资产互联的方法,推动透明的、可重复的和跨学科的研究。关联科学的前提之一就是科学资源和数据的共享。为了促使科学数据共享,国际上不同层次的主体都制定了相关的政策法规,这些主体包括国际组织及一国政府、科研资助机构、期刊社及科研机构等,其制定的政策法规分别构成了科学数据共享这一政策法规体系的宏观、中观、微观层次。这一政策法规体系就奠定了关联科学的制度基础。 展开更多
关键词 科学数据共享 政策法规体系 关联科学
下载PDF
Simulation of Dynamic Electromagnetic Interference Environment for Unmanned Aerial Vehicle Data Link 被引量:10
15
作者 郭淑霞 董中要 +1 位作者 胡占涛 胡楚峰 《China Communications》 SCIE CSCD 2013年第7期19-28,共10页
In order to test the anti-interference ability of an Unmanned Aerial Vehicle(UAV) data link in a complex electromagnetic environment,a method for simulating the dynamic electromagnetic interference of an indoor wirele... In order to test the anti-interference ability of an Unmanned Aerial Vehicle(UAV) data link in a complex electromagnetic environment,a method for simulating the dynamic electromagnetic interference of an indoor wireless environment is proposed.This method can estimate the relational degree between the actual face of an UAV data link in an interface environment and the simulation scenarios in an anechoic chamber by using the Grey Relational Analysis(GRA) theory.The dynamic drive of the microwave instrument produces a real-time corresponding interference signal and realises scene mapping.The experimental results show that the maximal correlation between the interference signal in the real scene and the angular domain of the radiation antenna in the anechoic chamber is 0.959 3.Further,the relational degree of the Signal-toInterference Ratio(SIR) of the UAV at its reception terminal indoors and in the anechoic chamber is 0.996 8,and the time of instrument drive is only approximately 10 μs.All of the above illustrates that this method can achieve a simulation close to a real field dynamic electromagnetic interference signal of an indoor UAV data link. 展开更多
关键词 UAV data link dynamic electromagnetic interference GRA relational degree scene mapping instrument driver
下载PDF
Blockchain-based data transmission control for Tactical Data Link 被引量:5
16
作者 Wei Feng Yafeng Li +2 位作者 Xuetao Yang Zheng Yan Liang Chen 《Digital Communications and Networks》 SCIE CSCD 2021年第3期285-294,共10页
Tactical Data Link(TDL)is a communication system that utilizes a particular message format and a protocol to transmit data via wireless channels in an instant,automatic,and secure way.So far,TDL has shown its excellen... Tactical Data Link(TDL)is a communication system that utilizes a particular message format and a protocol to transmit data via wireless channels in an instant,automatic,and secure way.So far,TDL has shown its excellence in military applications.Current TDL adopts a distributed architecture to enhance anti-destruction capacity.However,It still faces a problem of data inconsistency and thus cannot well support cooperation across multiple militarily domains.To tackle this problem,we propose to leverage blockchain to build an automatic and adaptive data transmission control scheme for TDL.It achieves automatic data transmission and realizes information consistency among different TDL entities.Besides,applying smart contracts based on blockchain further enables adjusting data transmission policies automatically.Security analysis and experimental results based on simulations illustrate the effectiveness and efficiency of our proposed scheme. 展开更多
关键词 Blockchain Tactical data link CONSENSUS data transmission control
下载PDF
Cognitive Congestion Control for Data Portals with Variable Link Capacity 被引量:2
17
作者 Ershad Sharifahmadian Shahram Latifi 《International Journal of Communications, Network and System Sciences》 2012年第8期481-489,共9页
Network congestion, one of the challenging tasks in communication networks, leads to queuing delays, packet loss, or the blocking of new connections. In this study, a data portal is considered as an application-based ... Network congestion, one of the challenging tasks in communication networks, leads to queuing delays, packet loss, or the blocking of new connections. In this study, a data portal is considered as an application-based network, and a cognitive method is proposed to deal with congestion in this kind of network. Unlike previous methods for congestion control, the proposed method is an effective approach for congestion control when the link capacity and information inquiries are unknown or variable. Using sufficient training samples and the current value of the network parameters, available bandwidth is adjusted to distribute the bandwidth among the active flows. The proposed cognitive method was tested under such situations as unexpected variations in link capacity and oscillatory behavior of the bandwidth. Based on simulation results, the proposed method is capable of adjusting the available bandwidth by tuning the queue length, and provides a stable queue in the network. 展开更多
关键词 Available Bandwidth COGNITIVE System data Portal Network CONGESTION QUEUE Length VARIABLE link Capacity
下载PDF
Model Based Data Transmission: Analysis of Link Budget Requirement Reduction 被引量:1
18
作者 Jeremy Straub 《Communications and Network》 2012年第4期278-287,共10页
Communications capability can be a significant constraint on the utility of a spacecraft. While conventionally enhanced through the use of a larger transmitting or receiving antenna or through augmenting transmission ... Communications capability can be a significant constraint on the utility of a spacecraft. While conventionally enhanced through the use of a larger transmitting or receiving antenna or through augmenting transmission power, communications capability can also be enhanced via incorporating more data in every unit of transmission. Model Based Transmission Reduction (MBTR) increases the mission utility of spacecraft via sending higher-level messages which rely on preshared (or, in some cases, co-transmitted) data. Because of this a priori knowledge, the amount of information contained in a MBTR message significantly exceeds the amount the amount of information in a conventional message. MBTR has multiple levels of operation;the lowest, Model Based Data Transmission (MBDT), utilizes a pre-shared lower-resolution data frame, which is augmented in areas of significant discrepancy with data from the higher-resolution source. MBDT is examined, in detail, herein and several approaches to minimizing the required bandwidth for conveying data required to conform to a minimum level of accuracy are considered. Also considered are ways of minimizing transmission requirements when both a model and change data required to attain a desired minimum discrepancy threshold must be transmitted. These possible solutions are compared to alternate transmission techniques including several forms of image compression. 展开更多
关键词 SPACECRAFT COMMUNICATIONS data Compression Satellite COMMUNICATIONS link BUDGET REDUCTION Image FORMAT
下载PDF
Privacy Protection for Big Data Linking using the Identity Correlation Approach 被引量:1
19
作者 Kevin McCormack Mary Smyth 《Journal of Statistical Science and Application》 2017年第3期81-90,共10页
Privacy protection for big data linking is discussed here in relation to the Central Statistics Office (CSO), Ireland's, big data linking project titled the 'Structure of Earnings Survey - Administrative Data Proj... Privacy protection for big data linking is discussed here in relation to the Central Statistics Office (CSO), Ireland's, big data linking project titled the 'Structure of Earnings Survey - Administrative Data Project' (SESADP). The result of the project was the creation of datasets and statistical outputs for the years 2011 to 2014 to meet Eurostat's annual earnings statistics requirements and the Structure of Earnings Survey (SES) Regulation. Record linking across the Census and various public sector datasets enabled the necessary information to be acquired to meet the Eurostat earnings requirements. However, the risk of statistical disclosure (i.e. identifying an individual on the dataset) is high unless privacy and confidentiality safe-guards are built into the data matching process. This paper looks at the three methods of linking records on big datasets employed on the SESADP, and how to anonymise the data to protect the identity of the individuals, where potentially disclosive variables exist. 展开更多
关键词 Big data linking data Matching data Privacy data Confidentiality Identity Correlation Approach data Disclosure data Mining
下载PDF
A Mathematical Solution to String Matching for Big Data Linking 被引量:1
20
作者 Kevin McCormack Mary Smyth 《Journal of Statistical Science and Application》 2017年第2期39-55,共17页
This paper describes how data records can be matched across large datasets using a technique called the Identity Correlation Approach (ICA). The ICA technique is then compared with a string matching exercise. Both t... This paper describes how data records can be matched across large datasets using a technique called the Identity Correlation Approach (ICA). The ICA technique is then compared with a string matching exercise. Both the string matching exercise and the ICA technique were employed for a big data project carried out by the CSO. The project was called the SESADP (Structure of Earnings Survey Administrative Data Project) and involved linking the Irish Census dataset 2011 to a large Public Sector Dataset. The ICA technique provides a mathematical tool to link the datasets and the matching rate for an exact match can be calculated before the matching process begins. Based on the number of variables and the size of the population, the matching rate is calculated in the ICA approach from the MRUI (Matching Rate for Unique Identifier) formula, and false positives are eliminated. No string matching is used in the ICA, therefore names are not required on the dataset, making the data more secure & ensuring confidentiality. The SESADP Project was highly successful using the ICA technique. A comparison of the results using a string matching exercise for the SESADP and the ICA are discussed here. 展开更多
关键词 Big data data linking Identity Correlation Approach String Matching Public Sector datasets dataPrivacy.
下载PDF
上一页 1 2 165 下一页 到第
使用帮助 返回顶部