华南地质是区域地质学研究的一个热点,针对该研究对象的文献总体呈现增长趋势。本文以“华南地质”和“Geology of South China”为关键词,分别在CNKI(中国知网)和Web of Science数据库进行检索,得到977篇(1977-2020年)和1078篇文献(199...华南地质是区域地质学研究的一个热点,针对该研究对象的文献总体呈现增长趋势。本文以“华南地质”和“Geology of South China”为关键词,分别在CNKI(中国知网)和Web of Science数据库进行检索,得到977篇(1977-2020年)和1078篇文献(1999-2022年),利用CiteSpace软件对这些文献的产出国、作者、研究机构、关键词等方面进行知识图谱可视化分析,以刻画该研究对象的研究现状和发展趋势。CiteSpace是大数据社区发现的重要工具之一,可以对科学文献中的信息进行提取、重构,形成知识图谱,通过对节点和网络进行分析和可视化,发现其中的联系和规律。将CiteSpace应用于华南地质研究对象分析形成的知识图谱显示:中国对于华南地质研究最为深入,发文数量最多,国家之间缺少交流合作;华南地质研究的热点随着时代变化,研究程度呈现由浅到深的特点;作者之间初步形成合作网络,团队内部的作者合作紧密,但团队之间的合作较弱。建议不同研究机构加强合作,对各方的资源进行整合以推动华南地质研究迈向更高水平,并加强可视化技术在研究热点分析方面的应用。展开更多
Privacy protection for big data linking is discussed here in relation to the Central Statistics Office (CSO), Ireland's, big data linking project titled the 'Structure of Earnings Survey - Administrative Data Proj...Privacy protection for big data linking is discussed here in relation to the Central Statistics Office (CSO), Ireland's, big data linking project titled the 'Structure of Earnings Survey - Administrative Data Project' (SESADP). The result of the project was the creation of datasets and statistical outputs for the years 2011 to 2014 to meet Eurostat's annual earnings statistics requirements and the Structure of Earnings Survey (SES) Regulation. Record linking across the Census and various public sector datasets enabled the necessary information to be acquired to meet the Eurostat earnings requirements. However, the risk of statistical disclosure (i.e. identifying an individual on the dataset) is high unless privacy and confidentiality safe-guards are built into the data matching process. This paper looks at the three methods of linking records on big datasets employed on the SESADP, and how to anonymise the data to protect the identity of the individuals, where potentially disclosive variables exist.展开更多
文摘华南地质是区域地质学研究的一个热点,针对该研究对象的文献总体呈现增长趋势。本文以“华南地质”和“Geology of South China”为关键词,分别在CNKI(中国知网)和Web of Science数据库进行检索,得到977篇(1977-2020年)和1078篇文献(1999-2022年),利用CiteSpace软件对这些文献的产出国、作者、研究机构、关键词等方面进行知识图谱可视化分析,以刻画该研究对象的研究现状和发展趋势。CiteSpace是大数据社区发现的重要工具之一,可以对科学文献中的信息进行提取、重构,形成知识图谱,通过对节点和网络进行分析和可视化,发现其中的联系和规律。将CiteSpace应用于华南地质研究对象分析形成的知识图谱显示:中国对于华南地质研究最为深入,发文数量最多,国家之间缺少交流合作;华南地质研究的热点随着时代变化,研究程度呈现由浅到深的特点;作者之间初步形成合作网络,团队内部的作者合作紧密,但团队之间的合作较弱。建议不同研究机构加强合作,对各方的资源进行整合以推动华南地质研究迈向更高水平,并加强可视化技术在研究热点分析方面的应用。
文摘Privacy protection for big data linking is discussed here in relation to the Central Statistics Office (CSO), Ireland's, big data linking project titled the 'Structure of Earnings Survey - Administrative Data Project' (SESADP). The result of the project was the creation of datasets and statistical outputs for the years 2011 to 2014 to meet Eurostat's annual earnings statistics requirements and the Structure of Earnings Survey (SES) Regulation. Record linking across the Census and various public sector datasets enabled the necessary information to be acquired to meet the Eurostat earnings requirements. However, the risk of statistical disclosure (i.e. identifying an individual on the dataset) is high unless privacy and confidentiality safe-guards are built into the data matching process. This paper looks at the three methods of linking records on big datasets employed on the SESADP, and how to anonymise the data to protect the identity of the individuals, where potentially disclosive variables exist.