期刊文献+
共找到119篇文章
< 1 2 6 >
每页显示 20 50 100
A Novel Named Entity Recognition Scheme for Steel E-Commerce Platforms Using a Lite BERT 被引量:1
1
作者 Maojian Chen Xiong Luo +2 位作者 Hailun Shen Ziyang Huang Qiaojuan Peng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第10期47-63,共17页
In the era of big data,E-commerce plays an increasingly important role,and steel E-commerce certainly occupies a positive position.However,it is very difficult to choose satisfactory steel raw materials from diverse s... In the era of big data,E-commerce plays an increasingly important role,and steel E-commerce certainly occupies a positive position.However,it is very difficult to choose satisfactory steel raw materials from diverse steel commodities online on steel E-commerce platforms in the purchase of staffs.In order to improve the efficiency of purchasers searching for commodities on the steel E-commerce platforms,we propose a novel deep learning-based loss function for named entity recognition(NER).Considering the impacts of small sample and imbalanced data,in our NER scheme,the focal loss,the label smoothing,and the cross entropy are incorporated into a lite bidirectional encoder representations from transformers(BERT)model to avoid the over-fitting.Moreover,through the analysis of different classic annotation techniques used to tag data,an ideal one is chosen for the training model in our proposed scheme.Experiments are conducted on Chinese steel E-commerce datasets.The experimental results show that the training time of a lite BERT(ALBERT)-based method is much shorter than that of BERT-based models,while achieving the similar computational performance in terms of metrics precision,recall,and F1 with BERT-based models.Meanwhile,our proposed approach performs much better than that of combining Word2Vec,bidirectional long short-term memory(Bi-LSTM),and conditional random field(CRF)models,in consideration of training time and F1. 展开更多
关键词 Named entity recognition bidirectional encoder representations from transformers steel e-commerce platform annotation technique
下载PDF
Redundancy Elimination in Multi-signature Based Parallel Entity Resolution
2
作者 燕彩蓉 阮文洁 +1 位作者 徐淑华 黄永锋 《Journal of Donghua University(English Edition)》 EI CAS 2017年第4期556-562,共7页
The multi-signature method can improve the accuracy of entity resolution. However,it will bring the redundant computation problem in the parallel processing framework. In this paper,a multisignature based parallel ent... The multi-signature method can improve the accuracy of entity resolution. However,it will bring the redundant computation problem in the parallel processing framework. In this paper,a multisignature based parallel entity resolution method called multi-sig-er is proposed. The method was implemented in MapReduce-based framework which first tagged multiple signatures for each input object and utilized these signatures to generate key-value pairs,then shuffled the pairs to the reduce tasks that are responsible for similarity computation. To improve the performance,two strategies were adopted. One is for pruning the candidate pairs brought by the blocking technique and the other is for eliminating the redundancy according to the transitive property. Both strategies reduce the number of similarity computation without affecting the resolution accuracy. Experimental results on real-world datasets show that the method tends to handle large datasets rather than small datasets,and it is more suitable for complex similarity computation as compared to simple similarity matching. 展开更多
关键词 entity resolution MAPREDUCE blocking technique redundancy elimination
下载PDF
Mixed Attributes Two-Stage-Clustering Entity Resolution
3
作者 LEI Gang 《通讯和计算机(中英文版)》 2015年第6期297-302,共6页
关键词 混合属性 聚类 解析 实体 双级 度量方法 记录信息 比较实验
下载PDF
E-commerce development in rural and remote areas of BRICS countries 被引量:12
4
作者 Karine HAJI 《Journal of Integrative Agriculture》 SCIE CAS CSCD 2021年第4期979-997,共19页
E-commerce plays an essential role in modern trade today.It is expected that e-commerce volume amounted to 29 trillion USD in the world in 2017,and would grow with the spread of the Internet and information and commun... E-commerce plays an essential role in modern trade today.It is expected that e-commerce volume amounted to 29 trillion USD in the world in 2017,and would grow with the spread of the Internet and information and communication technologies(ICTs).Brazil,Russia,India,China and South Africa(BRICS),together with many others,consider e-commerce a means to facilitate rapid,inclusive and sustainable economic growth,improving the living standards and alleviating poverty.This article examines areas for potential cooperation by BRICS countries in e-commerce development across rural and remote areas to fight poverty.It analyses the current state of e-commerce development in rural and remote areas in each of the BRICS countries,including cases of public and private initiatives to support it.The article also defines the opportunities which e-commerce brings to people living in rural and remote areas.Moreover,it evaluates the existing challenges and risks.The article concludes that despite the rapid e-commerce development in BRICS countries,and significant opportunities created,there are still issues of disproportionate e-commerce in varied regions and the lack of BRICS cooperation in this sphere.Based on a comparative and normative in-depth,systematic analysis,the article develops a set of recommendations for deepening BRICS countries'cooperation in the following areas:infrastructure in rural and remote regions;education;consumer protection;online dispute resolution;coordinated policy in the international scene,including representation of BRICS countries in international indexes,such as the Organization of Economic Co-operation and Development(OECD)Digital Services Trade Restrictiveness Index(STRI). 展开更多
关键词 e-commerce BRICS poverty alleviation international cooperation remote and rural areas ICT infrastructure educational cooperation online dispute resolution consumer protection online Digital STRI
下载PDF
基于半监督学习的域适应实体解析算法
5
作者 戴超凡 丁华华 《计算机科学》 CSCD 北大核心 2024年第9期214-222,共9页
实体解析旨在查找两个数据实体是否引用同一实体,是许多自然语言处理任务中的一项基本任务。现有的基于深度学习的实体解析解决方案通常需要大量的标注数据,即使利用预训练的语言模型进行训练,仍然需要数千个标签才能达到令人满意的准... 实体解析旨在查找两个数据实体是否引用同一实体,是许多自然语言处理任务中的一项基本任务。现有的基于深度学习的实体解析解决方案通常需要大量的标注数据,即使利用预训练的语言模型进行训练,仍然需要数千个标签才能达到令人满意的准确性。现实场景中,这些标注数据并不容易获得。针对上述问题,提出了一个基于半监督学习的域适应实体解析模型。首先,在源域上训练一个分类器,然后利用域适应减小源域和目标域的分布差异,同时用数据增强后的目标域软伪标签加入源域迭代训练,从而实现从源域到目标域的知识迁移。在13个来自相同或不同领域的数据集上对所提模型进行了对比实验和消融实验,实验结果表明,与无监督基线模型相比,所提模型在多个数据集上的F1值平均提升了2.84%,9.16%和7.1%;与有监督基线模型相比,所提模型只需要20%~40%的标签就可以达到与有监督学习相当的性能。消融实验进一步证明了所提模型的有效性,其总体上可以获得更好的实体解析结果(相关代码已开源1))。 展开更多
关键词 实体解析 域适应 伪标签 预训练语言模型 数据增强
下载PDF
多元纠纷解决机制视域中行政道歉的实体法律规范研究
6
作者 王晨 《陕西行政学院学报》 2024年第2期77-84,共8页
作为多元纠纷解决机制的重要组成部分,行政道歉能够帮助实现“满足当事人多元需求、促进纠纷实质性化解”的目标。为了更好地实现行政道歉的功能,需要从致歉主体、对象、条件、内容以及后续法律责任等方面予以实体规范。致歉主体分为致... 作为多元纠纷解决机制的重要组成部分,行政道歉能够帮助实现“满足当事人多元需求、促进纠纷实质性化解”的目标。为了更好地实现行政道歉的功能,需要从致歉主体、对象、条件、内容以及后续法律责任等方面予以实体规范。致歉主体分为致歉组织和直接责任人等具体致歉人,致歉对象不限于社会公众和特定行政相对人,致歉条件涵盖“失政”“失德”行为,致歉内容包括分析错误、自责、补救等,后续责任追究成为行政道歉完成的最终标志。 展开更多
关键词 行政道歉 多元纠纷解决机制 实体规范
下载PDF
工业互联网安全知识图谱构建研究综述 被引量:1
7
作者 常钰 王钢 +2 位作者 朱鹏 孔令飞 何京恒 《计算机科学与探索》 CSCD 北大核心 2024年第2期279-300,共22页
工业互联网安全知识图谱能够在丰富安全概念语义关系、提高安全知识库质量和增强安全态势可视化分析能力等方面发挥重要作用,已经成为认知、溯源和防护针对新能源工业控制系统攻击的关键。但是,与通用领域知识图谱构建相比,工业互联网... 工业互联网安全知识图谱能够在丰富安全概念语义关系、提高安全知识库质量和增强安全态势可视化分析能力等方面发挥重要作用,已经成为认知、溯源和防护针对新能源工业控制系统攻击的关键。但是,与通用领域知识图谱构建相比,工业互联网安全知识图谱构建的各个环节仍然存在许多问题,影响了其实际应用效果。介绍了工业互联网安全知识图谱的概念、意义和其与通用知识图谱的区别;概括了工业互联网安全知识图谱本体构建的相关工作及其作用;重点研究了在工业互联网安全背景下,构建知识图谱的三个关键环节,即命名实体识别、关系抽取和共指消解的相关工作。对于每个环节,详细报告了该环节在领域背景下的发展历史和研究现状,深入分析了该环节面临的领域特有挑战,如非连续实体识别问题、候选词提取问题和缺乏领域高质量数据集等,并针对特有挑战展望了该环节未来的研究方向,为进一步提升工业互联网安全知识图谱的质量和实用性,从而更有效地应对新兴威胁和攻击提供借鉴和启示。 展开更多
关键词 工业互联网安全 知识图谱 命名实体识别 关系抽取 共指消解
下载PDF
基于众包的新闻数据实体解析系统
8
作者 田梦璐 方明 +1 位作者 刘琳 朱凯龙 《现代计算机》 2024年第17期103-107,共5页
随着大数据时代的来临,新闻数据的处理和解析成为了一个重要的研究领域。鉴于新闻实体更新速度快等特点,提出了一种基于众包的新闻数据实体解析系统,旨在通过众包模式,结合人工智能技术,实现对新闻数据中实体的高效、准确解析。首先介... 随着大数据时代的来临,新闻数据的处理和解析成为了一个重要的研究领域。鉴于新闻实体更新速度快等特点,提出了一种基于众包的新闻数据实体解析系统,旨在通过众包模式,结合人工智能技术,实现对新闻数据中实体的高效、准确解析。首先介绍了众包和实体解析的相关概念,然后详细阐述了实现方法以及系统的架构设计。实验结果表明,该系统能够有效地提高新闻数据实体解析的准确性和效率。 展开更多
关键词 众包 实体解析 新闻 质量控制
下载PDF
Cross-Modal Entity Resolution for Image and Text Integrating Global and Fine-Grained Joint Attention Mechanism
9
作者 曾志贤 曹建军 +2 位作者 翁年凤 袁震 余旭 《Journal of Shanghai Jiaotong university(Science)》 EI 2023年第6期728-737,共10页
In order to solve the problem that the existing cross-modal entity resolution methods easily ignore the high-level semantic informational correlations between cross-modal data,we propose a novel cross-modal entity res... In order to solve the problem that the existing cross-modal entity resolution methods easily ignore the high-level semantic informational correlations between cross-modal data,we propose a novel cross-modal entity resolution for image and text integrating global and fine-grained joint attention mechanism method.First,we map the cross-modal data to a common embedding space utilizing a feature extraction network.Then,we integrate global joint attention mechanism and fine-grained joint attention mechanism,making the model have the ability to learn the global semantic characteristics and the local fine-grained semantic characteristics of the cross-modal data,which is used to fully exploit the cross-modal semantic correlation and boost the performance of cross-modal entity resolution.Moreover,experiments on Flickr-30K and MS-COCO datasets show that the overall performance of R@sum outperforms by 4.30%and 4.54%compared with 5 state-of-the-art methods,respectively,which can fully demonstrate the superiority of our proposed method. 展开更多
关键词 cross-modal entity resolution joint attention mechanism deep learning feature extraction semantic correlation
原文传递
民航领域突发事件的实体链接方法
10
作者 冯兴杰 彭洲 +1 位作者 张成豪 冯小荣 《计算机应用研究》 CSCD 北大核心 2023年第4期1052-1058,1064,共8页
实体链接的相关研究主要集中于医疗、生物和新闻领域,但在民航领域的研究较少。因此针对民航领域实体链接任务进行了研究,发现在实体链接中存在实体变体多、歧义少等问题。为解决上述问题,提出了一种基于语义推断的实体链接框架以及一... 实体链接的相关研究主要集中于医疗、生物和新闻领域,但在民航领域的研究较少。因此针对民航领域实体链接任务进行了研究,发现在实体链接中存在实体变体多、歧义少等问题。为解决上述问题,提出了一种基于语义推断的实体链接框架以及一种用于增强框架鲁棒性的负采样策略。在民航领域数据集上进行了对比实验,结果表明所提框架链接效果优于现有基准框架,并通过消融实验,验证了负采样策略的有效性。在负采样策略的作用下,该实体链接框架的Acc@top1高达0.875。 展开更多
关键词 民航突发事件 实体链接 实体统一 实体消歧 数据增强
下载PDF
跨模态数据实体分辨研究综述 被引量:2
11
作者 曹建军 聂子博 +2 位作者 郑奇斌 吕国俊 曾志贤 《软件学报》 EI CSCD 北大核心 2023年第12期5822-5847,共26页
实体分辨广泛地存在于数据质量控制、信息检索、数据集成等数据任务中.传统的实体分辨主要面向关系型数据,而随着大数据技术的发展,文本、图像等模态不同的数据大量涌现催生了跨模态数据应用需求,将跨模态数据实体分辨提升为大数据处理... 实体分辨广泛地存在于数据质量控制、信息检索、数据集成等数据任务中.传统的实体分辨主要面向关系型数据,而随着大数据技术的发展,文本、图像等模态不同的数据大量涌现催生了跨模态数据应用需求,将跨模态数据实体分辨提升为大数据处理和分析的基础问题之一.对跨模态实体分辨问题的研究进展进行回顾,首先介绍问题的定义、评价指标;然后,以模态内关系的保持和模态间关系的建立为主线,对现有研究进行总结和梳理;并且,通过在多个公开数据集上对常用方法进行测试,对出现差异的原因和进行分析;最后,总结当前研究仍然存在的问题,并依据这些问题给出未来可能的研究方向. 展开更多
关键词 实体分辨 跨模态数据处理 深度学习 相似性度量
下载PDF
基于上下文共指实体依赖的文档级关系抽取
12
作者 夏正新 苏翀 刘勇 《数据采集与处理》 CSCD 北大核心 2023年第5期1226-1234,共9页
文档级关系提取(Document relationship extraction,DRE)旨在多条句子中识别实体间的关系,而实体可能对应于跨越句子边界的多次提及,其中代词实体提及是因句子之间连接而普遍存在的语法现象,也是影响句子推理的一个重要因素。然而,以往... 文档级关系提取(Document relationship extraction,DRE)旨在多条句子中识别实体间的关系,而实体可能对应于跨越句子边界的多次提及,其中代词实体提及是因句子之间连接而普遍存在的语法现象,也是影响句子推理的一个重要因素。然而,以往的研究大多侧重于普通实体提及之间的关系,却很少关注代词实体提及的共指和关系捕获。本文提出了基于上下文共指实体依赖(Contextual coreference entity dependency,CCED)的文档级关系抽取模型,即通过融合普通实体和代词实体表示来构建共指实体依赖关系的上下文图结构,并在图上进行实体对间的全局交互推理,从而对实体关系的相互依赖进行建模。分别在公共数据集DocRED、DialogRE和MPDD上对CCED模型进行评估,结果显示在DocRED数据集上,与表现最好的基线模型DocuNet-BERT相比,CCED模型在测试集上的Ign F_(1)性能提高0.55%,F_(1)性能提高0.35%。在DialogRE和MPDD数据集上,与表现最好的基线模型COLN相比,CCED模型在DialogRE测试集上的F_(1)性能提高1.02%,在MPDD测试集上的ACC性能提高1.19%。实验结果验证了新模型对于文档级关系抽取的有效性。 展开更多
关键词 关系提取 实体提及 共指消解 图推理 上下文图结构
下载PDF
公安调解职能的现实困境与优化向度 被引量:2
13
作者 马泽红 《辽宁警察学院学报》 2023年第5期36-42,共7页
公安调解是我国公安工作的优良传统,也是矛盾纠纷化解机制的重要组成部分和有效手段,在基层公安工作中发挥着重要作用。公安调解能充分发挥公安机关的先天优势,但也面临公安调解独立性弱化与权限缺位、调解人员素质参差不齐、调解标准... 公安调解是我国公安工作的优良传统,也是矛盾纠纷化解机制的重要组成部分和有效手段,在基层公安工作中发挥着重要作用。公安调解能充分发挥公安机关的先天优势,但也面临公安调解独立性弱化与权限缺位、调解人员素质参差不齐、调解标准化流程未系统建立、实质化解纠纷矛盾困难重重等实际问题。为了使公安调解工作能够健康发展,结合公安工作实际情况,应当从完善法律制度设计压实主体责任、制定标准化调解流程落实监管责任以及加强专业化人才培养实质化解争议等方面探索可落地可执行的具体调解完善方案,以期实现阶段性巩固公安调解理论与长远性拓展公安调解实践的融合。 展开更多
关键词 社会治理 主体责任 协调监管 实质化解
下载PDF
EntityManager: Managing Dirty Data Based on Entity Resolution 被引量:2
14
作者 Xue-Li Liu Hong-Zhi Wang +1 位作者 Jian-Zhong Li Hong Gao 《Journal of Computer Science & Technology》 SCIE EI CSCD 2017年第3期644-662,共19页
Data quality is important in many data-driven applications, such as decision making, data analysis, and data mining. Recent studies focus on data cleaning techniques by deleting or repairing the dirty data, which may ... Data quality is important in many data-driven applications, such as decision making, data analysis, and data mining. Recent studies focus on data cleaning techniques by deleting or repairing the dirty data, which may cause information loss and bring new inconsistencies. To avoid these problems, we propose EntityManager, a general system to manage dirty data without data cleaning. This system takes real-world entity as the basic storage unit and retrieves query results according to the quality requirement of users. The system is able to handle all kinds of inconsistencies recognized by entity resolution. We elaborate the EntityManager system, covering its architecture, data model, and query processing techniques. To process queries efficiently, our system adopts novel indices, similarity operator and query optimization techniques. Finally, we verify the efficiency and effectiveness of this system and present future research challenges. 展开更多
关键词 dirty data entity resolution uncertain attribute query processing query optimization
原文传递
A genetic algorithm based entity resolution approach with active learning 被引量:1
15
作者 Chenchen SUN Derong SHEN +2 位作者 Yue KOU Tiezheng NIE Ge YU 《Frontiers of Computer Science》 SCIE EI CSCD 2017年第1期147-159,共13页
Entity resolution is a key aspect in data quality and data integration, identifying which records correspond to the same real world entity in data sources. Many existing ap- proaches require manually designed match ru... Entity resolution is a key aspect in data quality and data integration, identifying which records correspond to the same real world entity in data sources. Many existing ap- proaches require manually designed match rules to solve the problem, which always needs domain knowledge and is time consuming. We propose a novel genetic algorithm based en- tity resolution approach via active learning. It is able to learn effective match rules by logically combining several different attributes' comparisons with proper thresholds. We use ac- tive learning to reduce manually labeled data and speed up the learning process. The extensive evaluation shows that the proposed approach outperforms the sate-of-the-art entity res- olution approaches in accuracy. 展开更多
关键词 entity resolution genetic algorithm active learning data quality data integration
原文传递
Modeling Topic-Based Human Expertise for Crowd Entity Resolution 被引量:1
16
作者 Sai-Sai Gong Wei Hu +1 位作者 Wei-Yi Ge Yu-Zhong Qu 《Journal of Computer Science & Technology》 SCIE EI CSCD 2018年第6期1204-1218,共15页
Entity resolution (ER) aims to identify whether two entities in an ER task refer to the same real-world thing.Crowd ER uses humans, in addition to machine algorithms, to obtain the truths of ER tasks. However, inacc... Entity resolution (ER) aims to identify whether two entities in an ER task refer to the same real-world thing.Crowd ER uses humans, in addition to machine algorithms, to obtain the truths of ER tasks. However, inaccurate orerroneous results are likely to be generated when humans give unreliable judgments. Previous studies have found thatcorrectly estimating human accuracy or expertise in crowd ER is crucial to truth inference. However, a large number ofthem assume that humans have consistent expertise over all the tasks, and ignore the fact that humans may have variedexpertise on different topics (e.g., music versus sport). In this paper, we deal with crowd ER in the Semantic Web area.We identify multiple topics of ER tasks and model human expertise on different topics. Furthermore, we leverage similartask clustering to enhance the topic modeling and expertise estimation. We propose a probabilistic graphical model thatcomputes ER task similarity, estimates human expertise, and infers the task truths in a unified framework. Our evaluationresults on real-world and synthetic datasets show that, compared with several state-of-the-art approaches, our proposedmodel achieves higher accuracy on the task truth inference and is more consistent with the human real expertise. 展开更多
关键词 entity resolution crowdsourcing HUMAN EXPERTISE TOPIC MODELING task SIMILARITY
原文传递
A Survey on Blocking Technology of Entity Resolution 被引量:1
17
作者 Bo-Han Li Yi Liu +2 位作者 An-Man Zhang Wen-Huan Wang Shuo Wan 《Journal of Computer Science & Technology》 SCIE EI CSCD 2020年第4期769-793,共25页
Entity resolution(ER)is a significant task in data integration,which aims to detect all entity profiles that correspond to the same real-world entity.Due to its inherently quadratic complexity,blocking was proposed to... Entity resolution(ER)is a significant task in data integration,which aims to detect all entity profiles that correspond to the same real-world entity.Due to its inherently quadratic complexity,blocking was proposed to ameliorate ER,and it offers an approximate solution which clusters similar entity profiles into blocks so that it suffices to perform pair-wise comparisons inside each block in order to reduce the computational cost of ER.This paper presents a comprehensive survey on existing blocking technologies.We summarize and analyze all classic blocking methods with emphasis on different blocking construction and optimization techniques.We find that traditional blocking ER methods which depend on the fixed schema may not work in the context of highly heterogeneous information spaces.How to use schema information flexibly is of great significance to efficiently process data with the new features of this era.Machine learning is an important tool for ER,but end-to-end and efficient machine learning methods still need to be explored.We also sum up and provide the most promising trend for future work from the directions of real-time blocking ER,incremental blocking ER,deep learning with ER,etc. 展开更多
关键词 BLOCKING CONSTRUCTION BLOCKING optimization data LINKAGE entity resolution
原文传递
基于主题异构图嵌入的Token粒度实体解析方法
18
作者 初慧琳 申德荣 +2 位作者 窦文周 聂铁铮 寇月 《小型微型计算机系统》 CSCD 北大核心 2023年第7期1398-1404,共7页
实体解析是数据集成、数据挖掘等技术中不可或缺的步骤,其具体任务是查找引用自同一真实世界的实体的数据记录.现有的方法多数是通过计算实体记录的属性相似度来评估是否为同一实体,由于该方法需要预先对齐记录属性,无法适应属性中toke... 实体解析是数据集成、数据挖掘等技术中不可或缺的步骤,其具体任务是查找引用自同一真实世界的实体的数据记录.现有的方法多数是通过计算实体记录的属性相似度来评估是否为同一实体,由于该方法需要预先对齐记录属性,无法适应属性中token误放的情形,也不能有效利用跨属性中tokens的语义和结构信息,影响实体识别准确性.本文提出了一种采用主题异构图嵌入的token粒度的实体解析方法(THGE-ER).在token、属性和记录基础上,利用LDA模型为实体记录添加一个主题层级,并构建了一个由token、属性、记录和主题4类节点组成的主题异构图;采用区分节点类型的异构图嵌入表示方法,并将节点间的语义和结构信息嵌入到token层级的嵌入向量中;进一步结合多层次注意力机制,完成最终的实体解析决策.经过大量的实验证明,本文提出的方法表现出了良好的性能. 展开更多
关键词 实体解析 LDA文档主题模型 异构图 多层注意力机制
下载PDF
基于域分离网络的实体解析迁移方法
19
作者 孙琛琛 许雷 +1 位作者 申德荣 聂铁铮 《湖南大学学报(自然科学版)》 EI CAS CSCD 北大核心 2023年第2期86-94,共9页
实体解析致力于识别多条记录是否描述真实世界相同实体,这是数据清洗和数据集成中的关键问题.近年来,基于深度学习的实体解析广受欢迎,它们需要大量标注数据才能达到较优的效果.然而,在现实场景中,大量高质量标注数据不容易获得.本文提... 实体解析致力于识别多条记录是否描述真实世界相同实体,这是数据清洗和数据集成中的关键问题.近年来,基于深度学习的实体解析广受欢迎,它们需要大量标注数据才能达到较优的效果.然而,在现实场景中,大量高质量标注数据不容易获得.本文提出了一个基于深度迁移学习的实体解析模型,通过域分离网络提取源域和目标域的公共特征,并利用公共特征得到实体解析结果,从而实现从源域到目标域的迁移.实验结果表明,在多个数据集上,本文提出的方法比之前最好的方法在F1度量上最大提高了40%左右.实验证明本文的方法具有更好的表现,并且训练时间更短. 展开更多
关键词 实体解析 域分离网络 变分自编码器 数据集成 迁移学习
下载PDF
基于实体间关系的数据空间实体解析技术
20
作者 祁祥威 《现代计算机》 2023年第15期80-82,共3页
针对数据空间中大量异质数据没有统一的语义,无法进行基于属性值相似度的实体解析任务的问题,提出了从实体间关系进行实体解析的简单方法。通过决策结点和决策关系构建连接图,并通过连通分量算法进行冗余结点的删除和属性的继承。通过... 针对数据空间中大量异质数据没有统一的语义,无法进行基于属性值相似度的实体解析任务的问题,提出了从实体间关系进行实体解析的简单方法。通过决策结点和决策关系构建连接图,并通过连通分量算法进行冗余结点的删除和属性的继承。通过构建的小规模数据集进行了算法的验证。 展开更多
关键词 实体解析 数据空间 实体关系模型 数据清洗
下载PDF
上一页 1 2 6 下一页 到第
使用帮助 返回顶部