期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
A Top-down Method of Extraction Entity Relationship Triples and Obtaining Annotated Data
1
作者 Zhiqiang Hu Zheng Ma +6 位作者 Jun Shi Zhipeng Li Xun Shao Yangzhao Yang Yong Liao Zhenyuan Gao Jie Zhang 《Journal of Quantum Computing》 2022年第1期13-22,共10页
The extraction of entity relationship triples is very important to build a knowledge graph(KG),meanwhile,various entity relationship extraction algorithms are mostly based on data-driven,especially for the current pop... The extraction of entity relationship triples is very important to build a knowledge graph(KG),meanwhile,various entity relationship extraction algorithms are mostly based on data-driven,especially for the current popular deep learning algorithms.Therefore,obtaining a large number of accurate triples is the key to build a good KG as well as train a good entity relationship extraction algorithm.Because of business requirements,this KG’s application field is determined and the experts’opinions also must be satisfied.Considering these factors we adopt the top-down method which refers to determining the data schema firstly,then filling the specific data according to the schema.The design of data schema is the top-level design of KG,and determining the data schema according to the characteristics of KG is equivalent to determining the scope of data’s collection and the mode of data’s organization.This method is generally suitable for the construction of domain KG.This article proposes a fast and efficient method to extract the topdown type KG’s triples in social media with the help of structured data in the information box on the right side of the related encyclopedia webpage.At the same time,based on the obtained triples,a data labeling method is proposed to obtain sufficiently high-quality training data,using in various Natural Language Processing(NLP)information extraction algorithms’training. 展开更多
关键词 Entity relationship triples knowledge graph TOP-DOWN social media data labeling
下载PDF
Classification framework and semantic labeling for Big Earth Data
2
作者 Juanle Wang Kun Bu +4 位作者 Dongmei Yan Jingyue Wang Bowen Duan Min Zhang Guojin He 《Big Earth Data》 EI CSCD 2023年第3期886-903,共18页
Big Earth Data refers to the multidimensional integration and association of scientific data,including geography,resources,environment,ecology,and biology.An effective data classification system and label management s... Big Earth Data refers to the multidimensional integration and association of scientific data,including geography,resources,environment,ecology,and biology.An effective data classification system and label management strategy are important foundations for long-term management of data resources.The objective of this study was to construct a classification system and realize multidimensional semantic data label management for the Big Earth Data Science Engineering Program(CASEarth).This study constructed two sets of classification and coding systems that realize classification by mapping each other;namely,the geosphere-level and Sustainable Development Goals(SDGs)indicator classifications.This technique was based on natural language processing technology and solved problems with subject-word segmentation,weight calculation,and dynamic matching.A prototype system for classification and label management was constructed based on existing CASEarth datasets of more than 1,100.Furthermore,we expect our study to provide the methodology and technical support for useroriented classification and label management services for Big Earth Data. 展开更多
关键词 Big Earth data CASEarth scientific engineering data classification data labeling data management
原文传递
A Threshold-Control Generative Adversarial Network Method for Intelligent Fault Diagnosis 被引量:2
3
作者 Xinyu Li Sican Cao +1 位作者 Liang Gao Long Wen 《Complex System Modeling and Simulation》 2021年第1期55-64,共10页
Fault diagnosis plays the increasingly vital role to guarantee the machine reliability in the industrial enterprise.Among all the solutions,deep learning(DL)methods have achieved more popularity for their feature extr... Fault diagnosis plays the increasingly vital role to guarantee the machine reliability in the industrial enterprise.Among all the solutions,deep learning(DL)methods have achieved more popularity for their feature extraction ability from the raw historical data.However,the performance of DL relies on the huge amount of labeled data,as it is costly to obtain in the real world as the labeling process for data is usually tagged by hand.To obtain the good performance with limited labeled data,this research proposes a threshold-control generative adversarial network(TCGAN)method.Firstly,the 1D vibration signals are processed to be converted into 2D images,which are used as the input of TCGAN.Secondly,TCGAN would generate pseudo data which have the similar distribution with the limited labeled data.With pseudo data generation,the training dataset can be enlarged and the increase on the labeled data could further promote the performance of TCGAN on fault diagnosis.Thirdly,to mitigate the instability of the generated data,a threshold-control is presented to adjust the relationship between discriminator and generator dynamically and automatically.The proposed TCGAN is validated on the datasets from Case Western Reserve University and Self-Priming Centrifugal Pump.The prediction accuracies with limited labeled data have reached to 99.96%and 99.898%,which are even better than other methods tested under the whole labeled datasets. 展开更多
关键词 generative adversarial network limited labeled data DISCRIMINATOR fault diagnosis
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部