期刊文献+
共找到1,277篇文章
< 1 2 64 >
每页显示 20 50 100
Physics-informed neural network-based petroleum reservoir simulation with sparse data using domain decomposition
1
作者 Jiang-Xia Han Liang Xue +4 位作者 Yun-Sheng Wei Ya-Dong Qi Jun-Lei Wang Yue-Tian Liu Yu-Qi Zhang 《Petroleum Science》 SCIE EI CAS CSCD 2023年第6期3450-3460,共11页
Recent advances in deep learning have expanded new possibilities for fluid flow simulation in petroleum reservoirs.However,the predominant approach in existing research is to train neural networks using high-fidelity ... Recent advances in deep learning have expanded new possibilities for fluid flow simulation in petroleum reservoirs.However,the predominant approach in existing research is to train neural networks using high-fidelity numerical simulation data.This presents a significant challenge because the sole source of authentic wellbore production data for training is sparse.In response to this challenge,this work introduces a novel architecture called physics-informed neural network based on domain decomposition(PINN-DD),aiming to effectively utilize the sparse production data of wells for reservoir simulation with large-scale systems.To harness the capabilities of physics-informed neural networks(PINNs)in handling small-scale spatial-temporal domain while addressing the challenges of large-scale systems with sparse labeled data,the computational domain is divided into two distinct sub-domains:the well-containing and the well-free sub-domain.Moreover,the two sub-domains and the interface are rigorously constrained by the governing equations,data matching,and boundary conditions.The accuracy of the proposed method is evaluated on two problems,and its performance is compared against state-of-the-art PINNs through numerical analysis as a benchmark.The results demonstrate the superiority of PINN-DD in handling large-scale reservoir simulation with limited data and show its potential to outperform conventional PINNs in such scenarios. 展开更多
关键词 Physical-informed neural networks Fluid flow simulation Sparse data domain decomposition
下载PDF
Enhancing Relational Triple Extraction in Specific Domains:Semantic Enhancement and Synergy of Large Language Models and Small Pre-Trained Language Models
2
作者 Jiakai Li Jianpeng Hu Geng Zhang 《Computers, Materials & Continua》 SCIE EI 2024年第5期2481-2503,共23页
In the process of constructing domain-specific knowledge graphs,the task of relational triple extraction plays a critical role in transforming unstructured text into structured information.Existing relational triple e... In the process of constructing domain-specific knowledge graphs,the task of relational triple extraction plays a critical role in transforming unstructured text into structured information.Existing relational triple extraction models facemultiple challenges when processing domain-specific data,including insufficient utilization of semantic interaction information between entities and relations,difficulties in handling challenging samples,and the scarcity of domain-specific datasets.To address these issues,our study introduces three innovative components:Relation semantic enhancement,data augmentation,and a voting strategy,all designed to significantly improve the model’s performance in tackling domain-specific relational triple extraction tasks.We first propose an innovative attention interaction module.This method significantly enhances the semantic interaction capabilities between entities and relations by integrating semantic information fromrelation labels.Second,we propose a voting strategy that effectively combines the strengths of large languagemodels(LLMs)and fine-tuned small pre-trained language models(SLMs)to reevaluate challenging samples,thereby improving the model’s adaptability in specific domains.Additionally,we explore the use of LLMs for data augmentation,aiming to generate domain-specific datasets to alleviate the scarcity of domain data.Experiments conducted on three domain-specific datasets demonstrate that our model outperforms existing comparative models in several aspects,with F1 scores exceeding the State of the Art models by 2%,1.6%,and 0.6%,respectively,validating the effectiveness and generalizability of our approach. 展开更多
关键词 Relational triple extraction semantic interaction large language models data augmentation specific domains
下载PDF
Incorporating Domain Knowledge into Data Mining Process:An Ontology Based Framework 被引量:5
3
作者 PAN Ding SHEN Jun-yi ZHOU Mu-xin 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期165-169,共5页
With the explosive growth of data available, there is an urgent need to develop continuous data mining which reduces manual interaction evidently. A novel model for data mining is proposed in evolving environment. Fir... With the explosive growth of data available, there is an urgent need to develop continuous data mining which reduces manual interaction evidently. A novel model for data mining is proposed in evolving environment. First, some valid mining task schedules are generated, and then au tonomous and local mining are executed periodically, finally, previous results are merged and refined. The framework based on the model creates a communication mechanism to in corporate domain knowledge into continuous process through ontology service. The local and merge mining are transparent to the end user and heterogeneous data ,source by ontology. Experiments suggest that the framework should be useful in guiding the continuous mining process. 展开更多
关键词 continuous data mining domain knowledge ONTOLOGY FRAMEWORK
下载PDF
Direction-of-arrival estimation based on direct data domain (D3) method 被引量:2
4
作者 Chen Hui Huang Benxiong +1 位作者 Wang Yongliang Hou Yaoqiong 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2009年第3期512-518,共7页
A direction-of-arrival (DOA) estimation algorithm based on direct data domain (D3) approach is presented. This method can accuracy estimate DOA using one snapshot modified data, called the temporal and spatial two... A direction-of-arrival (DOA) estimation algorithm based on direct data domain (D3) approach is presented. This method can accuracy estimate DOA using one snapshot modified data, called the temporal and spatial two-dimensional vector reconstruction (TSR) method. The key idea is to apply the D3 approach which can extract the signal of given frequency but null out other frequency signals in temporal domain. Then the spatial vector reconstruction processing is used to estimate the angle of the spatial coherent signal source based on extract signal data. Compared with the common temporal and spatial processing approach, the TSR method has a lower computational load, higher real-time performance, robustness and angular accuracy of DOA. The proposed algorithm can be directly applied to the phased array radar of coherent pulses. Simulation results demonstrate the performance of the proposed technique. 展开更多
关键词 direction-of-arrival estimation space-time two-dimensional DOA direct data domain de-correlation.
下载PDF
Direct data domain approach to space-time adaptive processing 被引量:2
5
作者 Wen Xiaoqin Han Chongzhao 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第1期59-64,共6页
In non-homogeneous environment, traditional space-time adaptive processing doesn't effectively suppress interference and detect target, because the secondary data don' t exactly reflect the statistical characteristi... In non-homogeneous environment, traditional space-time adaptive processing doesn't effectively suppress interference and detect target, because the secondary data don' t exactly reflect the statistical characteristic of the range cell under test. A ravel methodology utilizing the direct data domain approach to space-time adaptive processing ( STAP ) in airbome radar non-homogeneous environments is presented. The deterministic least squares adaptive signal processing technique operates on a "snapshot-by-snapshot" basis to dethrone the adaptive adaptive weights for nulling interferences and estimating signal of interest (SOI). Furthermore, this approach eliminates the requirement for estimating the covariance through the data of neighboring range cell, which eliminates calculating the inverse of covariance, and can be implemented to operate in real-time. Simulation results illustrate the efficiency of interference suppression in non-homogeneous environment. 展开更多
关键词 space-time adaptive processing direct data domain interference suppression.
下载PDF
Analyzing Cross-domain Transportation Big Data of New York City with Semi-supervised and Active Learning 被引量:4
6
作者 Huiyu Sun Suzanne McIntosh 《Computers, Materials & Continua》 SCIE EI 2018年第10期1-9,共9页
The majority of big data analytics applied to transportation datasets suffer from being too domain-specific,that is,they draw conclusions for a dataset based on analytics on the same dataset.This makes models trained ... The majority of big data analytics applied to transportation datasets suffer from being too domain-specific,that is,they draw conclusions for a dataset based on analytics on the same dataset.This makes models trained from one domain(e.g.taxi data)applies badly to a different domain(e.g.Uber data).To achieve accurate analyses on a new domain,substantial amounts of data must be available,which limits practical applications.To remedy this,we propose to use semi-supervised and active learning of big data to accomplish the domain adaptation task:Selectively choosing a small amount of datapoints from a new domain while achieving comparable performances to using all the datapoints.We choose the New York City(NYC)transportation data of taxi and Uber as our dataset,simulating different domains with 90%as the source data domain for training and the remaining 10%as the target data domain for evaluation.We propose semi-supervised and active learning strategies and apply it to the source domain for selecting datapoints.Experimental results show that our adaptation achieves a comparable performance of using all datapoints while using only a fraction of them,substantially reducing the amount of data required.Our approach has two major advantages:It can make accurate analytics and predictions when big datasets are not available,and even if big datasets are available,our approach chooses the most informative datapoints out of the dataset,making the process much more efficient without having to process huge amounts of data. 展开更多
关键词 Big data taxi and uber domain adaptation active learning semi-supervised learning
下载PDF
A ROBUST PHASE-ONLY DIRECT DATA DOMAIN ALGORITHM BASED ON GENERALIZED RAYLEIGH QUOTIENT OPTIMIZATION USING HYBRID GENETIC ALGORITHM 被引量:2
7
作者 Shao Wei Qian Zuping Yuan Feng 《Journal of Electronics(China)》 2007年第4期560-566,共7页
A robust phase-only Direct Data Domain Least Squares (D3LS) algorithm based on gen- eralized Rayleigh quotient optimization using hybrid Genetic Algorithm (GA) is presented in this letter. The optimization efficiency ... A robust phase-only Direct Data Domain Least Squares (D3LS) algorithm based on gen- eralized Rayleigh quotient optimization using hybrid Genetic Algorithm (GA) is presented in this letter. The optimization efficiency and computational speed are improved via the hybrid GA com- posed of standard GA and Nelder-Mead simplex algorithms. First, the objective function, with a form of generalized Rayleigh quotient, is derived via the standard D3LS algorithm. It is then taken as a fitness function and the unknown phases of all adaptive weights are taken as decision variables. Then, the nonlinear optimization is performed via the hybrid GA to obtain the optimized solution of phase-only adaptive weights. As a phase-only adaptive algorithm, the proposed algorithm is sim- pler than conventional algorithms when it comes to hardware implementation. Moreover, it proc- esses only a single snapshot data as opposed to forming sample covariance matrix and operating matrix inversion. Simulation results show that the proposed algorithm has a good signal recovery and interferences nulling performance, which are superior to that of the phase-only D3LS algorithm based on standard GA. 展开更多
关键词 Generalized Rayleigh quotient Hybrid genetic algorithm Phase-only optimization Direct data domain Least Squares (D^3LS) algorithm Nelder-Mead simplex algorithm
下载PDF
Wavelet domain adaptive filtering algorithm for removing the seamless pipe noise contained in the magnetic flux leakage data 被引量:1
8
作者 韩文花 Que Peiwen 《High Technology Letters》 EI CAS 2006年第2期170-174,共5页
With the widespread application and fast development of gas and oil pipeline network in China, the pipeline inspection technology has been used more extensively. The magnetic flux leakage (MFL) method has establishe... With the widespread application and fast development of gas and oil pipeline network in China, the pipeline inspection technology has been used more extensively. The magnetic flux leakage (MFL) method has established itself as the most widely used in-line inspection technique for the evaluation of gas and oil pipelines. The MFL data obtained from seamless pipeline inspection is usually contaminated by the seamless pipe noise (SPN). SPN can in some cases completely mask MFL signals from certain type of defects, and therefore considerably reduces the detectability of the defect signals. In this paper, a new de-noising algorithm called wavelet domain adaptive filtering is proposed for removing the SPN contained in the MFL data. The new algorithm results from combining the wavelet transform with the adaptive filtering technique. Results from application of the proposed algorithm to the MFL data from field tests show that the proposed algorithm has good performance and considerably improves the detectability of the defect signals in the MFL data. 展开更多
关键词 pipeline inspection magnetic flux leakage data discrete wavelet transform wavelet domain adaptive filtering seamless pipe noise
下载PDF
Cost-Aware Multi-Domain Virtual Data Center Embedding 被引量:1
9
作者 Xiao Ma Zhongbao Zhang Sen Su 《China Communications》 SCIE CSCD 2018年第12期190-207,共18页
Virtual data center is a new form of cloud computing concept applied to data center. As one of the most important challenges, virtual data center embedding problem has attracted much attention from researchers. In dat... Virtual data center is a new form of cloud computing concept applied to data center. As one of the most important challenges, virtual data center embedding problem has attracted much attention from researchers. In data centers, energy issue is very important for the reality that data center energy consumption has increased by dozens of times in the last decade. In this paper, we are concerned about the cost-aware multi-domain virtual data center embedding problem. In order to solve this problem, this paper first addresses the energy consumption model. The model includes the energy consumption model of the virtual machine node and the virtual switch node, to quantify the energy consumption in the virtual data center embedding process. Based on the energy consumption model above, this paper presents a heuristic algorithm for cost-aware multi-domain virtual data center embedding. The algorithm consists of two steps: inter-domain embedding and intra-domain embedding. Inter-domain virtual data center embedding refers to dividing virtual data center requests into several slices to select the appropriate single data center. Intra-domain virtual data center refers to embedding virtual data center requests in each data center. We first propose an inter-domain virtual data center embedding algorithm based on label propagation to select the appropriate single data center. We then propose a cost-aware virtual data center embedding algorithm to perform the intra-domain data center embedding. Extensive simulation results show that our proposed algorithm in this paper can effectively reduce the energy consumption while ensuring the success ratio of embedding. 展开更多
关键词 virtual data CENTER EMBEDDING MULTI-domain cost-aware LABEL PROPAGATION
下载PDF
A Model-free Approach to Fault Detection of Continuous-time Systems Based on Time Domain Data
10
作者 Steven X. Ding 《International Journal of Automation and computing》 EI 2007年第2期189-194,共6页
In this paper, a model-free approach is presented to design an observer-based fault detection system of linear continuoustime systems based on input and output data in the time domain. The core of the approach is to d... In this paper, a model-free approach is presented to design an observer-based fault detection system of linear continuoustime systems based on input and output data in the time domain. The core of the approach is to directly identify parameters of the observer-based residual generator based on a numerically reliable data equation obtained by filtering and sampling the input and output signals. 展开更多
关键词 Fault detection linear continuous time-invariant systems time domain data subspace methods observer-based residual generator
下载PDF
Quality Assessment of Training Data with Uncertain Labels for Classification of Subjective Domains
11
作者 Ying Dai 《Journal of Computer and Communications》 2017年第7期152-168,共17页
In order to improve the performance of classifiers in subjective domains, this paper defines a metric to measure the quality of the subjectively labelled training data (QoSTD) by means of K-means clustering. Then, the... In order to improve the performance of classifiers in subjective domains, this paper defines a metric to measure the quality of the subjectively labelled training data (QoSTD) by means of K-means clustering. Then, the QoSTD is used as a weight of the predicted class scores to adjust the likelihoods of instances. Moreover, two measurements are defined to assess the performance of the classifiers trained by the subjective labelled data. The binary classifiers of Traditional Chinese Medicine (TCM) Zhengs are trained and retrained by the real-world data set, utilizing the support vector machine (SVM) and the discrimination analysis (DA) models, so as to verify the effectiveness of the proposed method. The experimental results show that the consistency of likelihoods of instances with the corresponding observations is increased notable for the classes, especially in the cases with the relatively low QoSTD training data set. The experimental results also indicate the solution how to eliminate the miss-labelled instances from the training data set to re-train the classifiers in the subjective domains. 展开更多
关键词 Quality Assessment SUBJECTIVE domain Multimodal Sensor data LABEL Noise LIKELIHOOD ADJUSTING TCM ZHENG
下载PDF
Identification of Categorical Registration Data of Domain Names in Data Warehouse Construction Task
12
作者 Rasim Alguliev Rena Gasimova 《Intelligent Control and Automation》 2013年第2期227-234,共8页
This work is dedicated to formation of data warehouse for processing of a large volume of registration data of domain names. Data cleaning is applied in order to increase the effectiveness of decision making support. ... This work is dedicated to formation of data warehouse for processing of a large volume of registration data of domain names. Data cleaning is applied in order to increase the effectiveness of decision making support. Data cleaning is ap- plied in warehouses for detection and deletion of errors, discrepancy in data in order to improve their quality. For this purpose, fuzzy record comparison algorithms are for clearing of registration data of domain names reviewed in this work. Also, identification method of domain names registration data for data warehouse formation is proposed. Deci- sion making algorithms for identification of registration data are implemented in DRRacket and Python. 展开更多
关键词 domain domain NAME System Registrar Registrant Category data data WAREHOUSE data CLEARING Fuzzy Search Algorithms Damerau-Levenstein Distance Decision Tree
下载PDF
Domain-Oriented Data-Driven Data Mining Based on Rough Sets 被引量:1
13
作者 Guoyin Wang 《南昌工程学院学报》 CAS 2006年第2期46-46,共1页
Data mining (also known as Knowledge Discovery in Databases - KDD) is defined as the nontrivial extraction of implicit, previously unknown, and potentially useful information from data. The aims and objectives of data... Data mining (also known as Knowledge Discovery in Databases - KDD) is defined as the nontrivial extraction of implicit, previously unknown, and potentially useful information from data. The aims and objectives of data mining are to discover knowledge of interest to user needs.Data mining is really a useful tool in many domains such as marketing, decision making, etc. However, some basic issues of data mining are ignored. What is data mining? What is the product of a data mining process? What are we doing in a data mining process? Is there any rule we should obey in a data mining process? In order to discover patterns and knowledge really interesting and actionable to the real world Zhang et al proposed a domain-driven human-machine-cooperated data mining process.Zhao and Yao proposed an interactive user-driven classification method using the granule network. In our work, we find that data mining is a kind of knowledge transforming process to transform knowledge from data format into symbol format. Thus, no new knowledge could be generated (born) in a data mining process. In a data mining process, knowledge is just transformed from data format, which is not understandable for human, into symbol format,which is understandable for human and easy to be used.It is similar to the process of translating a book from Chinese into English.In this translating process,the knowledge itself in the book should remain unchanged. What will be changed is the format of the knowledge only. That is, the knowledge in the English book should be kept the same as the knowledge in the Chinese one.Otherwise, there must be some mistakes in the translating proces, that is, we are transforming knowledge from one format into another format while not producing new knowledge in a data mining process. The knowledge is originally stored in data (data is a representation format of knowledge). Unfortunately, we can not read, understand, or use it, since we can not understand data. With this understanding of data mining, we proposed a data-driven knowledge acquisition method based on rough sets. It also improved the performance of classical knowledge acquisition methods. In fact, we also find that the domain-driven data mining and user-driven data mining do not conflict with our data-driven data mining. They could be integrated into domain-oriented data-driven data mining. It is just like the views of data base. Users with different views could look at different partial data of a data base. Thus, users with different tasks or objectives wish, or could discover different knowledge (partial knowledge) from the same data base. However, all these partial knowledge should be originally existed in the data base. So, a domain-oriented data-driven data mining method would help us to extract the knowledge which is really existed in a data base, and really interesting and actionable to the real world. 展开更多
关键词 data mining data-DRIVEN USER-DRIVEN domain-driven KDD Machine Learning Knowledge Acquisition rough sets
下载PDF
Lu-Hf Isotopic Data of the Mbé-Sassa-Mbersi Tonalite (Central Cameroon Domain): Indicator of ca. 1.0 Ga Juvenile Tonian Magmatism in the Region
14
作者 Alliance Nicaise Saha-Fouotsa Rigobert Tchameni +5 位作者 Olivier Vanderhaeghe Armin Zeh Periclex Martial Fosso Tchunte Aurélien Eglinger Emmanuel Negue Nomo Pierre Barbey 《Journal of Geoscience and Environment Protection》 2021年第9期1-19,共19页
The Pan-African/Brasiliano orogenic belts are a part of numerous Neoproterozoic orogenic belts that belong to the long-lived orgenic cycle that distancing phase started at the Tonian around 1.0 Ga. The Tonian magmatis... The Pan-African/Brasiliano orogenic belts are a part of numerous Neoproterozoic orogenic belts that belong to the long-lived orgenic cycle that distancing phase started at the Tonian around 1.0 Ga. The Tonian magmatism fairly documented in the Neoproterozoic belts of Borborema Province (NE-Brazil), seemed so far inexistent in the Central African Orogenic Belt (CAOB) although these two belts <span style="font-family:Verdana;">were</span><span style="font-family:Verdana;"> geologically correlated. Through the Lu-Hf geochronological analysis on zircon of tonalite, the present work, coupled with the previous data, suggests the existence of a Tonian age magmatism in the Central Cameroon Domain of the CAOB although the latter is much reworked. The Nguesseck tonalite outcrops in the northern part of the Mbé</span><span style="font-family:Verdana;">-</span><span style="font-family:Verdana;">Sassa-Mbersi region, in the northern edge of Central Cameroon Domain of the CAOB and in the Tcholliré</span><span style="font-family:Verdana;">-</span><span style="font-family:;" "=""><span style="font-family:Verdana;">Banyo shear zone (TBSZ). The Lu-Hf data obtain on the zircon grains of this tonalite reveal juvenile Hf</span><sub><span style="font-family:Verdana;">TDM</span></sub><span style="font-family:Verdana;"> age of ca. 1.0 Ga. This age, combined to the previous geochemical data suggest</span></span><span style="font-family:Verdana;">s</span><span style="font-family:Verdana;"> that the protholites of this tonalite would have extracted from the source during the distancing phase (rifting and oceanization) of the Pan-african/Brasiliano orogeny at the early Neoproterozoic.</span> 展开更多
关键词 Mbé-Sassa-Mbersi Central Cameroon domain Central African Orogenic Belt Tonian magmatism Lu-Hf Isotopic data
下载PDF
Domain-oriented data-driven data mining:a new understanding for data mining
15
作者 WANG Guo-yin WANG Yan 《重庆邮电大学学报(自然科学版)》 2008年第3期266-271,共6页
Recent advances in computing,communications,digital storage technologies,and high-throughput data-acquisition technologies,make it possible to gather and store incredible volumes of data.It creates unprecedented oppor... Recent advances in computing,communications,digital storage technologies,and high-throughput data-acquisition technologies,make it possible to gather and store incredible volumes of data.It creates unprecedented opportunities for large-scale knowledge discovery from database.Data mining is an emerging area of computational intelligence that offers new theories,techniques,and tools for processing large volumes of data,such as data analysis,decision making,etc.There are many researchers working on designing efficient data mining techniques,methods,and algorithms.Unfortunately,most data mining researchers pay much attention to technique problems for developing data mining models and methods,while little to basic issues of data mining.In this paper,we will propose a new understanding for data mining,that is,domain-oriented data-driven data mining(3DM)model.Some data-driven data mining algorithms developed in our Lab are also presented to show its validity. 展开更多
关键词 粗糙集 或然率 数据处理 计算方法
下载PDF
Building a Productive Domain-Specific Cloud for Big Data Processing and Analytics Service
16
作者 Yuzhong Yan Mahsa Hanifi +1 位作者 Liqi Yi Lei Huang 《Journal of Computer and Communications》 2015年第5期107-117,共11页
Cloud Computing as a disruptive technology, provides a dynamic, elastic and promising computing climate to tackle the challenges of big data processing and analytics. Hadoop and MapReduce are the widely used open sour... Cloud Computing as a disruptive technology, provides a dynamic, elastic and promising computing climate to tackle the challenges of big data processing and analytics. Hadoop and MapReduce are the widely used open source frameworks in Cloud Computing for storing and processing big data in the scalable fashion. Spark is the latest parallel computing engine working together with Hadoop that exceeds MapReduce performance via its in-memory computing and high level programming features. In this paper, we present our design and implementation of a productive, domain-specific big data analytics cloud platform on top of Hadoop and Spark. To increase user’s productivity, we created a variety of data processing templates to simplify the programming efforts. We have conducted experiments for its productivity and performance with a few basic but representative data processing algorithms in the petroleum industry. Geophysicists can use the platform to productively design and implement scalable seismic data processing algorithms without handling the details of data management and the complexity of parallelism. The Cloud platform generates a complete data processing application based on user’s kernel program and simple configurations, allocates resources and executes it in parallel on top of Spark and Hadoop. 展开更多
关键词 BUILDING a Productive domain-Specific CLOUD for BIG data PROCESSING and ANALYTICS SERVICE
下载PDF
Incremental Learning Based on Data Translation and Knowledge Distillation
17
作者 Tan Cheng Jielong Wang 《International Journal of Intelligence Science》 2023年第2期33-47,共15页
Recently, deep convolutional neural networks (DCNNs) have achieved remarkable results in image classification tasks. Despite convolutional networks’ great successes, their training process relies on a large amount of... Recently, deep convolutional neural networks (DCNNs) have achieved remarkable results in image classification tasks. Despite convolutional networks’ great successes, their training process relies on a large amount of data prepared in advance, which is often challenging in real-world applications, such as streaming data and concept drift. For this reason, incremental learning (continual learning) has attracted increasing attention from scholars. However, incremental learning is associated with the challenge of catastrophic forgetting: the performance on previous tasks drastically degrades after learning a new task. In this paper, we propose a new strategy to alleviate catastrophic forgetting when neural networks are trained in continual domains. Specifically, two components are applied: data translation based on transfer learning and knowledge distillation. The former translates a portion of new data to reconstruct the partial data distribution of the old domain. The latter uses an old model as a teacher to guide a new model. The experimental results on three datasets have shown that our work can effectively alleviate catastrophic forgetting by a combination of the two methods aforementioned. 展开更多
关键词 Incremental domain Learning data Translation Knowledge Distillation Cat-astrophic Forgetting
下载PDF
跨域数据管理 被引量:1
18
作者 杜小勇 李彤 +3 位作者 卢卫 范举 张峰 柴云鹏 《计算机科学》 CSCD 北大核心 2024年第1期4-12,共9页
随着数据成为新的生产要素和数字中国顶层战略的推进,跨域数据共享和流通对于实现数据要素价值最大化变得至关重要。国家通过布局全国一体化大数据中心体系、启动“东数西算”工程等一系列举措,为数据要素的跨域应用提供了基础设施。然... 随着数据成为新的生产要素和数字中国顶层战略的推进,跨域数据共享和流通对于实现数据要素价值最大化变得至关重要。国家通过布局全国一体化大数据中心体系、启动“东数西算”工程等一系列举措,为数据要素的跨域应用提供了基础设施。然而,传统的数据管理局限于单一域内,无法满足跨域场景下的数据管理需求。跨域数据管理面临通信层面的跨空间域挑战、数据建模层面的异构模型融合问题,以及数据访问层面的跨信任域挑战。从跨空间域、跨管辖域和跨信任域3个视角出发,探讨了跨域数据管理的内涵、研究挑战及关键技术,并展望了其未来发展趋势。 展开更多
关键词 数据管理 跨空间域 跨管辖域 跨信任域
下载PDF
数据产权法律构造论 被引量:18
19
作者 冯晓青 《政法论丛》 CSSCI 北大核心 2024年第1期120-136,共17页
在数字经济环境中,数据已成为一种新型生产要素,需要更好地挖掘和开发其经济价值。数据不同于传统有体物以及知识产权客体的特征和属性使其在法律制度构建和运行上也具有特殊性。数据产权制度是数据基础制度的核心内容,其应涵盖保护数... 在数字经济环境中,数据已成为一种新型生产要素,需要更好地挖掘和开发其经济价值。数据不同于传统有体物以及知识产权客体的特征和属性使其在法律制度构建和运行上也具有特殊性。数据产权制度是数据基础制度的核心内容,其应涵盖保护数据主体合法权益、协调和平衡数据利益关系、促进数据有序流动和分享、促进数据要素利益公平分配等内容。数据产权的法律构造旨在建立数据资源持有权、数据加工使用权和数据产品经营权的数据动态流转和价值实现机制,最终服务于数字经济发展目标。 展开更多
关键词 数据产权 法律构造 数据资源持有权 数据加工使用权 数据产品经营权 公共领域
下载PDF
基于知识标注平台的水利枢纽工程知识图谱构建及应用
20
作者 张军珲 昝红英 +2 位作者 欧佳乐 阎子悦 张坤丽 《计算机科学》 CSCD 北大核心 2024年第11期255-264,共10页
大量水利异构数据的产生,为领域知识图谱的构建及应用提供了场景,但也导致了水利知识图谱构建过程的差异。针对现有水利知识图谱构建流程复杂的问题,提出了一套有效的基于知识标注平台的水利知识图谱构建流程。以小浪底水利枢纽工程知... 大量水利异构数据的产生,为领域知识图谱的构建及应用提供了场景,但也导致了水利知识图谱构建过程的差异。针对现有水利知识图谱构建流程复杂的问题,提出了一套有效的基于知识标注平台的水利知识图谱构建流程。以小浪底水利枢纽工程知识的智能应用为例,使用该枢纽的工程数据,应用提出的流程在水利领域构建水利枢纽工程知识图谱(Water Conservancy Hub Project Knowledge Graph,WCHP-KG)。首先以小浪底水利枢纽工程为中心,依据行业术语标准和现有词汇表,制定了概念分类和关系描述体系,形成了WCHP-KG的模式层。通过BiLSTM-CRF和序列标注模型,在水利专家的指导下,使用知识标注平台对非结构化文本进行了半自动标注和人工校对,实现了知识融合,进而构建了WCHP-KG的数据层。结果表明WCHP-KG涵盖了43种水利实体以及110种实体关系。经过实践验证,构建的WCHP-KG为小浪底水利枢纽工程的相关应用提供了有力的结构化知识基础,为工程决策和管理提供了可靠的参考依据,进而证明了所提构建流程的有效性。未来将进一步扩展WCHP-KG和完善水利知识图谱的构建流程,以适应更多的应用场景和领域需求。 展开更多
关键词 异构数据 领域知识图谱 知识图谱构建 水利枢纽 知识标注平台
下载PDF
上一页 1 2 64 下一页 到第
使用帮助 返回顶部