Nowadays,cloud computing is used more and more widely,more and more people prefer to using cloud server to store data.So,how to encrypt the data efficiently is an important problem.The search efficiency of existed sea...Nowadays,cloud computing is used more and more widely,more and more people prefer to using cloud server to store data.So,how to encrypt the data efficiently is an important problem.The search efficiency of existed search schemes decreases as the index increases.For solving this problem,we build the two-level index.Simultaneously,for improving the semantic information,the central word expansion is combined.The purpose of privacy-preserving content-aware search by using the two-level index(CKESS)is that the first matching is performed by using the extended central words,then calculate the similarity between the trapdoor and the secondary index,finally return the results in turn.Through experiments and analysis,it is proved that our proposed schemes can resist multiple threat models and the schemes are secure and efficient.展开更多
With the advent of the era of big data,knowledge engineering has received extensive attention.How to extract useful knowledge from massive data is the key to big data analysis.Knowledge graph technology is an importan...With the advent of the era of big data,knowledge engineering has received extensive attention.How to extract useful knowledge from massive data is the key to big data analysis.Knowledge graph technology is an important part of artificial intelligence,which provides a method to extract structured knowledge from massive texts and images,and has broad application prospects.The knowledge base with semantic processing capability and open interconnection ability can be used to generate application value in intelligent information services such as intelligent search,intelligent question answering and personalized recommendation.Although knowledge graph has been applied to various systems,the basic theory and application technology still need further research.On the basis of comprehensively expounding the definition and architecture of knowledge graph,this paper reviews the key technologies of knowledge graph construction,including the research progress of four core technologies such as knowledge extraction technology,knowledge representation technology,knowledge fusion technology and knowledge reasoning technology,as well as some typical applications.Finally,the future development direction and challenges of the knowledge graph are prospected.展开更多
Searchable encryption provides an effective way for data security and privacy in cloud storage.Users can retrieve encrypted data in the cloud under the premise of protecting their own data security and privacy.However...Searchable encryption provides an effective way for data security and privacy in cloud storage.Users can retrieve encrypted data in the cloud under the premise of protecting their own data security and privacy.However,most of the current content-based retrieval schemes do not contain enough semantic information of the article and cannot fully reflect the semantic information of the text.In this paper,we propose two secure and semantic retrieval schemes based on BERT(bidirectional encoder representations from transformers)named SSRB-1,SSRB-2.By training the documents with BERT,the keyword vector is generated to contain more semantic information of the documents,which improves the accuracy of retrieval and makes the retrieval result more consistent with the user’s intention.Finally,through testing on real data sets,it is shown that both of our solutions are feasible and effective.展开更多
Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web ...Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web query, keyword query in relational databases and XML databases cannot be completely applied to keyword query in dataspaces. So we propose KeymanticES, a novel keyword-based semantic entity search mechanism in dataspaces which combines both keyword query and semantic query features. And we focus on query intent disambiguation problem and propose a novel three-step approach to resolve it. Extensive experimental results show the effectiveness and correctness of our proposed approach.展开更多
Current search engines in most geospatial data portals tend to induce users to focus on one single-data characteristic dimension(e.g.popularity and release date).This approach largely fails to take account of users’m...Current search engines in most geospatial data portals tend to induce users to focus on one single-data characteristic dimension(e.g.popularity and release date).This approach largely fails to take account of users’multidimensional preferences for geospatial data,and hence may likely result in a less than optimal user experience in discovering the most applicable dataset.This study reports a machine learning framework to address the ranking challenge,the fundamental obstacle in geospatial data discovery,by(1)identifying a number of ranking features of geospatial data to represent users’multidimensional preferences by considering semantics,user behavior,spatial similarity,and static dataset metadata attributes;(2)applying a machine learning method to automatically learn a ranking function;and(3)proposing a system architecture to combine existing search-oriented open source software,semantic knowledge base,ranking feature extraction,and machine learning algorithm.Results show that the machine learning approach outperforms other methods,in terms of both precision at K and normalized discounted cumulative gain.As an early attempt of utilizing machine learning to improve the search ranking in the geospatial domain,we expect this work to set an example for further research and open the door towards intelligent geospatial data discovery.展开更多
Software intelligent development has become one of the most important research trends in software engineering. In this paper, we put forward two key concepts -- intelligent development environment (IntelliDE) and so...Software intelligent development has become one of the most important research trends in software engineering. In this paper, we put forward two key concepts -- intelligent development environment (IntelliDE) and software knowledge graph -- for the first time. IntelliDE is an ecosystem in which software big data are aggregated, mined and analyzed to provide intelligent assistance in the life cycle of software development. We present its architecture and discuss its key research issues and challenges. Software knowledge graph is a software knowledge representation and management framework, which plays an important role in IntelliDE. We study its concept and introduce some concrete details and examples to show how it could be constructed and leveraged.展开更多
This paper introduces a new concept,distributed geospatial information processing(DGIP),which refers to the process of geospatial information residing on computers geographically dispersed and connected through comput...This paper introduces a new concept,distributed geospatial information processing(DGIP),which refers to the process of geospatial information residing on computers geographically dispersed and connected through computer networks,and the contribution of DGIP to Digital Earth(DE).The DGIP plays a critical role in integrating the widely distributed geospatial resources to support the DE envisioned to utilise a wide variety of information.This paper addresses this role from three different aspects:1)sharing Earth data,information,and services through geospatial interoperability supported by standardisation of contents and interfaces;2)sharing computing and software resources through a GeoCyberinfrastructure supported by DGIP middleware;and 3)sharing knowledge within and across domains through ontology and semantic searches.Observing the long-term process for the research and development of an operational DE,we discuss and expect some practical contributions of the DGIP to the DE.展开更多
Description logics (DLs) are widely employed in recent semantic web application systems. However, classical description logics are limited when dealing with imprecise concepts and roles, thus providing the motivatio...Description logics (DLs) are widely employed in recent semantic web application systems. However, classical description logics are limited when dealing with imprecise concepts and roles, thus providing the motivation for this work. In this paper, we present a type-2 fuzzy attributive concept language with complements (ALC) and provide its knowledge representation and reasoning algorithms. We also propose type-2 fuzzy web ontology language (OWL) to build a fuzzy ontology based on type- 2 fuzzy ALC and analyze the soundness, completeness, and complexity of the reasoning algorithms. Compared to type-1 fuzzy ALC, type-2 fuzzy ALC can describe imprecise knowledge more meticulously by using the membership degree interval. We implement a semantic search engine based on type-2 fuzzy ALC and carry out experiments on real data to test its performance. The results show that the type-2 fuzzy ALC can improve the precision and increase the number of relevant hits for imprecise information searches.展开更多
Peer-to-peer(P2P) networks are widely used due to their good scalability and robustness.This paper considers the characteristics of data sources which use some taxonomy hierarchies to classify and organize their data ...Peer-to-peer(P2P) networks are widely used due to their good scalability and robustness.This paper considers the characteristics of data sources which use some taxonomy hierarchies to classify and organize their data objects,combines P2P techniques,and proposes a P2P based taxonomy data ring model(TDRM).The model makes full use of the semantic information contained in taxonomy hierarchies,places the data objects having similar semantics together,and organizes them into one dimensional ring structure.Super-peers dynamically join the ring according to the requirement.The routing connections among super-peers are created,which are similar to Chord ring,thus a semantics based structured super-peer network is formed.Experiments show that the model has good scalability and search efficiency.展开更多
基金This work is supported by the National Natural Science Foundation of China under grant U1836110,U1836208,U1536206,61602253,61672294by the National Key R&D Program of China under grant 2018YFB1003205+5 种基金by China Postdoctoral Science Foundation(2017M610574)by the Jiangsu Basic Research Programs-Natural Science Foundation under grant numbers BK20181407by the Priority Academic Program Development of Jiangsu Higher Education Institutions(PAPD)fundby the Major Program of the National Social Science Fund of China(17ZDA092)Qing Lan Projectby the Collaborative Innovation Center of Atmospheric Environment and Equipment Technology(CICAEET)fund,China.
文摘Nowadays,cloud computing is used more and more widely,more and more people prefer to using cloud server to store data.So,how to encrypt the data efficiently is an important problem.The search efficiency of existed search schemes decreases as the index increases.For solving this problem,we build the two-level index.Simultaneously,for improving the semantic information,the central word expansion is combined.The purpose of privacy-preserving content-aware search by using the two-level index(CKESS)is that the first matching is performed by using the extended central words,then calculate the similarity between the trapdoor and the secondary index,finally return the results in turn.Through experiments and analysis,it is proved that our proposed schemes can resist multiple threat models and the schemes are secure and efficient.
基金This research work is implemented at the 2011 Collaborative Innovation Center for Development and Utilization of Finance and Economics Big Data Property,Universities of Hunan ProvinceHunan Provincial Key Laboratory of Big Data Science and Technology,Finance and Economics+3 种基金Key Laboratory of Information Technology and Security,Hunan Provincial Higher Education.This research is funded by the Open Foundation for the University Innovation Platform in the Hunan Province,grant number 18K103Open project,Grant Numbers 20181901CRP03,20181901CRP04,20181901CRP05Hunan Provincial Education Science 13th Five-Year Plan(Grant No.XJK016BXX001)Social Science Foundation of Hunan Province(Grant No.17YBA049).
文摘With the advent of the era of big data,knowledge engineering has received extensive attention.How to extract useful knowledge from massive data is the key to big data analysis.Knowledge graph technology is an important part of artificial intelligence,which provides a method to extract structured knowledge from massive texts and images,and has broad application prospects.The knowledge base with semantic processing capability and open interconnection ability can be used to generate application value in intelligent information services such as intelligent search,intelligent question answering and personalized recommendation.Although knowledge graph has been applied to various systems,the basic theory and application technology still need further research.On the basis of comprehensively expounding the definition and architecture of knowledge graph,this paper reviews the key technologies of knowledge graph construction,including the research progress of four core technologies such as knowledge extraction technology,knowledge representation technology,knowledge fusion technology and knowledge reasoning technology,as well as some typical applications.Finally,the future development direction and challenges of the knowledge graph are prospected.
基金This work was supported by the National Natural Science Foundation of China(Grant Nos.U1836110 and U1836208)by the Jiangsu Basic Research Programs-Natural Science Foundation under grant numbers BK20200039.
文摘Searchable encryption provides an effective way for data security and privacy in cloud storage.Users can retrieve encrypted data in the cloud under the premise of protecting their own data security and privacy.However,most of the current content-based retrieval schemes do not contain enough semantic information of the article and cannot fully reflect the semantic information of the text.In this paper,we propose two secure and semantic retrieval schemes based on BERT(bidirectional encoder representations from transformers)named SSRB-1,SSRB-2.By training the documents with BERT,the keyword vector is generated to contain more semantic information of the documents,which improves the accuracy of retrieval and makes the retrieval result more consistent with the user’s intention.Finally,through testing on real data sets,it is shown that both of our solutions are feasible and effective.
基金supported by the National Basic Research 973 Program of China under Grant No. 2012CB316201the National Natural Science Foundation of China under Grant Nos. 60973021, 61033007, 61003060the Fundamental Research Funds for the Central Universities of China under Grant No. N100704001
文摘Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web query, keyword query in relational databases and XML databases cannot be completely applied to keyword query in dataspaces. So we propose KeymanticES, a novel keyword-based semantic entity search mechanism in dataspaces which combines both keyword query and semantic query features. And we focus on query intent disambiguation problem and propose a novel three-step approach to resolve it. Extensive experimental results show the effectiveness and correctness of our proposed approach.
基金NSF I/UCRC:[Grant Number IIP-1338925]NSF EarthCube:[Grant Number ICER-1540998]NASA AIST Program:[Grant Number NNX15AM85G].
文摘Current search engines in most geospatial data portals tend to induce users to focus on one single-data characteristic dimension(e.g.popularity and release date).This approach largely fails to take account of users’multidimensional preferences for geospatial data,and hence may likely result in a less than optimal user experience in discovering the most applicable dataset.This study reports a machine learning framework to address the ranking challenge,the fundamental obstacle in geospatial data discovery,by(1)identifying a number of ranking features of geospatial data to represent users’multidimensional preferences by considering semantics,user behavior,spatial similarity,and static dataset metadata attributes;(2)applying a machine learning method to automatically learn a ranking function;and(3)proposing a system architecture to combine existing search-oriented open source software,semantic knowledge base,ranking feature extraction,and machine learning algorithm.Results show that the machine learning approach outperforms other methods,in terms of both precision at K and normalized discounted cumulative gain.As an early attempt of utilizing machine learning to improve the search ranking in the geospatial domain,we expect this work to set an example for further research and open the door towards intelligent geospatial data discovery.
文摘Software intelligent development has become one of the most important research trends in software engineering. In this paper, we put forward two key concepts -- intelligent development environment (IntelliDE) and software knowledge graph -- for the first time. IntelliDE is an ecosystem in which software big data are aggregated, mined and analyzed to provide intelligent assistance in the life cycle of software development. We present its architecture and discuss its key research issues and challenges. Software knowledge graph is a software knowledge representation and management framework, which plays an important role in IntelliDE. We study its concept and introduce some concrete details and examples to show how it could be constructed and leveraged.
基金supported by a Chinese 973 project(2006CB701306)a NASA Geosciences Interoperability project(NNX07AD99G),and FGDC 2005 CAP award(05HQAG0115).
文摘This paper introduces a new concept,distributed geospatial information processing(DGIP),which refers to the process of geospatial information residing on computers geographically dispersed and connected through computer networks,and the contribution of DGIP to Digital Earth(DE).The DGIP plays a critical role in integrating the widely distributed geospatial resources to support the DE envisioned to utilise a wide variety of information.This paper addresses this role from three different aspects:1)sharing Earth data,information,and services through geospatial interoperability supported by standardisation of contents and interfaces;2)sharing computing and software resources through a GeoCyberinfrastructure supported by DGIP middleware;and 3)sharing knowledge within and across domains through ontology and semantic searches.Observing the long-term process for the research and development of an operational DE,we discuss and expect some practical contributions of the DGIP to the DE.
基金Acknowledgements This work was supported by the National Natural Science Foundation of China (Grant Nos. 60873225, 60873083, and 70771043), the National High Technology Research and Development Program of China (2007AA01Z403), the Natural Science Foundation of Hubei Province (2009CDB298), the Natural Science Foundation of Hubei Province for Distinguished Young Scholars (2008CDB351), the Wuhan Youth Science and Technology Chenguang Program (200950431171), the Open Foundation of State Key Laboratory of Software Engineering (SKLSE20080718), the Innovation Fund of Huazhong University of Science and Technology (2010MS068, Q2009021).
文摘Description logics (DLs) are widely employed in recent semantic web application systems. However, classical description logics are limited when dealing with imprecise concepts and roles, thus providing the motivation for this work. In this paper, we present a type-2 fuzzy attributive concept language with complements (ALC) and provide its knowledge representation and reasoning algorithms. We also propose type-2 fuzzy web ontology language (OWL) to build a fuzzy ontology based on type- 2 fuzzy ALC and analyze the soundness, completeness, and complexity of the reasoning algorithms. Compared to type-1 fuzzy ALC, type-2 fuzzy ALC can describe imprecise knowledge more meticulously by using the membership degree interval. We implement a semantic search engine based on type-2 fuzzy ALC and carry out experiments on real data to test its performance. The results show that the type-2 fuzzy ALC can improve the precision and increase the number of relevant hits for imprecise information searches.
基金the National Natural Science Foundation of China(Nos.61073063 and 61100028)the Fundamental Research Funds for the Central Universities(No. N090404012)the Public Science and Technology Research Funds Projects of Ocean(No.201105033)
文摘Peer-to-peer(P2P) networks are widely used due to their good scalability and robustness.This paper considers the characteristics of data sources which use some taxonomy hierarchies to classify and organize their data objects,combines P2P techniques,and proposes a P2P based taxonomy data ring model(TDRM).The model makes full use of the semantic information contained in taxonomy hierarchies,places the data objects having similar semantics together,and organizes them into one dimensional ring structure.Super-peers dynamically join the ring according to the requirement.The routing connections among super-peers are created,which are similar to Chord ring,thus a semantics based structured super-peer network is formed.Experiments show that the model has good scalability and search efficiency.