Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web ...Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web query, keyword query in relational databases and XML databases cannot be completely applied to keyword query in dataspaces. So we propose KeymanticES, a novel keyword-based semantic entity search mechanism in dataspaces which combines both keyword query and semantic query features. And we focus on query intent disambiguation problem and propose a novel three-step approach to resolve it. Extensive experimental results show the effectiveness and correctness of our proposed approach.展开更多
The design of the infrastructure for Chinese Web(CWI),a prototype system aimed at forum data analysis,is introduced.CWI takes a best effort approach.1)It tries its best to extract or annotate semantics over the web da...The design of the infrastructure for Chinese Web(CWI),a prototype system aimed at forum data analysis,is introduced.CWI takes a best effort approach.1)It tries its best to extract or annotate semantics over the web data.2)It provides flexible schemes for users to transform the web data into eXtensible Markup Language(XML)forms with more semantic annotations that are more friendly for further analytical tasks.3)A distributed graph repository,called DISGR is used as backend for management of web data.The paper introduces the design issues,reports the progress of the implementation,and discusses the research issues that are under study.展开更多
基金supported by the National Basic Research 973 Program of China under Grant No. 2012CB316201the National Natural Science Foundation of China under Grant Nos. 60973021, 61033007, 61003060the Fundamental Research Funds for the Central Universities of China under Grant No. N100704001
文摘Keyword query has attracted much research attention due to its simplicity and wide applications. The inherent ambiguity of keyword query is prone to unsatisfied query results. Moreover some existing techniques on Web query, keyword query in relational databases and XML databases cannot be completely applied to keyword query in dataspaces. So we propose KeymanticES, a novel keyword-based semantic entity search mechanism in dataspaces which combines both keyword query and semantic query features. And we focus on query intent disambiguation problem and propose a novel three-step approach to resolve it. Extensive experimental results show the effectiveness and correctness of our proposed approach.
基金This work was partially supported by the National Natural Science Foundation of China(Grant Nos.60833003 and 61070051)the National Basic Research Program of China(Grant No.2010CB731402).
文摘The design of the infrastructure for Chinese Web(CWI),a prototype system aimed at forum data analysis,is introduced.CWI takes a best effort approach.1)It tries its best to extract or annotate semantics over the web data.2)It provides flexible schemes for users to transform the web data into eXtensible Markup Language(XML)forms with more semantic annotations that are more friendly for further analytical tasks.3)A distributed graph repository,called DISGR is used as backend for management of web data.The paper introduces the design issues,reports the progress of the implementation,and discusses the research issues that are under study.