期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
TWE‐WSD: An effective topical word embedding based word sense disambiguation 被引量:1
1
作者 lianyin jia Jilin Tang +3 位作者 Mengjuan Li Jinguo You jiaman Ding Yinong Chen 《CAAI Transactions on Intelligence Technology》 EI 2021年第1期72-79,共8页
Word embedding has been widely used in word sense disambiguation(WSD)and many other tasks in recent years for it can well represent the semantics of words.However,the existing word embedding methods mostly represent e... Word embedding has been widely used in word sense disambiguation(WSD)and many other tasks in recent years for it can well represent the semantics of words.However,the existing word embedding methods mostly represent each word as a single vector,without considering the homonymy and polysemy of the word;thus,their performances are limited.In order to address this problem,an effective topical word embedding(TWE)‐based WSD method,named TWE‐WSD,is proposed,which integrates Latent Dirichlet Allocation(LDA)and word embedding.Instead of generating a single word vector(WV)for each word,TWE‐WSD generates a topical WV for each word under each topic.Effective integrating strategies are designed to obtain high quality contextual vectors.Extensive experiments on SemEval‐2013 and SemEval‐2015 for English all‐words tasks showed that TWE‐WSD outperforms other state‐of‐the‐art WSD methods,especially on nouns. 展开更多
关键词 EMBEDDING WORD WSD
下载PDF
GPU Based Hash Segmentation Index for Fast T-overlap Query
2
作者 lianyin jia Yongbin Zhang +2 位作者 Mengjuan Li jiaman Ding Jinguo You 《国际计算机前沿大会会议论文集》 2017年第1期10-12,共3页
T-overlap query is the basis of set similarity query and has been applied in many important fields.Most existing approaches employ a pruning-and-verification framework,thus in low efficiency.Modern GPU has much higher... T-overlap query is the basis of set similarity query and has been applied in many important fields.Most existing approaches employ a pruning-and-verification framework,thus in low efficiency.Modern GPU has much higher parallelism as well as memory bandwidth than CPU and can be used to accelerate T-overlap query.In this paper,we use hash segmentation to divide inverted lists into segments,then design an efficient inverted index called GHSII on GPU using hash segmentation.Based on GHSII,a new segmentation parallel T-overlap algorithm,GSPS,is proposed.GSPS uses segment at a time to scan segments and uses shared memory to decrease the number of accesses to device memory.Furthermore,an optimized algorithm called GSPS-TLLO using a heuristic query order is proposed to solve the problem of load imbalance.Experiments are carried out on two real datasets and the results show that GSPS-TLLO outperforms the state-of-the-art GPU parallel T-overlap algorithms. 展开更多
关键词 T-overlap QUERY HASH SEGMENTATION Segment at a time SHARED memory GHSII GSPS-TLLO
下载PDF
ETI: an efficient index for set similarity queries 被引量:2
3
作者 lianyin jia jianqing XI +2 位作者 Mengjuan LI Yong LIU Decheng MIAO 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第6期700-712,共13页
Set queries are an important topic and have attracted a lot of attention. Earlier research mainly concentrated on set containment queries. In this paper we focus on the T-Overlap query which is the foundation of the s... Set queries are an important topic and have attracted a lot of attention. Earlier research mainly concentrated on set containment queries. In this paper we focus on the T-Overlap query which is the foundation of the set similarity query. To address this issue, unlike traditional algorithms that are based on an inverted index, we design a new paradigm based on the prefix tree (trie) called the expanded trie index (ETI) which expands the trie node structure by adding some new properties. Based on ETI, we convert the T- Overlap problem to finding query nodes with specific query depth equaling to T and propose a new algorithm called T- Similarity to solve T-Overlap efficiently. Then we carry out a three-step framework to extend T-Overlap to other simi- larity predicates. Extensive experiments are carried out to compare T-Similarity with other inverted index based algorithms from cardinality of query, overlap threshold, dataset size, the number of distinct elements and so on. Results show that T-Similarity outperforms the state-of-the-art algorithms in many aspects. 展开更多
关键词 expanded trie index (ETI) set similarity query T-Overlap T-Similarity algorithm T-SimilarityExact algorithm
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部