期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
Novel Representations of Word Embedding Based on the Zolu Function
1
作者 Jihua Lu Youcheng Zhang 《Journal of Beijing Institute of Technology》 EI CAS 2020年第4期526-530,共5页
Two learning models,Zolu-continuous bags of words(ZL-CBOW)and Zolu-skip-grams(ZL-SG),based on the Zolu function are proposed.The slope of Relu in word2vec has been changed by the Zolu function.The proposed models can ... Two learning models,Zolu-continuous bags of words(ZL-CBOW)and Zolu-skip-grams(ZL-SG),based on the Zolu function are proposed.The slope of Relu in word2vec has been changed by the Zolu function.The proposed models can process extremely large data sets as well as word2vec without increasing the complexity.Also,the models outperform several word embedding methods both in word similarity and syntactic accuracy.The method of ZL-CBOW outperforms CBOW in accuracy by 8.43%on the training set of capital-world,and by 1.24%on the training set of plural-verbs.Moreover,experimental simulations on word similarity and syntactic accuracy show that ZL-CBOW and ZL-SG are superior to LL-CBOW and LL-SG,respectively. 展开更多
关键词 Zolu function word embedding continuous bags of words word similarity accuracy
下载PDF
基于连续词包模型的一种改进的文本主题聚类算法
2
作者 秦泽浩 《电脑知识与技术》 2018年第6Z期226-228,共3页
本文针对知乎网上问答文章的特点和信息处理方式,分析了使用连续词包模型对这种文本进行主题聚类的一般方式和步骤。包括文本预处理、文本处理的模型选择和聚类分析算法的设计。在本文预处理阶段,讨论了对于中文的分词和去噪等;在文本... 本文针对知乎网上问答文章的特点和信息处理方式,分析了使用连续词包模型对这种文本进行主题聚类的一般方式和步骤。包括文本预处理、文本处理的模型选择和聚类分析算法的设计。在本文预处理阶段,讨论了对于中文的分词和去噪等;在文本处理的模型选择阶段,本文着重讨论了N-gram语言模型;在文本聚类阶段,分析并描述了一种文本聚类算法。通过上述讨论分析确定了本文最终应用的方法。 展开更多
关键词 连续词包(continuous Bag of Words) 文本主题聚类算法 改进K-MEANS
下载PDF
Enhancing Arabic Cyberbullying Detection with End-to-End Transformer Model
3
作者 Mohamed A.Mahdi Suliman Mohamed Fati +2 位作者 Mohamed A.G.Hazber Shahanawaj Ahamad Sawsan A.Saad 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第11期1651-1671,共21页
Cyberbullying,a critical concern for digital safety,necessitates effective linguistic analysis tools that can navigate the complexities of language use in online spaces.To tackle this challenge,our study introduces a ... Cyberbullying,a critical concern for digital safety,necessitates effective linguistic analysis tools that can navigate the complexities of language use in online spaces.To tackle this challenge,our study introduces a new approach employing Bidirectional Encoder Representations from the Transformers(BERT)base model(cased),originally pretrained in English.This model is uniquely adapted to recognize the intricate nuances of Arabic online communication,a key aspect often overlooked in conventional cyberbullying detection methods.Our model is an end-to-end solution that has been fine-tuned on a diverse dataset of Arabic social media(SM)tweets showing a notable increase in detection accuracy and sensitivity compared to existing methods.Experimental results on a diverse Arabic dataset collected from the‘X platform’demonstrate a notable increase in detection accuracy and sensitivity compared to existing methods.E-BERT shows a substantial improvement in performance,evidenced by an accuracy of 98.45%,precision of 99.17%,recall of 99.10%,and an F1 score of 99.14%.The proposed E-BERT not only addresses a critical gap in cyberbullying detection in Arabic online forums but also sets a precedent for applying cross-lingual pretrained models in regional language applications,offering a scalable and effective framework for enhancing online safety across Arabic-speaking communities. 展开更多
关键词 Cyberbullying offensive detection Bidirectional Encoder Representations from the Transformers(BERT) continuous bag of words Social Media natural language processing
下载PDF
Improved Dota2 Lineup Recommendation Model Based on a Bidirectional LSTM 被引量:7
4
作者 Lei Zhang Chenbo Xu +3 位作者 Yihua Gao Yi Han Xiaojiang Du Zhihong Tian 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第6期712-720,共9页
In recent years,e-sports has rapidly developed,and the industry has produced large amounts of data with specifications,and these data are easily to be obtained.Due to the above characteristics,data mining and deep lea... In recent years,e-sports has rapidly developed,and the industry has produced large amounts of data with specifications,and these data are easily to be obtained.Due to the above characteristics,data mining and deep learning methods can be used to guide players and develop appropriate strategies to win games.As one of the world’s most famous e-sports events,Dota2 has a large audience base and a good game system.A victory in a game is often associated with a hero’s match,and players are often unable to pick the best lineup to compete.To solve this problem,in this paper,we present an improved bidirectional Long Short-Term Memory(LSTM)neural network model for Dota2 lineup recommendations.The model uses the Continuous Bag Of Words(CBOW)model in the Word2 vec model to generate hero vectors.The CBOW model can predict the context of a word in a sentence.Accordingly,a word is transformed into a hero,a sentence into a lineup,and a word vector into a hero vector,the model applied in this article recommends the last hero according to the first four heroes selected first,thereby solving a series of recommendation problems. 展开更多
关键词 Word2vec mutiplayer online battle arena games continuous Bag Of Words(CBOW)model Long Short-Term Memory(LSTM)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部