期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
An Efficient Long Short-Term Memory Model for Digital Cross-Language Summarization
1
作者 Y.C.A.Padmanabha Reddy Shyam Sunder Reddy Kasireddy +2 位作者 Nageswara Rao Sirisala Ramu Kuchipudi Purnachand Kollapudi 《Computers, Materials & Continua》 SCIE EI 2023年第3期6389-6409,共21页
The rise of social networking enables the development of multilingual Internet-accessible digital documents in several languages.The digital document needs to be evaluated physically through the Cross-Language Text Su... The rise of social networking enables the development of multilingual Internet-accessible digital documents in several languages.The digital document needs to be evaluated physically through the Cross-Language Text Summarization(CLTS)involved in the disparate and generation of the source documents.Cross-language document processing is involved in the generation of documents from disparate language sources toward targeted documents.The digital documents need to be processed with the contextual semantic data with the decoding scheme.This paper presented a multilingual crosslanguage processing of the documents with the abstractive and summarising of the documents.The proposed model is represented as the Hidden Markov Model LSTM Reinforcement Learning(HMMlstmRL).First,the developed model uses the Hidden Markov model for the computation of keywords in the cross-language words for the clustering.In the second stage,bi-directional long-short-term memory networks are used for key word extraction in the cross-language process.Finally,the proposed HMMlstmRL uses the voting concept in reinforcement learning for the identification and extraction of the keywords.The performance of the proposed HMMlstmRL is 2%better than that of the conventional bi-direction LSTM model. 展开更多
关键词 Text summarization reinforcement learning hidden markov model cross-language MULTILINGUAL
下载PDF
Cross-Language Transfer Learning-based Lhasa-Tibetan Speech Recognition
2
作者 Zhijie Wang Yue Zhao +3 位作者 Licheng Wu Xiaojun Bi Zhuoma Dawa Qiang Ji 《Computers, Materials & Continua》 SCIE EI 2022年第10期629-639,共11页
As one of Chinese minority languages,Tibetan speech recognition technology was not researched upon as extensively as Chinese and English were until recently.This,along with the relatively small Tibetan corpus,has resu... As one of Chinese minority languages,Tibetan speech recognition technology was not researched upon as extensively as Chinese and English were until recently.This,along with the relatively small Tibetan corpus,has resulted in an unsatisfying performance of Tibetan speech recognition based on an end-to-end model.This paper aims to achieve an accurate Tibetan speech recognition using a small amount of Tibetan training data.We demonstrate effective methods of Tibetan end-to-end speech recognition via cross-language transfer learning from three aspects:modeling unit selection,transfer learning method,and source language selection.Experimental results show that the Chinese-Tibetan multi-language learning method using multilanguage character set as the modeling unit yields the best performance on Tibetan Character Error Rate(CER)at 27.3%,which is reduced by 26.1%compared to the language-specific model.And our method also achieves the 2.2%higher accuracy using less amount of data compared with the method using Tibetan multi-dialect transfer learning under the same model structure and data set. 展开更多
关键词 cross-language transfer learning low-resource language modeling unit Tibetan speech recognition
下载PDF
Query Expansion Using Wikipedia and a Concept Base in Cross-language Information Retrieval
3
作者 Pham Huy Anh Yukawa Takashi 《Computer Technology and Application》 2013年第10期522-531,共10页
The present paper describes the use of online free language resources for translating and expanding queries in CLIR (cross-language information retrieval). In a previous study, we proposed method queries that were t... The present paper describes the use of online free language resources for translating and expanding queries in CLIR (cross-language information retrieval). In a previous study, we proposed method queries that were translated by two machine translation systems on the Language Gridem. The queries were then expanded using an online dictionary to translate compound words or word phrases. A concept base was used to compare back translation words with the original query in order to delete mistranslated words. In order to evaluate the proposed method, we constructed a CLIR system and used the science documents of the NTCIR1 dataset. The proposed method achieved high precision. However~ proper nouns (names of people and places) appear infrequently in science documents. In information retrieval, proper nouns present unique problems. Since proper nouns are usually unknown words, they are difficult to find in monolingual dictionaries, not to mention bilingual dictionaries. Furthermore, the initial query of the user is not always the best description of the desired information. In order to solve this problem, and to create a better query representation, query expansion is often proposed as a solution. Wikipedia was used to translate compound words or word phrases. It was also used to expand queries together with a concept base. The NTCIRI and NTCIR 6 datasets were used to evaluate the proposed method. In the proposed method, the CLIR system was implemented with a high rate of precision. The proposed syst had a higher ranking than the NTCIRI and NTCIR6 participation systems. 展开更多
关键词 cross-language inlbrmation retrieval CLIR language resources concept base language grid Wikipedia.
下载PDF
Research on Cross-Language Retrieval Using Bilingual Word Vectors in Different Languages
4
作者 Yulong Li Dong Zhou 《国际计算机前沿大会会议论文集》 2019年第1期462-465,共4页
Bilingual word vectors have been exploited a lot in cross-language information retrieval research. However, most of the research is currently focused on similar language pairs. There are very few studies exploring the... Bilingual word vectors have been exploited a lot in cross-language information retrieval research. However, most of the research is currently focused on similar language pairs. There are very few studies exploring the impact of using bilingual word vectors for cross-language information retrieval in long-distance language pairs. In this paper, it systematically analyzes the retrieval performance of various European languages (English, German, Italian, French, Finnish, Dutch) as well as Asian languages (Chinese, Japanese) in the adhoc task of CLEF 2002–2003 campaign. Genetic proximity was used to visually represent the relationships between languages and compare their crosslingual retrieval performance in various settings. The results show that the differences in language vocabulary would dramatically affect the retrieval performance. At the same time, the term by term translation retrieval method performs slightly better than the simple vector addition retrieval methods. It proves that the translation-based retrieval model can still maintain its advantage under the new semantic scheme. 展开更多
关键词 cross-language information retrieval BILINGUAL word EMBEDDING Genetic PROXIMITY Language PAIRS
下载PDF
HCRVD: A Vulnerability Detection System Based on CST-PDG Hierarchical Code Representation Learning
5
作者 Zhihui Song Jinchen Xu +1 位作者 Kewei Li Zheng Shan 《Computers, Materials & Continua》 SCIE EI 2024年第6期4573-4601,共29页
Prior studies have demonstrated that deep learning-based approaches can enhance the performance of source code vulnerability detection by training neural networks to learn vulnerability patterns in code representation... Prior studies have demonstrated that deep learning-based approaches can enhance the performance of source code vulnerability detection by training neural networks to learn vulnerability patterns in code representations.However,due to limitations in code representation and neural network design,the validity and practicality of the model still need to be improved.Additionally,due to differences in programming languages,most methods lack cross-language detection generality.To address these issues,in this paper,we analyze the shortcomings of previous code representations and neural networks.We propose a novel hierarchical code representation that combines Concrete Syntax Trees(CST)with Program Dependence Graphs(PDG).Furthermore,we introduce a Tree-Graph-Gated-Attention(TGGA)network based on gated recurrent units and attention mechanisms to build a Hierarchical Code Representation learning-based Vulnerability Detection(HCRVD)system.This system enables cross-language vulnerability detection at the function-level.The experiments show that HCRVD surpasses many competitors in vulnerability detection capabilities.It benefits from the hierarchical code representation learning method,and outperforms baseline in cross-language vulnerability detection by 9.772%and 11.819%in the C/C++and Java datasets,respectively.Moreover,HCRVD has certain ability to detect vulnerabilities in unknown programming languages and is useful in real open-source projects.HCRVD shows good validity,generality and practicality. 展开更多
关键词 Vulnerability detection deep learning CST-PDG code representation tree-graph-gated-attention network cross-language
下载PDF
Enhancing Cross-Lingual Image Description: A Multimodal Approach for Semantic Relevance and Stylistic Alignment
6
作者 Emran Al-Buraihy Dan Wang 《Computers, Materials & Continua》 SCIE EI 2024年第6期3913-3938,共26页
Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural net... Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural network models and semantic matching techniques.Experiments conducted on the Flickr8k and AraImg2k benchmark datasets,featuring images and descriptions in English and Arabic,showcase remarkable performance improvements over state-of-the-art methods.Our model,equipped with the Image&Cross-Language Semantic Matching module and the Target Language Domain Evaluation module,significantly enhances the semantic relevance of generated image descriptions.For English-to-Arabic and Arabic-to-English cross-language image descriptions,our approach achieves a CIDEr score for English and Arabic of 87.9%and 81.7%,respectively,emphasizing the substantial contributions of our methodology.Comparative analyses with previous works further affirm the superior performance of our approach,and visual results underscore that our model generates image captions that are both semantically accurate and stylistically consistent with the target language.In summary,this study advances the field of cross-lingual image description,offering an effective solution for generating image captions across languages,with the potential to impact multilingual communication and accessibility.Future research directions include expanding to more languages and incorporating diverse visual and textual data sources. 展开更多
关键词 cross-language image description multimodal deep learning semantic matching reward mechanisms
下载PDF
A Brief Analysis of Syntactical Features of English Virology Texts
7
作者 ZHANG Zhengqing JIA Xiaoqing 《Psychology Research》 2022年第10期858-861,共4页
Ever since late 2019,the COVID pandemic has given the world a great deal of pain and financial loss.Virologists around the world are working hard to eradicate it.Vaccines and treatment methods have been found,which ca... Ever since late 2019,the COVID pandemic has given the world a great deal of pain and financial loss.Virologists around the world are working hard to eradicate it.Vaccines and treatment methods have been found,which cannot be accomplished without the joint efforts of the world virologist community.Naturally,facilitating global communication would help advance the research.This paper analyzes syntactical features of English virology texts and finds that:In these texts,verbs and postpositive attributes are used frequently,complicated logic needs careful analysis,and personification is often used.Having some knowledge of these sentence features may contribute to better communication in the virology community. 展开更多
关键词 syntactical features VIROLOGY cross-language communication
下载PDF
A Brief Analysis of Lexical Features of English Virology Texts
8
作者 ZHANG Zhengqing JIA Xiaoqing 《Psychology Research》 2022年第8期656-659,共4页
COVID has already been taken as a global pandemic,the culprit of which is the virus called SARS-CoV-2.Virologists around the world have been working hard at trying to find remedies and protection methods for the world... COVID has already been taken as a global pandemic,the culprit of which is the virus called SARS-CoV-2.Virologists around the world have been working hard at trying to find remedies and protection methods for the world.This paper is a brief analysis of lexical features of English virology texts,which will show that technical words,semi-technical words,and acronyms are the three prominent lexical features of English virology tests.It may be helpful in guiding cross-language communication in the virologists’community,which the authors hope would facilitate global research on COVID. 展开更多
关键词 lexical features VIROLOGY cross-language communication
下载PDF
Generating a multilingual taxonomy based on multilingual terminology clustering
9
作者 Chengzhi ZHANG 《Chinese Journal of Library and Information Science》 2011年第2期27-40,共14页
Taxonomy denotes the hierarchical structure of a knowledge organization system. It has important applications in knowledge navigation, semantic annotation and semantic search. It is a useful instrument to study the mu... Taxonomy denotes the hierarchical structure of a knowledge organization system. It has important applications in knowledge navigation, semantic annotation and semantic search. It is a useful instrument to study the multilingual taxonomy generated automatically under the dynamic information environment in which massive amounts of information are processed and found. Multilingual taxonomy is the core component of the multilingual thesaurus or ontology. This paper presents two methods of bilingual generated taxonomy: Cross-language terminology clustering and mixed-language based terminology clustering. According to our experimental results of terminology clustering related to four specific subject domains, we found that if the parallel corpus is used to cluster multilingual terminologies, the method of using mixed-language based terminology clustering outperforms that of using the cross-language terminology clustering. 展开更多
关键词 Multilingual taxonomy multilingual terminology clustering cross-language terminology clustering parallel corpus mixed language
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部