期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
利用单语数据改进神经机器翻译压缩模型的翻译质量 被引量:9
1
作者 李响 刘洋 +1 位作者 陈伟 刘群 《中文信息学报》 CSCD 北大核心 2019年第7期46-55,共10页
该文提出利用一个大型且精度高的神经机器翻译模型(教师模型)从单语数据中提取隐性双语知识,从而改进小型且精度低的神经机器翻译模型(学生模型)的翻译质量。该文首先提出了'伪双语数据'的教学方法,利用教师模型翻译单语数据获... 该文提出利用一个大型且精度高的神经机器翻译模型(教师模型)从单语数据中提取隐性双语知识,从而改进小型且精度低的神经机器翻译模型(学生模型)的翻译质量。该文首先提出了'伪双语数据'的教学方法,利用教师模型翻译单语数据获得的合成双语数据改进学生模型,然后提出了'负对数似然—知识蒸馏联合优化'教学方法,除了利用合成双语数据,还利用教师模型获得的目标语言词语概率分布作为知识,从而在知识蒸馏框架下提高学生模型的翻译质量。实验证明,在中英和德英翻译任务上,使用该方法训练的学生模型不仅在领域内测试集上显著超过了基线学生模型,而且在领域外测试集上的泛化性能也得到了提高。 展开更多
关键词 神经机器翻译 知识蒸馏 单语数据
下载PDF
神经机器翻译中英语单词及其大小写联合预测模型 被引量:11
2
作者 张楠 李响 +1 位作者 靳晓宁 陈伟 《中文信息学报》 CSCD 北大核心 2019年第3期52-58,共7页
英文中单词有大小写之分,如果使用不规范,会降低语句的可读性,甚至造成语义上的根本变化。当前的机器翻译处理流程一般先翻译生成小写的英文译文,再采用独立的大小写恢复工具进行还原,这种方式步骤繁琐且没有考虑上下文信息。另一种方... 英文中单词有大小写之分,如果使用不规范,会降低语句的可读性,甚至造成语义上的根本变化。当前的机器翻译处理流程一般先翻译生成小写的英文译文,再采用独立的大小写恢复工具进行还原,这种方式步骤繁琐且没有考虑上下文信息。另一种方式是抽取包含大小写的词表,但这种方式扩大了词表,增加了模型参数。该文提出了一种在神经机器翻译训练中联合预测英语单词及其大小写属性的方法,在同一个解码器输出层分别预测单词及其大小写属性,预测大小写时充分考虑源端语料和目标端语料上下文信息。该方法不仅减小了词表的大小和模型参数,译文的质量也得到提升。在WMT 2017汉英新闻翻译任务测试集上,相比基线方法,该方法在大小写敏感和大小写不敏感两个评价指标上分别提高0.97BLEU和1.01BLEU,改善了神经机器翻译模型的性能。 展开更多
关键词 机器翻译 大小写恢复 联合预测
下载PDF
开放Web API时代正在逐步走近
3
作者 崔江涛 《程序员》 2008年第4期44-44,8,共1页
API的思想并不是一种新的创举或者一个新的概念,实际在应用程序早有这种概念,并且获得很好的应用。现在有人开创性的把它作为网络应用的一种尝试并获得了成功,抽象地看它的特性里有分享与开放的精神。事实总是胜于雄辩,让我们看看... API的思想并不是一种新的创举或者一个新的概念,实际在应用程序早有这种概念,并且获得很好的应用。现在有人开创性的把它作为网络应用的一种尝试并获得了成功,抽象地看它的特性里有分享与开放的精神。事实总是胜于雄辩,让我们看看这些先行者的网站如何做的: 展开更多
关键词 API WEB 应用程序 网络应用 网站
下载PDF
Knowledge Graph Construction and Applications for Web Search and Beyond 被引量:3
4
作者 Peilu Wang Hao Jiang +1 位作者 Jingfang Xu Qi Zhang 《Data Intelligence》 2019年第4期333-349,共17页
Knowledge graph(KG)has played an important role in enhancing the performance of many intelligent systems.In this paper,we introduce the solution of building a large-scale multi-source knowledge graph from scratch in S... Knowledge graph(KG)has played an important role in enhancing the performance of many intelligent systems.In this paper,we introduce the solution of building a large-scale multi-source knowledge graph from scratch in Sogou Inc.,including its architecture,technical implementation and applications.Unlike previous works that build knowledge graph with graph databases,we build the knowledge graph on top of SogouQdb,a distributed search engine developed by Sogou Web Search Department,which can be easily scaled to support petabytes of data.As a supplement to the search engine,we also introduce a series of models to support inference and graph based querying.Currently,the data of Sogou knowledge graph that are collected from 136 different websites and constantly updated consist of 54 million entities and over 600 million entity links.We also introduce three applications of knowledge graph in Sogou Inc.:entity detection and linking,knowledge based question answering and knowledge based dialog system.These applications have been used in Web search products to help user acquire information more efficiently. 展开更多
关键词 Knowledge graph Search engine Question answering
原文传递
Spotlight: Hot Target Discovery and Localization with Crowdsourced Photos 被引量:1
5
作者 Jiaxi Gu Jiliang Wang +3 位作者 Lan Zhang Zhiwen Yu Xiaozhe Xin Yunhao Liu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2020年第1期68-80,共13页
Camera-equipped mobile devices are encouraging people to take more photos and the development and growth of social networks is making it increasingly popular to share photos online. When objects appear in overlapping ... Camera-equipped mobile devices are encouraging people to take more photos and the development and growth of social networks is making it increasingly popular to share photos online. When objects appear in overlapping Fields Of View(FOV), this means that they are drawing much attention and thus indicates their popularity. Successfully discovering and locating these objects can be very useful for many applications, such as criminal investigations, event summaries, and crowdsourcing-based Geographical Information Systems(GIS).Existing methods require either prior knowledge of the environment or intentional photographing. In this paper, we propose a seamless approach called 'Spotlight', which performs passive localization using crowdsourced photos.Using a graph-based model, we combine object images across multiple camera views. Within each set of combined object images, a photographing map is built on which object localization is performed using plane geometry. We evaluate the system’s localization accuracy using photos taken in various scenarios, with the results showing our approach to be effective for passive object localization and to achieve a high level of accuracy. 展开更多
关键词 crowdsourcing LOCALIZATION MULTIMEDIA mobile COMPUTING
原文传递
kLDM:Inferring Multiple Metagenomic Association Networks Based on the Variation of Environmental Factors
6
作者 Yuqing Yang Xin Wang +3 位作者 Kaikun Xie Congmin Zhu Ning Chen Ting Chen 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2021年第5期834-847,共14页
Identification of significant biological relationships or patterns is central to many metagenomic studies.Methods that estimate association networks have been proposed for this purpose;however,they assume that associa... Identification of significant biological relationships or patterns is central to many metagenomic studies.Methods that estimate association networks have been proposed for this purpose;however,they assume that associations are static,neglecting the fact that relationships in a microbial ecosystem may vary with changes in environmental factors(EFs),which can result in inaccurate estimations.Therefore,in this study,we propose a computational model,called the k-Lognormal-Dirichlet-Multinomial(kLDM)model,which estimates multiple association networks that correspond to specific environmental conditions,and simultaneously infers microbe-microbe and EF-microbe associations for each network.The effectiveness of the kLDM model was demonstrated on synthetic data,a colorectal cancer(CRC)dataset,the Tara Oceans dataset,and the American Gut Project dataset.The results revealed that the widely-used Spearman’s rank correlation coefficient method performed much worse than the other methods,indicating the importance of separating samples by environmental conditions.Cancer fecal samples were then compared with cancer-free samples,and the estimation achieved by kLDM exhibited fewer associations among microbes but stronger associations between specific bacteria,especially five CRC-associated operational taxonomic units,indicating gut microbe translocation in cancer patients.Some EF-dependent associations were then found within a marine eukaryotic community.Finally,the gut microbial heterogeneity of inflammatory bowel disease patients was detected.These results demonstrate that kLDM can elucidate the complex associations within microbial ecosystems.The kLDM program,R,and Python scripts,together with all experimental datasets,are accessible at https://github.com/tinglab/kLDM.git. 展开更多
关键词 METAGENOMICS Association inference Environmental condition Bayesian model Clustering
原文传递
Memory Access Optimization of Molecular Dynamics Simulation Software Crystal-MD on Sunway Taihu Light
7
作者 Jianjiang Li Jie Lin +2 位作者 Panpan Du Kai Zhang Jie Wu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2021年第3期296-308,共13页
The radiation damage effect of key structural materials is one of the main research subjects of the numerical reactor.From the perspective of experimental safety and feasibility,Molecular Dynamics(MD)in the materials ... The radiation damage effect of key structural materials is one of the main research subjects of the numerical reactor.From the perspective of experimental safety and feasibility,Molecular Dynamics(MD)in the materials field is an ideal method for simulating the radiation damage of structural materials.The Crystal-MD represents a massive parallel MD simulation software based on the key material characteristics of reactors.Compared with the Large-scale Atomic/Molecurlar Massively Parallel Simulator(LAMMPS)and ITAP Molecular Dynamics(IMD)software,the Crystal-MD reduces the memory required for software operation to a certain extent,but it is very time-consuming.Moreover,the calculation results of the Crystal-MD have large deviations,and there are also some problems,such as memory limitation and frequent communication during its migration and optimization.In this paper,in order to solve the above problems,the memory access mode of the Crystal-MD software is studied.Based on the memory access mode,a memory access optimization strategy is proposed for a unique architecture of China’s supercomputer Sunway Taihu Light.The proposed optimization strategy is verified by the experiments,and experimental results show that the running speed of the Crystal-MD is increased significantly by using the proposed optimization strategy. 展开更多
关键词 molecular dynamics simulation Crystal-MD Sunway Taihu Light memory access optimization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部