When the Transformer proposed by Google in 2017,it was first used for machine translation tasks and achieved the state of the art at that time.Although the current neural machine translation model can generate high qu...When the Transformer proposed by Google in 2017,it was first used for machine translation tasks and achieved the state of the art at that time.Although the current neural machine translation model can generate high quality translation results,there are still mistranslations and omissions in the translation of key information of long sentences.On the other hand,the most important part in traditional translation tasks is the translation of key information.In the translation results,as long as the key information is translated accurately and completely,even if other parts of the results are translated incorrect,the final translation results’quality can still be guaranteed.In order to solve the problem of mistranslation and missed translation effectively,and improve the accuracy and completeness of long sentence translation in machine translation,this paper proposes a key information fused neural machine translation model based on Transformer.The model proposed in this paper extracts the keywords of the source language text separately as the input of the encoder.After the same encoding as the source language text,it is fused with the output of the source language text encoded by the encoder,then the key information is processed and input into the decoder.With incorporating keyword information from the source language sentence,the model’s performance in the task of translating long sentences is very reliable.In order to verify the effectiveness of the method of fusion of key information proposed in this paper,a series of experiments were carried out on the verification set.The experimental results show that the Bilingual Evaluation Understudy(BLEU)score of the model proposed in this paper on theWorkshop on Machine Translation(WMT)2017 test dataset is higher than the BLEU score of Transformer proposed by Google on the WMT2017 test dataset.The experimental results show the advantages of the model proposed in this paper.展开更多
Machine Translation are increasingly welcomed and used during recent years with the commonly application of Internet and the acceleration of the integration of world economy. To know about the history and development ...Machine Translation are increasingly welcomed and used during recent years with the commonly application of Internet and the acceleration of the integration of world economy. To know about the history and development process of Machine Translation during 1930s-1970 s could help researchers gain new insights through restudying old material.展开更多
In view of the lack of research on the information model of tufting carpet machine in China,an information modeling method based on Object Linking and Embedding for Process Control Unified Architecture(OPC UA)framewor...In view of the lack of research on the information model of tufting carpet machine in China,an information modeling method based on Object Linking and Embedding for Process Control Unified Architecture(OPC UA)framework was proposed to solve the problem of“information island”caused by the differentiated data interface between heterogeneous equipment and system in tufting carpet machine workshop.This paper established an information model of tufting carpet machine based on analyzing the system architecture,workshop equipment composition and information flow of the workshop,combined with the OPC UA information modeling specification.Subsequently,the OPC UA protocol is used to instantiate and map the information model,and the OPC UA server is developed.Finally,the practicability of tufting carpet machine information model under the OPC UA framework and the feasibility of realizing the information interconnection of heterogeneous devices in the tufting carpet machine digital workshop are verified.On this basis,the cloud and remote access to the underlying device data are realized.The application of this information model and information integration scheme in actual production explores and practices the application of OPC UA technology in the digital workshop of tufting carpet machine.展开更多
The development o f the network technology, and especially the web search engine, has brought great changes to the field of the English translation. Translators can acquire the background information of the translated...The development o f the network technology, and especially the web search engine, has brought great changes to the field of the English translation. Translators can acquire the background information of the translated texts by using the web search engine correctly, inquire about the correct translation methods of the rare professional terms, apply the fixed sentence patterns, and check the correctness of the translation, so as to improve the translation speed and quality.展开更多
In bilingual translation,attention-based Neural Machine Translation(NMT)models are used to achieve synchrony between input and output sequences and the notion of alignment.NMT model has obtained state-of-the-art perfo...In bilingual translation,attention-based Neural Machine Translation(NMT)models are used to achieve synchrony between input and output sequences and the notion of alignment.NMT model has obtained state-of-the-art performance for several language pairs.However,there has been little work exploring useful architectures for Urdu-to-English machine translation.We conducted extensive Urdu-to-English translation experiments using Long short-term memory(LSTM)/Bidirectional recurrent neural networks(Bi-RNN)/Statistical recurrent unit(SRU)/Gated recurrent unit(GRU)/Convolutional neural network(CNN)and Transformer.Experimental results show that Bi-RNN and LSTM with attention mechanism trained iteratively,with a scalable data set,make precise predictions on unseen data.The trained models yielded competitive results by achieving 62.6%and 61%accuracy and 49.67 and 47.14 BLEU scores,respectively.From a qualitative perspective,the translation of the test sets was examined manually,and it was observed that trained models tend to produce repetitive output more frequently.The attention score produced by Bi-RNN and LSTM produced clear alignment,while GRU showed incorrect translation for words,poor alignment and lack of a clear structure.Therefore,we considered refining the attention-based models by defining an additional attention-based dropout layer.Attention dropout fixes alignment errors and minimizes translation errors at the word level.After empirical demonstration and comparison with their counterparts,we found improvement in the quality of the resulting translation system and a decrease in the perplexity and over-translation score.The ability of the proposed model was evaluated using Arabic-English and Persian-English datasets as well.We empirically concluded that adding an attention-based dropout layer helps improve GRU,SRU,and Transformer translation and is considerably more efficient in translation quality and speed.展开更多
Scientific literature often contains abbreviated terms in English for brief.Machine translation(MT)systems can help to share knowledge in different languages among researchers.Current MT systems may translate the same...Scientific literature often contains abbreviated terms in English for brief.Machine translation(MT)systems can help to share knowledge in different languages among researchers.Current MT systems may translate the same abbreviated term in different sentences into different target terms.MT systems translate the abbreviated term in two ways:one is to use translation of the full name,the other is to use the abbreviated term directly.Abbreviated terms may be ambiguous and polysemous,and MT systems do not have an explicit strategy to decide which way to use without context information.To get the consistent translation for abbreviated terms in scientific literature,this paper proposes a translation model for abbreviated terms that integrates context information to get consistent translation of abbreviated terms.The context information includes the positions of abbreviated term and domain attributes of scientific literature.The first abbreviated term is translated in full name while the latter ones of the same abbreviated term will show the abbreviated form in the translation text.Experiments of translation from Chinese to English show the effectiveness of the proposed translation model.展开更多
Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word a...Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word aligned bilingual corpus,while ignoring the effect of the number of adjacent bilingual phrases.In this paper,we propose a method to take the number of adjacent phrases into account for better estimation of reordering models.Instead of just checking whether there is one phrase adjacent to a given phrase,our method firstly uses a compact structure named reordering graph to represent all phrase segmentations of a parallel sentence,then the effect of the adjacent phrase number can be quantified in a forward-backward fashion,and finally incorporated into the estimation of reordering models.Experimental results on the NIST Chinese-English and WMT French-Spanish data sets show that our approach significantly outperforms the baseline method.展开更多
To cope with the market demand dynamically,enterprise needs to obtain the production status of work in process real-timely,but the information of machining progress has feature of uncertainty and can not reflect the s...To cope with the market demand dynamically,enterprise needs to obtain the production status of work in process real-timely,but the information of machining progress has feature of uncertainty and can not reflect the status of production field effectively.In this work,to overcome the ineffectiveness of computer numerical control(CNC) machining progress information extraction and its application restriction in practice because of heterogeneous system of CNC machine,based on information fusion by analyzing multi-sources information,estimating CNC machining status and predicting the machining progress through tracking tool coordinates,a CNC machining progress monitoring method is presented.The multi-sources heterogeneous information includes machining path,real-time spindle power information,manual input data and tool position.On the method of obtaining this multi-sources heterogeneous information,the method which helps explore numerical control(NC) program,monitor spindle power of CNC,collect human-computer interaction(HCI) information,obtain real-time tool coordinates and express the knowledge concerned in this field is analyzed; The decision rule of CNC machining status in the way of fusing multi-sources information in manufacturing process is summarized,as well as the machining progress tracking method in accordance with real-time tool coordinates and machining path is presented.Finally,the method discussed is proved feasible by the verification of machining progress tracking through simulation experiment.The proposed research realizes the effective integration of CNC machining progress information,and enables enterprises an efficient way to share CNC information and configure CNC resources optimally.展开更多
Similar to many fields of sciences,recent deep learning advances have been applied extensively in geosciences for both small-and large-scale problems.However,the necessity of using large training data and the’black ...Similar to many fields of sciences,recent deep learning advances have been applied extensively in geosciences for both small-and large-scale problems.However,the necessity of using large training data and the’black box’nature of learning have limited them in practice and difficult to interpret.Furthermore,including the governing equations and physical facts in such methods is also another challenge,which entails either ignoring the physics or simplifying them using unrealistic data.To address such issues,physics informed machine learning methods have been developed which can integrate the governing physics law into the learning process.In this work,a 1-dimensional(1 D)time-dependent seismic wave equation is considered and solved using two methods,namely Gaussian process(GP)and physics informed neural networks.We show that these meshless methods are trained by smaller amount of data and can predict the solution of the equation with even high accuracy.They are also capable of inverting any parameter involved in the governing equation such as wave velocity in our case.Results show that the GP can predict the solution of the seismic wave equation with a lower level of error,while our developed neural network is more accurate for velocity(P-and S-wave)and density inversion.展开更多
Authors of that paper proposed a prototype machine translator system to translate scientific English sentences into Ara- bic sentences. This system is based on natural language processing and machine learning. This pr...Authors of that paper proposed a prototype machine translator system to translate scientific English sentences into Ara- bic sentences. This system is based on natural language processing and machine learning. This proposed system is ap- plied in statistical field, which is very important on a mathematical sub field in Math department. The system is ana- lyzed, designed and developed. Author tested the proposed system on some statistical statements. It proves its validity as a prototype system.展开更多
Thanks to the strong representation capability of pre-trained language models,supervised machine translation models have achieved outstanding performance.However,the performances of these models drop sharply when the ...Thanks to the strong representation capability of pre-trained language models,supervised machine translation models have achieved outstanding performance.However,the performances of these models drop sharply when the scale of the parallel training corpus is limited.Considering the pre-trained language model has a strong ability for monolingual representation,it is the key challenge for machine translation to construct the in-depth relationship between the source and target language by injecting the lexical and syntactic information into pre-trained language models.To alleviate the dependence on the parallel corpus,we propose a Linguistics Knowledge-Driven MultiTask(LKMT)approach to inject part-of-speech and syntactic knowledge into pre-trained models,thus enhancing the machine translation performance.On the one hand,we integrate part-of-speech and dependency labels into the embedding layer and exploit large-scale monolingual corpus to update all parameters of pre-trained language models,thus ensuring the updated language model contains potential lexical and syntactic information.On the other hand,we leverage an extra self-attention layer to explicitly inject linguistic knowledge into the pre-trained language model-enhanced machine translation model.Experiments on the benchmark dataset show that our proposed LKMT approach improves the Urdu-English translation accuracy by 1.97 points and the English-Urdu translation accuracy by 2.42 points,highlighting the effectiveness of our LKMT framework.Detailed ablation experiments confirm the positive impact of part-of-speech and dependency parsing on machine translation.展开更多
Temporal relation computation is one of the tasks of the extraction of temporal arguments from event, and it is also the ultimate goal of temporal information processing. However, temporal relation computation based o...Temporal relation computation is one of the tasks of the extraction of temporal arguments from event, and it is also the ultimate goal of temporal information processing. However, temporal relation computation based on machine learning requires a lot of hand-marked work, and exploring more features from discourse. A method of two-stage machine learning based on temporal relation computation (TSMLTRC) is proposed in this paper for the shortcomings of current temporal relation computation between two events. The first stage is to get the main temporal attributes of event based on classification learning. The second stage is to compute the event temporal relation in the discourse through employing the result of the first stage as the basic features, and also employing some new linguistic characteristics. Experiments show that, compared with the artificial golden rule, the computational efficiency in the first stage is much higher, and the F1-Score of event temporal relation which is computed through combining multi-features may be increased at 85.8% in the second stage.展开更多
Automated manufacturing system is characterized by flexibility. It aims at producing a variety of products with virtually no time loses to change over from one part to the next. In this paper, the Machining Process Si...Automated manufacturing system is characterized by flexibility. It aims at producing a variety of products with virtually no time loses to change over from one part to the next. In this paper, the Machining Process Simulator GMPS is introduced, which can be used as a supported environment for machining process. It can be executed off-line or on-line in manufacturing systems in order to predict the collisions of tool with machined workpieces, fixtures or pallets. First, the functional model of GMPS is described, then adopted critical techniques in the simulator are introduced. Finally, an application of GMPS in CIMS ERC of China is presented.展开更多
The technological breakthroughs in generative artificial intelligence,represented by ChatGPT,have brought about significant social changes as well as new problems and challenges.Generative artificial intelligence has ...The technological breakthroughs in generative artificial intelligence,represented by ChatGPT,have brought about significant social changes as well as new problems and challenges.Generative artificial intelligence has inherent flaws such as language imbalance,algorithmic black box,and algorithmic bias,and at the same time,it has external risks such as algorithmic comfort zone,data pollution,algorithmic infringement,and inaccurate output.These problems lead to the difficulty in legislation for the governance of generative artificial intelligence.Taking the data contamination incident in Google Translate as an example,this article proposes that in the process of constructing machine translation ethics,the responsibility mechanism of generative artificial intelligence should be constructed around three elements:data processing,algorithmic optimisation,and ethical alignment.展开更多
Most State-Of-The-Art(SOTA) Neural Machine Translation(NMT) systems today achieve outstanding results based only on large parallel corpora.The large-scale parallel corpora for high-resource languages is easily obtaina...Most State-Of-The-Art(SOTA) Neural Machine Translation(NMT) systems today achieve outstanding results based only on large parallel corpora.The large-scale parallel corpora for high-resource languages is easily obtainable.However,the translation quality of NMT for morphologically rich languages is still unsatisfactory,mainly because of the data sparsity problem encountered in Low-Resource Languages(LRLs).In the low-resource NMT paradigm,Transfer Learning(TL) has been developed into one of the most efficient methods.It is difficult to train the model on high-resource languages to include the information in both parent and child models,as well as the initially trained model that only contains the lexicon features and word embeddings of the parent model instead of the child languages feature.In this work,we aim to address this issue by proposing the language-independent Hybrid Transfer Learning(HTL) method for LRLs by sharing lexicon embedding between parent and child languages without leveraging back translation or manually injecting noises.First,we train the High-Resource Languages(HRLs) as the parent model with its vocabularies.Then,we combine the parent and child language pairs using the oversampling method to train the hybrid model initialized by the previously parent model.Finally,we fine-tune the morphologically rich child model using a hybrid model.Besides,we explore some exciting discoveries on the original TL approach.Experimental results show that our model consistently outperforms five SOTA methods in two languages Azerbaijani(Az) and Uzbek(Uz).Meanwhile,our approach is practical and significantly better,achieving improvements of up to 4:94 and 4:84 BLEU points for low-resource child languages Az ! Zh and Uz ! Zh,respectively.展开更多
The pivot language approach for statistical machine translation(SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivotside c...The pivot language approach for statistical machine translation(SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivotside context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The first method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be reflected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.展开更多
Aiming at the problem that virtual machine information cannot be extracted incompletely, we extend the typical information extraction model of virtual machine and propose a perception mechanism in virtualization syste...Aiming at the problem that virtual machine information cannot be extracted incompletely, we extend the typical information extraction model of virtual machine and propose a perception mechanism in virtualization system based on storage covert channel to overcome the affection of the semantic gap. Taking advantage of undetectability of the covert channel, a secure channel is established between Guest and virtual machine monitor to pass data directly. The Guest machine can pass the control information of malicious process to virtual machine monitor by using the VMCALL instruction and shared memory. By parsing critical information in process control structure, virtual machine monitor can terminate the malicious processes. The test results show that the proposed mechanism can clear the user-level malicious programs in the virtual machine effectively and covertly. Meanwhile, its performance overhead is about the same as that of other mainstream monitoring mode.展开更多
This study presents a novel and innovative approach to auto-matically translating Arabic Sign Language(ATSL)into spoken Arabic.The proposed solution utilizes a deep learning-based classification approach and the trans...This study presents a novel and innovative approach to auto-matically translating Arabic Sign Language(ATSL)into spoken Arabic.The proposed solution utilizes a deep learning-based classification approach and the transfer learning technique to retrain 12 image recognition models.The image-based translation method maps sign language gestures to corre-sponding letters or words using distance measures and classification as a machine learning technique.The results show that the proposed model is more accurate and faster than traditional image-based models in classifying Arabic-language signs,with a translation accuracy of 93.7%.This research makes a significant contribution to the field of ATSL.It offers a practical solution for improving communication for individuals with special needs,such as the deaf and mute community.This work demonstrates the potential of deep learning techniques in translating sign language into natural language and highlights the importance of ATSL in facilitating communication for individuals with disabilities.展开更多
基金Major Science and Technology Project of Sichuan Province[No.2022YFG0315,2022YFG0174]Sichuan Gas Turbine Research Institute stability support project of China Aero Engine Group Co.,Ltd.[No.GJCZ-2019-71].
文摘When the Transformer proposed by Google in 2017,it was first used for machine translation tasks and achieved the state of the art at that time.Although the current neural machine translation model can generate high quality translation results,there are still mistranslations and omissions in the translation of key information of long sentences.On the other hand,the most important part in traditional translation tasks is the translation of key information.In the translation results,as long as the key information is translated accurately and completely,even if other parts of the results are translated incorrect,the final translation results’quality can still be guaranteed.In order to solve the problem of mistranslation and missed translation effectively,and improve the accuracy and completeness of long sentence translation in machine translation,this paper proposes a key information fused neural machine translation model based on Transformer.The model proposed in this paper extracts the keywords of the source language text separately as the input of the encoder.After the same encoding as the source language text,it is fused with the output of the source language text encoded by the encoder,then the key information is processed and input into the decoder.With incorporating keyword information from the source language sentence,the model’s performance in the task of translating long sentences is very reliable.In order to verify the effectiveness of the method of fusion of key information proposed in this paper,a series of experiments were carried out on the verification set.The experimental results show that the Bilingual Evaluation Understudy(BLEU)score of the model proposed in this paper on theWorkshop on Machine Translation(WMT)2017 test dataset is higher than the BLEU score of Transformer proposed by Google on the WMT2017 test dataset.The experimental results show the advantages of the model proposed in this paper.
文摘Machine Translation are increasingly welcomed and used during recent years with the commonly application of Internet and the acceleration of the integration of world economy. To know about the history and development process of Machine Translation during 1930s-1970 s could help researchers gain new insights through restudying old material.
文摘In view of the lack of research on the information model of tufting carpet machine in China,an information modeling method based on Object Linking and Embedding for Process Control Unified Architecture(OPC UA)framework was proposed to solve the problem of“information island”caused by the differentiated data interface between heterogeneous equipment and system in tufting carpet machine workshop.This paper established an information model of tufting carpet machine based on analyzing the system architecture,workshop equipment composition and information flow of the workshop,combined with the OPC UA information modeling specification.Subsequently,the OPC UA protocol is used to instantiate and map the information model,and the OPC UA server is developed.Finally,the practicability of tufting carpet machine information model under the OPC UA framework and the feasibility of realizing the information interconnection of heterogeneous devices in the tufting carpet machine digital workshop are verified.On this basis,the cloud and remote access to the underlying device data are realized.The application of this information model and information integration scheme in actual production explores and practices the application of OPC UA technology in the digital workshop of tufting carpet machine.
文摘The development o f the network technology, and especially the web search engine, has brought great changes to the field of the English translation. Translators can acquire the background information of the translated texts by using the web search engine correctly, inquire about the correct translation methods of the rare professional terms, apply the fixed sentence patterns, and check the correctness of the translation, so as to improve the translation speed and quality.
基金This work was supported by the Institute for Big Data Analytics and Artificial Intelligence(IBDAAI),Universiti Teknologi Mara,Shah Alam,Selangor.Malaysia.
文摘In bilingual translation,attention-based Neural Machine Translation(NMT)models are used to achieve synchrony between input and output sequences and the notion of alignment.NMT model has obtained state-of-the-art performance for several language pairs.However,there has been little work exploring useful architectures for Urdu-to-English machine translation.We conducted extensive Urdu-to-English translation experiments using Long short-term memory(LSTM)/Bidirectional recurrent neural networks(Bi-RNN)/Statistical recurrent unit(SRU)/Gated recurrent unit(GRU)/Convolutional neural network(CNN)and Transformer.Experimental results show that Bi-RNN and LSTM with attention mechanism trained iteratively,with a scalable data set,make precise predictions on unseen data.The trained models yielded competitive results by achieving 62.6%and 61%accuracy and 49.67 and 47.14 BLEU scores,respectively.From a qualitative perspective,the translation of the test sets was examined manually,and it was observed that trained models tend to produce repetitive output more frequently.The attention score produced by Bi-RNN and LSTM produced clear alignment,while GRU showed incorrect translation for words,poor alignment and lack of a clear structure.Therefore,we considered refining the attention-based models by defining an additional attention-based dropout layer.Attention dropout fixes alignment errors and minimizes translation errors at the word level.After empirical demonstration and comparison with their counterparts,we found improvement in the quality of the resulting translation system and a decrease in the perplexity and over-translation score.The ability of the proposed model was evaluated using Arabic-English and Persian-English datasets as well.We empirically concluded that adding an attention-based dropout layer helps improve GRU,SRU,and Transformer translation and is considerably more efficient in translation quality and speed.
基金the National Key Research and Development Program of China(No.2019YFA0707201)ISTIC Research Foundation Project(No.ZD2020-10)。
文摘Scientific literature often contains abbreviated terms in English for brief.Machine translation(MT)systems can help to share knowledge in different languages among researchers.Current MT systems may translate the same abbreviated term in different sentences into different target terms.MT systems translate the abbreviated term in two ways:one is to use translation of the full name,the other is to use the abbreviated term directly.Abbreviated terms may be ambiguous and polysemous,and MT systems do not have an explicit strategy to decide which way to use without context information.To get the consistent translation for abbreviated terms in scientific literature,this paper proposes a translation model for abbreviated terms that integrates context information to get consistent translation of abbreviated terms.The context information includes the positions of abbreviated term and domain attributes of scientific literature.The first abbreviated term is translated in full name while the latter ones of the same abbreviated term will show the abbreviated form in the translation text.Experiments of translation from Chinese to English show the effectiveness of the proposed translation model.
基金supported by the National Natural Science Foundation of China(No.61303082) the Research Fund for the Doctoral Program of Higher Education of China(No.20120121120046)
文摘Lexicalized reordering models are very important components of phrasebased translation systems.By examining the reordering relationships between adjacent phrases,conventional methods learn these models from the word aligned bilingual corpus,while ignoring the effect of the number of adjacent bilingual phrases.In this paper,we propose a method to take the number of adjacent phrases into account for better estimation of reordering models.Instead of just checking whether there is one phrase adjacent to a given phrase,our method firstly uses a compact structure named reordering graph to represent all phrase segmentations of a parallel sentence,then the effect of the adjacent phrase number can be quantified in a forward-backward fashion,and finally incorporated into the estimation of reordering models.Experimental results on the NIST Chinese-English and WMT French-Spanish data sets show that our approach significantly outperforms the baseline method.
基金supported by National Natural Science Foundation of China (Grant No. 50775228)Municipality Key Scientific & Technological Program of Chongqing, China (Grant No. CSTC2007AA2013)+1 种基金Fundamental Research Funds for the Central Universities of China (Grant No. CDJXS11111136)Program for New Century Excellent Talents in University of Ministry of Education of China
文摘To cope with the market demand dynamically,enterprise needs to obtain the production status of work in process real-timely,but the information of machining progress has feature of uncertainty and can not reflect the status of production field effectively.In this work,to overcome the ineffectiveness of computer numerical control(CNC) machining progress information extraction and its application restriction in practice because of heterogeneous system of CNC machine,based on information fusion by analyzing multi-sources information,estimating CNC machining status and predicting the machining progress through tracking tool coordinates,a CNC machining progress monitoring method is presented.The multi-sources heterogeneous information includes machining path,real-time spindle power information,manual input data and tool position.On the method of obtaining this multi-sources heterogeneous information,the method which helps explore numerical control(NC) program,monitor spindle power of CNC,collect human-computer interaction(HCI) information,obtain real-time tool coordinates and express the knowledge concerned in this field is analyzed; The decision rule of CNC machining status in the way of fusing multi-sources information in manufacturing process is summarized,as well as the machining progress tracking method in accordance with real-time tool coordinates and machining path is presented.Finally,the method discussed is proved feasible by the verification of machining progress tracking through simulation experiment.The proposed research realizes the effective integration of CNC machining progress information,and enables enterprises an efficient way to share CNC information and configure CNC resources optimally.
文摘Similar to many fields of sciences,recent deep learning advances have been applied extensively in geosciences for both small-and large-scale problems.However,the necessity of using large training data and the’black box’nature of learning have limited them in practice and difficult to interpret.Furthermore,including the governing equations and physical facts in such methods is also another challenge,which entails either ignoring the physics or simplifying them using unrealistic data.To address such issues,physics informed machine learning methods have been developed which can integrate the governing physics law into the learning process.In this work,a 1-dimensional(1 D)time-dependent seismic wave equation is considered and solved using two methods,namely Gaussian process(GP)and physics informed neural networks.We show that these meshless methods are trained by smaller amount of data and can predict the solution of the equation with even high accuracy.They are also capable of inverting any parameter involved in the governing equation such as wave velocity in our case.Results show that the GP can predict the solution of the seismic wave equation with a lower level of error,while our developed neural network is more accurate for velocity(P-and S-wave)and density inversion.
文摘Authors of that paper proposed a prototype machine translator system to translate scientific English sentences into Ara- bic sentences. This system is based on natural language processing and machine learning. This proposed system is ap- plied in statistical field, which is very important on a mathematical sub field in Math department. The system is ana- lyzed, designed and developed. Author tested the proposed system on some statistical statements. It proves its validity as a prototype system.
基金supported by the National Natural Science Foundation of China under Grant(61732005,61972186)Yunnan Provincial Major Science and Technology Special Plan Projects(Nos.202103AA080015,202203AA080004).
文摘Thanks to the strong representation capability of pre-trained language models,supervised machine translation models have achieved outstanding performance.However,the performances of these models drop sharply when the scale of the parallel training corpus is limited.Considering the pre-trained language model has a strong ability for monolingual representation,it is the key challenge for machine translation to construct the in-depth relationship between the source and target language by injecting the lexical and syntactic information into pre-trained language models.To alleviate the dependence on the parallel corpus,we propose a Linguistics Knowledge-Driven MultiTask(LKMT)approach to inject part-of-speech and syntactic knowledge into pre-trained models,thus enhancing the machine translation performance.On the one hand,we integrate part-of-speech and dependency labels into the embedding layer and exploit large-scale monolingual corpus to update all parameters of pre-trained language models,thus ensuring the updated language model contains potential lexical and syntactic information.On the other hand,we leverage an extra self-attention layer to explicitly inject linguistic knowledge into the pre-trained language model-enhanced machine translation model.Experiments on the benchmark dataset show that our proposed LKMT approach improves the Urdu-English translation accuracy by 1.97 points and the English-Urdu translation accuracy by 2.42 points,highlighting the effectiveness of our LKMT framework.Detailed ablation experiments confirm the positive impact of part-of-speech and dependency parsing on machine translation.
基金Project supported the National Natural Science Foundation of China(Grant No.60975033)the Basic Scientific Research Project of International Centre for Bamboo Rattan(Grant No.1632009006)the Shanghai Leading Academic Discipline Project(Grant No.J50103)
文摘Temporal relation computation is one of the tasks of the extraction of temporal arguments from event, and it is also the ultimate goal of temporal information processing. However, temporal relation computation based on machine learning requires a lot of hand-marked work, and exploring more features from discourse. A method of two-stage machine learning based on temporal relation computation (TSMLTRC) is proposed in this paper for the shortcomings of current temporal relation computation between two events. The first stage is to get the main temporal attributes of event based on classification learning. The second stage is to compute the event temporal relation in the discourse through employing the result of the first stage as the basic features, and also employing some new linguistic characteristics. Experiments show that, compared with the artificial golden rule, the computational efficiency in the first stage is much higher, and the F1-Score of event temporal relation which is computed through combining multi-features may be increased at 85.8% in the second stage.
文摘Automated manufacturing system is characterized by flexibility. It aims at producing a variety of products with virtually no time loses to change over from one part to the next. In this paper, the Machining Process Simulator GMPS is introduced, which can be used as a supported environment for machining process. It can be executed off-line or on-line in manufacturing systems in order to predict the collisions of tool with machined workpieces, fixtures or pallets. First, the functional model of GMPS is described, then adopted critical techniques in the simulator are introduced. Finally, an application of GMPS in CIMS ERC of China is presented.
基金supported by Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies(Grant No.2022B1212010005)XJTLU Research Development Funding(Grant No.RDF-22-01-053).
文摘The technological breakthroughs in generative artificial intelligence,represented by ChatGPT,have brought about significant social changes as well as new problems and challenges.Generative artificial intelligence has inherent flaws such as language imbalance,algorithmic black box,and algorithmic bias,and at the same time,it has external risks such as algorithmic comfort zone,data pollution,algorithmic infringement,and inaccurate output.These problems lead to the difficulty in legislation for the governance of generative artificial intelligence.Taking the data contamination incident in Google Translate as an example,this article proposes that in the process of constructing machine translation ethics,the responsibility mechanism of generative artificial intelligence should be constructed around three elements:data processing,algorithmic optimisation,and ethical alignment.
基金supported by the National Key R&D Program of China (No. 2017YFB0202204)the National Natural Science Foundation of China (Nos. 61925601, 61761166008, and 61772302)+1 种基金Beijing Advanced Innovation Center for Language Resources (No. TYR17002)the NExT ++ project which supported by the National Research Foundation, Prime Ministers Office, Singapore under its IRC@Singapore Funding Initiative。
文摘Most State-Of-The-Art(SOTA) Neural Machine Translation(NMT) systems today achieve outstanding results based only on large parallel corpora.The large-scale parallel corpora for high-resource languages is easily obtainable.However,the translation quality of NMT for morphologically rich languages is still unsatisfactory,mainly because of the data sparsity problem encountered in Low-Resource Languages(LRLs).In the low-resource NMT paradigm,Transfer Learning(TL) has been developed into one of the most efficient methods.It is difficult to train the model on high-resource languages to include the information in both parent and child models,as well as the initially trained model that only contains the lexicon features and word embeddings of the parent model instead of the child languages feature.In this work,we aim to address this issue by proposing the language-independent Hybrid Transfer Learning(HTL) method for LRLs by sharing lexicon embedding between parent and child languages without leveraging back translation or manually injecting noises.First,we train the High-Resource Languages(HRLs) as the parent model with its vocabularies.Then,we combine the parent and child language pairs using the oversampling method to train the hybrid model initialized by the previously parent model.Finally,we fine-tune the morphologically rich child model using a hybrid model.Besides,we explore some exciting discoveries on the original TL approach.Experimental results show that our model consistently outperforms five SOTA methods in two languages Azerbaijani(Az) and Uzbek(Uz).Meanwhile,our approach is practical and significantly better,achieving improvements of up to 4:94 and 4:84 BLEU points for low-resource child languages Az ! Zh and Uz ! Zh,respectively.
基金Project supported by the National High-Tech R&D Program of China(No.2012BAH14F03)the National Natural Science Foundation of China(Nos.61005052 and 61303082)+2 种基金the Re-search Fund for the Doctoral Program of Higher Education of China(No.20120121120046)the Natural Science Foundation of Fujian Province of China(No.2011J01360)the Funda-mental Research Funds for the Central Universities,China(No.2010121068)
文摘The pivot language approach for statistical machine translation(SMT) is a good method to break the resource bottleneck for certain language pairs. However, in the implementation of conventional approaches, pivotside context information is far from fully utilized, resulting in erroneous estimations of translation probabilities. In this study, we propose two topic-aware pivot language approaches to use different levels of pivot-side context. The first method takes advantage of document-level context by assuming that the bridged phrase pairs should be similar in the document-level topic distributions. The second method focuses on the effect of local context. Central to this approach are that the phrase sense can be reflected by local context in the form of probabilistic topics, and that bridged phrase pairs should be compatible in the latent sense distributions. Then, we build an interpolated model bringing the above methods together to further enhance the system performance. Experimental results on French-Spanish and French-German translations using English as the pivot language demonstrate the effectiveness of topic-based context in pivot-based SMT.
基金Supported by the National High Technology Research and Development Program of China (863 Program) (2009AA012200)Henan Province Science and Technology Funding Projects ( SP09JH11158)
文摘Aiming at the problem that virtual machine information cannot be extracted incompletely, we extend the typical information extraction model of virtual machine and propose a perception mechanism in virtualization system based on storage covert channel to overcome the affection of the semantic gap. Taking advantage of undetectability of the covert channel, a secure channel is established between Guest and virtual machine monitor to pass data directly. The Guest machine can pass the control information of malicious process to virtual machine monitor by using the VMCALL instruction and shared memory. By parsing critical information in process control structure, virtual machine monitor can terminate the malicious processes. The test results show that the proposed mechanism can clear the user-level malicious programs in the virtual machine effectively and covertly. Meanwhile, its performance overhead is about the same as that of other mainstream monitoring mode.
文摘This study presents a novel and innovative approach to auto-matically translating Arabic Sign Language(ATSL)into spoken Arabic.The proposed solution utilizes a deep learning-based classification approach and the transfer learning technique to retrain 12 image recognition models.The image-based translation method maps sign language gestures to corre-sponding letters or words using distance measures and classification as a machine learning technique.The results show that the proposed model is more accurate and faster than traditional image-based models in classifying Arabic-language signs,with a translation accuracy of 93.7%.This research makes a significant contribution to the field of ATSL.It offers a practical solution for improving communication for individuals with special needs,such as the deaf and mute community.This work demonstrates the potential of deep learning techniques in translating sign language into natural language and highlights the importance of ATSL in facilitating communication for individuals with disabilities.