The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of ...The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of object and etalon descriptions while maintaining the required level of classification efficiency.The class to be recognized is represented by an infinite set of images obtained from the etalon by applying arbitrary geometric transformations.It is proposed to reduce the descriptions for the etalon database by selecting the most significant descriptor components according to the information content criterion.The informativeness of an etalon descriptor is estimated by the difference of the closest distances to its own and other descriptions.The developed method determines the relevance of the full description of the recognized object with the reduced description of the etalons.Several practical models of the classifier with different options for establishing the correspondence between object descriptors and etalons are considered.The results of the experimental modeling of the proposed methods for a database including images of museum jewelry are presented.The test sample is formed as a set of images from the etalon database and out of the database with the application of geometric transformations of scale and rotation in the field of view.The practical problems of determining the threshold for the number of votes,based on which a classification decision is made,have been researched.Modeling has revealed the practical possibility of tenfold reducing descriptions with full preservation of classification accuracy.Reducing the descriptions by twenty times in the experiment leads to slightly decreased accuracy.The speed of the analysis increases in proportion to the degree of reduction.The use of reduction by the informativeness criterion confirmed the possibility of obtaining the most significant subset of features for classification,which guarantees a decent level of accuracy.展开更多
现有的多模态间歇过程软测量未考虑过程数据的批次差异及过渡模态的复杂时变特性,影响了间歇过程模态识别的合理性及质量变量在线软测量的准确性。提出了一种基于双边界支持向量数据描述-相关向量回归(double boundary support vector d...现有的多模态间歇过程软测量未考虑过程数据的批次差异及过渡模态的复杂时变特性,影响了间歇过程模态识别的合理性及质量变量在线软测量的准确性。提出了一种基于双边界支持向量数据描述-相关向量回归(double boundary support vector data description-relevance vector regression,DBSVDD-RVR)的间歇过程质量变量在线软测量方法。依据间歇过程离线模态划分获得的各稳定及过渡模态历史数据,建立DBSVDD在线模态识别模型,并引入滑动窗,构建间歇过程在线模态识别策略,利用DBSVDD模型实现在线测量数据的模态识别;在此基础上,构建了基于超球体距离的数据相似度计算方法,选择过渡模态在线数据的相似建模数据集,建立过渡模态的即时学习RVR软测量模型,并依据历史数据建立各稳定模态的RVR软测量模型,实现间歇过程质量变量的在线软测量。青霉素发酵过程的实验结果表明,所提方法有效地提高了间歇过程模态识别的合理性和质量变量在线软测量的准确性。展开更多
DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modelin...DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modeling file format within the 3D software industry.In this paper,we introduce a novel method that can automatically convert complex HEP detector geometries from DD4hep description into 3D models in the FBX format.The feasibility of this method was dem-onstrated by its application to the DD4hep description of the Compact Linear Collider detector and several sub-detectors of the super Tau-Charm facility and circular electron-positron collider experiments.The automatic DD4hep–FBX detector conversion interface provides convenience for further development of applications,such as detector design,simulation,visualization,data monitoring,and outreach,in HEP experiments.展开更多
Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually imp...Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually impaired.This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images.Our method focuses on refining the reward function in deep reinforcement learning,facilitating the generation of precise descriptions by aligning visual and textual features more closely.Our approach comprises three key architectures.Firstly,it utilizes Residual Network 101(ResNet-101)and Faster Region-based Convolutional Neural Network(Faster R-CNN)to extract average and local image features,respectively,followed by the implementation of a dual attention mechanism for intricate feature fusion.Secondly,the Transformer model is engaged to derive contextual semantic features from textual data.Finally,the generation of descriptive text is executed through a two-layer long short-term memory network(LSTM),directed by the value and reward functions.Compared with the image description method that relies on deep learning,the score of Bilingual Evaluation Understudy(BLEU-1)is 0.762,which is 1.6%higher,and the score of BLEU-4 is 0.299.Consensus-based Image Description Evaluation(CIDEr)scored 0.998,Recall-Oriented Understudy for Gisting Evaluation(ROUGE)scored 0.552,the latter improved by 0.36%.These results not only attest to the viability of our approach but also highlight its superiority in the realm of image description.Future research can explore the integration of our method with other artificial intelligence(AI)domains,such as emotional AI,to create more nuanced and context-aware systems.展开更多
Hot deformation is a commonly employed processing technique to enhance the ductility and workability of Mg alloy.However,the hot deformation of Mg alloy is highly sensitive to factors such as temperature,strain rate,a...Hot deformation is a commonly employed processing technique to enhance the ductility and workability of Mg alloy.However,the hot deformation of Mg alloy is highly sensitive to factors such as temperature,strain rate,and strain,leading to complex flow behavior and an exceptionally narrow processing window for Mg alloy.To overcome the shortcomings of the conventional Arrhenius-type(AT)model,this study developed machine learning-based Arrhenius-type(ML-AT)models by combining the genetic algorithm(GA),particle swarm optimization(PSO),and artificial neural network(ANN).Results indicated that when describing the flow behavior of the AQ80 alloy,the PSO-ANN-AT model demonstrates the most prominent prediction accuracy and generalization ability among all ML-AT and AT models.Moreover,an activation energy-processing(AEP)map was established using the reconstructed flow stress and activation energy fields based on the PSO-ANN-AT model.Experimental validations revealed that this AEP map exhibits superior predictive capability for microstructure evolution compared to the one established by the traditional interpolation methods,ultimately contributing to the precise determination of the optimum processing window.These findings provide fresh insights into the accurate constitutive description and workability characterization of Mg alloy during hot deformation.展开更多
Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural net...Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural network models and semantic matching techniques.Experiments conducted on the Flickr8k and AraImg2k benchmark datasets,featuring images and descriptions in English and Arabic,showcase remarkable performance improvements over state-of-the-art methods.Our model,equipped with the Image&Cross-Language Semantic Matching module and the Target Language Domain Evaluation module,significantly enhances the semantic relevance of generated image descriptions.For English-to-Arabic and Arabic-to-English cross-language image descriptions,our approach achieves a CIDEr score for English and Arabic of 87.9%and 81.7%,respectively,emphasizing the substantial contributions of our methodology.Comparative analyses with previous works further affirm the superior performance of our approach,and visual results underscore that our model generates image captions that are both semantically accurate and stylistically consistent with the target language.In summary,this study advances the field of cross-lingual image description,offering an effective solution for generating image captions across languages,with the potential to impact multilingual communication and accessibility.Future research directions include expanding to more languages and incorporating diverse visual and textual data sources.展开更多
The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials...The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials, A<sub>μ</sub>. We assumed that the fundamental form of the Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-S<sub>μ</sub>)Ψ=0 should describe the stable particles (the electron, the proton and the dark-matter-particle (dmp)) bound to themselves under the action of their own potentials S<sub>μ</sub>. The new equation reveals that self energy is consequence of self action, it also reveals that the spin angular momentum is consequence of the dynamic structure of the stable particles. The quantitative results are the determination of their relative masses as well as the determination of the electromagnetic coupling constant.展开更多
为解决传统航空发动机异常检测方法准确率和泛化性能较低的问题,提出一种混合核最大相关熵的深度支持向量数据描述(mixed kernel maximum correntropy criterion-deep support vector data description,MKMCC-DSVDD)方法。首先,采用合...为解决传统航空发动机异常检测方法准确率和泛化性能较低的问题,提出一种混合核最大相关熵的深度支持向量数据描述(mixed kernel maximum correntropy criterion-deep support vector data description,MKMCC-DSVDD)方法。首先,采用合成少数类过采样技术扩充异常样本规模,提高对非均衡样本的泛化性能;其次,建立基于混合核改进的最大相关熵损失函数,可以在无须数据分布假设的前提下提升准确率;最后,构建基于MKMCC-DSVDD的航空发动机异常检测方法。在航空发动机气路系统和滑油系统异常检测实验中,所提方法平均曲线下的面积(area under curve,AUC)达到98.53%,表明其具有较高的实用性和泛化性能。展开更多
基金This research was funded by Prince Sattam bin Abdulaziz University(Project Number PSAU/2023/01/25387).
文摘The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of object and etalon descriptions while maintaining the required level of classification efficiency.The class to be recognized is represented by an infinite set of images obtained from the etalon by applying arbitrary geometric transformations.It is proposed to reduce the descriptions for the etalon database by selecting the most significant descriptor components according to the information content criterion.The informativeness of an etalon descriptor is estimated by the difference of the closest distances to its own and other descriptions.The developed method determines the relevance of the full description of the recognized object with the reduced description of the etalons.Several practical models of the classifier with different options for establishing the correspondence between object descriptors and etalons are considered.The results of the experimental modeling of the proposed methods for a database including images of museum jewelry are presented.The test sample is formed as a set of images from the etalon database and out of the database with the application of geometric transformations of scale and rotation in the field of view.The practical problems of determining the threshold for the number of votes,based on which a classification decision is made,have been researched.Modeling has revealed the practical possibility of tenfold reducing descriptions with full preservation of classification accuracy.Reducing the descriptions by twenty times in the experiment leads to slightly decreased accuracy.The speed of the analysis increases in proportion to the degree of reduction.The use of reduction by the informativeness criterion confirmed the possibility of obtaining the most significant subset of features for classification,which guarantees a decent level of accuracy.
文摘现有的多模态间歇过程软测量未考虑过程数据的批次差异及过渡模态的复杂时变特性,影响了间歇过程模态识别的合理性及质量变量在线软测量的准确性。提出了一种基于双边界支持向量数据描述-相关向量回归(double boundary support vector data description-relevance vector regression,DBSVDD-RVR)的间歇过程质量变量在线软测量方法。依据间歇过程离线模态划分获得的各稳定及过渡模态历史数据,建立DBSVDD在线模态识别模型,并引入滑动窗,构建间歇过程在线模态识别策略,利用DBSVDD模型实现在线测量数据的模态识别;在此基础上,构建了基于超球体距离的数据相似度计算方法,选择过渡模态在线数据的相似建模数据集,建立过渡模态的即时学习RVR软测量模型,并依据历史数据建立各稳定模态的RVR软测量模型,实现间歇过程质量变量的在线软测量。青霉素发酵过程的实验结果表明,所提方法有效地提高了间歇过程模态识别的合理性和质量变量在线软测量的准确性。
基金supported by the National Natural Science Foundation of China(Nos.12175321,11975021,11675275,and U1932101)National Key Research and Development Program of China(Nos.2023YFA1606000 and 2020YFA0406400)+2 种基金State Key Laboratory of Nuclear Physics and Technology,Peking University(Nos.NPT2020KFY04 and NPT2020KFY05)Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA10010900)National College Students Science and Technology Innovation Project,and Undergraduate Base Scientific Research Project of Sun Yat-sen University。
文摘DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modeling file format within the 3D software industry.In this paper,we introduce a novel method that can automatically convert complex HEP detector geometries from DD4hep description into 3D models in the FBX format.The feasibility of this method was dem-onstrated by its application to the DD4hep description of the Compact Linear Collider detector and several sub-detectors of the super Tau-Charm facility and circular electron-positron collider experiments.The automatic DD4hep–FBX detector conversion interface provides convenience for further development of applications,such as detector design,simulation,visualization,data monitoring,and outreach,in HEP experiments.
基金This research was funded by the Natural Science Foundation of Gansu Province with Approval Numbers 20JR10RA334 and 21JR7RA570Funding is provided for the 2021 Longyuan Youth Innovation and Entrepreneurship Talent Project with Approval Number 2021LQGR20+1 种基金the University Level Innovation Project with Approval NumbersGZF2020XZD18jbzxyb2018-01 of Gansu University of Political Science and Law.
文摘Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually impaired.This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images.Our method focuses on refining the reward function in deep reinforcement learning,facilitating the generation of precise descriptions by aligning visual and textual features more closely.Our approach comprises three key architectures.Firstly,it utilizes Residual Network 101(ResNet-101)and Faster Region-based Convolutional Neural Network(Faster R-CNN)to extract average and local image features,respectively,followed by the implementation of a dual attention mechanism for intricate feature fusion.Secondly,the Transformer model is engaged to derive contextual semantic features from textual data.Finally,the generation of descriptive text is executed through a two-layer long short-term memory network(LSTM),directed by the value and reward functions.Compared with the image description method that relies on deep learning,the score of Bilingual Evaluation Understudy(BLEU-1)is 0.762,which is 1.6%higher,and the score of BLEU-4 is 0.299.Consensus-based Image Description Evaluation(CIDEr)scored 0.998,Recall-Oriented Understudy for Gisting Evaluation(ROUGE)scored 0.552,the latter improved by 0.36%.These results not only attest to the viability of our approach but also highlight its superiority in the realm of image description.Future research can explore the integration of our method with other artificial intelligence(AI)domains,such as emotional AI,to create more nuanced and context-aware systems.
基金supported by the National Natural Science Foundation of China(Grant Nos.52305361,51775194,52090043)China Postdoctoral Science Foundation(2023M741245)the National Key Research and Development Program of China(2022YFB3706903).
文摘Hot deformation is a commonly employed processing technique to enhance the ductility and workability of Mg alloy.However,the hot deformation of Mg alloy is highly sensitive to factors such as temperature,strain rate,and strain,leading to complex flow behavior and an exceptionally narrow processing window for Mg alloy.To overcome the shortcomings of the conventional Arrhenius-type(AT)model,this study developed machine learning-based Arrhenius-type(ML-AT)models by combining the genetic algorithm(GA),particle swarm optimization(PSO),and artificial neural network(ANN).Results indicated that when describing the flow behavior of the AQ80 alloy,the PSO-ANN-AT model demonstrates the most prominent prediction accuracy and generalization ability among all ML-AT and AT models.Moreover,an activation energy-processing(AEP)map was established using the reconstructed flow stress and activation energy fields based on the PSO-ANN-AT model.Experimental validations revealed that this AEP map exhibits superior predictive capability for microstructure evolution compared to the one established by the traditional interpolation methods,ultimately contributing to the precise determination of the optimum processing window.These findings provide fresh insights into the accurate constitutive description and workability characterization of Mg alloy during hot deformation.
文摘Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural network models and semantic matching techniques.Experiments conducted on the Flickr8k and AraImg2k benchmark datasets,featuring images and descriptions in English and Arabic,showcase remarkable performance improvements over state-of-the-art methods.Our model,equipped with the Image&Cross-Language Semantic Matching module and the Target Language Domain Evaluation module,significantly enhances the semantic relevance of generated image descriptions.For English-to-Arabic and Arabic-to-English cross-language image descriptions,our approach achieves a CIDEr score for English and Arabic of 87.9%and 81.7%,respectively,emphasizing the substantial contributions of our methodology.Comparative analyses with previous works further affirm the superior performance of our approach,and visual results underscore that our model generates image captions that are both semantically accurate and stylistically consistent with the target language.In summary,this study advances the field of cross-lingual image description,offering an effective solution for generating image captions across languages,with the potential to impact multilingual communication and accessibility.Future research directions include expanding to more languages and incorporating diverse visual and textual data sources.
文摘The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials, A<sub>μ</sub>. We assumed that the fundamental form of the Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-S<sub>μ</sub>)Ψ=0 should describe the stable particles (the electron, the proton and the dark-matter-particle (dmp)) bound to themselves under the action of their own potentials S<sub>μ</sub>. The new equation reveals that self energy is consequence of self action, it also reveals that the spin angular momentum is consequence of the dynamic structure of the stable particles. The quantitative results are the determination of their relative masses as well as the determination of the electromagnetic coupling constant.
文摘由于电网企业不断加快数字化转型,利用北斗定位技术将自动获取区域内光伏计量装置经纬度这一关键技术参数。文章充分利用分布式光伏集群内光伏发电装机位置空间相关性,提出一种在弱监督下基于图滤波与支持向量数据描述(support vector data description,SVDD)的分布式光伏集群发电异常检测方法。首先建立分布式光伏集群发电图数据结构模型,通过加权邻接矩阵描述分布式光伏发电点空间耦合性,其次构造图高通滤波器将时域参数转化为频域参数,然后通过SVDD算法优化图滤波结果,进一步挖掘图高通滤波器阈值与输出功率数据之间的关系。结果表明,采用图滤波器和SVDD算法模型方法在分布式光伏发电异常检测精度上有显著提高。
文摘为解决传统航空发动机异常检测方法准确率和泛化性能较低的问题,提出一种混合核最大相关熵的深度支持向量数据描述(mixed kernel maximum correntropy criterion-deep support vector data description,MKMCC-DSVDD)方法。首先,采用合成少数类过采样技术扩充异常样本规模,提高对非均衡样本的泛化性能;其次,建立基于混合核改进的最大相关熵损失函数,可以在无须数据分布假设的前提下提升准确率;最后,构建基于MKMCC-DSVDD的航空发动机异常检测方法。在航空发动机气路系统和滑油系统异常检测实验中,所提方法平均曲线下的面积(area under curve,AUC)达到98.53%,表明其具有较高的实用性和泛化性能。