The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of ...The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of object and etalon descriptions while maintaining the required level of classification efficiency.The class to be recognized is represented by an infinite set of images obtained from the etalon by applying arbitrary geometric transformations.It is proposed to reduce the descriptions for the etalon database by selecting the most significant descriptor components according to the information content criterion.The informativeness of an etalon descriptor is estimated by the difference of the closest distances to its own and other descriptions.The developed method determines the relevance of the full description of the recognized object with the reduced description of the etalons.Several practical models of the classifier with different options for establishing the correspondence between object descriptors and etalons are considered.The results of the experimental modeling of the proposed methods for a database including images of museum jewelry are presented.The test sample is formed as a set of images from the etalon database and out of the database with the application of geometric transformations of scale and rotation in the field of view.The practical problems of determining the threshold for the number of votes,based on which a classification decision is made,have been researched.Modeling has revealed the practical possibility of tenfold reducing descriptions with full preservation of classification accuracy.Reducing the descriptions by twenty times in the experiment leads to slightly decreased accuracy.The speed of the analysis increases in proportion to the degree of reduction.The use of reduction by the informativeness criterion confirmed the possibility of obtaining the most significant subset of features for classification,which guarantees a decent level of accuracy.展开更多
DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modelin...DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modeling file format within the 3D software industry.In this paper,we introduce a novel method that can automatically convert complex HEP detector geometries from DD4hep description into 3D models in the FBX format.The feasibility of this method was dem-onstrated by its application to the DD4hep description of the Compact Linear Collider detector and several sub-detectors of the super Tau-Charm facility and circular electron-positron collider experiments.The automatic DD4hep–FBX detector conversion interface provides convenience for further development of applications,such as detector design,simulation,visualization,data monitoring,and outreach,in HEP experiments.展开更多
Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural net...Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural network models and semantic matching techniques.Experiments conducted on the Flickr8k and AraImg2k benchmark datasets,featuring images and descriptions in English and Arabic,showcase remarkable performance improvements over state-of-the-art methods.Our model,equipped with the Image&Cross-Language Semantic Matching module and the Target Language Domain Evaluation module,significantly enhances the semantic relevance of generated image descriptions.For English-to-Arabic and Arabic-to-English cross-language image descriptions,our approach achieves a CIDEr score for English and Arabic of 87.9%and 81.7%,respectively,emphasizing the substantial contributions of our methodology.Comparative analyses with previous works further affirm the superior performance of our approach,and visual results underscore that our model generates image captions that are both semantically accurate and stylistically consistent with the target language.In summary,this study advances the field of cross-lingual image description,offering an effective solution for generating image captions across languages,with the potential to impact multilingual communication and accessibility.Future research directions include expanding to more languages and incorporating diverse visual and textual data sources.展开更多
Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually imp...Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually impaired.This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images.Our method focuses on refining the reward function in deep reinforcement learning,facilitating the generation of precise descriptions by aligning visual and textual features more closely.Our approach comprises three key architectures.Firstly,it utilizes Residual Network 101(ResNet-101)and Faster Region-based Convolutional Neural Network(Faster R-CNN)to extract average and local image features,respectively,followed by the implementation of a dual attention mechanism for intricate feature fusion.Secondly,the Transformer model is engaged to derive contextual semantic features from textual data.Finally,the generation of descriptive text is executed through a two-layer long short-term memory network(LSTM),directed by the value and reward functions.Compared with the image description method that relies on deep learning,the score of Bilingual Evaluation Understudy(BLEU-1)is 0.762,which is 1.6%higher,and the score of BLEU-4 is 0.299.Consensus-based Image Description Evaluation(CIDEr)scored 0.998,Recall-Oriented Understudy for Gisting Evaluation(ROUGE)scored 0.552,the latter improved by 0.36%.These results not only attest to the viability of our approach but also highlight its superiority in the realm of image description.Future research can explore the integration of our method with other artificial intelligence(AI)domains,such as emotional AI,to create more nuanced and context-aware systems.展开更多
The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials...The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials, A<sub>μ</sub>. We assumed that the fundamental form of the Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-S<sub>μ</sub>)Ψ=0 should describe the stable particles (the electron, the proton and the dark-matter-particle (dmp)) bound to themselves under the action of their own potentials S<sub>μ</sub>. The new equation reveals that self energy is consequence of self action, it also reveals that the spin angular momentum is consequence of the dynamic structure of the stable particles. The quantitative results are the determination of their relative masses as well as the determination of the electromagnetic coupling constant.展开更多
How to think a unique and determinative turn in analytic philosophy of mind?To answer this question this article first presents an attempt to render clear that analytic phenomenology,by contrast with conceptions of ph...How to think a unique and determinative turn in analytic philosophy of mind?To answer this question this article first presents an attempt to render clear that analytic phenomenology,by contrast with conceptions of phenomenology of the XXth century,beneficially dispenses with several methodological and conceptual assumptions that were assumed to be compulsory,as phenomenological reduction,a notion of synthesis,and a philosophical notion of the a priori.It then presents some eventual difficulties to the achievement of a phenomenological turn within analytic philosophy,which are,the neglect of historicity,abstractionism,the acknowledgement of the place of language in our lives,and solipsism.It finally presents several demands that concern the felicity of contemporary analytic phenomenologies,namely,anti-abstractionism,fallibilism,attention to polyadic relations,and the integration of ecological and decolonial concerns of our cultures.展开更多
Mark Twain is one of the most famous writers of the nineteenth century,his works have a large number of descriptions of dreams,in Mark Twain’s short story My Platonic Sweetheart,the author describes a dream that cons...Mark Twain is one of the most famous writers of the nineteenth century,his works have a large number of descriptions of dreams,in Mark Twain’s short story My Platonic Sweetheart,the author describes a dream that constantly repeats itself in his life.The dream description in the novel is not only part of the narrative structure of the article,but also expresses the theme of the article,through the close reading of the text,taking dream description as the starting point,the author of this thesis analyzes the dream description in My Platonic Sweetheart,exploring the thematic role of dream description in the novel,and analyzing what the author wants to express and how the author expresses his spiritual pursuit through dream description.展开更多
Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to th...Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to the visually disadvantaged.In this study,a corpus of AD of COVID-19 educational videos is developed,named“Audio Description Corpus of COVID-19 Educational Videos”(ADCCEV).Drawing on the model of Textual and Linguistic Audio Description Matrix(TLADM),this paper aims to identify the linguistic and textual idiosyncrasies of AD themed on COVID-19 response released by the New Zealand Government.This study finds that linguistically,the AD script uses a mix of complete sentences and phrases,the majority being in Present Simple tense.Present participles and the“with”structure are used for brevity.Vocabulary is diverse,with simpler words for animated explainers.Third-person pronouns are common in educational videos.Color words are a salient feature of AD,where“yellow”denotes urgency,and“red”indicates importance,negativity,and hostility.On textual idiosyncrasies,coherence is achieved through intermodal components that align with the video’s mood and style.AD style varies depending on the video’s purpose,from informative to narrative or expressive.展开更多
基金This research was funded by Prince Sattam bin Abdulaziz University(Project Number PSAU/2023/01/25387).
文摘The research aims to improve the performance of image recognition methods based on a description in the form of a set of keypoint descriptors.The main focus is on increasing the speed of establishing the relevance of object and etalon descriptions while maintaining the required level of classification efficiency.The class to be recognized is represented by an infinite set of images obtained from the etalon by applying arbitrary geometric transformations.It is proposed to reduce the descriptions for the etalon database by selecting the most significant descriptor components according to the information content criterion.The informativeness of an etalon descriptor is estimated by the difference of the closest distances to its own and other descriptions.The developed method determines the relevance of the full description of the recognized object with the reduced description of the etalons.Several practical models of the classifier with different options for establishing the correspondence between object descriptors and etalons are considered.The results of the experimental modeling of the proposed methods for a database including images of museum jewelry are presented.The test sample is formed as a set of images from the etalon database and out of the database with the application of geometric transformations of scale and rotation in the field of view.The practical problems of determining the threshold for the number of votes,based on which a classification decision is made,have been researched.Modeling has revealed the practical possibility of tenfold reducing descriptions with full preservation of classification accuracy.Reducing the descriptions by twenty times in the experiment leads to slightly decreased accuracy.The speed of the analysis increases in proportion to the degree of reduction.The use of reduction by the informativeness criterion confirmed the possibility of obtaining the most significant subset of features for classification,which guarantees a decent level of accuracy.
基金supported by the National Natural Science Foundation of China(Nos.12175321,11975021,11675275,and U1932101)National Key Research and Development Program of China(Nos.2023YFA1606000 and 2020YFA0406400)+2 种基金State Key Laboratory of Nuclear Physics and Technology,Peking University(Nos.NPT2020KFY04 and NPT2020KFY05)Strategic Priority Research Program of the Chinese Academy of Sciences(No.XDA10010900)National College Students Science and Technology Innovation Project,and Undergraduate Base Scientific Research Project of Sun Yat-sen University。
文摘DD4hep serves as a generic detector description toolkit recommended for offline software development in next-generation high-energy physics(HEP)experiments.Conversely,Filmbox(FBX)stands out as a widely used 3D modeling file format within the 3D software industry.In this paper,we introduce a novel method that can automatically convert complex HEP detector geometries from DD4hep description into 3D models in the FBX format.The feasibility of this method was dem-onstrated by its application to the DD4hep description of the Compact Linear Collider detector and several sub-detectors of the super Tau-Charm facility and circular electron-positron collider experiments.The automatic DD4hep–FBX detector conversion interface provides convenience for further development of applications,such as detector design,simulation,visualization,data monitoring,and outreach,in HEP experiments.
文摘Cross-lingual image description,the task of generating image captions in a target language from images and descriptions in a source language,is addressed in this study through a novel approach that combines neural network models and semantic matching techniques.Experiments conducted on the Flickr8k and AraImg2k benchmark datasets,featuring images and descriptions in English and Arabic,showcase remarkable performance improvements over state-of-the-art methods.Our model,equipped with the Image&Cross-Language Semantic Matching module and the Target Language Domain Evaluation module,significantly enhances the semantic relevance of generated image descriptions.For English-to-Arabic and Arabic-to-English cross-language image descriptions,our approach achieves a CIDEr score for English and Arabic of 87.9%and 81.7%,respectively,emphasizing the substantial contributions of our methodology.Comparative analyses with previous works further affirm the superior performance of our approach,and visual results underscore that our model generates image captions that are both semantically accurate and stylistically consistent with the target language.In summary,this study advances the field of cross-lingual image description,offering an effective solution for generating image captions across languages,with the potential to impact multilingual communication and accessibility.Future research directions include expanding to more languages and incorporating diverse visual and textual data sources.
基金This research was funded by the Natural Science Foundation of Gansu Province with Approval Numbers 20JR10RA334 and 21JR7RA570Funding is provided for the 2021 Longyuan Youth Innovation and Entrepreneurship Talent Project with Approval Number 2021LQGR20+1 种基金the University Level Innovation Project with Approval NumbersGZF2020XZD18jbzxyb2018-01 of Gansu University of Political Science and Law.
文摘Image description task is the intersection of computer vision and natural language processing,and it has important prospects,including helping computers understand images and obtaining information for the visually impaired.This study presents an innovative approach employing deep reinforcement learning to enhance the accuracy of natural language descriptions of images.Our method focuses on refining the reward function in deep reinforcement learning,facilitating the generation of precise descriptions by aligning visual and textual features more closely.Our approach comprises three key architectures.Firstly,it utilizes Residual Network 101(ResNet-101)and Faster Region-based Convolutional Neural Network(Faster R-CNN)to extract average and local image features,respectively,followed by the implementation of a dual attention mechanism for intricate feature fusion.Secondly,the Transformer model is engaged to derive contextual semantic features from textual data.Finally,the generation of descriptive text is executed through a two-layer long short-term memory network(LSTM),directed by the value and reward functions.Compared with the image description method that relies on deep learning,the score of Bilingual Evaluation Understudy(BLEU-1)is 0.762,which is 1.6%higher,and the score of BLEU-4 is 0.299.Consensus-based Image Description Evaluation(CIDEr)scored 0.998,Recall-Oriented Understudy for Gisting Evaluation(ROUGE)scored 0.552,the latter improved by 0.36%.These results not only attest to the viability of our approach but also highlight its superiority in the realm of image description.Future research can explore the integration of our method with other artificial intelligence(AI)domains,such as emotional AI,to create more nuanced and context-aware systems.
文摘The Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-eA<sub>μ</sub>)Ψ=mc<sup>2</sup>Ψ describes the bound states of the electron under the action of external potentials, A<sub>μ</sub>. We assumed that the fundamental form of the Dirac equation γ<sub>μ</sub>(δ<sub>μ</sub>-S<sub>μ</sub>)Ψ=0 should describe the stable particles (the electron, the proton and the dark-matter-particle (dmp)) bound to themselves under the action of their own potentials S<sub>μ</sub>. The new equation reveals that self energy is consequence of self action, it also reveals that the spin angular momentum is consequence of the dynamic structure of the stable particles. The quantitative results are the determination of their relative masses as well as the determination of the electromagnetic coupling constant.
文摘How to think a unique and determinative turn in analytic philosophy of mind?To answer this question this article first presents an attempt to render clear that analytic phenomenology,by contrast with conceptions of phenomenology of the XXth century,beneficially dispenses with several methodological and conceptual assumptions that were assumed to be compulsory,as phenomenological reduction,a notion of synthesis,and a philosophical notion of the a priori.It then presents some eventual difficulties to the achievement of a phenomenological turn within analytic philosophy,which are,the neglect of historicity,abstractionism,the acknowledgement of the place of language in our lives,and solipsism.It finally presents several demands that concern the felicity of contemporary analytic phenomenologies,namely,anti-abstractionism,fallibilism,attention to polyadic relations,and the integration of ecological and decolonial concerns of our cultures.
文摘Mark Twain is one of the most famous writers of the nineteenth century,his works have a large number of descriptions of dreams,in Mark Twain’s short story My Platonic Sweetheart,the author describes a dream that constantly repeats itself in his life.The dream description in the novel is not only part of the narrative structure of the article,but also expresses the theme of the article,through the close reading of the text,taking dream description as the starting point,the author of this thesis analyzes the dream description in My Platonic Sweetheart,exploring the thematic role of dream description in the novel,and analyzing what the author wants to express and how the author expresses his spiritual pursuit through dream description.
文摘Audio description(AD),unlike interlingual translation and interpretation,is subject to unique constraints as a spoken text.Facilitated by AD,educational videos on COVID-19 anti-virus measures are made accessible to the visually disadvantaged.In this study,a corpus of AD of COVID-19 educational videos is developed,named“Audio Description Corpus of COVID-19 Educational Videos”(ADCCEV).Drawing on the model of Textual and Linguistic Audio Description Matrix(TLADM),this paper aims to identify the linguistic and textual idiosyncrasies of AD themed on COVID-19 response released by the New Zealand Government.This study finds that linguistically,the AD script uses a mix of complete sentences and phrases,the majority being in Present Simple tense.Present participles and the“with”structure are used for brevity.Vocabulary is diverse,with simpler words for animated explainers.Third-person pronouns are common in educational videos.Color words are a salient feature of AD,where“yellow”denotes urgency,and“red”indicates importance,negativity,and hostility.On textual idiosyncrasies,coherence is achieved through intermodal components that align with the video’s mood and style.AD style varies depending on the video’s purpose,from informative to narrative or expressive.