This paper presents a two-phase genetic algorithm (TPGA) based on the multi- parent genetic algorithm (MPGA). Through analysis we find MPGA will lead the population' s evol vement to diversity or convergence accor...This paper presents a two-phase genetic algorithm (TPGA) based on the multi- parent genetic algorithm (MPGA). Through analysis we find MPGA will lead the population' s evol vement to diversity or convergence according to the population size and the crossover size, so we make it run in different forms during the global and local optimization phases and then forms TPGA. The experiment results show that TPGA is very efficient for the optimization of low-dimension multi-modal functions, usually we can obtain all the global optimal solutions.展开更多
In this paper, a new algorithm for solving multi-modal function optimization problems-two-level subspace evolutionary algorithm is proposed. In the first level, the improved GT algorithm is used to do global recombina...In this paper, a new algorithm for solving multi-modal function optimization problems-two-level subspace evolutionary algorithm is proposed. In the first level, the improved GT algorithm is used to do global recombination search so that the whole population can be separated into several niches according to the position of solutions; then, in the second level, the niche evolutionary strategy is used for local search in the subspaces gotten in the first level till solutions of the problem are found. The new algorithm has been tested on some hard problems and some good results are obtained.展开更多
A novel immune genetic algorithm with the elitist selection and elitist crossover was proposed, which is called the immune genetic algorithm with the elitism (IGAE). In IGAE, the new methods for computing antibody s...A novel immune genetic algorithm with the elitist selection and elitist crossover was proposed, which is called the immune genetic algorithm with the elitism (IGAE). In IGAE, the new methods for computing antibody similarity, expected reproduction probability, and clonal selection probability were given. IGAE has three features. The first is that the similarities of two antibodies in structure and quality are all defined in the form of percentage, which helps to describe the similarity of two antibodies more accurately and to reduce the computational burden effectively. The second is that with the elitist selection and elitist crossover strategy IGAE is able to find the globally optimal solution of a given problem. The third is that the formula of expected reproduction probability of antibody can be adjusted through a parameter r, which helps to balance the population diversity and the convergence speed of IGAE so that IGAE can find the globally optimal solution of a given problem more rapidly. Two different complex multi-modal functions were selected to test the validity of IGAE. The experimental results show that IGAE can find the globally maximum/minimum values of the two functions rapidly. The experimental results also confirm that IGAE is of better performance in convergence speed, solution variation behavior, and computational efficiency compared with the canonical genetic algorithm with the elitism and the immune genetic algorithm with the information entropy and elitism.展开更多
Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and ...Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.展开更多
Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant resear...Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.展开更多
Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
Mill vibration is a common problem in rolling production,which directly affects the thickness accuracy of the strip and may even lead to strip fracture accidents in serious cases.The existing vibration prediction mode...Mill vibration is a common problem in rolling production,which directly affects the thickness accuracy of the strip and may even lead to strip fracture accidents in serious cases.The existing vibration prediction models do not consider the features contained in the data,resulting in limited improvement of model accuracy.To address these challenges,this paper proposes a multi-dimensional multi-modal cold rolling vibration time series prediction model(MDMMVPM)based on the deep fusion of multi-level networks.In the model,the long-term and short-term modal features of multi-dimensional data are considered,and the appropriate prediction algorithms are selected for different data features.Based on the established prediction model,the effects of tension and rolling force on mill vibration are analyzed.Taking the 5th stand of a cold mill in a steel mill as the research object,the innovative model is applied to predict the mill vibration for the first time.The experimental results show that the correlation coefficient(R^(2))of the model proposed in this paper is 92.5%,and the root-mean-square error(RMSE)is 0.0011,which significantly improves the modeling accuracy compared with the existing models.The proposed model is also suitable for the hot rolling process,which provides a new method for the prediction of strip rolling vibration.展开更多
Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the intro...Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.展开更多
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent...Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.展开更多
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the...Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.展开更多
Multi-modal 3D object detection has achieved remarkable progress,but it is often limited in practical industrial production because of its high cost and low efficiency.The multi-view camera-based method provides a fea...Multi-modal 3D object detection has achieved remarkable progress,but it is often limited in practical industrial production because of its high cost and low efficiency.The multi-view camera-based method provides a feasible solution due to its low cost.However,camera data lacks geometric depth,and only using camera data to obtain high accuracy is challenging.This paper proposes a multi-modal Bird-Eye-View(BEV)distillation framework(MMDistill)to make a trade-off between them.MMDistill is a carefully crafted two-stage distillation framework based on teacher and student models for learning cross-modal knowledge and generating multi-modal features.It can improve the performance of unimodal detectors without introducing additional costs during inference.Specifically,our method can effectively solve the cross-gap caused by the heterogeneity between data.Furthermore,we further propose a Light Detection and Ranging(LiDAR)-guided geometric compensation module,which can assist the student model in obtaining effective geometric features and reduce the gap between different modalities.Our proposed method generally requires fewer computational resources and faster inference speed than traditional multi-modal models.This advancement enables multi-modal technology to be applied more widely in practical scenarios.Through experiments,we validate the effectiveness and superiority of MMDistill on the nuScenes dataset,achieving an improvement of 4.1%mean Average Precision(mAP)and 4.6%NuScenes Detection Score(NDS)over the baseline detector.In addition,we also present detailed ablation studies to validate our method.展开更多
Traumatic brain injury involves complex pathophysiological mechanisms,among which oxidative stress significantly contributes to the occurrence of secondary injury.In this study,we evaluated hypidone hydrochloride(YL-0...Traumatic brain injury involves complex pathophysiological mechanisms,among which oxidative stress significantly contributes to the occurrence of secondary injury.In this study,we evaluated hypidone hydrochloride(YL-0919),a self-developed antidepressant with selective sigma-1 receptor agonist properties,and its associated mechanisms and targets in traumatic brain injury.Behavioral experiments to assess functional deficits were followed by assessment of neuronal damage through histological analyses and examination of blood-brain barrier permeability and brain edema.Next,we investigated the antioxidative effects of YL-0919 by assessing the levels of traditional markers of oxidative stress in vivo in mice and in vitro in HT22 cells.Finally,the targeted action of YL-0919 was verified by employing a sigma-1 receptor antagonist(BD-1047).Our findings demonstrated that YL-0919 markedly improved deficits in motor function and spatial cognition on day 3 post traumatic brain injury,while also decreasing neuronal mortality and reversing blood-brain barrier disruption and brain edema.Furthermore,YL-0919 effectively combated oxidative stress both in vivo and in vitro.The protective effects of YL-0919 were partially inhibited by BD-1047.These results indicated that YL-0919 relieved impairments in motor and spatial cognition by restraining oxidative stress,a neuroprotective effect that was partially reversed by the sigma-1 receptor antagonist BD-1047.YL-0919 may have potential as a new treatment for traumatic brain injury.展开更多
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
Very long chain-saturated and-polyunsaturated fatty acids(VLC-SFA and VLC-PUFA, respectively) are a functionally important class of fatty acids containing 28 carbons or more in their acyl chain. They are synthesized b...Very long chain-saturated and-polyunsaturated fatty acids(VLC-SFA and VLC-PUFA, respectively) are a functionally important class of fatty acids containing 28 carbons or more in their acyl chain. They are synthesized by the elongation of very long fatty acids-4(ELOVL4) enzyme, expressed mainly in the brain, retina, skin, testes, and meibomian gland, where these fatty acids are found(Agbaga et al., 2008). Further, these organs exhibit tissuespecific VLC-PUFA and VLC-SFA biosynthesis and incorporation into complex lipids for specific functions. In the brain, skin, and Meibomian glands, the ELOVL4 mainly makes VLC-SFA, which are incorporated into complex sphingolipids. In the retina, the ELOVL4 makes VLC-PUFA that are incorporated into phosphatidylcholine, that are critical for visual function, while in testes and sperm, the VLC-PUFA are incorporated into sphingolipids that are critical for fertility(Yeboah et al., 2021).展开更多
Mutations in the protocadherin-19(PCDH19)gene(Xq22.1)cause the X-linked syndrome known as developmental and epileptic encephalopathy 9(DEE9,OMIM#300088)(Dibbens et al.,2008).DEE9 is characterized by early-onset cluste...Mutations in the protocadherin-19(PCDH19)gene(Xq22.1)cause the X-linked syndrome known as developmental and epileptic encephalopathy 9(DEE9,OMIM#300088)(Dibbens et al.,2008).DEE9 is characterized by early-onset clustering epilepsy associated with intellectual disability ranging from mild to profound,autism spectrum disorder,and other neuropsychiatric features including schizophrenia,anxiety,attentiondeficit/hyperactivity,and obsessive or aggressive behaviors.While seizures may become less frequent in adolescence,psychiatric comorbidities persist and often worsen with age(Dibbens et al.,2008;Kolc et al.,2020).展开更多
Spatial memory is crucial for survival within external surroundings and wild environments.The hippocampus,a critical hub for spatial learning and memory formation,has received extensive investigations on how neuromodu...Spatial memory is crucial for survival within external surroundings and wild environments.The hippocampus,a critical hub for spatial learning and memory formation,has received extensive investigations on how neuromodulators shape its functions(Teixeira et al.,2018;Zhang et al.,2024).However,the landscape of neuromodulations in the hippocampal system remains poorly understood because most studies focus on classical monoamine neuromodulators,such as acetylcholine,serotonin,dopamine,and noradrenaline.The neuropeptides,comprising the most abundant neuromodulators in the central nervous system,play a pivotal role in neural information processing in the hippocampal system.Cholecystokinin(CCK),one of the most abundant neuropeptides,has been implicated in regulating various physiological and neurobiological statuses(Chen et al.,2019).CCK-A receptor(CCK-AR)and CCK-B receptors(CCK-BR)are two key receptors mediating the biological functions of CCK,both of which belong to class-A sevenfold transmembrane G protein-coupled receptors(Nishimura et al.,2015).CCK-AR preferentially reacts to sulfated CCK,whereas CCK-BR binds both CCK and gastrin with similar affinities(Ding et al.,2022).The expression patterns of CCK-AR and CCK-BR are distinct,implying that CCK has various functions in target regions.For instance,CCK-AR is widely expressed in the GI and brain subregions and is hence implicated in the control of digestive function and satiety regulation.Conversely,CCK-BR is abundantly and widely distributed in the central nervous system,which majorly regulates anxiety,learning,and memory(Ding et al.,2022).However,the roles of endogenous CCK and CCK receptors in regulating hippocampal function at electrophysiological and behavioral levels have received less attention.展开更多
There is an error in the name of the cell line in the abstract of the published paper“MicroRNA-502-3p regulates GABAergic synapse function in hippocampal neurons”published on pages 2698-2707,Issue 12,Volume 19 of Ne...There is an error in the name of the cell line in the abstract of the published paper“MicroRNA-502-3p regulates GABAergic synapse function in hippocampal neurons”published on pages 2698-2707,Issue 12,Volume 19 of Neural Regeneration Research(Sharma et al.,2024),because of oversight during final proof checking.The correct description should be“human-GABA receptor A-α1/β2/γ2L human embryonic kidney(HEK)recombinant cell line.”The authors apologize for any inconvenience this correction may cause for readers and editors of Neural Regeneration Research.展开更多
Schwann cell transplantation is considered one of the most promising cell-based therapy to repair injured spinal cord due to its unique growth-promoting and myelin-forming properties.A the Food and Drug Administration...Schwann cell transplantation is considered one of the most promising cell-based therapy to repair injured spinal cord due to its unique growth-promoting and myelin-forming properties.A the Food and Drug Administration-approved Phase I clinical trial has been conducted to evaluate the safety of transplanted human autologous Schwann cells to treat patients with spinal cord injury.A major challenge for Schwann cell transplantation is that grafted Schwann cells are confined within the lesion cavity,and they do not migrate into the host environment due to the inhibitory barrier formed by injury-induced glial scar,thus limiting axonal reentry into the host spinal cord.Here we introduce a combinatorial strategy by suppressing the inhibitory extracellular environment with injection of lentivirus-mediated transfection of chondroitinase ABC gene at the rostral and caudal borders of the lesion site and simultaneously leveraging the repair capacity of transplanted Schwann cells in adult rats following a mid-thoracic contusive spinal cord injury.We report that when the glial scar was degraded by chondroitinase ABC at the rostral and caudal lesion borders,Schwann cells migrated for considerable distances in both rostral and caudal directions.Such Schwann cell migration led to enhanced axonal regrowth,including the serotonergic and dopaminergic axons originating from supraspinal regions,and promoted recovery of locomotor and urinary bladder functions.Importantly,the Schwann cell survival and axonal regrowth persisted up to 6 months after the injury,even when treatment was delayed for 3 months to mimic chronic spinal cord injury.These findings collectively show promising evidence for a combinatorial strategy with chondroitinase ABC and Schwann cells in promoting remodeling and recovery of function following spinal cord injury.展开更多
There is a strong evidence supporting the hypothesis of synaptic dysfunction as a major contributor to neural circuit and network disruption underlying emotional and mood dysregulation in psychiatric disorders(Simmons...There is a strong evidence supporting the hypothesis of synaptic dysfunction as a major contributor to neural circuit and network disruption underlying emotional and mood dysregulation in psychiatric disorders(Simmons et al.,2024).Diverse sets of distinct molecular signaling pathways converge on the synapse to regulate synaptogenesis,synaptic function,and synaptic plasticity in brain regions and circuits through complex interactions organized by numerous multivalent protein scaffolds,including the family of proteins known as A-kinase anchoring proteins(AKAPs).展开更多
Distinct brain remodeling has been found after different nerve reconstruction strategies,including motor representation of the affected limb.However,differences among reconstruction strategies at the brain network lev...Distinct brain remodeling has been found after different nerve reconstruction strategies,including motor representation of the affected limb.However,differences among reconstruction strategies at the brain network level have not been elucidated.This study aimed to explore intranetwork changes related to altered peripheral neural pathways after different nerve reconstruction surgeries,including nerve repair,endto-end nerve transfer,and end-to-side nerve transfer.Sprague–Dawley rats underwent complete left brachial plexus transection and were divided into four equal groups of eight:no nerve repair,grafted nerve repair,phrenic nerve end-to-end transfer,and end-to-side transfer with a graft sutured to the anterior upper trunk.Resting-state brain functional magnetic resonance imaging was obtained 7 months after surgery.The independent component analysis algorithm was utilized to identify group-level network components of interest and extract resting-state functional connectivity values of each voxel within the component.Alterations in intra-network resting-state functional connectivity were compared among the groups.Target muscle reinnervation was assessed by behavioral observation(elbow flexion)and electromyography.The results showed that alterations in the sensorimotor and interoception networks were mostly related to changes in the peripheral neural pathway.Nerve repair was related to enhanced connectivity within the sensorimotor network,while end-to-side nerve transfer might be more beneficial for restoring control over the affected limb by the original motor representation.The thalamic-cortical pathway was enhanced within the interoception network after nerve repair and end-to-end nerve transfer.Brain areas related to cognition and emotion were enhanced after end-to-side nerve transfer.Our study revealed important brain networks related to different nerve reconstructions.These networks may be potential targets for enhancing motor recovery.展开更多
基金Supported by the National Natural Science Foundation of China (70071042,60073043,60133010)
文摘This paper presents a two-phase genetic algorithm (TPGA) based on the multi- parent genetic algorithm (MPGA). Through analysis we find MPGA will lead the population' s evol vement to diversity or convergence according to the population size and the crossover size, so we make it run in different forms during the global and local optimization phases and then forms TPGA. The experiment results show that TPGA is very efficient for the optimization of low-dimension multi-modal functions, usually we can obtain all the global optimal solutions.
基金Supported by the National Natural Science Foundation of China (70071042,60073043,60133010)
文摘In this paper, a new algorithm for solving multi-modal function optimization problems-two-level subspace evolutionary algorithm is proposed. In the first level, the improved GT algorithm is used to do global recombination search so that the whole population can be separated into several niches according to the position of solutions; then, in the second level, the niche evolutionary strategy is used for local search in the subspaces gotten in the first level till solutions of the problem are found. The new algorithm has been tested on some hard problems and some good results are obtained.
基金Project(50275150) supported by the National Natural Science Foundation of ChinaProjects(20040533035, 20070533131) supported by the National Research Foundation for the Doctoral Program of Higher Education of China
文摘A novel immune genetic algorithm with the elitist selection and elitist crossover was proposed, which is called the immune genetic algorithm with the elitism (IGAE). In IGAE, the new methods for computing antibody similarity, expected reproduction probability, and clonal selection probability were given. IGAE has three features. The first is that the similarities of two antibodies in structure and quality are all defined in the form of percentage, which helps to describe the similarity of two antibodies more accurately and to reduce the computational burden effectively. The second is that with the elitist selection and elitist crossover strategy IGAE is able to find the globally optimal solution of a given problem. The third is that the formula of expected reproduction probability of antibody can be adjusted through a parameter r, which helps to balance the population diversity and the convergence speed of IGAE so that IGAE can find the globally optimal solution of a given problem more rapidly. Two different complex multi-modal functions were selected to test the validity of IGAE. The experimental results show that IGAE can find the globally maximum/minimum values of the two functions rapidly. The experimental results also confirm that IGAE is of better performance in convergence speed, solution variation behavior, and computational efficiency compared with the canonical genetic algorithm with the elitism and the immune genetic algorithm with the information entropy and elitism.
基金funded by the National Natural Science Foundation of China(61991413)the China Postdoctoral Science Foundation(2019M651142)+1 种基金the Natural Science Foundation of Liaoning Province(2021-KF-12-07)the Natural Science Foundations of Liaoning Province(2023-MS-322).
文摘Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.
基金supported by the Natural Science Foundation of Liaoning Province(Grant No.2023-MSBA-070)the National Natural Science Foundation of China(Grant No.62302086).
文摘Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
基金Project(2023JH26-10100002)supported by the Liaoning Science and Technology Major Project,ChinaProjects(U21A20117,52074085)supported by the National Natural Science Foundation of China+1 种基金Project(2022JH2/101300008)supported by the Liaoning Applied Basic Research Program Project,ChinaProject(22567612H)supported by the Hebei Provincial Key Laboratory Performance Subsidy Project,China。
文摘Mill vibration is a common problem in rolling production,which directly affects the thickness accuracy of the strip and may even lead to strip fracture accidents in serious cases.The existing vibration prediction models do not consider the features contained in the data,resulting in limited improvement of model accuracy.To address these challenges,this paper proposes a multi-dimensional multi-modal cold rolling vibration time series prediction model(MDMMVPM)based on the deep fusion of multi-level networks.In the model,the long-term and short-term modal features of multi-dimensional data are considered,and the appropriate prediction algorithms are selected for different data features.Based on the established prediction model,the effects of tension and rolling force on mill vibration are analyzed.Taking the 5th stand of a cold mill in a steel mill as the research object,the innovative model is applied to predict the mill vibration for the first time.The experimental results show that the correlation coefficient(R^(2))of the model proposed in this paper is 92.5%,and the root-mean-square error(RMSE)is 0.0011,which significantly improves the modeling accuracy compared with the existing models.The proposed model is also suitable for the hot rolling process,which provides a new method for the prediction of strip rolling vibration.
基金National College Students’Training Programs of Innovation and Entrepreneurship,Grant/Award Number:S202210022060the CACMS Innovation Fund,Grant/Award Number:CI2021A00512the National Nature Science Foundation of China under Grant,Grant/Award Number:62206021。
文摘Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.
文摘Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.
基金supported by the National Key Research and Development Project under Grant 2020YFB1807602Key Program of Marine Economy Development Special Foundation of Department of Natural Resources of Guangdong Province(GDNRC[2023]24)the National Natural Science Foundation of China under Grant 62271267.
文摘Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.
基金supported by the National Natural Science Foundation of China(GrantNo.62302086)the Natural Science Foundation of Liaoning Province(Grant No.2023-MSBA-070)the Fundamental Research Funds for the Central Universities(Grant No.N2317005).
文摘Multi-modal 3D object detection has achieved remarkable progress,but it is often limited in practical industrial production because of its high cost and low efficiency.The multi-view camera-based method provides a feasible solution due to its low cost.However,camera data lacks geometric depth,and only using camera data to obtain high accuracy is challenging.This paper proposes a multi-modal Bird-Eye-View(BEV)distillation framework(MMDistill)to make a trade-off between them.MMDistill is a carefully crafted two-stage distillation framework based on teacher and student models for learning cross-modal knowledge and generating multi-modal features.It can improve the performance of unimodal detectors without introducing additional costs during inference.Specifically,our method can effectively solve the cross-gap caused by the heterogeneity between data.Furthermore,we further propose a Light Detection and Ranging(LiDAR)-guided geometric compensation module,which can assist the student model in obtaining effective geometric features and reduce the gap between different modalities.Our proposed method generally requires fewer computational resources and faster inference speed than traditional multi-modal models.This advancement enables multi-modal technology to be applied more widely in practical scenarios.Through experiments,we validate the effectiveness and superiority of MMDistill on the nuScenes dataset,achieving an improvement of 4.1%mean Average Precision(mAP)and 4.6%NuScenes Detection Score(NDS)over the baseline detector.In addition,we also present detailed ablation studies to validate our method.
基金supported by the National Natural Science Foundation of China,Nos.82204360(to HM)and 82270411(to GW)National Science and Technology Innovation 2030 Major Program,No.2021ZD0200900(to YL)。
文摘Traumatic brain injury involves complex pathophysiological mechanisms,among which oxidative stress significantly contributes to the occurrence of secondary injury.In this study,we evaluated hypidone hydrochloride(YL-0919),a self-developed antidepressant with selective sigma-1 receptor agonist properties,and its associated mechanisms and targets in traumatic brain injury.Behavioral experiments to assess functional deficits were followed by assessment of neuronal damage through histological analyses and examination of blood-brain barrier permeability and brain edema.Next,we investigated the antioxidative effects of YL-0919 by assessing the levels of traditional markers of oxidative stress in vivo in mice and in vitro in HT22 cells.Finally,the targeted action of YL-0919 was verified by employing a sigma-1 receptor antagonist(BD-1047).Our findings demonstrated that YL-0919 markedly improved deficits in motor function and spatial cognition on day 3 post traumatic brain injury,while also decreasing neuronal mortality and reversing blood-brain barrier disruption and brain edema.Furthermore,YL-0919 effectively combated oxidative stress both in vivo and in vitro.The protective effects of YL-0919 were partially inhibited by BD-1047.These results indicated that YL-0919 relieved impairments in motor and spatial cognition by restraining oxidative stress,a neuroprotective effect that was partially reversed by the sigma-1 receptor antagonist BD-1047.YL-0919 may have potential as a new treatment for traumatic brain injury.
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
基金supported by NEI/NIH R01 EY030513NIAMS/NIH R21-AR076035Multi-PI Team Science grant from Presbyterian Health Foundation。
文摘Very long chain-saturated and-polyunsaturated fatty acids(VLC-SFA and VLC-PUFA, respectively) are a functionally important class of fatty acids containing 28 carbons or more in their acyl chain. They are synthesized by the elongation of very long fatty acids-4(ELOVL4) enzyme, expressed mainly in the brain, retina, skin, testes, and meibomian gland, where these fatty acids are found(Agbaga et al., 2008). Further, these organs exhibit tissuespecific VLC-PUFA and VLC-SFA biosynthesis and incorporation into complex lipids for specific functions. In the brain, skin, and Meibomian glands, the ELOVL4 mainly makes VLC-SFA, which are incorporated into complex sphingolipids. In the retina, the ELOVL4 makes VLC-PUFA that are incorporated into phosphatidylcholine, that are critical for visual function, while in testes and sperm, the VLC-PUFA are incorporated into sphingolipids that are critical for fertility(Yeboah et al., 2021).
基金supported by a grant from Telethon Foundation(grant No.GGP20056 to SB)The generation of Pcdh19 floxed mouse model was funded by Cariplo Foundation(grant No.2014-0972 to SB)。
文摘Mutations in the protocadherin-19(PCDH19)gene(Xq22.1)cause the X-linked syndrome known as developmental and epileptic encephalopathy 9(DEE9,OMIM#300088)(Dibbens et al.,2008).DEE9 is characterized by early-onset clustering epilepsy associated with intellectual disability ranging from mild to profound,autism spectrum disorder,and other neuropsychiatric features including schizophrenia,anxiety,attentiondeficit/hyperactivity,and obsessive or aggressive behaviors.While seizures may become less frequent in adolescence,psychiatric comorbidities persist and often worsen with age(Dibbens et al.,2008;Kolc et al.,2020).
文摘Spatial memory is crucial for survival within external surroundings and wild environments.The hippocampus,a critical hub for spatial learning and memory formation,has received extensive investigations on how neuromodulators shape its functions(Teixeira et al.,2018;Zhang et al.,2024).However,the landscape of neuromodulations in the hippocampal system remains poorly understood because most studies focus on classical monoamine neuromodulators,such as acetylcholine,serotonin,dopamine,and noradrenaline.The neuropeptides,comprising the most abundant neuromodulators in the central nervous system,play a pivotal role in neural information processing in the hippocampal system.Cholecystokinin(CCK),one of the most abundant neuropeptides,has been implicated in regulating various physiological and neurobiological statuses(Chen et al.,2019).CCK-A receptor(CCK-AR)and CCK-B receptors(CCK-BR)are two key receptors mediating the biological functions of CCK,both of which belong to class-A sevenfold transmembrane G protein-coupled receptors(Nishimura et al.,2015).CCK-AR preferentially reacts to sulfated CCK,whereas CCK-BR binds both CCK and gastrin with similar affinities(Ding et al.,2022).The expression patterns of CCK-AR and CCK-BR are distinct,implying that CCK has various functions in target regions.For instance,CCK-AR is widely expressed in the GI and brain subregions and is hence implicated in the control of digestive function and satiety regulation.Conversely,CCK-BR is abundantly and widely distributed in the central nervous system,which majorly regulates anxiety,learning,and memory(Ding et al.,2022).However,the roles of endogenous CCK and CCK receptors in regulating hippocampal function at electrophysiological and behavioral levels have received less attention.
文摘There is an error in the name of the cell line in the abstract of the published paper“MicroRNA-502-3p regulates GABAergic synapse function in hippocampal neurons”published on pages 2698-2707,Issue 12,Volume 19 of Neural Regeneration Research(Sharma et al.,2024),because of oversight during final proof checking.The correct description should be“human-GABA receptor A-α1/β2/γ2L human embryonic kidney(HEK)recombinant cell line.”The authors apologize for any inconvenience this correction may cause for readers and editors of Neural Regeneration Research.
基金supported in part by NIH R01 NS100531,R01 NS103481NIH R21NS130241(to LD)+3 种基金Merit Review Award I01 BX002356,I01 BX003705 from the U.S.Department of Veterans AffairsIndiana Spinal Cord and Brain Injury Research Foundation(No.19919)Mari Hulman George Endowment Funds(to XMX)Indiana Spinal Cord&Brain Injury Research Fund from ISDH(to NKL and LD)。
文摘Schwann cell transplantation is considered one of the most promising cell-based therapy to repair injured spinal cord due to its unique growth-promoting and myelin-forming properties.A the Food and Drug Administration-approved Phase I clinical trial has been conducted to evaluate the safety of transplanted human autologous Schwann cells to treat patients with spinal cord injury.A major challenge for Schwann cell transplantation is that grafted Schwann cells are confined within the lesion cavity,and they do not migrate into the host environment due to the inhibitory barrier formed by injury-induced glial scar,thus limiting axonal reentry into the host spinal cord.Here we introduce a combinatorial strategy by suppressing the inhibitory extracellular environment with injection of lentivirus-mediated transfection of chondroitinase ABC gene at the rostral and caudal borders of the lesion site and simultaneously leveraging the repair capacity of transplanted Schwann cells in adult rats following a mid-thoracic contusive spinal cord injury.We report that when the glial scar was degraded by chondroitinase ABC at the rostral and caudal lesion borders,Schwann cells migrated for considerable distances in both rostral and caudal directions.Such Schwann cell migration led to enhanced axonal regrowth,including the serotonergic and dopaminergic axons originating from supraspinal regions,and promoted recovery of locomotor and urinary bladder functions.Importantly,the Schwann cell survival and axonal regrowth persisted up to 6 months after the injury,even when treatment was delayed for 3 months to mimic chronic spinal cord injury.These findings collectively show promising evidence for a combinatorial strategy with chondroitinase ABC and Schwann cells in promoting remodeling and recovery of function following spinal cord injury.
基金supported by the National Institute of Mental Health (NIH/NIMH)the National Institute of Neurological Disorders and Stroke(NIH/NINDS):Grants#R21 MH132136 to FSN and R01 MH123700 and R01 NS040701 to MLD
文摘There is a strong evidence supporting the hypothesis of synaptic dysfunction as a major contributor to neural circuit and network disruption underlying emotional and mood dysregulation in psychiatric disorders(Simmons et al.,2024).Diverse sets of distinct molecular signaling pathways converge on the synapse to regulate synaptogenesis,synaptic function,and synaptic plasticity in brain regions and circuits through complex interactions organized by numerous multivalent protein scaffolds,including the family of proteins known as A-kinase anchoring proteins(AKAPs).
基金supported by the National Natural Science Foundation of China,Nos.81871836(to MZ),82172554(to XH),and 81802249(to XH),81902301(to JW)the National Key R&D Program of China,Nos.2018YFC2001600(to JX)and 2018YFC2001604(to JX)+3 种基金Shanghai Rising Star Program,No.19QA1409000(to MZ)Shanghai Municipal Commission of Health and Family Planning,No.2018YQ02(to MZ)Shanghai Youth Top Talent Development PlanShanghai“Rising Stars of Medical Talent”Youth Development Program,No.RY411.19.01.10(to XH)。
文摘Distinct brain remodeling has been found after different nerve reconstruction strategies,including motor representation of the affected limb.However,differences among reconstruction strategies at the brain network level have not been elucidated.This study aimed to explore intranetwork changes related to altered peripheral neural pathways after different nerve reconstruction surgeries,including nerve repair,endto-end nerve transfer,and end-to-side nerve transfer.Sprague–Dawley rats underwent complete left brachial plexus transection and were divided into four equal groups of eight:no nerve repair,grafted nerve repair,phrenic nerve end-to-end transfer,and end-to-side transfer with a graft sutured to the anterior upper trunk.Resting-state brain functional magnetic resonance imaging was obtained 7 months after surgery.The independent component analysis algorithm was utilized to identify group-level network components of interest and extract resting-state functional connectivity values of each voxel within the component.Alterations in intra-network resting-state functional connectivity were compared among the groups.Target muscle reinnervation was assessed by behavioral observation(elbow flexion)and electromyography.The results showed that alterations in the sensorimotor and interoception networks were mostly related to changes in the peripheral neural pathway.Nerve repair was related to enhanced connectivity within the sensorimotor network,while end-to-side nerve transfer might be more beneficial for restoring control over the affected limb by the original motor representation.The thalamic-cortical pathway was enhanced within the interoception network after nerve repair and end-to-end nerve transfer.Brain areas related to cognition and emotion were enhanced after end-to-side nerve transfer.Our study revealed important brain networks related to different nerve reconstructions.These networks may be potential targets for enhancing motor recovery.