The all-wheel drive(AWD)hybrid system is a research focus on high-performance new energy vehicles that can meet the demands of dynamic performance and passing ability.Simultaneous optimization of the power and economy...The all-wheel drive(AWD)hybrid system is a research focus on high-performance new energy vehicles that can meet the demands of dynamic performance and passing ability.Simultaneous optimization of the power and economy of hybrid vehicles becomes an issue.A unique multi-mode coupling(MMC)AWD hybrid system is presented to realize the distributed and centralized driving of the front and rear axles to achieve vectored distribution and full utilization of the system power between the axles of vehicles.Based on the parameters of the benchmarking model of a hybrid vehicle,the best model-predictive control-based energy management strategy is proposed.First,the drive system model was built after the analysis of the MMC-AWD’s drive modes.Next,three fundamental strategies were established to address power distribution adjustment and battery SOC maintenance when the SOC changed,which was followed by the design of a road driving force observer.Then,the energy consumption rate in the average time domain was processed before designing the minimum fuel consumption controller based on the equivalent fuel consumption coefficient.Finally,the advantage of the MMC-AWD was confirmed by comparison with the dynamic performance and economy of the BYD Song PLUS DMI-AWD.The findings indicate that,in comparison to the comparative hybrid system at road adhesion coefficients of 0.8 and 0.6,the MMC-AWD’s capacity to accelerate increases by 5.26%and 7.92%,respectively.When the road adhesion coefficient is 0.8,0.6,and 0.4,the maximum climbing ability increases by 14.22%,12.88%,and 4.55%,respectively.As a result,the dynamic performance is greatly enhanced,and the fuel savings rate per 100 km of mileage reaches 12.06%,which is also very economical.The proposed control strategies for the new hybrid AWD vehicle can optimize the power and economy simultaneously.展开更多
In recent years,how to efficiently and accurately identify multi-model fake news has become more challenging.First,multi-model data provides more evidence but not all are equally important.Secondly,social structure in...In recent years,how to efficiently and accurately identify multi-model fake news has become more challenging.First,multi-model data provides more evidence but not all are equally important.Secondly,social structure information has proven to be effective in fake news detection and how to combine it while reducing the noise information is critical.Unfortunately,existing approaches fail to handle these problems.This paper proposes a multi-model fake news detection framework based on Tex-modal Dominance and fusing Multiple Multi-model Cues(TD-MMC),which utilizes three valuable multi-model clues:text-model importance,text-image complementary,and text-image inconsistency.TD-MMC is dominated by textural content and assisted by image information while using social network information to enhance text representation.To reduce the irrelevant social structure’s information interference,we use a unidirectional cross-modal attention mechanism to selectively learn the social structure’s features.A cross-modal attention mechanism is adopted to obtain text-image cross-modal features while retaining textual features to reduce the loss of important information.In addition,TD-MMC employs a new multi-model loss to improve the model’s generalization ability.Extensive experiments have been conducted on two public real-world English and Chinese datasets,and the results show that our proposed model outperforms the state-of-the-art methods on classification evaluation metrics.展开更多
In mobile machinery,hydro-mechanical pumps are increasingly replaced by electronically controlled pumps to improve the automation level,but diversified control functions(e.g.,power limitation and pressure cut-off)are ...In mobile machinery,hydro-mechanical pumps are increasingly replaced by electronically controlled pumps to improve the automation level,but diversified control functions(e.g.,power limitation and pressure cut-off)are integrated into the electronic controller only from the pump level,leading to the potential instability of the overall system.To solve this problem,a multi-mode electrohydraulic load sensing(MELS)control scheme is proposed especially considering the switching stability from the system level,which includes four working modes of flow control,load sensing,power limitation,and pressure control.Depending on the actual working requirements,the switching rules for the different modes and the switching direction(i.e.,the modes can be switched bilaterally or unilaterally)are defined.The priority of different modes is also defined,from high to low:pressure control,power limitation,load sensing,and flow control.When multiple switching rules are satisfied at the same time,the system switches to the control mode with the highest priority.In addition,the switching stability between flow control and pressure control modes is analyzed,and the controller parameters that guarantee the switching stability are obtained.A comparative study is carried out based on a test rig with a 2-ton hydraulic excavator.The results show that the MELS controller can achieve the control functions of proper flow supplement,power limitation,and pressure cut-off,which has good stability performance when switching between different control modes.This research proposes the MELS control method that realizes the stability of multi-mode switching of the hydraulic system of mobile machinery under different working conditions.展开更多
With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-freque...With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-frequency signals of GPS,GLONASS,Galileo,and Beidou can be used for GNSS-IR sea level retrieval,but combining these retrievals remains problematic.To address this issue,a GNSS-IR sea level retrieval combination system has been developed,which begins by analyzing error sources in GNSS-IR sea level retrieval and establishing and solving the GNSS-IR retrieval equation.This paper focuses on two key points:time window selection and equation stability.The stability of the retrieval combination equations is determined by the condition number of the coefficient matrix within the time window.The impact of ill-conditioned coefficient matrices on the retrieval results is demonstrated using an extreme case of SNR data with only ascending or descending trajectories.After determining the time window and removing ill-conditioned equations,the multi-mode,multi-frequency GNSS-IR retrieval is performed.Results from three International GNSS Service(IGS)stations show that the combination method produces high-precision,high-resolution,and high-reliability sea level retrieval combination sequences.展开更多
To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features e...To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features extracted synchronously by the CCAE were stacked and fed to the multi-channel convolution layers for fusion. Then, the fused data was passed to all connection layers for compression and fed to the Softmax module for classification. Finally, the coupling loss function coefficients and the network parameters were optimized through an adaptive approach using the gray wolf optimization (GWO) algorithm. Experimental comparisons showed that the proposed ADCCAE fusion model was superior to existing models for multi-mode data fusion.展开更多
Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and ...Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.展开更多
Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant resear...Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.展开更多
High speed photography technique is potentially the most effective way to measure the motion parameter of warhead fragment benefiting from its advantages of high accuracy,high resolution and high efficiency.However,it...High speed photography technique is potentially the most effective way to measure the motion parameter of warhead fragment benefiting from its advantages of high accuracy,high resolution and high efficiency.However,it faces challenge in dense objects tracking and 3D trajectories reconstruction due to the characteristics of small size and dense distribution of fragment swarm.To address these challenges,this work presents a warhead fragments motion trajectories tracking and spatio-temporal distribution reconstruction method based on high-speed stereo photography.Firstly,background difference algorithm is utilized to extract the center and area of each fragment in the image sequence.Subsequently,a multi-object tracking(MOT)algorithm using Kalman filtering and Hungarian optimal assignment is developed to realize real-time and robust trajectories tracking of fragment swarm.To reconstruct 3D motion trajectories,a global stereo trajectories matching strategy is presented,which takes advantages of epipolar constraint and continuity constraint to correctly retrieve stereo correspondence followed by 3D trajectories refinement using polynomial fitting.Finally,the simulation and experimental results demonstrate that the proposed method can accurately track the motion trajectories and reconstruct the spatio-temporal distribution of 1.0×10^(3)fragments in a field of view(FOV)of 3.2 m×2.5 m,and the accuracy of the velocity estimation can achieve 98.6%.展开更多
The warhead of a ballistic missile may precess due to lateral moments during release. The resulting micro-Doppler effect is determined by parameters such as the target's motion state and size. A three-dimensional ...The warhead of a ballistic missile may precess due to lateral moments during release. The resulting micro-Doppler effect is determined by parameters such as the target's motion state and size. A three-dimensional reconstruction method for the precession warhead via the micro-Doppler analysis and inverse Radon transform(IRT) is proposed in this paper. The precession parameters are extracted by the micro-Doppler analysis from three radars, and the IRT is used to estimate the size of targe. The scatterers of the target can be reconstructed based on the above parameters. Simulation experimental results illustrate the effectiveness of the proposed method in this paper.展开更多
Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
The double casing warhead with sandwiched charge is a novel fragmentation warhead that can produce two groups of fragments with different velocity,and the previous work has presented a calculation formula to determine...The double casing warhead with sandwiched charge is a novel fragmentation warhead that can produce two groups of fragments with different velocity,and the previous work has presented a calculation formula to determine the maximum fragment velocity.The current work builds on the published formula to further develop a formula for calculating the axial distribution characteristics of the fragment velocity.For this type of warhead,the simulation of the dispersion characteristics of the detonation products at different positions shows that the detonation products at the ends have a much larger axial velocity than those in the middle,and the detonation products have a greater axial dispersion velocity when they are closer to the central axis.The loading process and the fragment velocity vary with the axial position for both casing layers,and the total velocity of the fragments is the vector sum of the radial velocity and the axial velocity.At the same axial position,the acceleration time of the inner casing is greater than that of the outer casing.For the same casing,the fragments generated at the ends have a longer acceleration time than the fragments from the middle.The proposed formula is validated with the X-ray radiography results of the four warheads previously tested experimentally and the 3D smoothedparticle hydrodynamics numerical simulation results of several series of new warheads with different configurations.The formula can accurately and reliably calculate the fragment velocity when the lengthto-diameter ratio of the charge is greater than 1.5 and the thickness of the casing is less than 20%its inner radius.This work thus provides a key reference for the theoretical analysis and the design of warheads with multiple casings.展开更多
Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the intro...Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.展开更多
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent...Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.展开更多
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the...Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.展开更多
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
This paper considers the problem of time varying congestion pricing to determine optimal time-varying tolls at peak periods for a queuing network with the interactions between buses and private cars.Through the combin...This paper considers the problem of time varying congestion pricing to determine optimal time-varying tolls at peak periods for a queuing network with the interactions between buses and private cars.Through the combined applications of the space-time expanded network(STEN) and the conventional network equilibrium modeling techniques,a multi-class,multi-mode and multi-criteria traffic network equilibrium model is developed.Travelers of different classes have distinctive value of times(VOTs),and travelers from the same class perceive their travel disutility or generalized costs on a route according to different weights of travel time and travel costs.Moreover,the symmetric cost function model is extended to deal with the interactions between buses and private cars.It is found that there exists a uniform(anonymous) link toll pattern which can drive a multi-class,multi-mode and multi-criteria user equilibrium flow pattern to a system optimum when the system's objective function is measured in terms of money.It is also found that the marginal cost pricing models with a symmetric travel cost function do not reflect the interactions between traffic flows of different road sections,and the obtained congestion pricing toll is smaller than the real value.展开更多
New advanced numerical computer model enabling accurate simulation of fragmentation parameters of large Length over Diameter(L/D)explosively driven metal shells has been developed and validated.The newly developed lar...New advanced numerical computer model enabling accurate simulation of fragmentation parameters of large Length over Diameter(L/D)explosively driven metal shells has been developed and validated.The newly developed large L/D multi-region model links three-dimensional axisymmetric high strain high strain-rate hydrocode analyses with the conventional set of Picatinny Arsenal FRAGmentation(PAFRAG)simulation routines.The standard PAFRAG modeling technique is based on the Mott's theory of break-up of idealized cylindrical"ring-bombs",in which the length of the average fragment is a function of the radius and velocity of the shell at the moment of break-up,and the mechanical properties of the metal.In the newly developed multi-region model,each of the shell region,the break-up is assumed to occur instantaneously,whereas the entire shell is modeled to fragment at multiple times,according to the number of the regions considered.According to PAFRAG methodology,the required input for both the natural and the controlled fragmentation models including the geometry and the velocity of the shell at moment of break-up had been provided from the hydrocode analyses and validated with available experimental data.The newly developed large L/D multi-region PAFRAG model has been shown to accurately reproduce available experimental fragmentation data.展开更多
For the characterization of the behaviors of a metal material in events like expanding warheads, it is necessary to know its strength and ductility at high strain rates, around 104e105/s. The flyer plate impact testin...For the characterization of the behaviors of a metal material in events like expanding warheads, it is necessary to know its strength and ductility at high strain rates, around 104e105/s. The flyer plate impact testing produces the uniform stress and strain rates but the testing is expensive. The Taylor test is relatively inexpensive but produces non-uniform stress and strain fields, and the results are not so easily inferred for material modeling. In the split-Hopkinson bar(SHB), which may be used in compression, tension and torsion testing, the strain rates never exceeds 103/s. In the present work, we use the expanding ring test where the strain rate is 104e105/s. A streak camera is used to examine the expanding ring velocity, and a water tank is used to collect the fragments. The experimental results are compared with the numerical simulations using the hydrocodes AUTODYN, IMPETUS Afea and a regularized smooth particle(RSPH) software. The number of fragments increases with the increase in the expansion velocity of the rings. The number of fragments is similar to the experimental results. The RSPH software shows much the same results as the AUTODYN where the Lagrangian solver is used for the ring. The IMPETUS Afea solver shows a somewhat different fragmentation characteristic due to the node splitting algorithm that induces pronounced tensile splitting.展开更多
A new multi-mode resistivity imaging sonde, with toroidal coils as source, can conduct three resistivity measurements: azimuthal resistivity, lateral resistivity, and bit resistivity measurements. Thus, the logging ti...A new multi-mode resistivity imaging sonde, with toroidal coils as source, can conduct three resistivity measurements: azimuthal resistivity, lateral resistivity, and bit resistivity measurements. Thus, the logging time and cost are greatly saved. The toroidal coils are simplified as an extended voltage dipole and the response equations are derived for a homogenous formation. Based on 3D FEM, the depth of investigation(DOI), vertical resolution, circumferential azimuthal capacity, borehole diameter, mud resistivity, thickness of target formation, and the resistivity of the surrounding formation and mud invasion are simulated. The results suggest that the three measurement modes of the new sonde are different in vertical resolutions and DOIs. The circumferential detection ability of the azimuth button depends on the contrast between the anomaly and formation resistivity and the open angle of the anomaly. Whether the borehole is truncated at the bit or not has a great influence on the simulation results. The borehole and mud invasion affect the apparent resistivity in all modes, but the effects of resistivity of surrounding formation and thickness of the target formation are only corrected for lateral resistivity measurement.展开更多
基金Supported by Hebei Provincial Natural Science Foundation of China(Grant Nos.E2020203174,E2020203078)S&T Program of Hebei Province of China(Grant No.226Z2202G)Science Research Project of Hebei Provincial Education Department of China(Grant No.ZD2022029).
文摘The all-wheel drive(AWD)hybrid system is a research focus on high-performance new energy vehicles that can meet the demands of dynamic performance and passing ability.Simultaneous optimization of the power and economy of hybrid vehicles becomes an issue.A unique multi-mode coupling(MMC)AWD hybrid system is presented to realize the distributed and centralized driving of the front and rear axles to achieve vectored distribution and full utilization of the system power between the axles of vehicles.Based on the parameters of the benchmarking model of a hybrid vehicle,the best model-predictive control-based energy management strategy is proposed.First,the drive system model was built after the analysis of the MMC-AWD’s drive modes.Next,three fundamental strategies were established to address power distribution adjustment and battery SOC maintenance when the SOC changed,which was followed by the design of a road driving force observer.Then,the energy consumption rate in the average time domain was processed before designing the minimum fuel consumption controller based on the equivalent fuel consumption coefficient.Finally,the advantage of the MMC-AWD was confirmed by comparison with the dynamic performance and economy of the BYD Song PLUS DMI-AWD.The findings indicate that,in comparison to the comparative hybrid system at road adhesion coefficients of 0.8 and 0.6,the MMC-AWD’s capacity to accelerate increases by 5.26%and 7.92%,respectively.When the road adhesion coefficient is 0.8,0.6,and 0.4,the maximum climbing ability increases by 14.22%,12.88%,and 4.55%,respectively.As a result,the dynamic performance is greatly enhanced,and the fuel savings rate per 100 km of mileage reaches 12.06%,which is also very economical.The proposed control strategies for the new hybrid AWD vehicle can optimize the power and economy simultaneously.
基金This research was funded by the General Project of Philosophy and Social Science of Heilongjiang Province,Grant Number:20SHB080.
文摘In recent years,how to efficiently and accurately identify multi-model fake news has become more challenging.First,multi-model data provides more evidence but not all are equally important.Secondly,social structure information has proven to be effective in fake news detection and how to combine it while reducing the noise information is critical.Unfortunately,existing approaches fail to handle these problems.This paper proposes a multi-model fake news detection framework based on Tex-modal Dominance and fusing Multiple Multi-model Cues(TD-MMC),which utilizes three valuable multi-model clues:text-model importance,text-image complementary,and text-image inconsistency.TD-MMC is dominated by textural content and assisted by image information while using social network information to enhance text representation.To reduce the irrelevant social structure’s information interference,we use a unidirectional cross-modal attention mechanism to selectively learn the social structure’s features.A cross-modal attention mechanism is adopted to obtain text-image cross-modal features while retaining textual features to reduce the loss of important information.In addition,TD-MMC employs a new multi-model loss to improve the model’s generalization ability.Extensive experiments have been conducted on two public real-world English and Chinese datasets,and the results show that our proposed model outperforms the state-of-the-art methods on classification evaluation metrics.
基金National Key Research and Development Program of China(Grant No.2020YFB2009702)National Natural Science Foundation of China(Grant Nos.52075055,U21A20124 and 52111530069)Chongqing Natural Science Foundation of China(Grant No.cstc2020jcyj-msxmX0780)。
文摘In mobile machinery,hydro-mechanical pumps are increasingly replaced by electronically controlled pumps to improve the automation level,but diversified control functions(e.g.,power limitation and pressure cut-off)are integrated into the electronic controller only from the pump level,leading to the potential instability of the overall system.To solve this problem,a multi-mode electrohydraulic load sensing(MELS)control scheme is proposed especially considering the switching stability from the system level,which includes four working modes of flow control,load sensing,power limitation,and pressure control.Depending on the actual working requirements,the switching rules for the different modes and the switching direction(i.e.,the modes can be switched bilaterally or unilaterally)are defined.The priority of different modes is also defined,from high to low:pressure control,power limitation,load sensing,and flow control.When multiple switching rules are satisfied at the same time,the system switches to the control mode with the highest priority.In addition,the switching stability between flow control and pressure control modes is analyzed,and the controller parameters that guarantee the switching stability are obtained.A comparative study is carried out based on a test rig with a 2-ton hydraulic excavator.The results show that the MELS controller can achieve the control functions of proper flow supplement,power limitation,and pressure cut-off,which has good stability performance when switching between different control modes.This research proposes the MELS control method that realizes the stability of multi-mode switching of the hydraulic system of mobile machinery under different working conditions.
基金National Natural Science Foundation of China(No.42004018)。
文摘With the development of Global Navigation Satellite Systems(GNSS),geodetic GNSS receivers have been utilized to monitor sea levels using GNSS-Interferometry Reflectometry(GNSS-IR)technology.The multi-mode,multi-frequency signals of GPS,GLONASS,Galileo,and Beidou can be used for GNSS-IR sea level retrieval,but combining these retrievals remains problematic.To address this issue,a GNSS-IR sea level retrieval combination system has been developed,which begins by analyzing error sources in GNSS-IR sea level retrieval and establishing and solving the GNSS-IR retrieval equation.This paper focuses on two key points:time window selection and equation stability.The stability of the retrieval combination equations is determined by the condition number of the coefficient matrix within the time window.The impact of ill-conditioned coefficient matrices on the retrieval results is demonstrated using an extreme case of SNR data with only ascending or descending trajectories.After determining the time window and removing ill-conditioned equations,the multi-mode,multi-frequency GNSS-IR retrieval is performed.Results from three International GNSS Service(IGS)stations show that the combination method produces high-precision,high-resolution,and high-reliability sea level retrieval combination sequences.
文摘To address the difficulties in fusing multi-mode sensor data for complex industrial machinery, an adaptive deep coupling convolutional auto-encoder (ADCCAE) fusion method was proposed. First, the multi-mode features extracted synchronously by the CCAE were stacked and fed to the multi-channel convolution layers for fusion. Then, the fused data was passed to all connection layers for compression and fed to the Softmax module for classification. Finally, the coupling loss function coefficients and the network parameters were optimized through an adaptive approach using the gray wolf optimization (GWO) algorithm. Experimental comparisons showed that the proposed ADCCAE fusion model was superior to existing models for multi-mode data fusion.
基金funded by the National Natural Science Foundation of China(61991413)the China Postdoctoral Science Foundation(2019M651142)+1 种基金the Natural Science Foundation of Liaoning Province(2021-KF-12-07)the Natural Science Foundations of Liaoning Province(2023-MS-322).
文摘Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.
基金supported by the Natural Science Foundation of Liaoning Province(Grant No.2023-MSBA-070)the National Natural Science Foundation of China(Grant No.62302086).
文摘Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.
基金Key Basic Research Project of Strengthening the Foundations Plan of China (Grant No.2019-JCJQ-ZD-360-12)National Defense Basic Scientific Research Program of China (Grant No.JCKY2021208B011)to provide fund for conducting experiments。
文摘High speed photography technique is potentially the most effective way to measure the motion parameter of warhead fragment benefiting from its advantages of high accuracy,high resolution and high efficiency.However,it faces challenge in dense objects tracking and 3D trajectories reconstruction due to the characteristics of small size and dense distribution of fragment swarm.To address these challenges,this work presents a warhead fragments motion trajectories tracking and spatio-temporal distribution reconstruction method based on high-speed stereo photography.Firstly,background difference algorithm is utilized to extract the center and area of each fragment in the image sequence.Subsequently,a multi-object tracking(MOT)algorithm using Kalman filtering and Hungarian optimal assignment is developed to realize real-time and robust trajectories tracking of fragment swarm.To reconstruct 3D motion trajectories,a global stereo trajectories matching strategy is presented,which takes advantages of epipolar constraint and continuity constraint to correctly retrieve stereo correspondence followed by 3D trajectories refinement using polynomial fitting.Finally,the simulation and experimental results demonstrate that the proposed method can accurately track the motion trajectories and reconstruct the spatio-temporal distribution of 1.0×10^(3)fragments in a field of view(FOV)of 3.2 m×2.5 m,and the accuracy of the velocity estimation can achieve 98.6%.
基金supported by the National Natural Science Foundation of China (61871146)the Fundamental Research Funds for the Central Universities (FRFCU5710093720)。
文摘The warhead of a ballistic missile may precess due to lateral moments during release. The resulting micro-Doppler effect is determined by parameters such as the target's motion state and size. A three-dimensional reconstruction method for the precession warhead via the micro-Doppler analysis and inverse Radon transform(IRT) is proposed in this paper. The precession parameters are extracted by the micro-Doppler analysis from three radars, and the IRT is used to estimate the size of targe. The scatterers of the target can be reconstructed based on the above parameters. Simulation experimental results illustrate the effectiveness of the proposed method in this paper.
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
基金supported by the National Natural Science Foundation of China(Grant No.11872121)。
文摘The double casing warhead with sandwiched charge is a novel fragmentation warhead that can produce two groups of fragments with different velocity,and the previous work has presented a calculation formula to determine the maximum fragment velocity.The current work builds on the published formula to further develop a formula for calculating the axial distribution characteristics of the fragment velocity.For this type of warhead,the simulation of the dispersion characteristics of the detonation products at different positions shows that the detonation products at the ends have a much larger axial velocity than those in the middle,and the detonation products have a greater axial dispersion velocity when they are closer to the central axis.The loading process and the fragment velocity vary with the axial position for both casing layers,and the total velocity of the fragments is the vector sum of the radial velocity and the axial velocity.At the same axial position,the acceleration time of the inner casing is greater than that of the outer casing.For the same casing,the fragments generated at the ends have a longer acceleration time than the fragments from the middle.The proposed formula is validated with the X-ray radiography results of the four warheads previously tested experimentally and the 3D smoothedparticle hydrodynamics numerical simulation results of several series of new warheads with different configurations.The formula can accurately and reliably calculate the fragment velocity when the lengthto-diameter ratio of the charge is greater than 1.5 and the thickness of the casing is less than 20%its inner radius.This work thus provides a key reference for the theoretical analysis and the design of warheads with multiple casings.
基金National College Students’Training Programs of Innovation and Entrepreneurship,Grant/Award Number:S202210022060the CACMS Innovation Fund,Grant/Award Number:CI2021A00512the National Nature Science Foundation of China under Grant,Grant/Award Number:62206021。
文摘Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.
文摘Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.
基金supported by the National Key Research and Development Project under Grant 2020YFB1807602Key Program of Marine Economy Development Special Foundation of Department of Natural Resources of Guangdong Province(GDNRC[2023]24)the National Natural Science Foundation of China under Grant 62271267.
文摘Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
基金The National High Technology Research and Development Program of China (863 Program) (No. 2007AA11Z202)the National Key Technology R & D Program of China during the 11th Five-Year Plan Period(No. 2006BAJ18B03)the Fundamental Research Funds for the Central Universities (No. DUT10RC(3) 112)
文摘This paper considers the problem of time varying congestion pricing to determine optimal time-varying tolls at peak periods for a queuing network with the interactions between buses and private cars.Through the combined applications of the space-time expanded network(STEN) and the conventional network equilibrium modeling techniques,a multi-class,multi-mode and multi-criteria traffic network equilibrium model is developed.Travelers of different classes have distinctive value of times(VOTs),and travelers from the same class perceive their travel disutility or generalized costs on a route according to different weights of travel time and travel costs.Moreover,the symmetric cost function model is extended to deal with the interactions between buses and private cars.It is found that there exists a uniform(anonymous) link toll pattern which can drive a multi-class,multi-mode and multi-criteria user equilibrium flow pattern to a system optimum when the system's objective function is measured in terms of money.It is also found that the marginal cost pricing models with a symmetric travel cost function do not reflect the interactions between traffic flows of different road sections,and the obtained congestion pricing toll is smaller than the real value.
文摘New advanced numerical computer model enabling accurate simulation of fragmentation parameters of large Length over Diameter(L/D)explosively driven metal shells has been developed and validated.The newly developed large L/D multi-region model links three-dimensional axisymmetric high strain high strain-rate hydrocode analyses with the conventional set of Picatinny Arsenal FRAGmentation(PAFRAG)simulation routines.The standard PAFRAG modeling technique is based on the Mott's theory of break-up of idealized cylindrical"ring-bombs",in which the length of the average fragment is a function of the radius and velocity of the shell at the moment of break-up,and the mechanical properties of the metal.In the newly developed multi-region model,each of the shell region,the break-up is assumed to occur instantaneously,whereas the entire shell is modeled to fragment at multiple times,according to the number of the regions considered.According to PAFRAG methodology,the required input for both the natural and the controlled fragmentation models including the geometry and the velocity of the shell at moment of break-up had been provided from the hydrocode analyses and validated with available experimental data.The newly developed large L/D multi-region PAFRAG model has been shown to accurately reproduce available experimental fragmentation data.
文摘For the characterization of the behaviors of a metal material in events like expanding warheads, it is necessary to know its strength and ductility at high strain rates, around 104e105/s. The flyer plate impact testing produces the uniform stress and strain rates but the testing is expensive. The Taylor test is relatively inexpensive but produces non-uniform stress and strain fields, and the results are not so easily inferred for material modeling. In the split-Hopkinson bar(SHB), which may be used in compression, tension and torsion testing, the strain rates never exceeds 103/s. In the present work, we use the expanding ring test where the strain rate is 104e105/s. A streak camera is used to examine the expanding ring velocity, and a water tank is used to collect the fragments. The experimental results are compared with the numerical simulations using the hydrocodes AUTODYN, IMPETUS Afea and a regularized smooth particle(RSPH) software. The number of fragments increases with the increase in the expansion velocity of the rings. The number of fragments is similar to the experimental results. The RSPH software shows much the same results as the AUTODYN where the Lagrangian solver is used for the ring. The IMPETUS Afea solver shows a somewhat different fragmentation characteristic due to the node splitting algorithm that induces pronounced tensile splitting.
基金sponsored by Study on High-Precision Logging While Drilling Imaging Technology of Low-Permeability Reservoirs(No.2016ZX05021-002)
文摘A new multi-mode resistivity imaging sonde, with toroidal coils as source, can conduct three resistivity measurements: azimuthal resistivity, lateral resistivity, and bit resistivity measurements. Thus, the logging time and cost are greatly saved. The toroidal coils are simplified as an extended voltage dipole and the response equations are derived for a homogenous formation. Based on 3D FEM, the depth of investigation(DOI), vertical resolution, circumferential azimuthal capacity, borehole diameter, mud resistivity, thickness of target formation, and the resistivity of the surrounding formation and mud invasion are simulated. The results suggest that the three measurement modes of the new sonde are different in vertical resolutions and DOIs. The circumferential detection ability of the azimuth button depends on the contrast between the anomaly and formation resistivity and the open angle of the anomaly. Whether the borehole is truncated at the bit or not has a great influence on the simulation results. The borehole and mud invasion affect the apparent resistivity in all modes, but the effects of resistivity of surrounding formation and thickness of the target formation are only corrected for lateral resistivity measurement.