The fatigue damage caused by flow-induced vibration(FIV)is one of the major concerns for multiple cylindrical structures in many engineering applications.The FIV suppression is of great importance for the security of ...The fatigue damage caused by flow-induced vibration(FIV)is one of the major concerns for multiple cylindrical structures in many engineering applications.The FIV suppression is of great importance for the security of many cylindrical structures.Many active and passive control methods have been employed for the vibration suppression of an isolated cylinder undergoing vortex-induced vibrations(VIV).The FIV suppression methods are mainly extended to the multiple cylinders from the vibration control of the isolated cylinder.Due to the mutual interference between the multiple cylinders,the FIV mechanism is more complex than the VIV mechanism,which makes a great challenge for the FIV suppression.Some efforts have been devoted to vibration suppression of multiple cylinder systems undergoing FIV over the past two decades.The control methods,such as helical strakes,splitter plates,control rods and flexible sheets,are not always effective,depending on many influence factors,such as the spacing ratio,the arrangement geometrical shape,the flow velocity and the parameters of the vibration control devices.The FIV response,hydrodynamic features and wake patterns of the multiple cylinders equipped with vibration control devices are reviewed and summarized.The FIV suppression efficiency of the vibration control methods are analyzed and compared considering different influence factors.Further research on the FIV suppression of multiple cylinders is suggested to provide insight for the development of FIV control methods and promote engineering applications of FIV control methods.展开更多
The snap-through behaviors and nonlinear vibrations are investigated for a bistable composite laminated cantilever shell subjected to transversal foundation excitation based on experimental and theoretical approaches....The snap-through behaviors and nonlinear vibrations are investigated for a bistable composite laminated cantilever shell subjected to transversal foundation excitation based on experimental and theoretical approaches.An improved experimental specimen is designed in order to satisfy the cantilever support boundary condition,which is composed of an asymmetric region and a symmetric region.The symmetric region of the experimental specimen is entirely clamped,which is rigidly connected to an electromagnetic shaker,while the asymmetric region remains free of constraint.Different motion paths are realized for the bistable cantilever shell by changing the input signal levels of the electromagnetic shaker,and the displacement responses of the shell are collected by the laser displacement sensors.The numerical simulation is conducted based on the established theoretical model of the bistable composite laminated cantilever shell,and an off-axis three-dimensional dynamic snap-through domain is obtained.The numerical solutions are in good agreement with the experimental results.The nonlinear stiffness characteristics,dynamic snap-through domain,and chaos and bifurcation behaviors of the shell are quantitatively analyzed.Due to the asymmetry of the boundary condition and the shell,the upper stable-state of the shell exhibits an obvious soft spring stiffness characteristic,and the lower stable-state shows a linear stiffness characteristic of the shell.展开更多
This study is focused on the effect of vibration induced by moving trains in tunnels on the surrounding ground and structures.A three-dimensional finite element model is established for a one-track railway tunnel and ...This study is focused on the effect of vibration induced by moving trains in tunnels on the surrounding ground and structures.A three-dimensional finite element model is established for a one-track railway tunnel and an adjacent twelve-storey building frame by using commercial software Midas GTS-NX(2019)and Midas Gen.This study considered the moving load effect of a complete train,which varies with space as well as with time.The effect of factors such as train speed,overburden pressure on the tunnel and variation in soil properties are studied in the time domain.As a result,the variations in horizontal and vertical acceleration for two different sites,i.e.,the free ground surface(without structure)and the area containing the structure,are compared.Also,the displacement pattern of the raft foundation is plotted for different train velocities.At lower speeds,the heaving phenomenon is negligible,but as the speed increases,both the heaving and differential settlement increase in the foundation.This study demonstrates that the effect of moving train vibrations should be considered in the design of new nearby structures and proper ground improvement should be considered for existing structures.展开更多
Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and ...Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.展开更多
The purpose of this study is to investigate the suppression effect of a nonlinear energy sink(NES)on the wind-vortex-induced pipe vibration and explore the influence of damping,stiffness,and NES installation position ...The purpose of this study is to investigate the suppression effect of a nonlinear energy sink(NES)on the wind-vortex-induced pipe vibration and explore the influence of damping,stiffness,and NES installation position on the suppression effect.In this work,the wind-vortex-induced vibration of an elastic pipe of a deepwater jacket was studied,and vibrations were suppressed by using an NES.A van der Pol wake oscillator was used to simulate vortex-induced force,and the dynamic equation of the pipe considering the NES was established.The Galerkin method was applied to discretize the motion equation,and the vortex-induced vibration(VIV)of the pipe at reduced wind speeds was numerically analyzed.The novelty of this research is that particle swarm optimization was used to optimize the parameters of the NES to improve vibration suppression.The influence of the installation position,nonlinear stiffness,and damping parameters of the NES on vibration suppression was analyzed.Results showed that the optimized parameter combinations of the NES can effectively reduce wind-vortex-induced pipe vibration.The installation position of the NES had a significant effect on vibration suppression,and the midpoint of the pipe was the optimal NES installation position.An increase in stiffness or a 10% decrease in damping may cause vibration suppression failure.The results of this study provide some guidance for VIV suppression in deepwater jacket pipes.展开更多
Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant resear...Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.展开更多
Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
Functionally graded materials(FGMs)are a novel class of composite materials that have attracted significant attention in the field of engineering due to their unique mechanical properties.This study aims to explore th...Functionally graded materials(FGMs)are a novel class of composite materials that have attracted significant attention in the field of engineering due to their unique mechanical properties.This study aims to explore the dynamic behaviors of an FGM stepped beam with different boundary conditions based on an efficient solving method.Under the assumptions of the Euler-Bernoulli beam theory,the governing differential equations of an individual FGM beam are derived with Hamilton’s principle and decoupled via the separation-of-variable approach.Then,the free and forced vibrations of the FGM stepped beam are solved with the transfer matrix method(TMM).Two models,i.e.,a three-level FGM stepped beam and a five-level FGM stepped beam,are considered,and their natural frequencies and mode shapes are presented.To demonstrate the validity of the method in this paper,the simulation results by ABAQUS are also given.On this basis,the detailed parametric analyses on the frequencies and dynamic responses of the three-level FGM stepped beam are carried out.The results show the accuracy and efficiency of the TMM.展开更多
Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the intro...Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.展开更多
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent...Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.展开更多
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the...Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.展开更多
A transition to clean hydrogen energy will not be possible until the issues related to its production, transportation,storage, etc., are adequately resolved. Currently, however, it is possible to use methane-hydrogen ...A transition to clean hydrogen energy will not be possible until the issues related to its production, transportation,storage, etc., are adequately resolved. Currently, however, it is possible to use methane-hydrogen mixtures.Natural gas can be transported using a pipeline system with the required pressure being maintained by gascompression stations. This method, however, is affected by some problems too. Compressors emergency stopscan be induced by vibrations because in some cases, mechanical methods are not able to reduce the vibrationamplitude. As an example, it is known that a gas-dynamic flow effect in labyrinth seals can lead to increasedvibrations. This paper presents the numerical simulation of rotor oscillations taking into account a gas-dynamicload. The influence of a transported mixture on the oscillatory process is investigated. Mixtures consisting ofmethane and hydrogen in various proportions and an air mixture are considered. The results are discussed forvarious operating pressures and include the rotor motion trajectories and oscillation frequency spectra obtainednumerically. It is shown that the gas mixture composition has a significant effect on the oscillations and theiroccurrence. Hydrogen as a working fluid reduces the vibration amplitude. Operating a compressor with hydrogenleads to a decrease in the resonant frequency, bringing it closer to the operating one. However, the operatingpressure at which maximum oscillations are observed depends slightly on the gas mixture composition.展开更多
One hallmark of glasses is the existence of excess vibrational modes at low frequenciesωbeyond Debye’s prediction.Numerous studies suggest that understanding low-frequency excess vibrations could help gain insight i...One hallmark of glasses is the existence of excess vibrational modes at low frequenciesωbeyond Debye’s prediction.Numerous studies suggest that understanding low-frequency excess vibrations could help gain insight into the anomalous mechanical and thermodynamic properties of glasses.However,there is still intensive debate as to the frequency dependence of the population of low-frequency excess vibrations.In particular,excess modes could hybridize with phonon-like modes and the density of hybridized excess modes has been reported to follow D_(exc)(ω)~ω^(2)in 2D glasses with an inverse power law potential.Yet,the universality of the quadratic scaling remains unknown,since recent work suggested that interaction potentials could influence the scaling of the vibrational spectrum.Here,we extend the universality of the quadratic scaling for hybridized excess modes in 2D to glasses with potentials ranging from the purely repulsive soft-core interaction to the hard-core one with both repulsion and attraction as well as to glasses with significant differences in density or interparticle repulsion.Moreover,we observe that the number of hybridized excess modes exhibits a decrease in glasses with higher density or steeper interparticle repulsion,which is accompanied by a suppression of the strength of the sound attenuation.Our results indicate that the density bears some resemblance to the repulsive steepness of the interaction in influencing low-frequency properties.展开更多
Various nonlinear phenomena such as bifurcations and chaos in the responses of carbon nanotubes(CNTs)are recognized as being major contributors to the inaccuracy and instability of nanoscale mechanical systems.Therefo...Various nonlinear phenomena such as bifurcations and chaos in the responses of carbon nanotubes(CNTs)are recognized as being major contributors to the inaccuracy and instability of nanoscale mechanical systems.Therefore,the main purpose of this paper is to predict the nonlinear dynamic behavior of a CNT conveying viscousfluid and supported on a nonlinear elastic foundation.The proposed model is based on nonlocal Euler–Bernoulli beam theory.The Galerkin method and perturbation analysis are used to discretize the partial differential equation of motion and obtain the frequency-response equation,respectively.A detailed parametric study is reported into how the nonlocal parameter,foundation coefficients,fluid viscosity,and amplitude and frequency of the external force influence the nonlinear dynamics of the system.Subharmonic,quasi-periodic,and chaotic behaviors and hardening nonlinearity are revealed by means of the vibration time histories,frequency-response curves,bifurcation diagrams,phase portraits,power spectra,and Poincarémaps.Also,the results show that it is possible to eliminate irregular motion in the whole range of external force amplitude by selecting appropriate parameters.展开更多
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
Maintaining the integrity and longevity of structures is essential in many industries,such as aerospace,nuclear,and petroleum.To achieve the cost-effectiveness of large-scale systems in petroleum drilling,a strong emp...Maintaining the integrity and longevity of structures is essential in many industries,such as aerospace,nuclear,and petroleum.To achieve the cost-effectiveness of large-scale systems in petroleum drilling,a strong emphasis on structural durability and monitoring is required.This study focuses on the mechanical vibrations that occur in rotary drilling systems,which have a substantial impact on the structural integrity of drilling equipment.The study specifically investigates axial,torsional,and lateral vibrations,which might lead to negative consequences such as bit-bounce,chaotic whirling,and high-frequency stick-slip.These events not only hinder the efficiency of drilling but also lead to exhaustion and harm to the system’s components since they are difficult to be detected and controlled in real time.The study investigates the dynamic interactions of these vibrations,specifically in their high-frequency modes,usingfield data obtained from measurement while drilling.Thefindings have demonstrated the effect of strong coupling between the high-frequency modes of these vibrations on drilling sys-tem performance.The obtained results highlight the importance of considering the interconnected impacts of these vibrations when designing and implementing robust control systems.Therefore,integrating these compo-nents can increase the durability of drill bits and drill strings,as well as improve the ability to monitor and detect damage.Moreover,by exploiting thesefindings,the assessment of structural resilience in rotary drilling systems can be enhanced.Furthermore,the study demonstrates the capacity of structural health monitoring to improve the quality,dependability,and efficiency of rotary drilling systems in the petroleum industry.展开更多
The sixth generation(6G)of mobile communication system is witnessing a new paradigm shift,i.e.,integrated sensing-communication system.A comprehensive dataset is a prerequisite for 6G integrated sensing-communication ...The sixth generation(6G)of mobile communication system is witnessing a new paradigm shift,i.e.,integrated sensing-communication system.A comprehensive dataset is a prerequisite for 6G integrated sensing-communication research.This paper develops a novel simulation dataset,named M3SC,for mixed multi-modal(MMM)sensing-communication integration,and the generation framework of the M3SC dataset is further given.To obtain multimodal sensory data in physical space and communication data in electromagnetic space,we utilize Air-Sim and WaveFarer to collect multi-modal sensory data and exploit Wireless InSite to collect communication data.Furthermore,the in-depth integration and precise alignment of AirSim,WaveFarer,andWireless InSite are achieved.The M3SC dataset covers various weather conditions,multiplex frequency bands,and different times of the day.Currently,the M3SC dataset contains 1500 snapshots,including 80 RGB images,160 depth maps,80 LiDAR point clouds,256 sets of mmWave waveforms with 8 radar point clouds,and 72 channel impulse response(CIR)matrices per snapshot,thus totaling 120,000 RGB images,240,000 depth maps,120,000 LiDAR point clouds,384,000 sets of mmWave waveforms with 12,000 radar point clouds,and 108,000 CIR matrices.The data processing result presents the multi-modal sensory information and communication channel statistical properties.Finally,the MMM sensing-communication application,which can be supported by the M3SC dataset,is discussed.展开更多
Excavation with tunnel boring machine(TBM)can generate vibrations,causing damages to neighbouring buildings and disturbing the residents or the equipment.This problem is particularly challenging in urban areas,where T...Excavation with tunnel boring machine(TBM)can generate vibrations,causing damages to neighbouring buildings and disturbing the residents or the equipment.This problem is particularly challenging in urban areas,where TBMs are increasingly large in diameter and shallow in depth.In response to this problem,four experimental campaigns were carried out in different geotechnical contexts in France.The vibration measurements were acquired on the surface and inside the TBMs.These measurements are also complemented by few data in the literature.An original methodology of signal processing is pro-posed to characterize the amplitude of the particle velocities,as well as the frequency content of the signals to highlight the most energetic bands.The levels of vibrations are also compared with the thresholds existing in various European regulations concerning the impact on neighbouring structures and the disturbance to local residents.展开更多
Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively u...Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation.In addition,semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation,but existing methods usually study individual tasks separately,which leads to high computational resource overhead.To this end,we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT).Specifically,we design a Cross-Modal Feature Fusion(CMFF)method,which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation.Besides,a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE),extracting common features in a shared network to save time and resources,and then learning task-specific features in two task branches.Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently,multi-task learning saves 20%of training time and achieves competitive performance with mIoU of 83.02%for semantic segmentation and accuracy of 95.26%for height estimation.展开更多
Power Shell has been widely deployed in fileless malware and advanced persistent threat(APT)attacks due to its high stealthiness and live-off-theland technique.However,existing works mainly focus on deobfuscation and ...Power Shell has been widely deployed in fileless malware and advanced persistent threat(APT)attacks due to its high stealthiness and live-off-theland technique.However,existing works mainly focus on deobfuscation and malicious detection,lacking the malicious Power Shell families classification and behavior analysis.Moreover,the state-of-the-art methods fail to capture fine-grained features and semantic relationships,resulting in low robustness and accuracy.To this end,we propose Power Detector,a novel malicious Power Shell script detector based on multimodal semantic fusion and deep learning.Specifically,we design four feature extraction methods to extract key features from character,token,abstract syntax tree(AST),and semantic knowledge graph.Then,we intelligently design four embeddings(i.e.,Char2Vec,Token2Vec,AST2Vec,and Rela2Vec) and construct a multi-modal fusion algorithm to concatenate feature vectors from different views.Finally,we propose a combined model based on transformer and CNN-Bi LSTM to implement Power Shell family detection.Our experiments with five types of Power Shell attacks show that PowerDetector can accurately detect various obfuscated and stealth PowerShell scripts,with a 0.9402 precision,a 0.9358 recall,and a 0.9374 F1-score.Furthermore,through singlemodal and multi-modal comparison experiments,we demonstrate that PowerDetector’s multi-modal embedding and deep learning model can achieve better accuracy and even identify more unknown attacks.展开更多
基金financially supported by the National Natural Science Foundation of China(Grant Nos.U2106223,51979193,52301352)。
文摘The fatigue damage caused by flow-induced vibration(FIV)is one of the major concerns for multiple cylindrical structures in many engineering applications.The FIV suppression is of great importance for the security of many cylindrical structures.Many active and passive control methods have been employed for the vibration suppression of an isolated cylinder undergoing vortex-induced vibrations(VIV).The FIV suppression methods are mainly extended to the multiple cylinders from the vibration control of the isolated cylinder.Due to the mutual interference between the multiple cylinders,the FIV mechanism is more complex than the VIV mechanism,which makes a great challenge for the FIV suppression.Some efforts have been devoted to vibration suppression of multiple cylinder systems undergoing FIV over the past two decades.The control methods,such as helical strakes,splitter plates,control rods and flexible sheets,are not always effective,depending on many influence factors,such as the spacing ratio,the arrangement geometrical shape,the flow velocity and the parameters of the vibration control devices.The FIV response,hydrodynamic features and wake patterns of the multiple cylinders equipped with vibration control devices are reviewed and summarized.The FIV suppression efficiency of the vibration control methods are analyzed and compared considering different influence factors.Further research on the FIV suppression of multiple cylinders is suggested to provide insight for the development of FIV control methods and promote engineering applications of FIV control methods.
基金Project supported by the National Natural Science Foundation of China(Nos.11832002 and 12072201)。
文摘The snap-through behaviors and nonlinear vibrations are investigated for a bistable composite laminated cantilever shell subjected to transversal foundation excitation based on experimental and theoretical approaches.An improved experimental specimen is designed in order to satisfy the cantilever support boundary condition,which is composed of an asymmetric region and a symmetric region.The symmetric region of the experimental specimen is entirely clamped,which is rigidly connected to an electromagnetic shaker,while the asymmetric region remains free of constraint.Different motion paths are realized for the bistable cantilever shell by changing the input signal levels of the electromagnetic shaker,and the displacement responses of the shell are collected by the laser displacement sensors.The numerical simulation is conducted based on the established theoretical model of the bistable composite laminated cantilever shell,and an off-axis three-dimensional dynamic snap-through domain is obtained.The numerical solutions are in good agreement with the experimental results.The nonlinear stiffness characteristics,dynamic snap-through domain,and chaos and bifurcation behaviors of the shell are quantitatively analyzed.Due to the asymmetry of the boundary condition and the shell,the upper stable-state of the shell exhibits an obvious soft spring stiffness characteristic,and the lower stable-state shows a linear stiffness characteristic of the shell.
文摘This study is focused on the effect of vibration induced by moving trains in tunnels on the surrounding ground and structures.A three-dimensional finite element model is established for a one-track railway tunnel and an adjacent twelve-storey building frame by using commercial software Midas GTS-NX(2019)and Midas Gen.This study considered the moving load effect of a complete train,which varies with space as well as with time.The effect of factors such as train speed,overburden pressure on the tunnel and variation in soil properties are studied in the time domain.As a result,the variations in horizontal and vertical acceleration for two different sites,i.e.,the free ground surface(without structure)and the area containing the structure,are compared.Also,the displacement pattern of the raft foundation is plotted for different train velocities.At lower speeds,the heaving phenomenon is negligible,but as the speed increases,both the heaving and differential settlement increase in the foundation.This study demonstrates that the effect of moving train vibrations should be considered in the design of new nearby structures and proper ground improvement should be considered for existing structures.
基金funded by the National Natural Science Foundation of China(61991413)the China Postdoctoral Science Foundation(2019M651142)+1 种基金the Natural Science Foundation of Liaoning Province(2021-KF-12-07)the Natural Science Foundations of Liaoning Province(2023-MS-322).
文摘Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.
基金supported by the Tianjin Municipal Transportation Commission Project(No.2018-b2).
文摘The purpose of this study is to investigate the suppression effect of a nonlinear energy sink(NES)on the wind-vortex-induced pipe vibration and explore the influence of damping,stiffness,and NES installation position on the suppression effect.In this work,the wind-vortex-induced vibration of an elastic pipe of a deepwater jacket was studied,and vibrations were suppressed by using an NES.A van der Pol wake oscillator was used to simulate vortex-induced force,and the dynamic equation of the pipe considering the NES was established.The Galerkin method was applied to discretize the motion equation,and the vortex-induced vibration(VIV)of the pipe at reduced wind speeds was numerically analyzed.The novelty of this research is that particle swarm optimization was used to optimize the parameters of the NES to improve vibration suppression.The influence of the installation position,nonlinear stiffness,and damping parameters of the NES on vibration suppression was analyzed.Results showed that the optimized parameter combinations of the NES can effectively reduce wind-vortex-induced pipe vibration.The installation position of the NES had a significant effect on vibration suppression,and the midpoint of the pipe was the optimal NES installation position.An increase in stiffness or a 10% decrease in damping may cause vibration suppression failure.The results of this study provide some guidance for VIV suppression in deepwater jacket pipes.
基金supported by the Natural Science Foundation of Liaoning Province(Grant No.2023-MSBA-070)the National Natural Science Foundation of China(Grant No.62302086).
文摘Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
基金the National Natural Science Foundation of China(Nos.12302007,12372006,and 12202109)the Specific Research Project of Guangxi for Research Bases and Talents(No.AD23026051)。
文摘Functionally graded materials(FGMs)are a novel class of composite materials that have attracted significant attention in the field of engineering due to their unique mechanical properties.This study aims to explore the dynamic behaviors of an FGM stepped beam with different boundary conditions based on an efficient solving method.Under the assumptions of the Euler-Bernoulli beam theory,the governing differential equations of an individual FGM beam are derived with Hamilton’s principle and decoupled via the separation-of-variable approach.Then,the free and forced vibrations of the FGM stepped beam are solved with the transfer matrix method(TMM).Two models,i.e.,a three-level FGM stepped beam and a five-level FGM stepped beam,are considered,and their natural frequencies and mode shapes are presented.To demonstrate the validity of the method in this paper,the simulation results by ABAQUS are also given.On this basis,the detailed parametric analyses on the frequencies and dynamic responses of the three-level FGM stepped beam are carried out.The results show the accuracy and efficiency of the TMM.
基金National College Students’Training Programs of Innovation and Entrepreneurship,Grant/Award Number:S202210022060the CACMS Innovation Fund,Grant/Award Number:CI2021A00512the National Nature Science Foundation of China under Grant,Grant/Award Number:62206021。
文摘Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.
文摘Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.
基金supported by the National Key Research and Development Project under Grant 2020YFB1807602Key Program of Marine Economy Development Special Foundation of Department of Natural Resources of Guangdong Province(GDNRC[2023]24)the National Natural Science Foundation of China under Grant 62271267.
文摘Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.
基金the Russian Ministry of Education and Science,Project FSNM-2023-0004“Hydrogen Energy.Materials and Technology for Storage,Transportation and Use of Hydrogen and Hydrogen-Containing Mixtures”.
文摘A transition to clean hydrogen energy will not be possible until the issues related to its production, transportation,storage, etc., are adequately resolved. Currently, however, it is possible to use methane-hydrogen mixtures.Natural gas can be transported using a pipeline system with the required pressure being maintained by gascompression stations. This method, however, is affected by some problems too. Compressors emergency stopscan be induced by vibrations because in some cases, mechanical methods are not able to reduce the vibrationamplitude. As an example, it is known that a gas-dynamic flow effect in labyrinth seals can lead to increasedvibrations. This paper presents the numerical simulation of rotor oscillations taking into account a gas-dynamicload. The influence of a transported mixture on the oscillatory process is investigated. Mixtures consisting ofmethane and hydrogen in various proportions and an air mixture are considered. The results are discussed forvarious operating pressures and include the rotor motion trajectories and oscillation frequency spectra obtainednumerically. It is shown that the gas mixture composition has a significant effect on the oscillations and theiroccurrence. Hydrogen as a working fluid reduces the vibration amplitude. Operating a compressor with hydrogenleads to a decrease in the resonant frequency, bringing it closer to the operating one. However, the operatingpressure at which maximum oscillations are observed depends slightly on the gas mixture composition.
基金Project supported by the National Natural Science Foundation of China(Grant Nos.12374202 and 12004001)Anhui Projects(Grant Nos.2022AH020009,S020218016,and Z010118169)+1 种基金Hefei City(Grant No.Z020132009)Anhui University(start-up fund)。
文摘One hallmark of glasses is the existence of excess vibrational modes at low frequenciesωbeyond Debye’s prediction.Numerous studies suggest that understanding low-frequency excess vibrations could help gain insight into the anomalous mechanical and thermodynamic properties of glasses.However,there is still intensive debate as to the frequency dependence of the population of low-frequency excess vibrations.In particular,excess modes could hybridize with phonon-like modes and the density of hybridized excess modes has been reported to follow D_(exc)(ω)~ω^(2)in 2D glasses with an inverse power law potential.Yet,the universality of the quadratic scaling remains unknown,since recent work suggested that interaction potentials could influence the scaling of the vibrational spectrum.Here,we extend the universality of the quadratic scaling for hybridized excess modes in 2D to glasses with potentials ranging from the purely repulsive soft-core interaction to the hard-core one with both repulsion and attraction as well as to glasses with significant differences in density or interparticle repulsion.Moreover,we observe that the number of hybridized excess modes exhibits a decrease in glasses with higher density or steeper interparticle repulsion,which is accompanied by a suppression of the strength of the sound attenuation.Our results indicate that the density bears some resemblance to the repulsive steepness of the interaction in influencing low-frequency properties.
文摘Various nonlinear phenomena such as bifurcations and chaos in the responses of carbon nanotubes(CNTs)are recognized as being major contributors to the inaccuracy and instability of nanoscale mechanical systems.Therefore,the main purpose of this paper is to predict the nonlinear dynamic behavior of a CNT conveying viscousfluid and supported on a nonlinear elastic foundation.The proposed model is based on nonlocal Euler–Bernoulli beam theory.The Galerkin method and perturbation analysis are used to discretize the partial differential equation of motion and obtain the frequency-response equation,respectively.A detailed parametric study is reported into how the nonlocal parameter,foundation coefficients,fluid viscosity,and amplitude and frequency of the external force influence the nonlinear dynamics of the system.Subharmonic,quasi-periodic,and chaotic behaviors and hardening nonlinearity are revealed by means of the vibration time histories,frequency-response curves,bifurcation diagrams,phase portraits,power spectra,and Poincarémaps.Also,the results show that it is possible to eliminate irregular motion in the whole range of external force amplitude by selecting appropriate parameters.
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
文摘Maintaining the integrity and longevity of structures is essential in many industries,such as aerospace,nuclear,and petroleum.To achieve the cost-effectiveness of large-scale systems in petroleum drilling,a strong emphasis on structural durability and monitoring is required.This study focuses on the mechanical vibrations that occur in rotary drilling systems,which have a substantial impact on the structural integrity of drilling equipment.The study specifically investigates axial,torsional,and lateral vibrations,which might lead to negative consequences such as bit-bounce,chaotic whirling,and high-frequency stick-slip.These events not only hinder the efficiency of drilling but also lead to exhaustion and harm to the system’s components since they are difficult to be detected and controlled in real time.The study investigates the dynamic interactions of these vibrations,specifically in their high-frequency modes,usingfield data obtained from measurement while drilling.Thefindings have demonstrated the effect of strong coupling between the high-frequency modes of these vibrations on drilling sys-tem performance.The obtained results highlight the importance of considering the interconnected impacts of these vibrations when designing and implementing robust control systems.Therefore,integrating these compo-nents can increase the durability of drill bits and drill strings,as well as improve the ability to monitor and detect damage.Moreover,by exploiting thesefindings,the assessment of structural resilience in rotary drilling systems can be enhanced.Furthermore,the study demonstrates the capacity of structural health monitoring to improve the quality,dependability,and efficiency of rotary drilling systems in the petroleum industry.
基金This work was supported in part by the Ministry National Key Research and Development Project(Grant No.2020AAA0108101)the National Natural Science Foundation of China(Grants No.62125101,62341101,62001018,and 62301011)+1 种基金Shandong Natural Science Foundation(Grant No.ZR2023YQ058)the New Cornerstone Science Foundation through the XPLORER PRIZE.The authors would like to thank Mengyuan Lu and Zengrui Han for their help in the construction of electromagnetic space in Wireless InSite simulation platform and Weibo Wen,Qi Duan,and Yong Yu for their help in the construction of phys ical space in AirSim simulation platform.
文摘The sixth generation(6G)of mobile communication system is witnessing a new paradigm shift,i.e.,integrated sensing-communication system.A comprehensive dataset is a prerequisite for 6G integrated sensing-communication research.This paper develops a novel simulation dataset,named M3SC,for mixed multi-modal(MMM)sensing-communication integration,and the generation framework of the M3SC dataset is further given.To obtain multimodal sensory data in physical space and communication data in electromagnetic space,we utilize Air-Sim and WaveFarer to collect multi-modal sensory data and exploit Wireless InSite to collect communication data.Furthermore,the in-depth integration and precise alignment of AirSim,WaveFarer,andWireless InSite are achieved.The M3SC dataset covers various weather conditions,multiplex frequency bands,and different times of the day.Currently,the M3SC dataset contains 1500 snapshots,including 80 RGB images,160 depth maps,80 LiDAR point clouds,256 sets of mmWave waveforms with 8 radar point clouds,and 72 channel impulse response(CIR)matrices per snapshot,thus totaling 120,000 RGB images,240,000 depth maps,120,000 LiDAR point clouds,384,000 sets of mmWave waveforms with 12,000 radar point clouds,and 108,000 CIR matrices.The data processing result presents the multi-modal sensory information and communication channel statistical properties.Finally,the MMM sensing-communication application,which can be supported by the M3SC dataset,is discussed.
文摘Excavation with tunnel boring machine(TBM)can generate vibrations,causing damages to neighbouring buildings and disturbing the residents or the equipment.This problem is particularly challenging in urban areas,where TBMs are increasingly large in diameter and shallow in depth.In response to this problem,four experimental campaigns were carried out in different geotechnical contexts in France.The vibration measurements were acquired on the surface and inside the TBMs.These measurements are also complemented by few data in the literature.An original methodology of signal processing is pro-posed to characterize the amplitude of the particle velocities,as well as the frequency content of the signals to highlight the most energetic bands.The levels of vibrations are also compared with the thresholds existing in various European regulations concerning the impact on neighbouring structures and the disturbance to local residents.
基金National Key R&D Program of China(No.2022ZD0118401).
文摘Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation.In addition,semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation,but existing methods usually study individual tasks separately,which leads to high computational resource overhead.To this end,we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT).Specifically,we design a Cross-Modal Feature Fusion(CMFF)method,which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation.Besides,a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE),extracting common features in a shared network to save time and resources,and then learning task-specific features in two task branches.Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently,multi-task learning saves 20%of training time and achieves competitive performance with mIoU of 83.02%for semantic segmentation and accuracy of 95.26%for height estimation.
基金This work was supported by National Natural Science Foundation of China(No.62172308,No.U1626107,No.61972297,No.62172144,and No.62062019).
文摘Power Shell has been widely deployed in fileless malware and advanced persistent threat(APT)attacks due to its high stealthiness and live-off-theland technique.However,existing works mainly focus on deobfuscation and malicious detection,lacking the malicious Power Shell families classification and behavior analysis.Moreover,the state-of-the-art methods fail to capture fine-grained features and semantic relationships,resulting in low robustness and accuracy.To this end,we propose Power Detector,a novel malicious Power Shell script detector based on multimodal semantic fusion and deep learning.Specifically,we design four feature extraction methods to extract key features from character,token,abstract syntax tree(AST),and semantic knowledge graph.Then,we intelligently design four embeddings(i.e.,Char2Vec,Token2Vec,AST2Vec,and Rela2Vec) and construct a multi-modal fusion algorithm to concatenate feature vectors from different views.Finally,we propose a combined model based on transformer and CNN-Bi LSTM to implement Power Shell family detection.Our experiments with five types of Power Shell attacks show that PowerDetector can accurately detect various obfuscated and stealth PowerShell scripts,with a 0.9402 precision,a 0.9358 recall,and a 0.9374 F1-score.Furthermore,through singlemodal and multi-modal comparison experiments,we demonstrate that PowerDetector’s multi-modal embedding and deep learning model can achieve better accuracy and even identify more unknown attacks.