Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and ...Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.展开更多
Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant resear...Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.展开更多
Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of po...Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.展开更多
Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the intro...Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.展开更多
Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent...Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.展开更多
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the...Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.展开更多
Automatic control technology is the basis of road robot improvement,according to the characteristics of construction equipment and functions,the research will be input type perception from positioning acquisition,real...Automatic control technology is the basis of road robot improvement,according to the characteristics of construction equipment and functions,the research will be input type perception from positioning acquisition,real-world monitoring,the process will use RTK-GNSS positional perception technology,by projecting the left side of the earth from Gauss-Krueger projection method,and then carry out the Cartesian conversion based on the characteristics of drawing;steering control system is the core of the electric drive unmanned module,on the basis of the analysis of the composition of the steering system of unmanned engineering vehicles,the steering system key components such as direction,torque sensor,drive motor and other models are established,the joint simulation model of unmanned engineering vehicles is established,the steering controller is designed using the PID method,the simulation results show that the control method can meet the construction path demand for automatic steering.The path planning will first formulate the construction area with preset values and realize the steering angle correction during driving by PID algorithm,and never realize the construction-based path planning,and the results show that the method can control the straight path within the error of 10 cm and the curve error within 20 cm.With the collaboration of various modules,the automatic construction simulation results of this robot show that the design path and control method is effective.展开更多
The construction of extraterrestrial bases has become a new goal in the active exploration of deep space.Among the construction techniques,in situ resource-based construction is one of the most promising because of it...The construction of extraterrestrial bases has become a new goal in the active exploration of deep space.Among the construction techniques,in situ resource-based construction is one of the most promising because of its good sustainability and acceptable economic cost,triggering the development of various types of extraterrestrial construction materials.A comprehensive survey and comparison of materials from the perspective of performance was conducted to provide suggestions for material selection and optimization.Thirteen types of typical construction materials are discussed in terms of their reliability and applicability in extreme extraterrestrial environment.Mechanical,thermal and optical,and radiation-shielding properties are considered.The influencing factors and optimization methods for these properties are analyzed.From the perspective of material properties,the existing challenges lie in the comprehensive,long-term,and real characterization of regolith-based construction materials.Correspondingly,the suggested future directions include the application of high-throughput characterization methods,accelerated durability tests,and conducting extraterrestrial experiments.展开更多
Identifying workers’construction activities or behaviors can enable managers to better monitor labor efficiency and construction progress.However,current activity analysis methods for construction workers rely solely...Identifying workers’construction activities or behaviors can enable managers to better monitor labor efficiency and construction progress.However,current activity analysis methods for construction workers rely solely on manual observations and recordings,which consumes considerable time and has high labor costs.Researchers have focused on monitoring on-site construction activities of workers.However,when multiple workers are working together,current research cannot accu rately and automatically identify the construction activity.This research proposes a deep learning framework for the automated analysis of the construction activities of multiple workers.In this framework,multiple deep neural network models are designed and used to complete worker key point extraction,worker tracking,and worker construction activity analysis.The designed framework was tested at an actual construction site,and activity recognition for multiple workers was performed,indicating the feasibility of the framework for the automated monitoring of work efficiency.展开更多
Food security is a strategic priority for a country’s economic development.In China,high-standard farmland construction(HSFC)is an important initiative to stabilize grain production and increase grain production capa...Food security is a strategic priority for a country’s economic development.In China,high-standard farmland construction(HSFC)is an important initiative to stabilize grain production and increase grain production capacity.Based on panel data from 31 sample provinces,autonomous regions,and municipalities in China from 2005–2017,this study explored the impact of HSFC on grain yield using the difference-in-differences(DID)method.The results showed that HSFC significantly increased total grain production,which is robust to various checks.HSFC increased grain yield through three potential mechanisms.First,it could increase the grain replanting index.Second,it could effectively reduce yield loss due to droughts and floods.Last,HSFC could strengthen the cultivated land by renovating the low-and medium-yielding fields.Heterogeneity analysis found that the HSFC farmland showed a significant increase in grain yield only in the main grain-producing areas and balanced areas.In addition,HSFC significantly increased the yields of rice,wheat,and maize while leading to a reduction in soybean yields.The findings suggest the government should continue to promote HSFC,improve construction standards,and strictly control the“non-agriculturalization”and“non-coordination”of farmland to increase grain production further.At the same time,market mechanisms should be used to incentivize soybean farming,improve returns and stabilize soybean yields.展开更多
The continuous progress of urbanization has driven the continuous development and innovation of landscape planning and design.Focused on the important design method of modern construction art,this study analyzed its c...The continuous progress of urbanization has driven the continuous development and innovation of landscape planning and design.Focused on the important design method of modern construction art,this study analyzed its concepts and characteristics,and made deep exploration to its application in landscape planning and design.The results indicated that modern construction art had a significant impact on landscape spatial planning and layout,spatial design forms,and spatial ornaments.The use of modern construction art concepts could make landscape design more scientific,artistic,and humane,creating higher quality leisure and entertainment venues for audiences.展开更多
Introduction: Work-related accidents are frequent and serious in the construction sector. The aim of the study was to determine the frequency and factors associated with occupational accidents on the construction site...Introduction: Work-related accidents are frequent and serious in the construction sector. The aim of the study was to determine the frequency and factors associated with occupational accidents on the construction site of a referral hospital in Benin. Methods: A cross-sectional study was carried out. The sample size was calculated using the Schwartz form adjusted for the number of workers on site and was 129 workers. Random sampling was used. The dependent variable was work-related accidents. The other variables were socio-demographic and occupational characteristics. Data were collected through a questionnaire survey. Medians and proportions were calculated. An association was sought using Chi-square and Fisher tests with a threshold of p Results: A total of 132 workers were included. Their median age was 30 years with an ITQ of [27 - 38];men were the most represented 126 (95.45%) with a level of education higher than or equal to high school in 101 (76.52%) and in the majority with a permanent status 85 (64.39%). Seniority of more than 5 years was observed in 92 (69.7%). Workers working more than 8 hours of overtime per week numbered 57 (43.18%). Exposure to vibrating objects was 49 (37.12%). In terms of psychosocial constraints, 82.58% had high psychological demands;79.53% low decision-making latitude;50.76% low social support. The frequency of work-related accidents was 6.82%, and the only associated factor was the type of worker (p = 0.016). On the other hand, there were 10.2% accidents among workers handling vibrating objects versus 4.98% among those not using them. With regard to psychosocial constraints, the following frequencies were recorded respectively: 6.42% among those with high psychological demand versus 8.7% among those with low psychological demand;7.62% among those with low decision-making latitude versus 3.7% among those with high decision-making latitude;8.96% among those with low social support versus 4.62% among those with high support. Conclusion: Work-related accidents on construction sites must be avoided by all possible means including the management of psychosocial constraints.展开更多
Structure-soil interface friction characteristics is of importance to investigate the interaction between engineering structures and soils,especially for offshore structures.The interface friction behavior between mar...Structure-soil interface friction characteristics is of importance to investigate the interaction between engineering structures and soils,especially for offshore structures.The interface friction behavior between marine clay and structural materials with different roughness was studied in this paper by using 3D optical scanning tests,a modified direct shear device and numerical simulation.Relationships between the surface roughness of structures,water content and interface friction angle were presented by model tests.The increase of water contents decreased the interface friction angles.For interfaces with different roughness,the interface friction angles will be smaller than that of the soil when the water content exceeds a certain value.The roughness of the interface and the water content of the soil are mutually coupled to influence the coefficient of friction(COF).This paper proposed a Finite Element Method(FEM)to simulate the interface direct shear tests of structures with different roughness.The surface models with different roughness are established based on the structure data obtained by 3D scanning.The Coupled Eulerian-Lagrangian(CEL)approach was employed to analyse soils sheared by irregular surfaces.The interface behavior for interfaces with different roughness under cyclic shear stresses was analyzed by FEM.展开更多
Mountain excavation and city construction(MECC)projects being launched in the Loess Plateau in China involve the creation of large-scale artificial land.Understanding the subsurface evolution characteristics of the ar...Mountain excavation and city construction(MECC)projects being launched in the Loess Plateau in China involve the creation of large-scale artificial land.Understanding the subsurface evolution characteristics of the artificial land is essential,yet challenging.Here,we use an improved fiber-optic monitoring system for its subsurface multi-physical characterization.The system enables us to gather spatiotemporal distribution of various parameters,including strata deformation,temperature,and moisture.Yan’an New District was selected as a case study to conduct refined in-situ monitoring through a 77 m-deep borehole and a 30 m-long trench.Findings reveal that the ground settlement involves both the deformation of the filling loess and the underlying intact loess.Notably,the filling loess exhibits a stronger creep capability compared to underlying intact loess.The deformation along the profile is unevenly distributed,with a positive correlation with soil moisture.Water accumulation has been observed at the interface between the filling loess and the underlying intact loess,leading to a significant deformation.Moreover,the temperature and moisture in the filling loess have reached a new equilibrium state,with their depths influenced by atmospheric conditions measuring at 31 m and 26 m,respectively.The refined investigation allows us to identify critical layers that matter the sustainable development of newly created urban areas,and provide improved insights into the evolution mechanisms of land creation.展开更多
Being different from testing for popular GUI software, the “instruction-category” approach is proposed for testing embedded system. This approach is constructed by three steps including refining items, drawing instr...Being different from testing for popular GUI software, the “instruction-category” approach is proposed for testing embedded system. This approach is constructed by three steps including refining items, drawing instruction-brief and instruction-category, and constructing test suite. Consequently, this approach is adopted to test oven embedded system, and detail process is deeply discussed. As a result, the factual result indicates that the “instruction-category” approach can be effectively applied in embedded system testing as a black-box method for conformity testing.展开更多
In the context of China’s economic development and population aging,the innovation and exploration of the old-age care model has emerged as a new community transformation and development direction.Given the differing...In the context of China’s economic development and population aging,the innovation and exploration of the old-age care model has emerged as a new community transformation and development direction.Given the differing needs and characteristics of individuals across the lifespan,it is evident that a design approach that incorporates mixed-age integration and mutual-help communities is a viable strategy for enhancing intergenerational exchanges.This entails the creation of a diverse and open community that is conducive to habitation for individuals of all ages,encompassing the full spectrum of needs,from those of young children to the elderly.Such a community must be designed and constructed with the population in mind,from the initial planning and design stages to the operational phase.This encompasses a comprehensive range of services,including food,clothing,housing,transportation,and medical care and recreation.展开更多
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera...The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.展开更多
Taking the changes of construction land in Wan’an County over the years as the research object,the quantity and spatial characteristics of construction land in Wan’an County were analyzed,and the overall situation a...Taking the changes of construction land in Wan’an County over the years as the research object,the quantity and spatial characteristics of construction land in Wan’an County were analyzed,and the overall situation and regional differences of construction land utilization in Wan’an County were revealed.From the aspects of main influencing factors such as land use structure,land use intensity,land input intensity and output benefit,an evaluation indicator system was established to evaluate the economical and intensive use level of construction land in Wan’an County.The results show that the score of the economical and intensive use level of construction land in Wan’an County was 56.92,which was the lowest among all the districts and counties in Ji’an City.Based on the evaluation results,the corresponding economizing and intensive strategies were put forward,and the safeguard measures for its implementation were explored.The purpose is to provide some support for the preparation of territorial spatial planning,the delineation of urban development boundaries,and the potential exploitation of construction land stock,hoping to improve the utilization efficiency and benefit of construction land in Wan’an County,and promote the economic growth of Wan’an County to the stage of high-quality development.展开更多
This research is concentrated on the longitudinal vibration of a tapered pipe pile considering the vertical support of the surrounding soil and construction disturbance.First,the pile-soil system is partitioned into f...This research is concentrated on the longitudinal vibration of a tapered pipe pile considering the vertical support of the surrounding soil and construction disturbance.First,the pile-soil system is partitioned into finite segments in the vertical direction and the Voigt model is applied to simulate the vertical support of the surrounding soil acting on the pile segment.The surrounding soil is divided into finite ring-shaped zones in the radial direction to consider the construction disturbance.Then,the shear complex stiffness at the pile-soil interface is derived by solving the dynamic equilibrium equation for the soil from the outermost to innermost zone.The displacement impedance at the top of an arbitrary pile segment is obtained by solving the dynamic equilibrium equation for the pile and is combined with the vertical support of the surrounding soil to derive the displacement impedance at the bottom of the upper adjacent segment.Further,the displacement impedance at the pile head is obtained based on the impedance function transfer technique.Finally,the reliability of the proposed solution is verified,followed by a sensitivity analysis concerning the coupling effect of the pile parameters,construction disturbance and the vertical support of the surrounding soil on the displacement impedance of the pile.展开更多
Lunar habitat construction is crucial for successful lunar exploration missions.Due to the limitations of transportation conditions,extensive global research has been conducted on lunar in situ material processing tec...Lunar habitat construction is crucial for successful lunar exploration missions.Due to the limitations of transportation conditions,extensive global research has been conducted on lunar in situ material processing techniques in recent years.The aim of this paper is to provide a comprehensive review,precise classification,and quantitative evaluation of these approaches,focusing specifically on four main approaches:reaction solidification(RS),sintering/melting(SM),bonding solidification(BS),and confinement formation(CF).Eight key indicators have been identified for the construction of low-cost and highperformance systems to assess the feasibility of these methods:in situ material ratio,curing temperature,curing time,implementation conditions,compressive strength,tensile strength,curing dimensions,and environmental adaptability.The scoring thresholds are determined by comparing the construction requirements with the actual capabilities.Among the evaluated methods,regolith bagging has emerged as a promising option due to its high in situ material ratio,low time requirement,lack of hightemperature requirements,and minimal shortcomings,with only the compressive strength falling below the neutral score.The compressive strength still maintains a value of 2–3 MPa.The proposed construction scheme utilizing regolith bags offers numerous advantages,including rapid and large-scale construction,ensured tensile strength,and reduced reliance on equipment and energy.In this study,guidelines for evaluating regolith solidification techniques are provided,and directions for improvement are offered.The proposed lunar habitat design based on regolith bags is a practical reference for future research.展开更多
基金funded by the National Natural Science Foundation of China(61991413)the China Postdoctoral Science Foundation(2019M651142)+1 种基金the Natural Science Foundation of Liaoning Province(2021-KF-12-07)the Natural Science Foundations of Liaoning Province(2023-MS-322).
文摘Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases.
基金supported by the Natural Science Foundation of Liaoning Province(Grant No.2023-MSBA-070)the National Natural Science Foundation of China(Grant No.62302086).
文摘Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges.
基金European Commission,Joint Research Center,Grant/Award Number:HUMAINTMinisterio de Ciencia e Innovación,Grant/Award Number:PID2020‐114924RB‐I00Comunidad de Madrid,Grant/Award Number:S2018/EMT‐4362 SEGVAUTO 4.0‐CM。
文摘Predicting the motion of other road agents enables autonomous vehicles to perform safe and efficient path planning.This task is very complex,as the behaviour of road agents depends on many factors and the number of possible future trajectories can be consid-erable(multi-modal).Most prior approaches proposed to address multi-modal motion prediction are based on complex machine learning systems that have limited interpret-ability.Moreover,the metrics used in current benchmarks do not evaluate all aspects of the problem,such as the diversity and admissibility of the output.The authors aim to advance towards the design of trustworthy motion prediction systems,based on some of the re-quirements for the design of Trustworthy Artificial Intelligence.The focus is on evaluation criteria,robustness,and interpretability of outputs.First,the evaluation metrics are comprehensively analysed,the main gaps of current benchmarks are identified,and a new holistic evaluation framework is proposed.Then,a method for the assessment of spatial and temporal robustness is introduced by simulating noise in the perception system.To enhance the interpretability of the outputs and generate more balanced results in the proposed evaluation framework,an intent prediction layer that can be attached to multi-modal motion prediction models is proposed.The effectiveness of this approach is assessed through a survey that explores different elements in the visualisation of the multi-modal trajectories and intentions.The proposed approach and findings make a significant contribution to the development of trustworthy motion prediction systems for autono-mous vehicles,advancing the field towards greater safety and reliability.
基金National College Students’Training Programs of Innovation and Entrepreneurship,Grant/Award Number:S202210022060the CACMS Innovation Fund,Grant/Award Number:CI2021A00512the National Nature Science Foundation of China under Grant,Grant/Award Number:62206021。
文摘Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features.
文摘Intelligent personal assistants play a pivotal role in in-vehicle systems,significantly enhancing life efficiency,driving safety,and decision-making support.In this study,the multi-modal design elements of intelligent personal assistants within the context of visual,auditory,and somatosensory interactions with drivers were discussed.Their impact on the driver’s psychological state through various modes such as visual imagery,voice interaction,and gesture interaction were explored.The study also introduced innovative designs for in-vehicle intelligent personal assistants,incorporating design principles such as driver-centricity,prioritizing passenger safety,and utilizing timely feedback as a criterion.Additionally,the study employed design methods like driver behavior research and driving situation analysis to enhance the emotional connection between drivers and their vehicles,ultimately improving driver satisfaction and trust.
基金supported by the National Key Research and Development Project under Grant 2020YFB1807602Key Program of Marine Economy Development Special Foundation of Department of Natural Resources of Guangdong Province(GDNRC[2023]24)the National Natural Science Foundation of China under Grant 62271267.
文摘Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent.
文摘Automatic control technology is the basis of road robot improvement,according to the characteristics of construction equipment and functions,the research will be input type perception from positioning acquisition,real-world monitoring,the process will use RTK-GNSS positional perception technology,by projecting the left side of the earth from Gauss-Krueger projection method,and then carry out the Cartesian conversion based on the characteristics of drawing;steering control system is the core of the electric drive unmanned module,on the basis of the analysis of the composition of the steering system of unmanned engineering vehicles,the steering system key components such as direction,torque sensor,drive motor and other models are established,the joint simulation model of unmanned engineering vehicles is established,the steering controller is designed using the PID method,the simulation results show that the control method can meet the construction path demand for automatic steering.The path planning will first formulate the construction area with preset values and realize the steering angle correction during driving by PID algorithm,and never realize the construction-based path planning,and the results show that the method can control the straight path within the error of 10 cm and the curve error within 20 cm.With the collaboration of various modules,the automatic construction simulation results of this robot show that the design path and control method is effective.
基金supported by the National Key Research and Development Program of China(2023YFB3711300 and 2021YFF0500300)the Strategic Research and Consulting Project of the Chinese Academy of Engineering(2023-XZ-90 and 2023-JB-09-10)the National Key Research and Development Program of China(2021YFF0500300).
文摘The construction of extraterrestrial bases has become a new goal in the active exploration of deep space.Among the construction techniques,in situ resource-based construction is one of the most promising because of its good sustainability and acceptable economic cost,triggering the development of various types of extraterrestrial construction materials.A comprehensive survey and comparison of materials from the perspective of performance was conducted to provide suggestions for material selection and optimization.Thirteen types of typical construction materials are discussed in terms of their reliability and applicability in extreme extraterrestrial environment.Mechanical,thermal and optical,and radiation-shielding properties are considered.The influencing factors and optimization methods for these properties are analyzed.From the perspective of material properties,the existing challenges lie in the comprehensive,long-term,and real characterization of regolith-based construction materials.Correspondingly,the suggested future directions include the application of high-throughput characterization methods,accelerated durability tests,and conducting extraterrestrial experiments.
基金supported by the National Natural Science Foundation of China(52130801,U20A20312,52178271,and 52077213)the National Key Research and Development Program of China(2021YFF0500903)。
文摘Identifying workers’construction activities or behaviors can enable managers to better monitor labor efficiency and construction progress.However,current activity analysis methods for construction workers rely solely on manual observations and recordings,which consumes considerable time and has high labor costs.Researchers have focused on monitoring on-site construction activities of workers.However,when multiple workers are working together,current research cannot accu rately and automatically identify the construction activity.This research proposes a deep learning framework for the automated analysis of the construction activities of multiple workers.In this framework,multiple deep neural network models are designed and used to complete worker key point extraction,worker tracking,and worker construction activity analysis.The designed framework was tested at an actual construction site,and activity recognition for multiple workers was performed,indicating the feasibility of the framework for the automated monitoring of work efficiency.
基金supported by the National Natural Science Foundation of China(41871184)the National Social Science Fund of China(21ZDA056)the Scientific and Technological Innovation Project of the Chinese Academy of Agricultural Sciences(10-IAED-ZT-01-2023and 10-IAED-RC-07-2023)。
文摘Food security is a strategic priority for a country’s economic development.In China,high-standard farmland construction(HSFC)is an important initiative to stabilize grain production and increase grain production capacity.Based on panel data from 31 sample provinces,autonomous regions,and municipalities in China from 2005–2017,this study explored the impact of HSFC on grain yield using the difference-in-differences(DID)method.The results showed that HSFC significantly increased total grain production,which is robust to various checks.HSFC increased grain yield through three potential mechanisms.First,it could increase the grain replanting index.Second,it could effectively reduce yield loss due to droughts and floods.Last,HSFC could strengthen the cultivated land by renovating the low-and medium-yielding fields.Heterogeneity analysis found that the HSFC farmland showed a significant increase in grain yield only in the main grain-producing areas and balanced areas.In addition,HSFC significantly increased the yields of rice,wheat,and maize while leading to a reduction in soybean yields.The findings suggest the government should continue to promote HSFC,improve construction standards,and strictly control the“non-agriculturalization”and“non-coordination”of farmland to increase grain production further.At the same time,market mechanisms should be used to incentivize soybean farming,improve returns and stabilize soybean yields.
基金Sponsored by Germplasm Collection and Conservation Project for the Forest and Grass Germplasm Resources in Anhui Province in 2024(hxkt2024111)Science and Technology Plan Project of Huangshan(2022KN-02)+1 种基金Humanities and Social Sciences Research Project of Anhui Higher Education Institutions(SKHS2019B07)Key School-level Project of Huangshan University(2022xkjzd004).
文摘The continuous progress of urbanization has driven the continuous development and innovation of landscape planning and design.Focused on the important design method of modern construction art,this study analyzed its concepts and characteristics,and made deep exploration to its application in landscape planning and design.The results indicated that modern construction art had a significant impact on landscape spatial planning and layout,spatial design forms,and spatial ornaments.The use of modern construction art concepts could make landscape design more scientific,artistic,and humane,creating higher quality leisure and entertainment venues for audiences.
文摘Introduction: Work-related accidents are frequent and serious in the construction sector. The aim of the study was to determine the frequency and factors associated with occupational accidents on the construction site of a referral hospital in Benin. Methods: A cross-sectional study was carried out. The sample size was calculated using the Schwartz form adjusted for the number of workers on site and was 129 workers. Random sampling was used. The dependent variable was work-related accidents. The other variables were socio-demographic and occupational characteristics. Data were collected through a questionnaire survey. Medians and proportions were calculated. An association was sought using Chi-square and Fisher tests with a threshold of p Results: A total of 132 workers were included. Their median age was 30 years with an ITQ of [27 - 38];men were the most represented 126 (95.45%) with a level of education higher than or equal to high school in 101 (76.52%) and in the majority with a permanent status 85 (64.39%). Seniority of more than 5 years was observed in 92 (69.7%). Workers working more than 8 hours of overtime per week numbered 57 (43.18%). Exposure to vibrating objects was 49 (37.12%). In terms of psychosocial constraints, 82.58% had high psychological demands;79.53% low decision-making latitude;50.76% low social support. The frequency of work-related accidents was 6.82%, and the only associated factor was the type of worker (p = 0.016). On the other hand, there were 10.2% accidents among workers handling vibrating objects versus 4.98% among those not using them. With regard to psychosocial constraints, the following frequencies were recorded respectively: 6.42% among those with high psychological demand versus 8.7% among those with low psychological demand;7.62% among those with low decision-making latitude versus 3.7% among those with high decision-making latitude;8.96% among those with low social support versus 4.62% among those with high support. Conclusion: Work-related accidents on construction sites must be avoided by all possible means including the management of psychosocial constraints.
基金supported by a grant from the National Natural Science Foundations of China(No.52171282)supported by Taishan Scholars Program of Shandong Province,China(No.tsqn202306098)the Shandong Provincial Key Research and Development Plan,China(No.2021ZLGX04).
文摘Structure-soil interface friction characteristics is of importance to investigate the interaction between engineering structures and soils,especially for offshore structures.The interface friction behavior between marine clay and structural materials with different roughness was studied in this paper by using 3D optical scanning tests,a modified direct shear device and numerical simulation.Relationships between the surface roughness of structures,water content and interface friction angle were presented by model tests.The increase of water contents decreased the interface friction angles.For interfaces with different roughness,the interface friction angles will be smaller than that of the soil when the water content exceeds a certain value.The roughness of the interface and the water content of the soil are mutually coupled to influence the coefficient of friction(COF).This paper proposed a Finite Element Method(FEM)to simulate the interface direct shear tests of structures with different roughness.The surface models with different roughness are established based on the structure data obtained by 3D scanning.The Coupled Eulerian-Lagrangian(CEL)approach was employed to analyse soils sheared by irregular surfaces.The interface behavior for interfaces with different roughness under cyclic shear stresses was analyzed by FEM.
基金supported by National Natural Science Foundation of China(Grant Nos.4203070 and 41977217)the Key Research&Development Program of Shaanxi Province(Grant No.2020ZDLSF06-03).
文摘Mountain excavation and city construction(MECC)projects being launched in the Loess Plateau in China involve the creation of large-scale artificial land.Understanding the subsurface evolution characteristics of the artificial land is essential,yet challenging.Here,we use an improved fiber-optic monitoring system for its subsurface multi-physical characterization.The system enables us to gather spatiotemporal distribution of various parameters,including strata deformation,temperature,and moisture.Yan’an New District was selected as a case study to conduct refined in-situ monitoring through a 77 m-deep borehole and a 30 m-long trench.Findings reveal that the ground settlement involves both the deformation of the filling loess and the underlying intact loess.Notably,the filling loess exhibits a stronger creep capability compared to underlying intact loess.The deformation along the profile is unevenly distributed,with a positive correlation with soil moisture.Water accumulation has been observed at the interface between the filling loess and the underlying intact loess,leading to a significant deformation.Moreover,the temperature and moisture in the filling loess have reached a new equilibrium state,with their depths influenced by atmospheric conditions measuring at 31 m and 26 m,respectively.The refined investigation allows us to identify critical layers that matter the sustainable development of newly created urban areas,and provide improved insights into the evolution mechanisms of land creation.
文摘Being different from testing for popular GUI software, the “instruction-category” approach is proposed for testing embedded system. This approach is constructed by three steps including refining items, drawing instruction-brief and instruction-category, and constructing test suite. Consequently, this approach is adopted to test oven embedded system, and detail process is deeply discussed. As a result, the factual result indicates that the “instruction-category” approach can be effectively applied in embedded system testing as a black-box method for conformity testing.
文摘In the context of China’s economic development and population aging,the innovation and exploration of the old-age care model has emerged as a new community transformation and development direction.Given the differing needs and characteristics of individuals across the lifespan,it is evident that a design approach that incorporates mixed-age integration and mutual-help communities is a viable strategy for enhancing intergenerational exchanges.This entails the creation of a diverse and open community that is conducive to habitation for individuals of all ages,encompassing the full spectrum of needs,from those of young children to the elderly.Such a community must be designed and constructed with the population in mind,from the initial planning and design stages to the operational phase.This encompasses a comprehensive range of services,including food,clothing,housing,transportation,and medical care and recreation.
基金the National Natural Science Foundation of China(No.61976080)the Academic Degrees&Graduate Education Reform Project of Henan Province(No.2021SJGLX195Y)+1 种基金the Teaching Reform Research and Practice Project of Henan Undergraduate Universities(No.2022SYJXLX008)the Key Project on Research and Practice of Henan University Graduate Education and Teaching Reform(No.YJSJG2023XJ006)。
文摘The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable.
文摘Taking the changes of construction land in Wan’an County over the years as the research object,the quantity and spatial characteristics of construction land in Wan’an County were analyzed,and the overall situation and regional differences of construction land utilization in Wan’an County were revealed.From the aspects of main influencing factors such as land use structure,land use intensity,land input intensity and output benefit,an evaluation indicator system was established to evaluate the economical and intensive use level of construction land in Wan’an County.The results show that the score of the economical and intensive use level of construction land in Wan’an County was 56.92,which was the lowest among all the districts and counties in Ji’an City.Based on the evaluation results,the corresponding economizing and intensive strategies were put forward,and the safeguard measures for its implementation were explored.The purpose is to provide some support for the preparation of territorial spatial planning,the delineation of urban development boundaries,and the potential exploitation of construction land stock,hoping to improve the utilization efficiency and benefit of construction land in Wan’an County,and promote the economic growth of Wan’an County to the stage of high-quality development.
基金National Natural Science Foundation of China under Grand No.51808190the Central Government Guides Local Science and Technology Development Fund Projects under Grand No.XZ202301YD0019C+2 种基金the Foundation of Key Laboratory of Soft Soils and Geoenvironmental Engineering(Zhejiang University)Ministry of Education under Grand No.2022P04the Central University Basic Research Fund of China under Grand No.B220202017。
文摘This research is concentrated on the longitudinal vibration of a tapered pipe pile considering the vertical support of the surrounding soil and construction disturbance.First,the pile-soil system is partitioned into finite segments in the vertical direction and the Voigt model is applied to simulate the vertical support of the surrounding soil acting on the pile segment.The surrounding soil is divided into finite ring-shaped zones in the radial direction to consider the construction disturbance.Then,the shear complex stiffness at the pile-soil interface is derived by solving the dynamic equilibrium equation for the soil from the outermost to innermost zone.The displacement impedance at the top of an arbitrary pile segment is obtained by solving the dynamic equilibrium equation for the pile and is combined with the vertical support of the surrounding soil to derive the displacement impedance at the bottom of the upper adjacent segment.Further,the displacement impedance at the pile head is obtained based on the impedance function transfer technique.Finally,the reliability of the proposed solution is verified,followed by a sensitivity analysis concerning the coupling effect of the pile parameters,construction disturbance and the vertical support of the surrounding soil on the displacement impedance of the pile.
基金supported by the National Natural Science Foundation of China(42241109)the Guoqiang Institute,Tsinghua University(2021GQG1001)the New Cornerstone Science Foundation through the XPLORER PRIZE.
文摘Lunar habitat construction is crucial for successful lunar exploration missions.Due to the limitations of transportation conditions,extensive global research has been conducted on lunar in situ material processing techniques in recent years.The aim of this paper is to provide a comprehensive review,precise classification,and quantitative evaluation of these approaches,focusing specifically on four main approaches:reaction solidification(RS),sintering/melting(SM),bonding solidification(BS),and confinement formation(CF).Eight key indicators have been identified for the construction of low-cost and highperformance systems to assess the feasibility of these methods:in situ material ratio,curing temperature,curing time,implementation conditions,compressive strength,tensile strength,curing dimensions,and environmental adaptability.The scoring thresholds are determined by comparing the construction requirements with the actual capabilities.Among the evaluated methods,regolith bagging has emerged as a promising option due to its high in situ material ratio,low time requirement,lack of hightemperature requirements,and minimal shortcomings,with only the compressive strength falling below the neutral score.The compressive strength still maintains a value of 2–3 MPa.The proposed construction scheme utilizing regolith bags offers numerous advantages,including rapid and large-scale construction,ensured tensile strength,and reduced reliance on equipment and energy.In this study,guidelines for evaluating regolith solidification techniques are provided,and directions for improvement are offered.The proposed lunar habitat design based on regolith bags is a practical reference for future research.