期刊文献+
共找到29,249篇文章
< 1 2 250 >
每页显示 20 50 100
A Hand Features Based Fusion Recognition Network with Enhancing Multi-Modal Correlation
1
作者 Wei Wu Yuan Zhang +2 位作者 Yunpeng Li Chuanyang Li YanHao 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第7期537-555,共19页
Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and ... Fusing hand-based features in multi-modal biometric recognition enhances anti-spoofing capabilities.Additionally,it leverages inter-modal correlation to enhance recognition performance.Concurrently,the robustness and recognition performance of the system can be enhanced through judiciously leveraging the correlation among multimodal features.Nevertheless,two issues persist in multi-modal feature fusion recognition:Firstly,the enhancement of recognition performance in fusion recognition has not comprehensively considered the inter-modality correlations among distinct modalities.Secondly,during modal fusion,improper weight selection diminishes the salience of crucial modal features,thereby diminishing the overall recognition performance.To address these two issues,we introduce an enhanced DenseNet multimodal recognition network founded on feature-level fusion.The information from the three modalities is fused akin to RGB,and the input network augments the correlation between modes through channel correlation.Within the enhanced DenseNet network,the Efficient Channel Attention Network(ECA-Net)dynamically adjusts the weight of each channel to amplify the salience of crucial information in each modal feature.Depthwise separable convolution markedly reduces the training parameters and further enhances the feature correlation.Experimental evaluations were conducted on four multimodal databases,comprising six unimodal databases,including multispectral palmprint and palm vein databases from the Chinese Academy of Sciences.The Equal Error Rates(EER)values were 0.0149%,0.0150%,0.0099%,and 0.0050%,correspondingly.In comparison to other network methods for palmprint,palm vein,and finger vein fusion recognition,this approach substantially enhances recognition performance,rendering it suitable for high-security environments with practical applicability.The experiments in this article utilized amodest sample database comprising 200 individuals.The subsequent phase involves preparing for the extension of the method to larger databases. 展开更多
关键词 BIOMETRICS multi-modal CORRELATION deep learning feature-level fusion
下载PDF
An Improved CREAM Model Based on DS Evidence Theory and DEMATEL
2
作者 Zhihui Xu Shuwen Shang +3 位作者 Yuntong Pu Xiaoyan Su Hong Qian Xiaolei Pan 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第3期2597-2617,共21页
Cognitive Reliability and Error Analysis Method(CREAM)is widely used in human reliability analysis(HRA).It defines nine common performance conditions(CPCs),which represent the factors thatmay affect human reliability ... Cognitive Reliability and Error Analysis Method(CREAM)is widely used in human reliability analysis(HRA).It defines nine common performance conditions(CPCs),which represent the factors thatmay affect human reliability and are used to modify the cognitive failure probability(CFP).However,the levels of CPCs are usually determined by domain experts,whichmay be subjective and uncertain.What’smore,the classicCREAMassumes that the CPCs are independent,which is unrealistic.Ignoring the dependence among CPCs will result in repeated calculations of the influence of the CPCs on CFP and lead to unreasonable reliability evaluation.To address the issue of uncertain information modeling and processing,this paper introduces evidence theory to evaluate the CPC levels in specific scenarios.To address the issue of dependence modeling,the Decision-Making Trial and Evaluation Laboratory(DEMATEL)method is used to process the dependence among CPCs and calculate the relative weights of each CPC,thus modifying the multiplier of the CPCs.The detailed process of the proposed method is illustrated in this paper and the CFP estimated by the proposed method is more reasonable. 展开更多
关键词 Human reliability analysis CREAM uncertainty modeling DEPENDENCE Dempster-Shafer evidence theory DEMATEL
下载PDF
Multi-modal knowledge graph inference via media convergence and logic rule
3
作者 Feng Lin Dongmei Li +5 位作者 Wenbin Zhang Dongsheng Shi Yuanzhou Jiao Qianzhong Chen Yiying Lin Wentao Zhu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第1期211-221,共11页
Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the intro... Media convergence works by processing information from different modalities and applying them to different domains.It is difficult for the conventional knowledge graph to utilise multi-media features because the introduction of a large amount of information from other modalities reduces the effectiveness of representation learning and makes knowledge graph inference less effective.To address the issue,an inference method based on Media Convergence and Rule-guided Joint Inference model(MCRJI)has been pro-posed.The authors not only converge multi-media features of entities but also introduce logic rules to improve the accuracy and interpretability of link prediction.First,a multi-headed self-attention approach is used to obtain the attention of different media features of entities during semantic synthesis.Second,logic rules of different lengths are mined from knowledge graph to learn new entity representations.Finally,knowledge graph inference is performed based on representing entities that converge multi-media features.Numerous experimental results show that MCRJI outperforms other advanced baselines in using multi-media features and knowledge graph inference,demonstrating that MCRJI provides an excellent approach for knowledge graph inference with converged multi-media features. 展开更多
关键词 logic rule media convergence multi-modal knowledge graph inference representation learning
下载PDF
Real-World Evidence in Localized Pancreatic: Coping with Uncertainty in Unselected Populations
4
作者 María I. Pamies Ramón Diana Ibarra Anguita +8 位作者 Paula Rodríguez Payá Alvaro Muñoz Abad David Sánchez García María Valero Revert Beatriz Grau Mirete Mariano Martínez Marín Alicia Calero Amaro Lorena Rodríguez Cazalla Javier Gallego Plazas 《Journal of Cancer Therapy》 2024年第4期164-178,共15页
Background: Localized pancreatic cancer, including resectable (R), borderline resectable (BR) and locally advanced unresectable disease (LAU), is considered in clinical guidelines for diverse treatment options based o... Background: Localized pancreatic cancer, including resectable (R), borderline resectable (BR) and locally advanced unresectable disease (LAU), is considered in clinical guidelines for diverse treatment options based on clinical trials in selected populations. Hence, exploring with real world evidence (RWE) clinicians’ preferences for treatment options and their results seems pertinent. Methods: In a set of consecutive patients with localized pancreatic cancer assisted in a third level hospital from January 2013 to December 2022, medical records, symptoms, diagnostic process, distribution between subtypes, and treatment plans, with safety and efficacy results, were assessed. Results: A total of 152 patients with localized disease were included (43.4% R, 21.0% BR, 33.6% LAU). The population characteristics exemplified differences between daily practice and clinical trials. Tumor location and symptoms were as expected. Treatment plan was conditioned by PS or comorbidities in 23.0% of patients. In patients with R disease, surgery followed by different adjuvant chemotherapy (CT) regimes was the antineoplastic treatment of choice (64.8%) with efficacy results (OS 37.5 months;95% CI 18.4 - 56.7), in the range of contemporary standards. The common use of neoadjuvant CT for BR disease (94.4%), with surgery in 50% of them, and its results (OS 30.8 months;95% CI 10.5 - 51.2) reflected current controversies of treatment recommendations and evolution in this scenario. Paliative CT with or without radiotherapy was the standard specific treatment in LAU disease (95.1%) with survival results (PFS: 10.8 months;95% CI 8.8 - 12.7. OS: 20.3 months;95% CI 13.5 - 27.2) that justify the distinct character and the specific study of this entity. Conclusion: RWE for localized pancreatic cancer aroused from the analysis of this population confirms the distinct nature of patients assisted in daily practice, as well as mirrors the complexity of decision making in clinical assumptions in which achieving stronger evidence should be paramount. 展开更多
关键词 Real-World evidence LOCALIZED PANCREATIC CANCER
下载PDF
Generative Multi-Modal Mutual Enhancement Video Semantic Communications
5
作者 Yuanle Chen Haobo Wang +3 位作者 Chunyu Liu Linyi Wang Jiaxin Liu Wei Wu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期2985-3009,共25页
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the... Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent. 展开更多
关键词 Generative adversarial networks multi-modal mutual enhancement video semantic transmission deep learning
下载PDF
Levels of evidence and grades of recommendation supporting European society for medical oncology clinical practice guidelines
6
作者 MARKO SKELIN BRUNA PERKOV-STIPIČIN +5 位作者 SANJA VUŠKOVIĆ MARINAŠANDRK PLEHAČEK ANE BAŠIĆ DAVIDŠARČEVIĆ MAJA ILIĆ IVAN KREČAK 《Oncology Research》 SCIE 2024年第5期807-815,共9页
Background:The European Society for Medical Oncology(ESMO)guidelines are among the most comprehensive and widely used clinical practice guidelines(CPGs)globally.However,the level of scientific evidence supporting ESMO... Background:The European Society for Medical Oncology(ESMO)guidelines are among the most comprehensive and widely used clinical practice guidelines(CPGs)globally.However,the level of scientific evidence supporting ESMO CPG recommendations has not been systematically investigated.This study assessed ESMO CPG levels of evidence(LOE)and grades of recommendations(GOR),as well as their trends over time across various cancer settings.Methods:We manually extracted every recommendation with the Infectious Diseases Society of America(IDSA)classification from each CPG.We examined the distribution of LOE and GOR in all available ESMO CPG guidelines across different topics and cancer types.Results:Among the 1,823 recommendations in the current CPG,30%were classified as LOEⅠ,and 43%were classified as GOR A.Overall,there was a slight decrease in LOEⅠ(−2%)and an increase in the proportion of GOR A(+1%)in the current CPG compared to previous versions.The proportion of GOR A recommendations based on higher levels of evidence such as randomized trials(LOEⅠ–Ⅱ)shows a decrease(71%vs.63%,p=0.009)while recommendations based on lower levels of evidence(LOEⅢ–Ⅴ)show an increase(29%vs.37%,p=0.01)between previous and current version.In the current versions,the highest proportion of LOEⅠ(42%)was found in recommendations related to pharmacotherapy,while the highest proportion of GOR A recommendations was found in the areas of pathology(50%)and diagnostic(50%)recommendations.Significant variability in LOEⅠand GOR A recommendations and their changes over time was observed across different cancer types.Conclusion:One-third of the current ESMO CPG recommendations are supported by the highest level of evidence.More well-designed randomized clinical trials are needed to increase the proportion of LOEⅠand GOR A recommendations,ultimately leading to improved outcomes for cancer patients. 展开更多
关键词 ESMO guidelines Clinical practice guidelines Level of evidence Grade of recommendation
下载PDF
An Evidence-Based CoCoSo Framework with Double Hierarchy Linguistic Data for Viable Selection of Hydrogen Storage Methods
7
作者 Raghunathan Krishankumar Dhruva Sundararajan +1 位作者 K.S.Ravichandran Edmundas Kazimieras Zavadskas 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第3期2845-2872,共28页
Hydrogen is the new age alternative energy source to combat energy demand and climate change.Storage of hydrogen is vital for a nation’s growth.Works of literature provide different methods for storing the produced h... Hydrogen is the new age alternative energy source to combat energy demand and climate change.Storage of hydrogen is vital for a nation’s growth.Works of literature provide different methods for storing the produced hydrogen,and the rational selection of a viable method is crucial for promoting sustainability and green practices.Typically,hydrogen storage is associated with diverse sustainable and circular economy(SCE)criteria.As a result,the authors consider the situation a multi-criteria decision-making(MCDM)problem.Studies infer that previous models for hydrogen storage method(HSM)selection(i)do not consider preferences in the natural language form;(ii)weights of experts are not methodically determined;(iii)hesitation of experts during criteria weight assessment is not effectively explored;and(iv)three-stage solution of a suitable selection of HSM is unexplored.Driven by these gaps,in this paper,authors put forward a new integrated framework,which considers double hierarchy linguistic information for rating,criteria importance through inter-criteria correlation(CRITIC)for expert weight calculation,evidence-based Bayesian method for criteria weight estimation,and combined compromise solution(CoCoSo)for ranking HSMs.The applicability of the developed framework is testified by using a case example of HSM selection in India.Sensitivity and comparative analysis reveal the merits and limitations of the developed framework. 展开更多
关键词 Hydrogen storage methods double hierarchy hesitant fuzzy linguistic term set evidence theory CoCoSo method sustainability circular economy
下载PDF
Unsupervised multi-modal image translation based on the squeeze-and-excitation mechanism and feature attention module
8
作者 胡振涛 HU Chonghao +1 位作者 YANG Haoran SHUAI Weiwei 《High Technology Letters》 EI CAS 2024年第1期23-30,共8页
The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-genera... The unsupervised multi-modal image translation is an emerging domain of computer vision whose goal is to transform an image from the source domain into many diverse styles in the target domain.However,the multi-generator mechanism is employed among the advanced approaches available to model different domain mappings,which results in inefficient training of neural networks and pattern collapse,leading to inefficient generation of image diversity.To address this issue,this paper introduces a multi-modal unsupervised image translation framework that uses a generator to perform multi-modal image translation.Specifically,firstly,the domain code is introduced in this paper to explicitly control the different generation tasks.Secondly,this paper brings in the squeeze-and-excitation(SE)mechanism and feature attention(FA)module.Finally,the model integrates multiple optimization objectives to ensure efficient multi-modal translation.This paper performs qualitative and quantitative experiments on multiple non-paired benchmark image translation datasets while demonstrating the benefits of the proposed method over existing technologies.Overall,experimental results have shown that the proposed method is versatile and scalable. 展开更多
关键词 multi-modal image translation generative adversarial network(GAN) squeezeand-excitation(SE)mechanism feature attention(FA)module
下载PDF
Non-alcoholic fatty liver disease in type 2 diabetes:Emerging evidence of benefit of peroxisome proliferator-activated receptors agonists and incretin-based therapies
9
作者 Subhodip Pramanik Partha Pal Sayantan Ray 《World Journal of Methodology》 2024年第2期38-50,共13页
Nonalcoholic fatty liver disease(NAFLD)is a global epidemic,affecting more than half of the people living with type 2 diabetes(T2D).The relationship between NAFLD and T2D is bidirectional and the presence of one perpe... Nonalcoholic fatty liver disease(NAFLD)is a global epidemic,affecting more than half of the people living with type 2 diabetes(T2D).The relationship between NAFLD and T2D is bidirectional and the presence of one perpetuates the other,which significantly increases the hepatic as well as extrahepatic complications.Until recently,there was no approved pharmacological treatment for NAFLD/nonalcoholic steatohepatitits(NASH).However,there is evidence that drugs used for diabetes may have beneficial effects on NAFLD.Insulin sensitizers acting through peroxisome proliferator-activated receptor(PPAR)modulation act on multiple levels of NAFLD pathogenesis.Pioglitazone(PPARγ agonist)and saroglitazar(PPARα/γagonist)are particularly beneficial and recommended by several authoritative bodies for treating NAFLD in T2D,although data on biopsyproven NASH are lacking with the latter.Initial data on elafibanor(PPARα/δ agonist)and Lanifibranor(pan PPAR agonist)are promising.On the other hand,incretin therapies based on glucagon-like peptide-1(GLP-1)receptor agonists(GLP-1RA)and dual-and triple-hormone receptor co-agonists reported impressive weight loss and may have anti-inflammatory and antifibrotic properties.GLP-1 RAs have shown beneficial effects on NAFLD/NASH and more studies on potential direct effects on liver function by dual-and triple-agonists are required.Furthermore,the long-term safety of these therapies in NAFLD needs to be established.Collaborative efforts among healthcare providers such as primary care doctors,hepatologists,and endocrinologists are warranted for selecting patients for the best possible management of NAFLD in T2D. 展开更多
关键词 Non-alcoholic fatty liver disease Type 2 diabetes evidence PPAR agonists Incretin-based therapies
下载PDF
Evidence-Based Nursing Practice of Reducing Immune-Related Skin Toxicity of Tumor Patients Guided by Sensitive Indicators
10
作者 Lingling Tang Qiong Wen 《Journal of Biosciences and Medicines》 2024年第4期210-215,共6页
Purpose research on nursing sensitive indicators in tumor Patients application effect in immune-related skin toxicity management. Method select our hospital April to June, 202360 cases patients with immune therapy set... Purpose research on nursing sensitive indicators in tumor Patients application effect in immune-related skin toxicity management. Method select our hospital April to June, 202360 cases patients with immune therapy settings as the control group. August-October, 2023 60 cases the patients treated with immune therapy were the experimental group. The control group adopted regular nursing methods, while the experimental group sensitive Indicators, evidence-based give preventive care. The social situation, psychological state, physical function, quality of life score, incidence of skin toxicity caused by immune checkpoint inhibitors, moderate and above of the two groups of patients were compared. Incidence of skin toxicity. Result: experience group SAS score, SDS score higher than the control group, the difference was statistically significant (P < 0.05);The incidence of skin toxic reactions caused by immune checkpoint inhibitors and the incidence of moderate and above skin toxic reactions in the experimental group are lower than those in the control group, and the difference is statistically significant (P < 0.05). Conclusion: sensitive indicator guidance evidence-based preventive care can reduce the degree of immune-related skin toxicity, improve the psychological state and quality of life of tumor patients treated with immune therapy and reduce the incidence of adverse reactions, improve nursing quality and patient satisfaction. 展开更多
关键词 Sensitive Indicators Immune-Related Skin Toxicity evidence-Based Practice TUMOR
下载PDF
Clinical Practice of Evidence-Based PDCA Cycle Management Model in Accelerated Recovery of Lung Cancer Patients
11
作者 Lu Kang Juan Yuan +1 位作者 Dandan Liu Bo Deng 《Journal of Cancer Therapy》 2024年第4期130-140,共11页
Objective: To explore the nursing effect of evidence-based PDCA cycle management mode in accelerated rehabilitation of patients undergoing thoracoscopic lung cancer radical surgery. Methods: 104 patients who underwent... Objective: To explore the nursing effect of evidence-based PDCA cycle management mode in accelerated rehabilitation of patients undergoing thoracoscopic lung cancer radical surgery. Methods: 104 patients who underwent thoracoscopic lung cancer radical surgery in our hospital from June 2022 to February 2023 were randomly divided into control group and intervention group, with 52 cases in each group. The control group implemented evidence-based ERAS clinical pathway management, while the intervention group implemented evidence-based PDCA cycle quality management. The postoperative recovery of the two groups of patients was compared. Results: The postoperative recovery of the intervention group was significantly better than that of the control group. The first time to get out of bed, the first time to eat, the duration of chest drainage tube placement, and the length of hospital stay were significantly shorter than those of the control group. The incidence of postoperative chest complications and hospitalization costs were significantly lower than those of the control group, and patient satisfaction was significantly higher than that of the control group (P Conclusion: Evidence-based PDCA cycle quality management mode can effectively improve the implementation quality of accelerated rehabilitation nursing for patients undergoing thoracoscopic lung cancer radical surgery, and it is worthy of clinical promotion. 展开更多
关键词 evidence-BASED PDCA Cycle Thoracoscopic Lung Cancer Radical Surgery Accelerated Rehabilitation
下载PDF
Research on the Application of Evidence-Based Quality Control Circle to Improve the Implementation Rate of Airway Management Measures in Adult Critically Ill Patients
12
作者 Yujiao Yan Jing Wu +4 位作者 Juan Liu Yanting Yuan Lixin Liu Huaxin Ye Juan Ding 《Yangtze Medicine》 2024年第1期8-19,共12页
Objective: To explore the effect of evidence-based quality control circle (QCC) in improving the implementation rate of airway management measures in adult critically ill patients. Methods: Based on the Joanna Briggs ... Objective: To explore the effect of evidence-based quality control circle (QCC) in improving the implementation rate of airway management measures in adult critically ill patients. Methods: Based on the Joanna Briggs Institute (JBI) evidence-based health care model, the best evidence of airway management in adult critically ill patients was obtained and applied to the clinic. Results: The total implementation rate of airway management measures in adult critically ill patients increased from 23.62% before the implementation of quality control circle to 88.82%, and the pulmonary infection rate in critically ill patients decreased from 42.31% to 21.74%, with statistical significance between the two groups (P 0.05). Conclusion: Evidence-based quality control circle activities can standardize the practice standards of airway management in critically ill patients, reduce the occurrence of patients’ airway related complications, and improve clinical outcomes. 展开更多
关键词 Critically Ill Patients Airway Management Be evidence-Based Quality Control Circle Intensive Care Unit (ICU)
下载PDF
M3SC:A Generic Dataset for Mixed Multi-Modal(MMM)Sensing and Communication Integration 被引量:3
13
作者 Xiang Cheng Ziwei Huang +6 位作者 Lu Bai Haotian Zhang Mingran Sun Boxun Liu Sijiang Li Jianan Zhang Minson Lee 《China Communications》 SCIE CSCD 2023年第11期13-29,共17页
The sixth generation(6G)of mobile communication system is witnessing a new paradigm shift,i.e.,integrated sensing-communication system.A comprehensive dataset is a prerequisite for 6G integrated sensing-communication ... The sixth generation(6G)of mobile communication system is witnessing a new paradigm shift,i.e.,integrated sensing-communication system.A comprehensive dataset is a prerequisite for 6G integrated sensing-communication research.This paper develops a novel simulation dataset,named M3SC,for mixed multi-modal(MMM)sensing-communication integration,and the generation framework of the M3SC dataset is further given.To obtain multimodal sensory data in physical space and communication data in electromagnetic space,we utilize Air-Sim and WaveFarer to collect multi-modal sensory data and exploit Wireless InSite to collect communication data.Furthermore,the in-depth integration and precise alignment of AirSim,WaveFarer,andWireless InSite are achieved.The M3SC dataset covers various weather conditions,multiplex frequency bands,and different times of the day.Currently,the M3SC dataset contains 1500 snapshots,including 80 RGB images,160 depth maps,80 LiDAR point clouds,256 sets of mmWave waveforms with 8 radar point clouds,and 72 channel impulse response(CIR)matrices per snapshot,thus totaling 120,000 RGB images,240,000 depth maps,120,000 LiDAR point clouds,384,000 sets of mmWave waveforms with 12,000 radar point clouds,and 108,000 CIR matrices.The data processing result presents the multi-modal sensory information and communication channel statistical properties.Finally,the MMM sensing-communication application,which can be supported by the M3SC dataset,is discussed. 展开更多
关键词 multi-modal sensing RAY-TRACING sensing-communication integration simulation dataset
下载PDF
Multi-task Learning of Semantic Segmentation and Height Estimation for Multi-modal Remote Sensing Images 被引量:1
14
作者 Mengyu WANG Zhiyuan YAN +2 位作者 Yingchao FENG Wenhui DIAO Xian SUN 《Journal of Geodesy and Geoinformation Science》 CSCD 2023年第4期27-39,共13页
Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively u... Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation.In addition,semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation,but existing methods usually study individual tasks separately,which leads to high computational resource overhead.To this end,we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT).Specifically,we design a Cross-Modal Feature Fusion(CMFF)method,which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation.Besides,a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE),extracting common features in a shared network to save time and resources,and then learning task-specific features in two task branches.Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently,multi-task learning saves 20%of training time and achieves competitive performance with mIoU of 83.02%for semantic segmentation and accuracy of 95.26%for height estimation. 展开更多
关键词 multi-modal MULTI-TASK semantic segmentation height estimation convolutional neural network
下载PDF
A novel adaptive temporal-spatial information fusion model based on Dempster-Shafer evidence theory
15
作者 胡振涛 SU Yujie ZHANG Zihan 《High Technology Letters》 EI CAS 2023年第4期358-364,共7页
In the field of target recognition based on the temporal-spatial information fusion,evidence the-ory has received extensive attention.To achieve accurate and efficient target recognition by the evi-dence theory,an ada... In the field of target recognition based on the temporal-spatial information fusion,evidence the-ory has received extensive attention.To achieve accurate and efficient target recognition by the evi-dence theory,an adaptive temporal-spatial information fusion model is proposed.Firstly,an adaptive evaluation correction mechanism is constructed by the evidence distance and Deng entropy,which realizes the credibility discrimination and adaptive correction of the spatial evidence.Secondly,the credibility decay operator is introduced to obtain the dynamic credibility of temporal evidence.Finally,the sequential combination of temporal-spatial evidences is achieved by Shafer’s discount criterion and Dempster’s combination rule.The simulation results show that the proposed method not only considers the dynamic and sequential characteristics of the temporal-spatial evidences com-bination,but also has a strong conflict information processing capability,which provides a new refer-ence for the field of temporal-spatial information fusion. 展开更多
关键词 temporal-spatial information fusion evidence theory Deng entropy evidence dis-tance credibility decay model
下载PDF
A survey of multi-modal learning theory
16
作者 HUANG Yu HUANG Longbo 《中山大学学报(自然科学版)(中英文)》 CAS CSCD 北大核心 2023年第5期38-49,共12页
Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empi... Deep multi-modal learning,a rapidly growing field with a wide range of practical applications,aims to effectively utilize and integrate information from multiple sources,known as modalities.Despite its impressive empirical performance,the theoretical foundations of deep multi-modal learning have yet to be fully explored.In this paper,we will undertake a comprehensive survey of recent developments in multi-modal learning theories,focusing on the fundamental properties that govern this field.Our goal is to provide a thorough collection of current theoretical tools for analyzing multi-modal learning,to clarify their implications for practitioners,and to suggest future directions for the establishment of a solid theoretical foundation for deep multi-modal learning. 展开更多
关键词 multi-modal learning machine learning theory OPTIMIZATION GENERALIZATION
下载PDF
PowerDetector:Malicious PowerShell Script Family Classification Based on Multi-Modal Semantic Fusion and Deep Learning
17
作者 Xiuzhang Yang Guojun Peng +2 位作者 Dongni Zhang Yuhang Gao Chenguang Li 《China Communications》 SCIE CSCD 2023年第11期202-224,共23页
Power Shell has been widely deployed in fileless malware and advanced persistent threat(APT)attacks due to its high stealthiness and live-off-theland technique.However,existing works mainly focus on deobfuscation and ... Power Shell has been widely deployed in fileless malware and advanced persistent threat(APT)attacks due to its high stealthiness and live-off-theland technique.However,existing works mainly focus on deobfuscation and malicious detection,lacking the malicious Power Shell families classification and behavior analysis.Moreover,the state-of-the-art methods fail to capture fine-grained features and semantic relationships,resulting in low robustness and accuracy.To this end,we propose Power Detector,a novel malicious Power Shell script detector based on multimodal semantic fusion and deep learning.Specifically,we design four feature extraction methods to extract key features from character,token,abstract syntax tree(AST),and semantic knowledge graph.Then,we intelligently design four embeddings(i.e.,Char2Vec,Token2Vec,AST2Vec,and Rela2Vec) and construct a multi-modal fusion algorithm to concatenate feature vectors from different views.Finally,we propose a combined model based on transformer and CNN-Bi LSTM to implement Power Shell family detection.Our experiments with five types of Power Shell attacks show that PowerDetector can accurately detect various obfuscated and stealth PowerShell scripts,with a 0.9402 precision,a 0.9358 recall,and a 0.9374 F1-score.Furthermore,through singlemodal and multi-modal comparison experiments,we demonstrate that PowerDetector’s multi-modal embedding and deep learning model can achieve better accuracy and even identify more unknown attacks. 展开更多
关键词 deep learning malicious family detection multi-modal semantic fusion POWERSHELL
下载PDF
Multi-Modal Military Event Extraction Based on Knowledge Fusion
18
作者 Yuyuan Xiang Yangli Jia +1 位作者 Xiangliang Zhang Zhenling Zhang 《Computers, Materials & Continua》 SCIE EI 2023年第10期97-114,共18页
Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event eleme... Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event elements from multi-modal data remains a challenging task due to the presence of a large number of images and overlapping event elements in the data.Although researchers have proposed various methods to accomplish this task,most existing event extraction models cannot address these challenges because they are only applicable to text scenarios.To solve the above issues,this paper proposes a multi-modal event extraction method based on knowledge fusion.Specifically,for event-type recognition,we use a meticulous pipeline approach that integrates multiple pre-trained models.This approach enables a more comprehensive capture of the multidimensional event semantic features present in military texts,thereby enhancing the interconnectedness of information between trigger words and events.For event element extraction,we propose a method for constructing a priori templates that combine event types with corresponding trigger words.This approach facilitates the acquisition of fine-grained input samples containing event trigger words,thus enabling the model to understand the semantic relationships between elements in greater depth.Furthermore,a fusion method for spatial mapping of textual event elements and image elements is proposed to reduce the category number overload and effectively achieve multi-modal knowledge fusion.The experimental results based on the CCKS 2022 dataset show that our method has achieved competitive results,with a comprehensive evaluation value F1-score of 53.4%for the model.These results validate the effectiveness of our method in extracting event elements from multi-modal data. 展开更多
关键词 Event extraction multi-modal knowledge fusion pre-trained models
下载PDF
A multi-modal clustering method for traditonal Chinese medicine clinical data via media convergence
19
作者 Jingna Si Ziwei Tian +6 位作者 Dongmei Li Lei Zhang Lei Yao Wenjuan Jiang Jia Liu Runshun Zhang Xiaoping Zhang 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第2期390-400,共11页
Media convergence is a media change led by technological innovation.Applying media convergence technology to the study of clustering in Chinese medicine can significantly exploit the advantages of media fusion.Obtaini... Media convergence is a media change led by technological innovation.Applying media convergence technology to the study of clustering in Chinese medicine can significantly exploit the advantages of media fusion.Obtaining consistent and complementary information among multiple modalities through media convergence can provide technical support for clustering.This article presents an approach based on Media Convergence and Graph convolution Encoder Clustering(MCGEC)for traditonal Chinese medicine(TCM)clinical data.It feeds modal information and graph structure from media information into a multi-modal graph convolution encoder to obtain the media feature representation learnt from multiple modalities.MCGEC captures latent information from various modalities by fusion and optimises the feature representations and network architecture with learnt clustering labels.The experiment is conducted on real-world multimodal TCM clinical data,including information like images and text.MCGEC has improved clustering results compared to the generic single-modal clustering methods and the current more advanced multi-modal clustering methods.MCGEC applied to TCM clinical datasets can achieve better results.Integrating multimedia features into clustering algorithms offers significant benefits compared to single-modal clustering approaches that simply concatenate features from different modalities.It provides practical technical support for multi-modal clustering in the TCM field incorporating multimedia features. 展开更多
关键词 graph convolutional encoder media convergence multi-modal clustering traditional Chinese medicine
下载PDF
DCRL-KG: Distributed Multi-Modal Knowledge Graph Retrieval Platform Based on Collaborative Representation Learning
20
作者 Leilei Li Yansheng Fu +6 位作者 Dongjie Zhu Xiaofang Li Yundong Sun Jianrui Ding Mingrui Wu Ning Cao Russell Higgs 《Intelligent Automation & Soft Computing》 SCIE 2023年第6期3295-3307,共13页
The knowledge graph with relational abundant information has been widely used as the basic data support for the retrieval platforms.Image and text descriptions added to the knowledge graph enrich the node information,... The knowledge graph with relational abundant information has been widely used as the basic data support for the retrieval platforms.Image and text descriptions added to the knowledge graph enrich the node information,which accounts for the advantage of the multi-modal knowledge graph.In the field of cross-modal retrieval platforms,multi-modal knowledge graphs can help to improve retrieval accuracy and efficiency because of the abundant relational infor-mation provided by knowledge graphs.The representation learning method is sig-nificant to the application of multi-modal knowledge graphs.This paper proposes a distributed collaborative vector retrieval platform(DCRL-KG)using the multi-modal knowledge graph VisualSem as the foundation to achieve efficient and high-precision multimodal data retrieval.Firstly,use distributed technology to classify and store the data in the knowledge graph to improve retrieval efficiency.Secondly,this paper uses BabelNet to expand the knowledge graph through multi-ple filtering processes and increase the diversification of information.Finally,this paper builds a variety of retrieval models to achieve the fusion of retrieval results through linear combination methods to achieve high-precision language retrieval and image retrieval.The paper uses sentence retrieval and image retrieval experi-ments to prove that the platform can optimize the storage structure of the multi-modal knowledge graph and have good performance in multi-modal space. 展开更多
关键词 multi-modal retrieval distributed storage knowledge graph
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部