Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively u...Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation.In addition,semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation,but existing methods usually study individual tasks separately,which leads to high computational resource overhead.To this end,we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT).Specifically,we design a Cross-Modal Feature Fusion(CMFF)method,which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation.Besides,a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE),extracting common features in a shared network to save time and resources,and then learning task-specific features in two task branches.Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently,multi-task learning saves 20%of training time and achieves competitive performance with mIoU of 83.02%for semantic segmentation and accuracy of 95.26%for height estimation.展开更多
Prevailing linguistic steganalysis approaches focus on learning sensitive features to distinguish a particular category of steganographic texts from non-steganographic texts,by performing binary classification.While i...Prevailing linguistic steganalysis approaches focus on learning sensitive features to distinguish a particular category of steganographic texts from non-steganographic texts,by performing binary classification.While it remains an unsolved problem and poses a significant threat to the security of cyberspace when various categories of non-steganographic or steganographic texts coexist.In this paper,we propose a general linguistic steganalysis framework named LS-MTL,which introduces the idea of multi-task learning to deal with the classification of various categories of steganographic and non-steganographic texts.LS-MTL captures sensitive linguistic features from multiple related linguistic steganalysis tasks and can concurrently handle diverse tasks with a constructed model.In the proposed framework,convolutional neural networks(CNNs)are utilized as private base models to extract sensitive features for each steganalysis task.Besides,a shared CNN is built to capture potential interaction information and share linguistic features among all tasks.Finally,LS-MTL incorporates the private and shared sensitive features to identify the detected text as steganographic or non-steganographic.Experimental results demonstrate that the proposed framework LS-MTL outperforms the baseline in the multi-category linguistic steganalysis task,while average Acc,Pre,and Rec are increased by 0.5%,1.4%,and 0.4%,respectively.More ablation experimental results show that LS-MTL with the shared module has robust generalization capability and achieves good detection performance even in the case of spare data.展开更多
In this paper, we proposed a multi-task system that can identify dish types, food ingredients, and cooking methods from food images with deep convolutional neural networks. We built up a dataset of 360 classes of diff...In this paper, we proposed a multi-task system that can identify dish types, food ingredients, and cooking methods from food images with deep convolutional neural networks. We built up a dataset of 360 classes of different foods with at least 500 images for each class. To reduce the noises of the data, which was collected from the Internet, outlier images were detected and eliminated through a one-class SVM trained with deep convolutional features. We simultaneously trained a dish identifier, a cooking method recognizer, and a multi-label ingredient detector. They share a few low-level layers in the deep network architecture. The proposed framework shows higher accuracy than traditional method with handcrafted features, and the cooking method recognizer and ingredient detector can be applied to dishes which are not included in the training dataset to provide reference information for users.展开更多
针对传统课堂考勤中耗时长、效率低等问题,提出了一种基于计算机视觉的考勤系统,利用深度学习进行人脸识别与手机入袋检测,记录学生的到课情况与手机上交情况。为将考勤信息可视化,设计了3种登录模式的综合考勤系统。实验结果表明,该系...针对传统课堂考勤中耗时长、效率低等问题,提出了一种基于计算机视觉的考勤系统,利用深度学习进行人脸识别与手机入袋检测,记录学生的到课情况与手机上交情况。为将考勤信息可视化,设计了3种登录模式的综合考勤系统。实验结果表明,该系统不仅能在毫秒级的时间内完成检测,而且平均准确率(mean Average Precision,mAP)0.5达到0.990,保证了精确率和召回率。展开更多
The increasing share of renewable energy in the electricity grid and progressing changes in power consumption have led to fluctuating,and weather-dependent power flows.To ensure grid stability,grid operators rely on p...The increasing share of renewable energy in the electricity grid and progressing changes in power consumption have led to fluctuating,and weather-dependent power flows.To ensure grid stability,grid operators rely on power forecasts which are crucial for grid calculations and planning.In this paper,a Multi-Task Learning approach is combined with a Graph Neural Network(GNN)to predict vertical power flows at transformers connecting high and extra-high voltage levels.The proposed method accounts for local differences in power flow characteristics by using an Embedding Multi-Task Learning approach.The use of a Bayesian embedding to capture the latent node characteristics allows to share the weights across all transformers in the subsequent node-invariant GNN while still allowing the individual behavioral patterns of the transformers to be distinguished.At the same time,dependencies between transformers are considered by the GNN architecture which can learn relationships between different transformers and thus take into account that power flows in an electricity network are not independent from each other.The effectiveness of the proposed method is demonstrated through evaluation on two real-world data sets provided by two of four German Transmission System Operators,comprising large portions of the operated German transmission grid.The results show that the proposed Multi-Task Graph Neural Network is a suitable representation learner for electricity networks with a clear advantage provided by the preceding embedding layer.It is able to capture interconnections between correlated transformers and indeed improves the performance in power flow prediction compared to standard Neural Networks.A sign test shows that the proposed model reduces the test RMSE on both data sets compared to the benchmark models significantly.展开更多
Face anti-spoofing is a relatively important part of the face recognition system,which has great significance for financial payment and access control systems.Aiming at the problems of unstable face alignment,complex ...Face anti-spoofing is a relatively important part of the face recognition system,which has great significance for financial payment and access control systems.Aiming at the problems of unstable face alignment,complex lighting,and complex structure of face anti-spoofing detection network,a novel method is presented using a combination of convolutional neural network and brightness equalization.Firstly,multi-task convolutional neural network(MTCNN)based on the cascade of three convolutional neural networks(CNNs),P-net,R-net,and O-net are used to achieve accurate positioning of the face,and the detected face bounding box is cropped by a specified multiple,then brightness equalization is adopted to perform brightness compensation on different brightness areas of the face image.Finally,data features are extracted and classification is given by utilizing a 12-layer convolution neural network.Experiments of the proposed algorithm were carried out on CASIA-FASD.The results show that the classification accuracy is relatively high,and the half total error rate(HTER)reaches 1.02%.展开更多
基金National Key R&D Program of China(No.2022ZD0118401).
文摘Deep learning based methods have been successfully applied to semantic segmentation of optical remote sensing images.However,as more and more remote sensing data is available,it is a new challenge to comprehensively utilize multi-modal remote sensing data to break through the performance bottleneck of single-modal interpretation.In addition,semantic segmentation and height estimation in remote sensing data are two tasks with strong correlation,but existing methods usually study individual tasks separately,which leads to high computational resource overhead.To this end,we propose a Multi-Task learning framework for Multi-Modal remote sensing images(MM_MT).Specifically,we design a Cross-Modal Feature Fusion(CMFF)method,which aggregates complementary information of different modalities to improve the accuracy of semantic segmentation and height estimation.Besides,a dual-stream multi-task learning method is introduced for Joint Semantic Segmentation and Height Estimation(JSSHE),extracting common features in a shared network to save time and resources,and then learning task-specific features in two task branches.Experimental results on the public multi-modal remote sensing image dataset Potsdam show that compared to training two tasks independently,multi-task learning saves 20%of training time and achieves competitive performance with mIoU of 83.02%for semantic segmentation and accuracy of 95.26%for height estimation.
基金This paper is partly supported by the National Natural Science Foundation of China unde rGrants 61972057 and 62172059Hunan ProvincialNatural Science Foundation of China underGrant 2022JJ30623 and 2019JJ50287Scientific Research Fund of Hunan Provincial Education Department of China under Grant 21A0211 and 19A265。
文摘Prevailing linguistic steganalysis approaches focus on learning sensitive features to distinguish a particular category of steganographic texts from non-steganographic texts,by performing binary classification.While it remains an unsolved problem and poses a significant threat to the security of cyberspace when various categories of non-steganographic or steganographic texts coexist.In this paper,we propose a general linguistic steganalysis framework named LS-MTL,which introduces the idea of multi-task learning to deal with the classification of various categories of steganographic and non-steganographic texts.LS-MTL captures sensitive linguistic features from multiple related linguistic steganalysis tasks and can concurrently handle diverse tasks with a constructed model.In the proposed framework,convolutional neural networks(CNNs)are utilized as private base models to extract sensitive features for each steganalysis task.Besides,a shared CNN is built to capture potential interaction information and share linguistic features among all tasks.Finally,LS-MTL incorporates the private and shared sensitive features to identify the detected text as steganographic or non-steganographic.Experimental results demonstrate that the proposed framework LS-MTL outperforms the baseline in the multi-category linguistic steganalysis task,while average Acc,Pre,and Rec are increased by 0.5%,1.4%,and 0.4%,respectively.More ablation experimental results show that LS-MTL with the shared module has robust generalization capability and achieves good detection performance even in the case of spare data.
基金This work was supported by the National High Technology Research and Development 863 Program of China under Grant No. 2013AA013903, the National Natural Science Foundation of China under Grant No. 61373069, the Research Grant of Beijing Higher Institution Engineering Research Center, and the Tsinghua University Initiative Scientific Research Program.
文摘In this paper, we proposed a multi-task system that can identify dish types, food ingredients, and cooking methods from food images with deep convolutional neural networks. We built up a dataset of 360 classes of different foods with at least 500 images for each class. To reduce the noises of the data, which was collected from the Internet, outlier images were detected and eliminated through a one-class SVM trained with deep convolutional features. We simultaneously trained a dish identifier, a cooking method recognizer, and a multi-label ingredient detector. They share a few low-level layers in the deep network architecture. The proposed framework shows higher accuracy than traditional method with handcrafted features, and the cooking method recognizer and ingredient detector can be applied to dishes which are not included in the training dataset to provide reference information for users.
文摘针对传统课堂考勤中耗时长、效率低等问题,提出了一种基于计算机视觉的考勤系统,利用深度学习进行人脸识别与手机入袋检测,记录学生的到课情况与手机上交情况。为将考勤信息可视化,设计了3种登录模式的综合考勤系统。实验结果表明,该系统不仅能在毫秒级的时间内完成检测,而且平均准确率(mean Average Precision,mAP)0.5达到0.990,保证了精确率和召回率。
文摘The increasing share of renewable energy in the electricity grid and progressing changes in power consumption have led to fluctuating,and weather-dependent power flows.To ensure grid stability,grid operators rely on power forecasts which are crucial for grid calculations and planning.In this paper,a Multi-Task Learning approach is combined with a Graph Neural Network(GNN)to predict vertical power flows at transformers connecting high and extra-high voltage levels.The proposed method accounts for local differences in power flow characteristics by using an Embedding Multi-Task Learning approach.The use of a Bayesian embedding to capture the latent node characteristics allows to share the weights across all transformers in the subsequent node-invariant GNN while still allowing the individual behavioral patterns of the transformers to be distinguished.At the same time,dependencies between transformers are considered by the GNN architecture which can learn relationships between different transformers and thus take into account that power flows in an electricity network are not independent from each other.The effectiveness of the proposed method is demonstrated through evaluation on two real-world data sets provided by two of four German Transmission System Operators,comprising large portions of the operated German transmission grid.The results show that the proposed Multi-Task Graph Neural Network is a suitable representation learner for electricity networks with a clear advantage provided by the preceding embedding layer.It is able to capture interconnections between correlated transformers and indeed improves the performance in power flow prediction compared to standard Neural Networks.A sign test shows that the proposed model reduces the test RMSE on both data sets compared to the benchmark models significantly.
基金Project(61671204)supported by National Natural Science Foundation of ChinaProject(2016WK2001)supported by Hunan Provincial Key R&D Plan,China。
文摘Face anti-spoofing is a relatively important part of the face recognition system,which has great significance for financial payment and access control systems.Aiming at the problems of unstable face alignment,complex lighting,and complex structure of face anti-spoofing detection network,a novel method is presented using a combination of convolutional neural network and brightness equalization.Firstly,multi-task convolutional neural network(MTCNN)based on the cascade of three convolutional neural networks(CNNs),P-net,R-net,and O-net are used to achieve accurate positioning of the face,and the detected face bounding box is cropped by a specified multiple,then brightness equalization is adopted to perform brightness compensation on different brightness areas of the face image.Finally,data features are extracted and classification is given by utilizing a 12-layer convolution neural network.Experiments of the proposed algorithm were carried out on CASIA-FASD.The results show that the classification accuracy is relatively high,and the half total error rate(HTER)reaches 1.02%.