Active learning(AL)trains a high-precision predictor model from small numbers of labeled data by iteratively annotating the most valuable data sample from an unlabeled data pool with a class label throughout the learn...Active learning(AL)trains a high-precision predictor model from small numbers of labeled data by iteratively annotating the most valuable data sample from an unlabeled data pool with a class label throughout the learning process.However,most current AL methods start with the premise that the labels queried at AL rounds must be free of ambiguity,which may be unrealistic in some real-world applications where only a set of candidate labels can be obtained for selected data.Besides,most of the existing AL algorithms only consider the case of centralized processing,which necessitates gathering together all the unlabeled data in one fusion center for selection.Considering that data are collected/stored at different nodes over a network in many real-world scenarios,distributed processing is chosen here.In this paper,the issue of distributed classification of partially labeled(PL)data obtained by a fully decentralized AL method is focused on,and a distributed active partial label learning(dAPLL)algorithm is proposed.Our proposed algorithm is composed of a fully decentralized sample selection strategy and a distributed partial label learning(PLL)algorithm.During the sample selection process,both the uncertainty and representativeness of the data are measured based on the global cluster centers obtained by a distributed clustering method,and the valuable samples are chosen in turn.Meanwhile,using the disambiguation-free strategy,a series of binary classification problems can be constructed,and the corresponding cost-sensitive classifiers can be cooperatively trained in a distributed manner.The experiment results conducted on several datasets demonstrate that the performance of the dAPLL algorithm is comparable to that of the corresponding centralized method and is superior to the existing active PLL(APLL)method in different parameter configurations.Besides,our proposed algorithm outperforms several current PLL methods using the random selection strategy,especially when only small amounts of data are selected to be assigned with the candidate labels.展开更多
Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and sha...Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and share such multimodal data.However,due to professional discrepancies among annotators and lax quality control,noisy labels might be introduced.Recent research suggests that deep neural networks(DNNs)will overfit noisy labels,leading to the poor performance of the DNNs.To address this challenging problem,we present a Multimodal Robust Meta Learning framework(MRML)for multimodal sentiment analysis to resist noisy labels and correlate distinct modalities simultaneously.Specifically,we propose a two-layer fusion net to deeply fuse different modalities and improve the quality of the multimodal data features for label correction and network training.Besides,a multiple meta-learner(label corrector)strategy is proposed to enhance the label correction approach and prevent models from overfitting to noisy labels.We conducted experiments on three popular multimodal datasets to verify the superiority of ourmethod by comparing it with four baselines.展开更多
为完善基于先验知识的标记增强方法对于情绪信息的捕捉,提出一种基于语义规则增强的蒙古语情感分布学习方法(semantic rule enhancement based Mongolian emotion distribution learning, SRE-MEDL)。在情感轮和情感词典的基础上,引入...为完善基于先验知识的标记增强方法对于情绪信息的捕捉,提出一种基于语义规则增强的蒙古语情感分布学习方法(semantic rule enhancement based Mongolian emotion distribution learning, SRE-MEDL)。在情感轮和情感词典的基础上,引入程度词典和否定词典,得到各种情感词组合,以此制定相应的语义规则计算情感词权重,将其融入到标记增强中。在情感分布学习中融入从情感分布空间到实例特征空间的反向重构映射来弥补正向映射引起的原始信息丢失问题。对比实验结果显示,在蒙古语和中英文常用数据集上,SRE-MEDL方法在标记增强任务和情感分布学习中的表现均优于现有方法。展开更多
Label distribution learning(LDL)is a new learning paradigm to deal with label ambiguity and many researches have achieved the prominent performances.Compared with traditional supervised learning scenarios,the annotati...Label distribution learning(LDL)is a new learning paradigm to deal with label ambiguity and many researches have achieved the prominent performances.Compared with traditional supervised learning scenarios,the annotation with label distribution is more expensive.Direct use of existing active learning(AL)approaches,which aim to reduce the annotation cost in traditional learning,may lead to the degradation of their performance.To deal with the problem of high annotation cost in LDL,we propose the active label distribution learning via kernel maximum mean discrepancy(ALDL-kMMD)method to tackle this crucial but rarely studied problem.ALDL-kMMD captures the structural information of both data and label,extracts the most representative instances from the unlabeled ones by incorporating the nonlinear model and marginal probability distribution matching.Besides,it is also able to markedly decrease the amount of queried unlabeled instances.Meanwhile,an effective solution is proposed for the original optimization problem of ALDL-kMMD by constructing auxiliary variables.The effectiveness of our method is validated with experiments on the real-world datasets.展开更多
Recently,segmentation-based scene text detection has drawn a wide research interest due to its flexibility in describing scene text instance of arbitrary shapes such as curved texts.However,existing methods usually ne...Recently,segmentation-based scene text detection has drawn a wide research interest due to its flexibility in describing scene text instance of arbitrary shapes such as curved texts.However,existing methods usually need complex post-processing stages to process ambiguous labels,i.e.,the labels of the pixels near the text boundary,which may belong to the text or background.In this paper,we present a framework for segmentation-based scene text detection by learning from ambiguous labels.We use the label distribution learning method to process the label ambiguity of text annotation,which achieves a good performance without using additional post-processing stage.Experiments on benchmark datasets demonstrate that our method produces better results than state-of-the-art methods for segmentation-based scene text detection.展开更多
When utilizing the deep learning models in some real applications,the distribution of the labels in the environment can be used to increase the accuracy.Generally,to compute this distribution,there should be the valid...When utilizing the deep learning models in some real applications,the distribution of the labels in the environment can be used to increase the accuracy.Generally,to compute this distribution,there should be the validation set that is labeled by the ground truths.On the other side,the dependency of ground truths limits the utilization of the distribution in various environments.In this paper,we carried out a novel system for the deep learning-based classification to solve this problem.Firstly,our system only uses one validation set with ground truths to compute some hyper parameters,which is named as one-shot guidance.Secondly,in an environment,our system builds the validation set and labels this by the prediction results,which does not need any guidance by the ground truths.Thirdly,the computed distribution of labels by the validation set selectively cooperates with the probability of labels by the output of models,which is to increase the accuracy of predict results on testing samples.We selected six popular deep learning models on three real datasets for the evaluation.The experimental results show that our system can achieve higher accuracy than state-of-art methods while reducing the dependency of labeled validation set.展开更多
标记分布学习(label distribution learning,LDL)是一种用于解决标记多义性的新颖学习范式。现有的LDL方法大多基于完整数据信息进行设计,然而由于高昂的标注成本以及标注人员水平的局限性,很难获取到完整标注数据信息,且会导致传统LDL...标记分布学习(label distribution learning,LDL)是一种用于解决标记多义性的新颖学习范式。现有的LDL方法大多基于完整数据信息进行设计,然而由于高昂的标注成本以及标注人员水平的局限性,很难获取到完整标注数据信息,且会导致传统LDL算法性能的下降。为此,本文提出了一种新型的结合局部序标记关系的弱监督标记分布学习算法,通过维持尚未缺失标记之间的相对关系,并利用标记相关性来恢复缺失的标记,在数据标注不完整的情况下提升算法性能。在14个数据集上进行了大量的实验来验证算法的有效性。展开更多
In this paper, we tackle the problem of segmenting out a sequence of actions from videos. The videos contain background and actions which are usually composed of ordered sub-actions. We refer the sub-actions and the b...In this paper, we tackle the problem of segmenting out a sequence of actions from videos. The videos contain background and actions which are usually composed of ordered sub-actions. We refer the sub-actions and the background as semantic units. Considering the possible overlap between two adjacent semantic units, we propose a bidirectional sliding window method to generate the label distributions for various segments in the video. The label distribution covers a certain number of semantic unit labels, representing the degree to which each label describes the video segment. The mapping from a video segment to its label distribution is then learned by a Label Distribution Learning (LDL) algorithm. Based on the LDL model, a soft video parsing method with segmental regular grammars is proposed to construct a tree structure for the video. Each leaf of the tree stands for a video clip of background or sub-action. The proposed method shows promising results on the THUMOST4, MSR-II and UCF101 datasets and its computational complexity is much less than the compared state-of-the-art video parsing method.展开更多
Age estimation plays an important role in human-computer interaction system.The lack of large number of facial images with definite age label makes age estimation al-gorithms inefficient.Deep label distribution learni...Age estimation plays an important role in human-computer interaction system.The lack of large number of facial images with definite age label makes age estimation al-gorithms inefficient.Deep label distribution learning(DLDL)which employs convolutional neural networks(CNN)and label distribution learning to learn ambiguity from ground-truth age and adjacent ages,has been proven to outperform current state-of-the-art framework.However,DLDL assumes a rough label distribution which covers all ages for any given age label.In this paper,a more practical label distribution paradigm is proposed:we limit age label distribution that only covers a reasonable number of neighboring ages.In addition,we explore different label distributions to improve the performance of the proposed learning model.We employ CNN and the improved label distribution learning to estimate age.Experimental results show that compared to the DLDL,our method is more effective for facial age recognition.展开更多
Multi-label learning is an effective framework for learning with objects that have multiple semantic labels, and has been successfully applied into many real-world tasks, In contrast with traditional single-label lear...Multi-label learning is an effective framework for learning with objects that have multiple semantic labels, and has been successfully applied into many real-world tasks, In contrast with traditional single-label learning, the cost of la- beling a multi-label example is rather high, thus it becomes an important task to train an effective multi-label learning model with as few labeled examples as possible. Active learning, which actively selects the most valuable data to query their labels, is the most important approach to reduce labeling cost. In this paper, we propose a novel approach MADM for batch mode multi-label active learning. On one hand, MADM exploits representativeness and diversity in both the feature and label space by matching the distribution between labeled and unlabeled data. On the other hand, it tends to query predicted positive instances, which are expected to be more informative than negative ones. Experiments on benchmark datasets demonstrate that the proposed approach can reduce the labeling cost significantly.展开更多
基金supported by the National Natural Science Foundation of China(62201398)Natural Science Foundation of Zhejiang Province(LY21F020001),Science and Technology Plan Project of Wenzhou(ZG2020026).
文摘Active learning(AL)trains a high-precision predictor model from small numbers of labeled data by iteratively annotating the most valuable data sample from an unlabeled data pool with a class label throughout the learning process.However,most current AL methods start with the premise that the labels queried at AL rounds must be free of ambiguity,which may be unrealistic in some real-world applications where only a set of candidate labels can be obtained for selected data.Besides,most of the existing AL algorithms only consider the case of centralized processing,which necessitates gathering together all the unlabeled data in one fusion center for selection.Considering that data are collected/stored at different nodes over a network in many real-world scenarios,distributed processing is chosen here.In this paper,the issue of distributed classification of partially labeled(PL)data obtained by a fully decentralized AL method is focused on,and a distributed active partial label learning(dAPLL)algorithm is proposed.Our proposed algorithm is composed of a fully decentralized sample selection strategy and a distributed partial label learning(PLL)algorithm.During the sample selection process,both the uncertainty and representativeness of the data are measured based on the global cluster centers obtained by a distributed clustering method,and the valuable samples are chosen in turn.Meanwhile,using the disambiguation-free strategy,a series of binary classification problems can be constructed,and the corresponding cost-sensitive classifiers can be cooperatively trained in a distributed manner.The experiment results conducted on several datasets demonstrate that the performance of the dAPLL algorithm is comparable to that of the corresponding centralized method and is superior to the existing active PLL(APLL)method in different parameter configurations.Besides,our proposed algorithm outperforms several current PLL methods using the random selection strategy,especially when only small amounts of data are selected to be assigned with the candidate labels.
基金supported by STI 2030-Major Projects 2021ZD0200400National Natural Science Foundation of China(62276233 and 62072405)Key Research Project of Zhejiang Province(2023C01048).
文摘Multimodal sentiment analysis utilizes multimodal data such as text,facial expressions and voice to detect people’s attitudes.With the advent of distributed data collection and annotation,we can easily obtain and share such multimodal data.However,due to professional discrepancies among annotators and lax quality control,noisy labels might be introduced.Recent research suggests that deep neural networks(DNNs)will overfit noisy labels,leading to the poor performance of the DNNs.To address this challenging problem,we present a Multimodal Robust Meta Learning framework(MRML)for multimodal sentiment analysis to resist noisy labels and correlate distinct modalities simultaneously.Specifically,we propose a two-layer fusion net to deeply fuse different modalities and improve the quality of the multimodal data features for label correction and network training.Besides,a multiple meta-learner(label corrector)strategy is proposed to enhance the label correction approach and prevent models from overfitting to noisy labels.We conducted experiments on three popular multimodal datasets to verify the superiority of ourmethod by comparing it with four baselines.
文摘为完善基于先验知识的标记增强方法对于情绪信息的捕捉,提出一种基于语义规则增强的蒙古语情感分布学习方法(semantic rule enhancement based Mongolian emotion distribution learning, SRE-MEDL)。在情感轮和情感词典的基础上,引入程度词典和否定词典,得到各种情感词组合,以此制定相应的语义规则计算情感词权重,将其融入到标记增强中。在情感分布学习中融入从情感分布空间到实例特征空间的反向重构映射来弥补正向映射引起的原始信息丢失问题。对比实验结果显示,在蒙古语和中英文常用数据集上,SRE-MEDL方法在标记增强任务和情感分布学习中的表现均优于现有方法。
基金partially supported by the National Natural Science Fundation of China(Grant Nos.61922087,61906201 and 62006238)the Science and Technology Innovation Program of Hunan Province(2021RC3070).
文摘Label distribution learning(LDL)is a new learning paradigm to deal with label ambiguity and many researches have achieved the prominent performances.Compared with traditional supervised learning scenarios,the annotation with label distribution is more expensive.Direct use of existing active learning(AL)approaches,which aim to reduce the annotation cost in traditional learning,may lead to the degradation of their performance.To deal with the problem of high annotation cost in LDL,we propose the active label distribution learning via kernel maximum mean discrepancy(ALDL-kMMD)method to tackle this crucial but rarely studied problem.ALDL-kMMD captures the structural information of both data and label,extracts the most representative instances from the unlabeled ones by incorporating the nonlinear model and marginal probability distribution matching.Besides,it is also able to markedly decrease the amount of queried unlabeled instances.Meanwhile,an effective solution is proposed for the original optimization problem of ALDL-kMMD by constructing auxiliary variables.The effectiveness of our method is validated with experiments on the real-world datasets.
基金supported by the National Key R&D Program of China(2018AAA0100104,2018AAA0100100)the National Natural Science Foundation of China(Grant No.61702095)the Natural Science Foundation of Jiangsu Province(BK20211164).
文摘Recently,segmentation-based scene text detection has drawn a wide research interest due to its flexibility in describing scene text instance of arbitrary shapes such as curved texts.However,existing methods usually need complex post-processing stages to process ambiguous labels,i.e.,the labels of the pixels near the text boundary,which may belong to the text or background.In this paper,we present a framework for segmentation-based scene text detection by learning from ambiguous labels.We use the label distribution learning method to process the label ambiguity of text annotation,which achieves a good performance without using additional post-processing stage.Experiments on benchmark datasets demonstrate that our method produces better results than state-of-the-art methods for segmentation-based scene text detection.
基金NationalNatural Science Foundation of China(GrantNos.61802279,6180021345,61702281,and 61702366)Natural Science Foundation of Tianjin(Grant Nos.18JCQNJC70300,19JCTPJC49200,19PTZWHZ00020,and 19JCYBJC15800)+2 种基金Fundamental Research Funds for the Tianjin Universities(Grant No.2019KJ019)the Tianjin Science and Technology Program(Grant No.19PTZWHZ00020)and in part by the State Key Laboratory of ASIC and System(Grant No.2021KF014)Tianjin Educational Commission Scientific Research Program Project(Grant Nos.2020KJ112 and 2018KJ215).
文摘When utilizing the deep learning models in some real applications,the distribution of the labels in the environment can be used to increase the accuracy.Generally,to compute this distribution,there should be the validation set that is labeled by the ground truths.On the other side,the dependency of ground truths limits the utilization of the distribution in various environments.In this paper,we carried out a novel system for the deep learning-based classification to solve this problem.Firstly,our system only uses one validation set with ground truths to compute some hyper parameters,which is named as one-shot guidance.Secondly,in an environment,our system builds the validation set and labels this by the prediction results,which does not need any guidance by the ground truths.Thirdly,the computed distribution of labels by the validation set selectively cooperates with the probability of labels by the output of models,which is to increase the accuracy of predict results on testing samples.We selected six popular deep learning models on three real datasets for the evaluation.The experimental results show that our system can achieve higher accuracy than state-of-art methods while reducing the dependency of labeled validation set.
文摘标记分布学习(label distribution learning,LDL)是一种用于解决标记多义性的新颖学习范式。现有的LDL方法大多基于完整数据信息进行设计,然而由于高昂的标注成本以及标注人员水平的局限性,很难获取到完整标注数据信息,且会导致传统LDL算法性能的下降。为此,本文提出了一种新型的结合局部序标记关系的弱监督标记分布学习算法,通过维持尚未缺失标记之间的相对关系,并利用标记相关性来恢复缺失的标记,在数据标注不完整的情况下提升算法性能。在14个数据集上进行了大量的实验来验证算法的有效性。
基金the National Key Research & Development Plan of China (2017YFB1002801)the National Science Foundation of China (61622203, 61232007)the Jiangsu Natural Science Funds for Distinguished Young Scholar (BK20140022).
文摘In this paper, we tackle the problem of segmenting out a sequence of actions from videos. The videos contain background and actions which are usually composed of ordered sub-actions. We refer the sub-actions and the background as semantic units. Considering the possible overlap between two adjacent semantic units, we propose a bidirectional sliding window method to generate the label distributions for various segments in the video. The label distribution covers a certain number of semantic unit labels, representing the degree to which each label describes the video segment. The mapping from a video segment to its label distribution is then learned by a Label Distribution Learning (LDL) algorithm. Based on the LDL model, a soft video parsing method with segmental regular grammars is proposed to construct a tree structure for the video. Each leaf of the tree stands for a video clip of background or sub-action. The proposed method shows promising results on the THUMOST4, MSR-II and UCF101 datasets and its computational complexity is much less than the compared state-of-the-art video parsing method.
基金the financial support of the China National Natural Science Foundation(61702095)Natural Science Founda-tion(njpj2018209)of Nanjing Tech University Pujiang Institute,Anhui Polytechnic University Scientific Research Foundation(S031702004)+1 种基金Natural Science Foundation of Fujian Province(2018J01806)Scientific Research Pro-gram of Outstanding Talents in Universities of Fujian。
文摘Age estimation plays an important role in human-computer interaction system.The lack of large number of facial images with definite age label makes age estimation al-gorithms inefficient.Deep label distribution learning(DLDL)which employs convolutional neural networks(CNN)and label distribution learning to learn ambiguity from ground-truth age and adjacent ages,has been proven to outperform current state-of-the-art framework.However,DLDL assumes a rough label distribution which covers all ages for any given age label.In this paper,a more practical label distribution paradigm is proposed:we limit age label distribution that only covers a reasonable number of neighboring ages.In addition,we explore different label distributions to improve the performance of the proposed learning model.We employ CNN and the improved label distribution learning to estimate age.Experimental results show that compared to the DLDL,our method is more effective for facial age recognition.
文摘Multi-label learning is an effective framework for learning with objects that have multiple semantic labels, and has been successfully applied into many real-world tasks, In contrast with traditional single-label learning, the cost of la- beling a multi-label example is rather high, thus it becomes an important task to train an effective multi-label learning model with as few labeled examples as possible. Active learning, which actively selects the most valuable data to query their labels, is the most important approach to reduce labeling cost. In this paper, we propose a novel approach MADM for batch mode multi-label active learning. On one hand, MADM exploits representativeness and diversity in both the feature and label space by matching the distribution between labeled and unlabeled data. On the other hand, it tends to query predicted positive instances, which are expected to be more informative than negative ones. Experiments on benchmark datasets demonstrate that the proposed approach can reduce the labeling cost significantly.