期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Fine-Grained Pornographic Image Recognition with Multi-Instance Learning
1
作者 Zhiqiang Wu Bing Xie 《Computer Systems Science & Engineering》 SCIE EI 2023年第10期299-316,共18页
Image has become an essential medium for expressing meaning and disseminating information.Many images are uploaded to the Internet,among which some are pornographic,causing adverse effects on public psychological heal... Image has become an essential medium for expressing meaning and disseminating information.Many images are uploaded to the Internet,among which some are pornographic,causing adverse effects on public psychological health.To create a clean and positive Internet environment,network enforcement agencies need an automatic and efficient pornographic image recognition tool.Previous studies on pornographic images mainly rely on convolutional neural networks(CNN).Because of CNN’s many parameters,they must rely on a large labeled training dataset,which takes work to build.To reduce the effect of the database on the recognition performance of pornographic images,many researchers view pornographic image recognition as a binary classification task.In actual application,when faced with pornographic images of various features,the performance and recognition accuracy of the network model often decrease.In addition,the pornographic content in images usually lies in several small-sized local regions,which are not a large proportion of the image.CNN,this kind of strong supervised learning method,usually cannot automatically focus on the pornographic area of the image,thus affecting the recognition accuracy of pornographic images.This paper established an image dataset with seven classes by crawling pornographic websites and Baidu Image Library.A weakly supervised pornographic image recognition method based on multiple instance learning(MIL)is proposed.The Squeeze and Extraction(SE)module is introduced in the feature extraction to strengthen the critical information and weaken the influence of non-key and useless information on the result of pornographic image recognition.To meet the requirements of the pooling layer operation in Multiple Instance Learning,we introduced the idea of an attention mechanism to weight and average instances.The experimental results show that the proposed method has better accuracy and F1 scores than other methods. 展开更多
关键词 Deep learning multi-instance learning pornographic image multiclassification residual network
下载PDF
Formal Modeling and Discovery of Multi-instance Business Processes: A Cloud Resource Management Case Study 被引量:1
2
作者 Cong Liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第12期2151-2160,共10页
Process discovery, as one of the most challenging process analysis techniques, aims to uncover business process models from event logs. Many process discovery approaches were invented in the past twenty years;however,... Process discovery, as one of the most challenging process analysis techniques, aims to uncover business process models from event logs. Many process discovery approaches were invented in the past twenty years;however, most of them have difficulties in handling multi-instance sub-processes. To address this challenge, we first introduce a multi-instance business process model(MBPM) to support the modeling of processes with multiple sub-process instantiations. Formal semantics of MBPMs are precisely defined by using multi-instance Petri nets(MPNs)that are an extension of Petri nets with distinguishable tokens.Then, a novel process discovery technique is developed to support the discovery of MBPMs from event logs with sub-process multi-instantiation information. In addition, we propose to measure the quality of the discovered MBPMs against the input event logs by transforming an MBPM to a classical Petri net such that existing quality metrics, e.g., fitness and precision, can be used.The proposed discovery approach is properly implemented as plugins in the Pro M toolkit. Based on a cloud resource management case study, we compare our approach with the state-of-theart process discovery techniques. The results demonstrate that our approach outperforms existing approaches to discover process models with multi-instance sub-processes. 展开更多
关键词 Cloud resource management process multi-instance Petri nets(MPNs) multi-instance sub-processes process discovery quality evaluation
下载PDF
Multi-Instance Learning from Supervised View 被引量:11
3
作者 周志华 《Journal of Computer Science & Technology》 SCIE EI CSCD 2006年第5期800-809,共10页
In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. This paper studies multi-instance learning from the v... In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. This paper studies multi-instance learning from the view of supervised learning. First, by analyzing some representative learning algorithms, this paper shows that multi-instance learners can be derived from supervised learners by shifting their focuses from the discrimination on the instances to the discrimination on the bags. Second, considering that ensemble learning paradigms can effectively enhance supervised learners, this paper proposes to build multi-instance ensembles to solve multi-instance problems. Experiments on a real-world benchmark test show that ensemble learning paradigms can significantly enhance multi-instance learners. 展开更多
关键词 machine learning multi-instance learning supervised learning ensemble learning multi-instance ensemble
原文传递
MICkNN:Multi-Instance Covering kNN Algorithm 被引量:6
4
作者 Shu Zhao Chen Rui Yanping Zhang 《Tsinghua Science and Technology》 SCIE EI CAS 2013年第4期360-368,共9页
Mining from ambiguous data is very important in data mining. This paper discusses one of the tasks for mining from ambiguous data known as multi-instance problem. In multi-instance problem, each pattern is a labeled b... Mining from ambiguous data is very important in data mining. This paper discusses one of the tasks for mining from ambiguous data known as multi-instance problem. In multi-instance problem, each pattern is a labeled bag that consists of a number of unlabeled instances. A bag is negative if all instances in it are negative. A bag is positive if it has at least one positive instance. Because the instances in the positive bag are not labeled, each positive bag is an ambiguous. The mining aim is to classify unseen bags. The main idea of existing multi-instance algorithms is to find true positive instances in positive bags and convert the multi-instance problem to the supervised problem, and get the labels of test bags according to predict the labels of unknown instances. In this paper, we aim at mining the multi-instance data from another point of view, i.e., excluding the false positive instances in positive bags and predicting the label of an entire unknown bag. We propose an algorithm called Multi-Instance Covering kNN (MICkNN) for mining from multi-instance data. Briefly, constructive covering algorithm is utilized to restructure the structure of the original multi-instance data at first. Then, the kNN algorithm is applied to discriminate the false positive instances. In the test stage, we label the tested bag directly according to the similarity between the unseen bag and sphere neighbors obtained from last two steps. Experimental results demonstrate the proposed algorithm is competitive with most of the state-of-the-art multi-instance methods both in classification accuracy and running time. 展开更多
关键词 mining ambiguous data multi-instance classification constructive covering algorithm kNN algorithm
原文传递
Improving iris recognition performance via multi-instance fusion at the score level
5
作者 王风华 姚向华 韩九强 《Chinese Optics Letters》 SCIE EI CAS CSCD 2008年第11期824-826,共3页
Fusion of multiple instances within a modality for biometric verification performance improvement has received considerable attention. In this letter, we present an iris recognition method based on multiinstance fusio... Fusion of multiple instances within a modality for biometric verification performance improvement has received considerable attention. In this letter, we present an iris recognition method based on multiinstance fusion, which combines the left and right irises of an individual at the matching score level. When fusing, a novel fusion strategy using minimax probability machine (MPM) is applied to generate a fused score for the final decision. The experimental results on CASIA and UBIRIS databases show that the proposed method can bring obvious performance improvement compared with the single-instance method. The comparison among different fusion strategies demonstrates the superiority of the fusion strategy based on MPM. 展开更多
关键词 FRR EER Improving iris recognition performance via multi-instance fusion at the score level MPM
原文传递
Prediction of Protein-Protein Interactions by a Novel Model Based on Domain Information
6
作者 董露露 谢飞 +1 位作者 章程 李斌 《Journal of Donghua University(English Edition)》 EI CAS 2018年第2期163-169,共7页
Domain-based protein-protein interactions( PPIs) is a problem that has drawn the attentions of many researchers in recent years and it has been studied using lots of computational approaches from many different perspe... Domain-based protein-protein interactions( PPIs) is a problem that has drawn the attentions of many researchers in recent years and it has been studied using lots of computational approaches from many different perspectives. Existing domain-based methods to predict PPIs typically infer domain interactions from known interacting sets of proteins. However,these methods are costly and complex to implement. In this paper, a simple and effective prediction model is proposed. In this model,an improved multiinstance learning( MIL) algorithm( MilCaA) is designed that doesn't need to take the domain interactions into consideration to construct MIL bags. Then, the pseudo-amino acid composition( PseAAC) transformation method is used to encode the instances in a multi-instance bag and the principal components analysis( PCA) is also used to reduce the feature dimension. Finally, several traditional machine learning and MIL methods are used to verify the proposed model. Experimental results demonstrate that MilCaA performs better than state-of-the-art techniques including the traditional machine learning methods which are widely used in PPIs prediction. 展开更多
关键词 domain-based PROTEIN-PROTEIN interactions (PPIs) multi-instance learning AMINO acid composition ( AAC) pseudo-amino acidcomposition (PseAAC)
下载PDF
Data Augmentation Based Event Detection
7
作者 丁祥武 丁晶晶 秦彦霞 《Journal of Donghua University(English Edition)》 CAS 2021年第6期511-518,共8页
Supervised models for event detection usually require large-scale human-annotated training data,especially neural models.A data augmentation technique is proposed to improve the performance of event detection by gener... Supervised models for event detection usually require large-scale human-annotated training data,especially neural models.A data augmentation technique is proposed to improve the performance of event detection by generating paraphrase sentences to enrich expressions of the original data.Specifically,based on an existing human-annotated event detection dataset,we first automatically build a paraphrase dataset and label it with a designed event annotation alignment algorithm.To alleviate possible wrong labels in the generated paraphrase dataset,a multi-instance learning(MIL)method is adopted for joint training on both the gold human-annotated data and the generated paraphrase dataset.Experimental results on a widely used dataset ACE2005 show the effectiveness of our approach. 展开更多
关键词 event detection data augmentation back translation annotation alignment algorithm multi-instance learning(MIL)
下载PDF
Multi-task MIML learning for pre-course student performance prediction 被引量:1
8
作者 Yuling Ma Chaoran Cui +3 位作者 Jun Yu Jie Guo Gongping Yang Yilong Yin 《Frontiers of Computer Science》 SCIE EI CSCD 2020年第5期113-121,共9页
In higher education,the initial studying period of each course plays a crucial role for students,and seriously influences the subsequent learning activities.However,given the large size of a course’s students at univ... In higher education,the initial studying period of each course plays a crucial role for students,and seriously influences the subsequent learning activities.However,given the large size of a course’s students at universities,it has become impossible for teachers to keep track of the performance of individual students.In this circumstance,an academic early warning system is desirable,which automatically detects students with difficulties in learning(i.e.,at-risk students)prior to a course starting.However,previous studies are not well suited to this purpose for two reasons:1)they have mainly concentrated on e-learning platforms,e.g.,massive open online courses(MOOCs),and relied on the data about students’online activities,which is hardly accessed in traditional teaching scenarios;and 2)they have only made performance prediction when a course is in progress or even close to the end.In this paper,for traditional classroom-teaching scenarios,we investigate the task of pre-course student performance prediction,which refers to detecting at-risk students for each course before its commencement.To better represent a student sample and utilize the correlations among courses,we cast the problem as a multi-instance multi-label(MIML)problem.Besides,given the problem of data scarcity,we propose a novel multi-task learning method,i.e.,MIML-Circle,to predict the performance of students from different specialties in a unified framework.Extensive experiments are conducted on five real-world datasets,and the results demonstrate the superiority of our approach over the state-of-the-art methods. 展开更多
关键词 educational data mining academic early warning system student performance prediction multi-instance multi-label learning multi-task learning
原文传递
A Semi-Supervised Attention Model for Identifying Authentic Sneakers 被引量:1
9
作者 Yang Yang Nengjun Zhu +3 位作者 Yifeng Wu Jian Cao Dechuan Zhan Hui Xiong 《Big Data Mining and Analytics》 2020年第1期29-40,共12页
To protect consumers and those who manufacture and sell the products they enjoy,it is important to develop convenient tools to help consumers distinguish an authentic product from a counterfeit one.The advancement of ... To protect consumers and those who manufacture and sell the products they enjoy,it is important to develop convenient tools to help consumers distinguish an authentic product from a counterfeit one.The advancement of deep learning techniques for fine-grained object recognition creates new possibilities for genuine product identification.In this paper,we develop a Semi-Supervised Attention(SSA)model to work in conjunction with a large-scale multiple-source dataset named YSneaker,which consists of sneakers from various brands and their authentication results,to identify authentic sneakers.Specifically,the SSA model has a self-attention structure for different images of a labeled sneaker and a novel prototypical loss is designed to exploit unlabeled data within the data structure.The model draws on the weighted average of the output feature representations,where the weights are determined by an additional shallow neural network.This allows the SSA model to focus on the most important images of a sneaker for use in identification.A unique feature of the SSA model is its ability to take advantage of unlabeled data,which can help to further minimize the intra-class variation for more discriminative feature embedding.To validate the model,we collect a large number of labeled and unlabeled sneaker images and perform extensive experimental studies.The results show that YSneaker together with the proposed SSA architecture can identify authentic sneakers with a high accuracy rate. 展开更多
关键词 SNEAKER identification FINE-GRAINED classification multi-instance LEARNING ATTENTION mechanism
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部