In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficie...In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficient and effective searching environment for users to query their images more easily. In this paper,a semi-supervised learning based probabilistic latent semantic analysis( PLSA) model for automatic image annotation is presenred. Since it's often hard to obtain or create labeled images in large quantities while unlabeled ones are easier to collect,a transductive support vector machine( TSVM) is exploited to enhance the quality of the training image data. Then,different image features with different magnitudes will result in different performance for automatic image annotation. To this end,a Gaussian normalization method is utilized to normalize different features extracted from effective image regions segmented by the normalized cuts algorithm so as to reserve the intrinsic content of images as complete as possible. Finally,a PLSA model with asymmetric modalities is constructed based on the expectation maximization( EM) algorithm to predict a candidate set of annotations with confidence scores. Extensive experiments on the general-purpose Corel5k dataset demonstrate that the proposed model can significantly improve performance of traditional PLSA for the task of automatic image annotation.展开更多
A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to esti...A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.展开更多
Automatic image annotation(AIA)has become an important and challenging problem in computer vision due to the existence of semantic gap.In this paper,a novel support vector machine with mixture of kernels(SVM-MK)for au...Automatic image annotation(AIA)has become an important and challenging problem in computer vision due to the existence of semantic gap.In this paper,a novel support vector machine with mixture of kernels(SVM-MK)for automatic image annotation is proposed.On one hand,the combined global and local block-based image features are extracted in order to reflect the intrinsic content of images as complete as possible.On the other hand,SVM-MK is constructed to shoot for better annotating performance.Experimental results on Corel dataset show that the proposed image feature representation method as well as automatic image annotation classifier,SVM-MK,can achieve higher annotating accuracy than SVM with any single kernel and mi-SVM for semantic image annotation.展开更多
This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is c...This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is constructed to predict a candidate set of annotations with confidence scores, and then model semantic relationship among the candidate annotations by leveraging conditional ran- dom field. In CRF, the confidence scores generated lay the PLSA model and the Fliekr distance be- tween pairwise candidate annotations are considered as local evidences and contextual potentials re- spectively. The novelty of our method mainly lies in two aspects : exploiting PLSA to predict a candi- date set of annotations with confidence scores as well as CRF to further explore the semantic context among candidate annotations for precise image annotation. To demonstrate the effectiveness of the method proposed in this paper, an experiment is conducted on the standard Corel dataset and its re- sults are 'compared favorably with several state-of-the-art approaches.展开更多
In industry,it is becoming common to detect and recognize industrial workpieces using deep learning methods.In this field,the lack of datasets is a big problem,and collecting and annotating datasets in this field is v...In industry,it is becoming common to detect and recognize industrial workpieces using deep learning methods.In this field,the lack of datasets is a big problem,and collecting and annotating datasets in this field is very labor intensive.The researchers need to perform dataset annotation if a dataset is generated by themselves.It is also one of the restrictive factors that the current method based on deep learning cannot expand well.At present,there are very few workpiece datasets for industrial fields,and the existing datasets are generated from ideal workpiece computer aided design(CAD)models,for which few actual workpiece images were collected and utilized.We propose an automatic industrial workpiece dataset generation method and an automatic ground truth annotation method.Included in our methods are three algorithms that we proposed:a point cloud based spatial plane segmentation algorithm to segment the workpieces in the real scene and to obtain the annotation information of the workpieces in the images captured in the real scene;a random multiple workpiece generation algorithm to generate abundant composition datasets with random rotation workpiece angles and positions;and a tangent vector based contour tracking and completion algorithm to get improved contour images.With our procedures,annotation information can be obtained using the algorithms proposed in this paper.Upon completion of the annotation process,a json format file is generated.Faster R-CNN(Faster R-convolutional neural network),SSD(single shot multibox detector)and YOLO(you only look once:unified,real-time object detection)are trained using the datasets proposed in this paper.The experimental results show the effectiveness and integrity of this dataset generation and annotation method.展开更多
Herbal components characterization represents a challenging task because of the co-existing of multiple classes of naturally occurring compounds with wide spans of polarity,molecular mass,and the ubiquitous isomerism....Herbal components characterization represents a challenging task because of the co-existing of multiple classes of naturally occurring compounds with wide spans of polarity,molecular mass,and the ubiquitous isomerism.The root and rhizome of Salvia miltiorrhiza have been utilized as a reputable traditional Chinese medicine Salviae Miltiorrhizae Radix et Rhizoma(Dan-Shen)in the treatment of cardiovascular disease.Herein,a dimensionenhanced ultra-high performance liquid chromatography/ion mobility/quadrupole time-of-flight mass spectrometry approach in combination with intelligent peak annotation workflows was established aimed to rapidly characterize the multicomponents from S.miltiorrhiza.Due to the sufficient optimization,satisfactory chromatography separation was enabled on an HSS T3 column within 33 min using 0.1%formic acid in water(A)and acetonitrile(B)as the mobile phase,while the data-independent HDMS^(E) in both the negative and positive electrospray ionization modes was utilized for the high-coverage MS^(2) data acquisition.Streamlined automatic peak annotation by searching an in-house library(recording 198 known compounds)followed by the subsequent confirming steps(e.g.,comparison with the reference compounds,fragmentation pathways analysis,and retention behavior comparison,etc.),allowed us to identify or tentatively characterize a total of 86 components(including 50 terpenoids,21 phenolic acids,and 15 others)from S.miltiorrhiza.Importantly,three-dimensional structure information,such as the retention time,MS^(1) and MS^(2) data,and collision cross section(CCS),was provided,which can facilitate the more reliable characterization of herbal components.展开更多
基金Supported by the National Program on Key Basic Research Project(No.2013CB329502)the National Natural Science Foundation of China(No.61202212)+1 种基金the Special Research Project of the Educational Department of Shaanxi Province of China(No.15JK1038)the Key Research Project of Baoji University of Arts and Sciences(No.ZK16047)
文摘In recent years,multimedia annotation problem has been attracting significant research attention in multimedia and computer vision areas,especially for automatic image annotation,whose purpose is to provide an efficient and effective searching environment for users to query their images more easily. In this paper,a semi-supervised learning based probabilistic latent semantic analysis( PLSA) model for automatic image annotation is presenred. Since it's often hard to obtain or create labeled images in large quantities while unlabeled ones are easier to collect,a transductive support vector machine( TSVM) is exploited to enhance the quality of the training image data. Then,different image features with different magnitudes will result in different performance for automatic image annotation. To this end,a Gaussian normalization method is utilized to normalize different features extracted from effective image regions segmented by the normalized cuts algorithm so as to reserve the intrinsic content of images as complete as possible. Finally,a PLSA model with asymmetric modalities is constructed based on the expectation maximization( EM) algorithm to predict a candidate set of annotations with confidence scores. Extensive experiments on the general-purpose Corel5k dataset demonstrate that the proposed model can significantly improve performance of traditional PLSA for the task of automatic image annotation.
基金Supported by the National Basic Research Priorities Program(No.2013CB329502)the National High-tech R&D Program of China(No.2012AA011003)+1 种基金National Natural Science Foundation of China(No.61035003,61072085,60933004,60903141)the National Scienceand Technology Support Program of China(No.2012BA107B02)
文摘A novel image auto-annotation method is presented based on probabilistic latent semantic analysis(PLSA) model and multiple Markov random fields(MRF).A PLSA model with asymmetric modalities is first constructed to estimate the joint probability between images and semantic concepts,then a subgraph is extracted served as the corresponding structure of Markov random fields and inference over it is performed by the iterative conditional modes so as to capture the final annotation for the image.The novelty of our method mainly lies in two aspects:exploiting PLSA to estimate the joint probability between images and semantic concepts as well as multiple MRF to further explore the semantic context among keywords for accurate image annotation.To demonstrate the effectiveness of this approach,an experiment on the Corel5 k dataset is conducted and its results are compared favorably with the current state-of-the-art approaches.
基金Supported by the National Basic Research Priorities Programme(No.2007CB311004)the National Natural Science Foundation of China(No.61035003,60933004,60903141,60970088,61072085)
文摘Automatic image annotation(AIA)has become an important and challenging problem in computer vision due to the existence of semantic gap.In this paper,a novel support vector machine with mixture of kernels(SVM-MK)for automatic image annotation is proposed.On one hand,the combined global and local block-based image features are extracted in order to reflect the intrinsic content of images as complete as possible.On the other hand,SVM-MK is constructed to shoot for better annotating performance.Experimental results on Corel dataset show that the proposed image feature representation method as well as automatic image annotation classifier,SVM-MK,can achieve higher annotating accuracy than SVM with any single kernel and mi-SVM for semantic image annotation.
基金Supported by the National Basic Research Priorities Programme(No.2013CB329502)the National High Technology Research and Development Programme of China(No.2012AA011003)+1 种基金the Natural Science Basic Research Plan in Shanxi Province of China(No.2014JQ2-6036)the Science and Technology R&D Program of Baoji City(No.203020013,2013R2-2)
文摘This paper presents a new method for refining image annotation by integrating probabilistic la- tent semantic analysis (PLSA) with conditional random field (CRF). First a PLSA model with asymmetric modalities is constructed to predict a candidate set of annotations with confidence scores, and then model semantic relationship among the candidate annotations by leveraging conditional ran- dom field. In CRF, the confidence scores generated lay the PLSA model and the Fliekr distance be- tween pairwise candidate annotations are considered as local evidences and contextual potentials re- spectively. The novelty of our method mainly lies in two aspects : exploiting PLSA to predict a candi- date set of annotations with confidence scores as well as CRF to further explore the semantic context among candidate annotations for precise image annotation. To demonstrate the effectiveness of the method proposed in this paper, an experiment is conducted on the standard Corel dataset and its re- sults are 'compared favorably with several state-of-the-art approaches.
文摘In industry,it is becoming common to detect and recognize industrial workpieces using deep learning methods.In this field,the lack of datasets is a big problem,and collecting and annotating datasets in this field is very labor intensive.The researchers need to perform dataset annotation if a dataset is generated by themselves.It is also one of the restrictive factors that the current method based on deep learning cannot expand well.At present,there are very few workpiece datasets for industrial fields,and the existing datasets are generated from ideal workpiece computer aided design(CAD)models,for which few actual workpiece images were collected and utilized.We propose an automatic industrial workpiece dataset generation method and an automatic ground truth annotation method.Included in our methods are three algorithms that we proposed:a point cloud based spatial plane segmentation algorithm to segment the workpieces in the real scene and to obtain the annotation information of the workpieces in the images captured in the real scene;a random multiple workpiece generation algorithm to generate abundant composition datasets with random rotation workpiece angles and positions;and a tangent vector based contour tracking and completion algorithm to get improved contour images.With our procedures,annotation information can be obtained using the algorithms proposed in this paper.Upon completion of the annotation process,a json format file is generated.Faster R-CNN(Faster R-convolutional neural network),SSD(single shot multibox detector)and YOLO(you only look once:unified,real-time object detection)are trained using the datasets proposed in this paper.The experimental results show the effectiveness and integrity of this dataset generation and annotation method.
基金This work was financially supported by the National Key Research and Development Program of China(Grant No.2018YFC1704500)Tianjin Committee of Science and Technology of China(Grant No.21ZYJDJC00080)National Natural Science Foundation of China(Grant No.81872996)。
文摘Herbal components characterization represents a challenging task because of the co-existing of multiple classes of naturally occurring compounds with wide spans of polarity,molecular mass,and the ubiquitous isomerism.The root and rhizome of Salvia miltiorrhiza have been utilized as a reputable traditional Chinese medicine Salviae Miltiorrhizae Radix et Rhizoma(Dan-Shen)in the treatment of cardiovascular disease.Herein,a dimensionenhanced ultra-high performance liquid chromatography/ion mobility/quadrupole time-of-flight mass spectrometry approach in combination with intelligent peak annotation workflows was established aimed to rapidly characterize the multicomponents from S.miltiorrhiza.Due to the sufficient optimization,satisfactory chromatography separation was enabled on an HSS T3 column within 33 min using 0.1%formic acid in water(A)and acetonitrile(B)as the mobile phase,while the data-independent HDMS^(E) in both the negative and positive electrospray ionization modes was utilized for the high-coverage MS^(2) data acquisition.Streamlined automatic peak annotation by searching an in-house library(recording 198 known compounds)followed by the subsequent confirming steps(e.g.,comparison with the reference compounds,fragmentation pathways analysis,and retention behavior comparison,etc.),allowed us to identify or tentatively characterize a total of 86 components(including 50 terpenoids,21 phenolic acids,and 15 others)from S.miltiorrhiza.Importantly,three-dimensional structure information,such as the retention time,MS^(1) and MS^(2) data,and collision cross section(CCS),was provided,which can facilitate the more reliable characterization of herbal components.