Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description cons...Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description consisting of a stable local reference frame(LRF)and a feature descriptor based on local spatial voxels.First,an improved LRF was designed by incorporating distance weights into Z-and X-axis calculations.Subsequently,based on the LRF and voxel segmentation,a feature descriptor based on voxel homogenization was proposed.Moreover,uniform segmentation of cube voxels was performed,considering the eigenvalues of each voxel and its neighboring voxels,thereby enhancing the stability of the description.The performance of the descriptor was strictly tested and evaluated on three public datasets,which exhibited high descriptiveness,robustness,and superior performance compared with other current methods.Furthermore,the descriptor was applied to a 3D registration trial,and the results demonstrated the reliability of our approach.展开更多
The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitchi...The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitching, or data mining relying on it. Local feature description techniques are usually developed so as to provide invariance to photometric variations specific to the acquisition of natural images, but are nonetheless used in association with biomedical imaging as well. It has been previously shown that the matching of gradient based descriptors is affected by image modifications specific to Confocal Scanning Laser Microscopy (CSLM). In this paper we extend our previous work in this direction and show how specific acquisition or post-processing methods alleviate or accentuate this problem.展开更多
As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quick...As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness.展开更多
A detailed quantum mechanical analysis of electronic disposition of five aminopyrimidoisoquinolinequinones (APIQs) was performed after extraction of this subset of compounds from a larger data set of APIQs via a repor...A detailed quantum mechanical analysis of electronic disposition of five aminopyrimidoisoquinolinequinones (APIQs) was performed after extraction of this subset of compounds from a larger data set of APIQs via a reported clustering methodology (Elfaki, et al. 2020). Both semi empirical PM3 method and DFT quantum mechanical methods were used to calculate global and local quantum mechanical descriptors (QMDs) to define the electronic environment of these molecules in attempt to rationalize their observed anti-cancer response variability. The biological response is the anticancer activity against human gastric adenocarcenoma (AGS) cell line. The correlation matrix between the calculated global electronic descriptors and biological activity demonstrated that the global dipole moment gives the highest correlation. The local electronic environment was analysed by The Mullikan charges (MC) and Fukui functions for N-5, C-6, C-8 in addition to the N atom of phenylamino side group at C-8. MCs furnished no useful information as each of these atoms had almost identical MC values for all the five compounds with exception of C-6 which gave varied values. Regressing MCs of C-6 against the response traces 60% of the latter variability. As C-6 is an extra annular methyl carbon adjacent to N-5 in isoquinoline residue of APIQ, we reasoned that the chemical reactivities of 4 out of the 5 APIQs might be due to a Chichibabin-type tautomerism implying a possible alkylation aspect in their mechanism of action. The corresponding Fukui functions (f<sup>-</sup>, f<sup>+</sup> and f<sup>0</sup>) showed a considerable consistency with the patterns of chemical reactivity exhibited by this small set of APIQs.展开更多
An advanced edge-based method of feature detection and extraction is developed for object description in digital images. It is useful for the comparison of different images of the same scene in aerial imagery, for des...An advanced edge-based method of feature detection and extraction is developed for object description in digital images. It is useful for the comparison of different images of the same scene in aerial imagery, for describing and recognizing categories, for automatic building extraction and for finding the mutual regions in image matching. The method includes directional filtering and searching for straight edge segments in every direction and scale, taking into account edge gradient signs. Line segments are ordered with respect to their orientation and average gradients in the region in question. These segments are used for the construction of an object descriptor. A hierarchical set of feature descriptors is developed, taking into consideration the proposed straight line segment detector. Comparative performance is evaluated on the noisy model and in real aerial and satellite imagery.展开更多
This paper presents an efficient image feature representation method, namely angle structure descriptor(ASD), which is built based on the angle structures of images. According to the diversity in directions, angle str...This paper presents an efficient image feature representation method, namely angle structure descriptor(ASD), which is built based on the angle structures of images. According to the diversity in directions, angle structures are defined in local blocks. Combining color information in HSV color space, we use angle structures to detect images. The internal correlations between neighboring pixels in angle structures are explored to form a feature vector. With angle structures as bridges, ASD extracts image features by integrating multiple information as a whole, such as color, texture, shape and spatial layout information. In addition, the proposed algorithm is efficient for image retrieval without any clustering implementation or model training. Experimental results demonstrate that ASD outperforms the other related algorithms.展开更多
This paper proposed a novel multi-view interactive behavior recognition method based on local self-similarity descriptors and graph shared multi-task learning. First, we proposed the composite interactive feature repr...This paper proposed a novel multi-view interactive behavior recognition method based on local self-similarity descriptors and graph shared multi-task learning. First, we proposed the composite interactive feature representation which encodes both the spatial distribution of local motion of interest points and their contexts. Furthermore, local self-similarity descriptor represented by temporal-pyramid bag of words(BOW) was applied to decreasing the influence of observation angle change on recognition and retaining the temporal information. For the purpose of exploring latent correlation between different interactive behaviors from different views and retaining specific information of each behaviors, graph shared multi-task learning was used to learn the corresponding interactive behavior recognition model. Experiment results showed the effectiveness of the proposed method in comparison with other state-of-the-art methods on the public databases CASIA, i3Dpose dataset and self-built database for interactive behavior recognition.展开更多
Expression detection plays a vital role to determine the patient’s condition in healthcare systems.It helps the monitoring teams to respond swiftly in case of emergency.Due to the lack of suitable methods,results are...Expression detection plays a vital role to determine the patient’s condition in healthcare systems.It helps the monitoring teams to respond swiftly in case of emergency.Due to the lack of suitable methods,results are often compromised in an unconstrained environment because of pose,scale,occlusion and illumination variations in the image of the face of the patient.A novel patch-based multiple local binary patterns(LBP)feature extraction technique is proposed for analyzing human behavior using facial expression recognition.It consists of three-patch[TPLBP]and four-patch LBPs[FPLBP]based feature engineering respectively.Image representation is encoded from local patch statistics using these descriptors.TPLBP and FPLBP capture information that is encoded to find likenesses between adjacent patches of pixels by using short bit strings contrary to pixel-based methods.Coded images are transformed into the frequency domain using a discrete cosine transform(DCT).Most discriminant features extracted from coded DCT images are combined to generate a feature vector.Support vector machine(SVM),k-nearest neighbor(KNN),and Naïve Bayes(NB)are used for the classification of facial expressions using selected features.Extensive experimentation is performed to analyze human behavior by considering standard extended Cohn Kanade(CK+)and Oulu–CASIA datasets.Results demonstrate that the proposed methodology outperforms the other techniques used for comparison.展开更多
A new algorithm taking the spatial context of local features into account by utilizing contextualized histograms was proposed to recognize facial expression. The contextualized histograms were extracted fromtwo widely...A new algorithm taking the spatial context of local features into account by utilizing contextualized histograms was proposed to recognize facial expression. The contextualized histograms were extracted fromtwo widely used descriptors—the local binary pattern( LBP) and weber local descriptor( WLD). The LBP and WLD feature histograms were extracted separately fromeach facial image,and contextualized histogram was generated as feature vectors to feed the classifier. In addition,the human face was divided into sub-blocks and each sub-block was assigned different weights by their different contributions to the intensity of facial expressions to improve the recognition rate. With the support vector machine(SVM) as classifier,the experimental results on the 2D texture images fromthe 3D-BU FE dataset indicated that contextualized histograms improved facial expression recognition performance when local features were employed.展开更多
To fully describe the structure information of the point cloud when the LIDAR-object distance is long,a joint global and local feature(JGLF)descriptor is constructed.Compared with five typical descriptors,the object r...To fully describe the structure information of the point cloud when the LIDAR-object distance is long,a joint global and local feature(JGLF)descriptor is constructed.Compared with five typical descriptors,the object recognition rate of JGLF is higher when the LIDAR-object distances change.Under the situation that airborne LIDAR is getting close to the object,the particle filtering(PF)algorithm is used as the tracking frame.Particle weight is updated by comparing the difference between JGLFs to track the object.It is verified that the proposed algorithm performs 13.95%more accurately and stably than the basic PF algorithm.展开更多
This paper presents a biologically inspired local image descriptor that combines color and shape features. Compared with previous descriptors, red-cyan cells associated with L, M, and S cones (L for long, M for mediu...This paper presents a biologically inspired local image descriptor that combines color and shape features. Compared with previous descriptors, red-cyan cells associated with L, M, and S cones (L for long, M for medium, and S for short) are used to indicate one of the opponent color channels. Stepping forward from state-of-the-art color feature extraction, we exploit a new approach to compute the color orientation and magnitudes of three opponent color channels, namely, red-green, blue-yellow, and red-cyan, in two-dimensional space. Color orientation is calculated in histograms with magnitude weighting. We linearly concatenate the four-color-opponent-channel histogram and scale-invariant-feamre-transform histogram in the final step. We apply our biologically inspired descriptor to describe the local image feature. Quantitative comparisons with state-of-the-art descriptors demonstrate the significant advantages of maintaining invariance to photometric and geometric changes in image matching, particularly in cases, such as illumination variation and image blurring, where more color contrast information is observed.展开更多
Structure information plays an important role in both object recognition and detection. This paper studies what visual structure is and addresses the problem of struc- ture modeling and representation from two aspects...Structure information plays an important role in both object recognition and detection. This paper studies what visual structure is and addresses the problem of struc- ture modeling and representation from two aspects: visual feature and topology model. Firstly, at feature level, we pro- pose Local Structured Descriptor to capture the object's local structure effectively, and develop the descriptors from shape and texture information, respectively. Secondly, at topology level, we present a local strnctured model with a boosted fea- ture selection and fusion scheme. All experiments are conducted on the challenging PASCAL Visual Object Classes (VOC) datasets from VOC2007 to VOC2010. Experimental results show that our method achieves very competitive performance.展开更多
Currently, global-features-based image copy detection is vulnerable to geometric transformations like cropping, shift, and rotations. To resolve this problem, some algorithms based on local descriptors have been propo...Currently, global-features-based image copy detection is vulnerable to geometric transformations like cropping, shift, and rotations. To resolve this problem, some algorithms based on local descriptors have been proposed. However, the local descriptors, which were originally designed for object recognition, are not suitable for copy detection because they cause the problems of false positives and ambiguities. Instead of relying on the local gradient statistic as many existing descriptors do, we propose a new invariant local descriptor based on local polar-mapping and discrete Fourier transform. Then based on this descriptor, we propose a new framework of copy detection, in which virtual prior attacks and attack weight are employed for training and selecting only a few robust features. This consequently improves the storage and detection efficiency. In addition, it is worth noting that the feature matching takes the locations and orientations of interest points into consideration, which increases the number of matched regions and improves the recall. Experimental results show that the new descriptor is more robust and distinctive, and the proposed copy detection scheme using this descriptor can substantially enhance the accuracy and recall of copy detection and lower the false positives and ambiguities.展开更多
The extraction and description of image features are very important for visual simultaneous localization and mapping(V-SLAM).A rotated boosted efficient binary local image descriptor(BEBLID)SLAM(RB-SLAM)algorithm base...The extraction and description of image features are very important for visual simultaneous localization and mapping(V-SLAM).A rotated boosted efficient binary local image descriptor(BEBLID)SLAM(RB-SLAM)algorithm based on improved oriented fast and rotated brief(ORB)feature description is proposed in this paper,which can solve the problems of low localization accuracy and time efficiency of the current ORB-SLAM3 algorithm.Firstly,it uses the BEBLID to replace the feature point description algorithm of the original ORB to enhance the expressiveness and description efficiency of the image.Secondly,it adds rotational invariance to the BEBLID using the orientation information of the feature points.It also selects the rotationally stable bits in the BEBLID to further enhance the rotational invariance of the BEBLID.Finally,it retrains the binary visual dictionary based on the BEBLID to reduce the cumulative error of V-SLAM and improve the loading speed of the visual dictionary.Experiments show that the dictionary loading efficiency is improved by more than 10 times.The RB-SLAM algorithm improves the trajectory accuracy by 24.75%on the TUM dataset and 26.25%on the EuRoC dataset compared to the ORB-SLAM3 algorithm.展开更多
基金the National Natural Science Foundation of China,No.51705469the Zhengzhou University Youth Talent Enterprise Cooperative Innovation Team Support Program Project(2021,2022).
文摘Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description consisting of a stable local reference frame(LRF)and a feature descriptor based on local spatial voxels.First,an improved LRF was designed by incorporating distance weights into Z-and X-axis calculations.Subsequently,based on the LRF and voxel segmentation,a feature descriptor based on voxel homogenization was proposed.Moreover,uniform segmentation of cube voxels was performed,considering the eigenvalues of each voxel and its neighboring voxels,thereby enhancing the stability of the description.The performance of the descriptor was strictly tested and evaluated on three public datasets,which exhibited high descriptiveness,robustness,and superior performance compared with other current methods.Furthermore,the descriptor was applied to a 3D registration trial,and the results demonstrated the reliability of our approach.
基金The UEFISCDIPN-II-PT-PCCA-2011-3.2-1162 Research Grant The CRUS SCIEX NMS-CH Fellowship nr. 12.135
文摘The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitching, or data mining relying on it. Local feature description techniques are usually developed so as to provide invariance to photometric variations specific to the acquisition of natural images, but are nonetheless used in association with biomedical imaging as well. It has been previously shown that the matching of gradient based descriptors is affected by image modifications specific to Confocal Scanning Laser Microscopy (CSLM). In this paper we extend our previous work in this direction and show how specific acquisition or post-processing methods alleviate or accentuate this problem.
基金supported in part by the Postgraduate Research&Practice Innovation Program of Jiangsu Province under Grant KYCX 20_0758in part by the Science and Technology Research Project of Jiangsu Public Security Department under Grant 2020KX005+1 种基金in part by the General Project of Philosophy and Social Science Research in Colleges and Universities in Jiangsu Province under Grant 2022SJYB0473in part by“Cyberspace Security”Construction Project of Jiangsu Provincial Key Discipline during the“14th Five Year Plan”.
文摘As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness.
文摘A detailed quantum mechanical analysis of electronic disposition of five aminopyrimidoisoquinolinequinones (APIQs) was performed after extraction of this subset of compounds from a larger data set of APIQs via a reported clustering methodology (Elfaki, et al. 2020). Both semi empirical PM3 method and DFT quantum mechanical methods were used to calculate global and local quantum mechanical descriptors (QMDs) to define the electronic environment of these molecules in attempt to rationalize their observed anti-cancer response variability. The biological response is the anticancer activity against human gastric adenocarcenoma (AGS) cell line. The correlation matrix between the calculated global electronic descriptors and biological activity demonstrated that the global dipole moment gives the highest correlation. The local electronic environment was analysed by The Mullikan charges (MC) and Fukui functions for N-5, C-6, C-8 in addition to the N atom of phenylamino side group at C-8. MCs furnished no useful information as each of these atoms had almost identical MC values for all the five compounds with exception of C-6 which gave varied values. Regressing MCs of C-6 against the response traces 60% of the latter variability. As C-6 is an extra annular methyl carbon adjacent to N-5 in isoquinoline residue of APIQ, we reasoned that the chemical reactivities of 4 out of the 5 APIQs might be due to a Chichibabin-type tautomerism implying a possible alkylation aspect in their mechanism of action. The corresponding Fukui functions (f<sup>-</sup>, f<sup>+</sup> and f<sup>0</sup>) showed a considerable consistency with the patterns of chemical reactivity exhibited by this small set of APIQs.
文摘An advanced edge-based method of feature detection and extraction is developed for object description in digital images. It is useful for the comparison of different images of the same scene in aerial imagery, for describing and recognizing categories, for automatic building extraction and for finding the mutual regions in image matching. The method includes directional filtering and searching for straight edge segments in every direction and scale, taking into account edge gradient signs. Line segments are ordered with respect to their orientation and average gradients in the region in question. These segments are used for the construction of an object descriptor. A hierarchical set of feature descriptors is developed, taking into consideration the proposed straight line segment detector. Comparative performance is evaluated on the noisy model and in real aerial and satellite imagery.
基金supported by the National Natural Science Foundation of China (No.61170145, 61373081, 61402268, 61401260, 61572298)the Technology and Development Project of Shandong (No.2013GGX10125)+1 种基金the Natural Science Foundation of Shandong China (No.BS2014DX006, ZR2014FM012)the Taishan Scholar Project of Shandong, China
文摘This paper presents an efficient image feature representation method, namely angle structure descriptor(ASD), which is built based on the angle structures of images. According to the diversity in directions, angle structures are defined in local blocks. Combining color information in HSV color space, we use angle structures to detect images. The internal correlations between neighboring pixels in angle structures are explored to form a feature vector. With angle structures as bridges, ASD extracts image features by integrating multiple information as a whole, such as color, texture, shape and spatial layout information. In addition, the proposed algorithm is efficient for image retrieval without any clustering implementation or model training. Experimental results demonstrate that ASD outperforms the other related algorithms.
基金Project(51678075)supported by the National Natural Science Foundation of ChinaProject(2017GK2271)supported by Hunan Provincial Science and Technology Department,China
文摘This paper proposed a novel multi-view interactive behavior recognition method based on local self-similarity descriptors and graph shared multi-task learning. First, we proposed the composite interactive feature representation which encodes both the spatial distribution of local motion of interest points and their contexts. Furthermore, local self-similarity descriptor represented by temporal-pyramid bag of words(BOW) was applied to decreasing the influence of observation angle change on recognition and retaining the temporal information. For the purpose of exploring latent correlation between different interactive behaviors from different views and retaining specific information of each behaviors, graph shared multi-task learning was used to learn the corresponding interactive behavior recognition model. Experiment results showed the effectiveness of the proposed method in comparison with other state-of-the-art methods on the public databases CASIA, i3Dpose dataset and self-built database for interactive behavior recognition.
基金supported in part by the MSIT(Ministry of Science and ICT),Korea,under the ITRC(Information Technology Research Center)support program(IITP2020-2016-0-00312)supervised by the IITP(Institute for Information&communications Technology Planning&Evaluation)and in part by the Faculty Research Fund of Sejong University in 2019.
文摘Expression detection plays a vital role to determine the patient’s condition in healthcare systems.It helps the monitoring teams to respond swiftly in case of emergency.Due to the lack of suitable methods,results are often compromised in an unconstrained environment because of pose,scale,occlusion and illumination variations in the image of the face of the patient.A novel patch-based multiple local binary patterns(LBP)feature extraction technique is proposed for analyzing human behavior using facial expression recognition.It consists of three-patch[TPLBP]and four-patch LBPs[FPLBP]based feature engineering respectively.Image representation is encoded from local patch statistics using these descriptors.TPLBP and FPLBP capture information that is encoded to find likenesses between adjacent patches of pixels by using short bit strings contrary to pixel-based methods.Coded images are transformed into the frequency domain using a discrete cosine transform(DCT).Most discriminant features extracted from coded DCT images are combined to generate a feature vector.Support vector machine(SVM),k-nearest neighbor(KNN),and Naïve Bayes(NB)are used for the classification of facial expressions using selected features.Extensive experimentation is performed to analyze human behavior by considering standard extended Cohn Kanade(CK+)and Oulu–CASIA datasets.Results demonstrate that the proposed methodology outperforms the other techniques used for comparison.
基金Supported by the National Natural Science Foundation of China(60772066)
文摘A new algorithm taking the spatial context of local features into account by utilizing contextualized histograms was proposed to recognize facial expression. The contextualized histograms were extracted fromtwo widely used descriptors—the local binary pattern( LBP) and weber local descriptor( WLD). The LBP and WLD feature histograms were extracted separately fromeach facial image,and contextualized histogram was generated as feature vectors to feed the classifier. In addition,the human face was divided into sub-blocks and each sub-block was assigned different weights by their different contributions to the intensity of facial expressions to improve the recognition rate. With the support vector machine(SVM) as classifier,the experimental results on the 2D texture images fromthe 3D-BU FE dataset indicated that contextualized histograms improved facial expression recognition performance when local features were employed.
基金This work was supported by the National Natural Science Foundation of China(Nos.61271353 and 61871389)Foundation of State Key Laboratory of Pulsed Power Laser Technology(No.SKL2018ZR09)Major Funding Projects of National University of Defense Technology(No.ZK18-01-02).
文摘To fully describe the structure information of the point cloud when the LIDAR-object distance is long,a joint global and local feature(JGLF)descriptor is constructed.Compared with five typical descriptors,the object recognition rate of JGLF is higher when the LIDAR-object distances change.Under the situation that airborne LIDAR is getting close to the object,the particle filtering(PF)algorithm is used as the tracking frame.Particle weight is updated by comparing the difference between JGLFs to track the object.It is verified that the proposed algorithm performs 13.95%more accurately and stably than the basic PF algorithm.
基金Acknowledgment This study was supported by the National Natural Science Foundation of China (grant 61101155) and the Jilin Province Science and Technology Development Program (20101504).
文摘This paper presents a biologically inspired local image descriptor that combines color and shape features. Compared with previous descriptors, red-cyan cells associated with L, M, and S cones (L for long, M for medium, and S for short) are used to indicate one of the opponent color channels. Stepping forward from state-of-the-art color feature extraction, we exploit a new approach to compute the color orientation and magnitudes of three opponent color channels, namely, red-green, blue-yellow, and red-cyan, in two-dimensional space. Color orientation is calculated in histograms with magnitude weighting. We linearly concatenate the four-color-opponent-channel histogram and scale-invariant-feamre-transform histogram in the final step. We apply our biologically inspired descriptor to describe the local image feature. Quantitative comparisons with state-of-the-art descriptors demonstrate the significant advantages of maintaining invariance to photometric and geometric changes in image matching, particularly in cases, such as illumination variation and image blurring, where more color contrast information is observed.
文摘Structure information plays an important role in both object recognition and detection. This paper studies what visual structure is and addresses the problem of struc- ture modeling and representation from two aspects: visual feature and topology model. Firstly, at feature level, we pro- pose Local Structured Descriptor to capture the object's local structure effectively, and develop the descriptors from shape and texture information, respectively. Secondly, at topology level, we present a local strnctured model with a boosted fea- ture selection and fusion scheme. All experiments are conducted on the challenging PASCAL Visual Object Classes (VOC) datasets from VOC2007 to VOC2010. Experimental results show that our method achieves very competitive performance.
基金Supported by the National Natural Science Foundation of China under Grant Nos. 60873226,60803112the National High Technology Research and Development 863 Program of China under Grant No. 2009AA01Z411
文摘Currently, global-features-based image copy detection is vulnerable to geometric transformations like cropping, shift, and rotations. To resolve this problem, some algorithms based on local descriptors have been proposed. However, the local descriptors, which were originally designed for object recognition, are not suitable for copy detection because they cause the problems of false positives and ambiguities. Instead of relying on the local gradient statistic as many existing descriptors do, we propose a new invariant local descriptor based on local polar-mapping and discrete Fourier transform. Then based on this descriptor, we propose a new framework of copy detection, in which virtual prior attacks and attack weight are employed for training and selecting only a few robust features. This consequently improves the storage and detection efficiency. In addition, it is worth noting that the feature matching takes the locations and orientations of interest points into consideration, which increases the number of matched regions and improves the recall. Experimental results show that the new descriptor is more robust and distinctive, and the proposed copy detection scheme using this descriptor can substantially enhance the accuracy and recall of copy detection and lower the false positives and ambiguities.
文摘The extraction and description of image features are very important for visual simultaneous localization and mapping(V-SLAM).A rotated boosted efficient binary local image descriptor(BEBLID)SLAM(RB-SLAM)algorithm based on improved oriented fast and rotated brief(ORB)feature description is proposed in this paper,which can solve the problems of low localization accuracy and time efficiency of the current ORB-SLAM3 algorithm.Firstly,it uses the BEBLID to replace the feature point description algorithm of the original ORB to enhance the expressiveness and description efficiency of the image.Secondly,it adds rotational invariance to the BEBLID using the orientation information of the feature points.It also selects the rotationally stable bits in the BEBLID to further enhance the rotational invariance of the BEBLID.Finally,it retrains the binary visual dictionary based on the BEBLID to reduce the cumulative error of V-SLAM and improve the loading speed of the visual dictionary.Experiments show that the dictionary loading efficiency is improved by more than 10 times.The RB-SLAM algorithm improves the trajectory accuracy by 24.75%on the TUM dataset and 26.25%on the EuRoC dataset compared to the ORB-SLAM3 algorithm.