To address the current issues of inaccurate segmentation and the limited applicability of segmentation methods for building facades in point clouds, we propose a facade segmentation algorithm based on optimal dual-sca...To address the current issues of inaccurate segmentation and the limited applicability of segmentation methods for building facades in point clouds, we propose a facade segmentation algorithm based on optimal dual-scale feature descriptors. First, we select the optimal dual-scale descriptors from a range of feature descriptors. Next, we segment the facade according to the threshold value of the chosen optimal dual-scale descriptors. Finally, we use RANSAC (Random Sample Consensus) to fit the segmented surface and optimize the fitting result. Experimental results show that, compared to commonly used facade segmentation algorithms, the proposed method yields more accurate segmentation results, providing a robust data foundation for subsequent 3D model reconstruction of buildings.展开更多
Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description cons...Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description consisting of a stable local reference frame(LRF)and a feature descriptor based on local spatial voxels.First,an improved LRF was designed by incorporating distance weights into Z-and X-axis calculations.Subsequently,based on the LRF and voxel segmentation,a feature descriptor based on voxel homogenization was proposed.Moreover,uniform segmentation of cube voxels was performed,considering the eigenvalues of each voxel and its neighboring voxels,thereby enhancing the stability of the description.The performance of the descriptor was strictly tested and evaluated on three public datasets,which exhibited high descriptiveness,robustness,and superior performance compared with other current methods.Furthermore,the descriptor was applied to a 3D registration trial,and the results demonstrated the reliability of our approach.展开更多
A novel histogram descriptor for global feature extraction and description was presented. Three elementary primitives for a 2×2 pixel grid were defined. The complex primitives were computed by matrix transforms. ...A novel histogram descriptor for global feature extraction and description was presented. Three elementary primitives for a 2×2 pixel grid were defined. The complex primitives were computed by matrix transforms. These primitives and equivalence class were used for an image to compute the feature image that consisted of three elementary primitives. Histogram was used for the transformed image to extract and describe the features. Furthermore, comparisons were made among the novel histogram descriptor, the gray histogram and the edge histogram with regard to feature vector dimension and retrieval performance. The experimental results show that the novel histogram can not only reduce the effect of noise and illumination change, but also compute the feature vector of lower dimension. Furthermore, the system using the novel histogram has better retrieval performance.展开更多
As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quick...As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness.展开更多
This study proposes a novel feature extraction approach for radionuclide identification to increase the precision of identification of the gamma-ray energy spectrum set.For easier utilization of the information contai...This study proposes a novel feature extraction approach for radionuclide identification to increase the precision of identification of the gamma-ray energy spectrum set.For easier utilization of the information contained in the spectra,the vectors of the gamma-ray energy spectra from Euclidean space,which are fingerprints of the different types of radionuclides,were mapped to matrices in the Banach space.Subsequently,to make the spectra in matrix form easier to apply to image-based deep learning frameworks,the matrices of the gamma-ray energy spectra were mapped to images in the RGB color space.A deep convolutional neural network(DCNN)model was constructed and trained on the ImageNet dataset.The mapped gamma-ray energy spectrum images were applied as inputs to the DCNN model,and the corresponding outputs of the convolution layers and fully connected layers were transferred as descriptors of the images to construct a new classification model for radionuclide identification.The transferred image descriptors consist of global and local features,where the activation vectors of fully connected layers are global features,and activations from convolution layers are local features.A series of comparative experiments between the transferred image descriptors,peak information,features extracted by the histogram of the oriented gradients(HOG),and scale-invariant feature transform(SIFT)using both synthetic and measured data were applied to 11 classical classifiers.The results demonstrate that although the gamma-ray energy spectrum images are completely unfamiliar to the DCNN model and have not been used in the pre-training process,the transferred image descriptors achieved good classification results.The global features have strong semantic information,which achieves an average accuracy of 92.76%and 94.86%on the synthetic dataset and measured dataset,respectively.The results of the statistical comparison of features demonstrate that the proposed approach outperforms the peak-searching-based method,HOG,and SIFT on the synthetic and measured datasets.展开更多
A modified Fourier descriptor was presented. Information from a local space can be used more efficiently. After the boundary pixel set of an object was computed, centroid distance approach was used to compute shape si...A modified Fourier descriptor was presented. Information from a local space can be used more efficiently. After the boundary pixel set of an object was computed, centroid distance approach was used to compute shape signature in the local space. A pair of shape signature and boundary pixel gray was used as a point in a feature space. Then, Fourier transform was used for composition of point information in the feature space so that the shape features could be computed. It is proved theoretically that the shape features from modified Fourier descriptors are invariant to translation, rotation, scaling, and change of start point. It is also testified by measuring the retrieval performance of the systems that the shape features from modified Fourier oescriptors are more discriminative than those from other Fourier descriptors.展开更多
A novel moving objects segmentation method is proposed in this paper. A modified three dimensional recursive search (3DRS) algorithm is used in order to obtain motion information accurately. A motion feature descrip...A novel moving objects segmentation method is proposed in this paper. A modified three dimensional recursive search (3DRS) algorithm is used in order to obtain motion information accurately. A motion feature descriptor (MFD) is designed to describe motion feature of each block in a picture based on motion intensity, motion in occlusion areas, and motion correlation among neighbouring blocks. Then, a fuzzy C-means clustering algorithm (FCM) is implemented based on those MFDs so as to segment moving objects. Moreover, a new parameter named as gathering degree is used to distinguish foreground moving objects and background motion. Experimental results demonstrate the effectiveness of the proposed method.展开更多
Background Feature matching technology is vital to establish the association between virtual and real objects in virtual reality and augmented reality systems.Specifically,it provides them with the ability to match a ...Background Feature matching technology is vital to establish the association between virtual and real objects in virtual reality and augmented reality systems.Specifically,it provides them with the ability to match a dynamic scene.Many image matching methods,of which most are deep learning-based,have been proposed over the past few decades.However,vessel fracture,stenosis,artifacts,high background noise,and uneven vessel gray-scale make vessel matching in coronary angiography extremely difficult.Traditional matching methods perform poorly in this regard.Methods In this study,a topological distance-constrained feature descriptor learning model is proposed.This model regards the topology of the vasculature as the connection relationship of the centerline.The topological distance combines the geodesic distance between the input patches and constrains the descriptor network by maximizing the feature difference between connected and unconnected patches to obtain more useful potential feature relationships.Results Matching patches of different sequences of angiographic images are generated for the experiments.The matching accuracy and stability of the proposed method is superior to those of the existing models.Conclusions The proposed method solves the problem of matching coronary angiographies by generating a topological distance-constrained feature descriptor.展开更多
The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of ta...The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of taking the contour geometric features into account,which may lead to mismatching in map boundaries and areas with intensive contours or extreme terrain changes.In light of this,it is put forward that a matching strategy from coarse to precious based on the contour geometric features.The proposed matching strategy can be described as follows.Firstly,the point sequence is converted to feature sequence according to a feature descriptive function based on curvature and angle of normal vector.Then the level of similarity among multi-source contours is calculated by using the longest common subsequence solution.Accordingly,the identical contours could be matched based on the above calculated results.In the experiment for the proposed method,the reliability and efficiency of the matching method are verified using simulative datasets and real datasets respectively.It has been proved that the proposed contour matching strategy has a high matching precision and good applicability.展开更多
Fourier descriptors are used as features for 3-D aircraft classification and pose determination from a 2-D image recorded at an arbitrary viewing angle. By the feature ranking of Fourier descriptors, a classification ...Fourier descriptors are used as features for 3-D aircraft classification and pose determination from a 2-D image recorded at an arbitrary viewing angle. By the feature ranking of Fourier descriptors, a classification procedure based on the fast nearest neighbour rule is proposed to save the matching time of an unknown aircraft with a partial library search. The testing results of some typical examples indicate this method is generally applicable and efficient in 3-D aircraft recognition.展开更多
A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segme...A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.展开更多
In protein sequence classification research, it is popular to convert a variable length sequence of protein into a fixed length numerical vector by using various descriptors, for instance, composition of k-mer composi...In protein sequence classification research, it is popular to convert a variable length sequence of protein into a fixed length numerical vector by using various descriptors, for instance, composition of k-mer composition. Such position-independent descriptors are useful since they are applicable to any length of sequence;however, positional information of subsequence is discarded even though it might have high contribution to classification performance. To solve this problem, we divided the original sequence into some segments, and then calculated the numerical features for them. It enables us to partially introduce positional information (for instance, compositions of serine in anterior and posterior segments of a sequence). Through comprehensive experiments on the number of segments and length of overlapping region, we found our classification approach with sequence segmentation and feature selection is effective to improve the performance. We evaluated our approach on three protein classification problems and achieved significant improvement in all cases which have a dataset with sufficient amino acid in each sequence. This result has shown the great potential of using additional segments in protein sequence classification to solve other sequence problems in bioinformatics.展开更多
Machine learning potentials are promising in atomistic simulations due to their comparable accuracy to first-principles theory but much lower computational cost.However,the reliability,speed,and transferability of ato...Machine learning potentials are promising in atomistic simulations due to their comparable accuracy to first-principles theory but much lower computational cost.However,the reliability,speed,and transferability of atomistic machine learning potentials depend strongly on the way atomic configurations are represented.A wise choice of descriptors used as input for the machine learning program is the key for a successful machine learning representation.Here we develop a simple and efficient strategy to automatically select an optimal set of linearly-independent atomic features out of a large pool of candidates,based on the correlations that are intrinsic to the training data.Through applications to the construction of embedded atom neural network potentials for several benchmark molecules with less redundant linearly-independent embedded density descriptors,we demonstrate the efficiency and accuracy of this new strategy.The proposed algorithm can greatly simplify the initial selection of atomic features and vastly improve the performance of the atomistic machine learning potentials.展开更多
A hierarchical retrieval scheme of the accessory image database is proposed based on textile industrial accessory contour feature and region feature. At first smallest enclosed rectangle[1] feature (degree of accessor...A hierarchical retrieval scheme of the accessory image database is proposed based on textile industrial accessory contour feature and region feature. At first smallest enclosed rectangle[1] feature (degree of accessory coordination) is used to filter the image database to decouple the image search scope. After the accessory contour information and region information are extracted, the fusion multi-feature of the centroid distance Fourier descriptor and distance distribution histogram is adopted to finish image retrieval accurately. All the features above are invariable under translation, scaling and rotation. Results from the test on the image database including 1,000 accessory images demonstrate that the method is effective and practical with high accuracy and fast speed.展开更多
The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitchi...The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitching, or data mining relying on it. Local feature description techniques are usually developed so as to provide invariance to photometric variations specific to the acquisition of natural images, but are nonetheless used in association with biomedical imaging as well. It has been previously shown that the matching of gradient based descriptors is affected by image modifications specific to Confocal Scanning Laser Microscopy (CSLM). In this paper we extend our previous work in this direction and show how specific acquisition or post-processing methods alleviate or accentuate this problem.展开更多
三维局部特征描述是三维计算机视觉中的重要任务.现实场景中包含噪声、遮挡和杂波等干扰,使得准确和鲁棒的三维局部特征描述具有很大的挑战性.为提高特征描述的性能,提出一种局部曲面变化统计直方图(local sur-face variation based sta...三维局部特征描述是三维计算机视觉中的重要任务.现实场景中包含噪声、遮挡和杂波等干扰,使得准确和鲁棒的三维局部特征描述具有很大的挑战性.为提高特征描述的性能,提出一种局部曲面变化统计直方图(local sur-face variation based statistics histogram,LSVSH)描述符.首先设计一种不依赖于局部参考轴(local reference axis,LRA)的新属性(称为曲率属性),增强描述符对LRA误差的稳健性;然后沿径向剖分局部空间,在每个子空间中统计3个角度属性和1个曲率属性生成LSVSH描述符,实现对局部曲面信息的全面稳健描述.在B3R,U3M,U3OR和QuLD这4个数据集上进行大量的实验,结果表明,LSVSH在4个数据集上的RPC下面积(the area under the recall-precision curve,AUCpr)值分别为0.95,0.70,0.54和0.10,优于现有的局部特征描述符的性能;在U3M数据集上的正确配准率和在U3OR数据集上的正确识别率分别达到70%和100%,验证了LSVSH应用于物体配准和识别任务上的有效性.展开更多
文摘To address the current issues of inaccurate segmentation and the limited applicability of segmentation methods for building facades in point clouds, we propose a facade segmentation algorithm based on optimal dual-scale feature descriptors. First, we select the optimal dual-scale descriptors from a range of feature descriptors. Next, we segment the facade according to the threshold value of the chosen optimal dual-scale descriptors. Finally, we use RANSAC (Random Sample Consensus) to fit the segmented surface and optimize the fitting result. Experimental results show that, compared to commonly used facade segmentation algorithms, the proposed method yields more accurate segmentation results, providing a robust data foundation for subsequent 3D model reconstruction of buildings.
基金the National Natural Science Foundation of China,No.51705469the Zhengzhou University Youth Talent Enterprise Cooperative Innovation Team Support Program Project(2021,2022).
文摘Obtaining a 3D feature description with high descriptiveness and robustness under complicated nuisances is a significant and challenging task in 3D feature matching.This paper proposes a novel feature description consisting of a stable local reference frame(LRF)and a feature descriptor based on local spatial voxels.First,an improved LRF was designed by incorporating distance weights into Z-and X-axis calculations.Subsequently,based on the LRF and voxel segmentation,a feature descriptor based on voxel homogenization was proposed.Moreover,uniform segmentation of cube voxels was performed,considering the eigenvalues of each voxel and its neighboring voxels,thereby enhancing the stability of the description.The performance of the descriptor was strictly tested and evaluated on three public datasets,which exhibited high descriptiveness,robustness,and superior performance compared with other current methods.Furthermore,the descriptor was applied to a 3D registration trial,and the results demonstrated the reliability of our approach.
基金Project(60873010) supported by the National Natural Science Foundation of ChinaProjects(N090504005, N090604012, N090104001) supported by the Fundamental Research Funds for the Central UniversitiesProject(NCET-05-0288) supported by Program for New Century Excellent Talents in University
文摘A novel histogram descriptor for global feature extraction and description was presented. Three elementary primitives for a 2×2 pixel grid were defined. The complex primitives were computed by matrix transforms. These primitives and equivalence class were used for an image to compute the feature image that consisted of three elementary primitives. Histogram was used for the transformed image to extract and describe the features. Furthermore, comparisons were made among the novel histogram descriptor, the gray histogram and the edge histogram with regard to feature vector dimension and retrieval performance. The experimental results show that the novel histogram can not only reduce the effect of noise and illumination change, but also compute the feature vector of lower dimension. Furthermore, the system using the novel histogram has better retrieval performance.
基金supported in part by the Postgraduate Research&Practice Innovation Program of Jiangsu Province under Grant KYCX 20_0758in part by the Science and Technology Research Project of Jiangsu Public Security Department under Grant 2020KX005+1 种基金in part by the General Project of Philosophy and Social Science Research in Colleges and Universities in Jiangsu Province under Grant 2022SJYB0473in part by“Cyberspace Security”Construction Project of Jiangsu Provincial Key Discipline during the“14th Five Year Plan”.
文摘As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness.
基金supported by the National Defense Fundamental Research Project(No.JCKY2020404C004)Sichuan Science and Technology Program(No.22NSFSC0044).
文摘This study proposes a novel feature extraction approach for radionuclide identification to increase the precision of identification of the gamma-ray energy spectrum set.For easier utilization of the information contained in the spectra,the vectors of the gamma-ray energy spectra from Euclidean space,which are fingerprints of the different types of radionuclides,were mapped to matrices in the Banach space.Subsequently,to make the spectra in matrix form easier to apply to image-based deep learning frameworks,the matrices of the gamma-ray energy spectra were mapped to images in the RGB color space.A deep convolutional neural network(DCNN)model was constructed and trained on the ImageNet dataset.The mapped gamma-ray energy spectrum images were applied as inputs to the DCNN model,and the corresponding outputs of the convolution layers and fully connected layers were transferred as descriptors of the images to construct a new classification model for radionuclide identification.The transferred image descriptors consist of global and local features,where the activation vectors of fully connected layers are global features,and activations from convolution layers are local features.A series of comparative experiments between the transferred image descriptors,peak information,features extracted by the histogram of the oriented gradients(HOG),and scale-invariant feature transform(SIFT)using both synthetic and measured data were applied to 11 classical classifiers.The results demonstrate that although the gamma-ray energy spectrum images are completely unfamiliar to the DCNN model and have not been used in the pre-training process,the transferred image descriptors achieved good classification results.The global features have strong semantic information,which achieves an average accuracy of 92.76%and 94.86%on the synthetic dataset and measured dataset,respectively.The results of the statistical comparison of features demonstrate that the proposed approach outperforms the peak-searching-based method,HOG,and SIFT on the synthetic and measured datasets.
基金Project(60873010)supported by the National Natural Science Foundation of ChinaProject supported by the Doctor Startup Foundation of Shenyang University of Technology,China
文摘A modified Fourier descriptor was presented. Information from a local space can be used more efficiently. After the boundary pixel set of an object was computed, centroid distance approach was used to compute shape signature in the local space. A pair of shape signature and boundary pixel gray was used as a point in a feature space. Then, Fourier transform was used for composition of point information in the feature space so that the shape features could be computed. It is proved theoretically that the shape features from modified Fourier descriptors are invariant to translation, rotation, scaling, and change of start point. It is also testified by measuring the retrieval performance of the systems that the shape features from modified Fourier oescriptors are more discriminative than those from other Fourier descriptors.
基金Supported by the National Natural Science Foundation of China (No. 60772134, 60902081, 60902052) the 111 Project (No.B08038) the Fundamental Research Funds for the Central Universities(No.72105457).
文摘A novel moving objects segmentation method is proposed in this paper. A modified three dimensional recursive search (3DRS) algorithm is used in order to obtain motion information accurately. A motion feature descriptor (MFD) is designed to describe motion feature of each block in a picture based on motion intensity, motion in occlusion areas, and motion correlation among neighbouring blocks. Then, a fuzzy C-means clustering algorithm (FCM) is implemented based on those MFDs so as to segment moving objects. Moreover, a new parameter named as gathering degree is used to distinguish foreground moving objects and background motion. Experimental results demonstrate the effectiveness of the proposed method.
基金the Key Technologies R&D Program of the National Ministry of Science(2018ZX10734-401)the National Science Foundation Program of China(61901031,62071048,62025104,61971040)Beijing Nova Program from Beijing Municipal Science&Technology Commission(Z201100006820004).
文摘Background Feature matching technology is vital to establish the association between virtual and real objects in virtual reality and augmented reality systems.Specifically,it provides them with the ability to match a dynamic scene.Many image matching methods,of which most are deep learning-based,have been proposed over the past few decades.However,vessel fracture,stenosis,artifacts,high background noise,and uneven vessel gray-scale make vessel matching in coronary angiography extremely difficult.Traditional matching methods perform poorly in this regard.Methods In this study,a topological distance-constrained feature descriptor learning model is proposed.This model regards the topology of the vasculature as the connection relationship of the centerline.The topological distance combines the geodesic distance between the input patches and constrains the descriptor network by maximizing the feature difference between connected and unconnected patches to obtain more useful potential feature relationships.Results Matching patches of different sequences of angiographic images are generated for the experiments.The matching accuracy and stability of the proposed method is superior to those of the existing models.Conclusions The proposed method solves the problem of matching coronary angiographies by generating a topological distance-constrained feature descriptor.
基金National Science Foundation of China(Nos.41801388,41901397)。
文摘The existing multi-source contour matching studies have focused on the matching methods with consideration of topological relations and similarity measurement based on spatial Euclidean distance,while it is lack of taking the contour geometric features into account,which may lead to mismatching in map boundaries and areas with intensive contours or extreme terrain changes.In light of this,it is put forward that a matching strategy from coarse to precious based on the contour geometric features.The proposed matching strategy can be described as follows.Firstly,the point sequence is converted to feature sequence according to a feature descriptive function based on curvature and angle of normal vector.Then the level of similarity among multi-source contours is calculated by using the longest common subsequence solution.Accordingly,the identical contours could be matched based on the above calculated results.In the experiment for the proposed method,the reliability and efficiency of the matching method are verified using simulative datasets and real datasets respectively.It has been proved that the proposed contour matching strategy has a high matching precision and good applicability.
文摘Fourier descriptors are used as features for 3-D aircraft classification and pose determination from a 2-D image recorded at an arbitrary viewing angle. By the feature ranking of Fourier descriptors, a classification procedure based on the fast nearest neighbour rule is proposed to save the matching time of an unknown aircraft with a partial library search. The testing results of some typical examples indicate this method is generally applicable and efficient in 3-D aircraft recognition.
基金The National Natural Science Foundation of China(No60271033)
文摘A new active shape models (ASMs) was presented, which is driven by scale invariant feature transform (SIFT) local descriptor instead of normalizing first order derivative profiles in the original formulation, to segment lung fields from chest radiographs. The modified SIFT local descriptor, more distinctive than the general intensity and gradient features, is used to characterize the image features in the vicinity of each pixel at each resolution level during the segmentation optimization procedure. Experimental results show that the proposed method is more robust and accurate than the original ASMs in terms of an average overlap percentage and average contour distance in segmenting the lung fields from an available public database.
文摘In protein sequence classification research, it is popular to convert a variable length sequence of protein into a fixed length numerical vector by using various descriptors, for instance, composition of k-mer composition. Such position-independent descriptors are useful since they are applicable to any length of sequence;however, positional information of subsequence is discarded even though it might have high contribution to classification performance. To solve this problem, we divided the original sequence into some segments, and then calculated the numerical features for them. It enables us to partially introduce positional information (for instance, compositions of serine in anterior and posterior segments of a sequence). Through comprehensive experiments on the number of segments and length of overlapping region, we found our classification approach with sequence segmentation and feature selection is effective to improve the performance. We evaluated our approach on three protein classification problems and achieved significant improvement in all cases which have a dataset with sufficient amino acid in each sequence. This result has shown the great potential of using additional segments in protein sequence classification to solve other sequence problems in bioinformatics.
基金supported by CAS Project for Young Scientists in Basic Research(YSBR-005)the National Natural Science Foundation of China(No.22073089 and No.22033007)+1 种基金Anhui Initiative in Quantum Information Technologies(AHY090200)the Fundamental Research Funds for Central Universities(WK2060000017)。
文摘Machine learning potentials are promising in atomistic simulations due to their comparable accuracy to first-principles theory but much lower computational cost.However,the reliability,speed,and transferability of atomistic machine learning potentials depend strongly on the way atomic configurations are represented.A wise choice of descriptors used as input for the machine learning program is the key for a successful machine learning representation.Here we develop a simple and efficient strategy to automatically select an optimal set of linearly-independent atomic features out of a large pool of candidates,based on the correlations that are intrinsic to the training data.Through applications to the construction of embedded atom neural network potentials for several benchmark molecules with less redundant linearly-independent embedded density descriptors,we demonstrate the efficiency and accuracy of this new strategy.The proposed algorithm can greatly simplify the initial selection of atomic features and vastly improve the performance of the atomistic machine learning potentials.
文摘A hierarchical retrieval scheme of the accessory image database is proposed based on textile industrial accessory contour feature and region feature. At first smallest enclosed rectangle[1] feature (degree of accessory coordination) is used to filter the image database to decouple the image search scope. After the accessory contour information and region information are extracted, the fusion multi-feature of the centroid distance Fourier descriptor and distance distribution histogram is adopted to finish image retrieval accurately. All the features above are invariable under translation, scaling and rotation. Results from the test on the image database including 1,000 accessory images demonstrate that the method is effective and practical with high accuracy and fast speed.
基金The UEFISCDIPN-II-PT-PCCA-2011-3.2-1162 Research Grant The CRUS SCIEX NMS-CH Fellowship nr. 12.135
文摘The matching of local descriptors represents at this moment a key tool in computer vision, with a wide variety of methods designed for tasks such as image classification, object recognition and tracking, image stitching, or data mining relying on it. Local feature description techniques are usually developed so as to provide invariance to photometric variations specific to the acquisition of natural images, but are nonetheless used in association with biomedical imaging as well. It has been previously shown that the matching of gradient based descriptors is affected by image modifications specific to Confocal Scanning Laser Microscopy (CSLM). In this paper we extend our previous work in this direction and show how specific acquisition or post-processing methods alleviate or accentuate this problem.
文摘三维局部特征描述是三维计算机视觉中的重要任务.现实场景中包含噪声、遮挡和杂波等干扰,使得准确和鲁棒的三维局部特征描述具有很大的挑战性.为提高特征描述的性能,提出一种局部曲面变化统计直方图(local sur-face variation based statistics histogram,LSVSH)描述符.首先设计一种不依赖于局部参考轴(local reference axis,LRA)的新属性(称为曲率属性),增强描述符对LRA误差的稳健性;然后沿径向剖分局部空间,在每个子空间中统计3个角度属性和1个曲率属性生成LSVSH描述符,实现对局部曲面信息的全面稳健描述.在B3R,U3M,U3OR和QuLD这4个数据集上进行大量的实验,结果表明,LSVSH在4个数据集上的RPC下面积(the area under the recall-precision curve,AUCpr)值分别为0.95,0.70,0.54和0.10,优于现有的局部特征描述符的性能;在U3M数据集上的正确配准率和在U3OR数据集上的正确识别率分别达到70%和100%,验证了LSVSH应用于物体配准和识别任务上的有效性.