In the era of big data, the dimensionality of data is increasing dramatically in many domains. To deal with high dimensionality, online feature selection becomes critical in big data mining. Recently, online selection...In the era of big data, the dimensionality of data is increasing dramatically in many domains. To deal with high dimensionality, online feature selection becomes critical in big data mining. Recently, online selection of dynamic features has received much attention. In situations where features arrive sequentially over time, we need to perform online feature selection upon feature arrivals. Meanwhile, considering grouped features, it is necessary to deal with features arriving by groups. To handle these challenges, some state-of- the-art methods for online feature selection have been proposed. In this paper, we first give a brief review of traditional feature selection approaches. Then we discuss specific problems of online feature selection with feature streams in detail. A comprehensive review of existing online feature selection methods is presented by comparing with each other. Finally, we discuss several open issues in online feature selection.展开更多
Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on i...Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on its modules.In the preprocessing procedure,we use the linear and nonlinear normalization based on dot density method.Both structural and statistical features are extracted due to the fact that there are some very similar characters in Uyghur literature.In clustering analysis,we adopt the dynamic clustering algorithm based on the minimum spanning tree(MST),and use the k-nearest neighbor matching classification as classifier.The testing results of prototype system show that the recognition rates for characters of the four different types(independent,suffix,intermediate,and initial type) are 74.67%,70.42%,63.33%,and 72.02%,respectively;the recognition rates for the case of five candidates for those characters are 94.34%,94.19%,93.15%,and 95.86%,respectively.The ideas and methods used in this paper have some commonality and usefulness for the recognition of other characters that belong to Altaic languages family.展开更多
Multiple object tracking (MOT) poses many difficulties to conventional well-studied single object tracking (SOT) algorithms, such as severe expansion of configuration space, high complexity of motion conditions, a...Multiple object tracking (MOT) poses many difficulties to conventional well-studied single object tracking (SOT) algorithms, such as severe expansion of configuration space, high complexity of motion conditions, and visual ambiguities among nearby targets, among which the visual ambiguity problem is the central challenge. In this paper, we address this problem by embedding adaptive mixture observation models (AMOM) into a mixture tracker which is implemented in Particle Filter framework. In AMOM, the extracted multiple features for appearance description are combined according to their discriminative power between ambiguity prone objects, where the discriminability of features are evaluated by online entropy-based feature selection techniques. The induction of AMOM can help to surmount the incapability of conventional mixture tracker in handling object occlusions, and meanwhile retain its merits of flexibility and high efficiency. The final experiments show significant improvement in MOT scenarios compared with other methods.展开更多
基金This work was supported in part by the National Key Research and Development Program of China (2016YFB 1000901), the Program for Changjiang Scholars and Innovative Research Team in University (PCSIRT) of the Ministry of Education, China (IRT13059), the National Basic Research Program (973 Program) of China (2013CB329604), the Specialized Research Fund for the Doctoral Program of Higher Education (20130111110011), and the National Natural Science Foundation of China (Grant Nos. 61273292, 61229301, 61503112, 61673152).
文摘In the era of big data, the dimensionality of data is increasing dramatically in many domains. To deal with high dimensionality, online feature selection becomes critical in big data mining. Recently, online selection of dynamic features has received much attention. In situations where features arrive sequentially over time, we need to perform online feature selection upon feature arrivals. Meanwhile, considering grouped features, it is necessary to deal with features arriving by groups. To handle these challenges, some state-of- the-art methods for online feature selection have been proposed. In this paper, we first give a brief review of traditional feature selection approaches. Then we discuss specific problems of online feature selection with feature streams in detail. A comprehensive review of existing online feature selection methods is presented by comparing with each other. Finally, we discuss several open issues in online feature selection.
基金Supported by the National Natural Science Foundation of China (61065001)
文摘Based on the analysis of the unique shapes and writing styles of Uyghur characters,we design a framework for prototype character recognition system and carry out a systematic theoretical and experimental research on its modules.In the preprocessing procedure,we use the linear and nonlinear normalization based on dot density method.Both structural and statistical features are extracted due to the fact that there are some very similar characters in Uyghur literature.In clustering analysis,we adopt the dynamic clustering algorithm based on the minimum spanning tree(MST),and use the k-nearest neighbor matching classification as classifier.The testing results of prototype system show that the recognition rates for characters of the four different types(independent,suffix,intermediate,and initial type) are 74.67%,70.42%,63.33%,and 72.02%,respectively;the recognition rates for the case of five candidates for those characters are 94.34%,94.19%,93.15%,and 95.86%,respectively.The ideas and methods used in this paper have some commonality and usefulness for the recognition of other characters that belong to Altaic languages family.
基金Supported by National Natural Science Foundation of China (Grant No.60573167)National High-Tech Research and Development Program of China (Grant No.2006AA01Z118)National Basic Research Program of China (Grant No.2006CB303103)
文摘Multiple object tracking (MOT) poses many difficulties to conventional well-studied single object tracking (SOT) algorithms, such as severe expansion of configuration space, high complexity of motion conditions, and visual ambiguities among nearby targets, among which the visual ambiguity problem is the central challenge. In this paper, we address this problem by embedding adaptive mixture observation models (AMOM) into a mixture tracker which is implemented in Particle Filter framework. In AMOM, the extracted multiple features for appearance description are combined according to their discriminative power between ambiguity prone objects, where the discriminability of features are evaluated by online entropy-based feature selection techniques. The induction of AMOM can help to surmount the incapability of conventional mixture tracker in handling object occlusions, and meanwhile retain its merits of flexibility and high efficiency. The final experiments show significant improvement in MOT scenarios compared with other methods.