On the Matrices of Pairwise Frequencies of Categorical Attributes for Objects Classification

On the Matrices of Pairwise Frequencies of Categorical Attributes for Objects Classification

下载PDF

导出

摘要 This paper proposes two new algorithms for classifying objects with categorical attributes. These algorithms are derived from the assumption that the attributes of different object classes have different probability distributions. One algorithm classifies objects based on the distribution of the attribute frequencies, and the other classifies objects based on the distribution of the pairwise attribute frequencies described using a matrix of pairwise frequencies. Both algorithms are based on the method of invariants, which offers the simplest dependencies for estimating the probabilities of objects in each class by an average frequency of their attributes. The estimated object class corresponds to the maximum probability. This method reflects the sensory process models of animals and is aimed at recognizing an object class by searching for a prototype in information accumulated in the brain. Because these matrices may be sparse, the solution cannot be determined for some objects. For these objects, an analog of the k-nearest neighbors method is provided in which for each attribute value, the class to which the majority of the k-nearest objects in the training sample belong is determined, and the most likely class value is calculated. The efficiencies of these two algorithms were confirmed on five databases. This paper proposes two new algorithms for classifying objects with categorical attributes. These algorithms are derived from the assumption that the attributes of different object classes have different probability distributions. One algorithm classifies objects based on the distribution of the attribute frequencies, and the other classifies objects based on the distribution of the pairwise attribute frequencies described using a matrix of pairwise frequencies. Both algorithms are based on the method of invariants, which offers the simplest dependencies for estimating the probabilities of objects in each class by an average frequency of their attributes. The estimated object class corresponds to the maximum probability. This method reflects the sensory process models of animals and is aimed at recognizing an object class by searching for a prototype in information accumulated in the brain. Because these matrices may be sparse, the solution cannot be determined for some objects. For these objects, an analog of the k-nearest neighbors method is provided in which for each attribute value, the class to which the majority of the k-nearest objects in the training sample belong is determined, and the most likely class value is calculated. The efficiencies of these two algorithms were confirmed on five databases.

作者 Vladimir N. Shats

机构地区 St. Petersburg

出处《Journal of Intelligent Learning Systems and Applications》 2019年第4期65-75,共11页 智能学习系统与应用（英文）

关键词 CATEGORICAL Attributes Classification ALGORITHMS INVARIANTS of Matrix DATA DATA Processing Categorical Attributes Classification Algorithms Invariants of Matrix Data Data Processing

分类号 O17 [理学—基础数学]

引文网络
相关文献

1Jing Liang,Ruoyu Jia,Min Zhu,Henry B L Duh.VisQAC: Visual Analytics for Online Q&A Communities[J].Journal of Beijing Institute of Technology,2019,28(2):305-317.
2Bernhard Mitterauer.Method and Apparatus for Creating Problem-Solving Complexes from Individual Elements[J].Advances in Bioscience and Biotechnology,2014,5(4):311-315.
3Vladimir N. Shats.Error-Free Training via Information Structuring in the Classification Problem[J].Journal of Intelligent Learning Systems and Applications,2018,10(3):81-92. 被引量：1
4El-Sayed Ewis Omran.Improving the Prediction Accuracy of Soil Mapping through Geostatistics[J].International Journal of Geosciences,2012,3(3):574-590. 被引量：1
5Zichen Ma,Ernest Fokoué.A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs[J].Open Journal of Statistics,2014,4(4):258-266.
6Rami Alkhatib,Mohamad Diab,Bassam Moslem,Christophe Corbier,Mohamed El Badaoui.Gait-Ground Reaction Force Sensors Selection Based on ROC Curve Evaluation[J].Journal of Computer and Communications,2015,3(3):13-19.
7Vladimir N. Shats.Classification Based on Invariants of the Data Matrix[J].Journal of Intelligent Learning Systems and Applications,2017,9(3):35-46.
8Naser Safdarian,Nader Jafarnia Dabanloo,Gholamreza Attarodi.A New Pattern Recognition Method for Detection and Localization of Myocardial Infarction Using T-Wave Integral and Total Integral as Extracted Features from One Cycle of ECG Signal[J].Journal of Biomedical Science and Engineering,2014,7(10):818-824. 被引量：5
9Masayasu Atsumi.Attention-Guided Organized Perception and Learning of Object Categories Based on Probabilistic Latent Variable Models[J].Journal of Intelligent Learning Systems and Applications,2013,5(2):123-133.
10Mapathe Ndiaye,Eric Davaud,Daniel Ariztegui,Meissa Fall.A Semi Automated Method for Laminated Sediments Analysis[J].International Journal of Geosciences,2012,3(1):206-210.

Journal of Intelligent Learning Systems and Applications

2019年第4期

浏览历史

内容加载中请稍等...

On the Matrices of Pairwise Frequencies of Categorical Attributes for Objects Classification

相关作者

相关机构

相关主题

浏览历史