摘要
In this paper, we research the probability theory and matrix transformation based technique to manage the data for processing and analysis. Clustering analysis research has a long history, over the decades, the importance and the cross characteristics with other research direction to get the affirmation of the people. The probability theory and linear algebra act as the powerful tool for analyzing and mining data. The experimental result illustrates the effectiveness. In the near future, we plan to conduct more theoretical analysis on the topic.