This paper presents an efficient prediction model for a good learning environment using Random Forest(RF)classifier.It consists of a series of modules;data preprocessing,data normalization,data split andfinally classi...This paper presents an efficient prediction model for a good learning environment using Random Forest(RF)classifier.It consists of a series of modules;data preprocessing,data normalization,data split andfinally classification or prediction by the RF classifier.The preprocessed data is normalized using minmax normalization often used before modelfitting.As the input data or variables are measured at different scales,it is necessary to normalize them to contribute equally to the modelfitting.Then,the RF classifier is employed for course selection which is an ensemble learning method and k-fold cross-validation(k=10)is used to validate the model.The proposed Prediction Model for Course Selection(PMCS)system is considered a multi-class problem that predicts the course for a particular learner with three complexity levels,namely low,medium and high.It is operated under two modes;locally and globally.The former considers the gender of the learner and the later does not consider the gender of the learner.The database comprises the learner opinions from 75 males and 75 females per category(low,medium and high).Thus the system uses a total of 450 samples to evaluate the performance of the PMCS system.Results show that the system’s performance,while using locally i.e.,gender-wise has slightly higher performance than the global system.The RF classifier with 75 decision trees in the global system provides an average accuracy of 97.6%,whereas in the local system it is 97%(male)and 97.6%(female).The overall performance of the RF classifier with 75 trees is better than 25,50 and 100 decision trees in both local and global systems.展开更多
文摘This paper presents an efficient prediction model for a good learning environment using Random Forest(RF)classifier.It consists of a series of modules;data preprocessing,data normalization,data split andfinally classification or prediction by the RF classifier.The preprocessed data is normalized using minmax normalization often used before modelfitting.As the input data or variables are measured at different scales,it is necessary to normalize them to contribute equally to the modelfitting.Then,the RF classifier is employed for course selection which is an ensemble learning method and k-fold cross-validation(k=10)is used to validate the model.The proposed Prediction Model for Course Selection(PMCS)system is considered a multi-class problem that predicts the course for a particular learner with three complexity levels,namely low,medium and high.It is operated under two modes;locally and globally.The former considers the gender of the learner and the later does not consider the gender of the learner.The database comprises the learner opinions from 75 males and 75 females per category(low,medium and high).Thus the system uses a total of 450 samples to evaluate the performance of the PMCS system.Results show that the system’s performance,while using locally i.e.,gender-wise has slightly higher performance than the global system.The RF classifier with 75 decision trees in the global system provides an average accuracy of 97.6%,whereas in the local system it is 97%(male)and 97.6%(female).The overall performance of the RF classifier with 75 trees is better than 25,50 and 100 decision trees in both local and global systems.