Purpose: The purpose of the study is to explore the potential use of nature language process(NLP) and machine learning(ML) techniques and intents to find a feasible strategy and effective approach to fulfill the NER t...Purpose: The purpose of the study is to explore the potential use of nature language process(NLP) and machine learning(ML) techniques and intents to find a feasible strategy and effective approach to fulfill the NER task for Web oriented person-specific information extraction.Design/methodology/approach: An SVM-based multi-classification approach combined with a set of rich NLP features derived from state-of-the-art NLP techniques has been proposed to fulfill the NER task. A group of experiments has been designed to investigate the influence of various NLP-based features to the performance of the system,especially the semantic features. Optimal parameter settings regarding with SVM models,including kernel functions,margin parameter of SVM model and the context window size,have been explored through experiments as well.Findings: The SVM-based multi-classification approach has been proved to be effective for the NER task. This work shows that NLP-based features are of great importance in datadriven NE recognition,particularly the semantic features. The study indicates that higher order kernel function may not be desirable for the specific classification problem in practical application. The simple linear-kernel SVM model performed better in this case. Moreover,the modified SVM models with uneven margin parameter are more common and flexible,which have been proved to solve the imbalanced data problem better.Research limitations/implications: The SVM-based approach for NER problem is only proved to be effective on limited experiment data. Further research need to be conducted on the large batch of real Web data. In addition,the performance of the NER system need be tested when incorporated into a complete IE framework.Originality/value: The specially designed experiments make it feasible to fully explore the characters of the data and obtain the optimal parameter settings for the NER task,leading to a preferable rate in recall,precision and F1measures. The overall system performance(F1value) for all types of name entities can achieve above 88.6%,which can meet the requirements for the practical application.展开更多
This study investigates the application of a support vector machine(SVM)-based model for classifying students’learning abilities in system modeling and simulation courses,aiming at enhancing personalized education.A ...This study investigates the application of a support vector machine(SVM)-based model for classifying students’learning abilities in system modeling and simulation courses,aiming at enhancing personalized education.A small dataset,collected from a pre-course questionnaire,is augmented with integer data to improve model performance.The SVM model achieves an accuracy rate of 95.3%.This approach not only benefits courses at Guizhou Minzu University but also has potential for broader application in similar programs in other institutions.The research provides a foundation for creating personalized learning paths using AI technologies,such as AI-generated content,large language models,and knowledge graphs,offering insights for innovative educational practices.展开更多
基金support by the Special Research Fundation for Young Teachers of Sun Yat-sen University(Grant No.2000-3161101)Humanity and Social Science Youth Foundation of Ministry of Educationof China(Grant No.08JC870013)
文摘Purpose: The purpose of the study is to explore the potential use of nature language process(NLP) and machine learning(ML) techniques and intents to find a feasible strategy and effective approach to fulfill the NER task for Web oriented person-specific information extraction.Design/methodology/approach: An SVM-based multi-classification approach combined with a set of rich NLP features derived from state-of-the-art NLP techniques has been proposed to fulfill the NER task. A group of experiments has been designed to investigate the influence of various NLP-based features to the performance of the system,especially the semantic features. Optimal parameter settings regarding with SVM models,including kernel functions,margin parameter of SVM model and the context window size,have been explored through experiments as well.Findings: The SVM-based multi-classification approach has been proved to be effective for the NER task. This work shows that NLP-based features are of great importance in datadriven NE recognition,particularly the semantic features. The study indicates that higher order kernel function may not be desirable for the specific classification problem in practical application. The simple linear-kernel SVM model performed better in this case. Moreover,the modified SVM models with uneven margin parameter are more common and flexible,which have been proved to solve the imbalanced data problem better.Research limitations/implications: The SVM-based approach for NER problem is only proved to be effective on limited experiment data. Further research need to be conducted on the large batch of real Web data. In addition,the performance of the NER system need be tested when incorporated into a complete IE framework.Originality/value: The specially designed experiments make it feasible to fully explore the characters of the data and obtain the optimal parameter settings for the NER task,leading to a preferable rate in recall,precision and F1measures. The overall system performance(F1value) for all types of name entities can achieve above 88.6%,which can meet the requirements for the practical application.
基金supported by the 2021 Higher Education Teaching Reform Research and Practice Project of SEAC(Grant No.221057)2021 Ministry of Education Industry−University Cooperation Collaborative Education Project(Grant No.202102646007)2022 Guizhou Province Gold Course Construction Project.
文摘This study investigates the application of a support vector machine(SVM)-based model for classifying students’learning abilities in system modeling and simulation courses,aiming at enhancing personalized education.A small dataset,collected from a pre-course questionnaire,is augmented with integer data to improve model performance.The SVM model achieves an accuracy rate of 95.3%.This approach not only benefits courses at Guizhou Minzu University but also has potential for broader application in similar programs in other institutions.The research provides a foundation for creating personalized learning paths using AI technologies,such as AI-generated content,large language models,and knowledge graphs,offering insights for innovative educational practices.