期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Supervised Learning Algorithm on Unstructured Documents for the Classification of Job Offers: Case of Cameroun
1
作者 Fritz Sosso Makembe Roger Atsa Etoundi Hippolyte Tapamo 《Journal of Computer and Communications》 2023年第2期75-88,共14页
Nowadays, in data science, supervised learning algorithms are frequently used to perform text classification. However, African textual data, in general, have been studied very little using these methods. This article ... Nowadays, in data science, supervised learning algorithms are frequently used to perform text classification. However, African textual data, in general, have been studied very little using these methods. This article notes the particularity of the data and measures the level of precision of predictions of naive Bayes algorithms, decision tree, and SVM (Support Vector Machine) on a corpus of computer jobs taken on the internet. This is due to the data imbalance problem in machine learning. However, this problem essentially focuses on the distribution of the number of documents in each class or subclass. Here, we delve deeper into the problem to the word count distribution in a set of documents. The results are compared with those obtained on a set of French IT offers. It appears that the precision of the classification varies between 88% and 90% for French offers against 67%, at most, for Cameroonian offers. The contribution of this study is twofold. Indeed, it clearly shows that, in a similar job category, job offers on the internet in Cameroon are more unstructured compared to those available in France, for example. Moreover, it makes it possible to emit a strong hypothesis according to which sets of texts having a symmetrical distribution of the number of words obtain better results with supervised learning algorithms. 展开更多
关键词 Job Offer underemployment Text Classification Imbalanced Data Symmetric Word Distribution Supervised Learning
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部