User profiles representing users’preferences and interests play an important role in many applications of personalized recommendation.With the rapid growth of social platforms,there is a critical need for efficient s...User profiles representing users’preferences and interests play an important role in many applications of personalized recommendation.With the rapid growth of social platforms,there is a critical need for efficient solutions to learn user profiles from the information they shared on social platforms so as to improve the quality of recommendation services.The problem of user profile learning is significantly challenging due to difficulty in handling data from multiple sources,in different formats and often associated with uncertainty.In this paper,we introduce an integrated approach that combines advanced Machine Learning techniques with evidential reasoning based on Dempster-Shafer theory of evidence for user profiling and recommendation.The developed methods for user profile learning and multi-criteria collaborative filtering are demonstrated with experimental results and analysis that show the effectiveness and practicality of the integrated approach.A proposal for extending multi-criteria recommendation systems by incorporating user profiles learned from different sources of data into the recommendation process so as to provide better recommendation capabilities is also highlighted.展开更多
Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificia...Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificial neural network (ANN), restricted Boltzmann machines (RBM), anddiscrete wavelet transform (DWT) is presented in this paper. In the proposed model, DWT firstdecomposes time series into approximation and detail. Then Khashei and Bijari's model, which is anensemble model of ARIMA and ANN, is applied to the approximation and detail to extract their bothlinear and nonlinear components and fit the relationship between the components as a function insteadof additive relationship. Furthermore, RBM is used to perform pre-training for generating initialweights and biases based on inputs feature for ANN. Finally, the forecasted approximation and detailare combined to obtain final forecasting. The forecasting capability of the proposed model is testedwith three well-known time series: sunspot, Canadian lynx, exchange rate time series. The predictionperformance is compared to the other six forecasting models. The results indicate that the proposedmodel gives the best performance in all three data sets and all three measures (i.e. MSE, MAE andMAPE).展开更多
It is critically important for companies to screen new product projects before they are launched to the market. So far, many approaches have been developed for tackling the process of screening product innovations. Du...It is critically important for companies to screen new product projects before they are launched to the market. So far, many approaches have been developed for tackling the process of screening product innovations. Due to uncertain, vague and incomplete information as well as dynamically complex process regarding to new product development (NPD), a fuzzy linguistic approach employed linguistic assessments and the fuzzy-set-based computation is reasonable for screening new products. However, such a fuzzy linguistic approach faces with various defects and limitations, such as loss of information, failing in considering the aspects related to human nature on uncertain subjective judgments etc. These defects and limitations lead to a dilemma, i.e., it's very difficult to screen new product projects reasonably and precisely. In this paper, we propose a notion of proportional 3-tuple to represent a linguistic assessment and related ignoring information, and a preference-preserving proportional 3-tuple transformation for the unification of linguistic assessments represented by proportional 3-tuples between two different linguistic term sets. On this basis, a proportional 3-tuple fuzzy linguistic representation model for screening new product projects is developed. It is shown that the proposed model is flexible to handle uncertain, vague and incomplete information related to screening new product projects. It not only allows evaluators to express their subjective judgments with different confidence levels, but is also able to deal with incomplete linguistic assessments. Ultimately, the proposed model also improves the precision and reasonability of the screening result.展开更多
Most of the earlier work on clustering mainly focused on numeric data whoseinherent geometric properties can be exploited to naturally define distance functions between datapoints. However, data mining applications fr...Most of the earlier work on clustering mainly focused on numeric data whoseinherent geometric properties can be exploited to naturally define distance functions between datapoints. However, data mining applications frequently involve many datasets that also consists ofmixed numeric and categorical attributes. In this paper we present a clustering algorithm which isbased on the k-means algorithm. The algorithm clusters objects with numeric and categoricalattributes in a way similar to k-means. The object similarity measure is derived from both numericand categorical attributes. When applied to numeric data, the algorithm is identical to the k-means.The main result of this paper is to provide a method to update the 'cluster centers' of clusteringobjects described by mixed numeric and categorical attributes in the clustering process to minimizethe clustering cost function. The clustering performance of the algorithm is demonstrated with thetwo well known data sets, namely credit approval and abalone databases.展开更多
基金This work is supported by the University of Information Technology-Vietnam National University Ho Chi Minh City under grant No.D1-2023-10.
文摘User profiles representing users’preferences and interests play an important role in many applications of personalized recommendation.With the rapid growth of social platforms,there is a critical need for efficient solutions to learn user profiles from the information they shared on social platforms so as to improve the quality of recommendation services.The problem of user profile learning is significantly challenging due to difficulty in handling data from multiple sources,in different formats and often associated with uncertainty.In this paper,we introduce an integrated approach that combines advanced Machine Learning techniques with evidential reasoning based on Dempster-Shafer theory of evidence for user profiling and recommendation.The developed methods for user profile learning and multi-criteria collaborative filtering are demonstrated with experimental results and analysis that show the effectiveness and practicality of the integrated approach.A proposal for extending multi-criteria recommendation systems by incorporating user profiles learned from different sources of data into the recommendation process so as to provide better recommendation capabilities is also highlighted.
文摘Time series forecasting research area mainly focuses on developing effective forecasting models toimprove prediction accuracy. An ensemble model composed of autoregressive integrated movingaverage (ARIMA), artificial neural network (ANN), restricted Boltzmann machines (RBM), anddiscrete wavelet transform (DWT) is presented in this paper. In the proposed model, DWT firstdecomposes time series into approximation and detail. Then Khashei and Bijari's model, which is anensemble model of ARIMA and ANN, is applied to the approximation and detail to extract their bothlinear and nonlinear components and fit the relationship between the components as a function insteadof additive relationship. Furthermore, RBM is used to perform pre-training for generating initialweights and biases based on inputs feature for ANN. Finally, the forecasted approximation and detailare combined to obtain final forecasting. The forecasting capability of the proposed model is testedwith three well-known time series: sunspot, Canadian lynx, exchange rate time series. The predictionperformance is compared to the other six forecasting models. The results indicate that the proposedmodel gives the best performance in all three data sets and all three measures (i.e. MSE, MAE andMAPE).
文摘It is critically important for companies to screen new product projects before they are launched to the market. So far, many approaches have been developed for tackling the process of screening product innovations. Due to uncertain, vague and incomplete information as well as dynamically complex process regarding to new product development (NPD), a fuzzy linguistic approach employed linguistic assessments and the fuzzy-set-based computation is reasonable for screening new products. However, such a fuzzy linguistic approach faces with various defects and limitations, such as loss of information, failing in considering the aspects related to human nature on uncertain subjective judgments etc. These defects and limitations lead to a dilemma, i.e., it's very difficult to screen new product projects reasonably and precisely. In this paper, we propose a notion of proportional 3-tuple to represent a linguistic assessment and related ignoring information, and a preference-preserving proportional 3-tuple transformation for the unification of linguistic assessments represented by proportional 3-tuples between two different linguistic term sets. On this basis, a proportional 3-tuple fuzzy linguistic representation model for screening new product projects is developed. It is shown that the proposed model is flexible to handle uncertain, vague and incomplete information related to screening new product projects. It not only allows evaluators to express their subjective judgments with different confidence levels, but is also able to deal with incomplete linguistic assessments. Ultimately, the proposed model also improves the precision and reasonability of the screening result.
文摘Most of the earlier work on clustering mainly focused on numeric data whoseinherent geometric properties can be exploited to naturally define distance functions between datapoints. However, data mining applications frequently involve many datasets that also consists ofmixed numeric and categorical attributes. In this paper we present a clustering algorithm which isbased on the k-means algorithm. The algorithm clusters objects with numeric and categoricalattributes in a way similar to k-means. The object similarity measure is derived from both numericand categorical attributes. When applied to numeric data, the algorithm is identical to the k-means.The main result of this paper is to provide a method to update the 'cluster centers' of clusteringobjects described by mixed numeric and categorical attributes in the clustering process to minimizethe clustering cost function. The clustering performance of the algorithm is demonstrated with thetwo well known data sets, namely credit approval and abalone databases.