There is no reasonable scientific basis for selecting the excellent teachers of the school’s courses.To solve the practical problem,we firstly give a series of normalization models for defining the key attributes of ...There is no reasonable scientific basis for selecting the excellent teachers of the school’s courses.To solve the practical problem,we firstly give a series of normalization models for defining the key attributes of teachers’professional foundation,course difficulty coefficient,and comprehensive evaluation of teaching.Then,we define a partial weight function to calculate the key attributes,and obtain the partial recommendation values.Next,we construct a highly sparse Teaching Recommendation Factorization Machines(TRFMs)model,which takes the 5-tuples relation including teacher,course,teachers’professional foundation,course difficulty,teaching evaluation as the feature vector,and take partial recommendation value as the recommendation label.Finally,we design a novel Top-N excellent teacher recommendation algorithm based on TRFMs by course classification on the highly sparse dataset.Experimental results show that the proposed TRFMs and recommendation algorithm can accurately realize the recommendation of excellent teachers on a highly sparse historical teaching dataset.The recommendation accuracy is superior to that of the three-dimensional tensor decomposition model algorithm which also solves sparse datasets.The proposed method can be used as a new recommendation method applied to the teaching arrangements in all kinds of schools,which can effectively improve the teaching quality.展开更多
The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal ac...The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal activity on the network.To reduce these losses,a new fraud detection approach is required.Telecom fraud detection involves identifying a small number of fraudulent calls from a vast amount of call traffic.Developing an effective strategy to combat fraud has become challenging.Although much effort has been made to detect fraud,most existing methods are designed for batch processing,not real-time detection.To solve this problem,we propose an online fraud detection model using a Neural Factorization Autoencoder(NFA),which analyzes customer calling patterns to detect fraudulent calls.The model employs Neural Factorization Machines(NFM)and an Autoencoder(AE)to model calling patterns and a memory module to adapt to changing customer behaviour.We evaluate our approach on a large dataset of real-world call detail records and compare it with several state-of-the-art methods.Our results show that our approach outperforms the baselines,with an AUC of 91.06%,a TPR of 91.89%,an FPR of 14.76%,and an F1-score of 95.45%.These results demonstrate the effectiveness of our approach in detecting fraud in real-time and suggest that it can be a valuable tool for preventing fraud in telecommunications networks.展开更多
Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,par...Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,particularly network Denial-of-Service(DoS)attacks.While many research efforts have been devoted to identifying new features for DoS attack detection,detection methods are less accurate in detecting DoS attacks against client hosts due to the high stealth of such attacks.To solve this problem,a new method of DoS attack detection based on Deep Factorization Machine(DeepFM)is proposed in SDN.Firstly,we select the Growth Rate of Max Matched Packets(GRMMP)in SDN as detection feature.Then,the DeepFM algorithm is used to extract features from flow rules and classify them into dense and discrete features to detect DoS attacks.After training,the model can be used to infer whether SDN is under DoS attacks,and a DeepFM-based detection method for DoS attacks against client host is implemented.Simulation results show that our method can effectively detect DoS attacks in SDN.Compared with the K-Nearest Neighbor(K-NN),Artificial Neural Network(ANN)models,Support Vector Machine(SVM)and Random Forest models,our proposed method outperforms in accuracy,precision and F1 values.展开更多
In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristi...In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristics.In order to obtain the users’interest in the product more effectively,we will consider the key eye movement indicators.We first collect eye movement characteristics based on the self-developed data processing algorithm fast discriminative model prediction for tracking(FDIMP),and then we add data dimensions to the original data set through information filling.In addition,we apply the deep factorization machine(DeepFM)architecture to simultaneously learn the combination of low-level and high-level features.In order to effectively learn important features and emphasize relatively important features,the multi-head attention mechanism is applied in the interest model.The experimental results on the public data set Criteo show that,compared with the original DeepFM algorithm,the area under curve(AUC)value was improved by up to 9.32%.展开更多
Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues...Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues,we propose to fuse spatio-temporal contexts into deep factorization machine(STC_DeepFM)offline for pick-up area recommendation,and within the area to recommend pick-up points online using factorization machine(FM).Firstly,we divide the urban area into several grids with equal size.Spatio-temporal contexts are destilled from pick-up points or points-of-interest(POIs)belonged to the preceding grids.Secondly,the contexts are integrated into deep factorization machine(DeepFM)to mine high-order interaction relationships from grids.And a novel algorithm named STC_DeepFM is presented for offline pick-up area recommendation.Thirdly,we devise the architecture of offline-to-online(O2O)recommendation respectively based on DeepFM and FM model in order to tradeoff the accuracy and efficiency.Some experiments are designed on the DiDi dataset to evaluate step by step the performance of spatio-temporal contexts,different recommendation models,and the O2O architecture.The results show that the proposed STC_DeepFM algorithm exceeds several state-of-the-art methods,and the O2O architecture achieves excellent real-time performance.展开更多
Factorization machine (FM) is an effective model for feature-based recommendation that utilizes inner products to capture second-order feature interactions. However, one of the major drawbacks of FM is that it cannot ...Factorization machine (FM) is an effective model for feature-based recommendation that utilizes inner products to capture second-order feature interactions. However, one of the major drawbacks of FM is that it cannot capture complex high-order interaction signals. A common solution is to change the interaction function, such as stacking deep neural networks on the top level of FM. In this work, we propose an alternative approach to model high-order interaction signals at the embedding level, namely generalized embedding machine (GEM). The embedding used in GEM encodes not only the information from the feature itself but also the information from other correlated features. Under such a situation, the embedding becomes high-order. Then we can incorporate GEM with FM and even its advanced variants to perform feature interactions. More specifically, in this paper, we utilize graph convolution networks (GCN) to generate high-order embeddings. We integrate GEM with several FM-based models and conduct extensive experiments on two real-world datasets. The results demonstrate significant improvement of GEM over the corresponding baselines.展开更多
Due to its highly favorable physical and chemical properties,titanium and titanium alloy are widely used in a variety of industries.Because of the low output of a single batch,plate cold rolling without tension is the...Due to its highly favorable physical and chemical properties,titanium and titanium alloy are widely used in a variety of industries.Because of the low output of a single batch,plate cold rolling without tension is the most common rolling production method for titanium alloy.This method is lack of on-line thickness closed-loop control,with carefully thickness setting models for precision.A set of high-precision thickness setting models are proposed to suit the production method.Because of frequent variations in rolling specification,a model structural for the combination of analytical models and statistical models is adopted to replace the traditional self-learning method.The deformation resistance and friction factor,the primary factors which affect model precision,are considered as the objectives of statistical modeling.Firstly,the coefficient fitting of deformation resistance analytical model based on over-determined equations set is adopted.Additionally,a support vector machine(SVM)is applied to the modeling of the deformation resistance and friction factor.The setting models are applied to a 1450 plate-coiling mill for titanium alloy plate rolling,and then thickness precision is found consistently to be within 3%,exceeding the precision of traditional setting models with a self-learning method based on a large number of stable rolling data.Excellent application performance is obtained.The proposed research provides a set of high-precision thickness setting models which are well adapted to the characteristics of titanium alloy plate cold rolling without tension.展开更多
Click-through-rate(CTR)prediction is a crucial task in recommendation systems.The accuracy of CTR prediction is strongly influenced by the precise extraction of essential data and the modeling strategy chosen.The data...Click-through-rate(CTR)prediction is a crucial task in recommendation systems.The accuracy of CTR prediction is strongly influenced by the precise extraction of essential data and the modeling strategy chosen.The data of the CTR task are often very sparse,and Factorization Machines(FMs)are a class of general predictors working effectively with it.However,the performance of FMs can be limited by the fixed feature representation and the same weight of different features.In this work,we propose an improved Bitwise Feature Importance Factorization Machine(BFIFM)to improve the accuracy.The necessity of learning the degree of effect of the same feature under various situations is learned through the low-order intersection method,and the deep neural network(DNN)in our model is used in parallel to study high-order intersections.According to the final results obtained,the BFIFM model significantly outperforms other state-of-the-art models.展开更多
Performing arts and movies have become commercial products with high profit and great market potential. Previous research works have developed comprehensive models to forecast the demand for movies. However,they did n...Performing arts and movies have become commercial products with high profit and great market potential. Previous research works have developed comprehensive models to forecast the demand for movies. However,they did not pay enough attention to the decision support for performing arts which is a special category unlike movies. For performing arts with high-dimensional categorical attributes and limit samples, determining ticket prices in different levels is still a challenge job faced by the producers and distributors. In terms of these difficulties, factorization machine(FM), which can handle huge sparse categorical attributes, is used in this work first. Adaptive stochastic gradient descent(ASGD) and Markov chain Monte Carlo(MCMC) are both explored to estimate the model parameters of FM. FM with ASGD(FM-ASGD) and FM with MCMC(FM-MCMC) both can achieve a better prediction accuracy, compared with a traditional algorithm. In addition, the multi-output model is proposed to determine the price in multiple price levels simultaneously, which avoids the trouble of the models' repeating training. The results also confirm the prediction accuracy of the multi-output model, compared with those from the general single-output model.展开更多
There is compelling evidence that synergistic drug combinations have become promising strategies for combating complex diseases,and they have evident predominance comparing to traditional one drug-one disease approach...There is compelling evidence that synergistic drug combinations have become promising strategies for combating complex diseases,and they have evident predominance comparing to traditional one drug-one disease approaches.In this paper,we develop a computational method,namely SyFFM,that takes pharmacological data into consideration and applies field-aware factorization machines to analyze and predict potential synergistic drug combinations.Firstly,features of drug pairs are constructed based on associations between drugs and target,and enzymes,and indication areas.Then,the synergistic scores of drug combinations are obtained by implementing field-aware factorization machines on latent vector space of these features.Finally,synergistic combinations can be predicted by introducing a threshold.We applied SyFFM to predict pairwise synergistic combinations and three-drug synergistic combinations,and the performance is good in terms of cross-validation.Besides,more than 90%combinations of the top ranked predictions are proved by literature and the analysis of parameters in model shows that our method can help to investigate and explain synergistic mechanisms underlying combinatorial therapy.展开更多
基金This work was supported by the Planning Subject for the 13th Five-Year Plan of Hunan Provincial Educational Sciences under Grant XJK17BXX006,author D.Y,http://ghkt.hntky.com/.
文摘There is no reasonable scientific basis for selecting the excellent teachers of the school’s courses.To solve the practical problem,we firstly give a series of normalization models for defining the key attributes of teachers’professional foundation,course difficulty coefficient,and comprehensive evaluation of teaching.Then,we define a partial weight function to calculate the key attributes,and obtain the partial recommendation values.Next,we construct a highly sparse Teaching Recommendation Factorization Machines(TRFMs)model,which takes the 5-tuples relation including teacher,course,teachers’professional foundation,course difficulty,teaching evaluation as the feature vector,and take partial recommendation value as the recommendation label.Finally,we design a novel Top-N excellent teacher recommendation algorithm based on TRFMs by course classification on the highly sparse dataset.Experimental results show that the proposed TRFMs and recommendation algorithm can accurately realize the recommendation of excellent teachers on a highly sparse historical teaching dataset.The recommendation accuracy is superior to that of the three-dimensional tensor decomposition model algorithm which also solves sparse datasets.The proposed method can be used as a new recommendation method applied to the teaching arrangements in all kinds of schools,which can effectively improve the teaching quality.
基金This research work has been conducted in cooperation with members of DETSI project supported by BPI France and Pays de Loire and Auvergne Rhone Alpes.
文摘The proliferation of internet communication channels has increased telecom fraud,causing billions of euros in losses for customers and the industry each year.Fraudsters constantly find new ways to engage in illegal activity on the network.To reduce these losses,a new fraud detection approach is required.Telecom fraud detection involves identifying a small number of fraudulent calls from a vast amount of call traffic.Developing an effective strategy to combat fraud has become challenging.Although much effort has been made to detect fraud,most existing methods are designed for batch processing,not real-time detection.To solve this problem,we propose an online fraud detection model using a Neural Factorization Autoencoder(NFA),which analyzes customer calling patterns to detect fraudulent calls.The model employs Neural Factorization Machines(NFM)and an Autoencoder(AE)to model calling patterns and a memory module to adapt to changing customer behaviour.We evaluate our approach on a large dataset of real-world call detail records and compare it with several state-of-the-art methods.Our results show that our approach outperforms the baselines,with an AUC of 91.06%,a TPR of 91.89%,an FPR of 14.76%,and an F1-score of 95.45%.These results demonstrate the effectiveness of our approach in detecting fraud in real-time and suggest that it can be a valuable tool for preventing fraud in telecommunications networks.
基金This work was funded by the Researchers Supporting Project No.(RSP-2021/102)King Saud University,Riyadh,Saudi ArabiaThis work was supported by the Research Project on Teaching Reform of General Colleges and Universities in Hunan Province(Grant No.HNJG-2020-0261),China.
文摘Software-Defined Network(SDN)decouples the control plane of network devices from the data plane.While alleviating the problems presented in traditional network architectures,it also brings potential security risks,particularly network Denial-of-Service(DoS)attacks.While many research efforts have been devoted to identifying new features for DoS attack detection,detection methods are less accurate in detecting DoS attacks against client hosts due to the high stealth of such attacks.To solve this problem,a new method of DoS attack detection based on Deep Factorization Machine(DeepFM)is proposed in SDN.Firstly,we select the Growth Rate of Max Matched Packets(GRMMP)in SDN as detection feature.Then,the DeepFM algorithm is used to extract features from flow rules and classify them into dense and discrete features to detect DoS attacks.After training,the model can be used to infer whether SDN is under DoS attacks,and a DeepFM-based detection method for DoS attacks against client host is implemented.Simulation results show that our method can effectively detect DoS attacks in SDN.Compared with the K-Nearest Neighbor(K-NN),Artificial Neural Network(ANN)models,Support Vector Machine(SVM)and Random Forest models,our proposed method outperforms in accuracy,precision and F1 values.
文摘In the live broadcast process,eye movement characteristics can reflect people’s attention to the product.However,the existing interest degree predictive model research does not consider the eye movement characteristics.In order to obtain the users’interest in the product more effectively,we will consider the key eye movement indicators.We first collect eye movement characteristics based on the self-developed data processing algorithm fast discriminative model prediction for tracking(FDIMP),and then we add data dimensions to the original data set through information filling.In addition,we apply the deep factorization machine(DeepFM)architecture to simultaneously learn the combination of low-level and high-level features.In order to effectively learn important features and emphasize relatively important features,the multi-head attention mechanism is applied in the interest model.The experimental results on the public data set Criteo show that,compared with the original DeepFM algorithm,the area under curve(AUC)value was improved by up to 9.32%.
基金supported by the National Natural Science Foundation of China(41871320,61873316)the Key Project of Hunan Provincial Education Department(19A172)+1 种基金the Scientific Research Fund of Hunan Provincial Education Department(18K060)the Postgraduate Scientific Research Innovation Project of Hunan Province(CX20211000).
文摘Short-term GPS data based taxi pick-up area recommendation can improve the efficiency and reduce the overheads.But how to alleviate sparsity and further enhance accuracy is still challenging.Addressing at these issues,we propose to fuse spatio-temporal contexts into deep factorization machine(STC_DeepFM)offline for pick-up area recommendation,and within the area to recommend pick-up points online using factorization machine(FM).Firstly,we divide the urban area into several grids with equal size.Spatio-temporal contexts are destilled from pick-up points or points-of-interest(POIs)belonged to the preceding grids.Secondly,the contexts are integrated into deep factorization machine(DeepFM)to mine high-order interaction relationships from grids.And a novel algorithm named STC_DeepFM is presented for offline pick-up area recommendation.Thirdly,we devise the architecture of offline-to-online(O2O)recommendation respectively based on DeepFM and FM model in order to tradeoff the accuracy and efficiency.Some experiments are designed on the DiDi dataset to evaluate step by step the performance of spatio-temporal contexts,different recommendation models,and the O2O architecture.The results show that the proposed STC_DeepFM algorithm exceeds several state-of-the-art methods,and the O2O architecture achieves excellent real-time performance.
基金supported by National Natural Science Foundation of China(Nos.62032013 and 61972078)the Fundamental Research Funds for the Central Universities,China(No.N2217004).
文摘Factorization machine (FM) is an effective model for feature-based recommendation that utilizes inner products to capture second-order feature interactions. However, one of the major drawbacks of FM is that it cannot capture complex high-order interaction signals. A common solution is to change the interaction function, such as stacking deep neural networks on the top level of FM. In this work, we propose an alternative approach to model high-order interaction signals at the embedding level, namely generalized embedding machine (GEM). The embedding used in GEM encodes not only the information from the feature itself but also the information from other correlated features. Under such a situation, the embedding becomes high-order. Then we can incorporate GEM with FM and even its advanced variants to perform feature interactions. More specifically, in this paper, we utilize graph convolution networks (GCN) to generate high-order embeddings. We integrate GEM with several FM-based models and conduct extensive experiments on two real-world datasets. The results demonstrate significant improvement of GEM over the corresponding baselines.
基金Supported by National Natural Science Foundation of China(Grant No.51304017)National Key Technology R&D Program of the 12th Five-year Plan of China(Grant Nos.2012BAF04B02,2011BAE23B04)Fundamental Research Funds for Central Universities,China(Grant No.FRF-SD-12-013B)
文摘Due to its highly favorable physical and chemical properties,titanium and titanium alloy are widely used in a variety of industries.Because of the low output of a single batch,plate cold rolling without tension is the most common rolling production method for titanium alloy.This method is lack of on-line thickness closed-loop control,with carefully thickness setting models for precision.A set of high-precision thickness setting models are proposed to suit the production method.Because of frequent variations in rolling specification,a model structural for the combination of analytical models and statistical models is adopted to replace the traditional self-learning method.The deformation resistance and friction factor,the primary factors which affect model precision,are considered as the objectives of statistical modeling.Firstly,the coefficient fitting of deformation resistance analytical model based on over-determined equations set is adopted.Additionally,a support vector machine(SVM)is applied to the modeling of the deformation resistance and friction factor.The setting models are applied to a 1450 plate-coiling mill for titanium alloy plate rolling,and then thickness precision is found consistently to be within 3%,exceeding the precision of traditional setting models with a self-learning method based on a large number of stable rolling data.Excellent application performance is obtained.The proposed research provides a set of high-precision thickness setting models which are well adapted to the characteristics of titanium alloy plate cold rolling without tension.
基金supported by Hainan Province Science and Technology Special Fund,which is Research and Application of Intelligent Recommendation Technology Based on Knowledge Graph and User Portrait (No.ZDYF2020039)。
文摘Click-through-rate(CTR)prediction is a crucial task in recommendation systems.The accuracy of CTR prediction is strongly influenced by the precise extraction of essential data and the modeling strategy chosen.The data of the CTR task are often very sparse,and Factorization Machines(FMs)are a class of general predictors working effectively with it.However,the performance of FMs can be limited by the fixed feature representation and the same weight of different features.In this work,we propose an improved Bitwise Feature Importance Factorization Machine(BFIFM)to improve the accuracy.The necessity of learning the degree of effect of the same feature under various situations is learned through the low-order intersection method,and the deep neural network(DNN)in our model is used in parallel to study high-order intersections.According to the final results obtained,the BFIFM model significantly outperforms other state-of-the-art models.
基金the Fund of the Science and Technology Commission of Shanghai Municipality(No.13511506402)
文摘Performing arts and movies have become commercial products with high profit and great market potential. Previous research works have developed comprehensive models to forecast the demand for movies. However,they did not pay enough attention to the decision support for performing arts which is a special category unlike movies. For performing arts with high-dimensional categorical attributes and limit samples, determining ticket prices in different levels is still a challenge job faced by the producers and distributors. In terms of these difficulties, factorization machine(FM), which can handle huge sparse categorical attributes, is used in this work first. Adaptive stochastic gradient descent(ASGD) and Markov chain Monte Carlo(MCMC) are both explored to estimate the model parameters of FM. FM with ASGD(FM-ASGD) and FM with MCMC(FM-MCMC) both can achieve a better prediction accuracy, compared with a traditional algorithm. In addition, the multi-output model is proposed to determine the price in multiple price levels simultaneously, which avoids the trouble of the models' repeating training. The results also confirm the prediction accuracy of the multi-output model, compared with those from the general single-output model.
基金National Natural Science Foundation of China under Grant No.11631014.
文摘There is compelling evidence that synergistic drug combinations have become promising strategies for combating complex diseases,and they have evident predominance comparing to traditional one drug-one disease approaches.In this paper,we develop a computational method,namely SyFFM,that takes pharmacological data into consideration and applies field-aware factorization machines to analyze and predict potential synergistic drug combinations.Firstly,features of drug pairs are constructed based on associations between drugs and target,and enzymes,and indication areas.Then,the synergistic scores of drug combinations are obtained by implementing field-aware factorization machines on latent vector space of these features.Finally,synergistic combinations can be predicted by introducing a threshold.We applied SyFFM to predict pairwise synergistic combinations and three-drug synergistic combinations,and the performance is good in terms of cross-validation.Besides,more than 90%combinations of the top ranked predictions are proved by literature and the analysis of parameters in model shows that our method can help to investigate and explain synergistic mechanisms underlying combinatorial therapy.