financial services:for example,GPS and Bluetooth inspire location-based services,and search and web technologies motivate online shopping,reviews,and payments.These business services have become more connected than ev...financial services:for example,GPS and Bluetooth inspire location-based services,and search and web technologies motivate online shopping,reviews,and payments.These business services have become more connected than ever,and as a result,financial frauds have become a significant challenge.Therefore,combating financial risks in the big data era requires breaking the borders of traditional data,algorithms,and systems.An increasing number of studies have addressed these challenges and proposed new methods for risk detection,assessment,and forecasting.As a key contribution,we categorize these works in a rational framework:first,we identify the data that can be used to identify risks.We then discuss how big data can be combined with the emerging tools to effectively learn or analyze financial risk.Finally,we highlight the effectiveness of these methods in real-world applications.Furthermore,we stress on the importance of utilizing multi-channel information,graphs,and networks of long-range dependence for the effective identification of financial risks.We conclude our survey with a discussion on the new challenges faced by the financial sector,namely,deep fake technology,adversaries,causal and interpretable inference,privacy protection,and microsimulations.展开更多
The Chinese Software Developer Network(CSDN)is one of the largest information technology communities and service platforms in China.This paper describes the user profiling for CSDN,an evaluation track of SMP Cup 2017....The Chinese Software Developer Network(CSDN)is one of the largest information technology communities and service platforms in China.This paper describes the user profiling for CSDN,an evaluation track of SMP Cup 2017.It contains three tasks:(1)user document keyphrase extraction,(2)user tagging and(3)user growth value prediction.In the first task,we treat keyphrase extraction as a classification problem and train a Gradient-Boosting-Decision-Tree model with comprehensive features.In the second task,to deal with class imbalance and capture the interdependency between classes,we propose a two-stage framework:(1)for each class,we train a binary classifier to model each class against all of the other classes independently;(2)we feed the output of the trained classifiers into a softmax classifier,tagging each user with multiple labels.In the third task,we propose a comprehensive architecture to predict user growth value.Our contributions in this paper are summarized as follows:(1)we extract various types of features to identify the key factors in user value growth;(2)we use the semi-supervised method and the stacking technique to extend labeled data sets and increase the generality of the trained model,resulting in an impressive performance in our experiments.In the competition,we achieved the first place out of 329 teams.展开更多
基金supported by the National Natural Science Foundation of China under Grant Nos.91746301,61772498,61802370,and 61902380.
文摘financial services:for example,GPS and Bluetooth inspire location-based services,and search and web technologies motivate online shopping,reviews,and payments.These business services have become more connected than ever,and as a result,financial frauds have become a significant challenge.Therefore,combating financial risks in the big data era requires breaking the borders of traditional data,algorithms,and systems.An increasing number of studies have addressed these challenges and proposed new methods for risk detection,assessment,and forecasting.As a key contribution,we categorize these works in a rational framework:first,we identify the data that can be used to identify risks.We then discuss how big data can be combined with the emerging tools to effectively learn or analyze financial risk.Finally,we highlight the effectiveness of these methods in real-world applications.Furthermore,we stress on the importance of utilizing multi-channel information,graphs,and networks of long-range dependence for the effective identification of financial risks.We conclude our survey with a discussion on the new challenges faced by the financial sector,namely,deep fake technology,adversaries,causal and interpretable inference,privacy protection,and microsimulations.
基金The work is supported by the National Natural Science Foundation of China(NSFC)under grant numbers 61472400,91746301 and 61802371H.Shen is also funded by K.C.Wong Education Foundation and the Youth Innovation Promotion Association of the Chinese Academy of Sciences.
文摘The Chinese Software Developer Network(CSDN)is one of the largest information technology communities and service platforms in China.This paper describes the user profiling for CSDN,an evaluation track of SMP Cup 2017.It contains three tasks:(1)user document keyphrase extraction,(2)user tagging and(3)user growth value prediction.In the first task,we treat keyphrase extraction as a classification problem and train a Gradient-Boosting-Decision-Tree model with comprehensive features.In the second task,to deal with class imbalance and capture the interdependency between classes,we propose a two-stage framework:(1)for each class,we train a binary classifier to model each class against all of the other classes independently;(2)we feed the output of the trained classifiers into a softmax classifier,tagging each user with multiple labels.In the third task,we propose a comprehensive architecture to predict user growth value.Our contributions in this paper are summarized as follows:(1)we extract various types of features to identify the key factors in user value growth;(2)we use the semi-supervised method and the stacking technique to extend labeled data sets and increase the generality of the trained model,resulting in an impressive performance in our experiments.In the competition,we achieved the first place out of 329 teams.