期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
MMLUP: Multi-Source & Multi-Task Learning for User Profiles in Social Network 被引量:1
1
作者 Dongjie Zhu Yuhua Wang +5 位作者 Chuiju You Jinming Qiu Ning Cao Chenjing Gong Guohua Yang Helen Min Zhou 《Computers, Materials & Continua》 SCIE EI 2019年第9期1105-1115,共11页
With the rapid development of the mobile Internet,users generate massive data in different forms in social network every day,and different characteristics of users are reflected by these social media data.How to integ... With the rapid development of the mobile Internet,users generate massive data in different forms in social network every day,and different characteristics of users are reflected by these social media data.How to integrate multiple heterogeneous information and establish user profiles from multiple perspectives plays an important role in providing personalized services,marketing,and recommendation systems.In this paper,we propose Multi-source&Multi-task Learning for User Profiles in Social Network which integrates multiple social data sources and contains a multi-task learning framework to simultaneously predict various attributes of a user.Firstly,we design their own feature extraction models for multiple heterogeneous data sources.Secondly,we design a shared layer to fuse multiple heterogeneous data sources as general shared representation for multi-task learning.Thirdly,we design each task’s own unique presentation layer for discriminant output of specific-task.Finally,we design a weighted loss function to improve the learning efficiency and prediction accuracy of each task.Our experimental results on more than 5000 Sina Weibo users demonstrate that our approach outperforms state-of-the-art baselines for inferring gender,age and region of social media users. 展开更多
关键词 user profiles MULTI-SOURCE multi-task learning social network
下载PDF
AMachine Learning Approach to User Profiling for Data Annotation of Online Behavior
2
作者 Moona Kanwal Najeed AKhan Aftab A.Khan 《Computers, Materials & Continua》 SCIE EI 2024年第2期2419-2440,共22页
The user’s intent to seek online information has been an active area of research in user profiling.User profiling considers user characteristics,behaviors,activities,and preferences to sketch user intentions,interest... The user’s intent to seek online information has been an active area of research in user profiling.User profiling considers user characteristics,behaviors,activities,and preferences to sketch user intentions,interests,and motivations.Determining user characteristics can help capture implicit and explicit preferences and intentions for effective user-centric and customized content presentation.The user’s complete online experience in seeking information is a blend of activities such as searching,verifying,and sharing it on social platforms.However,a combination of multiple behaviors in profiling users has yet to be considered.This research takes a novel approach and explores user intent types based on multidimensional online behavior in information acquisition.This research explores information search,verification,and dissemination behavior and identifies diverse types of users based on their online engagement using machine learning.The research proposes a generic user profile template that explains the user characteristics based on the internet experience and uses it as ground truth for data annotation.User feedback is based on online behavior and practices collected by using a survey method.The participants include both males and females from different occupation sectors and different ages.The data collected is subject to feature engineering,and the significant features are presented to unsupervised machine learning methods to identify user intent classes or profiles and their characteristics.Different techniques are evaluated,and the K-Mean clustering method successfully generates five user groups observing different user characteristics with an average silhouette of 0.36 and a distortion score of 1136.Feature average is computed to identify user intent type characteristics.The user intent classes are then further generalized to create a user intent template with an Inter-Rater Reliability of 75%.This research successfully extracts different user types based on their preferences in online content,platforms,criteria,and frequency.The study also validates the proposed template on user feedback data through Inter-Rater Agreement process using an external human rater. 展开更多
关键词 user intent CLUSTER user profile online search information sharing user behavior search reasons
下载PDF
User Profile & Attitude Analysis Based on Unstructured Social Media and Online Activity
3
作者 Yuting Tan Vijay K. Madisetti 《Journal of Software Engineering and Applications》 2024年第6期463-473,共11页
As social media and online activity continue to pervade all age groups, it serves as a crucial platform for sharing personal experiences and opinions as well as information about attitudes and preferences for certain ... As social media and online activity continue to pervade all age groups, it serves as a crucial platform for sharing personal experiences and opinions as well as information about attitudes and preferences for certain interests or purchases. This generates a wealth of behavioral data, which, while invaluable to businesses, researchers, policymakers, and the cybersecurity sector, presents significant challenges due to its unstructured nature. Existing tools for analyzing this data often lack the capability to effectively retrieve and process it comprehensively. This paper addresses the need for an advanced analytical tool that ethically and legally collects and analyzes social media data and online activity logs, constructing detailed and structured user profiles. It reviews current solutions, highlights their limitations, and introduces a new approach, the Advanced Social Analyzer (ASAN), that bridges these gaps. The proposed solutions technical aspects, implementation, and evaluation are discussed, with results compared to existing methodologies. The paper concludes by suggesting future research directions to further enhance the utility and effectiveness of social media data analysis. 展开更多
关键词 Social Media user Behavior Analysis Sentiment Analysis Data Mining Machine Learning user Profiling CYBERSECURITY Behavioral Insights Personality Prediction
下载PDF
Sensory Evaluation of Freeze-Dried Facial Mask and Analysis of User Profile
4
作者 Qi Rong Chu LiLing Wang Feifei 《China Detergent & Cosmetics》 CAS 2024年第3期41-49,共9页
To thoroughly understand market opportunity of freeze-dried facial mask and deeply get insight of consumers’usage behavior and needs,evaluate sensory feelings of 10 screened commercial freeze-dried facial mask produc... To thoroughly understand market opportunity of freeze-dried facial mask and deeply get insight of consumers’usage behavior and needs,evaluate sensory feelings of 10 screened commercial freeze-dried facial mask products,group test products according to the differences of sensory attributions via Principal Component Analysis(PCA)and Agglomerative Hierarchical Clustering(AHC),pick up the representative products.Freeze-dried facial mask users evaluate satisfaction degree of picked up products and participate survey of usage behavior/cognition.Analyze consumer data by AHC to get consumer segmentations and their profile.The test results show that,sensory data and consumer data,which is from consumers test of screened representative products by performing PCA and AHC on sensory data,can be verified mutually.It is helpful to understand the needs of consumer segmentations and reason to buy by combining sensory data and consumer test. 展开更多
关键词 freeze-dried facial mask sensory evaluation consumer satisfaction evaluation PCA AHC user profile
下载PDF
Discovering User Profiles for Web Personalized Recommendation 被引量:2
5
作者 Ai-BoSong Mao-XianZhao +2 位作者 Zuo-PengLiang Yi-ShengDong Jun-ZhouLuo 《Journal of Computer Science & Technology》 SCIE EI CSCD 2004年第3期320-328,共9页
With the growing popularity of the World Wide Web, large volume of useraccess data has been gathered automatically by Web servers and stored in Web logs. Discovering andunderstanding user behavior patterns from log fi... With the growing popularity of the World Wide Web, large volume of useraccess data has been gathered automatically by Web servers and stored in Web logs. Discovering andunderstanding user behavior patterns from log files can provide Web personalized recommendationservices. In this paper, a novel clustering method is presented for log files called Clusteringlarge Weblog based on Key Path Model (CWKPM), which is based on user browsing key path model, to getuser behavior profiles. Compared with the previous Boolean model, key path model considers themajor features of users'' accessing to the Web: ordinal, contiguous and duplicate. Moreover, forclustering, it has fewer dimensions. The analysis and experiments show that CWKPM is an efficientand effective approach for clustering large and high-dimension Web logs. 展开更多
关键词 web log user profile PERSONALIZATION generalized suffix tree CLUSTERING
原文传递
User Profile in Smart Elderly Care Community:Findings from Community in Western China
6
作者 Yan Wei Xiaowei Liu Ruilin Hou 《Journal of Beijing Institute of Technology》 EI CAS 2023年第2期156-167,共12页
With the increase in the aging population,the need for elderly care services has diversified,and smart elderly care has become an effective measure to cope with this increasing aging population.Based on the data from ... With the increase in the aging population,the need for elderly care services has diversified,and smart elderly care has become an effective measure to cope with this increasing aging population.Based on the data from the platform“Guan Hu Tong”of RQ Company in the community of Shaanxi Province in western China,this study mined the data of smart elderly care services through the recency,frequency and monetary value(RFM)model and the backpropagation(BP)neural network model,constructed the user profile of the elderly,and predicted users’practical demands.The following conclusions were drawn:The oldest users are important target users of smart elderly care service platforms;Elderly women living alone rely more on smart elderly care services;Meal delivery and health follow-up services are the most popular among elderly users. 展开更多
关键词 smart elderly care user profile backpropagation(BP)neural network
下载PDF
User-oriented web search based on PLSA
7
作者 于芳 陈冬玲 +2 位作者 王大玲 于戈 鲍玉斌 《Journal of Southeast University(English Edition)》 EI CAS 2007年第3期347-351,共5页
In order to solve the problem that current search engines provide query-oriented searches rather than user-oriented ones, and that this improper orientation leads to the search engines' inability to meet the personal... In order to solve the problem that current search engines provide query-oriented searches rather than user-oriented ones, and that this improper orientation leads to the search engines' inability to meet the personalized requirements of users, a novel method based on probabilistic latent semantic analysis (PLSA) is proposed to convert query-oriented web search to user-oriented web search. First, a user profile represented as a user' s topics of interest vector is created by analyzing the user' s click through data based on PLSA, then the user' s queries are mapped into categories based on the user' s preferences, and finally the result list is re-ranked according to the user' s interests based on the new proposed method named user-oriented PageRank (UOPR). Experiments on real life datasets show that the user-oriented search system that adopts PLSA takes considerable consideration of user preferences and better satisfies a user' s personalized information needs. 展开更多
关键词 user-oriented search underlying search intention probabilistic latent semantic analysis (PLSA) user profile topics of interest
下载PDF
User Profile System Based on Sentiment Analysis for Mobile Edge Computing 被引量:1
8
作者 Sang-Min Park Young-Gab Kim 《Computers, Materials & Continua》 SCIE EI 2020年第2期569-590,共22页
Emotions of users do not converge in a single application but are scattered across diverse applications.Mobile devices are the closest media for handling user data and these devices have the advantage of integrating p... Emotions of users do not converge in a single application but are scattered across diverse applications.Mobile devices are the closest media for handling user data and these devices have the advantage of integrating private user information and emotions spread over different applications.In this paper,we first analyze user profile on a mobile device by describing the problem of the user sentiment profile system in terms of data granularity,media diversity,and server-side solution.Fine-grained data requires additional data and structural analysis in mobile devices.Media diversity requires standard parameters to integrate user data from various applications.A server-side solution presents a potential risk when handling individual privacy information.Therefore,in order to overcome these problems,we propose a general-purposed user profile system based on sentiment analysis that extracts individual emotional preferences by comparing the difference between public and individual data based on particular features.The proposed system is built based on a sentiment hierarchy,which is created by using unstructured data on mobile devices.It can compensate for the concentration of single media,and analyze individual private data without the invasion of privacy on mobile devices. 展开更多
关键词 user profile sentiment analysis mobile edge computing social network
下载PDF
PUMTD:Privacy-Preserving User-Profile Matching Protocol in Social Networks
9
作者 Jianhong Zhang Haoting Han +2 位作者 Hongwei Su Zhengtao Jiang Changgen Peng 《China Communications》 SCIE CSCD 2022年第6期77-90,共14页
User profile matching can establish social relationships between different users in the social network.If the user profile is matched in plaintext,the user's privacy might face a security challenge.Although there ... User profile matching can establish social relationships between different users in the social network.If the user profile is matched in plaintext,the user's privacy might face a security challenge.Although there exist some schemes realizing privacypreserving user profile matching,the resource-limited users or social service providers in these schemes need to take higher computational complexity to ensure the privacy or matching of the data.To overcome the problems,a novel privacy-preserving user profile matching protocol in social networks is proposed by using t-out-of n servers and the bloom filter technique,in which the computational complexity of a user is reduced by applying the Chinese Remainder Theorem,the matching users can be found with the help of any t matching servers,and the privacy of the user profile is not compromised.Furthermore,if at most t-1 servers are allowed to collude,our scheme can still fulfill user profile privacy and user query privacy.Finally,the performance of the proposed scheme is compared with the other two schemes,and the results show that our scheme is superior to them. 展开更多
关键词 user profile matching Chinese remainder theorem PRIVACY-PRESERVING query privacy
下载PDF
Ranking of Web Pages in a Personalized Search
10
作者 Mahmoud Abou Ghaly 《Journal of Computer and Communications》 2023年第2期89-101,共13页
The basic idea behind a personalized web search is to deliver search results that are tailored to meet user needs, which is one of the growing concepts in web technologies. The personalized web search presented in thi... The basic idea behind a personalized web search is to deliver search results that are tailored to meet user needs, which is one of the growing concepts in web technologies. The personalized web search presented in this paper is based on exploiting the implicit feedbacks of user satisfaction during her web browsing history to construct a user profile storing the web pages the user is highly interested in. A weight is assigned to each page stored in the user’s profile;this weight reflects the user’s interest in this page. We name this weight the relative rank of the page, since it depends on the user issuing the query. Therefore, the ranking algorithm provided in this paper is based on the principle that;the rank assigned to a page is the addition of two rank values R_rank and A_rank. A_rank is an absolute rank, since it is fixed for all users issuing the same query, it only depends on the link structures of the web and on the keywords of the query. Thus, it could be calculated by the PageRank algorithm suggested by Brin and Page in 1998 and used by the google search engine. While, R_rank is the relative rank, it is calculated by the methods given in this paper which depends mainly on recording implicit measures of user satisfaction during her previous browsing history. 展开更多
关键词 Implicit Feedback Personalized Search Web Page Ranking user Profile
下载PDF
Integrating Machine Learning and Evidential Reasoning for User Profiling and Recommendation
11
作者 Toan Nguyen Mau Quang-Hung Le +2 位作者 Duc-Vinh Vo Duy Doan Van-Nam Huynh 《Journal of Systems Science and Systems Engineering》 SCIE EI CSCD 2023年第4期393-412,共20页
User profiles representing users’preferences and interests play an important role in many applications of personalized recommendation.With the rapid growth of social platforms,there is a critical need for efficient s... User profiles representing users’preferences and interests play an important role in many applications of personalized recommendation.With the rapid growth of social platforms,there is a critical need for efficient solutions to learn user profiles from the information they shared on social platforms so as to improve the quality of recommendation services.The problem of user profile learning is significantly challenging due to difficulty in handling data from multiple sources,in different formats and often associated with uncertainty.In this paper,we introduce an integrated approach that combines advanced Machine Learning techniques with evidential reasoning based on Dempster-Shafer theory of evidence for user profiling and recommendation.The developed methods for user profile learning and multi-criteria collaborative filtering are demonstrated with experimental results and analysis that show the effectiveness and practicality of the integrated approach.A proposal for extending multi-criteria recommendation systems by incorporating user profiles learned from different sources of data into the recommendation process so as to provide better recommendation capabilities is also highlighted. 展开更多
关键词 Machine learning Dempster-Shafer theory of evidence user profiles personalized recommendation PREFERENCES
原文传递
基于流式计算的实时用户画像系统研究 被引量:9
12
作者 姜红玉 汪朋 封雷 《计算机技术与发展》 2020年第7期186-193,共8页
大数据环境下,基于海量数据,针对用户画像的精准度和实时性问题,对实时用户画像系统进行了研究工作,提出了一种采用流式计算思想的实时用户画像系统架构。从整体角度梳理分析了用户画像的体系结构,利用消息队列中间件Kafka实时采集不同... 大数据环境下,基于海量数据,针对用户画像的精准度和实时性问题,对实时用户画像系统进行了研究工作,提出了一种采用流式计算思想的实时用户画像系统架构。从整体角度梳理分析了用户画像的体系结构,利用消息队列中间件Kafka实时采集不同维度的用户数据,利用大数据分析和机器学习技术构建了相对精准立体的用户画像数据标签体系及用户画像模型,应用Flink框架和数据挖掘技术对多源流式数据进行实时计算处理,深度分析用户,挖掘用户的特征及需求,进而刻画出精准的用户画像,提供精准的个性化信息服务。该架构能准确对用户进行全方位、高精度的画像构建,结果具有较高的实时性和精确度,从而能达到快速且准确地了解用户需求、利用数据服务用户和业务发展的目的。 展开更多
关键词 用户画像(user profile) 流式计算 实时 Flink 大数据 标签
下载PDF
Joint user profiling with hierarchical attention networks 被引量:1
13
作者 Xiaojian LIU Yi ZHU Xindong WU 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第3期133-143,共11页
User profiling by inferring user personality traits,such as age and gender,plays an increasingly important role in many real-world applications.Most existing methods for user profiling either use only one type of data... User profiling by inferring user personality traits,such as age and gender,plays an increasingly important role in many real-world applications.Most existing methods for user profiling either use only one type of data or ignore handling the noisy information of data.Moreover,they usually consider this problem from only one perspective.In this paper,we propose a joint user profiling model with hierarchical attention networks(JUHA)to learn informative user representations for user profiling.Our JUHA method does user profiling based on both inner-user and inter-user features.We explore inner-user features from user behaviors(e.g.,purchased items and posted blogs),and inter-user features from a user-user graph(where similar users could be connected to each other).JUHA learns basic sentence and bag representations from multiple separate sources of data(user behaviors)as the first round of data preparation.In this module,convolutional neural networks(CNNs)are introduced to capture word and sentence features of age and gender while the self-attention mechanism is exploited to weaken the noisy data.Following this,we build another bag which contains a user-user graph.Inter-user features are learned from this bag using propagation information between linked users in the graph.To acquire more robust data,inter-user features and other inner-user bag representations are joined into each sentence in the current bag to learn the final bag representation.Subsequently,all of the bag representations are integrated to lean comprehensive user representation by the self-attention mechanism.Our experimental results demonstrate that our approach outperforms several state-of-the-art methods and improves prediction performance. 展开更多
关键词 user profiling hierarchical attention joint learning inner-user feature inter-user feature
原文传递
Masquerade Detection Using Support Vector Machine
14
作者 YANG Min WANG Li-na +1 位作者 ZHANG Huan-guo CHEN Wei 《Wuhan University Journal of Natural Sciences》 EI CAS 2005年第1期103-106,共4页
A new method using support vector data description (SVDD) to distinguishlegitimate users from mas-queradcrs based on UNIX user command sequences is proposed Sliding windowsare used to get low detection delay. Experime... A new method using support vector data description (SVDD) to distinguishlegitimate users from mas-queradcrs based on UNIX user command sequences is proposed Sliding windowsare used to get low detection delay. Experiments demonstrate that the detection effect usingenriched sequences is better than that of using truncated sequences. As a SVDD profile is composedof a small amount of support vectors, our SVDD-based method can achieve computation and storageadvantage when the detection performance issimilar to existing method. 展开更多
关键词 computer security intrusion detection masquerade detection user profiling support vector machine
下载PDF
ADAPTIVE AND ACTIVE COMPUTING PARADIGM FOR PERSONALIZED INFORMATION SERVICE IN DISTRIBUTED HETERONGEOUS ENVIRONMENT
15
作者 马兆丰 冯博琴 《Journal of Pharmaceutical Analysis》 SCIE CAS 2003年第2期129-133,共5页
To solve the problem that traditional pull based information service can’t meet the demand of long term users getting domain information timely and properly, an adaptive and active computing paradigm (AACP) for per... To solve the problem that traditional pull based information service can’t meet the demand of long term users getting domain information timely and properly, an adaptive and active computing paradigm (AACP) for personalized information service in heterogeneous environment is proposed to provide user centered, push based higsh quality information service timely in a proper way, the motivation of which is generalized as R 4 Service: the right information at the right time in the right way to the right person, upon which formalized algorithms framework of adaptive user profile management, incremental information retrieval, information filtering, and active delivery mechanism are discussed in details. The AACP paradigm serves users in a push based, event driven, interest related, adaptive and active information service mode, which is useful and promising for long term user to gain fresh information instead of polling from kinds of information sources. 展开更多
关键词 adaptive and active computing paradigm user profiling incremental information retrieval information filtering active information delivery.
下载PDF
Profile of People Living with HIV/AIDS in a Large Municipality of Sao Paulo State, Brazil (2012-2013)
16
作者 Gabriela Tavares Magnabosco Livia Maria Lopes Mayara Falico Faria Maria Eugenia Firmino Brunello Tiemi Arakawa Aline Araujo Antunes Rubia Laine de Paula Andrade Aline Aparecida Monroe Tereza Cristina Scatena Villa 《Journal of Health Science》 2014年第2期94-101,共8页
HIV/AIDS has brought to light the challenge of incorporating the many influences between living conditions, social characteristics and health services performance to an adequate care for PLWHA (people living with AID... HIV/AIDS has brought to light the challenge of incorporating the many influences between living conditions, social characteristics and health services performance to an adequate care for PLWHA (people living with AIDS). Vulnerability of these populations is under the responsibility of specialized care units whose assistance does not always occur according to their real needs and demands. Therefore, this study aimed to analyze demographic, social and clinical profiles of PLWHA, as well as their follow-up in SS (Specialized Health Services) in Ribeir^o Preto, Brazil. It is a descriptive study conducted by the application of structured questionnaires to 253 patients with HIV/AIDS in follow-up during the years of 2012-2013. Variables were analyzed by descriptive statistics procedures. The findings pointed out gender parity, aging population, low education and economic predominance of class C. Regarding clinical characteristics, there was a predominance of asymptomatic individuals, with no clinical manifestations of AIDS or major comorbidities. The main mode of transmission was through sexual contact. The results led to the need of adequating the assistance provided to the specificities inherent to PLWHA. The care provision should cross an interdisciplinary perspective, targeting recognition of problems and ensuring comprehensive health care adequate to users' needs and demands. 展开更多
关键词 AIDS user profile evaluation of health services comprehensive health care.
下载PDF
DP-UserPro:differentially private user profile construction and publication
17
作者 Zheng HUO Ping HE +1 位作者 Lisha HU Huanyu ZHAO 《Frontiers of Computer Science》 SCIE EI CSCD 2021年第5期197-206,共10页
User profiles are widely used in the age of big data.However,generating and releasing user profiles may cause serious privacy leakage,since a large number of personal data are collected and analyzed.In this paper,we p... User profiles are widely used in the age of big data.However,generating and releasing user profiles may cause serious privacy leakage,since a large number of personal data are collected and analyzed.In this paper,we propose a differentially private user profile construction method DP-UserPro,which is composed of DP-CLIQUE and privately top-κtags selection.DP-CLIQUE is a differentially private high dimensional data cluster algorithm based on CLIQUE.The multidimensional tag space is divided into cells,Laplace noises are added into the count value of each cell.Based on the breadth-first-search,the largest connected dense cells are clustered into a cluster.Then a privately top-κtags selection approach is proposed based on the score function of each tag,to select the most importantκtags which can represent the characteristics of the cluster.Privacy and utility of DP-UserPro are theoretically analyzed and experimentally evaluated in the last.Comparison experiments are carried out with Tag Suppression algorithm on two real datasets,to measure the False Negative Rate(FNR)and precision.The results show that DP-UserPro outperforms Tag Suppression by 62.5%in the best case and 14.25%in the worst case on FNR,and DP-UserPro is about 21.1%better on precision than that of Tag Suppression,in average. 展开更多
关键词 user profile DP-CLIQUE CLUSTERING differential privacy recommender system
原文传递
Feature weighted clustering for user profiling
18
作者 Ayse Cufoglu Mahi Lohi Colin Everiss 《International Journal of Modeling, Simulation, and Scientific Computing》 EI 2017年第4期245-261,共17页
Personalization is the adaptation of the services to fit the user’s interests,characteristics and needs.The key to effective personalization is user profiling.Apart from traditional collaborative and content-based ap... Personalization is the adaptation of the services to fit the user’s interests,characteristics and needs.The key to effective personalization is user profiling.Apart from traditional collaborative and content-based approaches,a number of classification and clustering algorithms have been used to classify user related information to create user profiles.However,they are not able to achieve accurate user profiles.In this paper,we present a new clustering algorithm,namely Multi-Dimensional Clustering(MDC),to determine user profiling.The MDC is a version of the Instance-Based Learner(IBL)algorithm that assigns weights to feature values and considers these weights for the clustering.Three feature weight methods are proposed for the MDC and,all three,have been tested and evaluated.Simulations were conducted with using two sets of user profile datasets,which are the training(includes 10,000 instances)and test(includes 1000 instances)datasets.These datasets reflect each user’s personal information,preferences and interests.Additional simulations and comparisons with existing weighted and non-weighted instance-based algorithms were carried out in order to demonstrate the performance of proposed algorithm.Experimental results using the user profile datasets demonstrate that the proposed algorithm has better clustering accuracy performance compared to other algorithms.This work is based on the doctoral thesis of the corresponding author. 展开更多
关键词 CLASSIFICATION CLUSTERING mining methods and algorithms user profiling and personalization
原文传递
User Profiling for CSDN:Keyphrase Extraction,User Tagging and User Growth Value Prediction
19
作者 Guoliang Xing Hao Gao +4 位作者 Qi Cao Xinyu Yue Bingbing Xu Keting Cen Huawei Shen 《Data Intelligence》 2019年第2期137-159,共23页
The Chinese Software Developer Network(CSDN)is one of the largest information technology communities and service platforms in China.This paper describes the user profiling for CSDN,an evaluation track of SMP Cup 2017.... The Chinese Software Developer Network(CSDN)is one of the largest information technology communities and service platforms in China.This paper describes the user profiling for CSDN,an evaluation track of SMP Cup 2017.It contains three tasks:(1)user document keyphrase extraction,(2)user tagging and(3)user growth value prediction.In the first task,we treat keyphrase extraction as a classification problem and train a Gradient-Boosting-Decision-Tree model with comprehensive features.In the second task,to deal with class imbalance and capture the interdependency between classes,we propose a two-stage framework:(1)for each class,we train a binary classifier to model each class against all of the other classes independently;(2)we feed the output of the trained classifiers into a softmax classifier,tagging each user with multiple labels.In the third task,we propose a comprehensive architecture to predict user growth value.Our contributions in this paper are summarized as follows:(1)we extract various types of features to identify the key factors in user value growth;(2)we use the semi-supervised method and the stacking technique to extend labeled data sets and increase the generality of the trained model,resulting in an impressive performance in our experiments.In the competition,we achieved the first place out of 329 teams. 展开更多
关键词 user profiling Keyphrase extraction user tagging Growth value prediction Word embedding
原文传递
Identifying User Profile by Incorporating Self-Attention Mechanism based on CSDN Data Set
20
作者 Junru Lu Le Chen +5 位作者 Kongming Meng Fengyi Wang Jun Xiang Nuo Chen Xu Han Binyang Li 《Data Intelligence》 2019年第2期160-175,共16页
With the popularity of social media,there has been an increasing interest in user profiling and its applications nowadays.This paper presents our system named UIR-SIST for User Profiling Technology Evaluation Campaign... With the popularity of social media,there has been an increasing interest in user profiling and its applications nowadays.This paper presents our system named UIR-SIST for User Profiling Technology Evaluation Campaign in SMP CUP 2017.UIR-SIST aims to complete three tasks,including keywords extraction from blogs,user interests labeling and user growth value prediction.To this end,we first extract keywords from a user’s blog,including the blog itself,blogs on the same topic and other blogs published by the same user.Then a unified neural network model is constructed based on a convolutional neural network(CNN)for user interests tagging.Finally,we adopt a stacking model for predicting user growth value.We eventually receive the sixth place with evaluation scores of 0.563,0.378 and 0.751 on the three tasks,respectively. 展开更多
关键词 user profile Convolutional neural network(CNN) Self-attention Keyword extraction
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部