The proliferation of intelligent,connected Internet of Things(IoT)devices facilitates data collection.However,task workers may be reluctant to participate in data collection due to privacy concerns,and task requesters...The proliferation of intelligent,connected Internet of Things(IoT)devices facilitates data collection.However,task workers may be reluctant to participate in data collection due to privacy concerns,and task requesters may be concerned about the validity of the collected data.Hence,it is vital to evaluate the quality of the data collected by the task workers while protecting privacy in spatial crowdsourcing(SC)data collection tasks with IoT.To this end,this paper proposes a privacy-preserving data reliability evaluation for SC in IoT,named PARE.First,we design a data uploading format using blockchain and Paillier homomorphic cryptosystem,providing unchangeable and traceable data while overcoming privacy concerns.Secondly,based on the uploaded data,we propose a method to determine the approximate correct value region without knowing the exact value.Finally,we offer a data filtering mechanism based on the Paillier cryptosystem using this value region.The evaluation and analysis results show that PARE outperforms the existing solution in terms of performance and privacy protection.展开更多
Crowdsourcing technology is widely recognized for its effectiveness in task scheduling and resource allocation.While traditional methods for task allocation can help reduce costs and improve efficiency,they may encoun...Crowdsourcing technology is widely recognized for its effectiveness in task scheduling and resource allocation.While traditional methods for task allocation can help reduce costs and improve efficiency,they may encounter challenges when dealing with abnormal data flow nodes,leading to decreased allocation accuracy and efficiency.To address these issues,this study proposes a novel two-part invalid detection task allocation framework.In the first step,an anomaly detection model is developed using a dynamic self-attentive GAN to identify anomalous data.Compared to the baseline method,the model achieves an approximately 4%increase in the F1 value on the public dataset.In the second step of the framework,task allocation modeling is performed using a twopart graph matching method.This phase introduces a P-queue KM algorithm that implements a more efficient optimization strategy.The allocation efficiency is improved by approximately 23.83%compared to the baseline method.Empirical results confirm the effectiveness of the proposed framework in detecting abnormal data nodes,enhancing allocation precision,and achieving efficient allocation.展开更多
Crowdsourcing holds broad applications in information acquisition and dissemination,yet encounters challenges pertaining to data quality assessment and user reputation management.Reputation mechanisms stand as crucial...Crowdsourcing holds broad applications in information acquisition and dissemination,yet encounters challenges pertaining to data quality assessment and user reputation management.Reputation mechanisms stand as crucial solutions for appraising and updating participant reputation scores,thereby elevating the quality and dependability of crowdsourced data.However,these mechanisms face several challenges in traditional crowdsourcing systems:1)platform security lacks robust guarantees and may be susceptible to attacks;2)there exists a potential for large-scale privacy breaches;and 3)incentive mechanisms relying on reputation scores may encounter issues as reputation updates hinge on task demander evaluations,occasionally lacking a dedicated reputation update module.This paper introduces a reputation update scheme tailored for crowdsourcing,with a focus on proficiently overseeing participant reputations and alleviating the impact of malicious activities on the sensing system.Here,the reputation update scheme is determined by an Empirical Cumulative distribution-based Outlier Detection method(ECOD).Our scheme embraces a blockchain-based crowdsourcing framework utilizing a homomorphic encryption method to ensure data transparency and tamper-resistance.Computation of user reputation scores relies on their behavioral history,actively discouraging undesirable conduct.Additionally,we introduce a dynamic weight incentive mechanism that mirrors alterations in participant reputation,enabling the system to allocate incentives based on user behavior and reputation.Our scheme undergoes evaluation on 11 datasets,revealing substantial enhancements in data credibility for crowdsourcing systems and a reduction in the influence of malicious behavior.This research not only presents a practical solution for crowdsourcing reputation management but also offers valuable insights for future research and applications,holding promise for fostering more reliable and high-quality data collection in crowdsourcing across diverse domains.展开更多
With the rapid development ofmobile Internet,spatial crowdsourcing has becomemore andmore popular.Spatial crowdsourcing consists of many different types of applications,such as spatial crowd-sensing services.In terms ...With the rapid development ofmobile Internet,spatial crowdsourcing has becomemore andmore popular.Spatial crowdsourcing consists of many different types of applications,such as spatial crowd-sensing services.In terms of spatial crowd-sensing,it collects and analyzes traffic sensing data from clients like vehicles and traffic lights to construct intelligent traffic prediction models.Besides collecting sensing data,spatial crowdsourcing also includes spatial delivery services like DiDi and Uber.Appropriate task assignment and worker selection dominate the service quality for spatial crowdsourcing applications.Previous research conducted task assignments via traditional matching approaches or using simple network models.However,advanced mining methods are lacking to explore the relationship between workers,task publishers,and the spatio-temporal attributes in tasks.Therefore,in this paper,we propose a Deep Double Dueling Spatial-temporal Q Network(D3SQN)to adaptively learn the spatialtemporal relationship between task,task publishers,and workers in a dynamic environment to achieve optimal allocation.Specifically,D3SQNis revised through reinforcement learning by adding a spatial-temporal transformer that can estimate the expected state values and action advantages so as to improve the accuracy of task assignments.Extensive experiments are conducted over real data collected fromDiDi and ELM,and the simulation results verify the effectiveness of our proposed models.展开更多
The widespread use of advanced electronic devices has led to the emergence of spatial crowdsourcing,a method that taps into collective efforts to perform real-world tasks like environmental monitoring and traffic surv...The widespread use of advanced electronic devices has led to the emergence of spatial crowdsourcing,a method that taps into collective efforts to perform real-world tasks like environmental monitoring and traffic surveillance.Our research focuses on a specific type of spatial crowdsourcing that involves ongoing,collaborative efforts for continuous spatial data acquisition.However,due to limited budgets and workforce availability,the collected data often lacks completeness,posing a data deficiency problem.To address this,we propose a reciprocal framework to optimize task assignments by leveraging the mutual benefits of spatiotemporal subtask execution.We introduce an entropy-based quality metric to capture the combined effects of incomplete data acquisition and interpolation imprecision.Building on this,we explore a quality-aware task assignment method,corresponding to spatiotemporal assignment strategies.Since the assignment problem is NP-hard,we develop a polynomial-time algorithm with the guaranteed approximation ratio.Novel indexing and pruning techniques are proposed to further enhance performance.Extensive experiments conducted on datasets validate the effectiveness of our methods.展开更多
Context-Sensitive Task(CST)is a complex task type in crowdsourc-ing,such as handwriting recognition,route plan,and audio transcription.The current result inference algorithms can perform well in simple crowd-sourcing ...Context-Sensitive Task(CST)is a complex task type in crowdsourc-ing,such as handwriting recognition,route plan,and audio transcription.The current result inference algorithms can perform well in simple crowd-sourcing tasks,but cannot obtain high-quality inference results for CSTs.The conventional method to solve CSTs is to divide a CST into multiple independent simple subtasks for crowdsourcing,but this method ignores the context correlation among subtasks and reduces the quality of result inference.To solve this problem,we propose a result inference algorithm based on the Partially ordered set and Tree augmented naive Bayes Infer(P&T-Inf)for CSTs.Firstly,we screen the candidate results of context-sensitive tasks based on the partially ordered set.If there are parallel candidate sets,the conditional mutual information among subtasks containing context infor-mation in external knowledge(such as Google n-gram corpus,American Contemporary English corpus,etc.)will be calculated.Combined with the tree augmented naive(TAN)Bayes model,the maximum weighted spanning tree is used to model the dependencies among subtasks in each CST.We collect two crowdsourcing datasets of handwriting recognition tasks and audio transcription tasks from the real crowdsourcing platform.The experimental results show that our approach improves the quality of result inference in CSTs and reduces the time cost compared with the latest methods.展开更多
The ubiquity of mobile devices have promoted the prosperity of mobile crowd systems, which recruit crowds to contribute their resources for performing tasks. Yet, due to the various resource consumption, the crowds ma...The ubiquity of mobile devices have promoted the prosperity of mobile crowd systems, which recruit crowds to contribute their resources for performing tasks. Yet, due to the various resource consumption, the crowds may be reluctant to join and contribute information. Thus, the low participation level of crowds will be a hurdle that prevents the adoption of crowdsourcing. A critical challenge for these systems is how to design a proper mechanism such that the crowds spontaneously act as suppliers to contribute accurate information. Most of existing mechanisms ignore either the honesty of crowds or requesters respectively. In this paper, considering the honesty of both, we propose a game-based incentive mechanism, namely RTRC, to stimulate the crowds to contribute accurate information and to motivate the requesters to return accurate feedbacks. In addition, an evolutionary game is designed to model the dynamic of user-strategy selection. Specially, the replicator dynamic is applied to model the adaptation of strategy interactions taking into account the dynamic nature in time dependence and we also derive the evolutionarily stable strategies(ESSs) for users. Finally, empirical results over the simulations show that all the requesters and suppliers will select honest strategy to maximize their profit.展开更多
Big data have the characteristics of enormous volume,high velocity,diversity,value-sparsity,and uncertainty,which lead the knowledge learning from them full of challenges.With the emergence of crowdsourcing,versatile ...Big data have the characteristics of enormous volume,high velocity,diversity,value-sparsity,and uncertainty,which lead the knowledge learning from them full of challenges.With the emergence of crowdsourcing,versatile information can be obtained on-demand so that the wisdom of crowds is easily involved to facilitate the knowledge learning process.During the past thirteen years,researchers in the AI community made great efforts to remove the obstacles in the field of learning from crowds.This concentrated survey paper comprehensively reviews the technical progress in crowdsourcing learning from a systematic perspective that includes three dimensions of data,models,and learning processes.In addition to reviewing existing important work,the paper places a particular emphasis on providing some promising blueprints on each dimension as well as discussing the lessons learned from our past research work,which will light up the way for new researchers and encourage them to pursue new contributions.展开更多
With the development of the Internet of Things(IoT),spatio-temporal crowdsourcing(mobile crowdsourcing)has become an emerging paradigm for addressing location-based sensing tasks.However,the delay caused by network tr...With the development of the Internet of Things(IoT),spatio-temporal crowdsourcing(mobile crowdsourcing)has become an emerging paradigm for addressing location-based sensing tasks.However,the delay caused by network transmission has led to low data processing efficiency.Fortunately,edge computing can solve this problem,effectively reduce the delay of data transmission,and improve data processing capacity,so that the crowdsourcing platform can make better decisions faster.Therefore,this paper combines spatio-temporal crowdsourcing and edge computing to study the Multi-Objective Optimization Task Assignment(MOO-TA)problem in the edge computing environment.The proposed online incentive mechanism considers the task difficulty attribute to motivate crowd workers to perform sensing tasks in the unpopular area.In this paper,the Weighted and Multi-Objective Particle Swarm Combination(WAMOPSC)algorithm is proposed to maximize both platform’s and crowd workers’utility,so as to maximize social welfare.The algorithm combines the traditional Linear Weighted Summation(LWS)algorithm and Multi-Objective Particle Swarm Optimization(MOPSO)algorithm to find pareto optimal solutions of multi-objective optimization task assignment problem as much as possible for crowdsourcing platform to choose.Through comparison experiments on real data sets,the effectiveness and feasibility of the proposed method are evaluated.展开更多
In the era of the Internet of Things(IoT),the crowdsourcing process is driven by data collected by devices that interact with each other and with the physical world.As a part of the IoT ecosystem,task assignment has b...In the era of the Internet of Things(IoT),the crowdsourcing process is driven by data collected by devices that interact with each other and with the physical world.As a part of the IoT ecosystem,task assignment has become an important goal of the research community.Existing task assignment algorithms can be categorized as offline(performs better with datasets but struggles to achieve good real-life results)or online(works well with real-life input but is difficult to optimize regarding in-depth assignments).This paper proposes a Cross-regional Online Task(CROT)assignment problem based on the online assignment model.Given the CROT problem,an Online Task Assignment across Regions based on Prediction(OTARP)algorithm is proposed.OTARP is a two-stage graphics-driven bilateral assignment strategy that uses edge cloud and graph embedding to complete task assignments.The first stage uses historical data to make offline predictions,with a graph-driven method for offline bipartite graph matching.The second stage uses a bipartite graph to complete the online task assignment process.This paper proposes accelerating the task assignment process through multiple assignment rounds and optimizing the process by combining offline guidance and online assignment strategies.To encourage crowd workers to complete crowd tasks across regions,an incentive strategy is designed to encourage crowd workers’movement.To avoid the idle problem in the process of crowd worker movement,a drop-by-rider problem is used to help crowd workers accept more crowd tasks,optimize the number of assignments,and increase utility.Finally,through comparison experiments on real datasets,the performance of the proposed algorithm on crowd worker utility value and the matching number is evaluated.展开更多
Crowd-Sensing is an innovative data acquisition method that combines the perception of mobile devices with the idea of crowdsourcing.It is a new application mode under the development of the Internet of Things.The per...Crowd-Sensing is an innovative data acquisition method that combines the perception of mobile devices with the idea of crowdsourcing.It is a new application mode under the development of the Internet of Things.The perceptual data that mobile users can provide is limited.Multiple crowdsourcing parties will share this limited data,but the cost that the crowdsourcing party can pay is limited,and enough mobile users are needed to complete the perceptual task,making the group wisdom is really played.In this process,there is bound to be a game between the crowds and the mobile users.Most of the existing researches consider a group-aware system.A group of mobile users will directly share or compete for the opportunity of the crowd-holders to do tasks and get paid,the behavior of multiple crowd-source parties,and their bilateral interaction with mobile users.The research is not clear enough and there is no targeted research.This paper will model and analyze the dynamic evolution process of crowd sensing perception.Based on the unique characteristics of crowd-source non-cooperative game and crowd-sourced Nash equilibrium,we will develop a perceptual plan for mobile users and use the stability analysis of iterative algorithms to explore a way to better match the capabilities of mobile users and the needs of crowdsourced parties.Our theoretical analysis and simulation results verify the dynamic evolution model of crowdsourcing in group perception and propose a method to improve the efficiency of crowdsourcing.展开更多
A crowdsourcing experiment in which viewers (the “crowd”) of a British Broadcasting Corporation (BBC) television show submitted estimates of the number of coins in a tumbler was shown in an antecedent paper (Part 1)...A crowdsourcing experiment in which viewers (the “crowd”) of a British Broadcasting Corporation (BBC) television show submitted estimates of the number of coins in a tumbler was shown in an antecedent paper (Part 1) to follow a log-normal distribution ∧(m,s2). The coin-estimation experiment is an archetype of a broad class of image analysis and object counting problems suitable for solution by crowdsourcing. The objective of the current paper (Part 2) is to determine the location and scale parameters (m,s) of ∧(m,s2) by both Bayesian and maximum likelihood (ML) methods and to compare the results. One outcome of the analysis is the resolution, by means of Jeffreys’ rule, of questions regarding the appropriate Bayesian prior. It is shown that Bayesian and ML analyses lead to the same expression for the location parameter, but different expressions for the scale parameter, which become identical in the limit of an infinite sample size. A second outcome of the analysis concerns use of the sample mean as the measure of information of the crowd in applications where the distribution of responses is not sought or known. In the coin-estimation experiment, the sample mean was found to differ widely from the mean number of coins calculated from ∧(m,s2). This discordance raises critical questions concerning whether, and under what conditions, the sample mean provides a reliable measure of the information of the crowd. This paper resolves that problem by use of the principle of maximum entropy (PME). The PME yields a set of equations for finding the most probable distribution consistent with given prior information and only that information. If there is no solution to the PME equations for a specified sample mean and sample variance, then the sample mean is an unreliable statistic, since no measure can be assigned to its uncertainty. Parts 1 and 2 together demonstrate that the information content of crowdsourcing resides in the distribution of responses (very often log-normal in form), which can be obtained empirically or by appropriate modeling.展开更多
Crowdsourcing is an innovative business practice of obtaining needed services, ideas, or content or even funds by soliciting contributions from a large group of people (the 'Crowd'). The potential benefits of util...Crowdsourcing is an innovative business practice of obtaining needed services, ideas, or content or even funds by soliciting contributions from a large group of people (the 'Crowd'). The potential benefits of utilizing crowdsourcing in product design are well-documented, but little research exists on what are the barriers and opportunities in adopting crowdsourcing in new product development (NPD) of manufacturing SMEs. In order to answer the above questions, a Proof of Market study is carried out on crowdsourcing-based product design under an Innovate UK funded Smart project, which aims at identifying the needs, challenges and future development opportunities associated with adopting crowdsourcing strategies for NPD. The research findings from this study are reported here and can be used to guide future development of crowdsourcing-based collaborative design methods and tools and provide some practical references for industry to adopt this new and emerging collaborative design method in their business.展开更多
In the context of the continuous development of the Internet,crowdsourcing has received continuous attention as a new cooperation model based on the relationship between enterprises,the public and society.Among them,a...In the context of the continuous development of the Internet,crowdsourcing has received continuous attention as a new cooperation model based on the relationship between enterprises,the public and society.Among them,a reasonably designed recommendation algorithm can recommend a batch of suitable workers for crowdsourcing tasks to improve the final task completion quality.Therefore,this paper proposes a crowdsourcing recommendation framework based on workers’influence(CRBI).This crowdsourcing framework completes the entire process design from task distribution,worker recommendation,and result return through processes such as worker behavior analysis,task characteristics construction,and cost optimization.In this paper,a calculation model of workers’influence characteristics based on the ablation method is designed to evaluate the comprehensive performance of workers.At the same time,the CRBI framework combines the traditional open-call task selection mode,builds a new task characteristics model by sensing the influence of the requesting worker and its task performance.In the end,accurate worker recommendation and task cost optimization are carried out by calculating model familiarity.In addition,for recommending workers to submit task answers,this paper also proposes an aggregation algorithm based on weighted influence to ensure the accuracy of task results.This paper conducts simulation experiments on some public datasets of AMT,and the experimental results show that the CRBI framework proposed in this paper has a high comprehensive performance.Moreover,CRBI has better usability,more in line with commercial needs,and can well reflect the wisdom of group intelligence.展开更多
Crowdsourcing is widely used in various fields to collect goods and services from large participants.Evaluating teaching quality by collecting feedback from experts or students after class is not only delayed but also...Crowdsourcing is widely used in various fields to collect goods and services from large participants.Evaluating teaching quality by collecting feedback from experts or students after class is not only delayed but also not accurate.In this paper,we present a crowdsourcing-based framework to evaluate teaching quality in the classroom using a weighted average operator to aggregate information from students’questionnaires described by linguistic 2-tuple terms.Then we define crowd grade based on similarity degree to distinguish contribution from different students and minimize the abnormal students’impact on the evaluation.The crowd grade would be updated at the end of each feedback so it can guarantee the evaluation accurately.Moreover,a simulated case is shown to illustrate how to apply this framework to assess teaching quality in the classroom.Finally,we developed a prototype and carried out some experiments on a series of real questionnaires and two sets of modified data.The results show that teachers can locate the weak points of teaching and furthermore to identify the abnormal students to improve the teaching quality.Meanwhile,our approach provides a strong tolerance for the abnormal student to make the evaluation more accurate.展开更多
In order to solve the problem of location privacy under big data and improve the user positioning experience,a new concept of anonymous crowdsourcing-based WLAN indoor localization is proposed by employing the Micro-E...In order to solve the problem of location privacy under big data and improve the user positioning experience,a new concept of anonymous crowdsourcing-based WLAN indoor localization is proposed by employing the Micro-Electro-Mechanical System(MEMS)motion sensors as well as WLAN module in off-the-shelf smartphones.First of all,the crowdsourced motion traces with similar Received Signal Strength(RSS)sequences are assembled into a motion graph.Second,the mobility map is constructed according to traces segmentation and clustering.Third,the pixel template matching is adopted to physically label the pre-constructed mobility map.Finally,the robust Extended Kalman Filter(EKF)is designed to perform localization by matching the newly-collected RSS measurements against the mobility map.The extensive experimental results show that the proposed approach is capable of constructing a physically-labeled mobility map from the sporadically-collected crowdsourced motion traces as well as achieving satisfactory localization accuracy in a cost-efficient manner.展开更多
The spatial distribution of cropland is an important input to many applications including food security monitoring and economic land use modeling. Global land cover maps derived from remote sensing are one source of c...The spatial distribution of cropland is an important input to many applications including food security monitoring and economic land use modeling. Global land cover maps derived from remote sensing are one source of cropland but they are currently not accurate enough in the cropland domain to meet the needs of the user community. Moreover, when compared with one another, these land cover products show large areas of spatial disagreement, which makes the choice very difficult regarding which land cover product to use. This paper takes an entirely different approach to mapping cropland, using crowdsourcing of Google Earth imagery via tools in Geo-Wiki. Using sample data generated by a crowdsourcing campaign for the collection of the degree of cultivation and settlement in Ethiopia, a cropland map was created using simple inverse distance weighted interpolation. The map was validated using data from the GOFC-GOLD validation portal and an independent crowdsourced dataset from Geo-Wiki. The results show that the crowdsourced cropland map for Ethiopia has a higher overall accuracy than the individual global land cover products for this country. Such an approach has great potential for mapping cropland in other countries where such data do not currently exist. Not only is the approach inexpensive but the data can be collected over a very short period of time using an existing network of volunteers.展开更多
A centralized trusted execution environment(TEE)has been extensively studied to provide secure and trusted computing.However,a TEE might become a throughput bottleneck if it is used to evaluate data quality when colle...A centralized trusted execution environment(TEE)has been extensively studied to provide secure and trusted computing.However,a TEE might become a throughput bottleneck if it is used to evaluate data quality when collecting large-scale data in a crowdsourcing system.It may also have security problems compromised by attackers.Here,we propose a scheme,named dTEE,for building a platform for providing distributed trusted computing by leveraging TEEs.The platform is used as an infrastructure of trusted computations for blockchain-based crowdsourcing systems,especially to securely evaluate data quality and manage remuneration:these operations are handled by a TEE group.First,dTEE uses a public blockchain with smart contracts to manage TEEs without reliance on any trusted third parties.Second,to update TEE registration information and rule out zombie TEEs,dTEE uses a reporting mechanism.To attract TEE owners to join in and provide service of trusted computations,it uses a fair monetary incentive mechanism.Third,to account for malicious attackers,we design a model with Byzantine fault tolerance,not limited to a crash-failure model.Finally,we conduct an extensive evaluation of our design on a local cluster.The results show that dTEE finishes evaluating 10,000 images within one minute and achieves about 65 tps throughput when evaluating Sudoku solution data with collective signatures both in a group of 120 TEEs.展开更多
Generally,knowledge extraction technology is used to obtain nodes and relationships of unstructured data and structured data,and then the data fuse with the original knowledge graph to achieve the extension of the kno...Generally,knowledge extraction technology is used to obtain nodes and relationships of unstructured data and structured data,and then the data fuse with the original knowledge graph to achieve the extension of the knowledge graph.Because the concepts and knowledge structures expressed on the Internet have problems of multi-source heterogeneity and low accuracy,it is usually difficult to achieve a good effect simply by using knowledge extraction technology.Considering that domain knowledge is highly dependent on the relevant expert knowledge,the method of this paper try to expand the domain knowledge through the crowdsourcing method.The method split the domain knowledge system into subgraph of knowledge according to corresponding concept,form subtasks with moderate granularity,and use the crowdsourcing technology for the acquisition and integration of knowledge subgraph to improve the knowledge system.展开更多
基金This work was supported by the National Natural Science Foundation of China under Grant 62233003the National Key Research and Development Program of China under Grant 2020YFB1708602.
文摘The proliferation of intelligent,connected Internet of Things(IoT)devices facilitates data collection.However,task workers may be reluctant to participate in data collection due to privacy concerns,and task requesters may be concerned about the validity of the collected data.Hence,it is vital to evaluate the quality of the data collected by the task workers while protecting privacy in spatial crowdsourcing(SC)data collection tasks with IoT.To this end,this paper proposes a privacy-preserving data reliability evaluation for SC in IoT,named PARE.First,we design a data uploading format using blockchain and Paillier homomorphic cryptosystem,providing unchangeable and traceable data while overcoming privacy concerns.Secondly,based on the uploaded data,we propose a method to determine the approximate correct value region without knowing the exact value.Finally,we offer a data filtering mechanism based on the Paillier cryptosystem using this value region.The evaluation and analysis results show that PARE outperforms the existing solution in terms of performance and privacy protection.
基金National Natural Science Foundation of China(62072392).
文摘Crowdsourcing technology is widely recognized for its effectiveness in task scheduling and resource allocation.While traditional methods for task allocation can help reduce costs and improve efficiency,they may encounter challenges when dealing with abnormal data flow nodes,leading to decreased allocation accuracy and efficiency.To address these issues,this study proposes a novel two-part invalid detection task allocation framework.In the first step,an anomaly detection model is developed using a dynamic self-attentive GAN to identify anomalous data.Compared to the baseline method,the model achieves an approximately 4%increase in the F1 value on the public dataset.In the second step of the framework,task allocation modeling is performed using a twopart graph matching method.This phase introduces a P-queue KM algorithm that implements a more efficient optimization strategy.The allocation efficiency is improved by approximately 23.83%compared to the baseline method.Empirical results confirm the effectiveness of the proposed framework in detecting abnormal data nodes,enhancing allocation precision,and achieving efficient allocation.
基金This work is supported by National Natural Science Foundation of China(Nos.U21A20463,62172117,61802383)Research Project of Pazhou Lab for Excellent Young Scholars(No.PZL2021KF0024)Guangzhou Basic and Applied Basic Research Foundation(Nos.202201010330,202201020162,202201020221).
文摘Crowdsourcing holds broad applications in information acquisition and dissemination,yet encounters challenges pertaining to data quality assessment and user reputation management.Reputation mechanisms stand as crucial solutions for appraising and updating participant reputation scores,thereby elevating the quality and dependability of crowdsourced data.However,these mechanisms face several challenges in traditional crowdsourcing systems:1)platform security lacks robust guarantees and may be susceptible to attacks;2)there exists a potential for large-scale privacy breaches;and 3)incentive mechanisms relying on reputation scores may encounter issues as reputation updates hinge on task demander evaluations,occasionally lacking a dedicated reputation update module.This paper introduces a reputation update scheme tailored for crowdsourcing,with a focus on proficiently overseeing participant reputations and alleviating the impact of malicious activities on the sensing system.Here,the reputation update scheme is determined by an Empirical Cumulative distribution-based Outlier Detection method(ECOD).Our scheme embraces a blockchain-based crowdsourcing framework utilizing a homomorphic encryption method to ensure data transparency and tamper-resistance.Computation of user reputation scores relies on their behavioral history,actively discouraging undesirable conduct.Additionally,we introduce a dynamic weight incentive mechanism that mirrors alterations in participant reputation,enabling the system to allocate incentives based on user behavior and reputation.Our scheme undergoes evaluation on 11 datasets,revealing substantial enhancements in data credibility for crowdsourcing systems and a reduction in the influence of malicious behavior.This research not only presents a practical solution for crowdsourcing reputation management but also offers valuable insights for future research and applications,holding promise for fostering more reliable and high-quality data collection in crowdsourcing across diverse domains.
基金supported in part by the Pioneer and Leading Goose R&D Program of Zhejiang Province under Grant 2022C01083 (Dr.Yu Li,https://zjnsf.kjt.zj.gov.cn/)Pioneer and Leading Goose R&D Program of Zhejiang Province under Grant 2023C01217 (Dr.Yu Li,https://zjnsf.kjt.zj.gov.cn/).
文摘With the rapid development ofmobile Internet,spatial crowdsourcing has becomemore andmore popular.Spatial crowdsourcing consists of many different types of applications,such as spatial crowd-sensing services.In terms of spatial crowd-sensing,it collects and analyzes traffic sensing data from clients like vehicles and traffic lights to construct intelligent traffic prediction models.Besides collecting sensing data,spatial crowdsourcing also includes spatial delivery services like DiDi and Uber.Appropriate task assignment and worker selection dominate the service quality for spatial crowdsourcing applications.Previous research conducted task assignments via traditional matching approaches or using simple network models.However,advanced mining methods are lacking to explore the relationship between workers,task publishers,and the spatio-temporal attributes in tasks.Therefore,in this paper,we propose a Deep Double Dueling Spatial-temporal Q Network(D3SQN)to adaptively learn the spatialtemporal relationship between task,task publishers,and workers in a dynamic environment to achieve optimal allocation.Specifically,D3SQNis revised through reinforcement learning by adding a spatial-temporal transformer that can estimate the expected state values and action advantages so as to improve the accuracy of task assignments.Extensive experiments are conducted over real data collected fromDiDi and ELM,and the simulation results verify the effectiveness of our proposed models.
文摘The widespread use of advanced electronic devices has led to the emergence of spatial crowdsourcing,a method that taps into collective efforts to perform real-world tasks like environmental monitoring and traffic surveillance.Our research focuses on a specific type of spatial crowdsourcing that involves ongoing,collaborative efforts for continuous spatial data acquisition.However,due to limited budgets and workforce availability,the collected data often lacks completeness,posing a data deficiency problem.To address this,we propose a reciprocal framework to optimize task assignments by leveraging the mutual benefits of spatiotemporal subtask execution.We introduce an entropy-based quality metric to capture the combined effects of incomplete data acquisition and interpolation imprecision.Building on this,we explore a quality-aware task assignment method,corresponding to spatiotemporal assignment strategies.Since the assignment problem is NP-hard,we develop a polynomial-time algorithm with the guaranteed approximation ratio.Novel indexing and pruning techniques are proposed to further enhance performance.Extensive experiments conducted on datasets validate the effectiveness of our methods.
基金supported by the National Social Science Fund of China(Grant No.22BTQ033).
文摘Context-Sensitive Task(CST)is a complex task type in crowdsourc-ing,such as handwriting recognition,route plan,and audio transcription.The current result inference algorithms can perform well in simple crowd-sourcing tasks,but cannot obtain high-quality inference results for CSTs.The conventional method to solve CSTs is to divide a CST into multiple independent simple subtasks for crowdsourcing,but this method ignores the context correlation among subtasks and reduces the quality of result inference.To solve this problem,we propose a result inference algorithm based on the Partially ordered set and Tree augmented naive Bayes Infer(P&T-Inf)for CSTs.Firstly,we screen the candidate results of context-sensitive tasks based on the partially ordered set.If there are parallel candidate sets,the conditional mutual information among subtasks containing context infor-mation in external knowledge(such as Google n-gram corpus,American Contemporary English corpus,etc.)will be calculated.Combined with the tree augmented naive(TAN)Bayes model,the maximum weighted spanning tree is used to model the dependencies among subtasks in each CST.We collect two crowdsourcing datasets of handwriting recognition tasks and audio transcription tasks from the real crowdsourcing platform.The experimental results show that our approach improves the quality of result inference in CSTs and reduces the time cost compared with the latest methods.
基金supported by the National Natural Science Foundation of China (Grant Nos. 61672408, U1405255, 61502368, 61602537, 61602357, 61672413, U1509214, U1135002)National High Technology Research and Development Program (863 Program) (Grant Nos. 2015AA016007, 2015AA017203)+5 种基金China Postdoctoral Science Foundation Funded Project (Grant No.2016M592762)Shaanxi Science & Technology Coordination & Innovation Project (Grant No.2016TZC-G-6-3)Shaanxi Provincial Natural Science Foundation (Grant Nos. 2015JQ6227, 2016JM6005)China 111 Project (Grant No. B16037)Beijing Municipal Social Science Foundation(Grant No. 16XCC023)Fundamental Research Funds for the Central Universities (Grant Nos. JB150308, JB150309, JB161501, JBG161511)
文摘The ubiquity of mobile devices have promoted the prosperity of mobile crowd systems, which recruit crowds to contribute their resources for performing tasks. Yet, due to the various resource consumption, the crowds may be reluctant to join and contribute information. Thus, the low participation level of crowds will be a hurdle that prevents the adoption of crowdsourcing. A critical challenge for these systems is how to design a proper mechanism such that the crowds spontaneously act as suppliers to contribute accurate information. Most of existing mechanisms ignore either the honesty of crowds or requesters respectively. In this paper, considering the honesty of both, we propose a game-based incentive mechanism, namely RTRC, to stimulate the crowds to contribute accurate information and to motivate the requesters to return accurate feedbacks. In addition, an evolutionary game is designed to model the dynamic of user-strategy selection. Specially, the replicator dynamic is applied to model the adaptation of strategy interactions taking into account the dynamic nature in time dependence and we also derive the evolutionarily stable strategies(ESSs) for users. Finally, empirical results over the simulations show that all the requesters and suppliers will select honest strategy to maximize their profit.
基金supported by the National Key Research and Development Program of China(2018AAA0102002)the National Natural Science Foundation of China(62076130,91846104).
文摘Big data have the characteristics of enormous volume,high velocity,diversity,value-sparsity,and uncertainty,which lead the knowledge learning from them full of challenges.With the emergence of crowdsourcing,versatile information can be obtained on-demand so that the wisdom of crowds is easily involved to facilitate the knowledge learning process.During the past thirteen years,researchers in the AI community made great efforts to remove the obstacles in the field of learning from crowds.This concentrated survey paper comprehensively reviews the technical progress in crowdsourcing learning from a systematic perspective that includes three dimensions of data,models,and learning processes.In addition to reviewing existing important work,the paper places a particular emphasis on providing some promising blueprints on each dimension as well as discussing the lessons learned from our past research work,which will light up the way for new researchers and encourage them to pursue new contributions.
基金supported in part by the National Natural Science Foundation of China under Grant 61822602,Grant 61772207,Grant 61802331,Grant 61572418,Grant 61602399,Grant 61702439 and Grant 61773331the China Postdoctoral Science Foundation under Grant 2019T120732 and Grant 2017M622691+1 种基金the National Science Foundation(NSF)under Grant 1704287,Grant 1252292 and Grant 1741277the Natural Science Foundation of Shandong Province under Grant ZR2016FM42.
文摘With the development of the Internet of Things(IoT),spatio-temporal crowdsourcing(mobile crowdsourcing)has become an emerging paradigm for addressing location-based sensing tasks.However,the delay caused by network transmission has led to low data processing efficiency.Fortunately,edge computing can solve this problem,effectively reduce the delay of data transmission,and improve data processing capacity,so that the crowdsourcing platform can make better decisions faster.Therefore,this paper combines spatio-temporal crowdsourcing and edge computing to study the Multi-Objective Optimization Task Assignment(MOO-TA)problem in the edge computing environment.The proposed online incentive mechanism considers the task difficulty attribute to motivate crowd workers to perform sensing tasks in the unpopular area.In this paper,the Weighted and Multi-Objective Particle Swarm Combination(WAMOPSC)algorithm is proposed to maximize both platform’s and crowd workers’utility,so as to maximize social welfare.The algorithm combines the traditional Linear Weighted Summation(LWS)algorithm and Multi-Objective Particle Swarm Optimization(MOPSO)algorithm to find pareto optimal solutions of multi-objective optimization task assignment problem as much as possible for crowdsourcing platform to choose.Through comparison experiments on real data sets,the effectiveness and feasibility of the proposed method are evaluated.
基金supported in part by the National Natural Science Foundation of China under Grant 62072392,Grant 61822602,Grant 61772207,Grant 61802331,Grant 61602399,Grant 61702439,Grant 61773331,and Grant 62062034the China Postdoctoral Science Foundation under Grant 2019T120732 and Grant 2017M622691+2 种基金the Natural Science Foundation of Shandong Province under Grant ZR2016FM42the Major scientific and technological innovation projects of Shandong Province under Grant 2019JZZY020131the Key projects of Shandong Natural Science Foundation under Grant ZR2020KF019.
文摘In the era of the Internet of Things(IoT),the crowdsourcing process is driven by data collected by devices that interact with each other and with the physical world.As a part of the IoT ecosystem,task assignment has become an important goal of the research community.Existing task assignment algorithms can be categorized as offline(performs better with datasets but struggles to achieve good real-life results)or online(works well with real-life input but is difficult to optimize regarding in-depth assignments).This paper proposes a Cross-regional Online Task(CROT)assignment problem based on the online assignment model.Given the CROT problem,an Online Task Assignment across Regions based on Prediction(OTARP)algorithm is proposed.OTARP is a two-stage graphics-driven bilateral assignment strategy that uses edge cloud and graph embedding to complete task assignments.The first stage uses historical data to make offline predictions,with a graph-driven method for offline bipartite graph matching.The second stage uses a bipartite graph to complete the online task assignment process.This paper proposes accelerating the task assignment process through multiple assignment rounds and optimizing the process by combining offline guidance and online assignment strategies.To encourage crowd workers to complete crowd tasks across regions,an incentive strategy is designed to encourage crowd workers’movement.To avoid the idle problem in the process of crowd worker movement,a drop-by-rider problem is used to help crowd workers accept more crowd tasks,optimize the number of assignments,and increase utility.Finally,through comparison experiments on real datasets,the performance of the proposed algorithm on crowd worker utility value and the matching number is evaluated.
基金supported by the National Natural Science Foundation of China(61772196,61472136)Postgraduate Scientific Research Innovation Project of Hunan Province(CX20201074)+1 种基金the key project of Hunan social science achievement Review Committee(XSP 19ZD1005)General program of Hunan Natural Science Foundation(2020JJ4249,2019JJ40145).
文摘Crowd-Sensing is an innovative data acquisition method that combines the perception of mobile devices with the idea of crowdsourcing.It is a new application mode under the development of the Internet of Things.The perceptual data that mobile users can provide is limited.Multiple crowdsourcing parties will share this limited data,but the cost that the crowdsourcing party can pay is limited,and enough mobile users are needed to complete the perceptual task,making the group wisdom is really played.In this process,there is bound to be a game between the crowds and the mobile users.Most of the existing researches consider a group-aware system.A group of mobile users will directly share or compete for the opportunity of the crowd-holders to do tasks and get paid,the behavior of multiple crowd-source parties,and their bilateral interaction with mobile users.The research is not clear enough and there is no targeted research.This paper will model and analyze the dynamic evolution process of crowd sensing perception.Based on the unique characteristics of crowd-source non-cooperative game and crowd-sourced Nash equilibrium,we will develop a perceptual plan for mobile users and use the stability analysis of iterative algorithms to explore a way to better match the capabilities of mobile users and the needs of crowdsourced parties.Our theoretical analysis and simulation results verify the dynamic evolution model of crowdsourcing in group perception and propose a method to improve the efficiency of crowdsourcing.
文摘A crowdsourcing experiment in which viewers (the “crowd”) of a British Broadcasting Corporation (BBC) television show submitted estimates of the number of coins in a tumbler was shown in an antecedent paper (Part 1) to follow a log-normal distribution ∧(m,s2). The coin-estimation experiment is an archetype of a broad class of image analysis and object counting problems suitable for solution by crowdsourcing. The objective of the current paper (Part 2) is to determine the location and scale parameters (m,s) of ∧(m,s2) by both Bayesian and maximum likelihood (ML) methods and to compare the results. One outcome of the analysis is the resolution, by means of Jeffreys’ rule, of questions regarding the appropriate Bayesian prior. It is shown that Bayesian and ML analyses lead to the same expression for the location parameter, but different expressions for the scale parameter, which become identical in the limit of an infinite sample size. A second outcome of the analysis concerns use of the sample mean as the measure of information of the crowd in applications where the distribution of responses is not sought or known. In the coin-estimation experiment, the sample mean was found to differ widely from the mean number of coins calculated from ∧(m,s2). This discordance raises critical questions concerning whether, and under what conditions, the sample mean provides a reliable measure of the information of the crowd. This paper resolves that problem by use of the principle of maximum entropy (PME). The PME yields a set of equations for finding the most probable distribution consistent with given prior information and only that information. If there is no solution to the PME equations for a specified sample mean and sample variance, then the sample mean is an unreliable statistic, since no measure can be assigned to its uncertainty. Parts 1 and 2 together demonstrate that the information content of crowdsourcing resides in the distribution of responses (very often log-normal in form), which can be obtained empirically or by appropriate modeling.
基金Supported by Innovate UK Smart Project(Grant No.700484)
文摘Crowdsourcing is an innovative business practice of obtaining needed services, ideas, or content or even funds by soliciting contributions from a large group of people (the 'Crowd'). The potential benefits of utilizing crowdsourcing in product design are well-documented, but little research exists on what are the barriers and opportunities in adopting crowdsourcing in new product development (NPD) of manufacturing SMEs. In order to answer the above questions, a Proof of Market study is carried out on crowdsourcing-based product design under an Innovate UK funded Smart project, which aims at identifying the needs, challenges and future development opportunities associated with adopting crowdsourcing strategies for NPD. The research findings from this study are reported here and can be used to guide future development of crowdsourcing-based collaborative design methods and tools and provide some practical references for industry to adopt this new and emerging collaborative design method in their business.
基金Ministry of Science and Technology:Key Research and Development Project(2018YFB003800)Hunan Provincial Key Laboratory of Finance&Economics Big Data Science and Technology(Hunan University of Finance and Economics)2017TP1025 and HNNSF 2018JJ2535.
文摘In the context of the continuous development of the Internet,crowdsourcing has received continuous attention as a new cooperation model based on the relationship between enterprises,the public and society.Among them,a reasonably designed recommendation algorithm can recommend a batch of suitable workers for crowdsourcing tasks to improve the final task completion quality.Therefore,this paper proposes a crowdsourcing recommendation framework based on workers’influence(CRBI).This crowdsourcing framework completes the entire process design from task distribution,worker recommendation,and result return through processes such as worker behavior analysis,task characteristics construction,and cost optimization.In this paper,a calculation model of workers’influence characteristics based on the ablation method is designed to evaluate the comprehensive performance of workers.At the same time,the CRBI framework combines the traditional open-call task selection mode,builds a new task characteristics model by sensing the influence of the requesting worker and its task performance.In the end,accurate worker recommendation and task cost optimization are carried out by calculating model familiarity.In addition,for recommending workers to submit task answers,this paper also proposes an aggregation algorithm based on weighted influence to ensure the accuracy of task results.This paper conducts simulation experiments on some public datasets of AMT,and the experimental results show that the CRBI framework proposed in this paper has a high comprehensive performance.Moreover,CRBI has better usability,more in line with commercial needs,and can well reflect the wisdom of group intelligence.
文摘Crowdsourcing is widely used in various fields to collect goods and services from large participants.Evaluating teaching quality by collecting feedback from experts or students after class is not only delayed but also not accurate.In this paper,we present a crowdsourcing-based framework to evaluate teaching quality in the classroom using a weighted average operator to aggregate information from students’questionnaires described by linguistic 2-tuple terms.Then we define crowd grade based on similarity degree to distinguish contribution from different students and minimize the abnormal students’impact on the evaluation.The crowd grade would be updated at the end of each feedback so it can guarantee the evaluation accurately.Moreover,a simulated case is shown to illustrate how to apply this framework to assess teaching quality in the classroom.Finally,we developed a prototype and carried out some experiments on a series of real questionnaires and two sets of modified data.The results show that teachers can locate the weak points of teaching and furthermore to identify the abnormal students to improve the teaching quality.Meanwhile,our approach provides a strong tolerance for the abnormal student to make the evaluation more accurate.
基金the National Natural Science Foundation of China(61771083,61704015)Program for Changjiang Scholars and Innovative Research Team in University(IRT1299)+2 种基金Special Fund of Chongqing Key Laboratory(CSTC),Fundamental and Frontier Research Project of Chongqing(cstc2017jcyjAX0380,cstc2015jcyjBX0065)University Outstanding Achievement Transformation Project of Chongqing(KJZH17117)Postgraduate Scientific Research and Innovation Project of Chongqing(CYS17221).
文摘In order to solve the problem of location privacy under big data and improve the user positioning experience,a new concept of anonymous crowdsourcing-based WLAN indoor localization is proposed by employing the Micro-Electro-Mechanical System(MEMS)motion sensors as well as WLAN module in off-the-shelf smartphones.First of all,the crowdsourced motion traces with similar Received Signal Strength(RSS)sequences are assembled into a motion graph.Second,the mobility map is constructed according to traces segmentation and clustering.Third,the pixel template matching is adopted to physically label the pre-constructed mobility map.Finally,the robust Extended Kalman Filter(EKF)is designed to perform localization by matching the newly-collected RSS measurements against the mobility map.The extensive experimental results show that the proposed approach is capable of constructing a physically-labeled mobility map from the sporadically-collected crowdsourced motion traces as well as achieving satisfactory localization accuracy in a cost-efficient manner.
文摘The spatial distribution of cropland is an important input to many applications including food security monitoring and economic land use modeling. Global land cover maps derived from remote sensing are one source of cropland but they are currently not accurate enough in the cropland domain to meet the needs of the user community. Moreover, when compared with one another, these land cover products show large areas of spatial disagreement, which makes the choice very difficult regarding which land cover product to use. This paper takes an entirely different approach to mapping cropland, using crowdsourcing of Google Earth imagery via tools in Geo-Wiki. Using sample data generated by a crowdsourcing campaign for the collection of the degree of cultivation and settlement in Ethiopia, a cropland map was created using simple inverse distance weighted interpolation. The map was validated using data from the GOFC-GOLD validation portal and an independent crowdsourced dataset from Geo-Wiki. The results show that the crowdsourced cropland map for Ethiopia has a higher overall accuracy than the individual global land cover products for this country. Such an approach has great potential for mapping cropland in other countries where such data do not currently exist. Not only is the approach inexpensive but the data can be collected over a very short period of time using an existing network of volunteers.
基金This work was supported by the National Research Foundation of Korea(NRF)grant funded by the Korea government(No.NRF-2019R1A2C1090713).
文摘A centralized trusted execution environment(TEE)has been extensively studied to provide secure and trusted computing.However,a TEE might become a throughput bottleneck if it is used to evaluate data quality when collecting large-scale data in a crowdsourcing system.It may also have security problems compromised by attackers.Here,we propose a scheme,named dTEE,for building a platform for providing distributed trusted computing by leveraging TEEs.The platform is used as an infrastructure of trusted computations for blockchain-based crowdsourcing systems,especially to securely evaluate data quality and manage remuneration:these operations are handled by a TEE group.First,dTEE uses a public blockchain with smart contracts to manage TEEs without reliance on any trusted third parties.Second,to update TEE registration information and rule out zombie TEEs,dTEE uses a reporting mechanism.To attract TEE owners to join in and provide service of trusted computations,it uses a fair monetary incentive mechanism.Third,to account for malicious attackers,we design a model with Byzantine fault tolerance,not limited to a crash-failure model.Finally,we conduct an extensive evaluation of our design on a local cluster.The results show that dTEE finishes evaluating 10,000 images within one minute and achieves about 65 tps throughput when evaluating Sudoku solution data with collective signatures both in a group of 120 TEEs.
文摘Generally,knowledge extraction technology is used to obtain nodes and relationships of unstructured data and structured data,and then the data fuse with the original knowledge graph to achieve the extension of the knowledge graph.Because the concepts and knowledge structures expressed on the Internet have problems of multi-source heterogeneity and low accuracy,it is usually difficult to achieve a good effect simply by using knowledge extraction technology.Considering that domain knowledge is highly dependent on the relevant expert knowledge,the method of this paper try to expand the domain knowledge through the crowdsourcing method.The method split the domain knowledge system into subgraph of knowledge according to corresponding concept,form subtasks with moderate granularity,and use the crowdsourcing technology for the acquisition and integration of knowledge subgraph to improve the knowledge system.