期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Random Forests Algorithm Based Duplicate Detection in On-Site Programming Big Data Environment 被引量:1
1
作者 Qianqian Li Meng Li +1 位作者 Lei Guo Zhen Zhang 《Journal of Information Hiding and Privacy Protection》 2020年第4期199-205,共7页
On-site programming big data refers to the massive data generated in the process of software development with the characteristics of real-time,complexity and high-difficulty for processing.Therefore,data cleaning is e... On-site programming big data refers to the massive data generated in the process of software development with the characteristics of real-time,complexity and high-difficulty for processing.Therefore,data cleaning is essential for on-site programming big data.Duplicate data detection is an important step in data cleaning,which can save storage resources and enhance data consistency.Due to the insufficiency in traditional Sorted Neighborhood Method(SNM)and the difficulty of high-dimensional data detection,an optimized algorithm based on random forests with the dynamic and adaptive window size is proposed.The efficiency of the algorithm can be elevated by improving the method of the key-selection,reducing dimension of data set and using an adaptive variable size sliding window.Experimental results show that the improved SNM algorithm exhibits better performance and achieve higher accuracy. 展开更多
关键词 On-site programming big data duplicate record detection random forests adaptive sliding window
下载PDF
PRECESION: progressive recovery and restoration planning of interdependent services in enterprise data centers 被引量:2
2
作者 Ibrahim El-Shekeil Amitangshu Pal Krishna Kant 《Digital Communications and Networks》 SCIE 2018年第1期39-47,共9页
The primary focus of this paper is to design a progressive restoration plan for an enterprise data center environment following a partial or full disruption. Repairing and restoring disrupted components in an enterpri... The primary focus of this paper is to design a progressive restoration plan for an enterprise data center environment following a partial or full disruption. Repairing and restoring disrupted components in an enterprise data center requires a significant amount of time and human effort. Following a major disruption, the recovery process involves multiple stages, and during each stage, the partially recovered infrastructures can provide limited services to users at some degraded service level. However, how fast and efficiently an enterprise infrastructure can be recovered de- pends on how the recovery mechanism restores the disrupted components, considering the inter-dependencies between services, along with the limitations of expert human operators. The entire problem turns out to be NP- hard and rather complex, and we devise an efficient meta-heuristic to solve the problem. By considering some real-world examples, we show that the proposed meta-heuristic provides very accurate results, and still runs 600-2800 times faster than the optimal solution obtained from a general purpose mathematical solver [1]. 展开更多
关键词 Progressive restoration planning Enterprise data center Genetic algorithm Integer linear program Multi-layer networks
下载PDF
Towards Sensor-free Academic Emotion Prediction in Programming Environment
3
作者 Tao Lin Zhiming Wu +2 位作者 Juan Zheng Shenggen Ju Yu Fu 《计算机教育》 2020年第12期77-84,共8页
he transition from traditional learning to practice-oriented programming learning will bring learners discomfort.The discomfort quickly breeds negative emotions when encountering programming difficulties,which leads t... he transition from traditional learning to practice-oriented programming learning will bring learners discomfort.The discomfort quickly breeds negative emotions when encountering programming difficulties,which leads the learner to lose interest in programming or even give up.Emotion plays a crucial role in learning.Educational psychology research shows that positive emotion can promote learning performance,increase learning interest and cultivate creative thinking.Accurate recognition and interpretation of programming learners’emotions can give them feedback in time,and adjust teaching strategies accurately and individually,which is of considerable significance to improve effects of programming learning and education.The existing methods of sensor-free emotion prediction include emotion prediction based on keyboard dynamic,mouse interaction data and interaction logs,respectively.However,none of the three studies considered the temporal characteristics of emotion,resulting in low recognition accuracy.For the first time,this paper proposes an emotion prediction model based on time series and context information.Then,we establish a Bi-recurrent neural network,obtain the time sequence characteristics of data automatically,and explore the application of deep learning in the field of Academic Emotion prediction.The results show that the classification ability of this model is much better than that of the original LSTM(Long-Short Term Memory),GRU(Gate Recurrent Unit)and RNN(Re-current Neural Network),and this model has better generalization ability. 展开更多
关键词 emotion prediction emotional state programming behavior data Bi-directional Recurrent Neural Network interaction sequence data
下载PDF
Application of hydrometeorological coupled European flood forecasting operational real time system in Yellow River Basin
4
作者 Yi-qi YAN Xin TAO +1 位作者 Bing-quan LI Cinzia MAZZETTI 《Water Science and Engineering》 EI CAS 2009年第4期28-39,共12页
This study evaluated the application of the European flood forecasting operational real time system (EFFORTS) to the Yellow River. An automatic data pre-processing program was developed to provide real-time hydromet... This study evaluated the application of the European flood forecasting operational real time system (EFFORTS) to the Yellow River. An automatic data pre-processing program was developed to provide real-time hydrometeorological data. Various GIS layers were collected and developed to meet the demands of the distributed hydrological model in the EFFORTS. The model parameters were calibrated and validated based on more than ten years of historical hydrometeorological data from the study area. The San-Hua Basin (from the Sanmenxia Reservoir to the Huayuankou Hydrological Station), the most geographically important area of the Yellow River, was chosen as the study area. The analysis indicates that the EFFORTS enhances the work efficiency, extends the flood forecasting lead time, and attains an acceptable level of forecasting accuracy in the San-Hua Basin, with a mean deterministic coefficient at Huayuankou Station, the basin outlet, of 0.90 in calibration and 0.96 in validation. The analysis also shows that the ;simulation accuracy is better for the southern part than for the northern part of the San-Hua Basin. This implies that, along with the characteristics of the basin and the mechanisms of runoff generation of the hydrological model, the hydrometeorological data play an important role in simulation of hydrological behavior. 展开更多
关键词 EFFORTS physically based distributed hydrological model data pre-processing program parameter calibration San-Hua Basin of Yellow River
下载PDF
Roundtable: Research Opportunities and Challenges for Emerging Software Systems
5
作者 张翔宇 张冬梅 +2 位作者 Yves Le Traon 王青 张路 《Journal of Computer Science & Technology》 SCIE EI CSCD 2015年第5期935-941,共7页
For this special section on software systems special section, discuss important issues that will shape several research leaders in software systems, as guest editors for this this field's future directions. The essa... For this special section on software systems special section, discuss important issues that will shape several research leaders in software systems, as guest editors for this this field's future directions. The essays included in this roundtable article cover research opportunities and challenges for emerging software systems such as data processing programs (Xiangyu Zhang) and online services (Dongmei Zhang), with new directions of technologies such as unifications in software testing (Yves Le Traon), data-driven and evidence-based software engineering (Qing Wang), and dynamic analysis of multiple traces (Lu Zhang). Tao Xie, Leading Editor of Special Section on Softwaare Svstem. 展开更多
关键词 data processing program software analytics online service software testing data-driven software engineering evidence-based software engineering
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部