This paper is a research on the characteristics of power big data. According to the characteristics of "large volume", "species diversity", "sparse value density", "fast speed" of the power big data, a predict...This paper is a research on the characteristics of power big data. According to the characteristics of "large volume", "species diversity", "sparse value density", "fast speed" of the power big data, a prediction model of multi-source information fusion for large data is established, the fusion prediction of various parameters of the same object is realized. A combined algorithm of Map Reduce and neural network is used in this paper. Using clustering and nonlinear mapping ability of neural network, it can effectively solve the problem of nonlinear objective function approximation, and neural network is applied to the prediction of fusion. In this paper, neural network model using multi layer feed forward network--BP neural network. Simultaneously, to achieve large-scale data sets in parallel computing, the parallelism and real-time property of the algorithm should be considered, further combined with Reduce Map model, to realize the parallel processing of the algorithm, making it more suitable for the study of the fusion of large data. And finally, through simulation, it verifies the feasibility of the proposed model and algorithm.展开更多
It is very important for the development of electric power big data technology to use the electric power knowledge.A new electric power knowledge theory model is proposed here to solve the problem of normalized modele...It is very important for the development of electric power big data technology to use the electric power knowledge.A new electric power knowledge theory model is proposed here to solve the problem of normalized modeled electric power knowledge for the management and analysis of electric power big data.Current modeling techniques of electric power knowledge are viewed as inadequate because of the complexity and variety of the relationships among electric power system data.Ontology theory and semantic web technologies used in electric power systems and in many other industry domains provide a new kind of knowledge modeling method.Based on this,this paper proposes the structure,elements,basic calculations and multidimensional reasoning method of the new knowledge model.A modeling example of the regulations defined in electric power system operation standard is demonstrated.Different forms of the model and related technologies are also introduced,including electric power system standard modeling,multi-type data management,unstructured data searching,knowledge display and data analysis based on semantic expansion and reduction.Research shows that the new model developed here is powerful and can adapt to various knowledge expression requirements of electric power big data.With the development of electric power big data technology,it is expected that the knowledge model will be improved and will be used in more applications.展开更多
Big data analytics is emerging as one kind of the most important workloads in modern data centers. Hence,it is of great interest to identify the method of achieving the best performance for big data analytics workload...Big data analytics is emerging as one kind of the most important workloads in modern data centers. Hence,it is of great interest to identify the method of achieving the best performance for big data analytics workloads running on state-of-the-art SMT( simultaneous multithreading) processors,which needs comprehensive understanding to workload characteristics. This paper chooses the Spark workloads as the representative big data analytics workloads and performs comprehensive measurements on the POWER8 platform,which supports a wide range of multithreading. The research finds that the thread assignment policy and cache contention have significant impacts on application performance. In order to identify the potential optimization method from the experiment results,this study performs micro-architecture level characterizations by means of hardware performance counters and gives implications accordingly.展开更多
This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of co...This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of computer science in different fields of study. The technique used in this paper is handling the inadequate Information for citation;it removes the problem of cold start, which is encountered by very many other recommender systems. In this paper, abstracts, the titles, and the Microsoft academic graphs have been used in coming up with the recommendation list for every document, which is used to combine the content-based approaches and the co-citations. Prioritization and the blending of every technique have been allowed by the tuning system parameters, allowing for the authority in results of recommendation versus the paper novelty. In the end, we do observe that there is a direct correlation between the similarity rankings that have been produced by the system and the scores of the participant. The results coming from the associated scrips of analysis and the user survey have been made available through the recommendation system. Managers must gain the required expertise to fully utilize the benefits that come with business intelligence systems [1]. Data mining has become an important tool for managers that provides insights about their daily operations and leverage the information provided by decision support systems to improve customer relationships [2]. Additionally, managers require business intelligence systems that can rank the output in the order of priority. Ranking algorithm can replace the traditional data mining algorithms that will be discussed in-depth in the literature review [3].展开更多
This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communi...This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communication protocols are formulated for data transmission. Big data platform and related technologies are utilized for data storage and computation. Compliance verification analysis and a power quality performance assessment are conducted, and a visualization tool for result presentation is finally presented.展开更多
大数据技术的应用对于电力系统的安全稳定运行和可持续发展具有重要意义,因此了解电力大数据的研究现状及热点尤为必要。使用文献计量方法,从时间、国家、机构、期刊、学科、引文、作者和关键词等方面,分析了1995—2021年Web of Scienc...大数据技术的应用对于电力系统的安全稳定运行和可持续发展具有重要意义,因此了解电力大数据的研究现状及热点尤为必要。使用文献计量方法,从时间、国家、机构、期刊、学科、引文、作者和关键词等方面,分析了1995—2021年Web of Science收录的1100篇电力大数据文献。结果表明:电力大数据研究稳步发展并逐渐成为热点,中国的发文量最多,但国际影响力有待提高;研究热点包括智能电网、负荷预测、电力系统安全和稳定等;电力大数据研究逐渐趋于电力系统安全稳定、智能高效方向。未来需要在电力与经济社会大数据融合、电力大数据多场景应用、电力大数据多主体参与等方面做更多探讨。展开更多
文摘This paper is a research on the characteristics of power big data. According to the characteristics of "large volume", "species diversity", "sparse value density", "fast speed" of the power big data, a prediction model of multi-source information fusion for large data is established, the fusion prediction of various parameters of the same object is realized. A combined algorithm of Map Reduce and neural network is used in this paper. Using clustering and nonlinear mapping ability of neural network, it can effectively solve the problem of nonlinear objective function approximation, and neural network is applied to the prediction of fusion. In this paper, neural network model using multi layer feed forward network--BP neural network. Simultaneously, to achieve large-scale data sets in parallel computing, the parallelism and real-time property of the algorithm should be considered, further combined with Reduce Map model, to realize the parallel processing of the algorithm, making it more suitable for the study of the fusion of large data. And finally, through simulation, it verifies the feasibility of the proposed model and algorithm.
基金supported by Science and Technology Foundation of the State Grid Corporation of China(XT71-14-043).
文摘It is very important for the development of electric power big data technology to use the electric power knowledge.A new electric power knowledge theory model is proposed here to solve the problem of normalized modeled electric power knowledge for the management and analysis of electric power big data.Current modeling techniques of electric power knowledge are viewed as inadequate because of the complexity and variety of the relationships among electric power system data.Ontology theory and semantic web technologies used in electric power systems and in many other industry domains provide a new kind of knowledge modeling method.Based on this,this paper proposes the structure,elements,basic calculations and multidimensional reasoning method of the new knowledge model.A modeling example of the regulations defined in electric power system operation standard is demonstrated.Different forms of the model and related technologies are also introduced,including electric power system standard modeling,multi-type data management,unstructured data searching,knowledge display and data analysis based on semantic expansion and reduction.Research shows that the new model developed here is powerful and can adapt to various knowledge expression requirements of electric power big data.With the development of electric power big data technology,it is expected that the knowledge model will be improved and will be used in more applications.
基金Supported by the National High Technology Research and Development Program of China(No.2015AA015308)the State Key Development Program for Basic Research of China(No.2014CB340402)
文摘Big data analytics is emerging as one kind of the most important workloads in modern data centers. Hence,it is of great interest to identify the method of achieving the best performance for big data analytics workloads running on state-of-the-art SMT( simultaneous multithreading) processors,which needs comprehensive understanding to workload characteristics. This paper chooses the Spark workloads as the representative big data analytics workloads and performs comprehensive measurements on the POWER8 platform,which supports a wide range of multithreading. The research finds that the thread assignment policy and cache contention have significant impacts on application performance. In order to identify the potential optimization method from the experiment results,this study performs micro-architecture level characterizations by means of hardware performance counters and gives implications accordingly.
文摘This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of computer science in different fields of study. The technique used in this paper is handling the inadequate Information for citation;it removes the problem of cold start, which is encountered by very many other recommender systems. In this paper, abstracts, the titles, and the Microsoft academic graphs have been used in coming up with the recommendation list for every document, which is used to combine the content-based approaches and the co-citations. Prioritization and the blending of every technique have been allowed by the tuning system parameters, allowing for the authority in results of recommendation versus the paper novelty. In the end, we do observe that there is a direct correlation between the similarity rankings that have been produced by the system and the scores of the participant. The results coming from the associated scrips of analysis and the user survey have been made available through the recommendation system. Managers must gain the required expertise to fully utilize the benefits that come with business intelligence systems [1]. Data mining has become an important tool for managers that provides insights about their daily operations and leverage the information provided by decision support systems to improve customer relationships [2]. Additionally, managers require business intelligence systems that can rank the output in the order of priority. Ranking algorithm can replace the traditional data mining algorithms that will be discussed in-depth in the literature review [3].
基金supported by the State Grid Science and Technology Project (GEIRI-DL-71-17-002)
文摘This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communication protocols are formulated for data transmission. Big data platform and related technologies are utilized for data storage and computation. Compliance verification analysis and a power quality performance assessment are conducted, and a visualization tool for result presentation is finally presented.
文摘大数据技术的应用对于电力系统的安全稳定运行和可持续发展具有重要意义,因此了解电力大数据的研究现状及热点尤为必要。使用文献计量方法,从时间、国家、机构、期刊、学科、引文、作者和关键词等方面,分析了1995—2021年Web of Science收录的1100篇电力大数据文献。结果表明:电力大数据研究稳步发展并逐渐成为热点,中国的发文量最多,但国际影响力有待提高;研究热点包括智能电网、负荷预测、电力系统安全和稳定等;电力大数据研究逐渐趋于电力系统安全稳定、智能高效方向。未来需要在电力与经济社会大数据融合、电力大数据多场景应用、电力大数据多主体参与等方面做更多探讨。