This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of co...This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of computer science in different fields of study. The technique used in this paper is handling the inadequate Information for citation;it removes the problem of cold start, which is encountered by very many other recommender systems. In this paper, abstracts, the titles, and the Microsoft academic graphs have been used in coming up with the recommendation list for every document, which is used to combine the content-based approaches and the co-citations. Prioritization and the blending of every technique have been allowed by the tuning system parameters, allowing for the authority in results of recommendation versus the paper novelty. In the end, we do observe that there is a direct correlation between the similarity rankings that have been produced by the system and the scores of the participant. The results coming from the associated scrips of analysis and the user survey have been made available through the recommendation system. Managers must gain the required expertise to fully utilize the benefits that come with business intelligence systems [1]. Data mining has become an important tool for managers that provides insights about their daily operations and leverage the information provided by decision support systems to improve customer relationships [2]. Additionally, managers require business intelligence systems that can rank the output in the order of priority. Ranking algorithm can replace the traditional data mining algorithms that will be discussed in-depth in the literature review [3].展开更多
Since their introduction by James Dixon in 2010,data lakes get more and more attention,driven by the promise of high reusability of the stored data due to the schema-on-read semantics.Building on this idea,several add...Since their introduction by James Dixon in 2010,data lakes get more and more attention,driven by the promise of high reusability of the stored data due to the schema-on-read semantics.Building on this idea,several additional requirements were discussed in literature to improve the general usability of the concept,like a central metadata catalog including all provenance information,an overarching data governance,or the integration with(high-performance)processing capabilities.Although the necessity for a logical and a physical organisation of data lakes in order to meet those requirements is widely recognized,no concrete guidelines are yet provided.The most common architecture implementing this conceptual organisation is the zone architecture,where data is assigned to a certain zone depending on the degree of processing.This paper discusses how FAIR Digital Objects can be used in a novel approach to organize a data lake based on data types instead of zones,how they can be used to abstract the physical implementation,and how they empower generic and portable processing capabilities based on a provenance-based approach.展开更多
Poyang Lake is the largest freshwater lake in China. This paper conducted a digital and rapid investigation of the lake’s wetland vegetation biomass using Landsat ETM data acquired on April 16, 2000. First, utilizing...Poyang Lake is the largest freshwater lake in China. This paper conducted a digital and rapid investigation of the lake’s wetland vegetation biomass using Landsat ETM data acquired on April 16, 2000. First, utilizing the false color composite derived from the ETM data as one of the main references, the authors designed a reasonable sampling route for field measurement of the biomass, and carried it out on April 18–28, 2000. Then after both the sampling data and the ETM data were geometrically corrected to an equal-area projection of Albers, linear relationships among the sampling data and some transformed data derived from the ETM data and the ETM 4 were calculated. The results show that the sampling data is best relative to the band 4 data with a high correlation coefficient of 0.86, followed by the DVI and NDVI data with 0.83 and 0.80 respectively. Therefore, a linear regression model, which was based on the field data and band 4 data, was used to estimate the total biomass of entire Poyang Lake, and then the map of the biomass distribution was compiled.展开更多
The relatively rapid recession of glaciers in the Himalayas and formation of moraine dammed glacial lakes(MDGLs) in the recent past have increased the risk of glacier lake outburst floods(GLOF) in the countries of Nep...The relatively rapid recession of glaciers in the Himalayas and formation of moraine dammed glacial lakes(MDGLs) in the recent past have increased the risk of glacier lake outburst floods(GLOF) in the countries of Nepal and Bhutan and in the mountainous territory of Sikkim in India. As a product of climate change and global warming, such a risk has not only raised the level of threats to the habitation and infrastructure of the region, but has also contributed to the worsening of the balance of the unique ecosystem that exists in this domain that sustains several of the highest mountain peaks of the world. This study attempts to present an up to date mapping of the MDGLs in the central and eastern Himalayan regions using remote sensing data, with an objective to analyse their surface area variations with time from 1990 through 2015, disaggregated over six episodes. The study also includes the evaluation for susceptibility of MDGLs to GLOF with the least criteria decision analysis(LCDA). Forty two major MDGLs, each having a lake surface area greater than 0.2 km2, that were identified in the Himalayan ranges of Nepal, Bhutan, and Sikkim, have been categorized according to their surface area expansion rates in space and time. The lakes have been identified as located within the elevation range of 3800 m and6800 m above mean sea level(a msl). With a total surface area of 37.9 km2, these MDGLs as a whole were observed to have expanded by an astonishing 43.6% in area over the 25 year period of this study. A factor is introduced to numerically sort the lakes in terms of their relative yearly expansion rates, based on their interpretation of their surface area extents from satellite imageries. Verification of predicted GLOF events in the past using this factor with the limited field data as reported in literature indicates that the present analysis may be considered a sufficiently reliable and rapid technique for assessing the potential bursting susceptibility of the MDGLs. The analysis also indicates that, as of now, there are eight MDGLs in the region which appear to be in highly vulnerable states and have high chances in causing potential GLOF events anytime in the recent future.展开更多
Lakes are an important component of the earth climate system. They play an important role in the study of basin weather forecasting, air quality forecasting, and regional climate research. The accuracy of driving vari...Lakes are an important component of the earth climate system. They play an important role in the study of basin weather forecasting, air quality forecasting, and regional climate research. The accuracy of driving variables is the basic premise to ensure the rationality of lake mode simulation. Based on the in-situ observations at Bifenggang site of the Lake Taihu Eddy flux Network from 2012 to 2017, this paper investigated temporal variations in temperature, relative humidity, wind speed, radiation components at different time scales (hourly, seasonal and interannual). ERA5 reanalysis data were compared with in-situ observation to quantify the error and evaluate the performance of reanalysis data. The results show that: 1) On the hourly scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. 2) On the seasonal variation scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. However, the descriptions of wind speed, relative humidity and downward short-wave have large deviations. 3) On the interannual scale, the ERA5 reanalysis data show a good performance for temperature, followed by downward longwave radiation, downward shortwave radiation and relative humidity.展开更多
基于web of science和知网数据源,利用Derwent Data Analyzer,VOSvievwer和Incites对1960—2022年盐湖相关文献进行计量分析,以深入了解国际和国内盐湖研究进展和发展趋势。分析得出近60多年盐湖研究发文量整体上呈先稳定后增加的趋势;...基于web of science和知网数据源,利用Derwent Data Analyzer,VOSvievwer和Incites对1960—2022年盐湖相关文献进行计量分析,以深入了解国际和国内盐湖研究进展和发展趋势。分析得出近60多年盐湖研究发文量整体上呈先稳定后增加的趋势;盐湖相关研究国际发文数量最高的期刊是International Journal of Systematic and Evolutionary Microbiology,国内发文量最高的期刊是《盐湖研究》。美国和中国是盐湖研究的核心力量,两者的总发文量、总引频次和高被引论文数均居前2位。盐湖研究领域发文量最高的机构是中国科学院,其隶属单位青海盐湖研究所发文量占比最高。国际文献作者前15位中8位来自中国。国际和国内研究热点演进分析得出今后盐湖资源的分离和提取将得到持续关注,高被引文章分析说明近年来盐湖研究关注最多的为盐湖资源的分离提取,尤其是锂资源。整体而言,有关盐湖研究还在不断拓展和延伸,未来我国应在文章质量、研究方向布局和影响力上继续提升,发挥国内盐湖研究的对外影响力。展开更多
A paleo-lacustrine delta in Kyoto, Japan was reconstructed on the basis of subsurface geological and geomorphological analysis, and paleo-lake level changes were estimated from the structure of the delta. These analys...A paleo-lacustrine delta in Kyoto, Japan was reconstructed on the basis of subsurface geological and geomorphological analysis, and paleo-lake level changes were estimated from the structure of the delta. These analyses of the study region, i.e., the Oguraike reclaimed land area provided evidence that Lake Ogura existed until about 60 years ago in southern Kyoto, Japan. The Uji river delta was provided influents to this lake until ca. 400 years ago, as is indicated by an upward-coarsening delta succession of about 2 - 4 m thickness. The lake level could also have changed in the past as a result of a change in altitude of the delta-front (foreset) and delta-plain boundary, which probably reflects the lake surface elevation. About 400 years ago, the Paleo-Uji River was separated from Ogura Lake because a levee was constructed along the river for building a castle and for constructing a waterway for transportation. As a result of this construction, the lake level that was more than 13.0 m in elevation was reduced by 1.5 m. In a more ancient times, the lake level experienced two stages—one in which the elevation was more than 13.5 m, and one in which the elevation was reduced to less than 10 m. These changes in the lake level are represented by a flat surface with four steps and small cliff of height ca. 0.5 - 2 m (relative elevation) separating them, recognized at the southern lakeshore. The observation of strata along with the archaeological survey in the north of Ogura Lake reveals that the lake level was decreased ca. 800 - 680 years ago. The lake level was at its highest during two periods, the first from before the 8th century to the end of the 8th century and the second from the 14th century to 400 years ago.展开更多
文摘This research paper has provided the methodology and design for implementing the hybrid author recommender system using Azure Data Lake Analytics and Power BI. It offers a recommendation for the top 1000 Authors of computer science in different fields of study. The technique used in this paper is handling the inadequate Information for citation;it removes the problem of cold start, which is encountered by very many other recommender systems. In this paper, abstracts, the titles, and the Microsoft academic graphs have been used in coming up with the recommendation list for every document, which is used to combine the content-based approaches and the co-citations. Prioritization and the blending of every technique have been allowed by the tuning system parameters, allowing for the authority in results of recommendation versus the paper novelty. In the end, we do observe that there is a direct correlation between the similarity rankings that have been produced by the system and the scores of the participant. The results coming from the associated scrips of analysis and the user survey have been made available through the recommendation system. Managers must gain the required expertise to fully utilize the benefits that come with business intelligence systems [1]. Data mining has become an important tool for managers that provides insights about their daily operations and leverage the information provided by decision support systems to improve customer relationships [2]. Additionally, managers require business intelligence systems that can rank the output in the order of priority. Ranking algorithm can replace the traditional data mining algorithms that will be discussed in-depth in the literature review [3].
基金funding by the"Niedersachsisches Vorab"funding line of the Volkswagen Foundation.
文摘Since their introduction by James Dixon in 2010,data lakes get more and more attention,driven by the promise of high reusability of the stored data due to the schema-on-read semantics.Building on this idea,several additional requirements were discussed in literature to improve the general usability of the concept,like a central metadata catalog including all provenance information,an overarching data governance,or the integration with(high-performance)processing capabilities.Although the necessity for a logical and a physical organisation of data lakes in order to meet those requirements is widely recognized,no concrete guidelines are yet provided.The most common architecture implementing this conceptual organisation is the zone architecture,where data is assigned to a certain zone depending on the degree of processing.This paper discusses how FAIR Digital Objects can be used in a novel approach to organize a data lake based on data types instead of zones,how they can be used to abstract the physical implementation,and how they empower generic and portable processing capabilities based on a provenance-based approach.
基金The Knowledge Innovation Project of CAS, No. KZCX1-Y-02,No. KZCX2-310 The key project of Ninth Five-Year+3 种基金 Plan of CAS, No.KZ951-A1-102-01 The National Ninth Five-Year Plan Project,No.96-b02-01
文摘Poyang Lake is the largest freshwater lake in China. This paper conducted a digital and rapid investigation of the lake’s wetland vegetation biomass using Landsat ETM data acquired on April 16, 2000. First, utilizing the false color composite derived from the ETM data as one of the main references, the authors designed a reasonable sampling route for field measurement of the biomass, and carried it out on April 18–28, 2000. Then after both the sampling data and the ETM data were geometrically corrected to an equal-area projection of Albers, linear relationships among the sampling data and some transformed data derived from the ETM data and the ETM 4 were calculated. The results show that the sampling data is best relative to the band 4 data with a high correlation coefficient of 0.86, followed by the DVI and NDVI data with 0.83 and 0.80 respectively. Therefore, a linear regression model, which was based on the field data and band 4 data, was used to estimate the total biomass of entire Poyang Lake, and then the map of the biomass distribution was compiled.
文摘The relatively rapid recession of glaciers in the Himalayas and formation of moraine dammed glacial lakes(MDGLs) in the recent past have increased the risk of glacier lake outburst floods(GLOF) in the countries of Nepal and Bhutan and in the mountainous territory of Sikkim in India. As a product of climate change and global warming, such a risk has not only raised the level of threats to the habitation and infrastructure of the region, but has also contributed to the worsening of the balance of the unique ecosystem that exists in this domain that sustains several of the highest mountain peaks of the world. This study attempts to present an up to date mapping of the MDGLs in the central and eastern Himalayan regions using remote sensing data, with an objective to analyse their surface area variations with time from 1990 through 2015, disaggregated over six episodes. The study also includes the evaluation for susceptibility of MDGLs to GLOF with the least criteria decision analysis(LCDA). Forty two major MDGLs, each having a lake surface area greater than 0.2 km2, that were identified in the Himalayan ranges of Nepal, Bhutan, and Sikkim, have been categorized according to their surface area expansion rates in space and time. The lakes have been identified as located within the elevation range of 3800 m and6800 m above mean sea level(a msl). With a total surface area of 37.9 km2, these MDGLs as a whole were observed to have expanded by an astonishing 43.6% in area over the 25 year period of this study. A factor is introduced to numerically sort the lakes in terms of their relative yearly expansion rates, based on their interpretation of their surface area extents from satellite imageries. Verification of predicted GLOF events in the past using this factor with the limited field data as reported in literature indicates that the present analysis may be considered a sufficiently reliable and rapid technique for assessing the potential bursting susceptibility of the MDGLs. The analysis also indicates that, as of now, there are eight MDGLs in the region which appear to be in highly vulnerable states and have high chances in causing potential GLOF events anytime in the recent future.
文摘Lakes are an important component of the earth climate system. They play an important role in the study of basin weather forecasting, air quality forecasting, and regional climate research. The accuracy of driving variables is the basic premise to ensure the rationality of lake mode simulation. Based on the in-situ observations at Bifenggang site of the Lake Taihu Eddy flux Network from 2012 to 2017, this paper investigated temporal variations in temperature, relative humidity, wind speed, radiation components at different time scales (hourly, seasonal and interannual). ERA5 reanalysis data were compared with in-situ observation to quantify the error and evaluate the performance of reanalysis data. The results show that: 1) On the hourly scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. 2) On the seasonal variation scale, the ERA5 reanalysis data described air temperature, and downward long-wave radiation more accurately. However, the descriptions of wind speed, relative humidity and downward short-wave have large deviations. 3) On the interannual scale, the ERA5 reanalysis data show a good performance for temperature, followed by downward longwave radiation, downward shortwave radiation and relative humidity.
文摘基于web of science和知网数据源,利用Derwent Data Analyzer,VOSvievwer和Incites对1960—2022年盐湖相关文献进行计量分析,以深入了解国际和国内盐湖研究进展和发展趋势。分析得出近60多年盐湖研究发文量整体上呈先稳定后增加的趋势;盐湖相关研究国际发文数量最高的期刊是International Journal of Systematic and Evolutionary Microbiology,国内发文量最高的期刊是《盐湖研究》。美国和中国是盐湖研究的核心力量,两者的总发文量、总引频次和高被引论文数均居前2位。盐湖研究领域发文量最高的机构是中国科学院,其隶属单位青海盐湖研究所发文量占比最高。国际文献作者前15位中8位来自中国。国际和国内研究热点演进分析得出今后盐湖资源的分离和提取将得到持续关注,高被引文章分析说明近年来盐湖研究关注最多的为盐湖资源的分离提取,尤其是锂资源。整体而言,有关盐湖研究还在不断拓展和延伸,未来我国应在文章质量、研究方向布局和影响力上继续提升,发挥国内盐湖研究的对外影响力。
文摘A paleo-lacustrine delta in Kyoto, Japan was reconstructed on the basis of subsurface geological and geomorphological analysis, and paleo-lake level changes were estimated from the structure of the delta. These analyses of the study region, i.e., the Oguraike reclaimed land area provided evidence that Lake Ogura existed until about 60 years ago in southern Kyoto, Japan. The Uji river delta was provided influents to this lake until ca. 400 years ago, as is indicated by an upward-coarsening delta succession of about 2 - 4 m thickness. The lake level could also have changed in the past as a result of a change in altitude of the delta-front (foreset) and delta-plain boundary, which probably reflects the lake surface elevation. About 400 years ago, the Paleo-Uji River was separated from Ogura Lake because a levee was constructed along the river for building a castle and for constructing a waterway for transportation. As a result of this construction, the lake level that was more than 13.0 m in elevation was reduced by 1.5 m. In a more ancient times, the lake level experienced two stages—one in which the elevation was more than 13.5 m, and one in which the elevation was reduced to less than 10 m. These changes in the lake level are represented by a flat surface with four steps and small cliff of height ca. 0.5 - 2 m (relative elevation) separating them, recognized at the southern lakeshore. The observation of strata along with the archaeological survey in the north of Ogura Lake reveals that the lake level was decreased ca. 800 - 680 years ago. The lake level was at its highest during two periods, the first from before the 8th century to the end of the 8th century and the second from the 14th century to 400 years ago.