Wired drill pipe(WDP)technology is one of the most promising data acquisition technologies in today s oil and gas industry.For the first time it allows sensors to be positioned along the drill string which enables c...Wired drill pipe(WDP)technology is one of the most promising data acquisition technologies in today s oil and gas industry.For the first time it allows sensors to be positioned along the drill string which enables collecting and transmitting valuable data not only from the bottom hole assembly(BHA),but also along the entire length of the wellbore to the drill floor.The technology has received industry acceptance as a viable alternative to the typical logging while drilling(LWD)method.Recently more and more WDP applications can be found in the challenging drilling environments around the world,leading to many innovations to the industry.Nevertheless most of the data acquired from WDP can be noisy and in some circumstances of very poor quality.Diverse factors contribute to the poor data quality.Most common sources include mis-calibrated sensors,sensor drifting,errors during data transmission,or some abnormal conditions in the well,etc.The challenge of improving the data quality has attracted more and more focus from many researchers during the past decade.This paper has proposed a promising solution to address such challenge by making corrections of the raw WDP data and estimating unmeasurable parameters to reveal downhole behaviors.An advanced data processing method,data validation and reconciliation(DVR)has been employed,which makes use of the redundant data from multiple WDP sensors to filter/remove the noise from the measurements and ensures the coherence of all sensors and models.Moreover it has the ability to distinguish the accurate measurements from the inaccurate ones.In addition,the data with improved quality can be used for estimating some crucial parameters in the drilling process which are unmeasurable in the first place,hence provide better model calibrations for integrated well planning and realtime operations.展开更多
The Lightning Mapping Imager(LMI)equipped on the FY-4 A(Feng Yun-4 A)geostationary satellite achieves lightning positioning through optical imaging and has the advantages of high temporal resolution,high stability,and...The Lightning Mapping Imager(LMI)equipped on the FY-4 A(Feng Yun-4 A)geostationary satellite achieves lightning positioning through optical imaging and has the advantages of high temporal resolution,high stability,and continuous observation.In this study,FY-4 A LMI lightning event,group and flash data from April to August 2018 are selected,and their quality are assessed through qualitative and quantitative comparison with the ground-based Advanced Time of Arrival and Direction system(ADTD)lightning observation network data and the American International Space Station(ISS)lightning imaging sensor(LIS)data.The results show that the spatial distributions of FY-4 A lightning are consistent with those of the ground-based ADTD and ISS LIS.The temporal variation in FY-4 A lightning group frequency is consistent with that of ADTD stroke,which reflects that FY-4 A LMI can capture the lightning occurrence in inland China.Quantitative statistics show that the consistency rate of FY-4 A LMI and ISS LIS events is relatively high but their consistency rate is lower in terms of lightning group and flash data.Compared with the lightning observations by the ISS LIS and the ground-based ADTD,FY-4 A LMI reports fewer lightning events in the Tibetan Plateau.The application of Tibetan Plateau lightning data requires further processing and consideration.展开更多
This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communi...This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communication protocols are formulated for data transmission. Big data platform and related technologies are utilized for data storage and computation. Compliance verification analysis and a power quality performance assessment are conducted, and a visualization tool for result presentation is finally presented.展开更多
This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the informat...This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the information quality risks for the database of the geographical information system (GIS). Four quantitative measures are introduced to examine how the quality risks of source information affect the quality of information outputs produced using the relational algebra operations Selection, Projection, and Cubic Product. It can be used to determine how quality risks associated with diverse data sources affect the derived data. The GIS is the prime source of information on the location of cables, and detection time strongly depends on whether maps indicate the presence of cables in the construction business. Poor data quality in the GIS can contribute to increased risk or higher risk avoidance costs. A case study provides a numerical example of the calculation of the trade-offs between risk and detection costs and provides an example of the calculation of the costs of data quality. We conclude that the model contributes valuable new insight.展开更多
In order to improve the performance of classifiers in subjective domains, this paper defines a metric to measure the quality of the subjectively labelled training data (QoSTD) by means of K-means clustering. Then, the...In order to improve the performance of classifiers in subjective domains, this paper defines a metric to measure the quality of the subjectively labelled training data (QoSTD) by means of K-means clustering. Then, the QoSTD is used as a weight of the predicted class scores to adjust the likelihoods of instances. Moreover, two measurements are defined to assess the performance of the classifiers trained by the subjective labelled data. The binary classifiers of Traditional Chinese Medicine (TCM) Zhengs are trained and retrained by the real-world data set, utilizing the support vector machine (SVM) and the discrimination analysis (DA) models, so as to verify the effectiveness of the proposed method. The experimental results show that the consistency of likelihoods of instances with the corresponding observations is increased notable for the classes, especially in the cases with the relatively low QoSTD training data set. The experimental results also indicate the solution how to eliminate the miss-labelled instances from the training data set to re-train the classifiers in the subjective domains.展开更多
With the advent of the era of big data,the Provenance Method of electronic archives based on knowledge graph under the environment of big data has produced a large number of electronic archives due to the development ...With the advent of the era of big data,the Provenance Method of electronic archives based on knowledge graph under the environment of big data has produced a large number of electronic archives due to the development of science and technology.How to guarantee the credential characteristics of electronic archives in the big data environment has attracted wide attention of the academic community.Provenance is an important technical means to guarantee the certification of electronic archives.In this paper,knowledge graph technology is used to provide the concept provenance of electronic archives in large data environment.It not only enriches the provenance method,but also guarantees the certification of electronic archives in the large data environment.展开更多
从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过...从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过程中的人工反复处理,提高了生产单位的作业效率及成果质量,也可为其他项目的质检系统开发提供借鉴。展开更多
Air quality monitoring is effective for timely understanding of the current air quality status of a region or city.Currently,the huge volume of environmental monitoring data,which has reasonable real-time performance,...Air quality monitoring is effective for timely understanding of the current air quality status of a region or city.Currently,the huge volume of environmental monitoring data,which has reasonable real-time performance,provides strong support for in-depth analysis of air pollution characteristics and causes.However,in the era of big data,to meet current demands for fine management of the atmospheric environment,it is important to explore the characteristics and causes of air pollution from multiple aspects for comprehensive and scientific evaluation of air quality.This study reviewed and summarized air quality evaluation methods on the basis of environmental monitoring data statistics during the 13th Five-Year Plan period,and evaluated the level of air pollution in the Beijing-Tianjin-Hebei region and its surrounding areas(i.e.,the“2+26”region)during the period of the three-year action plan to fight air pollution.We suggest that air quality should be comprehensively,deeply,and scientifically evaluated from the aspects of air pollution characteristics,causes,and influences of meteorological conditions and anthropogenic emissions.It is also suggested that a threeyear moving average be introduced as one of the evaluation indexes of long-term change of pollutants.Additionally,both temporal and spatial differences should be considered when removing confounding meteorological factors.展开更多
桥梁健康监测数据的挖掘和分析工作只有在整体数据质量符合基本要求的有效数据基础上进行,才能保障如模态参数识别、损伤识别和状态评估等后续工作的准确性。因此,基于量化改进的探索性分析方法(Exploratory Data Analysis,EDA)和相关...桥梁健康监测数据的挖掘和分析工作只有在整体数据质量符合基本要求的有效数据基础上进行,才能保障如模态参数识别、损伤识别和状态评估等后续工作的准确性。因此,基于量化改进的探索性分析方法(Exploratory Data Analysis,EDA)和相关性分析从数据完整性、准确性和一致性的角度建立了桥梁健康监测静、动态数据的质量评估方法。对某大跨度斜拉桥健康监测系统的静、动态数据进行质量评估,通过对比分析了不同评估质量的温度数据、静挠度数据和不同评估质量的主梁竖向加速度动力信号的模态参数识别的稳定图,验证了所提方法的正确性。结果表明,所提评估方法能够快速有效地判断数据质量的好坏,进而确保桥梁结构的服役性能评估和预测的准确性,有利于提高健康监测数据的可用性和效能。展开更多
Crab paste is a traditional and popular food for people in the coastal area of China.Visual inspection is currently the only method of quality assessment.We developed a new analytical method of a comprehensive survey ...Crab paste is a traditional and popular food for people in the coastal area of China.Visual inspection is currently the only method of quality assessment.We developed a new analytical method of a comprehensive survey of nutrients and quality assessment for crab paste based on a combination of 1 H NMR spectroscopy with multivariate data analysis.The aqueous extract of crab paste was dominated by amino acids,sugars,carboxylic acids,nucleotides and amines(including 19 first-time-reported compounds such as choline,uracil and guanosine).Two grades of crab paste had significant compositional differences in terms of amino acids,lactate,N-acetylglutamate,choline,dimethylamine,uridine,1-methylnicotinamide and 2-pyridienmethanol.These results provided important information on the grade-dependence of crab-paste composition,and demonstrated that NMR-MDA was effective not only for the comprehensive survey of nutrients,but also for quality assessment of crab paste.展开更多
以METOP-A、Suomi-NPP历史资料作为参照,系统分析比较了2008—2020年4颗风云三号卫星的微波温度计(Microwave Temperature Sounder, MWTS)再定标历史资料质量。结果表明,4颗卫星的MWTS探测性能稳步上升,再定标数据集有效消除了遥感仪器...以METOP-A、Suomi-NPP历史资料作为参照,系统分析比较了2008—2020年4颗风云三号卫星的微波温度计(Microwave Temperature Sounder, MWTS)再定标历史资料质量。结果表明,4颗卫星的MWTS探测性能稳步上升,再定标数据集有效消除了遥感仪器在轨期间数据异常跳变、寿命期内遥感仪器辐射响应衰变、不同卫星间的辐射定标差异等因素影响,大幅提升了MWTS历史数据集的准确性和均匀性,使得再定标后的对流层和平流层通道数据与国外同类型仪器数据偏差在±0.1 K范围内。本文还重点分析比较了对流层中高层和平流层低层两个探测通道,结果表明FY-3D MWTS再定标数据和美国NOAA卫星应用研究中心STAR长序列数据集针对中高层大气的表现类似,平均亮温在时间变化和空间分布具有相似的特征,月均全球高空亮温年变化趋势差异最大为0.002 4。因此,2020年之后的FY-3D再定标数据,可以接续STAR长序列数据集,用于中高层大气的温度变化检测与分析。展开更多
基金supported by University of Stavanger, NorwaySINTEF,the Center for Integrated Operations in the Petroleum Industry and the management of National Oilwell Varco Intelli Serv
文摘Wired drill pipe(WDP)technology is one of the most promising data acquisition technologies in today s oil and gas industry.For the first time it allows sensors to be positioned along the drill string which enables collecting and transmitting valuable data not only from the bottom hole assembly(BHA),but also along the entire length of the wellbore to the drill floor.The technology has received industry acceptance as a viable alternative to the typical logging while drilling(LWD)method.Recently more and more WDP applications can be found in the challenging drilling environments around the world,leading to many innovations to the industry.Nevertheless most of the data acquired from WDP can be noisy and in some circumstances of very poor quality.Diverse factors contribute to the poor data quality.Most common sources include mis-calibrated sensors,sensor drifting,errors during data transmission,or some abnormal conditions in the well,etc.The challenge of improving the data quality has attracted more and more focus from many researchers during the past decade.This paper has proposed a promising solution to address such challenge by making corrections of the raw WDP data and estimating unmeasurable parameters to reveal downhole behaviors.An advanced data processing method,data validation and reconciliation(DVR)has been employed,which makes use of the redundant data from multiple WDP sensors to filter/remove the noise from the measurements and ensures the coherence of all sensors and models.Moreover it has the ability to distinguish the accurate measurements from the inaccurate ones.In addition,the data with improved quality can be used for estimating some crucial parameters in the drilling process which are unmeasurable in the first place,hence provide better model calibrations for integrated well planning and realtime operations.
基金National Key R&D Program of China(2018YFC1506603)The Second Tibetan Plateau Scientific Expedition and Research(STEP)Program(2019QZKK0105)。
文摘The Lightning Mapping Imager(LMI)equipped on the FY-4 A(Feng Yun-4 A)geostationary satellite achieves lightning positioning through optical imaging and has the advantages of high temporal resolution,high stability,and continuous observation.In this study,FY-4 A LMI lightning event,group and flash data from April to August 2018 are selected,and their quality are assessed through qualitative and quantitative comparison with the ground-based Advanced Time of Arrival and Direction system(ADTD)lightning observation network data and the American International Space Station(ISS)lightning imaging sensor(LIS)data.The results show that the spatial distributions of FY-4 A lightning are consistent with those of the ground-based ADTD and ISS LIS.The temporal variation in FY-4 A lightning group frequency is consistent with that of ADTD stroke,which reflects that FY-4 A LMI can capture the lightning occurrence in inland China.Quantitative statistics show that the consistency rate of FY-4 A LMI and ISS LIS events is relatively high but their consistency rate is lower in terms of lightning group and flash data.Compared with the lightning observations by the ISS LIS and the ground-based ADTD,FY-4 A LMI reports fewer lightning events in the Tibetan Plateau.The application of Tibetan Plateau lightning data requires further processing and consideration.
基金supported by the State Grid Science and Technology Project (GEIRI-DL-71-17-002)
文摘This paper introduces the implementation and data analysis associated with a state-wide power quality monitoring and analysis system in China. Corporation specifications on power quality monitors as well as on communication protocols are formulated for data transmission. Big data platform and related technologies are utilized for data storage and computation. Compliance verification analysis and a power quality performance assessment are conducted, and a visualization tool for result presentation is finally presented.
基金The National Natural Science Foundation of China (No.70772021,70372004)China Postdoctoral Science Foundation (No.20060400077)
文摘This paper presents a methodology to determine three data quality (DQ) risk characteristics: accuracy, comprehensiveness and nonmembership. The methodology provides a set of quantitative models to confirm the information quality risks for the database of the geographical information system (GIS). Four quantitative measures are introduced to examine how the quality risks of source information affect the quality of information outputs produced using the relational algebra operations Selection, Projection, and Cubic Product. It can be used to determine how quality risks associated with diverse data sources affect the derived data. The GIS is the prime source of information on the location of cables, and detection time strongly depends on whether maps indicate the presence of cables in the construction business. Poor data quality in the GIS can contribute to increased risk or higher risk avoidance costs. A case study provides a numerical example of the calculation of the trade-offs between risk and detection costs and provides an example of the calculation of the costs of data quality. We conclude that the model contributes valuable new insight.
文摘In order to improve the performance of classifiers in subjective domains, this paper defines a metric to measure the quality of the subjectively labelled training data (QoSTD) by means of K-means clustering. Then, the QoSTD is used as a weight of the predicted class scores to adjust the likelihoods of instances. Moreover, two measurements are defined to assess the performance of the classifiers trained by the subjective labelled data. The binary classifiers of Traditional Chinese Medicine (TCM) Zhengs are trained and retrained by the real-world data set, utilizing the support vector machine (SVM) and the discrimination analysis (DA) models, so as to verify the effectiveness of the proposed method. The experimental results show that the consistency of likelihoods of instances with the corresponding observations is increased notable for the classes, especially in the cases with the relatively low QoSTD training data set. The experimental results also indicate the solution how to eliminate the miss-labelled instances from the training data set to re-train the classifiers in the subjective domains.
基金This work is supported by the NSFC[Grant No.61772280]the National Training Programs of Innovation and Entrepreneurship for Undergraduates[Grant Nos.201910300123Y,201810300165]the PAPD Fund from NUIST.
文摘With the advent of the era of big data,the Provenance Method of electronic archives based on knowledge graph under the environment of big data has produced a large number of electronic archives due to the development of science and technology.How to guarantee the credential characteristics of electronic archives in the big data environment has attracted wide attention of the academic community.Provenance is an important technical means to guarantee the certification of electronic archives.In this paper,knowledge graph technology is used to provide the concept provenance of electronic archives in large data environment.It not only enriches the provenance method,but also guarantees the certification of electronic archives in the large data environment.
文摘从天地图融合数据质量检查出发,依据数据标准,通过结合具体的质检规则,研究了一种基于ArcGIS Data Reviewer模块的自动化、批量化并且可使数据在处理阶段就可进行检查的天地图融合数据检验方法,这种灵活的质检机制大大减少了数据融合过程中的人工反复处理,提高了生产单位的作业效率及成果质量,也可为其他项目的质检系统开发提供借鉴。
基金supported by the National Key Research and Development Program of China(No.2019YFC0214800)。
文摘Air quality monitoring is effective for timely understanding of the current air quality status of a region or city.Currently,the huge volume of environmental monitoring data,which has reasonable real-time performance,provides strong support for in-depth analysis of air pollution characteristics and causes.However,in the era of big data,to meet current demands for fine management of the atmospheric environment,it is important to explore the characteristics and causes of air pollution from multiple aspects for comprehensive and scientific evaluation of air quality.This study reviewed and summarized air quality evaluation methods on the basis of environmental monitoring data statistics during the 13th Five-Year Plan period,and evaluated the level of air pollution in the Beijing-Tianjin-Hebei region and its surrounding areas(i.e.,the“2+26”region)during the period of the three-year action plan to fight air pollution.We suggest that air quality should be comprehensively,deeply,and scientifically evaluated from the aspects of air pollution characteristics,causes,and influences of meteorological conditions and anthropogenic emissions.It is also suggested that a threeyear moving average be introduced as one of the evaluation indexes of long-term change of pollutants.Additionally,both temporal and spatial differences should be considered when removing confounding meteorological factors.
文摘桥梁健康监测数据的挖掘和分析工作只有在整体数据质量符合基本要求的有效数据基础上进行,才能保障如模态参数识别、损伤识别和状态评估等后续工作的准确性。因此,基于量化改进的探索性分析方法(Exploratory Data Analysis,EDA)和相关性分析从数据完整性、准确性和一致性的角度建立了桥梁健康监测静、动态数据的质量评估方法。对某大跨度斜拉桥健康监测系统的静、动态数据进行质量评估,通过对比分析了不同评估质量的温度数据、静挠度数据和不同评估质量的主梁竖向加速度动力信号的模态参数识别的稳定图,验证了所提方法的正确性。结果表明,所提评估方法能够快速有效地判断数据质量的好坏,进而确保桥梁结构的服役性能评估和预测的准确性,有利于提高健康监测数据的可用性和效能。
基金supported by the National Natural Science Foundation of China (31100032)University National Oceanographic Public Welfare Project (201205029)+1 种基金K. C. Wong Magna Fund in Ningbo University and Academic Discipline Project of Ningbo University (xk11089)the National Natural Science Foundation of China (20825520and 20921004) for financial supports
文摘Crab paste is a traditional and popular food for people in the coastal area of China.Visual inspection is currently the only method of quality assessment.We developed a new analytical method of a comprehensive survey of nutrients and quality assessment for crab paste based on a combination of 1 H NMR spectroscopy with multivariate data analysis.The aqueous extract of crab paste was dominated by amino acids,sugars,carboxylic acids,nucleotides and amines(including 19 first-time-reported compounds such as choline,uracil and guanosine).Two grades of crab paste had significant compositional differences in terms of amino acids,lactate,N-acetylglutamate,choline,dimethylamine,uridine,1-methylnicotinamide and 2-pyridienmethanol.These results provided important information on the grade-dependence of crab-paste composition,and demonstrated that NMR-MDA was effective not only for the comprehensive survey of nutrients,but also for quality assessment of crab paste.