In this paper we report an analysis of sampling error uncertainties in mean maximum and minimum temperatures (Tmax and Tmin) carried out on monthly,seasonal and annual scales,including an examination of homogenized ...In this paper we report an analysis of sampling error uncertainties in mean maximum and minimum temperatures (Tmax and Tmin) carried out on monthly,seasonal and annual scales,including an examination of homogenized and original data collected at 731 meteorological stations across China for the period 1951-2004.Uncertainties of the gridded data and national average,linear trends and their uncertainties,as well as the homogenization effect on uncertainties are assessed.It is shown that the sampling error variances of homogenized Tmax and Tmin,which are larger in winter than in summer,have a marked northwest-southeast gradient distribution,while the sampling error variances of the original data are found to be larger and irregular.Tmax and Tmin increase in all months of the year in the study period 1951-2004,with the largest warming and uncertainties being 0.400℃ (10 yr)-1 + 0.269℃ (10 yr)-1 and 0.578℃ (10 yr)-1 + 0.211℃ (10 yr)-1 in February,and the least being 0.022℃ (10 yr)-1 + 0.085℃ (10 yr)-1 and 0.104℃ (10 yr)-1 +0.070℃ (10 yr)-1 in August.Homogenization can remove large uncertainties in the original records resulting from various non-natural changes in China.展开更多
The analytical mathematical solutions of gas concentration and fractional gas loss for the diffusion of gas in a cylindrical coal sample were given with detailed mathematical derivations by assuming that the diffusion...The analytical mathematical solutions of gas concentration and fractional gas loss for the diffusion of gas in a cylindrical coal sample were given with detailed mathematical derivations by assuming that the diffusion of gas through the coal matrix is concentration gradient-driven and obeys the Fick’s Second Law of Diffusion.The analytical solutions were approximated in case of small values of time and the error analyses associated with the approximation were also undertaken.The results indicate that the square root relationship of gas release in the early stage of desorption,which is widely used to provide a simple and fast estimation of the lost gas,is the first term of the approximation,and care must be taken in using the square root relationship as a significant error might be introduced with increase in the lost time and decrease in effective diameter of a cylindrical coal sample.展开更多
When precision fanning management zones (MZs) are delineated in an agricultural field for precision nutrient management, unsupervised classification and cluster analysis procedures using remote sensing image analysi...When precision fanning management zones (MZs) are delineated in an agricultural field for precision nutrient management, unsupervised classification and cluster analysis procedures using remote sensing image analysis software are performed. These unsupervised classification and cluster analysis procedures are performed on the basis of the assumption that grouping of data points into naturally occurring clusters reduces within zone variability. The problem is that, there are small patches of different soil types within each management zone that are regarded as insignificant by the farmer, and are assimilated within larger MZs. These will consequently make soils within a management zone to be inhomogeneous. The objective of this study was to determine the probability of soil sampling occurrences on patches assimilated during delineation of MZs after a cluster analysis was performed. The study was conducted on a 5.0 ha (25°05′34.46″ S and 28°18′30.01″ E) and a 24.4 ha (23°59′04.61″ S and 28°52′29.43″ E) fields in the Waterberg District of the Limpopo Province in South Africa. A bare-soil high resolution Quickbird satellite imagery of a conventionally tilled agricultural field was used to develop MZs in the field. Soils were sampled using systematic unaligned sampling on a 35.0 m and 30.0 m grids for the 24.4 ha and 5.0 ha fields, respectively. Probabilities were calculated based on percentage area assimilated during the cluster analysis procedure that was performed using remote sensing image analysis software. The results indicated that in the 24.4 ha field there were 2.5 ha patches of high and medium zones that were assimilated within the low zone, and thus making low zones non-homogeneous. After cluster analysis and assimilation of patches, the low zone in the 24.4 ha field increased by 45.5% (2.5 ha) while the high zone was 16.4% (2.4 ha) smaller in size. In the smaller field of 5.0 ha, the high zone, which was originally 3.20 ha, lost 0.37 ha (11.6%), which was assimilated in either low or medium zone. The study indicates that unequal probability proportional to size sampling could be used to minimize error when sampling across precision farming MZs because typically the low, medium and high MZs are not of equal size and do not contribute equally towards the mean values of soil samples.展开更多
The principle of planc-to-plane perpendicularity measuring with coordinate measuring machine (CMM) is described and the main factors that influence the measuring precision are analyzed. The minimum condition method ...The principle of planc-to-plane perpendicularity measuring with coordinate measuring machine (CMM) is described and the main factors that influence the measuring precision are analyzed. The minimum condition method is adopted to eliminate the fitting error of the datum plane. In order to diminish the length error of the object plane, the tactics of measuring some part of the plane and then scale to the whole plane is employed. With large quantity of measuring experiments on fiat plates, the most appropriate number of points in measuring a plane is determined to reduce the sampling error.展开更多
To determine the feasibility and practicability of interrupt continuous wave (CW) approach proposed for real time simulating radar intermediate frequency(IF) video signal, theoretical analysis and computer simulation...To determine the feasibility and practicability of interrupt continuous wave (CW) approach proposed for real time simulating radar intermediate frequency(IF) video signal, theoretical analysis and computer simulation were used. Phases at two linked points between the end and beginning of adjoined frames are always consistent; the bias Doppler frequency for the time delay of A/D sampling start responds to that for target acceleration. No digital phase compensation is required at continuous points, and the interrupt CW approach has apparently practical values.展开更多
To evaluate a step up approach: Taking macrobiopsies and performing excision biopsies in patients with suspected rectal cancer in which biopsies taken though the flexible endoscope showed benign histology. METHODSPati...To evaluate a step up approach: Taking macrobiopsies and performing excision biopsies in patients with suspected rectal cancer in which biopsies taken though the flexible endoscope showed benign histology. METHODSPatients with a rectal neoplasm who underwent flexible endoscopy and biopsies were included. In case of benign biopsies rigid rectoscopy and macrobiopsies were employed. If this failed to prove malignancy, transanal endoscopic microsurgery (TEM) was used in a final effort to establish a certain preoperative diagnosis. The preoperative results were compared with the findings after surgical excision and follow up to calculate the reliability of this algorithm. RESULTSOne hundred and thirty-two patients were included. One hundred and ten patients with a carcinoma and 22 with an adenoma. Seventy-five of 110 carcinomas were proven malignant after flexible endoscopy. With the addition of rigid endoscopy and taking of macrobiopsies, this number increased to 89. Performing TEM excision biopsies further enlarged the number of proven malignancies to 100. CONCLUSIONThe step-up approach includes taking macrobiopsies through the rigid rectoscope and performing excision biopsies using transanal endoscopic microsurgery in addition to flexible endoscopy. This approach, reduced the number of missed preoperative malignant diagnoses from 32% to 9%.展开更多
Non-sampling errors can generally be divided into three types:sampling frame errors,non-response errors and measurement errors.Missing target units in the sam-pling frame,improper handling of non-responses,and misrepo...Non-sampling errors can generally be divided into three types:sampling frame errors,non-response errors and measurement errors.Missing target units in the sam-pling frame,improper handling of non-responses,and misreporting or underreport-ing of key variables in the questionnaire can all cause deviations in a survey’s results.The widespread application of Computer-Assisted Personal Interviewing(CAPI)systems and the inclusion of administrative records from government sources in sur-veys has strengthened the ability to control non-sampling errors.Taking a national fertility sampling survey as an example,this study summarizes the sources of var-ious non-sampling errors and explains how to harness big data resources such as administrative records to control non-sampling errors throughout the survey.The study analyzes the impact of three types of non-sampling errors on the results of the fertility survey and examines the strategies used to address the problems caused by these non-sampling errors.The findings indicate that non-sampling errors were the main source of total error in the survey,and that the errors found came mainly from sampling frame errors;non-response errors and measurement errors were controlled and had little impact on the survey results.展开更多
Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling desi...Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.展开更多
One of the obstacles of the efficient association rule mining is theexplosive expansion of data sets since it is costly or impossible to scan large databases, esp., formultiple times. A popular solution to improve the...One of the obstacles of the efficient association rule mining is theexplosive expansion of data sets since it is costly or impossible to scan large databases, esp., formultiple times. A popular solution to improve the speed and scalability of the association rulemining is to do the algorithm on a random sample instead of the entire database. But how toeffectively define and efficiently estimate the degree of error with respect to the outcome of thealgorithm, and how to determine the sample size needed are entangling researches until now. In thispaper, an effective and efficient algorithm is given based on the PAC (Probably Approximate Correct)learning theory to measure and estimate sample error. Then, a new adaptive, on-line, fast samplingstrategy - multi-scaling sampling - is presented inspired by MRA (Multi-Resolution Analysis) andShannon sampling theorem, for quickly obtaining acceptably approximate association rules atappropriate sample size. Both theoretical analysis and empirical study have showed that the Samplingstrategy can achieve a very good speed-accuracy trade-off.展开更多
In the present paper,we provide an error bound for the learning rates of the regularized Shannon sampling learning scheme when the hypothesis space is a reproducing kernel Hilbert space(RKHS) derived by a Mercer kerne...In the present paper,we provide an error bound for the learning rates of the regularized Shannon sampling learning scheme when the hypothesis space is a reproducing kernel Hilbert space(RKHS) derived by a Mercer kernel and a determined net.We show that if the sample is taken according to the determined set,then,the sample error can be bounded by the Mercer matrix with respect to the samples and the determined net.The regularization error may be bounded by the approximation order of the reproducing kernel Hilbert space interpolation operator.The paper is an investigation on a remark provided by Smale and Zhou.展开更多
基金supported by the National Natural Science Foundation of China (Grant No. 41130103)the 973 Program (Grant Nos. 2009CB421406 and 2012CB955401)+1 种基金the US National Oceanographic and Atmospheric Administration (Grant No. EL133E09SE4048)the US National Science Foundation (Grant Nos. AGS-1015926 and AGS-1015957)
文摘In this paper we report an analysis of sampling error uncertainties in mean maximum and minimum temperatures (Tmax and Tmin) carried out on monthly,seasonal and annual scales,including an examination of homogenized and original data collected at 731 meteorological stations across China for the period 1951-2004.Uncertainties of the gridded data and national average,linear trends and their uncertainties,as well as the homogenization effect on uncertainties are assessed.It is shown that the sampling error variances of homogenized Tmax and Tmin,which are larger in winter than in summer,have a marked northwest-southeast gradient distribution,while the sampling error variances of the original data are found to be larger and irregular.Tmax and Tmin increase in all months of the year in the study period 1951-2004,with the largest warming and uncertainties being 0.400℃ (10 yr)-1 + 0.269℃ (10 yr)-1 and 0.578℃ (10 yr)-1 + 0.211℃ (10 yr)-1 in February,and the least being 0.022℃ (10 yr)-1 + 0.085℃ (10 yr)-1 and 0.104℃ (10 yr)-1 +0.070℃ (10 yr)-1 in August.Homogenization can remove large uncertainties in the original records resulting from various non-natural changes in China.
基金provided by the Science and Technology Grant of Huainan City of China (No.2013A4001)the Key Research Grant of Shanxi Province of China (No.201303027-1)
文摘The analytical mathematical solutions of gas concentration and fractional gas loss for the diffusion of gas in a cylindrical coal sample were given with detailed mathematical derivations by assuming that the diffusion of gas through the coal matrix is concentration gradient-driven and obeys the Fick’s Second Law of Diffusion.The analytical solutions were approximated in case of small values of time and the error analyses associated with the approximation were also undertaken.The results indicate that the square root relationship of gas release in the early stage of desorption,which is widely used to provide a simple and fast estimation of the lost gas,is the first term of the approximation,and care must be taken in using the square root relationship as a significant error might be introduced with increase in the lost time and decrease in effective diameter of a cylindrical coal sample.
文摘When precision fanning management zones (MZs) are delineated in an agricultural field for precision nutrient management, unsupervised classification and cluster analysis procedures using remote sensing image analysis software are performed. These unsupervised classification and cluster analysis procedures are performed on the basis of the assumption that grouping of data points into naturally occurring clusters reduces within zone variability. The problem is that, there are small patches of different soil types within each management zone that are regarded as insignificant by the farmer, and are assimilated within larger MZs. These will consequently make soils within a management zone to be inhomogeneous. The objective of this study was to determine the probability of soil sampling occurrences on patches assimilated during delineation of MZs after a cluster analysis was performed. The study was conducted on a 5.0 ha (25°05′34.46″ S and 28°18′30.01″ E) and a 24.4 ha (23°59′04.61″ S and 28°52′29.43″ E) fields in the Waterberg District of the Limpopo Province in South Africa. A bare-soil high resolution Quickbird satellite imagery of a conventionally tilled agricultural field was used to develop MZs in the field. Soils were sampled using systematic unaligned sampling on a 35.0 m and 30.0 m grids for the 24.4 ha and 5.0 ha fields, respectively. Probabilities were calculated based on percentage area assimilated during the cluster analysis procedure that was performed using remote sensing image analysis software. The results indicated that in the 24.4 ha field there were 2.5 ha patches of high and medium zones that were assimilated within the low zone, and thus making low zones non-homogeneous. After cluster analysis and assimilation of patches, the low zone in the 24.4 ha field increased by 45.5% (2.5 ha) while the high zone was 16.4% (2.4 ha) smaller in size. In the smaller field of 5.0 ha, the high zone, which was originally 3.20 ha, lost 0.37 ha (11.6%), which was assimilated in either low or medium zone. The study indicates that unequal probability proportional to size sampling could be used to minimize error when sampling across precision farming MZs because typically the low, medium and high MZs are not of equal size and do not contribute equally towards the mean values of soil samples.
基金sponsored by the Special Research Fund for Young Teachers of Universities in Shanghai under Grant No.gjd-07048
文摘The principle of planc-to-plane perpendicularity measuring with coordinate measuring machine (CMM) is described and the main factors that influence the measuring precision are analyzed. The minimum condition method is adopted to eliminate the fitting error of the datum plane. In order to diminish the length error of the object plane, the tactics of measuring some part of the plane and then scale to the whole plane is employed. With large quantity of measuring experiments on fiat plates, the most appropriate number of points in measuring a plane is determined to reduce the sampling error.
文摘To determine the feasibility and practicability of interrupt continuous wave (CW) approach proposed for real time simulating radar intermediate frequency(IF) video signal, theoretical analysis and computer simulation were used. Phases at two linked points between the end and beginning of adjoined frames are always consistent; the bias Doppler frequency for the time delay of A/D sampling start responds to that for target acceleration. No digital phase compensation is required at continuous points, and the interrupt CW approach has apparently practical values.
文摘To evaluate a step up approach: Taking macrobiopsies and performing excision biopsies in patients with suspected rectal cancer in which biopsies taken though the flexible endoscope showed benign histology. METHODSPatients with a rectal neoplasm who underwent flexible endoscopy and biopsies were included. In case of benign biopsies rigid rectoscopy and macrobiopsies were employed. If this failed to prove malignancy, transanal endoscopic microsurgery (TEM) was used in a final effort to establish a certain preoperative diagnosis. The preoperative results were compared with the findings after surgical excision and follow up to calculate the reliability of this algorithm. RESULTSOne hundred and thirty-two patients were included. One hundred and ten patients with a carcinoma and 22 with an adenoma. Seventy-five of 110 carcinomas were proven malignant after flexible endoscopy. With the addition of rigid endoscopy and taking of macrobiopsies, this number increased to 89. Performing TEM excision biopsies further enlarged the number of proven malignancies to 100. CONCLUSIONThe step-up approach includes taking macrobiopsies through the rigid rectoscope and performing excision biopsies using transanal endoscopic microsurgery in addition to flexible endoscopy. This approach, reduced the number of missed preoperative malignant diagnoses from 32% to 9%.
基金sponsored by the Follow-up Research on Fertility Level and Fertility Intentions with the Help of Big Data(No.21BRK001)a research project funded by the National Social Science Fund of China.
文摘Non-sampling errors can generally be divided into three types:sampling frame errors,non-response errors and measurement errors.Missing target units in the sam-pling frame,improper handling of non-responses,and misreporting or underreport-ing of key variables in the questionnaire can all cause deviations in a survey’s results.The widespread application of Computer-Assisted Personal Interviewing(CAPI)systems and the inclusion of administrative records from government sources in sur-veys has strengthened the ability to control non-sampling errors.Taking a national fertility sampling survey as an example,this study summarizes the sources of var-ious non-sampling errors and explains how to harness big data resources such as administrative records to control non-sampling errors throughout the survey.The study analyzes the impact of three types of non-sampling errors on the results of the fertility survey and examines the strategies used to address the problems caused by these non-sampling errors.The findings indicate that non-sampling errors were the main source of total error in the survey,and that the errors found came mainly from sampling frame errors;non-response errors and measurement errors were controlled and had little impact on the survey results.
基金?nancially supported by the National Natural Science Foundation of China (Nos. 41541006 and 41771246)co-funded by Enterprise Ireland and the European Regional Development Fund (ERDF) under the National Strategic Reference Framework (NSRF) 2007–2013
文摘Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.
基金CAS Project of Brain and Mind Science,国家高技术研究发展计划(863计划),国家重点基础研究发展计划(973计划),国家自然科学基金,湖南省自然科学基金
文摘One of the obstacles of the efficient association rule mining is theexplosive expansion of data sets since it is costly or impossible to scan large databases, esp., formultiple times. A popular solution to improve the speed and scalability of the association rulemining is to do the algorithm on a random sample instead of the entire database. But how toeffectively define and efficiently estimate the degree of error with respect to the outcome of thealgorithm, and how to determine the sample size needed are entangling researches until now. In thispaper, an effective and efficient algorithm is given based on the PAC (Probably Approximate Correct)learning theory to measure and estimate sample error. Then, a new adaptive, on-line, fast samplingstrategy - multi-scaling sampling - is presented inspired by MRA (Multi-Resolution Analysis) andShannon sampling theorem, for quickly obtaining acceptably approximate association rules atappropriate sample size. Both theoretical analysis and empirical study have showed that the Samplingstrategy can achieve a very good speed-accuracy trade-off.
基金supported by National Natural Science Foundation of China (Grant No.10871226)Natural Science Foundation of Zhejiang Province (Grant No. Y6100096)
文摘In the present paper,we provide an error bound for the learning rates of the regularized Shannon sampling learning scheme when the hypothesis space is a reproducing kernel Hilbert space(RKHS) derived by a Mercer kernel and a determined net.We show that if the sample is taken according to the determined set,then,the sample error can be bounded by the Mercer matrix with respect to the samples and the determined net.The regularization error may be bounded by the approximation order of the reproducing kernel Hilbert space interpolation operator.The paper is an investigation on a remark provided by Smale and Zhou.