High-resolution and detailed regional soil spatial distribution information is increasingly needed for ecological modeling and land resource management. For areas with no point data, regional soil mapping includes two...High-resolution and detailed regional soil spatial distribution information is increasingly needed for ecological modeling and land resource management. For areas with no point data, regional soil mapping includes two steps: soil sampling and soil mapping. Because sampling over a large area is costly, efficient sampling strategies are required. A multi-grade representative sampling strategy, which designs a small number of representative samples with different representative grades to depict soil spatial variations at different scales,could be a potentially efficient sampling strategy for regional soil mapping. Additionally, a suitable soil mapping approach is needed to map regional soil variations based on a small number of samples. In this study, the multi-grade representative sampling strategy was applied and a fuzzy membership-weighted soil mapping approach was developed to map soil sand percentage and soil organic carbon(SOC) at 0–20 and 20–40 cm depths in a study area of 5 900 km2 in Anhui Province of China. First, geographical sub-areas were delineated using a parent lithology data layer. Next, fuzzy c-means clustering was applied to two climate and four terrain variables in each stratum. The clustering results(environmental cluster chains) were used to locate representative samples. Evaluations based on an independent validation sample set showed that the addition of samples with lower representativeness generally led to a decrease of root mean square error(RMSE). The declining rates of RMSE with the addition of samples slowed down for 20–40 cm depth, but fluctuated for 0–20 cm depth. The predicted SOC maps based on the representative samples exhibited higher accuracy, especially for soil depth 20–40 cm, as compared to those based on legacy soil data. Multi-grade representative sampling could be an effective sampling strategy at a regional scale. This sampling strategy, combined with the fuzzy membership-based mapping approach, could be an optional effective framework for regional soil property mapping. A more detailed and accurate soil parent material map and the addition of environmental variables representing human activities would improve mapping accuracy.展开更多
Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling desi...Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.展开更多
Soil type maps at the scale of 1︰1 000 000 are used extensively to provide soil spatial distribution information for soil erosion assessment and watershed management models in China.However,the soil property maps pro...Soil type maps at the scale of 1︰1 000 000 are used extensively to provide soil spatial distribution information for soil erosion assessment and watershed management models in China.However,the soil property maps produced through conventional direct linking method usually suffer low accuracy as well as the lack of spatial details within a soil type polygon.This paper presents an effective method to produce detailed soil property map based on representative samples which were extracted from each polygon on the 1︰1000 000 soil type map.The representative sample of each polygon is defined as the location that can represent the largest area within the polygon.The representativeness of a candidate sample is determined by calculating the soil-forming environment condition similarities between the sample and other locations.Once the representative sample of each polygon has been chosen,the property values of the existing typical samples are assigned to the corresponding representative samples with the same soil type.Finally,based on these representative samples,the detailed soil property map could be produced by using existing digital soil mapping methods.The case study in XuanCheng City,Anhui Province of China,demonstrated the proposed method could produce soil property map at a higher level of spatial details and accuracy:1)The soil organic matter(SOM)map produced based on the representative samples can not only depict the detailed spatial distribution of SOM within a soil type polygon but also largely reduce the abrupt change of soil property at the boundaries of two adjacent polygons.2)The Root Mean Squared Error(RMSE)of the SOM map based on the representative samples is1.61,and it is 1.37 for the SOM map produced by using conventional direct linking method.Therefore,the proposed method is an effective approach to produce spatial detailed soil property map with higher accuracy for environment simulation models.展开更多
The Soil Land Inference Model(SoLIM) was primarily proposed by Zhu et al.(Zhu A X, Band L, Vertessy R, Dutton B. 1997. Derivation of soil properties using a soil land inference model(SoLIM). Soil Sci Soc Am J. 61: 523...The Soil Land Inference Model(SoLIM) was primarily proposed by Zhu et al.(Zhu A X, Band L, Vertessy R, Dutton B. 1997. Derivation of soil properties using a soil land inference model(SoLIM). Soil Sci Soc Am J. 61: 523–533.) and was based on the Third Law of Geography. Based on the assumption that the soil property value at a location of interest will be more similar to that of a given soil sample when the environmental condition at the location of interest is more similar to that at the location from which the sample was taken, SoLIM estimates the soil property value of the location of interest using the soil property values of known samples weighted by the similarity between those samples and the location of interest in terms of an attribute domain of environmental conditions. However, the current SoLIM method ignores information about the spatial distances between the location of interest and those of the sample. In this study, we proposed a new method of soil property mapping, So LIM-IDW, which incorporates spatial distance information into the SoLIM method by means of inverse distance weighting(IDW). The proposed method is based on the assumption that the soil property value at a location of interest will be more similar to that of a known sample both when the environmental conditions are more similar and when the distance between the location of interest and the sample location is shorter. Our evaluation experiments on A-horizon soil organic matter mapping in two study areas with independent evaluation samples showed that the proposed SoLIM-IDW method can obtain lower prediction errors than the original SoLIM method, multiple linear regression, geographically weighted regression, and regression-kriging with the same modeling points. Future work mainly includes the determination of optimal power parameter values and the appropriate setting of the parameter under different application contexts.展开更多
Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs an...Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.展开更多
In digital soil mapping(DSM),a fundamental assumption is that the spatial variability of the target variable can be explained by the predictors or environmental covariates.Strategies to adequately sample the predictor...In digital soil mapping(DSM),a fundamental assumption is that the spatial variability of the target variable can be explained by the predictors or environmental covariates.Strategies to adequately sample the predictors have been well documented,with the conditioned Latin hypercube sampling(cLHS)algorithm receiving the most attention in the DSM community.Despite advances in sampling design,a critical gap remains in determining the number of samples required for DSM projects.We propose a simple workflow and function coded in R language to determine the minimum sample size for the cLHS algorithm based on histograms of the predictor variables using the Freedman-Diaconis rule for determining optimal bin width.Data preprocessing was included to correct for multimodal and non-normally distributed data,as these can affect sample size determination from the histogram.Based on a user-selected quantile range(QR)for the sample plan,the densities of the histogram bins at the upper and lower bounds of the QR were used as a scaling factor to determine minimum sample size.This technique was applied to a field-scale set of environmental covariates for a well-sampled agricultural study site near Guelph,Ontario,Canada,and tested across a range of QRs.The results showed increasing minimum sample size with an increase in the QR selected.Minimum sample size increased from 44 to 83 when the QR increased from 50% to 95% and then increased exponentially to 194 for the 99%QR.This technique provides an estimate of minimum sample size that can be used as an input to the cLHS algorithm.展开更多
Sampling design(SD) plays a crucial role in providing reliable input for digital soil mapping(DSM) and increasing its efficiency.Sampling design, with a predetermined sample size and consideration of budget and spatia...Sampling design(SD) plays a crucial role in providing reliable input for digital soil mapping(DSM) and increasing its efficiency.Sampling design, with a predetermined sample size and consideration of budget and spatial variability, is a selection procedure for identifying a set of sample locations spread over a geographical space or with a good feature space coverage. A good feature space coverage ensures accurate estimation of regression parameters, while spatial coverage contributes to effective spatial interpolation.First, we review several statistical and geometric SDs that mainly optimize the sampling pattern in a geographical space and illustrate the strengths and weaknesses of these SDs by considering spatial coverage, simplicity, accuracy, and efficiency. Furthermore, Latin hypercube sampling, which obtains a full representation of multivariate distribution in geographical space, is described in detail for its development, improvement, and application. In addition, we discuss the fuzzy k-means sampling, response surface sampling, and Kennard-Stone sampling, which optimize sampling patterns in a feature space. We then discuss some practical applications that are mainly addressed by the conditioned Latin hypercube sampling with the flexibility and feasibility of adding multiple optimization criteria. We also discuss different methods of validation, an important stage of DSM, and conclude that an independent dataset selected from the probability sampling is superior for its free model assumptions. For future work, we recommend: 1) exploring SDs with both good spatial coverage and feature space coverage; 2) uncovering the real impacts of an SD on the integral DSM procedure;and 3) testing the feasibility and contribution of SDs in three-dimensional(3 D) DSM with variability for multiple layers.展开更多
基金supported by the National Natural Science Foundation of China (Nos. 41471178, 41530749, and 41431177)the State Key Laboratory of Soil and Sustainable Agriculture, China (No. Y052010002)+2 种基金the Natural Science Research Program of Jiangsu, China (No. 14KJA170001)the National Key Technology Innovation Project for Water Pollution Control and Remediation, China (No. 2013ZX07103006)the National Basic Research Program (973 Program) of China (No. 2015CB954102)
文摘High-resolution and detailed regional soil spatial distribution information is increasingly needed for ecological modeling and land resource management. For areas with no point data, regional soil mapping includes two steps: soil sampling and soil mapping. Because sampling over a large area is costly, efficient sampling strategies are required. A multi-grade representative sampling strategy, which designs a small number of representative samples with different representative grades to depict soil spatial variations at different scales,could be a potentially efficient sampling strategy for regional soil mapping. Additionally, a suitable soil mapping approach is needed to map regional soil variations based on a small number of samples. In this study, the multi-grade representative sampling strategy was applied and a fuzzy membership-weighted soil mapping approach was developed to map soil sand percentage and soil organic carbon(SOC) at 0–20 and 20–40 cm depths in a study area of 5 900 km2 in Anhui Province of China. First, geographical sub-areas were delineated using a parent lithology data layer. Next, fuzzy c-means clustering was applied to two climate and four terrain variables in each stratum. The clustering results(environmental cluster chains) were used to locate representative samples. Evaluations based on an independent validation sample set showed that the addition of samples with lower representativeness generally led to a decrease of root mean square error(RMSE). The declining rates of RMSE with the addition of samples slowed down for 20–40 cm depth, but fluctuated for 0–20 cm depth. The predicted SOC maps based on the representative samples exhibited higher accuracy, especially for soil depth 20–40 cm, as compared to those based on legacy soil data. Multi-grade representative sampling could be an effective sampling strategy at a regional scale. This sampling strategy, combined with the fuzzy membership-based mapping approach, could be an optional effective framework for regional soil property mapping. A more detailed and accurate soil parent material map and the addition of environmental variables representing human activities would improve mapping accuracy.
基金?nancially supported by the National Natural Science Foundation of China (Nos. 41541006 and 41771246)co-funded by Enterprise Ireland and the European Regional Development Fund (ERDF) under the National Strategic Reference Framework (NSRF) 2007–2013
文摘Sampling plays an important role in acquiring precise soil information required in modern agricultural production worldwide, which determines both the cost and quality of final soil mapping products. For sampling design, it has been proposed possibile to transfer the relationships between kriging variance and sampling grid spacing from an area with existing information to other areas with similar soil-forming environments. However, this approach is challenged in practice because of two problems: i) different population vaxiograms among similar areas and ii) sampling errors in estimated variograms. This study evaluated the effects of these two problems on the transferability of the relationships between kriging variance and sampling grid spacing, by using spatial data simulated with three variograms and soil samples collected from four grasslands in Ireland with similar soil-forming environments. Results showed that the variograms suggested by different samples collected with the same grid spacing in the same or similar areas were different, leading to a range of mean kriging variance (MKV) for each grid spacing. With increasing grid spacing, the variation of MKV for a specific grid spacing increased and deviated more from the MKV generated using the population variograms. As a result, the spatial transferability of the relationships between kriging variance and grid spacing for sampling design was limited.
基金Under the auspices of Program of International Science & Technology Cooperation,Ministry of Science and Technology of China(No.2010DFB24140)National Natural Science Foundation of China(No.41023010,41001298)National High Technology Research and Development Program of China(No.2011AA120305)
文摘Soil type maps at the scale of 1︰1 000 000 are used extensively to provide soil spatial distribution information for soil erosion assessment and watershed management models in China.However,the soil property maps produced through conventional direct linking method usually suffer low accuracy as well as the lack of spatial details within a soil type polygon.This paper presents an effective method to produce detailed soil property map based on representative samples which were extracted from each polygon on the 1︰1000 000 soil type map.The representative sample of each polygon is defined as the location that can represent the largest area within the polygon.The representativeness of a candidate sample is determined by calculating the soil-forming environment condition similarities between the sample and other locations.Once the representative sample of each polygon has been chosen,the property values of the existing typical samples are assigned to the corresponding representative samples with the same soil type.Finally,based on these representative samples,the detailed soil property map could be produced by using existing digital soil mapping methods.The case study in XuanCheng City,Anhui Province of China,demonstrated the proposed method could produce soil property map at a higher level of spatial details and accuracy:1)The soil organic matter(SOM)map produced based on the representative samples can not only depict the detailed spatial distribution of SOM within a soil type polygon but also largely reduce the abrupt change of soil property at the boundaries of two adjacent polygons.2)The Root Mean Squared Error(RMSE)of the SOM map based on the representative samples is1.61,and it is 1.37 for the SOM map produced by using conventional direct linking method.Therefore,the proposed method is an effective approach to produce spatial detailed soil property map with higher accuracy for environment simulation models.
基金funded by the National Natural Science Foundation of China (Nos.41871300,41422109,and 41431177)the National Basic Research Program of China (No.2015CB954102)+1 种基金the Priority Academic Program Development of Jiangsu Higher Education Institutions,China (No.164320H116)the Outstanding Innovation Team in Colleges and Universities in Jiangsu Province,China the support from the Innovation Project of State Key Laboratory of Resources and Environmental Information System of China (No.O88RA20CYA)。
文摘The Soil Land Inference Model(SoLIM) was primarily proposed by Zhu et al.(Zhu A X, Band L, Vertessy R, Dutton B. 1997. Derivation of soil properties using a soil land inference model(SoLIM). Soil Sci Soc Am J. 61: 523–533.) and was based on the Third Law of Geography. Based on the assumption that the soil property value at a location of interest will be more similar to that of a given soil sample when the environmental condition at the location of interest is more similar to that at the location from which the sample was taken, SoLIM estimates the soil property value of the location of interest using the soil property values of known samples weighted by the similarity between those samples and the location of interest in terms of an attribute domain of environmental conditions. However, the current SoLIM method ignores information about the spatial distances between the location of interest and those of the sample. In this study, we proposed a new method of soil property mapping, So LIM-IDW, which incorporates spatial distance information into the SoLIM method by means of inverse distance weighting(IDW). The proposed method is based on the assumption that the soil property value at a location of interest will be more similar to that of a known sample both when the environmental conditions are more similar and when the distance between the location of interest and the sample location is shorter. Our evaluation experiments on A-horizon soil organic matter mapping in two study areas with independent evaluation samples showed that the proposed SoLIM-IDW method can obtain lower prediction errors than the original SoLIM method, multiple linear regression, geographically weighted regression, and regression-kriging with the same modeling points. Future work mainly includes the determination of optimal power parameter values and the appropriate setting of the parameter under different application contexts.
基金the National Science and Engineering Research Council of Canada(No.RGPIN-2014-04100)for funding this project.
文摘Digital soil mapping (DSM) aims to produce detailed maps of soil properties or soil classes to improve agricultural management and soil quality assessment. Optimized sampling design can reduce the substantial costs and efforts associated with sampling, profile description, and laboratory analysis. The purpose of this study was to compare common sampling designs for DSM, including grid sampling (GS), grid random sampling (GRS), stratified random sampling (StRS), and conditioned Latin hypercube sampling (cLHS). In an agricultural field (11 ha) in Quebec, Canada, a total of unique 118 locations were selected using each of the four sampling designs (45 locations each), and additional 30 sample locations were selected as an independent testing dataset (evaluation dataset). Soil visible near-infrared (Vis-NIR) spectra were collected in situ at the 148 locations (1 m depth), and soil cores were collected from a subset of 32 locations and subdivided at 10-cm depth intervals, totaling 251 samples. The Cubist model was used to elucidate the relationship between Vis-NIR spectra and soil properties (soil organic matter (SOM) and clay), which was then used to predict the soil properties at all 148 sample locations. Digital maps of soil properties at multiple depths for the entire field (148 sample locations) were prepared using a quantile random forest model to obtain complete model maps (CM-maps). Soil properties were also mapped using the samples from each of the 45 locations for each sampling design to obtain sampling design maps (SD-maps). The SD-maps were evaluated using the independent testing dataset (30 sample locations), and the spatial distribution and model uncertainty of each SD-map were compared with those of the corresponding CM-map. The spatial and feature space coverage were compared across the four sampling designs. The results showed that GS resulted in the most even spatial coverage, cLHS resulted in the best coverage of the feature space, and GS and cLHS resulted in similar prediction accuracies and spatial distributions of soil properties. The SOM content was underestimated using GRS, with large errors at 0–50 cm depth, due to some values not being captured by this sampling design, whereas larger errors for the deeper soil layers were produced using StRS. Predictions of SOM and clay contents had higher accuracy for topsoil (0–30 cm) than for deep subsoil (60–100 cm). It was concluded that the soil sampling designs with either good spatial coverage or feature space coverage can provide good accuracy in 3D DSM, but their performances may be different for different soil properties.
基金the Natural Science and Engineering Research Council(NSERC)of Canada,which supported and funded this project through an NSERC Postgraduate Scholarship—Doctoral(PGS-D)。
文摘In digital soil mapping(DSM),a fundamental assumption is that the spatial variability of the target variable can be explained by the predictors or environmental covariates.Strategies to adequately sample the predictors have been well documented,with the conditioned Latin hypercube sampling(cLHS)algorithm receiving the most attention in the DSM community.Despite advances in sampling design,a critical gap remains in determining the number of samples required for DSM projects.We propose a simple workflow and function coded in R language to determine the minimum sample size for the cLHS algorithm based on histograms of the predictor variables using the Freedman-Diaconis rule for determining optimal bin width.Data preprocessing was included to correct for multimodal and non-normally distributed data,as these can affect sample size determination from the histogram.Based on a user-selected quantile range(QR)for the sample plan,the densities of the histogram bins at the upper and lower bounds of the QR were used as a scaling factor to determine minimum sample size.This technique was applied to a field-scale set of environmental covariates for a well-sampled agricultural study site near Guelph,Ontario,Canada,and tested across a range of QRs.The results showed increasing minimum sample size with an increase in the QR selected.Minimum sample size increased from 44 to 83 when the QR increased from 50% to 95% and then increased exponentially to 194 for the 99%QR.This technique provides an estimate of minimum sample size that can be used as an input to the cLHS algorithm.
基金funded by the Natural Science and Engineering Research Council (NSERC) of Canada (No. RGPIN-2014-04100)
文摘Sampling design(SD) plays a crucial role in providing reliable input for digital soil mapping(DSM) and increasing its efficiency.Sampling design, with a predetermined sample size and consideration of budget and spatial variability, is a selection procedure for identifying a set of sample locations spread over a geographical space or with a good feature space coverage. A good feature space coverage ensures accurate estimation of regression parameters, while spatial coverage contributes to effective spatial interpolation.First, we review several statistical and geometric SDs that mainly optimize the sampling pattern in a geographical space and illustrate the strengths and weaknesses of these SDs by considering spatial coverage, simplicity, accuracy, and efficiency. Furthermore, Latin hypercube sampling, which obtains a full representation of multivariate distribution in geographical space, is described in detail for its development, improvement, and application. In addition, we discuss the fuzzy k-means sampling, response surface sampling, and Kennard-Stone sampling, which optimize sampling patterns in a feature space. We then discuss some practical applications that are mainly addressed by the conditioned Latin hypercube sampling with the flexibility and feasibility of adding multiple optimization criteria. We also discuss different methods of validation, an important stage of DSM, and conclude that an independent dataset selected from the probability sampling is superior for its free model assumptions. For future work, we recommend: 1) exploring SDs with both good spatial coverage and feature space coverage; 2) uncovering the real impacts of an SD on the integral DSM procedure;and 3) testing the feasibility and contribution of SDs in three-dimensional(3 D) DSM with variability for multiple layers.