Although big data is publicly available on water quality parameters,virtual simulation has not yet been adequately adapted in environmental chemistry research.Digital twin is different from conventional geospatial mod...Although big data is publicly available on water quality parameters,virtual simulation has not yet been adequately adapted in environmental chemistry research.Digital twin is different from conventional geospatial modeling approaches and is particularly useful when systematic laboratory/field experiment is not realistic(e.g.,climate impact and water-related environmental catastrophe)or difficult to design and monitor in a real time(e.g.,pollutant and nutrient cycles in estuaries,soils,and sediments).Data-driven water research could realize early warning and disaster readiness simulations for diverse environmental scenarios,including drinking water contamination.展开更多
We evaluated the use of spatial sampling and satellite images to identify deforested areas in Wonju, South Korea. The changes in land cover were identified using a grid of sample points overlaid onto medium and high-r...We evaluated the use of spatial sampling and satellite images to identify deforested areas in Wonju, South Korea. The changes in land cover were identified using a grid of sample points overlaid onto medium and high-resolution remote sensing (RS) satellite images. Deforestation identified in this way (hereafter, RSD) was compared to administrative data on deforestation. We also compared high-resolution satellite images (HR-RSD) and actual deforestation based on categories which were Intergovernmental Panel on Climate Change data. RSD generated by medium-resolution satellite images overesti- mated the amount of deforested area by 1.5-2.4 times the actual deforested area, whereas RSD generated by HR- RSD underestimated the amount of deforested area by 0.4-0.9 times the actual area. The highest degree of matching (90 %) was found in HR-RSD with a grid interval of 500 m and the accuracy of HR-RSD was the highest, at 67 %. The results also revealed that the largest cause of deforestation was the establishment of settlements followed by conversion to cropland and grassland. We conclude that for the identification of deforestation using satellite images, HR-RSD with a grid interval of 500 m is most suitable.展开更多
Challenges in Big Data analysis arise due to the way the data are recorded, maintained, processed and stored. We demonstrate that a hierarchical, multivariate, statistical machine learning algorithm, namely Boosted Re...Challenges in Big Data analysis arise due to the way the data are recorded, maintained, processed and stored. We demonstrate that a hierarchical, multivariate, statistical machine learning algorithm, namely Boosted Regression Tree (BRT) can address Big Data challenges to drive decision making. The challenge of this study is lack of interoperability since the data, a collection of GIS shapefiles, remotely sensed imagery, and aggregated and interpolated spatio-temporal information, are stored in monolithic hardware components. For the modelling process, it was necessary to create one common input file. By merging the data sources together, a structured but noisy input file, showing inconsistencies and redundancies, was created. Here, it is shown that BRT can process different data granularities, heterogeneous data and missingness. In particular, BRT has the advantage of dealing with missing data by default by allowing a split on whether or not a value is missing as well as what the value is. Most importantly, the BRT offers a wide range of possibilities regarding the interpretation of results and variable selection is automatically performed by considering how frequently a variable is used to define a split in the tree. A comparison with two similar regression models (Random Forests and Least Absolute Shrinkage and Selection Operator, LASSO) shows that BRT outperforms these in this instance. BRT can also be a starting point for sophisticated hierarchical modelling in real world scenarios. For example, a single or ensemble approach of BRT could be tested with existing models in order to improve results for a wide range of data-driven decisions and applications.展开更多
This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional...This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional algorithm: that is, the classification accura- cy is increased. This is achieved by incorporating covariance matrices at the level of individual classes rather than assuming a global one. Empirical results from a fuzzy classification of an Edinburgh suburban land cover confirmed the improved performance of the new algorithm for fuzzy c-means clustering, in particular when fuzziness is also accommodated in the assumed reference data.展开更多
An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input...An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input layer depends on the dimensionality of input patterns.The number of neurons in the output layer equals the number of the desired classes.The number of neurons in the Kohonen layer may be a few to several thousands,which depends on the complexity of classification problems and the classification precision.Each training sample is expressed by a pair of vectors: an input vector and a class codebook vector.When a training sample is input into the model,Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohonen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector,and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector.If the number of training samples is sufficiently large and the learning epochs iterate enough times,the model will be able to serve as a supervised classifier.The model has been tentatively applied to the supervised classification of multispectral remotely sensed data.The author compared the performances of the extended SOM and BPN in remotely sensed data classification.The investigation manifests that the extended SOM is feasible for supervised classification.展开更多
Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately ...Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately analyze the communication behavior.Traditional means can hardly utilize the scarce and crude spectrum sensing data captured in a real scene.Thus,communication behavior recognition using raw sensing data under smallsample condition has become a new challenge.In this paper,a data enhanced communication behavior recognition(DECBR)scheme is proposed to meet this challenge.Firstly,a preprocessing method is designed to make the raw spectrum data suitable for the proposed scheme.Then,an adaptive convolutional neural network structure is exploited to carry out communication behavior recognition.Moreover,DCGAN is applied to support data enhancement,which realize communication behavior recognition under small-sample condition.Finally,the scheme is verified by experiments under different data size.The results show that the DECBR scheme can greatly improve the accuracy and efficiency of behavior recognition under smallsample condition.展开更多
In this article, the extension to three dimensions (3D) of the blending technique that has been widely used in two dimensions (2D) to calibrate ocean chlorophyll is presented. The results thus obtained revealed a very...In this article, the extension to three dimensions (3D) of the blending technique that has been widely used in two dimensions (2D) to calibrate ocean chlorophyll is presented. The results thus obtained revealed a very high degree of efficiency when predicting observed values of ocean chlorophyll. The mean squared difference between the predicted and observed values of ocean chlorophyll when 3D technique was used fell far below the tolerance level which was set to the difference between satellite and observed in-situ values. The resulting blended field did not only provide better predictions of the in situ observations in areas where bottle samples cannot be obtained but also provided a smooth variation of the distribution of ocean chlorophyll throughout the year. An added advantage is its computational efficiency since data that would have been treated at least four times would be treated only once. With the advent of these results, it is believed that the modelling of the ocean life cycle will become more realistic.展开更多
The salinity of the salt lake is an important factor to evaluate whether it contains some mineral resources or not,the fault buried in the salt lake could control the abundance of the salinity.Therefore,it is of great...The salinity of the salt lake is an important factor to evaluate whether it contains some mineral resources or not,the fault buried in the salt lake could control the abundance of the salinity.Therefore,it is of great geological importance to identify the fault buried in the salt lake.Taking the Gasikule Salt Lake in China for example,the paper established a new method to identify the fault buried in the salt lake based on the multi-source remote sensing data including Landsat TM,SPOT-5 and ASTER data.It includes the acquisition and selection of the multi-source remote sensing data,data preprocessing,lake waterfront extraction,spectrum extraction of brine with different salinity,salinity index construction,salinity separation,analysis of the abnormal salinity and identification of the fault buried in salt lake,temperature inversion of brine and the fault verification.As a result,the study identified an important fault buried in the east of the Gasikule Salt Lake that controls the highest salinity abnormal.Because the level of the salinity is positively correlated to the mineral abundance,the result provides the important reference to identify the water body rich in mineral resources in the salt lake.展开更多
Due to the fact that a memristor with memory properties is an ideal electronic component for implementation of the artificial neural synaptic function,a brand-new tristable locally active memristor model is first prop...Due to the fact that a memristor with memory properties is an ideal electronic component for implementation of the artificial neural synaptic function,a brand-new tristable locally active memristor model is first proposed in this paper.Here,a novel four-dimensional fractional-order memristive cellular neural network(FO-MCNN)model with hidden attractors is constructed to enhance the engineering feasibility of the original CNN model and its performance.Then,its hardware circuit implementation and complicated dynamic properties are investigated on multi-simulation platforms.Subsequently,it is used toward secure communication application scenarios.Taking it as the pseudo-random number generator(PRNG),a new privacy image security scheme is designed based on the adaptive sampling rate compressive sensing(ASR-CS)model.Eventually,the simulation analysis and comparative experiments manifest that the proposed data encryption scheme possesses strong immunity against various security attack models and satisfactory compression performance.展开更多
The integrated approach of various techniques which historically have been used independently is key to successful exploration, development, exploitation and management of the groundwater resources. The integration of...The integrated approach of various techniques which historically have been used independently is key to successful exploration, development, exploitation and management of the groundwater resources. The integration of Remote Sensing (RS), Geographical Information Systems (GIS) and Borehole data has been used in the study area to assess their applicability in groundwater investigation. The area of study lies in the arid and semi-arid lands (ASALs) where principally remote sensing data has been used in extraction of various thematic maps (lithology, lineament, drainage density, and Digital Elevation Model Maps) for groundwater assessment. The GIS platform was used in integrating the RS data and data of productive boreholes. The lineaments generated through remote sensing agree well with structural geology of the area, where high density lineament points overlays the points of intense faulting. Lineaments found in the area correlate well with fault zones, fractures, and lithological contrasts as supported by geological map and structural map. Weathering, faulting and fracturing of the rocks mean a possible increase or a reduction in specific capacities as observed in productive boreholes in sedimentary rocks or igneous/basaltic rocks of the area. Similarly, it is noted that the degree of faulting affects the degree of radius of influence of a borehole in a particular area. These analyses show that groundwater potential within the Sub-County varies spatially with high dependency on geological structures in the basement region and more on geology within the volcanic and younger sediments.展开更多
Compressive sensing is a powerful method for reconstruction of sparsely-sampled data, based on statistical optimization. It can be applied to a range of flow measurement and visualization data, and in this work we sho...Compressive sensing is a powerful method for reconstruction of sparsely-sampled data, based on statistical optimization. It can be applied to a range of flow measurement and visualization data, and in this work we show the usage in groundwater mapping. Due to scarcity of water in many regions of the world, including southwestern United States, monitoring and management of groundwater is of utmost importance. A complete mapping of groundwater is difficult since the monitored sites are far from one another, and thus the data sets are considered extremely “sparse”. To overcome this difficulty in complete mapping of groundwater, compressive sensing is an ideal tool, as it bypasses the classical Nyquist criterion. We show that compressive sensing can effectively be used for reconstructions of groundwater level maps, by validating against data. This approach can have an impact on geographical sensing and information, as effective monitoring and management are enabled without constructing numerous or expensive measurement sites for groundwater.展开更多
With the rapid change in the Arctic sea ice,a large number of sea ice observations have been collected in recent years,and it is expected that an even larger number of such observations will emerge in the coming years...With the rapid change in the Arctic sea ice,a large number of sea ice observations have been collected in recent years,and it is expected that an even larger number of such observations will emerge in the coming years.To make the best use of these observations,in this paper we develop a multi-sensor optimal data merging(MODM)method to merge any number of different sea ice observations.Since such merged data are independent on model forecast,they are valid for model initialization and model validation.Based on the maximum likelihood estimation theory,we prove that any model assimilated with the merged data is equivalent to assimilating the original multi-sensor data.This greatly facilitates sea ice data assimilation,particularly for operational forecast with limited computational resources.We apply the MODM method to merge sea ice concentration(SIC)and sea ice thickness(SIT),respectively,in the Arctic.For SIC merging,the Special Sensor Microwave Imager/Sounder(SSMIS)and Advanced Microwave Scanning Radiometer 2(AMSR2)data are merged together with the Norwegian Ice Service ice chart.This substantially reduces the uncertainties at the ice edge and in the coastal areas.For SIT merging,the daily Soil Moisture and Ocean Salinity(SMOS)data is merged with the weekly-mean merged CryoSat-2 and SMOS(CS2SMOS)data.This generates a new daily CS2SMOS SIT data with better spatial coverage for the whole Arctic.展开更多
文摘Although big data is publicly available on water quality parameters,virtual simulation has not yet been adequately adapted in environmental chemistry research.Digital twin is different from conventional geospatial modeling approaches and is particularly useful when systematic laboratory/field experiment is not realistic(e.g.,climate impact and water-related environmental catastrophe)or difficult to design and monitor in a real time(e.g.,pollutant and nutrient cycles in estuaries,soils,and sediments).Data-driven water research could realize early warning and disaster readiness simulations for diverse environmental scenarios,including drinking water contamination.
文摘We evaluated the use of spatial sampling and satellite images to identify deforested areas in Wonju, South Korea. The changes in land cover were identified using a grid of sample points overlaid onto medium and high-resolution remote sensing (RS) satellite images. Deforestation identified in this way (hereafter, RSD) was compared to administrative data on deforestation. We also compared high-resolution satellite images (HR-RSD) and actual deforestation based on categories which were Intergovernmental Panel on Climate Change data. RSD generated by medium-resolution satellite images overesti- mated the amount of deforested area by 1.5-2.4 times the actual deforested area, whereas RSD generated by HR- RSD underestimated the amount of deforested area by 0.4-0.9 times the actual area. The highest degree of matching (90 %) was found in HR-RSD with a grid interval of 500 m and the accuracy of HR-RSD was the highest, at 67 %. The results also revealed that the largest cause of deforestation was the establishment of settlements followed by conversion to cropland and grassland. We conclude that for the identification of deforestation using satellite images, HR-RSD with a grid interval of 500 m is most suitable.
文摘Challenges in Big Data analysis arise due to the way the data are recorded, maintained, processed and stored. We demonstrate that a hierarchical, multivariate, statistical machine learning algorithm, namely Boosted Regression Tree (BRT) can address Big Data challenges to drive decision making. The challenge of this study is lack of interoperability since the data, a collection of GIS shapefiles, remotely sensed imagery, and aggregated and interpolated spatio-temporal information, are stored in monolithic hardware components. For the modelling process, it was necessary to create one common input file. By merging the data sources together, a structured but noisy input file, showing inconsistencies and redundancies, was created. Here, it is shown that BRT can process different data granularities, heterogeneous data and missingness. In particular, BRT has the advantage of dealing with missing data by default by allowing a split on whether or not a value is missing as well as what the value is. Most importantly, the BRT offers a wide range of possibilities regarding the interpretation of results and variable selection is automatically performed by considering how frequently a variable is used to define a split in the tree. A comparison with two similar regression models (Random Forests and Least Absolute Shrinkage and Selection Operator, LASSO) shows that BRT outperforms these in this instance. BRT can also be a starting point for sophisticated hierarchical modelling in real world scenarios. For example, a single or ensemble approach of BRT could be tested with existing models in order to improve results for a wide range of data-driven decisions and applications.
文摘This paper describes an improved algorithm for fuzzy c-means clustering of remotely sensed data, by which the degree of fuzziness of the resultant classification is de- creased as comparing with that by a conventional algorithm: that is, the classification accura- cy is increased. This is achieved by incorporating covariance matrices at the level of individual classes rather than assuming a global one. Empirical results from a fuzzy classification of an Edinburgh suburban land cover confirmed the improved performance of the new algorithm for fuzzy c-means clustering, in particular when fuzziness is also accommodated in the assumed reference data.
基金Supported by National Natural Science Foundation of China (No. 40872193)
文摘An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input layer depends on the dimensionality of input patterns.The number of neurons in the output layer equals the number of the desired classes.The number of neurons in the Kohonen layer may be a few to several thousands,which depends on the complexity of classification problems and the classification precision.Each training sample is expressed by a pair of vectors: an input vector and a class codebook vector.When a training sample is input into the model,Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohonen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector,and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector.If the number of training samples is sufficiently large and the learning epochs iterate enough times,the model will be able to serve as a supervised classifier.The model has been tentatively applied to the supervised classification of multispectral remotely sensed data.The author compared the performances of the extended SOM and BPN in remotely sensed data classification.The investigation manifests that the extended SOM is feasible for supervised classification.
基金supported by the National Natural Science Foundation of China(No.61971439 and No.61702543)the Natural Science Foundation of the Jiangsu Province of China(No.BK20191329)+1 种基金the China Postdoctoral Science Foundation Project(No.2019T120987)the Startup Foundation for Introducing Talent of NUIST(No.2020r100).
文摘Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately analyze the communication behavior.Traditional means can hardly utilize the scarce and crude spectrum sensing data captured in a real scene.Thus,communication behavior recognition using raw sensing data under smallsample condition has become a new challenge.In this paper,a data enhanced communication behavior recognition(DECBR)scheme is proposed to meet this challenge.Firstly,a preprocessing method is designed to make the raw spectrum data suitable for the proposed scheme.Then,an adaptive convolutional neural network structure is exploited to carry out communication behavior recognition.Moreover,DCGAN is applied to support data enhancement,which realize communication behavior recognition under small-sample condition.Finally,the scheme is verified by experiments under different data size.The results show that the DECBR scheme can greatly improve the accuracy and efficiency of behavior recognition under smallsample condition.
文摘In this article, the extension to three dimensions (3D) of the blending technique that has been widely used in two dimensions (2D) to calibrate ocean chlorophyll is presented. The results thus obtained revealed a very high degree of efficiency when predicting observed values of ocean chlorophyll. The mean squared difference between the predicted and observed values of ocean chlorophyll when 3D technique was used fell far below the tolerance level which was set to the difference between satellite and observed in-situ values. The resulting blended field did not only provide better predictions of the in situ observations in areas where bottle samples cannot be obtained but also provided a smooth variation of the distribution of ocean chlorophyll throughout the year. An added advantage is its computational efficiency since data that would have been treated at least four times would be treated only once. With the advent of these results, it is believed that the modelling of the ocean life cycle will become more realistic.
基金This work was supported by the National Advance Research Program(Item No.Y1601-1).
文摘The salinity of the salt lake is an important factor to evaluate whether it contains some mineral resources or not,the fault buried in the salt lake could control the abundance of the salinity.Therefore,it is of great geological importance to identify the fault buried in the salt lake.Taking the Gasikule Salt Lake in China for example,the paper established a new method to identify the fault buried in the salt lake based on the multi-source remote sensing data including Landsat TM,SPOT-5 and ASTER data.It includes the acquisition and selection of the multi-source remote sensing data,data preprocessing,lake waterfront extraction,spectrum extraction of brine with different salinity,salinity index construction,salinity separation,analysis of the abnormal salinity and identification of the fault buried in salt lake,temperature inversion of brine and the fault verification.As a result,the study identified an important fault buried in the east of the Gasikule Salt Lake that controls the highest salinity abnormal.Because the level of the salinity is positively correlated to the mineral abundance,the result provides the important reference to identify the water body rich in mineral resources in the salt lake.
文摘Due to the fact that a memristor with memory properties is an ideal electronic component for implementation of the artificial neural synaptic function,a brand-new tristable locally active memristor model is first proposed in this paper.Here,a novel four-dimensional fractional-order memristive cellular neural network(FO-MCNN)model with hidden attractors is constructed to enhance the engineering feasibility of the original CNN model and its performance.Then,its hardware circuit implementation and complicated dynamic properties are investigated on multi-simulation platforms.Subsequently,it is used toward secure communication application scenarios.Taking it as the pseudo-random number generator(PRNG),a new privacy image security scheme is designed based on the adaptive sampling rate compressive sensing(ASR-CS)model.Eventually,the simulation analysis and comparative experiments manifest that the proposed data encryption scheme possesses strong immunity against various security attack models and satisfactory compression performance.
文摘The integrated approach of various techniques which historically have been used independently is key to successful exploration, development, exploitation and management of the groundwater resources. The integration of Remote Sensing (RS), Geographical Information Systems (GIS) and Borehole data has been used in the study area to assess their applicability in groundwater investigation. The area of study lies in the arid and semi-arid lands (ASALs) where principally remote sensing data has been used in extraction of various thematic maps (lithology, lineament, drainage density, and Digital Elevation Model Maps) for groundwater assessment. The GIS platform was used in integrating the RS data and data of productive boreholes. The lineaments generated through remote sensing agree well with structural geology of the area, where high density lineament points overlays the points of intense faulting. Lineaments found in the area correlate well with fault zones, fractures, and lithological contrasts as supported by geological map and structural map. Weathering, faulting and fracturing of the rocks mean a possible increase or a reduction in specific capacities as observed in productive boreholes in sedimentary rocks or igneous/basaltic rocks of the area. Similarly, it is noted that the degree of faulting affects the degree of radius of influence of a borehole in a particular area. These analyses show that groundwater potential within the Sub-County varies spatially with high dependency on geological structures in the basement region and more on geology within the volcanic and younger sediments.
文摘Compressive sensing is a powerful method for reconstruction of sparsely-sampled data, based on statistical optimization. It can be applied to a range of flow measurement and visualization data, and in this work we show the usage in groundwater mapping. Due to scarcity of water in many regions of the world, including southwestern United States, monitoring and management of groundwater is of utmost importance. A complete mapping of groundwater is difficult since the monitored sites are far from one another, and thus the data sets are considered extremely “sparse”. To overcome this difficulty in complete mapping of groundwater, compressive sensing is an ideal tool, as it bypasses the classical Nyquist criterion. We show that compressive sensing can effectively be used for reconstructions of groundwater level maps, by validating against data. This approach can have an impact on geographical sensing and information, as effective monitoring and management are enabled without constructing numerous or expensive measurement sites for groundwater.
基金EUMETSAT,Norwegian Ice Service,University of Bremen,University of Hamburg,and Alfred Wegener Institute are gratefully acknowledged for providing the dataWe thank two anonymous reviewers for their helpful commentsThis study was supported by the Norwegian Research Council through the SPARSE project(Grant no.254765)and CIRFA project(Grant no.237906).
文摘With the rapid change in the Arctic sea ice,a large number of sea ice observations have been collected in recent years,and it is expected that an even larger number of such observations will emerge in the coming years.To make the best use of these observations,in this paper we develop a multi-sensor optimal data merging(MODM)method to merge any number of different sea ice observations.Since such merged data are independent on model forecast,they are valid for model initialization and model validation.Based on the maximum likelihood estimation theory,we prove that any model assimilated with the merged data is equivalent to assimilating the original multi-sensor data.This greatly facilitates sea ice data assimilation,particularly for operational forecast with limited computational resources.We apply the MODM method to merge sea ice concentration(SIC)and sea ice thickness(SIT),respectively,in the Arctic.For SIC merging,the Special Sensor Microwave Imager/Sounder(SSMIS)and Advanced Microwave Scanning Radiometer 2(AMSR2)data are merged together with the Norwegian Ice Service ice chart.This substantially reduces the uncertainties at the ice edge and in the coastal areas.For SIT merging,the daily Soil Moisture and Ocean Salinity(SMOS)data is merged with the weekly-mean merged CryoSat-2 and SMOS(CS2SMOS)data.This generates a new daily CS2SMOS SIT data with better spatial coverage for the whole Arctic.