Most known mineral deposits were discovered by accident using expensive,time-consuming,and knowledgebased methods such as stream sediment geochemical data,diamond drilling,reconnaissance geochemical and geophysical su...Most known mineral deposits were discovered by accident using expensive,time-consuming,and knowledgebased methods such as stream sediment geochemical data,diamond drilling,reconnaissance geochemical and geophysical surveys,and/or remote sensing.Recent years have seen a decrease in the number of newly discovered mineral deposits and a rise in demand for critical raw materials,prompting exploration geologists to seek more efficient and inventive ways for processing various data types at different phases of mineral exploration.Remote sensing is one of the most sought-after tools for early-phase mineral prospecting because of its broad coverage and low cost.Remote sensing images from satellites are publicly available and can be utilised for lithological mapping and mineral exploitation.In this study,we extend an artificial intelligence-based,unsupervised anomaly detection method to identify iron deposit occurrence using Landsat-8 Operational Land Imager(OLI)satellite imagery and machine learning.The novelty in our method includes:(1)knowledge-guided and unsupervised anomaly detection that does not assume any specific anomaly signatures;(2)detection of anomalies occurs only in the variable domain;and(3)a choice of a range of machine learning algorithms to balance between explain-ability and performance.Our new unsupervised method detects anomalies through three successive stages,namely(a)stage Ⅰ–acquisition of satellite imagery,data processing and selection of bands,(b)stage Ⅱ–predictive modelling and anomaly detection,and(c)stage Ⅲ–construction of anomaly maps and analysis.In this study,the new method was tested over the Assen iron deposit in the Transvaal Supergroup(South Africa).It detected both the known areas of the Assen iron deposit and additional deposit occurrence features around the Assen iron mine that were not known.To summarise the anomalies in the area,principal component analysis was used on the reconstruction errors across all modelled bands.Our method enhanced the Assen deposit as an anomaly and attenuated the background,including anthropogenic structural anomalies,which resulted in substantially improved visual contrast and delineation of the iron deposit relative to the background.The results demonstrate the robustness of the proposed unsupervised anomaly detection method,and it could be useful for the delineation of mineral exploration targets.In particular,the method will be useful in areas where no data labels exist regarding the existence or specific spectral signatures of anomalies,such as mineral deposits under greenfield exploration.展开更多
Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,...Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,and some modern satellites,such as the Copernicus Programme’s Sentinel-2 remote sensing satellites,offer a spatial resolution of 10 m across many of their spectral bands.The abundance and quality of remote sensing data combined with accumulated primary geochemical data has provided an unprecedented opportunity to inferentially invert remote sensing data into geochemical data.The ability to derive geochemical data from remote sensing data would provide a form of secondary big geochemical data,which can be used for numerous downstream activities,particularly where data timeliness,volume and velocity are important.Major benefactors of secondary geochemical data would be environmental monitoring and applications of artificial intelligence and machine learning in geochemistry,which currently entirely relies on manually derived data that is primarily guided by scientific reduction.Furthermore,it permits the usage of well-established data analysis techniques from geochemistry to remote sensing that allows useable insights to be extracted beyond those typically associated with strictly remote sensing data analysis.Currently,no generally applicable and systematic method to derive chemical elemental concentrations from large-scale remote sensing data have been documented in geosciences.In this paper,we demonstrate that fusing geostatistically-augmented geochemical and remote sensing data produces an abundance of data that enables a more generalized machine learning-based geochemical data generation.We use gold grade data from a South African tailing storage facility(TSF)and data from both the Landsat-8 and Sentinel remote sensing satellites.We show that various machine learning algorithms can be used given the abundance of training data.Consequently,we are able to produce a high resolution(10 m grid size)gold concentration map of the TSF,which demonstrates the potential of our method to be used to guide extraction planning,online resource exploration,environmental monitoring and resource estimation.展开更多
Evolution in geoscientific data provides the mineral industry with new opportunities.A direction of geochemical data generation evolution is towards big data to meet the demands of data-driven usage scenarios that rel...Evolution in geoscientific data provides the mineral industry with new opportunities.A direction of geochemical data generation evolution is towards big data to meet the demands of data-driven usage scenarios that rely on data velocity.This direction is more significant where traditional geochemical data are not ideal,which is the case for evaluating unconventional resources,such as tailing storage facilities(TSFs),because they are not static due to sedimentation,compaction and changes associated with hydrospheric and lithospheric processes(e.g.,erosion,saltation and mobility of chemical constituents).In this paper,we generate big secondary geochemical data derived from Sentinel-2 satellite-remote sensing data to showcase the benefits of big geochemical data using TSFs from the Witwatersrand Basin(South Africa).Using spatially fused remote sensing and legacy geochemical data on the Dump 20 TSF,we trained a machine learning model to predict in-situ gold grades.Subsequently,we deployed the model to the Lindum TSF,which is 3 km away,over a period of a few years(2015-2019).We were able to visualize and analyze the temporal variation in the spatial distributions of the gold grade of the Lindum TSF.Additionally,we were able to infer extraction sequencing(to the resolution of the data),acid mine drainage formation and seasonal migration.These findings suggest that dynamic mineral resource models and live geochemical monitoring(e.g.,of elemental mobility and structural changes)are possible without additional physical sampling.展开更多
In this study,we present a machine learning-based method to predict trace element concentrations from major and minor element concentration data using a legacy lithogeochemical database of magmatic rocks from the Karo...In this study,we present a machine learning-based method to predict trace element concentrations from major and minor element concentration data using a legacy lithogeochemical database of magmatic rocks from the Karoo large igneous province(Gondwana Supercontinent).Wedemonstrate that a variety of trace elements,including most of the lanthanides,chalcophile,lithophile,and siderophile elements,can be predicted with excellent accuracy.This finding reveals that there are reliable,high-dimensional elemental associations that can be used to predict trace elements in a range of plutonic and volcanic rocks.Since the major and minor elements are used as predictors,prediction performance can be used as a direct proxy for geochemical anomalies.As such,our proposed method is suitable for prospective exploration by identifying anomalous trace element concentrations.Compared to multivariate compositional data analysis methods,the new method does not rely on assumptions of stoichiometric combinations of elements in the data to discover geochemical anomalies.Because we do not use multivariate compositional data analysis techniques(e.g.principal component analysis and combined use of major,minor and trace elements data),we also show that log-ratio transforms do not increase the performance of the proposed approach and are unnecessary for algorithms that are not spatially aware in the feature space.Therefore,we demonstrate that high-dimensional elemental associations can be modelled in an automated manner through a data-driven approach and without assumptions of stoichiometry within the data.The approach proposed in this study can be used as a replacement method to the multivariate compositional data analysis technique that is used for prospectivity mapping,or be used as a pre-processor to reduce the detection of false geochemical anomalies,particularly where the data is of variable quality.展开更多
In exploration geochemistry,advances in the detection limit,breadth of elements analyze-able,accuracy and precision of analytical instruments have motivated the re-analysis of legacy samples to improve confidence in g...In exploration geochemistry,advances in the detection limit,breadth of elements analyze-able,accuracy and precision of analytical instruments have motivated the re-analysis of legacy samples to improve confidence in geochemical data and gain more insights into potentially mineralized areas.While a re-analysis campaign in a geochemical exploration program modernizes legacy geochemical data by providing more trustworthy and higher-dimensional geochemical data,especially where modern data is considerably different than legacy data,it is an expensive exercise.The risk associated with modernizing such legacy data lies within its uncertainty in return(e.g.,the possibility of new discoveries,in primarily greenfield settings).Without any advanced knowledge of yet unanalyzed elements,the importance of re-analyses remains ambiguous.To address this uncertainty,we apply machine learning to multivariate geochemical data from different regions in Canada(i.e.,the Churchill Province and the Trans-Hudson Orogen)in order to use legacy geochemical data to predict modern and higher dimensional multi-elemental concentrations ahead of planned re-analyses.Our study demonstrates that legacy and modern geochemical data can be repurposed to predict yet unanalyzed elements that will be realized from re-analyses and in a manner that significantly reduces the latency to downstream usage of modern geochemical data(e.g.,prospectivity mapping).Findings from this study serve as a pillar of a framework for exploration geologists to predictively explore and prioritize potentially mineralized districts for further prospects in a timely manner before employing more invasive and expensive techniques.展开更多
Mineral exploration campaigns are financially risky.Several state-of-the-art methods have been developed to mitigate the risk,including predictive modelling of mineral prospectivity using principal component analysis(...Mineral exploration campaigns are financially risky.Several state-of-the-art methods have been developed to mitigate the risk,including predictive modelling of mineral prospectivity using principal component analysis(PCA)and geographic information systems(GIS).The PCA and GIS approach is currently considered acceptable for generating mineral exploration targets.However,some of its limitations are the dependence on sample stoichiometry(e.g.,the existence of minerals),the necessity of log-ratio transformations when dealing with compositional data,and manual interpretation and use of principal components to enhance potential geochemical anomalies for prospectivity mapping.In this study,we generalize the fundamental ideas behind the PCA and GIS approach by developing a new data-driven approach using ML.We showcase a new workflow capable of generating either intermediate evidence layers or final prospectivity maps that depict major regional geochemical anomalies using multi-element geochemical data from Southeastern Churchill Province(Quebec and Labrador),Canada.The region is known for its REEs endowment and the data were gathered for prospectivity mapping.A comparison with the established multivariate hybrid data-and knowledge-based approach revealed that on a roughly comparable basis of the amount of manual effort,our new data-driven procedure can much more accurately identify geochemical anomalies in both univariate and multivariate applications.The results of our prospectivity mapping corroborate with the ground truth or known geological anomalies in the studied region.These findings have potentially wider implications on exploration target generation,where project risks(financial,environmental,political,etc.)and geochemical anomalies must be quantified using robust and effective datadriven approaches.In addition,our methodology is more replicable and objective,as manual geoscientific interpretation is not required during the detection of geochemical anomalies.展开更多
基金Supported by a Department of Science and Innovation(DSI)-National Research Foundation(NRF)Thuthuka Grant(Grant UID:121973)and DSI-NRF CIMERA.
文摘Most known mineral deposits were discovered by accident using expensive,time-consuming,and knowledgebased methods such as stream sediment geochemical data,diamond drilling,reconnaissance geochemical and geophysical surveys,and/or remote sensing.Recent years have seen a decrease in the number of newly discovered mineral deposits and a rise in demand for critical raw materials,prompting exploration geologists to seek more efficient and inventive ways for processing various data types at different phases of mineral exploration.Remote sensing is one of the most sought-after tools for early-phase mineral prospecting because of its broad coverage and low cost.Remote sensing images from satellites are publicly available and can be utilised for lithological mapping and mineral exploitation.In this study,we extend an artificial intelligence-based,unsupervised anomaly detection method to identify iron deposit occurrence using Landsat-8 Operational Land Imager(OLI)satellite imagery and machine learning.The novelty in our method includes:(1)knowledge-guided and unsupervised anomaly detection that does not assume any specific anomaly signatures;(2)detection of anomalies occurs only in the variable domain;and(3)a choice of a range of machine learning algorithms to balance between explain-ability and performance.Our new unsupervised method detects anomalies through three successive stages,namely(a)stage Ⅰ–acquisition of satellite imagery,data processing and selection of bands,(b)stage Ⅱ–predictive modelling and anomaly detection,and(c)stage Ⅲ–construction of anomaly maps and analysis.In this study,the new method was tested over the Assen iron deposit in the Transvaal Supergroup(South Africa).It detected both the known areas of the Assen iron deposit and additional deposit occurrence features around the Assen iron mine that were not known.To summarise the anomalies in the area,principal component analysis was used on the reconstruction errors across all modelled bands.Our method enhanced the Assen deposit as an anomaly and attenuated the background,including anthropogenic structural anomalies,which resulted in substantially improved visual contrast and delineation of the iron deposit relative to the background.The results demonstrate the robustness of the proposed unsupervised anomaly detection method,and it could be useful for the delineation of mineral exploration targets.In particular,the method will be useful in areas where no data labels exist regarding the existence or specific spectral signatures of anomalies,such as mineral deposits under greenfield exploration.
基金provided by the Department of Science and Innovation(DSI)-National Research Foundation(NRF)Thuthuka Grant(Grant UID:121,973)DSI-NRF CIMERA.Yousef Ghorbani acknowledges financial support from the Centre for Advanced Mining and Metallurgy(CAMM),a strategic research environment established at the LuleåUniversity of Technology funded by the Swedish governmentWe also thank Sibanye-Stillwater Ltd.For their funding through the Wits Mining Institute(WMI).
文摘Remote sensing data is a cheap form of surficial geoscientific data,and in terms of veracity,velocity and volume,can sometimes be considered big data.Its spatial and spectral resolution continues to improve over time,and some modern satellites,such as the Copernicus Programme’s Sentinel-2 remote sensing satellites,offer a spatial resolution of 10 m across many of their spectral bands.The abundance and quality of remote sensing data combined with accumulated primary geochemical data has provided an unprecedented opportunity to inferentially invert remote sensing data into geochemical data.The ability to derive geochemical data from remote sensing data would provide a form of secondary big geochemical data,which can be used for numerous downstream activities,particularly where data timeliness,volume and velocity are important.Major benefactors of secondary geochemical data would be environmental monitoring and applications of artificial intelligence and machine learning in geochemistry,which currently entirely relies on manually derived data that is primarily guided by scientific reduction.Furthermore,it permits the usage of well-established data analysis techniques from geochemistry to remote sensing that allows useable insights to be extracted beyond those typically associated with strictly remote sensing data analysis.Currently,no generally applicable and systematic method to derive chemical elemental concentrations from large-scale remote sensing data have been documented in geosciences.In this paper,we demonstrate that fusing geostatistically-augmented geochemical and remote sensing data produces an abundance of data that enables a more generalized machine learning-based geochemical data generation.We use gold grade data from a South African tailing storage facility(TSF)and data from both the Landsat-8 and Sentinel remote sensing satellites.We show that various machine learning algorithms can be used given the abundance of training data.Consequently,we are able to produce a high resolution(10 m grid size)gold concentration map of the TSF,which demonstrates the potential of our method to be used to guide extraction planning,online resource exploration,environmental monitoring and resource estimation.
基金supported by a Department of Science and Innovation(DSI)-National Research Foundation(NRF)Thuthuka Grant(Grant UID:121973)and DSI-NRF CIMERA.
文摘Evolution in geoscientific data provides the mineral industry with new opportunities.A direction of geochemical data generation evolution is towards big data to meet the demands of data-driven usage scenarios that rely on data velocity.This direction is more significant where traditional geochemical data are not ideal,which is the case for evaluating unconventional resources,such as tailing storage facilities(TSFs),because they are not static due to sedimentation,compaction and changes associated with hydrospheric and lithospheric processes(e.g.,erosion,saltation and mobility of chemical constituents).In this paper,we generate big secondary geochemical data derived from Sentinel-2 satellite-remote sensing data to showcase the benefits of big geochemical data using TSFs from the Witwatersrand Basin(South Africa).Using spatially fused remote sensing and legacy geochemical data on the Dump 20 TSF,we trained a machine learning model to predict in-situ gold grades.Subsequently,we deployed the model to the Lindum TSF,which is 3 km away,over a period of a few years(2015-2019).We were able to visualize and analyze the temporal variation in the spatial distributions of the gold grade of the Lindum TSF.Additionally,we were able to infer extraction sequencing(to the resolution of the data),acid mine drainage formation and seasonal migration.These findings suggest that dynamic mineral resource models and live geochemical monitoring(e.g.,of elemental mobility and structural changes)are possible without additional physical sampling.
文摘In this study,we present a machine learning-based method to predict trace element concentrations from major and minor element concentration data using a legacy lithogeochemical database of magmatic rocks from the Karoo large igneous province(Gondwana Supercontinent).Wedemonstrate that a variety of trace elements,including most of the lanthanides,chalcophile,lithophile,and siderophile elements,can be predicted with excellent accuracy.This finding reveals that there are reliable,high-dimensional elemental associations that can be used to predict trace elements in a range of plutonic and volcanic rocks.Since the major and minor elements are used as predictors,prediction performance can be used as a direct proxy for geochemical anomalies.As such,our proposed method is suitable for prospective exploration by identifying anomalous trace element concentrations.Compared to multivariate compositional data analysis methods,the new method does not rely on assumptions of stoichiometric combinations of elements in the data to discover geochemical anomalies.Because we do not use multivariate compositional data analysis techniques(e.g.principal component analysis and combined use of major,minor and trace elements data),we also show that log-ratio transforms do not increase the performance of the proposed approach and are unnecessary for algorithms that are not spatially aware in the feature space.Therefore,we demonstrate that high-dimensional elemental associations can be modelled in an automated manner through a data-driven approach and without assumptions of stoichiometry within the data.The approach proposed in this study can be used as a replacement method to the multivariate compositional data analysis technique that is used for prospectivity mapping,or be used as a pre-processor to reduce the detection of false geochemical anomalies,particularly where the data is of variable quality.
基金Supported by a Department of Science and Innovation(DSI)-National Research Foundation(NRF)Thuthuka Grant(Grant UID:121973),and DSI-NRF CIMERA.
文摘In exploration geochemistry,advances in the detection limit,breadth of elements analyze-able,accuracy and precision of analytical instruments have motivated the re-analysis of legacy samples to improve confidence in geochemical data and gain more insights into potentially mineralized areas.While a re-analysis campaign in a geochemical exploration program modernizes legacy geochemical data by providing more trustworthy and higher-dimensional geochemical data,especially where modern data is considerably different than legacy data,it is an expensive exercise.The risk associated with modernizing such legacy data lies within its uncertainty in return(e.g.,the possibility of new discoveries,in primarily greenfield settings).Without any advanced knowledge of yet unanalyzed elements,the importance of re-analyses remains ambiguous.To address this uncertainty,we apply machine learning to multivariate geochemical data from different regions in Canada(i.e.,the Churchill Province and the Trans-Hudson Orogen)in order to use legacy geochemical data to predict modern and higher dimensional multi-elemental concentrations ahead of planned re-analyses.Our study demonstrates that legacy and modern geochemical data can be repurposed to predict yet unanalyzed elements that will be realized from re-analyses and in a manner that significantly reduces the latency to downstream usage of modern geochemical data(e.g.,prospectivity mapping).Findings from this study serve as a pillar of a framework for exploration geologists to predictively explore and prioritize potentially mineralized districts for further prospects in a timely manner before employing more invasive and expensive techniques.
文摘Mineral exploration campaigns are financially risky.Several state-of-the-art methods have been developed to mitigate the risk,including predictive modelling of mineral prospectivity using principal component analysis(PCA)and geographic information systems(GIS).The PCA and GIS approach is currently considered acceptable for generating mineral exploration targets.However,some of its limitations are the dependence on sample stoichiometry(e.g.,the existence of minerals),the necessity of log-ratio transformations when dealing with compositional data,and manual interpretation and use of principal components to enhance potential geochemical anomalies for prospectivity mapping.In this study,we generalize the fundamental ideas behind the PCA and GIS approach by developing a new data-driven approach using ML.We showcase a new workflow capable of generating either intermediate evidence layers or final prospectivity maps that depict major regional geochemical anomalies using multi-element geochemical data from Southeastern Churchill Province(Quebec and Labrador),Canada.The region is known for its REEs endowment and the data were gathered for prospectivity mapping.A comparison with the established multivariate hybrid data-and knowledge-based approach revealed that on a roughly comparable basis of the amount of manual effort,our new data-driven procedure can much more accurately identify geochemical anomalies in both univariate and multivariate applications.The results of our prospectivity mapping corroborate with the ground truth or known geological anomalies in the studied region.These findings have potentially wider implications on exploration target generation,where project risks(financial,environmental,political,etc.)and geochemical anomalies must be quantified using robust and effective datadriven approaches.In addition,our methodology is more replicable and objective,as manual geoscientific interpretation is not required during the detection of geochemical anomalies.