In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independ...In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independent image different from the secret image.The generated image is then transmitted to the receiver and fed to the generative model database to generate another image visually the same as the secret image.Thus,we only need to transmit the meaning-normal image which is not related to the secret image,and we can achieve the same effect as the transmission of the secret image.This is the first time to propose the coverless image information steganographic scheme based on generative model,compared with the traditional image steganography.The transmitted image is not embedded with any information of the secret image in this method,therefore,can effectively resist steganalysis tools.Experimental results show that our scheme has high capacity,security and reliability.展开更多
In recent years,an increasing number of studies about quantum machine learning not only provide powerful tools for quantum chemistry and quantum physics but also improve the classical learning algorithm.The hybrid qua...In recent years,an increasing number of studies about quantum machine learning not only provide powerful tools for quantum chemistry and quantum physics but also improve the classical learning algorithm.The hybrid quantum-classical framework,which is constructed by a variational quantum circuit(VQC)and an optimizer,plays a key role in the latest quantum machine learning studies.Nevertheless,in these hybrid-framework-based quantum machine learning models,the VQC is mainly constructed with a fixed structure and this structure causes inflexibility problems.There are also few studies focused on comparing the performance of quantum generative models with different loss functions.In this study,we address the inflexibility problem by adopting the variable-depth VQC model to automatically change the structure of the quantum circuit according to the qBAS score.The basic idea behind the variable-depth VQC is to consider the depth of the quantum circuit as a parameter during the training.Meanwhile,we compared the performance of the variable-depth VQC model based on four widely used statistical distances set as the loss functions,including Kullback-Leibler divergence(KL-divergence),Jensen-Shannon divergence(JS-divergence),total variation distance,and maximum mean discrepancy.Our numerical experiment shows a promising result that the variable-depth VQC model works better than the original VQC in the generative learning tasks.展开更多
The use of deep generative models(DGMs)such as variational autoencoders,autoregressive models,flow-based models,energy-based models,generative adversarial networks,and diffusion models has been advantageous in various...The use of deep generative models(DGMs)such as variational autoencoders,autoregressive models,flow-based models,energy-based models,generative adversarial networks,and diffusion models has been advantageous in various disciplines due to their high data generative skills.Using DGMs has become one of the most trending research topics in Artificial Intelligence in recent years.On the other hand,the research and development endeavors in the civil structural health monitoring(SHM)area have also been very progressive owing to the increasing use of Machine Learning techniques.As such,some of the DGMs have also been used in the civil SHM field lately.This short review communication paper aims to assist researchers in the civil SHM field in understanding the fundamentals of DGMs and,consequently,to help initiate their use for current and possible future engineering applications.On this basis,this study briefly introduces the concept and mechanism of different DGMs in a comparative fashion.While preparing this short review communication,it was observed that some DGMs had not been utilized or exploited fully in the SHM area.Accordingly,some representative studies presented in the civil SHM field that use DGMs are briefly overviewed.The study also presents a short comparative discussion on DGMs,their link to the SHM,and research directions.展开更多
In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining ...In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.展开更多
Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands i...Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.展开更多
The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflectio...The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.展开更多
Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learnin...Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learning.A Conditional Variational Auto Encoder(CVAE)and an integrated generative network CVAE-GAN that combines the CVAE with the Wasserstein Generative Adversarial Networks(WGAN),are conducted as generative models.They are used to generate target wall Mach distributions for the inverse design that matches specified features,such as locations of suction peak,shock and aft loading.Qualitative and quantitative results show that both adopted generative models can generate diverse and realistic wall Mach number distributions satisfying the given features.The CVAE-GAN model outperforms the CVAE model and achieves better reconstruction accuracies for all the samples in the dataset.Furthermore,a deep neural network for nonlinear mapping is adopted to obtain the airfoil shape corresponding to the target wall Mach number distribution.The performances of the designed deep neural network are fully demonstrated and a smoothness measurement is proposed to quantify small oscillations in the airfoil surface,proving the authenticity and accuracy of the generated airfoil shapes.展开更多
Short-term building energy predictions serve as one of the fundamental tasks in building operation management.While large numbers of studies have explored the value of various supervised machine learning techniques in...Short-term building energy predictions serve as one of the fundamental tasks in building operation management.While large numbers of studies have explored the value of various supervised machine learning techniques in energy predictions,few studies have addressed the potential data shortage problem in developing data-driven models.One promising solution is data augmentation,which aims to enrich existing building data resources for reliable predictive modeling.This study proposes a deep generative modeling-based data augmentation strategy for improving short-term building energy predictions.Two types of conditional variational autoencoders have been designed for synthetic energy data generation using fully connected and one-dimensional convolutional layers respectively.Data experiments have been designed to evaluate the value of data augmentation using actual measurements from 52 buildings.The results indicate that conditional variational autoencoders are capable of generating high-quality synthetic data samples,which in turns helps to enhance the accuracy in short-term building energy predictions.The average performance enhancement ratios in terms of CV-RMSE range between 12%and 18%.Practical guidelines have been obtained to ensure the validity and quality of synthetic building energy data.The research outcomes are valuable for enhancing the robustness and reliability of data-driven models for smart building operation management.展开更多
For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for...For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.展开更多
Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggab...Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggable chemical space. Here, deep learning-based molecule generative model, which is a recent technique in de novo molecule design, was applied to generate virtual libraries with NP-like properties. Results demonstrated that the model was effective in generating molecules that highly resemble NPs. Moreover, the model was also found to be capable of generating NP-like molecules that were also easy to synthesize, significantly increasing the practical value of the compound library.展开更多
Most previous studies have mainly focused on the analyses of one entire network(graph) or the giant connected components of networks. In this paper, we investigate the disconnected components(non-giant connected compo...Most previous studies have mainly focused on the analyses of one entire network(graph) or the giant connected components of networks. In this paper, we investigate the disconnected components(non-giant connected component) of some real social networks, and report some interesting discoveries about structural properties of disconnected components. We study three diverse, real networks and compute the significance profile of each component. We discover some similarities in the local structure between the giant connected component and disconnected components in diverse social networks. Then we discuss how to detect network attacks based on the local structure properties of networks. Furthermore, we propose an empirical generative model called i Friends to generate networks that follow our observed patterns.展开更多
This article proposes a model combination method to enhance the discriminability of the generative model. Generative and discriminative models have different optimization objectives and have their own advantages and d...This article proposes a model combination method to enhance the discriminability of the generative model. Generative and discriminative models have different optimization objectives and have their own advantages and drawbacks. The method proposed in this article intends to strike a balance between the two models mentioned above. It extracts the discriminative parameter from the generative model and generates a new model based on a multi-model combination. The weight for combining is determined by the ratio of the inter-variance to the intra-variance of the classes. The higher the ratio is, the greater the weight is, and the more discriminative the model will be. Experiments on speech recognition demonstrate that the performance of the new model outperforms the model trained with the traditional generative method.展开更多
The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims...The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.展开更多
Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adver...Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adversarial learning idea.The goal of GANs is to estimate the potential distribution of real data samples and generate new samples from that distribution.Since their initiation, GANs have been widely studied due to their enormous prospect for applications, including image and vision computing, speech and language processing, etc. In this review paper, we summarize the state of the art of GANs and look into the future. Firstly, we survey GANs' proposal background,theoretic and implementation models, and application fields.Then, we discuss GANs' advantages and disadvantages, and their development trends. In particular, we investigate the relation between GANs and parallel intelligence,with the conclusion that GANs have a great potential in parallel systems research in terms of virtual-real interaction and integration. Clearly, GANs can provide substantial algorithmic support for parallel intelligence.展开更多
Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive....Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive.Recently,deep learning-based approaches have been developed to tackle the problem of inverse design efficiently.Although most of these neural network models have demonstrated high accuracy in different inverse design problems,no previous study has examined the potential effects under given constraints in nanomanufacturing.Additionally,the relative strength of different deep learning-based inverse design approaches has not been fully investigated.Here,we benchmark three commonly used deep learning models in inverse design:Tandem networks,Variational Auto-Encoders,and Generative Adversarial Networks.We provide detailed comparisons in terms of their accuracy,diversity,and robustness.We find that tandem networks and Variational Auto-Encoders give the best accuracy,while Generative Adversarial Networks lead to the most diverse predictions.Our findings could serve as a guideline for researchers to select the model that can best suit their design criteria and fabrication considerations.In addition,our code and data are publicly available,which could be used for future inverse design model development and benchmarking.展开更多
Predictive models for assessing the risk of developing lung cancers can help identify high-risk individuals with the aim of recommending further screening and early intervention.To facilitate pre-hospital self-assessm...Predictive models for assessing the risk of developing lung cancers can help identify high-risk individuals with the aim of recommending further screening and early intervention.To facilitate pre-hospital self-assessments,some studies have exploited predictive models trained on non-clinical data(e.g.,smoking status and family history).The performance of these models is limited due to not considering clinical data(e.g.,blood test and medical imaging results).Deep learning has shown the potential in processing complex data that combine both clinical and non-clinical information.However,predicting lung cancers remains difficult due to the severe lack of positive samples among follow-ups.To tackle this problem,this paper presents a generative-discriminative framework for improving the ability of deep learning models to generalize.According to the proposed framework,two nonlinear generative models,one based on the generative adversarial network and another on the variational autoencoder,are used to synthesize auxiliary positive samples for the training set.Then,several discriminative models,including a deep neural network(DNN),are used to assess the lung cancer risk based on a comprehensive list of risk factors.The framework was evaluated on over 55000 subjects questioned between January 2014 and December 2017,with 699 subjects being clinically diagnosed with lung cancer between January 2014 and August 2019.According to the results,the best performing predictive model built using the proposed framework was based on DNN.It achieved an average sensitivity of 76.54%and an area under the curve of 69.24%in distinguishing between the cases of lung cancer and normal cases on test sets.展开更多
Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural netwo...Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.展开更多
Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical re...Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.展开更多
Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management...Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.展开更多
Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have dev...Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.展开更多
基金This paper was supported by the National Natural Science Foundation of China(No.U1204606)the Key Programs for Science and Technology Development of Henan Province(No.172102210335)Key Scientific Research Projects in Henan Universities(No.16A520058).
文摘In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independent image different from the secret image.The generated image is then transmitted to the receiver and fed to the generative model database to generate another image visually the same as the secret image.Thus,we only need to transmit the meaning-normal image which is not related to the secret image,and we can achieve the same effect as the transmission of the secret image.This is the first time to propose the coverless image information steganographic scheme based on generative model,compared with the traditional image steganography.The transmitted image is not embedded with any information of the secret image in this method,therefore,can effectively resist steganalysis tools.Experimental results show that our scheme has high capacity,security and reliability.
基金This work has received support from the National Key Research&Development Plan of China under Grant No.2018YFA0306703.
文摘In recent years,an increasing number of studies about quantum machine learning not only provide powerful tools for quantum chemistry and quantum physics but also improve the classical learning algorithm.The hybrid quantum-classical framework,which is constructed by a variational quantum circuit(VQC)and an optimizer,plays a key role in the latest quantum machine learning studies.Nevertheless,in these hybrid-framework-based quantum machine learning models,the VQC is mainly constructed with a fixed structure and this structure causes inflexibility problems.There are also few studies focused on comparing the performance of quantum generative models with different loss functions.In this study,we address the inflexibility problem by adopting the variable-depth VQC model to automatically change the structure of the quantum circuit according to the qBAS score.The basic idea behind the variable-depth VQC is to consider the depth of the quantum circuit as a parameter during the training.Meanwhile,we compared the performance of the variable-depth VQC model based on four widely used statistical distances set as the loss functions,including Kullback-Leibler divergence(KL-divergence),Jensen-Shannon divergence(JS-divergence),total variation distance,and maximum mean discrepancy.Our numerical experiment shows a promising result that the variable-depth VQC model works better than the original VQC in the generative learning tasks.
基金the National Aeronautics and Space Administration(NASA)Award No.80NSSC20K0326 for the research activities and particularly for this paper。
文摘The use of deep generative models(DGMs)such as variational autoencoders,autoregressive models,flow-based models,energy-based models,generative adversarial networks,and diffusion models has been advantageous in various disciplines due to their high data generative skills.Using DGMs has become one of the most trending research topics in Artificial Intelligence in recent years.On the other hand,the research and development endeavors in the civil structural health monitoring(SHM)area have also been very progressive owing to the increasing use of Machine Learning techniques.As such,some of the DGMs have also been used in the civil SHM field lately.This short review communication paper aims to assist researchers in the civil SHM field in understanding the fundamentals of DGMs and,consequently,to help initiate their use for current and possible future engineering applications.On this basis,this study briefly introduces the concept and mechanism of different DGMs in a comparative fashion.While preparing this short review communication,it was observed that some DGMs had not been utilized or exploited fully in the SHM area.Accordingly,some representative studies presented in the civil SHM field that use DGMs are briefly overviewed.The study also presents a short comparative discussion on DGMs,their link to the SHM,and research directions.
基金This research was funded by the National Natural Science Foundation of China(No.62272124)the National Key Research and Development Program of China(No.2022YFB2701401)+3 种基金Guizhou Province Science and Technology Plan Project(Grant Nos.Qiankehe Paltform Talent[2020]5017)The Research Project of Guizhou University for Talent Introduction(No.[2020]61)the Cultivation Project of Guizhou University(No.[2019]56)the Open Fund of Key Laboratory of Advanced Manufacturing Technology,Ministry of Education(GZUAMT2021KF[01]).
文摘In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.
基金funded by National Science Centre,Poland under the project"Assessment of the impact of weather conditions on forest health status and forest disturbances at regional and national scale based on the integration of ground and space-based remote sensing datasets"(project no.2021/41/B/ST10/)Data collection and research was also supported by the project no.EZ.271.3.19.2021"Modele ryzyka zamierania drzewostanow glownych gatunkow lasotworczych Polski"funded by the General Directorate of State Forests in Poland。
文摘Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.
文摘The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.
基金co-supported by the National Key Project of China(No.GJXM92579)the National Natural Science Foundation of China(Nos.92052203,61903178 and61906081)。
文摘Inverse design has long been an efficient and powerful design tool in the aircraft industry.In this paper,a novel inverse design method for supercritical airfoils is proposed based on generative models in deep learning.A Conditional Variational Auto Encoder(CVAE)and an integrated generative network CVAE-GAN that combines the CVAE with the Wasserstein Generative Adversarial Networks(WGAN),are conducted as generative models.They are used to generate target wall Mach distributions for the inverse design that matches specified features,such as locations of suction peak,shock and aft loading.Qualitative and quantitative results show that both adopted generative models can generate diverse and realistic wall Mach number distributions satisfying the given features.The CVAE-GAN model outperforms the CVAE model and achieves better reconstruction accuracies for all the samples in the dataset.Furthermore,a deep neural network for nonlinear mapping is adopted to obtain the airfoil shape corresponding to the target wall Mach number distribution.The performances of the designed deep neural network are fully demonstrated and a smoothness measurement is proposed to quantify small oscillations in the airfoil surface,proving the authenticity and accuracy of the generated airfoil shapes.
基金support of this research by the National Natural Science Foundation of China(No.51908365,No.71772125)the Philosophical and Social Science Program of Guangdong Province,China(GD18YGL07).
文摘Short-term building energy predictions serve as one of the fundamental tasks in building operation management.While large numbers of studies have explored the value of various supervised machine learning techniques in energy predictions,few studies have addressed the potential data shortage problem in developing data-driven models.One promising solution is data augmentation,which aims to enrich existing building data resources for reliable predictive modeling.This study proposes a deep generative modeling-based data augmentation strategy for improving short-term building energy predictions.Two types of conditional variational autoencoders have been designed for synthetic energy data generation using fully connected and one-dimensional convolutional layers respectively.Data experiments have been designed to evaluate the value of data augmentation using actual measurements from 52 buildings.The results indicate that conditional variational autoencoders are capable of generating high-quality synthetic data samples,which in turns helps to enhance the accuracy in short-term building energy predictions.The average performance enhancement ratios in terms of CV-RMSE range between 12%and 18%.Practical guidelines have been obtained to ensure the validity and quality of synthetic building energy data.The research outcomes are valuable for enhancing the robustness and reliability of data-driven models for smart building operation management.
基金supported by the National Natural Science Foundation of China under Grant 51722406,52074340,and 51874335the Shandong Provincial Natural Science Foundation under Grant JQ201808+5 种基金The Fundamental Research Funds for the Central Universities under Grant 18CX02097Athe Major Scientific and Technological Projects of CNPC under Grant ZD2019-183-008the Science and Technology Support Plan for Youth Innovation of University in Shandong Province under Grant 2019KJH002the National Research Council of Science and Technology Major Project of China under Grant 2016ZX05025001-006111 Project under Grant B08028Sinopec Science and Technology Project under Grant P20050-1
文摘For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.
基金The National Natural Science Foundation of China(Grant No.81573273,81673279,21572010 and 21772005)National Major Scientific and Technological Special Project for"Significant New Drugs Development"(Grant No.2018ZX09735001-003)
文摘Natural products(NPs) have long been recognized as a valuable resource for drug discovery, and bringing NP-related features to virtual libraries is believed to be an effective way to increase the coverage of druggable chemical space. Here, deep learning-based molecule generative model, which is a recent technique in de novo molecule design, was applied to generate virtual libraries with NP-like properties. Results demonstrated that the model was effective in generating molecules that highly resemble NPs. Moreover, the model was also found to be capable of generating NP-like molecules that were also easy to synthesize, significantly increasing the practical value of the compound library.
基金supported by the National Natural Science Foundation of China (Grant Nos. 61572060, 61190125, 61472024)CERNET Innovation Project 2015 (Grant No. NGII20151004)
文摘Most previous studies have mainly focused on the analyses of one entire network(graph) or the giant connected components of networks. In this paper, we investigate the disconnected components(non-giant connected component) of some real social networks, and report some interesting discoveries about structural properties of disconnected components. We study three diverse, real networks and compute the significance profile of each component. We discover some similarities in the local structure between the giant connected component and disconnected components in diverse social networks. Then we discuss how to detect network attacks based on the local structure properties of networks. Furthermore, we propose an empirical generative model called i Friends to generate networks that follow our observed patterns.
基金supported by the National Natural Science Foundation of China(60705019)the Hi-Tech Research and Development Program of China(2006AA010102,2007AA01Z417)
文摘This article proposes a model combination method to enhance the discriminability of the generative model. Generative and discriminative models have different optimization objectives and have their own advantages and drawbacks. The method proposed in this article intends to strike a balance between the two models mentioned above. It extracts the discriminative parameter from the generative model and generates a new model based on a multi-model combination. The weight for combining is determined by the ratio of the inter-variance to the intra-variance of the classes. The higher the ratio is, the greater the weight is, and the more discriminative the model will be. Experiments on speech recognition demonstrate that the performance of the new model outperforms the model trained with the traditional generative method.
文摘The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.
基金supported by the National Natural Science Foundation of China(61533019,71232006,91520301)
文摘Recently, generative adversarial networks(GANs)have become a research focus of artificial intelligence. Inspired by two-player zero-sum game, GANs comprise a generator and a discriminator, both trained under the adversarial learning idea.The goal of GANs is to estimate the potential distribution of real data samples and generate new samples from that distribution.Since their initiation, GANs have been widely studied due to their enormous prospect for applications, including image and vision computing, speech and language processing, etc. In this review paper, we summarize the state of the art of GANs and look into the future. Firstly, we survey GANs' proposal background,theoretic and implementation models, and application fields.Then, we discuss GANs' advantages and disadvantages, and their development trends. In particular, we investigate the relation between GANs and parallel intelligence,with the conclusion that GANs have a great potential in parallel systems research in terms of virtual-real interaction and integration. Clearly, GANs can provide substantial algorithmic support for parallel intelligence.
文摘Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive.Recently,deep learning-based approaches have been developed to tackle the problem of inverse design efficiently.Although most of these neural network models have demonstrated high accuracy in different inverse design problems,no previous study has examined the potential effects under given constraints in nanomanufacturing.Additionally,the relative strength of different deep learning-based inverse design approaches has not been fully investigated.Here,we benchmark three commonly used deep learning models in inverse design:Tandem networks,Variational Auto-Encoders,and Generative Adversarial Networks.We provide detailed comparisons in terms of their accuracy,diversity,and robustness.We find that tandem networks and Variational Auto-Encoders give the best accuracy,while Generative Adversarial Networks lead to the most diverse predictions.Our findings could serve as a guideline for researchers to select the model that can best suit their design criteria and fabrication considerations.In addition,our code and data are publicly available,which could be used for future inverse design model development and benchmarking.
基金supported in part by Zhejiang Provincial Natural Science Foundation of China(LQ20F030013)Research Foundation of Hwa Mei Hospital,University of Chinese Academy of Sciences(2020HMZD22)+1 种基金Ningbo Public Service Technology Foundation(202002N3181)Medical Scientific Research Foundation of Zhejiang Province(2021431314)。
文摘Predictive models for assessing the risk of developing lung cancers can help identify high-risk individuals with the aim of recommending further screening and early intervention.To facilitate pre-hospital self-assessments,some studies have exploited predictive models trained on non-clinical data(e.g.,smoking status and family history).The performance of these models is limited due to not considering clinical data(e.g.,blood test and medical imaging results).Deep learning has shown the potential in processing complex data that combine both clinical and non-clinical information.However,predicting lung cancers remains difficult due to the severe lack of positive samples among follow-ups.To tackle this problem,this paper presents a generative-discriminative framework for improving the ability of deep learning models to generalize.According to the proposed framework,two nonlinear generative models,one based on the generative adversarial network and another on the variational autoencoder,are used to synthesize auxiliary positive samples for the training set.Then,several discriminative models,including a deep neural network(DNN),are used to assess the lung cancer risk based on a comprehensive list of risk factors.The framework was evaluated on over 55000 subjects questioned between January 2014 and December 2017,with 699 subjects being clinically diagnosed with lung cancer between January 2014 and August 2019.According to the results,the best performing predictive model built using the proposed framework was based on DNN.It achieved an average sensitivity of 76.54%and an area under the curve of 69.24%in distinguishing between the cases of lung cancer and normal cases on test sets.
基金Subjects funded by the National Natural Science Foundation of China(Nos.62275216 and 61775181)the Natural Science Basic Research Programme of Shaanxi Province-Major Basic Research Special Project(Nos.S2018-ZC-TD-0061 and TZ0393)the Special Project for the Development of National Key Scientific Instruments and Equipment No.(51927804).
文摘Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.
文摘Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.
基金The National Forestry Commission of Mexico and The Mexican National Council for Science and Technology(CONAFOR-CONACYT-115900)。
文摘Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.
基金This research was funded by the National Natural Science Foundation of China(grant no.32271881).
文摘Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.