In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining ...In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.展开更多
Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands i...Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.展开更多
The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflectio...The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.展开更多
Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural netwo...Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.展开更多
Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical re...Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.展开更多
Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management...Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.展开更多
Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have dev...Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.展开更多
Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth t...Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth to groundwater level(DGWL)and the impacts of climatic(precipitation,maximum temperature,and minimum temperature)and anthropogenic(gross district product(GDP),population,and net irrigated area(NIA))variables on DGWL during 1994-2020.The study considered DGWL in 113 observation wells and piezometers located in arid western plains(Barmer and Jodhpur districts)and semi-arid eastern plains(Jaipur,Ajmer,Dausa,and Tonk districts)of Rajasthan State,India.Statistical methods were employed to examine the annual and seasonal patterns of DGWL,and the generalized additive model(GAM)was used to determine the impacts of climatic and anthropogenic variables on DGWL.During 1994-2020,except for Barmer District,where the mean annual DGWL was almost constant(around 26.50 m),all other districts exhibited increase in DGWL,with Ajmer District experiencing the most increase.The results also revealed that 36 observation wells and piezometers showed a statistically significant annual increasing trend in DGWL and 34 observation wells and piezometers exhibited a statistically significant decreasing trend in DGWL.Similarly,32 observation wells and piezometers showed an statistically significant increasing trend and 37 observation wells and piezometers showed a statistically significant decreasing trend in winter;33 observation wells and piezometers indicated a statistically significant increasing trend and 34 had a statistically significant decreasing trend in post-monsoon;35 observation wells and piezometers exhibited a statistically significant increasing trend and 32 observation wells and piezometers showed a statistically significant decreasing trend in pre-monsoon;and 36 observation wells and piezometers reflected a statistically significant increasing trend and 30 observation wells and piezometers reflected a statistically significant decreasing trend in monsoon.Interestingly,most of the observation wells and piezometers with increasing trends of DGWL were located in Dausa and Jaipur districts.Furthermore,the GAM analysis revealed that climatic variables,such as precipitation,significantly affected DGWL in Barmer District,and DGWL in all other districts was influenced by anthropogenic variables,including GDP,NIA,and population.As a result,stringent regulations should be implemented to curb excessive groundwater extraction,manage agricultural water demand,initiate proactive aquifer recharge programs,and strengthen sustainable management in these water-scarce regions.展开更多
The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims...The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.展开更多
In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood e...In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.展开更多
In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independ...In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independent image different from the secret image.The generated image is then transmitted to the receiver and fed to the generative model database to generate another image visually the same as the secret image.Thus,we only need to transmit the meaning-normal image which is not related to the secret image,and we can achieve the same effect as the transmission of the secret image.This is the first time to propose the coverless image information steganographic scheme based on generative model,compared with the traditional image steganography.The transmitted image is not embedded with any information of the secret image in this method,therefore,can effectively resist steganalysis tools.Experimental results show that our scheme has high capacity,security and reliability.展开更多
In this paper, a generalized layered model for radiation transfer in canopy with high vertical resolution is developed. Differing from the two-stream approximate radiation transfer model commonly used in the land surf...In this paper, a generalized layered model for radiation transfer in canopy with high vertical resolution is developed. Differing from the two-stream approximate radiation transfer model commonly used in the land surface models, the generalized model takes into account the effect of complicated canopy morphology and inhomogeneous optical properties of leaves on radiation transfer within the canopy. In the model, the total leaf area index (LAI) of the canopy is divided into many layers. At a given layer, the influences of diffuse radiation angle distributions and leaf angle distributions on radiation transfer within the canopy are considered. The derivation of equations serving the model are described in detail, and these can deal with various diffuse radiation transfers in quite broad categories of canopy with quite inhomogeneons vertical structures and uneven leaves with substantially different optical properties of adaxial and abaxial faces of the leaves. The model is used to simulate the radiation transfer for canopies with horizontal leaves to validate the generalized model. Results from the model are compared with those from the two-stream scheme, and differences between these two models are discussed.展开更多
Habitat suitability index(HSI)models have been widely used to analyze the relationship between species abundance and environmental factors,and ultimately inform management of marine species.The response of species abu...Habitat suitability index(HSI)models have been widely used to analyze the relationship between species abundance and environmental factors,and ultimately inform management of marine species.The response of species abundance to each environmental variable is different and habitat requirements may change over life history stages and seasons.Therefore,it is necessary to determine the optimal combination of environmental variables in HSI modelling.In this study,generalized additive models(GAMs)were used to determine which environmental variables to be included in the HSI models.Significant variables were retained and weighted in the HSI model according to their relative contribution(%)to the total deviation explained by the boosted regression tree(BRT).The HSI models were applied to evaluate the habitat suitability of mantis shrimp Oratosquilla oratoria in the Haizhou Bay and adjacent areas in 2011 and 2013–2017.Ontogenetic and seasonal variations in HSI models of mantis shrimp were also examined.Among the four models(non-optimized model,BRT informed HSI model,GAM informed HSI model,and both BRT and GAM informed HSI model),both BRT and GAM informed HSI model showed the best performance.Four environmental variables(bottom temperature,depth,distance offshore and sediment type)were selected in the HSI models for four groups(spring-juvenile,spring-adult,falljuvenile and fall-adult)of mantis shrimp.The distribution of habitat suitability showed similar patterns between juveniles and adults,but obvious seasonal variations were observed.This study suggests that the process of optimizing environmental variables in HSI models improves the performance of HSI models,and this optimization strategy could be extended to other marine organisms to enhance the understanding of the habitat suitability of target species.展开更多
Physical mechanisms and influencing factors on the effective stress coefficient for rock/soil-like porous materials are investigated, based on which equivalent connectivity index is proposed. The equivalent connectivi...Physical mechanisms and influencing factors on the effective stress coefficient for rock/soil-like porous materials are investigated, based on which equivalent connectivity index is proposed. The equivalent connectivity index, relying on the meso-scale structure of porous material and the property of liquid, denotes the connectivity of pores in Representative Element Area (REA). If the conductivity of the porous material is anisotropic, the equivalent connectivity index is a second order tensor. Based on the basic theories of continuous mechanics and tensor analysis, relationship between area porosity and volumetric porosity of porous materials is deduced. Then a generalized expression, describing the relation between effective stress coefficient tensor and equivalent connectivity tensor of pores, is proposed, and the expression can be applied to isotropic media and also to anisotropic materials. Furthermore, evolution of porosity and equivalent connectivity index of the pore are studied in the strain space, and the method to determine the corresponding functions in expressions above is proposed using genetic algorithm and genetic programming. Two applications show that the results obtained by the method in this paper perfectly agree with the test data. This paper provides an important theoretical support to the coupled hydro-mechanical research.展开更多
This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vege...This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vegetation and environmental variables, as well as spectral variables. Based on the fitted GAMs model, probability map of species occurrence was generated and then vegetation type of each grid was defined according to the probability of species occurrence. Deviance analysis was employed to test the goodness of curve fitting and drop contribution calculation was used to evaluate the contribution of each predictor in the fitted GAMs models. Area under curve (AUC) of Receiver Operating Characteristic (ROC) curve was employed to assess the results maps of probability. The results showed that: 1) AUC values of the fitted GAMs models are very high which proves that integrating spectral data and environmental variables based on the GAMs is a feasible way to map the vegetation. 2) Prediction accuracy varies with plant community, and community with dense cover is better predicted than sparse plant community. 3) Both spectral variables and environmental variables play an important role in mapping the vegetation. However, the contribution of the same predictor in the GAMs models for different plant communities is different. 4) Insufficient resolution of spectral data, environmental data and confounding effects of land use and other variables which are not closely related to the environmental conditions are the major causes of imprecision.展开更多
Based on the theoretical expression of the three-dimension rheologic inclusion model, we analyze in detail the spatio-temporal changes on the ground of the bulk-strain produced by a spherical rheologic inclusion in a ...Based on the theoretical expression of the three-dimension rheologic inclusion model, we analyze in detail the spatio-temporal changes on the ground of the bulk-strain produced by a spherical rheologic inclusion in a semi-infinite rheologic medium. The results show that the spatio-temporal change of bulk-strain produced by the hard inclusion has three stages of different characteristics, which are similar to most of those geodetic deformation curves, but those by a soft inclusion do not. The α-stage is a long stage in which the precursors in both the near source region and the far field develop from the focal region to the periphery. The β-stage indicates a very rapid propagation of the precursors, so that they almost appear everywhere. During the γ-stage, the precursors in the far-field converge from the periphery, and the precursors in the near source region develop outwards. The theoretical results have been used to explain tentatively the stage characteristics of the spatio-temporal change of earthquake precursors.展开更多
For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for...For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.展开更多
Based on analyzing various factors influencing milled surface topography, firstly, a generalized model for milled surface topography is proposed. Secondly, using the principles of transformation matrix and vector oper...Based on analyzing various factors influencing milled surface topography, firstly, a generalized model for milled surface topography is proposed. Secondly, using the principles of transformation matrix and vector operation, the trajectory equation of cutting edge relative to workpiece is derived. Then, a three dimensional topography simulation algorithm is constructed through dividing the workpiece into regular grids. Finally, taking the peripheral milling process as an example, the generalized model is simplified, and the corresponding simulation examples are given. The results indicate that it is very efficient for the generalized model to be used to analyze and simulate the peripherally milled surface topography.展开更多
This research develops a new mathematical modeling method by combining industrial big data and process mechanism analysis under the framework of generalized additive models(GAM)to generate a practical model with gener...This research develops a new mathematical modeling method by combining industrial big data and process mechanism analysis under the framework of generalized additive models(GAM)to generate a practical model with generalization and precision.Specifically,the proposed modeling method includes the following steps.Firstly,the influence factors are screened using mechanism knowledge and data-mining methods.Secondly,the unary GAM without interactions including cleaning the data,building the sub-models,and verifying the sub-models.Subsequently,the interactions between the various factors are explored,and the binary GAM with interactions is constructed.The relationships among the sub-models are analyzed,and the integrated model is built.Finally,based on the proposed modeling method,two prediction models of mechanical property and deformation resistance for hot-rolled strips are established.Industrial actual data verification demonstrates that the new models have good prediction precision,and the mean absolute percentage errors of tensile strength,yield strength and deformation resistance are 2.54%,3.34%and 6.53%,respectively.And experimental results suggest that the proposed method offers a new approach to industrial process modeling.展开更多
The nonlinear stability of the three-layer generalized Phillips model, for which the velocity in each layeris constant and the top and bottom surfaces are either rigid or free, is studied by employing Arnol'd'...The nonlinear stability of the three-layer generalized Phillips model, for which the velocity in each layeris constant and the top and bottom surfaces are either rigid or free, is studied by employing Arnol'd'svariational principle and a prior estimate method. The nonlinear stability criteria are established. For comparison, the linear instability criteria are also obtained by using normal mode method. and the influences ofthe free parameter, β parameter and curvature in vertical profile of the horizontal velocity on the linear instability are discussed by use of the growth rate curves. The comparison between the nonlinear stability criterion and the linear one is made. It is shown that insome cases the two criteria are exactly the same in form, but in other cases, they are different. This phenomenon, which reveals the nonlinear property of the linear instability features. is explained by the explosiveresonant interaction (ERI). When there exists the ERI, i.e., the nonlinear mechanisms play a leading role inthe dynamical system. the nonlinear stability criterion is different from the linear one, on the other hand.when there does not exist the ERI. the nonlinear stability criterion is the same as the linear one in form.展开更多
基金This research was funded by the National Natural Science Foundation of China(No.62272124)the National Key Research and Development Program of China(No.2022YFB2701401)+3 种基金Guizhou Province Science and Technology Plan Project(Grant Nos.Qiankehe Paltform Talent[2020]5017)The Research Project of Guizhou University for Talent Introduction(No.[2020]61)the Cultivation Project of Guizhou University(No.[2019]56)the Open Fund of Key Laboratory of Advanced Manufacturing Technology,Ministry of Education(GZUAMT2021KF[01]).
文摘In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.
基金funded by National Science Centre,Poland under the project"Assessment of the impact of weather conditions on forest health status and forest disturbances at regional and national scale based on the integration of ground and space-based remote sensing datasets"(project no.2021/41/B/ST10/)Data collection and research was also supported by the project no.EZ.271.3.19.2021"Modele ryzyka zamierania drzewostanow glownych gatunkow lasotworczych Polski"funded by the General Directorate of State Forests in Poland。
文摘Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.
文摘The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.
基金Subjects funded by the National Natural Science Foundation of China(Nos.62275216 and 61775181)the Natural Science Basic Research Programme of Shaanxi Province-Major Basic Research Special Project(Nos.S2018-ZC-TD-0061 and TZ0393)the Special Project for the Development of National Key Scientific Instruments and Equipment No.(51927804).
文摘Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.
文摘Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.
基金The National Forestry Commission of Mexico and The Mexican National Council for Science and Technology(CONAFOR-CONACYT-115900)。
文摘Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.
基金This research was funded by the National Natural Science Foundation of China(grant no.32271881).
文摘Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.
文摘Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth to groundwater level(DGWL)and the impacts of climatic(precipitation,maximum temperature,and minimum temperature)and anthropogenic(gross district product(GDP),population,and net irrigated area(NIA))variables on DGWL during 1994-2020.The study considered DGWL in 113 observation wells and piezometers located in arid western plains(Barmer and Jodhpur districts)and semi-arid eastern plains(Jaipur,Ajmer,Dausa,and Tonk districts)of Rajasthan State,India.Statistical methods were employed to examine the annual and seasonal patterns of DGWL,and the generalized additive model(GAM)was used to determine the impacts of climatic and anthropogenic variables on DGWL.During 1994-2020,except for Barmer District,where the mean annual DGWL was almost constant(around 26.50 m),all other districts exhibited increase in DGWL,with Ajmer District experiencing the most increase.The results also revealed that 36 observation wells and piezometers showed a statistically significant annual increasing trend in DGWL and 34 observation wells and piezometers exhibited a statistically significant decreasing trend in DGWL.Similarly,32 observation wells and piezometers showed an statistically significant increasing trend and 37 observation wells and piezometers showed a statistically significant decreasing trend in winter;33 observation wells and piezometers indicated a statistically significant increasing trend and 34 had a statistically significant decreasing trend in post-monsoon;35 observation wells and piezometers exhibited a statistically significant increasing trend and 32 observation wells and piezometers showed a statistically significant decreasing trend in pre-monsoon;and 36 observation wells and piezometers reflected a statistically significant increasing trend and 30 observation wells and piezometers reflected a statistically significant decreasing trend in monsoon.Interestingly,most of the observation wells and piezometers with increasing trends of DGWL were located in Dausa and Jaipur districts.Furthermore,the GAM analysis revealed that climatic variables,such as precipitation,significantly affected DGWL in Barmer District,and DGWL in all other districts was influenced by anthropogenic variables,including GDP,NIA,and population.As a result,stringent regulations should be implemented to curb excessive groundwater extraction,manage agricultural water demand,initiate proactive aquifer recharge programs,and strengthen sustainable management in these water-scarce regions.
文摘The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.
基金The National Natural Science Foundation of China(No.11171065)the Natural Science Foundation of Jiangsu Province(No.BK2011058)
文摘In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.
基金This paper was supported by the National Natural Science Foundation of China(No.U1204606)the Key Programs for Science and Technology Development of Henan Province(No.172102210335)Key Scientific Research Projects in Henan Universities(No.16A520058).
文摘In this paper,we propose a novel coverless image steganographic scheme based on a generative model.In our scheme,the secret image is first fed to the generative model database,to generate a meaning-normal and independent image different from the secret image.The generated image is then transmitted to the receiver and fed to the generative model database to generate another image visually the same as the secret image.Thus,we only need to transmit the meaning-normal image which is not related to the secret image,and we can achieve the same effect as the transmission of the secret image.This is the first time to propose the coverless image information steganographic scheme based on generative model,compared with the traditional image steganography.The transmitted image is not embedded with any information of the secret image in this method,therefore,can effectively resist steganalysis tools.Experimental results show that our scheme has high capacity,security and reliability.
文摘In this paper, a generalized layered model for radiation transfer in canopy with high vertical resolution is developed. Differing from the two-stream approximate radiation transfer model commonly used in the land surface models, the generalized model takes into account the effect of complicated canopy morphology and inhomogeneous optical properties of leaves on radiation transfer within the canopy. In the model, the total leaf area index (LAI) of the canopy is divided into many layers. At a given layer, the influences of diffuse radiation angle distributions and leaf angle distributions on radiation transfer within the canopy are considered. The derivation of equations serving the model are described in detail, and these can deal with various diffuse radiation transfers in quite broad categories of canopy with quite inhomogeneons vertical structures and uneven leaves with substantially different optical properties of adaxial and abaxial faces of the leaves. The model is used to simulate the radiation transfer for canopies with horizontal leaves to validate the generalized model. Results from the model are compared with those from the two-stream scheme, and differences between these two models are discussed.
基金The National Key R&D Program of China under contract No.2017YFE0104400the National Natural Science Foundation of China under contract No.31772852the Marine S&T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology(Qingdao)under contract No.2018SDKJ0501-2。
文摘Habitat suitability index(HSI)models have been widely used to analyze the relationship between species abundance and environmental factors,and ultimately inform management of marine species.The response of species abundance to each environmental variable is different and habitat requirements may change over life history stages and seasons.Therefore,it is necessary to determine the optimal combination of environmental variables in HSI modelling.In this study,generalized additive models(GAMs)were used to determine which environmental variables to be included in the HSI models.Significant variables were retained and weighted in the HSI model according to their relative contribution(%)to the total deviation explained by the boosted regression tree(BRT).The HSI models were applied to evaluate the habitat suitability of mantis shrimp Oratosquilla oratoria in the Haizhou Bay and adjacent areas in 2011 and 2013–2017.Ontogenetic and seasonal variations in HSI models of mantis shrimp were also examined.Among the four models(non-optimized model,BRT informed HSI model,GAM informed HSI model,and both BRT and GAM informed HSI model),both BRT and GAM informed HSI model showed the best performance.Four environmental variables(bottom temperature,depth,distance offshore and sediment type)were selected in the HSI models for four groups(spring-juvenile,spring-adult,falljuvenile and fall-adult)of mantis shrimp.The distribution of habitat suitability showed similar patterns between juveniles and adults,but obvious seasonal variations were observed.This study suggests that the process of optimizing environmental variables in HSI models improves the performance of HSI models,and this optimization strategy could be extended to other marine organisms to enhance the understanding of the habitat suitability of target species.
基金supported by the Yalongjiang River Joint Fund by the National Natural Science Foundation of China(NSFC)Ertan Hydropower Development Company,LTD(Nos.50579091 and 50539090)+1 种基金NSFC(No.10772190)Major State Basic Research Project of China(No.2002CB412708)
文摘Physical mechanisms and influencing factors on the effective stress coefficient for rock/soil-like porous materials are investigated, based on which equivalent connectivity index is proposed. The equivalent connectivity index, relying on the meso-scale structure of porous material and the property of liquid, denotes the connectivity of pores in Representative Element Area (REA). If the conductivity of the porous material is anisotropic, the equivalent connectivity index is a second order tensor. Based on the basic theories of continuous mechanics and tensor analysis, relationship between area porosity and volumetric porosity of porous materials is deduced. Then a generalized expression, describing the relation between effective stress coefficient tensor and equivalent connectivity tensor of pores, is proposed, and the expression can be applied to isotropic media and also to anisotropic materials. Furthermore, evolution of porosity and equivalent connectivity index of the pore are studied in the strain space, and the method to determine the corresponding functions in expressions above is proposed using genetic algorithm and genetic programming. Two applications show that the results obtained by the method in this paper perfectly agree with the test data. This paper provides an important theoretical support to the coupled hydro-mechanical research.
基金Under the auspices of National Natural Science Foundation of China(No.41001363)
文摘This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vegetation and environmental variables, as well as spectral variables. Based on the fitted GAMs model, probability map of species occurrence was generated and then vegetation type of each grid was defined according to the probability of species occurrence. Deviance analysis was employed to test the goodness of curve fitting and drop contribution calculation was used to evaluate the contribution of each predictor in the fitted GAMs models. Area under curve (AUC) of Receiver Operating Characteristic (ROC) curve was employed to assess the results maps of probability. The results showed that: 1) AUC values of the fitted GAMs models are very high which proves that integrating spectral data and environmental variables based on the GAMs is a feasible way to map the vegetation. 2) Prediction accuracy varies with plant community, and community with dense cover is better predicted than sparse plant community. 3) Both spectral variables and environmental variables play an important role in mapping the vegetation. However, the contribution of the same predictor in the GAMs models for different plant communities is different. 4) Insufficient resolution of spectral data, environmental data and confounding effects of land use and other variables which are not closely related to the environmental conditions are the major causes of imprecision.
文摘Based on the theoretical expression of the three-dimension rheologic inclusion model, we analyze in detail the spatio-temporal changes on the ground of the bulk-strain produced by a spherical rheologic inclusion in a semi-infinite rheologic medium. The results show that the spatio-temporal change of bulk-strain produced by the hard inclusion has three stages of different characteristics, which are similar to most of those geodetic deformation curves, but those by a soft inclusion do not. The α-stage is a long stage in which the precursors in both the near source region and the far field develop from the focal region to the periphery. The β-stage indicates a very rapid propagation of the precursors, so that they almost appear everywhere. During the γ-stage, the precursors in the far-field converge from the periphery, and the precursors in the near source region develop outwards. The theoretical results have been used to explain tentatively the stage characteristics of the spatio-temporal change of earthquake precursors.
基金supported by the National Natural Science Foundation of China under Grant 51722406,52074340,and 51874335the Shandong Provincial Natural Science Foundation under Grant JQ201808+5 种基金The Fundamental Research Funds for the Central Universities under Grant 18CX02097Athe Major Scientific and Technological Projects of CNPC under Grant ZD2019-183-008the Science and Technology Support Plan for Youth Innovation of University in Shandong Province under Grant 2019KJH002the National Research Council of Science and Technology Major Project of China under Grant 2016ZX05025001-006111 Project under Grant B08028Sinopec Science and Technology Project under Grant P20050-1
文摘For reservoirs with complex non-Gaussian geological characteristics,such as carbonate reservoirs or reservoirs with sedimentary facies distribution,it is difficult to implement history matching directly,especially for the ensemble-based data assimilation methods.In this paper,we propose a multi-source information fused generative adversarial network(MSIGAN)model,which is used for parameterization of the complex geologies.In MSIGAN,various information such as facies distribution,microseismic,and inter-well connectivity,can be integrated to learn the geological features.And two major generative models in deep learning,variational autoencoder(VAE)and generative adversarial network(GAN)are combined in our model.Then the proposed MSIGAN model is integrated into the ensemble smoother with multiple data assimilation(ESMDA)method to conduct history matching.We tested the proposed method on two reservoir models with fluvial facies.The experimental results show that the proposed MSIGAN model can effectively learn the complex geological features,which can promote the accuracy of history matching.
文摘Based on analyzing various factors influencing milled surface topography, firstly, a generalized model for milled surface topography is proposed. Secondly, using the principles of transformation matrix and vector operation, the trajectory equation of cutting edge relative to workpiece is derived. Then, a three dimensional topography simulation algorithm is constructed through dividing the workpiece into regular grids. Finally, taking the peripheral milling process as an example, the generalized model is simplified, and the corresponding simulation examples are given. The results indicate that it is very efficient for the generalized model to be used to analyze and simulate the peripherally milled surface topography.
基金Project(51774219)supported by the National Natural Science Foundation of China
文摘This research develops a new mathematical modeling method by combining industrial big data and process mechanism analysis under the framework of generalized additive models(GAM)to generate a practical model with generalization and precision.Specifically,the proposed modeling method includes the following steps.Firstly,the influence factors are screened using mechanism knowledge and data-mining methods.Secondly,the unary GAM without interactions including cleaning the data,building the sub-models,and verifying the sub-models.Subsequently,the interactions between the various factors are explored,and the binary GAM with interactions is constructed.The relationships among the sub-models are analyzed,and the integrated model is built.Finally,based on the proposed modeling method,two prediction models of mechanical property and deformation resistance for hot-rolled strips are established.Industrial actual data verification demonstrates that the new models have good prediction precision,and the mean absolute percentage errors of tensile strength,yield strength and deformation resistance are 2.54%,3.34%and 6.53%,respectively.And experimental results suggest that the proposed method offers a new approach to industrial process modeling.
文摘The nonlinear stability of the three-layer generalized Phillips model, for which the velocity in each layeris constant and the top and bottom surfaces are either rigid or free, is studied by employing Arnol'd'svariational principle and a prior estimate method. The nonlinear stability criteria are established. For comparison, the linear instability criteria are also obtained by using normal mode method. and the influences ofthe free parameter, β parameter and curvature in vertical profile of the horizontal velocity on the linear instability are discussed by use of the growth rate curves. The comparison between the nonlinear stability criterion and the linear one is made. It is shown that insome cases the two criteria are exactly the same in form, but in other cases, they are different. This phenomenon, which reveals the nonlinear property of the linear instability features. is explained by the explosiveresonant interaction (ERI). When there exists the ERI, i.e., the nonlinear mechanisms play a leading role inthe dynamical system. the nonlinear stability criterion is different from the linear one, on the other hand.when there does not exist the ERI. the nonlinear stability criterion is the same as the linear one in form.