The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflectio...The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.展开更多
In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining ...In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.展开更多
Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural netwo...Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.展开更多
Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical re...Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.展开更多
Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands i...Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.展开更多
Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management...Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.展开更多
Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have dev...Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.展开更多
Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth t...Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth to groundwater level(DGWL)and the impacts of climatic(precipitation,maximum temperature,and minimum temperature)and anthropogenic(gross district product(GDP),population,and net irrigated area(NIA))variables on DGWL during 1994-2020.The study considered DGWL in 113 observation wells and piezometers located in arid western plains(Barmer and Jodhpur districts)and semi-arid eastern plains(Jaipur,Ajmer,Dausa,and Tonk districts)of Rajasthan State,India.Statistical methods were employed to examine the annual and seasonal patterns of DGWL,and the generalized additive model(GAM)was used to determine the impacts of climatic and anthropogenic variables on DGWL.During 1994-2020,except for Barmer District,where the mean annual DGWL was almost constant(around 26.50 m),all other districts exhibited increase in DGWL,with Ajmer District experiencing the most increase.The results also revealed that 36 observation wells and piezometers showed a statistically significant annual increasing trend in DGWL and 34 observation wells and piezometers exhibited a statistically significant decreasing trend in DGWL.Similarly,32 observation wells and piezometers showed an statistically significant increasing trend and 37 observation wells and piezometers showed a statistically significant decreasing trend in winter;33 observation wells and piezometers indicated a statistically significant increasing trend and 34 had a statistically significant decreasing trend in post-monsoon;35 observation wells and piezometers exhibited a statistically significant increasing trend and 32 observation wells and piezometers showed a statistically significant decreasing trend in pre-monsoon;and 36 observation wells and piezometers reflected a statistically significant increasing trend and 30 observation wells and piezometers reflected a statistically significant decreasing trend in monsoon.Interestingly,most of the observation wells and piezometers with increasing trends of DGWL were located in Dausa and Jaipur districts.Furthermore,the GAM analysis revealed that climatic variables,such as precipitation,significantly affected DGWL in Barmer District,and DGWL in all other districts was influenced by anthropogenic variables,including GDP,NIA,and population.As a result,stringent regulations should be implemented to curb excessive groundwater extraction,manage agricultural water demand,initiate proactive aquifer recharge programs,and strengthen sustainable management in these water-scarce regions.展开更多
The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims...The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.展开更多
In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood e...In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.展开更多
This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vege...This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vegetation and environmental variables, as well as spectral variables. Based on the fitted GAMs model, probability map of species occurrence was generated and then vegetation type of each grid was defined according to the probability of species occurrence. Deviance analysis was employed to test the goodness of curve fitting and drop contribution calculation was used to evaluate the contribution of each predictor in the fitted GAMs models. Area under curve (AUC) of Receiver Operating Characteristic (ROC) curve was employed to assess the results maps of probability. The results showed that: 1) AUC values of the fitted GAMs models are very high which proves that integrating spectral data and environmental variables based on the GAMs is a feasible way to map the vegetation. 2) Prediction accuracy varies with plant community, and community with dense cover is better predicted than sparse plant community. 3) Both spectral variables and environmental variables play an important role in mapping the vegetation. However, the contribution of the same predictor in the GAMs models for different plant communities is different. 4) Insufficient resolution of spectral data, environmental data and confounding effects of land use and other variables which are not closely related to the environmental conditions are the major causes of imprecision.展开更多
Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive....Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive.Recently,deep learning-based approaches have been developed to tackle the problem of inverse design efficiently.Although most of these neural network models have demonstrated high accuracy in different inverse design problems,no previous study has examined the potential effects under given constraints in nanomanufacturing.Additionally,the relative strength of different deep learning-based inverse design approaches has not been fully investigated.Here,we benchmark three commonly used deep learning models in inverse design:Tandem networks,Variational Auto-Encoders,and Generative Adversarial Networks.We provide detailed comparisons in terms of their accuracy,diversity,and robustness.We find that tandem networks and Variational Auto-Encoders give the best accuracy,while Generative Adversarial Networks lead to the most diverse predictions.Our findings could serve as a guideline for researchers to select the model that can best suit their design criteria and fabrication considerations.In addition,our code and data are publicly available,which could be used for future inverse design model development and benchmarking.展开更多
Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape m...Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape metrics. We examined the relative impacts of landscape predictors on the accuracy of habitat models by constructing distribution models at regional scales incorporating environmental variables (climate, topography, vegetation, and soil types) and secondary species occurrence data, and using them to predict the occurrence of 36 species in 15 forest fragments where we conducted rapid surveys. We then selected six landscape predictors at the landscape scale and ran general linear models of species presence/absence with either a single scale predictor (the probabilities of occurrence of the distribution models or landscape variables) or multiple scale predictors (distribution models + one landscape variable). Our results indicated that distribution models alone had poor predictive abilities but were improved when landscape predictors were added; the species responses were not, however, similar to the multiple scale predictors. Our study thus highlights the importance of considering landscape metrics to generate more accurate habitat suitability models.展开更多
This paper compares the predictions by two radiative transfer models-the two-stream approximation model and the generalized layered model (developed by the authors) in land surface processes -for different canopies ...This paper compares the predictions by two radiative transfer models-the two-stream approximation model and the generalized layered model (developed by the authors) in land surface processes -for different canopies under direct or diffuse radiation conditions. The comparison indicates that there are significant differences between the two models, especially in the near infrared (NIR) band. Results of canopy reflectance from the two-stream model are larger than those from the generalized model. However, results of canopy absorptance from the two-stream model are larger in some cases and smaller in others compared to those from the generalized model, depending on the cases involved. In the visible (VIS) band, canopy reflectance is smaller and canopy absorptance larger from the two-stream model compared to the generalized model when the Leaf Area Index (LAI) is low and soil reflectance is high. In cases of canopies with vertical leaf angles, the differences of reflectance and absorptance in the VIS and NIR bands between the two models are especially large. Two commonly occurring cases, with which the two-stream model cannot deal accurately, are also investigated. One is for a canopy with different adaxial and abaxial leaf optical properties; and the other is for incident sky diffuse radiation with a non-uniform distribution. Comparison of the generalized model within the same canopy for both uniform and non-uniform incident diffuse radiation inputs shows smaller differences in general. However, there is a measurable difference between these radiation inputs for a canopy with high leaf angle. This indicates that the application of the two-stream model to a canopy with different adaxial and abaxial leaf optical properties will introduce non-negligible errors.展开更多
The catch and effort data of Sillago sihama fishery in Pakistani waters were used to investigate the performance of two closely related stock assessment models: logistic and generalized surplus-production models. Comp...The catch and effort data of Sillago sihama fishery in Pakistani waters were used to investigate the performance of two closely related stock assessment models: logistic and generalized surplus-production models. Compared with the generalized production model, the logistic model produced more reasonable estimates for parameters such as maximum sustainable yield. The Akaike's Information Criterion values estimated at 4.265 and -51.152 respectively by the logistic and generalized models. Simulation analyses of the S. sihama fishery showed that the estimated and observed abundance indices for the logistic model were closer than those for the generalized production model. Standardized residuals were distributed closer for logistic model, but exhibited a slightly increasing trend for the generalized model. Statistical outliers were seen in 1989 and 1993 for the logistic model, and in 1981 and 1999 for the generalized model. Simulated results revealed that the logistic estimates were close to the true value for low CVs (coefficients of variation) but widely dispersed for high CVs. In contrast, the generalized model estimates were loose for all CV levels. The estimated production model curve parameter was not reasonable at all the tested levels of white noise. With the increase in white noise R2 for the catch per unit effort decreased. Therefore, we conclude that the logistic model performs more reasonably than the generalized production model.展开更多
Fault monitoring of bioprocess is important to ensure safety of a reactor and maintain high quality of products. It is difficult to build an accurate mechanistic model for a bioprocess, so fault monitoring based on ri...Fault monitoring of bioprocess is important to ensure safety of a reactor and maintain high quality of products. It is difficult to build an accurate mechanistic model for a bioprocess, so fault monitoring based on rich historical or online database is an effective way. A group of data based on bootstrap method could be resampling stochastically, improving generalization capability of model. In this paper, online fault monitoring of generalized additive models (GAMs) combining with bootstrap is proposed for glutamate fermentation process. GAMs and bootstrap are first used to decide confidence interval based on the online and off-line normal sampled data from glutamate fermentation experiments. Then GAMs are used to online fault monitoring for time, dissolved oxygen, oxygen uptake rate, and carbon dioxide evolution rate. The method can provide accurate fault alarm online and is helpful to provide useful information for removing fault and abnormal phenomena in the fermentation.展开更多
It is necessary to test for varying dispersion in generalized nonlinear models.Wei,et al(1998) developed a likelihood ratio test,a score test and their adjustments to test for varying dispersion in continuous exponent...It is necessary to test for varying dispersion in generalized nonlinear models.Wei,et al(1998) developed a likelihood ratio test,a score test and their adjustments to test for varying dispersion in continuous exponential family nonlinear models.This type of problem in the framework of general discrete exponential family nonlinear models is discussed.Two types of varying dispersion,which are random coefficients model and random effects model,are proposed,and corresponding score test statistics are constructed and expressed in simple,easy to use,matrix formulas.展开更多
For the Z-R relationship in radar-based rainfall estimation, the distribution of corresponding R values for a given Z value (or the corresponding Z value for a given R value) may be highly skewed. However, the traditi...For the Z-R relationship in radar-based rainfall estimation, the distribution of corresponding R values for a given Z value (or the corresponding Z value for a given R value) may be highly skewed. However, the traditional power-law model is physically deduced and fitted under the normal-distribution presumption of radar wave echoes associated with a rain rate value, and it may not be very appropriate. Considering this problem, the authors devised several generalized linear models with different forms and distribution presumptions to represent the Z-R relationship. Radar-reflectivity scans observed by a CINRAD/SC Doppler radar and 5-minute rainfall accumulation recorded by 10 ground gauges were used to fit these models. All data used in this study were collected during some large rainfalls of the period from 2005 to 2007. The radar and all gauges were installed in the catchment of the Yishu River, a branch of the Huaihe River in China. Three models based on normal distribution and a dBZ presumption of gamma distribution were fitted using maximum-likelihood techniques, which were resolved by genetic algorithms. Comparisons of estimated maximized likelihoods based on assumptions of gamma and normal distribution showed that all generalized linear models (GLMs) of presumed gamma distribution were better fitted than GLMs based on normal distribution. In a comparison of maximum-likelihood, the differences between these three models were small. Three error statistics were used to assess the agreement between radar estimated rainfall and gauge rainfall: relative bias (B), root mean square error (RMSE), and correlation coefficient (r). The results showed that no one model was excellent in all criteria. On the whole, the GLM-based models gave smaller relative bias than the traditional power-law model. It is suggested that validations conducted in many previous works should have been made against a specific criterion but overlooked others.展开更多
There are typical ecosystems of littoral wetlands in the Yellow River Delta.In order to study the relationships between Tamarix chinensis and environmental variables and to predict T.chinensis potential distribution i...There are typical ecosystems of littoral wetlands in the Yellow River Delta.In order to study the relationships between Tamarix chinensis and environmental variables and to predict T.chinensis potential distribution in the Yellow River Delta,641 vegetation samples and 964 soil samples were collected in the area in October of 2004,2005,2006 and 2007.The contents of soil organic matter,total phosphorus,salt,and soluble potassium were determined.Then,the analyzed data were interpolated into spatial raster data by Kriging interpolation method.Meanwhile,the digital elevation model,soil type map and landform unit map of the Yellow River Delta were also collected.Generalized Additive Models(GAMs) were employed to build species-environment model and then simulate the potential distribution of T.chinensis.The results indicated that the distribution of T.chinensis was mainly limited by soil salt content,total soil phosphorus content,soluble potassium content,soil type,landform unit,and elevation.The distribution probability of T.chinensis was produced with a lookup table generated by Grasp Module(based on GAMs) in software ArcView GIS 3.2.The AUC(Area Under Curve) value of validation and cross-validation of ROC(Receive Operating Characteristic) were both higher than 0.8,which suggested that the established model had a high precision for predicting species distribution.展开更多
We study the quasi likelihood equation in Generalized Linear Models(GLM) with adaptive design ∑(i=1)^n xi(yi-h(x'iβ))=0, where yi is a q=vector, and xi is a p×q random matrix. Under some assumptions, i...We study the quasi likelihood equation in Generalized Linear Models(GLM) with adaptive design ∑(i=1)^n xi(yi-h(x'iβ))=0, where yi is a q=vector, and xi is a p×q random matrix. Under some assumptions, it is shown that the Quasi- Likelihood equation for the GLM has a solution which is asymptotic normal.展开更多
文摘The inflection point is an important feature of sigmoidal height-diameter(H-D)models.It is often cited as one of the properties favoring sigmoidal model forms.However,there are very few studies analyzing the inflection points of H-D models.The goals of this study were to theoretically and empirically examine the behaviors of inflection points of six common H-D models with a regional dataset.The six models were the Wykoff(WYK),Schumacher(SCH),Curtis(CUR),HossfeldⅣ(HOS),von Bertalanffy-Richards(VBR),and Gompertz(GPZ)models.The models were first fitted in their base forms with tree species as random effects and were then expanded to include functional traits and spatial distribution.The distributions of the estimated inflection points were similar between the two-parameter models WYK,SCH,and CUR,but were different between the threeparameter models HOS,VBR,and GPZ.GPZ produced some of the largest inflection points.HOS and VBR produced concave H-D curves without inflection points for 12.7%and 39.7%of the tree species.Evergreen species or decreasing shade tolerance resulted in larger inflection points.The trends in the estimated inflection points of HOS and VBR were entirely opposite across the landscape.Furthermore,HOS could produce concave H-D curves for portions of the landscape.Based on the studied behaviors,the choice between two-parameter models may not matter.We recommend comparing seve ral three-parameter model forms for consistency in estimated inflection points before deciding on one.Believing sigmoidal models to have inflection points does not necessarily mean that they will produce fitted curves with one.Our study highlights the need to integrate analysis of inflection points into modeling H-D relationships.
基金This research was funded by the National Natural Science Foundation of China(No.62272124)the National Key Research and Development Program of China(No.2022YFB2701401)+3 种基金Guizhou Province Science and Technology Plan Project(Grant Nos.Qiankehe Paltform Talent[2020]5017)The Research Project of Guizhou University for Talent Introduction(No.[2020]61)the Cultivation Project of Guizhou University(No.[2019]56)the Open Fund of Key Laboratory of Advanced Manufacturing Technology,Ministry of Education(GZUAMT2021KF[01]).
文摘In the assessment of car insurance claims,the claim rate for car insurance presents a highly skewed probability distribution,which is typically modeled using Tweedie distribution.The traditional approach to obtaining the Tweedie regression model involves training on a centralized dataset,when the data is provided by multiple parties,training a privacy-preserving Tweedie regression model without exchanging raw data becomes a challenge.To address this issue,this study introduces a novel vertical federated learning-based Tweedie regression algorithm for multi-party auto insurance rate setting in data silos.The algorithm can keep sensitive data locally and uses privacy-preserving techniques to achieve intersection operations between the two parties holding the data.After determining which entities are shared,the participants train the model locally using the shared entity data to obtain the local generalized linear model intermediate parameters.The homomorphic encryption algorithms are introduced to interact with and update the model intermediate parameters to collaboratively complete the joint training of the car insurance rate-setting model.Performance tests on two publicly available datasets show that the proposed federated Tweedie regression algorithm can effectively generate Tweedie regression models that leverage the value of data fromboth partieswithout exchanging data.The assessment results of the scheme approach those of the Tweedie regressionmodel learned fromcentralized data,and outperformthe Tweedie regressionmodel learned independently by a single party.
基金Subjects funded by the National Natural Science Foundation of China(Nos.62275216 and 61775181)the Natural Science Basic Research Programme of Shaanxi Province-Major Basic Research Special Project(Nos.S2018-ZC-TD-0061 and TZ0393)the Special Project for the Development of National Key Scientific Instruments and Equipment No.(51927804).
文摘Deep learning is capable of greatly promoting the progress of super-resolution imaging technology in terms of imaging and reconstruction speed,imaging resolution,and imagingflux.This paper proposes a deep neural network based on a generative adversarial network(GAN).The generator employs a U-Net-based network,which integrates Dense Net for the downsampling component.The proposed method has excellent properties,for example,the network model is trained with several different datasets of biological structures;the trained model can improve the imaging resolution of different microscopy imaging modalities such as confocal imaging and wide-field imaging;and the model demonstrates a generalized ability to improve the resolution of different biological structures even out of the datasets.In addition,experimental results showed that the method improved the resolution of caveolin-coated pits(CCPs)structures from 264 nm to 138 nm,a 1.91-fold increase,and nearly doubled the resolution of DNA molecules imaged while being transported through microfluidic channels.
文摘Much of the world's biodiversity lies in heterogeneous mountain areas with their diverse environments.As an example,Iranian montane ranges are highly diverse,particularly in the Irano-Turanian phytogeographical region.Understanding plant diversity patterns with increasing elevation is of high significance,not least for conservation planning.We studied the pattern of species richness,Shannon diversity,endemic richness,endemics ratio,and richness of life forms along a 3900 m elevational transect in Mount Palvar,overlooking the Lut Desert in Southeast Iran.We also analyzed the effect of environmental variables on species turnover along the vertical gradient.A total of 120 vegetation plots(10 m×10 m)were sampled along the elevational transect containing species and environmental data.To discover plant diversity pattern along the elevational gradient,generalized additive model(GAM)was used.Non-metric multidimensional scaling(NMDS)was applied for illustrating the correlation between species composition and environmental variables.We found hump-shaped pattern for species richness,Shannon diversity,endemic richness,and species richness of different life forms,but a monotonic increasing pattern for ratio of endemic species from low to high elevations.Our study confirms the humped pattern of species richness peaking at intermediate elevations along a complete elevational gradient in a semi-arid mountain.The monotonic increase of endemics ratio with elevation in our area as a case study is consistent with global increase of endemism with elevation.According to our results,temperature and precipitation are two important climatic variables that drive elevational plant diversity,particularly in seasonally dry areas.Our study suggests that effective conservation and management are needed for this low latitude mountain area along with calling for long-term monitoring for species redistribution.
基金funded by National Science Centre,Poland under the project"Assessment of the impact of weather conditions on forest health status and forest disturbances at regional and national scale based on the integration of ground and space-based remote sensing datasets"(project no.2021/41/B/ST10/)Data collection and research was also supported by the project no.EZ.271.3.19.2021"Modele ryzyka zamierania drzewostanow glownych gatunkow lasotworczych Polski"funded by the General Directorate of State Forests in Poland。
文摘Over the past decade,the presence of mistletoe(Viscum album ssp.austriacum)in Scots pine stands has increased in many European countries.Understanding the factors that influence the occurrence of mistletoe in stands is key to making appropriate forest management decisions to limit damage and prevent the spread of mistletoe in the future.Therefore,the main objective of this study was to determine the probability of mistletoe occurrence in Scots pine stands in relation to stand-related endogenous factors such as age,top height,and stand density,as well as topographic and edaphic factors.We used unmanned aerial vehicle(UAV)imagery from 2,247 stands to detect mistletoe in Scots pine stands,while majority stand and site characteristics were calculated from airborne laser scanning(ALS)data.Information on stand age and site type from the State Forest database were also used.We found that mistletoe infestation in Scots pine stands is influenced by stand and site characteristics.We documented that the densest,tallest,and oldest stands were more susceptible to mistletoe infestation.Site type and specific microsite conditions associated with topography were also important factors driving mistletoe occurrence.In addition,climatic water balance was a significant factor in increasing the probability of mistletoe occurrence,which is important in the context of predicted temperature increases associated with climate change.Our results are important for better understanding patterns of mistletoe infestation and ecosystem functioning under climate change.In an era of climate change and technological development,the use of remote sensing methods to determine the risk of mistletoe infestation can be a very useful tool for managing forest ecosystems to maintain forest sustainability and prevent forest disturbance.
基金The National Forestry Commission of Mexico and The Mexican National Council for Science and Technology(CONAFOR-CONACYT-115900)。
文摘Multispecies forests have received increased scientific attention,driven by the hypothesis that biodiversity improves ecological resilience.However,a greater species diversity presents challenges for forest management and research.Our study aims to develop basal area growth models for tree species cohorts.The analysis is based on a dataset of 423 permanent plots(2,500 m^(2))located in temperate forests in Durango,Mexico.First,we define tree species cohorts based on individual and neighborhood-based variables using a combination of principal component and cluster analyses.Then,we estimate the basal area increment of each cohort through the generalized additive model to describe the effect of tree size,competition,stand density and site quality.The principal component and cluster analyses assign a total of 37 tree species to eight cohorts that differed primarily with regard to the distribution of tree size and vertical position within the community.The generalized additive models provide satisfactory estimates of tree growth for the species cohorts,explaining between 19 and 53 percent of the total variation of basal area increment,and highlight the following results:i)most cohorts show a"rise-and-fall"effect of tree size on tree growth;ii)surprisingly,the competition index"basal area of larger trees"had showed a positive effect in four of the eight cohorts;iii)stand density had a negative effect on basal area increment,though the effect was minor in medium-and high-density stands,and iv)basal area growth was positively correlated with site quality except for an oak cohort.The developed species cohorts and growth models provide insight into their particular ecological features and growth patterns that may support the development of sustainable management strategies for temperate multispecies forests.
基金This research was funded by the National Natural Science Foundation of China(grant no.32271881).
文摘Forest fires are natural disasters that can occur suddenly and can be very damaging,burning thousands of square kilometers.Prevention is better than suppression and prediction models of forest fire occurrence have developed from the logistic regression model,the geographical weighted logistic regression model,the Lasso regression model,the random forest model,and the support vector machine model based on historical forest fire data from 2000 to 2019 in Jilin Province.The models,along with a distribution map are presented in this paper to provide a theoretical basis for forest fire management in this area.Existing studies show that the prediction accuracies of the two machine learning models are higher than those of the three generalized linear regression models.The accuracies of the random forest model,the support vector machine model,geographical weighted logistic regression model,the Lasso regression model,and logistic model were 88.7%,87.7%,86.0%,85.0%and 84.6%,respectively.Weather is the main factor affecting forest fires,while the impacts of topography factors,human and social-economic factors on fire occurrence were similar.
文摘Climate change and increasing anthropogenic activities,such as over-exploitation of groundwater,are exerting unavoidable stress on groundwater resources.This study investigated the spatio-temporal variation of depth to groundwater level(DGWL)and the impacts of climatic(precipitation,maximum temperature,and minimum temperature)and anthropogenic(gross district product(GDP),population,and net irrigated area(NIA))variables on DGWL during 1994-2020.The study considered DGWL in 113 observation wells and piezometers located in arid western plains(Barmer and Jodhpur districts)and semi-arid eastern plains(Jaipur,Ajmer,Dausa,and Tonk districts)of Rajasthan State,India.Statistical methods were employed to examine the annual and seasonal patterns of DGWL,and the generalized additive model(GAM)was used to determine the impacts of climatic and anthropogenic variables on DGWL.During 1994-2020,except for Barmer District,where the mean annual DGWL was almost constant(around 26.50 m),all other districts exhibited increase in DGWL,with Ajmer District experiencing the most increase.The results also revealed that 36 observation wells and piezometers showed a statistically significant annual increasing trend in DGWL and 34 observation wells and piezometers exhibited a statistically significant decreasing trend in DGWL.Similarly,32 observation wells and piezometers showed an statistically significant increasing trend and 37 observation wells and piezometers showed a statistically significant decreasing trend in winter;33 observation wells and piezometers indicated a statistically significant increasing trend and 34 had a statistically significant decreasing trend in post-monsoon;35 observation wells and piezometers exhibited a statistically significant increasing trend and 32 observation wells and piezometers showed a statistically significant decreasing trend in pre-monsoon;and 36 observation wells and piezometers reflected a statistically significant increasing trend and 30 observation wells and piezometers reflected a statistically significant decreasing trend in monsoon.Interestingly,most of the observation wells and piezometers with increasing trends of DGWL were located in Dausa and Jaipur districts.Furthermore,the GAM analysis revealed that climatic variables,such as precipitation,significantly affected DGWL in Barmer District,and DGWL in all other districts was influenced by anthropogenic variables,including GDP,NIA,and population.As a result,stringent regulations should be implemented to curb excessive groundwater extraction,manage agricultural water demand,initiate proactive aquifer recharge programs,and strengthen sustainable management in these water-scarce regions.
文摘The limited amount of data in the healthcare domain and the necessity of training samples for increased performance of deep learning models is a recurrent challenge,especially in medical imaging.Newborn Solutions aims to enhance its non-invasive white blood cell counting device,Neosonics,by creating synthetic in vitro ultrasound images to facilitate a more efficient image generation process.This study addresses the data scarcity issue by designing and evaluating a continuous scalar conditional Generative Adversarial Network(GAN)to augment in vitro peritoneal dialysis ultrasound images,increasing both the volume and variability of training samples.The developed GAN architecture incorporates novel design features:varying kernel sizes in the generator’s transposed convolutional layers and a latent intermediate space,projecting noise and condition values for enhanced image resolution and specificity.The experimental results show that the GAN successfully generated diverse images of high visual quality,closely resembling real ultrasound samples.While visual results were promising,the use of GAN-based data augmentation did not consistently improve the performance of an image regressor in distinguishing features specific to varied white blood cell concentrations.Ultimately,while this continuous scalar conditional GAN model made strides in generating realistic images,further work is needed to achieve consistent gains in regression tasks,aiming for robust model generalization.
基金The National Natural Science Foundation of China(No.11171065)the Natural Science Foundation of Jiangsu Province(No.BK2011058)
文摘In order to detect whether the data conforms to the given model, it is necessary to diagnose the data in the statistical way. The diagnostic problem in generalized nonlinear models based on the maximum Lq-likelihood estimation is considered. Three diagnostic statistics are used to detect whether the outliers exist in the data set. Simulation results show that when the sample size is small, the values of diagnostic statistics based on the maximum Lq-likelihood estimation are greater than the values based on the maximum likelihood estimation. As the sample size increases, the difference between the values of the diagnostic statistics based on two estimation methods diminishes gradually. It means that the outliers can be distinguished easier through the maximum Lq-likelihood method than those through the maximum likelihood estimation method.
基金Under the auspices of National Natural Science Foundation of China(No.41001363)
文摘This study aims to provide a predictive vegetation mapping approach based on the spectral data, DEM and Generalized Additive Models (GAMs). GAMs were used as a prediction tool to describe the relationship between vegetation and environmental variables, as well as spectral variables. Based on the fitted GAMs model, probability map of species occurrence was generated and then vegetation type of each grid was defined according to the probability of species occurrence. Deviance analysis was employed to test the goodness of curve fitting and drop contribution calculation was used to evaluate the contribution of each predictor in the fitted GAMs models. Area under curve (AUC) of Receiver Operating Characteristic (ROC) curve was employed to assess the results maps of probability. The results showed that: 1) AUC values of the fitted GAMs models are very high which proves that integrating spectral data and environmental variables based on the GAMs is a feasible way to map the vegetation. 2) Prediction accuracy varies with plant community, and community with dense cover is better predicted than sparse plant community. 3) Both spectral variables and environmental variables play an important role in mapping the vegetation. However, the contribution of the same predictor in the GAMs models for different plant communities is different. 4) Insufficient resolution of spectral data, environmental data and confounding effects of land use and other variables which are not closely related to the environmental conditions are the major causes of imprecision.
文摘Photonic inverse design concerns the problem of finding photonic structures with target optical properties.However,traditional methods based on optimization algorithms are time-consuming and computationally expensive.Recently,deep learning-based approaches have been developed to tackle the problem of inverse design efficiently.Although most of these neural network models have demonstrated high accuracy in different inverse design problems,no previous study has examined the potential effects under given constraints in nanomanufacturing.Additionally,the relative strength of different deep learning-based inverse design approaches has not been fully investigated.Here,we benchmark three commonly used deep learning models in inverse design:Tandem networks,Variational Auto-Encoders,and Generative Adversarial Networks.We provide detailed comparisons in terms of their accuracy,diversity,and robustness.We find that tandem networks and Variational Auto-Encoders give the best accuracy,while Generative Adversarial Networks lead to the most diverse predictions.Our findings could serve as a guideline for researchers to select the model that can best suit their design criteria and fabrication considerations.In addition,our code and data are publicly available,which could be used for future inverse design model development and benchmarking.
基金supported by the Biota Minas Program(Proc.No.APQ 03549-09)FAPEMIG(Proc.No.PCE-00106-12)
文摘Species distribution models are used to aid our understanding of the processes driving the spatial patterns of species’ habitats. This approach has received criticism, however, largely because it neglects landscape metrics. We examined the relative impacts of landscape predictors on the accuracy of habitat models by constructing distribution models at regional scales incorporating environmental variables (climate, topography, vegetation, and soil types) and secondary species occurrence data, and using them to predict the occurrence of 36 species in 15 forest fragments where we conducted rapid surveys. We then selected six landscape predictors at the landscape scale and ran general linear models of species presence/absence with either a single scale predictor (the probabilities of occurrence of the distribution models or landscape variables) or multiple scale predictors (distribution models + one landscape variable). Our results indicated that distribution models alone had poor predictive abilities but were improved when landscape predictors were added; the species responses were not, however, similar to the multiple scale predictors. Our study thus highlights the importance of considering landscape metrics to generate more accurate habitat suitability models.
基金supported by the National Natural Science Foundation of China under Grant Nos.40233034,40605024,40575043,and 40305011.
文摘This paper compares the predictions by two radiative transfer models-the two-stream approximation model and the generalized layered model (developed by the authors) in land surface processes -for different canopies under direct or diffuse radiation conditions. The comparison indicates that there are significant differences between the two models, especially in the near infrared (NIR) band. Results of canopy reflectance from the two-stream model are larger than those from the generalized model. However, results of canopy absorptance from the two-stream model are larger in some cases and smaller in others compared to those from the generalized model, depending on the cases involved. In the visible (VIS) band, canopy reflectance is smaller and canopy absorptance larger from the two-stream model compared to the generalized model when the Leaf Area Index (LAI) is low and soil reflectance is high. In cases of canopies with vertical leaf angles, the differences of reflectance and absorptance in the VIS and NIR bands between the two models are especially large. Two commonly occurring cases, with which the two-stream model cannot deal accurately, are also investigated. One is for a canopy with different adaxial and abaxial leaf optical properties; and the other is for incident sky diffuse radiation with a non-uniform distribution. Comparison of the generalized model within the same canopy for both uniform and non-uniform incident diffuse radiation inputs shows smaller differences in general. However, there is a measurable difference between these radiation inputs for a canopy with high leaf angle. This indicates that the application of the two-stream model to a canopy with different adaxial and abaxial leaf optical properties will introduce non-negligible errors.
基金supported by the special research fund of the Ocean University of China (No.201022001)
文摘The catch and effort data of Sillago sihama fishery in Pakistani waters were used to investigate the performance of two closely related stock assessment models: logistic and generalized surplus-production models. Compared with the generalized production model, the logistic model produced more reasonable estimates for parameters such as maximum sustainable yield. The Akaike's Information Criterion values estimated at 4.265 and -51.152 respectively by the logistic and generalized models. Simulation analyses of the S. sihama fishery showed that the estimated and observed abundance indices for the logistic model were closer than those for the generalized production model. Standardized residuals were distributed closer for logistic model, but exhibited a slightly increasing trend for the generalized model. Statistical outliers were seen in 1989 and 1993 for the logistic model, and in 1981 and 1999 for the generalized model. Simulated results revealed that the logistic estimates were close to the true value for low CVs (coefficients of variation) but widely dispersed for high CVs. In contrast, the generalized model estimates were loose for all CV levels. The estimated production model curve parameter was not reasonable at all the tested levels of white noise. With the increase in white noise R2 for the catch per unit effort decreased. Therefore, we conclude that the logistic model performs more reasonably than the generalized production model.
基金Supported by the National Natural Science Foundation of China (61273131) 111 Project (B12018)+1 种基金 the Innovation Project of Graduate in Jiangsu Province (CXZZ12_0741) the Fundamental Research Funds for the Central Universities (JUDCF12034)
文摘Fault monitoring of bioprocess is important to ensure safety of a reactor and maintain high quality of products. It is difficult to build an accurate mechanistic model for a bioprocess, so fault monitoring based on rich historical or online database is an effective way. A group of data based on bootstrap method could be resampling stochastically, improving generalization capability of model. In this paper, online fault monitoring of generalized additive models (GAMs) combining with bootstrap is proposed for glutamate fermentation process. GAMs and bootstrap are first used to decide confidence interval based on the online and off-line normal sampled data from glutamate fermentation experiments. Then GAMs are used to online fault monitoring for time, dissolved oxygen, oxygen uptake rate, and carbon dioxide evolution rate. The method can provide accurate fault alarm online and is helpful to provide useful information for removing fault and abnormal phenomena in the fermentation.
基金Supported by the National Natural Science Foundations of China( 1 9631 0 4 0 ) and SSFC( o2 BTJ0 0 1 ) .
文摘It is necessary to test for varying dispersion in generalized nonlinear models.Wei,et al(1998) developed a likelihood ratio test,a score test and their adjustments to test for varying dispersion in continuous exponential family nonlinear models.This type of problem in the framework of general discrete exponential family nonlinear models is discussed.Two types of varying dispersion,which are random coefficients model and random effects model,are proposed,and corresponding score test statistics are constructed and expressed in simple,easy to use,matrix formulas.
基金financially supported by the National Natural Science Foundation of China (Grant No. 40971024)the National Basic Research Program of China (Grant No. 2006CB400502)the Special Meteorology Project (GYHY(QX)2007-6-1)
文摘For the Z-R relationship in radar-based rainfall estimation, the distribution of corresponding R values for a given Z value (or the corresponding Z value for a given R value) may be highly skewed. However, the traditional power-law model is physically deduced and fitted under the normal-distribution presumption of radar wave echoes associated with a rain rate value, and it may not be very appropriate. Considering this problem, the authors devised several generalized linear models with different forms and distribution presumptions to represent the Z-R relationship. Radar-reflectivity scans observed by a CINRAD/SC Doppler radar and 5-minute rainfall accumulation recorded by 10 ground gauges were used to fit these models. All data used in this study were collected during some large rainfalls of the period from 2005 to 2007. The radar and all gauges were installed in the catchment of the Yishu River, a branch of the Huaihe River in China. Three models based on normal distribution and a dBZ presumption of gamma distribution were fitted using maximum-likelihood techniques, which were resolved by genetic algorithms. Comparisons of estimated maximized likelihoods based on assumptions of gamma and normal distribution showed that all generalized linear models (GLMs) of presumed gamma distribution were better fitted than GLMs based on normal distribution. In a comparison of maximum-likelihood, the differences between these three models were small. Three error statistics were used to assess the agreement between radar estimated rainfall and gauge rainfall: relative bias (B), root mean square error (RMSE), and correlation coefficient (r). The results showed that no one model was excellent in all criteria. On the whole, the GLM-based models gave smaller relative bias than the traditional power-law model. It is suggested that validations conducted in many previous works should have been made against a specific criterion but overlooked others.
基金Under the auspices of the Project of National Natural Science Foundation of China ( No. 41001363)Autonomous Project of State Key Laboratory of Resources and Environmental Information System,Geo-information Tupu Theory and Virtual Geoscience
文摘There are typical ecosystems of littoral wetlands in the Yellow River Delta.In order to study the relationships between Tamarix chinensis and environmental variables and to predict T.chinensis potential distribution in the Yellow River Delta,641 vegetation samples and 964 soil samples were collected in the area in October of 2004,2005,2006 and 2007.The contents of soil organic matter,total phosphorus,salt,and soluble potassium were determined.Then,the analyzed data were interpolated into spatial raster data by Kriging interpolation method.Meanwhile,the digital elevation model,soil type map and landform unit map of the Yellow River Delta were also collected.Generalized Additive Models(GAMs) were employed to build species-environment model and then simulate the potential distribution of T.chinensis.The results indicated that the distribution of T.chinensis was mainly limited by soil salt content,total soil phosphorus content,soluble potassium content,soil type,landform unit,and elevation.The distribution probability of T.chinensis was produced with a lookup table generated by Grasp Module(based on GAMs) in software ArcView GIS 3.2.The AUC(Area Under Curve) value of validation and cross-validation of ROC(Receive Operating Characteristic) were both higher than 0.8,which suggested that the established model had a high precision for predicting species distribution.
文摘We study the quasi likelihood equation in Generalized Linear Models(GLM) with adaptive design ∑(i=1)^n xi(yi-h(x'iβ))=0, where yi is a q=vector, and xi is a p×q random matrix. Under some assumptions, it is shown that the Quasi- Likelihood equation for the GLM has a solution which is asymptotic normal.