In this article,we highlight a new three-parameter heavy-tailed lifetime distribution that aims to extend the modeling possibilities of the Lomax distribution.It is called the extended Lomax distribution.The considere...In this article,we highlight a new three-parameter heavy-tailed lifetime distribution that aims to extend the modeling possibilities of the Lomax distribution.It is called the extended Lomax distribution.The considered distribution naturally appears as the distribution of a transformation of a random variable following the logweighted power distribution recently introduced for percentage or proportion data analysis purposes.As a result,its cumulative distribution has the same functional basis as that of the Lomax distribution,but with a novel special logarithmic term depending on several parameters.The modulation of this logarithmic term reveals new types of asymetrical shapes,implying a modeling horizon beyond that of the Lomax distribution.In the first part,we examine several of its mathematical properties,such as the shapes of the related probability and hazard rate functions;stochastic comparisons;manageable expansions for various moments;and quantile properties.In particular,based on the quantile functions,various actuarial measures are discussed.In the second part,the distribution’s applicability is investigated with the use of themaximumlikelihood estimationmethod.The behavior of the obtained parameter estimates is validated by a simulation work.Insurance claim data are analyzed.We show that the proposed distribution outperforms eight well-known distributions,including the Lomax distribution and several extended Lomax distributions.In addition,we demonstrate that it gives preferable inferences from these competitor distributions in terms of risk measures.展开更多
Recent studies have pointed out the potential of the odd Fréchet family(or class)of continuous distributions in fitting data of all kinds.In this article,we propose an extension of this family through the so-cal...Recent studies have pointed out the potential of the odd Fréchet family(or class)of continuous distributions in fitting data of all kinds.In this article,we propose an extension of this family through the so-called“Topp-Leone strategy”,aiming to improve its overall flexibility by adding a shape parameter.The main objective is to offer original distributions with modifiable properties,from which adaptive and pliant statistical models can be derived.For the new family,these aspects are illustrated by the means of comprehensive mathematical and numerical results.In particular,we emphasize a special distribution with three parameters based on the exponential distribution.The related model is shown to be skillful to the fitting of various lifetime data,more or less heterogeneous.Among all the possible applications,we consider two data sets of current interest,linked to the COVID-19 pandemic.They concern daily cases confirmed and recovered in Pakistan from March 24 to April 28,2020.As a result of our analyzes,the proposed model has the best fitting results in comparison to serious challengers,including the former odd Fréchet model.展开更多
This study proposes an approach based on machine learning to forecast currency exchange rates by applying sentiment analysis to messages on Twitter(called tweets).A dataset of the exchange rates between the United Sta...This study proposes an approach based on machine learning to forecast currency exchange rates by applying sentiment analysis to messages on Twitter(called tweets).A dataset of the exchange rates between the United States Dollar(USD)and the Pakistani Rupee(PKR)was formed by collecting information from a forex website as well as a collection of tweets from the business community in Pakistan containing finance-related words.The dataset was collected in raw form,and was subjected to natural language processing by way of data preprocessing.Response variable labeling was then applied to the standardized dataset,where the response variables were divided into two classes:“1”indicated an increase in the exchange rate and“−1”indicated a decrease in it.To better represent the dataset,we used linear discriminant analysis and principal component analysis to visualize the data in three-dimensional vector space.Clusters that were obtained using a sampling approach were then used for data optimization.Five machine learning classifiers—the simple logistic classifier,the random forest,bagging,naïve Bayes,and the support vector machine—were applied to the optimized dataset.The results show that the simple logistic classifier yielded the highest accuracy of 82.14%for the USD and the PKR exchange rates forecasting.展开更多
Accelerated life testing has been widely used in product life testing experiments because it can quickly provide information on the lifetime distributions by testing products or materials at higher than basic conditio...Accelerated life testing has been widely used in product life testing experiments because it can quickly provide information on the lifetime distributions by testing products or materials at higher than basic conditional levels of stress,such as pressure,temperature,vibration,voltage,or load to induce early failures.In this paper,a step stress partially accelerated life test(SSPALT)is regarded under the progressive type-II censored data with random removals.The removals from the test are considered to have the binomial distribution.The life times of the testing items are assumed to follow lengthbiased weighted Lomax distribution.The maximum likelihood method is used for estimating the model parameters of length-biased weighted Lomax.The asymptotic confidence interval estimates of the model parameters are evaluated using the Fisher information matrix.The Bayesian estimators cannot be obtained in the explicit form,so the Markov chain Monte Carlo method is employed to address this problem,which ensures both obtaining the Bayesian estimates as well as constructing the credible interval of the involved parameters.The precision of the Bayesian estimates and the maximum likelihood estimates are compared by simulations.In addition,to compare the performance of the considered confidence intervals for different parameter values and sample sizes.The Bootstrap confidence intervals give more accurate results than the approximate confidence intervals since the lengths of the former are less than the lengths of latter,for different sample sizes,observed failures,and censoring schemes,in most cases.Also,the percentile Bootstrap confidence intervals give more accurate results than Bootstrap-t since the lengths of the former are less than the lengths of latter for different sample sizes,observed failures,and censoring schemes,in most cases.Further performance comparison is conducted by the experiments with real data.展开更多
Understanding a phenomenon from observed data requires contextual and efficient statistical models.Such models are based on probability distributions having sufficiently flexible statistical properties to adapt to a m...Understanding a phenomenon from observed data requires contextual and efficient statistical models.Such models are based on probability distributions having sufficiently flexible statistical properties to adapt to a maximum of situations.Modern examples include the distributions of the truncated Fréchet generated family.In this paper,we go even further by introducing a more general family,based on a truncated version of the generalized Fréchet distribution.This generalization involves a new shape parameter modulating to the extreme some central and dispersion parameters,as well as the skewness and weight of the tails.We also investigate the main functions of the new family,stress-strength parameter,diverse functional series expansions,incomplete moments,various entropy measures,theoretical and practical parameters estimation,bivariate extensions through the use of copulas,and the estimation of the model parameters.By considering a special member of the family having the Weibull distribution as the parent,we fit two data sets of interest,one about waiting times and the other about precipitation.Solid statistical criteria attest that the proposed model is superior over other extended Weibull models,including the one derived to the former truncated Fréchet generated family.展开更多
The purpose of this research is the segmentation of lungs computed tomography(CT)scan for the diagnosis of COVID-19 by using machine learning methods.Our dataset contains data from patients who are prone to the epidem...The purpose of this research is the segmentation of lungs computed tomography(CT)scan for the diagnosis of COVID-19 by using machine learning methods.Our dataset contains data from patients who are prone to the epidemic.It contains three types of lungs CT images(Normal,Pneumonia,and COVID-19)collected from two different sources;the first one is the Radiology Department of Nishtar Hospital Multan and Civil Hospital Bahawalpur,Pakistan,and the second one is a publicly free available medical imaging database known as Radiopaedia.For the preprocessing,a novel fuzzy c-mean automated region-growing segmentation approach is deployed to take an automated region of interest(ROIs)and acquire 52 hybrid statistical features for each ROIs.Also,12 optimized statistical features are selected via the chi-square feature reduction technique.For the classification,five machine learning classifiers named as deep learning J4,multilayer perceptron,support vector machine,random forest,and naive Bayes are deployed to optimize the hybrid statistical features dataset.It is observed that the deep learning J4 has promising results(sensitivity and specificity:0.987;accuracy:98.67%)among all the deployed classifiers.As a complementary study,a statistical work is devoted to the use of a new statistical model to fit the main datasets of COVID-19 collected in Pakistan.展开更多
基金funded by the Deanship Scientific Research(DSR),King Abdulaziz University,Jeddah,under the GrantNo.KEP-PhD:21-130-1443.
文摘In this article,we highlight a new three-parameter heavy-tailed lifetime distribution that aims to extend the modeling possibilities of the Lomax distribution.It is called the extended Lomax distribution.The considered distribution naturally appears as the distribution of a transformation of a random variable following the logweighted power distribution recently introduced for percentage or proportion data analysis purposes.As a result,its cumulative distribution has the same functional basis as that of the Lomax distribution,but with a novel special logarithmic term depending on several parameters.The modulation of this logarithmic term reveals new types of asymetrical shapes,implying a modeling horizon beyond that of the Lomax distribution.In the first part,we examine several of its mathematical properties,such as the shapes of the related probability and hazard rate functions;stochastic comparisons;manageable expansions for various moments;and quantile properties.In particular,based on the quantile functions,various actuarial measures are discussed.In the second part,the distribution’s applicability is investigated with the use of themaximumlikelihood estimationmethod.The behavior of the obtained parameter estimates is validated by a simulation work.Insurance claim data are analyzed.We show that the proposed distribution outperforms eight well-known distributions,including the Lomax distribution and several extended Lomax distributions.In addition,we demonstrate that it gives preferable inferences from these competitor distributions in terms of risk measures.
基金This work was funded by the Deanship of Scientific Research(DSR),King AbdulAziz University,Jeddah,under grant No.(G:550-247-1441).
文摘Recent studies have pointed out the potential of the odd Fréchet family(or class)of continuous distributions in fitting data of all kinds.In this article,we propose an extension of this family through the so-called“Topp-Leone strategy”,aiming to improve its overall flexibility by adding a shape parameter.The main objective is to offer original distributions with modifiable properties,from which adaptive and pliant statistical models can be derived.For the new family,these aspects are illustrated by the means of comprehensive mathematical and numerical results.In particular,we emphasize a special distribution with three parameters based on the exponential distribution.The related model is shown to be skillful to the fitting of various lifetime data,more or less heterogeneous.Among all the possible applications,we consider two data sets of current interest,linked to the COVID-19 pandemic.They concern daily cases confirmed and recovered in Pakistan from March 24 to April 28,2020.As a result of our analyzes,the proposed model has the best fitting results in comparison to serious challengers,including the former odd Fréchet model.
文摘This study proposes an approach based on machine learning to forecast currency exchange rates by applying sentiment analysis to messages on Twitter(called tweets).A dataset of the exchange rates between the United States Dollar(USD)and the Pakistani Rupee(PKR)was formed by collecting information from a forex website as well as a collection of tweets from the business community in Pakistan containing finance-related words.The dataset was collected in raw form,and was subjected to natural language processing by way of data preprocessing.Response variable labeling was then applied to the standardized dataset,where the response variables were divided into two classes:“1”indicated an increase in the exchange rate and“−1”indicated a decrease in it.To better represent the dataset,we used linear discriminant analysis and principal component analysis to visualize the data in three-dimensional vector space.Clusters that were obtained using a sampling approach were then used for data optimization.Five machine learning classifiers—the simple logistic classifier,the random forest,bagging,naïve Bayes,and the support vector machine—were applied to the optimized dataset.The results show that the simple logistic classifier yielded the highest accuracy of 82.14%for the USD and the PKR exchange rates forecasting.
基金This work was funded by the Deanship of Scientific Research(DSR),King Abdulaziz University,Jeddah,under Grant No.FP-190-42.
文摘Accelerated life testing has been widely used in product life testing experiments because it can quickly provide information on the lifetime distributions by testing products or materials at higher than basic conditional levels of stress,such as pressure,temperature,vibration,voltage,or load to induce early failures.In this paper,a step stress partially accelerated life test(SSPALT)is regarded under the progressive type-II censored data with random removals.The removals from the test are considered to have the binomial distribution.The life times of the testing items are assumed to follow lengthbiased weighted Lomax distribution.The maximum likelihood method is used for estimating the model parameters of length-biased weighted Lomax.The asymptotic confidence interval estimates of the model parameters are evaluated using the Fisher information matrix.The Bayesian estimators cannot be obtained in the explicit form,so the Markov chain Monte Carlo method is employed to address this problem,which ensures both obtaining the Bayesian estimates as well as constructing the credible interval of the involved parameters.The precision of the Bayesian estimates and the maximum likelihood estimates are compared by simulations.In addition,to compare the performance of the considered confidence intervals for different parameter values and sample sizes.The Bootstrap confidence intervals give more accurate results than the approximate confidence intervals since the lengths of the former are less than the lengths of latter,for different sample sizes,observed failures,and censoring schemes,in most cases.Also,the percentile Bootstrap confidence intervals give more accurate results than Bootstrap-t since the lengths of the former are less than the lengths of latter for different sample sizes,observed failures,and censoring schemes,in most cases.Further performance comparison is conducted by the experiments with real data.
基金funded by the Deanship of Scientific Research(DSR),King AbdulAziz University,Jeddah,under Grant No.G:531-305-1441.
文摘Understanding a phenomenon from observed data requires contextual and efficient statistical models.Such models are based on probability distributions having sufficiently flexible statistical properties to adapt to a maximum of situations.Modern examples include the distributions of the truncated Fréchet generated family.In this paper,we go even further by introducing a more general family,based on a truncated version of the generalized Fréchet distribution.This generalization involves a new shape parameter modulating to the extreme some central and dispersion parameters,as well as the skewness and weight of the tails.We also investigate the main functions of the new family,stress-strength parameter,diverse functional series expansions,incomplete moments,various entropy measures,theoretical and practical parameters estimation,bivariate extensions through the use of copulas,and the estimation of the model parameters.By considering a special member of the family having the Weibull distribution as the parent,we fit two data sets of interest,one about waiting times and the other about precipitation.Solid statistical criteria attest that the proposed model is superior over other extended Weibull models,including the one derived to the former truncated Fréchet generated family.
基金support provided by the Center of Excellence in Theoretical and Computational Science(TaCS-CoE),KMUTT.Moreoverthis research project is supported by Thailand Science Research and Innovation(TSRI)Basic Research Fund:Fiscal year 2021,received by Dr.Poom Kumam,under project number 64A306000005,and sponsors URL:https://www.tsri.or.th/.
文摘The purpose of this research is the segmentation of lungs computed tomography(CT)scan for the diagnosis of COVID-19 by using machine learning methods.Our dataset contains data from patients who are prone to the epidemic.It contains three types of lungs CT images(Normal,Pneumonia,and COVID-19)collected from two different sources;the first one is the Radiology Department of Nishtar Hospital Multan and Civil Hospital Bahawalpur,Pakistan,and the second one is a publicly free available medical imaging database known as Radiopaedia.For the preprocessing,a novel fuzzy c-mean automated region-growing segmentation approach is deployed to take an automated region of interest(ROIs)and acquire 52 hybrid statistical features for each ROIs.Also,12 optimized statistical features are selected via the chi-square feature reduction technique.For the classification,five machine learning classifiers named as deep learning J4,multilayer perceptron,support vector machine,random forest,and naive Bayes are deployed to optimize the hybrid statistical features dataset.It is observed that the deep learning J4 has promising results(sensitivity and specificity:0.987;accuracy:98.67%)among all the deployed classifiers.As a complementary study,a statistical work is devoted to the use of a new statistical model to fit the main datasets of COVID-19 collected in Pakistan.