期刊文献+
共找到14篇文章
< 1 >
每页显示 20 50 100
Maximum Entropy Distribution Function and Uncertainty Evaluation Criteria 被引量:5
1
作者 CHEN Bai-yu KOU Yi +3 位作者 ZHAO Daniel WU Fang WANG Li-ping LIU Gui-lin 《China Ocean Engineering》 SCIE EI CSCD 2021年第2期238-249,共12页
Marine environmental design parameter extrapolation has important applications in marine engineering and coastal disaster prevention.The distribution models used for environmental design parameter usually pass the hyp... Marine environmental design parameter extrapolation has important applications in marine engineering and coastal disaster prevention.The distribution models used for environmental design parameter usually pass the hypothesis tests in statistical analysis,but the calculation results of different distribution models often vary largely.In this paper,based on the information entropy,the overall uncertainty test criteria were studied for commonly used distributions including Gumbel,Weibull,and Pearson-III distribution.An improved method for parameter estimation of the maximum entropy distribution model is proposed on the basis of moment estimation.The study in this paper shows that the number of sample data and the degree of dispersion are proportional to the information entropy,and the overall uncertainty of the maximum entropy distribution model is minimal compared with other models. 展开更多
关键词 maximum entropy distribution model UNCERTAINTY information entropy evaluation criterion
下载PDF
Study on evaluation standard of uncertainty of design wave height calculation model 被引量:1
2
作者 Baiyu CHEN Yi KOU +2 位作者 Fang WU Liping WANG Guilin LIU 《Journal of Oceanology and Limnology》 SCIE CAS CSCD 2021年第4期1188-1197,共10页
The accurate calculation of marine environmental design parameters depends on the probability distribution model,and the calculation results of different distribution models are often different.It is very important to... The accurate calculation of marine environmental design parameters depends on the probability distribution model,and the calculation results of different distribution models are often different.It is very important to determine which distribution model is more stable and reasonable when extrapolating the recurrence level of the studied sea area.In this paper,we constructed an evaluation method of the overall uncertainty of the calculation results and a measurement of the uncertainty of the design parameters derivation model,by incorporating the influence of sample information on the model information entropy,such as sample size,degree of dispersion,and sampling error.Results show that the sample data size and the degree of dispersion are directly proportional to the information entropy.Within the same group of data,the maximum entropy distribution model has the lowest overall uncertainty,while the Gumbel distribution model has the largest overall uncertainty.In other words,the maximum entropy distribution model has good applicability in the accurate calculation of marine environmental design parameters. 展开更多
关键词 UNCERTAINTY information entropy extreme value distribution model
下载PDF
Health Analytics, Economics and Medicine toward a 21st Century Health Care System
3
作者 Anna L. Choi David A. Lai Tze L. Lai 《Health》 CAS 2016年第5期428-443,共16页
After a review of recent developments in precision medicine, population health sciences and innovative clinical trial designs, and in health economics and policy, we show how innovations in health analytics can capita... After a review of recent developments in precision medicine, population health sciences and innovative clinical trial designs, and in health economics and policy, we show how innovations in health analytics can capitalize on the advances in biomedicine and health economics towards developing a data-driven and cost-effective 21<sup>st</sup> century health care system. In particular, we propose a mutually beneficial public-private partnership that combines individual responsibility with community solidarity in building this health care system. 展开更多
关键词 ANALYTICS Big Data Comparative Effectiveness Research Health Insurance Moral Hazards Population Health Sciences
下载PDF
Assessment of typhoon storm surge disaster scale based on expansion model
4
作者 Guilin LIU Xiuxiu NONG +3 位作者 Yi KOU Fang WU Daniel ZHAO Zongbing YU 《Journal of Oceanology and Limnology》 SCIE CAS CSCD 2023年第2期518-531,共14页
The South China Sea suffers strongly from the typhoon storm surge disasters in China,and its northern coastal areas are facing severe risks.Therefore,it is necessary and urgent to establish an assessment system for ra... The South China Sea suffers strongly from the typhoon storm surge disasters in China,and its northern coastal areas are facing severe risks.Therefore,it is necessary and urgent to establish an assessment system for rating typhoon storm surge disaster.We constructed an effective and reliable rating assessment system for typhoon storm surge disaster based on the theories of over-threshold,distribution function family,and composite extreme value.The over-threshold sample was used as the basis of data analysis,the composite extreme value expansion model was used to derive the design water increment,and then the disaster level was delineated based on the return period level.The results of the extreme value model comparison show that the Weibull-Pareto distribution is more suitable than the classical extreme value distribution for fitting the over-threshold samples.The results of the return period projection are relatively stable based on different analysis samples.Taking the 10 typhoon storm surges as examples,they caused landfall in the Guangdong area in the past 10 years.The results of the assessment ranking indicate that the risk levels based on the return period levels obtained from different distributions are generally consistent.When classifying low-risk areas,the classification criteria of the State Oceanic Administration,China(SOA,2012)are more conservative.In the high-risk areas,the results of the assessment ranking based on return period are more consistent with those of the SOA. 展开更多
关键词 risk classification South China Sea typhoon storm surge extreme value expansion over threshold sampling
下载PDF
Uncertainty analysis for the calculation of marine environmental design parameters in the South China Sea
5
作者 Guilin LIU Xinsheng ZHOU +3 位作者 Yi KOU Fang WU Daniel ZHAO Yu XU 《Journal of Oceanology and Limnology》 SCIE CAS CSCD 2023年第2期427-443,共17页
The calculation results of marine environmental design parameters obtained from different data sampling methods,model distributions,and parameter estimation methods often vary greatly.To better analyze the uncertainti... The calculation results of marine environmental design parameters obtained from different data sampling methods,model distributions,and parameter estimation methods often vary greatly.To better analyze the uncertainties in the calculation of marine environmental design parameters,a general model uncertainty assessment method is necessary.We proposed a new multivariate model uncertainty assessment method for the calculation of marine environmental design parameters.The method divides the overall model uncertainty into two categories:aleatory uncertainty and epistemic uncertainty.The aleatory uncertainty of the model is obtained by analyzing the influence of the number and the dispersion degree of samples on the information entropy of the model.The epistemic uncertainty of the model is calculated using the information entropy of the model itself and the prediction error.The advantages of this method are that it does not require many-year-observation data for the marine environmental elements,and the method can be used to analyze any specific factors that cause model uncertainty.Results show that by applying the method to the South China Sea,the aleatory uncertainty of the model increases with the number of samples and then stabilizes.A positive correlation was revealed between the dispersion of the samples and the aleatory uncertainty of the model.Both the distribution of the model and the parameter estimation results of the model have significant effects on the epistemic uncertainty of the model.When the goodness-of-fit of the model is relatively close,the best model can be selected according to the criterion of the lowest overall uncertainty of the models,which can both ensure a better model fit and avoid too much uncertainty in the model calculation results.The presented multivariate model uncertainty assessment method provides a criterion to measure the advantages and disadvantages of the marine environmental design parameter calculation model from the aspect of uncertainty,which is of great significance to analyze the uncertainties in the calculation of marine environmental design parameters and improve the accuracy of the calculation results. 展开更多
关键词 South China Sea marine environmental design parameters model uncertainty information entropy Monte Carlo method
下载PDF
Self-normalized moderate deviations for independent random variables 被引量:2
6
作者 JING BingYi LIANG HanYing ZHOU Wang 《Science China Mathematics》 SCIE 2012年第11期2297-2315,共19页
Let X1,X2,...be a sequence of independent random variables(r.v.s) belonging to the domain of attraction of a normal or stable law.In this paper,we study moderate deviations for the self-normalized sum ∑ni=1 Xi/Vn,p,w... Let X1,X2,...be a sequence of independent random variables(r.v.s) belonging to the domain of attraction of a normal or stable law.In this paper,we study moderate deviations for the self-normalized sum ∑ni=1 Xi/Vn,p,where Vn,p =(∑ni=1 |Xi|p)1/p(p>1).Applications to the self-normalized law of the iterated logarithm,Studentized increments of partial sums,t-statistic,and weighted sum of independent and identically distributed(i.i.d.) r.v.s are considered. 展开更多
关键词 独立变量 随机偏差 随机变量序列 部分和增量 归一化法 加权求和 吸引域 标准化
原文传递
Discussion on “Gaining Effciency via Weight Estimators for Multivariate Failure Time Data” by Fan, Zhou and Chen
7
作者 KUK Anthony 《Science China Mathematics》 SCIE 2009年第6期1129-1130,共2页
The survival analysis literature has always lagged behind the categorical data literature in developing methods to analyze clustered or multivariate data. While estimators based on
关键词 Discussion on Gaining Effciency via Weight Estimators for Multivariate Failure Time Data Zhou and Chen by Fan
原文传递
Balanced augmented jackknife empirical likelihood for two sample U-statistics
8
作者 Conghua Cheng Yiming Liu +1 位作者 Zhi Liu Wang Zhou 《Science China Mathematics》 SCIE CSCD 2018年第6期1129-1138,共10页
In this paper, we investigate the two sample U-statistics by jackknife empirical likelihood(JEL),a versatile nonparametric approach. More precisely, we propose the method of balanced augmented jackknife empirical like... In this paper, we investigate the two sample U-statistics by jackknife empirical likelihood(JEL),a versatile nonparametric approach. More precisely, we propose the method of balanced augmented jackknife empirical likelihood(BAJEL) by adding two artificial points to the original pseudo-value dataset, and we prove that the log likelihood ratio based on the expanded dataset tends to the χ~2 distribution. 展开更多
关键词 折刀 实验 样品 平衡 统计 数据集
原文传递
KTI-RNN:Recognition of Heart Failure from Clinical Notes
9
作者 Dengao Li Huiting Ma +4 位作者 Wenjing Li Baofeng Zhao Jumin Zhao Yi Liu Jian Fu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2023年第1期117-130,共14页
Although deep learning methods have recently attracted considerable attention in the medical field,analyzing large-scale electronic health record data is still a difficult task.In particular,the accurate recognition o... Although deep learning methods have recently attracted considerable attention in the medical field,analyzing large-scale electronic health record data is still a difficult task.In particular,the accurate recognition of heart failure is a key technology for doctors to make reasonable treatment decisions.This study uses data from the Medical Information Mart for Intensive Care database.Compared with structured data,unstructured data contain abundant patient information.However,this type of data has unsatisfactory characteristics,e.g.,many colloquial vocabularies and sparse content.To solve these problems,we propose the KTI-RNN model for unstructured data recognition.The proposed model overcomes sparse content and obtains good classification results.The term frequency-inverse word frequency(TF-IWF)model is used to extract the keyword set.The latent dirichlet allocation(LDA)model is adopted to extract the topic word set.These models enable the expansion of the medical record text content.Finally,we embed the global attention mechanism and gating mechanism between the bidirectional recurrent neural network(BiRNN)model and the output layer.We call it gated-attention-BiRNN(GA-BiRNN)and use it to identify heart failure from extensive medical texts.Results show that the F 1 score of the proposed KTI-RNN model is 85.57%,and the accuracy rate of the proposed KTI-RNN model is 85.59%. 展开更多
关键词 heart failure diagnosis text classification deep learning
原文传递
On the k-sample Behrens-Fisher problem for high-dimensional data 被引量:3
10
作者 ZHANG JinTing XU JinFeng 《Science China Mathematics》 SCIE 2009年第6期1285-1304,共20页
For several decades,much attention has been paid to the two-sample Behrens-Fisher(BF) problem which tests the equality of the means or mean vectors of two normal populations with unequal variance/covariance structures... For several decades,much attention has been paid to the two-sample Behrens-Fisher(BF) problem which tests the equality of the means or mean vectors of two normal populations with unequal variance/covariance structures.Little work,however,has been done for the k-sample BF problem for high dimensional data which tests the equality of the mean vectors of several high-dimensional normal populations with unequal covariance structures.In this paper we study this challenging problem via extending the famous Scheffe's transformation method,which reduces the k-sample BF problem to a one-sample problem.The induced one-sample problem can be easily tested by the classical Hotelling's T 2 test when the size of the resulting sample is very large relative to its dimensionality.For high dimensional data,however,the dimensionality of the resulting sample is often very large,and even much larger than its sample size,which makes the classical Hotelling's T 2 test not powerful or not even well defined.To overcome this diffculty,we propose and study an L2-norm based test.The asymp-totic powers of the proposed L2-norm based test and Hotelling's T 2 test are derived and theoretically compared.Methods for implementing the L2-norm based test are described.Simulation studies are conducted to compare the L2-norm based test and Hotelling's T 2 test when the latter can be well defined,and to compare the proposed implementation methods for the L2-norm based test otherwise.The methodologies are motivated and illustrated by a real data example. 展开更多
关键词 χ~2-approximation χ~2-type MIXTURES HIGH-DIMENSIONAL data analysis Hotelling’s T^2 TEST k-sample TEST L^2-norm based TEST
原文传递
A regression approach to ROC surface,with applications to Alzheimer's disease
11
作者 LI JiaLiang ZHOU XiaoHua FINE Jason P 《Science China Mathematics》 SCIE 2012年第8期1583-1595,共13页
We consider the estimation of three-dimensional ROC surfaces for continuous tests given covariates.Three way ROC analysis is important in our motivating example where patients with Alzheimer's disease are usually ... We consider the estimation of three-dimensional ROC surfaces for continuous tests given covariates.Three way ROC analysis is important in our motivating example where patients with Alzheimer's disease are usually classified into three categories and should receive different category-specific medical treatment.There has been no discussion on how covariates affect the three way ROC analysis.We propose a regression framework induced from the relationship between test results and covariates.We consider several practical cases and the corresponding inference procedures.Simulations are conducted to validate our methodology.The application on the motivating example illustrates clearly the age and sex effects on the accuracy for Mini-Mental State Examination of Alzheimer's disease. 展开更多
关键词 阿尔茨海默氏症 应用程序 表面 回归 民国 中华 测试框架 协变量
原文传递
Multi-category diagnostic accuracy based on logistic regression
12
作者 Jialiang Li Jason P.Fine Michael JPencina 《Statistical Theory and Related Fields》 2017年第2期143-158,共16页
We provide a detailed review for the statistical analysis of diagnostic accuracy in a multi-category classification task.For qualitative response variables with more than two categories,many traditional accuracy measu... We provide a detailed review for the statistical analysis of diagnostic accuracy in a multi-category classification task.For qualitative response variables with more than two categories,many traditional accuracy measures such as sensitivity,specificity and area under the ROC curve are no longer applicable.In recent literature,new diagnostic accuracy measures are introduced in medical research studies.In this paper,important statistical concepts for multi-category classification accuracy are reviewed and their utilities are demonstrated with real medical examples.We offer problem-based R code to illustrate how to perform these statistical computations step by step.We expect such analysis tools will become more familiar to practitioners and receive broader applications in biostatistics.Our program can be adapted to many classifiers among which logistic regression may be the most popular approach.We thus base our discussion and illustration completely on the logistic regression in this paper. 展开更多
关键词 Hypervolume under the ROC manifold multi-category classification correct classification probability net reclassification improvement integrated discrimination improvement marker evaluation R software
原文传递
On high-dimensional change point problem
13
作者 JIN BaiSuo PAN GuangMing +1 位作者 YANG Qing ZHOU Wang 《Science China Mathematics》 SCIE CSCD 2016年第12期2355-2378,共24页
New statistics are proposed to estimate and test the structural change when the data dimension is comparable to or larger than the sample size. Consistency of the new statistic in estimating the change point position ... New statistics are proposed to estimate and test the structural change when the data dimension is comparable to or larger than the sample size. Consistency of the new statistic in estimating the change point position is established under the alternative hypothesis. The asymptotic distribution of the new statistic in testing the existence of a change point is obtained under the null hypothesis. Some simulation results are presented which show that the numerical performance of our method is satisfactory. The method is illustrated via the analysis of the house price index of US. 展开更多
关键词 变化点 高度维的统计推理 Hotellings T 2 统计数值
原文传递
On-chain analytics for sentiment-driven statistical causality in cryptocurrencies
14
作者 Ioannis Chalkiadakis Anna Zaremba +1 位作者 Gareth W.Peters Michael J.Chantler 《Blockchain(Research and Applications)》 2022年第2期39-63,共25页
This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an effi... This paper establishes a new framework for assessing multimodal statistical causality between cryptocurrency market(cryptomarket)sentiment and cryptocurrency price processes.In order to achieve this,we present an efficient algorithm for multimodal statistical causality analysis based on Multiple-Output Gaussian Processes.Signals from different information sources(modalities)are jointly modelled as a Multiple-Output Gaussian Process,and then using a novel approach to statistical causality based on Gaussian Processes(GPs),we study linear and non-linear causal effects between the different modalities.We demonstrate the effectiveness of our approach in a machine learning application by studying the relationship between cryptocurrency spot price dynamics and sentiment time-series data specific to the crypto sector,which we conjecture influences retail investor behaviour.The investor sentiment is extracted from cryptomarket news data via methods developed in the area of statistical machine learning known as Natural Language Processing(NLP).To capture sentiment,we present a novel framework for text to time-series embedding,which we then use to construct a sentiment index from publicly available news articles.We conduct a statistical analysis of our sentiment statistical index model and compare it to alternative state-of-the-art sentiment models popular in the NLP literature.In regard to the multimodal causality,the investor sentiment is our primary modality of exploration,in addition to price and a blockchain technologyrelated indicator(hash rate).Analysis shows that our approach is effective in modelling causal structures of variable degree of complexity between heterogeneous data sources and illustrates the impact that certain modelling choices for the different modalities can have on detecting causality.A solid understanding of these factors is necessary to gauge cryptocurrency adoption by retail investors and provide sentiment-and technologybased insights about the cryptocurrency market dynamics. 展开更多
关键词 Cryptocurrencies Statistical causality Blockchain regression Multiple-output Gaussian process Natural language processing Cryptonews sentiment
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部