This paper deals with the statistical modeling of latent topic hierarchies in text corpora. The height of the topic tree is assumed as fixed, while the number of topics on each level as unknown a priori and to be infe...This paper deals with the statistical modeling of latent topic hierarchies in text corpora. The height of the topic tree is assumed as fixed, while the number of topics on each level as unknown a priori and to be inferred from data. Taking a nonpara-metric Bayesian approach to this problem, we propose a new probabilistic generative model based on the nested hierarchical Dirichlet process (nHDP) and present a Markov chain Monte Carlo sampling algorithm for the inference of the topic tree structure as well as the word distribution of each topic and topic distribution of each document. Our theoretical analysis and experiment results show that this model can produce a more compact hierarchical topic structure and captures more fine-grained topic rela-tionships compared to the hierarchical latent Dirichlet allocation model.展开更多
In this study, the variations in surface soil liquid water(SSLW) due to future climate change are explored in the‘Huang-Huai-Hai Plain'(‘3H') region in China with the Common Land Model(CoLM). To evaluate the...In this study, the variations in surface soil liquid water(SSLW) due to future climate change are explored in the‘Huang-Huai-Hai Plain'(‘3H') region in China with the Common Land Model(CoLM). To evaluate the possible maximum response of SSLW to climate change, the combination of the conditional nonlinear optimal perturbation related to the parameter(CNOP-P) approach and projections from 10 general circulation models(GCMs) of the Coupled Model Intercomparison Project5(CMIP5) are used. The CNOP-P-type temperature change scenario, a new type of temperature change scenario, is determined by using the CNOP-P method and constrained by the temperature change projections from the 10 GCMs under a high-emission scenario(the Representative Concentration Pathway 8.5 scenario). Numerical results have shown that the response of SSLW to the CNOP-P-type temperature scenario is stronger than those to the 11 temperature scenarios derived from the 10 GCMs and from their ensemble average in the entire ‘3H' region. In the northern region, SSLW under the CNOP-P-type scenario increases to0.1773 m^3 m^(-3); however, SSLW in the scenarios from the GCMs fluctuates from 0.1671 to 0.1748 m^3 m^(-3). In the southern region,SSLW decreases, and its variation(–0.0070 m^3 m^(-3)) due to the CNOP-P-type scenario is higher than each of the variations(–0.0051 to –0.0026 m^3 m^(-3)) due to the scenarios from the GCMs.展开更多
基金Project (No. 60773180) supported by the National Natural Science Foundation of China
文摘This paper deals with the statistical modeling of latent topic hierarchies in text corpora. The height of the topic tree is assumed as fixed, while the number of topics on each level as unknown a priori and to be inferred from data. Taking a nonpara-metric Bayesian approach to this problem, we propose a new probabilistic generative model based on the nested hierarchical Dirichlet process (nHDP) and present a Markov chain Monte Carlo sampling algorithm for the inference of the topic tree structure as well as the word distribution of each topic and topic distribution of each document. Our theoretical analysis and experiment results show that this model can produce a more compact hierarchical topic structure and captures more fine-grained topic rela-tionships compared to the hierarchical latent Dirichlet allocation model.
基金supported by the National Natural Science Foundation of China(Grant Nos.91437111&41375111&41675104&41230420)
文摘In this study, the variations in surface soil liquid water(SSLW) due to future climate change are explored in the‘Huang-Huai-Hai Plain'(‘3H') region in China with the Common Land Model(CoLM). To evaluate the possible maximum response of SSLW to climate change, the combination of the conditional nonlinear optimal perturbation related to the parameter(CNOP-P) approach and projections from 10 general circulation models(GCMs) of the Coupled Model Intercomparison Project5(CMIP5) are used. The CNOP-P-type temperature change scenario, a new type of temperature change scenario, is determined by using the CNOP-P method and constrained by the temperature change projections from the 10 GCMs under a high-emission scenario(the Representative Concentration Pathway 8.5 scenario). Numerical results have shown that the response of SSLW to the CNOP-P-type temperature scenario is stronger than those to the 11 temperature scenarios derived from the 10 GCMs and from their ensemble average in the entire ‘3H' region. In the northern region, SSLW under the CNOP-P-type scenario increases to0.1773 m^3 m^(-3); however, SSLW in the scenarios from the GCMs fluctuates from 0.1671 to 0.1748 m^3 m^(-3). In the southern region,SSLW decreases, and its variation(–0.0070 m^3 m^(-3)) due to the CNOP-P-type scenario is higher than each of the variations(–0.0051 to –0.0026 m^3 m^(-3)) due to the scenarios from the GCMs.