The study provides one of the first lines of evidence showing linkages between Antarctic phytoplankton abundance and composition in response to ENSO, based on historical reconstruction of sediment biomarkers. In addit...The study provides one of the first lines of evidence showing linkages between Antarctic phytoplankton abundance and composition in response to ENSO, based on historical reconstruction of sediment biomarkers. In addition to sediment biomarkers, field measured and remote sensing data of phytoplankton abundance were also recorded from Prydz Bay, Eastern Antarctica. Com-munity structure of field measured phytoplankton showed significant El Ni?o/La Ni?a-related succession during 1990 to 2002. In general, the number of algae species decreased during El Ni?o and La Ni?a years compared to normal years. Austral summer monthly variation of remotely sensed chlorophyll-a (Chl-a), particulate organic carbon (POC), and sea surface temperature (SST) indicated that ENSO impacted the timing of phytoplankton blooms during 2007 to 2011. Phytoplankton blooms (indicated by Chl-a and POC) preceded the increases in SST during El Ni?o years, and lagged behind the SST increases during La Ni?a years. Stratigraphic record of marine sedimentary lipid (brassicasterol, dinosterol and alkenones) biomarkers inferred that the proportions of different algae (diatoms, dinoflagellates and haptophytes) changed significantly between El Ni?o and La Ni?a events. The relative proportion of diatoms increased, with that of dinoflagellates being decreased during El Ni?o years, while it was reversed during La Ni?a years.展开更多
Researchers in bioinformatics, biostatistics and other related fields seek biomarkers for many purposes, including risk assessment, disease diagnosis and prognosis, which can be formulated as a patient classification....Researchers in bioinformatics, biostatistics and other related fields seek biomarkers for many purposes, including risk assessment, disease diagnosis and prognosis, which can be formulated as a patient classification. In this paper, a new method of using a tree regression to improve logistic classification model is introduced in biomarker data analysis. The numerical results show that the linear logistic model can be significantly improved by a tree regression on the residuals. Although the classification problem of binary responses is discussed in this research, the idea is easy to extend to the classification of multinomial responses.展开更多
The generalized linear model is an indispensable tool for analyzing non-Gaussian response data, with both canonical and non-canonical link functions comprehensively used. When missing values are present, many existing...The generalized linear model is an indispensable tool for analyzing non-Gaussian response data, with both canonical and non-canonical link functions comprehensively used. When missing values are present, many existing methods in the literature heavily depend on an unverifiable assumption of the missing data mechanism, and they fail when the assumption is violated. This paper proposes a missing data mechanism that is as generally applicable as possible, which includes both ignorable and nonignorable missing data cases, as well as both scenarios of missing values in response and covariate.Under this general missing data mechanism, the authors adopt an approximate conditional likelihood method to estimate unknown parameters. The authors rigorously establish the regularity conditions under which the unknown parameters are identifiable under the approximate conditional likelihood approach. For parameters that are identifiable, the authors prove the asymptotic normality of the estimators obtained by maximizing the approximate conditional likelihood. Some simulation studies are conducted to evaluate finite sample performance of the proposed estimators as well as estimators from some existing methods. Finally, the authors present a biomarker analysis in prostate cancer study to illustrate the proposed method.展开更多
基金financially supported by the National Natural Science Foundation of China (NSFC) (40876104, 41306202, 41376193, 41076134 and 41006118)the scientific research fund of Second Institute of Oceanography, SOA (JT1208 and JG1218)+1 种基金Chinese Arctic and Antarctic Administration Foundation (20110208)the special fund for polar environment comprehensive investigation and assessment (CHINARE 2014-04-04, 2014-01-04 and 2014-04-01)
文摘The study provides one of the first lines of evidence showing linkages between Antarctic phytoplankton abundance and composition in response to ENSO, based on historical reconstruction of sediment biomarkers. In addition to sediment biomarkers, field measured and remote sensing data of phytoplankton abundance were also recorded from Prydz Bay, Eastern Antarctica. Com-munity structure of field measured phytoplankton showed significant El Ni?o/La Ni?a-related succession during 1990 to 2002. In general, the number of algae species decreased during El Ni?o and La Ni?a years compared to normal years. Austral summer monthly variation of remotely sensed chlorophyll-a (Chl-a), particulate organic carbon (POC), and sea surface temperature (SST) indicated that ENSO impacted the timing of phytoplankton blooms during 2007 to 2011. Phytoplankton blooms (indicated by Chl-a and POC) preceded the increases in SST during El Ni?o years, and lagged behind the SST increases during La Ni?a years. Stratigraphic record of marine sedimentary lipid (brassicasterol, dinosterol and alkenones) biomarkers inferred that the proportions of different algae (diatoms, dinoflagellates and haptophytes) changed significantly between El Ni?o and La Ni?a events. The relative proportion of diatoms increased, with that of dinoflagellates being decreased during El Ni?o years, while it was reversed during La Ni?a years.
文摘Researchers in bioinformatics, biostatistics and other related fields seek biomarkers for many purposes, including risk assessment, disease diagnosis and prognosis, which can be formulated as a patient classification. In this paper, a new method of using a tree regression to improve logistic classification model is introduced in biomarker data analysis. The numerical results show that the linear logistic model can be significantly improved by a tree regression on the residuals. Although the classification problem of binary responses is discussed in this research, the idea is easy to extend to the classification of multinomial responses.
基金supported by the Chinese 111 Project B14019the US National Science Foundation under Grant Nos.DMS-1305474 and DMS-1612873the US National Institutes of Health Award UL1TR001412
文摘The generalized linear model is an indispensable tool for analyzing non-Gaussian response data, with both canonical and non-canonical link functions comprehensively used. When missing values are present, many existing methods in the literature heavily depend on an unverifiable assumption of the missing data mechanism, and they fail when the assumption is violated. This paper proposes a missing data mechanism that is as generally applicable as possible, which includes both ignorable and nonignorable missing data cases, as well as both scenarios of missing values in response and covariate.Under this general missing data mechanism, the authors adopt an approximate conditional likelihood method to estimate unknown parameters. The authors rigorously establish the regularity conditions under which the unknown parameters are identifiable under the approximate conditional likelihood approach. For parameters that are identifiable, the authors prove the asymptotic normality of the estimators obtained by maximizing the approximate conditional likelihood. Some simulation studies are conducted to evaluate finite sample performance of the proposed estimators as well as estimators from some existing methods. Finally, the authors present a biomarker analysis in prostate cancer study to illustrate the proposed method.