期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
A Study of EM Algorithm as an Imputation Method: A Model-Based Simulation Study with Application to a Synthetic Compositional Data
1
作者 Yisa Adeniyi Abolade Yichuan Zhao 《Open Journal of Modelling and Simulation》 2024年第2期33-42,共10页
Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear mode... Compositional data, such as relative information, is a crucial aspect of machine learning and other related fields. It is typically recorded as closed data or sums to a constant, like 100%. The statistical linear model is the most used technique for identifying hidden relationships between underlying random variables of interest. However, data quality is a significant challenge in machine learning, especially when missing data is present. The linear regression model is a commonly used statistical modeling technique used in various applications to find relationships between variables of interest. When estimating linear regression parameters which are useful for things like future prediction and partial effects analysis of independent variables, maximum likelihood estimation (MLE) is the method of choice. However, many datasets contain missing observations, which can lead to costly and time-consuming data recovery. To address this issue, the expectation-maximization (EM) algorithm has been suggested as a solution for situations including missing data. The EM algorithm repeatedly finds the best estimates of parameters in statistical models that depend on variables or data that have not been observed. This is called maximum likelihood or maximum a posteriori (MAP). Using the present estimate as input, the expectation (E) step constructs a log-likelihood function. Finding the parameters that maximize the anticipated log-likelihood, as determined in the E step, is the job of the maximization (M) phase. This study looked at how well the EM algorithm worked on a made-up compositional dataset with missing observations. It used both the robust least square version and ordinary least square regression techniques. The efficacy of the EM algorithm was compared with two alternative imputation techniques, k-Nearest Neighbor (k-NN) and mean imputation (), in terms of Aitchison distances and covariance. 展开更多
关键词 compositional data Linear Regression Model Least Square Method Robust Least Square Method Synthetic data Aitchison Distance Maximum Likelihood Estimation Expectation-Maximization Algorithm k-Nearest Neighbor and Mean imputation
下载PDF
Using compositional data analysis to explore accumulation of sedentary behavior,physical activity and youth health
2
作者 Simone J.J.M.Verswijveren Karen E.Lamb +11 位作者 Josep A.Martín-Fernández Elisabeth Winkler Rebecca M.Leech Anna Timperio Jo Salmon Robin M.Daly Ester Cerin David W.Dunstan Rohan M.Telford Richard D.Telford Lisa S.Olive Nicola D.Ridgers 《Journal of Sport and Health Science》 SCIE 2022年第2期234-243,共10页
Purpose:The study aimed to describe youth time-use compositions,focusing on time spent in shorter and longer bouts of sedentary behavior and physical activity(PA),and to examine associations of these time-use composit... Purpose:The study aimed to describe youth time-use compositions,focusing on time spent in shorter and longer bouts of sedentary behavior and physical activity(PA),and to examine associations of these time-use compositions with cardiometabolic biomarkers.Methods:Accelerometer and cardiometabolic biomarker data from 2 Australian studies involving youths 7-13 years old were pooled(complete cases with accelerometry and adiposity marker data,n=782).A 9-component time-use composition was formed using compositional data analysis:time in shorter and longer bouts of sedentary behavior;time in shorter and longer bouts of light-,moderate-,or vigorous-intensity PA;and"other time"(i.e.,non-wear/sleep).Shorter and longer bouts of sedentary time were defined as<5 min and>5 min,respectively.Shorter bouts of light-,moderate-,and vigorous-intensity PA were defined as<1 min;longer bouts were defined as≥1 min.Regression models examined associations between overall time-use composition and cardiometabolic biomarkers.Then,associations were derived between ratios of longer activity patterns relative to shorter activity patterns,and of each intensity level relative to the other intensity levels and"other time",and cardiometabolic biomarkers.Results:Confounder-adjusted models showed that the overall time-use composition was associated with adiposity,blood pressure,lipids,and the summary score.Specifically,more time in longer bouts of light-intensity PA relative to shorter bouts of light-intensity PA was significantly associated with greater body mass index z-score(zBMI)(β=1.79;SE=0.68)and waist circumference(β=18.35,SE=4.78).When each activity intensity was considered relative to all higher intensities and"other time",more time in light-and vigorous-intensity PA,and less time in sedentary behavior and moderate-intensity PA,were associated with lower waist circumference.Conclusion:Accumulating PA,particularly light-intensity PA,in frequent short bursts may be more beneficial for limiting adiposity compared to accumulating the same amount of PA at these intensities in longer bouts. 展开更多
关键词 Accumulation patterns Cardiometabolic health CHILDREN compositional data analysis Time-use
下载PDF
Data processing for the Active Particle-induced X-ray Spectrometer and initial scientific results from Chang'e-3 mission 被引量:2
3
作者 Xiao-Hui Fu Chun-Lai Li +11 位作者 Guang-Liang Zhang Yong-Liao Zou Jian-Jun Liu Xin Ren Xu Tan Xiao-Xia Zhang Wei Zuo Wei-Bin Wen Wen-Xi Peng Xing-Zhu Cui Cheng-Mo Zhang Huan-Yu Wang 《Research in Astronomy and Astrophysics》 SCIE CAS CSCD 2014年第12期1595-1606,共12页
The Active Particle-induced X-ray Spectrometer (APXS) is an important payload mounted on the Yutu rover, which is part of the Chang'e-3 mission. The sci- entific objective of APXS is to perform in-situ analysis of ... The Active Particle-induced X-ray Spectrometer (APXS) is an important payload mounted on the Yutu rover, which is part of the Chang'e-3 mission. The sci- entific objective of APXS is to perform in-situ analysis of the chemical composition of lunar soil and rock samples. The radioactive sources, 55Fe and 109Cd, decay and produce a-particles and X-rays. When X-rays and a-particles interact with atoms in the surface material, they knock electrons out of their orbits, which release energy by emitting X-rays that can be measured by a silicon drift detector (SDD). The elements and their concentrations can be determined by analyzing their peak energies and in- tensities. APXS has analyzed both the calibration target and lunar soil once during the first lunar day and again during the second lunar day. The total detection time lasted about 266 min and more than 2000 frames of data records have been acquired. APXS has three operating modes: calibration mode, distance sensing mode and detection mode. In detection mode, work distance can be calculated from the X-ray counting rate collected by SDD. Correction for the effect of temperature has been performed to convert the channel number for each spectrum to X-ray energy. Dead time correction is used to eliminate the systematic error in quantifying the activity of an X-ray pulse in a sample and derive the real count rate. We report APXS data and initial results during the first and second lunar days for the Yutu rover. In this study, we analyze the data from the calibration target and lunar soil on the first lunar day. Seven major elements, including Mg, A1, Si, K, Ca, Ti and Fe, have been identified. Comparing the peak areas and ratios of calibration basalt and lunar soil the landing site was found to be depleted in K, and have lower Mg and A1 but higher Ca, Ti, and Fe. In the future, we will obtain the elemental concentrations of lunar soil at the Chang'e-3 landing site using APXS data. 展开更多
关键词 data analysis -- composition -- Moon
下载PDF
Short-Term Impact of Elemental Sulfur on Cranberry Nutrition and Crop Performance
4
作者 Reza Jamaly Serge-Étienne Parent +1 位作者 Noura Ziadi Léon E. Parent 《Open Journal of Soil Science》 2023年第2期83-96,共14页
Cranberry (Vaccinium macrocarpon Ait.) is an ammophilous plant grown on acid soils (pH 4.0 - 5.5). Elemental sulfur is commonly applied at a recommended rate of 1120 kg S ha<sup>−1</sup> per pH unit to aci... Cranberry (Vaccinium macrocarpon Ait.) is an ammophilous plant grown on acid soils (pH 4.0 - 5.5). Elemental sulfur is commonly applied at a recommended rate of 1120 kg S ha<sup>−1</sup> per pH unit to acidify cranberry soils, potentially impacting the plant mineral nutrition. The general recommendation may not fit all conditions encountered in the field. Our objective was to develop an equation to predict the sulfur requirement to reach pH<sub>water</sub> of 4.2 to tackle nitrification in acidic cranberry soils varying in initial pH values, and to measure the effect of elemental sulfur on the mineral nutrition and the performance of cranberry crops. A 3-yr experiment was designed to test the effect of elemental sulfur on soil and tissue tests and on berry yield and quality. Four S treatments (0, 250, 500 and 1000 kg S ha<sup>−1</sup>) were established on three duplicated sites during two consecutive years. We ran soil, foliar tissue, berry tissue tests, and measured berry yield, size, anthocyanin content (TAcy), Brix, and firmness. Nutrients were expressed as centered log ratios to reflect nutrient interactions. Results were analyzed using a mixed model. Soil Ca decreased while soil Mn and S increased significantly (p ≤ 0.05). Sulfur showed no significant effects on nutrient balances in uprights. The S impacted negatively berry B balance, and positively berry Mn and S balances. A linear regression model relating pH change to S dosage and elapsed time (R<sup>2</sup> = 0.53) showed that to reach pH<sub>water</sub> of 4.2 two years after S application, 250 - 1000 kg S ha<sup>−1</sup> could be applied depending on initial soil pH value. The stratification of surface-applied elemental S in the soil profile should be further examined in relation to plant rooting and nutrient leaching. 展开更多
关键词 SULFUR Berry Yield and Quality compositional data Local Diagnosis Plant Nutrition Soil Acidification
下载PDF
Application of cluster analysis to geochemical compositional data for identifying ore-related geochemical anomalies 被引量:3
5
作者 Shuguang ZHOU Kefa ZHOU +2 位作者 Jinlin WANG Genfang YANG Shanshan WANG 《Frontiers of Earth Science》 SCIE CAS CSCD 2018年第3期491-505,共15页
Cluster analysis is a well-known technique that is used to analyze various types of data. In this study, cluster analysis is applied to geochemical data that describe 1444 stream sediment samples collected in northwes... Cluster analysis is a well-known technique that is used to analyze various types of data. In this study, cluster analysis is applied to geochemical data that describe 1444 stream sediment samples collected in northwestern Xinjiang with a sample spacing of approximately 2 km. Three algorithms (the hierarchical, k-means, and fuzzy cmeans algorithms) and six data transformation methods (the z-score standardization, ZST; the logarithmic transformation, LT; the additive log-ratio transformation, ALT; the centered log-ratio transformation, CLT; the isometric log-ratio transformation, ILT; and no transformation, NT) are compared in terms of their effects on the cluster analysis of the geochemical compositional data. The study shows that, on the one hand, the ZST does not affect the results of columnor variable-based (R-type) cluster analysis, whereas the other methods, including the LT, the ALT, and the CLT, have substantial effects on the results. On the other hand, the results of the row- or observation-based (Q-type) cluster analysis obtained from the geochemical data after applying NT and the ZST are relatively poor. However, we derive some improved results from the geochemical data after applying the CLT, the ILT, the LT, and the ALT. Moreover, the k-means and fuzzy cmeans clustering algorithms are more reliable than the hierarchical algorithm when they are used to cluster the geochemical data. We apply cluster analysis to the geochemical data to explore for Au deposits within the study area, and we obtain a good correlation between the results retrieved by combining the CLT or the ILT with the k-means or fuzzy c-means algorithms and the potential zones of Au mineralization. Therefore, we suggest that the combination of the CLT or the ILT with the k-means or fuzzy c-means algorithms is an effective tool to identify potential zones of mineralization from geochemical data. 展开更多
关键词 cluster analysis compositional data geochemical anomaly mineral exploration
原文传递
Simplicial Indicator Kriging
6
作者 R Tolosana-Delgado V Pawlowsky-Glahn J J Egozcue 《Journal of China University of Geosciences》 SCIE CSCD 2008年第1期65-71,共7页
Indicator kriging (IK) is a spatial interpolation technique devised for estimating a conditional cumulative distribution function at an unsampled location. The result is a discrete approximation, and its correspondi... Indicator kriging (IK) is a spatial interpolation technique devised for estimating a conditional cumulative distribution function at an unsampled location. The result is a discrete approximation, and its corresponding estimated probability density function can be viewed as a composition in the simplex. This fact suggested a compositional approach to IK which, by construction, avoids all its standard drawbacks (negative predictions, not-ordered or larger than one). Here, a simple algorithm to develop the procedure is presented. 展开更多
关键词 Aitchison geometry compositional data disjunctive kriging log-ratio approach simplex.
下载PDF
Onion Response to Added N in Histosols of Contrasting C and N Contents
7
作者 Melissa Quinche Gonzalez Annie Pellerin Léon E. Parent 《American Journal of Plant Sciences》 2016年第3期469-478,共10页
Adjusting the N fertilization to soil potentially mineralizable N in Histosols is required to secure high vegetable yields while mitigating nitrate contamination of surface waters. However, there is still no soil test... Adjusting the N fertilization to soil potentially mineralizable N in Histosols is required to secure high vegetable yields while mitigating nitrate contamination of surface waters. However, there is still no soil test N (STN) relating the response of Histosol-grown onion (Allium cepa L.) to added N. Compositional data analysis can integrate soil C and N composition into a STN index computed as Mahalanobis distance (M<sup>2</sup>) across isometric log ratios (ilr) of diagnosed and reference soil C and N compositions. Our objective was to calibrate onion response to added N against a compositional STN index for Histosols. Reference compositions were computed from high N-mineralizing Histosols reported in the literature. Soil analyses were total C and N, and a residual soil mass (F<sub>v</sub>) was computed as 100%-%C-%N to close the compositional vector to 100%. The C, N, and F<sub>v</sub> proportions were synthesized into two ilrs. We conducted thirteen onion N fertilization trials in Histosols of south-western Quebec showing contrasting C, N, and F<sub>v</sub> proportions. Each crop received four N rates broadcast before seeding or split-applied. We derived two STN classes separating weakly to highly responsive crops about the M<sup>2</sup> value of 5.5. Onion crops grown on soils showing M<sup>2</sup> values >5.5 required more N and yielded less in control treatments compared with soils showing M<sup>2</sup> values 5.5) soils responded significantly (P < 0.10) to 60 and 180 kg N ha<sup>-1</sup>, respectively. Using literature data and the results of this study, we elaborated a provisory N requirement model for Histosol-grown onions in Quebec. 展开更多
关键词 compositional data Analysis META-ANALYSIS ONION C/N Ratio Soil Test N
下载PDF
Linear mixed-effects model for longitudinal complex data with diversified characteristics 被引量:2
8
作者 Zhichao Wang Huiwen Wang +2 位作者 Shanshan Wang Shan Lu Gilbert Saporta 《Journal of Management Science and Engineering》 2020年第2期105-124,共20页
The increasing richness of data encourages a comprehensive understanding of economic and financial activities,where variables of interest may include not only scalar(point-like)indicators,but also functional(curve-lik... The increasing richness of data encourages a comprehensive understanding of economic and financial activities,where variables of interest may include not only scalar(point-like)indicators,but also functional(curve-like)and compositional(pie-like)ones.In many research topics,the variables are also chronologically collected across individuals,which falls into the paradigm of longitudinal analysis.The complicated nature of data,however,increases the difficulty of modeling these variables under the classic longitudinal frame-work.In this study,we investigate the linear mixed-effects model(LMM)for such complex data.Different types of variables arefirst consistently represented using the corresponding basis expansions so that the classic LMM can then be conducted on them,which gener-alizes the theoretical framework of LMM to complex data analysis.A number of simulation studies indicate the feasibility and effectiveness of the proposed model.We further illustrate its practical utility in a real data study on Chinese stock market and show that the proposed method can enhance the performance and interpretability of the regression for complex data with diversified characteristics. 展开更多
关键词 Longitudinal complex data Linear mixed-effects model compositional data analysis Functional data analysis Chinese stock market Online investors'sentiment
原文传递
Research progress of 24-hour movement behaviors in chronic non-communicable disease
9
作者 Rong-Xuan Li Qing-Qing Fan Di Cui 《Life Research》 2024年第3期24-34,共11页
Chronic non-communicable diseases(NCDs)represent a significant impediment to improve life expectancy and remain a focal point in global public health and disease prevention efforts.24-hour movement behaviors,which inc... Chronic non-communicable diseases(NCDs)represent a significant impediment to improve life expectancy and remain a focal point in global public health and disease prevention efforts.24-hour movement behaviors,which include sleep,sedentary behavior(SED),and physical activity,underscore the inherent connections between different daily activities and the comprehensive impact of overall movement patterns on health.Evidence suggested that modifying patterns of 24-hour movement behaviors can aid in preventing and attenuating the progression of NCDs.This study systematically delineated the concept,evolution,analytical methods,and intrinsic associations of 24-hour movement behaviors,emphasizing their pivotal role in the prevention and management of NCDs such as obesity,mental disorders,cardiovascular diseases,diabetes,and renal diseases.Future research endeavors should focus on refining methodologies,broadening study populations,developing research tools,and exploring precise intervention strategies and interdisciplinary approaches to comprehensively enhance the effectiveness of NCDs prevention and management from a temporal perspective.Such efforts are poised to provide substantive guidance and support for public health practices. 展开更多
关键词 chronic non-communicable diseases 24-hour movement behaviors time-use epidemiology isotemporal substitution model compositional data analysis
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部