期刊文献+
共找到284篇文章
< 1 2 15 >
每页显示 20 50 100
Effect Modeling of Count Data Using Logistic Regression with Qualitative Predictors
1
作者 Haeil Ahn 《Engineering(科研)》 2014年第12期758-772,共15页
We modeled binary count data with categorical predictors, using logistic regression to develop a statistical method. We found that ANOVA-type analyses often performed unsatisfactorily, even when using different transf... We modeled binary count data with categorical predictors, using logistic regression to develop a statistical method. We found that ANOVA-type analyses often performed unsatisfactorily, even when using different transformations. The logistic transformation of fraction data could be an alternative, but it is not desirable in the statistical sense. We concluded that such methods are not appropriate, especially in cases where the fractions were close to 0 or 1. The major purpose of this paper is to demonstrate that logistic regression with an ANOVA-model like parameterization aids our understanding and provides a somewhat different, but sound, statistical background. We examined a simple real world example to show that we can efficiently test the significance of regression parameters, look for interactions, estimate related confidence intervals, and calculate the difference between the mean values of the referent and experimental subgroups. This paper demonstrates that precise confidence interval estimates can be obtained using the proposed ANOVA-model like approach. The method discussed here can be extended to any type of experimental fraction data analysis, particularly for experimental design. 展开更多
关键词 LOGISTIC Regression LOGIT LOGISTIC Response CATEGORICAL BINARY count data
下载PDF
Robust Estimation of Semiparametric Transformation Model for Panel Count Data 被引量:2
2
作者 FENG Yan WANG Yijun +1 位作者 WANG Weiwei CHEN Zhuo 《Journal of Systems Science & Complexity》 SCIE EI CSCD 2021年第6期2334-2356,共23页
Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering th... Panel count data are frequently encountered when study subjects are under discrete observations.However,limited literature has been found on variable selection for panel count data.In this paper,without considering the model assumption of observation process,a more general semiparametric transformation model for panel count data with informative observation process is developed.A penalized estimation procedure based on the quantile regression function is proposed for variable selection and parameter estimation simultaneously.The consistency and oracle properties of the estimators are established under some mild conditions.Some simulations and an application are reported to evaluate the proposed approach. 展开更多
关键词 B-spline function panel count data quantile regression semiparametric transformation model variable selection
原文传递
Panel Count Data模型参数的经验似然推断
3
作者 胡宏昌 崔恒建 《数理统计与管理》 CSSCI 北大核心 2014年第4期647-654,共8页
对Panel Count Data的处理越来越受到人们的关注,Sun与Wei^([1-2])基于简单的半参数模型,提出了Panel Count Data的回归分析,并且给出了参数的估计方程。本文则基于经验似然的思想,讨论了上述Panel Count Data模型参数的置信域构造问题... 对Panel Count Data的处理越来越受到人们的关注,Sun与Wei^([1-2])基于简单的半参数模型,提出了Panel Count Data的回归分析,并且给出了参数的估计方程。本文则基于经验似然的思想,讨论了上述Panel Count Data模型参数的置信域构造问题,特别仅通过经验似然置信区域给出了参数估计的方差阵估计,证明了估计的1/n相合性。基于Sun与Wei所给的数据,给出了参数置信区域的具体构造过程和结果。通过作图比较可以看出经验似然置信域要优于依据渐近正态性所构造的置信域。我们还依据所作出的经验似然置信域对参数估计的方差矩阵进行了估计,与用传统渐近正态性得到的矩阵较为接近。 展开更多
关键词 PANEL count data 经验似然 置信域 协方差矩阵估计
原文传递
Some Additional Moment Conditions for a Dynamic Count Panel Data Model with Predetermined Explanatory Variables
4
作者 Yoshitsugu Kitazawa 《Open Journal of Statistics》 2013年第5期319-333,共15页
This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly ... This paper proposes some additional moment conditions for the linear feedback model with explanatory variables being predetermined, which is proposed by [1] for the purpose of dealing with count panel data. The newly proposed moment conditions include those associated with the equidispersion, the Negbin I-type model and the stationarity. The GMM estimators are constructed incorporating the additional moment conditions. Some Monte Carlo experiments indicate that the GMM estimators incorporating the additional moment conditions perform well, compared to that using only the conventional moment conditions proposed by [2,3]. 展开更多
关键词 count PANEL data Linear Feedback Model MOMENT Conditions GMM MONTE Carlo Experiments
下载PDF
Dynamically Computing Approximate Frequency Counts in Sliding Window over Data Stream 被引量:1
5
作者 NIE Guo-liang LU Zheng-ding 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期283-288,共6页
This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constru... This paper presents two one-pass algorithms for dynamically computing frequency counts in sliding window over a data stream-computing frequency counts exceeding user-specified threshold ε. The first algorithm constructs subwindows and deletes expired sub-windows periodically in sliding window, and each sub-window maintains a summary data structure. The first algorithm outputs at most 1/ε + 1 elements for frequency queries over the most recent N elements. The second algorithm adapts multiple levels method to deal with data stream. Once the sketch of the most recent N elements has been constructed, the second algorithm can provides the answers to the frequency queries over the most recent n ( n≤N) elements. The second algorithm outputs at most 1/ε + 2 elements. The analytical and experimental results show that our algorithms are accurate and effective. 展开更多
关键词 data stream sliding window approximation algorithms frequency counts
下载PDF
Accuracy Assessment and Guidelines for Manual Traffic Counts from Pre-Recorded Video Data
6
作者 Mishuk Majumder Chester Wilmot 《Journal of Transportation Technologies》 2023年第4期497-523,共27页
Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to con... Traffic count is the fundamental data source for transportation planning, management, design, and effectiveness evaluation. Recording traffic flow and counting from the recorded videos are increasingly used due to convenience, high accuracy, and cost-effectiveness. Manual counting from pre-recorded video footage can be prone to inconsistencies and errors, leading to inaccurate counts. Besides, there are no standard guidelines for collecting video data and conducting manual counts from the recorded videos. This paper aims to comprehensively assess the accuracy of manual counts from pre-recorded videos and introduces guidelines for efficiently collecting video data and conducting manual counts by trained individuals. The accuracy assessment of the manual counts was conducted based on repeated counts, and the guidelines were provided from the experience of conducting a traffic survey on forty strip mall access points in Baton Rouge, Louisiana, USA. The percentage of total error, classification error, and interval error were found to be 1.05 percent, 1.08 percent, and 1.29 percent, respectively. Besides, the percent root mean square errors (RMSE) were found to be 1.13 percent, 1.21 percent, and 1.48 percent, respectively. Guidelines were provided for selecting survey sites, instruments and timeframe, fieldwork, and manual counts for an efficient traffic data collection survey. 展开更多
关键词 Traffic Survey counting Error Transportation Planning Total Error Collecting Video data Classification Error Standard Guidelines Repeated counts Interval Error
下载PDF
Bayesian Computation for the Parameters of a Zero-Inflated Cosine Geometric Distribution with Application to COVID-19 Pandemic Data
7
作者 Sunisa Junnumtuam Sa-Aat Niwitpong Suparat Niwitpong 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第5期1229-1254,共26页
A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The b... A new three-parameter discrete distribution called the zero-inflated cosine geometric(ZICG)distribution is proposed for the first time herein.It can be used to analyze over-dispersed count data with excess zeros.The basic statistical properties of the new distribution,such as the moment generating function,mean,and variance are presented.Furthermore,confidence intervals are constructed by using the Wald,Bayesian,and highest posterior density(HPD)methods to estimate the true confidence intervals for the parameters of the ZICG distribution.Their efficacies were investigated by using both simulation and real-world data comprising the number of daily COVID-19 positive cases at the Olympic Games in Tokyo 2020.The results show that the HPD interval performed better than the other methods in terms of coverage probability and average length in most cases studied. 展开更多
关键词 Bayesian analysis confidence interval gibbs sampling random-walk metropolis zero-inflated count data
下载PDF
Using Statistical Learning to Treat Missing Data: A Case of HIV/TB Co-Infection in Kenya
8
作者 Joshua O. Mwaro Linda Chaba Collins Odhiambo 《Journal of Data Analysis and Information Processing》 2020年第3期110-133,共24页
In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objec... In this study, we investigate the effects of missing data when estimating HIV/TB co-infection. We revisit the concept of missing data and examine three available approaches for dealing with missingness. The main objective is to identify the best method for correcting missing data in TB/HIV Co-infection setting. We employ both empirical data analysis and extensive simulation study to examine the effects of missing data, the accuracy, sensitivity, specificity and train and test error for different approaches. The novelty of this work hinges on the use of modern statistical learning algorithm when treating missingness. In the empirical analysis, both HIV data and TB-HIV co-infection data imputations were performed, and the missing values were imputed using different approaches. In the simulation study, sets of 0% (Complete case), 10%, 30%, 50% and 80% of the data were drawn randomly and replaced with missing values. Results show complete cases only had a co-infection rate (95% Confidence Interval band) of 29% (25%, 33%), weighted method 27% (23%, 31%), likelihood-based approach 26% (24%, 28%) and multiple imputation approach 21% (20%, 22%). In conclusion, MI remains the best approach for dealing with missing data and failure to apply it, results to overestimation of HIV/TB co-infection rate by 8%. 展开更多
关键词 Missing data HIV/TB Co-Infection IMPUTATION Missing at Random count data
下载PDF
Challenges Analyzing RNA-Seq Gene Expression Data
9
作者 Liliana López-Kleine Cristian González-Prieto 《Open Journal of Statistics》 2016年第4期628-636,共9页
The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pr... The analysis of messenger Ribonucleic acid obtained through sequencing techniques (RNA-se- quencing) data is very challenging. Once technical difficulties have been sorted, an important choice has to be made during pre-processing: Two different paths can be chosen: Transform RNA- sequencing count data to a continuous variable or continue to work with count data. For each data type, analysis tools have been developed and seem appropriate at first sight, but a deeper analysis of data distribution and structure, are a discussion worth. In this review, open questions regarding RNA-sequencing data nature are discussed and highlighted, indicating important future research topics in statistics that should be addressed for a better analysis of already available and new appearing gene expression data. Moreover, a comparative analysis of RNAseq count and transformed data is presented. This comparison indicates that transforming RNA-seq count data seems appropriate, at least for differential expression detection. 展开更多
关键词 RNA-Seq Analysis count data PREPROCESSING Differential Expression Gene Co-Expression Network
下载PDF
Modelling fertility:an application of count regression models
10
作者 Ranjita Pandey Charanjit Kaur 《Chinese Journal of Population,Resources and Environment》 2015年第4期349-357,共9页
Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characte... Often the lifecycle data occur as count of the vital events and are recorded as integers.The purpose of this article is to model the fertility behavior based on religious,educational,economic,and occupational characteristics.The responses of classified groups according to these determinants are examined for significant influence on fertility using Poisson regression model(PRM) based on the National Family Health Survey-3 dataset.The observed and predicted probabilities under PRM indicate modal value of two children for the Poisson distribution modeled data.Presence of dominance of two child in the data motivates the authors to adopt multinomial regression model(MRM) in order to link fertility with various socioeconomic indicators responsible for fertility variation.Choice of the explanatory factors is limited to the availability of data.Trends and patterns of preference for birth counts suggest that religion,caste,wealth,female education,and occupation are the dominant factors shaping the observed birth process.Empirical analysis suggests that both the models used in the study perform similarly on the sample data.However,fitting of MRM by taking birth count of two as comparison category shows improved Akaike information criterion and consistent Akaike information criterion values.Current work contributes to the existing literature as it attempts to provide more insight into the determinants of Indian fertility using Poisson and MRM. 展开更多
关键词 count data FERTILITY POISSON model MULTINOMIAL regression MODELS
下载PDF
Determinants of Antenatal Health Care Utilization in Egypt (2000-2014) Using Binary and Count Outcomes
11
作者 Hassan H. M. Zaky Dina M. Armanious Mohamed Ali Hussein 《Health》 2019年第1期25-39,共15页
Aim: This study seeks to investigate the factors determining the utilization of antenatal care services, the frequency of that use, and the timing of receiving antenatal care among Egyptian women utilizing a national ... Aim: This study seeks to investigate the factors determining the utilization of antenatal care services, the frequency of that use, and the timing of receiving antenatal care among Egyptian women utilizing a national representative data from Egypt Demographic and Health Surveys (EDHS) in 2000 and 2014. Methods: The paper estimates the logistic regression model, zero-inflated negative binomial model (ZINB), and negative binomial regression model (NB) to identify the most important determinants of antenatal health care utilization. Results: The findings indicate that the period 2000-2014 has experienced a significant increase in the use of antenatal health care services. The use of the public sector antenatal care services relative to that of the private sector has been decreasing over time. Moreover, wealth index, women’s education and quality of health services play significant roles in increasing accessibility of antenatal health care services. On the other hand, women’s empowerment has shown a positive effect in 2000 only. Conclusion: The study highlights the most vulnerable groups that are less likely to have access to antenatal health care services, mainly women who are less educated, poor and living in rural areas especially Upper Egypt. This certainly requires a more targeted health strategy with an equity lens. 展开更多
关键词 ANTENATAL Health Care Services BINARY and count data Negative BINOMIAL Regression Determinants EGYPT
下载PDF
Comparative Assessment of Zero-Inflated Models with Application to HIV Exposed Infants Data
12
作者 Faith Nekesa Collins Odhiambo Linda Chaba 《Open Journal of Statistics》 2019年第6期664-685,共22页
In a typical Kenyan HIV clinical setting, there is a likelihood of registering many zeros during the routine monthly data collection of new HIV infections among HIV exposed infants (HEI). This is attributed to the imp... In a typical Kenyan HIV clinical setting, there is a likelihood of registering many zeros during the routine monthly data collection of new HIV infections among HIV exposed infants (HEI). This is attributed to the implementation of the prevention of mother to child transmission (PMTCT) policies. However, even though the PMTCT policy is implemented uniformly across all public health facilities, implementation naturally differs from every facility due to differential health systems and infrastructure. This leads to structured zero among reported positive HEI (where PMTCT implementation is optimum) and non-structured zero among reported positive HEI (where PMTCT implementation is not optimum). Hence the classical zero-inflated and hurdle models that do not account for the abundance of structured and non-structured zeros in the data can give misleading results. The purpose of this study is to systematically compare performance of the various zero-inflated models with an application to HIV Exposed Infants (HEI) in the context of structured and unstructured zeros. We revisit zero-inflated, hurdle models, Poisson and negative binomial count models and conduct the simulations by varying sample size and levels of abundance zeros. Results from simulation study and real data analysis of exposed infant diagnosis show the negative binomial emerging as the best performing model when fitting data with both structured and non-structured zeros under various settings. 展开更多
关键词 ZERO-INFLATED Models HIV EXPOSED INFANTS Structured Zeroes Mother-to-Child Transmission count data
下载PDF
基于广义线性混合效应模型的森林树木死亡研究
13
作者 闫明 陈艳梅 +1 位作者 闫静 奚为民 《生态学报》 CAS CSCD 北大核心 2024年第6期2420-2436,共17页
基于计数模型方法,同时考虑样地的随机效应,构建林分水平死亡模型,探究影响树木死亡的因素,以期为森林资源的监测与管理提供参考依据。以美国德州东部森林连续清查的样地数据为数据源,按4∶1的比例将其进行随机抽样,划分为训练集和验证... 基于计数模型方法,同时考虑样地的随机效应,构建林分水平死亡模型,探究影响树木死亡的因素,以期为森林资源的监测与管理提供参考依据。以美国德州东部森林连续清查的样地数据为数据源,按4∶1的比例将其进行随机抽样,划分为训练集和验证集数据,将立地因子、林分因子和气候因子作为模型的自变量,林木死亡株数则作为模型的因变量,运用计数模型和混合效应模型方法进行模型的构建,并分析影响林木死亡株数的因子。使用赤池信息准则(AIC)、贝叶斯信息准则(BIC)和-2倍对数似然函数值(-2logL)3种模型评价指标评估各模型间的拟合效果;采用平均绝对误差(MAE)和均方根误差(RMSE)2种评价指标评估其预测效果,以便筛选出最佳的林分水平死亡模型。结果表明:立地因子方面,林木死亡株数与海拔(P<0.01)呈显著的负效应,与坡度(P<0.05)呈显著的正效应,说明林木死亡株数随海拔的升高而减少,随坡度的增加而增多;林分因子方面,林木死亡株数与林分年龄(P<0.001)和树木基面积(P<0.001)呈显著的正效应,与林分平方平均胸径(P<0.001)和林分密度(P<0.05)呈显著的负效应,说明林木死亡株数随林分年龄的增加和树木基面积的增大而增加,随林分平方平均胸径和林分密度的增大而减少;气候因子方面,林木死亡株数与SPEI(P<0.05)、干旱长度(P<0.001)、年平均温度(P<0.001)和夏季平均降雨量(P<0.05)均呈显著的负效应,与夏季平均温度(P<0.001)呈显著的正效应,说明林木死亡株数随干旱强度和夏季平均温度的增加而增多,随干旱长度、年平均温度和夏季平均降雨量的增加而减少。在基础计数模型中,零膨胀负二项(ZINB)模型的拟合效果最好。而加入样地随机效应后,混合效应模型的拟合精度明显有所提高。基于所有模型模拟结果的比较,得出德州东部森林的林分水平死亡模型以ZINB-mixed模型为最优模型。 展开更多
关键词 树木死亡 计数模型 混合效应模型 影响因子
下载PDF
数据中心发展进展
14
作者 陈焕新 王宜卿 +5 位作者 张丽 樊超 张忠斌 张羽 许陆顺 荆华乾 《制冷技术》 2024年第S01期2-19,共18页
本文采用文献调研的方法,研究了中国数据中心在新型基础设施建设等策略下的市场规模与发展趋势,并对比了中外数据中心发展特点及政策导向。本文分析了近年来中国数据中心耗电量、算力规模、机架数等关键指标的变化趋势。结果表明:我国... 本文采用文献调研的方法,研究了中国数据中心在新型基础设施建设等策略下的市场规模与发展趋势,并对比了中外数据中心发展特点及政策导向。本文分析了近年来中国数据中心耗电量、算力规模、机架数等关键指标的变化趋势。结果表明:我国数据中心市场规模将保持持续增长态势,增长率将保持在20%以上。“东数西算”政策驱动下,未来数据中心将朝着高效化、大型化、绿色化的新兴数据中心发展。冷却技术的高速发展将助力数据中心能效不断提升,使电能使用效率降至1.1左右。 展开更多
关键词 数据中心 东数西算 低碳 能效
下载PDF
含有倾向指数加权的面板计数数据半参数模型统计推断
15
作者 周稳 李霓 《应用概率统计》 CSCD 北大核心 2024年第6期863-876,共14页
近年来,关于面板计数数据的研究引起了统计学者的广泛关注.本文考虑相依观测过程对复发事件过程的影响,建立了含有倾向指数加权的半参数模型.通过向模型中引入倾向指数并提出含有倾向指数加权的半参数模型,减少了相依观测过程对复发事... 近年来,关于面板计数数据的研究引起了统计学者的广泛关注.本文考虑相依观测过程对复发事件过程的影响,建立了含有倾向指数加权的半参数模型.通过向模型中引入倾向指数并提出含有倾向指数加权的半参数模型,减少了相依观测过程对复发事件过程产生的混杂偏倚影响.特别地,我们结合逆概率加权估计方程对参数进行估计并证明了估计量在大样本下的渐近性质.通过数值模拟验证了估计的有限样本性质和合理性,并将该模型及方法应用于皮肤癌数据的分析. 展开更多
关键词 面板计数数据 倾向指数 复发事件过程 半参数模型
下载PDF
基于本地差分隐私的医疗数据收集方法
16
作者 王金鹏 李晓会 贾旭 《计算机工程与设计》 北大核心 2024年第10期2929-2935,共7页
针对现有医疗数据收集算法无法有效抵抗背景知识攻击和不可信第三方的隐私泄露问题,提出一种基于本地差分隐私的医疗数据收集方法。设计基于Count-Min Sketch和GRR算法的两阶段数据收集框架,利用随机采样技术避免隐私预算分割,降低数据... 针对现有医疗数据收集算法无法有效抵抗背景知识攻击和不可信第三方的隐私泄露问题,提出一种基于本地差分隐私的医疗数据收集方法。设计基于Count-Min Sketch和GRR算法的两阶段数据收集框架,利用随机采样技术避免隐私预算分割,降低数据收集的通信代价和噪声误差,通过对高低频症状分别抽样扰动收集统计,降低数据哈希冲突导致的误差问题。理论分析算法满足本地差分隐私。实验结果表明,该方法频率估计的精确度、运行时间和通信开销优于对比方法。 展开更多
关键词 医疗数据收集 本地差分隐私 草图结构 分层收集 不可信第三方 隐私保护 数据可用性
下载PDF
基于计算机视觉的大豆与玉米种子计数方法研究
17
作者 张洁 杨诚阳 +3 位作者 邹佳琪 鲁兆宏 谭先明 杨峰 《四川农业大学学报》 CSCD 北大核心 2024年第5期1021-1027,1048,共8页
【目的】作物种子的重量是产量构成的重要因素之一,而传统百粒重/千粒重计算过程耗时、费力,急需一种快速测定作物种子数量、计算重量的方法。【方法】以大豆和玉米为研究对象,首先针对种子计数环境复杂、目标小以及密度大等问题,采用Al... 【目的】作物种子的重量是产量构成的重要因素之一,而传统百粒重/千粒重计算过程耗时、费力,急需一种快速测定作物种子数量、计算重量的方法。【方法】以大豆和玉米为研究对象,首先针对种子计数环境复杂、目标小以及密度大等问题,采用Albumentations库对数据集进行增强处理;然后通过对比YOLOv8的5个子模型,筛选出表现最佳的YOLOv8n模型,在此基础上用Focal-IOU替代CIOU损失函数,得到改进后的模型;最后将改进后的模型与多种经典目标检测模型作对比。【结果】改进后的模型在大豆和玉米种子计数上的平均精度mAP50-95分别达到88.78%和86.89%,比原模型提高了1.29%和0.51%,且性能显著优于YOLOv5、SSD等目标检测模型。此外,改进后的模型在2种作物测试集上的平均绝对百分比误差(MAPE)分别为0.035%和0.045%,每秒帧率分别达到70.17和100.41。【结论】改进后的模型在大豆玉米种子计数上的结果与实际数量差异不显著,实时处理速度快,研究结果可以满足考种中百粒重和千粒重计算对种子的计数需求。 展开更多
关键词 大豆 玉米 种子计数 YOLOv8 Focal-IOU 数据增强
下载PDF
星载光子计数激光测距雷达的实时去噪方法
18
作者 谭崇涛 于文博 +4 位作者 向雨琰 李少辉 余婧 王倩莹 李松 《红外与毫米波学报》 SCIE EI CAS CSCD 北大核心 2024年第2期242-253,共12页
星载光子计数体制激光测距雷达系统具有高重频、高精度等显著优势,但也面临原始数据量大且噪声数据占比过高的问题。为适应星上数据通道的传输能力,需压缩原始数据量并保障信号光子的查全率,因此必须发展以硬件为主体的实时去噪算法。... 星载光子计数体制激光测距雷达系统具有高重频、高精度等显著优势,但也面临原始数据量大且噪声数据占比过高的问题。为适应星上数据通道的传输能力,需压缩原始数据量并保障信号光子的查全率,因此必须发展以硬件为主体的实时去噪算法。本文提出一种粗精结合的快速去噪算法,首先基于激光器发射脉宽、系统噪声率、目标特性以及接收光子事件的局部密度信息进行粗去噪,剔除部分噪声光子;再利用直方图统计,对保留的光子事件进行精去噪,确定信号光子区间及最终的信号光子及其时间信息。通过蒙特卡洛仿真和ICESat-2实测数据对算法进行验证,测试结果表明,本算法查全率大于94%、查准率大于93%、调和平均值大于94%,运行效率提高了10%。算法可以实现光子事件的快速实时去噪,为星上硬件实时去噪处理提供了理论基础。 展开更多
关键词 光子计数 激光测距 粗精去噪 数据密度 直方图统计
下载PDF
用于时态聚合范围查询的分布式时态索引
19
作者 孟繁珺 韩斌 +1 位作者 黄树成 梅向东 《计算机应用》 CSCD 北大核心 2024年第6期1848-1854,共7页
在大数据与云计算时代,时态大数据的查询分析面临许多重要挑战。针对其中时态聚合范围查询性能不佳和不能有效利用索引等问题,提出一种用于时态聚合范围查询的分布式时态索引(DTI)。首先,采用随机或轮询策略对时态数据分区;其次,采用基... 在大数据与云计算时代,时态大数据的查询分析面临许多重要挑战。针对其中时态聚合范围查询性能不佳和不能有效利用索引等问题,提出一种用于时态聚合范围查询的分布式时态索引(DTI)。首先,采用随机或轮询策略对时态数据分区;其次,采用基于时间位数组前缀的分区内索引构造算法建立索引,同时记录包括时间跨度在内的分区统计信息;再次,利用谓词下推筛选时间跨度与查询时间区间重叠的数据分区,扫描索引进行预聚合;最后,将各分区得到的预聚合值按时间归并并聚合。实验结果表明,索引的分区内构造算法处理时间密度2400条每单位时间和0.001条每单位时间的数据的执行时间相近。索引的聚合查询算法相较于ParTime算法:在查询时间线前75%的数据时,每一步用时都至少减少22%;执行选择型聚合函数时,每一步用时都至少减少11%。因此,索引在多数时态聚合范围查询任务中具有更高的速度,它的分区内构造算法能解决数据稀疏问题且执行效率高。 展开更多
关键词 时态索引 时态数据 分布式 时态聚合 计数排序
下载PDF
利用γ射线测量原油含水率和含气率的数学算法分析 被引量:15
20
作者 白秋果 景春国 舒冬梅 《核电子学与探测技术》 CAS CSCD 北大核心 2002年第3期225-227,共3页
详细分析了利用 γ射线测量原油含水率和含气率的数学算法 ,以及减小逃逸效应对系统精度的影响。该方法已被成功地用于油田生产中对油水气三相介质的自动在线计量系统。
关键词 测量 原油 含水率 含气率 数学算法 Γ射线 透射计数 数据拟合 逃逸效应
下载PDF
上一页 1 2 15 下一页 到第
使用帮助 返回顶部