离群点检测任务是指检测与正常数据在特征属性上存在显著差异的异常数据。大多数基于聚类的离群点检测方法主要从全局角度对数据集中的离群点进行检测,而对局部离群点的检测性能较弱。基于此,本文通过引入快速搜索和发现密度峰值方法改...离群点检测任务是指检测与正常数据在特征属性上存在显著差异的异常数据。大多数基于聚类的离群点检测方法主要从全局角度对数据集中的离群点进行检测,而对局部离群点的检测性能较弱。基于此,本文通过引入快速搜索和发现密度峰值方法改进K-means聚类算法,提出了一种名为KLOD(local outlier detection based on improved K-means and least-squares methods)的局部离群点检测方法,以实现对局部离群点的精确检测。首先,利用快速搜索和发现密度峰值方法计算数据点的局部密度和相对距离,并将二者相乘得到γ值。其次,将γ值降序排序,利用肘部法则选择γ值最大的k个数据点作为K-means聚类算法的初始聚类中心。然后,通过K-means聚类算法将数据集聚类成k个簇,计算数据点在每个维度上的目标函数值并进行升序排列。接着,确定数据点的每个维度的离散程度并选择适当的拟合函数和拟合点,通过最小二乘法对升序排列的每个簇的每1维目标函数值进行函数拟合并求导,以获取变化率。最后,结合信息熵,将每个数据点的每个维度目标函数值乘以相应的变化率进行加权,得到最终的异常得分,并将异常值得分较高的top-n个数据点视为离群点。通过人工数据集和UCI数据集,对KLOD、LOF和KNN方法在准确度上进行仿真实验对比。结果表明KLOD方法相较于KNN和LOF方法具有更高的准确度。本文提出的KLOD方法能够有效改善K-means聚类算法的聚类效果,并且在局部离群点检测方面具有较好的精度和性能。展开更多
The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrati...The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrating the mean shift algorithm and frame-difference methods. The rough position of the moving tar- get is first located by the direct frame-difference algorithm and three-frame-difference algorithm for the immobile camera scenes and mobile camera scenes, respectively. Then, the mean shift algorithm is used to achieve precise tracking of the target. Several tracking experiments show that the proposed method can effectively track first moving targets and overcome the tracking error accumulation problem.展开更多
In this article,we consider a new family of exponential type estimators for estimating the unknown population mean of the study variable.We propose estimators taking advantage of the auxiliary variable information und...In this article,we consider a new family of exponential type estimators for estimating the unknown population mean of the study variable.We propose estimators taking advantage of the auxiliary variable information under the first and second non-response cases separately.The required theoretical comparisons are obtained and the numerical studies are conducted.In conclusion,the results show that the proposed family of estimators is the most efficient estimator with respect to the estimators in literature under the obtained conditions for both cases.展开更多
Significant wave height is an important criterion in designing coastal and offshore structures.Based on the orthogonality principle, the linear mean square estimation method is applied to calculate significant wave he...Significant wave height is an important criterion in designing coastal and offshore structures.Based on the orthogonality principle, the linear mean square estimation method is applied to calculate significant wave height in this paper.Twenty-eight-year time series of wave data collected from three ocean buoys near San Francisco along the California coast are analyzed.It is proved theoretically that the computation error will be reduced by using as many measured data as possible for the calculation of significant wave height.Measured significant wave height at one buoy location is compared with the calculated value based on the data from two other adjacent buoys.The results indicate that the linear mean square estimation method can be well applied to the calculation and prediction of significant wave height in coastal regions.展开更多
文摘离群点检测任务是指检测与正常数据在特征属性上存在显著差异的异常数据。大多数基于聚类的离群点检测方法主要从全局角度对数据集中的离群点进行检测,而对局部离群点的检测性能较弱。基于此,本文通过引入快速搜索和发现密度峰值方法改进K-means聚类算法,提出了一种名为KLOD(local outlier detection based on improved K-means and least-squares methods)的局部离群点检测方法,以实现对局部离群点的精确检测。首先,利用快速搜索和发现密度峰值方法计算数据点的局部密度和相对距离,并将二者相乘得到γ值。其次,将γ值降序排序,利用肘部法则选择γ值最大的k个数据点作为K-means聚类算法的初始聚类中心。然后,通过K-means聚类算法将数据集聚类成k个簇,计算数据点在每个维度上的目标函数值并进行升序排列。接着,确定数据点的每个维度的离散程度并选择适当的拟合函数和拟合点,通过最小二乘法对升序排列的每个簇的每1维目标函数值进行函数拟合并求导,以获取变化率。最后,结合信息熵,将每个数据点的每个维度目标函数值乘以相应的变化率进行加权,得到最终的异常得分,并将异常值得分较高的top-n个数据点视为离群点。通过人工数据集和UCI数据集,对KLOD、LOF和KNN方法在准确度上进行仿真实验对比。结果表明KLOD方法相较于KNN和LOF方法具有更高的准确度。本文提出的KLOD方法能够有效改善K-means聚类算法的聚类效果,并且在局部离群点检测方面具有较好的精度和性能。
基金supported by the Fundamental Research Funds for the Central Universities Project(CDJZR10170010)
文摘The mean shift tracker has difficulty in tracking fast moving targets and suffers from tracking error accumulation problem. To overcome the limitations of the mean shift method, a new approach is proposed by integrating the mean shift algorithm and frame-difference methods. The rough position of the moving tar- get is first located by the direct frame-difference algorithm and three-frame-difference algorithm for the immobile camera scenes and mobile camera scenes, respectively. Then, the mean shift algorithm is used to achieve precise tracking of the target. Several tracking experiments show that the proposed method can effectively track first moving targets and overcome the tracking error accumulation problem.
文摘In this article,we consider a new family of exponential type estimators for estimating the unknown population mean of the study variable.We propose estimators taking advantage of the auxiliary variable information under the first and second non-response cases separately.The required theoretical comparisons are obtained and the numerical studies are conducted.In conclusion,the results show that the proposed family of estimators is the most efficient estimator with respect to the estimators in literature under the obtained conditions for both cases.
基金support for this study was provided by the National Natural Science Foundation of China (No.40776006)Research Fund for the Doctoral Program of Higher Education of China (Grant No.20060423009)the Science and Technology Development Program of Shandong Province (Grant No.2008GGB01099)
文摘Significant wave height is an important criterion in designing coastal and offshore structures.Based on the orthogonality principle, the linear mean square estimation method is applied to calculate significant wave height in this paper.Twenty-eight-year time series of wave data collected from three ocean buoys near San Francisco along the California coast are analyzed.It is proved theoretically that the computation error will be reduced by using as many measured data as possible for the calculation of significant wave height.Measured significant wave height at one buoy location is compared with the calculated value based on the data from two other adjacent buoys.The results indicate that the linear mean square estimation method can be well applied to the calculation and prediction of significant wave height in coastal regions.