期刊文献+

基于正则化与时空约束改进K最近邻算法的农业物联网数据重构 被引量:5

Agricultural internet of things data reconstruction based on K-nearest neighbor reconstruction algorithm improved by regularization penalty and spatio-temporal constraints
下载PDF
导出
摘要 针对农业复杂环境易发的物联网感知数据丢失异常问题,该文提出一种基于正则化惩罚的K最近邻数据重构方法(K nearest neighbor-regularization penalty,KNN-RP),采用岭回归方法对最近邻方法中的最小二乘因子进行正则化,并讨论了惩罚项的范数选取形式。通过对农业物联网感知数据的时空稳定性与相关性分析,确定了时间与空间约束矩阵的定义方式。采用温室数据样本对算法性能进行交叉验证,结果显示该文的KNN-RP性能在点丢失模型下优于KNN、反距离加权KNN算法以及DT算法,而在块丢失模型下优于KNN和反距离加权KNN算法,略低于DT算法,提高了农业物联网的感知数据质量。该研究可为基于物联网数据的农业生产决策提供参考。 The internet of things(IoT)technology has been widely applied in the agriculture production monitoring.Accurate decision-making and environment regulation can be made based on monitoring results.However,data loss in agriculture wireless sensor networks is common due to noise,collision,unreliable link,and unexpected damage,which greatly reduces the quality of data acquisition and then affects the results of decision analysis.In order to solve this problem,this paper proposed a data reconstruction method based on K nearest neighbor with regularization penalty constraints(KNN-RP).Firstly,the ridge regression method was used in order to regularize the least square factor.Secondly,there was a problem that it is difficult to get a unique solution due to the algorithmic error while the data matrix is not full-column rank.This could be improved by introducing a penalty term into the method.The combination of 1-norm and 2-norm could ensure the sparsity of the matrix as well as prevent the loss function from over-fitting.It is suitable for high-dimensional agricultural WSN(wireless sensor network)data reconstruction with high noise.Furthermore,the definition of time and space constraint matrix was determined according to the temporal and spatial stability of perceptual data in agricultural IoT.Finally,the K value was determined by model training to achieve the better reconstruction performance.A cross-validate experiment was done to evaluate the algorithm performance according to the greenhouse data samples.KNN(K nearest neighbor),KNN-inverse and DT(delaunay triangulation)algorithms were chosen for the performance comparison.In the element random loss case,the overall reconstruction error rate of the 4 algorithms increased with the increasing of data loss rate.The KNN and KNN-inverse had higher error rate when the data loss rate above 60%compared with the other 2 algorithms.Besides,the performance of KNN-RP was superior to the DT algorithm in both high and low data loss rates.In the block loss case,the reconstruction error rates of the 4 algorithms were close to the element random loss case,but reconstruction error rates increased faster than the element random loss case while the data loss rate increased.In the block loss case,the overall performance of KNN-RP was better than KNN and KNN-inverse,but lower than that of DT algorithm when the data loss rate was above 60%.The K value had a significant influence on the performance of KNN-RP.The reconstruction error of KNN-RP decreased first and then increased with the increasing of K value.For the stable parameter like temperature,the reconstruction error rate was less affected by K value.On the contrast,the reconstruction error rates of humidity and lightness data were more affected by K value.The reason maybe the humidity and lightness data changed faster than temperature.Considering all 3 parameters,temperature,humidity and lightness,the optimal K value was between 6 and 8.In summary,KNN-RP algorithm could effectively reconstruct the missing errors in the agricultural IoT,especially in element random loss case.The proposed algorithm improves the quality of perceptual data in agricultural IoT monitoring and may provide reference for agricultural production decision-making.
作者 吴华瑞 李庆学 缪祎晟 宋玉玲 Wu Huarui;Li Qingxue;Miao Yisheng;Song Yuling(National Engineering Research Center for Information Technology in Agriculture,Beijing 100097,China;Beijing Research Center for Information Technology in Agriculture,Beijing Academy of Agriculture and Forestry Sciences,Beijing 100097,China;Key Laboratory of Agricultural Internet of Things,Ministry of Agriculture and Rural Affairs,Yangling 712100,China)
出处 《农业工程学报》 EI CAS CSCD 北大核心 2019年第14期183-189,共7页 Transactions of the Chinese Society of Agricultural Engineering
基金 国家自然科学基金项目(61871041,61571051) 北京市自然科学基金项目(4172024,4172026) 农业农村部农业物联网重点实验室开放课题(2018AIOT-06)
关键词 算法 模型 农业物联网 数据重构 聚类回归 algorithms models agricultural internet of things data reconfiguration cluster regression
  • 相关文献

参考文献5

二级参考文献61

  • 1高峰,俞立,张文安,徐青香,于莉洁.基于无线传感器网络的作物水分状况监测系统研究与设计[J].农业工程学报,2009,25(2):107-112. 被引量:108
  • 2蔡振江,康健一,张青,徐宏.数据融合技术在温室温度检测中的应用[J].农业机械学报,2006,37(10):101-103. 被引量:22
  • 3Cullar D, Estrin D, Strvastava M. Overview of sensor networks. IEEE Computer, 2004, 37(8): 41-49.
  • 4Madden S, Franklin M J, Hellerstein J M, Hong W. The design of an acquisitional query processor for sensor networks//Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data. San Diego, California, 2003: 491-502.
  • 5Manihi A, Nath S, Gibbons P B. Tributaries and deltas: Efficient and robust aggregation in sensor network streams// Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data. Baltimore, Maryland, 2005: 287-298.
  • 6Silberstein A, Munagala K, Yang J. Energy-efficient monitoring of extreme values in sensor networks//Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data. Chicago, Illinois, 2006:169-180.
  • 7Considine J, Li F, Kollios G, Byers J. Approximate aggregation techniques for sensor databases//Proceedings of the 20th International Conference on Data Engineering. Boston, MA, 2004:449-460.
  • 8Deshpande A, Guestrin C, Madden S, Hellerstein J M, Hong W. Model-driven data acquisition in sensor networks// Proceedings of the 30th International Conference on Very Large Data Bases. Toronto, Canada, 2004:588- 599.
  • 9Deshpande A, Guestrin C, Hong W, Madden S. Exploiting correlated attributes in acquisitional query processing//Proceedings of the 21st International Conference on Data Engineering. Tokyo, Japan, 2005: 143-154.
  • 10Chu D, Deshpand A, Hellerstein J M, Hong W. Approximate data collection in sensor networks using probabilistic models//Proceedings of the 22nd International Conference on Data Engineering. Atlanta, 2006:48.

共引文献72

同被引文献53

引证文献5

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部