Field-road segmentation is one of the key tasks in the processing of the trajectory of agricultural machinery.To improve the accuracy of the field-road segmentation,this study proposed an XGBoost model based on dual f...Field-road segmentation is one of the key tasks in the processing of the trajectory of agricultural machinery.To improve the accuracy of the field-road segmentation,this study proposed an XGBoost model based on dual feature extraction and recursive feature elimination called DR-XGBoost.DR-XGBoost takes only a small amount of agricultural machine trajectory features as input.Firstly,the model adopted the dual feature extraction method we designed to rapidly expand the number of features and then adequately extract local trajectory features by the time window and feature extraction operator.Secondly,the model applies the recursive feature elimination algorithm to eliminate redundant features from the perspective of the model segmentation effect and thus reduce the computational consumption of model training.Thirdly,it trains XGBoost to complete the trajectory segmentation.To evaluate the effectiveness of DR-XGBoost,we conducted a series of experiments on a real trajectory dataset of agricultural machines.The model achieves a 98.2%Macro-F1 score on the dataset,which is 10.9%higher than the previous state-of-art.The proposal of DR-XGBoost fills the knowledge gap of trajectory feature extraction for agricultural machinery and provides a reasonable and effective feature selection scheme for the field-road segmentation problem.展开更多
The surging accumulation of trajectory data has yielded invaluable insights into urban systems,but it has also presented challenges for data storage and management systems.In response,specialized storage systems based...The surging accumulation of trajectory data has yielded invaluable insights into urban systems,but it has also presented challenges for data storage and management systems.In response,specialized storage systems based on non-relational databases have been developed to support large data quantities in distributed approaches.However,these systems often utilize storage by point or storage by trajectory methods,both of which have drawbacks.In this study,we evaluate the effectiveness of segmented trajectory data storage with HBase optimizations for spatio-temporal queries.We develop a prototype system that includes trajectory segmentation,serialization,and spatio-temporal indexing and apply it to taxi trajectory data in Beijing.Ourfindings indicate that the segmented system provides enhanced query speed and reduced memory usage compared to the Geomesa system.展开更多
In recent years, the increasing development of traffic information collection technology based on floating car data has been recognized, which gives rise to the establishment of real-time traffic information dissemina...In recent years, the increasing development of traffic information collection technology based on floating car data has been recognized, which gives rise to the establishment of real-time traffic information dissemina- tion system in many cities. However, the recent massive construction of urban elevated roads hinders the processing of corresponding GPS data and further extraction of traffic information (e.g., identifying the real travel path), as a result of the frequent transfer of vehicles between ground and elevated road travel. Consequently, an algorithm for identifying the travel road type (i.e., elevated or ground road) of vehicles is designed based on the vehicle traveling features, geometric and topological characteristics of the elevated road network, and a trajectory model proposed in the present study. To be specific, the proposed algorithm can detect the places where a vehicle enters, leaves or crosses under elevated roads. An experiment of 10 sample taxis in Shanghai, China was conducted, and the comparison of our results and results that are obtained from visual interpretation validates the proposed algo- rithm.展开更多
基金This work was financially supported by the National Key Research and Development Program of China(Grant No.2021YFB3901300)the National Precision Agriculture Application Project(Grant No.JZNYYY001)National Innovation Training Project for University in China(Grant No.202310019034).
文摘Field-road segmentation is one of the key tasks in the processing of the trajectory of agricultural machinery.To improve the accuracy of the field-road segmentation,this study proposed an XGBoost model based on dual feature extraction and recursive feature elimination called DR-XGBoost.DR-XGBoost takes only a small amount of agricultural machine trajectory features as input.Firstly,the model adopted the dual feature extraction method we designed to rapidly expand the number of features and then adequately extract local trajectory features by the time window and feature extraction operator.Secondly,the model applies the recursive feature elimination algorithm to eliminate redundant features from the perspective of the model segmentation effect and thus reduce the computational consumption of model training.Thirdly,it trains XGBoost to complete the trajectory segmentation.To evaluate the effectiveness of DR-XGBoost,we conducted a series of experiments on a real trajectory dataset of agricultural machines.The model achieves a 98.2%Macro-F1 score on the dataset,which is 10.9%higher than the previous state-of-art.The proposal of DR-XGBoost fills the knowledge gap of trajectory feature extraction for agricultural machinery and provides a reasonable and effective feature selection scheme for the field-road segmentation problem.
基金support from the National Natural Science Foundation of China(42271471,42201454,41830645)the International Research Center of Big Data for Sustainable Development Goals(CBAS2022GSP06).
文摘The surging accumulation of trajectory data has yielded invaluable insights into urban systems,but it has also presented challenges for data storage and management systems.In response,specialized storage systems based on non-relational databases have been developed to support large data quantities in distributed approaches.However,these systems often utilize storage by point or storage by trajectory methods,both of which have drawbacks.In this study,we evaluate the effectiveness of segmented trajectory data storage with HBase optimizations for spatio-temporal queries.We develop a prototype system that includes trajectory segmentation,serialization,and spatio-temporal indexing and apply it to taxi trajectory data in Beijing.Ourfindings indicate that the segmented system provides enhanced query speed and reduced memory usage compared to the Geomesa system.
文摘In recent years, the increasing development of traffic information collection technology based on floating car data has been recognized, which gives rise to the establishment of real-time traffic information dissemina- tion system in many cities. However, the recent massive construction of urban elevated roads hinders the processing of corresponding GPS data and further extraction of traffic information (e.g., identifying the real travel path), as a result of the frequent transfer of vehicles between ground and elevated road travel. Consequently, an algorithm for identifying the travel road type (i.e., elevated or ground road) of vehicles is designed based on the vehicle traveling features, geometric and topological characteristics of the elevated road network, and a trajectory model proposed in the present study. To be specific, the proposed algorithm can detect the places where a vehicle enters, leaves or crosses under elevated roads. An experiment of 10 sample taxis in Shanghai, China was conducted, and the comparison of our results and results that are obtained from visual interpretation validates the proposed algo- rithm.