摘要
数据清洗技术及分词技术的应用对于挖掘海量交易信息潜在的数据价值至关重要,对不同类型交易“脏数据”按不同策略进行数据预处理,同时通过对LTP及Jieba分词技术在交易领域的应用研究,在提高交易关键信息的识别与处理效率及最终数据质量的同时,探索对信息搜索准确率及查全率的提升作用。
The application of data cleaning technology and word segmentation technology is crucial to mining the potential data value of massive bids information. Data preprocessing is carried out for “dirty data” of different types of bids according to different strategies. At the same time, through the research on the application of LTP and Jieba word segmentation technology in the bids field, while improving the identification and processing efficiency of key bids information and the final data quality, the role of improving the accuracy and recall of information search is explored.
出处
《软件工程与应用》
2022年第6期1415-1422,共8页
Software Engineering and Applications