摘要
针对现有四种装配工艺指令相似度评估方法(即词重叠、Jaccard、TF-IDF和潜在语义分析LSA)缺乏系统性量化评估的问题,提出采用十个工作指令集的45个基于文本的装配指令作为案例对现有四种文本比较方法进行分析。统计假设测试表明,Jaccard方法模拟装配工作指令的相似度高于其他三种方法至少3.75%,且对同义或多义词不敏感;而LSA方法对同义词或多义词的敏感度总体上低于其他三种方法至少6.02%,且可用于检索自由文本的装备工作指令,避免了TF-IDF方法依赖于装配工作指令数据库的弊端。故而LSA方法更适合用于装配工作指令的评估。
Considering the lack of systematic and quantitative evaluation of four existing similarity evaluation methods(i.e. word overlapping, Jaccard, TF-IDF and latent semantic analysis(LSA)) for assembly process instructions, a case study of 45 text-based assembly instructions from 10 work instruction sets is presented to analyze four existing text comparison methods. Statistical hypothesis tests show that the similarity of simulation assembly work instructions of Jaccard method is at least 3.75% higher than that of the other three methods,and the Jaccard method is insensitive to synonyms or polysemous words. In addition, the sensitivity of LSA to synonyms or polysemous words is at least 6.02% lower than that of the other three methods. Meanwhile, LSA method can also be used to retrieve free-text equipment work instructions, avoiding the disadvantage of TF-IDF method depending on assembly work instruction database. Therefore, LSA method is more suitable for the evaluation of assembly work instructions.
作者
王云飞
赵霞
屈美霞
张占荣
赵丽
WANG Yun-fei;ZHAO Xia;QU Mei-xia;ZHANG Zhan-rong;ZHAO Li(Ordos Vocational College of Eco environment,Ordos 017010,China;School of Software,Shanxi University,Taiyuan 030013,China)
出处
《控制工程》
CSCD
北大核心
2021年第3期592-599,共8页
Control Engineering of China
关键词
制造业
装配工艺
跨生产线
装配工作指令
文本比较
相似度计算
Manufacturing industry
assembly process
cross production line
assembly work instruction
text comparison
similarity computation