Purpose-In order to improve the accuracy of project cost prediction,considering the limitations of existing models,the construction cost prediction model based on SVM(Standard Support Vector Machine)and LSSVM(Least Sq...Purpose-In order to improve the accuracy of project cost prediction,considering the limitations of existing models,the construction cost prediction model based on SVM(Standard Support Vector Machine)and LSSVM(Least Squares Support Vector Machine)is put forward.Design/methodology/approach-In the competitive growth and industries 4.0,the prediction in the cost plays a key role.Findings-At the same time,the original data is dimensionality reduced.The processed data are imported into the SVM and LSSVM models for training and prediction respectively,and the prediction results are compared and analyzed and a more reasonable prediction model is selected.Originality/value-The prediction result is further optimized by parameter optimization.The relative error of the prediction model is within 7%,and the prediction accuracy is high and the result is stable.展开更多
近年来,使用恶意Excel 4.0宏(XLM)文档的攻击迎来了爆发,而XLM代码往往经过复杂的混淆,现有方法或检测系统难以分析海量样本的真实功能。因此,针对恶意样本中使用的各类混淆技术,基于抽象语法树和模拟执行,设计和实现了包含138个宏函数...近年来,使用恶意Excel 4.0宏(XLM)文档的攻击迎来了爆发,而XLM代码往往经过复杂的混淆,现有方法或检测系统难以分析海量样本的真实功能。因此,针对恶意样本中使用的各类混淆技术,基于抽象语法树和模拟执行,设计和实现了包含138个宏函数处理程序的自动化XLM反混淆与关键威胁指标(IOC,indicators of compromise)提取系统XLMRevealer;在此基础上,根据XLM代码特点提取Word和Token特征,通过特征融合能够捕获多层次细粒度特征,并在XLMRevealer中构造CNN-BiLSTM(convolution neural network-bidirectional long short term memory)模型,从不同维度挖掘家族样本的关联性和完成家族分类。最后,从5个来源构建包含2346个样本的数据集并用于反混淆实验和家族分类实验。实验结果表明,XLMRevealer的反混淆成功率达到71.3%,相比XLMMacroDeobfuscator和SYMBEXCEL工具分别提高了20.8%和15.8%;反混淆效率稳定,平均耗时仅为0.512 s。XLMRevealer对去混淆XLM代码的家族分类准确率高达94.88%,效果优于所有基线模型,有效体现Word和Token特征融合的优势。此外,为探索反混淆对家族分类的影响,并考虑不同家族使用的混淆技术可能有所不同,模型会识别到混淆技术的特征,分别对反混淆前和反混淆后再统一混淆的XLM代码进行实验,家族分类准确率为89.58%、53.61%,证明模型能够学习混淆技术特征,更验证了反混淆对家族分类极大的促进作用。展开更多
文摘Purpose-In order to improve the accuracy of project cost prediction,considering the limitations of existing models,the construction cost prediction model based on SVM(Standard Support Vector Machine)and LSSVM(Least Squares Support Vector Machine)is put forward.Design/methodology/approach-In the competitive growth and industries 4.0,the prediction in the cost plays a key role.Findings-At the same time,the original data is dimensionality reduced.The processed data are imported into the SVM and LSSVM models for training and prediction respectively,and the prediction results are compared and analyzed and a more reasonable prediction model is selected.Originality/value-The prediction result is further optimized by parameter optimization.The relative error of the prediction model is within 7%,and the prediction accuracy is high and the result is stable.
文摘近年来,使用恶意Excel 4.0宏(XLM)文档的攻击迎来了爆发,而XLM代码往往经过复杂的混淆,现有方法或检测系统难以分析海量样本的真实功能。因此,针对恶意样本中使用的各类混淆技术,基于抽象语法树和模拟执行,设计和实现了包含138个宏函数处理程序的自动化XLM反混淆与关键威胁指标(IOC,indicators of compromise)提取系统XLMRevealer;在此基础上,根据XLM代码特点提取Word和Token特征,通过特征融合能够捕获多层次细粒度特征,并在XLMRevealer中构造CNN-BiLSTM(convolution neural network-bidirectional long short term memory)模型,从不同维度挖掘家族样本的关联性和完成家族分类。最后,从5个来源构建包含2346个样本的数据集并用于反混淆实验和家族分类实验。实验结果表明,XLMRevealer的反混淆成功率达到71.3%,相比XLMMacroDeobfuscator和SYMBEXCEL工具分别提高了20.8%和15.8%;反混淆效率稳定,平均耗时仅为0.512 s。XLMRevealer对去混淆XLM代码的家族分类准确率高达94.88%,效果优于所有基线模型,有效体现Word和Token特征融合的优势。此外,为探索反混淆对家族分类的影响,并考虑不同家族使用的混淆技术可能有所不同,模型会识别到混淆技术的特征,分别对反混淆前和反混淆后再统一混淆的XLM代码进行实验,家族分类准确率为89.58%、53.61%,证明模型能够学习混淆技术特征,更验证了反混淆对家族分类极大的促进作用。