摘要
Having the ability to forecast cyberattacks before they happen will unquestionably change the landscape of cyber warfare and cyber crime.This work predicts specific types of attacks on a potential victim network before the actual malicious actions take place.The challenge to forecasting cyberattacks is to extract relevant and reliable signals to treat sporadic and seemingly random acts of adversaries.This paper builds on multi-faceted machine learning solutions and develops an integrated system to transform large volumes of public data to aggregate signals with imputation that are relevant and predictive of cyber incidents.A comprehensive analysis of the individual parts and the integrated whole demonstrates the effectiveness and trade-offs of the proposed approach.Using 16-months of reported cyber incidents by an anonymized victim organization,the integrated approach achieves up to 87%,90%,and 96% AUC for forecasting endpoint-malware,malicious-destination,and malicious-email attacks,respectively.When assessed month-by-month,the proposed approach shows robustness to perform consistently well,achieving F-Measure between 0.6 and 1.0.The framework also enables an examination of which unconventional signals are meaningful for cyberattack forecasting.
基金
Intelligence Advanced Research Projects Activity(IARPA)with contract number FA875016C0114.