A semantics-based approach to malware detection

被引:39
作者
Preda, Mila Dalla [1 ]
Christodorescu, Mihai [2 ]
Jha, Somesh [2 ]
Debray, Saumya [3 ]
机构
[1] Univ Verona, I-37100 Verona, Italy
[2] Univ Wisconsin, Madison, WI 53706 USA
[3] Univ Arizona, Tucson, AZ USA
来源
ACM TRANSACTIONS ON PROGRAMMING LANGUAGES AND SYSTEMS | 2008年 / 30卷 / 05期
基金
美国国家科学基金会;
关键词
security; languages; theory; verification; malware detection; obfuscation; trace semantics; abstract interpretation;
D O I
10.1145/1387673.1387674
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Malware detection is a crucial aspect of software security. Current malware detectors work by checking for signatures, which attempt to capture the syntactic characteristics of the machine-level byte sequence of the malware. This reliance on a syntactic approach makes current detectors vulnerable to code obfuscations, increasingly used by malware writers, that alter the syntactic properties of the malware byte sequence without significantly affecting their execution behavior. This paper takes the position that the key to malware identification lies in their semantics. It proposes a semantics-based framework for reasoning about malware detectors and proving properties such as soundness and completeness of these detectors. Our approach uses a trace semantics to characterize the behavior of malware as well as that of the program being checked for infection, and uses abstract interpretation to "hide" irrelevant aspects of these behaviors. As a concrete application of our approach, we show that (1) standard signature matching detection schemes are generally sound but not complete, (2) the semantics-aware malware detector proposed by Christodorescu et al. is complete with respect to a number of common obfuscations used by malware writers and (3) the malware detection scheme proposed by Kinder et al. and based on standard model-checking techniques is sound in general and complete on some, but not all, obfuscations handled by the semantics-aware malware detector.
引用
收藏
页数:54
相关论文
共 50 条
[1]  
Adleman L., 1988, LECT NOTES COMPUTER, V403
[2]  
[Anonymous], P 6 JOINT M EUR SOFT
[3]  
[Anonymous], 2001, In Virus Bulletin Conference
[4]  
[Anonymous], 2001, MODEL CHECKING
[5]  
BARAK B, 2001, LNCS, V2139, P1, DOI DOI 10.1007/3-540-44647-8
[6]  
Bergeron J., 2001, S REQ ENG INF SEC, P1
[7]  
BRIESEMEISTER L, 2005, SRICSL0503
[8]  
CHESS DM, 2000, VIRUS B
[9]  
CHOW S, 2001, LNCS, V2200, P144
[10]   Semantics-aware malware detection [J].
Christodorescu, M ;
Jha, S ;
Seshia, SA ;
Song, D ;
Bryant, RE .
2005 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 2005, :32-46