390 likes | 542 Views
Hunting for Metamorphic Engines. Wing Wong Mark Stamp. In This Paper…. Analyze metamorphic malware Hacker-produced metamorphic code Measure similarity of software Based on n -gram analysis Compute scores Based on n -grams and Based on HMMs This paper is baseline for future work.
E N D
Hunting for Metamorphic Engines • Wing Wong • Mark Stamp Hunting for Metamorphic Engines
In This Paper… Hunting for Metamorphic Engines • Analyze metamorphic malware • Hacker-produced metamorphic code • Measure similarity of software • Based on n-gram analysis • Compute scores • Based on n-grams and • Based on HMMs • This paper is baseline for future work
Motivation Hunting for Metamorphic Engines • Many virus construction kits available • Many can produce metamorphic code • So anybody can create “new” version of existing malware • Virtually no technical expertise required • How “effective” is the resulting metamorphic code? • Can we detect metamorphic malware?
Background Hunting for Metamorphic Engines • Encrypted, polymorphic, metamorphic • Metamorphic == body polymorphic • Metamorphic vs cloned software • Clone is the norm, but metamorphic could offer advantages to the good guy too… • From the theory, we know malware detection is NP-complete • And metamorphic is at least as hard • But what about practical situation?
Metamorphism Hunting for Metamorphic Engines • Metamorphic code changes it “shape” • Well-known examples • W95/Regswap • W32/Ghost • W95/Zperm • MetaPHOR
Metamorphism Hunting for Metamorphic Engines • General techniques available • Insertion • Substitution • Transposition • Deletion • Some easier than others • Some more effective against certain detection strategies
Virus Construction Kits Hunting for Metamorphic Engines • In this paper, we consider • PS-MPC (Phalcon/Skism Mass Produced Code generator) • G2 (Second Generation virus generator) • MPCGEN (Mass Produced Code GENerator) • NGVCK (Next Generation Virus Construction Kit) • VCL32 (Virus Creation Lab for Win32)
Virus Construction Kits Hunting for Metamorphic Engines • Did not consider MetaPHOR • Difficult to work with, finicky • All of these claim to be metamorphic • Are they really? • How can we measure “metamorphism”? • If they are highly metamorphic, can we still detect them?
Brief Review of Malware Detection Hunting for Metamorphic Engines • First generation • Signature scanning, wildcards OK • Second generation • Approximate signature scanning; e.g., ignore NOP instructions • Code emulation • Heuristic analysis • Static or dynamic, false positives…
Machine Learning Hunting for Metamorphic Engines • Consider the following • Data Mining, Neural Networks, HMMs • Data Mining • Malware-related previous work • Generic approach • Neural Networks • Previous work based on byte trigrams • Developed and used at IBM
Hidden Markov Models Hunting for Metamorphic Engines • Train HMM on metamorphic family • Then we can score any file to see how “close” it is to the family • What to use to train such an HMM? • Raw bytes in exe? • Disassembled code? • Opcode sequence? • More on this later…
Software Similarity Hunting for Metamorphic Engines • How to quantify metamorphism? • In general, how to measure similarity of software? • Given program 1 and program 2.. • We develop a score • Score of 0 means “no similarity” • Score of 1 means “virtually identical”
N-gram Similarity Hunting for Metamorphic Engines • Given executable files X and Y • Extract opcode sequences from each • Suppose X has nopcodes • Suppose Y has mopcodes • How to compare the sequences? • Many possible ways --- here we use n-gram analysis • That is, we compare subsequences
N-gram Similarity Hunting for Metamorphic Engines • Extracted opcode sequences • X=(x0,x1,…,xn-1) and Y=(y0,y1,…,ym-1) • Compare subsequences of length k • Then xi,xi+1,…,xi+k-1 matches yj,yj+1,…,yj+k-1 if they are the same in any order • For each such match, plot the point (i,j) • Remove any segments less than p points • Then score = (x axis covered + y axis covered) / 2
N-gram Similarity Example Hunting for Metamorphic Engines
N-gram Similarity Hunting for Metamorphic Engines • Score is between 0 and 1 • If program X identical to program Y • Main diagonal is a solid line • And score = 1 • Minimum score is 0 • The smaller the score, the less similar are the programs
Typical N-gram Similarity Hunting for Metamorphic Engines Normal (cygwin utility) files
Typical N-gram Similarity Hunting for Metamorphic Engines NGVCK
Typical N-gram Similarity Hunting for Metamorphic Engines G2
N-gram Similarity Hunting for Metamorphic Engines Compare members of a “family” with each other
N-gram Similarity Hunting for Metamorphic Engines In graphical form…
N-gram Similarity Conclusion? Hunting for Metamorphic Engines • G2 more similar to each other than expected • So, they are not very metamorphic • Ditto for most of the other generators • But, NGVCK viruses more different from each other than expected • So, they are highly metamorphic • Implication wrt signature detection?
NGVCK Similarity Hunting for Metamorphic Engines Compare NGVCK to other families…
NGVCK Similarity Conclusion? Hunting for Metamorphic Engines • NGVCK viruses very different from each other • Implies highly metamorphic… • …so, signature detection will fail • But NGVCK viruses are even more different from normal files • Then what about detection?
Aside: Similar Similarity Measures to Consider? Hunting for Metamorphic Engines • Given opcode sequences • Edit distance • Other sequence comparison techniques • Statistical measures • Considering raw bytes • Statistical measures • Entropy and other “structural” measures
Hidden Markov Models Hunting for Metamorphic Engines Generic view of HMM
HMM Notation Hunting for Metamorphic Engines
HMM for Metamorphic Detection Hunting for Metamorphic Engines • Train HMM • Extract opcodes from family executables • Append opcode sequences • Train a model, i.e., determine matrices • Use trained HMM to score files • Given an file, extract opcode sequence • Score sequence against the model • Compare to predetermined threshold
HMM Scoring: Fine Points Hunting for Metamorphic Engines • Score computed as log likelihood of the scored sequence • We mormalize score to “log likelihood per opcode” (LLPO) • Why? • How to quantify effectiveness? • ROC curves are very useful • Specifically, area under ROC curve (AUC)
Results Hunting for Metamorphic Engines HMM scoring for NGVCK family
HMM Scoring: Bottom Line Hunting for Metamorphic Engines • Signature detection for metamorphic families, except NGVCK • For NGVCK, we can use HMM • Classification is 100% when compared to normal (benign) files • Some misclassifications of other malware (is that good or bad?) • Should include ROC curves, AUC, …
HMM States: 3 State Model Hunting for Metamorphic Engines
N-gram Score Hunting for Metamorphic Engines • Can also score files using N-grams • Randomly select NGVCK file • Extract its opcode sequence • Given a file we want to score • Extract its opcode sequence • N-gram similarity to NGVCK sequence • Higher similarity, classify as NGVCK • Lower similarity, classify as “not NGVCK”
N-gram Score Results? Hunting for Metamorphic Engines • For NGVCK, obtain ideal separation • There exists a threshold for which… • …we can separate NGVCK from normal • Surprisingly strong results • For such a simple similarity score • Why does this work? • We come back to this at the end…
Compare to Commercial AV Hunting for Metamorphic Engines • Tested following on our virus sets • eTrust, avast!, AVG • These scanners detected most of the viruses from weak families • That is, G2, VCL32, etc. • But none of the NGVCK viruses detected by any of the 3 scanners
Conclusion Hunting for Metamorphic Engines HMM effective at detecting a highly metamorphic NGVCK malware family N-gram similarity also effective NGVCK not detected by commercial AV So, this detection improves the state of the art Practical considerations?
Lessons Learned? Hunting for Metamorphic Engines Why can we detect NGVCK family? In spite of high metamorphism, code is statistically different from normal “Improved” metamorphic malware? Metamorphism must be sufficient to evade signature detection But, metamorphic family must be statistically similar to normal
Future Work Hunting for Metamorphic Engines • Build a better metamorphic generator • Some progress here, but still detectable using other detection methods • Still need better generators… • Develop and test other detection strategies • Lots of work done here too • But lots more to do
References Hunting for Metamorphic Engines W. Wong and M. Stamp, Hunting formetamorphic engines, Journal in Computer Virology 2(3):211-229, 2006 M. Stamp, A revealing introduction tohidden Markov models