Automated Signature Generation for Detecting Botnet-Based Spam Activity

Spamming Botnets: Signatures and Characteristics Authors:Yinglian Xie, Fang Yu, Kannan Achan, Rina Panigrahy, Geoff Hulten+, Ivan Osipkov+ Presenter: Chia-Li Lin

References • Y. Xie, F. Yu, K. Achan, R. Panigrahy, G. Hulten, and I. Osipkov. Spamming botnets: Signatures and characteristics. In SIGCOMM, 2008

Outline • Introduction • Spam Activity Trends • AutoRE Structure • Study Results • Conclusion

Introduction • Developed a spam signature generation framework called: • AutoRE • To detect botnet-based spam emails and botnet membership • It outputs high quality regular expression signatures

Contribution • Ability to detect frequent domain modifications • In-depth analysis of identified spamming botnet characteristics and their activity trends

Two Observations • First, spammers often add random, legitimate URLs to content • legitimate and very general (e.g.,http://www.w3.org) • Second, customize polymorphic URLs

Multi-URL spam emails

Polymorphic URLs

AutoRE • Automatically generating URL signatures to identify botnet-based spam campaigns • Produces two outputs: • a set of spam URL signatures • complete URL string (CU) • URL regular Expression (RE) • a related list of botnet host IP addresses

Three modules • AutoRE is comprised of the following three modules • URL preprocessor • Group selector • RegEx generator • domain-specific • domain-agnostic

AutoRE Structure[1/2]

AutoRE Structure[2/2]

Suffix-array algorithm

keyword-based signature tree

Detailing and Generalization • Detailing • returns a domain specific regular expression using a keyword-based signature as input. • Generalization • returns a more general domain-agnostic regular expression by merging very similar domain-specific regular expressions

Generalization

Detect Results • Using three months of sampled emails from Hotmail • November 2006, June 2007, July 2007 • AutoRE successfully detected • 7,721 spam campaigns • 340,050 distinct botnet host IP addresses • spanning 5,916 ASes.

CU& RE Statistics

False positive rate

Conclutions • This is the first successful attempt to automatically generate regular expression signatures • The existence of botnet spam signatures and the feasibility of detecting botnet hosts using them

Questions

Automated Signature Generation for Detecting Botnet-Based Spam Activity

Automated Signature Generation for Detecting Botnet-Based Spam Activity

Presentation Transcript

BOTNETS

Botnets

Botnets

Botnets

Botnets

Botnets

STUDYING SPAMMING BOTNETS USING BOTLAB THE NERD VERSION OF

Botnets

Botnets and Applications

Botnets

Botnets

Spamming Techniques and Control

Spamming Botnets: Signatures and Characteristics

Botnets

Botnets

Botnets

Report on “ Spamming Botnets: Signatures and Characteristics ”

Botnets

Botnets

Botnets