Identifying Botnets Using Anomaly Detection Techniques Applied to DNS Traffic

Identifying Botnets Using Anomaly Detection Techniques Applied to DNS Traffic Speaker:Chiang Hong-Ren

Outline • Introduction • Anomaly detection techniques • DDNS-Based Bontet detection • Methodology • Experimental Results • Discussion • Conclusion

Introduction • The first approach consists in looking for domain names whose query rates are abnormally high or temporally concentrated. • The second approach consists in looking for abnormally recurring DDNS replies indicating that the query is for an inexistent name (NXDOMAIN). • This paper evaluates experimentally the effectiveness of these approaches for detecting botnets in enterprise and access provider networks.

Anomaly detection techniques • Dagon et al. use Chebyshev’s inequality and a simplified version of the Mahalanobis distance to quantify how anomalous the number of queries for each domain name is during a day or hour in that day,respectively. • Considering that botnets often use Third Level Domains (3LDs) instead of subdirectories Dagon et al.aggregate lookups for each Second Level Domain (SLD) with those of the respective 3LDs.

DDNS-Based Bontet detection • In their method, a “Canonical DNS Request Rate” (CDRR) aggregates the query rate of a SLD with the query rates of the SLD’s children 3LDs, according to the formula: • when the CDRR of a name is anomalous according to Chebyshev’s inequality that name has an abnormally high query rate and is likely to belong to a botnet C&C server. • They suggest that names whose feature vector differs from that of a normal name by more than a threshold are likely to belong to a botnet C&C server.

Methodology(1/3) • A. Data Collection • We used the tcpdump network sniffer to collect this data (11 GB) and store it in the pcap format. • We collected all DNS traffic at the University of Pittsburgh (Pitt)’s Computer Science (CS) department for a period of 192 hours (9 days) starting on 2/13/2007.

Methodology(2/3) B. Data Selection AA=Authoritative Answer RR=resource record NXDOMAIN= name error ANS = answer RR, AUTH=Authority RR TTL=Time to Live NS=Name Server SOA=Start of Authority

Methodology(3/3) C. Detection of abnormally high rates we verified whether the SLD is anomalous according to Chebyshev’s inequality with k = 4.47. We investigated whether anomalous SLDs are indeed suspicious. D. Detection of abnormally temporally concentrated rates The top SLDs with distances exceeding a threshold were considered anomalous. We investigated whether anomalous SLDs are indeed suspicious.

Experimental Results(1/8) • summarizes our results for detection based on abnormally high rates.

Experimental Results(2/8) SLDs In CS_NS with anomalous high rates and independently reported as suspicious.

Experimental Results(3/8)

Discussion • distinguishing DDNS queries from other DNS queries is difficult in enterprise and access provider networks. • Many legitimate domains, such as google.com, yahoo.com, and weather.com use low TTL values. • some legitimate and popular domain names, such as mozilla.com, are also hosted by DDNS providers. • Smaller botnets can be expected to generate fewer queries for each C&C server,making the latter’s detection more difficult.

Conclusion • the first approach generated many false positives (legitimate names classified as C&C servers). • the second approach was effective. Most of the names it detected were independently reported as suspicious by others. • The two different algorithm for botnet detection are proposed and both can detect the specific activity of botnet nicely. • Increasingly, popular legitimate names such as gmail.com and mozilla.com are using low TTL values or DDNS hosting, blurring boundaries and confounding classifications.

Identifying Botnets Using Anomaly Detection Techniques Applied to DNS Traffic

Identifying Botnets Using Anomaly Detection Techniques Applied to DNS Traffic

Presentation Transcript

Anomaly Detection

Botnets, detection and mitigation: DNS-based techniques

Anomaly Detection

Anomaly Detection Systems

Anomaly Detection - Traffic Video Surveillance

Anomaly Detection Using Call Stack Information

Detecting Botnets With Anomalous DNS Traffic

Sensitivity of PCA for Traffic Anomaly Detection

Anomaly Detection using Curious Agents

Traffic Anomaly Detection

Anomaly Detection Using “Normal” Data

Identifying on-line Fraudsters: Anomaly Detection Using Network Effects

Anomaly Detection Using Data Mining Techniques

Anomaly Detection Systems

Volume Anomaly Detection

HeapMD: Identifying Heap-based Bugs using Anomaly Detection

Anomaly Detection Using Data Mining Techniques

HeapMD: Identifying Heap-based Bugs using Anomaly Detection

Anomaly Detection Using Call Stack Information

Anomaly Detection in Fruits using Hyperspectral Images

Anomaly Detection Market