140 likes | 153 Views
Deriving Risk Events through the Analysis of Distributed Netstat Data. Timothy Wright Graduate Operating Systems Fall 2006. The Motivation. IT Risk Management (RM) Risk picture--accuracy depends on information Data Sources for Risk Events Network oriented Expense: keep it cheap
E N D
Deriving Risk Eventsthrough the Analysis ofDistributed Netstat Data Timothy Wright Graduate Operating Systems Fall 2006
The Motivation • IT Risk Management (RM) • Risk picture--accuracy depends on information • Data Sources for Risk Events • Network oriented • Expense: keep it cheap • Resource needs: keep it simple
The Problem What network data source is cheap and easy to obtain, yet provides great accuracy (e.g., when, what, where, how) and semantic (e.g., who, why) value?
The Netstat Solution • Netstat • “Inside” perspective • Ubiquitous • Distributed Collection • Leverage both sides of a connection • Aggregate observations
…but, the truly compelling thing about netstat is: All TCP/UDP activities can be tied to an account-level entity • User account associations • Program name associations (root only)
Data Drop Problem • Type 1: monitored host offline • Type 2: netstat-agent script failing to log • 83,184 connections logged; 1,371 Type 2 data drops detected (about 2%)
Types of Reports Generated from Distributed Netstat Data • Traffic Composition • Various Kinds of Inbound/Outbound Connections • Human User Activities • Bipartite Matching
Bipartite Matching • Match up listeners with clients • Only works for connections among monitored hosts • Example non-matched connection: • Why would there be an SSH connection between cluster hosts on a Saturday afternoon? • Any risk here?
Bipartite Matching (continued) Example matched connection: • Student account originated connection • User’s affiliation may diminish risk (e.g., in this case, activity seems legit)
Conclusions and Future Work • Netstat data offer a strong information resource for IT RM • The “who” in network transactions • The “how” in network transactions (if root) • Heightened semantics • Issues with DragNet • Not real time (fixed) • Harvesting and processing netstat data (fixed) • Data drops • Type 1 (monitored via heartbeat) • Type 2 (mitigated)