140 likes | 152 Views
This workshop explores the challenges and solutions for interpreting and combining internet monitoring data, addressing issues such as inconsistent formats and conflicting data. It also discusses the use of ontologies and RDF data models for encapsulating domain expertise. The workshop aims to develop queries and applications for accessing and understanding internet monitoring data.
E N D
Semantic Interpretation of Internet Monitoring Data Ian Napier PhD Student, Dept of Computer Science Loughborough University Supervisors : Dr Iain Phillips Dr Lin Guan Multi-Service Networks Workshop, Cosener’s House 2009
Example Scenario : Intersite VPN Multi-Service Networks Workshop, Cosener’s House 2009
Where’s the gap? Multi-Service Networks Workshop, Cosener’s House 2009
Data Usage Problems • Different published formats (PCAP files, CSV, HTML, spreadsheets, databases etc.) How to access and combine these? • Inconsistent or Conflicting data (eg : AS level topology from BGP updates or Active probing methods [4]). How to mediate? • Understanding the significance of, and relationship between disparate data. Interpretation and semantics need domain expertise. How to encapsulate this knowledge? • RDF data models and Ontologies are one approach. Multi-Service Networks Workshop, Cosener’s House 2009
Ontology Summary A nice general definition : “An Ontology is a structure capturing semantic knowledge about a certain domain by describing relevant concepts and relations between them.” [5] data Ontology (knowledge model) metadata ‘knowledge’ High-Level Information Multi-Service Networks Workshop, Cosener’s House 2009
Introduction to Semantic Web Tools Layered Approach, built on the foundation of the Resource Description Framework (RDF) and the RDF ‘triple’ Statement : predicate (property) SUBJECT OBJECT cos:locatedIn cos:Cosener’s Hotel cos:Oxfordshire cos:hasCountyTown “149,100” cos:Oxford cos:hasPopulation cos = “http://nets.lboro.ac.uk/home/coidn/cosenersontology.rdf/” Multi-Service Networks Workshop, Cosener’s House 2009
Semantic Web Languages Query (SPARQL) Reasoners (Pellet,Fact) / Rules (SWRL) Ontology Languages (RDFS and OWL) Data Modelling (RDF) Multi-Service Networks Workshop, Cosener’s House 2009
Conceptual System Outline Application “Local” data (eg:Packet traces, traceroutes) Queries Reasoner/Rules Ontology Data Converters (eg: PCAP to RDF) RDF Triple Store RDF/XML files Serialisers/Parsers “Public” data (eg: RIPE) Multi-Service Networks Workshop, Cosener’s House 2009
Sample Ontology : Graph snippet Multi-Service Networks Workshop, Cosener’s House 2009
Sample Ontology : RDF/XML snippet <rdf:Description rdf:about="http://nets.lboro.ac.uk/TCPFLOW72"> <lboro:hasRESET>false</lboro:hasRESET> <lboro:hasSRCIPAddr rdf:resource="http://nets.lboro.ac.uk/SRCIP72"/> <lboro:hasDSTPort>80</lboro:hasDSTPort> <rdf:type rdf:resource="http://nets.lboro.ac.uk/TCPFlow"/> </rdf:Description> − <rdf:Description rdf:about="http://nets.lboro.ac.uk/UDPFlow"> <owl:disjointWith rdf:resource="http://nets.lboro.ac.uk/TCPFlow"/> <rdfs:subClassOf rdf:resource="http://nets.lboro.ac.uk/IPCommunication"/> <rdf:type rdf:resource="http://www.w3.org/2002/07/owl#Class"/> </rdf:Description> Multi-Service Networks Workshop, Cosener’s House 2009
Next Steps • Expand ontology domain to include more data, specifically more converters. • Develop richer semantics (predicates). • Develop queries and application, liaising with industry. • Initially concentrate on AS topology data and local packet captures. Thank You. Any questions? Multi-Service Networks Workshop, Cosener’s House 2009
References and Reading References [1] RIPE. www.ripe.net. [2] Colleen Shannon; David Moore; Ken Keys; Marina Fomenkov; Brad- ley Huffaker; K.Claffy. The internet measurement data catalog. ACM SIG- COMM Computer Communication Review, 35:97100, 2005. [3] MOME. www.ist-mome.org. [4] Mahadevan; Krioukov; Formenkov; Huffaker; Dimitropoulos;Claffy;Vahdat. The internet as-level topology:three data sources and one definitive metric. In ACM/SIGCOMM Computer Communications Review Vol 36 Jan2006, 2006. [5] Marko Grobelnik; Dunja Mladenic. Semantic Web Technologies: Trends and Research in Ontology Based Systems, chapter 2. Knowledge Discovery for Ontology Construction. John Wiley and Sons, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, England, first edition, 2006. Recommended Reading Grigoris Antoniou; Frank van Harmelen. A Semantic Web Primer. The MIT Press, Cambridge, Massachusetts, second edition, 2008. Dean Allemang; Jim Hendler. Semantic Web for the Working Ontologist, Elsever Inc, Burlington, Massachusetts, first edition, 2007. Related Work http://www.fp7-moment.eu/ Multi-Service Networks Workshop, Cosener’s House 2009
Supplementary Notes : Some Properties rdf :type rdfs :subClassOf :domain :range :subPropertyOf owl :class :ObjectProperty :datatypeProperty :inverseOf :SymmetricProperty :ReflexiveProperty A p A for allA :IrreflexiveProperty :TransitiveProperty Multi-Service Networks Workshop, Cosener’s House 2009
Supplementary Notes : Some Possible Data Sources • “Locally generated” data, including Passive packet captures at Internet gateways, Active traceroute data, etc. • Regional Internet Registry data (eg: RIPE): delay measurements, AS to IP Address mappings etc [1]. • CAIDA: Many ongoing projects, including Archipelago (ARK) for AS level topology updates. Also CAIDA Internet Data Catalog [2]. • MoMe : EU FP6 initiative which gives a standardised interface to data from several ongoing projects and a data catalog [3]. Multi-Service Networks Workshop, Cosener’s House 2009