1 / 20

CS519 BGP Project Report

CS519 BGP Project Report. Kai-Wen Chung (kc279) San-Yiu Cheng (sc345). How to Proceed BGP Analysis. Collect Raw Data. Import into Database. Query Database and Analyze data. Collect Raw Data. MAE-EAST (1998.1 ~ 1998.11) http://archive.routeviews.org/ (2003.1 ~ 2003.3). Database Schema.

elton
Download Presentation

CS519 BGP Project Report

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CS519 BGP Project Report Kai-Wen Chung (kc279) San-Yiu Cheng (sc345)

  2. How to Proceed BGP Analysis Collect Raw Data Import into Database Query Database and Analyze data

  3. Collect Raw Data • MAE-EAST (1998.1 ~ 1998.11) • http://archive.routeviews.org/ (2003.1 ~ 2003.3)

  4. Database Schema • Original Schema

  5. Database Schema (cont.) • Record Size • Message: 94 bytes/record • MsgPath: 18 bytes/record • # Record • Message: 104,841,405 (98.1 ~ 98.11) • MsgPath: 251,442,478 (98.1 ~ 98.11)

  6. Database Schema (cont.) • Database space allocation: 20GB • About 12 hours to import raw data for 1 month (about 10,000,000 messages, and 20,000,000 paths) • Data volume reaches limitation soon

  7. Our Solution • Allocate larger space • Move Database from SQLServer -> Sparrow • Total 70GB • Modify data schema to reduced record size

  8. Data Schema Modification

  9. Data Schema Modification • Record Size • Message: 52 bytes/record • MsgPath: 14 bytes/record • Size Reduces • Message: 46.9% • MsgPath: 22.2% • Faster Data Importing

  10. Current Status • Database • P3-500 with 128MB ram, and Windows 2000 Server and SQL Server 2000 installed • Imported Data • 1998.1 ~ 1998.11. About 21GB in DB • 2003.3. About 34GB in DB

  11. Current Database Issue • SQL Server Performance • It could take several hours to run a query • Space problem • 70GB is only enough for data of 1 ~ 2 month (of 2003) • We need a “Tera-byte” Database to accommodate all data of 2002, and 2003

  12. Summary of Data • Total space used: • ~55G (1998 and 03/2003) • Number of Messages: • ~220.5 Million (1998 and 03/2003) • Number of DataSet: • ~30,000 (1998 and 03/2003)

  13. Summary of Data (cont.) • A small number of IP addresses dominate the routing table • 15 Source IP addresses occupy about 68% of the PeerIp field of the Messages • 15 Destination IP Addresses occupy about 47% of the NextHop field of the Messages

  14. Summary of Data (cont.) • Advertisement Vs. Withdrawal Messages • There are about 220 Million Messages • ~31.5% of all Messages are Withdrawal Messages • ~68.5% of all Messages are Advertisement Messages

  15. Data Analysis

  16. Data Analysis (cont.)

  17. Data Analysis (cont.)

  18. Data Analysis (cont.)

  19. Data Analysis (cont.)

  20. Some Advices • Optimize your query • Some queries are going to take several hours to execute • Test on bgpbaby first • This is a smaller version of bgpdata (~1G) • Don’t try to execute all your queries on last day • The SQL Server database is going to be overwhelmed

More Related