170 likes | 179 Views
This workshop focuses on developing a P2P client application based on 3rd generation P2P protocol for distributing biological databases and software over low-bandwidth networks. The performance of the application is tested with nodes in the Asia-Pacific region.
E N D
P2P Workshopfor Automatic Synchronization and Distribution of Biological Databases and Software over Low-Bandwidth Networks Unitsa Sangket, Amornrat Phongdara, Wilaiwan Chotigeat, Darran Nathan, Woo-Yeon Kim, Tin Wee Tan, Jong Bhak, Chumpol Ngamphiw, Sissades Tongsima, Asif M. Khan and Honghuang Lin By
Outline • Background • Objectives • Methods • Results • Setup of the P2P node
Background • Bioinformatics and the need for network bandwidth • Bioinformatics involves the collection, organization and analysis of large amounts of biological data. • normally update database by a file transfer over FTP • Network bandwidth within developing countries • still very low • the low reliability of connections means breaks / aborts in downloads are common
Background • the revolution in file sharing technology • Peer-to-Peer (P2P) file exchange • 1st generation • Napster • exchanging files among client through server • 2nd generation • FastTrack/Kazza • continued to evolve and improve, no server • 3rd generation • Azureus • advance over previous P2P protocols with BitTorrent • a large file will be broken up into smaller fragments and reassemble
Background • the revolution in file sharing technology traditional client/server distribution of files P2P distribution of files using BitTorrent protocol source: http://no.wikipedia.org/wiki/BitTorrent
Background • 3rd generation P2P technology solve • Low international bandwidth • provide additional bandwidth, speed up the overall download rate • Unreliable connections • download automatic from the best connections
Background • applied in three areas – the distribution of • biological software (size of files ~ < 1GB) • courseware (size of files ~ 1GB – 10GB) • databases(size of files ~ 10GB - 100GB)
Objectives • To extenda P2P client application base on 3rd generation P2P protocol for use in the distribution of biological software, courseware, and databases. • To set up and test the performance with nodes in countries in the Asia-Pacific region.
Methods • The Azureus P2P (http://azureus.sourceforge.net/) suite was selected because • open-source • runs on Java • well documented plugin interface
Methods • A RSSFeed Scanner Plugin was used to trigger automatic synchronization of data at regular intervals
Methods • Four trial nodes have been setup • Prince of Songkla University (PSU, Thailand) • Korean Bioinformation Center (KOBIC, Korea) • National University of Singapore (NUS, Singapore) • National Center for Genetic Engineering and Biotechnology (BIOTEC, Thailand).
Results Fig. Total data size downloaded
Acknowledgements • the International Development Research Centre (IDRC) Canada • the Asia-Pacific Bioinformatics Network (APBioNet) • MOST and KADO of MIC in Korea.
P2PWiki • http://everest.bic.nus.edu.sg/p2p/index.php/Main_Page
Setup of the P2P node 1. Install Azureus and Java 2. Set up Azureus • Set up client of Azureus • To download and upload data • Set up server of Azureus • To create the .torrent files and publish them 3. Install and set up RSSFeed Scanner Plugin • To synchronize data automatically 4. Install and set up Advanced Statistics Plugin • To test performance
1. Install Azureus and Java • ftp://192.168.1.2 • http://azureus.sourceforge.net/download.php