180 likes | 351 Views
NTU Cloud. 2010/05/30. System Diagram. Architecture. Gluster File System Provide a distributed shared file system for migration NFS A Prototype Image storage space. Compute Img. C- Img. C- Img. C- Img. Storage Img. S - Img. Prototype Img. NFS. Gluster File System.
E N D
NTU Cloud 2010/05/30
Architecture • Gluster File System • Provide a distributed shared file system for migration • NFS • A Prototype Image storage space Compute Img C- Img C- Img C- Img Storage Img S- Img Prototype Img NFS Gluster File System
Architecture • Prototype Image • Original Image e.g. Hadoop MPI • Compute Image • Modified Images for user • Do not preserve the content after cluster shutdown Compute Img C- Img C- Img C- Img Storage Img S- Img Prototype Img NFS Gluster File System
XEN • A hypervisor • Virtualization
Cloud Master • Monitor system state • Scheduling • Use NFS to store Prototype Image • Web server
OpenNebula • A middleware • Provides an interface to manage virtual infrastructure (computation and network) • VM Migration => We use OpenNebula to manage VM deployment, migration and set up virtual local area network(VLAN).
Gluster file system • User level distributed file system • Client/Server Architecture • Use TCP/IP to transfer data =>We use GlusterFS to build our share file system environment for VM live migration. =>Our deployment is "symmetrical" - every machine is both a server and a client.
Hadoop Benchmark • Case 1 • M1 : Master + Slave-01 + Slave02 • Case 2 • M1 : Master • M2 : Slave-01 + Slave-02 • Case 3 • M1 : Master • M3 : Slave-01 + Slave-02 • Case 4 • M1 : Master • M2 : Slave-01 • M3 : Slave-02
Sec Iteration
Set 1 • M1&M2 has same CPU and Memory size. • HADOOP_HEAPSIZE=500MB • mapred.child.java.opts=100MB • RandomWriter 10M for 30Maps • Sortting • HDFS_BYTES_READ=210543161 • HDFS_BYTES_WRITTEN=210541669
Reduce shuffle bytes=199629523 Sort Reduce shuffle bytes=203039958 Launched reduce tasks=4 Others=3 Exactly the same! Therefore, putting two VM into one machine performance slowdown to 88.92% two machine / single machine = 88.92 %
Set 2 1.RandomWriter 10M for 30Maps 2.Sort HADOOP_HEAPSIZE=500MB mapred.child.java.opts=100MB
RandomWriter Therefore, putting two VM into one machine performance slowdown to 80.70% two machine / single machine = 80.70 %
Current Progress • Xen 4.0 is ready on each node. • We can offer two kinds of images • Hadoop • MPI • Start up VMs to destination node automatically. • Configure MPI and Hadoop environment for use automatically.