130 likes | 268 Views
Present status of the ILMT software. Poels , J. and Bartczak, P. ARC Liège, 19 February 2009. The ideal solution, however out of reach (highly expensive ). One multiprocessor computer and RAID storage. Monolithic systems. Advantages In the box multiple CPUs (8,16,…)
E N D
Present status of the ILMT software Poels, J. and Bartczak, P. ARC Liège, 19 February 2009
The ideal solution, however out of reach (highly expensive) One multiprocessor computer and RAID storage
Monolithic systems Advantages • In the box multiple CPUs (8,16,…) • Very low latency due to share of RAM (16GB,32GB,…) between processors through high speed FSB (backbone memory bus). • Easy setup of multiple virtual machines • High speed disk transactions Drawbacks • Costs out of reach (purchase, maintenance, …) • Dedicated hardware • Contracting with a unique company
ILMT cluster PC PC PC PC PC PC PC PC
The clustering concept Advantages • A way to break up large programs and split them across a series of workstations. From 1997, words like « Beowulf » are tossed and cheap supercomputers running Linux OS. • Heterogeneous hardware brands • A modern PC (node) can now accommodate up to 4TB of SATA disks and plenty of RAM Drawbacks • High latency due to network communication between nodes • Data to process is not necessarily stored on the target node -> TCP/IP transfers on NAS (multi-TB Networks-attached storage) • Beowulf solutions even though well adapted to procedural languages like C is not adequate to OO (Object Oriented) programming. • Needs for a middleware Layer as an interface between software and hardware to ease distributed computing in OO environment.
Solutions • High latency can be overcome avoiding inter-nodes communications, therefore, processes must be independent. However, interaction with the cluster scheduler running the ORACLE database server is still necessary. • The cluster scheduler (implemented in Oracle) should have a knowledge of the location of the data to be used in the process and run it in the closest node if possible. • The overhead time decreases as Ethernet performance increases (100mb/s now)…1Gb is now affordable meaning that the future is bright. • As a rule of thumb one should write the code such that the total overhead time <<<< than the process execution time.
Database NODE PC PC Network LAN CORBA
Common Object Request Broker Architecture (CORBA)client side • class testclient { • testclient(); • ~testclient(); • // Resolved and narrowed CORBA object for proxy calls • Data::test_var m_Data; • bool RequestTestSum(); }; • bool testclient::RequestTestSum() • { • CORBA::Long num1=4; CORBA::Long num2=5; CORBA::Long num3=6; • long sum; • m_Data = test::_narrow(obj1.in()); • // This is the CORBA call which is to be executed remotely • sum=m_Data->testsum( num1, num2, num3); • cout << « sum=" << sum << endl; • return true; } • void main(int argc, char** argv) • { • // Constructor establishes the link with the CORBA server. • testclient testcl; • testcl.RequestTestSum()); • }
Common Object Request Broker Architecture (CORBA)server side • class test_i : public POA_Data::test, • public PortableServer::RefCountServantBase • { • public: • test_i(); • virtual ~test_i(); • virtual CORBA::Long testsum( CORBA::Long num1,CORBA::Long num2,CORBA::Long num3);}; • test_i::test_i(){} • test_i::~test_i(void){} • CORBA::Long test_i::testsum( • CORBA::Long num1, • CORBA::Long num2, • CORBA::Long num3) • { • long result; • result = num2 + num1 +num3; • return result; • }
Pipeline(s) • A pipeline (C++, Pyraf) is a stack of ordered tasks or processes where the output of each becomes the input to the next in the sequence, the final result being images or alphanumeric data: • Preliminary astrometry and photometry on incoming ILMT data (daily) • Image subtraction aimed to compute the variability of point-like sources melted in extended sources or early discovery of supernovae's (daily) • Build and refine reference images (monthly) • Extract potential periods from point-like sources photometry (SQL Oracle database) • Etc… • As the project is international, the cluster must accommodate independent pipelines developed by collaborators (Canada, India) • For cluster performance reasons a pipeline must be run on one and only one node providing its results to the cluster master
New observation data NODE computers Database server Storage JOBS list Welcomedeamon Results JOBS watchdog Pipeline 3 Pipeline 2 Pipeline 1
Pipeline 1 Pipeline 2 Pipeline 3