60 likes | 149 Views
Status of PDC’07. L. Betev ALICE-LCG Task Force, Aug 16, 2007. Central services - load. Central services - AliEn updates to use service aliases instead of host names Allows to extend the load balancing to all services (presently used for proxy)
E N D
Status of PDC’07 L. Betev ALICE-LCG Task Force, Aug 16, 2007
Central services - load • Central services - AliEn updates to use service aliases instead of host names • Allows to extend the load balancing to all services (presently used for proxy) • Adding / upgrading of individual hosts will be transparent • Increased reliability • Job load • 300K jobs (DONE+Error) in the last 7 days • 0.5 jobs/sec ALICE-LCG TF Meeting
Central services - load (2) • Most loaded hosts/services • db06a (Catalogue/Task queue DB) - average 17, peak 120 • This is the Task Queue - frequent job status changes (each job updates the DB at least 5 times) + additional remaining open connections to DB • Fixes for the above are being made • db1 (Job Optimizer, Authen, IS) - average 4, peak 50 • The Job Optimizer should go a dedicated host • The other 4 hosts are balanced very well ALICE-LCG TF Meeting
Job Errors • Mostly ERROR_V due to problems with application installaton • Three sources: • Installation by JA on mixed 32/64 bit WNs/VO-box - incompatibility of libraries (3 sites) • Incomplete installation by JAs from sites with pool accounts • Shared filesystem problems during installation • Total affected - 10 sites • Fix - PackMan flag (in LDAP) allowing only the VO-box to do the packages installation for certain sites ALICE-LCG TF Meeting
Resources usage ALICE-LCG TF Meeting
Data transfers and staging • Smooth data transfer to GSI • One more disk server will be available tomorrow as xrootd CASTOR2 buffer at CERN • We need to step-up the installation of remote storage (difficult in August) • We should complete the instegration of the tools for data transfer and staging ALICE-LCG TF Meeting