100 likes | 252 Views
ALMA Integrated Computing Team Coordination & Planning Meeting #1 Santiago, 17-19 April 2013. Monitoring: Report of Current Status Tzu-Chiang Shen. Monitoring Status. Insertion into ALMALOG database has been disabled since 24/Nov/2012. (APO-147)
E N D
ALMA Integrated Computing TeamCoordination & Planning Meeting #1Santiago, 17-19 April 2013 Monitoring: Report of Current Status Tzu-Chiang Shen
Monitoring Status • Insertion into ALMALOG database has been disabled since 24/Nov/2012. (APO-147) • This includes: Monitoring data, XML logs, WEATER data. • Monthly data volume is bigger the ASM GROUP4, causing: • Excessive DBA effort dedicate in monthly rotation, daily partition and backup activities • High risk to block operation since TMCDB data is being saved in the same database
Work Around • Monitoring data: • Are being persisted in text file format. • Accessible thought the web: • http://monitordata.osf.alma.cl • XML logs : • Equivalent files (but with DEBUG level logs) are available within STEs in short period. • For long term, files are moved to a storage and accessible through the web: • http://computing-logs.aiv.alma.cl • Weather data: • Storage in local MYSQL database • Accessible through the web • http://weather.aiv.alma.cl/data/data/files/
TMCTextArchiver’s Buffer Internal buffer Enqueue Dequeue Drop
Current Monitoring Data Rate and Data Size • 56 antennas + CentralLO’s devices • Total data rate: ~5000 clobs/s • ~ 89.2 clobs/s/antenna • # Monitor Points per antenna type • DV: 2,179; DA: 2,194; CM: 2,438; PM: 2,474 • Currents Size of Monitor Data per antenna type (daily) • DV: 241MB; DA:246MB; CM:301MB; PM:296MB • Current daily monitoring data size: ~ 20 GB (in ~120k files)
Fluctuation of Monitoring Data • Expected data rate with 66 antennas: • ~ 6000 - 7000 clobs/s ~ 25 - 30 GB/day • ~ equivalent a 310KByte/s or 2,485Mbit/s • Reasons of Fluctuations: • Sampling rate of BACI properties • # of system restarts • Type of observation • FE devices are being monitored only if the specific band has been turned on. • # of hwDevice in hwOperational state.