170 likes | 270 Views
Online Overview. L. Coney – UCR MICE CM35 – Feb 2013. Also in This Session:. Controls & Monitoring – Pierrick DAQ – Yordan Online MAUS – Alex Richards MICE Computing – Chris Rogers Will leave the specifics to them.. Note: Online focus here – Operations focus tomorrow.
E N D
Online Overview L. Coney – UCR MICE CM35 – Feb 2013
Also in This Session: • Controls & Monitoring – Pierrick • DAQ – Yordan • Online MAUS – Alex Richards • MICE Computing – Chris Rogers • Will leave the specifics to them.. • Note: Online focus here – Operations focus tomorrow Coney - CM35 - Feb 2013
Since CM34 in October • Significant events in the 4 months since the October CM: • December run • Christmas break – Failure of AC in PPD computing area – downtime of PPD-hosted MICE services/computing • Spectrometer Solenoid Controls Review • Restart of SS2 cooldown/testing • Activation run – Wednesday (13 Feb) Coney - CM35 - Feb 2013
December Run • Did not go smoothly • Similar to October run issues • Problems with DAQ • Worked initially but problems developed • Unable to solve remotely • Problems with C&M • HV control applications • Run Control • Problems with Online Monitoring – gone • Prompted evaluation of reliability within Online Systems • Need develop more robust pre-run procedures for DAQ, C&M, Online Reconstruction • Need DAQ pulser trigger • Need higher priority on documentation • More than one person to solve problems – must do better on handover of information Coney - CM35 - Feb 2013
December Run • Need fake data (more than cosmics) – test full chain of DAQ, unpacker, Online Reco • fake signals from TOFs? – no LED system ‘yet’ • Tracker will have LEDs • Emphasized the need for additional expertise • Personnel changes: • New network village manager – Chris Brew (RAL) • New RAL network liaison – Antony Wilson (RAL) • New DAQ deputy – David Adey (FNAL) • New Online Monitoring owner – Rhys Gardner (Brunel grad student) • New C&M deputy – ??????????? Coney - CM35 - Feb 2013
PPD Outage • PPD-hosted computing services loss over holiday • Loss of access to configuration database (CDB) prevented software development by US MICE • Loss of access to micemine • Highlighted confusion regarding these services and how they relate to MICE activities • Including Online Group, Data-taking, FC testing, SS testing • Prompted review of service loss – see Chris’ talk • Motivated improvement in computing documentation across the board – in PPD, on micenet, hardware and services • Micenet: http://micewww.pp.rl.ac.uk/projects/computing-software/wiki/Micenet_Computers • PPD: http://micewww.pp.rl.ac.uk/projects/computing-software/wiki/Computing_infrastructure Coney - CM35 - Feb 2013
Since CM34 cont’d • Spectrometer Solenoid Controls Review • See Pierrick’s talk • Good feedback g improvements to the system • Led to significant changes in priorities in C&M • Knock-on effect on non-SS C&M work • Restart of SS2 magnet training • See Pierrick’s talk • Activation Run • Even with beam only to DSA – still an exercise of Online Systems (DAQ, C&M, Online Reco) • Went much better than December Run Coney - CM35 - Feb 2013
Overall Online • Completed • Automated operating system updates including a MOM-accessible OFF switch for data-taking (stability/performance) • Spare hardware now organized in R9 (reliability) • Computing documentation agreed go on micemine (ease of operations/record keeping) • Installation of additional UPSs (reliability) • In progress • Finalize monitoring of new UPS units (reliability/stability) • Installation of new Online Reconstruction machines (reliability/stability/performance) • NOTE: Infrastructure largely in hand – consistently good effort by Matt and Antony Coney - CM35 - Feb 2013
Overall Online2 • Delayed • Installation of new iocpc1 – 21 Dec 2012 (related to C&M reliability) • Delayed – requires Pierrick to be at RAL and to coordinate with Matt – last visit December Run • Pierrick priority for trip was work on SS controls • Ran out of time • Computer monitoring info into Alarm Handler – 1 Feb 2013 (stability) • Requires Pierrick • Pierrick priority is SS controls • Restriction of access to micenet – 1 Feb 2013 • The link between Online Group, Software Group, and overall Computing has been weak • Confusion regarding services, ownership, etc. • Recent work to strengthen this area Coney - CM35 - Feb 2013
MICE must be able to take data without connection to services or computing external to micenet Strengths Use primary CDB – located in MLCR Can store data in MLCR for days (so far none deleted from local storage although is all on GRID) Use local EPICS, Alarm Handler, and Archiver for C&M DAQ local Online Reco uses local MAUS installation Weaknesses Elog – hosted on PPD – useful but not critical Micemine – hosted on PPD – run plans, documentation, etc. – useful but not critical Data g GRID (see above) External expert access – Archiver, EPICS gateway, mousehole – convenience rather than necessity Conclusions Current arrangement of services works well MICE data-taking not at risk Online Systems 1 Other Computing Coney - CM35 - Feb 2013
Online Activities • Resurrected remote readout of neutron monitor – Ian • Updated/improved Online Reco plots • TOFs – Durga • CKOVS – Gene • Online software – Alex • Major developments in C&M • Fully developed Focus Coil testing • Micenet to R9 (thanks Antony!) g running as if in MLCR • Transparent move (from Online Group perspective) to MICE Hall • C&M focused on Spectrometer Solenoid g changes in priorities • C&M milestones • Implement SS state machine – 1 Feb – see Pierrick’s talk • Full SS2 C&M – 1 Feb – see Pierrick’s talk Coney - CM35 - Feb 2013
C&M Milestones Shifting • Complete Run Control for Step I elements – 21 Dec g 1 March • Depends on data-taking – aimed at completion during December Run • Depends on Pierrick available – busy on Spectrometer Solenoid work • Rack room environment monitoring plan – 1 Jan g 1 Feb g ? • Require Pierrick – Pierrick busy on SS controls • Complete HV control user manual – 2 Feb g 1 May • Requires Pierrick to write documentation – Pierrick busy on SS controls – documentation suffers – trickledown effect on Operations • Complete Run Control manual and shifter guide – 1 Jan g 15 April • Requires Pierrick to write documentation – Pierrick busy on SS controls – documentation suffers – again, affects Operations Coney - CM35 - Feb 2013
C&M Milestones cont’d • Bench test pneumatic proton absorber controls – 1 April g 1 May • Requires Pierrick – busy on SS controls • Install pneumatic proton absorber controls – 15 April g maybe 1 June • Requires Pierrick – busy • Requires ISIS shutdown: April 1-28 or June 17-30 • Implement DS state-based Alarm Handler – 1 Feb g 15 June • Priority for first state-based system shifted from DS to SS • New HV control – 15 Jan g 1 May • Requires Pierrick & slow communication with CAEN – Pierrick busy • FC C&M review – 15 Feb • Depends on Pierrick availability and travel capability – not happening today • WE NEED ANOTHER C&M PERSON! Coney - CM35 - Feb 2013
Online Reco • Improved TOF & CKOV online plots – 21 Dec 2012 – DONE • Requires data-taking with beam to completely vet new plots – therefore tied into December Run • Install new online reco machines – 1 Feb 2013 • Likely to finish roughly on time – required new order by UniGeneve – installation now during Yordan’s MOM term • KL online plots – 15 April 2013 g 1 June 2013 • No real online plot participation by KL group • Anticipate possible KL plots available after PID paper done? • EMR online plots – 15 April 2013 g 1 June 2013 • EMR schedule slipped – online plots follow hardware completion • NOTE: KL online plots is a guess – no participation = no plots = no info Coney - CM35 - Feb 2013
Finish SS2 and SS1 fully developed C&M SS state machine completed Finish FC – move to Hall Documentation – needs higher priority Improve stability & reliability Complete installation of iocpc1 Complete computer monitoring Stabilize C&M applications Formalize development vs production C&M Create comprehensive pre-run test plan for DAQ and C&M Simplify Production version Run Control Better limits w/in Alarm Handler Automate datamover Improve functionality Incorporate EMR into DAQ Beam test EMR Bring back Online Monitoring EMR online plots Online analysis prototype New C&M deputy. By June CM36… Coney - CM35 - Feb 2013
Then….MICE Step IV • Both Spectrometer Solenoids – both Trackers – one FC and an Absorber module.. • Requires robust Online systems – DAQ, C&M, Online Reco & Analysis, Infrastructure Coney - CM35 - Feb 2013