440 likes | 589 Views
DT Upgrade overview and Phase 2 discussion. C. F. Bedoya. CMS Week - April 9th 2013. C. F. Bedoya April 9th , 2013. 2. DT Upgrade Overview. LS1 projects: * Theta TRB: (talk from Fabio). * SC relocation (next slides) Phase 1 Trigger : * L1 TDR finalized and now under review.
E N D
DT Upgrade overview and Phase 2 discussion C. F. Bedoya CMS Week - April 9th 2013
C. F. BedoyaApril 9th, 2013 2 DT Upgrade Overview • LS1 projects: • * Theta TRB: (talk from Fabio). • * SC relocation (next slides) • Phase 1 Trigger : • * L1 TDR finalized and now under review. • Goal is to make a unified • trigger system both in HW and • allow to combine information • from different detectors • at the earlier possible time • * Twin Mux board: (pre-DTTF board) • A. Triossi started tests of reading trigger inputs with Avago+FPGA • * Algorithms: DT+ RPC algorithms? Manpower more or less identified. Some general approaches established. We need to start performing the actual work. • Phase 2: • * Mixing Tracker + DT (or other subdetector) info studies moved to Tracking Trigger working group (CMS general, not subdetector based) • * New inputs from CMS regarding L1A parameters affect our electronics (this talk and next one)
READOUT -ROS TRIGGER -TSC COPPER OPTICAL C. F. BedoyaApril 9th, 2013 3 SC relocation Madrid Aachen Padova Torino Bologna
C. F. BedoyaApril 9th, 2013 4 SC relocation Many important milestones achieved but also a lot of work ahead us. Since last report (oct 2012): - New CUOF produced (attenuation pad that allowed larger margins for unbalanced data) - December tests at P5: 3 sectors in TRG+RO operated satisfactorily with CUOF+OFCU chain - ESR (Electronics System Review) on Feb 6th 2013. Went well. - February tests at P5: 72 TRG links tested satisfactorily with different trigger rates - Motherboard first boards production: boards in hands, firmware being finalized - First fanouts received: good results - Final order for fibers placed: all RO fibers delivered - Aachen crate prototype received and tested. New backplane prototype expected soon. - Slow control OCB and ECB first tests at 904 ok. - CUOF+OFCU-TRG temperature tests satisfactory results - Half of the Wiener crates ordered (delivery next week?) - Final VME_patch and TIMBUS-OFCU boards under assembly: tests ok - Alignment modules moved from X2 Near - DT-DSS relocation in USC underway - Final order of Pattern Units partly placed
C. F. BedoyaApril 9th, 2013 5 SC relocation • Next steps: • - Final motherboard tests (missing firmware) • - CUOF cooling test (missing motherboard) • - CUOF slow control tests at 904 (idem) • - Final integration tests in 904 with Minicrate to ROS and TSC chain. • - New OCB to be produced (waiting slow control tests). • - New backplane from Aachen expected soon • - OFCU-TRG slow control tests • - Final purchase of OFfanouts • - Fibers installation • - LV rack refurbishment (waiting for CUOF consumption) • - Place final order of Wiener crates • - Final orders of CUOF, motherboards, Slow control, OFCU-RO, TSCrear, OFCU-TRG.... • - IRR (Installation Readiness Review) 3 months prior installation (oct), i.e. June.... However, there is still one point I don´t think we have yet totally solved : Would we go ahead with full trigger relocation as it is? Deciding on some pre-production tests. Partially done at Torino with doubts... what else??
DT LS1 upgrade project milestones 2012 June -- new TRBs and CuOF prototypes tested for radiation tolerance Jun 2012 2012 June -- DT system Electronics Change Review 2012 Nov -- Installation plan review/ milestones reassessment 2013 March -- Fiber trunk cables ready for installation 2013 May -- new TRB theta ready for installation 2013 July -- CuOF-OFCu system ready for installation in one test wheel (YB-1?) 2013 Oct – Complete CuOF-OFCu system ready for installation in five wheels
C. F. BedoyaApril 9th, 2013 7 CMS Phase 2 TPSWG Interim report from the TPSWG (Trigger Performance Strategy Working Group): https://cms-docdb.cern.ch/cgi-bin/DocDB/RetrieveFile?docid=11688&version=1&filename=TPS-Interim-Report-v20.pdf TPSWG charges: to establish a plan for CMS triggering options during Phase 2. Even if you think you will be retired by 2022, think of what you want to do until then... (R&D takes years)
C. F. BedoyaApril 9th, 2013 8 CMS Phase 2 TPSWG • TPSWG charges: to establish a plan for CMS triggering options during Phase 2. • L1 Tracking trigger • Bandwidth increase (L1A latency 20 us and rate 1 MHz) • * More latency is desirable for implementing tracking trigger algorithms • * Rate increase of hadron (and photon) triggers will not be reduced with tracking trigger (need more than 100 kHz) • ROBs may be able to stand 20 us latency, but 1 MHz trigger rate is basically impossible. It could be possible in a fraction of them, mostly if we run the TDCs in continuous mode. • What do we do? • 1 MHz & 20 us => 1-2 years & 5 MCHF ? Probably more... Evaluate • Studies so far compiled in this document, you are welcomed to review it: • https://twiki.cern.ch/twiki/pub/CMS/DtUpgrade/subsystemarch_DT.docx AND
Other intermediate points that alleviate the cost and time of needed intervention should be given (if they exist) We are being asked this C. F. BedoyaApril 9th, 2013
C. F. Bedoya December 11th, 2012 10 CMS DT Phase 2 * L1A rate and latency will impact only readout chain * ROS will be redesigned between LS1 and LS2. * Will try to incorporate DDU functionality to remove further bottlenecks * i.e. impact is on the ROB • Higher LHC luminosity will imply larger occupancies in the DT chambers, which will turn into a higher hit frequency. The consequences of this higher occupancy are the following: • -increase buffers occupancy at the different levels • -slow down the time to process an event (reduce processing speed) • -increase the required bandwidth for the output link • Increase of trigger rate and/or trigger latency only makes this scenario even harder.
When can replacement happen? • Big constraint: large time needed to perform any intervention in our detector. • Very unlikely that we get access time for all of our needs: • LS1.5?: new pixel • LS2: HCAL, GEMs?, GRPC? • LS3: New Tracker, New ECAL FE • Whatever we invent, will likely need to coexist with present electronics • but at the same time, the architecture could be completely different if 20 us latency is granted... C. F. BedoyaApril 9th, 2013 11
C. F. BedoyaApril 9th, 2013 12 Phase 2: discussion • PHASE 2 • We need to come with a general plan for DT by the CMS Upgrade week in June (DESY) • Running until 2030 • 3000 fb-1 by 2030 • InstLumi of 7-8 e34 • After LS3 (2030) Level1 trigger 1 MHz rate and 20 us latency ? • Will chambers survive that? Do we want to make any special study on them? • Maybe not the full chambers, but some items? PADCs? FEBs? • Minicrates? • In reality probably the power supplies and the cooling are the point of major concern for the long reliability of the system... (though not much design we can do on that?...)
C. F. BedoyaApril 9th, 2013 13 Phase 2: Minicrates discussion
C. F. BedoyaApril 9th, 2013 14 A possible approach.... We can digitize every hit, assign it a time stamp (related to BC0?) and send it outside the Minicrate to USC With the time digital word, one should be able to recreate the trigger primitives Each time measurement will contain: tBX + tdrift + offsets...
C. F. BedoyaApril 9th, 2013 15 A possible approach.... This hits will be stored in memories until a L1A arrives and an event matching is performed (such as what ROB does). UXC USC
C. F. BedoyaApril 9th, 2013 16 WORKING GROUPS Minicrate mechanics, integration and interfaces: what is easier: replace full Aluminum structure or inner parts? How long it takes to perform the replacement? How can we install new fibers from MC to racks? Power dissipation Electronics design, time digitization: R&D of time digitization based in FPGAs (ACTEL not feasible 350 MHz max, Xilinx?) New techniques for scrubbing, etc and other rad tolerance actions (reloading firmware with TTC comands, etc) Delay lines or DESER approach? Power consumption How many channels per FPGA? (cost) Trigger algorithms and offsite electronics: Starting with time digital information: extract correct bunch crossing, obtain trigger primitives (meantimer? others?) Implement algorithms in FPGAs and interconnect with present trigger electronics READOUT: Perform event matching, etc Which is the latency required? can we survive with a mixed scheme in which we replace only a fraction of the minicrates?
C. F. BedoyaApril 9th, 2013 17 Phase 2: Minicrates discussion With our time resolution requirement, we should focus in FPGA implementations
C. F. BedoyaApril 9th, 2013 18 TDC options in FPGAs * Current consumption should also be a factor to take into account for the design, we could be talking of few Amps/board very easily (Critical point in my opinion) * Radiation tolerance is the other constrain * Chambers resolution aprox 4 ns * HPTDC working in 0.781 ns time bin (Actually signal integrity degrades this resolution to aprox 1 ns in some channels) * FPGA: There are basically two approaches to reach this resolutions: Deserializer option (+ coarse counter) Delay lines (+ coarse counter)
C. F. BedoyaApril 9th, 2013 19 TDC options in FPGA Using FPGA gates as delay elements. High resolution can be achieved (picoseconds) Dependencies on placement (and thus firmware version), on temperature, on power supply...
C. F. BedoyaApril 9th, 2013 20 TDC options in FPGA * Deserializer option: encoding hit as a input serial stream. It can be done with high speed deserializers (GTX) or with lower speed ones, which depend on clock speed) * 1 ns => 1 GHz clock speed (this is well within today´s reach) * GTX are for 11 Gbps and more * Number of GTX/FPGA is somehow limited, but with SERDES one can think of achieving large number of channels per chip 128? one ROB?) hit coarse clock deser coarse time coarse counter time digital word serdes fine time encoder high speed clock
C. F. BedoyaApril 9th, 2013 21 TDC options in FPGA Baseline approach: 128 channels /FPGA (one FPGA== one ROB) => 6-7 FPGAs/Minicrate => 2 -3 boards max/Minicrate? => probably one or two MTP optical fiber cords per Minicrate (24 links) At 10 times more luminosity, required output bandwidth could be 200 Mbps/ROB (30 kHz non correlated hits, 114 kHz tracks/ROB) 11.6 Gbps output link allows to transmit basically 9 time measurements/BX => no latency impact
C. F. BedoyaApril 9th, 2013 22 Phase 2: Minicrates discussion size of this connectors....
C. F. BedoyaApril 9th, 2013 23 Phase 2: Minicrates discussion How to obtain the trigger primitives with BC0 related digital time measurements? to be continued.......
C. F. BedoyaApril 9th, 2013 25 Phase 2: Minicrates discussion
C. F. BedoyaDecember 11th, 2012 27 CMS Phase 2 • TSWG (Trigger Strategy Working Group) • CMS is discussing the possibility of operating in Phase 2 with: • - 1 MHz L1A rate (instead of 100 kHz) • - 20 us latency (instead of 6 us as previously assumed. Now 3.2 us) • We are asked to provide statements about the implications in our subsystem. • Two meetings already took place: • - https://indico.cern.ch/conferenceDisplay.py?confId=195153 • - https://indico.cern.ch/conferenceDisplay.py?confId=200590 • It appeared unlikely because it meant replace all ECAL electronics, but now this is not considered so difficult. • (Wouldthis mean that tracking triggerisnot so mandatoryifwe can increasethetriggerrateto 1 Mhz? notclear)
C. F. Bedoya December 11th, 2012 28 CMS DT Phase 2 * L1A rate and latency will impact only readout chain * ROS will be redesigned between LS1 and LS2. * Will try to incorporate DDU functionality to remove further bottlenecks * i.e. impact is on the ROB • Higher LHC luminosity will imply larger occupancies in the DT chambers, which will turn into a higher hit frequency. The consequences of this higher occupancy are the following: • -increase buffers occupancy at the different levels • -slow down the time to process an event (reduce processing speed) • -increase the required bandwidth for the output link • Increase of trigger rate and/or trigger latency only makes this scenario even harder.
C. F. BedoyaDecember 11th, 2012 29 Background estimations Rate outside the trains, no muons included (basically neutrons) Big difference between chambers,: - YB-2 S4 MB4 27 kHz/TDC channel - YB-/+2 MB1s 12 kHz/TDC channel Accounting inside trains may increase 3.6 kHz in MB1 and 0.3 kHz in MB4 (¿?) 1035 From Gianni CMS week June 1035
C. F. BedoyaDecember 11th, 2012 30 Background estimations Gianni CMS week June * Upper sectors MB4 * Leaks between barrel and endcap in MB1s
C. F. BedoyaDecember 11th, 2012 31 Muon rate estimations Phi rates • YB-/+2 MB1s at 1035 : • 350 kHz “muon” rate (per chamber) • 114 kHz “muon” rate per ROB • 29 kHz “muon” rate per TDC This are chamber rates (DTTF input counters) From I. Redondo
C. F. BedoyaDecember 11th, 2012 32 Muon rate estimations eta From I. Redondo
C. F. BedoyaDecember 11th, 2012 33 Impact on the ROB • Hit rate 27 kHz => 37 us between hits. Since there are: • 128 channels/ROB, => one hit per ROB each 0.29 us(3.5 hits/ROB/event) • 32 channels/TDC, => one hit per TDC each 1.2 us. • Including muons, adds 8 hits per L1buffer FIFO(or more): • Worst muon rate 350 kHz => 29kHz/TDC => 1 muon every 35 us/TDC Hits stored in L1buffer until the L1A arrives. So if the L1A latency was 20 us, this means we will have to store max 1 hit/channel, i.e. 32 hits/TDC until the L1A arrive+ 8 hits/TDC from muons until the L1A arrive This represents 12 hits per L1buffer FIFO/event, while they are 256 positions in the FIFO. It seems it should be possible to work with 20 us latency in phase 2 LATENCY impact => L1 buffers (slower matching?)
C. F. BedoyaDecember 11th, 2012 34 Impact on the ROB L1A rate impact -Readout FIFO -Event processing speed -Output link bandwidth byte-wise mode at 20 MHz Header,trailer, hits = 32 bit words It takes 8 BXs to send each hit (minimum) + 16 BXs header+ trailer+ 8 BXs token sharing + seems there is more... (next slide) Time window is approximately 1 us, and we will have 1 hit/TDC each 1.2 us, that is the number of hits per event that are expected to be stored in the Readout FIFO But the limiting factor is the speed at which you output the data
C. F. BedoyaDecember 11th, 2012 35 ROB Trigger rate test Test perform on a ROB at lab with fixed trigger rate: No muons <- 3.5 hits/ROB Max 500 kHz... probably optimistic If one includes the worst case muon rate (350 kHz/chamber => 1 muon/ROB every 9 us) => MAX 300 kHz (one could reduce the time window, increase the threshold or start being inefficient...)
C. F. BedoyaDecember 11th, 2012 36 Options without replacing ROB? • trigger matching: reduces the payload. (100 kHz L1A rate => one L1A each 10 us with a sampling window of 1 us => we read 10% of the time) • At 1 MHz L1A rate => we will be reading 100% of the data, thus trigger matching is an overhead of processing time and bandwidth (headers and trailers) • In principle, it is possible to run the HPTDCs in continuous mode: • - hits sent as arrive from chambers, • - time measurement referred to BC0 • Latency and L1A rate becomes irrelevant for the ROB • Trigger matching in the new ROS may be possible (needs study and will imply detailed calibration of the different absolute BXs ID between the ROBs and the ROS). • It may open possibility to use TDC data in the trigger (?)... (output delay not deterministic and long...) • Limiting factor in this mode of operation is the bandwidth of the readout link: • 27 kHz * 128 ch * 32 bits/hit = 110 Mbps • With muons (350 kHz/chamber => 114 kHz/ROB of 8 hits muons => 114 Mbps • Max bandwidth is 160 Mbps => hit rate 39 Hz/cm2 (to be checked) MB4s are close... • This may work depending on the uncertainty of the background. If it is large, then we start o be inefficient
C. F. BedoyaDecember 11th, 2012 37 CONCLUSIONS • We don´t expect any problem for running present ROBs design with Phase 2 occupancies and 100 kHz L1A rate and 6 us L1A latency • With the background rates expected in Phase 2, it is unlikely that we could run at a trigger rate higher than 500 kHz. • Latencies of 20 us could likely be achieved, although proper testing should be done. • HPTDCs could be operated in continuous mode, which should allow operation with 20 us latency and 1 MHz trigger rate. However, this means no background reduction is possible playing with the time window (no filtering is done, all hits are sent). Therefore, we could start to loose efficiency for background rates larger than 39 Hz/cm2. (This number should also be checked). MB4 rates are too close to this number (27 Hz/cm2for YB-2 S4 MB4). • Any action to place a shielding in the MB4s of the upper sectors or in the MB1s of the external wheels will reduce any possible efficiency drop. • Considering the large uncertainties (energy, extrapolation, target luminosity, etc), even if the MB4s can be shielded, replacing the ROBs of the MB1s external wheels with a higher performance board, should also not be discarded • In order to operate in continuous mode, a DC balancing is mandatory at CUOF level.
C. F. BedoyaApril 9th, 2013 38 Phase 2: Minicrates discussion
Width of the board is fixed by the Minicrate width (aprox 9.5 cm) Length is variable, limitation are the input connectors which are 5.5 cm long Number of boards Otherwise, interface board with high density connectors This will allow higher density integration (up to 1 board/Minicrate? do we want this?)
C. F. BedoyaApril 9th, 2013 40 Some comments • Simulations of the impact of inefficiency in the detector in the physics analysis do not exist. A muon joint effort would make sense on this. • Probably even less if one refers to simulation of the electronics (readout), and this is one of the main points we are asked.... • In reality probably the power supplies and the cooling are the point of major concern for the long reliability of the system... (though not much design we can do on that?...) • But for custom electronics we need time for brainstorming and for putting together the manpower and the budget (different timescales for different institutes, etc), so discussions on what can be done should take place now. • Different L1A parameters may imply some challenge in the electronics.. it may be more interesting that to rebuilding the same wheel... • Or maybe what we want to think is in replacing subdetectors technology.. but I doubt that.
AND maybe not so much Pixel trigger? (or 25 us...) C. F. BedoyaApril 9th, 2013
Present knowledge ECAL: limit is 150 kHz & 6.4 us (maybe 300 kHz?). > => 26 months shutdown, 10 MCHF Tracker: very much in favor of increasing latency (20 us) if rate increased, happy with 500 kHz, 1 MHz not impossible HCAL: changes limited to USC, does not seem a major constraint at present. RPC: Probably changed by LS2. All in USC. Any increase L1A param => 100-150 keuros (84 RMBs, 3 FEDs) CSC: Good at increased L1A rate, bad at increased latency >300 kHz & 6.4 us => 4 months & 12 MCHF. 500 kHz possible at all? which implications Is it worth changing just a fraction of the CFEBs? Do you expect any longevity issue that would make worth a re-desing independently from this? 1 MHz & 20 us => 4 months & 12 MCHF ? Is that true? DT: Opposite to CSCs... latency should be OK, rate overkilling 300 kHz & 20 us => OK (to check) 500 kHz & 20 us => YB-/+2 MB1 and MB4s? 6 months?, cost? 1 MCHF? Evaluate Possibility open that we need to replace Minicrates either way.. maybe not all in one goal, but the R&D should be launched soon 1 MHz & 20 us => 1-2 years & 5 MCHF ? Probably more... Evaluate C. F. BedoyaApril 9th, 2013
C. F. BedoyaApril 9th, 2013 43 TIME Measurement
C. F. BedoyaApril 9th, 2013 44 WORKING GROUPS