310 likes | 450 Views
Project Overview. Ted Liu Fermilab Sept. 27 th , 2004 L2 Pulsar upgrade IRR Review. Materials for the review can be found at:. http://hep.uchicago.edu/~thliu/projects/Pulsar/L2_upgrade_meeting.html. Q&A. Are we ready for operation? No Why are we having this review then?
E N D
Project Overview Ted Liu Fermilab Sept. 27th, 2004 L2 Pulsar upgrade IRR Review Materials for the review can be found at: http://hep.uchicago.edu/~thliu/projects/Pulsar/L2_upgrade_meeting.html
Q&A • Are we ready for operation? • No • Why are we having this review then? • To help us getting ready for final operation. To review • what we have achieved so far • our plans for final commissioning • is there anything missing on our to-do-list? • is manpower enough? • Is the schedule realistic? • … • Will have another review early next year
Qutline • Baseline design for operation and overall status • Project Organization: who is working on what • General testing&commissioning methodology • Firmware Overview and necessary improvements • Firmware Version Control • Comments on possible future system optimization • Manpower issues • Summary
Baseline for operation • Pulsar-based system is very flexible • Should only focus on the baseline for operation • single algorithm node • deliver non-SVT and SVT data on separate PCI path • simple algorithm in pre-processor Pulsars • simple data merging • use S32PCI64 cards instead of FILAR … • Will comment on some system flexibility later • further possible improvements/optimization depend on initial operation experience, not required for operation
L1 muon L1 XTRP L1 trigger Muon Merger TS PC L2toTS Cluster SLINK L2 CAL (CLIST/Iso) PreFred SVT SVT Electron ShowMax (RECES) Merger Baseline for operation Pulsar pre-processors • Commissioning strategy: • Use Pulsar Tx Rx in teststand • Split all input signals to Run IIa system, run parasitically with detector and beam
Main tasks for the project • Hardware: custom: Pulsar,mezzanine&AUX, LVDS & fiber active splitter commercial: fiber passive splitter, SLINK mezzanine, SLINK-PCI cards, Linux processors • Firmware: transmitters and receivers for all data paths SLINK merger and L2 to trigger supervisor interfaces about 14 different types of Pulsar firmware • Software: board testing, VME DAQ readout, online/offline monitoring, software for algorithm&control node, interface with rest of world …
Overall Status • Hardware: All custom hardware in hand, and all fully tested except fiber active splitter (see Chris’s talk) need to order two Algorithm PCs soon, one in hand • Firmware: All firmware in place and tested in beam…some needs fine tuning for either performance optimization or robustness more on firmware later • Software: main work left is related to algorithm and control node see later talks for details
Who is working on what (I) • Muon/XTRP/L1 + SVT + Merger + RECES + L2TS • Tx Firmware: Tomi (work done, left) • Rx Firmware: Sakari • Testing: Burkard, Frans (left), Cheng-Ju (L2TS) + Sakari • Commissioning: Burkard + Cheng-Ju • Monitoring for Muon/XTRP/L1/SVT/Merger: Shawn • Monitoring for L2TS: Cheng-Ju • Monitoring for RECES: Chris with Daniel’s help earlier • Active Splitter for RECES: Chris • LVDS Splitter Board: Wojtek (only the main soldiers are shown here)
Who is working on what (II) • Cluster/PreFred • Tx Firmware: Tomi (work done, left) • Rx Firmware: DataIO FPGA: Vadim Control FPGA: Sakari (similar to other cases) • Testing/commissioning/monitoring: Vadim and Wojtek • Algorithm node software: Kristian • Control node software: Daniel (only the main soldiers are shown here)
Who is working on what (III) • General • Pulsar Hardware: Burkard • VME readout software: Jane from DAQ + Burkard’s help • PulsarMon design: Vadim • RunCTRL/TriggerDB etc interfaces: Daniel • Firmware coordination: TL • Commissioning coordination: in the past: TL in the future: Trigger SPLs (See Cheng-Ju’s talk on plan and schedule)
Testing and Commissioning Methodology • Pulsar is self-testable, at both board and system level • This design feature is the key to success • Pulsar Hardware testing all done with self-testing capability • Pulsar firmware testing mostly done with self-testing capability • Software development all done with Pulsar self-testing capability • Even at system commissioning, a few Tx boards are used • This has made the commissioning work much easier • System commissioning relies heavily on input signal splitting • Saved us LOTs of dedicated beam time • So far, only used a few hours dedicated beam time • Input signal splitting has been and will be extremely helpful • Will keep input signals split even after final commissioning • to have a “copy” of the new system running parasitically for hot spares, for firmware/software improvements
Pulsar Firmware: BIG effort • Pulsar has three main FPGAs • L2 decision upgrade:14 different type Pulsars (Tx/Rx) • Key to success: • Design: common re-usable modular design • Simulation: no simulation no testing • Dedicated testing support • Version Control • Careful firmware update procedures …
people 14 Pulsar types/firmware for L2 decision upgrade 081 L2 Pulsar Muon/XTRP Rx IIa083 L2 Pulsar SVT Road Warrior085 L2 Pulsar Muon/XTRP/L1 Tx or SVT XTRP-emu086 L2 Pulsar Muon/XTRP/L1 Rx IIb087 L2 Pulsar RECES Tx088 L2 Pulsar RECES Rx089 L2 Pulsar Cluster/PreFred Tx090 L2 Pulsar Cluster/PreFred Rx091 L2 Pulsar SVT Tx092 L2 Pulsar SVT Rx093 L2 Pulsar Merger Tx094 L2 Pulsar Merger Rx095 L2 Pulsar L2TS “Tx” (reserved spare ID) 096 L2 Pulsar L2TS097 L2 Pulsar L1 Scaler098 L2 Pulsar SVT Track Fitter 099 L2 Pulsar test Tx100 L2 Pulsar test Rx101 L2 Pulsar XFT Stereo Tx102 L2 Pulsar XFT Stereo Rx 103 L2 Pulsar SVT Hit Buffer 104 L2 Pulsar SVT AMSRW Board ID Sakari Tomi Sakari Tomi Sakari Tomi Vadim/Sakari Tomi Sakari Tomi Sakari Sakari Sakari Sakari One Hardware -- many applications other types for SVT/XFT or future improvement
FPGAs and data interfaces Interfaces (Tx/Rx): Muon Reces Cluster Iso Slink Interfaces (Tx/Rx): SVT/XTRP/L1/PreFred TS & SLINK (P3)
General simplified data flow CDF CTRL Mezz interface DataIO algo slink formatter DAQ buffers VME response DAQ buffers CDF CTRL slink formatter merge Same as above DataIO DAQ buffers VME response DAQ buffers XTRP/SVT
Well organized firmware effort is the key:clean/easy-to-understand/robust with minimal maintenance effort DataIO or Control FPGA CDF CTRL Input interface Slink formatter algo Output FIFO DAQ buffers DAQ buffers VME response • all common parts are shared by all firmware • also share many low level routines (latency timer, bunch • counter etc)
Common core firmware (library) muon VME responses SLINK formatter DAQ buffers CDFCtrl responses Mezzanine interfaces Latency timer Bunch counter … SVT track LVDS + ext. FIFO cluster hotlink L1 LVDS Taxi Iso TS showermax Knowing one data path Pulsar knows all Pulsars Data specific algorithm difference is just minor details core firmware developed by people dedicated to the task
SLINK data format Keep it simple 8 4 2 8 8 2 was defined as data size before, which means one has to wait until all data arrives before sending out SLINK package. will remove it, and use it as latency stamp instead. Format version Data source Region ID Reserved Bunch count Buffer # Latency (previous) Latency (current) 16 16 Data elements 16 16 Data size Error flags
Four Slink mezz cards AUX card Slink output Slink inputs SLINK Merger fragments into event package Slink merger output 0xE0F00000 Merger trailer End of fragment Input 4 trailer Slink input 4 data Input 4 header 2 Input 4 header 1 End of fragment Input 3 trailer Slink input 3 data Input 3 header 2 Input 3 header 1 End of fragment Input 2 trailer Slink input 2 data Input 2 header 2 Input 2 header 1 End of fragment Input 1 trailer Slink input 1 data Input 1 header 2 Input 1 header 1 Merger header 2 Merger header 1
32-bit SLINK to 64 bit PCI interface card: S32PCI64 • highly autonomous data reception • 32-bit SLINK, 64-bit PCI bus • 33MHz and 66 MHz PCI clock speed data Slink to PCI mem data Slink to PCI L2 decision CPU PCI to Slink
General comments on Firmware • Firmware is not hard if it just has to be “sort of working” • The hard part is to get it “rock solid” • Firmware is not a place to show how smart one is • rather to show how many mistakes one could make • Firmware testing is as important as writing/simulating • Firmware testing is often much more time consuming than the actual writing • How good the firmware depends on how well it is tested • Our general strategy: firmware developer supported by people dedicated to testing
Files kept • VHDL source code • Files necessary for recreating this version • Readme with revision history • FPGA configuration files the FPGAs • Source code (working version) under CVS control • Firmware archive updated on different PCs
How to keep Firmware in order • We have so many different types of firmware • Keeping them in order is non-trivial: • under CVS control: source VHDL code, pin files etc • Firmware version ID VME register per FPGA (has to update by hand) • README file update • Follow careful firmware update procedures • There has been no confusion about firmware versions when the procedure has been followed
slot 05: IDPROM: 0016 086 PULSAR MUON RX CTRL: 04408060,DataIO1: 08407020, DataIO2: 08407020 slot 13: IDPROM: 0004 092 PULSAR SVT RX CTRL: 05408060, DataIO1: 09407280, DataIO2: 09407280 slot 15: IDPROM: 0028 093 PULSAR SLINK TX CTRL: a0406300, DataIO1: 08407230, DataIO2: 08407230 slot 17: IDPROM: 0061 096 PULSAR TS INTERFACE CTRL: 0a407280, DataIO1: 09408030, DataIO2: 09406040 slot 19: IDPROM: 0012 094 PULSAR SLINK MERGER CTRL: 02408080, DataIO1: 01407220, DataIO2: 01407220 slot 21: IDPROM: 0014 094 PULSAR SLINK MERGER CTRL: 02408080, DataIO1: 01407220, DataIO2: 01407220 Example of Pulsar crate ID&version scan by Burkard on Aug 19th after Beam test: Not all boards are shown due to limited space Allow us to go back to the same configuration after shutdown How can we check these against database when in operation mode?
Necessary firmware optimizationbefore operation • Reces: zero-suppression + latency optimization • Reces data comes into Pulsar 8 us after L1A • current firmware is not optimized • needed to reduce Reces path latency by a few us • Status: in progress, estimated time: ~ one month • Slink formatting: sending data words out upon arrival (not waiting for all data to arrive) • improving Slink merging latency (~ a few us) • estimated time: ~ one month (including testing) • More timing measurements • add latency timer in a few other places (relatively easy) • Cluster path improvement (see Vadim’s talk)
Future possible system optimization after operation • Use FILAR cards instead of S32PCI64 • Remove the final merger, data directly goes into FILAR • Possibly also remove Reces merger with another FILAR • Also allow us to run SLINK faster: 40 MHz ~60 MHz • Should reduce the latency by ~ few us • Need PCI software modification (no firmware change) • Move L1 scaler into firmware if needed • Move to 4 PCs instead of single PC (how much do we gain?) • Additional firmware (L2TS) work and some software work • Muon and track matching in firmware • Bring upgrade XFT stereo info into L2 (this will happen) • …much more, depends on what we learn&need by then
FILAR can improve the system latency Higher bandwidth, less latency overhead 64-bit/66MHz PCI bus, four 2 Gbit/s S-LINK channels Reduced PCI protocol overhead (compare to S32PCI64)
System with FILARs Pulsar pre-processors L1 muon L1 XTRP L1 trigger Muon Merger TS PC L2toTS Cluster SLINK L2 CAL (CLIST/Iso) PreFred SVT Electron Send all data paths directly into 2 FILARs… ShowMax (RECES) Merger SVT
Manpower Issues • Why should we worry? • Tomi and Frans left (two key people on firmware/testing) • Only10% Burkard’s time available from now on (key person on hardware/VME software/firmware testing/system commissioning need replacement SOON) • Kristian needs to graduate … (key person on algorithm/PCI software need replacement SOON …) • Need new L2 software coordinator (Daniel?) … • anyone else will be full time on the project? • This is very serious …need this committee to help!
Summary • Review should focus on baseline for operation • Keep in mind that system is flexible • Hardware is in good shape (still need to fully test the active splitters) • Firmware is in reasonable shape, needs fine tuning/testing in some cases key is to follow our testing methodology • Software needs more work, no show stoppers clear spec is the key • Manpower issues serious (in a few areas): need new people! • Details are in the rest of talks …