1 / 16

Efficient Data Archiving System for Jefferson Lab

Explore .CZAR:.Jefferson Lab’s new archiving system featuring relational databases, CORBA, Java, and distributed systems for managing data acquisition and history retrieval. The system includes plug-ins, multi-threading capabilities, and a configuration database for efficient data storage and management. Discover how this scalable system offers fast access and manipulation of historical data.

Download Presentation

Efficient Data Archiving System for Jefferson Lab

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CZAR: Jefferson Lab’s New Archiving System Christopher A. Larrieu Chris Slominski Matthew Bickley

  2. Buzzword Conformance • Relational Database • SQL • CORBA • Java • Distributed System • Multi-threaded CA • Plug-ins • C++, Standard Library

  3. Components • Configuration Database • Data logging engine • Data store • History data server • Configuration management GUI • Engine control / monitor GUI • History data retrieval clients

  4. System Overview Database (MySQL) Embedded SQL DAQ Engine History Server JDBC CORBA Manager GUI XARR CORBA StripTool Filesystem DARTH

  5. Configuration Database • Arbitrary relationships between PVs • administrative groupings • logical groupings • convenience groupings • “Stream” associates PV with DAQ specs • journal configuration changes • multiple, evolving, specs per PV

  6. Data Acquisition Engine • Multi-threaded CA client • runs CA task in separate thread • SQL client • reads configuration information from database • CORBA server • exposes control API • pushes status events into CosEventService • DAQ plug-in modules

  7. DAQ Engine Remote API • Activate / Deactivate streams • Enable / Disable per-stream status events • connect, disconnect, flush • Query stream status • bytes written, when connected, buffer status • Asynchronous status events • engine state, bytes written, CA activity

  8. DAQ Plug-in Modules • Monitor • time threshold • buffer size • maximum buffer age • Trigger • ring buffer size • causal trigger • flush delay

  9. Data Store • Engine • streams raw data to disk • Converter • runs periodically, or on-demand • “cooks” raw data (convert, compress, index) • Database • connection history • data directory • file status

  10. History Data Server • Multi-threaded CORBA server • Accesses data store via CADataStore API • Consults database for data summary • Reads data from file system shared with Engine • Converts spooled but unprocessed data on demand

  11. History Server Remote API • Supplies archive accessor objects to clients • Client initializes range in archive accessor • data summary • first, last point • before, after point • num points • range accessor object • provides data pull interface • serves sequences of data in large chunks • timestamp, value, status, severity, control info

  12. Managment GUI • Java 1.3, Swing, JDBC, CORBA • truly platform-independent • View/manipulate hierarchical ordering • Specify/modify DAQ parameters • Enable/disable streams • Monitor/control engine • flush streams • display engine status • Multiple simultaneous instances

  13. History Clients • XARR • pure CORBA client • command-line tools • specify signal, time range • StripTool • work in progress • New java-based tools

  14. Status • Engine up since January 2002 • ~ 21,000 signals • ~ 2Gb / day raw; ~ 600Mb processed • < 20% CPU (HP B2000, 1GB RAM) • XARR • retrieve 100,000 points: ~ 3 sec. • Management GUI 1.0 99% done • Port to Solaris, Oracle: June

  15. Transportability • Entire system • Engine • History server • Database structure • CADataStore • CORBA API • Modular DAQ plug-ins

  16. Scalability • Data store decoupled from data acquisition • multiple engines can spool data • Database indexes, summarizes data • fast lookup • File system stores data • easily shuffle data offline/online • fast access to binary data • History server • reduce data before transporting

More Related