290 likes | 449 Views
October 28, 2010 NCC Seminar, Goyang , Korea. Belle II Data Handling System. Kihyeon Cho * (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science and Technology Information). Contents. High Energy Physics Team @ KISTI
E N D
October 28, 2010 NCC Seminar, Goyang, Korea Belle II Data Handling System Kihyeon Cho* (on behalf of the Belle II Computing Group) *Presenter High Energy Physics Team KISTI (Korea Institute of Science and Technology Information)
Contents • High Energy Physics Team @ KISTI • Belle II Experiment • Belle II Data Handling System • Meta-data system • Data cache system • To test Large Scale Data Handling • With Belle Data • With Belle II Data (Random data) • The interaction between HLT and Storage • Summary
HighEnergy Physics Team @ KISTI • High Energy Physics Team (Physicists) • Kihyeon Cho – P.I. • Soo-hyeonNam - Senior Researcher • Junghyun Kim - Senior Researcher • YoungJinKim - Postdoc • TaegilBae - Postdoc • JihyeMoon - Researcher • Former members • YongseokOh – Prof. of Physics @ KNU • Daejung Kong - Contract Prof. @ KNU • Ilsung Cho - Contract Prof. @ KyungHeeU. • Minho Jeung - Company • Hyunwoo Kim - Computing Team @ KISTI
HighEnergy Physics Team @ KISTI To probe the Standard Model and search for New Physics CDF Belle/Belle II cf. LHCb K.Cho and H.W.Kim, JKPS (2009)
국제 공동연구 CDF*@FNAL, USA IN2P3@France 한•불 입자물리연구소 RHIC@BNL, USA LHC@CERN,Europe e-Science @KISTI Belle*/Belle II* @ KEK, Japan *Collaboration • 한불입자물리연구소CDF 그룹 한국 파트너 (조기현) • Belle II Data Handling 워킹그룹장(조기현)
High Energy Physics in the LHC era LHC Energy frontier experiments LHC, ILC, … Higgs, SUSY, Dark matter, New understanding of space-time… ILC KEKB upgrade New particles and new interactions Three approaches to New Physics n exp., m LFV, t LFV, gm-2, … Lepton physics Quark flavor physics Super-B Factory, K exp., etc. Neutrino mixing/masses, Lepton number non-conservation… CP asymmetry, Baryogenesis, Left-right symmetry, New sources of flavor mixing… J-PARC M. Yamakuchi, Belle II meeting (2008)
Belle vs. Belle II Belle II To handle 50 times more data and to use grids ⇒ New data handling system Belle
Luminosity vs. Physics Major achievements Expected at Super B Factory Start 2012 => 2014
Belle II Time schedule Hara San
Count down • 2014 April – Collider turn on • 2012February – Ready for Computing • 2011 Summer – Rough final system • 2010 Summer – Prototype system • Spec & Test • 2010.5 PAC – Answers for questions • Computing & Pixel
Belle II Computing => To handle data, we need new data handling system.
Belle II Data Handling Group(Chair: Kihyeon Cho) • KISTI • HEP Team – JungHyun Kim, Kihyeon Cho, YoungJin Kim, … • AMGA Team – Soonwook Hwang, Sunil Ahn, HanGi Kim, Tae-sang Huh, … • Melbourne • Tom Fifield, Martin Sevior, … • Krokow • MaciejSitarz, MitoszZdybat, RafatGrzymkowski, Henryk Polka, … • KEK, Karlsruhe, etc…
Belle II Data Handling Group meeting First and Third Thursday 5:00 PM (KEK Time)
Belle II computing model Raw Data Storage And Processing Tape Raw Data KEK Grid Site Grid Site CPU mDST Data Disk mDST MC Ntuples AMGA Client gbast2 UI Data Tools DIRAC Data Tools Data Tools MC Production (optional) Data Tools MC Production AndNtuple Production Cloud Ntuple Analysis Local Resources Local Resources Local Resources Local Resources
Data Handling Outlines DIRAC plan KEK Grid sites
Belle II metadata system To construct the DH system for Belle II experiment • To improve the scalability and performance • To run based on grid farm • ⇒AMGA (ArdaMetadata Catalog for Grid Application) Data Cache AMGA DIRAC DIRAC
Belle II data cache system • We make the simple data tool • which is not based on database. Event-driven meta-data catalog ⇒ Condition-driven meta-data catalog
Large Scale data DH test with Belle Data • We perform searching for the interesting files with a table of meta-system and changing number of parallel processing. • The linearity of search is stable up to 50 parallel simultaneous processing. Output Input • # of files: 2013 files • # of events: 12 M events • # of luminosity: 5792 pb-1 • What queries? • - run #, exp#, stream#...
Large Scale data DH test with Belle II Data (Random generating) • Input: 70,000 files (140TB) • The linearity of search is stable up to 50 parallel simultaneous processing. • It is almost same between using a table and using 30 multi-tables. • With a table and multi-processing • Generating time: 400 files/sec • With 30 multi-tables and multi-processing • Generating time: 400 files/sec
The interface between HLT and Storage => To apply AMGA 30kHz Raw Data Storage And Processing KEK Grid Site Grid Site Tape Raw Data CPU • We assume two files/sec for both reading and writing for AMGA. • Read-write optimization for meta-data • Generating for writing only 400 files/sec • To test reading performance for 1Hz, 2Hz, 10Hz, 50Hz and 100 Hz Detector mDST Data Disk mDST MC 6kHz DAQ Ntuples AMGA Client HLT gbast2 UI Data Tools LFC AMGA DIRAC Data Tools Data Tools MC Production (optional) Data Tools MC Production AndNtuple Production Cloud Ntuple Analysis Local Resources Local Resources Local Resources Local Resources
Products • Conference talks • The Advanced Data Searching System with AMGA at the Belle II Experiment • J. H. Kim • CCP2009, Gaushung, Taiwan, Dec., 2009 • Data Cache System at Belle II experiment • K. Cho • CCP2010, Trondheim, Norway, June 2010 • Belle II Data Management system • K. Cho • CHEP 2010, Taipei, Taiwan, Nov. 2010
대표 성과 • 우주 기원 밝히는 국제실험 주도 그 연구 성과 SCI 논문 출판 • 9개국 12개 연구기관의 30여명의 Belle II DH 워킹그룹장(조기현) 수행 • KISTI 주도로 시스템 개발 • 그 연구성과 김정현(제1저자), 조기현 (교신저자)로 해외 SCI 저널 Computer physics Communication(IF: 1.958)에 출판 • 국제공동연구 참여 => 주도 => 연구성과 도출 • 실간의 협력 서비스 파이프 라인 => 공동저자 등재 • AMGA 관련 기반기술 개발실 (안선일, 황순욱) • 팜 관련 글로벌허브센터 (김법균, 유진승, 윤희준, 장행진) S. Ahn, K.Cho, S.W.Hwang, J.Kim* et al, JKPS 56 (2010.10.15)
Belle II DH Group – Daeduk net (2010.4.6) http://www.hellodd.com
The 3rd Belle II Computing Workshop • Date: November 22-24 (Mon-Wed.) 2010 • 1st day (Monday): Off-line and HLT software • 2nd day (Tuesday): Distributed Computing and Data Handling • 3rd day (Wednesday) • Morning – Overflow • Afternoon- Excursion • Place: KISTI, Daejeon, Korea • Asa chair of Belle II DH group, HEP team@ KISTI hosts • Just after Belle II General Meeting @KEK, on Nov. 17-20
Summary In order to handle 50 times more data of Belle, Belle II Data Handling Group works on: • Metadata system and data cache system based on Grids • Test of Large Scale Data Handling • Belle II Data • Belle Data at KEK • Application of AMGA at HLT • Data Handling and Job Management for Belle II Grid • This is a great success! • Keep going
Thank you. cho@kisti.re.kr