Jialin Liu , Bradly Crysler , Yin Lu , Yong Chen Oct. 15. 2013@U-REaSON Seminar

Jialin Liu, BradlyCrysler, Yin Lu, Yong Chen Oct. 15. 2013@U-REaSON Seminar Data-Intensive Scalable Computing Laboratory (DISCL) Locality-driven High-level I/O Aggregation for Processing Scientific Datasets

Introduction • Scientific simulations nowadays generate a few terabytes (TB) of data in a single run and the data sizes are expected to reach petabytes (PB) in the near future. • VPIC, Vector Particle in Cell, Plasma physics, 26 bytes per particle, 30TB • Accessing and analyzing the data reveals poor I/O performance due to the logical-physical mismatching.

Introduction • Scientific Datasets and Scientific I/O Libraries • PnetCDF, HDF5, ADIOS PnetCDF MPI-IO Parallel File Systems • Scientific I/O libraries allow users to specify array-based logical input • Logical-physical mismatching

Motivation I/O methods in scientific I/O libraries(PnetCDF, ADIOS, HDF5): Independent I/O • Processes collaboration: No • Calls collaboration : No Collective I/O • Processes collaboration: Yes • Calls collaboration : No Nonblocking I/O • Processes collaboration: Yes • Calls collaboration : Yes

Motivation Call0 Calli Call1 … … … … … … … Two Phase Collective I/O … ag02 ag12 ag00 ag01 ag10 ag11 ag03 ag13 agi2 agi0 agi1 agi3 Contention on Storage Server without Aware of Locality

Performance with Overlapping Calls Conclusion: Overlapping Should be Removed

Idea: High level I/O Aggregation Logical Input Decomposition Physical Layout Physical Layout sub0 Call0 start{0,0,0} length{100,200,100} start{0,0,0} length{100,200,200} sub0 sub1 start{0,0,100} length{100,200,100} sub2 sub2 Call1 start{10,20,100} length{10,150,400} sub1 start{10,20,100} length{10,300,400} sub3 sub3 start{10,170,100} length{10,150,400}

Idea: High level I/O Aggregation • Basic Idea • Figure out the overlapping among requests • Eliminate the overlapping before doing I/O • Challenges • How to decompose the requests • How to aggregate the sub-arrays at a high level

Hila: High Level I/O Aggregation • Way to figure out the physical layout • Sub-correlation Function • Sub-correlation Set • Lustre Striping: stripe size: t; stripe count: l; • Dataset : Dimension: d; subsets size: m

Hila Algorithm: Prior Step Prior Step: calculate sub-correlation set, one time analysis

Hila Algorithm: Decomposition Main Steps: Request Decomposition and Aggregation

Improvement with Hila Performance Improved with Hila

Improvement with Hila FASM Improved with Hila

Conclusion and Future Work • Conclusion • The mismatching between logical access and physical layout can lead to poor performance. • We propose the locality-driven high-level aggregation approach (HiLa) to facilitate the existing I/O methodsby eliminating the overlapping among sub-array requests. • Future Work • Apply to write operations • Integrate with file systems.

Locality-driven High-level I/O Aggregation for Processing Scientific Datasets Thanks Q&A http://discl.cs.ttu.edu

Jialin Liu , Bradly Crysler , Yin Lu , Yong Chen Oct. 15. 2013@U-REaSON Seminar

Jialin Liu , Bradly Crysler , Yin Lu , Yong Chen Oct. 15. 2013@U-REaSON Seminar

Presentation Transcript

Tuesday Oct. 15, 2013

Oct 2013

Oct. 15-16

PavanBalaji Yong Chen Abhinav Vishnu

DOS WEEK of Oct 15-18 2013

Warm-up Oct. 15, 2013

15 Oct

PavanBalaji Yong Chen Abhinav Vishnu

David Figlio , Northwestern U Mark Rush, U Florida Lu Yin, American Institutes for Research

Tuesday Oct 15, 2013

Reporter: Yong Wang Supervisor: Professor Xu Chen

Bradly French

Zhu Dahuan Liu yang Chen Junling

Yong Han, Yong Chen and Paul van Delst

Ssu-li Chang  , Yen-yin Chen  

Fengshan Liu Oct. 29, in Shanghai

Randy J. Chen Apr. 15, 2013

Oct 2013

Wed., Oct. 15

Monday, Oct. 15

Reading Report 9 Yin Chen 29 Mar 2004

Oct. 15, 2018