150 likes | 261 Views
Jialin Liu , Bradly Crysler , Yin Lu , Yong Chen Oct. 15. 2013@U-REaSON Seminar Data-Intensive Scalable Computing Laboratory (DISCL ). Locality-driven High-level I/O Aggregation for Processing Scientific Datasets. Introduction.
E N D
Jialin Liu, BradlyCrysler, Yin Lu, Yong Chen Oct. 15. 2013@U-REaSON Seminar Data-Intensive Scalable Computing Laboratory (DISCL) Locality-driven High-level I/O Aggregation for Processing Scientific Datasets
Introduction • Scientific simulations nowadays generate a few terabytes (TB) of data in a single run and the data sizes are expected to reach petabytes (PB) in the near future. • VPIC, Vector Particle in Cell, Plasma physics, 26 bytes per particle, 30TB • Accessing and analyzing the data reveals poor I/O performance due to the logical-physical mismatching.
Introduction • Scientific Datasets and Scientific I/O Libraries • PnetCDF, HDF5, ADIOS PnetCDF MPI-IO Parallel File Systems • Scientific I/O libraries allow users to specify array-based logical input • Logical-physical mismatching
Motivation I/O methods in scientific I/O libraries(PnetCDF, ADIOS, HDF5): Independent I/O • Processes collaboration: No • Calls collaboration : No Collective I/O • Processes collaboration: Yes • Calls collaboration : No Nonblocking I/O • Processes collaboration: Yes • Calls collaboration : Yes
Motivation Call0 Calli Call1 … … … … … … … Two Phase Collective I/O … ag02 ag12 ag00 ag01 ag10 ag11 ag03 ag13 agi2 agi0 agi1 agi3 Contention on Storage Server without Aware of Locality
Performance with Overlapping Calls Conclusion: Overlapping Should be Removed
Idea: High level I/O Aggregation Logical Input Decomposition Physical Layout Physical Layout sub0 Call0 start{0,0,0} length{100,200,100} start{0,0,0} length{100,200,200} sub0 sub1 start{0,0,100} length{100,200,100} sub2 sub2 Call1 start{10,20,100} length{10,150,400} sub1 start{10,20,100} length{10,300,400} sub3 sub3 start{10,170,100} length{10,150,400}
Idea: High level I/O Aggregation • Basic Idea • Figure out the overlapping among requests • Eliminate the overlapping before doing I/O • Challenges • How to decompose the requests • How to aggregate the sub-arrays at a high level
Hila: High Level I/O Aggregation • Way to figure out the physical layout • Sub-correlation Function • Sub-correlation Set • Lustre Striping: stripe size: t; stripe count: l; • Dataset : Dimension: d; subsets size: m
Hila Algorithm: Prior Step Prior Step: calculate sub-correlation set, one time analysis
Hila Algorithm: Decomposition Main Steps: Request Decomposition and Aggregation
Improvement with Hila Performance Improved with Hila
Improvement with Hila FASM Improved with Hila
Conclusion and Future Work • Conclusion • The mismatching between logical access and physical layout can lead to poor performance. • We propose the locality-driven high-level aggregation approach (HiLa) to facilitate the existing I/O methodsby eliminating the overlapping among sub-array requests. • Future Work • Apply to write operations • Integrate with file systems.
Locality-driven High-level I/O Aggregation for Processing Scientific Datasets Thanks Q&A http://discl.cs.ttu.edu