70 likes | 88 Views
Data mining and discovery of access patterns. 3a.i) Adaptive file caching in a distributed system (LBNL) 3b.i) Dimension reduction and sampling (LLNL) 3c.i) Multi-agent based high-dimensional cluster analysis (ORNL) 3c.ii) Analysis of application level query patterns (LLNL, NWU).
E N D
Data mining and discovery of access patterns 3a.i) Adaptive file caching in a distributed system (LBNL) 3b.i) Dimension reduction and sampling (LLNL) 3c.i) Multi-agent based high-dimensional cluster analysis (ORNL) 3c.ii) Analysis of application level query patterns (LLNL, NWU) SDM kickoff meeting July 10-11, 2001 Area 3 Report
People involved • Adaptive file caching in a distributed system (LBNL) • Ekow Otoo, Frank Olken • Dimension reduction and sampling (LLNL) • Chandrika Kamath, Imola Fodor • Multi-agent based cluster analysis (ORNL) • Nagiza Samatova, George Ostrouchov • Analysis of application level query patterns (LLNL, NWU) • Terence Critchlow, Ghaleb Abdulla, Alok Choudhary, … • Agent technology (ORNL, NCSU) • Tom Potok, Mladen Vouk Area 3 Report
Targeted Application Area(s) • First Year: Climate, HEP, Astrophysics • Future Years: others (to be determined) Area 3 Report
Application(s) contact people • High Energy Physics: • Ask Arie? • Climate (SciDAC): • John Drake & David Erikson: Compute Science and Mathematic Division, ORNL • Ben Santer: Program for Climate Model Diagnosis and Intercomparison (PCMDI), LLNL • Astrophysics (SciDAC): • Tony Mezzacappa: Physics Division, ORNL Area 3 Report
Application Scenario Area 3 Report
System Architecture Area 3 Report
Year 1 Deliverables Distributed simulation product query, search, and retrieval engine (proof-of-principle climate-centric search engine) • VIPAR-based system architecture (Tom) • Develop and test similarity measures & clustering algorithms for climate time series data comparison (Nagiza) • Identify, collect & begin analyzing meta-data from simulation application (Terence) • Optimization of file migration and replacement policies in distributed disk caching using simulation models (Ekow) • (Alok) • Serial implementations of climate–appropriate non-linear non-orthogonal dimension reduction methods; Start dynamic EOFs (Chandrika) Area 3 Report