160 likes | 239 Views
Semantic Cyberinfrastructure for Knowledge and Information Discovery (SCiKID) Proposal. For NSF EAGER Grant. Principle Investigator: Eric Rozell Tetherless World Constellation Rensselaer Polytechnic Institute. Table of Contents. Cover Sheet Information Project Summary Project Description
E N D
Semantic Cyberinfrastructure for Knowledge and Information Discovery(SCiKID) Proposal For NSF EAGER Grant Principle Investigator: Eric Rozell Tetherless World Constellation Rensselaer Polytechnic Institute
Table of Contents • Cover Sheet Information • Project Summary • Project Description • Unfunded Collaborations • EArly Grants for Exploratory Research(EAGER) “Fitness” • References Cited • Biographical Sketch • Budget Justification • Review
Cover Sheet Information • Awardee: Eric Rozell • Primary Location: Rensselaer Polytechnic Institute • Program: EAGER • Unit of Consideration: NSF Office of Cyberinfrastructure (OCI) • Title: Semantic Cyberinfrastrucutre for Knowledge and Information Discovery (SciKID) • Budget: $300,000 • Duration: 2 years
Project Summary - Overview • Federated search capabilities • Multi-paradigm search tools (e.g., hierarchical, faceted, semantic, etc.) • Multi-disciplinary data discovery platform • Data integration and quality analysis tools
Project Summary – Intellectual Merit • General cyberinfrastructure contribution applicable to many science domains (breadth) • Accelerate scientific discovery • Extensible architecture for data analysis and visualization tools (depth) • Data scientists / informaticists focus on algorithms and tools
Project Summary –Broader Impacts • Applicable to any data-centric science domain • Range of tools supporting experts (e.g., specialized informaticists) to non-experts (e.g., undergraduates) in scientific discovery • Support education on best practices in data pipeline
Project Description - Content • Objectives • To advance the fields of data science and cyberinfrastructure • Significance • Will accelerate scientific discovery in data-centric science domains • Long-term Goals • To provide cyberinfrastructure covering all aspects of the data pipeline enabling the preservation of accessible and reusable data for centuries to come • Related Work • Virtual Observatories (many refs.) • Faceted Browse (many refs.) • Semantic Search (Noesis, other refs.) • Web service discovery (ESIP Discovery Cluster, P2P systems, et al.) • Data provenance (Peter’s DSRC talk)
Project Description - Content • (2 months) Extend S2S to support federated search • (2 months) Investigate search paradigms • E.g., faceted browse, semantic search algorithms, hierarchical search • (4 months) Extend S2S to support variety of search paradigms • (4 months) Investigate data discovery techniques • E.g., service metadata (ontology) models, registry architectures • (2 months) Extend S2S with necessary discovery infrastructure • (6 months) Investigate provenance models that will enable data integration and data QA as presented in Peter’s DSRC talk • (4 months) Extend S2S with more elaborate “data” model
Project Description – Results from Prior Work • No prior NSF funding (as PI / co-PI) • Worked as RA for SeSF? • Results include S2S (see refs.)
Unfunded Collaborations • Will utilize existing connections with… • High Altitude Observatory, NCAR, CO • Geology & Geophysics, WHOI, MA • Physical Oceanography, WHOI, MA • BCO-DMO, WHOI, MA
EAGER “Fitness” • Transformative • Federated search across web service standards • Persistent storage and reuse of user interface components and integration and analysis tools in a web environment • Interdisciplinary • Involving many science domains (for use cases and design input), visualization, web science, data science, software engineering
EAGER “Unfitness” • Not necessarily “Untested” • May work as “regular” NSF proposal
References Cited • S2S Publications • E. Rozell, P. Fox, A. Maffei, S. Zednik (2011), A Framework for Earth Science Search Interface Development, Abstract EGU2011-13413 presented at General Assembly 2011, EGU, Vienna, Austria, 03-08 Apr • E. Rozell, A. Maffei, S. Beaulieu, P. Fox (2010), A Framework for Integrating Oceanographic Data Repositories, Abstract IN23A-1349 presented at 2010 Fall Meeting, AGU, San Francisco, Calif., 13-17 Dec • E. Rozell, P. Fox, A. Maffei, Ontology and Application for Reusable Search Interface Design, to be submitted to Computers & Geosciences • Plus everything needed from the related work
Biographical Sketch • Eric Rozell, Ph.D. Student • Degrees • B.S., Computer Science, RPI • M.S., Management in TC&E*, RPI (in pursuit) • Ph.D., Computer Science, RPI (in pursuit) • Appointments • WHOI Summer Student Fellow • Darrin Fdn. Summer Ugrad. Research Fellow • Collaborators • TWC Professors • WHOI Scientists
Budget Justification • Annual Budget ($150,000 annually) • $30,000 (meager RA salary) • $50,000 (tuition & fees) • $55,000 (jr. software engineer salary) • $10,000 (consulting fees and travel) • $5,000 (hardware) • Cumulative Budget • Above x2
Review • Needs work…