1 / 20

Towards Mammalian Feeding Database

Towards Mammalian Feeding Database. Vladimir Gapeyev NESCent. 1st Working Group Meeting Analysis and Synthesis of Physiologic Data from the Mammalian Feeding Apparatus NESCent, Durham, NC 26 February 2009. Goal of this talk. Start soliciting application requirements. Plan of the talk.

Download Presentation

Towards Mammalian Feeding Database

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Towards Mammalian Feeding Database Vladimir Gapeyev NESCent 1st Working Group Meeting Analysis and Synthesis of Physiologic Data from the Mammalian Feeding Apparatus NESCent, Durham, NC 26 February 2009

  2. Goal of this talk Start soliciting application requirements

  3. Plan of the talk • Web-based databases • NESCent Informatics • Mammalian Feeding • Data model • Application mock-up

  4. Biological Databases on the Web

  5. Dryad data repository http://www.datadryad.org -- Ryan Scherle

  6. Architecture of a Web database

  7. Notable points about Bio databases • Data vs. informational display • Accession IDs • Correspond to “units of contribution” • Enable linking between DBs • Layers of data • What are “annotations” in one DB, are “primary data” in another

  8. NESCent Informatics • Science support mission: • IT infrastructure • Custom prototype applications • Directly-funded projects • People

  9. Sample prototypes from NESCent • about 1 full-time developer for 2-3 months • split among Vladimir Gapeyev Xianhua Liu Hilmar Lapp

  10. Prototype development for a WG • What to expect: • Scope: an app facilitating WG scientific activity • Quality: a functional and working prototype • Deliverable: open-source code • Support: near-term hosting and administration • What not to expect: • industrial-strength 24x7 system • data entry or transformation en masse • scientific data curation • on-going development after WG wraps up • … But: NESCent runs dedicated projects, too

  11. Mammalian Feeding: IT guy’s view

  12. Two major areas for IT support • Managing the data • collecting, storing, searching, downloading • “out” is the same as “in”: no added value • Primary data: parameters and digital recordings • Derived data: e.g., chewing cycle annotations • Facilitating analysis • rectify and integrate before download • extract and assemble relevant data subsets • transform into stats-ready formats

  13. Managing the data • Data model: structure of the data • Sample data records • Entity-relationship diagrams • Controlled vocabularies • User interface: manipulation of the data • Mock-ups of pages / forms • Usage sequences and scenarios

  14. Datamodel sketch

  15. Some concrete open issues • Do Animals belong to Sessions, Studies, or Labs? • Does File belong to a Behavior or to a Session? • What are the kinds of recordings to be collected? • Are there simultaneous recordings? • What are recording file formats? On the wiki: https://www.nescent.org/wg_feeding/Data_model_sketch -- add to and comment!

  16. Another DM aspect: file structure • Refer to data points by # or by second? • Bother about tapes info or assume digitized? • What’s besides EMG?

  17. Mammalian feeding: UI mock-up http://feeding-dev.nescent.org/

  18. How we can use the UI mock-up http://feeding-dev.nescent.org/ • Try to populate with data • Try to do searches • Take note of what is awkward and what is lacking • Get ideas to refine the data model • Determine usage scenarios that require a more complex interface • Correct Lab names, etc. to ones we prefer

  19. Facilitating analysis • Rectify and integrate before download • Use one of a dozen pre-determined algorithms • Supply your own algorithm • Extract and assemble data subsets • Only get channels for anterior temporalis • Get each chew into a separate file • Do these for all apes in the database • Transform into stats-ready formats • Tab- or comma- delimited? • Special headers added? • Accompanied by a phylogenetic tree?

  20. Informatics’ goals for this meeting • Understand high-level needs of this WG for data storage and processing • Clarify ambiguities and fill in details of the data model • Document major usage scenarios • Establish a stakeholder group for day-to-day collaboration • Determine constraints on the timeline

More Related