170 likes | 186 Views
Explore the unique challenges of neuroinformatics in handling vast data generated by neuroscientists and integrating molecular, anatomical, and behavioral information. Discover how the CARMEN project focuses on neural activity data and the efforts to resolve neural coding complexities. Differentiate between bioinformatics and neuroinformatics, emphasizing the need for standardized structures in the latter. Gain insights into the difficulties of sharing complex neuroscience data without adequate metadata and the cultural barriers hindering data exchange. Conclusions highlight the importance of collaborative efforts and leveraging existing bioinformatics knowledge to address neuroinformatics challenges effectively.
E N D
The Sociology of Ontologies in Neurosciences Phillip Lord, School of Computing Science, Newcastle University
Background to the CARMEN project The role that we see for ontologies. Why neurosciences is different. How we are planning to build them Overview
Research Challenge • Worldwide >100,000 neuroscientists(~ 5,000 in UK) are generating vast amounts of data • Principal experimental data formats: • molecular (genomic/proteomic) • anatomical (spatial) • behavioural • neurophysiological (time-series electrical measures of activity) • Neuroinformatics concerns how these data are handled and integrated, including the application of computational modelling Understanding the brain may be the greatest informatics challenge of the 21st century
CARMEN – Focus on Neural Activity • raw voltage signal data collected by patch-clamp and single & multi- electrode array recording Understanding the brain may be the greatest informatics challenge of the 21st century resolving the ‘neural code’ from the timing of action potential activity neurone 1 neurone 2 neurone 3
Potential Barriers • Technical • Multiple proprietary data formats Volume of the data to be analysed • Cultural • Multiple communities each acting independently • Concerns about the consequences of sharing data All of this will sound very familiar to biologists, and others
The project was funded starting from this October – hence it’s about 3 weeks old. Therefore, this talk is based on my initial impressions I don’t actually know anything about sociology A disclaimer
Neurosciences seems to have very similar problems to bioinformatics Bioinformatics is rich with metadata; this isn’t yet true with neuroinformatics What are the differences between bio and neuroinformatics Whats the difference?
DNA and Protein sequence form a core datatype for bioinformatics It’s simple to structure and to store, and it is of high-value Initially, there wasn’t much of it, and textual metadata was fine. Many people built tools over it, for transforming and manipulating. No sequences!
Neurosciences data is hard • Most neurosciences data is relatively simple in structure • But often contextually complex • And sometimes associated with behavioural features • Without additional metadata, the raw data is relatively meaningless • In this, it shares much with microarray data.
Data Sharing was an early tradition in biology. Gene patenting, NDAs and the like came as quite a surprise Many political battles were fought, culminating with Clinton/Blair statement Data Sharing in bioinformatics
The data is easy to structure, but the metadata is not Is therefore much harder to share data usefully Many neuroscientists come from a medical background tends to be more of a hierarchical, secretive profession – all worried about getting sued. A lot of neuroscientists use invasive, live animal experiments security is more than a passing concern. Data Sharing in Neurosciences
The achievements and processes of bioinformatics are familiar to neuroscience it seems to be easier to argue for the value of standardisation But less of a do-it-yourself attitude “But you can’t just make up a standard” “We’re just trying to build a list of terms, which we all understand. Then the experts can turn it into an ontology” A Following Wind
Currently, we are term gathering ignorance is our key weapon! Many of the analysis steps are straight-forward maths/stats Much of the experimental metadata should be transferable from bioinformatics. Approach
How to define the most essential metadata, for highest win. How to engage the community into providing the metadata Will we be able to adapt the knowledge from bio, or will it be too complex? Are we doomed to relieve our past? The issues
We need to avoid “ontology for everything” Probably easier to avoid “reinventing the wheel” Simple to start with a migratory path Conclusions
Frank Gibson Carmen Investigators Jim Austin, Colin Ingram, Paul Watson, Stuart Baker, Roman Borisyuk, Stephen Eglen, Jianfeng Feng, Kevin Gurney, Tom Jackson, Marcus Kaiser, Stefano Panzeri, Rodrigo Quian Quiroga, Simon Schultz, Evelyne Sernagor, V. Anne Smith, Tom Smulders, Miles Whittington Acknowledgements