450 likes | 614 Views
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation Yao-chun.Peng@oracle.com. Manage vast quantities of data. Informatics Challenges. Access heterogeneous data. Access heterogeneous Data. Collaborate securely.
E N D
Life Sciences Integrated Demo Joyce Peng Senior Product Manager, Life Sciences Oracle Corporation Yao-chun.Peng@oracle.com
Manage vast quantities of data Informatics Challenges Accessheterogeneous data Access heterogeneous Data Collaborate securely Integrate a variety of data types Find Patterns and insights
Oracle Life Sciences Platform Transparent Gateways Fast access using Oracle OCI Distributed Queries Perform searches across domains Generic Gateways Access any data using ODBC e.g. MySQL GenBank e.g. PubMed External Tables Ability to index and query external files UltraSearch Search external sites & repositories MySQL Toolkit Easily move MySQL data into Oracle Real Application Clusters Linear scalability Oracle Portal Build personalized portals Application Server Provide scalability for themiddle tier XML DB Flexibly manage data interMedia Store & manage images Security Enforce security Auditing Create audit trail to facilitate FDA compliance Workflow Automate laboratory & business processes Collaboration Suite Collaborate securely iFS/Files Share documents e.g. SwissProt SP-ML Data Mining Discover patterns & insights BLAST Sequence similarity search Network Model Pathways Modeling Statistics Perform basic statistics Table Functions Implement complex algorithms OLAP & Discoverer Interactive query & drill-down Extensibility Framework (Data cartridges), manage complex scientific dataLOBs Manage unstructured data Text Index & query text, e.g. literature searches SQL Loader High performance data loader Web Services Standard communication between applications Merge/Upsert Enabling update and insert in one step TransportableTablespaces Rapidly exchange tables Oracle Streams Rule-based subscription for information sharing
Platform Features Highlighted Transparent Gateways Fast access using Oracle OCI Distributed Queries Perform searches across domains Generic Gateways Access any data using ODBC e.g. MySQL GenBank e.g. PubMed External Tables Ability to index and query external files UltraSearch Search external sites & repositories MySQL Toolkit Easily move MySQL data into Oracle Real Application Clusters Linear scalability Oracle Portal Build personalized portals Application Server Provide scalability for themiddle tier XML DB Flexibly manage data interMedia Store & manage images Security Enforce security Auditing Create audit trail to facilitate FDA compliance Workflow Automate laboratory & business processes Collaboration Suite Collaborate securely iFS/Files Share documents e.g. SwissProt SP-ML Data Mining Discover patterns & insights BLAST Sequence similarity search Network Model Pathways Modeling Statistics Perform basic statistics Table Functions Implement complex algorithms OLAP & Discoverer Interactive query & drill-down Extensibility Framework (Data cartridges), manage complex scientific dataLOBs Manage unstructured data Text Index & query text, e.g. literature searches SQL Loader High performance data loader Web Services Standard communication between applications Merge/Upsert Enabling update and insert in one step TransportableTablespaces Rapidly exchange tables Oracle Streams Rule-based subscription for information sharing
BioOracle Project We are scientists at a life sciences company looking to find a cure for Lymphoma
BioOracle Portal Integrated data view and Single-Sign-On to many applications
Find a Cure for Lymphoma • Literature search on Lymphoma • Set up a project workspace • Set up a meeting • Check lab protocols • Store cell histology images • Analyze gene expression results • Study the markers • Find a lead
Literature Search Search document content.
Find a Cure for Lymphoma • Literature search on Lymphoma • Set up a project workspace • Set up a meeting • Check lab protocols • Store cell histology images • Analyze gene expression results • Study the markers • Find a lead
BioOracle Project In Oracle Files Lymphoma project workspace after adding documents
BioOracle Project in Oracle Files Support revision control
BioOracle Project in Oracle Files Associate metadata (Categories) to a document.
BioOracle Project in Oracle Files Advanced Search
BioOracle Project in Oracle Files Access Control
BioOracle Project in Oracle Files • Support • HTTP/WebDAV(Web) • SMB (Windows) • NFS (UNIX) • AFP (Apple Mac) • FTP protocols
Find a Cure for Lymphoma • Literature search on Lymphoma • Set up a project workspace • Set up a meeting • Check lab protocols • Store cell histology images • Analyze gene expression results • Study the markers • Find a lead
Calendar Use calendar in Collaboration Suite to schedule meetings with collaborators
Find a Cure for Lymphoma • Literature search on Lymphoma • Set up a project workspace • Set up a meeting • Check lab protocols • Store cell histology images • Analyze gene expression results • Study the markers • Find a lead
BioOracle Image Management Use interMedia to manage and query Lymphoma histology data
BioOracle Image Management Generate image thumbnails
BioOracle Image Management Integrated search across relational data and image attributes extracted
DLBC Follicular Gene Expression Analysis for Lymphoma Biopsies Samples Feature Selection SQL Oracle Data Mining Feature Selection Molecular Pattern Recognition Oracle Data Mining Bayesian Classifier Interpretation of Results Discoverer Reports Portals Java Servlets Filtering and Pre-Processing SQL, XML, Java Instruments Affymetrix Microarray Use analytical pipeline to identify the patterns that differentiate DLBC from Follicular Lymphoma Prediction: DLBC Follicular Dataset from Golub et al Science 286:531-537.
Find a Cure for Lymphoma • Literature search on Lymphoma • Set up a project workspace • Set up a meeting • Check lab protocols • Store cell histology images • Analyze gene expression results • Study the markers • Find a lead
Oracle Data MiningClassification of Cancer Subtypes (DLBC versus Follicular) Oracle provides wizards to guide analysts through data mining model creation
Oracle Data Mining Build a classification model
Oracle Data Mining Select the target field, e.g. DLBC or Follicular Lymphoma
Oracle Data Mining Select the classification model
Oracle Data Mining Test the model on the data set of interest
Naïve Bayes has built a model that distinguishes DLBC from Folicular with 77% accuracy The confusion matrix shows the number of times the model’s predictions are accurate
Oracle Data Mining See if the Adaptive Bayes Network algorithm can build a better model
Oracle Data Mining Use wizards to define parameters for building a model
Oracle Data Mining Adaptive Bayes Network algorithm can predict Lymphoma subtype with 84% accuracy
Oracle Data Mining Adaptive Bayes Network algorithm generates rules for model interpretation
Oracle Data Mining in JDeveloper Automatically create the Java code needed to build analytical pipelines inside the database