140 likes | 306 Views
Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g. Professor Yike Guo. Open Discovery Workflow :. Discovery Workflow. Real Time Data Integration. Discovery Services. Operational Data. Literature. Instrument Data. Databases. Using Distributed Resources.
E N D
Kensington Oracle Edition:Open Discovery Workflow Meets Oracle 10g Professor Yike Guo
Open Discovery Workflow : Discovery Workflow Real Time Data Integration Discovery Services Operational Data Literature Instrument Data Databases Using Distributed Resources Intellectual Property Management Dynamic Application Integration Images Information Discovery
KDE Platform Architecture KDE Applications: End-user applications and user interface allowing scientists to construct and drive knowledge discovery activities Open Discovery Workflow Model KDE Middleware: Managing , executing and deploying discovery plans (workflows) for distributed knowledge discovery and access to distributed resources and services KDE Discovery Services: Distributed databases, compute servers and scientific devices. SDK Model Web Service Support Infrastructure
KOE : Open Discovery Workflow for Oracle 10g • Kensington : A discovery workflow environment that interfaces with Oracle’s Data/Text Mining and Life Science components • Access Oracle 10g analytical function without coding • Data Query via SQL or workflow construction • Visualization and reporting discovery results/process • Deploying reusable discovery process as Web services • Oracle 10g : The most powerful foundation for grid-based informatics • Scalability : Large scale data analysis with grid support • Performance : Speed and quality • Reliability : Persistence, security and industry standard • Kensington + Oracle 10g : Enterprise Discovery Information Framework • Fulfilling complex integrative discovery tasks with ease • Organizing large discovery projects • Turning your database into a knowledge base with immediate gain
KDE Oracle Edition Architecture 10g ODM & Text
Oracle Workflow example Kensington Components Oracle Components Connect icons to form uniform workflow
“In Oracle” Processing • Drag-and-drop used to create symbolic links to database tables • A transparent process brings the data into the system, as if it were a normal table node • A complex SQL query can be built via a workflow for pre-processing • Such a query workflow is transparently executed within Oracle • Oracle data/text mining, statistic functions, life science components are accessible through KOE workflow and running transparently within Oracle • Building complex discovery applications WITHOUT CODING
An Application Example (The Lymphoma Example) Generate and Compile Code Accuracy Testing Sequence Search View Model Choose A Build Feature Selection Choose algorithm/parameter
Kensington Workflow Pre-processing Naïve Bayes Mining and Evaluation Adaptive NB Mining and Evaluation Naïve Bayes Mining, Evaluation & BLAST Apply Classification
Rich visualizers to view results Visualize pre-processed data Select attribute importance visually
Browse classification models Evaluate with ROC plot
Evaluate the class distribution Inspect the sequence alignment
Summary • "Empowered by Oracle Database 10g, InforSense's Open Discovery Workflow technology has its most powerful execution engine", commented Prof. Yike Guo, CEO of InforSense. "Using InforSense's discovery workflows built upon Oracle's data mining, text mining and R&D database functionality, researchers and organizations can now automate large scale and complex knowledge discovery activities with high performance and reliability.“ • “Integrating Kensington Discovery Platform with Oracle Database 10g provides life science researchers with a flexible workflow solution for the intuitive integration of analytics from InforSense and Oracle”, said Dr. Susie Stephens, Life Sciences Product Manager, Oracle Corporation.