Jesal Bhuta, Chris Mattmann {jesal, mattmann}@usc

A Framework for the Assessment and Selection of Software Components and Connectors in COTS-based Architectures Jesal Bhuta, Chris Mattmann {jesal, mattmann}@usc.edu USC Center for Systems & Software Engineering http://csse.usc.eduFebruary 13, 2007

Outline • Motivation and Context • COTS Interoperability Evaluation Framework • Demonstration • Experimentation & Results • Conclusion and Future work

COTS-Based Applications Growth Trend • Number of systems using OTS components steadily increasing • USC e-Services projects show number of CBA’s rise from 28% in 1997 to 70% in 2002 • Standish group’s 2000 survey found similar results (54%) in the industry [Standish 2001 - Extreme Chaos] CBA Growth Trend in USC e-Services Projects Standish Group Results

Java CRM JDBC ODBC Microsoft SQL Server COTS Integration: Issues • COTS products are created with their own set of assumptions which are not always compatible • Example: Java-Based Customer Relationship Management (CRM) and Microsoft Access integration • CRM supports JDBC, MS SQL supports ODBC

Case Study [Garlan et al. 1995] • Develop a software architecture toolkit • COTS selected • OBST, public domain object oriented database • Inter-views, GUI toolkit • Softbench, event-based tool integration mechanism • Mach RPC interface generator, an RPC mechanism • Estimated time to integrate: 6 months and 1 person-year • Actual time to integrate: 2 years and 5 person-years

Problem: Reduced Trade-Off Space • Detailed interoperability assessment is effort intensive • Requires detailed analysis of interfaces and COTS characteristics, prototyping • Large number of COTS products available in the market • Over 100 CRM solutions, over 50 databases = 5000 possible combinations • This results in interoperability assessment being neglected until late in development cycle • These reduce trade-off space between • medium and low priority requirements chosen over cost to integrate COTS

Statement of Purpose To develop an efficient and effective COTS interoperability assessment framework by: • Utilizing existing research and observations to introduce concepts for representing COTS products • Developing rules that define when specific interoperability mismatches could occur • Synthesizing (1 and 2) to develop a comprehensive framework for performing interoperability assessment early (late inception) in the system development cycle Efficient: Acting or producing effectively with a minimum of unnecessary effort Effective: Producing the desired effect (effort reduction during COTS integration)

Proposed Framework: Scope • Specifically addresses the problem of technical interoperability • Does not address non-technical interoperability issues • Human computer interaction incompatibilities • Inter/intra organization incompatibilities

Manage and disseminate Digital content (planetary science data) Data disseminated in multiple intervals Two user classes separated by distributed geographic networks (Internet) Scientists from European Space Agency (ESA) External users Motivating Example: Large Scale Distributed Scenario

Interoperability Evaluation Framework Interfaces

COTS Representation Attributes

COTS Definition Example: Apache 2.0

COTS Interoperability Evaluation Framework

Integration Rules • Interface analysis rules • Example: ‘Failure due incompatible error communication’ • Internal assumption analysis rules • Example: ‘Data connectors connecting components that are not always active’ • Dependency analysis rules • Example: ‘Parent node does not support dependencies required by the child components’ • Each rule includes: pre-conditions, results

Integration Rules: Interface Analysis • ‘Failure due incompatible error communication’ • Pre-conditions • 2 components (A and B) communicating via data &/or control (bidirectional) • One component’s (A) error handling mechanism is ‘notify’ • Two components have incompatible error output/error input methods • Result • Failure in the component A will not be communicated in component B causing a permanent block or failure in component B

Integration Rules: Internal Assumption Analysis • ‘Data connectors connecting components that are not always active’ • Pre-conditions • 2 components connected via a data connector • One of the component does not have a central control unit • Result • Potential data loss Component A Component B Pipe

Integration Rules: Dependency Analysis • ‘Parent node does not support dependencies required by the child components’ • Pre-condition: • Component in the system requires one or more software components to function • Result: • The component will not function as expected

Voluminous Data Intensive Interaction Analysis • An Extension Point implementation of the Level of Service Connector Selector • Distribution connector profiles (DCPs) • Data access, distribution, streaming [Mehta et. al 2000] metadata captured for each profiled connector • Can be generated manually, or using an automatic process • Distribution Scenarios • Constraint queries phrased against the architectural vocabulary of data distribution • Total Volume • Number of Users • Number of User Types • Delivery Intervals • Data Types • Geographic Distribution • Access Policies • Performance Requirements

Voluminous Data Intensive Interaction Analysis • Need to understand the relationship between the scenario dimensions and the connector metadata • If we understood the relationship we would know which connectors to select for a given scenario • Current approach allows both Bayesian inference and linear equations as a means of relating the connector metadata to the scenario dimensions • For our motivating example • 3 Connectors, C1-C3 • Profiled 12 major OTS connector technologies • Including bbFTP, gridFTP, UDP bursting technologies, FTP, etc. • Apply selection framework to “rank” most appropriate of 12 OTS connector solutions for given example scenarios

Voluminous Data Intensive Interaction Analysis • Precision-Recall analysis • Evaluated framework against 30 real-world data distribution scenarios • 10 high volume, 9 medium volume, and 11 low volume scenarios • Used expert analysis to develop “answer key” for scenarios • Set of “right” connectors • Set of “wrong” connectors • Applied Bayesian and linear programming connector selection algorithm • Clustered ranked connector lists using k-means clustering (k=2) to develop similar answer key for each algorithm • Bayesian selection algorithm: 80% precision, linear programming 48% • Bayesian algorithm more “white box” • Linear algorithm more “black box” • White box is better

Demonstration

Experiment 1 • Conducted in graduate software engineering course on 8 projects • 6 projects COTS-Based Applications • 2 web-based (3-tier) projects, 1 shared data project, 1 client-server project, 1 web-service interaction project and 1 single-user system • Implemented this framework before RLCA* milestone on their respective projects • Data collected using surveys • Immediately after interoperability assessment • After the completion of the project * Rebaselined Life Cycle Architecture

Experiment 1 Results * Accuracy of Dependency Assessment: 1 – (number of unidentified dependencies/total number of dependencies) ** Accuracy of Interface Assessment: 1 – (number of interface interaction mismatches identified/total number of interface interactions) Accuracy: a quantitative measure of the magnitude of error [IEEE 1990]

Experiment 2 – Controlled Experiment

Experiment 2 - Cumulative Results

Experiment 2 – DEN Results

Conclusion and Future Work • Results (so far) indicate a “sweet spot” in small e-services project • Framework-based tool automates initial interoperability analysis: • Interface, internal assumption, dependency mismatches • Further experimental analysis ongoing • Different software development domains • Projects with greater COTS complexity • Additional quality of service extensions

Questions

Jesal Bhuta, Chris Mattmann {jesal, mattmann}@usc

Jesal Bhuta, Chris Mattmann {jesal, mattmann}@usc

Presentation Transcript

Cloud Computing for Life Science

“The Garden of Abdul Gasazi” by Chris Van Allsburg

Chris Taylor University of Trieste

SLA Escalation

Professor William Browne and Chris Charlton Centre for Multilevel Modelling

Yang Cui Friday July 26 th 2013 Committee: Gerald, Chris, Fabian, Art, Dan

Chris Van Allsburg Day 1 Day 4 Day 2 Day 5 Day 3 Vocabulary Definitions Vocabulary Sentences

Whole Brain Teaching

Cod Liver Oil

Chris Chapman and Stephen Ward

GROUP #4 Chris Carmichael Ismael Rivera Journey Sumlar Warayut Techarut

Understanding and Comparing Remote Sensing Data to Model Output

Introduction to Evolution Chris Scott, Ph.D.

EmpowHR 9.0 Upgrade UAT/DB Change Status

Manufacturing Systems III

E-Rate for Beginners

Judgment and decision making Chris Snijders, ETH, April 28-29

CS590D: Data Mining Chris Clifton

SAC Anchors II lecture Chris McGuinness 11/08/06

Chris Starmer

Chris Cornell: 1964 - 2017