Causal Models, Learning Algorithms and their Application to Performance Modeling

Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15th 2006

Overview • I. Causal Models • II. Learning Algorithms • III. Performance Modeling • IV. Extensions Causal Performance Models

I. Multivariate Analysis • Variables • Experimental data Probabilistic model of joint distribution? Relational information? A priori unknown relations Causal Performance Models

A. Representation of distributions • Factorization • Reduction of factorization complexity • Bayesian Network Ordering 1 Ordering 2 Causal Performance Models

B. Representation of Independencies • Conditional independence • Qualitative property: P(rain|quality of speech)=P(rain)? • Markov condition in graph Variable becomes independent from all its non-descendants by conditioning on its direct parents. • graphical d-separation criterion Causal Performance Models

Faithfulness • Independence-map: • All independencies in the Bayesian network appear in the distribution • Faithfulness: Joint Distribution Directed Acyclic Graph Conditional independencies  d-separation • Theorem: if a faithful graph exists, it is the minimal factorization. Causal Performance Models

C. Representation of Causal Mechanisms Model of the underlying physical mechanisms • Definition through interventions • causal model + Conditional Probability Distributions + Causal Markov Condition = Bayesian network Causal Performance Models

Reductionism • Causal modeling = reductionism • Canonical representation: unique, minimal, independent • Building block = P(Xi|parents(Xi)) • Whole theory is based on this modularity • Intervention • = change of block Causal Performance Models

Ultimate motivation for causality If causal mechanisms are unrelated • model is faithful Model = canonical representation able to explain all qualitative properties (independencies) • close to reality Causal Performance Models

II. Learning Algorithms Two types: • Constraint-based based on the independencies • Scoring-based searches set of all models, give a score of how good they represent distribution Causal Performance Models

Step 1: Adjacency search • Property: adjacent nodes do not become independent • Algorithm: • start with full-connected graph • check for marginal independencies • check for conditional independencies Causal Performance Models

Step 2: Orientation • Property: • V-structure can be recognized • Algorithm: • look for v-structures • derived rules Causal Performance Models

Assumptions • General statistical assumptions: • No selection bias • Random sample • Sufficient data for correctness of statistical tests • Underlying network is faithful • Causal sufficiency • No unknown common causes Causal Performance Models

Criticism • Definition causality? • About predicting the effect of changes to the system • Faithfulness assumption • Eg.: accidental cancellation • Causal Markov Condition • “All relations are causal” • Learning algorithms are not robust • Statistical tests make mistakes Causal Performance Models

Part III: Performance Analysis • High-Performance computing parallel system 1 processor Performance Questions: • Performance prediction • Parameter-dependency? • Reasons of bad performance? • System-dependency? • Effect of Optimizations? Causal Performance Models

PhD?? Causal modeling (cf. COMO lab, VUB) • Representation form • Close to reality • Learning algorithms • TETRAD tool (open-source, java) Causal Performance Models

Performance Models • Aim performance analysis • Support software developer • High-performance applications • Expected properties • offer insight into causes performance degradation • prediction • estimate effect of optimizations • reusable submodels • separate application and system-dependency • reason under uncertainty • causal models Causal Performance Models

Integrated in statistical analysis • Statistical characteristics • Regression analysis • Probability table compression • Outlier detection Iterative process 1. Perform additional experiments 2. Extract additional characteristics 3. Indicate exceptions 4. Analyze the divergences of the data points with the current hypotheses Causal Performance Models

A. Model construction • Model of computation time of LU decom- position algorithm • elementsize (redundant variable) is sufficient for influence datatype -> cache misses • regression analysis on submodels X=f(parents) • analysis of parameters Causal Performance Models

B. Detection of unexpected dependencies • Point-to-point communication performance • background communication Causal Performance Models

C. Finding explanations for outliers Exceptional data in communication performance measurements Probability table compression => derived variable Interesting features Causal Performance Models

IV. Complexity of Performance Data • Mixture discrete and continuous variables • Mutual Information & Kernel Density Estimation • Non-linear relations • Mutual Information & Kernel Density Estimation • Deterministic relations • Augmented models & Complexity criterion • Context variables • Work in progress • Context-specific independencies • Work in progress Causal Performance Models

A. Information-theoretic Dependency • Discretized entropy for continuous variable • Entropy of random variable X • Mutual Information Causal Performance Models

B. Kernel Density Estimation • See applets Trade-off maximal entropy <> typicalness • Conclusions • Limited number data points needed • Discretization of continuous data justified • Form-free dependency measure Causal Performance Models

C. Deterministic relations • Y=f(X) • Y becomes independent from Z conditioned on X • ~ violation of the intersection condition (Pearl ’88) • Not faithfully describable Solution: augmented causal model - add regularity to model - adapt inference algorithms Causal Performance Models

The Complexity Criterion • X & Y contain equivalent information about Z • Select simplest relation Causal Performance Models

Augmented causal model • Restrict conditional independencies • Generalize d-separation • Reestablish faithfulness • Consistent models under Complexity Increase assumption { Causal Performance Models

Theory works! Deterministic B Probabilistic A Causal Performance Models

Conclusions • Benefit of the integration of statistical techniques • Causal modeling is a challenge • wants to know the inner from the outer • More information • http://parallel.vub.ac.be • http://parallel.vub.ac.be/~jan Causal Performance Models

Causal Models, Learning Algorithms and their Application to Performance Modeling

Causal Models, Learning Algorithms and their Application to Performance Modeling

Presentation Transcript

Causal Models for Regression Modeling Strategies

Performance Evaluation for Learning Algorithms

Tetrad: Machine Learning and Graphcial Causal Models

Causal Rasch Models

Mesopic vision models and their application

Modern Jet Algorithms and their performance

Models and Modeling

Simulation and Application on learning gene causal relationships

Causal Models for Performance Analysis of Computer Systems

Learning Styles and their application to StudyNet

Dynamic Causal Models

Modern Jet Algorithms and their performance

Dynamic Causal Models

Application Performance Analysis and Modeling

Graphical Causal Models

Causal Modeling

Causal Inference and Graphical Models

Learning Styles and their application to StudyNet

Dynamic Causal Models

Meshless wavelets and their application to terrain modeling

Discovering Causal Models