1 / 69

Evolutionary Computation

Evolutionary Computation. Genetic Algorithms Genetic Programming Learning Classifier Systems. Genetic Algorithms. Population-based technique for discovery of knowledge structures Based on idea that evolution represents search for optimum solution set Massively parallel.

Download Presentation

Evolutionary Computation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Evolutionary Computation Genetic Algorithms Genetic Programming Learning Classifier Systems

  2. Genetic Algorithms • Population-based technique for discovery of knowledge structures • Based on idea that evolution represents search for optimum solution set • Massively parallel

  3. The Vocabulary of GAs • Population • Set of individuals, each represented by one or more strings of characters • Chromosome • The string representing an individual

  4. The vocabulary of GAs, contd. • Gene • The basic informational unit on a chromosome • Allele • The value of a specific gene • Locus • The ordinal place on a chromosome where a specific gene is found

  5. Thus...

  6. Genetic operators • Reproduction • Increase representations of strong individuals • Crossover • Explore the search space • Mutation • Recapture “lost” genes due to crossover

  7. Genetic operators illustrated...

  8. GAs rely on the concept of “fitness” • Ability of an individual to survive into the next generation • “Survival of the fittest” • Usually calculated in terms of an objective fitness function • Maximization • Minimization • Other functions

  9. Genetic Programming • Based on adaptation and evolution • Structures undergoing adaptation are computer programs of varying size and shape • Computer programs are genetically “bred” over time

  10. The Learning Classifier System • Rule-based knowledge discovery and concept learning tool • Operates by means of evaluation, credit assignment, and discovery applied to a population of “chromosomes” (rules) each with a corresponding “phenotype” (outcome)

  11. Components of a Learning Classifier System • Performance • Provides interaction between environment and rule base • Performs matching function • Reinforcement • Rewards accurate classifiers • Punishes inaccurate classifiers • Discovery • Uses the genetic algorithm to search for plausible rules

  12. The Learning Classifier System • Rule-based knowledge discovery and concept learning tool • EpiCS • First Learning Classifier System designed for use in epidemiologic surveillance • Supervised learning environment

  13. Knowledge Representation • Classifiers • IF-THEN rules • Condition=“genotype” • Action=“phenotype” • Strength metric • Encoded as bit strings or numerics • Population • Fixed size collection of classifiers

  14. Low-level knowledge representation:The Classifier • Taxon is analogous to a condition (LHS) of an IF-THEN rule • Action bit is analogous to an action (RHS) of an IF-THEN rule • Strength is an internal fitness function

  15. High-level knowledge representation:Macrostate Population

  16. Components of a learning classifier system • Performance • Provides interaction between environment and classifier population • Performs matching function • Reinforcement • Rewards accurate classifiers • Punishes inaccurate classifiers • Discovery • Uses the genetic algorithm to search for plausible knowledge structures

  17. Generic Machine Learning Model

  18. A Generic Learning Classifier System

  19. EpiCS: A Learning Classifier System

  20. EpiCS: Performance Component

  21. Performance component • Creates a subset (the matchset, [M]) of all classifiers in population [P] whose conditions match a string received from the environment • From [M], a single classifier is selected, based on its strength as a proportion of the sum of all strengths in [M] • The action of this classifier is then used as the output of the system

  22. EpiCS: Reinforcement Component

  23. Reinforcement component • Correct set [C] is created from classifiers in [M] advocating correct decisions • Remaining classifiers in [M] form Not[C] • Tax is deducted from the strengths of all classifiers in [C] • Reward is added to the strengths of all classifiers in [C], biased for generality • Penalty is deducted from the strengths of all classifiers in Not[C]

  24. EpiCS: Discovery Component

  25. Discovery component • Genetic algorithm invoked once per iteration • One new offspring is created, from parents deterministically selected based on strength • The single offspring replaces weakest classifier in the population

  26. Features of EpiCS • Object-oriented implementation • Stimulus-response architecture • Payoff/Penalty reinforcement regime • Syntactic control of overgeneralization • Differential penalty control of undergeneralization • Ability to compute risk of outcome

  27. Discovering risk with EpiCS • Output decision of the learning classifier system is probability of disease (CSPD), rather than dichotomous decision • CSPD determined from proportion of classifiers matching a given input case’s taxon

  28. Discovering risk with EpiCS: The specifics

  29. Discovery of Predictive Models in an Injury Surveillance Database:An Application of Data Mining in Clinical Research

  30. Partners for Child Passenger SafetyInformation Infrastructure

  31. Why data mining is needed for PCPS • Large number of raw and derived variables renders traditional “manual” methods for discovering patters in data unwieldy • Hypothesis-driven (biased) analyses may lead to missed associations • Constantly changing patterns in prospective data require constantly changing analytic approaches that can be informed by data mining

  32. Candidate Predictors • Demographics • Kinematics • Characteristics of crash • Restraint use

  33. Outcome: Head Injury • Major burns involving the head • Skull fracture • Evidence of brain injury reported by respondent • Excessive sleepiness • Difficulty in arousing • Unresponsiveness • Amnesia after accident

  34. Data Preparation • Pool of 8,334 records • 20 separate datasets created • All cases of head injury included (N=415) • Equal number of non-head injury cases randomly drawn from pool • Each dataset randomly sampled to create mutually exclusive training and testing sets of equal size

  35. Comparison methods:Logistic Regression • Variables from training sets stepped into model to determine significant terms • Significant terms used to create new risk model: • Risk model applied to cases in testing set • Risk estimates categorized by deciles and used construct ROC curves

  36. Comparison Methods:Decision Tree Induction • C4.5 used to create decision trees from training sets • 10-fold cross-validation used to optimize trees • Optimized trees used by C4.5RULES to classify cases in testing set

  37. Experimental Procedure

  38. Results: Training

  39. Results: Training • EpiCS • 5,000 unique classifiers reduced to 2,314 by the end of training • Logistic regression • Single model with eight significant terms, no significant interactions • C4.5 • 11 rules created for each training set, most with single conjuncts

  40. Results: Prediction Area under the ROC curve obtained on testing, averaged over the 20 separate studies

  41. And now for something a little different The XCS model

  42. XCS: A little history • Wilson, SW: Evolutionary Computation, 2(1), 1-18 (1994) • ZCS • Wilson, SW: Evolutionary Computation, 3(2), 149-175 (1995) • The seminal work on XCS • Many papers by Lanzi, Barry, Butz, and others • Butz, M and Wilson, SW: Advances in Learning Classifier Systems. Third International Workshop (IWLCS-2000), Lecture Notes in Artificial Intelligence (LNAI-1996). Berlin: Springer-Verlag (2001) • The algorithm paper

  43. What is XCS? • An LCS that differs from traditional Holland model • Classifier fitness is based on the accuracy of the classifiers payoff prediction, rather than the prediction itself • The genetic algorithm is restricted to niches in the action set, rather than applied to the classifier population as a whole • The major feature is graceful, accurate generalization

  44. XCS in a nutshell ((43*99)+(27*3))/102 Action: 00 Action: 01 Source: Wilson, XCS tutorial

  45. EpiXCS: An XCS-Based Learning Classifier System for Epidemiologic Research

  46. Outline • What is it? • EpiXCS architecture • Data encoding • Evaluation metrics • Reinforcement • Missing values handling • Classifier ranking • Risk assessment • Test case: Pima Indians Diabetes Data

  47. What is EpiXCS? • Learning classifier system based on the XCS paradigm • Uses the Lanzi C++ kernel • Designed for use in epidemiologic research, specifically mining disease surveillance databases in supervised learning environments • Visualization by non-LCS users • Sensitive to demands of clinical data

  48. Data Encoding in EpiXCS • All numeric data formats permissible • Binary • Categorical • Ordinal • Real • Non-binary data represented using “center-spread” approach • Two genes per feature • Actions are limited to binary (for now)

  49. Sample input data format(Pima Indians Diabetes Database) ATTRIBUTE 0 <WILD "99"><REAL><STRING "Clump Thickness"> ATTRIBUTE 1 <WILD "99"><REAL><STRING "Uniformity of Cell Size"> ATTRIBUTE 2 <WILD "99"><REAL><STRING "Uniformity of Cell Shape"> ATTRIBUTE 3 <WILD "99"><REAL><STRING "Marginal Adhesion"> ATTRIBUTE 4 <WILD "99"><REAL><STRING "Single Epithelial Cell Size"> ATTRIBUTE 5 <WILD "99"><REAL><STRING "Bare Nuclei"> ATTRIBUTE 6 <WILD "99"><REAL><STRING "Bland Chromatin"> ATTRIBUTE 7 <WILD "99"><REAL><STRING "Normal Nucleoli"> ATTRIBUTE 8 <WILD "99"><REAL><STRING "Mitoses"> ACTION 9 <STRING "Malignant"> 5 4 4 5 7 10 3 2 1 0 3 1 1 1 2 2 3 1 1 0 8 10 10 8 7 10 9 7 1 1 …

  50. Classifier Population Initialization • Minima and maxima for each attribute determined automatically at start of run • Center values can be initialized by user • Mean • Median • Random value between spread • Spread values can be initialized by user • Standard deviation • Quantile

More Related