150 likes | 275 Views
Fuzzy rule-based system derive d from similarity to prototypes. Plan. What is it all about? Fuzzy rule systems and prototype rule based systems. From prototype rules to fuzzy rules and vice versa, with examples. Results of applications on real datasets. Conclusions. Motivation.
E N D
Fuzzy rule-based system derived from similarity to prototypes
Plan • What is it all about? • Fuzzy rulesystems and prototype rule based systems. • From prototype rules to fuzzy rules and vice versa, with examples. • Results of applications on real datasets. • Conclusions.
Motivation • Understanding data, situations, recognizing objects or making diagnosis people frequently use similarity to known cases, and rarely use logical reasoning, but soft computing experts use logic instead of similarity ... • Relations between similarity and logic are not clear. • Q1: How to obtain the same decision borders in Fuzzy Logic systems and Prototype Rule Based systems? • Q2: What type of similarity measure corresponds to a typical fuzzy functions and vice versa? • Q3: How to transform one type of a system into another type preserving their decision borders? • Q4: Are there any advantages of such transformations? • Q5: Can we understand data better using prototypes instead of logical rules?
Fuzzy Rule BasedSystem Learning process includes: • for each feature, select shapes of membership functions and the number of these functions; • optimize parameters of the membership functions (such as positions and spreads) using training data; • aggregate input information and calculate final rule activations for each category; • assign membership degrees to output classes; • write the set of F-rules and interpret them.
Prototype Rule Based System Learning process involves: • specify the number and positions of prototypes; • select similarity or dissimilarity (distance) functions (we use distance functions); • calculate distance (similarity) to each prototype; • assign P-rule to the output class as a rule; choices are: If P=argminp’(D(X,P’)) Then Class(X)=Class(P) This is a nearest prototype rule, similar to the fuzzy logic rule: If R=maxk MembFk(X) Then Class(X)<=Class(R) Another form of P-rules is based on similarity threshold: If D(X,P)≤dp Then C Taking D(X,P) distance crisp logic rules are obtained
Advantages of prototype based rules • Inspired by cognitive psychology: it may be easier to understand prototypes and similarity than fuzzy rules • P-rules may be defined for nominal features using probabilistic distance measures (such as VDM), while F-rules require numerical inputs. • Many algorithms for prototype selection and optimization exist but they have not been applied to understand data and their relation to fuzzy rules have not been explored; • Applications of P-rules to real datasets give excellent results generating small number of prototypes.
Value Difference Matrix (VDM) VDM – probability difference measure for 1 attribute for many attributes VDM measure can be also applied for continuous features, in the simplest way using discretization and interpolation, or other probability estimation techniques (Gaussian smoothing, Parzen windows, etc).
P-rules F-rules Condition: preserve classification borders Q: how are membership functions and distancefunctions related? Can one obtain new, interesting membership functions from known distance functions and vice versa? For all additive distance functions exp transformationchanges distances D of P-rules into products of MF of F-rules: MF=exp(-D) Example:Euclidean distance is equivalent to Gaussian MFs Algebraic (product) T-norm is obtained with Gaussian MFs
Visualization Euclidean distance function Square of Canberra distance function
VDM distance => membership functions DVDM distance function IVDM distance function
Inverse transformation For all product T-norm D = -ln(F) Advantages: New type of distance functions are generated. Example: distances generated from triangular functions.
Applications to real data • Gene expression data for 2 types of leukaemia (Golub et al, Science 286 (1999) 531-537 Description: 2 classes, 1100 features, 3 most relevant selected. Used methods: 1 prototype/class LVQ, DVDM similarity measure. Results (number of misclassified vectors): • Searching for Promoters in DNA strings Description: 2 classes, 57 features, all symbolic features. Used methods: 9 prototypes for promoters, 12 for nonpromoters, generated using C-means + LVQ, with VDM similarity measure. Results: 5 misclassified vectors in leave one out test.
Conclusions • First step in understanding relations between fuzzy and similarity-based systems was made. • Prototype rules can be expressed using fuzzy rules and vice versa. • New possibilities in both fields: • new type of membership functions; • new type of distance functions; • VDM measure used in P-rules leads to a natural shape of membership functions in fuzzy logic for symbolic data. • Expert knowledge can be captured in both types of rules, but sometimes it is easier to express as P-rules and sometimes as F-rules. • Many open problems remain.
Thank Youfor lending your ears ... Speaker: Marcin Blachnik