510 likes | 686 Views
Algorithmic Facets of Human Centricity in Computing with Fuzzy Sets. Witold Pedrycz Department of Electrical & Computer Engineering University of Alberta, Edmonton, Canada and Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland. pedrycz@ee.ualberta.ca.
E N D
Algorithmic Facets of Human Centricity in Computing with Fuzzy Sets Witold Pedrycz Department of Electrical & Computer Engineering University of Alberta, Edmonton, Canada and Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland pedrycz@ee.ualberta.ca ISDA-2009, Pisa, Italy, November 30-December 2, 2009
Agenda Human centricity and information granules Design of information granules – approaches of knowledge-based clustering Granular representation of computing with fuzzy sets
Human Centricity and information granules Information granules as conceptual entities inherently associated with human pursuits (decision-making, perception control, prediction) Interaction with and processing in intelligent systems realized at the level of information granules (fuzzy sets, rough sets, intervals…) Emergence of Human-Centric computing (HC2) Knowledge sharing and collaboration in distributed systems
Human Centricity and fuzzy sets Two fundamental quests: Construction of information granules (fuzzy sets); use of existing experimental evidence and its interpretation Cast in the framework of users/designer Qualitative, user-centric interpretation of results of computing with fuzzy sets
Clustering as a conceptual and algorithmic framework of information granulation Data information granules (clusters) abstraction of data Formalism of: set theory (K-Means) fuzzy sets (FCM) rough sets shadowed sets
Main categories of clustering Graph-oriented and hierarchical (single linkage, complete linkage, average linkage..) Objective function-based clustering Diversity of formalisms and optimization tools (e.g., methods of Evolutionary Computing)
Key challenges of clustering Data-driven methods Selection of distance function (geometry of clusters) Number of clusters Quality of clustering results
The dichotomy and the shift of paradigm supervised learning unsupervised learning Human-centricity Guidance mechanisms
Fuzzy Clustering: Fuzzy C-Means (FCM) Given data x1, x2, …, xN, determine its structure by forming a collection of information granules – fuzzy sets Objective function Minimize Q; structure in data (partition matrix and prototypes)
Fuzzy Clustering: Fuzzy C-Means (FCM) Vi– prototypes U- partition matrix
FCM – optimization Minimize subject to (a) prototypes (b) partition matrix
Domain Knowledge: Category of knowledge-oriented guidance Context-based guidance: clustering realized in a certain context specified with regard to some attribute Viewpoints: some structural information is provided Partially labeled data: some data are provided with labels (classes) Proximity knowledge: some pairs of data are quantified in terms of their proximity (resemblance, closeness)
Clustering with domain knowledge(Knowledge-based clustering)
Context-based clustering Clustering : construct clusters in input space X Active role of the designer [customization of processing] Context-based Clustering : construct clusters in input space X given some contextexpressed in output space Y
Context-based clustering: Conmputational considerations structure structure context Data Data • computationally more efficient, • well-focused, • designer-guided clustering process
Context-based clustering: focus mechanism Determine structure in input space given the output is high Determine structure in input space given the output is medium Determine structure in input space given the output is low Input space (data)
Context-based clustering: examples Find a structure of customer data [clustering] Find a structure of customer data considering customers making weekly purchases in the range [$1,000 $3,000] Find a structure of customer data considering customers making weekly purchases at the level of around $ 2,500 Find a structure of customer data considering customers making significant weekly purchases who are young no context context context context (compound)
Context-oriented FCM Data (xk, targetk), k=1,2,…,N Contexts: fuzzy sets W1, W2, …, Wp wjk = Wi(targetk) membership of j-th context for k-th data Context-driven partition matrix
Context-oriented FCM: Optimization flow Objective function Subject to constraint U in U(Wj) Iterative adjustment of partition matrix and prototypes
Viewpoints: definition Description of entity (concept) which is deemed essential in describing phenomenon (system) and helpful in casting an overall analysis in a required setting “external” , “reinforced” clusters
Viewpoints: definition viewpoint (a,b) viewpoint (a,?)
Viewpoints: definition Description of entity (concept) which is deemed essential in describing phenomenon (system) and helpful in casting an overall analysis in a required setting “external” , “reinforced” clusters
Viewpoints in fuzzy clustering B- Boolean matrix characterizing structure: viewpoints prototypes (induced by data)
Viewpoints in localization of “extreme” information granules specification of viewpoints through evolutionary/population-based optimization
Labelled data and their description Characterization in terms of membership degrees: F = [fik] i=12,…,c , k=1,2, …., N supervision indicator b = [bk], k=1,2,…, N
Proximity hints Prox(k,l) Prox(s,t) Characterization in terms of proximity degrees: Prox(k, l), k, l=1,2, …., N and supervision indicator matrix B = [bkl], k, l=1,2,…, N
Proximity measure • Properties of proximity: • Prox(k, k) =1 • Prox(k,l) = Prox(l,k) Proximity induced by partition matrix U: Linkages with kernel functions K(xk, xl)
Two general development strategies SELECTION OF A “MEANINGFUL” SUBSET OF INFORMATION GRANULES
Two general development strategies (1) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES (INFORMMATION GRANULES OF HIGHER TYPE) Information granules Type -2 Information granules Type -1
Two general development strategies (2) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES AND THE USE OF VIEWPOINTS viewpoints Information granules Type -2 Information granules Type -1
Two general development strategies (3) HIERARCHICAL DEVELOPMENT OF INFORMATION GRANULES – A MODE OF SUCCESSIVE CONSTRUCTION
Fuzzy Computing: Interpretability Interpretation of fuzzy sets - departure from pure numeric quantification of membership grades A= [0.11 0.19 0.34 0.45 1.00 0.98 0.821 0.447…]
Granulation of fuzzy sets Granulation of membership grades low, high, medium membership of alternative x Granulation of membership grades and universe of discourse low membership for a collection of alternatives….
Granulation of membership grades A= [L L L M M L L M…] A= [0.11 0.19 0.34 0.45 1.00 0.98 0.821 0.447…]
Granulation of membership grades: optimization A= [L L L M M L L M…] G= {G1, G2, …, Gc} Entropy minimization
Granulation of fuzzy sets A= [L M L M…]
Granulation of fuzzy sets: optimization Gc Gi Wi Wc G1 W1
Interpretability of fuzzy set computing Granulation of fuzzy sets Interpretability layer Fuzzy set computing
Interpretability of fuzzy set computing Granulation of fuzzy sets Interpretability layer Fuzzy set computing
Interpretability of fuzzy set computing • Equivalence sought with respect with assumed level • interpretability: • stability • Equivalence of models Non-distinguishability distinguishability
Fuzzy set computing: a retrospective interpretability ~1970 evolutionary after ~1990 Rule-based neurofuzzy accuracy