150 likes | 168 Views
Multivalued type dissimilarity measure and concept of mutual dissimilarity value for clustering symbolic patterns. Advisor : Dr. Hsu Presenter : Wen-Hsiang Hu Authors : D.S. Guru*; Bapu B. Kiranagi;. Pattern Recognition Society. Published by Elsevier Ltd, 2004, Pages:151 - 156.
E N D
Multivalued type dissimilarity measure and concept of mutual dissimilarity value for clustering symbolic patterns Advisor : Dr. Hsu Presenter : Wen-Hsiang Hu Authors :D.S. Guru*; Bapu B. Kiranagi; Pattern Recognition Society. Published by Elsevier Ltd, 2004, Pages:151 - 156
Outline • Motivation • Objective • Introduction • Dissimilarity Measure • Modified Agglomerative clustering technique by introducing the concept of MDV • Experiments • Conclusion • Personal Opinion
Motivation • For methods [2-5], the degree of proximity between two symbolic patterns is assumed to be crisp and symmetric. • However, the proximity can, in general, be expected either to lie within a certain range or to be an instance of multivalued type in addition to being non-symmetric. A special instance of crisp, non-symmetric and Boolean.
Objective • In order to work on such unconventional proximity value.
Introduction • We present a novel dissimilarity measure to estimate the degree of dissimilarity between two symbolic patterns. • The proposed measure unlike other methods [2–5] approximates degree of dissimilarity by multivalued type data and in addition, it is non-symmetric. [5] [2] [3][4]
A novel dissimilarity measure • Let Fi and Fj be two symbolic patterns described by n interval valued features as follows: • For example:
A novel dissimilarity measure (cont.) • The degree of dissimilarity of Fi to Fj , with respect to the kth feature (irrespective of overlapping or no overlapping) is characterized by Fik Fjk
Modified agglomerative clustering technique by introducing the concept of MDV • The “Mutual dissimilarity value” between two patterns is defined to be the magnitude of the vector, which is the sum of the scalar times of the vectors representing the degree of dissimilarity between the patterns. i.e. • MDV is derived by the use of non-symmetric values Di→j and Dj→i with different weight factors α and β. • If Di→j and Dj→i are one and the same (as in conventional techniques), the weight factors do not convey any meaning. where α and β are scalars (weights).
Modified agglomerative clustering technique by introducing the concept of MDV(cont.) • Initially m clusters
Experiments • For sake of simplicity the weight factors α and β are set to 1. [5] [2] [3] [4] [5] [2] [3][4]
Conclusion • In this paper, a novel dissimilarity measure and a modified clustering algorithm by introducing the concept of MDV is proposed for clustering symbolic patterns. • The proposed method is experimentally validated for its efficacy and is shown to have high consistency with human perception.
Conclusion (cont.) • Our method bears the following characteristics: • It is simple and computationally efficient. • It can be employed on quantitative, qualitative and multivalued qualitative symbolic data types. • It is non-parametric. • Being based on MDV it works on multivalued type proximity matrix.
Personal Opinion • Drawback • Does not explain what is suitable weight factors (αandβ)for obtaining a better cluster of symbolic patterns • Application • Apply dissimilarity to ART