20 likes | 192 Views
Open source clustering software. Clustering extension module for Python-language Can be used in association with Python-language to perform clustering routines Uses C Clustering Library Manual can be downloaded from following link The C Clustering Library
E N D
Open source clustering software • Clustering extensionmodule for Python-language • Canbeused in association withPython-language to perform clustering routines • Uses C Clustering Library • Manualcanbedownloadedfromfollowinglink • The C Clustering Library • The University of Tokyo, Institute of Medical Science, Human Genome Center • “implement the most commonly used clustering methods for gene expression data analysis” PyCluster
PyCluster The clustering algorithms are: • Hierarchical clustering (pairwisecentroid-, single-, complete-, and average-linkage) • k-means clustering • Self-Organizing Maps • Principal Component Analysis. To measure the similarity or distance between gene expression data, eight distance measures are available: • Pearson correlation • Absolute value of the Pearson correlation • UncenteredPearson correlation (equivalent to the cosine of the angle between two data • vectors) • Absolute uncentered Pearson correlation (equivalent to the cosine of the smallest angle • between two data vectors) • Spearman's rank correlation • Kendall's ¿ • Euclidean distance; • City-block distance.