1 / 15

Extending the Growing Hierarchal SOM for Clustering Documents in Graphs domain

Extending the Growing Hierarchal SOM for Clustering Documents in Graphs domain. Presenter : Cheng- Hui Chen Authors : Mahmoud F. Hussin , Mahmoud R. farra and Yasser El- Sonbaty IJCNN, 2008. Outlines. Motivation Objectives Methodology Experiments Conclusions Comments.

mostyn
Download Presentation

Extending the Growing Hierarchal SOM for Clustering Documents in Graphs domain

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Extending the Growing Hierarchal SOM for Clustering Documents in Graphs domain Presenter : Cheng-HuiChen Authors : Mahmoud F. Hussin, Mahmoud R. farra and Yasser El-Sonbaty IJCNN, 2008

  2. Outlines • Motivation • Objectives • Methodology • Experiments • Conclusions • Comments

  3. Motivation river mild rafting • The variants of SOM are limited by the fact that they use only the VSM for document representations. • It does not represent any relation between the words. • The space complexityto the VSM. • The sentences being broken down into their individual components without any representation of the sentence structure.

  4. river Objectives rafting mild Using graphs to represent documents helped the salient features of data through using edges to represent relations and using vertices to represent words. The decrease the space complexity comparing to the VSM. The extend the GHSOM to work in the graph domain to enhance the quality of clusters.

  5. Methodology

  6. Enhance the DIG to work with G-GHSOM river 3 • The Document Index Graph (DIG) model • the DIG for representing the document and Exploited it in the document clustering. • For example (the document table of word "river" is shown) • River rafting. (doc1) • mild river rafting. (doc2) • River fishing. (doc3)

  7. Enhance the DIG to work with G-GHSOM 1 2 2 1 3 3 4 4

  8. Enhance the DIG to work with G-GHSOM • Single-word similarity measure • Two document vectors similarity measure • The total similarity is the integration

  9. Ment of the G-GHSOM to work with graph • Neuron Initialization • Detecting the matching list to calculate the phrase based similarity.

  10. Ment of the G-GHSOM to work with graph

  11. Ment of the G-GHSOM to work with graph Gin

  12. Experiments • Data set • Reuter’s news articles (RNA) • University of Waterloo and Canadian Web sites (UW-CAN)

  13. Experiments

  14. Conclusions • The extend the GHSOM to a new graph based GHSOM: (G-GHSOM) to enhance the quality of the document clustering. • G-GHSOM works successfully with graph domain and achieves a better quality clustering than TGHSOM in document clustering. • The enhanced the DIG model to work with GHSOM algorithm.

  15. Comments • Advantages • Enhance the quality of the document clustering • Application • SOM • Clustering

More Related