1 / 11

Concept Detection

Concept Detection. Amir R. Tahamtan. Concept Detection. Goals: discover knowledge, find associations. Discussed Techniques: Concept Mining , Document Clustering Related works: Keyword-based search , Resource discovery, Wrapper information extraction, Web queries, User preferences.

helga
Download Presentation

Concept Detection

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Concept Detection Amir R. Tahamtan Web Information Extraction

  2. Concept Detection • Goals:discover knowledge, find associations. • Discussed Techniques: Concept Mining, Document Clustering • Related works: Keyword-based search , Resource discovery, Wrapper information extraction, Web queries, User preferences Web Information Extraction

  3. Fu Y., Bauer T., Mostafa J., Palakal M., and Mukhopadhyay S (2002): Concept Extraction and Association from Cancer Literature. Proceedings of the 4th international workshop on Web information and data management. McLean, Virginia, USA. • Introduction • Algorithm • Experiments & Conclusion Web Information Extraction

  4. The Algorithm • Token discovery tf.idf : Wik= tik X log(N/nk) • LSA • Data representation as a term-doc matrix • Factoriziation : Xtx0 = Ttxr.Srxr.Orxo • Approximation : Xtx0 ˜X´tx0 = Ttxk.Skxk.Okxo • Token Association Discovery Web Information Extraction

  5. Web Information Extraction

  6. Web Information Extraction

  7. Liu B., Chin CW., Ng HAT (2003): Mining Topic-Specific Concepts and Definitions on the Web. Proceedings of the twelfth international conference on World Wide Web. Budapest, Hungary. • Introduction • The proposed Technique • System Architecture • Experiments & Conclusion Web Information Extraction

  8. The Proposed Technique • Algorithm Weblearn (T) • Subtopic Discovery • Definition Finding • Dealing with Ambiguity • Mutual Reinforcement Web Information Extraction

  9. System Architecture Web Information Extraction

  10. Web Information Extraction

  11. THANK YOU ! Web Information Extraction

More Related