110 likes | 240 Views
Concept Detection. Amir R. Tahamtan. Concept Detection. Goals: discover knowledge, find associations. Discussed Techniques: Concept Mining , Document Clustering Related works: Keyword-based search , Resource discovery, Wrapper information extraction, Web queries, User preferences.
E N D
Concept Detection Amir R. Tahamtan Web Information Extraction
Concept Detection • Goals:discover knowledge, find associations. • Discussed Techniques: Concept Mining, Document Clustering • Related works: Keyword-based search , Resource discovery, Wrapper information extraction, Web queries, User preferences Web Information Extraction
Fu Y., Bauer T., Mostafa J., Palakal M., and Mukhopadhyay S (2002): Concept Extraction and Association from Cancer Literature. Proceedings of the 4th international workshop on Web information and data management. McLean, Virginia, USA. • Introduction • Algorithm • Experiments & Conclusion Web Information Extraction
The Algorithm • Token discovery tf.idf : Wik= tik X log(N/nk) • LSA • Data representation as a term-doc matrix • Factoriziation : Xtx0 = Ttxr.Srxr.Orxo • Approximation : Xtx0 ˜X´tx0 = Ttxk.Skxk.Okxo • Token Association Discovery Web Information Extraction
Liu B., Chin CW., Ng HAT (2003): Mining Topic-Specific Concepts and Definitions on the Web. Proceedings of the twelfth international conference on World Wide Web. Budapest, Hungary. • Introduction • The proposed Technique • System Architecture • Experiments & Conclusion Web Information Extraction
The Proposed Technique • Algorithm Weblearn (T) • Subtopic Discovery • Definition Finding • Dealing with Ambiguity • Mutual Reinforcement Web Information Extraction
System Architecture Web Information Extraction
THANK YOU ! Web Information Extraction