30 likes | 154 Views
Information Extraction concluding remarks. Sunita Sarawagi. Points to Emphasize. Multidisciplinary: machine learning, databases, web, information retrieval Entity extraction Rule-based systems: manual coding being replaced by rule learning
E N D
Information Extractionconcluding remarks SunitaSarawagi
Points to Emphasize • Multidisciplinary: machine learning, databases, web, information retrieval • Entity extraction • Rule-based systems: manual coding being replaced by rule learning • Statistical methods: based on features & particularly helpful for where rule-based extractors are too brittle • Relationship extraction (preliminary, 2008) • Clues from extracted text, surrounding text, POS tags, dependency graphs • Seed-based bootstrapping • Practical issues: performance, management, uncertainty, data integration
Points to Emphasize • Accuracy remains the primary concern • “The time is ripe now for … more exciting and useful work …”