430 likes | 575 Views
Research Introspection “ICML does ICML”. Andrew McCallum Computer Science Department University of Massachusetts Amherst. Relational Modeling of the Research Literature & other Entities. Better understand structure of our own research area. Tools to help us learn a new sub-field.
E N D
Research Introspection“ICML does ICML” Andrew McCallum Computer Science Department University of Massachusetts Amherst
Relational Modeling of theResearch Literature & other Entities • Better understand structure of our own research area. • Tools to help us learn a new sub-field. • Aid collaboration • Map how ideas travel through social networks of researchers. • Aids for hiring and finding reviewers! • Many opportunities for rich relational learning • ... in a domain we understand well.
Previous Systems Cites Research Paper
More Entities and Relations Expertise Cites Grant Research Paper Person Venue University Groups
Extract metadata (title, authors, abstract, venue, citations; 14 fields in total) Convert to text (with layout & format) Topic Analysis & other Data Mining Reference resolution (of papers, authors & grants) Browsable Web Interface Spider Web for PDFs Rexa System Overview NSF grant DB WWW Discriminativelytrainedgraph partitioning (competition-winningaccuracy) Home-grownJava+MySQL (~1m PDF/day) Enhancedps2text (better word stiching,plus layout in XML) ConditionalRandom Fields (99% word accuracy)
From Text to Actionable Knowledge Spider Filter Data Mining IE Segment Classify Associate Cluster Discover patterns - entity types - links / relations - events Database Documentcollection Actionableknowledge Prediction Outlier detection Decision support
Joint Inference Uncertainty Info Spider Filter Data Mining IE Segment Classify Associate Cluster Discover patterns - entity types - links / relations - events Database Documentcollection Actionableknowledge Emerging Patterns Prediction Outlier detection Decision support
Discriminatively-trained undirected graphical models Conditional Random Fields [Lafferty, McCallum, Pereira] Conditional PRMs [Koller…], [Jensen…], [Geetor…], [Domingos…] Complex Inference and Learning Just what we researchers like to sink our teeth into! Unified Model Spider Filter Data Mining IE Segment Classify Associate Cluster Discover patterns - entity types - links / relations - events Probabilistic Model Documentcollection Actionableknowledge Prediction Outlier detection Decision support
Information Extraction Markov dependencies ...and long-range & KB dependencies?
IE from Research Papers [McCallum et al ‘99] @article{ kaelbling96reinforcement, author = "Leslie Pack Kaelbling and Michael L. Littman and Andrew P. Moore", title = "Reinforcement Learning: A Survey", journal = "Journal of Artificial Intelligence Research", volume = "4", pages = "237-285", year = "1996",
where Wide-spread interest, positive experimental results in many applications. Noun phrase, Named entity [HLT’03], [CoNLL’03]Protein structure prediction [ICML’04] IE from Bioinformatics text [Bioinformatics ‘04],… Asian word segmentation [COLING’04], [ACL’04]IE from Research papers [HTL’04] Object classification in images [CVPR ‘04] (Linear Chain) Conditional Random Fields [Lafferty, McCallum, Pereira 2001] Undirected graphical model, trained to maximize conditional probability of output sequence given input sequence Finite state model Graphical model OTHERPERSONOTHERORGTITLE … output seq y y y y y t+2 t+3 t - 1 t t+1 FSM states . . . observations x x x x x t t +2 +3 t - t +1 t 1 input seq said Jones a Microsoft VP …
Entity Resolution Joint inference among all pairwise coref ...models of entities, attributes, first-order...
Joint Co-reference Decisions,Discriminative Model [Culotta & McCallum 2005] People Stuart Russell Y/N Stuart Russell Y/N Y/N S. Russel
Co-reference for Multiple Entity Types [Culotta & McCallum 2005] People Organizations Stuart Russell University of California at Berkeley Y/N Y/N Stuart Russell Y/N Berkeley Y/N Y/N Y/N S. Russel Berkeley
Joint Co-reference of Multiple Entity Types [Culotta & McCallum 2005] People Organizations Stuart Russell University of California at Berkeley Y/N Y/N Stuart Russell Y/N Berkeley Y/N Y/N Y/N Reduces error by 22% S. Russel Berkeley
Structured Topic Models Discovering latent structurein jointly modeling words, time, relations...
Topical N-gram Model [Wang, McCallum 2005] z1 z2 z3 z4 . . . y1 y2 y3 y4 . . . w1 w2 w3 w4 . . . D 2 1 1 2 W W T T
Finding Topics with TNG Traditional unigram LDArun on 1.6 milliontitles / abstracts (200 topics) ...select ~300k papers onML, NLP, robotics, vision... Find 200 TNG topics among those papers.
Topical Transfer Citation counts from one topic to another. Map “producers and consumers”
Topic Distributions Conditioned on Time topic mass (in vertical height) time
Topical Transfer Through Time • Can we predict which research topicswill be “hot” at ICML next year? • ...based on • the hot topics in “neighboring” venues last year • learned “neighborhood” distances for venue pairs
How do Ideas Progress Through Social Networks? Hypothetical Example: “ADA Boost” SIGIR(Info. Retrieval) COLT ICML ICCV(Vision) ACL(NLP)
How do Ideas Progress Through Social Networks? Hypothetical Example: “ADA Boost” SIGIR(Info. Retrieval) COLT ICML ICCV(Vision) ACL(NLP)
How do Ideas Progress Through Social Networks? Hypothetical Example: “ADA Boost” SIGIR(Info. Retrieval) COLT ICML ICCV(Vision) ACL(NLP)
Preliminary Results Mean Squared Prediction Error (Smaller Is better) TransferModel # Venues used for prediction Transfer Model with Ridge Regression is a good Predictor
Other Relational Opportunities • Categorizing citations. • Map transfer of ideas through science. • Rank CS departments by various criteria. • What 10 papers tell the story of ASR research? • Predicting when a student will graduate. • Help me find the right postdoc. • Suggest best collaborative opportunities. • Who should chair the next ICML?