220 likes | 401 Views
An Overview of Literature Management Systems. Qiaozhu Mei April 12, 2007. Example Systems. Digital Libraries: ACM Digital Library CiteSeer JSTOR PubMed Domain-specific Search Engines: Google Scholar ; Live academic search ; Libra ; Rexa ; SCOPUS ; DBLP ; Bibliography Search
E N D
An Overview of Literature Management Systems Qiaozhu Mei April 12, 2007
Example Systems • Digital Libraries: • ACM Digital Library • CiteSeer • JSTOR • PubMed • Domain-specific Search Engines: • Google Scholar; Live academic search; • Libra; Rexa; SCOPUS; • DBLP; Bibliography Search • Integrated Systems: • DBLife, bibSonomy, BeeSpace?
Outline • Characteristics • What’s unique with scientific literature? • Functionalities • Possible utilities based on the characteristics • Prototypes • http://en.wikipedia.org/wiki/List_of_academic_databases_and_search_engines
Scientific Literature: What’s Unique? • Structured content: • Title, author, abstract, conclusion, reference, .. • A latent network structure: • Reference; citations; co-authorship • Contexts: • Time, conference/area, authorship, ... • Citation context • What about the content & language? • Topic dense? Terminology rich? Definitive language? • Controlled vocabulary? Low noise?
Managing Literature: What can We Do? • Structured content: • Full text search, structured search, citation search • Similarity search • Latent Network structure: • Citation navigation, co-author navigation • Co-citation Analysis, etc • Community analysis • Context: • Trend analysis, author comparison, interdisciplinary research identification, … • Language: • Topic extraction & categorization, filtering, concept analysis, ontology, summarization, …
Existing Systems: What do They Provide? EN: Entity object 1: Paper 2: Author & Paper CS: content search BC: browsing by context CN: citation navigation AN: author/co-author navigation FT: full text download TA: tagging & annotation CC: citation context TC: topic catergorization SS: Similarity Search Bib: BibTex
Example Systems • CiteSeer, Libra, Rexa, SCOPUS, DBLife • Who did it? • What’s their unique feature? • What’s not covered?
CiteSeer • Penn State • Similarity Search • Citation Context (partially) • Citation Trend • Statistics
CiteSeer: Similarity Search Sentence-level similarity Content similarity Co-citation similarity
Libra • Zaiqing Nie, Microsoft Research Asia • Community search (topic extraction?)
Rexa • UMass, Andrew McCallum • Search for grants, Tagging • Topic extraction & categorization
Rexa: Topic Extraction & Categorization Trends Topic Relation: Citing, cited, co-occurring
SCOPUS • Subject Area: not CS • Life/health/physics/social science • Alerts (Simple Filtering) • Search (content) alerts • Document Citation alerts • Citation trends
DBLife • Anhai Doan, U of Wisc. • Integrating heterogonous academic information • Change monitoring • Not much on literature • Many prototype functionalities
Discussion: What is not Covered? • Features: • Citation Context • Functionalities: • Filtering; Summarization • Opportunities: • Comparison across contexts • In-depth community analysis • More useful classification • …