1 / 7

The BioText Project: Recent Work

The BioText Project: Recent Work. Marti Hearst SIMS, UC Berkeley http://biotext.berkeley.edu Supported by NSF DBI-0317510 and a gift from Genentech. Project Team. Project Leaders: PI: Marti Hearst Co-PI: Adam Arkin Computational Linguistics Preslav Nakov Emilia Stoica Sarah Poon

calla
Download Presentation

The BioText Project: Recent Work

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The BioText Project:Recent Work Marti Hearst SIMS, UC Berkeley http://biotext.berkeley.edu Supported by NSF DBI-0317510 and a gift from Genentech

  2. Project Team • Project Leaders: • PI: Marti Hearst • Co-PI: Adam Arkin • Computational Linguistics • Preslav Nakov • Emilia Stoica • Sarah Poon • IR/Databases/Software • Ariel Schwartz • Itai Brickner • Brian Wolf • Bioscience • Janice Hamer • Alumni • Dr. Barbara Rosario • Dr. TingTing Zhang • Gaurav Bhalotia

  3. BioText Project Goals • Provide flexible, intelligent access to information for use in biosciences applications. • Focus on • Textual Information from Journal Articles • Tightly integrated with other resources • Ontologies • Record-based databases

  4. BioText Architecture Sophisticated Text Analysis Annotations in Database Improved Search Interface

  5. Today’s Talks • Intro (Marti) • Design and Implementation of the Layered Query Language (Ariel & Brian) • Adding Fulltext to LQL (Itai) • Determining Gene Function from Text (Emilia) • Using the Web as an Implicit Training Corpus (Presley) • Identifing Protein-Protein Interactions (Marti, covering Barbara’s work) • Citances (Marti) • Discussion: what should our user interface do?

  6. Recent Papers • Predicting Gene Functions from Text Using a Cross-Species Approach, Emilia Stoica and Marti Hearst, to appear in PSB 2006. • Multi-way Relation Classification: Application to Protein-Protein Interaction, Barbara Rosario and Marti Hearst, in HLT/EMNLP 2005.   • Using the Web as an Implicit Training Set: Application to Structural Ambiguity Resolution, Preslav Nakov and Marti Hearst, in HLT/EMNLP 2005.

  7. Recent Papers • Scaling Up BioNLP: Application of a Text Annotation Architecture to Noun Compound Bracketing, Preslav Nakov, Ariel Schwartz, Brian Wolf, and Marti Hearst, in ACL/ISMB SIGLINK 2005.   • Search Engine Statistics Beyond the n-gram: Application to Noun Compound Bracketing , Preslav Nakov and Marti Hearst, in CoNNL 2005. • Citances: Citation Sentences for Semantic Analysis of Bioscience Text, Preslav Nakov, Ariel Schwartz, and Marti Hearst, in the SIGIR'04 workshop on Search and Discovery in Bioinformatics.  

More Related