1 / 7

Semantic Similarity Search

Semantic Similarity Search. IDB Lab. Kisung Kim Cheolhan Kim. OASIS Environment. GOA Team Investigate relationship between proteins from the point of view of GO annotation. RDF storage, RDBMS. GO Annotation DB (UniProt). PubMed. Blast DB. GO annotation. Biomedical Literature.

Download Presentation

Semantic Similarity Search

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Semantic Similarity Search IDB Lab. Kisung Kim Cheolhan Kim

  2. OASIS Environment GOA Team Investigate relationship between proteins from the point of view of GO annotation RDF storage, RDBMS GO Annotation DB(UniProt) PubMed Blast DB GO annotation Biomedical Literature Sequence matching SubcellularLocalization DB PPI DB KEGG pathway Molecular function Cellular component Biological process

  3. Introduction • Finding similar gene products is crucial for bioinformatics • Recently semantic similarity between gene products is focused on • Semantic similarity • Assessment of semantic relatedness between two objects • GO Annotation • Most bio-DBs provide the information of proteins annotated by GO • GO annotation provides the semantic information of gene products • Semantic similarity over GO • Measure similarity between gene products using the information encoded in the GO

  4. GORank System What gene products do the function similar with PI4KB_HUMAN? Similarity between GPs is calculated based on similarity between annotation terms Gene Ontology Similarity of Ontology terms PI4KB_HUMAN Gene products DB Annotation Ranked top-k results Similar gene products Similarity of gene products

  5. GORank System • Query input • Configuration • Method for calculating shared IC • Most informative common ancestor • Disjunctive common ancestor (GraSM) • Method for calculating term similarity • Lin • JiangConrath • Ontology • Molecular function • Biological process • Cellular component • Result size : k • Symbol of the query gene product • Names of terms with annotation weight

  6. GORank : Ranked similarity search for proteins over Gene ontology Configuration Method for calculating shared IC Most informative common ancestor Method for calculating term similarity Lin Ontology Molecular function Result size Symbol of the query gene product Search Names of terms with annotation weight Term name Weight (0~1) Search

  7. GORank : Ranked similarity search for proteins over Gene ontology Query Ontology : Molecular Function GeneProductSymbol : BACH_HUMAN Annotation Terms : acyl-CoA binding(TAS) serine esterase activity(IEA) palmitonyl-CoA hydrolase activity(IEA) hydrolase activity(IEA) Results TRI34_HUMAN (Similarity : 1.0) Splice Isoform 2 of Tripartite motif protein 34 Annotation Terms : acyl-CoA binding(TAS), serine esterase activity(IEA), palmitonyl-CoA hydrolase activity(IEA), hydrolase activity(IEA) Type : protein Source : MGI TRI35_HUMAN (Similarity : 0.9) Splice Isoform 2 of Tripartite motif protein 35 Annotation Terms : acyl-CoA binding(TAS), serine esterase activity(IEA), hydrolase activity(IEA) Type : protein Source : MGI TRI36_HUMAN (Similarity : 0.8) Splice Isoform 2 of Tripartite motif protein 36 Annotation Terms : acyl-CoA binding(TAS), palmitonyl-CoA hydrolase activity(IEA), hydrolase activity(IEA) Type : protein Source : MGI

More Related