410 likes | 584 Views
NCBI Molecular Biology Resources. Using Entrez. March 2007. WWW Access. Entrez & BLAST. Entrez: Database Integration. Word weight. PubMed abstracts. 3 -D Structure. 3-D Structure. Taxonomy. VAST. Genomes. Phylogeny. Neighbors Related Structures. Protein sequences.
E N D
NCBI Molecular Biology Resources Using Entrez March 2007
WWWAccess Entrez & BLAST
Entrez: Database Integration Word weight PubMed abstracts 3 -D Structure 3-D Structure Taxonomy VAST Genomes Phylogeny Neighbors Related Structures Protein sequences Nucleotide sequences BLAST BLAST Neighbors Related Sequences BLink Domains Neighbors Related Sequences Hard Link
Database Searching with Entrez Using limits and field restriction to find human MutL homolog Linking and neighboring with MutL Mapping SNPs onto structure and the genome
Human hereditary nonpolyposis colon cancer Global NCBI (Entrez) Search
Nucleotide Sequences • Nucleotide database now three parts • EST expressed sequence tags • GSS genome survey sequences • CoreNucleotide everything else
More Precise Nucleotides Search nonpolyposis[All Fields] AND colon cancer[Title] AND human[Organism] AND biomol_mrna[Properties] AND srcdb_refseq[Properties]
Useful Field Restrictions • [Title]: Definition line in GenBank / GenPept format shown in Summary format • glyceraldehyde 3 phosphate dehydrogenase[Title] • [Organism]: NCBI’s taxonomy. Organizing system for molecular databases • mouse[organism]; green plants[organism]; Streptomyces coelicolor[organism] • [Properties]: molecule type, location, database source • biomol_mrna[properties]; biomol_genomic[properties]; • gene_in_mitochondrion[properties];srcdb pdb[properties] • [Filter]: subsets of data, Entrez links • all[filter]; nucleotide mapview[filter]; nucleotide omim[filter]
GenBank Records Human MutL RefSeq
Literature Links OMIM
Conserved Domain OMIM: Human Disease Genes
Sequence Links Finding Homologs and Structures
Protein Link BLAST Link Conserved Domains
BLink: BLAST Link Redundant GIs top 200 only
BLink: non-redundant relatives zebrafish homolog BLAST
E. coli MutL Structure Cn3D viewer Structure Neighbors 3D Domain Neighbors Pubchem compound Conserved Domains
MLH1 Domain Structure: CDD ATPase Domain Mismatch Repair Domain
ATPase domain GeneView: Variations Human MLH1
Mapping Variation Onto Structure Asn Ile Ile – Val Conserved Asn
The Map Viewer Genome BLAST
Map Viewer: Human MLH1 Customizable Transcripts EST Hits Download data and sequences Models NCBI Assembly Gene Annotations
Synteny: Mammalian Genomes Albumin Gene Family
orthologs orthologs paralogs frog A chick A mouse A mouseB chick B frog B A-chain gene B-chain gene gene duplication early globin gene Homologene • Completely Annotated Eukaryotic Genomes • Homologous UniGene determined for other organisms • Protein similarities first • Guided by taxonomic tree • Includes orthologs and paralogs
The Gene Database • Gene Centered Information • Unifies LocusLink and microbial Genomes • 2.4 million records for 3,822 taxa