1 / 22

Introduction to ENTREZ Search Capability in NCBI Database

Learn how to use ENTREZ to search for specific sequences and refine your searches using advanced options. Download specific sets of data.

caguirre
Download Presentation

Introduction to ENTREZ Search Capability in NCBI Database

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. This module Introduces the ENTREZ search capability of the NCBI database. After following this module, you should be able to: Describe the different databases within NCBI Use ENTREZ to search for specific sequences Use ENTREZ advanced search options to refine and improve database searches Use ENTREZ to download specific sets of data

  2. http://www.ncbi.nlm.nih.gov/sites/gquery

  3. Entrez uses the following Boolean Operators: AND: Entrez will find all documents that contain BOTH terms OR: Entrez will find all documents that contain EITHER term. NOT: Entrez will find all documents that contain search term 1 BUT NOT search term 2. The Entrez search rules and syntax for using Boolean operators are: 1. Boolean operators AND, OR, NOT must be entered in UPPERCASE (e.g., promoters OR response elements). 2. Entrez processes all Boolean operators in a left-to-right sequence. The order in which Entrez processes a search statement can be changed by enclosing individual concepts in parentheses. The terms inside the parentheses are processed first as a unit and then incorporated into the overall strategy. For example, the search statement: g1p3 AND (response element OR promoter) is processed by Entrez by ORing the terms response element OR promoter first and then ANDing the resulting set of documents with g1p3. 3. The Details button shows how Entrez translated and executed your search strategy. 4. See Writing Advanced Search Statements for more information on using Boolean Operators and Entrez Search Field Qualifiers. The use of parentheses can change your search results significantly. Compare the number of records retrieved in the Nucleotide database as of February 2006 in each case below. Example g1p3 AND (response element OR promoter) retrieves three records g1p3 AND response element OR promoter retrieves 354,554

  4. Writing Advanced Search Statements Complex search statements can be written and executed directly from the the query box of any of the databases, as long as you obey some simple rules and use the correct syntax. Perform a search by specifying the search terms, their fields, and the Boolean operations to perform on the term. Use the following syntax: term [field] OPERATOR term [field] Where term(s) are the search terms, the field(s) are the Search Field qualifiers from Table 1, Table 2, and Table 3 , and the OPERATOR(s) are the Boolean Operators. Remember that Boolean operators are normally processed from left to right. If you want part of your Boolean expression to be processed out of order, enclose it in parentheses. EXAMPLE: Find all human nucleotide sequences with D-loop annotations. In the Nucleotide database, use the following expression: D-loop[FKEY] AND human[ORGN] EXAMPLE: Find all human protein sequences with lengths between 50 and 60 amino acids that were entered into the database during 1999. In the Protein database, use the following expression: human[ORGN] AND 50[SLEN]:60[SLEN] AND 1999[MDAT] EXAMPLE: Find Drosophila population studies published in the Journal of Molecular Evolution. In the PopSet database, use the following expression: j mol evol[JOUR] AND drosohila [ORGN]

  5. Lycopersicum [ORGN] AND Rio Grande 21973 Lycopersicum [ORGN] AND Rio Fuego 171 Lycopersicum [ORGN] AND MicroTom 120462 Lycopersicum [ORGN] AND TA496 116711 Lycopersicum [ORGN] AND Moneymaker 833 Phytophthora [ORGN] AND Judelson [AUTH] AND Tomato 3921 (comment ‘may be of host plant origin’)

  6. 1: CV969339 PI010H11 infected tomato, outside of lesion 3 dpi Phytophthora infestans cDNA, mRNA sequence gi|58159088|gb|CV969339.1| [58159088] 2: CV969338 PI010H8 infected tomato, outside of lesion 3 dpi Phytophthora infestans cDNA, mRNA sequence gi|58159087|gb|CV969338.1| [58159087] 3: CV969337 PI010H2 infected tomato, outside of lesion 3 dpi Phytophthora infestans cDNA, mRNA sequence gi|58159086|gb|CV969337.1| [58159086]

  7. http://www.ncbi.nlm.nih.gov/Tools/ http://www.ncbi.nlm.nih.gov/Ftp/

More Related