230 likes | 309 Views
Developed by James Estill, Dept. of Plant Biology, University of Georgia. TriAnnot. France. IOB Cluster: UGA. Pipeline Annotate Wheat Sequences. PERL. GAME XML. BLAST –m 8 -d MIPS. BLAST –m 8 -d RB_pln. BLAST –m 8 -d TIGRGram. BLAST –m 8 -d TREP9nr. >HEX0014K09 GCAATACT CGGCACTT.
E N D
Developed by James Estill, Dept. of Plant Biology, University of Georgia
TriAnnot France IOB Cluster: UGA Pipeline Annotate Wheat Sequences PERL GAME XML
BLAST –m 8-d MIPS BLAST –m 8-d RB_pln BLAST –m 8-d TIGRGram BLAST –m 8-d TREP9nr >HEX0014K09GCAATACTCGGCACTT Annotation Pipeline Gene Annotation TE Annotation Homology Homology De Novo De Novo GENSCANGENID FGENESH FindmiteLTR_StrucLTR_SeqFind_LTRLTR_Finder BLASTBLAT SIM4 HMMERRepeatmaskerTE NestBLAST
Individual Program Procedure Configuration File Directoryof FASTA Files Run Program RawResults GFFFormated
Developed by James Estill, Dept. of Plant Biology, University of Georgia
!! THIS DOCUMENT IS UNDER CURRENT DEVELOPMENT!! This program manual and the scripts that make up the DAWG-PAWS package are under current development. Everything is subject to change without notice at this point. This software comes as is, without any expressed or implied warranty. Use at your own risk.
File requirements: • Each fasta file contains a single record • BAC scaffolds need to be merged to a single sequence • Short header
Repeat masking with RepeatMasker and TREP • Softmask (using RepeatMasker) • Convert softmask to hardmask because many gene prediction programs are not softmasked aware
Structural feature annotation: Includes currently only the annotation of gaps
Gene annotation: • Conduct gene prediction using TriAnnot pipeline • Run individual gene prediction programs
GenMarkHMM: can be run locally (free license required) GENSCAN: Run on web server & convert output to .gff file FGeneSH: Run on web server & convert output to .gff file
Transposable element annotation: • By homology: RepeatMasker, NCBI-Blast • By structural criteria: LTR-finder
De Novo LTR Annotation Software Computation Annotation Good Neutral Bad Crap Best
Preparing the computational results for Apollo • Audit the computational results • Concatenate the .gff files