150 likes | 379 Views
TrypDB Analysis Workflow. Common Analysis. T Cruzi Analysis. T Brucei Analysis. L Braziliensis Analysis. L Infantum Analysis. L Major Analysis. Mercator. Common Analysis. Init Workflow Home Dir on Cluster. Init apiSiteFiles WebServices Dirs. Make Data Dir. Init User/Group/Project.
E N D
TrypDB Analysis Workflow Common Analysis T Cruzi Analysis T Brucei Analysis L Braziliensis Analysis L Infantum Analysis L Major Analysis Mercator
Common Analysis Init Workflow Home Dir on Cluster Init apiSiteFilesWebServicesDirs Make Data Dir Init User/Group/Project Insert BlatAlignmentQuality Table with Xml Copy PDB from Downloads Copy NRDB from Downloads Make Mercator Data Dir Make NRDB Short Defline Mirror Common Data Dir to Cluster
Organism Analysis Workflow Make Data Dir Mirror Data Dir to Cluster Init apiSiteFilesDownloadSite Organism Dir Genome Analysis Proteome Analysis Run Tuning Manager Run Full Record Dump Make Gff File Make and Format Download Files
Genome Analysis Make Data Dir Dump and Block Mixed Genome Seqs Extract Genome Seqs Make and Block Candidate Assem Seqs Copy Genomic Seqs to Cluster Find Tandem Repeats Make ORFs Filter Sequences tRNA Scan Map Candidate Assem Seqs to Genome Load ORFs BLASTX NRDB Load Low Complexity Seqs Load Tandem Repeats Make and Block DoTS Assemblies Map DoTS Assemblies to genome
Proteome Analysis Calcuate Protein Seq Make Data Dir Update TaxonId for PDB ExternalAASequence Calculate AASeq Attributes Extract Protein Seqs Find Seq Identity to NRDB Filter Seqs Run TMHMM Run SignalP Epitopes Load NRDB xrefs Load Low Complexity Seqs Copy Protein Seqs to Cluster Load TMHMM Load SignalP BLASTP NRDB Psipred InterproScan BLASTP PDB
BLAST Make data dir Start blast Wait for cluster Copy files From cluster filter by subject extract IDs From Blast result Optional steps (runtime test) Load Subject subset Update TaxonId for Nrdb ExternalAASequence Load Result
Psipred Make data dir fix protein IDs For psipred run pfilt on nrdb create psipred Task dir copy Data Dir to cluster start psipred On cluster wait for cluster copy psipred Files from cluster make Alg Inv fix psipred File names load psipred
Epitopes Make Data Dir Make Blast Dir Format NCBI blast file Create Epitoptes map file Load Epitopes map
InterproScan Make Data Dir Make InterproScan Cluster Task Input Dir Mirror InterproScan to Cluster Start Cluster Task Wait for Cluster Task Mirror InterproScan From Cluster Insert IprScan Results
Make and Block Candidate Assembly Seqs Make Candidate Assembly Seqs Make Data Dir Extract Candidate Assembly Seqs Make Cluster Task Input Dir Mirror To Cluster Start Cluster Task Wait for Cluster Task Mirror From Cluster
Map Candidate Assembly Seqs to Genome Make Data Dir Extract Genomic Seqs into Separate Fasta Files Make Gf Client Cluster Task Input Dir Mirror Gf Client to Cluster Run Nib On Cluster Start GFCluster Task Wait for GF Cluster Task Mirror Gf Client From Cluster Insert BLAT Alignment Setbest BLAT Alignment
Make and Block Assemblies Make Data Dir Make Repeat Mask Cluster Task Input Dir Cluster Transcripts by Genome Alignment Put Unaligned Transcripts into One Cluster Assemble Transcripts Extract Assemblies Mirror Assembly Repeat Mask To Cluster Start RM Task on Cluster Wait for RM Cluster Task
Map Assemblies to Genome Make Data Dir Make Assembly Gf Client Cluster Task Input Dir Mirror Assembly Gf Client to Cluster Start GF Task on Cluster Wait for GF Cluster Task Mirror Gf Client From Cluster Insert BLAT Alignment Setbest BLAT Alignment Update Assembly Source Id
Dump and Block Mixed Genome Seqs Make Data Dir Dump Mixed Genomic Sequences Make Repeat Mask Cluster Task Input Dir Mirror Repeat Mask To Cluster Start Cluster Task Wait for Cluster Task Mirror Virtual Sequence Repeat Mask From Cluster Move Blocked Seq File to Mercator Data Dir
Mercator Make Mercator Gff File Correct Reading Frame in Mercator Gff file Run MercatorMavid Create External Database and Release for Synteny from Mercator Insert Mercator Synteny Spans