80 likes | 384 Views
2. Sequence Read Archive (SRA). NGS reads and alignments mostly in BAM Manages high-throughput data including sequence variation (VCF) Data owned by the submitter, expected to be made public and
E N D
2 Sequence Read Archive (SRA) NGS reads and alignments mostly in BAM Manages high-throughput data including sequence variation (VCF) Data owned by the submitter, expected to be made public and freely without any restrictions. Pre-publication confidentiality is supported. Keep Metadata(stored as XML format) separate from data (in different formats).
3 SRA data upload / download SRA-FileUploader / SRA-FileDownloader http://asperasoft.com/
5 SRA data formats • As of now the recommended format is BAM • Other accepted formats • General Fastq, • SRF • Illumina native fastq • SOLiD native csfasta/qual • 454 sff, • IonTorrent sff, • PacBio HDF5, • Complete Genomics Data package • CRAM (very soon) • Binary data formats should be compressed • MD5 of data files are required
Status transitions Private Cancel Draft Public http://www.ebi.ac.uk/ena/about/data_availability_policy
7 Documentation • Subscribe to ena-announce mailing list: • http://listserver.ebi.ac.uk/mailman/listinfo/ena-announce • General submission instructions: • http://www.ebi.ac.uk/ena/about/sra_submissions • SRA data formats: • http://www.ebi.ac.uk/ena/about/sra_format • SRA metadata preparation: • http://www.ebi.ac.uk/ena/about/sra_preparing_metadata • Uploading data: • http://www.ebi.ac.uk/ena/about/sra_data_upload • SRA REST webservice: • http://www.ebi.ac.uk/ena/about/sra_rest_submissions Please email us with questions atdatasubs@ebi.ac.uk