RSS Feed

Announcements

2012-02-01:   PDA authentication pages were merged with myNCBI pages more...less...

Please login via myNCBI using your PDA credentials

2012-01-05:   Updated script for downloading reference sequences supporting cSRA format more...less...

Please download the updated configuration-assistant.perl script for use with SRA Toolkit. Due to firewall issues, the URI of reference sequences is changed which requires changes in the configuration-assistant.perl script. This script helps users download reference sequence files needed for decompressing NCBI cSRA format files that are created from BAM files. Other format files that do not include alignment data are not affected by this change.

2011-12-20:   NCBI Sequencing Archive of TCGA Data Has Been Further Extended more...less...

TCGA has arranged with NCBI to temporarily extend maintenance of the TCGA data in the Sequence Read Archive (SRA). Users may continue to download TCGA data from NCBI until February 29, 2012. All requests for data retrieval must be submitted by February 28, 2012, and all data downloads must be completed by February 29, 2012. After February 29, NCBI will no longer host sequencing data from TCGA in the SRA. Users can sign up for updates for future TCGA data releases and other TCGA-related information.

For immediate issues, email TCGA@mail.nih.gov

Status of the NCBI Sequence Read Archive (SRA)

Subsequent to an announcement in February 2011 that NCBI was planning to phase out the SRA due to funding constraints, NIH support has been provided that will enable the continuation of SRA. NCBI will continue to operate the SRA as NIH’s primary archive of high-throughput sequencing data and as part of the international partnership of archives at the NCBI, the European Bioinformatics Institute and the DNA Database of Japan. Data submitted to any of the three organizations are shared among them.

The SRA is managing high-throughput sequencing data from many large studies funded by NIH Institutes. The SRA will also continue to archive high-throughput sequencing data that are associated with:

  1. RNA-Seq, ChIP-Seq, and epigenomic data that are submitted to GEO
  2. Genomic and Transcriptomic assemblies that are submitted to GenBank
  3. 16S ribosomal RNA data associated with metagenomics that are submitted to GenBank

It is NCBI's policy to make its publicly available data, including that in the SRA, available to others for redistribution so that they can provide value added services, such as tool sets for analyzing data and alternate interfaces. NCBI will continue work on new approaches for optimum storage and retrieval of raw sequencing data and their alignments.

Search in SRA Documentation

Overview

The Sequence Read Archive (SRA) stores raw sequencing data from the "next" generation of sequencing platforms including Roche 454 GS System®, Illumina Genome Analyzer®, Applied Biosystems SOLiD® System, Helicos Heliscope®, Complete Genomics®, and others.