Download Sequences By
Gene IDs | Genomic Sequence IDs | ORF IDs


If you would like to download data in bulk, please visit our file download section


Retrieve Sequences By Gene IDs

Enter a list of Gene IDs (each ID on a separate line):
Choose the type of sequence: genomic protein CDS transcript
Choose the region of the sequence(s):
begin at nucleotides
end at nucleotides
Choose the region of the protein sequence(s):
begin at aminoacids
end at aminoacids
Download Type: Save to File Show in Browser
Fasta defline: Only Gene ID Full Fasta Header
Sequence format: Default (60 chars on a line) Single line

Note:
For "genomic" sequence: If UTRs have not been annotated for a gene, then choosing "transcription start" may have the same effect as choosing "translation start".
For "protein" sequence: you can only retrieve sequence contained within the ID(s) listed. i.e. from downstream of amino acid sequence start (ie. Methionine = 0) to upstream of the amino acid end (last amino acid in the protein = 0).




Retrieve Sequences By Genomic Sequence IDs

Enter a list of Genomic Sequence IDs (each ID on a separate line):
Default region (for sequences in the list without a specified region):
Reverse & Complement
Nucleotide positions to
Download Type: Save to File Show in Browser

Note : Valid formats of specified Genomic Sequence IDs are :
  'ID' for full sequence,
  'ID:start..end' for sequence from start to end,
  'ID:start..end:r' for sequence from start to end, reverse-complemented.


Retrieve Sequences By Open Reading Frame IDs

Enter a list of ORF IDs (each ID on a separate line):
Choose the type of sequence: protein genomic
Choose the region of the sequence(s):
begin at nucleotides
end at nucleotides
Download Type: Save to File Show in Browser

Help

Types of sequences:
protein the predicted translation of the gene
CDS the coding sequence, excluding UTRs (introns spliced out)
transcript the processed transcript, including UTRs (introns spliced out)
genomic a region of the genome. Genomic sequence is always returned from 5' to 3', on the proper strand

Regions:
relative to sequence start to retrieve, eg, the 100 bp upstream genomic region, use "begin at start - 100 end at start - 1".
relative to sequence stop to retrieve, eg, the last 10 bp of a sequence, use "begin at stop - 9 end at stop + 0".
relative to sequence start and stop to retrieve, eg, a CDS with the first and last 10 basepairs excised, use: "begin at start + 10 end at stop - 10".

Note: If UTRs have not been annotated for a gene, then choosing "transcription start" may have the same effect as choosing "translation start."