Codon list

From 22101
Jump to navigation Jump to search

20 Amino acids, their single-letter data-base codes (SLC), and their corresponding DNA codons

Amino Acid SLC DNA codons
Isoleucine I ATT, ATC, ATA
Leucine L CTT, CTC, CTA, CTG, TTA, TTG
Valine V GTT, GTC, GTA, GTG
Phenylalanine F TTT, TTC
Methionine M ATG
Cysteine C TGT, TGC
Alanine A GCT, GCC, GCA, GCG
Glycine G GGT, GGC, GGA, GGG
Proline P CCT, CCC, CCA, CCG
Threonine T ACT, ACC, ACA, ACG
Serine S TCT, TCC, TCA, TCG, AGT, AGC
Tyrosine Y TAT, TAC
Tryptophan W TGG
Glutamine Q CAA, CAG
Asparagine N AAT, AAC
Histidine H CAT, CAC
Glutamic acid E GAA, GAG
Aspartic acid D GAT, GAC
Lysine K AAA, AAG
Arginine R CGT, CGC, CGA, CGG, AGA, AGG
Stop codons Stop TAA, TAG, TGA

In this table, the twenty amino acids found in proteins are listed, along with the single-letter code used to represent these amino acids in protein data bases. The DNA codons representing each amino acid are also listed. All 64 possible 3-letter combinations of the DNA coding units T, C, A and G are used either to encode one of these amino acids or as one of the three stop codons that signals the end of a sequence. While DNA can be decoded unambiguously, it is not possible to predict a DNA sequence from its protein sequence. Because most amino acids have multiple codons, a number of possible DNA sequences might represent the same protein sequence.