The Triplet Genetic Code
Detailed experiments indicate that each series of three
nucleotides along a DNA strand orders where
particular amino acid building blocks will be placed when RNA synthesizes proteins. That is, the genetic
code is a triplet code. The 3-nucleotide units in RNA strands carrying this
information are called codons. For example, UUU or UUG are examples of codons,
the first corresponding to the 3-base sequence uracil-uracil-uracil and the second to
uracil-uracil-guanine. Therefore, the codons are transcribed from the DNA and strung out in
sequence to make mRNA. The transfer RNA (tRNA) responsible for transporting the correct amino
acid to the protein assembly point is made from anticodons, which are nucleotide
triplets that pair their bases with the mRNA codons. For example, AAA is the anticodon of
UUU, since the RNA base pairing rule is A with U and G with C.
Because there are 4 bases, there are 4 x 4 x 4 = 64 ways that the bases can combine in groups of
three. That is, given one of the 4 bases there are 4 possibilities to combine it with a second
base (giving 16 independent combinations), and each of these can combine with one of 4 bases,
giving a total of 4 x 16 = 64 possibilities for the number of independent ways to combine 3
bases. Of these 64 codons, 61 specify a unique
amino acid (this property
is called uniqueness), but because there are only 20 amino acids
and 61 codons available, most amino acids are ordered by more than one codon (this
property is
called degeneracy).
For example, the mRNA codons UGU and UGC both are found to order the amino acid called
cysteine. Since the mRNA is a reverse copy of the DNA which carries
the genetic code (because of the base pairing in the replication step),
the genetic code for cysteine corresponds to the sequences ACA or ACG in the DNA (that is, the
complement of UGU or UGC).
The remaining 3 codons of the 64 possibilities are
"stop commands" that end the addition of
amino acids to the chain and signal the completion of the protein under construction.
The following figure illustrates the 64 possible combinations for the genetic code and the
corresponding amino acids for which they code (see the table below for amino acid abbreviations).
The complete list of amino acids and the corresponding codons are given in the following table.
The Amino Acids and the Triplet Genetic Code
|
Symbol |
Name |
Codons
|
ala |
alanine |
GCU GCC GCA GCG
|
val |
valine |
GUU GUC GUA GUG
|
leu |
leucine |
UUA UUG CUU CUC CUA CUG
|
ile |
isoleucine |
AUU AUC AUA
|
pro |
proline |
CCU CCC CCA CCG
|
met |
methionine |
AUG
|
phe |
phenylalanine |
UUU UUC
|
trp |
tryptophan |
UGG
|
gly |
glycine |
GGU GGC GGA GGG
|
ser |
serine |
UCU UCC UCA UCG AGU AGC
|
thr |
threonine |
ACU ACC ACA ACG
|
cys |
cysteine |
UGU UGC
|
tyr |
tyrosine |
UAU UAC
|
asn |
asparagine |
AAU AAC
|
gln |
glutamine |
CAA CAG
|
lys |
lysine |
AAA AAG
|
arg |
arginine |
CGU CGC CGA CGG AGA AGG
|
his |
histidine |
CAU CAC
|
asp |
aspartic acid |
GAU GAC
|
glu |
glutamic acid |
GAA GAG
|
|
|
|
|