Partial human ThyA protein sequence with rhythm based on the human codon distribution. (a) Four different note lengths (eighth, quarter, half, whole note) were each assigned to a particular codon usage range based on frequency per 1,000. Zero to 10 (per 1,000) was assigned the eighth note, 11 to 20 the quarter note, 21 to 30 a half note, and a codon usage greater than 30 was assigned the whole note. The more frequently a particular codon is used, the longer the note length that represents such a codon. (b) Huntingtin protein translated into musical notes based on the reduced-note range and human codon distribution. The wild-type huntingtin protein contains 21 glutamines in the beginning portion of the sequence. The protein also contains proline-rich regions. The repetition in these regions can be distinctly heard in the musical translation.
Takahashi and Miller Genome Biology 2007 8:405 doi:10.1186/gb-2007-8-5-405