Email updates

Keep up to date with the latest news and content from Genome Biology and BioMed Central.

Paper report

Making high fidelity expression libraries

Rachel Brem

Genome Biology 2000, 1:reports0044  doi:10.1186/gb-2000-1-2-reports0044

The electronic version of this article is the complete one and can be found online at: http://genomebiology.com/2000/1/2/reports/0044


Received:1 April 2000
Published:11 May 2000

© 2000 BioMed Central Ltd

Significance and context

Many new biochemistry experiments involve the expression of DNA or RNA libraries of thousands of genes at once. A technical problem arises when some sequences of the library contain errors, such as frameshifts, deletions, or misplaced start or stop codons. Here, Cho et al. have developed a new technique to eliminate such mistakes. For each sequence in an RNA library, they translate the encoded protein and then covalently attach the protein to its RNA. From a pool of these protein-RNA fusions, they select only full-length proteins, guaranteeing, in theory, an error-free library. This new technique is important not only because it can improve the quality of nucleic acid libraries, but also because the final RNA-protein fusions can be used in further selection experiments for protein structure or function. The technique may turn out to be a viable cell-free alternative to phage display.

Key results

Cho et al. use a new procedure to generate full-length proteins that are coupled to their mRNAs. When the authors make a library of random 20 amino-acid peptides using their new procedure, 88% of them are perfect, compared with 34% for a control library that has no specific selection of full-length proteins. The authors then use their basic idea to make several different test libraries. For each library, they want proteins that are long (80-300 amino acids), but their error-free fusions are only 20 amino acids long. So they must ligate the short error-free RNA cassettes into longer sequences and then translate those sequences; the ligation introduces some error, however. The final libraries of long sequences are between 60 and 100% error-free.

Methodological innovations

The technique of Cho et al. is as follows. First, the authors synthesize cassettes of RNA coding for sequences of interest flanked by the code for protein purification tags at the amino and carboxyl termini. At the end of each RNA sequence is an adduct of the antibiotic puromycin. Next, the authors add ribosomes and free amino acids. Ribosomes translate the RNA into protein, then stall at the puromycin at the end of the RNA and the protein becomes covalently attached to its message. The covalent protein-RNA fusions are then purified against affinity columns corresponding to the new protein's amino- and carboxy-terminal tags. Proteins in fusions that are selected should have amino and carboxyl termini and, therefore, should be full length.

Reporter's comments

The new technology of Cho et al. is exciting and innovative, but it seems as though a few problems still need to be worked out. It would be interesting to check whether the final expressed proteins are functional, or at least structurally intact, on the RNA fusions. And now that Cho et al. have developed the main library strategy, they will probably need to go back to improve the fidelity of their ligation step and/or the chemical synthesis of their initial sequences.

Table of links

Assumptions that are made about each paper that is the subject of a report, unless otherwise specified:
The full text and figures are available only to subscribers of the journal, but are available over the internet from the journal's website. The paper itself is abstracted by PubMed. There is no supplementary material.