Genome Biology

official impact factor 6.89

Open Access Research

Prediction of unidentified human genes on the basis of sequence similarity to novel cDNAs from cynomolgus monkey brain

Naoki Osada1,2*, Munetomo Hida3, Jun Kusuda1, Reiko Tanuma1, Makoto Hirata1, Momoki Hirai4, Keiji Terao5, Yutaka Suzuki3, Sumio Sugano3 and Katsuyuki Hashimoto1

Author Affiliations

1 Division of Genetic Resources, National Institute of Infectious Diseases, 1-23-1 Toyama-cho, Shinjuku-ku, 162-8640, Japan

2 Department of Biological Science, Graduate School of Science, University of Tokyo, Tokyo, Japan

3 Department of Genome Structure Analysis, Institute of Medical Science, University of Tokyo, Tokyo, Japan

4 Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Tokyo, Japan

5 Tsukuba Primate Center For Medical Science, National Institute of Infectious Diseases, Tsukuba, Japan

For all author emails, please log on.

Genome Biology 2001, 3:research0006-research0006.5 doi:10.1186/gb-2001-3-1-research0006

Published: 19 December 2001

Abstract

Background

The complete assignment of the protein-coding regions of the human genome is a major challenge for genome biology today. We have already isolated many hitherto unknown full-length cDNAs as orthologs of unidentified human genes from cDNA libraries of the cynomolgus monkey (Macaca fascicularis) brain (parietal lobe and cerebellum). In this study, we used cDNA libraries of three other parts of the brain (frontal lobe, temporal lobe and medulla oblongata) to isolate novel full-length cDNAs.

Results

The entire sequences of novel cDNAs of the cynomolgus monkey were determined, and the orthologous human cDNA sequences were predicted from the human genome sequence. We predicted 29 novel human genes with putative coding regions sharing an open reading frame with the cynomolgus monkey, and we confirmed the expression of 21 pairs of genes by the reverse transcription-coupled polymerase chain reaction method. The hypothetical proteins were also functionally annotated by computer analysis.

Conclusions

The 29 new genes had not been discovered in recent explorations for novel genes in humans, and the ab initio method failed to predict all exons. Thus, monkey cDNA is a valuable resource for the preparation of a complete human gene catalog, which will facilitate post-genomic studies.