Email updates

Keep up to date with the latest news and content from Genome Biology and BioMed Central.

Open Access Highly Accessed Research

Primate-specific evolution of an LDLR enhancer

Qian-fei Wang12, Shyam Prabhakar12, Qianben Wang3, Alan M Moses1, Sumita Chanan1, Myles Brown3, Michael B Eisen1, Jan-Fang Cheng12, Edward M Rubin12* and Dario Boffelli12

  • * Corresponding author: Edward M Rubin EMRubin@lbl.gov

  • † Equal contributors

Author Affiliations

1 Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA

2 US Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA

3 Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA

For all author emails, please log on.

Genome Biology 2006, 7:R68  doi:10.1186/gb-2006-7-8-r68


The electronic version of this article is the complete one and can be found online at: http://genomebiology.com/2006/7/8/R68


Received:11 May 2006
Revisions received:28 June 2006
Accepted:2 August 2006
Published:2 August 2006

© 2006 Wang et al.; licensee BioMed Central Ltd.

This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Background

Sequence changes in regulatory regions have often been invoked to explain phenotypic divergence among species, but molecular examples of this have been difficult to obtain.

Results

In this study we identified an anthropoid primate-specific sequence element that contributed to the regulatory evolution of the low-density lipoprotein receptor. Using a combination of close and distant species genomic sequence comparisons coupled with in vivo and in vitro studies, we found that a functional cholesterol-sensing sequence motif arose and was fixed within a pre-existing enhancer in the common ancestor of anthropoid primates.

Conclusion

Our study demonstrates one molecular mechanism by which ancestral mammalian regulatory elements can evolve to perform new functions in the primate lineage leading to human.

Background

Since King and Wilson's provocative paper was published in 1975 [1], differences in gene regulatory sequences have been predicted to be among the major sources of phenotypic evolution and divergence among animals. Consistent with this hypothesis, cis-regulatory changes have been found to play an important role in the evolution of morphologic features in model organisms [2]. In contrast, evolution of physiology has been linked to changes in protein coding sequences, when studied in animal vision, digestive metabolism, and host defense [3-7]. The contribution of regulatory sequence changes to the evolution of physiologic differences, however, is largely unexplored [8,9].

To examine the role of cis-regulatory changes in the emergence of novel physiologic traits in primates, we investigated the evolution of regulatory elements of the low-density lipoprotein (LDL) receptor gene (LDLR), which is a key player in maintaining lipid homeostasis. Cholesterol metabolism in humans has diverged in a variety of ways from that of many distant mammals such as rodents and dogs, with humans in general being more susceptible to diet-induced hypercholesterolemia [10]. The pivotal role of LDLR in cholesterol metabolism, coupled with its known expression differences among mammals [11], makes it a prime candidate for investigating primate-specific evolution of regulatory sequences. Here, we present molecular data supporting the gain of a cholesterol-sensing DNA motif in an ancestral mammalian LDLR regulatory element at a specific stage in primate evolution.

Results and discussion

Identification of primate-specific noncoding elements in the LDLR locus

To identify putative primate-specific LDLR regulatory sequences, we examined orthologous regions from a panel of mammals closely and distantly related to human for the presence of evolutionarily conserved noncoding sequences using Gumby, an algorithm that detects sequence blocks evolving significantly more slowly than the local neutral rate (see Materials and methods, below) [12-14]. Because humans and nonhuman primates share many features of cholesterol metabolism, we specifically scanned for elements that are preferentially conserved in primates under the hypothesis that primate-specific regulatory sequences contribute to the distinctive biology of those species. We conducted pair-wise sequence comparisons of the 83 kilobase (kb) genomic region containing LDLR and its entire 5' and 3' intergenic regions between human and each of a panel of distantly related species consisting of the prosimian lemur, mouse, and dog. In these comparisons we identified either the known promoter sequence alone (Figure 1a and data not shown) or a limited number of noncoding elements (Additional data file 1 and data not shown). The promoter region was the only noncoding region consistently identified as being conserved in the three pair-wise comparisons. In contrast, multiple sequence comparisons between human and a set of five anthropoid primate species, chosen on the basis of their evolutionary relationship using the 'phylogenetic shadowing' strategy [15], identified two human noncoding DNA elements, named PS (primate specific) 1 and 2, which were found to be highly significantly conserved (P approximately 10-5) in primates (Figure 1b). However, they were undetected in comparisons involving human and each of the distant species (Figure 1 and Additional data file 1).

thumbnailFigure 1. Conservation profiles of the LDLR locus using close (primate) and distant (human-mouse) species comparisons. (a) Human-mouse and (b) multiple primate (human, baboon, colobus, dusky titi, marmoset, and owl monkey) conservation profiles were calculated using Gumby and visualized using RankVISTA (see Materials and methods) and displayed with the human sequence as reference. Only about 6 kilobases (kb) of the 5' intergenic region is shown because of incomplete primate sequence availability. The entire 3' intergenic region was included in the analysis. Vertical bars depict conserved exonic (light blue) and nonexonic (red) sequences, with height indicating statistical significance of sequence conservation (see Materials and methods). LDLR coding exons (dark blue) and untranslated regions (UTRs; magenta) are marked below the conservation plots. Arrows denote the two highest-scoring primate-specific elements (PS1 and PS2). The inset shows the human-mouse VISTA plot for element PS2, with the vertical axis representing sequence identity calculated over a 100 base pair (bp) window.

To confirm independently the lack of significant conservation of the PS1 and PS2 elements between human and distant mammals, we also analyzed human-mouse alignment using a sliding-window percentage identity conservation criterion. We found that the human-mouse percentage identities across PS1 and PS2 were below 50% (Figure 1 and data not shown). This is close to the background percentage identity in aligned intergenic DNA and is well below the threshold of 70% identity that is normally applied to the detection of conserved functional sequences [16]. We further verified that the phastCons program [17] detects no conserved sequences overlapping PS1 and PS2 (data not shown). Although the phastCons predictions, obtained from the UCSC Genome Browser, are in general based on alignment of 17 mammalian and nonmammalian species, conservation scores in the LDLR locus reflect only mammalian conservation because more distant genomes exhibit very limited nonexonic alignment in this locus.

To assess quantitatively the conservation level of PS1 and PS2 between human and distant mammals, we identified the orthologous aligned counterparts of the human PS1 and PS2 elements in lemur, mouse, and dog. Gumby analysis of conservation scores indicated that each of these nonanthropoid primate sequences exhibited a level of similarity to the human sequence consistent with unconstrained evolution at the neutral rate (conservation P value; Table 1). Together, these analyses strongly suggest a lack of significant sequence constraint between the anthropoid primate and mammalian PS1 and PS2 sequences.

Table 1. PS2 enhancer functional divergence correlates with sequence constraint

The human LDLR PS2 element exhibits significantly greater enhancer activity than its mammalian orthologs

To explore the potential regulatory function of these two primate-specific conserved elements, we examined their ability to drive reporter gene expression in both a transient transfection assay in human 293T cells and in an in vivo mouse liver gene transfer assay [18]. Each human element plus approximately 200 base pairs (bp) of flanking sequence on either side was cloned upstream of the human LDLR promoter [19] fused to a luciferase reporter gene. Human element PS2, but not PS1, consistently increased luciferase expression approximately fivefold relative to the human promoter alone in both the in vitro and in vivo assays (Figure 2). The human element PS2 also increased luciferase expression when cloned upstream of the generic SV40 promoter, albeit to a lesser extent (twofold; Additional data file 3). Enhancer activity of this element was further confirmed by the finding that genomic region corresponding to PS2, but not PS1, is a DNaseI hypersensitive site in human liver cells (Additional data file 2 and data not shown).

thumbnailFigure 2. Human LDLR PS2 enhancer exhibits significantly higher activity than orthologous lemur, mouse, and dog enhancers. Luciferase assay analysis of (a) transient transfections into human 293T cells and (b) plasmid DNA transfer into mouse liver. The luciferase reporter constructs tested are either the LDLR promoter alone (promoter) or the promoter in combination with the LDLR PS2 enhancer from one of the indicated species. Error bars indicate standard deviation. 'SRE mutant' refers to the mutagenized human sterol regulatory element (SRE) with four point substitutions relative to the wild-type (WT) SRE (Figure 4a). Luciferase activity is reported in arbitrary units. Each triangle in panel b represents luciferase activity in an individual mouse. Red bars denote the median activity of each construct.

To explore the regulatory function, if any, of mammalian sequences orthologous to human PS2, we cloned the PS2-aligned sequences from lemur, mouse, and dog into the luciferase reporter vector described above and compared their activities with that of the human sequence. Despite the lack of statistically significant sequence constraint between the human enhancer and its lemur, mouse, and dog orthologs, the latter three sequences exhibited enhancer activity both in vitro and in vivo (Figure 2). The human regulatory element, however, consistently exhibited stronger enhancer activity in both assays, driving twofold greater expression than lemur or dog PS2 and fourfold greater expression than mouse (Figure 2a). This observation, coupled with the evidence of negative selection acting on the primate enhancer and the lack of significant sequence constraint between the anthropoid primate and mammalian PS2 sequences (conservation P value; Table 1), suggests that the stronger enhancer activity in human is a gain of function in the anthropoid primate lineage with a potentially important adaptive role in these species.

An anthropoid-primate specific sterol regulatory element contributes to distinct human PS2 enhancer activity

To identify the molecular basis of the primate-specific activity of PS2, we computationally dissected the 860 bp human PS2 enhancer (see Materials and methods, below) and found a sterol regulatory element (SRE). This is a binding site specifically recognized by the cholesterol sensing proteins SREBPs (sterol regulatory element binding proteins), which are known to play a key role in the regulation of LDLR [20,21]. Phylogenetic analysis of the orthologous PS2 sequences from three distant mammals (mouse, rat, and dog), three prosimians (lemur, mouse lemur and galago), and nine anthropoid primates covering all major lineages including hominoids, and old-world and new-world monkeys revealed the presence of the SRE exclusively in anthropoid primates (Figure 3). This phylogenetic distribution of the SRE in mammals can most parsimoniously be explained by the appearance of the SRE in the ancestor of anthropoid primates after its divergence from prosimians (Figure 3).

thumbnailFigure 3. Phylogenetic analysis of the SRE. The human sterol regulatory element (SRE) motif and its orthologs were scored for transcription factor binding affinity, with low motif scores indicating low predicted affinity to SRE binding protein (SREBP; see Materials and methods). Because the SRE is present in all the analyzed anthropoid primates (indicated by the red branches in the tree) and absent from the prosimians, rodents, and dog, emergence in the lineage leading to anthropoid primates is the most parsimonious explanation.

The functional role of the binding motif identified by computational analysis was explored by site-specific mutagenesis. A 4 bp substitution was introduced into the SRE, which was expected to inactivate the site completely based on a previously reported mutagenesis study [22]. The 4 bp substitution in the SRE decreased human enhancer activity in the human cell culture assay and the in vivo mouse liver DNA transfer assay to a level comparable with that in lemur, mouse, and dog enhancers; these species lack a computationally predicted SRE (Figure 2). The functionality of the SRE, found exclusively in anthropoid primates, suggests that this element is likely to contribute to the stronger activity found in these species. We also identified within the 860 bp enhancer a 21 bp subregion that exhibits strong conservation across mammalian species including lemur, mouse lemur, galago, mouse and dog, and that contains predicted binding sites for transcription factors activating enhancer binding protein (AP)-4 and AP-1. Deletion of the conserved 21 bp sequence from either human or dog PS2 resulted in a significant reduction in enhancer activity (data not shown), suggesting that the evolutionarily conserved AP-4 and AP-1 sites are important for the core enhancer activity shared among mammals. It is worth noting that such short blocks of genuinely constrained sequence are not easily distinguishable from the numerous 'coincidentally conserved' sequence fragments that are likely to occur in large genomic regions as a consequence of stochastic variation in the incidence of neutral mutations. Incorporation of additional information, namely the binding specificities of transcription factors, was required to classify this 21 bp fragment as a functional candidate. Thus, conservation of this short subsequence in multiple mammals does not detract from the fact that the enhancer sequence is significantly conserved only in anthropoid primates, as described above.

Because SREBP-2, the major regulator of LDLR [20,21], specifically binds to the SRE [11], we examined the responsiveness of the human, lemur, mouse, and dog orthologous PS2 enhancers to this transcription factor. Co-transfection of the reporter gene driven by PS2 and the human LDLR promoter with a construct expressing the mature form of SREBP-2 indicated that the human enhancer was strongly activated by the exogenous SREBP-2, to a level fivefold higher than that of the human LDLR promoter alone, which is known to be SREBP responsive as well [23]. The lemur, mouse, and dog enhancers were activated to a significantly lesser extent, which is consistent with their much lower SRE prediction score and with their lack of additional consensus SRE motifs within the PS2 element (Figure 3, Figure 4a, and data not shown). To determine whether the observed differential SREBP-2 response among tested mammalian PS2 enhancers was directly mediated by the predicted SRE, we inactivated or restored the consensus SRE by site-specific mutagenesis at the orthologous positions of the human and dog PS2 element, respectively. Substituting four bases in the human SRE motif, which reduced the motif matrix score from 1 to 0.35 (see Materials and methods, below), resulted in a reduction in SREBP-2 enhancer response to a level comparable to that of the lemur, mouse, and dog enhancers. These results indicate that the anthropoid-specific SRE mediates the activation of the PS2 enhancer by SREBP-2 and contributes to the strong enhancer activity characterizing human and other anthropoid primates. Furthermore, substituting three bases in the dog SRE, so as to increase the SRE motif score from 0.47 to 1 (representing a perfect SRE), led to a significant increase in the dog enhancer response to SREBP-2, although only to half the level of the human PS2 enhancer (Figure 4b). This suggests that the anthropoid primate-specific SRE is part of a combinatorial mechanism [24], including possible additional substitutions in the core enhancer element that contribute to the stronger human PS2 enhancer activity.

thumbnailFigure 4. Relation between SRE motif score and response of PS2 enhancer to SREBP-2. (a) Alignment of sequences from indicated species at positions orthologous to human PS2 sterol regulatory element (SRE). 'Human mutant' refers to the mutated human SRE with four point substitutions relative to the human wild-type. 'Dog mutant' refers to the consensus SRE introduced into dog PS2 by means of three point substitutions. (b) Luciferase assay analysis of reporter construct and the SRE binding protein (SREBP)-2 expression vector cotransfection into 293T cells. The LDLR PS2 element from each of the indicated species was tested in combination with the LDLR promoter. The Y-axis denotes SRE motif score (likeness to known SRE motifs; see Materials and methods). Response of the PS2 element to SREBP-2 is shown as the increase in luciferase expression level (arbitrary units) on activation by 3 ng SREBP-2 expression vector. Expression level increase for LDLR promoter alone is indicated by the dotted line. Error bars indicate standard deviation.

The role of SREBP-2 in regulating the human PS2 enhancer was further explored in its native chromosomal context in HepG2 cells, which actively express SREBP-2 and are a well defined system for studying LDLR regulation [25-27]. Our analysis showed that the PS2 sequence is a DNaseI hypersensitive site in HepG2 cells (Additional data file 2), suggesting that the corresponding DNA element is involved in transcriptional regulation of the endogenous gene. Using the ChIP (chromatin immunoprecipitation) assay, we were able to show that fractionation of chromatin with an anti-SREBP-2 antibody specifically enriched for endogenous PS2 and LDLR promoter DNA relative to control region (Figure 5); the latter has previously been shown to be bound by SREBP-2 [28]. Together, the DNAseI hypersensitivity and ChIP assays provide strong evidence that SREBP-2 binds in the vicinity of the human PS2 enhancer in its native genomic locus. Regulation of the enhancer by SREBP-2 also suggests that the PS2 element plays a role in the activation of its upstream gene LDLR rather than the downstream gene Spbc24, which encodes a component of the kinetochore Ndc80 protein complex [29]. It was recently noted, based on genome-wide analysis of gene expression, that SREBP targets are largely restricted to lipid metabolism genes, including LDLR [20]. No connection was found between SREBP and kinetochore structural genes such as Spbc24.

thumbnailFigure 5. Anti-SREBP-2 antibody specifically enriches human PS2 DNA sequences in HepG2 cells in a ChIP assay. DNA precipitates were measured by real-time polymerase chain reaction using primers spanning the indicated regions. Control region (control) corresponds to the first coding exon of the neighboring gene SPBC24, which is approximately 7.5 kilobases away from the PS2 sequence. The results are presented as fold increase in the enrichment of precipitated DNA by anti-SREBP-2 (anti-sterol regulatory element binding protein-2) antibody over nonspecific IgG. Graphical representations of the mean ± standard error from three independent experiments are shown.

Conclusion

We have shown phylogenetic and molecular data supporting the evolution of differential gene expression of LDLR in mammals. Transcriptional control of LDLR is mainly effected through the intracellular cholesterol sensor SREBP-2. The latter was previously shown to mediate the increased transcription of LDLR in response to low cholesterol levels through an SRE in the LDLR promoter [23,30], which is conserved in all mammals examined. The additional SRE found in the PS2 enhancer in primates may lead to differential response to SREBP-2 among mammals. Although the contribution of the PS2 enhancer to the in vivo regulation of LDLR remains to be elucidated, these results suggest that species-specific regulation of LDLR is expected in conditions that result in decreased intracellular cholesterol levels, such as reduced availability of dietary cholesterol, and has implications for the study of LDLR response to cholesterol-lowering drugs in animal models.

Although the human LDLR coding sequence and promoter are well conserved in all sequenced mammals (Figure 1 and Additional data file 1), our data support the modification of the expression characteristics of this gene through the primate-specific evolution of a distal regulatory element. We have shown the emergence and fixation of a SRE in the common ancestor of anthropoid primates, which modifies the expression driven by a pre-existing mammalian enhancer shared by all tested mammals. This demonstrates one mechanism by which mammalian regulatory elements can evolve to perform new functions. Given the vital importance of LDLR in energy storage, the appearance of a new cholesterol sensing element in the LDLR enhancer might have played a role in the evolution of new physiologic features, because the ancestor of anthropoid primates adapted to different metabolic requirements and diets.

Materials and methods

Plasmid constructs

The human LDLR promoter was cloned in the proper orientation upstream of the luciferase cDNA in the pGL3Basic construct (Promega, Madison, WI, USA). The human PS1 element and the PS2 elements from human, lemur, mouse, and dog LDLR loci were polymerase chain reaction (PCR) cloned into poly-linker sites in the (-) orientation upstream of the promoter. The cloned human PS1 element corresponds to the Gumby predicted conserved sequence and approximately 200 bp of flanking sequence on either side (hg18, chr19:11067913-11068639). To clone the human PS2 element, the region containing human PS2 was PCR cloned into pGL3Basic (see Additional data file 4 for primer sequences), and digested with SpeI and NheI to only include the Gumby predicted conserved sequence and approximately 200 bp of flanking sequence on either side (hg18, chr19:11110333-11111194). Site-specific point mutations and deletions were introduced into human and dog PS2 elements using QuikChangeII site-directed mutagenesis kit (Stratagene, La Jolla, CA, USA), in accordance with the manufacturer's protocol and were confirmed by sequencing. The expression construct for human mature form of SREBP-2 (pcDNA.2FLAG SREBP-2) was kindly provided by Dr Timothy F Osborne (UC Irvine). The human LDLR promoter (hg18, chr19:11060880-11061099) was PCR amplified (see Additional data file 4 for primers used).

Transient transfection reporter assay

Cells of human embryonic kidney cell line 293T (ATCC CRL-11268) were grown at 37°C and 5% carbon dioxide CO2 in Dulbecco's modified Eagle's medium (ATCC), supplemented with 10% fetal bovine serum (Hyclone, Logan, UT, USA), l-glutamine, and penicillin-streptomycin. Cells with a passage number below 15 were used. The cells were grown in 12-well plates (4 × 104 cells/well) and transfected using Fugene (Roche Molecular Biochemicals, Indianapolis, IN, USA), in accordance with the manufacturer's protocol. Briefly, 100 ng of each assayed plasmid and 10 ng pCMVβ (BD Biosciences, Franklin Lakes, NJ, USA) were mixed with 1.5 μl Fugene and added to each well. Following 42-48 hours of incubation, cells were harvested and lysed. Activity of luciferase and β-galactosidase was measured using the Luciferase Assay System (Promega) and the Galacto-Light Plus (Applied Biosystems, Foster City, CA, USA), respectively. Luciferase activity for each sample was normalized to the β-galactosidase assay control. For co-transfection experiments, 100 ng of the lulciferase reporter gene construct, 3 ng of SREBP-2 expression vector, and 10 ng of pCMVβ were used. Transfections were carried out in duplicate. All experiments are representative of at least three independent transfections.

Tail vein plasmid DNA transfer assays

Tail vein injection was performed as described by Herweijer and Wolff [18], following the TransIT® In Vivo Gene Delivery System (Mirus Corporation, Madison, WI, USA) protocol. Six to nine FVB male mice (Charles River Laboratory, Wilmington, MA, USA) at age 7-8 weeks were used for each reporter gene construct. Ten micrograms of each reporter construct, along with 2 μg of pCMVβ (BD Biosciences) to correct for delivery efficiency, were injected into each mouse. The entire content of the syringe was delivered in 3-5 s. Animals were killed 24 hours later, livers extracted, measured to correct for size, homogenized, and centrifuged for 15 min at 4°C, 14,000 rpm. Activities of luciferase and β-galactosidase were measured as described above. All P values are from the two-sample Wilcoxon rank-sum (Mann-Whitney) test using STATA (STATA Corporation, College Station, TX, USA). All experimental results are representative of two independent plasmid DNA transfer assays.

Chromatin immunoprecipitation

HepG2 cells (ATCC HB-8065) were cultured in DMSF (defined medium serum free) medium for 24 hours for induction of endogenous SREBPs [31]. Chromatin immunoprecipitaion assays were performed as described previously [32]. Cross-linked chromatin was immunoprecipitated with a specific SREBP-2 antibody (Santa Cruz sc-8151) [8] or IgG control.

Sources of sequence data

Draft sequences of baboon, colobus, dusky titi, marmoset, owl monkey, lemur, and galago bacterial artificial chromosomes (BACs) were determined by sequencing ends of 3 kb subclones to 8- to 10-fold coverage using BigDye terminators (Applied Biosystems) and assembling reads into contigs with the Phred-Phrap-Consed suite, as described previously [33]. All BAC sequences were submitted to GenBank with the following species and accession numbers: baboon (Papio hamadryas), AC140974; colobus (Colobus guereza), AC150433; marmoset (Callithrix jacchus), AC145530; dusky titi (Callicebus moloch), AC144655; owl monkey (Aotus hybrid), AC171393; squirrel monkey (Saimiri boliviensis boliviensis), AC146467; lemur (lemur catta), AC118569; mouse lemur (microcebus murinus), AC175656; and galago (otolemur garnetti), AC175655). Human, chimpanzee, rhesus, mouse, rat, and dog sequences were downloaded from the UCSC Genome Bioinformatics website [34].

Analysis of sequence conservation

We aligned the human LDLR locus (chr19:11,055,219-11,117,169; NCBI Build 35) to its orthologs in baboon, colobus, dusky titi, marmoset, owl monkey, lemur, mouse, and dog using MLAGAN [35]. Because incomplete primate sequence availability, we included only about 6 kb of the 5' intergenic region in the analysis. Aligned sequences were scanned for statistically significant evolutionarily conserved regions using Gumby [12-14]. Gumby goes through the following three-step process to identify statistically significant conservation in the global alignment input. First, noncoding regions in the alignment are used to estimate the local neutral mismatch rates among all pairs of aligned sequences. The rates are used to derive a log-likelihood scoring scheme for slow versus neutral evolution, in which the slow rate is set to some fraction (in this case, half) of the neutral rate. Second, each alignment position is then assigned a conservation score using a phylogenetically weighted sum-of-pairs scheme. Third, conserved regions of any length are identified as alignment blocks with a high cumulative conservation score and assigned P values using Karlin-Altschul statistics [36]. We set a threshold P value of 0.1 in a baseline human sequence length of 10 kb. Conserved regions identified by Gumby were visualized using RankVISTA. In addition, human-mouse sequence conservation was analyzed using the VISTA web server [37,38], with the standard criterion of 70% sequence identity in window of size 100 bp.

Binding site prediction

We scanned the aligned enhancer sequences for predicted transcription factor binding sites using DiAlign TF [39]. Anthropoid primate (human, baboon, colobus, dusky titi, marmoset, and owl monkey) sequences were assessed for the presence of sites conserved across all six species that were predicted to bind one of the following liver-expressed transcription factors: C/EBP (CCAAT/enhancer-binding protein), LXR (liver X receptor), FXR (farnesoid X receptor), COUP-TF (Chicken Ovalbumin Upstream Promoter Transcription Factor), peroxisome proliferator-activated receptor (PPAR), HNF1 (hepatocyte nuclear factor 1), HNF3, HNF4, and SREBP. Binding sites common to primates and mammals were predicted on the basis of conservation of any vertebrate transcription factor motif in at least eight of the 10 analyzed species (six anthropoid primates, lemur, mouse, rat, and dog). Motif scores of the SREs or SRE orthologs of individual species were calculated using rVISTA [40] and normalized so that the maximum achievable score is 1.0, and the expected score of a random nucleotide sequence with the local GC content is zero. The score distribution of functional SREs was calculated from the binding profile of SREBP [41], assuming that nucleotide frequencies at each position in the motif are independent. We retrospectively augmented the species set with SRE orthologs from chimpanzee, rhesus, squirrel monkey, mouse lemur and galago, based on pair-wise alignments of those species to human.

Additional data files

The following additional data are available with the online version of this paper: A figure showing the conservation profiles of the LDLR locus using (A) human-dog and (B) human-lemur sequence comparisons (Additional data file 1); a figure showing the DNaseI hypersensitive site mapping around the LDLR PS2 region in human liver cell line HepG2 (Additional data file 2); a figure showing that human LDLR PS2 enhancer activity is independent of the LDLR promoter (Additional data file 3); a table listing the primers used in the cloning of human LDLR promoter and PS2 elements from indicated species (Additional data file 4); and a table listing the primers used in ChIP assay (Additional data file 5).

Additional data file 1. A figure showing the conservation profiles of the LDLR locus using (A) human-dog and (B) human-lemur sequence comparisons.

Format: EPS Size: 1.9MB Download fileOpen Data

Additional data file 2. A figure showing the DNaseI hypersensitive site mapping around the LDLR PS2 region in human liver cell line HepG2.

Format: EPS Size: 3.1MB Download fileOpen Data

Additional data file 3. A figure showing that human LDLR PS2 enhancer activity is independent of the LDLR promoter.

Format: EPS Size: 2.3MB Download fileOpen Data

Additional data file 4. A table listing the primers used in the cloning of human LDLR promoter and PS2 elements from indicated species.

Format: PDF Size: 11KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

Additional data file 5. A table listing the primers used in ChIP assay.

Format: PDF Size: 10KB Download file

This file can be viewed with: Adobe Acrobat ReaderOpen Data

Acknowledgements

We thank J Noonan, L Pennacchio, A Visel, S Tringe, N Ahituv, and other Rubin laboratory members for suggestions and criticisms on the manuscript, and Tim Osborne for providing the SREBP-2 expression vector. B Kullgren and S Phouanenavong provided technical assistance for tail vein plasmid DNA transfer assay.

Research was conducted at the EO Lawrence Berkeley National Laboratory and at the Joint Genome Institute. This work was supported by the Director, Office of Science, of the US Department of Energy under Contract No. DE-AC02-05CH11231 and NIH-NHLBI grant numbers THL007279F and U1HL66681B.

References

  1. King MC, Wilson AC: Evolution at two levels in humans and chimpanzees.

    Science 1975, 188:107-116. PubMed Abstract OpenURL

  2. Carroll SB: Evolution at two levels: on genes and form.

    PLoS Biol 2005, 3:e245. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  3. Hughes AL: Natural selection and the diversification of vertebrate immune effectors.

    Immunol Rev 2002, 190:161-168. PubMed Abstract | Publisher Full Text OpenURL

  4. Dorus S, Vallender EJ, Evans PD, Anderson JR, Gilbert SL, Mahowald M, Wyckoff GJ, Malcom CM, Lahn BT: Accelerated evolution of nervous system genes in the origin of Homo sapiens.

    Cell 2004, 119:1027-1040. PubMed Abstract | Publisher Full Text OpenURL

  5. Zhang J, Zhang YP, Rosenberg HF: Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey.

    Nat Genet 2002, 30:411-415. PubMed Abstract | Publisher Full Text OpenURL

  6. Jessen TH, Weber RE, Fermi G, Tame J, Braunitzer G: Adaptation of bird hemoglobins to high altitudes: demonstration of molecular mechanism by protein engineering.

    Proc Natl Acad Sci USA 1991, 88:6519-6522. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  7. Yokoyama S: Molecular evolution of color vision in vertebrates.

    Gene 2002, 300:69-78. PubMed Abstract | Publisher Full Text OpenURL

  8. Rockman MV, Hahn MW, Soranzo N, Zimprich F, Goldstein DB, Wray GA: Ancient and recent positive selection transformed opioid cis-regulation in humans.

    PLoS Biol 2005, 3:e387. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  9. Hahn MW, Rockman MV, Soranzo N, Goldstein DB, Wray GA: Population genetic and phylogenetic evidence for positive selection on regulatory mutations at the factor VII locus in humans.

    Genetics 2004, 167:867-877. PubMed Abstract | Publisher Full Text OpenURL

  10. Chiang JY: Bile acid regulation of gene expression: roles of nuclear hormone receptors.

    Endocr Rev 2002, 23:443-463. PubMed Abstract | Publisher Full Text OpenURL

  11. Horton JD, Goldstein JL, Brown MS: SREBPs: activators of the complete program of cholesterol and fatty acid synthesis in the liver.

    J Clin Invest 2002, 109:1125-1131. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  12. Prabhakar S, Poulin F, Shoukry M, Afzal V, Rubin EM, Couronne O, Pennacchio LA: Close sequence comparisons are sufficient to identify human cis-regulatory elements.

    Genome Res 2006, 16:855-863. PubMed Abstract | Publisher Full Text OpenURL

  13. Ahituv N, Prabhakar S, Poulin F, Rubin EM, Couronne O: Mapping cis-regulatory domains in the human genome using multi-species conservation of synteny.

    Hum Mol Genet 2005, 14:3057-3063. PubMed Abstract | Publisher Full Text OpenURL

  14. Hughes JR, Cheng JF, Ventress N, Prabhakar S, Clark K, Anguita E, De Gobbi M, de Jong P, Rubin E, Higgs DR: Annotation of cis-regulatory elements by identification, subclassification, and functional assessment of multispecies conserved sequences.

    Proc Natl Acad Sci USA 2005, 102:9830-9835. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  15. Boffelli D, McAuliffe J, Ovcharenko D, Lewis KD, Ovcharenko I, Pachter L, Rubin EM: Phylogenetic shadowing of primate sequences to find functional regions of the human genome.

    Science 2003, 299:1391-1394. PubMed Abstract | Publisher Full Text OpenURL

  16. Boffelli D, Nobrega MA, Rubin EM: Comparative genomics at the vertebrate extremes.

    Nat Rev Genet 2004, 5:456-465. PubMed Abstract | Publisher Full Text OpenURL

  17. Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al.: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes.

    Genome Res 2005, 15:1034-1050. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. Herweijer H, Wolff JA: Progress and prospects: naked DNA gene transfer and therapy.

    Gene Ther 2003, 10:453-458. PubMed Abstract | Publisher Full Text OpenURL

  19. Sudhof TC, Van der Westhuyzen DR, Goldstein JL, Brown MS, Russell DW: Three direct repeats and a TATA-like sequence are required for regulated expression of the human low density lipoprotein receptor gene.

    J Biol Chem 1987, 262:10773-10779. PubMed Abstract | Publisher Full Text OpenURL

  20. Horton JD, Shah NA, Warrington JA, Anderson NN, Park SW, Brown MS, Goldstein JL: Combined analysis of oligonucleotide microarray data from transgenic and knockout mice identifies direct SREBP target genes.

    Proc Natl Acad Sci USA 2003, 100:12027-12032. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  21. Horton JD, Shimomura I, Brown MS, Hammer RE, Goldstein JL, Shimano H: Activation of cholesterol synthesis in preference to fatty acid synthesis in liver and adipose tissue of transgenic mice overproducing sterol regulatory element-binding protein-2.

    J Clin Invest 1998, 101:2331-2339. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  22. Smith JR, Osborne TF, Goldstein JL, Brown MS: Identification of nucleotides responsible for enhancer activity of sterol regulatory element in low density lipoprotein receptor gene.

    J Biol Chem 1990, 265:2306-2310. PubMed Abstract | Publisher Full Text OpenURL

  23. Hussain MM, Strickland DK, Bakillah A: The mammalian low-density lipoprotein receptor family.

    Annu Rev Nutr 1999, 19:141-172. PubMed Abstract | Publisher Full Text OpenURL

  24. Sanchez HB, Yieh L, Osborne TF: Cooperation by sterol regulatory element-binding protein and Sp1 in sterol regulation of low density lipoprotein receptor gene.

    J Biol Chem 1995, 270:1161-1169. PubMed Abstract | Publisher Full Text OpenURL

  25. Kong W, Wei J, Abidi P, Lin M, Inaba S, Li C, Wang Y, Wang Z, Si S, Pan H, et al.: Berberine is a novel cholesterol-lowering drug working through a unique mechanism distinct from statins.

    Nat Med 2004, 10:1344-1351. PubMed Abstract | Publisher Full Text OpenURL

  26. Maxwell KN, Fisher EA, Breslow JL: Overexpression of PCSK9 accelerates the degradation of the LDLR in a post-endoplasmic reticulum compartment.

    Proc Natl Acad Sci USA 2005, 102:2069-2074. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. Izem L, Rassart E, Kamate L, Falstrault L, Rhainds D, Brissette L: Effect of reduced low-density lipoprotein receptor level on HepG2 cell cholesterol metabolism.

    Biochem J 1998, 329:81-89. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  28. Zeng L, Lu M, Mori K, Luo S, Lee AS, Zhu Y, Shyy JY: ATF6 modulates SREBP2-mediated lipogenesis.

    EMBO J 2004, 23:950-958. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  29. Wei RR, Sorger PK, Harrison SC: Molecular organization of the Ndc80 complex, an essential kinetochore component.

    Proc Natl Acad Sci USA 2005, 102:5363-5367. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  30. Osborne TF, Gil G, Goldstein JL, Brown MS: Operator constitutive mutation of 3-hydroxy-3-methylglutaryl coenzyme A reductase promoter abolishes protein binding to sterol regulatory element.

    J Biol Chem 1988, 263:3380-3387. PubMed Abstract | Publisher Full Text OpenURL

  31. Toth JI, Datta S, Athanikar JN, Freedman LP, Osborne TF: Selective coactivator interactions in gene activation by SREBP-1a and -1c.

    Mol Cell Biol 2004, 24:8288-8300. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  32. Wang Q, Carroll JS, Brown M: Spatial and temporal recruitment of androgen receptor and its coactivators involves chromosomal looping and polymerase tracking.

    Mol Cell 2005, 19:631-642. PubMed Abstract | Publisher Full Text OpenURL

  33. Wang QF, Liu X, O'Connell J, Peng Z, Krauss RM, Rainwater DL, VandeBerg JL, Rubin EM, Cheng JF, Pennacchio LA: Haplotypes in the APOA1-C3-A4-A5 gene cluster affect plasma lipids in both humans and baboons.

    Hum Mol Genet 2004, 13:1049-1056. PubMed Abstract | Publisher Full Text OpenURL

  34. UCSC Genome Browser [http://genome.ucsc.edu] webcite

  35. Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, Green ED, Sidow A, Batzoglou S: LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA.

    Genome Res 2003, 13:721-731. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  36. Karlin S, Altschul SF: Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.

    Proc Natl Acad Sci USA 1990, 87:2264-2268. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  37. VISTA web server [http://pipeline.lbl.gov/] webcite

  38. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I: VISTA: computational tools for comparative genomics.

    Nucleic Acids Res 2004, (32 Web Server):W273-W279. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  39. Genomatix [http://www.genomatix.de] webcite

  40. Loots GG, Ovcharenko I, Pachter L, Dubchak I, Rubin EM: rVista for comparative sequence-based discovery of functional transcription factor binding sites.

    Genome Res 2002, 12:832-839. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  41. TRANSFAC [http://www.gene-regulation.com/pub/databases.html] webcite