<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
<ui>gb-2012-13-3-r17</ui>
<ji>1465-6906</ji>
<fm>
<dochead>Research</dochead>
<bibl>
<title><p>The RNA-binding landscapes of two SR proteins reveal unique functions and binding to diverse RNA classes</p></title>
<aug>
<au id="A1" ca="yes"><snm>&#196;nk&#246;</snm><fnm>Minna-Liisa</fnm><insr iid="I1"/><insr iid="I4"/><email>minna-liisa.anko@monash.edu</email></au>
<au id="A2"><snm>M&#252;ller-McNicoll</snm><fnm>Michaela</fnm><insr iid="I1"/><email>mmueller@mpi-cbg.de</email></au>
<au id="A3"><snm>Brandl</snm><fnm>Holger</fnm><insr iid="I1"/><email>brandl@mpi-cbg.de</email></au>
<au id="A4"><snm>Curk</snm><fnm>Tomaz</fnm><insr iid="I3"/><email>tomaz.curk@fri.uni-lj.si</email></au>
<au id="A5"><snm>Gorup</snm><fnm>Crtomir</fnm><insr iid="I3"/><email>crtomir.gorup@fri.uni-lj.si</email></au>
<au id="A6"><snm>Henry</snm><fnm>Ian</fnm><insr iid="I1"/><email>henry@mpi-cbg.de</email></au>
<au id="A7"><snm>Ule</snm><fnm>Jernej</fnm><insr iid="I2"/><email>jule@mrc-lmb.cam.ac.uk</email></au>
<au id="A8" ca="yes"><snm>Neugebauer</snm><mi>M</mi><fnm>Karla</fnm><insr iid="I1"/><email>neugebau@mpi-cbg.de</email></au>
</aug>
<insg>
<ins id="I1"><p>Max Planck Institute of Cell Biology and Genetics, Pfotenhauerstr. 108, Dresden, 01307, Germany</p></ins>
<ins id="I2"><p>Laboratory of Molecular Biology, Medical Research Council, Hills Road, Cambridge, CB2 0QH, UK</p></ins>
<ins id="I3"><p>Faculty of Computer and Information Science, University of Ljubljana, Trzaska cesta 25, Ljubljana, SI-1001, Slovenia</p></ins>
<ins id="I4"><p>Current address: Australian Regenerative Medicine Institute, Monash University, Wellington Road, Clayton, Melbourne, VIC-3800, Australia</p></ins>
</insg>
<source>Genome Biology</source>
<issn>1465-6906</issn>
<pubdate>2012</pubdate>
<volume>13</volume>
<issue>3</issue>
<fpage>R17</fpage>
<url>http://genomebiology.com/2012/13/3/R17</url>
<xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2012-13-3-r17</pubid><pubid idtype="pmpid">22436691</pubid></pubidlist></xrefbib></bibl>
<history><rec><date><day>26</day><month>1</month><year>2012</year></date></rec><revrec><date><day>20</day><month>3</month><year>2012</year></date></revrec><acc><date><day>21</day><month>3</month><year>2012</year></date></acc><pub><date><day>21</day><month>3</month><year>2012</year></date></pub></history>
<cpyrt><year>2012</year><collab>&#196;nk&#246; et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec><st><p>Abstract</p></st>
<sec><st><p>Background</p></st>
<p>The SR proteins comprise a family of essential, structurally related RNA binding proteins. The complexity of their RNA targets and specificity of RNA recognition <it>in vivo </it>is not well understood. Here we use iCLIP to globally analyze and compare the RNA binding properties of two SR proteins, SRSF3 and SRSF4, in murine cells.</p>
</sec>
<sec><st><p>Results</p></st>
<p>SRSF3 and SRSF4 binding sites mapped to largely non-overlapping target genes, and <it>in vivo </it>consensus binding motifs were distinct. Interactions with intronless and intron-containing mRNAs as well as non-coding RNAs were detected. Surprisingly, both SR proteins bound to the 3' ends of the majority of intronless histone transcripts, implicating SRSF3 and SRSF4 in histone mRNA metabolism. In contrast, SRSF3 but not SRSF4 specifically bound transcripts encoding numerous RNA binding proteins. Remarkably, SRSF3 was shown to modulate alternative splicing of its own as well as three other transcripts encoding SR proteins. These SRSF3-mediated splicing events led to downregulation of heterologous SR proteins via nonsense-mediated decay.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>SRSF3 and SRSF4 display unique RNA binding properties underlying diverse cellular regulatory mechanisms, with shared as well as unique coding and non-coding targets. Importantly, CLIP analysis led to the discovery that SRSF3 cross-regulates the expression of other SR protein family members.</p>
</sec>
</sec>
</abs>
</fm>
<bdy>
<sec><st><p>Background</p></st>
<p>Gene expression in metazoans is regulated at multiple levels. While investigation of transcriptional regulation by transcription factors has led to a deep understanding of how DNA binding proteins direct RNA polymerases to genes, regulation of RNA processing by RNA-binding proteins is still enigmatic. Hundreds of proteins encoded by metazoan genomes have RNA-binding capacity conferred by specific protein structural domains, such as RNA recognition motifs (RRMs), KH domains and zinc fingers <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. RNA-binding proteins can change gene expression output at different steps of RNA metabolism, including pre-mRNA splicing, polyadenylation, RNA export, RNA stability, and translation. However, the <it>in vivo </it>binding specificity and function(s) of most RNA-binding proteins are not well understood.</p>
<p>SR proteins are a family of seven RNA-binding proteins with a functional repertoire that has expanded to many aspects of RNA metabolism <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. They are concentrated in the nucleus, where they participate in pre-mRNA splicing <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, yet nearly all SR proteins shuttle between the nucleus and cytoplasm. SR protein shuttling activity contributes to their roles in mRNA export, stability and translation <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. SR proteins share a modular structure of one or two RNA recognition motifs (RRMs) at their amino terminus and an arginine-serine-rich RS domain of variable length at the carboxyl terminus. Both domains can directly contact RNA <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, although the RRM appears to determine RNA-binding specificity <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. <it>In vitro </it>binding specificities have been determined for some SR protein family members <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, which bind to 4- to 10-nucleotide long degenerate sequences. Recently, <it>in vivo </it>crosslinking was used to define mRNA targets of SRSF1 (also called ASF/SF2); this study identified thousands of SRSF1 target sites, which resembled the sequences derived <it>in vitro </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Mature mRNAs associated with SRSF3 (SRp20) and SRSF4 (SRp75) were also recently identified and represent functionally distinct mitochondrial ribonucleoproteins (mRNPs) <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. However, the latter analysis provided information at the gene level and did not identify direct binding sites of SR proteins to RNA targets.</p>
<p>To understand the widespread functions of SR protein family members, the identification of endogenous RNA target sites is required. The development of ultraviolet (UV) crosslinking and immunoprecipitation (CLIP) followed by high-throughput sequencing has made possible the identification of <it>in vivo </it>binding sites of RNA-binding proteins in a genome-wide manner <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Here we used a modification of the CLIP protocol called iCLIP <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, which allows high-resolution identification of RNA-protein crosslink sites, to investigate the binding specificity and endogenous RNA targets of SRSF3 and SRSF4. We took advantage of our previously developed tagging and stable expression system, in which an enhanced green fluorescent protein (EGFP) tag is inserted at the carboxyl terminus of the SR protein by recombineering of bacterial artificial chromosomes (BACs); due to co-regulation of the endogenous and stably integrated transgenes, the total level of SR protein expression is unchanged in the diploid mouse P19 cells used here <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Using the EGFP tag as a universal epitope for iCLIP, we determined <it>in vivo </it>binding sites of SRSF3 and SRSF4. Our analysis shows that SRSF3 and SRSF4 bind to distinct sequences and target RNAs, including non-coding RNAs (ncRNAs). The subsequent analysis showed that SRSF3 or SRSF4 binding to these sites conferred regulatory functions in several steps of RNA metabolism in cells, supporting the widespread contribution of SR proteins in gene expression regulation.</p>
</sec>
<sec><st><p>Results</p></st>
<sec><st><p>SRSF3 and SRSF4 bind distinct RNAs</p></st>
<p>We used the iCLIP method <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> to identify SRSF3 and SRSF4 binding sites genome-wide in mouse P19 cells. SRSF3 and SRSF4 were immunopurified via the EGFP tag encoded on stable transgenes to allow direct comparison of the binding profiles of the two SR proteins <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Previous analyses showed that the EGFP-tagged SR proteins recapitulate interactions with nascent RNA and functionally rescue the endogenous proteins <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B13">13</abbr></abbrgrp>. Both SRSF3-EGFP and SRSF4-EGFP were specifically and efficiently immunopurified from cell extracts, and SR protein-RNA complexes were isolated after <it>in vivo </it>UV crosslinking (Figure S1a, b in Additional file <supplr sid="S1">1</supplr>). No RNA-protein complexes were detected in cells expressing only nuclear EGFP (EGFP-nuclear localization signal) or in the absence of UV crosslinking (Figure S1b in Additional file <supplr sid="S1">1</supplr>). In each replicate experiment, SRSF4 showed weaker signal intensity than SRSF3 (Figure S1b in Additional file <supplr sid="S1">1</supplr>), indicating either lower crosslinking efficiency or fewer RNA targets.</p>
<suppl id="S1">
<title><p>Additional file 1</p></title>
<text><p><b>Supplementary Information</b>. Supplementary Materials and methods, References, Figures S1 to S7 and Tables S1 and S3 to S5.</p></text>
<file name="gb-2012-13-3-r17-S1.PDF">
   <p>Click here for file</p>
</file>
</suppl>
<p>Crosslinked, immunopurified RNA was digested to lengths of 40 to 100 nucleotides, reverse transcribed and prepared for next-generation sequencing <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> (Figure S1c in Additional file <supplr sid="S1">1</supplr>). The resulting reads, referred to as CLIP-tags throughout the manuscript, were aligned to the mouse mm9 genome assembly. In total, iCLIP produced 1,212,480 and 243,501 unique CLIP-tags for SRSF3 and SRSF4, respectively (Table S1 in Additional file <supplr sid="S1">1</supplr>). SRSF4 reproducibly yielded fewer sequence reads, in agreement with the lower crosslinking levels observed (Figure S1b in Additional file <supplr sid="S1">1</supplr>). The EGFP-nuclear localization signal control iCLIP experiments performed in parallel did not produce any detectable PCR products and yielded a total of 2,611 CLIP-tags mapping to the mouse genome. Because the SRSF3 and SRSF4 iCLIPs generated 100- to 1,000-fold more CLIP-tags than the control iCLIP, less than 1% of the detected CLIP-tags could be due to nonspecific crosslinking.</p>
<p>As a first step towards analyzing the RNAs and RNA regions bound by SRSF3 and SRSF4, crosslink sites were identified by mapping to the first nucleotide upstream of the start of each CLIP-tag, as previously described <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. We determined statistically significant SRSF3 and SRSF4 crosslink sites (33,458 and 10,393, respectively), and identified CLIP-tag clusters with a maximum spacing of 15 nucleotides and containing a significant CLIP-tag count when compared to randomized positions (false discovery rate &lt; 0.05) <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. To test whether the iCLIP captured only the most highly expressed genes, we compared the density of CLIP-tags to our global gene expression data in P19 cells <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. There was a slight positive correlation between the gene expression level and the density of CLIP-tags within the gene, yet CLIP-tags were identified in genes at the whole range of gene expression (Figure S1d in Additional file <supplr sid="S1">1</supplr>).</p>
<p>Examination of SRSF3 and SRSF4 CLIP-tag clusters indicated that multiple reads were detected in limited RNA regions. The same transcript could display crosslinking to both SR proteins, albeit in different regions of the transcript, as exemplified by the <it>NPM1 </it>gene that contained CLIP-tag clusters for both SRSF3 and SRSF4 mapping to distinct exons (Figure <figr fid="F1">1a</figr>). Also at the chromosome level, a large proportion of the CLIP-tags and clusters were non-overlapping (Figure <figr fid="F1">1a</figr>; Figure S2 in Additional file <supplr sid="S1">1</supplr>). Significant crosslink sites were detected in 2,304 genes for SRSF3 and 1,055 genes for SRSF4, of which 83.3% and 83.2% were protein-coding, respectively. A list of genes with significant crosslink sites is provided in Additional file <supplr sid="S2">2</supplr>. These numbers are likely to be underestimates because our sequencing has not reached saturation. In agreement with our recent analysis showing that SRSF3 and SRSF4 associate with distinct mRNAs <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, the identity of the target RNAs bound by SRSF3 and SRSF4 only partially overlapped (Figure <figr fid="F1">1b</figr>). An even smaller overlap between SRSF3 and SRSF4 CLIP-tag clusters, rather than genes, was observed (compare Figure <figr fid="F1">1b</figr> and <figr fid="F1">1c</figr>), strongly suggesting differential RNA-binding specificities of SRSF3 and SRSF4.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>SRSF3 and SRSF4 CLIP-tags cluster to distinct positions in mouse RNAs</p></caption><text>
   <p><b>SRSF3 and SRSF4 CLIP-tags cluster to distinct positions in mouse RNAs</b>. <b>(a) </b><it>NPM1 </it>gene (green box) and the surrounding approximately 3 MB region in chromosome 11 (black box) with SRSF3 and SRSF4 CLIP-tags and clusters. The numbers on the left represent the number of CLIP-tags within the window. The sense strand is marked in blue and the antisense strand in orange. Note that the genes in the antisense strand run from right to left. <b>(b) </b>Comparison of annotated genes with significant SRSF3 or SRSF4 crosslink sites (false discovery rate &lt; 0.05). <b>(c) </b>Comparison of significant SRSF3 and SRSF4 CLIP-tag clusters (overlap of clusters &#8805; 15 nucleotides).</p>
</text><graphic file="gb-2012-13-3-r17-1"/></fig>
<suppl id="S2">
<title><p>Additional file 2</p></title>
<text><p><b>Supplementary Table S2</b>. Genes with significant SRSF3 or SRSF4 crosslink sites.</p></text>
<file name="gb-2012-13-3-r17-S2.XLSX">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec><st><p>Consensus binding motif of SRSF3 and SRSF4</p></st>
<p>The <it>in vivo </it>binding specificities of SRSF3 and SRSF4 are unknown. The differences in the CLIP-tag cluster sites suggested that each of the two SR proteins binds to a distinct RNA sequence. To address this directly, we used the data to derive <it>in vivo </it>binding motifs for SRSF3 and SRSF4 by analyzing enriched pentamer sequences around the crosslink sites. To calculate a Z-score for each pentamer, iCLIP positions were randomized within the same regions. The pentamer enrichment analysis showed that SRSF3 and SRSF4 identify distinct sequence motifs (Figure <figr fid="F2">2</figr>). The top five pentamers for SRSF3 (Figure <figr fid="F2">2a</figr>) were in close agreement with the core SELEX (systemic evolution of ligands by exponential enrichment) motif determined <it>in vitro </it><abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. SELEX has not been performed on SRSF4; interestingly, the SRSF4 top five pentamers (Figure <figr fid="F2">2b</figr>) were similar to one sequence (GAAGGA) previously shown to be an SRSF4 binding site in bovine papilloma virus pre-mRNA <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The SRSF3 binding motif was CU-rich excluding Gs, whereas SRSF4 bound to GA-rich sequences excluding Cs (Figure <figr fid="F2">2d</figr>). These results are consistent with the largely non-overlapping SRSF3 and SRSF4 crosslink sites and clusters (Figure <figr fid="F2">2c</figr>, and see above).</p>
<fig id="F2"><title><p>Figure 2</p></title><caption><p><it>In vivo </it>binding specificity of SRSF3 and SRSF4</p></caption><text>
   <p><b><it>In vivo </it>binding specificity of SRSF3 and SRSF4</b>. <b>(a, b) </b>The frequency distribution of SRSF3 (a) and SRSF4 (b) pentamer Z-scores. The Z-score was calculated relative to randomized genomic positions by shuffling the crosslink positions 100 times within the genes. Five pentamers with highest Z-scores are shown. <b>(c) </b>Correlation of SRSF3 and SRSF4 pentamer Z-scores. The top five pentamers presented in (a, b) are marked as larger light grey dots. <b>(d) </b>Consensus motifs were derived from the top pentamers shown in (a, b).</p>
</text><graphic file="gb-2012-13-3-r17-2"/></fig>
</sec>
<sec><st><p>SRSF3 and SRSF4 bind to coding and non-coding RNAs</p></st>
<p>Which categories of RNA and which functional RNA regions are bound by SR proteins? Analysis of the frequency with which SRSF3 and SRSF4 CLIP-tags were mapped to genes and gene regions revealed their common propensity to bind exons and introns in protein-coding genes (Figure <figr fid="F3">3a</figr>; Table S3 in Additional file <supplr sid="S1">1</supplr>). The high proportion of intronic CLIP-tags detected clearly reflects the fact that mammalian introns are much longer than exons; when the frequency of CLIP-tags was normalized to the length of the RNA region (Figure <figr fid="F3">3b</figr>), both SRSF3 and SRSF4 CLIP-tags were more highly enriched in exons than in introns. SR protein interactions with exons could reflect activities either in pre-mRNA splicing or in mRNPs after splicing (see below).</p>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>Distribution of SRSF3 and SRSF4 CLIP-tags within RNA classes and transcript regions</p></caption><text>
   <p><b>Distribution of SRSF3 and SRSF4 CLIP-tags within RNA classes and transcript regions</b>. <b>(a) </b>The proportion of CLIP-tags that mapped to different RNAs relative to the total number of CLIP-tags. <b>(b) </b>The fold enrichment of CLIP-tag density (the number of CLIP-tags divided by the length of each RNA feature) in different RNAs relative to the average CLIP-tag density in the genome.</p>
</text><graphic file="gb-2012-13-3-r17-3"/></fig>
<p>The highest density of CLIP-tags was detected in ncRNAs (Figure <figr fid="F3">3b</figr>). Overall, 319 and 141 ncRNAs had SRSF3 and SRSF4 CLIP-tag clusters, respectively. The most abundant ncRNA classes with CLIP-tags were long ncRNAs (lincRNAs) and small nucleolar RNAs (snoRNAs) (Figure <figr fid="F4">4a</figr>). Similar to SRSF1 and TDP-43 <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B21">21</abbr></abbrgrp>, SRSF3 and SRSF4 crosslinked to the lincRNA <it>MALAT1 </it>(aka <it>NEAT2</it>; Figure S3a in Additional file <supplr sid="S1">1</supplr>) that is enriched in nuclear speckles <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In addition, another speckle-localized ncRNA, <it>7SK </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, had abundant SRSF3 and SRSF4 CLIP-tag clusters (data not shown). An especially large proportion of ncRNAs with SRSF3 and SRSF4 crosslink sites belonged to snoRNAs, a class of small RNAs that guide RNA modifying enzymes <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Intriguingly, small Cajal body-specific RNAs (scaRNAs), a subclass of snoRNAs, were enriched in SRSF4 CLIP-tag clusters. SR protein binding could not be correlated with known elements within scaRNAs because the scaRNAs identified included those with H/ACA boxes alone, C/D boxes alone, and a combination of H/ACA and C/D boxes. The specificity of SR protein binding to this group of scaRNAs was investigated in two ways. First, we asked whether binding was biased to any particular region of the scaRNAs. Figure <figr fid="F4">4b</figr> shows that binding sites were localized near scaRNA 3' ends (Figure <figr fid="F4">4b</figr>; Figure S3a in Additional file <supplr sid="S1">1</supplr>). Second, the CLIP-tag clusters within the scaRNAs were used to determine a consensus binding motif independent of the global pentamer analysis. Multiple alignment of the CLIP-tag cluster regions using the MEME (Multiple Em for Motif Elicitation) algorithm identified a consensus sequence element (Figure S3c in Additional file <supplr sid="S1">1</supplr>) that was found in all scaRNAs with SRSF4 CLIP-tag clusters. The motif was GA-rich, similar to the pentamer motif determined for all crosslink sites with the exception that Cs were occasionally observed. This independent derivation of a binding sequence similar to the globally derived consensus indicates that SRSF4 binding to scaRNAs is specific.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>ncRNAs with SRSF3 and SRSF4 crosslink sites</p></caption><text>
   <p><b>ncRNAs with SRSF3 and SRSF4 crosslink sites</b>. <b>(a) </b>The distribution of crosslink sites within the ncRNA subclasses. <b>(b) </b>The position of the SRSF4 CLIP-tag clusters relative to the scaRNA 3' end. 'Other ncRNAs' are processed transcripts with no known ORF or function.</p>
</text><graphic file="gb-2012-13-3-r17-4"/></fig>
</sec>
<sec><st><p>SRSF3 and SRSF4 bind to intronless histone mRNAs</p></st>
<p>SRSF3 and SRSF4 binding sites were found in intronless protein-coding genes, likely reflecting SRSF3 and SRSF4 participation in regulatory events other than splicing. In particular, SRSF3 and SRSF4 CLIP-tag clusters were detected within histone genes: 73.8% of the mouse histone genes annotated in <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> had SRSF3 clusters and 47.7% had SRSF4 clusters (Figure <figr fid="F5">5a</figr>; Figure S4a in Additional file <supplr sid="S1">1</supplr>). This was also reflected in the enriched Gene Ontology (GO) terms where categories related to chromatin and nucleosome assembly were present (Table S4 in Additional file <supplr sid="S1">1</supplr>). The SRSF3 and SRSF4 CLIP-tag clusters were located at the boundary between ORF and 3' UTR and/or within the 3' UTR of histone mRNAs (Figure <figr fid="F5">5b</figr>). The CLIP-tag clusters were located just upstream of conserved stem-loops that occur 14 to 50 nucleotides downstream of the ORF (Figure <figr fid="F5">5a</figr>); these stem loops specify the sites of endonucleolytic cleavage of replication-dependent histone mRNAs and therefore define their 3' ends <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>SRSF3 and SRSF4 bind to numerous intronless histone mRNAs at a consistent position</p></caption><text>
   <p><b>SRSF3 and SRSF4 bind to numerous intronless histone mRNAs at a consistent position</b>. <b>(a) </b>SRSF3 and SRSF4 CLIP-tags and clusters in <it>HIST2H2BB </it>and <it>HIST1H2AB </it>genes. Labels as in Figure 1a. The orange arrowheads mark the mRNA 3' end cleavage site. <b>(b) </b>Mapping of SRSF3 (left panel) and SRSF4 (right panel) crosslink sites to the ORF-3' UTR boundary of histone mRNAs. The position 0 marked with a dotted line represents the ORF-3' UTR boundary. <b>(c) </b>Cytoplasmic levels of histone mRNAs associated with SRSF3 or SRSF4 determined by UV-RNA immunoprecipitation and reverse transcription quantitative PCR. To prime the reverse transcription reactions, hexamers were used to detect total and oligo-dT to detect polyadenylated histone mRNAs. Data are presented relative to the input sample. Mock is the non-immune control. *<it>P </it>&lt; 0.05, **<it>P </it>&lt; 0.01, ***<it>P </it>&lt; 0.001 (Student's unpaired <it>t</it>-test, <it>n </it>= 3-6). Error bars are standard deviation. IP, immunoprecipitation.</p>
</text><graphic file="gb-2012-13-3-r17-5"/></fig>
<p>SRSF3 was previously shown to promote the export of histone <it>H2A </it>reporter mRNAs via a 22-nucleotide transport element within the coding region of <it>H2A </it>mRNAs, to which SRSF3 bound and recruited the mRNA export factor TAP <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. In our study, however, most SRSF3 and SRSF4 CLIP-tag clusters in histone <it>H2A </it>family mRNAs were found outside this 22-nucleotide transport element (Figure <figr fid="F5">5a, b</figr>; Figure S4a in Additional file <supplr sid="S1">1</supplr>). Furthermore, most SRSF3 and SRSF4 crosslink sites were present in mRNAs of histone families other than <it>H2A</it>, which do not contain the transport element (Additional file <supplr sid="S2">2</supplr>). Interestingly, SRSF3 and SRSF4 binding sites identified here are similar to those reported in another study that characterized export factor-binding sites in histone mRNAs <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
<p>SR proteins also promote polyadenylation in some contexts <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. We found this intriguing in the context of the histone mRNA targets because several recent studies have shown that a significant pool of histone mRNAs undergo polyadenylation instead of 3' end cleavage <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. To validate the association of SRSF3 and SRSF4 with histone mRNAs and to investigate polyadenylation, we adopted an RNA immunoprecipitation (RIP) assay from UV crosslinked cell extracts (UV-RIP); the immunoprecipitation was carried out from a cytoplasmic fraction in order to avoid contamination by genomic DNA that would later influence results obtained by reverse transcription quantitative PCR (RT-qPCR) (Figure S4b in Additional file <supplr sid="S1">1</supplr>). Both total and polyadenylated histone mRNA levels were measured in the SRSF3 and SRSF4 immunoprecipitates, using either random hexamers or oligo-dT as reverse primers. Figure <figr fid="F5">5c</figr> shows that both SR proteins immunoprecipitated histone mRNAs significantly above mock immunoprecipitates, irrespective of which reverse primer was used. Compared to input, detection of histone mRNAs was more robust when oligo-dT reverse primers were used, suggesting that SRSF3 and SRSF4 preferentially bind polyadenylated histone mRNAs. The detection of SRSF3 and SRSF4 bound to polyadenylated histone mRNAs in the cytoplasmic fraction suggests that both SR proteins may be involved in histone mRNA 3' end formation, export, and/or translation.</p>
</sec>
<sec><st><p>SRSF3 and SRSF4 make diverse contacts with exons and introns</p></st>
<p>Because SR proteins are known to regulate pre-mRNA splicing, we wondered whether the crosslink sites were correlated with particular locations within introns and/or exons. Data from <it>in vitro </it>studies suggest that SR proteins bind pre-mRNAs primarily within exons and thereby recruit spliceosomal components to adjacent 5' and 3' splice sites <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Therefore, crosslink sites were mapped to exon-intron and intron-exon boundaries. Variability in exon and intron length genome-wide leads to an apparent abundance of CLIP-tags close to the junctions (Figure S5a in Additional file <supplr sid="S1">1</supplr>). Therefore, we established a normalization factor derived from the length distribution of exons and introns to correct for these differences (Figure S5b in Additional file <supplr sid="S1">1</supplr>). Mapping of normalized crosslink sites showed exonic enrichment of SRSF3 and SRSF4 crosslink sites, which were most pronounced within 100 nucleotides of both 5' and 3' splice sites (Figure <figr fid="F6">6a</figr>). Peaks of SRSF3 and SRSF4 binding approximately 70 nucleotides upstream of 5' splice sites were more prominent than peaks observed downstream of 3' splice sites. Note that we did not map sequences falling onto exon-exon junctions, which explains the drop in crosslinking immediately upstream of 5' splice sites. Because SR proteins bind mRNA as well as pre-mRNA, it seems logical that exon sequences are overrepresented in the experimental data compared to intron sequences. However, similar patterns of enrichment in exons were observed when the pentamer motifs alone were considered (Figure <figr fid="F2">2</figr>; Figure S5c in Additional file <supplr sid="S1">1</supplr>), suggesting that the observed exon bias reflects the distribution of binding sequences within target RNAs. Interestingly, we noticed a peak of crosslink sites approximately 30 nucleotides upstream of 3' splice sites (Figure <figr fid="F6">6a</figr>). This corresponds to the approximate position of branch points in mammalian introns. However, the actual position of the branch point varies relative to the 3' splice site, with the longest observed distance of 400 nucleotides <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Therefore, crosslink sites were mapped to predicted mouse branch points <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. This mapping indicated that SRSF3 and SRSF4 bind at or slightly downstream of the branch point nucleotide (Figure <figr fid="F6">6b</figr>). In conclusion, SRSF3 and SRSF4 preferentially contact exonic sequences, especially upstream of 5' splice sites; they also interact with branch points as suggested by two previous studies <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B40">40</abbr></abbrgrp>, consistent with the model that SR proteins regulate splicing by contacting pre-mRNA in different functional regions.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>SRSF3 and SRSF4 contact exons and introns</p></caption><text>
   <p><b>SRSF3 and SRSF4 contact exons and introns</b>. <b>(a) </b>SRSF3 and SRSF4 crosslink sites mapped around the 5' and 3' splice sites. The position 0 (dotted line) represents the indicated 5' or 3' splice site; the y-axis represents normalized crosslink sites per 10<sup>3 </sup>nucleotides. The normalization is based on the length distribution of exons and introns (Figure S5 in Additional file 1). The data were smoothed using a Gaussian window (half-width of the window = 5). <b>(b) </b>SRSF3 and SRSF4 crosslink sites mapped to predicted mouse branch points. The position 0 (dotted line) represents the branch point nucleotide. Smoothing as in (a).</p>
</text><graphic file="gb-2012-13-3-r17-6"/></fig>
</sec>
<sec><st><p>SRSF3: a regulator of splicing factors</p></st>
<p>The notion that different splicing factors might regulate transcripts with similar functions, creating an expression module regulated by splicing, has intrigued the field for decades. We therefore asked about the functional identity of SRSF3 and SRSF4 protein-coding targets. Similar to our previous findings by RIP-chip <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, GO analysis of the protein-coding genes with significant SRSF3 and SRSF4 crosslink sites revealed functions related to nucleic acid binding and RNA processing as the most enriched GO terms for both SRSF3 and SRSF4 (Table S4 in Additional file <supplr sid="S1">1</supplr>). SRSF3 binding sites were especially enriched within genes encoding components of RNP complexes, including splicing factors (Table S5 in Additional file <supplr sid="S1">1</supplr>). SRSF3 crosslink sites were found within the genes encoding other SR proteins, as well as in proteins of heterogeneous nuclear ribonucleoprotein complexes and components of the core splicing machinery. SRSF3 is known to strictly regulate its own expression through an inclusion of a premature termination codon (PTC)-containing cassette exon, which is referred to as a 'poison cassette exon' because it leads to transcript degradation by nonsense-mediated decay (NMD) <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B41">41</abbr></abbrgrp>. Poison cassette exons occur in all SR protein family members and are ultraconserved among species <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. The inclusion of the alternative cassette exon or intron retention leads to the introduction of a PTC in the SR protein mRNA in every case. Indeed, SRSF3 and SRSF4 CLIP-tag clusters were detected in the <it>SRSF3 </it>and <it>SRSF4 </it>autoregulatory cassette exons, respectively (Figure <figr fid="F7">7a</figr>, top panel; Figure S6, bottom panel, in Additional file <supplr sid="S1">1</supplr>).</p>
<fig id="F7"><title><p>Figure 7</p></title><caption><p>SRSF3 binds to poison cassette exons in SR proteins</p></caption><text>
   <p><b>SRSF3 binds to poison cassette exons in SR proteins</b>. <b>(a) </b>SRSF3 and SRSF4 CLIP-tags and clusters around the alternative cassette exon of <it>SRSF3 </it>and <it>SRSF7 </it>genes. Labels as in Figure 1a. The zoom in represents the ultraconserved regions identified in <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Note that the genes in the antisense strand run from right to left. <b>(b) </b>The enrichment of mRNAs encoding different SR protein family members after UV crosslinking and SRSF3 or SRSF4 immunoprecipitation (IP). To prime the RT reactions, hexamers were used. Data are presented relative to the input sample. IP is the specific immunoprecipitation and mock is the non-immune control. *<it>P </it>&lt; 0.05, **<it>P </it>&lt; 0.01, ***<it>P </it>&lt; 0.001 (Student's unpaired <it>t</it>-test, <it>n </it>= 3-6). Error bars are standard deviation.</p>
</text><graphic file="gb-2012-13-3-r17-7"/></fig>
<p>To date, it has been assumed that poison cassette exons are recognized by the gene's own protein product, in an auto-regulatory feedback loop (see above). Intriguingly, the SRSF3 CLIP-tag clusters were also found in the NMD-associated exons or introns of three heterologous SR protein-encoding genes, <it>SRSF2, SRSF5 </it>and <it>SRSF7 </it>(Figure <figr fid="F7">7a</figr>; Figure S6 in Additional file <supplr sid="S1">1</supplr>). In contrast, SRSF4 CLIP-tag clusters were found only in the poison cassette exon of its own pre-mRNA. We sought to validate the specificity of these interactions by UV-RIP. SRSF3 specifically immunoprecipitated <it>SRSF2, SRSF3, SRSF5 </it>and <it>SRSF7 </it>(pre-)mRNAs, whereas SRSF4 only immunoprecipitated significant levels of its own (pre-)mRNA (Figure <figr fid="F7">7b</figr>). These data validate the specificity of SRSF3 interactions with heterologous transcripts encoding SR protein family members in the manner indicated by iCLIP; note that low recovery of some transcripts may be due to the short half-lives of the bound, PTC-containing messages.</p>
<p>The presence of SRSF3 CLIP-tag clusters in heterologous SR protein-encoding transcripts could indicate that SRSF3 either positively or negatively regulates poison-cassette exon usage. If so, we would predict that SRSF3 levels in cells should affect the alternative splicing and ultimately expression levels of the three target SR protein transcripts identified. To test this directly, minigenes including the genomic regions around SRSF3 CLIP-tag clusters were constructed for <it>SRSF2, SRSF3, SRSF5 </it>and <it>SRSF7 </it>(Figure <figr fid="F8">8a</figr>; Figure S7C in Additional file <supplr sid="S1">1</supplr>). Efficient SRSF3 or SRSF4 protein over-expression and knockdown was achieved by transfection of cDNA expression constructs and RNA interference, respectively (Figure S7a in Additional file <supplr sid="S1">1</supplr>). Under these conditions, the splicing patterns of the minigene-encoded transcripts were analyzed, using vector-specific primers for RT-PCR. Figure <figr fid="F8">8a</figr> shows that over-expression of SRSF3 led to a marked increase in poison cassette exon inclusion for both the <it>SRSF3 </it>and <it>SRSF7 </it>minigenes. Upon SRSF3 knockdown, this pattern was reversed (Figure S7b in Additional file <supplr sid="S1">1</supplr>). Similarly, SRSF3 over-expression led to alternative splicing changes for the <it>SRSF2 </it>and <it>SRSF5 </it>minigenes, leading to increased poison cassette usage and/or intron retention (Figure S7c in Additional file <supplr sid="S1">1</supplr>). Importantly, SRSF4 over-expression or knockdown did not detectably alter splicing patterns (Figure <figr fid="F8">8a</figr>; Figure S7b, c in Additional file <supplr sid="S1">1</supplr>).</p>
<fig id="F8"><title><p>Figure 8</p></title><caption><p>SRSF3 controls the level of SR proteins through splicing regulation</p></caption><text>
   <p><b>SRSF3 controls the level of SR proteins through splicing regulation</b>. <b>(a) </b>The splicing products of <it>SRSF3 </it>and <it>SRSF7 </it>minigenes determined after 24-hour over-expression of SRSF3, SRSF4 or EGFP (control). The alternative exons are marked with light grey. <b>(b) </b>The splicing products of endogenous <it>SRSF3 </it>and <it>SRSF7 </it>after inhibition of NMD by a 3-hour treatment with cycloheximide (CHX). <b>(c) </b>The expression level of endogenous, mature <it>SRSF2, SRSF5 </it>and <it>SRSF7 </it>mRNAs upon EGFP, SRSF3 or SRSF4 overexpression (24 hours) as measured by RT-qPCR. *<it>P </it>&lt; 0.05 (one-way ANOVA). Error bars are standard deviation. <it>ACTB </it>was used as the reference gene. <b>(d) </b>Schematic showing how SRSF3 controls the levels of other SR protein family members through alternative splicing. The inclusion of a poison cassette exon harboring a premature termination codon (PTC, red stop sign) leads to RNA degradation through NMD.</p>
</text><graphic file="gb-2012-13-3-r17-8"/></fig>
<p>The alternative splicing events regulated by SRSF3 documented above predict that the transcripts regulated by SRSF3 - namely <it>SRSF3 </it>itself as well as <it>SRSF2, SRSF5 and SRSF7 </it>- will undergo degradation through NMD when SRSF3 is over-expressed. To test this, the NMD pathway was inhibited by treating the cells with cycloheximide (CHX) <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The use of CHX as a tool also enabled us to investigate the alternative splicing outcome of endogenous transcripts. Figure <figr fid="F8">8b</figr> shows that CHX treatment leads to detection of the otherwise highly unstable endogenous poison cassette exon-containing SR protein transcripts that increase in abundance upon SRSF3 over-expression. Another prediction of these findings is that the steady-state levels of heterologous SR protein transcripts will depend on SRSF3 levels. Through measurement of target mRNA levels by RT-qPCR, we show that <it>SRSF5 </it>and <it>SRSF7 </it>mRNA levels decrease significantly in cells over-expressing SRSF3 (Figure <figr fid="F8">8c</figr>). Upon CHX treatment, mRNA levels recovered to those of the control (Figure S7d in Additional file <supplr sid="S1">1</supplr>). Taken together, the data indicate that SRSF3 specifically binds not only its own but other SR protein transcripts and the binding leads to alternative splicing changes that increase the occurrence of PTCs, which in turn target the expressed transcripts for degradation through the NMD pathway. Thus, SRSF3 regulates the expression of its own mRNA and the mRNAs encoding three other SR protein family members (Figure <figr fid="F8">8d</figr>). This cross-regulation by SRSF3 and the observation that many other RNA binding proteins may similarly be regulated by SRSF3 (Table S5 in Additional file <supplr sid="S1">1</supplr>) raises the possibility that SRSF3 is a master regulator of the transcriptome acting through a network of feedback mechanisms.</p>
</sec>
</sec>
<sec><st><p>Discussion</p></st>
<p>Here we used iCLIP to investigate the RNA-binding landscape of two SR proteins, SRSF3 and SRSF4, in mouse cells. The value of this study is enhanced by the global comparison of RNA targets and binding sites for two members of this prominent family of RNA binding proteins with a variety of known roles in gene expression. Through detailed analysis of the transcripts and transcript regions bound by SRSF3 and SRSF4, we provide evidence for previously unknown functions of these highly conserved RNA binding proteins. Here we discuss our findings in the context of five major conclusions.</p>
<p>First, SRSF3 and SRSF4 exhibit largely non-overlapping binding sites and RNA targets, indicating that SRSF3 and SRSF4 regulate specific sets of genes through their interaction with different RNA sequences. This finding is consistent with the previous finding that SRSF3 and SRSF4 are present in non-overlapping mRNPs <it>in vivo </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, although the previous study did not examine direct binding sites. The genome-wide CLIP data provided large numbers of binding sites, enabling us to derive consensus <it>in vivo </it>binding sequences. We show that SRSF4 binds a consensus GA-rich sequence. The CU-rich SRSF3 <it>in vivo </it>consensus binding sequence is similar to that derived <it>in vitro </it>by SELEX <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, indicating the validity of the use of SELEX to understand binding specificity alone. However, bioinformatic analysis of SELEX sequences does not permit the identification <it>in vivo </it>RNA targets because the shortness and degeneracy of consensus sequences leads to over-representation within the transcriptome <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The identification of <it>in vivo </it>targets, accomplished here by CLIP, allowed us to further investigate the identified RNA classes and RNA regions bound.</p>
<p>Second, an important class of RNAs bound by both SRSF3 and SRSF4 were ncRNAs. SRSF3 and SRSF4 crosslinked to multiple sites along the lincRNA <it>MALAT1</it>, which is enriched in nuclear speckles and interacts with a subset of SR proteins, including SRSF3 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Other splicing factors localized in nuclear speckles, such as TDP-43 and SRSF1, also bind to <it>MALAT1 </it><abbrgrp><abbr bid="B12">12</abbr><abbr bid="B21">21</abbr></abbrgrp>; it appears, therefore, that <it>MALAT1 </it>interaction is common among RNA-binding proteins in nuclear speckles. Furthermore, SRSF3 and SRSF4 interacted with <it>7SK</it>, another ncRNA localized to speckles <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> but not with the paraspeckle ncRNA component <it>NEAT1 </it><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. One especially overrepresented group of short ncRNAs with SRSF3- and SRSF4-binding sites was snoRNAs. We show that a subset of snoRNAs, the scaRNAs, are prominent targets of SRSF4 with binding sites near their 3' ends. SR proteins are likely required for the splicing of all introns, including those containing snoRNAs. The position of the snoRNAs, including scaRNAs, within the host intron is critical for snoRNA processing, implying that the spliceosome is actively involved in the release of snoRNAs from the debranched intron <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. It remains to be investigated whether SR proteins are required for snoRNA processing from host introns. An interesting possibility is that SR proteins interact within the snoRNA-derived short RNAs and co-regulate alternative splicing <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Because the functions of many ncRNAs are currently poorly understood, it will be interesting to determine whether ncRNAs have functions as co-regulators of splicing.</p>
<p>Third, we provide evidence that SRSF3 and SRSF4 bind many intronless genes, further supporting their role as regulators of gene expression independent of pre-mRNA splicing <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Surprisingly, we show that SRSF3 and SRSF4 display clusters of binding sites at the ORF-3' UTR junction of the vast majority of histone mRNAs. This binding region was previously shown to be important for the export of histone H2A mRNAs in <it>Xenopus </it>oocytes <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. UV-RIP experiments indicate that both SR proteins bind preferentially to polyadenylated histone mRNAs and that these mRNP complexes are detectable in the cytoplasm. Replication-dependent histone mRNAs are mainly processed by 3' end cleavage <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, and are exported from the nucleus via the stem loop binding protein (SLBP) <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. It was known that histone mRNAs become polyadenylated when the 3' end cleavage machinery is compromised <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. However, recent high-throughput sequence analyses of human and mouse mRNAs identified significant pools of polyadenylated mRNAs encoding all four core histones even when the 3'-end processing machinery is functional <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Thus, expression of polyadenylated histone mRNAs appears to be physiologically important. If polyadenylated histone transcripts fail to bind SLBP, they may require SRSF3 and SRSF4 for export from the nucleus; both SRSF3 and SRSF4 shuttle to the cytoplasm and SRSF3 binds the mRNA export receptor TAP <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B28">28</abbr></abbrgrp>. It will be of interest to determine how SRSF3 and SRSF4 regulate histone mRNA 3' end formation, export, or both.</p>
<p>Fourth, intron-containing protein-coding transcripts were a major class of SRSF3 and SRSF4 targets. SR proteins are thought to bind primarily to exonic splicing enhancers, where they influence recognition of adjacent 5' and 3' splice sites <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. The pattern of binding to exon-intron junctions resembles that observed for SRSF1 <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>; abundant binding within the exon bodies dropped sharply towards exon-intron boundaries. These data agree with the current concept that SR proteins promote adjacent splice site recognition. However, our data show that neither SRSF3 nor SRSF4 binding is limited to exons; instead, a large number of binding sites are found in introns and 3' UTRs. Crosslinking to exonic regions should be overrepresented in CLIP, owing to the low abundance of pre-mRNA in living cells. It is remarkable, therefore, that 65% of SRSF3 and 52% of SRSF4 binding sites were located in introns. SRSF3 was shown to regulate splicing by binding to an intronic splicing enhancer <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, providing precedence for SRSF3 splicing regulatory function via intronic regulatory elements. Intriguingly, SRSF3 and SRSF4 binding was also detected at and around branch-point sequences within introns. <it>In vitro </it>studies suggested that SRSF1 is first recruited to an exonic splicing enhancer and the RS domain subsequently contacts the branch point <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B40">40</abbr></abbrgrp>. The present study does not distinguish whether the RRM or the RS domain of SRSF3 and/or SRSF4 binds to the branchpoint. Although some correlation between binding to the branch point and to the downstream exon was observed, a strict requirement for binding to both downstream exonic splicing enhancer and the branch point was not detected. Therefore, SR proteins may have exon-independent functions as proposed previously <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
<p>Finally, we have discovered that SRSF3 cross-regulates the expression of other SR protein family members. Autoregulation, in which SRSF1 and SRSF3 modulate expression of their own messages via inclusion of a PTC-containing exon, was previously known <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B52">52</abbr></abbrgrp>. This activity has been attributed to ultraconserved regions within alternative cassette exons and retained introns that introduce a PTC in mRNAs encoding SR proteins <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Here we identify SRSF3 binding sites in the mRNAs encoding three additional members of the SR protein family and show that short-term overexpression of SRSF3 led to robust effects on the splicing and expression of four SR protein family members (<it>SRSF2, SRSF3, SRSF5</it>, and <it>SRSF7</it>). This reveals an unexpected role of SRSF3 in cross-regulating expression of other SR protein family members (Figure <figr fid="F8">8d</figr>). SRSF3 binding sites were also abundant in mRNAs encoding other splicing factors and spliceosomal components. These data, together with the observation that SRSF3 binds numerous transcripts encoding RNA binding proteins, underscore the notion that the splicing machinery is under strict cellular control and indicate that SRSF3 is a key regulator of RNA metabolism.</p>
</sec>
<sec><st><p>Conclusions</p></st>
<p>Our global analysis of SRSF3 and SRSF4 shows that these SR proteins are multifunctional RNA binding proteins interacting with distinct classes of RNA. Initially identified as splicing factors, SRSF3 and SRSF4 regulate constitutive and alternative exons by binding to both exonic and intronic positions. SRSF3 in particular seems to function as a master regulator of splicing machinery expression through its activities in alternative splicing. However, SRSF3 and SRSF4 also interact with RNAs that are not processed by the spliceosome, suggesting a network of interactions that control cellular programs of gene expression. In addition to the numerous functions already assigned for SR proteins, interactions with different RNA classes, including ncRNAs, implies that more is yet to come.</p>
</sec>
<sec><st><p>Materials and methods</p></st>
<sec><st><p>Cell culture and iCLIP</p></st>
<p>P19 cells were cultured as described <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. For iCLIP, P19 SRSF3-BAC or P19 SRSF4-BAC cells <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> were irradiated with 100 mJ/cm<sup>2 </sup>UV light. The iCLIP was performed as described <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Protein G Dynabeads coupled with goat anti-GFP antibody (a kind gift from D Drechsel, MPI-CBG, Dresden) were used for the immunopurification. The recovered RNA was reverse transcribed into cDNA. After size-purification and amplification the cDNA was subjected to high-throughput sequencing by Illumina Genome Analyser II (single-end 32-nucleotide reads). The adapter oligonucleotides, reverse transcription primers and primers for amplification were as described <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. A more detailed description is in the Supplementary Methods in Additional file <supplr sid="S1">1</supplr>.</p>
</sec>
<sec><st><p>Mapping of sequences to mouse genome and sequence analysis</p></st>
<p>The sequences corresponding to an experiment were identified by a defined barcode and random barcodes were registered. The barcodes were removed before mapping to the mouse mm9 sequence assembly using Bowtie version 0.12.5. Two mismatches were allowed in the mapping, and only CLIP-tags mapping to unique positions were considered. For analysis of significant crosslink sites, the iCLIP positions were randomized. The randomization was done within co-transcribed regions as described <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Ensembl59 annotation based on the mouse mm9 genome assembly was used. The statistical approach used to identify significant crosslink sites and CLIP-tag clusters was as described <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>.</p>
<p>Z-score analysis for enriched pentamers was performed essentially as described <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. Pentamers were used because they are the longest motifs that could be statically derived from the data. Because we noticed that the inclusion of the actual crosslink nucleotide and the positions immediately surrounding it always resulted in a run of U nucleotides as the most enriched motif (data not shown), we excluded the crosslink site from the analysis and thus avoided bias towards any nucleotide due to differences in crosslinking efficiency. The positions of the crosslinking nucleotide were extended by 30 nucleotides in both directions. Only one occurrence of a pentamer within the evaluated interval [(-30, -10), (10, 30)] relative to each cross-link was counted and each occurrence of a crosslink site was weighted by 1.0. Reference data were generated 100 times by random shuffling of iCLIP crosslink positions within corresponding genome segments (within same genes) and a Z-score was calculated relative to the randomized genomic positions. The top five pentamers were used to calculate the binding consensus motif <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
</sec>
<sec><st><p>Mapping of crosslink sites to exon-intron junctions and branch points</p></st>
<p>Crosslink sites located within a maximum 600-nucleotide window [-300,+300] around exon-intron boundaries were mapped to these regions. Each occurrence of a crosslink site was assigned to the closest exon-intron boundary, counted as 1.0 and normalized by number of junctions spanning the crosslink position. In the case of exon-intron junctions, only the last half of the exon and first half of the intron were used to obtain the distribution of exons and introns spanning each position relative to the boundary. Similarly, the last half of introns and the first half of exons were used for intron-exon junctions. Junctions where exons were shorter than 60 nucleotides or introns shorter than 200 nucleotides were ignored (&lt; 15% of all possible junctions). For branch point RNA maps, we used computationally predicted branch points from Corvelo <it>et al</it>. <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Only the best, non-negative SVM scored branch point that resided in the last half of intron was used. In case of ties, we used the branch point closest to the intron-exon junction. For normalization, only the last half of the intron was used, and the branch point was at position zero. We ignored introns shorter than 240 nucleotides and introns where the branch point was closer than 20 nucleotides to the intron-exon junction.</p>
</sec>
<sec><st><p>Minigene analysis</p></st>
<p>For the knockdown of <it>SRSF3 </it>or <it>SRSF4</it>, esiRNA sequences described previously were used <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. For the overexpression, human <it>SRSF3 </it>and <it>SRSF4 </it>cDNA were cloned into pYFP-N2 and pEGFP-N3 vectors (Clontech), respectively. To construct the minigenes, the region of interest was PCR amplified from P19 genomic DNA and cloned into pcDNA3 vector (Invitrogen). The plasmids carrying the minigenes were co-transfected together with either <it>SRSF3 </it>or <it>SRSF4 </it>esiRNA or overexpression construct into P19 cells using Lipofectamine 2000 (Invitrogen). Empty vectors or esiRNA targeting EGFP were used as controls. Twenty-four hours post-transfection, total RNA was isolated using acid phenol-chloroform extraction (Ambion). After DNaseI treatment the RNA was reverse transcribed with Superscript III (Invitrogen). The splicing patterns of minigenes were analyzed using vector-specific PCR primers. To abrogate the nonsense-mediated decay pathway, P19 wt cells were treated with 300 &#956;g/ml cycloheximide for 3 hours. Untreated samples were processed in parallel. Total RNA was extracted and the samples treated as described above. Gene-specific primers spanning the exons flanking the PTC-containing cassette exon were used. Total expression of <it>SRSF2, SRSF5 </it>and <it>SRSF7 </it>was determined by RT-qPCR using primers spanning an exon-exon junction. <it>ACTB </it>was used as a reference gene to normalize for cDNA synthesis efficiency. Primer sequences are available upon request.</p>
</sec>
<sec><st><p>UV crosslinking and RNA immunoprecipitation</p></st>
<p>UV-crosslinked cells (see above) were harvested and the cytoplasmic fraction was separated. The total cell pellet was suspended in NET-2 buffer (50 mM Tris-HCl pH 7.4, 150 mM NaCl; 0.05% (v/v) Nonidet P-40) for western blot analysis or Trizol (Invitrogen) for RNA isolation. For the fractionation, the cell pellet was suspended in hypotonic buffer (10 mM Hepes, pH 7.4; 10 mM NaCl; 3 mM MgCl<sub>2</sub>; 0.3% (v/v) Nonidet P-40, RNaseOUT; complete protein inhibitor cocktail), the nuclear pellet was collected by centrifugation, and the supernatant was collected as the cytoplasmic faction. The cytoplasmic fraction was used as such or RNA was extracted as above. Input, mock and immunoprecipitation samples were independently reverse primed with either oligo-dT or hexamer primers. Primers specific for individual histone mRNAs were used for qPCR amplification; <it>POFUT1 </it>was used as a reference gene for SRSF3 and <it>DTMYK </it>for SRSF4 to normalize for cDNA synthesis efficiency. Oligo-dT and hexamer samples were normalized independently of each other to their respective input. The primer sequences are available upon request.</p>
</sec>
<sec><st><p>Database accession</p></st>
<p>The sequencing data have been submitted to the Array Express database <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, accession number E-MTAB-747.</p>
</sec>
</sec>
<sec><st><p>Abbreviations</p></st>
<p>BAC: bacterial artificial chromosome; CHX: cycloheximide; CLIP: crosslinking and immunoprecipitation; EGFP: enhanced green fluorescent protein; GO: Gene Ontology; iCLIP: individual nucleotide-resolution UV cross-linking and immunoprecipitation; lincRNA: long non-coding RNA; mRNP: mitochondrial ribonucleoprotein ; ncRNA: non-coding RNA; NMD: nonsense-mediated decay; ORF: open reading frame; PTC: premature termination codon; RIP: RNA immunoprecipitation; RNP: ribonucleoprotein; RRM: RNA recognition motif; RT-(q)PCR: reverse transcription (quantitative) polymerase chain reaction; scaRNA: small Cajal body-specific RNA; SELEX: systemic evolution of ligands by exponential enrichment; SLBP: stem loop binding protein; snoRNA: small nucleolar RNA; UTR: untranslated region; UV: ultraviolet.</p>
</sec>
<sec><st><p>Competing interests</p></st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec><st><p>Authors' contributions</p></st>
<p>M-L&#196;, MMM and KMN designed the experiments; M-L&#196; and MMM performed the experiments; M-L&#196;, HB, IH, JU, TC and CG analyzed the data; M-L&#196;, MMM and KMN wrote the manuscript. The final manuscript has been read and approved by all the authors.</p>
</sec>
</bdy>
<bm>
<ack>
<sec><st><p>Acknowledgements</p></st>
<p>We thank members of the Neugebauer lab, Michael Hiller, and Tim Mercer for helpful discussions and comments on the manuscript. The financial support was from Sigrid Juselius foundation (to M-L&#196;), Helsingin Sanomain Foundation (to M-L&#196;), EMBO long-term fellowship (to MMM), Slovenian Research Agency (Z7-3665 to TC), the Max Planck Society (to KMN), the Deutsche Forschungsgemeinschaft (NE909/3-1 to KMN).</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>SMART 6: recent updates and new developments.</p></title><aug><au><snm>Letunic</snm><fnm>I</fnm></au><au><snm>Doerks</snm><fnm>T</fnm></au><au><snm>Bork</snm><fnm>P</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><fpage>D229</fpage><lpage>D232</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkn808</pubid><pubid idtype="pmcid">2686533</pubid><pubid idtype="pmpid" link="fulltext">18978020</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>SR proteins in vertical integration of gene expression from transcription to RNA processing to translation.</p></title><aug><au><snm>Zhong</snm><fnm>X-Y</fnm></au><au><snm>Wang</snm><fnm>P</fnm></au><au><snm>Han</snm><fnm>J</fnm></au><au><snm>Rosenfeld</snm><fnm>MG</fnm></au><au><snm>Fu</snm><fnm>X-D</fnm></au></aug><source>Mol Cell</source><pubdate>2009</pubdate><volume>35</volume><fpage>1</fpage><lpage>10</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2009.06.016</pubid><pubid idtype="pmcid">2744344</pubid><pubid idtype="pmpid" link="fulltext">19595711</pubid></pubidlist></xrefbib></bibl><bibl id="B3"><title><p>The SR protein family.</p></title><aug><au><snm>Shepard</snm><fnm>P</fnm></au><au><snm>Hertel</snm><fnm>K</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><fpage>242</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-10-242</pubid><pubid idtype="pmcid">2784316</pubid><pubid idtype="pmpid" link="fulltext">19857271</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>SR proteins and related factors in alternative splicing.</p></title><aug><au><snm>Lin</snm><fnm>S</fnm></au><au><snm>Fu</snm><fnm>XD</fnm></au></aug><source>Adv Exp Med Biol</source><pubdate>2007</pubdate><volume>623</volume><fpage>107</fpage><lpage>122</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1007/978-0-387-77374-2_7</pubid><pubid idtype="pmpid">18380343</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>SR protein family members display diverse activities in the formation of nascent and mature mRNPs in vivo.</p></title><aug><au><snm>Sapra</snm><fnm>AK</fnm></au><au><snm>&#196;nk&#246;</snm><fnm>M-L</fnm></au><au><snm>Grishina</snm><fnm>I</fnm></au><au><snm>Lorenz</snm><fnm>M</fnm></au><au><snm>Pabis</snm><fnm>M</fnm></au><au><snm>Poser</snm><fnm>I</fnm></au><au><snm>Rollins</snm><fnm>J</fnm></au><au><snm>Weiland</snm><fnm>E-M</fnm></au><au><snm>Neugebauer</snm><fnm>KM</fnm></au></aug><source>Mol Cell</source><pubdate>2009</pubdate><volume>34</volume><fpage>179</fpage><lpage>190</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2009.02.031</pubid><pubid idtype="pmpid" link="fulltext">19394295</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>A specific subset of SR proteins shuttles continuously between the nucleus and the cytoplasm.</p></title><aug><au><snm>Caceres</snm><fnm>JF</fnm></au><au><snm>Screaton</snm><fnm>GR</fnm></au><au><snm>Krainer</snm><fnm>AR</fnm></au></aug><source>Genes Dev</source><pubdate>1998</pubdate><volume>12</volume><fpage>55</fpage><lpage>66</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.12.1.55</pubid><pubid idtype="pmcid">316398</pubid><pubid idtype="pmpid" link="fulltext">9420331</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>Arginine-serine-rich domains bound at splicing enhancers contact the branchpoint to promote prespliceosome assembly.</p></title><aug><au><snm>Shen</snm><fnm>H</fnm></au><au><snm>Kan</snm><fnm>JLC</fnm></au><au><snm>Green</snm><fnm>MR</fnm></au></aug><source>Mol Cell</source><pubdate>2004</pubdate><volume>13</volume><fpage>367</fpage><lpage>376</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(04)00025-5</pubid><pubid idtype="pmpid" link="fulltext">14967144</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Substrate specificities of SR proteins in constitutive splicing are determined by their RNA recognition motifs and composite pre-mRNA exonic elements.</p></title><aug><au><snm>Mayeda</snm><fnm>A</fnm></au><au><snm>Screaton</snm><fnm>GR</fnm></au><au><snm>Chandler</snm><fnm>SD</fnm></au><au><snm>Fu</snm><fnm>X-D</fnm></au><au><snm>Krainer</snm><fnm>AR</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1999</pubdate><volume>19</volume><fpage>1853</fpage><lpage>1863</lpage><xrefbib><pubidlist><pubid idtype="pmcid">83978</pubid><pubid idtype="pmpid" link="fulltext">10022872</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>Role of the modular domains of SR proteins in subnuclear localization and alternative splicing specificity.</p></title><aug><au><snm>Caceres</snm><fnm>JF</fnm></au><au><snm>Misteli</snm><fnm>T</fnm></au><au><snm>Screaton</snm><fnm>GR</fnm></au><au><snm>Spector</snm><fnm>DL</fnm></au><au><snm>Krainer</snm><fnm>AR</fnm></au></aug><source>J Cell Biol</source><pubdate>1997</pubdate><volume>138</volume><fpage>225</fpage><lpage>238</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1083/jcb.138.2.225</pubid><pubid idtype="pmcid">2138183</pubid><pubid idtype="pmpid" link="fulltext">9230067</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>Rapid and systematic analysis of the RNA recognition specificities of RNA-binding proteins.</p></title><aug><au><snm>Ray</snm><fnm>D</fnm></au><au><snm>Kazan</snm><fnm>H</fnm></au><au><snm>Chan</snm><fnm>ET</fnm></au><au><snm>Castillo</snm><fnm>LP</fnm></au><au><snm>Chaudhry</snm><fnm>S</fnm></au><au><snm>Talukder</snm><fnm>S</fnm></au><au><snm>Blencowe</snm><fnm>BJ</fnm></au><au><snm>Morris</snm><fnm>Q</fnm></au><au><snm>Hughes</snm><fnm>TR</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2009</pubdate><volume>27</volume><fpage>667</fpage><lpage>670</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt.1550</pubid><pubid idtype="pmpid" link="fulltext">19561594</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>Alternative splicing: new insights from global analyses.</p></title><aug><au><snm>Blencowe</snm><fnm>BJ</fnm></au></aug><source>Cell</source><pubdate>2006</pubdate><volume>126</volume><fpage>37</fpage><lpage>47</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2006.06.023</pubid><pubid idtype="pmpid" link="fulltext">16839875</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts.</p></title><aug><au><snm>Sanford</snm><fnm>JR</fnm></au><au><snm>Wang</snm><fnm>X</fnm></au><au><snm>Mort</snm><fnm>M</fnm></au><au><snm>VanDuyn</snm><fnm>N</fnm></au><au><snm>Cooper</snm><fnm>DN</fnm></au><au><snm>Mooney</snm><fnm>SD</fnm></au><au><snm>Edenberg</snm><fnm>HJ</fnm></au><au><snm>Liu</snm><fnm>Y</fnm></au></aug><source>Genome Res</source><pubdate>2009</pubdate><volume>19</volume><fpage>381</fpage><lpage>394</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2661799</pubid><pubid idtype="pmpid" link="fulltext">19116412</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>Global analysis reveals SRp20- and SRp75-specific mRNPs in cycling and neural cells.</p></title><aug><au><snm>&#196;nk&#246;</snm><fnm>M-L</fnm></au><au><snm>Morales</snm><fnm>L</fnm></au><au><snm>Henry</snm><fnm>I</fnm></au><au><snm>Beyer</snm><fnm>A</fnm></au><au><snm>Neugebauer</snm><fnm>KM</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2010</pubdate><volume>17</volume><fpage>962</fpage><lpage>970</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1862</pubid><pubid idtype="pmpid" link="fulltext">20639886</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>CLIP: Construction of cDNA libraries for high-throughput sequencing from RNAs cross-linked to proteins in vivo.</p></title><aug><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Tollervey</snm><fnm>J</fnm></au><au><snm>Briese</snm><fnm>M</fnm></au><au><snm>Turner</snm><fnm>D</fnm></au><au><snm>Ule</snm><fnm>J</fnm></au></aug><source>Methods</source><pubdate>2009</pubdate><volume>48</volume><fpage>287</fpage><lpage>293</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.ymeth.2009.02.021</pubid><pubid idtype="pmpid" link="fulltext">19272451</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution.</p></title><aug><au><snm>Konig</snm><fnm>J</fnm></au><au><snm>Zarnack</snm><fnm>K</fnm></au><au><snm>Rot</snm><fnm>G</fnm></au><au><snm>Curk</snm><fnm>T</fnm></au><au><snm>Kayikci</snm><fnm>M</fnm></au><au><snm>Zupan</snm><fnm>B</fnm></au><au><snm>Turner</snm><fnm>DJ</fnm></au><au><snm>Luscombe</snm><fnm>NM</fnm></au><au><snm>Ule</snm><fnm>J</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2010</pubdate><volume>17</volume><fpage>909</fpage><lpage>915</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1838</pubid><pubid idtype="pmcid">3000544</pubid><pubid idtype="pmpid" link="fulltext">20601959</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>An RNA code for the FOX2 splicing regulator revealed by mapping RNA-protein interactions in stem cells.</p></title><aug><au><snm>Yeo</snm><fnm>GW</fnm></au><au><snm>Coufal</snm><fnm>NG</fnm></au><au><snm>Liang</snm><fnm>TY</fnm></au><au><snm>Peng</snm><fnm>GE</fnm></au><au><snm>Fu</snm><fnm>X-D</fnm></au><au><snm>Gage</snm><fnm>FH</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2009</pubdate><volume>16</volume><fpage>130</fpage><lpage>137</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1545</pubid><pubid idtype="pmcid">2735254</pubid><pubid idtype="pmpid" link="fulltext">19136955</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>iCLIP predicts the dual splicing effects of TIA-RNA interactions.</p></title><aug><au><snm>Wang</snm><fnm>Z</fnm></au><au><snm>Kayikci</snm><fnm>M</fnm></au><au><snm>Briese</snm><fnm>M</fnm></au><au><snm>Zarnack</snm><fnm>K</fnm></au><au><snm>Luscombe</snm><fnm>NM</fnm></au><au><snm>Rot</snm><fnm>G</fnm></au><au><snm>Zupan</snm><fnm>B</fnm></au><au><snm>Curk</snm><fnm>T</fnm></au><au><snm>Ule</snm><fnm>J</fnm></au></aug><source>PLoS Biol</source><pubdate>2010</pubdate><volume>8</volume><fpage>e1000530</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pbio.1000530</pubid><pubid idtype="pmcid">2964331</pubid><pubid idtype="pmpid" link="fulltext">21048981</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>The splicing factors 9G8 and SRp20 transactivate splicing through different and specific enhancers.</p></title><aug><au><snm>Cavaloc</snm><fnm>Y</fnm></au><au><snm>Bourgeois</snm><fnm>CF</fnm></au><au><snm>Kister</snm><fnm>L</fnm></au><au><snm>Stevenin</snm><fnm>J</fnm></au></aug><source>RNA</source><pubdate>1999</pubdate><volume>5</volume><fpage>468</fpage><lpage>483</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1017/S1355838299981967</pubid><pubid idtype="pmcid">1369774</pubid><pubid idtype="pmpid" link="fulltext">10094314</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>Selection and characterization of pre-mRNA splicing enhancers: identification of novel SR protein-specific enhancer sequences.</p></title><aug><au><snm>Schaal</snm><fnm>TD</fnm></au><au><snm>Maniatis</snm><fnm>T</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1999</pubdate><volume>19</volume><fpage>1705</fpage><lpage>1719</lpage><xrefbib><pubidlist><pubid idtype="pmcid">83964</pubid><pubid idtype="pmpid" link="fulltext">10022858</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Structural, functional, and protein binding analyses of bovine papillomavirus type 1 exonic splicing enhancers.</p></title><aug><au><snm>Zheng</snm><fnm>ZM</fnm></au><au><snm>He</snm><fnm>PJ</fnm></au><au><snm>Baker</snm><fnm>CC</fnm></au></aug><source>J Virol</source><pubdate>1997</pubdate><volume>71</volume><fpage>9096</fpage><lpage>9107</lpage><xrefbib><pubidlist><pubid idtype="pmcid">230210</pubid><pubid idtype="pmpid" link="fulltext">9371566</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>Characterizing the RNA targets and position-dependent splicing regulation by TDP-43.</p></title><aug><au><snm>Tollervey</snm><fnm>JR</fnm></au><au><snm>Curk</snm><fnm>T</fnm></au><au><snm>Rogelj</snm><fnm>B</fnm></au><au><snm>Briese</snm><fnm>M</fnm></au><au><snm>Cereda</snm><fnm>M</fnm></au><au><snm>Kayikci</snm><fnm>M</fnm></au><au><snm>K&#246;nig</snm><fnm>J</fnm></au><au><snm>Hortob&#225;gyi</snm><fnm>T</fnm></au><au><snm>Nishimura</snm><fnm>AL</fnm></au><au><snm>Zupunski</snm><fnm>V</fnm></au><au><snm>Patani</snm><fnm>R</fnm></au><au><snm>Chandran</snm><fnm>S</fnm></au><au><snm>Rot</snm><fnm>G</fnm></au><au><snm>Zupan</snm><fnm>B</fnm></au><au><snm>Shaw</snm><fnm>CE</fnm></au><au><snm>Ule</snm><fnm>J</fnm></au></aug><source>Nat Neurosci</source><pubdate>2011</pubdate><volume>14</volume><fpage>452</fpage><lpage>458</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nn.2778</pubid><pubid idtype="pmcid">3108889</pubid><pubid idtype="pmpid" link="fulltext">21358640</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation.</p></title><aug><au><snm>Tripathi</snm><fnm>V</fnm></au><au><snm>Ellis</snm><fnm>JD</fnm></au><au><snm>Shen</snm><fnm>Z</fnm></au><au><snm>Song</snm><fnm>DY</fnm></au><au><snm>Pan</snm><fnm>Q</fnm></au><au><snm>Watt</snm><fnm>AT</fnm></au><au><snm>Freier</snm><fnm>SM</fnm></au><au><snm>Bennett</snm><fnm>CF</fnm></au><au><snm>Sharma</snm><fnm>A</fnm></au><au><snm>Bubulya</snm><fnm>PA</fnm></au><au><snm>Blencowe</snm><fnm>BJ</fnm></au><au><snm>Prasanth</snm><fnm>SG</fnm></au><au><snm>Prasanth</snm><fnm>KV</fnm></au></aug><source>Mol Cell</source><pubdate>2010</pubdate><volume>29</volume><fpage>925</fpage><lpage>938</lpage></bibl><bibl id="B23"><title><p>Nuclear organization and dynamics of 7SK RNA in regulating gene expression.</p></title><aug><au><snm>Prasanth</snm><fnm>KV</fnm></au><au><snm>Camiolo</snm><fnm>M</fnm></au><au><snm>Chan</snm><fnm>G</fnm></au><au><snm>Tripathi</snm><fnm>V</fnm></au><au><snm>Denis</snm><fnm>L</fnm></au><au><snm>Nakamura</snm><fnm>T</fnm></au><au><snm>Hubner</snm><fnm>MR</fnm></au><au><snm>Spector</snm><fnm>DL</fnm></au></aug><source>Mol Biol Cell</source><pubdate>2010</pubdate><volume>21</volume><fpage>4184</fpage><lpage>4196</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1091/mbc.E10-02-0105</pubid><pubid idtype="pmcid">2993747</pubid><pubid idtype="pmpid" link="fulltext">20881057</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>Small nucleolar RNA-guided post-transcriptional modification of cellular RNAs.</p></title><aug><au><snm>Kiss</snm><fnm>T</fnm></au></aug><source>EMBO J</source><pubdate>2001</pubdate><volume>20</volume><fpage>3617</fpage><lpage>3622</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/emboj/20.14.3617</pubid><pubid idtype="pmcid">125535</pubid><pubid idtype="pmpid" link="fulltext">11447102</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>The human and mouse replication-dependent histone genes.</p></title><aug><au><snm>Marzluff</snm><fnm>WF</fnm></au><au><snm>Gongidi</snm><fnm>P</fnm></au><au><snm>Woods</snm><fnm>KR</fnm></au><au><snm>Jin</snm><fnm>J</fnm></au><au><snm>Maltais</snm><fnm>LJ</fnm></au></aug><source>Genomics</source><pubdate>2002</pubdate><volume>80</volume><fpage>487</fpage><lpage>498</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1006/geno.2002.6850</pubid><pubid idtype="pmpid" link="fulltext">12408966</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Metabolism and regulation of canonical histone mRNAs: life without a poly(A) tail.</p></title><aug><au><snm>Marzluff</snm><fnm>WF</fnm></au><au><snm>Wagner</snm><fnm>EJ</fnm></au><au><snm>Duronio</snm><fnm>RJ</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2008</pubdate><volume>9</volume><fpage>843</fpage><lpage>854</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg2438</pubid><pubid idtype="pmcid">2715827</pubid><pubid idtype="pmpid" link="fulltext">18927579</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>Splicing factors SRp20 and 9G8 promote the nucleocytoplasmic export of mRNA.</p></title><aug><au><snm>Huang</snm><fnm>Y</fnm></au><au><snm>Steitz</snm><fnm>JA</fnm></au></aug><source>Mol Cell</source><pubdate>2001</pubdate><volume>7</volume><fpage>899</fpage><lpage>905</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(01)00233-7</pubid><pubid idtype="pmpid" link="fulltext">11336712</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>SR splicing factors serve as adapter proteins for TAP-dependent mRNA export.</p></title><aug><au><snm>Huang</snm><fnm>Y</fnm></au><au><snm>Gattoni</snm><fnm>R</fnm></au><au><snm>St&#233;venin</snm><fnm>J</fnm></au><au><snm>Steitz</snm><fnm>JA</fnm></au></aug><source>Mol Cell</source><pubdate>2003</pubdate><volume>11</volume><fpage>837</fpage><lpage>843</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S1097-2765(03)00089-3</pubid><pubid idtype="pmpid" link="fulltext">12667464</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Nuclear export of metazoan replication-dependent histone mRNAs is dependent on RNA length and is mediated by TAP.</p></title><aug><au><snm>Erkmann</snm><fnm>JA</fnm></au><au><snm>Sanchez</snm><fnm>R</fnm></au><au><snm>Treichel</snm><fnm>N</fnm></au><au><snm>Marzluff</snm><fnm>WF</fnm></au><au><snm>Kutay</snm><fnm>U</fnm></au></aug><source>RNA</source><pubdate>2005</pubdate><volume>11</volume><fpage>45</fpage><lpage>58</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.7189205</pubid><pubid idtype="pmcid">1370690</pubid><pubid idtype="pmpid" link="fulltext">15611298</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Juxtaposition of two distant, serine-arginine-rich protein-binding elements is required for optimal polyadenylation in rous sarcoma virus.</p></title><aug><au><snm>Hudson</snm><fnm>SW</fnm></au><au><snm>McNally</snm><fnm>MT</fnm></au></aug><source>J Virol</source><pubdate>2011</pubdate><volume>85</volume><fpage>11351</fpage><lpage>11360</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1128/JVI.00721-11</pubid><pubid idtype="pmpid" link="fulltext">21849435</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>Regulation of alternative polyadenylation by U1 snRNPs and SRp20.</p></title><aug><au><snm>Lou</snm><fnm>H</fnm></au><au><snm>Neugebauer</snm><fnm>KM</fnm></au><au><snm>Gagel</snm><fnm>RF</fnm></au><au><snm>Berget</snm><fnm>SM</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1998</pubdate><volume>18</volume><fpage>4977</fpage><lpage>4985</lpage><xrefbib><pubidlist><pubid idtype="pmcid">109082</pubid><pubid idtype="pmpid" link="fulltext">9710581</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>Knockdown of SLBP results in nuclear retention of histone mRNA.</p></title><aug><au><snm>Sullivan</snm><fnm>KD</fnm></au><au><snm>Mullen</snm><fnm>TE</fnm></au><au><snm>Marzluff</snm><fnm>WF</fnm></au><au><snm>Wagner</snm><fnm>EJ</fnm></au></aug><source>RNA</source><pubdate>2009</pubdate><volume>15</volume><fpage>459</fpage><lpage>472</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.1205409</pubid><pubid idtype="pmcid">2657014</pubid><pubid idtype="pmpid" link="fulltext">19155325</pubid></pubidlist></xrefbib></bibl><bibl id="B33"><title><p>U7 snRNA mutations in Drosophila block histone pre-mRNA processing and disrupt oogenesis.</p></title><aug><au><snm>Godfrey</snm><fnm>AC</fnm></au><au><snm>Kupsco</snm><fnm>JM</fnm></au><au><snm>Burch</snm><fnm>BD</fnm></au><au><snm>Zimmerman</snm><fnm>RM</fnm></au><au><snm>Dominski</snm><fnm>Z</fnm></au><au><snm>Marzluff</snm><fnm>WF</fnm></au><au><snm>Duronio</snm><fnm>RJ</fnm></au></aug><source>RNA</source><pubdate>2006</pubdate><volume>12</volume><fpage>396</fpage><lpage>409</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.2270406</pubid><pubid idtype="pmcid">1383579</pubid><pubid idtype="pmpid" link="fulltext">16495235</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>A genome-wide RNA interference screen reveals that variant histones are necessary for replication-dependent histone pre-mRNA processing.</p></title><aug><au><snm>Wagner</snm><fnm>EJ</fnm></au><au><snm>Burch</snm><fnm>BD</fnm></au><au><snm>Godfrey</snm><fnm>AC</fnm></au><au><snm>Salzler</snm><fnm>HR</fnm></au><au><snm>Duronio</snm><fnm>RJ</fnm></au><au><snm>Marzluff</snm><fnm>WF</fnm></au></aug><source>Mol Cell</source><pubdate>2007</pubdate><volume>28</volume><fpage>692</fpage><lpage>699</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2007.10.009</pubid><pubid idtype="pmpid" link="fulltext">18042462</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Complex and dynamic landscape of RNA polyadenylation revealed by PAS-Seq.</p></title><aug><au><snm>Shepard</snm><fnm>PJ</fnm></au><au><snm>Choi</snm><fnm>E-A</fnm></au><au><snm>Lu</snm><fnm>J</fnm></au><au><snm>Flanagan</snm><fnm>LA</fnm></au><au><snm>Hertel</snm><fnm>KJ</fnm></au><au><snm>Shi</snm><fnm>Y</fnm></au></aug><source>RNA</source><pubdate>2011</pubdate><volume>17</volume><fpage>761</fpage><lpage>772</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1261/rna.2581711</pubid><pubid idtype="pmcid">3062186</pubid><pubid idtype="pmpid" link="fulltext">21343387</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Genomewide characterization of non-polyadenylated RNAs.</p></title><aug><au><snm>Yang</snm><fnm>L</fnm></au><au><snm>Duff</snm><fnm>M</fnm></au><au><snm>Graveley</snm><fnm>B</fnm></au><au><snm>Carmichael</snm><fnm>G</fnm></au><au><snm>Chen</snm><fnm>L-L</fnm></au></aug><source>Genome Biol</source><pubdate>2011</pubdate><volume>12</volume><fpage>R16</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2011-12-2-r16</pubid><pubid idtype="pmcid">3188798</pubid><pubid idtype="pmpid" link="fulltext">21324177</pubid></pubidlist></xrefbib></bibl><bibl id="B37"><title><p>SR proteins are 'locators' of the RNA splicing machinery.</p></title><aug><au><snm>Graveley</snm><fnm>BR</fnm></au><au><snm>Hertel</snm><fnm>KJ</fnm></au><au><snm>Maniatis</snm><fnm>T</fnm></au></aug><source>Curr Biol</source><pubdate>1999</pubdate><volume>9</volume><fpage>R6</fpage><lpage>R7</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">9889434</pubid></xrefbib></bibl><bibl id="B38"><title><p>A class of human exons with predicted distant branch points revealed by analysis of AG dinucleotide exclusion zones.</p></title><aug><au><snm>Gooding</snm><fnm>C</fnm></au><au><snm>Clark</snm><fnm>F</fnm></au><au><snm>Wollerton</snm><fnm>M</fnm></au><au><snm>Grellscheid</snm><fnm>S-N</fnm></au><au><snm>Groom</snm><fnm>H</fnm></au><au><snm>Smith</snm><fnm>C</fnm></au></aug><source>Genome Biol</source><pubdate>2006</pubdate><volume>7</volume><fpage>R1</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2006-7-1-r1</pubid><pubid idtype="pmcid">1431707</pubid><pubid idtype="pmpid" link="fulltext">16507133</pubid></pubidlist></xrefbib></bibl><bibl id="B39"><title><p>Genome-wide association between branch point properties and alternative splicing.</p></title><aug><au><snm>Corvelo</snm><fnm>A</fnm></au><au><snm>Hallegger</snm><fnm>M</fnm></au><au><snm>Smith</snm><fnm>CWJ</fnm></au><au><snm>Eyras</snm><fnm>E</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2010</pubdate><volume>6</volume><fpage>e1001016</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1371/journal.pcbi.1001016</pubid><pubid idtype="pmcid">2991248</pubid><pubid idtype="pmpid" link="fulltext">21124863</pubid></pubidlist></xrefbib></bibl><bibl id="B40"><title><p>RS domains contact splicing signals and promote splicing by a commonmechanism in yeast through humans.</p></title><aug><au><snm>Shen</snm><fnm>H</fnm></au><au><snm>Green</snm><fnm>MR</fnm></au></aug><source>Genes Dev</source><pubdate>2006</pubdate><volume>20</volume><fpage>1755</fpage><lpage>1765</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1422106</pubid><pubid idtype="pmcid">1522072</pubid><pubid idtype="pmpid" link="fulltext">16766678</pubid></pubidlist></xrefbib></bibl><bibl id="B41"><title><p>The splicing factor SRp20 modifies splicing of its own mRNA and ASF/SF2 antagonizes this regulation.</p></title><aug><au><snm>Jumaa</snm><fnm>H</fnm></au><au><snm>Nielsen</snm><fnm>PJ</fnm></au></aug><source>EMBO J</source><pubdate>1997</pubdate><volume>16</volume><fpage>5077</fpage><lpage>5085</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/emboj/16.16.5077</pubid><pubid idtype="pmcid">1170142</pubid><pubid idtype="pmpid" link="fulltext">9305649</pubid></pubidlist></xrefbib></bibl><bibl id="B42"><title><p>Ultraconserved elements are associated with homeostatic control of splicing regulators by alternative splicing and nonsense-mediated decay.</p></title><aug><au><snm>Ni</snm><fnm>JZ</fnm></au><au><snm>Grate</snm><fnm>L</fnm></au><au><snm>Donohue</snm><fnm>JP</fnm></au><au><snm>Preston</snm><fnm>C</fnm></au><au><snm>Nobida</snm><fnm>N</fnm></au><au><snm>O&apos;Brien</snm><fnm>G</fnm></au><au><snm>Shiue</snm><fnm>L</fnm></au><au><snm>Clark</snm><fnm>TA</fnm></au><au><snm>Blume</snm><fnm>JE</fnm></au><au><snm>Ares</snm><fnm>M</fnm><suf>Jr</suf></au></aug><source>Genes Dev</source><pubdate>2007</pubdate><volume>21</volume><fpage>708</fpage><lpage>718</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.1525507</pubid><pubid idtype="pmcid">1820944</pubid><pubid idtype="pmpid" link="fulltext">17369403</pubid></pubidlist></xrefbib></bibl><bibl id="B43"><title><p>Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements.</p></title><aug><au><snm>Lareau</snm><fnm>LF</fnm></au><au><snm>Inada</snm><fnm>M</fnm></au><au><snm>Green</snm><fnm>RE</fnm></au><au><snm>Wengrod</snm><fnm>JC</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Nature</source><pubdate>2007</pubdate><volume>446</volume><fpage>926</fpage><lpage>929</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature05676</pubid><pubid idtype="pmpid" link="fulltext">17361132</pubid></pubidlist></xrefbib></bibl><bibl id="B44"><title><p>Regulation of alternative splicing by the core spliceosomal machinery.</p></title><aug><au><snm>Saltzman</snm><fnm>AL</fnm></au><au><snm>Pan</snm><fnm>Q</fnm></au><au><snm>Blencowe</snm><fnm>BJ</fnm></au></aug><source>Genes Dev</source><pubdate>2011</pubdate><volume>25</volume><fpage>373</fpage><lpage>384</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gad.2004811</pubid><pubid idtype="pmcid">3042160</pubid><pubid idtype="pmpid" link="fulltext">21325135</pubid></pubidlist></xrefbib></bibl><bibl id="B45"><title><p>Distribution of SR protein exonic splicing enhancer motifs in human protein-coding genes.</p></title><aug><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Smith</snm><fnm>PJ</fnm></au><au><snm>Krainer</snm><fnm>AR</fnm></au><au><snm>Zhang</snm><fnm>MQ</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2005</pubdate><volume>33</volume><fpage>5053</fpage><lpage>5062</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gki810</pubid><pubid idtype="pmcid">1201331</pubid><pubid idtype="pmpid" link="fulltext">16147989</pubid></pubidlist></xrefbib></bibl><bibl id="B46"><title><p>Paraspeckles: nuclear bodies built on long noncoding RNA.</p></title><aug><au><snm>Bond</snm><fnm>CS</fnm></au><au><snm>Fox</snm><fnm>AH</fnm></au></aug><source>J Cell Biol</source><pubdate>2009</pubdate><volume>186</volume><fpage>637</fpage><lpage>644</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1083/jcb.200906113</pubid><pubid idtype="pmcid">2742191</pubid><pubid idtype="pmpid" link="fulltext">19720872</pubid></pubidlist></xrefbib></bibl><bibl id="B47"><title><p>Position within the host intron is critical for efficient processing of box C/D snoRNAs in mammalian cells.</p></title><aug><au><snm>Hirose</snm><fnm>T</fnm></au><au><snm>Steitz</snm><fnm>JA</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>2001</pubdate><volume>98</volume><fpage>12914</fpage><lpage>12919</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.231490998</pubid><pubid idtype="pmcid">60799</pubid><pubid idtype="pmpid" link="fulltext">11606788</pubid></pubidlist></xrefbib></bibl><bibl id="B48"><title><p>Regulation of alternative splicing by short non-coding nuclear RNAs.</p></title><aug><au><snm>Khanna</snm><fnm>A</fnm></au><au><snm>Stamm</snm><fnm>S</fnm></au></aug><source>RNA Biol</source><pubdate>2011</pubdate><volume>7</volume><fpage>480</fpage><lpage>485</lpage></bibl><bibl id="B49"><title><p>Exonic splicing enhancers: mechanism of action, diversity and role in human genetic diseases.</p></title><aug><au><snm>Blencowe</snm><fnm>BJ</fnm></au></aug><source>Trends Biochem Sci</source><pubdate>2000</pubdate><volume>25</volume><fpage>106</fpage><lpage>110</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0968-0004(00)01549-8</pubid><pubid idtype="pmpid" link="fulltext">10694877</pubid></pubidlist></xrefbib></bibl><bibl id="B50"><title><p>Regulation of alternative polyadenylation by U1 snRNPs and SRp20.</p></title><aug><au><snm>Lou</snm><fnm>H</fnm></au><au><snm>Neugebauer</snm><fnm>KM</fnm></au><au><snm>Gagel</snm><fnm>RF</fnm></au><au><snm>Berget</snm><fnm>SM</fnm></au></aug><source>Mol Cell Biol</source><pubdate>1998</pubdate><volume>18</volume><fpage>4977</fpage><lpage>4985</lpage><xrefbib><pubidlist><pubid idtype="pmcid">109082</pubid><pubid idtype="pmpid" link="fulltext">9710581</pubid></pubidlist></xrefbib></bibl><bibl id="B51"><title><p>Serine-arginine (SR)-rich splicing factors have an exon-independent function in pre-mRNA splicing.</p></title><aug><au><snm>Hertel</snm><fnm>KJ</fnm></au><au><snm>Maniatis</snm><fnm>T</fnm></au></aug><source>Proc Natl Acad Sci USA</source><pubdate>1999</pubdate><volume>96</volume><fpage>2651</fpage><lpage>2655</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1073/pnas.96.6.2651</pubid><pubid idtype="pmcid">15823</pubid><pubid idtype="pmpid" link="fulltext">10077565</pubid></pubidlist></xrefbib></bibl><bibl id="B52"><title><p>SF2/ASF autoregulation involves multiple layers of post-transcriptional and translational control.</p></title><aug><au><snm>Sun</snm><fnm>S</fnm></au><au><snm>Zhang</snm><fnm>Z</fnm></au><au><snm>Sinha</snm><fnm>R</fnm></au><au><snm>Karni</snm><fnm>R</fnm></au><au><snm>Krainer</snm><fnm>AR</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2010</pubdate><volume>17</volume><fpage>306</fpage><lpage>312</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1750</pubid><pubid idtype="pmcid">2921916</pubid><pubid idtype="pmpid" link="fulltext">20139984</pubid></pubidlist></xrefbib></bibl><bibl id="B53"><title><p>WebLogo: a sequence logo generator.</p></title><aug><au><snm>Crooks</snm><fnm>GE</fnm></au><au><snm>Hon</snm><fnm>G</fnm></au><au><snm>Chandonia</snm><fnm>J-M</fnm></au><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Genome Res</source><pubdate>2004</pubdate><volume>14</volume><fpage>1188</fpage><lpage>1190</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1101/gr.849004</pubid><pubid idtype="pmcid">419797</pubid><pubid idtype="pmpid" link="fulltext">15173120</pubid></pubidlist></xrefbib></bibl><bibl id="B54"><title><p>Array Express.</p></title><url>http://www.ebi.ac.uk/arrayexpress/</url></bibl></refgrp>
</bm>
</art>