Table 4 |
||
|
Experimental feature definitions |
||
|
Feature |
Term |
Definition |
|
|
||
|
RNA transcription (coding and noncoding) |
CDS |
Coding sequence: well characterized transcribed regions with an annotated protein-coding open reading frame (ORF) |
|
RACEfrags |
5' and 3' rapid Amplification of cDNA ends (RACE), using polyA or total RNA to construct full-length cDNA. This technique has revealed previously unrecognized UTRs |
|
|
TARs/transfrags |
Transcriptionally active regions/transcribed fragments as determined by analyses of cellular RNA (polyA or total) hybridizations to multiple microarray platforms. For the analyses reported here, portions of TARs/transfrags overlapping any CDS, 5' or 3' UTR annotations were removed from the dataset |
|
|
Pseudo-exons |
A pre-mRNA sequence that resembles an exon but is not recognized as such by the splicing machinery |
|
|
TSS |
Transcription start site |
|
|
5' UTR |
Untranslated region: portions of CDS-containing transcripts before the start codon. For the analyses reported here, 5' UTRs overlapping alternatively transcribed CDS annotations were removed from the dataset |
|
|
TUF |
Transcripts of unknown function for noncoding transcripts |
|
|
3' UTR |
Untranslated region: portions of CDS-containing transcripts after the stop codon |
|
|
Transcript regulation: open chromatin/DNA-protein interaction |
DHS |
DNAse I hypersensitive sites are short regions of DNA that are relatively easily cleaved by deoxyribonuclease. Regions of open chromatin detected by quantitative chromatin profiling and novel microarray-based methods. For the analyses reported here, regions that overlap repetitive sequence were removed. Measures of DHS are reported using two sources: the ENCODE Regulome group and the NHGRI |
|
FAIRE-sites |
Formaldehyde assisted isolation of regulatory elements: a procedure used to isolate chromatin that is resistant to the formation of protein-DNA crosslinks. Data suggest that depletion of nucleosomes (the most basic organizational unit of chromatin) at active regulatory regions, such as promotors, is the primary underlying basis for FAIRE [38] |
|
|
HisPolTAF |
Histone modifications, RNA polymerase II (PolII), and transcription regulator TAF250 |
|
|
Sequence specific factors |
Regions of DNA determined to be bound by sequence-specific transcription factors through chromatin immunoprecipitation followed by microarray chip hybridization (so-called 'ChIP-Chip') analyses |
|
|
Sequence specific (all motifs) |
Computationally identified short sequence motifs found to be over-represented in the sequence specific factors dataset |
|
|
Ancestral repeats |
Mobile elements with well defined consensus sequences that inserted into the ancestral genome prior to mammalian radiation. These sequences are considered to be predominantly non-functional and are often used as models of neutrally evolving DNA |
|
|
Cell cycle |
EarlyRepSeg |
Early replicating segments |
|
MidRepSeg |
Mid replicating segments |
|
|
LateRepSeg |
Late replicating segments |
|
|
Evolutionary constraint |
MCS strict |
Multi-species conserved sequences: strict criteria |
|
MCS moderate |
Multi-species conserved sequences: modest criteria |
|
|
MCS loose |
Multi-species conserved sequences: loose criteria |
|
|
|
||
|
Clark et al. Genome Biology 2007 8:R180 doi:10.1186/gb-2007-8-9-r180 |
||