Schema for Affy RNA Loc - ENCODE Affymetrix/CSHL Subcellular RNA Localization by Tiling Array

JavaScript is disabled in your web browser

You must have JavaScript enabled in your web browser to use the Genome Browser

Database: hg18 Primary Table: wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya Row Count: 1,951,409 Data last updated: 2009-07-17
Format description: Wiggle track values to display as y-values (first 6 fields are bed6)
On download server: MariaDB table dump directory

field	example	SQL type	description
`bin`	585	`smallint(5) unsigned`	Indexing field to speed chromosome range queries.
`chrom`	chr1	`varchar(255)`	Reference sequence chromosome or scaffold
`chromStart`	2918	`int(10) unsigned`	Start position in chromosome
`chromEnd`	3839	`int(10) unsigned`	End position in chromosome
`name`	chr1.0	`varchar(255)`	Name of item
`span`	1	`int(10) unsigned`	each value spans this many bases
`count`	921	`int(10) unsigned`	number of values in this block
`offset`	0	`int(10) unsigned`	offset in File to fetch data
`file`	/gbdb/hg18/wib/wgEncodeAffy...	`varchar(255)`	path name to data file, one byte per value
`lowerLimit`	-584	`double`	lowest data value in this block
`dataRange`	1033	`double`	lowerLimit + dataRange = upperLimit
`validCount`	14	`int(10) unsigned`	number of valid data values in this block
`sumData`	-622	`double`	sum of the data points, for average and stddev calc
`sumSquares`	2713990	`double`	sum of data points squared, for stddev calc

Sample Rows

bin	chrom	chromStart	chromEnd	name	span	count	offset	file	lowerLimit	dataRange	validCount	sumData	sumSquares
585	chr1	2918	3839	chr1.0	1	921	0	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	-584	1033	14	-622	2713990
585	chr1	4779	5769	chr1.1	1	990	921	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	-455.5	930.5	10	97.5	2165530
585	chr1	5941	6654	chr1.2	1	713	1911	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	11	446.75	20	3941.5	1318960
585	chr1	7333	7354	chr1.3	1	21	2624	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	1574	0	5	7870	12387400
585	chr1	8689	9638	chr1.4	1	949	2645	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	531.5	950.5	10	10067.5	12394100
585	chr1	19236	19303	chr1.5	1	67	3594	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	-3793	18589.9	9	32955.7	751990000
585	chr1	28070	28092	chr1.6	1	22	3661	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	156.5	0	5	782.5	122461
585	chr1	36955	37151	chr1.7	1	196	3683	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	80	104.75	8	954.25	134398
585	chr1	38022	38033	chr1.8	1	11	3879	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	12.5	0	3	37.5	468.75
585	chr1	41315	41920	chr1.9	1	605	3890	/gbdb/hg18/wib/wgEncodeAffyRnaChipFiltRawSignalHepg2NucleusLongnonpolya.wib	-10	64.25	21	251.25	13086.9

Note: all start coordinates in our database are 0-based, not 1-based. See explanation here.

Affy RNA Loc (wgEncodeAffyRnaChip) Track Description


	Description This track is produced as part of the ENCODE Transcriptome Project. Transcription of different RNA extracts from different sub-cellular localizations in different cell lines is compared in companion experiments using three different technologies: tiling arrays, RNA-seq using Solexa, and RNA-seq using SOLiD. The tiling array data are shown in this track. The Raw Signal view shows an estimate of abundance of RNA molecules and the Transfrags view shows the locations of sites corresponding to these molecules. Display Conventions and Configuration To show only selected subtracks, uncheck the boxes next to the tracks that you wish to hide. Color differences among the views are arbitrary. They provide a visual cue for distinguishing between the different cell types and compartments. Transfrags The Transfrags view includes all transfrags before filtering. Filtered Transfrags The Filtered Transfrags view excludes repeats and other known annotations including: tRNAs and rRNAs, mi/snoRNAs, things mapping to the mitochondrial or Y chromosomes, and many predicted snoRNAs and miRNAs. Raw Signal The Raw Signal view shows the probe intensity based on all probes. Filtered Signal The Filtered Signal view shows the probe intensity based on the filtered set of probes. Regions with negative signal value are areas where the mismatch probe hybridizes better than the match probe. This could indicate presence of a SNP, a site of RNA editing, or sequencing error. Methods Cells were grown according to the approved ENCODE cell culture protocols. RNA molecules longer than 200 nt and present in RNA population isolated from different subcellular compartments (such as cytosol, nucleus, polysomes and others) were fractionated into polyA+ and polyA- fractions as described in these protocols. Each RNA fraction was converted into double-stranded cDNA using random hexamers, labeled and hybridized to a tiling 91-array set containing probes against the non-repetitive portion of the human genome tiled on average every 5 bp (center-to-center of each consecutive 25-mers). All arrays were scaled to a median array intensity of 330. Within a sliding 61 bp window centered on each probe, an estimate of RNA abundance (signal; see the Raw Signal view) was found by calculating the median of all pairwise average PM-MM values, where PM is a perfect match and MM is a mismatch. Kapranov et al. (2002), Cheng et al. (2005) , Kapranov et al. (2007), and Cawley et al. (2004) are good references for the experimental methods. Cawley et al. also describes the analytical methods. Verification The reproducibility of the labeling method was assessed separately. Three independent technical replicates were generated from the same RNA pool for each RNA preparation and hybridized to duplicate arrays (two technical replicates) that contain the ENCODE regions. Labeled RNA samples were then pooled and hybridized to the tiling 91-array set spanning the whole genome. Transcribed regions (transfrags; see the Transfrags view) were generated from the Raw Signal by merging genomic positions to which probes are mapped. This merging was based on a 5% false positive rate cutoff in negative bacterial controls, a maximum gap (MaxGap) of 40 base-pairs and minimum run (MinRun) of 40 base-pairs. Credits These data were generated and analyzed by the transcriptome group at Affymetrix and Cold Spring Harbor Laboratories: P. Kapranov, I. Bell, E. Dumais, J. Drenkow, J. Dumais, N. Garg, M. Lubinsky, Carrie A. Davis, Huaien Wang, Kimberly Bell, Jorg Drenkow, Chris Zaleski, and Thomas R. Gingeras. Contact: gingeras@cshl. edu (Tom Gingeras) References Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA, Kampa D, Piccolboni A, Sementchenko V, Cheng J, Williams AJ et al. Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell. 2004 Feb 20;116(4):499-509. Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, Stern D, Tammana H, Helt G et al. Transcriptional Maps of 10 Human Chromosomes at 5 Nucleotide Resolution. Science. 2005 May 20;308(5725):1149-1154. Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, Gingeras TR. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002 May 3;296(5569):916-9. Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermüller J, Hofacker IL, et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007 Jun 8;316(5830):1484-8. Data Release Policy Data users may freely use ENCODE data, but may not, without prior consent, submit publications that use an unpublished ENCODE dataset until nine months following the release of the dataset. This date is listed in the Restricted Until column, above. The full data release policy for ENCODE is available here.

Description

Display Conventions and Configuration

Methods

Verification

Credits

References

Data Release Policy