FANTOM5 summary Tracks
 
FANTOM5 summary tracks

Display mode:   

 All
TSS peaks  FANTOM5 DPI peak, robust set  Source data version: FANTOM5 reprocessed7
Total counts  Total counts of CAGE reads  Source data version: FANTOM5 reprocessed7
Max counts  Max counts of CAGE reads  Source data version: FANTOM5 reprocessed7
Source data version: FANTOM5 reprocessed7
Assembly: Human Dec. 2013 (GRCh38/hg38)

Description - FANTOM5

Regulated transcription controls the diversity, developmental pathways, and spatial organization of the hundreds of cell types that make up a mammal. In FANTOM5 (https://fantom.gsc.riken.jp/5/ ), we mapped transcription start sites (TSS) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of gene expression across the human body by using single molecule sequencing. This datahub is set up to provide the TSS activities in individual biological states and the identified regions. Please visit the web site for further data exploration and with details.

Methods

Protocol

Individual biological states are profiled by HeliScopeCAGE, which is a variation of CAGE (Cap Analysis Gene Expression) protocol based on a single molecule sequencer. The standard protocol requiring 5 µg of total RNA as a starting material is referred as hCAGE, and an optimized version for lower quantity (~ 100 ng) is referred as LQhCAGE (Kanamori-Katyama et al. 2011).

  • hCAGE
  • LQhCAGE

Samples

We mapped transcription start sites (TSSs) and their usage in human and mouse primary cells, cell lines and tissues to produce a comprehensive overview of mammalian gene expression across the human body. 5′-end of the mapped CAGE reads are counted at a single base pair resolution (CTSS, CAGE tag starting sites) on the genomic coordinates, which represent TSS activities in the sample. Individual samples shown in "TSS activity" tracks are grouped as below.

  • Primary cell
  • Tissue
  • Cell Line
  • Time course
  • Fractionation

TSS peaks

TSS (CAGE) peaks across the panel of the biological states (samples) are identified by DPI (decomposition based peak identification, Forrest et al 2014), where each of the peaks consists of neighboring and related TSSs. The peaks are used as anchor to define promoters and as unit of promoter-level expression analysis. Two subsets of the peaks are defined based on evidence of read counts, depending on scopes of subsequent analyses, and the first subset (referred as robust set of the peaks, thresholded for expression analysis is shown as TSS peaks. They are named as "p#@GENE_SYMBOL" if associated with 5'-end of known genes, or "p@CHROM:START..END,STRAND" otherwise. The summary tracks consist of the TSS (CAGE) peaks, and summary profiles of TSS activities (total and maximum values). The summary track consists of the following tracks.

  • TSS (CAGE) peaks
    • the robust peaks
  • TSS summary profiles
    • Total counts and TPM (tags per million) in all the samples
    • Maximum counts and TPM among the samples

Credits

Please send us any questions regarding to this trackHub and underlying data.

References

  • The FANTOM5 project
  • The FANTOM5 reprocessed data
  • Gateways to the FANTOM5 promoter level mammalian expression atlas. Lizio M, Harshbarger J, Shimoji H, Severin J, Kasukawa T, Sahin S, Abugessaisa I, Fukuda S, Hori F, Ishikawa-Kato S, et al. Genome Biol. 2015 Jan 5;16:22. doi: 10.1186/s13059-014-0560-6. [PMID: 25723102]
  • Unamplified cap analysis of gene expression on a single-molecule sequencer. Kanamori-Katayama M1, Itoh M, Kawaji H, Lassmann T, Katayama S, Kojima M, Bertin N, Kaiho A, Ninomiya N, Daub CO, Carninci P, Forrest AR, Hayashizaki Y.Genome Res. 2011 Jul;21(7):1150-9. [PMID: 21596820]
  • A promoter-level mammalian expression atlas. Forrest ARR, Kawaji H, Rehli M, Baillie JK, de Hoon MJL, Haberle V, Lassmann T, Kulakovskiy IV, Lizio M, Itoh M et al. Nature. 2014 Mar 27;507(7493):462-70. [PMID: 24670764]