hg38 CCDS Gene
 

Consensus CDS Gene CCDS5234.1

GeneESR1
Descriptionestrogen receptor isoform 1
SequencesCDS,  protein,  genomic
CCDS database CCDS5234.1

Associated Sequences

 mRNAProtein
UCSC GenesENST00000206249.8 
ENST00000338799.9 
ENST00000440973.5 
ENST00000443427.5 
RefSeqNM_000125.3 NP_000116.2
NM_001122740.1 NP_001116212.1
NM_001122741.1 NP_001116213.1
NM_001122742.1 NP_001116214.1
XM_011535543.2 XP_011533845.1
XM_011535544.2 XP_011533846.1
XM_011535545.2 XP_011533847.1
XM_017010376.1 XP_016865865.1
XM_017010377.1 XP_016865866.1
XM_017010378.1 XP_016865867.1
XM_017010379.1 XP_016865868.1
XM_017010380.1 XP_016865869.1
XM_017010381.1 XP_016865870.1
VegaOTTHUMT00000043308 OTTHUMP00000017718
OTTHUMT00000043309 OTTHUMP00000017719
EnsemblENST00000206249 ENSP00000206249
ENST00000338799 ENSP00000342630
ENST00000440973 ENSP00000405330
ENST00000443427 ENSP00000387500
MGCBC128574 
BC128573 

Note: mRNA and protein sequences in other gene collections may differ from the CCDS sequences.


RefSeq summary of CCDS5234.1

This gene encodes an estrogen receptor and ligand-activated transcription factor. The canonical protein contains an N-terminal ligand-independent transactivation domain, a central DNA binding domain, a hinge domain, and a C-terminal ligand-dependent transactivation domain. The protein localizes to the nucleus where it may form either a homodimer or a heterodimer with estrogen receptor 2. The protein encoded by this gene regulates the transcription of many estrogen-inducible genes that play a role in growth, metabolism, sexual development, gestation, and other reproductive functions and is expressed in many non-reproductive tissues. The receptor encoded by this gene plays a key role in breast cancer, endometrial cancer, and osteoporosis. This gene is reported to have dozens of transcript variants due to the use of alternate promoters and alternative splicing, however, the full-length nature of many of these variants remain uncertain. [provided by RefSeq, Jul 2020].


Data schema/format description and download

Go to CCDS track controls

Data last updated at UCSC: 2019-10-03

Description

This track shows human genome high-confidence gene annotations from the Consensus Coding Sequence (CCDS) project. This project is a collaborative effort to identify a core set of human protein-coding regions that are consistently annotated and of high quality. The long-term goal is to support convergence towards a standard set of gene annotations on the human genome.

Collaborators include:

For more information on the different gene tracks, see our Genes FAQ.

Methods

CDS annotations of the human genome were obtained from two sources: NCBI RefSeq and a union of the gene annotations from Ensembl and Vega, collectively known as Hinxton.

Genes with identical CDS genomic coordinates in both sets become CCDS candidates. The genes undergo a quality evaluation, which must be approved by all collaborators. The following criteria are currently used to assess each gene:

  • an initiating ATG (Exception: a non-ATG translation start codon is annotated if it has sufficient experimental support), a valid stop codon, and no in-frame stop codons (Exception: selenoproteins, which contain a TGA codon that is known to be translated to a selenocysteine instead of functioning as a stop codon)
  • ability to be translated from the genome reference sequence without frameshifts
  • recognizable splicing sites
  • no intersection with putative pseudogene predictions
  • supporting transcripts and protein homology
  • conservation evidence with other species

A unique CCDS ID is assigned to the CCDS, which links together all gene annotations with the same CDS. CCDS gene annotations are under continuous review, with periodic updates to this track.

Credits

This track was produced at UCSC from data downloaded from the CCDS project web site.

References

Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T et al. The Ensembl genome database project. Nucleic Acids Res. 2002 Jan 1;30(1):38-41. PMID: 11752248; PMC: PMC99161

Pruitt KD, Harrow J, Harte RA, Wallin C, Diekhans M, Maglott DR, Searle S, Farrell CM, Loveland JE, Ruef BJ et al. The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. Genome Res. 2009 Jul;19(7):1316-23. PMID: 19498102; PMC: PMC2704439

Pruitt KD, Tatusova T, Maglott DR. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. PMID: 15608248; PMC: PMC539979