Human Gene SPATA46 (ENST00000367935.10) Description and Page Index
  Description: Homo sapiens spermatogenesis associated 46 (SPATA46), mRNA. (from RefSeq NM_182581)
Gencode Transcript: ENST00000367935.10
Gencode Gene: ENSG00000171722.13
Transcript (Including UTRs)
   Position: hg38 chr1:162,373,203-162,376,854 Size: 3,652 Total Exon Count: 3 Strand: -
Coding Region
   Position: hg38 chr1:162,374,048-162,376,790 Size: 2,743 Coding Exon Count: 3 

Page IndexSequence and LinksUniProtKB CommentsRNA-Seq ExpressionRNA StructureProtein Structure
Other SpeciesGO AnnotationsmRNA DescriptionsOther NamesMethods
Data last updated: 2019-09-04

-  Sequence and Links to Tools and Databases
Genomic Sequence (chr1:162,373,203-162,376,854)mRNA (may differ from genome)Protein (261 aa)
Gene SorterGenome BrowserOther Species FASTATable SchemaBioGPSEnsembl
Entrez GeneExonPrimerGeneCardsHPRDLynxMGI
neXtProtOMIMPubMedStanford SOURCEUniProtKB

-  Comments and Description Text from UniProtKB
DESCRIPTION: RecName: Full=Uncharacterized protein C1orf111;
SEQUENCE CAUTION: Sequence=AAP20051.1; Type=Frameshift; Positions=224;

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 33.50 RPKM in Testis
Total median expression: 38.37 RPKM

View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

-  mRNA Secondary Structure of 3' and 5' UTRs
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -17.2064-0.269 Picture PostScript Text
3' UTR -288.90845-0.342 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  ModBase Predicted Comparative 3D Structure on Q5T0L3
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
Genome BrowserGenome BrowserNo orthologNo orthologNo orthologNo ortholog
Gene Details     
Gene Sorter     
Protein SequenceProtein Sequence    

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0005515 protein binding

Biological Process:
GO:0007283 spermatogenesis
GO:0007342 fusion of sperm to egg plasma membrane
GO:0030154 cell differentiation

Cellular Component:
GO:0005634 nucleus
GO:0016020 membrane
GO:0031965 nuclear membrane

-  Descriptions from all associated GenBank mRNAs
  BC032957 - Homo sapiens chromosome 1 open reading frame 111, mRNA (cDNA clone MGC:33651 IMAGE:4827863), complete cds.
CU689584 - Synthetic construct Homo sapiens gateway clone IMAGE:100020654 5' read C1orf111 mRNA.
HQ448371 - Synthetic construct Homo sapiens clone IMAGE:100071795; CCSB001921_01 chromosome 1 open reading frame 111 (C1orf111) gene, encodes complete protein.
KJ900574 - Synthetic construct Homo sapiens clone ccsbBroadEn_09968 C1orf111 gene, encodes complete protein.
AB463503 - Synthetic construct DNA, clone: pF1KB6331, Homo sapiens C1orf111 gene for chromosome 1 open reading frame 111, without stop codon, in Flexi system.
AY248900 - Homo sapiens HSD20 mRNA, complete cds.
JD469709 - Sequence 450733 from Patent EP1572962.

-  Other Names for This Gene
  Alternate Gene Symbols: C1orf111, CA111_HUMAN, HSD20, NM_182581, Q5T0L3, Q6X961, Q8NEC3, uc001gbx.1, uc001gbx.2, uc001gbx.3, uc001gbx.4
UCSC ID: uc001gbx.4
RefSeq Accession: NM_182581
Protein: Q5T0L3 (aka CA111_HUMAN)
CCDS: CCDS1238.1

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.