Human Gene CD209 (uc002mht.2)
  Description: Homo sapiens CD209 molecule (CD209), transcript variant 1, mRNA.
RefSeq Summary (NM_021155): This gene encodes a C-type lectin that functions in cell adhesion and pathogen recognition. This receptor recognizes a wide range of evolutionarily divergent pathogens with a large impact on public health, including leprosy and tuberculosis mycobacteria, the Ebola, hepatitis C, HIV-1 and Dengue viruses, and the SARS-CoV acute respiratory syndrome coronavirus. The protein is organized into four distinct domains: a C-terminal carbohydrate recognition domain, a flexible tandem-repeat neck domain, a transmembrane region and an N-terminal cytoplasmic domain involved in internalization. This gene is closely related in terms of both sequence and function to a neighboring gene, CLEC4M (Gene ID: 10332), also known as L-SIGN. The two genes differ in viral recognition and expression patterns, with this gene showing high expression on the surface of dendritic cells. Polymorphisms in the neck region are associated with protection from HIV-1 infection, while single nucleotide polymorphisms in the promoter of this gene are associated with differing resistance and susceptibility to and severity of infectious disease, including rs4804803, which is associated with SARS severity. [provided by RefSeq, May 2020].
Transcript (Including UTRs)
   Position: hg19 chr19:7,804,881-7,812,464 Size: 7,584 Total Exon Count: 7 Strand: -
Coding Region
   Position: hg19 chr19:7,807,925-7,812,397 Size: 4,473 Coding Exon Count: 7 

Page IndexSequence and LinksUniProtKB CommentsPrimersGenetic AssociationsMalaCards
CTDGene AllelesRNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein Structure
Other SpeciesGO AnnotationsmRNA DescriptionsPathwaysOther NamesModel Information
Methods
Data last updated at UCSC: 2013-06-14

-  Sequence and Links to Tools and Databases
 
Genomic Sequence (chr19:7,804,881-7,812,464)mRNA (may differ from genome)Protein (404 aa)
Gene SorterGenome BrowserOther Species FASTAVisiGeneGene interactionsTable Schema
AlphaFoldBioGPSEnsemblEntrez GeneExonPrimerGeneCards
GeneNetworkHGNCHPRDLynxMalacardsMGI
neXtProtOMIMPubMedReactomeTreefamUniProtKB
WikipediaBioGrid CRISPR DB

-  Comments and Description Text from UniProtKB
  ID: CD209_HUMAN
DESCRIPTION: RecName: Full=CD209 antigen; AltName: Full=C-type lectin domain family 4 member L; AltName: Full=Dendritic cell-specific ICAM-3-grabbing non-integrin 1; Short=DC-SIGN; Short=DC-SIGN1; AltName: CD_antigen=CD209;
FUNCTION: Pathogen-recognition receptor expressed on the surface of immature dendritic cells (DCs) and involved in initiation of primary immune response. Thought to mediate the endocytosis of pathogens which are subsequently degraded in lysosomal compartments. The receptor returns to the cell membrane surface and the pathogen-derived antigens are presented to resting T-cells via MHC class II proteins to initiate the adaptive immune response. Probably recognizes in a calcium-dependent manner high mannose N-linked oligosaccharides in a variety of pathogen antigens, including HIV-1 gp120, HIV-2 gp120, SIV gp120, ebolavirus glycoproteins, cytomegalovirus gB, HCV E2, dengue virus gE, Leishmania pifanoi LPG, Lewis-x antigen in Helicobacter pylori LPS, mannose in Klebsiella pneumonae LPS, di-mannose and tri- mannose in Mycobacterium tuberculosis ManLAM and Lewis-x antigen in Schistosoma mansoni SEA.
FUNCTION: On DCs it is a high affinity receptor for ICAM2 and ICAM3 by binding to mannose-like carbohydrates. May act as a DC rolling receptor that mediates transendothelial migration of DC presursors from blood to tissues by binding endothelial ICAM2. Seems to regulate DC-induced T-cell proliferation by binding to ICAM3 on T-cells in the immunological synapse formed between DC and T-cells.
SUBUNIT: Homotetramer. Binds to many viral surface glycoproteins such as HIV-1 gp120, HIV-2 gp120, SIV gp120, ebolavirus envelope glycoproteins, cytomegalovirus gB, HCV E2 and dengue virus major envelope protein E.
SUBCELLULAR LOCATION: Isoform 1: Cell membrane; Single-pass type II membrane protein (Probable).
SUBCELLULAR LOCATION: Isoform 2: Cell membrane; Single-pass type II membrane protein (Probable).
SUBCELLULAR LOCATION: Isoform 3: Cell membrane; Single-pass type II membrane protein (Probable).
SUBCELLULAR LOCATION: Isoform 4: Cell membrane; Single-pass type II membrane protein (Probable).
SUBCELLULAR LOCATION: Isoform 5: Cell membrane; Single-pass type II membrane protein (Probable).
SUBCELLULAR LOCATION: Isoform 6: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 7: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 8: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 9: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 10: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 11: Secreted (Probable).
SUBCELLULAR LOCATION: Isoform 12: Secreted (Probable).
TISSUE SPECIFICITY: Predominantly expressed in dendritic cells and in DC-residing tissues. Also found in placental macrophages, endothelial cells of placental vascular channels, peripheral blood mononuclear cells, and THP-1 monocytes.
DOMAIN: The tandem repeat domain, also called neck domain, mediates oligomerization.
POLYMORPHISM: Genetic variations in CD209 determine Mycobacterium tuberculosis susceptibility [MIM:607948].
POLYMORPHISM: Genetic variations in CD209 may influence susceptibility or resistance to dengue virus infection, as well as disease progression and severity [MIM:614371]. A promoter polymorphism in the CD209 gene is associated with protection from dengue fever, but not dengue hemorrhagic fever.
MISCELLANEOUS: In vitro, is a receptor for HIV-1 and transmits HIV-1 either in trans without DC infection, or in cis following a DC infection to permissive T-cells to induce a robust infection. Bound HIV-1 remains infectious over a prolonged period of time and it is proposed that bound HIV-1 is not degraded but protected in non-lysosomal acidic organelles within the DCs close to the cell membrane thus contributing to the HIV-1 infectious potential during transport by DCs from the periphery to lymphoid organs.
SIMILARITY: Contains 1 C-type lectin domain.
SEQUENCE CAUTION: Sequence=AAK91858.1; Type=Miscellaneous discrepancy; Note=Aberrant splicing;
WEB RESOURCE: Name=Wikipedia; Note=DC-SIGN entry; URL="http://en.wikipedia.org/wiki/DC-SIGN";
WEB RESOURCE: Name=Functional Glycomics Gateway - Glycan Binding; Note=DC-SIGN; URL="http://www.functionalglycomics.org/glycomics/GBPServlet?&operationType=view&cbpId=cbp_hum_Ctlect_00121";

-  Primer design for this transcript
 

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3


-  Genetic Association Studies of Complex Diseases and Disorders
  Genetic Association Database (archive): CD209
CDC HuGE Published Literature: CD209
Positive Disease Associations: dengue disease , HIV
Related Studies:
  1. dengue disease
    Sakuntabhai, A. et al. 2005, A variant in the CD209 promoter is associated with severity of dengue disease., Nature genetics. 2005 May;37(5):507-13. [PubMed 15838506]
    These results indicate that CD209 has a crucial role in dengue pathogenesis, which discriminates between severe dengue fever and dengue hemorrhagic fever. This may have consequences for therapeutic and preventive strategies.
  2. HIV
    Martin, M. P. et al. 2004, Association of DC-SIGN promoter polymorphism with increased risk for parenteral, but not mucosal, acquisition of human immunodeficiency virus type 1 infection., Journal of virology. 2004 Dec;78(24):14053-6. [PubMed 15564514]
    A potential role for DC-SIGN specific to systemic acquisition and dissemination of infection is suggested.

-  MalaCards Disease Associations
  MalaCards Gene Search: CD209
Diseases sorted by gene-association score: dengue fever, protection against* (550), hiv-1* (428), japanese encephalitis (37), tuberculosis (34), dengue hemorrhagic fever (34), rift valley fever (34), dengue disease (31), measles (28), hemorrhagic fever (28), west nile virus (19), encephalitis (18), secondary syphilis (12), severe acute respiratory syndrome (11), common wart (11), tick-borne encephalitis (10), skin sarcoma (8), marburg hemorrhagic fever (8), lymphadenitis (7), herpes simplex (7), asymptomatic dengue (6), syphilis (5), inflammatory bowel disease 6 (4), papilloma (3), leprosy (3), viral infectious disease (2)
* = Manually curated disease association

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 6.39 RPKM in Adipose - Visceral (Omentum)
Total median expression: 53.61 RPKM



View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
 
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -19.6067-0.293 Picture PostScript Text
3' UTR -950.343044-0.312 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR001304 - C-type_lectin
IPR016186 - C-type_lectin-like
IPR018378 - C-type_lectin_CS
IPR016187 - C-type_lectin_fold

Pfam Domains:
PF00059 - Lectin C-type domain

SCOP Domains:
56436 - C-type lectin-like

Protein Data Bank (PDB) 3-D Structure
MuPIT help
1K9I - X-ray MuPIT 1SL4 - X-ray MuPIT 1SL5 - X-ray MuPIT 2B6B - EM MuPIT 2IT5 - X-ray MuPIT 2IT6 - X-ray MuPIT 2XR5 - X-ray MuPIT 2XR6 - X-ray MuPIT


ModBase Predicted Comparative 3D Structure on Q9NNX6
FrontTopSide
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologGenome BrowserNo orthologNo orthologNo orthologNo ortholog
 Gene Details    
 Gene Sorter    
 RGD    
 Protein Sequence    
 Alignment    

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0001618 virus receptor activity
GO:0005515 protein binding
GO:0005537 mannose binding
GO:0030246 carbohydrate binding
GO:0042605 peptide antigen binding
GO:0046790 virion binding
GO:0046872 metal ion binding

Biological Process:
GO:0002223 stimulatory C-type lectin receptor signaling pathway
GO:0002250 adaptive immune response
GO:0002376 immune system process
GO:0006897 endocytosis
GO:0007155 cell adhesion
GO:0007157 heterophilic cell-cell adhesion via plasma membrane cell adhesion molecules
GO:0007159 leukocyte cell-cell adhesion
GO:0009988 cell-cell recognition
GO:0016032 viral process
GO:0019048 modulation by virus of host morphology or physiology
GO:0019062 virion attachment to host cell
GO:0019079 viral genome replication
GO:0019882 antigen processing and presentation
GO:0035556 intracellular signal transduction
GO:0042102 positive regulation of T cell proliferation
GO:0042129 regulation of T cell proliferation
GO:0045087 innate immune response
GO:0046718 viral entry into host cell
GO:0046968 peptide antigen transport
GO:0075733 intracellular transport of virus
GO:0097323 B cell adhesion

Cellular Component:
GO:0005576 extracellular region
GO:0005737 cytoplasm
GO:0005886 plasma membrane
GO:0009897 external side of plasma membrane
GO:0009986 cell surface
GO:0016020 membrane
GO:0016021 integral component of membrane
GO:0043657 host cell


-  Descriptions from all associated GenBank mRNAs
  AF290886 - Homo sapiens DC-SIGN mRNA, complete cds.
AK057212 - Homo sapiens cDNA FLJ32650 fis, clone SYNOV2001468.
AK293089 - Homo sapiens cDNA FLJ78046 complete cds, highly similar to Homo sapiens CD209 antigen (CD209), mRNA.
JD066926 - Sequence 47950 from Patent EP1572962.
JD063941 - Sequence 44965 from Patent EP1572962.
JD401241 - Sequence 382265 from Patent EP1572962.
JD285645 - Sequence 266669 from Patent EP1572962.
JD398575 - Sequence 379599 from Patent EP1572962.
JD040824 - Sequence 21848 from Patent EP1572962.
JD258710 - Sequence 239734 from Patent EP1572962.
JD259422 - Sequence 240446 from Patent EP1572962.
JD528092 - Sequence 509116 from Patent EP1572962.
JD547810 - Sequence 528834 from Patent EP1572962.
JD556679 - Sequence 537703 from Patent EP1572962.
JD148623 - Sequence 129647 from Patent EP1572962.
JD250973 - Sequence 231997 from Patent EP1572962.
JD379868 - Sequence 360892 from Patent EP1572962.
JD368121 - Sequence 349145 from Patent EP1572962.
JD349469 - Sequence 330493 from Patent EP1572962.
JD164009 - Sequence 145033 from Patent EP1572962.
JD405644 - Sequence 386668 from Patent EP1572962.
JD421063 - Sequence 402087 from Patent EP1572962.
JD505541 - Sequence 486565 from Patent EP1572962.
JD356252 - Sequence 337276 from Patent EP1572962.
JD246168 - Sequence 227192 from Patent EP1572962.
JD331009 - Sequence 312033 from Patent EP1572962.
JD545489 - Sequence 526513 from Patent EP1572962.
JD352269 - Sequence 333293 from Patent EP1572962.
JD542023 - Sequence 523047 from Patent EP1572962.
JD527704 - Sequence 508728 from Patent EP1572962.
JD488413 - Sequence 469437 from Patent EP1572962.
JD198439 - Sequence 179463 from Patent EP1572962.
JD527703 - Sequence 508727 from Patent EP1572962.
JD546050 - Sequence 527074 from Patent EP1572962.
JD140627 - Sequence 121651 from Patent EP1572962.
JD198438 - Sequence 179462 from Patent EP1572962.
JD272061 - Sequence 253085 from Patent EP1572962.
JD247459 - Sequence 228483 from Patent EP1572962.
JD325923 - Sequence 306947 from Patent EP1572962.
JD367232 - Sequence 348256 from Patent EP1572962.
JD140628 - Sequence 121652 from Patent EP1572962.
JD198440 - Sequence 179464 from Patent EP1572962.
JD523606 - Sequence 504630 from Patent EP1572962.
JD559320 - Sequence 540344 from Patent EP1572962.
JD140629 - Sequence 121653 from Patent EP1572962.
JD523607 - Sequence 504631 from Patent EP1572962.
JD153217 - Sequence 134241 from Patent EP1572962.
JD527702 - Sequence 508726 from Patent EP1572962.
JD527699 - Sequence 508723 from Patent EP1572962.
JD153216 - Sequence 134240 from Patent EP1572962.
JD153215 - Sequence 134239 from Patent EP1572962.
JD038247 - Sequence 19271 from Patent EP1572962.
JD312742 - Sequence 293766 from Patent EP1572962.
JD362847 - Sequence 343871 from Patent EP1572962.
JD483118 - Sequence 464142 from Patent EP1572962.
JD093654 - Sequence 74678 from Patent EP1572962.
JD416172 - Sequence 397196 from Patent EP1572962.
JD122571 - Sequence 103595 from Patent EP1572962.
JD257819 - Sequence 238843 from Patent EP1572962.
JD241508 - Sequence 222532 from Patent EP1572962.
JD531892 - Sequence 512916 from Patent EP1572962.
JD487053 - Sequence 468077 from Patent EP1572962.
JD137940 - Sequence 118964 from Patent EP1572962.
JD051895 - Sequence 32919 from Patent EP1572962.
JD419045 - Sequence 400069 from Patent EP1572962.
JD251825 - Sequence 232849 from Patent EP1572962.
JD460118 - Sequence 441142 from Patent EP1572962.
JD142458 - Sequence 123482 from Patent EP1572962.
JD256504 - Sequence 237528 from Patent EP1572962.
JD264547 - Sequence 245571 from Patent EP1572962.
JD342366 - Sequence 323390 from Patent EP1572962.
JD071705 - Sequence 52729 from Patent EP1572962.
JD131526 - Sequence 112550 from Patent EP1572962.
JD213754 - Sequence 194778 from Patent EP1572962.
JD120054 - Sequence 101078 from Patent EP1572962.
JD363601 - Sequence 344625 from Patent EP1572962.
JD268330 - Sequence 249354 from Patent EP1572962.
JD523546 - Sequence 504570 from Patent EP1572962.
JD296048 - Sequence 277072 from Patent EP1572962.
JD262397 - Sequence 243421 from Patent EP1572962.
JD154264 - Sequence 135288 from Patent EP1572962.
JD522522 - Sequence 503546 from Patent EP1572962.
JD355761 - Sequence 336785 from Patent EP1572962.
JD086895 - Sequence 67919 from Patent EP1572962.
AK304190 - Homo sapiens cDNA FLJ61677 complete cds, highly similar to CD209 antigen.
JD365402 - Sequence 346426 from Patent EP1572962.
JD053745 - Sequence 34769 from Patent EP1572962.
JD250722 - Sequence 231746 from Patent EP1572962.
JD535830 - Sequence 516854 from Patent EP1572962.
AK308595 - Homo sapiens cDNA, FLJ98636.
JD112872 - Sequence 93896 from Patent EP1572962.
BC110615 - Homo sapiens CD209 molecule, mRNA (cDNA clone MGC:129965 IMAGE:40032459), complete cds.
JD276142 - Sequence 257166 from Patent EP1572962.
JD342062 - Sequence 323086 from Patent EP1572962.
JD109543 - Sequence 90567 from Patent EP1572962.
M98457 - Homo sapiens membrane-associated lectin type-C mRNA, complete cds.
JD237136 - Sequence 218160 from Patent EP1572962.
JD432993 - Sequence 414017 from Patent EP1572962.
AY042233 - Homo sapiens sDC-SIGN1B type IV isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042221 - Homo sapiens mDC-SIGN1A type I isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042222 - Homo sapiens mDC-SIGN1A type II isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042223 - Homo sapiens mDC-SIGN1A type III isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042224 - Homo sapiens mDC-SIGN1A type IV isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042225 - Homo sapiens sDC-SIGN1A type I isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042226 - Homo sapiens sDC-SIGN1A TYPE II isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042227 - Homo sapiens sDC-SIGN1A type III isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042229 - Homo sapiens mDC-SIGN1B type I isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042230 - Homo sapiens sDC-SIGN1B type I isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042231 - Homo sapiens sDC-SIGN1B type II isoform (CD209) mRNA, complete cds, alternatively spliced.
AK313585 - Homo sapiens cDNA, FLJ94149, highly similar to Homo sapiens CD209 antigen (CD209), mRNA.
KJ902527 - Synthetic construct Homo sapiens clone ccsbBroadEn_11921 CD209 gene, encodes complete protein.
AB527563 - Synthetic construct DNA, clone: pF1KB6631, Homo sapiens CD209 gene for CD209 antigen, without stop codon, in Flexi system.
AY042232 - Homo sapiens sDC-SIGN1B type III isoform (CD209) mRNA, complete cds, alternatively spliced.
AY042228 - Homo sapiens sDC-SIGN1A type IV isoform (CD209) mRNA, complete cds, alternatively spliced.

-  Biochemical and Signaling Pathways
  Reactome (by CSHL, EBI, and GO)

Protein Q9NNX6 (Reactome details) participates in the following event(s):

R-HSA-5621571 CD209 binds ICAM2
R-HSA-5621606 CD209 binds ManLAM
R-HSA-5621615 CD209 binds ICAM3
R-HSA-8851018 BTN2A1 binds DC-SIGN
R-HSA-5621575 CD209 (DC-SIGN) signaling
R-HSA-8851680 Butyrophilin (BTN) family interactions
R-HSA-5621481 C-type lectin receptors (CLRs)
R-HSA-1280218 Adaptive Immune System
R-HSA-168249 Innate Immune System
R-HSA-168256 Immune System

-  Other Names for This Gene
  Alternate Gene Symbols: A8KAM4, A8MVQ9, CD209_HUMAN, CLEC4L, NM_021155, NP_066978, Q2TB19, Q96QP7, Q96QP8, Q96QP9, Q96QQ0, Q96QQ1, Q96QQ2, Q96QQ3, Q96QQ4, Q96QQ5, Q96QQ6, Q96QQ7, Q96QQ8, Q9NNX6
UCSC ID: uc002mht.2
RefSeq Accession: NM_021155
Protein: Q9NNX6 (aka CD209_HUMAN or C209_HUMAN)
CCDS: CCDS12186.1, CCDS45951.1

-  Gene Model Information
 
category: coding nonsense-mediated-decay: no RNA accession: NM_021155.3
exon count: 7CDS single in 3' UTR: no RNA size: 4328
ORF size: 1215CDS single in intron: no Alignment % ID: 100.00
txCdsPredict score: 1823.00frame shift in genome: no % Coverage: 99.95
has start codon: yes stop codon in genome: no # of Alignments: 1
has end codon: yes retained intron: no # AT/AC introns 0
selenocysteine: no end bleed into intron: 0# strange splices: 0
Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.