Human Gene HSPG2 (uc001bfj.3)
  Description: Homo sapiens heparan sulfate proteoglycan 2 (HSPG2), mRNA.
RefSeq Summary (NM_005529): This gene encodes the perlecan protein, which consists of a core protein to which three long chains of glycosaminoglycans (heparan sulfate or chondroitin sulfate) are attached. The perlecan protein is a large multidomain proteoglycan that binds to and cross-links many extracellular matrix components and cell-surface molecules. It has been shown that this protein interacts with laminin, prolargin, collagen type IV, FGFBP1, FBLN2, FGF7 and transthyretin, etc., and it plays essential roles in multiple biological activities. Perlecan is a key component of the vascular extracellular matrix, where it helps to maintain the endothelial barrier function. It is a potent inhibitor of smooth muscle cell proliferation and is thus thought to help maintain vascular homeostasis. It can also promote growth factor (e.g., FGF2) activity and thus stimulate endothelial growth and re-generation. It is a major component of basement membranes, where it is involved in the stabilization of other molecules as well as being involved with glomerular permeability to macromolecules and cell adhesion. Mutations in this gene cause Schwartz-Jampel syndrome type 1, Silverman-Handmaker type of dyssegmental dysplasia, and tardive dyskinesia. Alternative splicing of this gene results in multiple transcript variants. [provided by RefSeq, May 2014].
Transcript (Including UTRs)
   Position: hg19 chr1:22,148,737-22,263,750 Size: 115,014 Total Exon Count: 97 Strand: -
Coding Region
   Position: hg19 chr1:22,149,809-22,263,710 Size: 113,902 Coding Exon Count: 97 

Page IndexSequence and LinksUniProtKB CommentsPrimersGenetic AssociationsMalaCards
CTDGene AllelesRNA-Seq ExpressionMicroarray ExpressionRNA StructureProtein Structure
Other SpeciesGO AnnotationsmRNA DescriptionsPathwaysOther NamesModel Information
Methods
Data last updated at UCSC: 2013-06-14

-  Sequence and Links to Tools and Databases
 
Genomic Sequence (chr1:22,148,737-22,263,750)mRNA (may differ from genome)Protein (4391 aa)
Gene SorterGenome BrowserOther Species FASTAGene interactionsTable SchemaAlphaFold
BioGPSEnsemblEntrez GeneExonPrimerGeneCardsGeneNetwork
H-INVHGNCHPRDLynxMalacardsMGI
neXtProtOMIMPubMedReactomeTreefamUniProtKB
WikipediaBioGrid CRISPR DB

-  Comments and Description Text from UniProtKB
  ID: PGBM_HUMAN
DESCRIPTION: RecName: Full=Basement membrane-specific heparan sulfate proteoglycan core protein; Short=HSPG; AltName: Full=Perlecan; Short=PLC; Contains: RecName: Full=Endorepellin; Contains: RecName: Full=LG3 peptide; Flags: Precursor;
FUNCTION: Integral component of basement membranes. Component of the glomerular basement membrane (GBM), responsible for the fixed negative electrostatic membrane charge, and which provides a barrier which is both size- and charge-selective. It serves as an attachment substrate for cells. Plays essential roles in vascularization. Critical for normal heart development and for regulating the vascular response to injury. Also required for avascular cartilage development.
FUNCTION: Endorepellin in an anti-angiogenic and anti-tumor peptide that inhibits endothelial cell migration, collagen-induced endothelial tube morphogenesis and blood vessel growth in the chorioallantoic membrane. Blocks endothelial cell adhesion to fibronectin and type I collagen. Anti-tumor agent in neovascularization. Interaction with its ligand, integrin alpha2/beta1, is required for the anti-angiogenic properties. Evokes a reduction in phosphorylation of receptor tyrosine kinases via alpha2/beta1 integrin-mediated activation of the tyrosine phosphatase, PTPN6.
FUNCTION: The LG3 peptide has anti-angiogenic properties that require binding of calcium ions for full activity.
SUBUNIT: Purified perlecan has a strong tendency to aggregate in dimers or stellate structures. It interacts with other basement membrane components such as laminin, prolargin and collagen type IV. Interacts with COL13A1, FGFBP1 and VWA1. Interacts (via C- terminus) with ECM1 (via C-terminus).
SUBCELLULAR LOCATION: Secreted, extracellular space, extracellular matrix, basement membrane.
TISSUE SPECIFICITY: Found in the basement membranes.
PTM: Proteolytic processing produces the C-terminal angiogenic peptide, endorepellin. This peptide can be further processed to produce the LG3 peptide.
PTM: N- and O-glycosylated. O-glycosylated with core 1 or possibly core 8 glycans. Perlecan contains three heparan sulfate chains. The LG3 peptide contains at least three and up to five potential O-glycosylation sites but no N-glycosylation.
DISEASE: Defects in HSPG2 are the cause of Schwartz-Jampel syndrome (SJS1) [MIM:255800]; a rare autosomal recessive disorder characterized by permanent myotonia (prolonged failure of muscle relaxation) and skeletal dysplasia, resulting in reduced stature, kyphoscoliosis, bowing of the diaphyses and irregular epiphyses.
DISEASE: Defects in HSPG2 are the cause of dyssegmental dysplasia Silverman-Handmaker type (DDSH) [MIM:224410]. The dyssegmental dysplasias are rare, autosomal recessive skeletal dysplasias with anisospondyly and micromelia. There are two recognized types: the severe, lethal DDSH and the milder Rolland-Desbuquois form. Individuals with DDSH also have a flat face, micrognathia, cleft palate and reduced joint mobility, and frequently have an encephalocoele. The endochondral growth plate is short, the calcospherites (which are spherical calcium-phosphorus crystals produced by hypertrophic chondrocytes) are unfused, and there is mucoid degeneration of the resting cartilage.
MISCELLANEOUS: The LG3 peptide has been found in the urine of patients with end-stage renal disease and in the amniotic fluid of pregnant women with premature rupture of fetal membranes.
SIMILARITY: Contains 4 EGF-like domains.
SIMILARITY: Contains 22 Ig-like C2-type (immunoglobulin-like) domains.
SIMILARITY: Contains 11 laminin EGF-like domains.
SIMILARITY: Contains 3 laminin G-like domains.
SIMILARITY: Contains 3 laminin IV type A domains.
SIMILARITY: Contains 4 LDL-receptor class A domains.
SIMILARITY: Contains 1 SEA domain.
WEB RESOURCE: Name=Atlas of Genetics and Cytogenetics in Oncology and Haematology; URL="http://atlasgeneticsoncology.org//Genes/HSPG2ID40890ch1p36.html";
WEB RESOURCE: Name=Wikipedia; Note=Perlecan entry; URL="http://en.wikipedia.org/wiki/Perlecan";
WEB RESOURCE: Name=GeneReviews; URL="http://www.ncbi.nlm.nih.gov/sites/GeneTests/lab/gene/HSPG2";

-  Primer design for this transcript
 

Primer3Plus can design qPCR Primers that straddle exon-exon-junctions, which amplify only cDNA, not genomic DNA.
Click here to load the transcript sequence and exon structure into Primer3Plus

Exonprimer can design one pair of Sanger sequencing primers around every exon, located in non-genic sequence.
Click here to open Exonprimer with this transcript

To design primers for a non-coding sequence, zoom to a region of interest and select from the drop-down menu: View > In External Tools > Primer3


-  Genetic Association Studies of Complex Diseases and Disorders
  Genetic Association Database (archive): HSPG2
CDC HuGE Published Literature: HSPG2
Positive Disease Associations: periodontitis
Related Studies:
  1. periodontitis
    Suzuki A 2004, Single nucleotide polymorphisms associated with aggressive periodontitis and severe chronic periodontitis in Japanese., Biochemical and biophysical research communications. 2004 May;317(3):887-92. [PubMed 15081423]
    These appear to be good candidates as genetic factors for future study.

-  MalaCards Disease Associations
  MalaCards Gene Search: HSPG2
Diseases sorted by gene-association score: schwartz-jampel syndrome, type 1* (1684), dyssegmental dysplasia, silverman-handmaker type* (1390), gas gangrene (24), tardive dyskinesia (20), myotonia (18), hyperglobulinemic purpura (15), sjogren's syndrome (14), sialadenitis (11), hypersensitivity syndrome, carbamazepine-induced (11), dacryoadenitis (11), pseudomembranous conjunctivitis (10), keratoconjunctivitis sicca (9), mixed connective tissue disease (8), functional colonic disease (7), urbach-wiethe disease (7), aplasia of lacrimal and salivary glands (7), blepharospasm (7), xerophthalmia (6), eyelid disease (6), salivary gland disease (6), colonic pseudo-obstruction (6), skeletal dysplasias (5), hemometra (5), cenani-lenz syndactyly syndrome (5), lacrimal apparatus disease (5), kniest dysplasia (4), exanthem (4), muscular dystrophy-dystroglycanopathy , type b, 6 (4), behcet syndrome (1)
* = Manually curated disease association

-  Comparative Toxicogenomics Database (CTD)
  The following chemicals interact with this gene           more ... click here to view the complete list

+  Common Gene Haplotype Alleles
  Press "+" in the title bar above to open this section.

-  RNA-Seq Expression Data from GTEx (53 Tissues, 570 Donors)
  Highest median expression: 80.29 RPKM in Artery - Tibial
Total median expression: 1126.11 RPKM



View in GTEx track of Genome Browser    View at GTEx portal     View GTEx Body Map

+  Microarray Expression Data
  Press "+" in the title bar above to open this section.

-  mRNA Secondary Structure of 3' and 5' UTRs
 
RegionFold EnergyBasesEnergy/Base
Display As
5' UTR -18.4040-0.460 Picture PostScript Text
3' UTR -450.301072-0.420 Picture PostScript Text

The RNAfold program from the Vienna RNA Package is used to perform the secondary structure predictions and folding calculations. The estimated folding energy is in kcal/mol. The more negative the energy, the more secondary structure the RNA is likely to have.

-  Protein Domain and Structure Information
  InterPro Domains: Graphical view of domain structure
IPR008985 - ConA-like_lec_gl_sf
IPR013320 - ConA-like_subgrp
IPR000742 - EG-like_dom
IPR013032 - EGF-like_CS
IPR002049 - EGF_laminin
IPR007110 - Ig-like
IPR013783 - Ig-like_fold
IPR013098 - Ig_I-set
IPR003598 - Ig_sub2
IPR018031 - Laminin_B_subgr
IPR000034 - Laminin_B_type_IV
IPR001791 - Laminin_G
IPR023415 - LDLR_class-A_CS
IPR002172 - LDrepeatLR_classA_rpt
IPR000082 - SEA

Pfam Domains:
PF00008 - EGF-like domain
PF00047 - Immunoglobulin domain
PF00052 - Laminin B (Domain IV)
PF00053 - Laminin EGF domain
PF00054 - Laminin G domain
PF00057 - Low-density lipoprotein receptor domain class A
PF02210 - Laminin G domain
PF07679 - Immunoglobulin I-set domain
PF12661 - Human growth factor-like EGF
PF13895 - Immunoglobulin domain
PF13927 - Immunoglobulin domain

SCOP Domains:
48726 - Immunoglobulin
49899 - Concanavalin A-like lectins/glucanases
57424 - LDL receptor-like module
57196 - EGF/Laminin
57184 - Growth factor receptor domain

Protein Data Bank (PDB) 3-D Structure
MuPIT help
3SH4 - X-ray 3SH5 - X-ray


ModBase Predicted Comparative 3D Structure on P98160
FrontTopSide
The pictures above may be empty if there is no ModBase structure for the protein. The ModBase structure frequently covers just a fragment of the protein. You may be asked to log onto ModBase the first time you click on the pictures. It is simplest after logging in to just click on the picture again to get to the specific info on that model.

-  Orthologous Genes in Other Species
  Orthologies between human, mouse, and rat are computed by taking the best BLASTP hit, and filtering out non-syntenic hits. For more distant species reciprocal-best BLASTP hits are used. Note that the absence of an ortholog in the table below may reflect incomplete annotations in the other species rather than a true absence of the orthologous gene.
MouseRatZebrafishD. melanogasterC. elegansS. cerevisiae
No orthologNo orthologNo orthologNo orthologNo orthologNo ortholog
Gene Details     
Gene Sorter     
      
      
      

-  Gene Ontology (GO) Annotations with Structured Vocabulary
  Molecular Function:
GO:0005178 integrin binding
GO:0005509 calcium ion binding
GO:0005515 protein binding
GO:0008022 protein C-terminus binding
GO:0046872 metal ion binding
GO:0050750 low-density lipoprotein particle receptor binding

Biological Process:
GO:0001523 retinoid metabolic process
GO:0001525 angiogenesis
GO:0006024 glycosaminoglycan biosynthetic process
GO:0006027 glycosaminoglycan catabolic process
GO:0006629 lipid metabolic process
GO:0006898 receptor-mediated endocytosis
GO:0006954 inflammatory response
GO:0007420 brain development
GO:0016525 negative regulation of angiogenesis
GO:0030154 cell differentiation
GO:0030198 extracellular matrix organization
GO:0044267 cellular protein metabolic process
GO:0060548 negative regulation of cell death
GO:0072358 cardiovascular system development

Cellular Component:
GO:0005576 extracellular region
GO:0005604 basement membrane
GO:0005615 extracellular space
GO:0005796 Golgi lumen
GO:0005886 plasma membrane
GO:0005925 focal adhesion
GO:0031012 extracellular matrix
GO:0043202 lysosomal lumen
GO:0070062 extracellular exosome
GO:0098797 plasma membrane protein complex


-  Descriptions from all associated GenBank mRNAs
  AK126516 - Homo sapiens cDNA FLJ44552 fis, clone UTERU3007104, highly similar to Basement membrane-specific heparan sulfate proteoglycan core protein precursor.
M85289 - Human heparan sulfate proteoglycan (HSPG2) mRNA, complete cds.
BC033152 - Homo sapiens heparan sulfate proteoglycan 2, mRNA (cDNA clone IMAGE:4131756), with apparent retained intron.
AB209851 - Homo sapiens mRNA for Basement membrane-specific heparan sulfate proteoglycan core protein precursor variant protein.
KJ487142 - Homo sapiens clone St2-69 immunoglobulin heavy chain variable region (IG) mRNA, partial cds.
X62515 - H.sapiens mRNA for basement membrane heparan sulfate proteoglycan.
JD365626 - Sequence 346650 from Patent EP1572962.
JD107454 - Sequence 88478 from Patent EP1572962.
JD475108 - Sequence 456132 from Patent EP1572962.
JD239700 - Sequence 220724 from Patent EP1572962.
JD060015 - Sequence 41039 from Patent EP1572962.
JD247944 - Sequence 228968 from Patent EP1572962.
JD360901 - Sequence 341925 from Patent EP1572962.
JD317773 - Sequence 298797 from Patent EP1572962.
JD498461 - Sequence 479485 from Patent EP1572962.
JD223117 - Sequence 204141 from Patent EP1572962.
JD465804 - Sequence 446828 from Patent EP1572962.
JD128893 - Sequence 109917 from Patent EP1572962.
JD051321 - Sequence 32345 from Patent EP1572962.
BC109204 - Homo sapiens heparan sulfate proteoglycan 2, mRNA (cDNA clone IMAGE:40005996), partial cds.
M64283 - Human heparan sulfate proteoglycan (HSPG2) protein core mRNA, partial cds.
S76436 - Homo sapiens heparan sulfate proteoglycan core protein (HSPG2) mRNA, partial cds.
AF479675 - Homo sapiens perlecan variant miniperl mRNA, complete cds; alternatively spliced.
JD460755 - Sequence 441779 from Patent EP1572962.

-  Biochemical and Signaling Pathways
  KEGG - Kyoto Encyclopedia of Genes and Genomes
hsa04512 - ECM-receptor interaction

Reactome (by CSHL, EBI, and GO)

Protein P98160 (Reactome details) participates in the following event(s):

R-HSA-976734 Amyloid fibrils have additional components
R-HSA-1592314 HSPG2 (perlecan) degradation by MMP3, plasmin, (MMP12)
R-HSA-2396337 HSPG2 binds FGF2(10-155), Fibronectn matrix, Transthyretin tetramer, PDGFA homodimer, PDGFB homodimer
R-HSA-2396395 HSPG2 (perlecan) binds alpha-dystroglycan
R-HSA-2426530 Laminins:Nidogens binds HSPG2
R-HSA-2534160 HSPG2 (perlecan) degradation by MMP13, CTSS
R-NUL-2534170 Degradation of HSPG2 by Mmp13 and Ctss
R-HSA-2534240 HSPG2 (perlecan) degradation by MMP14, MMP15
R-HSA-3814820 HSPG2 (perlecan) is cleaved by BMP1, TLL1, TLL2, Cathepsin L1
R-HSA-4084505 Laminins bind HSPG2
R-HSA-4088220 Endorepellin binds alpha2beta1 integrin
R-HSA-4088281 Endorepellin binds KDR (VEGFR2)
R-HSA-1678694 Heparanase 2 (HPSE2) cleaves heparan sulfate from its proteoglycan (plasma membrane)
R-HSA-2396113 Dystroglycan binds AGRN and HSPG2
R-HSA-1878002 XYLTs transfer Xyl to core protein
R-HSA-1889955 B3GATs transfer GlcA to tetrasaccharide linker
R-HSA-1889978 B3GALT6 transfers Gal to the tetrasaccharide linker
R-HSA-1889981 B4GALT7 transfers Gal group to xylosyl-unit of the tetrasaccharide linker
R-HSA-2022919 EXT1:EXT2 transfers GlcNAc to the terminal GlcA residue
R-HSA-2076419 HS6STs sulfate GlcN at C6 in heparan sulfate/heparin
R-HSA-2076371 GLCE epimerises more GlcA to IdoA as sulfate content rises
R-HSA-2022887 NDST1-4 N-deacetylates GlcNAc residues in heparan
R-HSA-2076392 EXT1:EXT2 transfers GlcA to heparan
R-HSA-2022851 EXT1:EXT2 transfer GlcNAc to the heparan chain
R-HSA-2024100 GLCE epimerises GlcA to IdoA
R-HSA-2022860 NDST1-4 can sulfate a glucosamine residue in heparan to form heparan sulfate (HS)
R-HSA-2022856 EXT1:EXT2 transfers GlcNAc to heparan
R-HSA-2076383 HS3ST1 sulfates GlcN at C3 in heparan sulfate
R-HSA-2076611 HS3STs sulfate GlcN at C3 in heparan sulfate
R-HSA-2076508 HS2ST1 sulfates IdoA at C2 in heparan sulfate
R-HSA-1667005 Heparanase (HPSE) cleaves heparan sulfate from its proteoglycan (lysosome)
R-HSA-2423785 CR:atREs binds apoE and HSPG
R-HSA-2429643 NREH hydrolyses atREs (HSPG:apoE) to atROL and FAs
R-HSA-2404131 LRPs transport extracellular CR:atREs:HSPG:apoE to cytosol
R-HSA-977225 Amyloid fiber formation
R-HSA-1474228 Degradation of the extracellular matrix
R-HSA-3000171 Non-integrin membrane-ECM interactions
R-HSA-3000178 ECM proteoglycans
R-HSA-3000157 Laminin interactions
R-HSA-216083 Integrin cell surface interactions
R-HSA-2024096 HS-GAG degradation
R-HSA-392499 Metabolism of proteins
R-HSA-1474244 Extracellular matrix organization
R-HSA-1971475 A tetrasaccharide linker sequence is required for GAG synthesis
R-HSA-3560801 Defective B3GAT3 causes JDSSDHD
R-HSA-3560783 Defective B4GALT7 causes EDS, progeroid type
R-HSA-2022928 HS-GAG biosynthesis
R-HSA-3656253 Defective EXT1 causes exostoses 1, TRPS2 and CHDS
R-HSA-3656237 Defective EXT2 causes exostoses 2
R-HSA-4420332 Defective B3GALT6 causes EDSP2 and SEMDJL1
R-HSA-1638091 Heparan sulfate/heparin (HS-GAG) metabolism
R-HSA-1793185 Chondroitin sulfate/dermatan sulfate metabolism
R-HSA-3560782 Diseases associated with glycosaminoglycan metabolism
R-HSA-975634 Retinoid metabolism and transport
R-HSA-1630316 Glycosaminoglycan metabolism
R-HSA-3781865 Diseases of glycosylation
R-HSA-2187338 Visual phototransduction
R-HSA-6806667 Metabolism of fat-soluble vitamins
R-HSA-71387 Metabolism of carbohydrates
R-HSA-1643685 Disease
R-HSA-418594 G alpha (i) signalling events
R-HSA-196854 Metabolism of vitamins and cofactors
R-HSA-1430728 Metabolism
R-HSA-388396 GPCR downstream signalling
R-HSA-372790 Signaling by GPCR
R-HSA-162582 Signal Transduction

-  Other Names for This Gene
  Alternate Gene Symbols: NM_005529, NP_005520, P98160, PGBM_HUMAN, Q16287, Q5SZI3, Q9H3V5
UCSC ID: uc001bfj.3
RefSeq Accession: NM_005529
Protein: P98160 (aka PGBM_HUMAN)
CCDS: CCDS30625.1

-  Gene Model Information
 
category: coding nonsense-mediated-decay: no RNA accession: NM_005529.5
exon count: 97CDS single in 3' UTR: no RNA size: 14294
ORF size: 13176CDS single in intron: no Alignment % ID: 100.00
txCdsPredict score: 26552.00frame shift in genome: no % Coverage: 99.96
has start codon: yes stop codon in genome: no # of Alignments: 1
has end codon: yes retained intron: no # AT/AC introns 0
selenocysteine: no end bleed into intron: 0# strange splices: 0
Click here for a detailed description of the fields of the table above.

-  Methods, Credits, and Use Restrictions
  Click here for details on how this gene model was made and data restrictions if any.