PGGHG

gene
On this page

Also known as FLJ22635

Summary

PGGHG (protein-glucosylgalactosylhydroxylysine glucosidase, HGNC:26210) is a protein-coding gene on chromosome 11p15.5, encoding Protein-glucosylgalactosylhydroxylysine glucosidase (Q32M88). Catalyzes the hydrolysis of glucose from the disaccharide unit linked to hydroxylysine residues of collagen and collagen-like proteins.

Enables protein-glucosylgalactosylhydroxylysine glucosidase activity. Involved in carbohydrate metabolic process. Located in cytosol.

Source: NCBI Gene 80162 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 23 total
  • MANE Select transcript: NM_025092

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26210
Approved symbolPGGHG
Nameprotein-glucosylgalactosylhydroxylysine glucosidase
Location11p15.5
Locus typegene with protein product
StatusApproved
AliasesFLJ22635
Ensembl geneENSG00000142102
Ensembl biotypeprotein_coding
OMIM617032
Entrez80162

Gene structure

Transcript identifiers

Ensembl transcripts: 17 — 13 protein_coding, 3 retained_intron, 1 nonsense_mediated_decay

ENST00000397660, ENST00000409479, ENST00000409548, ENST00000409655, ENST00000474221, ENST00000476372, ENST00000482937, ENST00000529087, ENST00000865174, ENST00000865175, ENST00000865176, ENST00000865177, ENST00000865178, ENST00000935432, ENST00000935433, ENST00000943937, ENST00000943938

RefSeq mRNA: 1 — MANE Select: NM_025092 NM_025092

CCDS: CCDS31322

Canonical transcript exons

ENST00000409548 — 14 exons

ExonStartEnd
ENSE00001492785293366293502
ENSE00001492806293163293235
ENSE00001584341289126289239
ENSE00001607402290678291113
ENSE00002159295294556296107
ENSE00003470125294267294478
ENSE00003477800293594293727
ENSE00003549512291976292095
ENSE00003559650292886292997
ENSE00003596264292546292677
ENSE00003621402290390290600
ENSE00003648187289804290075
ENSE00003649557293830293925
ENSE00003683502294099294196

Expression profiles

Bgee: expression breadth ubiquitous, 236 present calls, max score 99.43.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 14.4109 / max 318.5008, expressed in 1732 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1121679.19631641
1121684.95201455
2060700.2625120

Top tissues by expression

279 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
body of pancreasUBERON:000115099.43gold quality
granulocyteCL:000009498.61gold quality
right lungUBERON:000216798.27gold quality
tendon of biceps brachiiUBERON:000818898.23gold quality
small intestine Peyer’s patchUBERON:000345497.53gold quality
right uterine tubeUBERON:000130296.98gold quality
upper lobe of left lungUBERON:000895296.88gold quality
spleenUBERON:000210696.87gold quality
left lobe of thyroid glandUBERON:000112096.74gold quality
left uterine tubeUBERON:000130396.70gold quality
right lobe of thyroid glandUBERON:000111996.47gold quality
metanephros cortexUBERON:001053396.40gold quality
mucosa of transverse colonUBERON:000499196.35gold quality
right lobe of liverUBERON:000111495.88gold quality
thyroid glandUBERON:000204695.45gold quality
endocervixUBERON:000045895.40gold quality
upper lobe of lungUBERON:000894895.40gold quality
small intestineUBERON:000210895.33gold quality
tibial nerveUBERON:000132394.79gold quality
lymph nodeUBERON:000002994.72gold quality
minor salivary glandUBERON:000183094.60gold quality
descending thoracic aortaUBERON:000234594.40gold quality
bloodUBERON:000017894.28gold quality
ascending aortaUBERON:000149694.03gold quality
thoracic aortaUBERON:000151593.96gold quality
gall bladderUBERON:000211093.70gold quality
omental fat padUBERON:001041493.69gold quality
left coronary arteryUBERON:000162693.68gold quality
peritoneumUBERON:000235893.64gold quality
coronary arteryUBERON:000162193.12gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes17.22

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

19 targeting PGGHG, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-444799.8567.812900
HSA-MIR-6892-3P99.6866.401178
HSA-MIR-447299.5666.081478
HSA-MIR-4786-3P99.3668.351390
HSA-MIR-429199.2068.882969
HSA-MIR-544B99.1867.411632
HSA-MIR-6797-3P99.1766.94668
HSA-MIR-66199.0965.942062
HSA-MIR-92299.0267.231838
HSA-MIR-6804-3P98.7264.82852
HSA-MIR-607698.6165.69637
HSA-MIR-6826-3P98.1966.321153
HSA-MIR-615-5P98.1063.76591
HSA-MIR-607298.0066.47804
HSA-MIR-366597.7365.08975
HSA-MIR-6823-5P96.2665.69919
HSA-MIR-6891-3P95.8065.76683
HSA-MIR-2277-3P91.9462.27299
HSA-MIR-120489.5065.56109

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriopgghgENSDARG00000014311
mus_musculusPgghgENSMUSG00000062031
rattus_norvegicusPgghgENSRNOG00000014783
drosophila_melanogasterCG16965FBGN0032387

Protein

Protein identifiers

Protein-glucosylgalactosylhydroxylysine glucosidaseQ32M88 (reviewed: Q32M88)

Alternative names: Acid trehalase-like protein 1

All UniProt accessions (6): Q32M88, A0A024R1Z9, B8ZZ60, E7EMA9, H0YCL6, H3BM00

UniProt curated annotations — full annotation on UniProt →

Function. Catalyzes the hydrolysis of glucose from the disaccharide unit linked to hydroxylysine residues of collagen and collagen-like proteins.

Similarity. Belongs to the glycosyl hydrolase 65 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q32M88-11yes
Q32M88-22

RefSeq proteins (1): NP_079368* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR005195Glyco_hydro_65_MDomain
IPR0089286-hairpin_glycosidase_sfHomologous_superfamily
IPR0123416hp_glycosidase-like_sfHomologous_superfamily

Pfam: PF03632

Catalyzed reactions (Rhea), 1 shown:

  • (5R)-5-O-[alpha-D-glucosyl-(1->2)-beta-D-galactosyl]-5-hydroxy-L-lysyl-[collagen] + H2O = (5R)-5-O-(beta-D-galactosyl)-5-hydroxy-L-lysyl-[collagen] + D-glucose (RHEA:11068)

UniProt features (14 total): mutagenesis site 5, binding site 2, splice variant 2, chain 1, region of interest 1, sequence conflict 1, compositionally biased region 1, active site 1

Structure

Experimental structures (PDB)

5 structures.

PDBMethodResolution (Å)
9SD5X-RAY DIFFRACTION1.36
9SD7X-RAY DIFFRACTION1.39
9SD8X-RAY DIFFRACTION1.61
9SD6X-RAY DIFFRACTION1.7
9SD4X-RAY DIFFRACTION2.39

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q32M88-F191.660.89

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 430 (proton donor)

Ligand- & substrate-binding residues (2): 300–301; 498–499

Mutagenesis-validated functional residues (5):

PositionPhenotype
429significantly impairs catalytic activity.
430abolishes catalytic activity.
574abolishes catalytic activity.
301abolishes catalytic activity.
429abolishes catalytic activity.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 106 (showing top): ACEVEDO_LIVER_TUMOR_VS_NORMAL_ADJACENT_TISSUE_DN, GOBP_CARBOHYDRATE_METABOLIC_PROCESS, HOEBEKE_LYMPHOID_STEM_CELL_UP, DEURIG_T_CELL_PROLYMPHOCYTIC_LEUKEMIA_UP, KIM_WT1_TARGETS_DN, GOMF_GLUCOSIDASE_ACTIVITY, GOMF_HYDROLASE_ACTIVITY_ACTING_ON_GLYCOSYL_BONDS, GOMF_HYDROLASE_ACTIVITY_HYDROLYZING_O_GLYCOSYL_COMPOUNDS, GUTIERREZ_CHRONIC_LYMPHOCYTIC_LEUKEMIA_DN, VDR_Q6, LEE_DIFFERENTIATING_T_LYMPHOCYTE, CHEMNITZ_RESPONSE_TO_PROSTAGLANDIN_E2_DN, LEE_BMP2_TARGETS_UP, KRAS.600.LUNG.BREAST_UP.V1_DN, KRAS.BREAST_UP.V1_DN

GO Biological Process (1): carbohydrate metabolic process (GO:0005975)

GO Molecular Function (4): protein-glucosylgalactosylhydroxylysine glucosidase activity (GO:0047402), catalytic activity (GO:0003824), hydrolase activity (GO:0016787), hydrolase activity, acting on glycosyl bonds (GO:0016798)

GO Cellular Component (1): cytosol (GO:0005829)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
primary metabolic process1
glucosidase activity1
catalytic activity, acting on a protein1
molecular_function1
catalytic activity1
hydrolase activity1
cytoplasm1
cellular anatomical structure1

Protein interactions and networks

STRING

572 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PGGHGB4GALNT4Q76KP1705
PGGHGIFITM5A6NNB3590
PGGHGTREHO43280575
PGGHGNLRP6P59044532
PGGHGIFITM10A6NMD0529
PGGHGPKP3Q9Y446521
PGGHGSYNDIG1LA6NDD5497
PGGHGIFITM2Q01629452
PGGHGIFITM1P13164420
PGGHGIFITM3Q01628415
PGGHGTMEM91Q6ZNR0409
PGGHGZNF329Q86UD4380
PGGHGPRRT1Q99946377
PGGHGGAPDHP00354375
PGGHGPSMD13Q9UNM6360

IntAct

4 interactions, top by confidence:

ABTypeScore
ECE1PGGHGpsi-mi:“MI:0915”(physical association)0.370
PGGHGECE1psi-mi:“MI:0915”(physical association)0.370

BioGRID (7): ATHL1 (Affinity Capture-RNA), ATHL1 (Affinity Capture-MS), ATHL1 (Affinity Capture-RNA), ATHL1 (Affinity Capture-MS), ATHL1 (Affinity Capture-RNA), ATHL1 (Two-hybrid), ATHL1 (Affinity Capture-Western)

ESM2 similar proteins: A0JND9, E1BPW0, O14773, O18956, O35795, O55026, O75173, O75355, O75356, O75578, O89023, O93295, P08514, P08648, P11688, P17405, P49961, P55772, P56201, P79784, P97687, Q04519, Q0VD19, Q12794, Q32M88, Q49HH9, Q49KI5, Q5DRK1, Q5IS74, Q5MY95, Q5RFL1, Q5RFQ8, Q60HH1, Q6P3E7, Q6P6S9, Q717C1, Q717C2, Q7RTX0, Q8BFW6, Q8BNJ2

Diamond homologs: A0A1H1XG33, A0JMP0, A5FBJ5, B8D049, F1NZI4, M1RNJ8, N1P212, O06993, P48016, P78617, Q32M88, Q54KX5, Q5AAU5, Q8BP56, Q8RBL8, E6ENP7, Q8GRC3, Q8L163, A9KT32, D6XZ22, P9WN14, P9WN15, A0PMI0, A0QKN5, A1KP65, A5U846, B2HDP8, E1WGG9, F4I1A6, F4KFG5, G4RK44, O64896, P0CL50, P55611, P65070, P9WFZ4, P9WFZ5, P9WKZ6, P9WKZ7, Q0D6F4

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

23 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance10
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2541 predictions. Top by Δscore:

VariantEffectΔscore
11:290593:G:GTdonor_gain1.0000
11:290596:GCCCG:Gdonor_gain1.0000
11:292096:G:GAdonor_loss1.0000
11:292097:T:Gdonor_loss1.0000
11:292543:CAGGG:Cacceptor_gain1.0000
11:292544:A:AGacceptor_gain1.0000
11:292544:AG:Aacceptor_gain1.0000
11:292544:AGGGA:Aacceptor_gain1.0000
11:292545:G:GAacceptor_loss1.0000
11:292545:G:GGacceptor_gain1.0000
11:292545:GG:Gacceptor_gain1.0000
11:292545:GGGA:Gacceptor_gain1.0000
11:292545:GGGAG:Gacceptor_gain1.0000
11:292674:CCAGG:Cdonor_loss1.0000
11:292675:CAG:Cdonor_loss1.0000
11:292676:AGG:Adonor_loss1.0000
11:292677:GGTGA:Gdonor_loss1.0000
11:292679:T:Adonor_loss1.0000
11:293415:A:AGacceptor_gain1.0000
11:293416:G:GGacceptor_gain1.0000
11:293416:GCCA:Gacceptor_gain1.0000
11:293726:GG:Gdonor_gain1.0000
11:293727:GG:Gdonor_gain1.0000
11:293821:T:TAacceptor_gain1.0000
11:293828:A:AGacceptor_gain1.0000
11:293829:G:GGacceptor_gain1.0000
11:293829:GA:Gacceptor_gain1.0000
11:293829:GAGC:Gacceptor_gain1.0000
11:293829:GAGCA:Gacceptor_gain1.0000
11:294265:AG:Aacceptor_gain1.0000

AlphaMissense

4762 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:293830:A:CS539R0.985
11:293832:C:AS539R0.985
11:293832:C:GS539R0.985
11:293484:T:CF488L0.983
11:293486:C:AF488L0.983
11:293486:C:GF488L0.983
11:294138:T:CF584L0.982
11:294140:C:AF584L0.982
11:294140:C:GF584L0.982
11:292943:T:CF406L0.979
11:292945:T:AF406L0.979
11:292945:T:GF406L0.979
11:294102:T:AW572R0.979
11:294102:T:CW572R0.979
11:291048:A:CS281R0.975
11:291050:T:AS281R0.975
11:291050:T:GS281R0.975
11:293920:T:CF569L0.973
11:293922:C:AF569L0.973
11:293922:C:GF569L0.973
11:294137:C:AN583K0.973
11:294137:C:GN583K0.973
11:290395:T:CF89L0.971
11:290397:T:AF89L0.971
11:290397:T:GF89L0.971
11:292042:C:AR325S0.971
11:292567:A:CS350R0.969
11:292569:T:AS350R0.969
11:292569:T:GS350R0.969
11:291105:T:AW300R0.967

dbSNP variants (sampled 300 via entrez): RS1000116014 (11:295525 C>G,T), RS1000120544 (11:289157 G>A,C), RS1000274492 (11:289550 G>C), RS1000383447 (11:293665 C>T), RS1000514908 (11:288941 T>C), RS1000848576 (11:292392 C>T), RS1000922085 (11:292260 G>A), RS1001108474 (11:287919 G>A,C), RS1001540140 (11:288174 G>A), RS1001747142 (11:295709 T>C), RS1001777134 (11:288681 C>A,T), RS1001853854 (11:291693 C>T), RS1002099434 (11:295489 G>A,C), RS1002284161 (11:294027 G>A), RS1002380069 (11:290221 C>A,G,T)

Disease associations

OMIM: gene MIM:617032 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST006016_19Serum alkaline phosphatase levels3.000000e-38
GCST90011900_65Serum alkaline phosphatase levels1.000000e-71

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004533alkaline phosphatase measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

37 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Aaffects cotreatment, increases methylation, decreases expression2
sodium arsenitedecreases expression, increases expression2
Benzo(a)pyreneaffects methylation, increases expression2
aristolochic acid Idecreases expression1
bisphenol Fdecreases expression, affects cotreatment1
bufotalindecreases expression1
testosterone enanthateaffects expression1
triphenyl phosphateaffects expression1
2,5,2’,5’-tetrachlorobiphenyldecreases expression1
butyraldehydedecreases expression1
aflatoxin B2increases methylation1
di-n-butylphosphoric acidaffects expression1
Grape Seed Proanthocyanidinsaffects cotreatment, decreases expression1
bisphenol Saffects cotreatment, decreases expression1
jinfukangaffects cotreatment, increases expression1
Temozolomideincreases expression1
Sunitinibdecreases expression1
Arsenic Trioxidedecreases expression1
Fulvestrantincreases methylation, affects cotreatment1
Air Pollutantsincreases abundance, increases expression1
Catechinaffects cotreatment, decreases expression1
Cisplatinaffects cotreatment, increases expression1
Dexamethasonedecreases expression, affects cotreatment1
Diethylhexyl Phthalatedecreases expression1
Estradiolaffects cotreatment, increases expression1
Indomethacinaffects cotreatment, decreases expression1
Ivermectindecreases expression1
Nickelincreases expression1
Phthalic Acidsincreases methylation1
Tobacco Smoke Pollutiondecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.