GALNT5

gene
On this page

Also known as GalNAc-T5

Summary

GALNT5 (polypeptide N-acetylgalactosaminyltransferase 5, HGNC:4127) is a protein-coding gene on chromosome 2q24.1, encoding Polypeptide N-acetylgalactosaminyltransferase 5 (Q7Z7M9). Catalyzes the initial reaction in O-linked oligosaccharide biosynthesis, the transfer of an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor.

The protein encoded by this gene is a membrane-bound polypeptide N-acetylgalactosaminyltransferase that is found in the Golgi. The encoded protein catalyzes the first step in the mucin-type O-glycosylation of Golgi proteins, transfering an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor.

Source: NCBI Gene 11227 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 156 total — 1 likely-pathogenic
  • MANE Select transcript: NM_014568

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4127
Approved symbolGALNT5
Namepolypeptide N-acetylgalactosaminyltransferase 5
Location2q24.1
Locus typegene with protein product
StatusApproved
AliasesGalNAc-T5
Ensembl geneENSG00000136542
Ensembl biotypeprotein_coding
OMIM615129
Entrez11227

Gene structure

Transcript identifiers

Ensembl transcripts: 6 — 4 protein_coding, 2 protein_coding_CDS_not_defined

ENST00000259056, ENST00000461704, ENST00000463418, ENST00000899988, ENST00000953166, ENST00000953167

RefSeq mRNA: 2 — MANE Select: NM_014568 NM_001329868, NM_014568

CCDS: CCDS2203

Canonical transcript exons

ENST00000259056 — 10 exons

ExonStartEnd
ENSE00000778544157286015157286134
ENSE00000924647157295663157295798
ENSE00000924648157296394157296513
ENSE00000924650157300676157300999
ENSE00000924651157305749157305829
ENSE00000924652157308567157308728
ENSE00000964301157257705157259536
ENSE00000964302157311208157318491
ENSE00000964303157284282157284448
ENSE00000964305157299548157299665

Expression profiles

Bgee: expression breadth ubiquitous, 157 present calls, max score 93.61.

FANTOM5 (CAGE): breadth broad, TPM avg 11.4503 / max 640.9709, expressed in 863 samples.

FANTOM5 promoters (15 alternative TSS)

Promoter IDTPM avgSamples expressed
231875.3978692
231852.5490513
231950.9036351
231960.4894238
231940.4086207
231860.3551192
231930.2938151
231900.2486127
231910.155670
231830.154168

Top tissues by expression

233 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
nasal cavity epitheliumUBERON:000538493.61gold quality
cardiac muscle of right atriumUBERON:000337993.56gold quality
left ventricle myocardiumUBERON:000656692.85gold quality
kidney epitheliumUBERON:000481992.53gold quality
pancreatic ductal cellCL:000207989.62gold quality
rectumUBERON:000105287.92gold quality
mucosa of sigmoid colonUBERON:000499386.77gold quality
colonic mucosaUBERON:000031785.59gold quality
parotid glandUBERON:000183185.32silver quality
ileal mucosaUBERON:000033184.84gold quality
stromal cell of endometriumCL:000225584.29gold quality
epithelial cell of pancreasCL:000008383.87silver quality
myocardiumUBERON:000234982.90gold quality
oral cavityUBERON:000016782.70gold quality
tibiaUBERON:000097982.42gold quality
palpebral conjunctivaUBERON:000181282.37gold quality
esophagus squamous epitheliumUBERON:000692081.62gold quality
calcaneal tendonUBERON:000370179.34gold quality
spermCL:000001979.02silver quality
minor salivary glandUBERON:000183078.69gold quality
saliva-secreting glandUBERON:000104478.57gold quality
cartilage tissueUBERON:000241878.40gold quality
eyeUBERON:000097078.07gold quality
mouth mucosaUBERON:000372977.74gold quality
epithelium of mammary glandUBERON:000324477.66gold quality
mammary ductUBERON:000176577.58gold quality
secondary oocyteCL:000065576.93gold quality
vastus lateralisUBERON:000137976.88gold quality
nasal cavity mucosaUBERON:000182676.83gold quality
quadriceps femorisUBERON:000137776.27gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes9.11
E-MTAB-10290no632.23

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 5)

  • of GalNAc-T5 expression in gastric cancer tissues might add some prognostic information for patients with this disease and lead to a more accurate classification under the TNM stage system (PMID:24619076)
  • Two identified candidate O-glycoprotein biomarkers (CD44 and GalNAc-T5) circulating with the STn glycoform were further validated as being expressed in gastric cancer tissue (PMID:25813380)
  • miR-196b-5p inhibition led to significantly increased colorectal cancer cell migration/invasion and metastases formation in mice, whereas ectopic overexpression showed the opposite phenotype. Molecular profiling and target confirmation identified an interaction between miR-196b-5p and HOXB7 and GALNT5, which in turn regulated colorectal cancer cell migration (PMID:28533224)
  • The 5’-end stem-loop motifs of GALNT5 uaRNA promoted the binding of HSP90. (PMID:29743591)
  • Oncogenic GALNT5 confers FOLFIRINOX resistance via activating the MYH9/ NOTCH/ DDR axis in pancreatic ductal adenocarcinoma. (PMID:39433745)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_rerioENSDARG00000114227
mus_musculusGalnt5ENSMUSG00000026828
rattus_norvegicusGalnt5ENSRNOG00000004645
drosophila_melanogasterPgant5FBGN0031681
drosophila_melanogasterPgant9FBGN0050463
caenorhabditis_elegansWBGENE00001628
caenorhabditis_elegansWBGENE00001630

Paralogs (19): GALNT16 (ENSG00000100626), GALNTL5 (ENSG00000106648), GALNT7 (ENSG00000109586), GALNT18 (ENSG00000110328), GALNT3 (ENSG00000115339), GALNT12 (ENSG00000119514), GALNT8 (ENSG00000130035), GALNT15 (ENSG00000131386), GALNT6 (ENSG00000139629), GALNT1 (ENSG00000141429), GALNT2 (ENSG00000143641), GALNT13 (ENSG00000144278), GALNT14 (ENSG00000158089), GALNT10 (ENSG00000164574), GALNTL6 (ENSG00000174473), GALNT11 (ENSG00000178234), GALNT9 (ENSG00000182870), GALNT17 (ENSG00000185274), GALNT4 (ENSG00000257594)

Protein

Protein identifiers

Polypeptide N-acetylgalactosaminyltransferase 5Q7Z7M9 (reviewed: Q7Z7M9)

Alternative names: Polypeptide GalNAc transferase 5, Protein-UDP acetylgalactosaminyltransferase 5, UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5

All UniProt accessions (1): Q7Z7M9

UniProt curated annotations — full annotation on UniProt →

Function. Catalyzes the initial reaction in O-linked oligosaccharide biosynthesis, the transfer of an N-acetyl-D-galactosamine residue to a serine or threonine residue on the protein receptor. Has activity toward EA2 peptide substrate, but has a weak activity toward Muc2 or Muc1b substrates.

Subunit / interactions. Interacts with EXT2. Does not interact with EXT1, EXTL1 or EXTL3.

Subcellular location. Golgi apparatus membrane.

Domain organisation. There are two conserved domains in the glycosyltransferase region: the N-terminal domain (domain A, also called GT1 motif), which is probably involved in manganese coordination and substrate binding and the C-terminal domain (domain B, also called Gal/GalNAc-T motif), which is probably involved in catalytic reaction and UDP-Gal binding. The ricin B-type lectin domain binds to GalNAc and contributes to the glycopeptide specificity.

Pathway. Protein modification; protein glycosylation.

Similarity. Belongs to the glycosyltransferase 2 family. GalNAc-T subfamily.

RefSeq proteins (2): NP_001316797, NP_055383* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000772Ricin_B_lectinDomain
IPR001173Glyco_trans_2-likeDomain
IPR029044Nucleotide-diphossugar_transHomologous_superfamily
IPR035992Ricin_B-like_lectinsHomologous_superfamily
IPR045885GalNAc-TDomain

Pfam: PF00535, PF00652

Catalyzed reactions (Rhea), 2 shown:

  • L-seryl-[protein] + UDP-N-acetyl-alpha-D-galactosamine = a 3-O-[N-acetyl-alpha-D-galactosaminyl]-L-seryl-[protein] + UDP + H(+) (RHEA:23956)
  • L-threonyl-[protein] + UDP-N-acetyl-alpha-D-galactosamine = a 3-O-[N-acetyl-alpha-D-galactosaminyl]-L-threonyl-[protein] + UDP + H(+) (RHEA:52424)

UniProt features (44 total): glycosylation site 12, binding site 9, disulfide bond 5, region of interest 5, sequence variant 4, topological domain 2, compositionally biased region 2, chain 1, transmembrane region 1, modified residue 1, domain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q7Z7M9-F168.510.46

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (9): 536; 565; 588; 589; 590; 695; 723; 726; 731

Post-translational modifications (1): 292

Disulfide bonds (5): 486–718, 709–789, 822–835, 858–873, 908–923

Glycosylation sites (12): 217, 256, 273, 316, 362, 395, 406, 578, 776, 827, 845, 912

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-913709O-linked glycosylation of mucins

MSigDB gene sets: 49 (showing top): ROVERSI_GLIOMA_COPY_NUMBER_UP, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GOBP_AMINOGLYCAN_BIOSYNTHETIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_BIOSYNTHETIC_PROCESS, chr2q24, WAGNER_APO2_SENSITIVITY, GOBP_AMINOGLYCAN_METABOLIC_PROCESS, GOBP_PROTEIN_O_LINKED_GLYCOSYLATION_VIA_N_ACETYL_GALACTOSAMINE, GOBP_PROTEIN_O_LINKED_GLYCOSYLATION, GOBP_GLYCOPROTEIN_METABOLIC_PROCESS, GOMF_ACETYLGALACTOSAMINYLTRANSFERASE_ACTIVITY, GOMF_HEXOSYLTRANSFERASE_ACTIVITY, GOMF_GLYCOSYLTRANSFERASE_ACTIVITY, GOMF_POLYPEPTIDE_N_ACETYLGALACTOSAMINYLTRANSFERASE_ACTIVITY, GOMF_UDP_GLYCOSYLTRANSFERASE_ACTIVITY

GO Biological Process (4): glycosaminoglycan biosynthetic process (GO:0006024), protein O-linked glycosylation (GO:0006493), protein O-linked glycosylation via N-acetylgalactosamine (GO:0016266), obsolete protein glycosylation (GO:0006486)

GO Molecular Function (5): polypeptide N-acetylgalactosaminyltransferase activity (GO:0004653), carbohydrate binding (GO:0030246), metal ion binding (GO:0046872), transferase activity (GO:0016740), glycosyltransferase activity (GO:0016757)

GO Cellular Component (3): Golgi membrane (GO:0000139), Golgi apparatus (GO:0005794), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
O-linked glycosylation1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
aminoglycan biosynthetic process1
glycosaminoglycan metabolic process1
glycoprotein biosynthetic process1
protein O-linked glycosylation1
acetylgalactosaminyltransferase activity1
catalytic activity, acting on a protein1
binding1
cation binding1
catalytic activity1
transferase activity1
Golgi apparatus1
bounding membrane of organelle1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1
cellular anatomical structure1

Protein interactions and networks

STRING

644 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GALNT5HOXB7P09629595
GALNT5ST3GAL5Q9UNP4492
GALNT5C1GALT1Q9NS00483
GALNT5ERMNQ8TAM6454
GALNT5HSPBAP1Q96EW2454
GALNT5SEC23IPQ9Y6Y8435
GALNT5ST6GALNAC1Q9NSC7429
GALNT5COPB1P53618424
GALNT5MACO1Q8N5G2414
GALNT5ST8SIA2Q92186409
GALNT5DPYSL4O14531403
GALNT5GCNT3O95395398
GALNT5SEC24AO95486383
GALNT5SEC24DO94855376
GALNT5WDSUB1Q8N9V3362

IntAct

22 interactions, top by confidence:

ABTypeScore
FBXO2TMEM131Lpsi-mi:“MI:0914”(association)0.530
C1orf54EXTL3psi-mi:“MI:0914”(association)0.530
GALNT5NONOpsi-mi:“MI:0915”(physical association)0.400
GALNT5CBLIFpsi-mi:“MI:0915”(physical association)0.400
ESYT2psi-mi:“MI:0914”(association)0.350
PGRMC1psi-mi:“MI:0914”(association)0.350
HAX1psi-mi:“MI:0914”(association)0.350
TMEM106AQSOX1psi-mi:“MI:0914”(association)0.350
TMEM106ATMEM131Lpsi-mi:“MI:0914”(association)0.350
BTNL2TMEM131Lpsi-mi:“MI:0914”(association)0.350
SFTPCTMEM131Lpsi-mi:“MI:0914”(association)0.350
BRICD5TMEM131Lpsi-mi:“MI:0914”(association)0.350
NRG1TMEM131Lpsi-mi:“MI:0914”(association)0.350
HLA-DRB1TMEM131Lpsi-mi:“MI:0914”(association)0.350
ISLRpsi-mi:“MI:0914”(association)0.350
PDGFRAQSOX1psi-mi:“MI:0914”(association)0.350
CNTNAP3ADAM10psi-mi:“MI:0914”(association)0.350
HLA-DRAMGRN1psi-mi:“MI:0914”(association)0.350
ASGR1GALNT5psi-mi:“MI:0914”(association)0.350

BioGRID (28): GIF (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Proximity Label-MS), GALNT5 (Proximity Label-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GIF (Affinity Capture-MS), GALNT5 (Two-hybrid), GALNT5 (Reconstituted Complex), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS), GALNT5 (Affinity Capture-MS)

ESM2 similar proteins: A2AMT1, A8MU46, E9PT23, H3JU05, O54963, O70318, O88422, P10636, P10637, P15304, P19332, P46087, P51954, Q13438, Q14028, Q28181, Q3MHX6, Q3UH66, Q3UIZ8, Q3UVR3, Q4ZHG4, Q5I012, Q5RC98, Q5RKH6, Q5S6V2, Q5TCY1, Q5YCV9, Q5YCW0, Q5YCW1, Q6PCF9, Q6PCN3, Q7Z7M9, Q811D2, Q811Q2, Q8BHB9, Q8BWQ5, Q8C102, Q8K2C7, Q91XA2, Q91Z96

Diamond homologs: A8Y236, H0ZAB5, O08832, O08912, O45293, O45947, O61394, O61397, O88422, P34678, P70419, Q07537, Q10471, Q10472, Q10473, Q14435, Q29121, Q49A17, Q5EA41, Q5RFJ6, Q6DJR8, Q6IS24, Q6P6V1, Q6P9A2, Q6P9S7, Q6PB93, Q6UE39, Q6WV16, Q6WV17, Q6WV19, Q6WV20, Q7K755, Q7TT15, Q7Z4T8, Q7Z7M9, Q80VA0, Q86SF2, Q86SR1, Q8BGT9, Q8BVG5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

156 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic1
Uncertain significance134
Likely benign12
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
814332GRCh37/hg19 2q24.1(chr2:157117939-158422574)x1Likely pathogenic

SpliceAI

2131 predictions. Top by Δscore:

VariantEffectΔscore
2:157284270:ATTCT:Aacceptor_gain1.0000
2:157284274:T:TAacceptor_gain1.0000
2:157284447:AGG:Adonor_loss1.0000
2:157284449:G:GAdonor_loss1.0000
2:157284450:T:Gdonor_loss1.0000
2:157286003:A:AGacceptor_gain1.0000
2:157286003:ACTCT:Aacceptor_gain1.0000
2:157286134:GGTA:Gdonor_loss1.0000
2:157286135:G:Tdonor_loss1.0000
2:157286136:T:Gdonor_loss1.0000
2:157296386:T:TAacceptor_gain1.0000
2:157296392:A:AGacceptor_gain1.0000
2:157296393:G:GGacceptor_gain1.0000
2:157300925:TGCAA:Tdonor_gain1.0000
2:157305743:TTTTA:Tacceptor_loss1.0000
2:157305744:TTTA:Tacceptor_loss1.0000
2:157305745:TTA:Tacceptor_loss1.0000
2:157305746:TAGC:Tacceptor_loss1.0000
2:157305747:A:AGacceptor_gain1.0000
2:157305747:AGCTT:Aacceptor_loss1.0000
2:157305748:G:GGacceptor_gain1.0000
2:157305748:GCTT:Gacceptor_gain1.0000
2:157305827:GAG:Gdonor_gain1.0000
2:157308561:CCACA:Cacceptor_loss1.0000
2:157308562:CACAG:Cacceptor_loss1.0000
2:157308563:ACAG:Aacceptor_loss1.0000
2:157308564:CA:Cacceptor_loss1.0000
2:157308565:A:ACacceptor_loss1.0000
2:157308565:A:AGacceptor_gain1.0000
2:157308566:G:GGacceptor_gain1.0000

AlphaMissense

6234 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:157286096:T:CL568S1.000
2:157295684:A:CD588A1.000
2:157295684:A:TD588V1.000
2:157295685:T:AD588E1.000
2:157295685:T:GD588E1.000
2:157296446:T:CF644L1.000
2:157296448:T:AF644L1.000
2:157296448:T:GF644L1.000
2:157299633:T:AW695R1.000
2:157299633:T:CW695R1.000
2:157299635:G:CW695C1.000
2:157299635:G:TW695C1.000
2:157299647:T:AN699K1.000
2:157299647:T:GN699K1.000
2:157300727:C:GH723D1.000
2:157300729:T:AH723Q1.000
2:157300729:T:GH723Q1.000
2:157300733:T:CF725L1.000
2:157300735:C:AF725L1.000
2:157300735:C:GF725L1.000
2:157300737:G:CR726T1.000
2:157300737:G:TR726I1.000
2:157300738:A:CR726S1.000
2:157300738:A:TR726S1.000
2:157300792:C:AN744K1.000
2:157300792:C:GN744K1.000
2:157300814:T:AW752R1.000
2:157300814:T:CW752R1.000
2:157259487:A:CS469R0.999
2:157259489:C:AS469R0.999

dbSNP variants (sampled 300 via entrez): RS1000002853 (2:157278143 T>A), RS1000040614 (2:157275914 G>T), RS1000065718 (2:157308095 A>G), RS1000068143 (2:157266747 A>C), RS1000093618 (2:157308398 T>C), RS1000102106 (2:157275232 C>A,T), RS1000122712 (2:157286853 G>A), RS1000169435 (2:157273102 A>C), RS1000219178 (2:157314749 C>T), RS1000258951 (2:157287208 T>A,C,G), RS1000297302 (2:157260821 C>A,T), RS1000433836 (2:157279174 T>C), RS1000461081 (2:157285527 C>A,T), RS1000512818 (2:157291531 C>A,T), RS1000620265 (2:157279212 A>C)

Disease associations

OMIM: gene MIM:615129 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST012311_25Schizophrenia x sex interaction5.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0008343sex interaction measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

38 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteaffects methylation, decreases expression, increases expression5
Benzo(a)pyrenedecreases methylation, increases expression2
Tobacco Smoke Pollutionaffects expression, increases expression2
geldanamycinincreases expression1
urushiolincreases expression1
methylmercuric chlorideincreases expression1
lasiocarpineincreases expression1
propionaldehydeincreases expression1
bisphenol Aincreases methylation, affects cotreatment1
tobacco tardecreases expression1
potassium chromate(VI)decreases expression1
CGP 52608increases reaction, affects binding1
nutlin 3affects cotreatment, increases expression1
abrinedecreases expression1
MRK 003decreases expression1
jinfukangaffects cotreatment, increases expression1
(+)-JQ1 compounddecreases expression1
Resveratrolaffects cotreatment, decreases expression1
Sunitinibdecreases expression1
Fulvestrantaffects cotreatment, increases methylation1
Acetaminophendecreases expression1
Glyphosateincreases expression1
Air Pollutantsincreases abundance, increases expression1
Arsenicaffects methylation1
Cadmiumdecreases expression1
Camptothecinincreases expression1
Cisplatinaffects cotreatment, increases expression1
Dactinomycinaffects cotreatment, increases expression1
Dimethyl Sulfoxideincreases expression1
Doxorubicinincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.