GBGT1

gene
On this page

Also known as UDP-GalNAcA3GALNTMGC44848FS

Summary

GBGT1 (globoside alpha-1,3-N-acetylgalactosaminyltransferase 1 (FORS blood group), HGNC:20460) is a protein-coding gene on chromosome 9q34.2, encoding Globoside alpha-1,3-N-acetylgalactosaminyltransferase 1 (Q8N5D6). Has lost the ability to synthesize Forssman glycolipid antigen (FORS1/FG).

This gene encodes a glycosyltransferase that plays a role in the synthesis of Forssman glycolipid (FG), a member of the globoseries glycolipid family. Glycolipids such as FG form attachment sites for the binding of pathogens to cells; expression of this protein may determine host tropism to microorganisms. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 26301 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 77 total — 1 pathogenic
  • MANE Select transcript: NM_021996

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20460
Approved symbolGBGT1
Namegloboside alpha-1,3-N-acetylgalactosaminyltransferase 1 (FORS blood group)
Location9q34.2
Locus typegene with protein product
StatusApproved
AliasesUDP-GalNAc, A3GALNT, MGC44848, FS
Ensembl geneENSG00000148288
Ensembl biotypeprotein_coding
OMIM606074
Entrez26301

Gene structure

Transcript identifiers

Ensembl transcripts: 32 — 29 protein_coding, 2 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000372036, ENST00000372038, ENST00000372040, ENST00000372043, ENST00000470431, ENST00000472281, ENST00000487864, ENST00000540636, ENST00000642916, ENST00000644196, ENST00000645973, ENST00000646215, ENST00000900748, ENST00000900749, ENST00000900750, ENST00000900751, ENST00000913413, ENST00000913414, ENST00000913415, ENST00000953293, ENST00000953294, ENST00000953295, ENST00000953296, ENST00000953297, ENST00000953298, ENST00000953299, ENST00000953300, ENST00000953301, ENST00000953302, ENST00000953303, ENST00000953304, ENST00000953305

RefSeq mRNA: 5 — MANE Select: NM_021996 NM_001282629, NM_001282632, NM_001288572, NM_001288573, NM_021996

CCDS: CCDS65175, CCDS65176, CCDS6960

Canonical transcript exons

ENST00000372040 — 7 exons

ExonStartEnd
ENSE00002215040133152948133154261
ENSE00003468855133155901133155936
ENSE00003550334133162342133162532
ENSE00003642386133155178133155312
ENSE00003667465133161467133161532
ENSE00003684805133156015133156065
ENSE00003814885133163754133163914

Expression profiles

Bgee: expression breadth ubiquitous, 208 present calls, max score 88.72.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 5.0089 / max 101.5805, expressed in 1254 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1029653.93571147
1029661.0732511

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
body of stomachUBERON:000116188.72gold quality
monocyteCL:000057687.94gold quality
leukocyteCL:000073887.57gold quality
bloodUBERON:000017887.51gold quality
stomachUBERON:000094585.79gold quality
right ovaryUBERON:000211885.09gold quality
left ovaryUBERON:000211985.05gold quality
granulocyteCL:000009484.98gold quality
right adrenal glandUBERON:000123384.56gold quality
right adrenal gland cortexUBERON:003582784.52gold quality
apex of heartUBERON:000209883.54gold quality
left adrenal glandUBERON:000123483.40gold quality
left adrenal gland cortexUBERON:003582583.14gold quality
vermiform appendixUBERON:000115482.29gold quality
adrenal cortexUBERON:000123582.02gold quality
fundus of stomachUBERON:000116081.81gold quality
omental fat padUBERON:001041481.63gold quality
peritoneumUBERON:000235881.57gold quality
pancreatic ductal cellCL:000207981.35silver quality
adrenal glandUBERON:000236981.22gold quality
germinal epithelium of ovaryUBERON:000130481.12gold quality
placentaUBERON:000198780.97gold quality
ileal mucosaUBERON:000033180.94silver quality
adipose tissue of abdominal regionUBERON:000780880.80gold quality
upper lobe of left lungUBERON:000895280.71gold quality
upper lobe of lungUBERON:000894880.70gold quality
left uterine tubeUBERON:000130380.69gold quality
ovaryUBERON:000099280.21gold quality
right lobe of thyroid glandUBERON:000111979.79gold quality
esophagogastric junction muscularis propriaUBERON:003584179.79gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.51
E-MTAB-7303no2.17

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

12 targeting GBGT1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-568099.9169.833421
HSA-MIR-428499.3665.251293
HSA-MIR-887-5P98.8265.901347
HSA-MIR-92A-1-5P98.2864.51631
HSA-MIR-4664-5P98.1765.071020
HSA-MIR-6819-5P97.9666.591071
HSA-MIR-63797.9164.051517
HSA-MIR-6737-5P97.7566.541044
HSA-MIR-6812-5P97.5665.391059
HSA-MIR-4693-5P97.3567.021234
HSA-MIR-451395.0467.06727
HSA-MIR-6855-3P95.0466.57725

Literature-anchored findings (GeneRIF, showing 4)

  • The Fs synthase gene, GBGT1, in A(pae) individuals encoded an arginine to glutamine change at residue 296. (PMID:23255552)
  • GBGT1 expression is epigenetically silenced through promoter hypermethylation in ovarian cancer. (PMID:25294702)
  • an amino acid substitution at codon 69 from methionine to threonine or serine (Met69Thr/Ser) modified enzymatic specificity of Blood group A/B glycosyltransferases and permitted FORS1 biosynthesis (PMID:29898878)
  • extended the knowledge of GBGT1 variants, allele frequencies, and the characteristics of naturally occurring antibodies in our newest carbohydrate blood group system, FORS. The finding of c.363C>A-homozygous donors indicates that GBGT1 is dispensable. (PMID:30277576)

Cross-species orthologs

10 orthologs

OrganismSymbolGene ID
danio_reriogbgt1l9ENSDARG00000011283
danio_reriogbgt1l5ENSDARG00000019207
danio_reriogbgt1l1ENSDARG00000025275
danio_reriogbgt1l3ENSDARG00000035555
danio_reriogbgt1l4ENSDARG00000068503
danio_reriogbgt1l7ENSDARG00000091936
danio_reriogbgt1l8ENSDARG00000091944
danio_reriogbgt1l6ENSDARG00000091969
danio_reriogbgt1l2ENSDARG00000092718
mus_musculusGbgt1ENSMUSG00000026829

Paralogs (3): ABO (ENSG00000175164), A3GALT2 (ENSG00000184389), GLT6D1 (ENSG00000204007)

Protein

Protein identifiers

Globoside alpha-1,3-N-acetylgalactosaminyltransferase 1Q8N5D6 (reviewed: Q8N5D6)

Alternative names: Forssman glycolipid synthase-like protein

All UniProt accessions (7): Q8N5D6, A0A2R8Y4A7, A0A2R8YGC1, J7PW20, J7Q0Z1, Q5T7U8, Q5T7V0

UniProt curated annotations — full annotation on UniProt →

Function. Has lost the ability to synthesize Forssman glycolipid antigen (FORS1/FG). Might have acquired an alternative function in glycosphingolipid metabolism, but it remains to be established. It appears to have drifted more slowly than confirmed pseudogenes in the glycosyltransferase 6 family, suggesting that it has remained under evolutionary pressure.

Subcellular location. Golgi apparatus membrane.

Tissue specificity. Widely expressed. Expressed at higher level in placenta, ovary and peripheral blood leukocyte, whereas it is weakly expressed in liver, thymus, and testis. Expressed in bone marrow erythroid cells.

Cofactor. Binds 1 Mn(2+) ion per subunit.

Domain organisation. The conserved DXD motif is involved in cofactor binding. The manganese ion interacts with the beta-phosphate group of UDP and may also have a role in catalysis.

Pathway. Protein modification; protein glycosylation.

Polymorphism. Common alleles GBGT101N.01 and GBGT101N.02 do not synthesize Forssman glycolipid antigen (FORS1). A rare allele encoding an arginine to glutamine change at residue 296 is associated with the ability to synthesize Forssman antigen, which is expressed in erythrocytes and is inheritable, thus defining a new histo-blood group FORS, also known as Apae. This variation might have arised as a consequence of the selective pressure exerted by microorganisms. For instance, the uropathogenic E.coli expressing prsG adhesin only binds and agglutinates FORS1-expressing erythrocytes. Thus, FORS1-positive individuals might be more susceptible to certain pathogens.

Similarity. Belongs to the glycosyltransferase 6 family.

Isoforms (3)

UniProt IDNamesCanonical?
Q8N5D6-11yes
Q8N5D6-22
Q8N5D6-33

RefSeq proteins (5): NP_001269558, NP_001269561, NP_001275501, NP_001275502, NP_068836* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR005076Glyco_trans_6Family
IPR029044Nucleotide-diphossugar_transHomologous_superfamily

Pfam: PF03414

Enzyme classification (BRENDA):

  • EC 2.4.1.88 — globoside alpha-N-acetylgalactosaminyltransferase (BRENDA: 6 organisms, 21 substrates, 33 inhibitors, 5 Km, 0 kcat entries)

Substrate kinetics (BRENDA)

2 substrates with measured Km, best-characterized 2. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
GLOBOSIDE0.0016–0.53
UDP-N-ACETYLGALACTOSAMINE0.01–0.12

UniProt features (25 total): sequence variant 9, binding site 5, sequence conflict 3, topological domain 2, splice variant 2, chain 1, glycosylation site 1, transmembrane region 1, active site 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N5D6-F186.390.67

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 298 (nucleophile)

Ligand- & substrate-binding residues (5): 116–121; 206–208; 206; 208; 228–231

Glycosylation sites (1): 108

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 60 (showing top): GSE45365_NK_CELL_VS_CD8_TCELL_DN, GOBP_GLYCOLIPID_BIOSYNTHETIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GTGCCTT_MIR506, KEGG_GLYCOSPHINGOLIPID_BIOSYNTHESIS_GLOBO_SERIES, GOBP_CARBOHYDRATE_DERIVATIVE_BIOSYNTHETIC_PROCESS, GOBP_CARBOHYDRATE_METABOLIC_PROCESS, GOBP_LIPID_METABOLIC_PROCESS, GOBP_LIPID_BIOSYNTHETIC_PROCESS, GOBP_LIPOSACCHARIDE_METABOLIC_PROCESS, GOBP_MEMBRANE_LIPID_METABOLIC_PROCESS, GOBP_MEMBRANE_LIPID_BIOSYNTHETIC_PROCESS, GOMF_ACETYLGALACTOSAMINYLTRANSFERASE_ACTIVITY, GOMF_HEXOSYLTRANSFERASE_ACTIVITY, GOMF_GLYCOSYLTRANSFERASE_ACTIVITY

GO Biological Process (3): carbohydrate metabolic process (GO:0005975), glycolipid biosynthetic process (GO:0009247), obsolete protein glycosylation (GO:0006486)

GO Molecular Function (5): metal ion binding (GO:0046872), globoside alpha-N-acetylgalactosaminyltransferase activity (GO:0047277), transferase activity (GO:0016740), glycosyltransferase activity (GO:0016757), hexosyltransferase activity (GO:0016758)

GO Cellular Component (4): Golgi membrane (GO:0000139), Golgi apparatus (GO:0005794), vesicle (GO:0031982), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
primary metabolic process1
glycolipid metabolic process1
lipid biosynthetic process1
carbohydrate derivative biosynthetic process1
cation binding1
acetylgalactosaminyltransferase activity1
catalytic activity1
transferase activity1
glycosyltransferase activity1
Golgi apparatus1
bounding membrane of organelle1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1
membrane-bounded organelle1
cellular anatomical structure1

Protein interactions and networks

STRING

480 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GBGT1B3GALNT1O75752610
GBGT1A4GALTQ9NPC4507
GBGT1MED13LQ71F56390
GBGT1SUMF1Q8NBK3380
GBGT1C22orf42Q6IC83377
GBGT1FUT3P21217372
GBGT1DOK2O60496370
GBGT1FUT7Q11130356
GBGT1FUT1P19526350
GBGT1ENTHD1Q8IYW4348
GBGT1B3GALT2O43825348
GBGT1B3GALT5Q9Y2C3348
GBGT1ST3GAL6Q9Y274324
GBGT1FUT2Q10981323
GBGT1B3GALT1Q9Y5Z6323

IntAct

2 interactions, top by confidence:

ABTypeScore
GBGT1dppDpsi-mi:“MI:0915”(physical association)0.000

BioGRID (1): GLT6D1 (Negative Genetic)

ESM2 similar proteins: A0A0D3QS98, A0A0D3QS99, A4D0V7, C5H5C4, F6Q1T7, O70309, O75354, P17405, P18084, P18424, P22413, P50747, P52850, P58242, P61642, P80747, Q04519, Q0VBD0, Q0VD19, Q13219, Q52KP5, Q58CQ9, Q5QQ51, Q5STE3, Q64687, Q6DFZ6, Q6KFX9, Q6MZW2, Q6P988, Q6UWX4, Q6YGZ1, Q6ZXD2, Q71RP1, Q812F8, Q8BJQ9, Q8C1F4, Q8C419, Q8N5D6, Q8N6G5, Q8R116

Diamond homologs: A0A4Z3, A1YGR5, A1YGR6, A2AUQ7, D3ZNQ3, G3V9Q9, P14769, P16442, P23336, P38649, P50127, Q2NKH9, Q2YDM8, Q3L7M0, Q3V1N9, Q4R5T7, Q5ZLK4, Q7Z4J2, Q8CFC4, Q8HY56, Q8HYB2, Q8N5D6, Q8SPR2, Q8SQ20, Q8VI38, Q95158, Q9ET32, U3KPV4, Q4G0N0, Q5JBG6

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

77 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance58
Likely benign7
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
4075962GRCh37/hg19 9q33.2-34.3(chr9:124095694-141020389)x3Pathogenic

SpliceAI

1570 predictions. Top by Δscore:

VariantEffectΔscore
9:133155176:A:ACdonor_gain1.0000
9:133155177:C:CCdonor_gain1.0000
9:133155179:T:TAdonor_gain1.0000
9:133161465:A:ACdonor_gain1.0000
9:133161466:C:CCdonor_gain1.0000
9:133163752:ACT:Adonor_gain1.0000
9:133163753:CTC:Cdonor_gain1.0000
9:133154243:C:CCacceptor_gain0.9900
9:133155176:ACTT:Adonor_gain0.9900
9:133155177:CTT:Cdonor_gain0.9900
9:133155177:CTTC:Cdonor_gain0.9900
9:133155180:C:Adonor_gain0.9900
9:133155283:C:CTacceptor_gain0.9900
9:133156061:TGTTG:Tacceptor_gain0.9900
9:133156063:TTG:Tacceptor_gain0.9900
9:133156064:TG:Tacceptor_gain0.9900
9:133156066:C:CCacceptor_gain0.9900
9:133156073:C:CTacceptor_gain0.9900
9:133157502:AG:Adonor_gain0.9900
9:133157549:TG:Tdonor_gain0.9900
9:133162943:G:Cdonor_gain0.9900
9:133162990:G:Cdonor_gain0.9900
9:133163752:A:ACdonor_gain0.9900
9:133163753:C:CCdonor_gain0.9900
9:133163753:CT:Cdonor_gain0.9900
9:133155176:ACTTC:Adonor_gain0.9800
9:133155177:CTTCC:Cdonor_gain0.9800
9:133155276:C:CTacceptor_gain0.9800
9:133155280:A:Tacceptor_gain0.9800
9:133155896:CCTA:Cdonor_loss0.9800

AlphaMissense

2268 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
9:133155252:G:CF95L0.930
9:133155252:G:TF95L0.930
9:133155254:A:GF95L0.930
9:133153847:G:CF258L0.925
9:133153847:G:TF258L0.925
9:133153849:A:GF258L0.925
9:133153988:A:CF211L0.909
9:133153988:A:TF211L0.909
9:133153990:A:GF211L0.909
9:133154012:G:CF203L0.876
9:133154012:G:TF203L0.876
9:133154014:A:GF203L0.876
9:133154216:G:CF135L0.872
9:133154216:G:TF135L0.872
9:133154218:A:GF135L0.872
9:133153871:A:CF250L0.867
9:133153871:A:TF250L0.867
9:133153873:A:GF250L0.867
9:133153826:G:CF265L0.854
9:133153826:G:TF265L0.854
9:133153828:A:GF265L0.854
9:133153989:A:GF211S0.845
9:133154237:G:CF128L0.844
9:133154237:G:TF128L0.844
9:133154239:A:GF128L0.844
9:133153669:A:GY318H0.841
9:133154180:G:CF147L0.841
9:133154180:G:TF147L0.841
9:133154182:A:GF147L0.841
9:133153694:C:AK309N0.839

dbSNP variants (sampled 300 via entrez): RS1000260653 (9:133156533 G>A), RS1000296669 (9:133152567 G>T), RS1000333933 (9:133156410 C>T), RS1000357839 (9:133157724 C>T), RS1000412334 (9:133152904 G>A,C), RS1000821196 (9:133163279 G>A,C), RS1000888622 (9:133161994 G>A,T), RS1000896882 (9:133161949 C>G), RS1000961512 (9:133156629 C>T), RS1001115320 (9:133161671 G>A), RS1001171353 (9:133163072 C>A), RS1001212207 (9:133156608 C>T), RS1001548734 (9:133164605 T>C), RS1001770956 (9:133164355 C>G), RS1001807227 (9:133156897 G>T)

Disease associations

OMIM: gene MIM:606074 | disease phenotypes: MIM:191100

GenCC curated gene-disease

Mondo (1): tuberous sclerosis 1 (MONDO:0008612)

Orphanet (1): Tuberous sclerosis complex (Orphanet:805)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST009391_770Metabolite levels9.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0010392sphingomyelin 16:1 measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C565346Tuberous Sclerosis 1 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

26 total (human), top 26 by PubMed support.

ChemicalActions (top 5)PubMed papers
triphenyl phosphateaffects expression1
terbufosincreases methylation1
sodium arsenitedecreases expression1
benzo(e)pyreneincreases methylation1
aflatoxin B2increases methylation1
S-(1,2-dichlorovinyl)cysteineaffects cotreatment, decreases expression1
CGP 52608affects binding, increases reaction1
entinostatincreases expression1
abrinedecreases expression1
bisphenol Sdecreases methylation1
Acetaminophendecreases expression1
Benzo(a)pyreneaffects methylation1
Carbamazepineaffects expression1
Fonofosincreases methylation1
Estradiolaffects cotreatment, increases expression1
Gallic Acidincreases expression1
Leadaffects methylation1
Lipopolysaccharidesaffects cotreatment, decreases expression1
Methapyrileneincreases methylation1
Parathionincreases methylation1
Tobacco Smoke Pollutiondecreases expression1
Tretinoindecreases expression1
Vanadatesdecreases expression1
Lithium Chlorideincreases expression1
Antirheumatic Agentsdecreases expression1
Particulate Matterdecreases expression1

Clinical trials (associated diseases)

4 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT02201212PHASE2COMPLETEDEverolimus for Cancer With TSC1 or TSC2 Mutation
NCT05103358PHASE2ACTIVE_NOT_RECRUITINGPhase 2 Basket Trial of Nab-sirolimus in Patients With Malignant Solid Tumors With Pathogenic Alterations in TSC1/TSC2 Genes (PRECISION 1)
NCT03655223Not specifiedENROLLING_BY_INVITATIONEarly Check: Expanded Screening in Newborns
NCT03817515Not specifiedAPPROVED_FOR_MARKETINGExpanded Access for ABI-009 in Patients With Advanced PEComa and Patients With a Malignancy With Relevant Genetic Mutations or mTOR Pathway Activation
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): tuberous sclerosis 1