GCNT2

gene
On this page

Also known as IGNTNAGCT1bA421M1.1bA360O19.2ULG3

Summary

GCNT2 (glucosaminyl (N-acetyl) transferase 2 (I blood group), HGNC:4204) is a protein-coding gene on chromosome 6p24.3-p24.2, encoding N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase (Q8N0V5). Branching enzyme that converts linear into branched poly-N-acetyllactosaminoglycans.

This gene encodes the enzyme responsible for formation of the blood group I antigen. The i and I antigens are distinguished by linear and branched poly-N-acetyllactosaminoglycans, respectively. The encoded protein is the I-branching enzyme, a beta-1,6-N-acetylglucosaminyltransferase responsible for the conversion of fetal i antigen to adult I antigen in erythrocytes during embryonic development. Mutations in this gene have been associated with adult i blood group phenotype. Alternatively spliced transcript variants encoding different isoforms have been described.

Source: NCBI Gene 2651 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): cataract 13 with adult I phenotype (Definitive, GenCC) — +1 more curated relationship
  • GWAS associations: 9
  • Clinical variants (ClinVar): 215 total — 21 pathogenic, 3 likely-pathogenic
  • Phenotypes (HPO): 2
  • MANE Select transcript: NM_145649

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4204
Approved symbolGCNT2
Nameglucosaminyl (N-acetyl) transferase 2 (I blood group)
Location6p24.3-p24.2
Locus typegene with protein product
StatusApproved
AliasesIGNT, NAGCT1, bA421M1.1, bA360O19.2, ULG3
Ensembl geneENSG00000111846
Ensembl biotypeprotein_coding
OMIM600429
Entrez2651

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 10 protein_coding, 8 protein_coding_CDS_not_defined, 3 retained_intron

ENST00000265012, ENST00000316170, ENST00000379597, ENST00000397423, ENST00000410107, ENST00000459872, ENST00000461400, ENST00000474518, ENST00000474983, ENST00000475577, ENST00000483204, ENST00000485764, ENST00000488742, ENST00000489225, ENST00000489819, ENST00000495262, ENST00000910531, ENST00000910532, ENST00000942490, ENST00000942491, ENST00000942492

RefSeq mRNA: 4 — MANE Select: NM_145649 NM_001374747, NM_001491, NM_145649, NM_145655

CCDS: CCDS34338, CCDS4512, CCDS4513

Canonical transcript exons

ENST00000495262 — 5 exons

ExonStartEnd
ENSE000015767181052863110529836
ENSE000015783771052747410527660
ENSE000034715941062135110621443
ENSE000039011611062641710629368
ENSE000039024601052135110521417

Expression profiles

Bgee: expression breadth ubiquitous, 136 present calls, max score 85.05.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 7.2563 / max 172.5324, expressed in 1040 samples.

FANTOM5 promoters (26 alternative TSS)

Promoter IDTPM avgSamples expressed
657491.1884424
657550.9455418
657520.8341315
657700.7345153
657540.6158293
657510.5576318
657630.4616219
657600.377675
657560.2832138
657580.225596

Top tissues by expression

136 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099185.05gold quality
heart left ventricleUBERON:000208484.75gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.73gold quality
apex of heartUBERON:000209884.27gold quality
duodenumUBERON:000211482.52gold quality
mucosa of transverse colonUBERON:000499182.45gold quality
prostate glandUBERON:000236781.92gold quality
stomachUBERON:000094581.91gold quality
adult mammalian kidneyUBERON:000008281.65gold quality
body of stomachUBERON:000116181.54gold quality
kidneyUBERON:000211381.32gold quality
monocyteCL:000057680.52gold quality
leukocyteCL:000073880.04gold quality
endometriumUBERON:000129579.88gold quality
tonsilUBERON:000237279.83gold quality
liverUBERON:000210779.42gold quality
ganglionic eminenceUBERON:000402379.33gold quality
bone marrow cellCL:000209278.70gold quality
olfactory segment of nasal mucosaUBERON:000538678.52gold quality
lungUBERON:000204878.50gold quality
fundus of stomachUBERON:000116078.21gold quality
bone marrowUBERON:000237178.20gold quality
heartUBERON:000094877.32gold quality
ventricular zoneUBERON:000305377.28gold quality
right lungUBERON:000216777.05gold quality
right lobe of liverUBERON:000111476.98gold quality
cortex of kidneyUBERON:000122576.83gold quality
minor salivary glandUBERON:000183076.83gold quality
upper lobe of left lungUBERON:000895276.68gold quality
saliva-secreting glandUBERON:000104476.65gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-CURD-119yes39.54
E-ANND-3no2.03

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

127 targeting GCNT2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692A100.0074.406850
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-4425100.0067.591049
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-477599.9875.006394
HSA-MIR-6888-3P99.9765.951170
HSA-MIR-570-3P99.9672.414910
HSA-MIR-495-3P99.9672.814197
HSA-MIR-568899.9673.234504
HSA-MIR-128-3P99.9571.172484
HSA-MIR-216A-3P99.9571.192505
HSA-MIR-23A-3P99.9574.243163
HSA-MIR-23B-3P99.9574.243163
HSA-MIR-23C99.9573.923192
HSA-MIR-141-3P99.9472.792421
HSA-MIR-200A-3P99.9472.682420
HSA-MIR-144-3P99.9473.982698
HSA-MIR-651-3P99.9473.485177
HSA-MIR-338-5P99.9272.342951
HSA-MIR-515-5P99.9269.822343
HSA-MIR-519E-5P99.9269.622358
HSA-MIR-6768-5P99.9267.361942
HSA-MIR-450B-5P99.9271.483175
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-990299.8969.152250

Literature-anchored findings (GeneRIF, showing 13)

  • The I carbohydrate antigen interacts simultaneously with the entire hydrophobic patch in framework region 1 and with the outside surface of Ig heavy chain complementarity-determining region 3, leaving most of the site available for binding other antigens. (PMID:12244172)
  • A nonsense mutation in the GCNT2 gene isoforms is associated with autosomal recessive congenital cataract in four distantly related Arab families from Israel. (PMID:15161861)
  • role of C/EBPalpha in the induction of the IGnTC gene as well as in I antigen expression (PMID:17855628)
  • In the family with the “ii” blood group a novel GCNT2 mutation was found in the cataract patients. (PMID:21541272)
  • Results show involvement of GCNT2 in EMT and TGF-beta signaling, and further glycosylation modification of E-cadherin by GCNT2, are the underlying integrative mechanisms for breast cancer metastasis. (PMID:21750175)
  • An Alu repeat-mediated genomic GCNT2 deletion underlies congenital cataracts and adult i blood group (PMID:21761136)
  • Hypomethylation of the GCNT2 variant 2 reflected lymph node metastasis of colorectal cancer in the tumor and normal tissues. (PMID:25750292)
  • GCNT2 expression is closely associated with invasive potential of prostate cancer. (PMID:26678556)
  • This study reports a novel insertion/deletion mutation at the GCNT2 locus that is responsible for congenital cataracts in a large consanguineous family. (PMID:27936067)
  • miR-199a/b-5p regulates GCNT2 and I antigen expression in colon cancer cells undergoing EMT (PMID:28542779)
  • study suggested that GCNT2 was highly expressed in patients with esophageal squamous cell carcinoma and predicted adverse outcome; overexpression of GCNT2 induces epithelial-mesenchymal transition and promotes migration and invasion in ESCC cells; therefore, GCNT2 may act as a candidate prognostic indicator of outcome and a novel target in ESCC patients (PMID:30575058)
  • I-branched glycans catalyzed principally by the I-branching enzyme GCNT2 are now indicated in several malignancies. (PMID:31213534)
  • Melanoma-associated glycosyltransferase GCNT2 as an emerging biomarker and therapeutic target. (PMID:33660254)

Cross-species orthologs

20 orthologs

OrganismSymbolGene ID
rattus_norvegicusGcnt2ENSRNOG00000023778
drosophila_melanogasteroxtFBGN0015360
drosophila_melanogasterCG9164FBGN0030634
caenorhabditis_elegansgly-15WBGENE00001640
caenorhabditis_elegansgly-16WBGENE00001641
caenorhabditis_elegansgly-17WBGENE00001642
caenorhabditis_elegansgly-18WBGENE00001643
caenorhabditis_elegansgly-19WBGENE00001644
caenorhabditis_elegansWBGENE00005024
caenorhabditis_elegansWBGENE00009148
caenorhabditis_elegansF30A10.4WBGENE00009263
caenorhabditis_elegansWBGENE00011090
caenorhabditis_elegansT09E11.6WBGENE00011655
caenorhabditis_elegansT09E11.9WBGENE00011658
caenorhabditis_elegansT27F6.1WBGENE00012102
caenorhabditis_elegansWBGENE00012135
caenorhabditis_elegansWBGENE00013119
caenorhabditis_elegansZK1225.2WBGENE00014236
caenorhabditis_elegansWBGENE00019270
caenorhabditis_elegansWBGENE00019919

Paralogs (8): XYLT2 (ENSG00000015532), WSCD2 (ENSG00000075035), XYLT1 (ENSG00000103489), GCNT7 (ENSG00000124091), GCNT3 (ENSG00000140297), GCNT4 (ENSG00000176928), WSCD1 (ENSG00000179314), GCNT1 (ENSG00000187210)

Protein

Protein identifiers

N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferaseQ8N0V5 (reviewed: Q8N0V5)

Alternative names: I-branching enzyme, IGNT

All UniProt accessions (2): Q8N0V5, B7ZBL3

UniProt curated annotations — full annotation on UniProt →

Function. Branching enzyme that converts linear into branched poly-N-acetyllactosaminoglycans. Introduces the blood group I antigen during embryonic development. It is closely associated with the development and maturation of erythroid cells. Determines the expression of the blood group I antigen in erythrocytes.

Subcellular location. Golgi apparatus membrane.

Tissue specificity. Expressed in lens epithelium cells. Expressed in reticulocytes.

Disease relevance. Cataract 13, with adult i phenotype (CTRCT13) [MIM:116700] An opacification of the crystalline lens of the eye that frequently results in visual impairment or blindness. Opacities vary in morphology, are often confined to a portion of the lens, and may be static or progressive. In general, the more posteriorly located and dense an opacity, the greater the impact on visual function. CTRCT13 is associated with the rare adult i phenotype, in which adult red blood cells are rich in i antigen and contain low levels of I antigen. The disease is caused by variants affecting the gene represented in this entry.

Pathway. Protein modification; protein glycosylation.

Polymorphism. GCNT2 is involved in determining the blood group I system (Ii) [MIM:110800]. The i (fetal) and I (adult) antigens are determined by linear and branched poly-N-acetyllactosaminoglycans, respectively. A replacement during development of i by I is dependent on the appearance of a beta-1,6-N-acetylglucosaminyltransferase, the I-branching enzyme. The expression of the blood group I antigen in erythrocytes is determined by isoform C of GCNT2.

Similarity. Belongs to the glycosyltransferase 14 family.

Isoforms (3)

UniProt IDNamesCanonical?
Q8N0V5-1A, IGnTA, IGNT1yes
Q8N0V5-2B, IGnTB, IGNT2
Q8N0V5-3C, IGnTC, IGNT3

RefSeq proteins (4): NP_001361676, NP_001482, NP_663624, NP_663630 (=MANE)

Domains & families (InterPro)

IDNameType
IPR003406Glyco_trans_14Family

Pfam: PF02485

Enzyme classification (BRENDA):

  • EC 2.4.1.150 — N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase (BRENDA: 14 organisms, 65 substrates, 15 inhibitors, 5 Km, 0 kcat entries)

Substrate kinetics (BRENDA)

5 substrates with measured Km, best-characterized 5. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
GALBETA(1-4)GLCNACBETA(1-3)GALBETA(1-4)GLC0.441
GALBETA(1-4)GLCNACBETA(1-3)GALBETA(1-4)GLCNACBET0.551
NEUACALPHA(2-6)GALBETA(1-4)GLCNACBETA(1-3)GALBET0.521
PYRIDYLAMINATED LACTO-N-NEOTETRAOSE0.961
UDP-N-ACETYL-D-GLUCOSAMINE2.591

Catalyzed reactions (Rhea), 1 shown:

  • a beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->3)-beta-D-Gal-(1->4)-beta-D-GlcNAc derivative + UDP-N-acetyl-alpha-D-glucosamine = a beta-D-Gal-(1->4)-beta-D-GlcNAc-(1->3)-[beta-D-GlcNAc-(1->6)]-beta-D-Gal-(1->4)-N-acetyl-beta-D-GlcNAc derivative + UDP + H(+) (RHEA:54820)

UniProt features (13 total): sequence variant 5, topological domain 2, splice variant 2, chain 1, sequence conflict 1, transmembrane region 1, glycosylation site 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N0V5-F191.660.80

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (1): 41

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 280 (showing top): GSE45365_NK_CELL_VS_CD11B_DC_DN, HNF3ALPHA_Q6, GOBP_POSITIVE_REGULATION_OF_EPITHELIAL_TO_MESENCHYMAL_TRANSITION, chr6p24, GOBP_REGULATION_OF_HETEROTYPIC_CELL_CELL_ADHESION, GRAESSMANN_APOPTOSIS_BY_SERUM_DEPRIVATION_DN, GRAESSMANN_RESPONSE_TO_MC_AND_SERUM_DEPRIVATION_DN, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, GOBP_POSITIVE_REGULATION_OF_MAPK_CASCADE, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GOBP_AMINOGLYCAN_BIOSYNTHETIC_PROCESS, GOBP_POSITIVE_REGULATION_OF_CELL_ADHESION, GOBP_NEGATIVE_REGULATION_OF_CELL_SUBSTRATE_ADHESION, GOBP_CELL_CELL_ADHESION, GOBP_POST_TRANSCRIPTIONAL_REGULATION_OF_GENE_EXPRESSION

GO Biological Process (14): glycosaminoglycan biosynthetic process (GO:0006024), transforming growth factor beta receptor signaling pathway (GO:0007179), multicellular organism development (GO:0007275), positive regulation of cell population proliferation (GO:0008284), glycoprotein biosynthetic process (GO:0009101), post-transcriptional regulation of gene expression (GO:0010608), positive regulation of epithelial to mesenchymal transition (GO:0010718), negative regulation of cell-substrate adhesion (GO:0010812), positive regulation of cell migration (GO:0030335), positive regulation of heterotypic cell-cell adhesion (GO:0034116), maintenance of lens transparency (GO:0036438), positive regulation of phosphatidylinositol 3-kinase/protein kinase B signal transduction (GO:0051897), positive regulation of ERK1 and ERK2 cascade (GO:0070374), obsolete protein glycosylation (GO:0006486)

GO Molecular Function (6): N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase activity (GO:0008109), acetylglucosaminyltransferase activity (GO:0008375), protein binding (GO:0005515), UDP-glycosyltransferase activity (GO:0008194), transferase activity (GO:0016740), glycosyltransferase activity (GO:0016757)

GO Cellular Component (3): Golgi membrane (GO:0000139), membrane (GO:0016020), Golgi apparatus (GO:0005794)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
aminoglycan biosynthetic process1
glycosaminoglycan metabolic process1
cellular response to transforming growth factor beta stimulus1
transforming growth factor beta receptor superfamily signaling pathway1
multicellular organismal process1
anatomical structure development1
cell population proliferation1
regulation of cell population proliferation1
positive regulation of cellular process1
macromolecule biosynthetic process1
glycoprotein metabolic process1
carbohydrate derivative biosynthetic process1
regulation of gene expression1
epithelial to mesenchymal transition1
regulation of epithelial to mesenchymal transition1
positive regulation of cell differentiation1
positive regulation of multicellular organismal process1
negative regulation of cell adhesion1
regulation of cell-substrate adhesion1
cell-substrate adhesion1
cell migration1
regulation of cell migration1
positive regulation of cell motility1
positive regulation of cell-cell adhesion1
heterotypic cell-cell adhesion1
regulation of heterotypic cell-cell adhesion1
tissue homeostasis1
phosphatidylinositol 3-kinase/protein kinase B signal transduction1
regulation of phosphatidylinositol 3-kinase/protein kinase B signal transduction1
positive regulation of intracellular signal transduction1
positive regulation of MAPK cascade1
ERK1 and ERK2 cascade1
regulation of ERK1 and ERK2 cascade1
acetylglucosaminyltransferase activity1
UDP-glycosyltransferase activity1
hexosyltransferase activity1
binding1
glycosyltransferase activity1
catalytic activity1
transferase activity1

Protein interactions and networks

STRING

840 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GCNT2B3GNT2Q9NY97876
GCNT2GYPCP04921843
GCNT2CRYBB3P26998815
GCNT2TRNT1Q96Q11779
GCNT2NHSQ6T4R5768
GCNT2CRYGDP07320748
GCNT2MGAT5Q09328685
GCNT2CRYBB2P43320684
GCNT2MGAT5BQ3V5L5667
GCNT2MGAT2Q10469646
GCNT2HSF4Q9ULV5636
GCNT2MGAT4AQ9UM21616
GCNT2ADGRL2O95490613
GCNT2BFSP1Q12934607
GCNT2MGAT4BQ9UQ53606

IntAct

6 interactions, top by confidence:

ABTypeScore
TMEM79GCNT2psi-mi:“MI:0915”(physical association)0.590
GCNT2ATP2A1psi-mi:“MI:0914”(association)0.350
GCNT2CLGNpsi-mi:“MI:0914”(association)0.350

BioGRID (64): ATP2A1 (Affinity Capture-MS), METTL9 (Affinity Capture-MS), GHITM (Affinity Capture-MS), COL14A1 (Affinity Capture-MS), TRIM68 (Affinity Capture-MS), FAM3C (Affinity Capture-MS), HLA-DPB1 (Affinity Capture-MS), GCNT2 (Affinity Capture-MS), GCNT2 (Affinity Capture-MS), TRIM68 (Affinity Capture-MS), ATP2A1 (Affinity Capture-MS), GHITM (Affinity Capture-MS), COL14A1 (Affinity Capture-MS), FAM3C (Affinity Capture-MS), HLA-DPB1 (Affinity Capture-MS)

ESM2 similar proteins: A0A2C9JXL4, E9Q649, O95395, P0DN25, P97402, Q08BL3, Q0VC84, Q18515, Q24342, Q3SX46, Q499P3, Q5F3G7, Q5HZL5, Q5U258, Q5XJP0, Q5YB40, Q66GS2, Q6A1G2, Q6DE15, Q6DJR8, Q6GNL1, Q6P3P5, Q6P6V1, Q6WV16, Q6Y288, Q7K237, Q7SYI5, Q7T3S5, Q7Z1Z1, Q864U8, Q866Z5, Q8BGY6, Q8BHT6, Q8L7M1, Q8LPF8, Q8N0V5, Q8NCW6, Q96EU7, Q99NB2, Q9BYG0

Diamond homologs: A5GFW8, E9Q649, O95395, P97402, Q02742, Q09324, Q1M0V6, Q3V3K7, Q5JCT0, Q5U258, Q6ZNI0, Q71SG7, Q7YQE1, Q805R1, Q80RC7, Q866Z4, Q866Z5, Q866Z6, Q8CH87, Q8N0V5, Q92180, Q99CW3, Q9IZK2, Q9P109, Q5QQ53, Q7KVA1, Q5QQ54, Q5QQ55, A0A1U9VX91, A2BGL3, D4AUF1, D4AUF4, P54867, P83097, P84675, Q0IIY2, Q505J3, Q5QQ49, Q5QQ50, Q5QQ51

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

215 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic21
Likely pathogenic3
Uncertain significance108
Likely benign33
Benign39

Top pathogenic / likely-pathogenic (24)

Variant IDHGVSClassification
1065633NM_145649.5(GCNT2):c.14G>A (p.Trp5Ter)Pathogenic
1437805NM_145649.5(GCNT2):c.1000A>T (p.Arg334Ter)Pathogenic
2424190NC_000006.11:g.(?10556657)(10626840_?)delPathogenic
252954NM_145649.5(GCNT2):c.1091T>C (p.Phe364Ser)Pathogenic
2685194GRCh37/hg19 6p24.3(chr6:10099993-10564232)x1Pathogenic
2918140NM_001491.3(GCNT2):c.760del (p.His254fs)Pathogenic
4028900NM_001491.3(GCNT2):c.760dup (p.His254fs)Pathogenic
4279311GRCh37/hg19 6p24.3-24.2(chr6:10535030-10630002)x1Pathogenic
571035NM_001491.3(GCNT2):c.60del (p.Ile20fs)Pathogenic
571381NM_001491.3(GCNT2):c.710_711insT (p.Lys237fs)Pathogenic
685194GRCh37/hg19 6p24.3-24.2(chr6:10541521-10645806)x1Pathogenic
686169GRCh37/hg19 6p24.3(chr6:10467100-10530991)x1Pathogenic
686306GRCh37/hg19 6p24.3(chr6:10528333-10575149)x1Pathogenic
687321GRCh37/hg19 6p24.3-24.2(chr6:10546386-10640180)x1Pathogenic
687866GRCh37/hg19 6p24.3(chr6:10483812-10560100)x1Pathogenic
830705NC_000006.12:g.(?10528892)(10557362_?)delPathogenic
830756NC_000006.12:g.(?10556404)(10626627_?)delPathogenic
9128NM_145649.5(GCNT2):c.1049G>A (p.Gly350Glu)Pathogenic
9130NM_001491.2(GCNT2):c.-117_*41delPathogenic
9131NM_145649.5(GCNT2):c.505G>A (p.Ala169Thr)Pathogenic
9132NM_145649.5(GCNT2):c.683G>A (p.Arg228Gln)Pathogenic
252955NM_145649.5(GCNT2):c.1169_1172del (p.Asn390fs)Likely pathogenic
4849340NM_145649.5(GCNT2):c.651dup (p.Gly218fs)Likely pathogenic
4849446NM_001491.3(GCNT2):c.707_708dup (p.Lys237fs)Likely pathogenic

SpliceAI

616 predictions. Top by Δscore:

VariantEffectΔscore
6:10586857:TCAA:Tdonor_gain0.9900
6:10586912:CAG:Cdonor_loss0.9900
6:10586913:AGGT:Adonor_loss0.9900
6:10586915:G:GAdonor_loss0.9900
6:10586916:T:Adonor_loss0.9900
6:10621350:GGT:Gacceptor_gain0.9900
6:10621439:CCACG:Cdonor_loss0.9900
6:10621440:CACGG:Cdonor_loss0.9900
6:10621441:ACGGT:Adonor_loss0.9900
6:10621442:CGGTG:Cdonor_loss0.9900
6:10621443:GGTGA:Gdonor_loss0.9900
6:10621444:G:GAdonor_loss0.9900
6:10621445:T:Cdonor_loss0.9900
6:10626411:TTGCA:Tacceptor_loss0.9900
6:10626412:TGCA:Tacceptor_loss0.9900
6:10626413:GCAG:Gacceptor_loss0.9900
6:10626414:CAGGC:Cacceptor_loss0.9900
6:10626415:A:AGacceptor_gain0.9900
6:10626415:AGGCC:Aacceptor_loss0.9900
6:10626416:G:GAacceptor_gain0.9900
6:10626416:G:GTacceptor_loss0.9900
6:10626416:GGC:Gacceptor_gain0.9900
6:10626416:GGCC:Gacceptor_gain0.9900
6:10626416:GGCCA:Gacceptor_gain0.9900
6:10586915:G:GGdonor_gain0.9800
6:10621344:A:AGacceptor_gain0.9800
6:10621344:ATT:Aacceptor_gain0.9800
6:10621345:TTGCA:Tacceptor_loss0.9800
6:10621346:TGCA:Tacceptor_loss0.9800
6:10621347:GCA:Gacceptor_loss0.9800

AlphaMissense

2654 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:10621409:G:CW328C0.997
6:10621409:G:TW328C0.997
6:10621437:T:AC338S0.997
6:10621438:G:CC338S0.997
6:10626437:T:AC347S0.997
6:10626438:G:AC347Y0.997
6:10626438:G:CC347S0.997
6:10626439:T:GC347W0.997
6:10626499:G:CK367N0.997
6:10626499:G:TK367N0.997
6:10621407:T:AW328R0.996
6:10621407:T:CW328R0.996
6:10626437:T:CC347R0.996
6:10626438:G:TC347F0.996
6:10621437:T:CC338R0.994
6:10621438:G:AC338Y0.994
6:10626552:G:CR385P0.994
6:10529553:T:AN214K0.993
6:10529553:T:GN214K0.993
6:10529812:T:AW301R0.993
6:10529812:T:CW301R0.993
6:10621439:C:GC338W0.993
6:10626492:C:AA365D0.993
6:10626497:A:GK367E0.993
6:10621406:G:CK327N0.992
6:10621406:G:TK327N0.992
6:10626498:A:TK367M0.992
6:10621438:G:TC338F0.991
6:10626466:G:CW356C0.991
6:10626466:G:TW356C0.991

dbSNP variants (sampled 300 via entrez): RS1000000606 (6:10619883 A>G,T), RS1000030889 (6:10626003 T>A,C), RS1000031283 (6:10563253 T>G), RS1000092113 (6:10557499 T>G), RS1000106410 (6:10620105 A>G), RS1000121216 (6:10597762 A>G), RS1000121624 (6:10593389 G>A), RS1000123339 (6:10557680 A>T), RS1000185654 (6:10561514 T>C), RS1000186442 (6:10528084 G>C,T), RS1000194516 (6:10594751 C>T), RS1000210542 (6:10533094 G>A), RS1000239291 (6:10598021 A>C,G,T), RS1000245419 (6:10598693 T>C), RS1000285372 (6:10609535 G>A,T)

Disease associations

OMIM: gene MIM:600429 | disease phenotypes: MIM:116700

GenCC curated gene-disease

DiseaseClassificationInheritance
cataract 13 with adult I phenotypeDefinitiveAutosomal recessive
total early-onset cataractSupportiveAutosomal dominant

Mondo (2): cataract 13 with adult I phenotype (MONDO:0007289), total early-onset cataract (MONDO:0021548)

Orphanet (1): Early onset non-syndromic cataract (Orphanet:91492)

HPO phenotypes

2 total (2 of 2 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0000519Developmental cataract

GWAS associations

9 associations (top):

StudyTraitp-value
GCST004609_60Monocyte percentage of white cells8.000000e-10
GCST004625_22Monocyte count4.000000e-18
GCST008153_65Lean body mass8.000000e-07
GCST008769_5Perceived intensity of glucose8.000000e-06
GCST009172_2Response to (pegylated) interferon in HBeAg-negative hepatitis B3.000000e-06
GCST90002388_108Lymphocyte count2.000000e-10
GCST90002393_8Monocyte count3.000000e-44
GCST90002394_37Monocyte percentage of white cells2.000000e-23
GCST90002407_233White blood cell count5.000000e-12

EFO canonical traits (5, from GWAS)

EFO IDTrait name
EFO:0007989monocyte percentage of leukocytes
EFO:0005091monocyte count
EFO:0004995lean body mass
EFO:0007859response to interferon
EFO:0004587lymphocyte count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

43 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, decreases expression, increases expression4
sodium arseniteaffects cotreatment, increases abundance, increases expression3
potassium chromate(VI)decreases expression, affects cotreatment2
Acetaminophendecreases expression2
Formaldehydedecreases expression2
Smokedecreases expression, increases expression2
arseniteincreases methylation1
manganese chlorideaffects cotreatment, increases abundance, increases expression1
nickel sulfateincreases expression1
vanadyl sulfateincreases expression1
epigallocatechin gallateaffects cotreatment, decreases expression1
chromium hexavalent iondecreases expression1
CGP 52608affects binding, increases reaction1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
PCI 5002affects cotreatment, increases expression1
Bortezomibincreases expression1
Resveratrolaffects cotreatment, increases expression1
Temozolomideincreases expression1
Sunitinibdecreases expression1
Vorinostataffects cotreatment, decreases expression1
Arsenicaffects cotreatment, increases abundance, increases expression1
Azacitidinedecreases expression1
Cisplatinaffects response to substance1
Cytarabinedecreases expression1
Diazinonincreases methylation1
Bucladesineaffects cotreatment, decreases expression1
Doxorubicindecreases expression1
Estradiolaffects cotreatment, decreases expression1
Ethyl Methanesulfonatedecreases expression1

Clinical trials (associated diseases)

1 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT06068348Not specifiedACTIVE_NOT_RECRUITINGLiquid Biopsy Collection Study