CHST1

gene
On this page

Also known as C6STKSGal6ST

Summary

CHST1 (carbohydrate sulfotransferase 1, HGNC:1969) is a protein-coding gene on chromosome 11p11.2, encoding Carbohydrate sulfotransferase 1 (O43916). Sulfotransferase that utilizes 3’-phospho-5’-adenylyl sulfate (PAPS) as sulfonate donor to catalyze the transfer of sulfate to position 6 of internal galactose (Gal) residues of keratan.

This locus encodes a member of the keratin sulfotransferase family of proteins. The encoded enzyme catalyzes the sulfation of the proteoglycan keratin.

Source: NCBI Gene 8534 — RefSeq curated summary.

At a glance

  • GWAS associations: 6
  • Clinical variants (ClinVar): 45 total
  • MANE Select transcript: NM_003654

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:1969
Approved symbolCHST1
Namecarbohydrate sulfotransferase 1
Location11p11.2
Locus typegene with protein product
StatusApproved
AliasesC6ST, KSGal6ST
Ensembl geneENSG00000175264
Ensembl biotypeprotein_coding
OMIM603797
Entrez8534

Gene structure

Transcript identifiers

Ensembl transcripts: 28 — 26 protein_coding, 1 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000308064, ENST00000531322, ENST00000533673, ENST00000891796, ENST00000891797, ENST00000891798, ENST00000891799, ENST00000891800, ENST00000891801, ENST00000891802, ENST00000891803, ENST00000891804, ENST00000891805, ENST00000891806, ENST00000891807, ENST00000891808, ENST00000891809, ENST00000891810, ENST00000891811, ENST00000891812, ENST00000891813, ENST00000924349, ENST00000924350, ENST00000960329, ENST00000960330, ENST00000960331, ENST00000960332, ENST00000960333

RefSeq mRNA: 1 — MANE Select: NM_003654 NM_003654

CCDS: CCDS7913

Canonical transcript exons

ENST00000308064 — 4 exons

ExonStartEnd
ENSE000011824684565199345652090
ENSE000011824714565252145652606
ENSE000011824824564768945650965
ENSE000012525424566517845665622

Expression profiles

Bgee: expression breadth ubiquitous, 211 present calls, max score 97.03.

FANTOM5 (CAGE): breadth broad, TPM avg 5.3584 / max 194.6610, expressed in 717 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1194765.1761712
1194750.151374
1194740.031015

Top tissues by expression

281 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
frontal poleUBERON:000279597.03gold quality
Brodmann (1909) area 10UBERON:001354196.76gold quality
middle frontal gyrusUBERON:000270294.29gold quality
parietal lobeUBERON:000187293.19gold quality
amygdalaUBERON:000187693.11gold quality
anterior cingulate cortexUBERON:000983592.95gold quality
right frontal lobeUBERON:000281092.94gold quality
cingulate cortexUBERON:000302792.93gold quality
postcentral gyrusUBERON:000258192.67gold quality
orbitofrontal cortexUBERON:000416792.66gold quality
frontal cortexUBERON:000187092.22gold quality
dorsolateral prefrontal cortexUBERON:000983492.09gold quality
temporal lobeUBERON:000187192.03gold quality
caudate nucleusUBERON:000187391.82gold quality
putamenUBERON:000187491.81gold quality
prefrontal cortexUBERON:000045191.72gold quality
neocortexUBERON:000195091.72gold quality
nucleus accumbensUBERON:000188291.71gold quality
superior frontal gyrusUBERON:000266191.60gold quality
cerebral cortexUBERON:000095691.42gold quality
Ammon’s hornUBERON:000195491.38gold quality
telencephalonUBERON:000189391.32gold quality
Brodmann (1909) area 9UBERON:001354091.15gold quality
Brodmann (1909) area 46UBERON:000648390.45gold quality
entorhinal cortexUBERON:000272890.28gold quality
forebrainUBERON:000189090.11gold quality
hypothalamusUBERON:000189889.94gold quality
ventral tegmental areaUBERON:000269189.53gold quality
superior vestibular nucleusUBERON:000722787.80gold quality
medial globus pallidusUBERON:000247787.63gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.31

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

135 targeting CHST1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-4668-3P100.0068.742635
HSA-MIR-548AW99.9972.573559
HSA-MIR-1212199.9966.64255
HSA-MIR-428299.9975.366408
HSA-MIR-19A-3P99.9875.332762
HSA-MIR-19B-3P99.9875.442754
HSA-MIR-32-5P99.9875.211964
HSA-MIR-92A-3P99.9875.211960
HSA-MIR-92B-3P99.9875.251955
HSA-MIR-27A-3P99.9872.132955
HSA-MIR-27B-3P99.9872.132955
HSA-MIR-998599.9872.112939
HSA-MIR-25-3P99.9874.601817
HSA-MIR-363-3P99.9874.721821
HSA-MIR-367-3P99.9874.831819
HSA-MIR-569699.9872.364487
HSA-MIR-548P99.9872.253784
HSA-MIR-570-3P99.9672.414910
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-96-5P99.9572.802140
HSA-MIR-391099.9571.132227
HSA-MIR-6835-3P99.9370.492904

Literature-anchored findings (GeneRIF, showing 6)

  • C4ST-1 and C6ST-1 differ from each other in the recognition of uronic acid residues adjacent to the targeted GalNAc residue (PMID:15324304)
  • The first to identify upregulated CHST1 protein gene expression correlating with the motility of Schwann cells that guide growing axons through both developmental and injured environments. (PMID:16495484)
  • sulfated keratan sulfate is produced by beta3GNT7, beta4GalT4, CGn6ST, and KSG6ST (PMID:17690104)
  • A new point mutation in C6ST-1 is associated with spondyloepiphyseal dysplasia, Omani type. (PMID:18698629)
  • Epitope expressing changes with suppression or over-expression of the Gal6ST (Gal 6-O-sulfotransferase) gene, suggesting that Gal6ST is involved in the biosynthesis of the unique epitopes of KL-6/mAb. (PMID:21880669)
  • found endogenous expression of the long form of the enzyme in human tissue, predominantly in the trans-Golgi network of endothelial cells that form human high endothelial venules. (PMID:22260995)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriochst1ENSDARG00000088593
mus_musculusChst1ENSMUSG00000027221
rattus_norvegicusChst1ENSRNOG00000007989
drosophila_melanogasterCG9550FBGN0031826
drosophila_melanogasterCG31637FBGN0051637

Paralogs (6): CHST3 (ENSG00000122863), CHST5 (ENSG00000135702), CHST4 (ENSG00000140835), CHST7 (ENSG00000147119), CHST2 (ENSG00000175040), CHST6 (ENSG00000183196)

Protein

Protein identifiers

Carbohydrate sulfotransferase 1O43916 (reviewed: O43916)

Alternative names: Galactose/N-acetylglucosamine/N-acetylglucosamine 6-O-sulfotransferase 1, Keratan sulfate Gal-6 sulfotransferase

All UniProt accessions (1): O43916

UniProt curated annotations — full annotation on UniProt →

Function. Sulfotransferase that utilizes 3’-phospho-5’-adenylyl sulfate (PAPS) as sulfonate donor to catalyze the transfer of sulfate to position 6 of internal galactose (Gal) residues of keratan. Cooperates with B4GALT4 and B3GNT7 glycosyltransferases and CHST6 sulfotransferase to construct and elongate disulfated disaccharide unit [->3(6-sulfoGalbeta)1->4(6-sulfoGlcNAcbeta)1->] within keratan sulfate polymer. Has a preference for sulfating keratan sulfate, but it also transfers sulfate to the unsulfated polymer. Involved in biosynthesis of phosphacan, a major keratan sulfate proteoglycan in the developing brain. Involved in biosynthesis of 6-sulfoGalbeta-containing O-linked glycans in high endothelial venules of lymph nodes. May act in a synergistic manner with CHST4 to generate sialyl 6’,6-disulfo Lewis X motif, a recognition determinant for immune cell receptors implicated in leukocyte trafficking. Catalyzes sulfation of N-acetyllactosamine (LacNAc) oligosaccharides with highest efficiency for sialylated LacNAc structures.

Subcellular location. Golgi apparatus membrane.

Tissue specificity. Widely expressed at low level. Expressed in brain and skeletal muscle. Expressed by high endothelial cells (HEVs) and leukocytes.

Pathway. Glycan metabolism.

Similarity. Belongs to the sulfotransferase 1 family. Gal/GlcNAc/GalNAc subfamily.

RefSeq proteins (1): NP_003645* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000863Sulfotransferase_domDomain
IPR016469Carbohydrate_sulfotransferaseFamily
IPR027417P-loop_NTPaseHomologous_superfamily
IPR051135Gal/GlcNAc/GalNAc_STFamily

Pfam: PF00685

Enzyme classification (BRENDA):

  • EC 2.8.2.21 — keratan sulfotransferase (BRENDA: 5 organisms, 27 substrates, 11 inhibitors, 3 Km, 0 kcat entries)

Substrate kinetics (BRENDA)

2 substrates with measured Km, best-characterized 2. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
KERATAN SULFATE0.025–0.382
3’-PHOSPHOADENYLYLSULFATE0.00061

UniProt features (11 total): glycosylation site 4, topological domain 2, binding site 2, chain 1, transmembrane region 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O43916-F187.900.71

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (2): 69–75; 234–242

Glycosylation sites (4): 334, 56, 145, 189

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-2022854Keratan sulfate biosynthesis

MSigDB gene sets: 205 (showing top): MODULE_52, HORIUCHI_WTAP_TARGETS_DN, MODULE_255, GOBP_INFLAMMATORY_RESPONSE, RORA1_01, LI_WILMS_TUMOR, MODULE_45, GAUSSMANN_MLL_AF4_FUSION_TARGETS_G_DN, ROVERSI_GLIOMA_COPY_NUMBER_UP, MODULE_317, MODULE_16, MORF_RAD51L3, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, MODULE_66, AP1_Q4_01

GO Biological Process (8): polysaccharide metabolic process (GO:0005976), galactose metabolic process (GO:0006012), N-acetylglucosamine metabolic process (GO:0006044), sulfur compound metabolic process (GO:0006790), inflammatory response (GO:0006954), keratan sulfate proteoglycan biosynthetic process (GO:0018146), keratan sulfate proteoglycan metabolic process (GO:0042339), carbohydrate metabolic process (GO:0005975)

GO Molecular Function (5): N-acetylglucosamine 6-O-sulfotransferase activity (GO:0001517), sulfotransferase activity (GO:0008146), keratan sulfotransferase activity (GO:0045130), protein binding (GO:0005515), transferase activity (GO:0016740)

GO Cellular Component (3): Golgi membrane (GO:0000139), membrane (GO:0016020), Golgi apparatus (GO:0005794)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Keratan sulfate/keratin metabolism1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
carbohydrate metabolic process1
macromolecule metabolic process1
hexose metabolic process1
amino sugar metabolic process1
metabolic process1
defense response1
proteoglycan biosynthetic process1
keratan sulfate proteoglycan metabolic process1
proteoglycan metabolic process1
primary metabolic process1
sulfotransferase activity1
transferase activity, transferring sulphur-containing groups1
proteoglycan sulfotransferase activity1
binding1
catalytic activity1
Golgi apparatus1
bounding membrane of organelle1
cellular anatomical structure1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

706 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CHST1LUMP51884756
CHST1SULT1C3Q6IMI6756
CHST1SULT1C4O75897729
CHST1KERAO60938727
CHST1SULT1B1O43704722
CHST1NTAN1Q96AB6636
CHST1CHST9Q7L1S5539
CHST1B3GNT7Q8NFL0499
CHST1GCNT1Q02742492
CHST1B4GALT4O60513435
CHST1SIAEQ9HAT2410
CHST1CHST12Q9NRB3410
CHST1SIGLEC8Q9NYZ4403
CHST1GAL3ST3Q96A11400
CHST1ELOAQ14241399

IntAct

9 interactions, top by confidence:

ABTypeScore
CHST1STOMpsi-mi:“MI:0915”(physical association)0.560
CHST1GPX8psi-mi:“MI:0915”(physical association)0.560
CHST1SNU13psi-mi:“MI:0915”(physical association)0.370
SFNCHST1psi-mi:“MI:0915”(physical association)0.370
CHST1GPX8psi-mi:“MI:0915”(physical association)0.000
CHST1STOMpsi-mi:“MI:0915”(physical association)0.000

BioGRID (6): CHST1 (Two-hybrid), CHST1 (Two-hybrid), CHST1 (Two-hybrid), CHST1 (Two-hybrid), HSP90AB1 (Cross-Linking-MS (XL-MS)), NHP2L1 (Two-hybrid)

ESM2 similar proteins: A0A8C2LVE3, A2BGL3, F4HXW9, O08889, O17645, O43909, O43916, O93336, O93403, O95461, P25722, P69478, P79948, Q0IIY2, Q2TBF2, Q5NDE4, Q5NDE5, Q5NDE6, Q5NDE7, Q5NDE8, Q5NVB3, Q5R621, Q5RJQ0, Q5XHM7, Q66PG1, Q66PG2, Q66PG3, Q6DBY9, Q6NVP8, Q6P9A2, Q6PA90, Q76EC5, Q76KB1, Q7LFX5, Q7LGA3, Q7LGC8, Q7T3S3, Q800H9, Q8BUB6, Q8CHI9

Diamond homologs: O43916, O88199, O93403, Q0VBN2, Q5RJQ0, Q6DBY9, Q6XQG8, Q7LGC8, Q80WV3, Q8IZU8, Q8NCG5, Q92179, Q9EP78, Q9EQC0, Q9GZS9, Q9GZX3, Q9NS84, Q9QUP4, Q9QZL2, Q9R1I1, Q9Y4C5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

45 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance37
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

862 predictions. Top by Δscore:

VariantEffectΔscore
11:45650966:C:CCacceptor_gain1.0000
11:45651988:CTTA:Cdonor_loss1.0000
11:45651989:TTA:Tdonor_loss1.0000
11:45651990:TA:Tdonor_loss1.0000
11:45651991:A:ACdonor_gain1.0000
11:45651992:C:Adonor_loss1.0000
11:45651992:C:CCdonor_gain1.0000
11:45652086:GCAGA:Gacceptor_gain1.0000
11:45652087:CAGA:Cacceptor_gain1.0000
11:45652087:CAGAC:Cacceptor_gain1.0000
11:45652088:AGA:Aacceptor_gain1.0000
11:45652089:GA:Gacceptor_gain1.0000
11:45652090:AC:Aacceptor_loss1.0000
11:45652091:C:CCacceptor_gain1.0000
11:45650961:GAGGT:Gacceptor_gain0.9900
11:45650963:GGT:Gacceptor_gain0.9900
11:45650964:GT:Gacceptor_gain0.9900
11:45650964:GTC:Gacceptor_loss0.9900
11:45650965:TCTG:Tacceptor_loss0.9900
11:45650966:C:Aacceptor_loss0.9900
11:45650967:T:Gacceptor_loss0.9900
11:45652088:AGACT:Aacceptor_gain0.9900
11:45652089:GACTG:Gacceptor_gain0.9900
11:45652090:ACTGC:Aacceptor_gain0.9900
11:45652091:CTGCA:Cacceptor_gain0.9900
11:45650962:AGGT:Aacceptor_gain0.9800
11:45651986:CACTT:Cdonor_loss0.9800
11:45651987:ACTTA:Adonor_loss0.9800
11:45652094:C:CTacceptor_gain0.9800
11:45651740:CTGGC:Cacceptor_gain0.9700

AlphaMissense

2650 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:45649844:C:AW360C1.000
11:45649844:C:GW360C1.000
11:45649846:A:GW360R1.000
11:45649846:A:TW360R1.000
11:45649931:C:AW331C1.000
11:45649931:C:GW331C1.000
11:45649933:A:GW331R1.000
11:45649933:A:TW331R1.000
11:45650306:C:AK206N1.000
11:45650306:C:GK206N1.000
11:45650468:G:CF152L1.000
11:45650468:G:TF152L1.000
11:45650470:A:GF152L1.000
11:45650649:G:TP92H1.000
11:45650652:T:AE91V1.000
11:45650654:A:CF90L1.000
11:45650654:A:TF90L1.000
11:45650656:A:GF90L1.000
11:45649796:G:CC376W0.999
11:45649797:C:GC376S0.999
11:45649797:C:TC376Y0.999
11:45649798:A:GC376R0.999
11:45649798:A:TC376S0.999
11:45649872:C:GR351P0.999
11:45649932:C:GW331S0.999
11:45650007:G:TA306D0.999
11:45650010:A:GL305P0.999
11:45650016:T:AE303V0.999
11:45650023:G:TR301S0.999
11:45650046:A:GL293P0.999

dbSNP variants (sampled 300 via entrez): RS1000103267 (11:45667408 A>C), RS1000256847 (11:45661489 C>T), RS1000484296 (11:45667074 C>T), RS1000594521 (11:45662708 G>A), RS1000627076 (11:45662942 C>A), RS1000696577 (11:45661833 A>AGGGCC), RS1000808888 (11:45649559 T>C,G), RS1000809366 (11:45657545 G>A,C), RS1000888858 (11:45665622 C>A,T), RS1000918353 (11:45665931 G>A), RS1000969290 (11:45657195 C>G), RS1001063246 (11:45660002 A>G), RS1001226186 (11:45667058 T>C), RS1001298228 (11:45660360 C>T), RS1001348098 (11:45661546 C>A)

Disease associations

OMIM: gene MIM:603797 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

6 associations (top):

StudyTraitp-value
GCST003671_3Diastolic blood pressure4.000000e-07
GCST004487_9Peak insulin response4.000000e-09
GCST004575_6Acute insulin response4.000000e-09
GCST006585_2780Blood protein levels3.000000e-08
GCST008437_4Alzheimer’s disease in hypertension-negative individuals4.000000e-07
GCST010002_237Refractive error1.000000e-10

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0006336diastolic blood pressure
EFO:0008000peak insulin response measurement
EFO:0006831acute insulin response measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

PharmGKB clinical annotations

1 annotations.

VariantTypeLevelDrugsPhenotypes
rs9787901Toxicity3imatinibGastrointestinal Stromal Tumors

CTD chemical–gene interactions

34 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, increases expression3
sodium arseniteaffects cotreatment, decreases expression, increases abundance, increases expression2
Cyclosporineincreases expression2
propionaldehydeincreases expression1
bisphenol Adecreases methylation1
potassium perchloratedecreases expression1
methylparabenincreases expression1
butyraldehydeincreases expression1
manganese chlorideaffects cotreatment, decreases expression, increases abundance1
tri-o-cresyl phosphatedecreases expression1
butylparabenincreases expression1
S-(1,2-dichlorovinyl)cysteineaffects cotreatment, increases expression1
pentanalincreases expression1
S-1,2-dichlorovinyl-N-acetylcysteineaffects expression1
4-diethoxyphosphorylmethyl-N-(4-bromo-2-cyanophenyl)benzamideincreases expression1
jinfukangaffects cotreatment, increases expression1
Sunitinibincreases expression1
Arsenic Trioxidedecreases expression1
Acetaminophenincreases expression1
Aldehydesincreases expression1
Arsenicdecreases expression, increases abundance, affects cotreatment1
Arsenicalsdecreases expression1
Atrazineincreases expression1
Benzo(a)pyreneaffects methylation1
Cisplatinaffects cotreatment, increases expression1
Dustincreases expression1
Estradiolincreases expression, affects cotreatment1
Lipopolysaccharidesaffects cotreatment, increases expression1
Manganeseaffects cotreatment, decreases expression, increases abundance1
Silicon Dioxidedecreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_E1MXHyCyte TE-1 KO-hCHST1Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.