CHST9

gene
On this page

Also known as GALNAC4ST-2GALNAC-4-ST2

Summary

CHST9 (carbohydrate sulfotransferase 9, HGNC:19898) is a protein-coding gene on chromosome 18q11.2, encoding Carbohydrate sulfotransferase 9 (Q7L1S5). Catalyzes the transfer of sulfate from 3’-phosphoadenylyl sulfate (PAPS) to position 4 of non-reducing N-acetylgalactosamine (GalNAc) residues in both N-glycans and O-glycans.

The protein encoded by this gene belongs to the sulfotransferase 2 family. It is localized to the golgi membrane, and catalyzes the transfer of sulfate to position 4 of non-reducing N-acetylgalactosamine (GalNAc) residues in both N-glycans and O-glycans. Sulfate groups on carbohydrates confer highly specific functions to glycoproteins, glycolipids, and proteoglycans, and are critical for cell-cell interaction, signal transduction, and embryonic development. Alternatively spliced transcript variants have been described for this gene.

Source: NCBI Gene 83539 — RefSeq curated summary.

At a glance

  • GWAS associations: 9
  • Clinical variants (ClinVar): 22 total — 1 pathogenic
  • MANE Select transcript: NM_031422

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:19898
Approved symbolCHST9
Namecarbohydrate sulfotransferase 9
Location18q11.2
Locus typegene with protein product
StatusApproved
AliasesGALNAC4ST-2, GALNAC-4-ST2
Ensembl geneENSG00000154080
Ensembl biotypeprotein_coding
OMIM610191
Entrez83539

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 5 protein_coding

ENST00000580774, ENST00000581714, ENST00000618847, ENST00000932774, ENST00000951345

RefSeq mRNA: 3 — MANE Select: NM_031422 NM_001256316, NM_001398493, NM_031422

CCDS: CCDS42422, CCDS58618

Canonical transcript exons

ENST00000618847 — 6 exons

ExonStartEnd
ENSE000017475872704846527048503
ENSE000017992492718513627185308
ENSE000018063622694432926944366
ENSE000022454322714268927142905
ENSE000037309002690648126917350
ENSE000037403522702411627024157

Expression profiles

Bgee: expression breadth ubiquitous, 139 present calls, max score 99.34.

FANTOM5 (CAGE): breadth broad, TPM avg 1.4858 / max 113.8425, expressed in 342 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1714870.6791261
1714880.4681234
1714860.283880
1714850.041326
1714890.01364

Top tissues by expression

237 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
bronchial epithelial cellCL:000232899.34gold quality
bronchusUBERON:000218598.80gold quality
olfactory segment of nasal mucosaUBERON:000538695.79gold quality
mucosa of paranasal sinusUBERON:000503095.32gold quality
nasal cavity epitheliumUBERON:000538494.38gold quality
nasal cavity mucosaUBERON:000182692.93gold quality
epithelium of nasopharynxUBERON:000195190.70gold quality
islet of LangerhansUBERON:000000689.99gold quality
tracheaUBERON:000312688.53gold quality
epithelial cell of pancreasCL:000008385.65gold quality
parotid glandUBERON:000183185.22gold quality
right uterine tubeUBERON:000130281.10gold quality
pancreasUBERON:000126479.11gold quality
saliva-secreting glandUBERON:000104478.38gold quality
minor salivary glandUBERON:000183077.79gold quality
ventricular zoneUBERON:000305377.31gold quality
oviduct epitheliumUBERON:000480476.58gold quality
body of pancreasUBERON:000115074.56gold quality
mouth mucosaUBERON:000372973.33gold quality
gall bladderUBERON:000211070.48gold quality
ganglionic eminenceUBERON:000402367.92gold quality
right lungUBERON:000216767.67gold quality
fallopian tubeUBERON:000388966.80gold quality
spleenUBERON:000210665.68gold quality
heart left ventricleUBERON:000208465.51gold quality
right testisUBERON:000453465.47gold quality
cardiac ventricleUBERON:000208265.23gold quality
urethraUBERON:000005765.11silver quality
left ventricle myocardiumUBERON:000656665.06gold quality
testisUBERON:000047364.56gold quality

Single-cell (SCXA)

Detected in 7 experiment(s), a significant marker in 6.

ExperimentMarker?Max mean expression
E-ANND-2yes522.18
E-CURD-114yes54.16
E-GEOD-130148yes12.56
E-ANND-3yes11.58
E-MTAB-9388yes10.15
E-HCAD-1yes8.04
E-GEOD-124858no0.89

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

59 targeting CHST9, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-570-3P99.9672.414910
HSA-MIR-55999.9572.283609
HSA-MIR-548AB99.9571.313488
HSA-MIR-6772-5P99.9467.01577
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502
HSA-MIR-548BB-5P99.9471.273509
HSA-MIR-548C-5P99.9471.243488
HSA-MIR-548D-5P99.9471.233502
HSA-MIR-548H-5P99.9471.243488
HSA-MIR-548I99.9471.253481
HSA-MIR-548J-5P99.9471.143489
HSA-MIR-548O-5P99.9471.243488
HSA-MIR-548W99.9471.243488
HSA-MIR-548Y99.9471.283514
HSA-MIR-4760-3P99.9370.502385
HSA-MIR-137-3P99.8774.742401
HSA-MIR-391999.8769.452489
HSA-MIR-469899.8471.414303

Literature-anchored findings (GeneRIF, showing 3)

  • the copy number variations of CHST9 have been shown to associate with hematologic malignancies (PMID:21156230)
  • CHST9 rs1436904 polymorphism significantly contributes to prognosis of early-stage TNBC, suggesting its clinical potential in the screening of high-risk TNBC patients for recurrence and the possibility of patient-tailored therapeutic decisions. (PMID:28924212)
  • A Frameshift Variant in the CHST9 Gene Identified by Family-Based Whole Genome Sequencing Is Associated with Schizophrenia in Chinese Population. (PMID:31481703)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusChst9ENSMUSG00000047161
rattus_norvegicusChst9ENSRNOG00000015867

Paralogs (6): CHST10 (ENSG00000115526), CHST8 (ENSG00000124302), CHST12 (ENSG00000136213), CHST14 (ENSG00000169105), CHST11 (ENSG00000171310), CHST13 (ENSG00000180767)

Protein

Protein identifiers

Carbohydrate sulfotransferase 9Q7L1S5 (reviewed: Q7L1S5)

Alternative names: GalNAc-4-O-sulfotransferase 2, N-acetylgalactosamine-4-O-sulfotransferase 2

All UniProt accessions (1): Q7L1S5

UniProt curated annotations — full annotation on UniProt →

Function. Catalyzes the transfer of sulfate from 3’-phosphoadenylyl sulfate (PAPS) to position 4 of non-reducing N-acetylgalactosamine (GalNAc) residues in both N-glycans and O-glycans. Transfers sulfate to the C-4 hydroxyl of terminal beta-1,4-linked GalNAc in the sequence GalNAc-beta-1,4GlcNAcbeta-R found on N-linked oligosaccharides and to the nonterminal beta-1,4-linked GalNAc in chondroitin and dermatan. Required for biosynthesis of glycoprotein hormones lutropin and thyrotropin, by mediating sulfation of their carbohydrate structures. Active against chondroitin but not against terminal beta-1,4-linked GalNAc.

Subcellular location. Golgi apparatus membrane Secreted.

Tissue specificity. Highly expressed in trachea. Also expressed in fetal lung, adult pancreas, testis and salivary gland. Expressed at low level in pituitary gland, apex of the heart, adult lung, prostate and mammary gland. Weakly or not expressed in heart, liver and spinal cord.

Pathway. Protein modification; carbohydrate sulfation.

Similarity. Belongs to the sulfotransferase 2 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q7L1S5-11yes
Q7L1S5-22

RefSeq proteins (3): NP_001243245, NP_001385422, NP_113610* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR005331SulfotransferaseFamily
IPR018011Carb_sulfotrans_8-10Family

Pfam: PF03567

UniProt features (20 total): glycosylation site 4, sequence conflict 4, topological domain 2, splice variant 2, sequence variant 2, binding site 2, chain 1, transmembrane region 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q7L1S5-F178.390.61

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (2): 220–226; 280–288

Glycosylation sites (4): 324, 437, 159, 243

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-2022870CS-GAG biosynthesis

MSigDB gene sets: 153 (showing top): GSE45365_CD8A_DC_VS_CD11B_DC_IFNAR_KO_UP, GSE45365_NK_CELL_VS_CD8A_DC_MCMV_INFECTION_DN, AAGCAAT_MIR137, MODULE_255, MODULE_317, GOBP_REGULATION_OF_HORMONE_LEVELS, TAL1ALPHAE47_01, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, CEBPB_01, GOBP_CHONDROITIN_SULFATE_PROTEOGLYCAN_BIOSYNTHETIC_PROCESS, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_DN, GOBP_CARBOHYDRATE_DERIVATIVE_BIOSYNTHETIC_PROCESS, GOBP_HORMONE_BIOSYNTHETIC_PROCESS, GOBP_AMINOGLYCAN_METABOLIC_PROCESS, GRE_C

GO Biological Process (6): sulfur compound metabolic process (GO:0006790), carbohydrate biosynthetic process (GO:0016051), proteoglycan biosynthetic process (GO:0030166), glycosaminoglycan metabolic process (GO:0030203), hormone biosynthetic process (GO:0042446), chondroitin sulfate proteoglycan biosynthetic process (GO:0050650)

GO Molecular Function (4): dermatan 4-sulfotransferase activity (GO:0001537), sulfotransferase activity (GO:0008146), chondroitin 4-sulfotransferase activity (GO:0047756), transferase activity (GO:0016740)

GO Cellular Component (4): Golgi membrane (GO:0000139), extracellular region (GO:0005576), membrane (GO:0016020), Golgi apparatus (GO:0005794)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Chondroitin sulfate/dermatan sulfate metabolism1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
biosynthetic process2
cellular anatomical structure2
metabolic process1
carbohydrate metabolic process1
proteoglycan metabolic process1
glycoprotein biosynthetic process1
aminoglycan metabolic process1
hormone metabolic process1
proteoglycan biosynthetic process1
chondroitin sulfate proteoglycan metabolic process1
protein O-linked glycosylation via xylose1
dermatan sulfotransferase activity1
transferase activity, transferring sulphur-containing groups1
chondroitin sulfotransferase activity1
catalytic activity1
Golgi apparatus1
bounding membrane of organelle1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

462 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CHST9CA6P23280858
CHST9CHST6Q9GZX3603
CHST9CHST1O43916539
CHST9TTC32Q5I0X7527
CHST9RGS13O14921487
CHST9RGS18Q9NS28485
CHST9CHST3Q7LGC8455
CHST9NOGQ13253429
CHST9Q3MI93Q3MI93397
CHST9CHST2Q9Y4C5390
CHST9PROSER2Q86WR7389
CHST9B4GALNT3Q6L9W6378
CHST9TPST2O60704377
CHST9TPBGLP0DKB5373
CHST9ANTXRLA6NF34370

IntAct

7 interactions, top by confidence:

ABTypeScore
CHST8CANXpsi-mi:“MI:0914”(association)0.640
CHST9LRP5psi-mi:“MI:0914”(association)0.530
CHST9STX6psi-mi:“MI:0914”(association)0.350
CHST9ISLRpsi-mi:“MI:0914”(association)0.350
S1PR1ISLRpsi-mi:“MI:0914”(association)0.350

BioGRID (16): CANX (Affinity Capture-MS), LRP5 (Affinity Capture-MS), LRP6 (Affinity Capture-MS), STX5 (Affinity Capture-MS), LRP6 (Affinity Capture-MS), LRP5 (Affinity Capture-MS), CANX (Affinity Capture-MS), CHST9 (Affinity Capture-MS), CHST9 (Affinity Capture-MS), LRP6 (Affinity Capture-MS), LRP5 (Affinity Capture-MS), CANX (Affinity Capture-MS), EDEM2 (Affinity Capture-MS), HLA-DRA (Affinity Capture-MS), LRRC15 (Affinity Capture-MS)

ESM2 similar proteins: A0A8C2LVE3, A0MGZ5, A0MGZ7, A1A4K5, A2BGL3, D4PHA7, O08889, O43916, O88199, O93336, O95461, P25722, P69478, P79948, Q24342, Q2TBF2, Q5R621, Q5RJQ0, Q5XHM7, Q64610, Q66PG1, Q66PG2, Q66PG3, Q6DBY9, Q6KFX9, Q6XQH0, Q76EC5, Q76KB1, Q7L1S5, Q7LFX5, Q7LGA3, Q7LGC8, Q7T3S3, Q800H9, Q8BQ86, Q8CHI9, Q8JHF2, Q8R3H7, Q91XQ5, Q92179

Diamond homologs: O43529, O54702, P69478, Q5RBZ6, Q5XHM7, Q5ZIE4, Q6AXM1, Q6GNS1, Q6PGK7, Q76EC5, Q7L1S5, Q7T3S3, Q805E5, Q8BQ86, Q8NET6, Q99LL3, Q9H2A9, Q9JME2, Q9NPF2, Q9NRB3, Q80V53, Q8NCH0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

22 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance15
Likely benign1
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
1340066GRCh37/hg19 18p11.32-q12.1(chr18:136226-25252276)x3Pathogenic

SpliceAI

2279 predictions. Top by Δscore:

VariantEffectΔscore
18:27024108:TTAC:Tdonor_loss1.0000
18:27024109:TACT:Tdonor_loss1.0000
18:27024110:ACT:Adonor_loss1.0000
18:27024111:CTCA:Cdonor_gain1.0000
18:27024112:TCAC:Tdonor_loss1.0000
18:27024113:CACTG:Cdonor_loss1.0000
18:27024114:A:ACdonor_gain1.0000
18:27024114:A:Tdonor_loss1.0000
18:27024115:C:CAdonor_gain1.0000
18:27024115:C:Tdonor_loss1.0000
18:27024115:CTG:Cdonor_gain1.0000
18:27024115:CTGA:Cdonor_gain1.0000
18:27024115:CTGAT:Cdonor_gain1.0000
18:27024153:CCATC:Cacceptor_gain1.0000
18:27024154:CATCC:Cacceptor_gain1.0000
18:27024157:CCTGA:Cacceptor_loss1.0000
18:27024158:C:CAacceptor_loss1.0000
18:27048462:TAC:Tdonor_loss1.0000
18:27048463:A:AGdonor_loss1.0000
18:27048464:C:CGdonor_loss1.0000
18:27142684:GTTAC:Gdonor_loss1.0000
18:27142685:TTA:Tdonor_loss1.0000
18:27142686:TAC:Tdonor_loss1.0000
18:27142906:CTAG:Cacceptor_loss1.0000
18:27185131:GTTAC:Gdonor_loss1.0000
18:27185132:TTACC:Tdonor_loss1.0000
18:27185133:TACC:Tdonor_loss1.0000
18:27185134:A:ATdonor_loss1.0000
18:27185135:C:CAdonor_loss1.0000
18:26917349:TT:Tacceptor_gain0.9900

AlphaMissense

2950 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
18:26916541:C:AW350C0.999
18:26916541:C:GW350C0.999
18:26916543:A:GW350R0.999
18:26916543:A:TW350R0.999
18:26916520:G:CC357W0.998
18:26916521:C:TC357Y0.998
18:26916546:G:CH349D0.998
18:26916712:T:AK293N0.998
18:26916712:T:GK293N0.998
18:26916713:T:AK293I0.998
18:26916752:C:GR280P0.998
18:26916912:A:GW227R0.998
18:26916912:A:TW227R0.998
18:26916940:A:CC217W0.998
18:26916941:C:TC217Y0.998
18:26916714:T:CK293E0.997
18:26916714:T:GK293Q0.997
18:26916721:A:CF290L0.997
18:26916721:A:TF290L0.997
18:26916723:A:GF290L0.997
18:26916928:C:AK221N0.997
18:26916928:C:GK221N0.997
18:26916522:A:GC357R0.996
18:26916544:G:CH349Q0.996
18:26916544:G:TH349Q0.996
18:26916552:C:GD347H0.996
18:26916592:A:CF333L0.996
18:26916592:A:TF333L0.996
18:26916593:A:GF333S0.996
18:26916594:A:GF333L0.996

dbSNP variants (sampled 300 via entrez): RS1000000772 (18:26935339 G>C), RS1000003074 (18:26956962 A>G), RS1000016813 (18:26940531 G>T), RS1000020581 (18:27185625 C>G,T), RS1000023176 (18:27052733 T>C), RS1000036271 (18:27185029 C>T), RS1000038659 (18:27003950 A>G), RS1000041010 (18:27047834 A>G,T), RS1000051666 (18:27186091 C>G), RS1000075101 (18:27065024 G>A), RS1000075412 (18:26984476 A>G,T), RS1000076180 (18:27054773 C>T), RS1000081618 (18:26906646 A>C), RS1000095925 (18:27011381 G>A), RS1000099829 (18:27046232 A>G)

Disease associations

OMIM: gene MIM:610191 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

9 associations (top):

StudyTraitp-value
GCST001786_24Dental caries5.000000e-06
GCST001937_14Breast cancer3.000000e-08
GCST002104_2Bronchopulmonary dysplasia3.000000e-06
GCST002363_11Response to anti-retroviral therapy (ddI/d4T) in HIV-1 infection (Grade 3 peripheral neuropathy)2.000000e-07
GCST004988_194Breast cancer1.000000e-14
GCST006585_959Blood protein levels2.000000e-27
GCST009391_1586Metabolite levels9.000000e-06
GCST009391_1725Metabolite levels3.000000e-06
GCST009391_47Metabolite levels8.000000e-06

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0000180HIV-1 infection
EFO:0010349cholesteryl ester 20:5 measurement
EFO:0010342cholesteryl ester 16:1 measurement
EFO:0010114citrate measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

36 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, decreases methylation7
Benzo(a)pyreneaffects methylation, decreases expression, decreases methylation5
Cyclosporinedecreases expression, decreases methylation4
Aflatoxin B1affects expression, decreases expression, decreases methylation4
Tobacco Smoke Pollutionaffects expression, decreases expression3
methylmercuric chloridedecreases expression2
entinostatincreases expression, affects cotreatment2
Vorinostatincreases expression, affects cotreatment2
Phenylmercuric Acetateaffects cotreatment, increases expression2
Tetrachlorodibenzodioxinincreases expression2
aristolochic acid Idecreases expression1
perfluorotetradecanoic aciddecreases expression1
dicrotophosdecreases expression1
decabromobiphenyl etheraffects expression1
trichostatin Aincreases expression1
mono-(2-ethylhexyl)phthalatedecreases expression1
perfluorooctanoic aciddecreases expression1
perfluorobutyric aciddecreases expression1
dinophysistoxin 1decreases expression1
perfluorooctane sulfonic aciddecreases expression1
perfluoro-n-nonanoic aciddecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
dorsomorphinincreases expression, affects cotreatment1
Azathioprinedecreases expression1
Biological Factorsdecreases expression1
Diethylhexyl Phthalatedecreases expression1
Formaldehydeincreases expression1
Hydrogen Peroxideincreases expression1
Methyl Methanesulfonateincreases expression1
Phenobarbitalaffects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.