NDST4

gene
On this page

Summary

NDST4 (N-deacetylase and N-sulfotransferase 4, HGNC:20779) is a protein-coding gene on chromosome 4q26, encoding Bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 4 (Q9H3R1). Essential bifunctional enzyme that catalyzes both the N-deacetylation and the N-sulfation of glucosamine (GlcNAc) of the glycosaminoglycan in heparan sulfate.

Predicted to enable deacetylase activity and heparan sulfate N-sulfotransferase activity. Predicted to be involved in heparan sulfate proteoglycan biosynthetic process and heparin proteoglycan biosynthetic process. Predicted to be located in Golgi membrane. Predicted to be active in Golgi apparatus.

Source: NCBI Gene 64579 — RefSeq curated summary.

At a glance

  • GWAS associations: 7
  • Clinical variants (ClinVar): 107 total
  • MANE Select transcript: NM_022569

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20779
Approved symbolNDST4
NameN-deacetylase and N-sulfotransferase 4
Location4q26
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000138653
Ensembl biotypeprotein_coding
OMIM615039
Entrez64579

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 2 protein_coding, 1 nonsense_mediated_decay, 1 retained_intron

ENST00000264363, ENST00000504854, ENST00000514570, ENST00000613194

RefSeq mRNA: 1 — MANE Select: NM_022569 NM_022569

CCDS: CCDS3706

Canonical transcript exons

ENST00000264363 — 14 exons

ExonStartEnd
ENSE00000841882114935206114935334
ENSE00001121074114845823114845997
ENSE00001766555114827763114827935
ENSE00002292490115076059115077282
ENSE00002472984114870768114870950
ENSE00002511295114848215114848338
ENSE00002516593114852725114852821
ENSE00003529843114833606114833715
ENSE00003541002114839378114839548
ENSE00003543817114937318114937503
ENSE00003591952114977187114977274
ENSE00003606736114829790114829892
ENSE00003667195114970430114970584
ENSE00003892435115113444115113620

Expression profiles

Bgee: expression breadth broad, 65 present calls, max score 87.87.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 1.1457 / max 148.8944, expressed in 176 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
537100.633299
537110.5125144

Top tissues by expression

223 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
palpebral conjunctivaUBERON:000181287.87gold quality
ponsUBERON:000098882.10gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047381.38gold quality
dorsal root ganglionUBERON:000004463.15gold quality
ganglionic eminenceUBERON:000402362.52gold quality
hair follicleUBERON:000207362.44gold quality
buccal mucosa cellCL:000233661.24gold quality
ventricular zoneUBERON:000305359.61gold quality
adrenal tissueUBERON:001830358.69gold quality
trigeminal ganglionUBERON:000167556.94silver quality
deciduaUBERON:000245056.55gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099156.06gold quality
vena cavaUBERON:000408755.53gold quality
upper leg skinUBERON:000426255.41gold quality
smooth muscle tissueUBERON:000113553.72gold quality
nasal cavity epitheliumUBERON:000538451.54gold quality
spermCL:000001949.85gold quality
prefrontal cortexUBERON:000045148.94gold quality
superior vestibular nucleusUBERON:000722747.88silver quality
lateral nuclear group of thalamusUBERON:000273647.41gold quality
substantia nigra pars compactaUBERON:000196547.07silver quality
corpus callosumUBERON:000233646.71silver quality
pigmented layer of retinaUBERON:000178246.52silver quality
substantia nigra pars reticulataUBERON:000196645.95gold quality
gall bladderUBERON:000211045.83silver quality
deltoidUBERON:000147645.62gold quality
medulla oblongataUBERON:000189645.53silver quality
lateral globus pallidusUBERON:000247645.37gold quality
ventral tegmental areaUBERON:000269145.03gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451143.37gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.74

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 4)

  • Our results suggest that genetic variants in TYW3/CRYZ and NDST4 loci may be involved in the regulation of circulating resistin levels (PMID:22843503)
  • NDST4 gene is a novel candidate tumor suppressor gene in human cancer. (PMID:23825612)
  • The rs12108602 near NDST4 showed significant associations with MaxDrinks. (PMID:23953852)
  • Highly-purified recombinant NDST-4 and a selective library of structurally-defined oligosaccharides were employed to determine the substrate specificity of rNDST-4. Full-length rNDST-4 lacks obvious N-deacetylase activity, and displays only N-sulfotransferase activity. NDST-4 did not show directional N-sulfotransferase activity while the N-deacetylase domain was inactive. (PMID:29158133)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriondst3ENSDARG00000041776
mus_musculusNdst4ENSMUSG00000027971
rattus_norvegicusNdst4ENSRNOG00000009577

Paralogs (10): HS3ST1 (ENSG00000002587), NDST1 (ENSG00000070614), HS3ST2 (ENSG00000122254), HS3ST3B1 (ENSG00000125430), HS3ST3A1 (ENSG00000153976), HS3ST6 (ENSG00000162040), NDST3 (ENSG00000164100), NDST2 (ENSG00000166507), HS3ST4 (ENSG00000182601), HS3ST5 (ENSG00000249853)

Protein

Protein identifiers

Bifunctional heparan sulfate N-deacetylase/N-sulfotransferase 4Q9H3R1 (reviewed: Q9H3R1)

Alternative names: Glucosaminyl N-deacetylase/N-sulfotransferase 4, N-heparan sulfate sulfotransferase 4

All UniProt accessions (1): Q9H3R1

UniProt curated annotations — full annotation on UniProt →

Function. Essential bifunctional enzyme that catalyzes both the N-deacetylation and the N-sulfation of glucosamine (GlcNAc) of the glycosaminoglycan in heparan sulfate. Modifies the GlcNAc-GlcA disaccharide repeating sugar backbone to make N-sulfated heparosan, a prerequisite substrate for later modifications in heparin biosynthesis. Has low deacetylase activity but high sulfotransferase activity.

Subunit / interactions. Monomer.

Subcellular location. Golgi apparatus membrane.

Pathway. Glycan metabolism; heparan sulfate biosynthesis. Glycan metabolism; heparin biosynthesis.

Miscellaneous. The presence of 4 different heparan sulfate N-deacetylase/N-sulfotransferase enzymes in mammals, as well as differences in their enzyme activity suggest that some initiate heparan sulfate modification/sulfation reactions, whereas other later on fill in or extend already modified heparan sulfate sequences.

Similarity. Belongs to the sulfotransferase 1 family. NDST subfamily.

Isoforms (2)

UniProt IDNamesCanonical?
Q9H3R1-11yes
Q9H3R1-22

RefSeq proteins (1): NP_072091* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000863Sulfotransferase_domDomain
IPR021930Heparan_SO4_deacetylase_domDomain
IPR027417P-loop_NTPaseHomologous_superfamily
IPR037359NST/OSTFamily
IPR056793HSNSD_NDomain

Pfam: PF00685, PF12062, PF25119

Enzyme classification (BRENDA):

  • EC 2.8.2.8 — [heparan sulfate]-glucosamine N-sulfotransferase (BRENDA: 11 organisms, 86 substrates, 14 inhibitors, 13 Km, 0 kcat entries)

Substrate kinetics (BRENDA)

5 substrates with measured Km, best-characterized 5. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
3’-PHOSPHOADENYLYLSULFATE0.005–0.1086
N-DEACETYLASE K5-POLYSACCHARIDE0.0009–0.02243
N,O-DESULFOHEPARAN SULFATE TETRASACCHARIDE2.51
N-DESULFOHEPARAN SULFATE1.891
N-DESULFOHEPARIN0.0131

Catalyzed reactions (Rhea), 1 shown:

  • alpha-D-glucosaminyl-heparan sulfate + 3’-phosphoadenylyl sulfate = N-sulfo-alpha-D-glucosaminyl-heparan sulfate + adenosine 3’,5’-bisphosphate + 2 H(+) (RHEA:21980)

UniProt features (20 total): glycosylation site 5, splice variant 3, binding site 3, topological domain 2, region of interest 2, chain 1, disulfide bond 1, sequence variant 1, transmembrane region 1, active site 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H3R1-F191.220.82

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 604 (for sulfotransferase activity)

Ligand- & substrate-binding residues (3): 604–608; 702; 823–827

Disulfide bonds (1): 808–818

Glycosylation sites (5): 226, 341, 391, 657, 793

Function

Pathways and Gene Ontology

Reactome pathways

5 pathways

IDPathway
R-HSA-2022928HS-GAG biosynthesis
R-HSA-1430728Metabolism
R-HSA-1630316Glycosaminoglycan metabolism
R-HSA-1638091Heparan sulfate/heparin (HS-GAG) metabolism
R-HSA-71387Metabolism of carbohydrates and carbohydrate derivatives

MSigDB gene sets: 100 (showing top): GCANCTGNY_MYOD_Q6, FOXO4_01, CHX10_01, AAAYRNCTG_UNKNOWN, CAGCTG_AP4_Q5, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, YGACNNYACAR_UNKNOWN, GOBP_HEPARAN_SULFATE_PROTEOGLYCAN_METABOLIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_BIOSYNTHETIC_PROCESS, HEN1_01, WTGAAAT_UNKNOWN, TCF11_01, OCT1_06, TGANTCA_AP1_C, TGACATY_UNKNOWN

GO Biological Process (2): heparan sulfate proteoglycan biosynthetic process (GO:0015012), heparin proteoglycan biosynthetic process (GO:0030210)

GO Molecular Function (7): heparan sulfate N-sulfotransferase activity (GO:0015016), deacetylase activity (GO:0019213), N-acetylglucosamine deacetylase activity (GO:0050119), catalytic activity (GO:0003824), sulfotransferase activity (GO:0008146), transferase activity (GO:0016740), hydrolase activity (GO:0016787)

GO Cellular Component (3): Golgi membrane (GO:0000139), Golgi apparatus (GO:0005794), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-4 pathways:

CategoryPathways
Heparan sulfate/heparin (HS-GAG) metabolism1
Metabolism of carbohydrates and carbohydrate derivatives1
Glycosaminoglycan metabolism1
Metabolism1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
protein O-linked glycosylation via xylose2
catalytic activity2
proteoglycan biosynthetic process1
heparan sulfate proteoglycan metabolic process1
heparin proteoglycan metabolic process1
heparan sulfate sulfotransferase activity1
deacylase activity1
hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides1
deacetylase activity1
molecular_function1
transferase activity, transferring sulphur-containing groups1
Golgi apparatus1
bounding membrane of organelle1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1
cellular anatomical structure1

Protein interactions and networks

STRING

574 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NDST4HS2ST1Q7LGA3661
NDST4GLCEO94923649
NDST4HS6ST3Q8IZP7628
NDST4HS6ST1O60243575
NDST4EXTL3O43909565
NDST4EXT1Q16394558
NDST4EXT2Q93063557
NDST4EXTL2Q9UBQ6542
NDST4EXTL1Q92935540
NDST4CNTNAP2Q9UHC6537
NDST4ZNF385DQ9H6B1523
NDST4HS6ST2Q96MM7518
NDST4ATP2C2O75185509
NDST4SSTP01166508
NDST4CHST12Q9NRB3497

IntAct

4 interactions, top by confidence:

ABTypeScore
NDST4NPM1psi-mi:“MI:0915”(physical association)0.400
NDST3PYGBpsi-mi:“MI:0914”(association)0.350
NDST4IGF2Rpsi-mi:“MI:0914”(association)0.350

BioGRID (6): NDST4 (Proximity Label-MS), NDST4 (Affinity Capture-MS), FAM3C (Affinity Capture-MS), IGF2R (Affinity Capture-MS), NDST1 (Affinity Capture-MS), GNPTAB (Affinity Capture-MS)

ESM2 similar proteins: A0A1D5PUP4, A2AFS3, M0R7X9, O70472, O75882, O95803, P01134, P26012, P52799, P52848, P69849, Q02353, Q05204, Q0VCJ8, Q12841, Q13635, Q15155, Q3UHN9, Q3ZBS2, Q58D84, Q5EA46, Q5JPE7, Q5R9Y1, Q5U4X8, Q5VV63, Q5ZJB7, Q5ZMH6, Q61115, Q62356, Q62632, Q6A051, Q6GQK9, Q6GQT9, Q6P988, Q6UXG2, Q7Z5A7, Q86TD4, Q90693, Q91WE9, Q96CW9

Diamond homologs: O14792, O35310, O95803, O97583, Q5GFD5, Q673U1, Q80W66, Q8BKN6, Q8BSL4, Q8IZT8, Q96QI5, Q9EQW8, Q9ESG5, Q9H3R1, Q9QZS6, Q9Y278, Q9Y661, Q9Y662, Q9Y663, P52849, Q60V90, Q966W3, Q9EQH7, Q9V3L1, P52848, P52850, Q02353, Q3UHN9, Q5U4X8, Q6GQK9, Q55GK8

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

107 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance98
Likely benign7
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

3355 predictions. Top by Δscore:

VariantEffectΔscore
4:114827931:CTAGA:Cacceptor_gain1.0000
4:114827936:C:CCacceptor_gain1.0000
4:114829784:TATTA:Tdonor_loss1.0000
4:114829786:TTA:Tdonor_loss1.0000
4:114829787:TACCT:Tdonor_loss1.0000
4:114829788:A:ACdonor_gain1.0000
4:114829789:C:CCdonor_gain1.0000
4:114829789:CCT:Cdonor_gain1.0000
4:114829888:CAAAC:Cacceptor_gain1.0000
4:114829890:AAC:Aacceptor_gain1.0000
4:114829893:C:CAacceptor_loss1.0000
4:114829893:C:CCacceptor_gain1.0000
4:114833604:A:ACdonor_gain1.0000
4:114833605:C:CCdonor_gain1.0000
4:114845993:TATAC:Tacceptor_gain1.0000
4:114845995:TAC:Tacceptor_gain1.0000
4:114845995:TACC:Tacceptor_loss1.0000
4:114845996:ACCT:Aacceptor_loss1.0000
4:114845997:CCTGA:Cacceptor_loss1.0000
4:114845998:C:CAacceptor_loss1.0000
4:114848209:TCTTA:Tdonor_loss1.0000
4:114848210:CTTA:Cdonor_loss1.0000
4:114848211:TTACC:Tdonor_loss1.0000
4:114848212:TACC:Tdonor_loss1.0000
4:114848213:A:ACdonor_gain1.0000
4:114848213:A:Cdonor_loss1.0000
4:114848214:C:CAdonor_loss1.0000
4:114848214:C:CCdonor_gain1.0000
4:114848214:CCAGT:Cdonor_gain1.0000
4:114848260:T:TAdonor_gain1.0000

AlphaMissense

5737 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
4:114829820:T:AK823N1.000
4:114829820:T:GK823N1.000
4:114870921:G:CN522K1.000
4:114870921:G:TN522K1.000
4:114935305:G:CF479L1.000
4:114935305:G:TF479L1.000
4:114935307:A:GF479L1.000
4:114937384:G:CS447R1.000
4:114937384:G:TS447R1.000
4:114937386:T:GS447R1.000
4:114970502:G:CH383Q1.000
4:114970502:G:TH383Q1.000
4:114827848:A:GW863R0.999
4:114827848:A:TW863R0.999
4:114829821:T:AK823I0.999
4:114829836:C:GC818S0.999
4:114829837:A:TC818S0.999
4:114829871:A:CF806L0.999
4:114829871:A:TF806L0.999
4:114829873:A:GF806L0.999
4:114839404:A:GW754R0.999
4:114839404:A:TW754R0.999
4:114839443:A:GC741R0.999
4:114845828:A:CY704D0.999
4:114845831:A:GW703R0.999
4:114845831:A:TW703R0.999
4:114845841:C:AR699S0.999
4:114845841:C:GR699S0.999
4:114848338:C:TG606E0.999
4:114852729:T:AK604N0.999

dbSNP variants (sampled 300 via entrez): RS1000008111 (4:114949081 A>C,G), RS1000008948 (4:114895752 C>T), RS1000026403 (4:114943090 A>G), RS1000042666 (4:115022989 G>A,C), RS1000047967 (4:115018350 T>C), RS1000051722 (4:115068688 G>C), RS1000052190 (4:114865758 C>A,T), RS1000060137 (4:114858617 G>A), RS1000063694 (4:114961306 G>GT), RS1000068863 (4:115105992 A>G), RS1000069203 (4:114943462 A>C,G), RS1000081599 (4:115095866 C>A,T), RS1000081984 (4:115099797 C>A), RS1000085455 (4:114937273 T>C), RS10000924 (4:114848341 G>A,T)

Disease associations

OMIM: gene MIM:615039 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

7 associations (top):

StudyTraitp-value
GCST001620_3Resistin levels6.000000e-18
GCST002104_19Bronchopulmonary dysplasia1.000000e-06
GCST009215_2Postcentral gyrus volume3.000000e-06
GCST010796_3506Electrocardiogram morphology (amplitude at temporal datapoints)2.000000e-08
GCST010796_3507Electrocardiogram morphology (amplitude at temporal datapoints)1.000000e-08
GCST011741_30LDL cholesterol levels in HIV infection6.000000e-07
GCST90000015_14Parkinson’s disease motor subtype (tremor to postural instability/gait difficulty score ratio)6.000000e-06

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0004819resistin measurement
EFO:0004327electrocardiography
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0600011Parkinson’s disease symptom measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, increases expression5
Phenylmercuric Acetateaffects cotreatment, increases expression2
bisphenol Aaffects cotreatment, decreases methylation1
trichostatin Aincreases expression1
sodium arseniteincreases expression1
mercuric bromideaffects cotreatment, increases expression1
CGP 52608affects binding, increases reaction1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
2,2’,4,4’,5-brominated diphenyl etherdecreases expression1
dorsomorphinaffects cotreatment, increases expression1
Fulvestrantaffects cotreatment, decreases methylation1
Vorinostatincreases expression1
Benzo(a)pyreneaffects methylation1
Carbamazepineaffects expression1
Diethylhexyl Phthalatedecreases expression1
Silicon Dioxidedecreases expression1
Cyclosporineincreases expression1
Aflatoxin B1increases methylation1
Paclitaxelincreases expression1
Antirheumatic Agentsdecreases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): bronchopulmonary dysplasia