SUMF1

gene
On this page

Also known as FGEUNQ3037

Summary

SUMF1 (sulfatase modifying factor 1, HGNC:20376) is a protein-coding gene on chromosome 3p26.1, encoding Formylglycine-generating enzyme (Q8NBK3). Oxidase that catalyzes the conversion of cysteine to 3-oxoalanine on target proteins, using molecular oxygen and an unidentified reducing agent. 3-oxoalanine modification, which is also named formylglycine (fGly), occurs in the maturation of arylsulfatases and some alkaline phos….

This gene encodes an enzyme that catalyzes the hydrolysis of sulfate esters by oxidizing a cysteine residue in the substrate sulfatase to an active site 3-oxoalanine residue, which is also known as C-alpha-formylglycine. Mutations in this gene cause multiple sulfatase deficiency, a lysosomal storage disorder. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 285362 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): mucosulfatidosis (Definitive, ClinGen)
  • GWAS associations: 15
  • Clinical variants (ClinVar): 734 total — 58 pathogenic, 51 likely-pathogenic
  • Phenotypes (HPO): 50
  • Dosage sensitivity (ClinGen): haploinsufficiency autosomal recessive, triplosensitivity no evidence
  • MANE Select transcript: NM_182760

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20376
Approved symbolSUMF1
Namesulfatase modifying factor 1
Location3p26.1
Locus typegene with protein product
StatusApproved
AliasesFGE, UNQ3037
Ensembl geneENSG00000144455
Ensembl biotypeprotein_coding
OMIM607939
Entrez285362

Gene structure

Transcript identifiers

Ensembl transcripts: 16 — 12 protein_coding, 3 protein_coding_CDS_not_defined, 1 nonsense_mediated_decay

ENST00000272902, ENST00000383843, ENST00000405420, ENST00000448413, ENST00000458465, ENST00000469888, ENST00000470751, ENST00000484993, ENST00000881800, ENST00000881801, ENST00000881802, ENST00000881803, ENST00000933293, ENST00000948920, ENST00000948921, ENST00000948922

RefSeq mRNA: 3 — MANE Select: NM_182760 NM_001164674, NM_001164675, NM_182760

CCDS: CCDS2564, CCDS54548, CCDS54549

Canonical transcript exons

ENST00000272902 — 9 exons

ExonStartEnd
ENSE0000096522344528764453049
ENSE0000107260843611464362254
ENSE0000117375543763304376389
ENSE0000117375844108654410978
ENSE0000117376644171284417242
ENSE0000117377044180104418132
ENSE0000117377544200644420146
ENSE0000117378144492664449340
ENSE0000193982944669764467269

Expression profiles

Bgee: expression breadth ubiquitous, 258 present calls, max score 97.60.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 23.9706 / max 227.5841, expressed in 1818 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
4083023.97061818

Top tissues by expression

258 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
kidney epitheliumUBERON:000481997.60gold quality
renal medullaUBERON:000036297.52gold quality
upper arm skinUBERON:000426397.04gold quality
layer of synovial tissueUBERON:000761696.63gold quality
oviduct epitheliumUBERON:000480496.06gold quality
synovial jointUBERON:000221795.62gold quality
tibiaUBERON:000097994.62gold quality
penisUBERON:000098994.59gold quality
esophagus squamous epitheliumUBERON:000692094.54gold quality
upper leg skinUBERON:000426294.48gold quality
nippleUBERON:000203094.23gold quality
stromal cell of endometriumCL:000225593.90gold quality
pylorusUBERON:000116693.66gold quality
bronchial epithelial cellCL:000232893.61gold quality
urethraUBERON:000005793.60gold quality
epithelial cell of pancreasCL:000008393.57gold quality
bronchusUBERON:000218593.53gold quality
mammalian vulvaUBERON:000099793.35gold quality
ileal mucosaUBERON:000033193.19gold quality
tracheaUBERON:000312693.12gold quality
deciduaUBERON:000245093.03gold quality
skin of hipUBERON:000155492.94gold quality
visceral pleuraUBERON:000240192.94gold quality
parietal pleuraUBERON:000240092.66gold quality
nasal cavity epitheliumUBERON:000538492.53gold quality
buccal mucosa cellCL:000233692.36gold quality
palpebral conjunctivaUBERON:000181292.36gold quality
pharyngeal mucosaUBERON:000035592.34gold quality
saphenous veinUBERON:000731892.31gold quality
pancreatic ductal cellCL:000207992.07gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-111727yes453.97
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

85 targeting SUMF1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692A100.0074.406850
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-314899.9775.066478
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-548AB99.9571.313488
HSA-MIR-55999.9572.283609
HSA-MIR-548BB-5P99.9471.273509
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502
HSA-MIR-548C-5P99.9471.243488
HSA-MIR-548D-5P99.9471.233502
HSA-MIR-548H-5P99.9471.243488
HSA-MIR-548I99.9471.253481
HSA-MIR-548J-5P99.9471.143489
HSA-MIR-548O-5P99.9471.243488
HSA-MIR-548W99.9471.243488
HSA-MIR-548Y99.9471.283514

Functional genomics

ClinGen dosage: haploinsufficiency 30 (autosomal recessive), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 32)

  • mutational analysis of SUMF1 in 20 Multiple sulfatase deficiency patients of different ethnic origin (PMID:15146462)
  • characterization of the human Calpha-formylglycine-generating enzyme (PMID:15657036)
  • FGE is a single-domain monomer with a surprising paucity of secondary structure and adopts a unique fold. The effect of all 18 missense mutations found in MSD patients is explained by the FGE structure, providing a molecular basis of MSD. (PMID:15907468)
  • Sulphatase-modifying factor 1 interacts with SUMF2 to regulate sulphatase activities (PMID:15962010)
  • co-delivery of SUMF1 may enhance the efficacy of gene therapy in several sulfatase deficiencies (PMID:17206939)
  • study demonstrates that upon secretion, SUMF1 can be taken up from the medium by several cell lines; following its uptake, SUMF1 shuttles from the plasma membrane to the endoplasmic reticulum (PMID:17446859)
  • Complete loss of SUMF1 function is likely to be lethal in humans. (PMID:17657823)
  • Our data provide evidence that haploinsufficiency of ITPR1 alone, but not SUMF1, causes SCA16 and SCA15. (PMID:17932120)
  • Molecular analysis of sulfatase modifying factor 1 mutations. (PMID:18157819)
  • ERp44-mediated retention of FGE, indicating that noncovalent interactions between ERp44 and FGE are sufficient to mediate ER retention. (PMID:18178549)
  • the non-catalytic N-terminal extension of formylglycine-generating enzyme is required for its biological activity and retention in the endoplasmic reticulum (PMID:18305113)
  • Study shows that SUMF1 interacts with protein disulfide isomerase (PDI) and ERp44, two thioredoxin family members residing in the early secretory pathway, and with ERGIC-53, a lectin that shuttles between the ER and the Golgi. (PMID:18508857)
  • This study present clinical findings of two consanguineous patients with multiple sulfatase deficiency. They were found to be homozygous for a novel missense mutation c.739G > C causing a p.G247R amino acid substitution in the SUMF1 protein. (PMID:18509892)
  • We have identified a 414-kb deletion including the entire ITPR1 and exon 1 of SUMF1 in patients in a Japanese family with Spinocerebellar ataxia type 15. (PMID:18579805)
  • This study identified genetic variation of SUMF1 in genes associated with in vivo glutamate measured using 1H magnetic resonance spectroscopic imaging in the grey matter of patients with multiple sclerosis. (PMID:20802204)
  • Phenotypic outcome in Multiple Sulfatase Deficiency depends on both residual FGE activity as well as protein stability. (PMID:21224894)
  • furin-mediated processing of FGE during secretion is a physiological means of higher eukaryotic cells to regulate FGE activity upon exit from the endoplasmic reticulum (PMID:23288839)
  • A novel missense mutation & an insertional truncating mutation in SUMF1 gene causing nultiple sulphatase deficiency. (PMID:25222778)
  • MSD presenting in the newborn period with hypotonia, apnoea, cyanosis and rolling eyes, hepato-splenomegaly and deafness. This patient was compound heterozygous for two so far undescribed SUMF1 mutations (c.191C > A; p.S64X and c.818A > G; p.D273G (PMID:25516103)
  • This detailed clinical description and follow-up of a cohort of patients, together with the molecular characterisation of their underlying defects, contribute to improved knowledge of multiple sulfatase deficiency. (PMID:25885655)
  • The complete kinetic parameters for both forms of FGE are described, along with a proposed mechanism for FGE catalysis that accounts for the copper-dependent activity. (PMID:25931126)
  • SUMF1 catalyses a monooxygenase type of reaction. (PMID:26077311)
  • We show that SUMF1 expression is affected in COPD patients compared to controls, and that SNPs in SUMF1 are associated with an increased risk of COPD. Certain COPD-associated SNPs have effects on either SUMF1 gene expression or on lung function. Collectively, this study shows that SUMF1 is associated with an increased risk of developing COPD. (PMID:28464818)
  • Herein, in this report, it was aimed to depict two rare cases from two different families with similar gene mutations and clinical presentation, neuro-imaging, and laboratory result of MSD in Iran. (PMID:29479672)
  • protein disulfide isomerase (PDI) plays a pivotal role in the recognition and quality control of Multiple sulfatase deficiency-causing FGE variants. (PMID:29972788)
  • Multiple sulfatase deficiency is an autosomal recessive lysosomal storage disorder due to a deficiency in formylglycine-generating enzyme, which is encoded by the Sulfatase Modifying Factor 1 ( SUMF1) gene. (PMID:30124108)
  • A homozygous missense variant of SUMF1 in the Bedouin population extends the clinical spectrum in ultrarare neonatal multiple sulfatase deficiency. (PMID:32048457)
  • Multiple Sulfatase Deficiency: A Disease Comprising Mucopolysaccharidosis, Sphingolipidosis, and More Caused by a Defect in Posttranslational Modification. (PMID:32414121)
  • A systematic review and meta-analysis of published cases reveals the natural disease history in multiple sulfatase deficiency. (PMID:32621519)
  • Natural history of multiple sulfatase deficiency: Retrospective phenotyping and functional variant analysis to characterize an ultra-rare disease. (PMID:32749716)
  • Association between SUMF1 polymorphisms and COVID-19 severity. (PMID:37344788)
  • SUMF1 overexpression promotes tumorous cell growth and migration and is correlated with the immune status of patients with glioma. (PMID:38460946)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriosumf1ENSDARG00000044203
mus_musculusSumf1ENSMUSG00000030101
rattus_norvegicusSumf1ENSRNOG00000006813
drosophila_melanogasterCG7049FBGN0035102

Paralogs (1): SUMF2 (ENSG00000129103)

Protein

Protein identifiers

Formylglycine-generating enzymeQ8NBK3 (reviewed: Q8NBK3)

Alternative names: C-alpha-formylglycine-generating enzyme 1, Sulfatase-modifying factor 1

All UniProt accessions (3): Q8NBK3, E9PF05, F5GXA0

UniProt curated annotations — full annotation on UniProt →

Function. Oxidase that catalyzes the conversion of cysteine to 3-oxoalanine on target proteins, using molecular oxygen and an unidentified reducing agent. 3-oxoalanine modification, which is also named formylglycine (fGly), occurs in the maturation of arylsulfatases and some alkaline phosphatases that use the hydrated form of 3-oxoalanine as a catalytic nucleophile. Known substrates include GALNS, ARSA, STS and ARSE.

Subunit / interactions. Monomer, homodimer and heterodimer with SUMF2.

Subcellular location. Endoplasmic reticulum lumen.

Tissue specificity. Ubiquitous. Highly expressed in kidney, pancreas and liver. Detected at lower levels in leukocytes, lung, placenta, small intestine, skeletal muscle and heart.

Post-translational modifications. N-glycosylated. Contains high-mannose-type oligosaccharides.

Disease relevance. Multiple sulfatase deficiency (MSD) [MIM:272200] A clinically and biochemically heterogeneous disorder caused by the simultaneous impairment of all sulfatases, due to defective post-translational modification and activation. It combines features of individual sulfatase deficiencies such as metachromatic leukodystrophy, mucopolysaccharidosis, chondrodysplasia punctata, hydrocephalus, ichthyosis, neurologic deterioration and developmental delay. The disease is caused by variants affecting the gene represented in this entry. SUMF1 mutations result in defective post-translational modification of sulfatases.

Cofactor. The catalytic copper is required to activate oxygen and catalyze oxidative C-H activation.

Pathway. Protein modification; sulfatase oxidation.

Miscellaneous. The resulting 3-oxoalanine in the substrate protein is called C(alpha)-formylglycine by many authors. It should not be confused with N-formylglycine.

Similarity. Belongs to the sulfatase-modifying factor family.

Isoforms (5)

UniProt IDNamesCanonical?
Q8NBK3-11yes
Q8NBK3-22
Q8NBK3-33
Q8NBK3-44
Q8NBK3-55

RefSeq proteins (3): NP_001158146, NP_001158147, NP_877437* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR005532SUMF_domDomain
IPR016187CTDL_foldHomologous_superfamily
IPR042095SUMF_sfHomologous_superfamily
IPR051043Sulfatase_Mod_Factor_KinaseFamily

Pfam: PF03781

Enzyme classification (BRENDA):

  • EC 1.8.3.7 — formylglycine-generating enzyme (BRENDA: 7 organisms, 18 substrates, 5 inhibitors, 11 Km, 14 kcat entries)

Substrate kinetics (BRENDA)

3 substrates with measured Km, best-characterized 3. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
ABZ-SAL-3-OXO-L-ALA-SPTRA-NH20.1–0.555
ABZ-ATTPLCGPSRASILSGR0.39–0.533
ABZ-SALCSPTRA-NH20.23–0.583

Catalyzed reactions (Rhea), 1 shown:

  • L-cysteinyl-[sulfatase] + 2 a thiol + O2 = an organic disulfide + 3-oxo-L-alanyl-[sulfatase] + hydrogen sulfide + H2O + H(+) (RHEA:51152)

UniProt features (71 total): sequence variant 18, binding site 11, strand 11, helix 7, mutagenesis site 6, splice variant 4, sequence conflict 4, disulfide bond 3, region of interest 2, turn 2, signal peptide 1, chain 1, glycosylation site 1

Structure

Experimental structures (PDB)

18 structures.

PDBMethodResolution (Å)
5SSXX-RAY DIFFRACTION1.02
5SSZX-RAY DIFFRACTION1.02
8ARUX-RAY DIFFRACTION1.08
1Z70X-RAY DIFFRACTION1.15
5SSYX-RAY DIFFRACTION1.29
2AFYX-RAY DIFFRACTION1.49
2AIIX-RAY DIFFRACTION1.54
1Y1JX-RAY DIFFRACTION1.55
2AIJX-RAY DIFFRACTION1.55
2HI8X-RAY DIFFRACTION1.64
2AFTX-RAY DIFFRACTION1.66
1Y1GX-RAY DIFFRACTION1.67
1Y1HX-RAY DIFFRACTION1.67
1Y1EX-RAY DIFFRACTION1.73
2AIKX-RAY DIFFRACTION1.73
1Y1FX-RAY DIFFRACTION1.8
2HIBX-RAY DIFFRACTION2
1Y1IX-RAY DIFFRACTION2.61

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8NBK3-F185.390.75

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (11): 296; 298; 300; 336; 341; 130; 259; 260; 273; 275; 293

Disulfide bonds (3): 50–52, 218–365, 235–346

Glycosylation sites (1): 141

Mutagenesis-validated functional residues (6):

PositionPhenotype
333loss of activity.
333reduces activity by 99%.
336loss of activity.
337reduces activity 5-fold.
340no effect.
341loss of activity.

Function

Pathways and Gene Ontology

Reactome pathways

9 pathways

IDPathway
R-HSA-1663150The activation of arylsulfatases
R-HSA-9840310Glycosphingolipid catabolism
R-HSA-1430728Metabolism
R-HSA-163841Gamma carboxylation, hypusinylation, hydroxylation, and arylsulfatase activation
R-HSA-1660662Glycosphingolipid metabolism
R-HSA-392499Metabolism of proteins
R-HSA-428157Sphingolipid metabolism
R-HSA-556833Metabolism of lipids
R-HSA-597592Post-translational protein modification

MSigDB gene sets: 287 (showing top): TGCACTT_MIR519C_MIR519B_MIR519A, GOBP_CARBOHYDRATE_DERIVATIVE_CATABOLIC_PROCESS, KEGG_LYSOSOME, GOBP_MEMBRANE_LIPID_CATABOLIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GTGCCTT_MIR506, GOBP_GLYCOLIPID_CATABOLIC_PROCESS, GOBP_SPHINGOLIPID_METABOLIC_PROCESS, USF_01, TGTGTGA_MIR377, GOBP_POST_TRANSLATIONAL_PROTEIN_MODIFICATION, GOBP_LIPID_METABOLIC_PROCESS, REACTOME_SPHINGOLIPID_METABOLISM, USF_02, GOBP_LIPOSACCHARIDE_METABOLIC_PROCESS

GO Biological Process (3): protein oxidation (GO:0018158), post-translational protein modification (GO:0043687), glycosphingolipid catabolic process (GO:0046479)

GO Molecular Function (6): oxidoreductase activity (GO:0016491), identical protein binding (GO:0042802), formylglycine-generating oxidase activity (GO:0120147), cupric ion binding (GO:1903135), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (3): endoplasmic reticulum (GO:0005783), endoplasmic reticulum lumen (GO:0005788), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-7 pathways:

CategoryPathways
Gamma carboxylation, hypusinylation, hydroxylation, and arylsulfatase activation1
Glycosphingolipid metabolism1
Post-translational protein modification1
Sphingolipid metabolism1
Metabolism of lipids1
Metabolism1
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
protein modification process2
glycosphingolipid metabolic process1
glycolipid catabolic process1
sphingolipid catabolic process1
catalytic activity1
protein binding1
oxidoreductase activity, acting on a sulfur group of donors, oxygen as acceptor1
copper ion binding1
binding1
cation binding1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1
endoplasmic reticulum1
intracellular organelle lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

1391 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SUMF1GALNSP34059920
SUMF1ARSBP15848857
SUMF1ERP44Q9BS26837
SUMF1ITPR1Q14643801
SUMF1CNTN4Q8IWV2783
SUMF1ARSAP15289781
SUMF1SGSHP51688763
SUMF1GLB1P16278614
SUMF1IDSP22304614
SUMF1SETMARQ53H47602
SUMF1NAGLUP54802586
SUMF1DARS1P14868533
SUMF1IDUAP35475525
SUMF1LRRN1Q6UXK5501
SUMF1ZNF385AQ96PM9497

IntAct

83 interactions, top by confidence:

ABTypeScore
SUMF1SGSHpsi-mi:“MI:0914”(association)0.710
SGSHSUMF1psi-mi:“MI:0915”(physical association)0.710
B3GNT3PGRMC1psi-mi:“MI:0914”(association)0.670
STIM2PRKAB2psi-mi:“MI:0914”(association)0.640
SUMF1SUMF2psi-mi:“MI:0915”(physical association)0.640
SUMF2SUMF1psi-mi:“MI:0915”(physical association)0.640
SUMF1SUMF2psi-mi:“MI:0403”(colocalization)0.640
CTRCSUMF1psi-mi:“MI:0915”(physical association)0.590
ERP44MEX3Apsi-mi:“MI:0914”(association)0.530
IFNA21IFIT3psi-mi:“MI:0914”(association)0.530
ARSAZBTB43psi-mi:“MI:0914”(association)0.530
GNSCLPXpsi-mi:“MI:0914”(association)0.530
PNLIPLAMC1psi-mi:“MI:0914”(association)0.530
STSGJA1psi-mi:“MI:0914”(association)0.530
FBXO2TMEM131Lpsi-mi:“MI:0914”(association)0.530
SUN2POTEFpsi-mi:“MI:0914”(association)0.530
GALNSCLGNpsi-mi:“MI:0914”(association)0.530
SGSHFBXO21psi-mi:“MI:0914”(association)0.530

BioGRID (67): SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Co-fractionation), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS), SUMF1 (Affinity Capture-MS)

ESM2 similar proteins: A1A4K5, A2BGL3, D1A7C3, D4PHA7, I6Y8I5, O75808, P22413, P57110, P97812, Q09325, Q0P5L5, Q10979, Q13219, Q14DK5, Q16270, Q16549, Q2TBF2, Q3TCT4, Q505J3, Q5DTK1, Q5QQ49, Q5QQ50, Q5QQ51, Q5QQ56, Q5QQ57, Q5RCR5, Q61139, Q62849, Q64610, Q658N2, Q6P9A2, Q6UWX4, Q80XH4, Q811B1, Q86Y38, Q8BG58, Q8BPG6, Q8NBJ7, Q8NBK3, Q8R0F3

Diamond homologs: A0R5N0, G7CFI3, O69671, Q0P5L5, Q58CP2, Q5RCR5, Q7D513, Q8BPG6, Q8NBJ7, Q8NBK3, Q8R0F3, Q9F3C7, A0A0H3MBJ2, D1A7C3, I6Y8I5, P0DPS7, Q7AJA5, Q822R1, Q9PKP3, V9TSX0, O94632, O34722

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 83 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Gamma carboxylation, hypusinylation, hydroxylation, and arylsulfatase activation642.3×3e-07
Glycosphingolipid catabolism734.2×1e-07
Glycosphingolipid metabolism630.1×2e-06
Sphingolipid metabolism616.8×6e-05
Neutrophil degranulation166.2×3e-07
Metabolism of lipids94.7×3e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

734 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic58
Likely pathogenic51
Uncertain significance204
Likely benign311
Benign38

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1067600NM_182760.4(SUMF1):c.603-1G>CPathogenic
1070279NC_000003.11:g.(?4403818)(4494743_?)delPathogenic
1070280NC_000003.11:g.(?4403818)(4459826_?)delPathogenic
1072766NM_182760.4(SUMF1):c.43G>T (p.Glu15Ter)Pathogenic
1076822NM_182760.4(SUMF1):c.451A>T (p.Lys151Ter)Pathogenic
1323658NM_182760.4(SUMF1):c.706C>T (p.Arg236Ter)Pathogenic
1348585NC_000003.11:g.(?4494540)(4494753_?)delPathogenic
1366344NM_182760.4(SUMF1):c.323del (p.Ile108fs)Pathogenic
1378318NM_182760.4(SUMF1):c.603-2A>TPathogenic
1409518NM_182760.4(SUMF1):c.903G>A (p.Trp301Ter)Pathogenic
1422956NM_182760.4(SUMF1):c.668del (p.Lys223fs)Pathogenic
1431236NM_182760.4(SUMF1):c.826C>T (p.Gln276Ter)Pathogenic
1453143NM_182760.4(SUMF1):c.539G>A (p.Trp180Ter)Pathogenic
1456418NM_182760.4(SUMF1):c.790G>T (p.Glu264Ter)Pathogenic
1457189NC_000003.11:g.(?4461738)(4461840_?)delPathogenic
1459688NC_000003.11:g.(?4403818)(4461840_?)delPathogenic
1460004NC_000003.11:g.(?4403818)(4508939_?)delPathogenic
1460108NM_182760.4(SUMF1):c.632G>A (p.Trp211Ter)Pathogenic
1460160NM_182760.4(SUMF1):c.876C>G (p.Tyr292Ter)Pathogenic
2022738NM_182760.4(SUMF1):c.774del (p.Asn259fs)Pathogenic
2029578NM_182760.4(SUMF1):c.675dup (p.Pro226fs)Pathogenic
2066505NM_182760.4(SUMF1):c.572del (p.Pro191fs)Pathogenic
2081614NM_182760.4(SUMF1):c.552del (p.Gly185fs)Pathogenic
2083297NM_182760.4(SUMF1):c.1023T>A (p.Cys341Ter)Pathogenic
2095730NM_182760.4(SUMF1):c.908C>G (p.Ser303Ter)Pathogenic
2203301NM_182760.4(SUMF1):c.954+5G>TPathogenic
2427593NC_000003.11:g.(?4490940)(4491034_?)delPathogenic
2663NM_182760.4(SUMF1):c.519+5_519+8delPathogenic
2665NM_182760.4(SUMF1):c.979C>T (p.Arg327Ter)Pathogenic
2671NM_182760.4(SUMF1):c.661del (p.Ala221fs)Pathogenic

SpliceAI

435 predictions. Top by Δscore:

VariantEffectΔscore
3:3799917:GGG:Gdonor_gain0.9800
3:3799918:GG:Gdonor_gain0.9800
3:3799918:GGG:Gdonor_gain0.9800
3:3799919:GG:Gdonor_gain0.9800
3:3799917:GGGGT:Gdonor_loss0.9700
3:3799920:GT:Gdonor_loss0.9700
3:3799921:TAAG:Tdonor_loss0.9700
3:3801221:GC:Gdonor_gain0.9700
3:3799922:AA:Adonor_loss0.9600
3:3844362:A:AGacceptor_gain0.9600
3:3844363:G:GGacceptor_gain0.9600
3:3844442:T:Gdonor_gain0.9600
3:3799920:G:GGdonor_gain0.9500
3:3801244:GCC:Gdonor_gain0.9500
3:3843674:A:AGacceptor_gain0.9500
3:3844358:TTTCA:Tacceptor_loss0.9500
3:3844359:TTCA:Tacceptor_loss0.9500
3:3844360:TCA:Tacceptor_loss0.9500
3:3844362:A:Cacceptor_loss0.9500
3:3844363:G:GCacceptor_loss0.9500
3:3844363:GGCAC:Gacceptor_gain0.9500
3:3801191:TTCTG:Tdonor_gain0.9400
3:3801242:CCGCC:Cdonor_gain0.9400
3:3801243:CGCC:Cdonor_gain0.9400
3:3801244:GCCG:Gdonor_gain0.9400
3:3801247:G:GGdonor_gain0.9400
3:3801216:C:Tdonor_gain0.9300
3:3799483:C:Tdonor_gain0.9200
3:3799923:AG:Adonor_loss0.9200
3:3801222:C:Gdonor_gain0.9200

AlphaMissense

2418 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:4362241:C:AR343M0.999
3:4410918:A:GW301R0.999
3:4410918:A:TW301R0.999
3:4410924:A:GW299R0.999
3:4410924:A:TW299R0.999
3:4417173:A:CF265L0.999
3:4417173:A:TF265L0.999
3:4417175:A:GF265L0.999
3:4420128:A:GW180R0.999
3:4420128:A:TW180R0.999
3:4362241:C:GR343T0.998
3:4410922:C:AW299C0.998
3:4410922:C:GW299C0.998
3:4417232:A:GW246R0.998
3:4417232:A:TW246R0.998
3:4420126:C:AW180C0.998
3:4420126:C:GW180C0.998
3:4420131:A:GW179R0.998
3:4420131:A:TW179R0.998
3:4362201:G:CS356R0.997
3:4362201:G:TS356R0.997
3:4362203:T:GS356R0.997
3:4362246:A:CC341W0.997
3:4362247:C:TC341Y0.997
3:4362248:A:GC341R0.997
3:4376336:G:CC336W0.997
3:4376337:C:TC336Y0.997
3:4376338:A:GC336R0.997
3:4410928:G:CN297K0.997
3:4410928:G:TN297K0.997

dbSNP variants (sampled 300 via entrez): RS1000003691 (3:4215238 C>G), RS1000004641 (3:4280496 A>G), RS1000007571 (3:4233230 G>A), RS1000017400 (3:4154691 C>A,G), RS1000018727 (3:4133576 T>C), RS1000026870 (3:4418562 T>C), RS1000031095 (3:4055858 T>C,G), RS1000034758 (3:4430929 C>A), RS1000036921 (3:4349941 A>C), RS1000037108 (3:4400010 A>G), RS1000040558 (3:4426764 C>A), RS1000042118 (3:4075841 A>T), RS1000042683 (3:4312736 G>A), RS1000043696 (3:4346656 G>A), RS1000044365 (3:4362908 T>C)

Disease associations

OMIM: gene MIM:607939 | disease phenotypes: MIM:272200, MIM:606658

GenCC curated gene-disease

DiseaseClassificationInheritance
mucosulfatidosisDefinitiveAutosomal recessive

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
mucosulfatidosisDefinitiveAR

Mondo (3): mucosulfatidosis (MONDO:0010088), spinocerebellar ataxia type 15/16 (MONDO:0011694), neurodevelopmental disorder (MONDO:0700092)

Orphanet (3): Multiple sulfatase deficiency (Orphanet:585), Spinocerebellar ataxia type 15/16 (Orphanet:98769), Spinocerebellar ataxia type 16 (Orphanet:98770)

HPO phenotypes

50 total (30 of 50 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0000238Hydrocephalus
HP:0000252Microcephaly
HP:0000256Macrocephaly
HP:0000280Coarse facial features
HP:0000319Smooth philtrum
HP:0000365Hearing impairment
HP:0000407Sensorineural hearing impairment
HP:0000463Anteverted nares
HP:0000505Visual impairment
HP:0000518Cataract
HP:0000546Retinal degeneration
HP:0000574Thick eyebrow
HP:0000648Optic atrophy
HP:0000943Dysostosis multiplex
HP:0001249Intellectual disability
HP:0001250Seizure
HP:0001251Ataxia
HP:0001257Spasticity
HP:0001263Global developmental delay
HP:0001272Cerebellar atrophy
HP:0001319Neonatal hypotonia
HP:0001387Joint stiffness
HP:0001744Splenomegaly
HP:0002003Large forehead
HP:0002059Cerebral atrophy
HP:0002119Ventriculomegaly
HP:0002208Coarse hair
HP:0002240Hepatomegaly
HP:0002376Developmental regression

GWAS associations

15 associations (top):

StudyTraitp-value
GCST000783_4Multiple sclerosis–Brain Glutamate Levels6.000000e-07
GCST003488_15Response to fenofibrate (triglyceride levels)9.000000e-06
GCST004283_1Midgestational circulating levels of PCBs1.000000e-07
GCST004283_14Midgestational circulating levels of PCBs1.000000e-07
GCST004283_17Midgestational circulating levels of PCBs9.000000e-08
GCST004283_19Midgestational circulating levels of PCBs2.000000e-07
GCST004283_2Midgestational circulating levels of PCBs9.000000e-08
GCST007478_7Non-word reading3.000000e-06
GCST008161_117Waist circumference adjusted for body mass index8.000000e-06
GCST008959_2Lipid traits4.000000e-08
GCST010516_2Fractures (paediatric)1.000000e-07
GCST012277_9Clostridioides difficle infection2.000000e-06
GCST012490_14Femur bone mineral density x serum urate levels interaction3.000000e-13
GCST012490_451Femur bone mineral density x serum urate levels interaction3.000000e-10
GCST012616_17Spondylosis5.000000e-06

EFO canonical traits (8, from GWAS)

EFO IDTrait name
EFO:0007681triglyceride change measurement
EFO:0007042polychlorinated biphenyls measurement
EFO:0007964gestational serum measurement
EFO:0005299non-word reading
EFO:0007789BMI-adjusted waist circumference
EFO:0004612high density lipoprotein cholesterol measurement
EFO:0009130clostridium difficile infection
EFO:0004531urate measurement

MeSH disease descriptors (3)

DescriptorNameTree numbers
D052517Multiple Sulfatase Deficiency DiseaseC10.228.140.163.100.435.825.850.750; C16.320.565.189.435.825.850.750; C16.320.565.398.641.803.925.750; C16.320.565.595.554.825.850.750; C18.452.132.100.435.825.850.750; C18.452.584.563.641.803.925.750; C18.452.648.189.435.825.850.750; C18.452.648.398.641.803.925.750; C18.452.648.595.554.825.850.750
D065886Neurodevelopmental DisordersF03.625
C564685Spinocerebellar Ataxia 15 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

PharmGKB variants

1 variants.

VariantGenesLevelScore#Clin annotsDrugs
rs793391SUMF10.000

CTD chemical–gene interactions

40 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression6
Benzo(a)pyrenedecreases expression, decreases methylation, increases expression3
bisphenol Aaffects cotreatment, increases expression, decreases methylation2
Air Pollutantsaffects expression, increases abundance, decreases expression2
Cyclosporinedecreases expression2
aristolochic acid Idecreases expression1
bisphenol Faffects cotreatment, increases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
CGP 52608affects binding, increases reaction1
2-palmitoylglycerolincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
abrinedecreases expression1
dorsomorphinaffects cotreatment, increases expression1
bisphenol Sincreases expression1
jinfukangincreases expression1
bisphenol AFincreases expression1
Temozolomidedecreases expression1
Sunitinibincreases expression1
Leflunomidedecreases expression1
Cadmiumincreases expression, increases abundance1
Dexamethasoneaffects cotreatment, increases expression1
Estradioldecreases expression1
Formaldehydeincreases expression1
Hydrogen Peroxideincreases expression1
Indomethacinaffects cotreatment, increases expression1
Ivermectindecreases expression1
Ozoneaffects expression, increases abundance1
Phenobarbitalaffects expression1

Clinical trials (associated diseases)

209 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT04586348PHASE4UNKNOWNPrenatal Iodine Supplementation and Early Childhood Neurodevelopment
NCT04873115PHASE4UNKNOWNDouble-blind, Placebo-controlled, Randomized Clinical Trial Comparing the Efficacy and Safety of Sialanar Plus orAl rehabiLitation Against Placebo Plus Oral Rehabilitation for chIldren and Adolescents With seVere Sialorrhoea and Neurodisabilties,
NCT02559102PHASE3COMPLETEDDexmedetomidine Sedation Versus General Anaesthesia for Inguinal Hernia Surgery in Infants
NCT02757079PHASE3COMPLETEDStudy of the Efficacy and Safety of NPC-15 for Sleep Disorders of Children With Neurodevelopmental Disorders
NCT06915480PHASE3RECRUITINGReducing Missed Appointments
NCT07377032PHASE3RECRUITINGTAP-GRIN: Interventional Study on Patients With GRIN-related Neurodevelopmental Disorders
NCT02909959PHASE2COMPLETEDSulforaphane for the Treatment of Young Men With Autism Spectrum Disorder
NCT06081348PHASE2RECRUITINGSertraline vs. Placebo in the Treatment of Anxiety in Children and AdoLescents With NeurodevelopMental Disorders
NCT06352372PHASE2COMPLETEDSafety and Efficacy of tPBM for Epileptiform Activity in Autism
NCT00503191PHASE1COMPLETEDNeuroModulation Technique Treatment of Autism
NCT04475848PHASE1COMPLETEDA Study to Investigate the Safety, Tolerability, Pharmacokinetics, Pharmacodynamics and Food Effect of RO6953958 in Healthy Participants
NCT06300398PHASE1COMPLETEDIAMA-6 Oral Dose Study in Healthy Adults
NCT01841983Not specifiedCOMPLETEDProject A: Integrated Approaches to Improving the Health and Safety of Health Care Workers
NCT02699190Not specifiedCOMPLETEDLeukoSEQ: Whole Genome Sequencing as a First-Line Diagnostic Tool for Leukodystrophies
NCT03047369Not specifiedRECRUITINGThe Myelin Disorders Biorepository Project
NCT03333200Not specifiedRECRUITINGLongitudinal Study of Neurodegenerative Disorders
NCT03831737Not specifiedCOMPLETEDTeam-based Ergonomics Educational Model for Workplace WELLNESS Improvement: A Pilot Study
NCT06036693Not specifiedRECRUITINGMPS (RaDiCo Cohort) (RaDiCo-MPS)
NCT01793168Not specifiedRECRUITINGRare Disease Patient Registry & Natural History Study - Coordination of Rare Diseases at Sanford
NCT01783041PHASE2/PHASE3COMPLETEDEffect of Early L-Carnitine Supplementation on Neurodevelopmental Outcomes in Very Preterm Infants
NCT05767385PHASE2/PHASE3RECRUITINGFetal Cerebrovascular Autoregulation in Congenital Heart Disease and Association With Neonatal Neurobehavior
NCT05675098EARLY_PHASE1NOT_YET_RECRUITINGCentral Nervous System Stimulants and Physical Function in Children With Cerebral Palsy
NCT00783783Not specifiedCOMPLETEDCYP2D6 Pharmacogenetics in Risperidone-Treated Children
NCT01778504Not specifiedRECRUITINGStudying Childhood-onset Behavioral, Psychiatric, and Developmental Disorders
NCT01850784Not specifiedUNKNOWNHigh Energy Formula Feeding in Infants With Congenital Heart Disease
NCT01922791Not specifiedCOMPLETEDNutrition and Pregnancy Intervention Study
NCT01942525Not specifiedUNKNOWNInfluence of Intrauterine Growth Restriction on Amplitude-integrated EEG in Preterm Infants
NCT02003170Not specifiedCOMPLETEDEtiology and Early Diagnosis of Neurodevelopmental Disorders
NCT02118649Not specifiedACTIVE_NOT_RECRUITINGEnhancing Behavior and Brain Response to Visual Targets Using a Computer Game
NCT02557191Not specifiedTERMINATEDBiomarkers, Neurodevelopment and Preterm Infants
NCT02690675Not specifiedCOMPLETEDIron Supplement Effect on Child Development
NCT02694003Not specifiedCOMPLETEDBetter Nights, Better Days for Children With Neurodevelopment Disorders
NCT02792894Not specifiedCOMPLETEDFamily Networks (FaNs) for Children With Developmental Disorders and Delays
NCT02871674Not specifiedUNKNOWNGood Night Project: Behavioural Sleep Interventions for Children With ADHD: A Randomised Controlled Trial
NCT02887157Not specifiedCOMPLETEDAnalyzing Retinal Microanatomy in ROP
NCT02898298Not specifiedCOMPLETEDPositive Emotion Regulation Training in Children, Adolescents and Young Adults With and Without Developmental Disorder
NCT02912780Not specifiedUNKNOWNIntroduction of Microsystems in a Level 3 Neonatal Intensive Care Unit
NCT03023293Not specifiedCOMPLETEDn-3 PUFAs, Irisin and Maternal Glucose Metabolism From Pregnancy to Postpartum
NCT03023644Not specifiedCOMPLETEDImproving Neurodevelopmental Outcomes in Children With Congenital Heart Disease: An Intervention Study
NCT03032991Not specifiedUNKNOWNEarly Biomarkers of Neurodevelopment in Offspring of Diabetic Mothers