NSD1

gene
On this page

Also known as ARA267FLJ22263KMT3B

Summary

NSD1 (nuclear receptor binding SET domain protein 1, HGNC:14234) is a protein-coding gene on chromosome 5q35.3, encoding Histone-lysine N-methyltransferase, H3 lysine-36 specific (Q96L73). Histone methyltransferase that dimethylates Lys-36 of histone H3 (H3K36me2). It is haploinsufficient (ClinGen: sufficient evidence).

This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this enhancement can be increased further in the presence of other androgen receptor associated coregulators. This protein may act as a nucleus-localized, basic transcriptional factor and also as a bifunctional transcriptional regulator. Mutations of this gene have been associated with Sotos syndrome and Weaver syndrome. One version of childhood acute myeloid leukemia is the result of a cryptic translocation with the breakpoints occurring within nuclear receptor-binding Su-var, enhancer of zeste, and trithorax domain protein 1 on chromosome 5 and nucleoporin, 98-kd on chromosome 11. Multiple transcript variants encoding distinct isoforms have been identified for this gene.

Source: NCBI Gene 64324 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Sotos syndrome (Definitive, ClinGen) — +2 more curated relationships
  • GWAS associations: 43
  • Clinical variants (ClinVar): 2,636 total — 463 pathogenic, 178 likely-pathogenic
  • Phenotypes (HPO): 194
  • Druggable target: yes — 7 molecules with ChEMBL bioactivity
  • Cancer driver (intOGen): loss-of-function (tumor-suppressor-like) across 11 cancer types
  • Dosage sensitivity (ClinGen): haploinsufficiency sufficient evidence, triplosensitivity no evidence
  • MANE Select transcript: NM_022455

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14234
Approved symbolNSD1
Namenuclear receptor binding SET domain protein 1
Location5q35.3
Locus typegene with protein product
StatusApproved
AliasesARA267, FLJ22263, KMT3B
Ensembl geneENSG00000165671
Ensembl biotypeprotein_coding
OMIM606681
Entrez64324

Gene structure

Transcript identifiers

Ensembl transcripts: 28 — 14 protein_coding, 7 protein_coding_CDS_not_defined, 5 retained_intron, 2 nonsense_mediated_decay

ENST00000347982, ENST00000354179, ENST00000375350, ENST00000439151, ENST00000503056, ENST00000504457, ENST00000505395, ENST00000508029, ENST00000508896, ENST00000510954, ENST00000511258, ENST00000513736, ENST00000515735, ENST00000602285, ENST00000638627, ENST00000644863, ENST00000685206, ENST00000686385, ENST00000686993, ENST00000687095, ENST00000687453, ENST00000688613, ENST00000689326, ENST00000689345, ENST00000689549, ENST00000692024, ENST00000936190, ENST00000936191

RefSeq mRNA: 12 — MANE Select: NM_022455 NM_001365684, NM_001409301, NM_001409302, NM_001409303, NM_001409304, NM_001409305, NM_001409306, NM_001409307, NM_001409308, NM_001409309, NM_022455, NM_172349

CCDS: CCDS4412

Canonical transcript exons

ENST00000439151 — 23 exons

ExonStartEnd
ENSE00001378782177135087177136030
ENSE00001516123177133773177133952
ENSE00001547915177191884177192019
ENSE00001549090177204120177204292
ENSE00003808375177209636177212195
ENSE00004014201177256951177257151
ENSE00004014202177251730177251853
ENSE00004014203177283787177283928
ENSE00004014204177246678177246796
ENSE00004014205177273672177273784
ENSE00004014206177248181177248324
ENSE00004014208177239756177239865
ENSE00004014209177238237177238507
ENSE00004014210177291954177292158
ENSE00004014211177282465177282581
ENSE00004014212177269602177269807
ENSE00004014213177235821177235945
ENSE00004014214177259989177260168
ENSE00004014215177244195177244270
ENSE00004014216177280565177280834
ENSE00004014217177293832177300213
ENSE00004014218177267562177267718
ENSE00004014219177288819177288925

Expression profiles

Bgee: expression breadth ubiquitous, 235 present calls, max score 96.55.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 19.4492 / max 352.3882, expressed in 1796 samples.

FANTOM5 promoters (8 alternative TSS)

Promoter IDTPM avgSamples expressed
6046014.36041773
604581.3001865
604620.9439601
604590.9164619
604610.7893365
604570.6365332
2038090.4902264
604670.01233

Top tissues by expression

274 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
sural nerveUBERON:001548896.55gold quality
colonic epitheliumUBERON:000039795.88gold quality
calcaneal tendonUBERON:000370195.00gold quality
adrenal tissueUBERON:001830394.42gold quality
ventricular zoneUBERON:000305391.53gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047390.29gold quality
ganglionic eminenceUBERON:000402390.08gold quality
cortical plateUBERON:000534389.94gold quality
tendonUBERON:000004389.44gold quality
monocyteCL:000057687.31gold quality
leukocyteCL:000073886.85gold quality
mononuclear cellCL:000084286.79gold quality
spermCL:000001986.60silver quality
tonsilUBERON:000237286.16gold quality
corpus callosumUBERON:000233685.97gold quality
stromal cell of endometriumCL:000225585.54gold quality
cervix squamous epitheliumUBERON:000692285.50gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099185.19gold quality
hindlimb stylopod muscleUBERON:000425284.48gold quality
cerebellar vermisUBERON:000472084.44gold quality
muscle of legUBERON:000138384.32gold quality
male germ cellCL:000001584.28silver quality
granulocyteCL:000009484.27gold quality
islet of LangerhansUBERON:000000684.08gold quality
gastrocnemiusUBERON:000138884.03gold quality
left testisUBERON:000453383.84gold quality
bone marrow cellCL:000209283.76gold quality
right testisUBERON:000453483.76gold quality
bloodUBERON:000017883.74gold quality
testisUBERON:000047383.74gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.08

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
PRB4Activation

miRNA regulators (miRDB)

215 targeting NSD1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-29A-3P100.0073.111835
HSA-MIR-29B-3P100.0073.181833
HSA-MIR-29C-3P100.0073.151833
HSA-MIR-8485100.0077.574731
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-4425100.0067.591049
HSA-MIR-6758-5P100.0066.211470
HSA-MIR-6856-5P100.0065.471298
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-574-5P100.0066.01989
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-4682100.0068.891258
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-4481100.0066.421669
HSA-MIR-186-5P99.9970.833707
HSA-MIR-34A-5P99.9971.211784
HSA-MIR-449A99.9971.051776
HSA-MIR-607799.9968.042299
HSA-MIR-27A-3P99.9872.132955
HSA-MIR-27B-3P99.9872.132955
HSA-MIR-998599.9872.112939
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-433-3P99.9869.371203
HSA-MIR-4745-5P99.9865.951028
HSA-MIR-6888-3P99.9765.951170
HSA-MIR-34C-5P99.9770.451577
HSA-MIR-449B-5P99.9770.261580

Functional genomics

ClinGen dosage: haploinsufficiency 3 (sufficient evidence), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 40)

  • The genomic structure of NSD1 consists of at least 23 exons, the cDNA is 8552 bp long, contains at least six functional domains (SET, PWWP-I, PWWP-II, PHD-I, PHD-II, and PHD-III) and ten putative nuclear localization signals. (PMID:11733144)
  • Haploinsufficiency of NSD1 causes Sotos syndrome (PMID:11896389)
  • NSD1 mutations are the major cause of Sotos syndrome and occur in some cases of Weaver syndrome but are rare in other overgrowth phenotypes (PMID:12464997)
  • A novel 1 base pair mutation in the NSD1 gene is the cause of familial Sotos syndrome in a Sotos syndrome family. (PMID:12525543)
  • In 26 patients with Soto’s syndrome 18 of the 20 microdeletions occurred prezygotically in the paternally derived chromosome 5, whereas those in the maternally derived chromosome were found in only two cases (PMID:12687502)
  • In Sotos and Weaver syndromes mental retardation was consistently more severe in patients with NSD1 deletions. (PMID:12807965)
  • fused to NUP98 and frequently transcribed in childhool AML (PMID:12931227)
  • genetic susceptibility in Beckwith-Wiedemann syndrome; NSD1 could be involved in imprinting of the chromosome 11p15 region (PMID:14997421)
  • NSD1(+/-) patients show endocrine and paracrine changes in the IGF system (PMID:15362962)
  • Mutation or microdeletion of NSD1 is diagnostic of Sotos syndrome. (PMID:15365454)
  • Haploinsufficiency of NSD1 is the major cause of Sotos syndrome, and NSD1 plays a role in growth and brain development in humans. (PMID:15539801)
  • androgen signaling pathway might cross talk with apoptosis signaling pathway through the interaction between ARA267-alpha and DR6 (PMID:15623156)
  • Deletion of Low-copy repeats that are centromeric and telomeric to NSD1 is associated with Sotos syndrome (PMID:15640245)
  • analysis of 266 Sotos syndrome patients with NSD1 aberrations (PMID:15942875)
  • REVIEW: mutational analysis in Sotos syndrome (PMID:16010675)
  • NSD1 mutation patients showed less severe behavior problems and an easier temperament than NSD1 non-mutation patients, and ADHD was not a consistent finding in these patients. (PMID:16780628)
  • The SET domain of NSD1 is involved in NIH3T3 cell growth by modulating serum dependence. (PMID:17437319)
  • investigated the NSD1 cDNA sequence in genetically confirmed Sos patients harbouring truncating and missense mutations (PMID:17561922)
  • confirms the heterogeneity of NSD1 alterations in Sotos syndrome and therefore the need to complete sequencing analysis by screening for partial deletions and duplications to ensure an accurate molecular diagnosis (PMID:17565729)
  • study shows that NUP98-NSD1 induces acute myeloid leukemia in vivo, sustains self-renewal of myeloid stem cells in vitro, and enforces expression of the HoxA7, HoxA9, HoxA10 and Meis1 proto-oncogenes (PMID:17589499)
  • Screening for NSD1 mutations and deletions in patients with autism and macrocephaly is not warranted in the absence of other features of Sotos syndrome. (PMID:18001468)
  • Mutation analysis was performed in 4 patients with Sotos syndrome with typical phenotypic characteristics. In each of the 4 patients a NSD1 mutation was found (2 frame shifts, 1 nonsense and 1 missense mutation). (PMID:19039236)
  • Familial Sotos syndrome caused by a novel missense mutation in NSD1 is reported. (PMID:19545651)
  • MLPA analysis in 30 Sotos syndrome patients revealed one total NSD1 deletion and two partial deletions not previously reported. (PMID:19596467)
  • The severity of the hypodontia seemed to increase with the severity of aberration of the NSD1. (PMID:19876911)
  • Data describe a NF-kappaB regulatory pathway that is driven by reversible lysine methylation of p65, carried out by nuclear receptor-binding SET domain-containing protein 1 (NSD1) and F-box and leucine-rich repeat protein 11 (FBXL11). (PMID:20080798)
  • study describes two boys with Sotos syndrome in whom PCR amplification & direct sequencing of the NSD1 gene identified 2 novel mutations not previously described: c.4736dupG in exon 12 and c.3938_3939insT in exon (PMID:20420030)
  • NSD1 regulates RNAP II recruitment to BMP4, and failure to do so leads to reduced gene expression and abrogated levels of H3K36Me and CTD phosphorylation. (PMID:20837538)
  • The structure of NSD1 reveals an autoregulatory mechanism underlying histone H3K36 methylation. (PMID:21196496)
  • Our cases suggest a contiguous gene deletion syndrome including NSD1 and SLC34A1 and provide a potential genetic basis for idiopathic infantile hypercalcemia. (PMID:21597970)
  • these finding exposes a key regulatory and recognition mechanism driven by the flexibility of a loop at the interface of the SET and postSET region in NSD1 protein. (PMID:21806967)
  • NUP98/NSD1 identifies a previously unrecognized group of young AML patients, with distinct characteristics and dismal prognosis, for whom new treatment strategies are urgently needed. (PMID:21813447)
  • data suggest that Sotos point mutations in NSD1 PHD domains disrupt its transcriptional regulation by interfering with its ability to bind epigenetic marks and recruit cofactors. (PMID:21972110)
  • NSD1 abnormalities were identified in 15 (83%) patients. Among them, eight patients (53%) had 5q35 microdeletions and the other seven patients (47%) had seven different NSD1 intragenic mutations including four novel mutations (PMID:23190751)
  • We performed a genetic analysis for Sotos syndrome which revealed a heterozygous mutation on the exon 23 of NSD1. (PMID:23333153)
  • NSD1 mutations that cause Sotos syndrome are loss-of-function, primarily truncating mutations or missense mutations at key residues in functional domains. EZH2 mutations that cause Weaver syndrome are primarily missense variants (PMID:23592277)
  • NSD1 interaction with liganded NRs including ERa, AR, RARa, RXRa, TRb, PPARg and VDR is mediated by an LXXLL motif. Interaction with the NR2E/F subfamily including COUP-TFI, COUP-TFII, EAR2 and TLX requires an overlapping F/YSXXLXXL/Y motif. (PMID:23975195)
  • support to the adoption of screening for NUP98-NSD1 in pediatric AML without otherwise favorable genetic markers (PMID:23999921)
  • all of the H3K36-specific methyltransferases, including ASH1L, HYPB, NSD1, and NSD2 were inhibited by ubH2A, whereas the other histone methyltransferases, including PRC2, G9a, and Pr-Set7 were not affected by ubH2A. (PMID:24019522)
  • NSD1 prefers aromatic, hydrophobic, and basic residues at the -2, -1 and +2, and +1 sites of its substrate peptide in histone H3. (PMID:24412544)

Cross-species orthologs

22 orthologs

OrganismSymbolGene ID
danio_rerionsd1aENSDARG00000060016
danio_rerionsd1bENSDARG00000060705
mus_musculusNsd1ENSMUSG00000021488
rattus_norvegicusNsd1ENSRNOG00000016680
drosophila_melanogasterash1FBGN0005386
drosophila_melanogastertrrFBGN0023518
drosophila_melanogasterSet2FBGN0030486
drosophila_melanogasterCG4565FBGN0037841
drosophila_melanogasterG9aFBGN0040372
drosophila_melanogastereggFBGN0086908
caenorhabditis_elegansset-32WBGENE00008062
caenorhabditis_elegansWBGENE00008206
caenorhabditis_elegansWBGENE00008527
caenorhabditis_elegansWBGENE00011729
caenorhabditis_elegansmet-1WBGENE00016603
caenorhabditis_elegansWBGENE00018023
caenorhabditis_elegansWBGENE00019584
caenorhabditis_elegansWBGENE00019690
caenorhabditis_elegansWBGENE00019883
caenorhabditis_elegansWBGENE00020006
caenorhabditis_elegansWBGENE00020919
caenorhabditis_elegansWBGENE00021282

Paralogs (19): KMT2C (ENSG00000055609), SETD1A (ENSG00000099381), SUV39H1 (ENSG00000101945), EZH2 (ENSG00000106462), EZH1 (ENSG00000108799), NSD2 (ENSG00000109685), ASH1L (ENSG00000116539), KMT2A (ENSG00000118058), SETDB2 (ENSG00000136169), SETD1B (ENSG00000139718), SETDB1 (ENSG00000143379), NSD3 (ENSG00000147548), SETBP1 (ENSG00000152217), SUV39H2 (ENSG00000152455), KMT2D (ENSG00000167548), EHMT1 (ENSG00000181090), SETD2 (ENSG00000181555), EHMT2 (ENSG00000204371), KMT2B (ENSG00000272333)

Protein

Protein identifiers

Histone-lysine N-methyltransferase, H3 lysine-36 specificQ96L73 (reviewed: Q96L73)

Alternative names: Androgen receptor coactivator 267 kDa protein, Androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, Lysine N-methyltransferase 3B, Nuclear receptor-binding SET domain-containing protein 1

All UniProt accessions (8): A0A1W2PS55, A0A2R8Y7J4, A0A8I5KV31, A0A8I5QJP2, Q96L73, D6RBP3, D6RG26, H7BYB0

UniProt curated annotations — full annotation on UniProt →

Function. Histone methyltransferase that dimethylates Lys-36 of histone H3 (H3K36me2). Transcriptional intermediary factor capable of both negatively or positively influencing transcription, depending on the cellular context.

Subunit / interactions. Interacts with the ligand-binding domains of RARA and THRA in the absence of ligand; in the presence of ligand the interaction is severely disrupted but some binding still occurs. Interacts with the ligand-binding domains of RXRA and ESRRA only in the presence of ligand. Interacts with ZNF496. Interacts with AR DNA- and ligand-binding domains.

Subcellular location. Nucleus. Chromosome.

Tissue specificity. Expressed in the fetal/adult brain, kidney, skeletal muscle, spleen, and the thymus, and faintly in the lung.

Disease relevance. Sotos syndrome (SOTOS) [MIM:117550] An autosomal dominant, childhood overgrowth syndrome characterized by pre- and postnatal overgrowth, developmental delay, intellectual disability, advanced bone age, and abnormal craniofacial morphology including macrodolichocephaly with frontal bossing, frontoparietal sparseness of hair, apparent hypertelorism, downslanting palpebral fissures, and facial flushing. Common oral findings include: premature eruption of teeth; high, arched palate; pointed chin and, more rarely, prognathism. The disease is caused by variants affecting the gene represented in this entry. Beckwith-Wiedemann syndrome (BWS) [MIM:130650] A disorder characterized by anterior abdominal wall defects including exomphalos (omphalocele), pre- and postnatal overgrowth, and macroglossia. Additional less frequent complications include specific developmental defects and a predisposition to embryonal tumors. The disease is caused by variants affecting the gene represented in this entry. A chromosomal aberration involving NSD1 is found in childhood acute myeloid leukemia. Translocation t(5;11)(q35;p15.5) with NUP98. A chromosomal aberration involving NSD1 is found in an adult form of myelodysplastic syndrome (MDS). Insertion of NUP98 into NSD1 generates a NUP98-NSD1 fusion product.

Similarity. Belongs to the class V-like SAM-binding methyltransferase superfamily.

Isoforms (3)

UniProt IDNamesCanonical?
Q96L73-11, ARA267-betayes
Q96L73-22, ARA267-alpha
Q96L73-33

RefSeq proteins (12): NP_001352613, NP_001396230, NP_001396231, NP_001396232, NP_001396233, NP_001396234, NP_001396235, NP_001396236, NP_001396237, NP_001396238, NP_071900, NP_758859 (=MANE)

Domains & families (InterPro)

IDNameType
IPR000313PWWP_domDomain
IPR001214SET_domDomain
IPR001965Znf_PHDDomain
IPR003616Post-SET_domDomain
IPR006560AWS_domDomain
IPR011011Znf_FYVE_PHDHomologous_superfamily
IPR013083Znf_RING/FYVE/PHDHomologous_superfamily
IPR019786Zinc_finger_PHD-type_CSConserved_site
IPR019787Znf_PHD-fingerDomain
IPR041306C5HCHDomain
IPR046341SET_dom_sfHomologous_superfamily
IPR047423PWWP_NSD1_rpt2Domain
IPR047426PHD1_NSD1_2Domain
IPR047428PHD2_NSD1Domain
IPR047429PHD3_NSD1Domain
IPR047430PHD4_NSD1Domain
IPR047432PHD5_NSD1Domain
IPR047433SET_NSD1Domain
IPR050777SET2_Histone-Lys_MeTrsfaseFamily
IPR055197PHDvar_NSDDomain
IPR055198NSD_PHDDomain
IPR059153NSD_PHD-1stDomain

Pfam: PF00628, PF00855, PF00856, PF17907, PF17982, PF22908, PF23004, PF23011

Enzyme classification (BRENDA):

  • EC 2.1.1.357 — [histone H3]-lysine36 N-dimethyltransferase (BRENDA: 4 organisms, 14 substrates, 5 inhibitors, 4 Km, 2 kcat entries)
  • EC 2.1.1.362 — [histone H4]-N-methyl-L-lysine20 N-methyltransferase (BRENDA: 4 organisms, 9 substrates, 0 inhibitors, 0 Km, 0 kcat entries)

Substrate kinetics (BRENDA)

3 substrates with measured Km, best-characterized 3. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
S-ADENOSYL-L-METHIONINE0.0028–0.00382
CHICKEN NUCLEOSOME0.00031
RECOMBINANT NUCLEOSOME1

Catalyzed reactions (Rhea), 1 shown:

  • L-lysyl(36)-[histone H3] + 2 S-adenosyl-L-methionine = N(6),N(6)-dimethyl-L-lysyl(36)-[histone H3] + 2 S-adenosyl-L-homocysteine + 2 H(+) (RHEA:60308)

UniProt features (136 total): sequence conflict 26, sequence variant 21, compositionally biased region 19, region of interest 18, strand 11, helix 9, modified residue 8, domain 5, binding site 5, zinc finger region 4, cross-link 3, splice variant 3, mutagenesis site 2, chain 1, turn 1

Structure

Experimental structures (PDB)

4 structures.

PDBMethodResolution (Å)
3OOIX-RAY DIFFRACTION1.75
6KQQX-RAY DIFFRACTION1.8
6KQPX-RAY DIFFRACTION2.4
8C5LX-RAY DIFFRACTION2.6

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96L73-F145.260.13

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (5): 1952–1954; 1994–1997; 2020–2021; 2065; 2071

Post-translational modifications (11): 117, 483, 486, 766, 1510, 2369, 2462, 2471, 906, 1339, 2616

Mutagenesis-validated functional residues (2):

PositionPhenotype
1914reduced enzyme activity.
1952nearly abolished enzyme activity.

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-3214841PKMTs methylate histone lysines

MSigDB gene sets: 652 (showing top): GSE45365_CTRL_VS_MCMV_INFECTION_NK_CELL_DN, MYAATNNNNNNNGGC_UNKNOWN, GOBP_REGULATION_OF_PHOSPHORYLATION, NKX25_02, AAGTCCA_MIR422B_MIR422A, GOBP_PEPTIDYL_SERINE_MODIFICATION, GGCNKCCATNK_UNKNOWN, SHEPARD_BMYB_MORPHOLINO_DN, NFKB_Q6, NKX61_01, GOBP_REGULATION_OF_PEPTIDYL_SERINE_PHOSPHORYLATION, TGCTGAY_UNKNOWN, GRE_C, OCT1_06, KEGG_LYSINE_DEGRADATION

GO Biological Process (8): negative regulation of transcription by RNA polymerase II (GO:0000122), regulation of DNA-templated transcription (GO:0006355), methylation (GO:0032259), regulation of peptidyl-serine phosphorylation (GO:0033135), positive regulation of DNA-templated transcription (GO:0045893), regulation of RNA polymerase II regulatory region sequence-specific DNA binding (GO:1903025), chromatin organization (GO:0006325), chromatin remodeling (GO:0006338)

GO Molecular Function (20): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), chromatin binding (GO:0003682), transcription coregulator activity (GO:0003712), transcription corepressor activity (GO:0003714), zinc ion binding (GO:0008270), nuclear estrogen receptor binding (GO:0030331), histone H4K20 methyltransferase activity (GO:0042799), nuclear retinoic acid receptor binding (GO:0042974), nuclear retinoid X receptor binding (GO:0046965), nuclear thyroid hormone receptor binding (GO:0046966), histone H3K36 methyltransferase activity (GO:0046975), nuclear androgen receptor binding (GO:0050681), histone H3 methyltransferase activity (GO:0140938), histone H3K36 dimethyltransferase activity (GO:0140954), protein binding (GO:0005515), methyltransferase activity (GO:0008168), transferase activity (GO:0016740), nuclear receptor binding (GO:0016922), histone methyltransferase activity (GO:0042054), metal ion binding (GO:0046872)

GO Cellular Component (4): chromatin (GO:0000785), nucleus (GO:0005634), nucleoplasm (GO:0005654), chromosome (GO:0005694)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Chromatin modifying enzymes1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
nuclear receptor binding4
negative regulation of DNA-templated transcription2
DNA-templated transcription2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
binding2
protein-lysine N-methyltransferase activity2
cellular anatomical structure2
regulation of transcription by RNA polymerase II1
transcription by RNA polymerase II1
regulation of gene expression1
regulation of RNA biosynthetic process1
metabolic process1
regulation of protein phosphorylation1
peptidyl-serine phosphorylation1
regulation of DNA-templated transcription1
positive regulation of RNA biosynthetic process1
regulation of transcription regulatory region DNA binding1
cellular component organization1
chromatin organization1
cis-regulatory region sequence-specific DNA binding1
transcription regulator activity1
transcription coregulator activity1
transition metal ion binding1
histone H4 methyltransferase activity1
nuclear retinoic acid receptor binding1
histone H3 methyltransferase activity1
histone methyltransferase activity1
histone H3K36 methyltransferase activity1
transferase activity, transferring one-carbon groups1
catalytic activity1
RNA polymerase II-specific DNA-binding transcription factor binding1
protein methyltransferase activity1
histone modifying activity1
cation binding1
chromosome1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular membraneless organelle1

Protein interactions and networks

STRING

2774 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NSD1NUP98P52948931
NSD1DDX10Q13206731
NSD1PHF23Q9BUL5725
NSD1PSIP1O75475717
NSD1ZNF496Q96IT1671
NSD1SETBP1Q9Y6X0663
NSD1MLLT10P55197656
NSD1MLLT3P42568624
NSD1RAP1GDS1P52306620
NSD1HOXA9P31269614
NSD1DOT1LQ8TEK3612
NSD1SMYD2Q9NRG4604
NSD1SETD3Q86TU7602
NSD1KMT5BQ4FZB7597
NSD1H4C7Q99525592

IntAct

47 interactions, top by confidence:

ABTypeScore
NSD1H2BC21psi-mi:“MI:0915”(physical association)0.560
NSD1HMGA1psi-mi:“MI:0915”(physical association)0.560
G3BP1NSD1psi-mi:“MI:0407”(direct interaction)0.560
MECP2GTPBP10psi-mi:“MI:0914”(association)0.530
MAGEB2POLRMTpsi-mi:“MI:0914”(association)0.530
ZBTB48ZBTB24psi-mi:“MI:0914”(association)0.530
NSD1G3BP2psi-mi:“MI:0407”(direct interaction)0.440
RELANSD1psi-mi:“MI:0915”(physical association)0.400
NSD1H2BC9psi-mi:“MI:0915”(physical association)0.400
NSD1H1-2psi-mi:“MI:0915”(physical association)0.400
NSD1PABPC1psi-mi:“MI:0915”(physical association)0.400
NSD1YTHDC1psi-mi:“MI:0915”(physical association)0.400
NSD1H1-4psi-mi:“MI:0915”(physical association)0.400
ESR1ESYT2psi-mi:“MI:0914”(association)0.350
H2AZ1SUPT5Hpsi-mi:“MI:0914”(association)0.350
H2BC21SMCHD1psi-mi:“MI:0914”(association)0.350
NPM1RPS3Apsi-mi:“MI:0914”(association)0.350
ATG16L1psi-mi:“MI:0914”(association)0.350
ZNF467ZNF320psi-mi:“MI:0914”(association)0.350
CEACAM16SBNO1psi-mi:“MI:0914”(association)0.350
RPL36GTPBP10psi-mi:“MI:0914”(association)0.350
ILF3GTPBP10psi-mi:“MI:0914”(association)0.350
ANOS1ZNF724psi-mi:“MI:0914”(association)0.350
RPL4POLRMTpsi-mi:“MI:0914”(association)0.350
RPL23AMPHOSPH10psi-mi:“MI:0914”(association)0.350
RSL1D1MPHOSPH10psi-mi:“MI:0914”(association)0.350
LARP7U2SURPpsi-mi:“MI:0914”(association)0.350

BioGRID (158): HIST3H3 (Biochemical Activity), HIST1H1B (Biochemical Activity), ATRX (Biochemical Activity), HIST4H4 (Biochemical Activity), SNORD3A (Biochemical Activity), HIST1H1D (Biochemical Activity), HIST1H1C (Biochemical Activity), NSD1 (Affinity Capture-MS), NSD1 (Affinity Capture-MS), HIST1H3A (Biochemical Activity), HIST1H4A (Biochemical Activity), NSD1 (Affinity Capture-Western), NSD1 (Reconstituted Complex), NSD1 (Affinity Capture-MS), NSD1 (Affinity Capture-MS)

ESM2 similar proteins: A0A087WXM9, A0A2K1JJ00, A0JM83, A4IGL8, E1BC15, E9Q5F9, O14513, O35923, O60673, O88491, P46013, P97929, Q14B71, Q28DZ0, Q29RT4, Q3MHH3, Q3TNU4, Q3ZBP0, Q4QY64, Q4V7J0, Q5DTT3, Q5E9A0, Q5F2C3, Q5RD08, Q5VWN6, Q5VYV7, Q61493, Q69YH5, Q6NS59, Q703I1, Q80U59, Q86XD8, Q8IXS0, Q8IYL3, Q8L7I1, Q8N7Z5, Q8NFU7, Q8TEP8, Q92628, Q96BU1

Diamond homologs: A0A1L7TZE5, A4IGY9, A7E2Z2, A8XI75, J9VWH9, O43463, O54864, O60016, O65312, O82175, O88491, O88974, O96028, P20659, P42124, P45975, P46995, P55200, P70351, P93831, Q03164, Q06ZW3, Q08BR4, Q08BS4, Q0VD24, Q15047, Q15910, Q24742, Q28CQ7, Q28D84, Q294B9, Q2NL30, Q32PH7, Q4PB36, Q4R381, Q4R3E0, Q4V863, Q53H47, Q5DW34, Q5F3W5

SIGNOR signaling

4 interactions.

AEffectBMechanism
NSD1up-regulatesRELAmethylation
NSD1up-regulatesNfKb-p65/p50methylation
NSD1“up-regulates quantity by expression”PRB4“transcriptional regulation”

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 56 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Nonsense Mediated Decay (NMD) independent of the Exon Junction Complex (EJC)721.7×8e-06
Peptide chain elongation620.0×2e-05
Viral mRNA Translation620.0×2e-05
PELO:HBS1L and ABCE1 dissociate a ribosome on a non-stop mRNA619.8×2e-05
Selenocysteine synthesis619.0×2e-05
Eukaryotic Translation Termination619.0×2e-05
L13a-mediated translational silencing of Ceruloplasmin expression718.6×1e-05
ZNF598 and the Ribosome-associated Quality Trigger (RQT) complex dissociate a ribosome stalled on a no-go mRNA618.6×2e-05

GO biological processes:

GO termPartnersFoldFDR
cytoplasmic translation621.0×1e-04
nucleosome assembly513.2×3e-03
translation59.7×9e-03

Disease & clinical

Cancer significance

From intOGen — cancer-driver classification: loss-of-function (tumor-suppressor-like) across 11 cancer types — BLCA, CEAD, ESCA, HNSC, LUSC, MEL, MLYM, NPC, PAST, STOMACH, UCEC.

Clinical variants and AI predictions

ClinVar

2636 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic463
Likely pathogenic178
Uncertain significance935
Likely benign565
Benign122

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1028263NM_022455.5(NSD1):c.4378+1G>TPathogenic
1038714NM_022455.5(NSD1):c.6475T>A (p.Cys2159Ser)Pathogenic
1065474NM_022455.5(NSD1):c.3364dup (p.Ile1122fs)Pathogenic
1068588NM_022455.5(NSD1):c.4655T>A (p.Leu1552Ter)Pathogenic
1069132NM_022455.5(NSD1):c.5753_5754del (p.Leu1917_Tyr1918insTer)Pathogenic
1069920NM_022455.5(NSD1):c.6294del (p.Lys2098fs)Pathogenic
1071923NM_022455.5(NSD1):c.4514dup (p.His1505fs)Pathogenic
1073231NM_022455.5(NSD1):c.4672dup (p.Cys1558fs)Pathogenic
1073800NM_022455.5(NSD1):c.2674del (p.Ser892fs)Pathogenic
1074877NM_022455.5(NSD1):c.5190T>G (p.His1730Gln)Pathogenic
1074998NM_022455.5(NSD1):c.3145_3146dup (p.Thr1050fs)Pathogenic
1076373NM_022455.5(NSD1):c.3254del (p.Pro1085fs)Pathogenic
1076793NM_022455.5(NSD1):c.1828C>T (p.Gln610Ter)Pathogenic
1172670NM_022455.5(NSD1):c.3109C>T (p.Gln1037Ter)Pathogenic
1176497NM_022455.5(NSD1):c.858del (p.Ser287fs)Pathogenic
1191918NM_022455.5(NSD1):c.5855G>A (p.Arg1952Gln)Pathogenic
1202491NM_022455.5(NSD1):c.1318C>T (p.Arg440Ter)Pathogenic
1210027NM_022455.5(NSD1):c.5741G>T (p.Arg1914Leu)Pathogenic
1214278NM_022455.5(NSD1):c.1216_1219del (p.Glu406fs)Pathogenic
1283908NM_022455.5(NSD1):c.4220_4221del (p.Lys1407fs)Pathogenic
1299651NM_022455.5(NSD1):c.5824_5828del (p.Pro1942fs)Pathogenic
1320149NM_022455.5(NSD1):c.1727dup (p.Asn576fs)Pathogenic
1321321NM_022455.5(NSD1):c.4361C>G (p.Ser1454Ter)Pathogenic
1323377NM_022455.5(NSD1):c.6258+1G>TPathogenic
1331627NM_022455.5(NSD1):c.911C>A (p.Ser304Ter)Pathogenic
1338369NM_022455.5(NSD1):c.6488del (p.Gln2163fs)Pathogenic
1338653NM_022455.5(NSD1):c.2808C>A (p.Tyr936Ter)Pathogenic
1339482NM_022455.5(NSD1):c.3017del (p.Pro1006fs)Pathogenic
1365499NM_022455.5(NSD1):c.3701T>G (p.Leu1234Ter)Pathogenic
1366561NM_022455.5(NSD1):c.2163_2181del (p.Gly722fs)Pathogenic

SpliceAI

4415 predictions. Top by Δscore:

VariantEffectΔscore
5:177134527:G:GTdonor_gain1.0000
5:177136019:G:GTdonor_gain1.0000
5:177136020:A:Tdonor_gain1.0000
5:177136028:TTT:Tdonor_gain1.0000
5:177136031:G:GGdonor_gain1.0000
5:177169365:A:AGacceptor_gain1.0000
5:177169366:A:Gacceptor_gain1.0000
5:177169369:T:TAacceptor_gain1.0000
5:177191856:T:Aacceptor_gain1.0000
5:177191875:T:TAacceptor_gain1.0000
5:177191880:A:AGacceptor_gain1.0000
5:177191880:AAAGT:Aacceptor_gain1.0000
5:177191881:A:Gacceptor_gain1.0000
5:177191882:A:AGacceptor_gain1.0000
5:177191883:G:GGacceptor_gain1.0000
5:177191883:GT:Gacceptor_gain1.0000
5:177192016:AAAGG:Adonor_loss1.0000
5:177192017:AAGGT:Adonor_loss1.0000
5:177192020:G:GAdonor_loss1.0000
5:177192021:T:Adonor_loss1.0000
5:177204117:TA:Tacceptor_loss1.0000
5:177204118:A:AGacceptor_gain1.0000
5:177204118:AG:Aacceptor_loss1.0000
5:177204119:G:GAacceptor_gain1.0000
5:177204119:GT:Gacceptor_gain1.0000
5:177204119:GTTT:Gacceptor_gain1.0000
5:177204279:GATA:Gdonor_gain1.0000
5:177204289:TAAG:Tdonor_loss1.0000
5:177204290:AAG:Adonor_loss1.0000
5:177204292:GGTAG:Gdonor_loss1.0000

AlphaMissense

17693 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:177191938:T:AW328R1.000
5:177191938:T:CW328R1.000
5:177191939:G:CW328S1.000
5:177191940:G:CW328C1.000
5:177191940:G:TW328C1.000
5:177191962:T:AW336R1.000
5:177191962:T:CW336R1.000
5:177191964:G:CW336C1.000
5:177191964:G:TW336C1.000
5:177191965:T:AW337R1.000
5:177191965:T:CW337R1.000
5:177191969:C:AP338H1.000
5:177204185:T:AW377R1.000
5:177204185:T:CW377R1.000
5:177204240:T:CL395P1.000
5:177209660:T:AW421R1.000
5:177209660:T:CW421R1.000
5:177209661:G:CW421S1.000
5:177209662:G:CW421C1.000
5:177209662:G:TW421C1.000
5:177248319:T:AC1546S1.000
5:177248319:T:CC1546R1.000
5:177248320:G:AC1546Y1.000
5:177248320:G:CC1546S1.000
5:177248321:T:GC1546W1.000
5:177251752:T:CL1555P1.000
5:177251755:T:CL1556P1.000
5:177251760:T:AC1558S1.000
5:177251760:T:CC1558R1.000
5:177251761:G:AC1558Y1.000

dbSNP variants (sampled 300 via entrez): RS1000005820 (5:177268819 A>G), RS1000009801 (5:177289440 T>A), RS1000036439 (5:177139978 G>A), RS1000067308 (5:177150427 G>C), RS1000072133 (5:177146699 C>G), RS1000077764 (5:177241372 G>C,T), RS1000088425 (5:177140334 T>C), RS1000114112 (5:177159681 C>G,T), RS1000116471 (5:177162287 G>A), RS1000170661 (5:177162570 G>A), RS1000198684 (5:177281931 C>T), RS1000207383 (5:177241159 C>T), RS1000239371 (5:177227469 T>A,G), RS1000242235 (5:177244447 C>T), RS1000253837 (5:177144640 G>A,C)

Disease associations

OMIM: gene MIM:606681 | disease phenotypes: MIM:117550, MIM:236750, MIM:157170, MIM:130650, MIM:601626, MIM:189800, MIM:603563

GenCC curated gene-disease

DiseaseClassificationInheritance
Sotos syndrome 1DefinitiveAutosomal dominant
Sotos syndromeDefinitiveAutosomal dominant
Beckwith-Wiedemann syndrome due to NSD1 mutationDefinitiveAutosomal dominant

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
Sotos syndromeDefinitiveAD

Mondo (14): Sotos syndrome (MONDO:0019349), neurodevelopmental disorder (MONDO:0700092), non-immune hydrops fetalis (MONDO:0009369), holoprosencephaly 2 (MONDO:0007999), intellectual disability (MONDO:0001071), Beckwith-Wiedemann syndrome (MONDO:0007534), acute myeloid leukemia (MONDO:0018874), preeclampsia (MONDO:0005081), scoliosis (MONDO:0005392), choroid plexus carcinoma (MONDO:0016718), hereditary spastic paraplegia 8 (MONDO:0011339), microcephaly (MONDO:0001149), (MONDO:0007299), Beckwith-Wiedemann syndrome due to NSD1 mutation (MONDO:0016547)

Orphanet (9): Sotos syndrome (Orphanet:821), Non-immune hydrops fetalis (Orphanet:363999), Holoprosencephaly (Orphanet:2162), Beckwith-Wiedemann syndrome (Orphanet:116), Acute myeloid leukemia (Orphanet:519), Preeclampsia (Orphanet:275555), Choroid plexus carcinoma (Orphanet:251899), Autosomal dominant spastic paraplegia type 8 (Orphanet:100989), NON RARE IN EUROPE: Unexplained intellectual disability (Orphanet:319658)

HPO phenotypes

194 total (30 of 194 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000023Inguinal hernia
HP:0000028Cryptorchidism
HP:0000034Hydrocele testis
HP:0000047Hypospadias
HP:0000073Ureteral duplication
HP:0000074Ureteropelvic junction obstruction
HP:0000076Vesicoureteral reflux
HP:0000077Abnormality of the kidney
HP:0000083Renal insufficiency
HP:0000098Tall stature
HP:0000104Renal agenesis
HP:0000126Hydronephrosis
HP:0000144Decreased fertility
HP:0000164Abnormality of the dentition
HP:0000189Narrow palate
HP:0000218High palate
HP:0000239Large fontanelles
HP:0000252Microcephaly
HP:0000256Macrocephaly
HP:0000268Dolichocephaly
HP:0000275Narrow face
HP:0000276Long face
HP:0000278Retrognathia
HP:0000280Coarse facial features
HP:0000286Epicanthus
HP:0000303Mandibular prognathia
HP:0000307Pointed chin
HP:0000311Round face
HP:0000316Hypertelorism

GWAS associations

43 associations (top):

StudyTraitp-value
GCST001574_7Activated partial thromboplastin time6.000000e-88
GCST002647_130Height5.000000e-11
GCST002830_5Urate levels in lean individuals4.000000e-06
GCST004562_100Waist circumference adjusted for body mass index5.000000e-11
GCST004562_126Waist circumference adjusted for body mass index1.000000e-12
GCST004562_167Waist circumference adjusted for body mass index6.000000e-13
GCST004562_223Waist circumference adjusted for body mass index5.000000e-11
GCST004562_23Waist circumference adjusted for body mass index3.000000e-10
GCST004563_103Waist circumference adjusted for BMI (joint analysis main effects and physical activity interaction)6.000000e-12
GCST004563_186Waist circumference adjusted for BMI (joint analysis main effects and physical activity interaction)1.000000e-10
GCST004563_240Waist circumference adjusted for BMI (joint analysis main effects and physical activity interaction)6.000000e-12
GCST004563_6Waist circumference adjusted for BMI (joint analysis main effects and physical activity interaction)1.000000e-09
GCST004563_87Waist circumference adjusted for BMI (joint analysis main effects and physical activity interaction)3.000000e-10
GCST004564_35Waist circumference adjusted for BMI in active individuals2.000000e-07
GCST004564_36Waist circumference adjusted for BMI in active individuals3.000000e-10
GCST004564_37Waist circumference adjusted for BMI in active individuals5.000000e-07
GCST004564_38Waist circumference adjusted for BMI in active individuals9.000000e-08
GCST004564_39Waist circumference adjusted for BMI in active individuals3.000000e-10
GCST005956_15Waist-to-hip ratio adjusted for BMI1.000000e-07
GCST005957_13Waist-to-hip ratio adjusted for BMI (age <50)3.000000e-07
GCST005962_42Waist-to-hip ratio adjusted for BMI x sex x age interaction (4df test)1.000000e-08
GCST006491_7Circulating fibroblast growth factor 23 levels1.000000e-07
GCST007293_138Body fat distribution (arm fat ratio)1.000000e-08
GCST007293_79Body fat distribution (arm fat ratio)2.000000e-11
GCST007293_97Body fat distribution (arm fat ratio)2.000000e-10
GCST007294_102Body fat distribution (trunk fat ratio)2.000000e-10
GCST007294_155Body fat distribution (trunk fat ratio)4.000000e-09
GCST007640_1Narrowest width of the femoral neck2.000000e-08
GCST008839_461Height7.000000e-10
GCST008971_108Urate levels3.000000e-08

EFO canonical traits (12, from GWAS)

EFO IDTrait name
EFO:0004531urate measurement
EFO:0007789BMI-adjusted waist circumference
EFO:0008002physical activity measurement
EFO:0007788BMI-adjusted waist-hip ratio
EFO:0008007age at assessment
EFO:0008343sex interaction measurement
EFO:0004341body fat distribution
EFO:0004511femoral neck bone geometry
EFO:0003959cleft lip
EFO:0008039BMI-adjusted hip circumference
EFO:0004587lymphocyte count
EFO:0004833neutrophil count

MeSH disease descriptors (11)

DescriptorNameTree numbers
D001506Beckwith-Wiedemann SyndromeC16.131.077.133; C16.131.260.080; C16.320.180.080; C16.320.447.375
D008607Intellectual DisabilityC10.597.606.360; C23.888.592.604.646; F01.700.687; F03.625.539
D015470Leukemia, Myeloid, AcuteC04.557.337.539.275; C15.378.508.539.275
D008831MicrocephalyC05.660.207.620; C10.500.507.400.500; C16.131.621.207.620; C16.131.666.507.400.500
D065886Neurodevelopmental DisordersF03.625
D011225Pre-EclampsiaC12.050.703.395.249
D012600ScoliosisC05.116.900.800.875
D058495Sotos SyndromeC16.131.077.889; C16.131.260.905; C16.320.180.905
C562943Choroid Plexus Carcinoma (supp.)
C563579Holoprosencephaly 2 (supp.)
C580458Spastic Paraplegia Type 8 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (2): CHEMBL3588738 (SINGLE PROTEIN), CHEMBL6066040 (PROTEIN-PROTEIN INTERACTION)

Molecules with ChEMBL bioactivity

7 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 70,511 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL3137309VENETOCLAX49,389
CHEMBL506PRIMAQUINE410,279
CHEMBL58510CHLOROQUINE PHOSPHATE45,477
CHEMBL265502SURAMIN336,848
CHEMBL1214186SINEFUNGIN22,165
CHEMBL1215331PF-03882845159
CHEMBL295316PLUMBAGIN16,294

PharmGKB: 1 entry (VIP=true, CPIC=false)

GtoPdb / IUPHAR curated pharmacology

(IUPHAR/BPS Guide to Pharmacology — expert-curated)

Target class: enzyme — 2.1.1.43 Histone methyltransferases (HMTs)

Most potent curated ligand interactions (1 total), top 1:

LigandActionAffinityParameter
compound 15a [PMID: 36642961]Inhibition6.16pIC50

ChEMBL bioactivities

28 potent at pChembl≥5 of 40 total, top 25 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.96IC50110nMCHEMBL337173
6.82IC50150nMCHEMBL337173
6.80IC50160nMCHEMBL5408212
6.67IC50214nMCHEMBL5438312
6.65IC50225nMCHEMBL5419719
6.61IC50248nMCHEMBL5396709
6.57IC50270nMCHEMBL5432302
6.47IC50340nMCHEMBL2028187
6.16IC50690nMCHEMBL5422549
5.92Kd1200nMCHEMBL5438496
5.85Kd1400nMQUINACRINE DIHYDROCHLORIDE
5.85IC501400nMTANSHINONE IIA
5.82IC501500nMCHEMBL6170408
5.75IC501800nMPRIMAQUINE
5.70IC502000nMNAPHTHOQUINONE
5.62IC502400nMVENETOCLAX
5.58IC502600nMCHEMBL1572280
5.58IC502600nMCHEMBL337173
5.51IC503100nMTANSHINONE I
5.46IC503500nMCRYPTOTANSHINONE
5.33Kd4700nMCHLOROQUINE PHOSPHATE
5.31IC504900nMCHEMBL1361596
5.24IC505800nMCHEMBL5270736
5.24IC505700nMPLUMBAGIN
5.23IC505900nMCHEMBL6149804

PubChem BioAssay actives

18 with measured affinity, of 115 total; 12 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
6-chloro-7-(2-morpholin-4-ylethylamino)quinoline-5,8-dione2017799: Inhibition of N-terminal polyhistidine-tagged recombinant human NSD1 (1538 to 2696 residues) expressed in baculovirus infected insect cell using SAM as substrate preincubated for 20 mins followed by substrate addition measured after 1 hrs by hotspot assayic500.1100uM
9-[[5-[(3R)-3-amino-3-(methoxymethyl)piperidin-1-yl]-2-(3,4-difluorophenyl)-4-pyridinyl]methyl]purin-6-amine2011299: Inhibition of human recombinant NSD1 incubated for 2 hrs by AlphaLISA methodic500.1600uM
9-[[5-[(3R)-3-amino-3-(cyclopropyloxymethyl)piperidin-1-yl]-2-(3,4-difluorophenyl)-4-pyridinyl]methyl]purin-6-amine2011299: Inhibition of human recombinant NSD1 incubated for 2 hrs by AlphaLISA methodic500.2140uM
9-[[5-[(3R)-3-amino-3-(methoxymethyl)piperidin-1-yl]-2-(2,4-difluorophenyl)-4-pyridinyl]methyl]purin-6-amine2011299: Inhibition of human recombinant NSD1 incubated for 2 hrs by AlphaLISA methodic500.2250uM
9-[[5-[(3R)-3-amino-3-(methoxymethyl)piperidin-1-yl]-2-(5-chloro-2-fluorophenyl)-4-pyridinyl]methyl]purin-6-amine2011299: Inhibition of human recombinant NSD1 incubated for 2 hrs by AlphaLISA methodic500.2480uM
9-[[5-[(3R)-3-amino-3-(cyclobutyloxymethyl)piperidin-1-yl]-2-(3-fluoro-4-methoxyphenyl)-4-pyridinyl]methyl]purin-6-amine2011299: Inhibition of human recombinant NSD1 incubated for 2 hrs by AlphaLISA methodic500.2700uM
N-[3-[(7-chloro-5,8-dioxoquinolin-6-yl)amino]-1-bicyclo[1.1.1]pentanyl]but-2-ynamide1966592: Inhibition of NSD1 (unknown origin)ic500.6900uM
5,8-dihydroxy-1,4-bis[2-(2-hydroxyethylamino)ethylamino]-4a,9a-dihydroanthracene-9,10-dione;dihydrochloride2011305: Binding affinity to NSD1 (unknown origin) assessed as dissociation constant by ITC assaykd1.2000uM
2-amino-6-(2-methylaziridin-1-yl)-1,3-benzothiazol-4-ol1980351: Inhibition of NSD1 (unknown origin) measured after 16 hrs by MS assayic501.4000uM
4-N-(6-chloro-2-methoxyacridin-9-yl)-1-N,1-N-diethylpentane-1,4-diamine;dihydrochloride2011305: Binding affinity to NSD1 (unknown origin) assessed as dissociation constant by ITC assaykd1.4000uM
Venetoclax2017799: Inhibition of N-terminal polyhistidine-tagged recombinant human NSD1 (1538 to 2696 residues) expressed in baculovirus infected insect cell using SAM as substrate preincubated for 20 mins followed by substrate addition measured after 1 hrs by hotspot assayic502.4000uM
Chloroquine Phosphate2011305: Binding affinity to NSD1 (unknown origin) assessed as dissociation constant by ITC assaykd4.7000uM

CTD chemical–gene interactions

30 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases expression, increases expression, increases methylation5
bisphenol Aaffects binding, affects folding, decreases reaction, decreases expression3
bisphenol AFdecreases reaction, affects binding, affects folding2
Estradiolaffects binding, increases reaction2
FR900359affects phosphorylation1
TAK-243increases sumoylation1
triphenyl phosphateaffects expression1
alpha-pineneaffects cotreatment, decreases expression, increases abundance1
pirinixic acidaffects binding, decreases expression, increases activity1
trichostatin Aaffects expression1
arseniteaffects binding, decreases reaction1
afimoxifeneaffects response to substance1
sodium arsenitedecreases expression1
perfluorooctanoic aciddecreases expression1
coumarinincreases phosphorylation1
methacrylaldehydeaffects cotreatment, decreases expression, increases abundance1
CGP 52608increases reaction, affects binding1
abrinedecreases expression1
Acroleinincreases abundance, affects cotreatment, decreases expression1
Air Pollutantsaffects cotreatment, decreases expression, increases abundance1
Benzo(a)pyreneaffects methylation1
Cisplatindecreases expression1
Diethylstilbestroldecreases expression1
Formaldehydeincreases expression1
Leadaffects expression1
Ozonedecreases expression, increases abundance, affects cotreatment1
Tobacco Smoke Pollutiondecreases expression1
Asbestos, Crocidoliteaffects expression1
Cadmium Chloridedecreases expression1
Volatile Organic Compoundsaffects cotreatment, decreases expression1

ChEMBL screening assays

90 unique, capped per target: 90 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL3592411BindingInhibition of NSD1 (unknown origin) at 1 uM after 1 hr by filter-based assayDiscovery of A-893, A New Cell-Active Benzoxazinone Inhibitor of Lysine Methyltransferase SMYD2. — ACS Med Chem Lett

Cellosaurus cell lines

11 cell lines: 5 cancer cell line, 3 transformed cell line, 3 induced pluripotent stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B3CTAbcam HEK293T NSD1 KOTransformed cell lineFemale
CVCL_C7FTIGGi001-AInduced pluripotent stem cellFemale
CVCL_C7FUIGGi001-BInduced pluripotent stem cellFemale
CVCL_D6MPIGGi003-AInduced pluripotent stem cellFemale
CVCL_E3GBHCI-EC-23Cancer cell lineFemale
CVCL_F102GM19977Transformed cell lineMale
CVCL_F103GM19978Transformed cell lineMale
CVCL_TB11HAP1 NSD1 (-) 1Cancer cell lineMale
CVCL_TB12HAP1 NSD1 (-) 2Cancer cell lineMale
CVCL_TB13HAP1 NSD1 (-) 3Cancer cell lineMale

Clinical trials (associated diseases)

298 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT04586348PHASE4UNKNOWNPrenatal Iodine Supplementation and Early Childhood Neurodevelopment
NCT04873115PHASE4UNKNOWNDouble-blind, Placebo-controlled, Randomized Clinical Trial Comparing the Efficacy and Safety of Sialanar Plus orAl rehabiLitation Against Placebo Plus Oral Rehabilitation for chIldren and Adolescents With seVere Sialorrhoea and Neurodisabilties,
NCT05657860PHASE4COMPLETEDGuanfacine Extended Release for the Reduction of Aggression and Self-injurious Behavior Associated With Prader-Willi Syndrome
NCT05744479PHASE4RECRUITINGMetformin for Antipsychotic-induced Weight Gain in Adults With Intellectual Disability
NCT06107829PHASE4WITHDRAWNValbenazine Treatment of Tardive Dyskinesia in Adults With Intellectual/Developmental Disabilities
NCT06997198PHASE4NOT_YET_RECRUITINGDeutetrabenazine Treatment for Tardive Dyskinesia in Intellectual/Developmental Disabilities
NCT02559102PHASE3COMPLETEDDexmedetomidine Sedation Versus General Anaesthesia for Inguinal Hernia Surgery in Infants
NCT02757079PHASE3COMPLETEDStudy of the Efficacy and Safety of NPC-15 for Sleep Disorders of Children With Neurodevelopmental Disorders
NCT06915480PHASE3RECRUITINGReducing Missed Appointments
NCT07377032PHASE3RECRUITINGTAP-GRIN: Interventional Study on Patients With GRIN-related Neurodevelopmental Disorders
NCT02270736PHASE3COMPLETEDClinical Study to Investigate the Efficacy and Safety of NT 201 Compared to Placebo in the Treatment of Chronic Troublesome Drooling Associated With Neurological Disorders and/or Intellectual Disability
NCT02909959PHASE2COMPLETEDSulforaphane for the Treatment of Young Men With Autism Spectrum Disorder
NCT06081348PHASE2RECRUITINGSertraline vs. Placebo in the Treatment of Anxiety in Children and AdoLescents With NeurodevelopMental Disorders
NCT06352372PHASE2COMPLETEDSafety and Efficacy of tPBM for Epileptiform Activity in Autism
NCT02304302PHASE2COMPLETEDDown Syndrome Memantine Follow-up Study
NCT03862950PHASE2COMPLETEDA Trial of Metformin in Individuals With Fragile X Syndrome (Met)
NCT04529226PHASE2UNKNOWNStudy to Compare Clozapine vs Treatment as Usual in People With Intellectual Disability & Treatment-resistant Psychosis
NCT04821856PHASE2COMPLETEDEvaluation of the Effectiveness of Cannabidiol in Treating Severe Behavioural Problems in Children and Adolescents With Intellectual Disability
NCT00503191PHASE1COMPLETEDNeuroModulation Technique Treatment of Autism
NCT04475848PHASE1COMPLETEDA Study to Investigate the Safety, Tolerability, Pharmacokinetics, Pharmacodynamics and Food Effect of RO6953958 in Healthy Participants
NCT06300398PHASE1COMPLETEDIAMA-6 Oral Dose Study in Healthy Adults
NCT05273320PHASE1COMPLETEDClinical Trial of Nabilone for Aggression in Adults With Intellectual and Developmental Disabilities
NCT05301361PHASE1ENROLLING_BY_INVITATIONSensitivity of the NIH Toolbox to Stimulant Treatment in Intellectual Disabilities
NCT06016764PHASE1COMPLETEDUse of MRI and cTBS for Catatonia in Autism
NCT06586827PHASE1COMPLETEDImpact of Competency-Based Training and Technical Assistance Employment Outcomes of Individuals With ID/DD
NCT07531940PHASE1NOT_YET_RECRUITINGEscalating Doses of Memantine in Down Syndrome (MEDS-123)
NCT04993235Not specifiedUNKNOWNBody Perception and Representation in Overgrowth Syndromes, Behavioral Assessment and Neuropsychological Development
NCT01783041PHASE2/PHASE3COMPLETEDEffect of Early L-Carnitine Supplementation on Neurodevelopmental Outcomes in Very Preterm Infants
NCT05767385PHASE2/PHASE3RECRUITINGFetal Cerebrovascular Autoregulation in Congenital Heart Disease and Association With Neonatal Neurobehavior
NCT05675098EARLY_PHASE1NOT_YET_RECRUITINGCentral Nervous System Stimulants and Physical Function in Children With Cerebral Palsy
NCT00783783Not specifiedCOMPLETEDCYP2D6 Pharmacogenetics in Risperidone-Treated Children
NCT01778504Not specifiedRECRUITINGStudying Childhood-onset Behavioral, Psychiatric, and Developmental Disorders
NCT01850784Not specifiedUNKNOWNHigh Energy Formula Feeding in Infants With Congenital Heart Disease
NCT01922791Not specifiedCOMPLETEDNutrition and Pregnancy Intervention Study
NCT01942525Not specifiedUNKNOWNInfluence of Intrauterine Growth Restriction on Amplitude-integrated EEG in Preterm Infants
NCT02003170Not specifiedCOMPLETEDEtiology and Early Diagnosis of Neurodevelopmental Disorders
NCT02118649Not specifiedACTIVE_NOT_RECRUITINGEnhancing Behavior and Brain Response to Visual Targets Using a Computer Game
NCT02557191Not specifiedTERMINATEDBiomarkers, Neurodevelopment and Preterm Infants
NCT02690675Not specifiedCOMPLETEDIron Supplement Effect on Child Development
NCT02694003Not specifiedCOMPLETEDBetter Nights, Better Days for Children With Neurodevelopment Disorders