SALL4

gene
On this page

Also known as dJ1112F19.1ZNF797

Summary

SALL4 (spalt like transcription factor 4, HGNC:15924) is a protein-coding gene on chromosome 20q13.2, encoding Sal-like protein 4 (Q9UJQ4). Transcription factor with a key role in the maintenance and self-renewal of embryonic and hematopoietic stem cells. It is haploinsufficient (ClinGen: sufficient evidence).

This gene encodes a zinc finger transcription factor thought to play a role in the development of abducens motor neurons. Defects in this gene are a cause of Duane-radial ray syndrome (DRRS). Alternative splicing results in multiple transcript variants encoding different isoforms.

Source: NCBI Gene 57167 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Duane-radial ray syndrome (Definitive, GenCC) — +2 more curated relationships
  • GWAS associations: 3
  • Clinical variants (ClinVar): 603 total — 42 pathogenic, 14 likely-pathogenic
  • Phenotypes (HPO): 151
  • Druggable target: yes
  • Cancer driver (intOGen): activating (oncogene-like) across 1 cancer types
  • Dosage sensitivity (ClinGen): haploinsufficiency sufficient evidence, triplosensitivity no evidence
  • Transcription factor: yes — 22 downstream targets (CollecTRI)
  • MANE Select transcript: NM_020436

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:15924
Approved symbolSALL4
Namespalt like transcription factor 4
Location20q13.2
Locus typegene with protein product
StatusApproved
AliasesdJ1112F19.1, ZNF797
Ensembl geneENSG00000101115
Ensembl biotypeprotein_coding
OMIM607343
Entrez57167

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 3 protein_coding, 2 protein_coding_CDS_not_defined

ENST00000217086, ENST00000371539, ENST00000395997, ENST00000481363, ENST00000483130

RefSeq mRNA: 2 — MANE Select: NM_020436 NM_001318031, NM_020436

CCDS: CCDS13438, CCDS82629

Canonical transcript exons

ENST00000217086 — 4 exons

ExonStartEnd
ENSE000006627515178886151789141
ENSE000006627525179002251792352
ENSE000019209225178233151784684
ENSE000026912955180227951802521

Expression profiles

Bgee: expression breadth ubiquitous, 150 present calls, max score 93.81.

FANTOM5 (CAGE): breadth broad, TPM avg 6.1743 / max 504.2526, expressed in 303 samples.

FANTOM5 promoters (8 alternative TSS)

Promoter IDTPM avgSamples expressed
1879702.5010203
1879711.5848175
1879671.3501200
1879690.2728118
1879680.165391
1879660.122448
1879720.104468
1879730.073546

Top tissues by expression

248 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065593.81gold quality
oocyteCL:000002389.85gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047387.38gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099184.73gold quality
right lobe of thyroid glandUBERON:000111977.96gold quality
left lobe of thyroid glandUBERON:000112077.76gold quality
thyroid glandUBERON:000204677.73gold quality
germinal epithelium of ovaryUBERON:000130475.34gold quality
trabecular bone tissueUBERON:000248375.17silver quality
tibiaUBERON:000097975.11gold quality
nasal cavity epitheliumUBERON:000538474.58gold quality
endothelial cellCL:000011574.31gold quality
mucosa of paranasal sinusUBERON:000503073.87gold quality
epithelial cell of pancreasCL:000008372.82gold quality
islet of LangerhansUBERON:000000670.02gold quality
buccal mucosa cellCL:000233669.88silver quality
testisUBERON:000047369.17gold quality
body of pancreasUBERON:000115068.82gold quality
pancreasUBERON:000126468.46gold quality
right testisUBERON:000453468.38gold quality
left testisUBERON:000453367.80gold quality
right lobe of liverUBERON:000111465.97gold quality
bronchial epithelial cellCL:000232865.22silver quality
amniotic fluidUBERON:000017364.36gold quality
bronchusUBERON:000218564.09silver quality
prostate glandUBERON:000236762.85gold quality
ileal mucosaUBERON:000033162.82silver quality
cauda epididymisUBERON:000436062.74gold quality
adult organismUBERON:000702362.73silver quality
superficial temporal arteryUBERON:000161462.71gold quality

Single-cell (SCXA)

Detected in 6 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-GEOD-75140yes319.93
E-GEOD-93593yes238.25
E-ANND-3yes4.86
E-GEOD-124858no52.11
E-GEOD-81608no37.24
E-MTAB-6142no18.03

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

22 targets.

TargetRegulation
ABCA3Activation
ABCG2Activation
BMI1Activation
CDH1Repression
CECR2Activation
EPCAM
HOXA9Activation
KIT
KRT19
LTARepression
NANOGActivation
POU5F1Unknown
PTENRepression
RUNX1Activation
SALL1Repression
SALL3Repression
SALL4Unknown
SOX17Activation
SOX2Unknown
TNFActivation
TSC1
ZEB1Activation

Upstream regulators (CollecTRI, top): LEF1, POU5F1, SALL3, SALL4, STAT3, TBX5, TCF7L2

miRNA regulators (miRDB)

123 targeting SALL4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-4262100.0073.263931
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-9-5P100.0072.282361
HSA-MIR-518D-5P100.0067.51979
HSA-MIR-518E-5P100.0067.66954
HSA-MIR-518F-5P100.0067.51979
HSA-MIR-519A-5P100.0067.66954
HSA-MIR-519B-5P100.0067.66954
HSA-MIR-519C-5P100.0067.66954
HSA-MIR-520C-5P100.0067.51979
HSA-MIR-522-5P100.0067.66954
HSA-MIR-523-5P100.0067.66954
HSA-MIR-526A-5P100.0067.51979
HSA-MIR-4283100.0066.422097
HSA-MIR-450099.9972.722367
HSA-MIR-181A-5P99.9972.962995
HSA-MIR-181B-5P99.9972.972996
HSA-MIR-181C-5P99.9972.952996
HSA-MIR-181D-5P99.9973.042997
HSA-MIR-371B-5P99.9975.344759
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784

Functional genomics

ClinGen dosage: haploinsufficiency 3 (sufficient evidence), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 40)

  • Okihiro syndrome is caused by SALL4 mutations (PMID:12393809)
  • The first identified Duane syndrome gene and the second malformation syndrome resulting from mutations in SAL genes. (PMID:12395297)
  • Mutations and birth defects caused by thalidomide exposure. (PMID:15329836)
  • Okihiro syndrome may also be caused by heterozygous deletions either of the whole SALL4 coding region or of single exons (PMID:15342710)
  • no SALL4 mutations in twenty-five Duane Tretration “syndrome” patients (PMID:15386473)
  • analysis of SALL4 defects and associated syndromes (including Okihiro syndrome (Duane-radial ray syndrome), acro-renal-ocular syndrome) and description of the clinical distinctions with similar phenotypes caused by other gene defects (PMID:16086360)
  • A novel nonsense mutation in the SALL4 gene was detected in all affected family members and obligate carriers. (PMID:16411190)
  • SALL4 is constitutively expressed in primary acute myeloid leukemia (PMID:16763212)
  • mutation in the SALL4 gene causes IVIC syndrome(an autosomal dominant condition,hearing loss, heart murmur..) (PMID:17256792)
  • Results implied a possible association between epigenetic silencing of SALL4 and tumor cell aneuploidy. (PMID:17546590)
  • BMI1 is a target gene for SALL4 in leukemic cells. (PMID:17557835)
  • The detection and molecular characterization of four novel, overlapping microdeletions, all spanning SALL4 and flanking genes, in four unrelated cases with features of Okihiro syndrome and variable degrees of psychomotor delay, is reported. (PMID:17623483)
  • A not previously described mutation of the SALL4 gene in exon 4. (PMID:18299846)
  • the reprogramming activity of Sall4 is independent of an N-terminal domain implicated in recruiting the nucleosome remodeling and deacetylase corepressor complex, a global transcriptional repressor (PMID:18414659)
  • mapped the global gene targets of SALL4 and identified more than 2000 high-confidence, SALL4-binding genes in the human acute promyelocytic leukemic cell line, NB4 (PMID:18487508)
  • SALL4 and BMI-1 expressions are up-regulated significantly in acute leukemia. (PMID:19099625)
  • Down-regulation of the STAT3 activity using a dominant-negative construct resulted in a significant decrease in the expression of SALL4. (PMID:19151334)
  • Studies indicate that vertebrate sal orthologues (spalt-like/sall) have important developmental roles during neural development and organogenesis and gentic diseases. (PMID:19247946)
  • SALL4 is a novel sensitive and specific marker for ovarian primitive germ cell tumors (PMID:19295406)
  • SALL4 is a novel sensitive and relatively specific marker for testicular germ cell tumors (PMID:19390421)
  • Data demonstrate that stem cell protein SALL4 represses its target genes, PTEN and SALL1, through the epigenetic repressor Mi-2/NuRD complex. (PMID:19440552)
  • These results demonstrated that Sall4 plays positive roles in the generation of pluripotent stem cells from blastocysts and fibroblasts. (PMID:19476507)
  • SALL4 is a more sensitive marker than PLAP, AFP, or glypican-3 for extragonadal yolk sac tumors. (PMID:19574883)
  • This is the first report to suggest that SALL4 might be a potential candidate gene of premature ovarian failure. (PMID:19581335)
  • SALL4 may be a potential candidate gene for ventricular septal defect. (PMID:19619907)
  • SALL4 plays a critical role in the pathogenesis of MDS by causing the aberrant activation of the Wnt/beta-catenin pathway. (PMID:19781444)
  • SALL4 is a novel sensitive diagnostic marker for primary germ cell tumors of the central nervous system. (PMID:19820689)
  • SALL4 is a sensitive marker for AFP-producing gastric carcinoma and is especially useful to distinguish hepatoid gastric carcinoma from hepatocellular carcinoma. (PMID:20182341)
  • SALL4 is a master regulator that controls its own expression and the expression of OCT4. (PMID:20505821)
  • For primary extragonadal yolk sac tumors, combining LIN28 and SALL4 can achieve a higher diagnostic sensitivity than either alone. (PMID:21057460)
  • High SALL4 expression is associated with Wilms tumors. (PMID:21258884)
  • SALL4 is essential for cancer cell proliferation and is overexpressed at early clinical stages in breast cancer (PMID:21274508)
  • No genetic variations were detected in the coding region and in the neighboring intronic regions of the SALL4 gene suggesting an alternative mechanism in the pathogenesis (PMID:21405998)
  • Of the 16 malignant rhabdoid tumors, 14 (88%) tumors showed robust SALL4 and/or LIN28 expression. (PMID:21417895)
  • SALL4 has an important role in the proliferation and survival of CML cells & its expression is associated with an advanced stage of CML disease. This is mediated by the Bmi-1 pathway. (PMID:21468036)
  • SALL4 was able to bind to the promoter region of ABCA3 and activate its expression while regulating the expression of ABCG2 indirectly. (PMID:21526180)
  • SALL4 mRNA was highly expressed even in the early clinical stages and there was no difference in the positivity rate between stage IA and other stagesin lung cancer. (PMID:21725617)
  • stem cell self-renewal mediated by SALL4 via epigenetic machinery. (PMID:22128185)
  • SALL4 is a more sensitive and specific marker for yolk sac tumors than glypican-3 (PMID:22448662)
  • Alpha-fetoprotein-producing gastric carcinoma and combined hepatocellular and cholangiocarcinoma show similar morphology but different histogenesis with respect to SALL4 expression. (PMID:22516245)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriosall4ENSDARG00000044485
mus_musculusSall4ENSMUSG00000027547
rattus_norvegicusSall4ENSRNOG00000050035

Paralogs (14): HIVEP2 (ENSG00000010818), HIVEP1 (ENSG00000095951), ZNF516 (ENSG00000101493), SALL1 (ENSG00000103449), BCL11A (ENSG00000119866), ZNF831 (ENSG00000124203), RREB1 (ENSG00000124782), HIVEP3 (ENSG00000127124), BCL11B (ENSG00000127152), ZNF219 (ENSG00000165804), SALL2 (ENSG00000165821), ZNF217 (ENSG00000171940), ZNF536 (ENSG00000198597), SALL3 (ENSG00000256463)

Protein

Protein identifiers

Sal-like protein 4Q9UJQ4 (reviewed: Q9UJQ4)

Alternative names: Zinc finger protein 797, Zinc finger protein SALL4

All UniProt accessions (2): Q9UJQ4, Q6Y8G5

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor with a key role in the maintenance and self-renewal of embryonic and hematopoietic stem cells.

Subunit / interactions. Interacts with POU5F1/OCT4. Interacts with NANOG. Interacts with BEND3. Interacts with NSD2 (via PHD-type zinc fingers 1, 2 and 3). Interacts with NRBP1.

Subcellular location. Cytoplasm. Nucleus.

Tissue specificity. Expressed in testis. Constitutively expressed in acute myeloid leukemia (AML).

Post-translational modifications. Isoform SALL4B exists primarily as a ubiquitinated form. Sumoylation with both SUMO1 and SUMO2 regulates the stability, subcellular localization, transcriptional activity, and may reduce interaction with POU5F1/OCT4.

Disease relevance. Duane-radial ray syndrome (DRRS) [MIM:607323] Disorder characterized by the association of forearm malformations with Duane retraction syndrome. The disease is caused by variants affecting the gene represented in this entry. IVIC syndrome (IVIC) [MIM:147750] An autosomal dominant condition characterized by upper limbs anomalies (radial ray defects, carpal bones fusion), extraocular motor disturbances, congenital bilateral non-progressive mixed hearing loss. Other less consistent malformations include heart involvement, mild thrombocytopenia and leukocytosis (before age 50), shoulder girdle hypoplasia, imperforate anus, kidney malrotation or rectovaginal fistula. The IVIC syndrome is an allelic disorder of Duane-radial ray syndrome with a similar phenotype. The disease is caused by variants affecting the gene represented in this entry.

Similarity. Belongs to the sal C2H2-type zinc-finger protein family.

Isoforms (2)

UniProt IDNamesCanonical?
Q9UJQ4-1SALL4Ayes
Q9UJQ4-2SALL4B

RefSeq proteins (2): NP_001304960, NP_065169* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR013087Znf_C2H2_typeDomain
IPR036236Znf_C2H2_sfHomologous_superfamily
IPR051565Sal_C2H2-zinc-fingerFamily

Pfam: PF00096

UniProt features (74 total): cross-link 19, zinc finger region 8, region of interest 7, compositionally biased region 7, modified residue 7, strand 6, turn 6, helix 6, sequence variant 3, sequence conflict 3, chain 1, splice variant 1

Structure

Experimental structures (PDB)

13 structures.

PDBMethodResolution (Å)
7BQVX-RAY DIFFRACTION1.8
7BQUX-RAY DIFFRACTION1.9
8CUCX-RAY DIFFRACTION2.09
7Y3IX-RAY DIFFRACTION2.45
7Y3KX-RAY DIFFRACTION2.5
5XWRX-RAY DIFFRACTION2.69
9NWSELECTRON MICROSCOPY2.7
9NWTELECTRON MICROSCOPY2.7
7Y3MX-RAY DIFFRACTION2.72
8U16X-RAY DIFFRACTION2.9
8U15X-RAY DIFFRACTION2.95
8U17X-RAY DIFFRACTION3.1
6UMLX-RAY DIFFRACTION3.58

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9UJQ4-F151.860.03

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (26): 57, 307, 541, 776, 789, 852, 1019, 156, 156, 175, 190, 290, 316, 316, 372, 374, 374, 436, 550, 597 …

Function

Pathways and Gene Ontology

Reactome pathways

8 pathways

IDPathway
R-HSA-2892247POU5F1 (OCT4), SOX2, NANOG activate genes related to proliferation
R-HSA-452723Transcriptional regulation of pluripotent stem cells
R-HSA-8943724Regulation of PTEN gene transcription
R-HSA-1257604PIP3 activates AKT signaling
R-HSA-1266738Developmental Biology
R-HSA-162582Signal Transduction
R-HSA-6807070PTEN Regulation
R-HSA-9006925Intracellular signaling by second messengers

MSigDB gene sets: 492 (showing top): GOBP_CARDIAC_CHAMBER_DEVELOPMENT, GOBP_MORPHOGENESIS_OF_AN_EPITHELIUM, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_CARDIAC_SEPTUM_DEVELOPMENT, GOBP_GROWTH, GOBP_NEURAL_TUBE_DEVELOPMENT, GOBP_MORPHOGENESIS_OF_EMBRYONIC_EPITHELIUM, GOBP_INNER_CELL_MASS_CELL_PROLIFERATION, MUELLER_PLURINET, GOBP_IN_UTERO_EMBRYONIC_DEVELOPMENT, GOBP_APPENDAGE_DEVELOPMENT, GOBP_BLASTOCYST_DEVELOPMENT, GOBP_CARDIAC_VENTRICLE_DEVELOPMENT, GOBP_MAINTENANCE_OF_CELL_NUMBER

GO Biological Process (12): negative regulation of transcription by RNA polymerase II (GO:0000122), inner cell mass cell proliferation (GO:0001833), neural tube closure (GO:0001843), ventricular septum development (GO:0003281), regulation of transcription by RNA polymerase II (GO:0006357), embryonic limb morphogenesis (GO:0030326), somatic stem cell population maintenance (GO:0035019), positive regulation of transcription by RNA polymerase II (GO:0045944), in utero embryonic development (GO:0001701), heart development (GO:0007507), tissue development (GO:0009888), neural tube development (GO:0021915)

GO Molecular Function (5): DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA binding (GO:0003677), zinc ion binding (GO:0008270), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (5): heterochromatin (GO:0000792), nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), protein-containing complex (GO:0032991)

Reactome top-level categories

Rollup of top-6 pathways:

CategoryPathways
Transcriptional regulation of pluripotent stem cells1
Developmental Biology1
PTEN Regulation1
Intracellular signaling by second messengers1
PIP3 activates AKT signaling1
Signal Transduction1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of transcription by RNA polymerase II3
transcription by RNA polymerase II3
chordate embryonic development2
chromatin2
cellular anatomical structure2
negative regulation of DNA-templated transcription1
blastocyst growth1
cell population proliferation1
primary neural tube formation1
tube closure1
cardiac ventricle development1
cardiac septum development1
regulation of DNA-templated transcription1
limb morphogenesis1
embryonic appendage morphogenesis1
stem cell population maintenance1
positive regulation of DNA-templated transcription1
animal organ development1
circulatory system development1
anatomical structure development1
nervous system development1
tube development1
epithelium development1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
DNA-binding transcription factor activity1
nucleic acid binding1
transition metal ion binding1
binding1
cation binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1
cellular_component1

Protein interactions and networks

STRING

2186 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SALL4NANOGQ9H9S0997
SALL4POU5F1P31359955
SALL4ESRRBO95718919
SALL4SOX2P48431901
SALL4NR0B1P51843888
SALL4NSD2O96028874
SALL4CHN1P15882823
SALL4TBX3O15119819
SALL4CCNQQ8N1B3785
SALL4TBX5Q99593769
SALL4CTNNB1P35222741
SALL4LIN28AQ9H9Z2733
SALL4NKX2-5P52952726
SALL4UTF1Q5T230719
SALL4HOXA1P49639713

IntAct

6 interactions, top by confidence:

ABTypeScore
MBD3CDK2AP1psi-mi:“MI:0914”(association)0.730
SALL4MBD3psi-mi:“MI:0914”(association)0.530
Mta1MTA3psi-mi:“MI:0914”(association)0.350
SALL1MTA2psi-mi:“MI:0914”(association)0.350
SALL4MTA2psi-mi:“MI:0914”(association)0.350

BioGRID (51): SALL4 (Affinity Capture-MS), SALL4 (Affinity Capture-MS), SALL4 (Affinity Capture-MS), SALL4 (Reconstituted Complex), SALL4 (Affinity Capture-MS), SALL4 (Affinity Capture-MS), SALL4 (Affinity Capture-Western), SALL4 (Affinity Capture-Western), EZH2 (Affinity Capture-Western), SALL4 (Reconstituted Complex), SALL4 (Biochemical Activity), SALL4 (Affinity Capture-MS), CRBN (Affinity Capture-Western), SALL4 (Biochemical Activity), SALL4 (Reconstituted Complex)

ESM2 similar proteins: A0A1L8GSA2, A0A1L8H0H2, A0JN51, A0JP82, A2AWL7, A2BGM5, A2RRX6, F8VPJ6, K9JHZ4, O13186, O46567, O54826, O89091, P04150, P08235, P15822, P22199, P32519, P36197, P37275, P48552, P55197, P59759, P79269, P79686, Q29131, Q2KHR2, Q3YC04, Q4JM28, Q5R9P5, Q60775, Q61321, Q62947, Q64318, Q68DE3, Q6XLJ0, Q8AYC1, Q8AYC2, Q8BMA5, Q8IZQ8

Diamond homologs: B0K011, E9Q6W4, P23803, P41995, P86413, Q08DS3, Q0IHB8, Q32NK7, Q3T135, Q567J8, Q5XJQ7, Q62255, Q66JF8, Q6AY34, Q7PN68, Q8I7Z8, Q8N2R0, Q8TAX0, Q8WUU4, Q91ZD1, Q9BXA9, Q9N5X6, Q9UJQ4, Q9VQS6, Q9VQS7, Q9VQU9, Q9WVG7, A1L2U9, A2A884, A2ANX9, B0X0K1, B0X9H6, B0XS89, B1WAZ8, B1WBU4, E9PZZ1, G5EBU4, O15090, O60315, O62836

SIGNOR signaling

13 interactions.

AEffectBMechanism
SALL4“up-regulates quantity by expression”ABCA3“transcriptional regulation”
SALL4“up-regulates quantity by expression”ABCG2“transcriptional regulation”
SALL4“up-regulates quantity by expression”ZEB1“transcriptional regulation”
SALL4“down-regulates quantity by repression”CDH1“transcriptional regulation”
SALL4“up-regulates quantity by expression”HOXA9“transcriptional regulation”
SALL4“down-regulates quantity by repression”PTEN“transcriptional regulation”
SALL4“down-regulates quantity by repression”SALL1“transcriptional regulation”
STAT3“up-regulates quantity by expression”SALL4“transcriptional regulation”
SALL4“up-regulates quantity by expression”NANOG“transcriptional regulation”
CRBN“down-regulates quantity by destabilization”SALL4ubiquitination
CRBN“down-regulates quantity by destabilization”SALL4binding
SALL4“up-regulates quantity by expression”CECR2“transcriptional regulation”
SOX2/POU5F1“up-regulates quantity by expression”SALL4“transcriptional regulation”

Disease & clinical

Cancer significance

From intOGen — cancer-driver classification: activating (oncogene-like) across 1 cancer types — CLLSLL.

Clinical variants and AI predictions

ClinVar

603 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic42
Likely pathogenic14
Uncertain significance331
Likely benign122
Benign30

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1362243NC_000020.10:g.(?50400804)(50418947_?)delPathogenic
1411759NM_020436.5(SALL4):c.1959dup (p.Pro654fs)Pathogenic
1415336NM_020436.5(SALL4):c.1657C>T (p.Gln553Ter)Pathogenic
1455097NM_020436.5(SALL4):c.1717C>T (p.Arg573Ter)Pathogenic
1684297NM_020436.5(SALL4):c.1068del (p.Lys357fs)Pathogenic
1697971NM_020436.5(SALL4):c.712C>T (p.Gln238Ter)Pathogenic
1806356NM_020436.5(SALL4):c.2252del (p.Asn751fs)Pathogenic
2015460NM_020436.5(SALL4):c.2280dup (p.Asn761fs)Pathogenic
2035463NM_020436.5(SALL4):c.1472C>A (p.Ser491Ter)Pathogenic
2087818NM_020436.5(SALL4):c.474_475del (p.Glu158fs)Pathogenic
218947NM_020436.5(SALL4):c.410dup (p.Gly138fs)Pathogenic
2412780NM_020436.5(SALL4):c.1625del (p.Pro542fs)Pathogenic
2501946NM_020436.5(SALL4):c.1829_1833dup (p.His612fs)Pathogenic
2502302NM_020436.5(SALL4):c.2456dup (p.Met819fs)Pathogenic
2633659NM_020436.5(SALL4):c.1242_1245del (p.Cys415fs)Pathogenic
2728404NM_020436.5(SALL4):c.1994_1995del (p.Phe665fs)Pathogenic
2756162NM_020436.5(SALL4):c.2424dup (p.Ala809fs)Pathogenic
3318NM_020436.5(SALL4):c.1954C>T (p.Gln652Ter)Pathogenic
3319NM_020436.5(SALL4):c.1054del (p.Ala352fs)Pathogenic
3320NM_020436.5(SALL4):c.941dup (p.Leu315fs)Pathogenic
3321NM_020436.5(SALL4):c.1904del (p.Phe635fs)Pathogenic
3323NM_020436.5(SALL4):c.2425del (p.Ala809fs)Pathogenic
3324NM_020436.5(SALL4):c.326del (p.Pro109fs)Pathogenic
3325NM_020436.5(SALL4):c.523A>T (p.Lys175Ter)Pathogenic
3326NM_020436.5(SALL4):c.1849C>T (p.Arg617Ter)Pathogenic
3329NM_020436.5(SALL4):c.2663A>G (p.His888Arg)Pathogenic
3330NM_020436.5(SALL4):c.2607del (p.Gln869fs)Pathogenic
3347440NM_020436.5(SALL4):c.2241_2248delinsTT (p.Ser748_Glu750delinsTer)Pathogenic
3362635NM_020436.5(SALL4):c.1557del (p.Thr520fs)Pathogenic
3639620NM_020436.5(SALL4):c.1291C>T (p.Arg431Ter)Pathogenic

SpliceAI

614 predictions. Top by Δscore:

VariantEffectΔscore
20:51784680:TGAAC:Tacceptor_gain1.0000
20:51784681:GAAC:Gacceptor_gain1.0000
20:51784683:ACC:Aacceptor_loss1.0000
20:51784685:C:CCacceptor_gain1.0000
20:51784685:CTAT:Cacceptor_loss1.0000
20:51792351:ACC:Aacceptor_loss1.0000
20:51792353:C:CCacceptor_gain1.0000
20:51792353:CTGTA:Cacceptor_loss1.0000
20:51802277:AC:Adonor_gain1.0000
20:51802278:CC:Cdonor_gain1.0000
20:51802278:CCCAG:Cdonor_gain1.0000
20:51784682:AAC:Aacceptor_gain0.9900
20:51784683:AC:Aacceptor_gain0.9900
20:51784684:CCTA:Cacceptor_gain0.9900
20:51784687:A:ACacceptor_gain0.9900
20:51784687:A:Cacceptor_gain0.9900
20:51784694:C:CTacceptor_gain0.9900
20:51784695:A:Tacceptor_gain0.9900
20:51788857:CCAC:Cdonor_loss0.9900
20:51788857:CCACC:Cdonor_gain0.9900
20:51788859:A:ATdonor_loss0.9900
20:51788860:CCT:Cdonor_loss0.9900
20:51788870:G:Adonor_gain0.9900
20:51789939:T:Adonor_gain0.9900
20:51790024:T:TAdonor_gain0.9900
20:51792349:GCAC:Gacceptor_gain0.9900
20:51792350:CAC:Cacceptor_gain0.9900
20:51792350:CACC:Cacceptor_gain0.9900
20:51792351:AC:Aacceptor_gain0.9900
20:51792352:CC:Cacceptor_gain0.9900

AlphaMissense

6920 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:51784681:G:CH916D1.000
20:51788865:A:GL913S1.000
20:51788867:G:CN912K1.000
20:51788867:G:TN912K1.000
20:51788872:C:GG911R1.000
20:51788882:A:CF907L1.000
20:51788882:A:TF907L1.000
20:51788883:A:GF907S1.000
20:51788884:A:GF907L1.000
20:51788894:A:CC903W1.000
20:51788895:C:TC903Y1.000
20:51788896:A:GC903R1.000
20:51788903:G:CC900W1.000
20:51788905:A:GC900R1.000
20:51788927:G:CH892Q1.000
20:51788927:G:TH892Q1.000
20:51788934:C:GR890P1.000
20:51788939:G:CH888Q1.000
20:51788939:G:TH888Q1.000
20:51788941:G:CH888D1.000
20:51788941:G:TH888N1.000
20:51788966:G:CF879L1.000
20:51788966:G:TF879L1.000
20:51788967:A:GF879S1.000
20:51788968:A:GF879L1.000
20:51790551:A:CH644Q1.000
20:51790551:A:TH644Q1.000
20:51790552:T:GH644P1.000
20:51790578:G:CF635L1.000
20:51790578:G:TF635L1.000

dbSNP variants (sampled 300 via entrez): RS1000011679 (20:51792676 G>A), RS1000076739 (20:51793905 G>A), RS1000232958 (20:51799305 A>C,G), RS1000362657 (20:51785646 G>A,C,T), RS1000447884 (20:51797469 C>T), RS1000457860 (20:51797381 C>T), RS1000490560 (20:51797793 T>C), RS1000589490 (20:51792831 T>C), RS1000652774 (20:51791352 C>T), RS1000714156 (20:51785381 A>G), RS1000717553 (20:51792576 C>T), RS1001183024 (20:51803847 T>C), RS1001422484 (20:51784658 G>A), RS1001443340 (20:51781894 A>G), RS1001525601 (20:51785545 A>G)

Disease associations

OMIM: gene MIM:607343 | disease phenotypes: MIM:607323, MIM:147750, MIM:601457, MIM:610805

GenCC curated gene-disease

DiseaseClassificationInheritance
Duane-radial ray syndromeDefinitiveAutosomal dominant
IVIC syndromeSupportiveAutosomal dominant
Duane retraction syndromeSupportiveAutosomal dominant

Mondo (6): Duane-radial ray syndrome (MONDO:0011812), IVIC syndrome (MONDO:0007836), severe combined immunodeficiency, autosomal recessive, T cell-negative, B cell-negative, NK cell-positive (MONDO:0011086), congenital heart disease (MONDO:0005453), congenital anomaly of kidney and urinary tract (MONDO:0019719), Duane retraction syndrome (MONDO:0007473)

Orphanet (5): Okihiro syndrome (Orphanet:93293), Acro-renal-ocular syndrome (Orphanet:959), IVIC syndrome (Orphanet:2307), Severe combined immunodeficiency due to complete RAG1/2 deficiency (Orphanet:331206), Renal or urinary tract malformation (Orphanet:93545)

HPO phenotypes

151 total (30 of 151 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000015Bladder diverticulum
HP:0000076Vesicoureteral reflux
HP:0000085Horseshoe kidney
HP:0000086Ectopic kidney
HP:0000089Renal hypoplasia
HP:0000104Renal agenesis
HP:0000126Hydronephrosis
HP:0000143Rectovaginal fistula
HP:0000175Cleft palate
HP:0000232Everted lower lip vermilion
HP:0000252Microcephaly
HP:0000286Epicanthus
HP:0000316Hypertelorism
HP:0000324Facial asymmetry
HP:0000347Micrognathia
HP:0000365Hearing impairment
HP:0000377Abnormal pinna morphology
HP:0000384Preauricular skin tag
HP:0000402Stenosis of the external auditory canal
HP:0000405Conductive hearing impairment
HP:0000407Sensorineural hearing impairment
HP:0000431Wide nasal bridge
HP:0000452Choanal stenosis
HP:0000453Choanal atresia
HP:0000463Anteverted nares
HP:0000465Webbed neck
HP:0000470Short neck
HP:0000480Retinal coloboma
HP:0000482Microcornea

GWAS associations

3 associations (top):

StudyTraitp-value
GCST000412_1Male infertility5.000000e-07
GCST002682_15Tourette’s syndrome or obsessive-compulsive disorder8.000000e-06
GCST011053_13Neuroblastoma (pediatric)1.000000e-10

MeSH disease descriptors (5)

DescriptorNameTree numbers
D004370Duane Retraction SyndromeC10.292.562.700.375.500; C11.270.235; C11.590.436.400.500; C16.320.290.235
D006330Heart Defects, CongenitalC14.240.400; C14.280.400; C16.131.240.400
C566906Cakut (supp.)
C535544Oculootoradial syndrome (supp.)
C563311Severe Combined Immunodeficiency, Autosomal Recessive, T Cell-Negative, B Cell-Negative, NK Cell-Positive (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (2): CHEMBL4879536 (PROTEIN-PROTEIN INTERACTION), CHEMBL6067463 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

25 potent at pChembl≥5 of 25 total, top 25 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
7.72EC5019nMCHEMBL5618808
7.12EC5075nMCHEMBL5618646
6.96IC50109nMCHEMBL5804153
6.80EC50160nMCHEMBL5619144
6.79IC50162nMCHEMBL4548728
6.77EC50170nMCHEMBL5620230
6.54IC50288nMCHEMBL5918901
6.47IC50335nMCHEMBL5856428
6.39EC50410nMCHEMBL5618477
6.36IC50434nMCHEMBL5967769
6.30IC50498nMCHEMBL6003258
6.25EC50560nMCHEMBL5618970
6.23IC50594nMCHEMBL5936883
6.21IC50619nMCHEMBL5761009
6.21IC50623nMCHEMBL5816357
6.21IC50616nMCHEMBL6006795
6.11IC50783nMCHEMBL5849668
5.87IC501357nMCHEMBL5872511
5.82IC501506nMCHEMBL5852700
5.65IC502231nMCHEMBL5996102
5.62EC502400nMCHEMBL5618377
5.60IC502489nMCHEMBL5990283
5.56IC502770nMCHEMBL5837395
5.55EC502800nMCHEMBL5619492
5.44EC503600nMCHEMBL5619824

PubChem BioAssay actives

9 with measured affinity, of 56 total; 9 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
2,2,2-trichloroethyl (1S)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1-oxo-3H-isoindol-4-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.0190uM
2,2,2-trichloroethyl (1R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1-oxo-3H-isoindol-5-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.0750uM
trans-2,2,2-trichloroethyl (1S,2R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1-oxo-3H-isoindol-5-yl]cyclopropane-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.1600uM
2,2,2-trichloroethyl (1R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1-oxo-3H-isoindol-4-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.1700uM
trans-2,2,2-trichloroethyl (1S,2R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1,3-dioxoisoindol-5-yl]cyclopropane-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.4100uM
2,2,2-trichloroethyl (1S)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1,3-dioxoisoindol-4-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec500.5600uM
2,2,2-trichloroethyl (1R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1,3-dioxoisoindol-5-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec502.4000uM
2,2,2-trichloroethyl (1R)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1,3-dioxoisoindol-4-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec502.8000uM
2,2,2-trichloroethyl (1S)-1-(4-bromophenyl)-2-[2-(2,6-dioxopiperidin-3-yl)-1,3-dioxoisoindol-5-yl]cycloprop-2-ene-1-carboxylate2130920: Induction of C-terminal HiBiT tag knocked in SALL4 degradation in human NCCIT cells incubated for 60 mins by Nano-Glo HiBiT lytic reagent based assayec503.6000uM

CTD chemical–gene interactions

48 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression6
trichostatin Aaffects cotreatment, decreases expression3
TAK-243decreases reaction, increases degradation, decreases sumoylation, decreases expression2
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression2
Resveratrolaffects cotreatment, decreases expression2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
Tretinoindecreases expression2
GSK-J4decreases expression1
Esketaminedecreases expression1
sotorasibaffects cotreatment, decreases expression1
methylmercuric chloridedecreases expression1
propionaldehydeincreases expression1
bisphenol Adecreases expression1
sulforaphanedecreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
sodium arsenitedecreases expression1
2,3-bis(3’-hydroxybenzyl)butyrolactoneaffects cotreatment, decreases expression1
CGP 52608affects binding, increases reaction1
ginsenoside Rg3decreases expression1
pomalidomideaffects binding, decreases expression, increases degradation1
dorsomorphinaffects cotreatment, decreases expression1
pevonedistatdecreases expression, decreases reaction, increases degradation1
LDN 193189affects cotreatment, decreases expression1
trametinibaffects cotreatment, decreases expression1
N-(2-(3H-indol-3-yl)ethyl)-9-isopropyl-2-(5-methyl-3-pyridyl)purin-6-amineincreases expression, decreases reaction1
NVP-BKM120affects cotreatment, decreases expression1
Bortezomibdecreases expression, decreases reaction, increases degradation1
Sunitinibdecreases expression1
Lenalidomideaffects binding, decreases expression, increases degradation1
Vorinostataffects cotreatment, decreases expression1

ChEMBL screening assays

12 unique, capped per target: 12 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4822141BindingProtac activity at CRBN/SALL4 in human Kelly cells assessed as induction of SALL4 degradation by measuring downregulation of SALL4 protein expression at 1 uM measured after 5 hrs by proteomics analysis based LC-MS analysisA biphenyl inhibitor of eIF4E targeting an internal binding site enables the design of cell-permeable PROTAC-degraders. — Eur J Med Chem

Cellosaurus cell lines

7 cell lines: 3 embryonic stem cell, 3 cancer cell line, 1 transformed cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A6A9SEES3-1V human SALL4, clone1Embryonic stem cellMale
CVCL_A6B0SEES3-1V human SALL4, clone2Embryonic stem cellMale
CVCL_A6B1SEES3-1V human SALL4, clone3Embryonic stem cellMale
CVCL_B8P5Abcam HCT 116 SALL4 KOCancer cell lineMale
CVCL_B9BDAbcam MCF-7 SALL4 KOCancer cell lineFemale
CVCL_B9RHAbcam A-549 SALL4 KOCancer cell lineMale
CVCL_HC92HEK293 eGFP-SALL4Transformed cell lineFemale

Clinical trials (associated diseases)

300 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00668824PHASE4UNKNOWNImproved Diagnosis of Congenital Heart Disease by Magnetic Resonance Imaging Using Vasovist
NCT01368705PHASE4COMPLETEDNitrogen Balance in Infants After Post Cardiothoracic Surgery
NCT01619982PHASE4COMPLETEDPre-operative Prophylaxis With Vancomycin and Cefazolin in Pediatric Cardiovascular Surgery Patients
NCT02122679PHASE4WITHDRAWNTranexamic Acid Effect on Platelet Aggregation Following Infant Cardiopulmonary Bypass
NCT02527811PHASE4UNKNOWNUlinastatin Injection in in Pediatric Patients Undergoing Open Heart Surgery
NCT03014700PHASE4COMPLETEDFibrinogen Concentrate vs Cryoprecipitate
NCT03408340PHASE4TERMINATEDParavertebral Nerve Blocks in Neonates
NCT03630796PHASE4UNKNOWNEffect of Sevoflurane in Postoperative Troponin I Levels in Children Undergoing Congenital Heart Defects Surgery
NCT03667703PHASE4COMPLETEDStress Ulcer Prophylaxis Versus Placebo in Critically Ill Infants With Congenital Heart Disease
NCT04453761PHASE4UNKNOWNThiamine Influenced on Substrate Energy Effectiveness in Indonesian Children Undergoing Cardiopulmonary Bypass
NCT06668389PHASE4RECRUITINGSodium-Glucose Cotransporter 2 Inhibitors for Repaired Tetralogy of Fallot Patients for Enhancement of Cardio-Pulmonary Status Trial
NCT07499154PHASE4NOT_YET_RECRUITINGPerioperative Lidocaine for Lung Protection in Infants Undergoing Cardiac Surgery
NCT00000470PHASE3COMPLETEDInfant Heart Surgery: Central Nervous System Sequelae of Circulatory Arrest
NCT00000494PHASE3COMPLETEDManagement of Patent Ductus in Premature Infants
NCT01134302PHASE3UNKNOWNHybrid Versus Norwood Management Strategies in Infants Undergoing Single Ventricle Palliation
NCT01607983PHASE3WITHDRAWNEffects of Pulmonary Vasodilation Upon VA Coupling in Fontan Patients
NCT01662011PHASE3UNKNOWNApplication of Neurally Adjusted Ventilatory Assist to Children After Congenital Cardiac Surgery
NCT02320669PHASE3COMPLETEDPhase 3 Triiodothyronine Supplementation for Infants After Cardiopulmonary Bypass
NCT02615262PHASE3COMPLETEDIntraoperative Dexamethasone in Pediatric Cardiac Surgery
NCT03153137PHASE3COMPLETEDClinical Study Assessing the Efficacy and Safety of Macitentan in Fontan-palliated Subjects
NCT03154476PHASE3COMPLETEDRole of Sildenafil for Fontan Associated Liver Disease (SiFALD) Study
NCT04536194PHASE3COMPLETEDDopamine Versus Norepinephrine Under General Anesthesia
NCT04702373PHASE3ACTIVE_NOT_RECRUITINGTraining in Exercise Activities and Motion for Growth (TEAM 4 Growth) RCT
NCT05049590PHASE3COMPLETEDAcute Normovolemic Hemodilution in Complex Cardiac Surgery
NCT06406517PHASE3UNKNOWNComparative Effectiveness of Gadopiclenol for Evaluation of Adult Congenital Heart Anatomy and Hemodynamics
NCT06693674PHASE3RECRUITINGEffect of Sacubitril-Valsartan on Cardiac Structure and Function
NCT06955260PHASE3NOT_YET_RECRUITINGSGLT2 Inhibition With Empagliflozin in Fontan Circulatory Failure
NCT00115375PHASE2COMPLETEDPlatelet Aggregation Inhibition in Children on Clopidogrel (PICOLO)
NCT00350220PHASE2COMPLETEDTransfusion Strategies in Pediatric Cardiothoracic Surgery
NCT00374088PHASE2COMPLETEDN-Acetylcysteine in Neonatal Congenital Heart Surgery (INACT Study)
NCT00538785PHASE2COMPLETEDA Study to Evaluate MEDI-524 In Children With Hemodynamically Significant Congenital Heart Disease
NCT00770705PHASE2WITHDRAWNParenteral Phenoxybenzamine During Congenital Heart Disease Surgery
NCT00919945PHASE2TERMINATEDImpact of Early Enteral Feeding on Splanchnic Blood Flow After Surgery for Critical Heart Disease in the Newborn
NCT01063712PHASE2COMPLETEDSafety and Effectiveness of the Device Nit-Occlud® PDA-R
NCT01069510PHASE2COMPLETEDSpironolactone in Adult Congenital Heart Disease
NCT01189981PHASE2COMPLETEDEffect of eHealth Encouragements to Intensive Exercise in Adolescents With Congenital Heart Disease
NCT01330433PHASE2COMPLETEDEffects of CoSeal on Bleeding & Adhesions in Pediatric Heart Surgery
NCT01662037PHASE2COMPLETEDBosentan Therapy in Children With Functional Single Ventricle
NCT01668264PHASE2UNKNOWNImaging Assessment of Diastolic Function
NCT01827059PHASE2UNKNOWNBosentan In Exercise Induced Pulmonary Arterial Hypertension in CongenitaL Heart diseasE