SOX4

gene
On this page

Summary

SOX4 (SRY-box transcription factor 4, HGNC:11200) is a protein-coding gene on chromosome 6p22.3, encoding Transcription factor SOX-4 (Q06945). Transcriptional activator that binds with high affinity to the T-cell enhancer motif 5’-AACAAAG-3’ motif.

This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins, such as syndecan binding protein (syntenin). The protein may function in the apoptosis pathway leading to cell death as well as to tumorigenesis and may mediate downstream effects of parathyroid hormone (PTH) and PTH-related protein (PTHrP) in bone development. The solution structure has been resolved for the HMG-box of a similar mouse protein.

Source: NCBI Gene 6659 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Coffin-Siris syndrome 10 (Strong, GenCC) — +2 more curated relationships
  • GWAS associations: 14
  • Clinical variants (ClinVar): 250 total — 8 pathogenic, 12 likely-pathogenic
  • Phenotypes (HPO): 85
  • Transcription factor: yes — 27 downstream targets (CollecTRI)
  • MANE Select transcript: NM_003107

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11200
Approved symbolSOX4
NameSRY-box transcription factor 4
Location6p22.3
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000124766
Ensembl biotypeprotein_coding
OMIM184430
Entrez6659

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000244745

RefSeq mRNA: 1 — MANE Select: NM_003107 NM_003107

CCDS: CCDS4547

Canonical transcript exons

ENST00000244745 — 1 exons

ExonStartEnd
ENSE000008481862159375121598619

Expression profiles

Bgee: expression breadth ubiquitous, 295 present calls, max score 99.57.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 157.9500 / max 10801.9646, expressed in 1749 samples.

FANTOM5 promoters (17 alternative TSS)

Promoter IDTPM avgSamples expressed
66299109.23251725
6631732.71491586
663142.8659952
663152.5958906
663112.2300872
663101.6820743
663041.4083751
663121.2355610
663020.7598339
663030.5884377

Top tissues by expression

300 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cortical plateUBERON:000534399.57gold quality
ganglionic eminenceUBERON:000402399.46gold quality
embryoUBERON:000092299.03gold quality
ventricular zoneUBERON:000305397.78gold quality
tendon of biceps brachiiUBERON:000818897.74gold quality
corpus epididymisUBERON:000435997.48gold quality
type B pancreatic cellCL:000016997.45gold quality
visceral pleuraUBERON:000240197.36gold quality
mammary ductUBERON:000176597.30gold quality
endometrium epitheliumUBERON:000481197.26gold quality
epithelium of mammary glandUBERON:000324497.22gold quality
adult organismUBERON:000702396.40gold quality
caput epididymisUBERON:000435895.94gold quality
endometriumUBERON:000129595.81gold quality
tibiaUBERON:000097995.68gold quality
thymusUBERON:000237095.60gold quality
stromal cell of endometriumCL:000225595.51gold quality
vena cavaUBERON:000408795.43gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047395.24gold quality
germinal epithelium of ovaryUBERON:000130495.23gold quality
olfactory bulbUBERON:000226495.04gold quality
cartilage tissueUBERON:000241894.89gold quality
mucosa of sigmoid colonUBERON:000499394.79gold quality
nasal cavity epitheliumUBERON:000538494.78gold quality
cauda epididymisUBERON:000436094.53gold quality
pleuraUBERON:000097794.50gold quality
skin of hipUBERON:000155494.34gold quality
bronchial epithelial cellCL:000232894.21gold quality
cardia of stomachUBERON:000116294.21gold quality
renal medullaUBERON:000036294.19gold quality

Single-cell (SCXA)

Detected in 59 experiment(s), a significant marker in 47.

ExperimentMarker?Max mean expression
E-CURD-7yes17643.80
E-HCAD-5yes8544.95
E-ENAD-21yes7855.32
E-MTAB-9435yes5421.71
E-MTAB-8221yes5211.99
E-HCAD-56yes5144.45
E-MTAB-10485yes4365.91
E-MTAB-11121yes3699.46
E-CURD-114yes3278.49
E-HCAD-31yes2746.19
E-MTAB-10432yes2681.25
E-CURD-122yes2228.19
E-MTAB-9154yes2117.22
E-MTAB-8884yes2110.13
E-MTAB-7008yes1467.78

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

27 targets.

TargetRegulation
AGO1Unknown
AXIN2Unknown
BAXActivation
BBC3Unknown
CALD1
CDH2Activation
CDKN1AActivation
CEL
DHX9Unknown
DICER1Unknown
DLL1Activation
EGFR
FZD5
GCGActivation
GCKR
HDAC8Activation
IGLL1Unknown
KMT2AUnknown
MECOMActivation
NCAM1Activation
NKX3-1Unknown
NOTCH1
PTCH1
SOX2Activation
SOX4Activation
TBXT
TNCActivation

JASPAR motifs

MotifNameFamily
MA0867.1SOX4SOX-related factors
MA0867.2SOX4SOX-related factors
MA0867.3SOX4SOX-related factors

JASPAR matrix evidence (PMIDs): PMID:12920151, PMID:17599239

Upstream regulators (CollecTRI, top): AR, CEBPA, ERG, FOSL2, JUND, KAT5, SOX4, SOX9, TP53

miRNA regulators (miRDB)

199 targeting SOX4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-5193100.0067.261744
HSA-MIR-3163100.0077.238605
HSA-MIR-8485100.0077.574731
HSA-MIR-190A-3P100.0080.355520
HSA-MIR-4533100.0069.482758
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-6740-5P100.0065.64932
HSA-MIR-6873-3P100.0071.422626
HSA-MIR-4476100.0068.182030
HSA-MIR-6876-5P100.0067.682126
HSA-MIR-196A-1-3P99.9972.152772
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-453199.9969.703181
HSA-MIR-453499.9966.581907
HSA-MIR-6870-5P99.9968.552115
HSA-MIR-32-5P99.9875.211964
HSA-MIR-92A-3P99.9875.211960
HSA-MIR-92B-3P99.9875.251955
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-25-3P99.9874.601817
HSA-MIR-363-3P99.9874.721821
HSA-MIR-367-3P99.9874.831819
HSA-MIR-19A-3P99.9875.332762

Literature-anchored findings (GeneRIF, showing 40)

  • Sox-4 is a positive regulator of Hep3B and HepG2 cells’ apoptosis induced by prostaglandin (PG)A(2) and delta(12)-PGJ(2). (PMID:12216117)
  • Data suggest that the apoptotic activity of Sox4 can be dissociated from its transcriptional trans-activation and is mediated through its central pro-apoptotic CD domain. (PMID:15522200)
  • The human transcription factor SOX4 was 5-fold up-regulated in bladder tumors compared with normal tissue based on whole-genome expression profiling of 166 clinical bladder tumor samples and 27 normal urothelium samples. (PMID:16585165)
  • Stable transfection of SOX4 into nontransformed prostate cells enabled colony formation in soft agar, suggesting that, in the proper cellular context, SOX4 can be a transforming oncogene. (PMID:16618720)
  • It shows that Ubc9 interacts with SOX4 and represses its transcriptional activity independent of its SUMO-1-conjugating activity. (PMID:16631117)
  • indings suggest that Sox4 contributes to the malignant phenotype of adenoid cystic carcinoma cells by promoting cell survival (PMID:16636670)
  • Sox4 expression in trangenic mice counteracts differentiation of radial glia and has to be downregulated before full maturation can occur. (PMID:17507571)
  • mutation of SOX4 gene in the different tumor tissues with pathological stages and types of non-small cell lung cancer (NSCLC) (PMID:17922414)
  • overexpression of SOX4 potentiates metastasis in hepatocellular carcinoma. (PMID:18504433)
  • There was a trend toward better survival with increasing SOX4 expression in medulloblastoma. (PMID:18577562)
  • SOX4 affects developmental signaling pathways and these changes may influence cancer progression via regulation of gene networks involved in microRNA processing, transcriptional regulation, growth factor signaling, and tumor metastasis. (PMID:19147588)
  • In lung cancer, SOX4 is overexpressed due to gene amplification and provide evidence of oncogenic properties of SOX4. (PMID:19153074)
  • high SOX4 transcript levels correlated with recurrence of colorectal cancer (PMID:19156145)
  • SOX4, a new DNA damage sensor, is required for the activation of p53 tumor suppressor in response to DNA damage. (PMID:19234109)
  • A direct link between miR-129 and the two putative targets GALNT1 and SOX4 in bladder cancer. (PMID:19487295)
  • Results imply that the aberrant expression of SOX4 is caused by epigenetic repression of miR-129-2 in endometrial cancer. (PMID:19887623)
  • induction of SOX4 gene expression might be responsible for the CD56 expression in human myeloma cells (PMID:20049565)
  • SOX4 has been implicated in pancreas development and the regulation of insulin secretion and risk of type 2 diabetes. (PMID:20080751)
  • up-regulation of SOX4 was inversely associated with the epigenetic silencing of miR-129-2 in gastric cancer, and restoration of miR-129-2 down-regulated SOX4 expression. (PMID:20331975)
  • SOX4 contributes to hepatocarcinogenesis by inhibiting p53-mediated apoptosis. (PMID:20400479)
  • SOX4 and TGFBI expression is elevated in GBM tissues compared with normal brain tissues at both the RNA and protein levels (PMID:20419098)
  • SOX4 is closely associated with early recurrence of hepatocellular carcinoma after curative resection, and its overexpression may contribute to early recurrence. (PMID:20423819)
  • Sox4 regulates melanoma cell migration and invasion in an NF-kappaB p50-dependent manner and may serve as a prognostic marker and potential therapeutic target for human melanoma. (PMID:20952589)
  • Sox4 acts as an agonist of Wnt signaling in cancer cells. (PMID:21165564)
  • SOX4 gene may have a role in bladder cancer tumorigenesis. (PMID:21680105)
  • our data demonstrate that the Sox4 C-terminal domain regulates polyubiquitin-independent proteasomal degradation of Sox4 that can be modulated by interaction with syntenin (PMID:21986941)
  • findings identify a functional role for SOX genes in SCLC, particularly for SOX4 and several novel targets defined in this study (PMID:22084397)
  • Junction plakoglobin (JUP) interacts with SOX4 in both the cytosol and the nucleus and the interaction between SOX4 and plakoglobin is significantly increased when prostate and breast cancer cells are stimulated with WNT3A. (PMID:22098624)
  • The SOX4-plakoglobin complex affected the expression of Wnt pathway target genes and SOX4 downstream targets, such as AXIN2, DICER1, and DHX9. (PMID:22098624)
  • may serve as a positive regulator of beta-catenin signaling through alteration in TCF4 expression during morular differentiation of endometrial carcinoma cells, leading to inhibition of cell proliferation (PMID:22231735)
  • Our data indicate for the first time that the over-expression of SOX4 in primary gallbladder carcinoma was significantly correlated with favorable clinicopathologic features (PMID:22510499)
  • demonstrate that SPARC expression suppressed irradiation induced SOX-4 mediated DNA repair (PMID:22542805)
  • Sox4 and CREB cooperate and contribute to increased proliferation of hematopoietic progenitor cells. (PMID:22627767)
  • Knockdown of Sox4 induces a major change in the expression pattern of miRNAs in melanoma cells, mainly due to reduced expression of Dicer. (PMID:22689055)
  • Our findings define an important function for SOX4 in the progression of breast cancer by orchestrating EMT, and they implicate this gene product as a marker of poor prognosis in this disease. (PMID:22787120)
  • SOX4 gene expression was strongly up-regulated in human Ph+ pre-BALL cells by tyrosine kinase inhibitor treatment.High levels of SOX4 expression in ALL cells at the time of diagnosis predicted poor outcome in a pediatric clinical trial (COG P9906). (PMID:23152540)
  • we disclose a miR-129-2(miR-335)/SOX4/Semaphorin-Plexin regulatory axis in the tumorigenesis of pancreatic cancer. (PMID:23251334)
  • overexpression of nuclear SOX4 was a clear prognostic marker for GC (P=0.004). Overexpression of nuclear SOX4 can be used as a marker to predict the outcome of patients with GC. (PMID:23285187)
  • TGF-b-mediated increased expression of SOX4 is required for the induction of a mesenchymal phenotype during EMT in human mammary epithelial cells (PMID:23301048)
  • study demonstrates the role and clinical relevance of miR-138 in ovarian cancer cell invasion and metastasis, providing a potential therapeutic strategy for suppression of ovarian cancer metastasis by targeting SOX4 and HIF-1alpha pathways (PMID:23389731)

Cross-species orthologs

8 orthologs

OrganismSymbolGene ID
danio_reriosox4aENSDARG00000004588
danio_reriosox4bENSDARG00000098834
mus_musculusSox4ENSMUSG00000076431
rattus_norvegicusSox4ENSRNOG00000065643
drosophila_melanogasterSox14FBGN0005612
drosophila_melanogasterSox21aFBGN0036411
drosophila_melanogasterSox102FFBGN0039938
caenorhabditis_elegansWBGENE00001182

Paralogs (20): SOX8 (ENSG00000005513), SOX30 (ENSG00000039600), SOX10 (ENSG00000100146), SOX6 (ENSG00000110693), SOX21 (ENSG00000125285), SOX9 (ENSG00000125398), SOX15 (ENSG00000129194), SOX5 (ENSG00000134532), SOX3 (ENSG00000134595), SOX13 (ENSG00000143842), SOX17 (ENSG00000164736), SOX14 (ENSG00000168875), SOX7 (ENSG00000171056), SOX11 (ENSG00000176887), SOX12 (ENSG00000177732), CFAP65 (ENSG00000181378), SOX2 (ENSG00000181449), SOX1 (ENSG00000182968), SRY (ENSG00000184895), SOX18 (ENSG00000203883)

Protein

Protein identifiers

Transcription factor SOX-4Q06945 (reviewed: Q06945)

All UniProt accessions (1): Q06945

UniProt curated annotations — full annotation on UniProt →

Function. Transcriptional activator that binds with high affinity to the T-cell enhancer motif 5’-AACAAAG-3’ motif. Required for IL17A-producing Vgamma2-positive gamma-delta T-cell maturation and development, via binding to regulator loci of RORC to modulate expression. Involved in skeletal myoblast differentiation by promoting gene expression of CALD1.

Subunit / interactions. Interacts with UBE2I. Interacts with HDAC1; interaction inhibits the transcriptional activator activity.

Subcellular location. Nucleus.

Tissue specificity. Testis, brain, and heart.

Post-translational modifications. Acetylation at Lys-95 by KAT5 promotes the transcription activator activity and is required during myoblast differentiation. Acetylation by KAT5 abolishes the interaction between SOX4 and HDAC1 and switches SOX4 into a transcriptional activator.

Disease relevance. Intellectual developmental disorder with speech delay and dysmorphic facies (IDDSDF) [MIM:618506] An autosomal dominant disorder characterized by mild to severe intellectual disability, global developmental delay, mild but distinct facial dysmorphism, fifth finger clinodactyly, and small stature. Hypotonia, ventricular septal defect, and spastic quadriparesis may also be present. The disease is caused by variants affecting the gene represented in this entry.

Domain organisation. The 9aaTAD motif is a transactivation domain present in a large number of yeast and animal transcription factors.

RefSeq proteins (1): NP_003098* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR017386SOX-12/11/4Family
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR050140SRY-related_HMG-box_TF-likeFamily

Pfam: PF00505

UniProt features (25 total): compositionally biased region 10, sequence variant 4, region of interest 4, mutagenesis site 2, chain 1, DNA-binding region 1, modified residue 1, sequence conflict 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q06945-F155.710.19

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 95

Mutagenesis-validated functional residues (2):

PositionPhenotype
45does not affect acetylation by kat5.
95abolished acetylation by kat5.

Function

Pathways and Gene Ontology

Reactome pathways

4 pathways

IDPathway
R-HSA-3769402Deactivation of the beta-catenin transactivating complex
R-HSA-162582Signal Transduction
R-HSA-195721Signaling by WNT
R-HSA-201681TCF dependent signaling in response to WNT

MSigDB gene sets: 899 (showing top): GOBP_CARDIAC_CHAMBER_DEVELOPMENT, GOBP_SPINAL_CORD_DEVELOPMENT, AHRARNT_01, GOBP_REGULATION_OF_CELL_ACTIVATION, RNGTGGGC_UNKNOWN, GOBP_HEMATOPOIETIC_PROGENITOR_CELL_DIFFERENTIATION, TONKS_TARGETS_OF_RUNX1_RUNX1T1_FUSION_MONOCYTE_UP, MODULE_52, GOBP_EPITHELIUM_DEVELOPMENT, KOBAYASHI_EGFR_SIGNALING_24HR_UP, GOBP_SOMATIC_STEM_CELL_POPULATION_MAINTENANCE, GOBP_CARDIAC_SEPTUM_DEVELOPMENT, GOBP_POSITIVE_REGULATION_OF_HEMOPOIESIS, GOBP_VENTRICULAR_SEPTUM_MORPHOGENESIS, CCAWYNNGAAR_UNKNOWN

GO Biological Process (39): negative regulation of transcription by RNA polymerase II (GO:0000122), pro-B cell differentiation (GO:0002328), mitral valve morphogenesis (GO:0003183), cardiac right ventricle morphogenesis (GO:0003215), atrial septum primum morphogenesis (GO:0003289), noradrenergic neuron differentiation (GO:0003357), regulation of DNA-templated transcription (GO:0006355), nervous system development (GO:0007399), brain development (GO:0007420), heart development (GO:0007507), positive regulation of cell population proliferation (GO:0008284), gene expression (GO:0010467), glial cell proliferation (GO:0014009), spinal cord development (GO:0021510), glial cell development (GO:0021782), neuron differentiation (GO:0030182), T cell differentiation (GO:0030217), endocrine pancreas development (GO:0031018), positive regulation of insulin secretion (GO:0032024), somatic stem cell population maintenance (GO:0035019), ascending aorta morphogenesis (GO:0035910), glucose homeostasis (GO:0042593), positive regulation of apoptotic process (GO:0043065), regulation of DNA damage response, signal transduction by p53 class mediator (GO:0043516), positive regulation of gamma-delta T cell differentiation (GO:0045588), negative regulation of myoblast differentiation (GO:0045662), positive regulation of myoblast differentiation (GO:0045663), positive regulation of DNA-templated transcription (GO:0045893), positive regulation of transcription by RNA polymerase II (GO:0045944), sympathetic nervous system development (GO:0048485), camera-type eye morphogenesis (GO:0048593), protein stabilization (GO:0050821), ventricular septum morphogenesis (GO:0060412), mesenchyme development (GO:0060485), neuroepithelial cell differentiation (GO:0060563), kidney morphogenesis (GO:0060993), hematopoietic stem cell homeostasis (GO:0061484), cellular response to glucose stimulus (GO:0071333), positive regulation of canonical Wnt signaling pathway (GO:0090263)

GO Molecular Function (9): transcription cis-regulatory region binding (GO:0000976), RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), DNA-binding transcription factor activity (GO:0003700), miRNA binding (GO:0035198), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (6): chromatin (GO:0000785), nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription regulator complex (GO:0005667), cytoplasm (GO:0005737), mitochondrion (GO:0005739)

Reactome top-level categories

Rollup of top-3 pathways:

CategoryPathways
TCF dependent signaling in response to WNT1
Signal Transduction1
Signaling by WNT1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulatory region sequence-specific DNA binding3
cellular anatomical structure3
regulation of transcription by RNA polymerase II2
central nervous system development2
animal organ development2
cell population proliferation2
anatomical structure development2
intracellular membrane-bounded organelle2
transcription by RNA polymerase II1
negative regulation of DNA-templated transcription1
lymphoid progenitor cell differentiation1
mitral valve development1
atrioventricular valve morphogenesis1
cardiac ventricle morphogenesis1
septum primum development1
atrial septum morphogenesis1
neuron differentiation1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
system development1
head development1
circulatory system development1
regulation of cell population proliferation1
positive regulation of cellular process1
macromolecule biosynthetic process1
gliogenesis1
glial cell differentiation1
cell development1
cell differentiation1
generation of neurons1
lymphocyte differentiation1
T cell activation1
pancreas development1
endocrine system development1
insulin secretion1
positive regulation of protein secretion1
regulation of insulin secretion1
positive regulation of peptide hormone secretion1
stem cell population maintenance1

Protein interactions and networks

STRING

2730 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SOX4POU5F1P31359914
SOX4EZH2Q15910871
SOX4SDCBPO00560803
SOX4CTNNB1P35222753
SOX4IL5RAQ01344707
SOX4STAT3P40763704
SOX4MYCP01106678
SOX4GATA3P23771655
SOX4UBE2IP50550645
SOX4LEF1Q9UJU2640
SOX4HMGB4Q8WW32633
SOX4SNAI2O43623628
SOX4ASCL1P50553610
SOX4HNF4AP41235602
SOX4SMAD4Q13485598

IntAct

31 interactions, top by confidence:

ABTypeScore
SOX4TP53psi-mi:“MI:0915”(physical association)0.590
TP53SOX4psi-mi:“MI:0915”(physical association)0.590
CREBBPSOX4psi-mi:“MI:0915”(physical association)0.400
EP300SOX4psi-mi:“MI:0915”(physical association)0.400
SOX4ACTBL2psi-mi:“MI:0915”(physical association)0.400
SOX4GATA3psi-mi:“MI:0915”(physical association)0.400
SOX4UBE2Mpsi-mi:“MI:0914”(association)0.350
MYO1BZMPSTE24psi-mi:“MI:0914”(association)0.350
SOX4SEC16Apsi-mi:“MI:2364”(proximity)0.270
FBXW7SOX4psi-mi:“MI:2364”(proximity)0.270
SMAD4SOX4psi-mi:“MI:2364”(proximity)0.270
SOX4SMARCA4psi-mi:“MI:2364”(proximity)0.270
SMARCA4SOX4psi-mi:“MI:2364”(proximity)0.270
SOX4PTPN11psi-mi:“MI:2364”(proximity)0.270
SPAG9SOX4psi-mi:“MI:0915”(physical association)0.000
MCM4SOX4psi-mi:“MI:0915”(physical association)0.000
MCM2SOX4psi-mi:“MI:0915”(physical association)0.000
MCM6SOX4psi-mi:“MI:0915”(physical association)0.000
EPS15L1SOX4psi-mi:“MI:0915”(physical association)0.000
USP15SOX4psi-mi:“MI:0915”(physical association)0.000

BioGRID (85): TAF5 (Affinity Capture-MS), TNNI3 (Two-hybrid), SCNN1G (Two-hybrid), MINPP1 (Two-hybrid), IGF1 (Two-hybrid), PLD3 (Two-hybrid), CAPN15 (Two-hybrid), FOXH1 (Two-hybrid), RGSL1 (Two-hybrid), LMNB2 (Two-hybrid), CTR9 (Two-hybrid), ITIH1 (Two-hybrid), APOC3 (Two-hybrid), EDF1 (Two-hybrid), NANOGNB (Two-hybrid)

ESM2 similar proteins: A1Z6W3, A7X8B3, A7X8B7, A7X8C4, B0WAQ0, O97960, P06401, P08155, P09631, P0C1G9, P15619, P23949, P25172, P35716, P40650, P41894, P47974, P48435, P57073, P57074, P78415, P81067, P84550, P84551, Q04886, Q05A36, Q06831, Q06945, Q0V9X5, Q14774, Q15464, Q292U2, Q292U5, Q297V5, Q2VWA4, Q5IS79, Q5U5Q3, Q66JF1, Q6PD21, Q6QT55

Diamond homologs: A0A0G2JTZ2, A2TED3, A5D8R3, B1H349, B3DLD3, B3DM43, F1M8W4, O42342, O42601, P0C1G9, P35710, P35711, P35712, P35713, P35716, P36389, P36390, P36393, P36394, P36396, P40645, P40646, P40647, P40649, P40650, P40656, P40657, P43680, P47792, P48433, P48435, Q03255, Q03257, Q04891, Q05738, Q06831, Q06945, Q20201, Q23045, Q27949

SIGNOR signaling

10 interactions.

AEffectBMechanism
CEBPAdown-regulatesSOX4“transcriptional regulation”
SOX4down-regulatesDifferentiation
SOX4“up-regulates activity”CTNNB1binding
SOX4“up-regulates quantity”DICER1“transcriptional regulation”
SOX4up-regulatesProliferation
miRNA-214-5p“down-regulates quantity by destabilization”SOX4“post transcriptional regulation”
IL5RA“up-regulates activity”SOX4binding
SDCBP“up-regulates activity”SOX4binding
FOXC1“up-regulates quantity by expression”SOX4“transcriptional regulation”

Disease & clinical

Clinical variants and AI predictions

ClinVar

250 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic8
Likely pathogenic12
Uncertain significance173
Likely benign33
Benign7

Top pathogenic / likely-pathogenic (20)

Variant IDHGVSClassification
1686223NM_003107.3(SOX4):c.975C>G (p.Tyr325Ter)Pathogenic
1691155NM_003107.3(SOX4):c.289T>G (p.Trp97Gly)Pathogenic
2443795NM_003107.3(SOX4):c.1333G>T (p.Glu445Ter)Pathogenic
2443796NM_003107.3(SOX4):c.130_133delinsCGCT (p.Gly44_Lys45delinsArgTer)Pathogenic
3254650NM_003107.3(SOX4):c.386_390del (p.Arg129fs)Pathogenic
3340821NM_003107.3(SOX4):c.291G>C (p.Trp97Cys)Pathogenic
4074737NM_003107.3(SOX4):c.1274C>A (p.Ser425Ter)Pathogenic
4795244NM_003107.3(SOX4):c.468_469del (p.Asp156fs)Pathogenic
2580163NM_003107.3(SOX4):c.281G>A (p.Gly94Asp)Likely pathogenic
2664011NM_003107.3(SOX4):c.373_378del (p.Asp125_Tyr126del)Likely pathogenic
3067564NM_003107.3(SOX4):c.998C>A (p.Ser333Ter)Likely pathogenic
3254652NM_003107.3(SOX4):c.335C>A (p.Ala112Glu)Likely pathogenic
3347110NM_003107.3(SOX4):c.744_745delinsA (p.Leu250fs)Likely pathogenic
3391688NM_003107.3(SOX4):c.280G>A (p.Gly94Ser)Likely pathogenic
3392529NM_003107.3(SOX4):c.200T>C (p.Met67Thr)Likely pathogenic
3629480NM_003107.3(SOX4):c.185C>T (p.Pro62Leu)Likely pathogenic
3774374NM_003107.3(SOX4):c.185C>G (p.Pro62Arg)Likely pathogenic
4279018NM_003107.3(SOX4):c.251T>G (p.Met84Arg)Likely pathogenic
4813418NM_003107.3(SOX4):c.332dup (p.Ala112fs)Likely pathogenic
548142NM_003107.3(SOX4):c.198C>A (p.Phe66Leu)Likely pathogenic

SpliceAI

62 predictions. Top by Δscore:

VariantEffectΔscore
6:21594038:GGA:Gacceptor_gain0.8000
6:21594033:TTGTA:Tacceptor_loss0.7600
6:21594034:TGTAG:Tacceptor_loss0.7600
6:21594035:GTAGG:Gacceptor_loss0.7600
6:21594036:TA:Tacceptor_loss0.7600
6:21594037:A:ACacceptor_loss0.7600
6:21594029:T:Aacceptor_loss0.7400
6:21594038:GGAGA:Gacceptor_gain0.7300
6:21594024:T:Aacceptor_loss0.7200
6:21594032:ATTGT:Aacceptor_loss0.7200
6:21594935:TG:Tdonor_gain0.6900
6:21594037:A:AGacceptor_gain0.6700
6:21594038:G:GGacceptor_gain0.6700
6:21595709:C:CAacceptor_gain0.6500
6:21595710:G:Aacceptor_gain0.6500
6:21594036:TAGG:Tacceptor_gain0.6300
6:21594037:AGGA:Aacceptor_gain0.6300
6:21594038:GGAG:Gacceptor_gain0.6300
6:21594032:A:AGacceptor_gain0.6100
6:21594037:AG:Aacceptor_gain0.6100
6:21594038:GG:Gacceptor_gain0.6100
6:21594025:GCTTT:Gacceptor_loss0.6000
6:21594026:CTTT:Cacceptor_loss0.5800
6:21594030:GCATT:Gacceptor_loss0.5800
6:21598072:A:Gacceptor_gain0.5800
6:21596448:T:Aacceptor_gain0.5700
6:21594936:GA:Gdonor_gain0.5200
6:21594937:AA:Adonor_gain0.5200
6:21594032:ATT:Aacceptor_gain0.4900
6:21594033:T:Gacceptor_gain0.4900

AlphaMissense

3041 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:21594685:T:AW51R1.000
6:21594685:T:CW51R1.000
6:21594687:G:CW51C1.000
6:21594687:G:TW51C1.000
6:21594688:T:AC52S1.000
6:21594688:T:CC52R1.000
6:21594689:G:AC52Y1.000
6:21594689:G:CC52S1.000
6:21594690:C:GC52W1.000
6:21594703:G:TG57W1.000
6:21594706:C:GH58D1.000
6:21594710:T:AI59N1.000
6:21594710:T:CI59T1.000
6:21594710:T:GI59S1.000
6:21594712:A:GK60E1.000
6:21594713:A:TK60M1.000
6:21594714:G:CK60N1.000
6:21594714:G:TK60N1.000
6:21594715:C:GR61G1.000
6:21594716:G:AR61Q1.000
6:21594716:G:TR61L1.000
6:21594718:C:AP62T1.000
6:21594718:C:GP62A1.000
6:21594718:C:TP62S1.000
6:21594719:C:AP62H1.000
6:21594719:C:GP62R1.000
6:21594719:C:TP62L1.000
6:21594722:T:AM63K1.000
6:21594722:T:CM63T1.000
6:21594722:T:GM63R1.000

dbSNP variants (sampled 300 via entrez): RS1000016194 (6:21596098 G>A,C,T), RS1000391858 (6:21594217 T>C), RS1000573405 (6:21598431 G>A), RS1000620262 (6:21594515 G>A,T), RS1000686943 (6:21593631 T>C), RS1001222943 (6:21598227 A>G,T), RS1001569510 (6:21597901 G>A,T), RS1001800156 (6:21595251 C>A,G,T), RS1002886917 (6:21596091 T>C), RS1003252136 (6:21592529 C>T), RS1003814845 (6:21596675 A>AGG), RS1003968598 (6:21596678 C>T), RS1004467409 (6:21596706 G>T), RS1004513030 (6:21598703 T>G), RS1004564341 (6:21592238 G>A)

Disease associations

OMIM: gene MIM:184430 | disease phenotypes: MIM:618506, MIM:135900, MIM:609943, MIM:614562

GenCC curated gene-disease

DiseaseClassificationInheritance
Coffin-Siris syndrome 10StrongAutosomal dominant
Coffin-Siris syndromeSupportiveAutosomal dominant
atrial fibrillationLimitedAutosomal dominant

Mondo (7): Coffin-Siris syndrome 10 (MONDO:0032791), intellectual disability (MONDO:0001071), disorder of sexual differentiation (MONDO:0002145), neurodevelopmental disorder (MONDO:0700092), Coffin-Siris syndrome 1 (MONDO:0007617), atrial fibrillation (MONDO:0004981), Coffin-Siris syndrome (MONDO:0015452)

Orphanet (3): Difference of sex development (Orphanet:90771), Coffin-Siris syndrome (Orphanet:1465), NON RARE IN EUROPE: Unexplained intellectual disability (Orphanet:319658)

HPO phenotypes

85 total (30 of 85 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000028Cryptorchidism
HP:0000047Hypospadias
HP:0000085Horseshoe kidney
HP:0000119Abnormality of the genitourinary system
HP:0000154Wide mouth
HP:0000179Thick lower lip vermilion
HP:0000219Thin upper lip vermilion
HP:0000243Trigonocephaly
HP:0000252Microcephaly
HP:0000280Coarse facial features
HP:0000286Epicanthus
HP:0000289Broad philtrum
HP:0000294Low anterior hairline
HP:0000358Posteriorly rotated ears
HP:0000365Hearing impairment
HP:0000369Low-set ears
HP:0000455Broad nasal tip
HP:0000463Anteverted nares
HP:0000486Strabismus
HP:0000505Visual impairment
HP:0000508Ptosis
HP:0000545Myopia
HP:0000574Thick eyebrow
HP:0000684Delayed eruption of teeth
HP:0000696Delayed eruption of permanent teeth
HP:0000708Atypical behavior
HP:0000718Aggressive behavior
HP:0000729Autistic behavior
HP:0000750Delayed speech and language development

GWAS associations

14 associations (top):

StudyTraitp-value
GCST000960_16Cardiac hypertrophy4.000000e-06
GCST001050_5Bone mineral density4.000000e-06
GCST001784_13Pulmonary function (smoking interaction)2.000000e-07
GCST001784_35Pulmonary function (smoking interaction)2.000000e-07
GCST003436_3Endometrial cancer2.000000e-08
GCST006464_8Endometrial cancer4.000000e-16
GCST006465_15Endometrial cancer (endometrioid histology)6.000000e-13
GCST009391_1002Metabolite levels8.000000e-06
GCST009391_2122Metabolite levels3.000000e-06
GCST90011898_138Alanine aminotransferase levels2.000000e-11
GCST90011899_19Aspartate aminotransferase levels4.000000e-10
GCST90013663_58Alanine aminotransferase levels1.000000e-12
GCST90013664_88Aspartate aminotransferase levels1.000000e-10
GCST90027899_4Eosinophilic esophagitis3.000000e-08

EFO canonical traits (6, from GWAS)

EFO IDTrait name
EFO:0002503cardiac hypertrophy
EFO:0003892pulmonary function measurement
EFO:0004713FEV/FVC ratio
EFO:0010358lysophosphatidylcholine 16:1 measurement
EFO:0010382phosphatidylcholine 36:4 measurement
EFO:0004736aspartate aminotransferase measurement

MeSH disease descriptors (5)

DescriptorNameTree numbers
D001281Atrial FibrillationC14.280.067.198; C23.550.073.198
D012734Disorders of Sex DevelopmentC12.050.351.875.253; C12.200.706.316; C12.800.316; C16.131.939.316; C19.391.119
D008607Intellectual DisabilityC10.597.606.360; C23.888.592.604.646; F01.700.687; F03.625.539
D065886Neurodevelopmental DisordersF03.625
C536436Coffin-Siris syndrome (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

96 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, affects expression, increases expression, increases methylation, affects reaction (+1 more)7
Benzo(a)pyrenedecreases methylation, increases expression5
sodium arsenitedecreases expression4
Estradioldecreases expression, increases expression, affects cotreatment4
Tetrachlorodibenzodioxindecreases expression, increases reaction, affects expression, affects binding, increases expression4
bisphenol Aaffects expression, decreases expression3
Ethinyl Estradiolaffects expression, decreases expression3
Progesteroneaffects cotreatment, decreases expression3
Cyclosporinedecreases expression, increases expression3
trichostatin Aaffects cotreatment, decreases expression2
entinostatdecreases expression, affects cotreatment2
Tretinoindecreases expression, increases expression2
Aflatoxin B1affects expression, decreases methylation2
Genisteindecreases expression2
bisphenol Fdecreases expression1
dicrotophosdecreases expression1
testosterone enanthateaffects expression1
methyleugenolincreases expression1
pirinixic acidaffects binding, decreases expression, increases activity1
deoxynivalenolincreases expression1
sulforaphanedecreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
cobaltous chloridedecreases expression1
potassium chromate(VI)decreases expression1
ferrous chlorideincreases expression1
2,3-bis(3’-hydroxybenzyl)butyrolactoneaffects cotreatment, decreases expression1
hydroquinonedecreases expression1
beta-methylcholineaffects expression1
celastroldecreases expression1
6-methyl-1,3,8-trichlorodibenzofuranaffects binding, decreases expression, increases reaction1

Clinical trials (associated diseases)

599 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00032591PHASE4COMPLETEDThe Home INR Study
NCT00127712PHASE4COMPLETEDPrevention of Atrial Fibrillation Following Noncardiac Thoracic Surgery
NCT00157781PHASE4COMPLETEDLEAF - Low Energy In Atrial Fibrillation
NCT00170313PHASE4TERMINATEDCORE: Study to Evaluate the Conducted AF-Response-Algorithm in Patients Suffering From Heart Failure and Atrial Fibrillation
NCT00189319PHASE4COMPLETEDTo Evaluate the Impact of Oral Flecainide on Quality of Life in Patients With Paroxysmal Atrial Fibrillation
NCT00196144PHASE4COMPLETEDFFS - Far Field Sensing Test Study in Cardiac Dual Chamber Pacemakers
NCT00196157PHASE4UNKNOWNLine Versus Spot Ablation in Persistent Atrial Fibrillation
NCT00196183PHASE4COMPLETEDTrigger- vs. Substrate-Ablation for Paroxysmal Atrial Fibrillation
NCT00196209PHASE4UNKNOWNCardioversion vs. Catheter Ablation for Persistent Atrial Fibrillation
NCT00227344PHASE4TERMINATEDCACAF2 Study: Catheter Ablation for Cure of Atrial Fibrillation
NCT00232219PHASE4COMPLETEDUse of Fish Oils to Reduce Recurrence of Atrial Fibrillation Following DC Cardioversion
NCT00232232PHASE4COMPLETEDUse of Fish Oils to Prevent Atrial Mechanical Stunning and Atrial Remodeling Due to Atrial Arrhythmia
NCT00232245PHASE4COMPLETEDUse of Fish Oils to Reduce the Frequency and Duration of Episodes of Atrial Fibrillation in Patients With Paroxysmal Atrial Fibrillation.
NCT00239226PHASE4COMPLETEDElectrophysiologically Guided PAcing Site Selection Study
NCT00247780PHASE4COMPLETEDCavotricuspid Isthmusblock and Circumferential Pulmonary Vein Isolation in Patients With Atrial Fibrillation
NCT00256152PHASE4COMPLETEDAsymptomatic Atrial Fibrillation and Stroke Evaluation in Pacemaker Patients and the Atrial Fibrillation Reduction Atrial Pacing Trial
NCT00262119PHASE4COMPLETEDMINERVA: MINimizE Right Ventricular Pacing to Prevent Atrial Fibrillation and Heart Failure
NCT00287209PHASE4COMPLETEDReduction of Atrial Fibrillation Study in Patients Undergoing Coronary Artery Bypass Grafting. (RASCABG 1 Study)
NCT00289042PHASE4COMPLETEDAssessment of Cardioversion Using Transesophageal Echocardiography II (ACUTE II)
NCT00313443PHASE4COMPLETEDConcentrations of Amiodarone in Fat Tissue During Chronic Treatment
NCT00340314PHASE4COMPLETEDA Trial of Circumferential Pulmonary Vein Ablation (CPVA) Versus Antiarrhythmic Drug Therapy in for Paroxysmal Atrial Fibrillation (AF)
NCT00343499PHASE4TERMINATEDThe Use of DIOVAN to Reduce Post-Cardioversion Recurrence of Atrial Fibrillation Trial (the DRAFT Trial)
NCT00408473PHASE4TERMINATEDComparative Study of Flecainide CR and Placebo in the Early Treatment of Atrial Fibrillation.
NCT00420017PHASE4COMPLETEDPrevention of Atrial Fibrillation Following Esophagectomy
NCT00438113PHASE4COMPLETEDAtrial Substrate Modification With Aggressive Blood Pressure Lowering to Prevent AF
NCT00446966PHASE4COMPLETEDFish Oil for Reduction of Atrial Fibrillation After Cardiac Surgery
NCT00449410PHASE4COMPLETEDSilent Cerebrovascular Lesion and Cognitive Decline Prevention by Cholesterol Lowering in Elderly AF Patients
NCT00466973PHASE4WITHDRAWNAtrial Fibrillation Ablation Device Comparison Study
NCT00511173PHASE4COMPLETEDComparison of Warfarin Dosing Using Decision Model Versus Pharmacogenetic Algorithm
NCT00512915PHASE4COMPLETEDAvoid FFS - Use of the Atrial Pacemaker Lead 1699 With Very Short Tip Ring Spacing to Avoid Far Field Sensing
NCT00552084PHASE4COMPLETEDEvaluating the Effectiveness of Fish Oil Supplements at Reducing the Recurrence of Atrial Fibrillation
NCT00559988PHASE4TERMINATEDCombined Use of BIOTRONIK Home Monitoring and Predefined Anticoagulation to Reduce Stroke Risk
NCT00579098PHASE4COMPLETEDThe Use of Statins Following a Left Atrial Catheter Ablation Procedure to Prevent Atrial Fibrillation
NCT00586287PHASE4COMPLETEDStudy to Find Out the Appropriate Initial Dose of the Anticoagulant Drug Phenprocoumon
NCT00597077PHASE4COMPLETEDAtrial Fibrillation and Congestive Heart Failure Trial
NCT00603317PHASE4COMPLETEDPharmacodynamic Drug Interaction Between Warfarin and Amoxicillin-clavulanic Acid
NCT00605748PHASE4UNKNOWNPulmonary Vein (PV) -Isolation: Arrhythmogenic Vein(s) Versus All Veins
NCT00643188PHASE4COMPLETEDCatheter Ablation vs. Standard Conventional Treatment in Patients With LV Dysfunction and AF
NCT00678340PHASE4COMPLETEDRandomized Trial of Two Ablation Catheters in Paroxysmal Atrial Fibrillation
NCT00680927PHASE4COMPLETEDReveal® XT Performance Trial (XPECT)