SOX5

gene
On this page

Also known as L-SOX5MGC35153

Summary

SOX5 (SRY-box transcription factor 5, HGNC:11201) is a protein-coding gene on chromosome 12p12.1, encoding Transcription factor SOX-5 (P35711). Transcription factor involved in chondrocytes differentiation and cartilage formation. It is haploinsufficient (ClinGen: sufficient evidence).

This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. The encoded protein may play a role in chondrogenesis. A pseudogene of this gene is located on chromosome 8. Multiple transcript variants encoding distinct isoforms have been identified for this gene.

Source: NCBI Gene 6660 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Lamb-Shaffer syndrome (Definitive, ClinGen) — +1 more curated relationship
  • GWAS associations: 52
  • Clinical variants (ClinVar): 357 total — 65 pathogenic, 31 likely-pathogenic
  • Phenotypes (HPO): 69
  • Dosage sensitivity (ClinGen): haploinsufficiency sufficient evidence, triplosensitivity no evidence
  • Transcription factor: yes — 19 downstream targets (CollecTRI)
  • MANE Select transcript: NM_006940

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11201
Approved symbolSOX5
NameSRY-box transcription factor 5
Location12p12.1
Locus typegene with protein product
StatusApproved
AliasesL-SOX5, MGC35153
Ensembl geneENSG00000134532
Ensembl biotypeprotein_coding
OMIM604975
Entrez6660

Gene structure

Transcript identifiers

Ensembl transcripts: 26 — 15 protein_coding, 7 protein_coding_CDS_not_defined, 3 nonsense_mediated_decay, 1 retained_intron

ENST00000367206, ENST00000396007, ENST00000429944, ENST00000441133, ENST00000446891, ENST00000451604, ENST00000456299, ENST00000535530, ENST00000536629, ENST00000536729, ENST00000536850, ENST00000536911, ENST00000537393, ENST00000538083, ENST00000538905, ENST00000542241, ENST00000545921, ENST00000646273, ENST00000704296, ENST00000704297, ENST00000704298, ENST00000704299, ENST00000704300, ENST00000900854, ENST00000900855, ENST00000943211

RefSeq mRNA: 6 — MANE Select: NM_006940 NM_001261414, NM_001261415, NM_001330785, NM_006940, NM_152989, NM_178010

CCDS: CCDS41761, CCDS58216, CCDS58217, CCDS81672, CCDS8699

Canonical transcript exons

ENST00000451604 — 15 exons

ExonStartEnd
ENSE000009155752354631623546424
ENSE000010981202353645323536669
ENSE000010981232356325823563403
ENSE000010981242374086723741039
ENSE000011462702384598323846193
ENSE000011959182357566123575838
ENSE000012277452375563823755724
ENSE000022225972352950423534522
ENSE000035238032389579323896024
ENSE000036217792394956423949670
ENSE000036509472364081223640897
ENSE000036544792360438723604533
ENSE000036633082373468423734752
ENSE000036674692354321123543384
ENSE000036929242366544423665564

Expression profiles

Bgee: expression breadth ubiquitous, 221 present calls, max score 98.41.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 7.8897 / max 264.7647, expressed in 972 samples.

FANTOM5 promoters (15 alternative TSS)

Promoter IDTPM avgSamples expressed
1301061.8784626
1300851.6001300
1301081.0384522
1300870.9242287
1301070.7309335
1301090.6782376
1300880.2617120
1301100.2572131
1301050.174183
1301040.151072

Top tissues by expression

283 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cortical plateUBERON:000534398.41gold quality
calcaneal tendonUBERON:000370198.36gold quality
synovial jointUBERON:000221796.29gold quality
sural nerveUBERON:001548895.61gold quality
tibiaUBERON:000097995.22gold quality
ganglionic eminenceUBERON:000402393.93gold quality
ventricular zoneUBERON:000305393.00gold quality
cartilage tissueUBERON:000241892.54gold quality
tendonUBERON:000004390.48gold quality
left testisUBERON:000453388.97gold quality
right testisUBERON:000453487.80gold quality
testisUBERON:000047387.38gold quality
upper leg skinUBERON:000426286.47gold quality
colonic epitheliumUBERON:000039785.59gold quality
corpus callosumUBERON:000233684.21gold quality
pigmented layer of retinaUBERON:000178283.99gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.89gold quality
entorhinal cortexUBERON:000272883.64gold quality
medial globus pallidusUBERON:000247783.41gold quality
tendon of biceps brachiiUBERON:000818882.98gold quality
skin of hipUBERON:000155482.62gold quality
endothelial cellCL:000011581.48gold quality
layer of synovial tissueUBERON:000761681.47gold quality
globus pallidusUBERON:000187580.94gold quality
liverUBERON:000210780.88gold quality
popliteal arteryUBERON:000225080.81gold quality
tibial arteryUBERON:000761080.78gold quality
postcentral gyrusUBERON:000258180.61gold quality
spermCL:000001980.45silver quality
endometriumUBERON:000129580.11gold quality

Single-cell (SCXA)

Detected in 9 experiment(s), a significant marker in 8.

ExperimentMarker?Max mean expression
E-GEOD-75140yes1753.96
E-CURD-119yes1588.61
E-HCAD-5yes632.48
E-MTAB-10485yes447.19
E-HCAD-35yes101.27
E-HCAD-25yes23.23
E-ANND-3yes7.56
E-GEOD-93593yes6.97
E-GEOD-131882no1819.63

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

19 targets.

TargetRegulation
ACANActivation
ADAM2
CATSPER1Unknown
CDH1Activation
CDH2Activation
COL2A1Activation
COMPActivation
CYP19A1
DMRT1Repression
FN1Activation
MATN1Unknown
NFKBIBActivation
PTHLHActivation
RUNX2Activation
SPAG6Unknown
SPARCUnknown
ST6GAL2Activation
TNFSF11Activation
TWIST1Unknown

Upstream regulators (CollecTRI, top): BARX2, KAT5, MKX, RUNX1, SNAI2, TCF7L2, WWTR1, YAP1

miRNA regulators (miRDB)

151 targeting SOX5, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6873-3P100.0071.422626
HSA-MIR-7110-3P100.0073.182486
HSA-MIR-8485100.0077.574731
HSA-MIR-4668-3P100.0068.742635
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-366299.9973.825684
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-453499.9966.581907
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-499A-5P99.9870.791323
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-1213699.9872.815713
HSA-MIR-314899.9775.066478
HSA-MIR-6793-5P99.9765.95758
HSA-MIR-548AT-5P99.9670.832666
HSA-MIR-570-3P99.9672.414910
HSA-MIR-55999.9572.283609
HSA-MIR-548AB99.9571.313488
HSA-MIR-96-5P99.9572.802140
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488

Functional genomics

ClinGen dosage: haploinsufficiency 3 (sufficient evidence), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 40)

  • DAD-R, SOX5, and EKI1 map within region of chromosome 12p whose duplication is related to reduced apoptosis in human testicular seminomas. (PMID:11912161)
  • SOX5 is aberrantly expressed in glioma (PMID:17230535)
  • SOX5 is upregulated by promoter swapping with the P2RY8 gene in primary splenic follicular lymphoma (PMID:17554380)
  • studies of SOX5 in cell lines, xenografts and human prostate specimens, at both the RNA and protein levels, found overexpression of the gene in tumors. This overexpression was then subsequently found by FISH to be caused by amplification of the region (PMID:19173284)
  • Sox5 may suppress the oncogenic effects of PDGFB signaling during glioma development by regulating p27(Kip1) in a p19(Arf)-dependent manner, leading to acute cellular senescence. (PMID:19219070)
  • SPAG6 is a S-SOX5 target gene, indicating a key role for S-SOX5 in the formation and function of motile cilia. (PMID:20668334)
  • identified SOX5 and SOX6 as the first two SHOX-interacting proteins and have shown that this interaction regulates aggrecan expression, an essential factor in chondrogenesis and skeletal development. (PMID:21262861)
  • Genetic variation in the transcription factor SOX5 is associated with COPD susceptibility (PMID:21330457)
  • SOX trio gene and protein decreased with advancement of osteoarthritis in human articular cartilage. (PMID:21728837)
  • Haploinsufficiency of SOX5 at 12p12.1 is associated with developmental delays with prominent language delay, behavior problems, and mild dysmorphic features (PMID:22290657)
  • MiR-194 regulates chondrogenic differentiation of adipose-derived stem cells by targeting Sox5 (PMID:22396742)
  • L-Sox5 and Sox6 proteins enhance chondrogenic miR-140 microRNA expression by strengthening dimeric Sox9 activity (PMID:22547066)
  • Our findings indicate that haploinsufficiency of SOX5 is a cause of intellectual disability without any striking physical anomalies. (PMID:23220431)
  • Two cases of 12p12.1 deletion involving SOX5 present with dysmorphic features and developmental delay. (PMID:23498568)
  • Sox5 regulates the proliferation of malignant B cells. (PMID:24418753)
  • depletion of Sox5 in breast cancer cells caused a dramatic decrease in Twist1 and chromosome immunoprecipitation assay showed that Sox5 can bind directly to the Twist1 promoter, suggesting that Sox5 transactivates Twist1 expression (PMID:24607904)
  • detection of SOX5 polymorphism in nonobstructive azoospermia contributing to predicting males at high risk of disease in Han Chinese population (PMID:24648396)
  • Identified two new splice variants of SOX5 in human B cells, encoding the known L-SOX5B isoform and a new shorter isoform L-SOX5F. SOX5 limits the proliferative capacity of human primary B lymphocytes and potentially affects the differentiation of human B cells during the germinal center responses. (PMID:24945754)
  • These findings describe for the first time a functional role of SOX5 during late B cell development reducing the proliferative capacity and thus potentially affecting the differentiation of B cells. (PMID:24945754)
  • Sox5 and Sox9 cause a significant increase in transactivation of the Catsper1 promoter. (PMID:25101494)
  • Sox5-PAX5 fusion transcript is associated with B-cell precursor acute lymphoblastic leukemia. (PMID:25304615)
  • High Sox5 expression is associated with pituitary tumor. (PMID:25305447)
  • Our results indicate for the first time that SOX5 is a novel regulator of epithelial-mesenchymal transition in hepatocellular carcinoma (PMID:25572815)
  • The results of the present study suggest a role for SOX5 in the molecular pathogenesis of FL. (PMID:26115875)
  • SOX5 has a strong inhibitory effect on MITF expression and seems to have a decisive clinical impact on melanoma during tumor progression. (PMID:26927636)
  • Results show that SOX5 expression levels is increased in the synovium of patients with rheumatoid arthritis (RA). Its overexpression regulates the expression of RANKL in RA synovial fibroblasts. (PMID:27550416)
  • Short-SOX5 regulates transcription of human SPAG16L gene via directly binding to the promoter of SPAG16L. (PMID:28137312)
  • Collectively, these findings indicate that SOX5 is a novel candidate gene for Alzheimer’s disease(AD) with an important role in neuronal function. The genetic findings warrant further studies to identify and characterize SOX5 variants that confer risk for AD, amyotrophic lateral sclerosis and intellectual disability. (PMID:28186563)
  • SOX5 promoted epithelial-mesenchymal transition (EMT) by regulation of Snail. (PMID:28365963)
  • results suggest that lnc-Sox5, which was stabilized by HuR, could regulate carcinogenesis of tongue cancer and may serve as a predicted target for tongue carcinoma therapies (PMID:28371600)
  • Sox5 is a previously unrecognized regulator of beta-cell gene expression and secretory function (PMID:28585545)
  • SOX5, candidate gene, associated with clinical features of Depression (e.g., earlier age at onset and recurrent and more severe forms of Depression). Gene expression patterns in the prefrontal and anterior cingulate cortex most closely matched the genetic findings. (PMID:28969442)
  • Likely pathogenic variants were identified in SOX5 gene, not previously associated with epilepsy, and UBA5, a recently associated with epilepsy gene. (PMID:29286531)
  • SOX5 overexpression increased Kruppel-like factor 4 (KLF4) gene expression, which was decreased by SOX5 silencing. KLF4 knockdown abrogated the suppressive effect of SOX5 overexpression on osteogenic differentiation of hMSCs. (PMID:29890823)
  • First reported meta-analysis of genome-wide association study for chronic back pain (CBP) in adults of European ancestry identified novel genetic associations with CBP at SOX5, CCDC26/GSDMC, and DCC.[meta-analysis] (PMID:30261039)
  • SOX5 is a direct target of miR-497-5P in the non-small-cell lung cancer cells. (PMID:30816573)
  • The expression levels of lncRNA ILF3-AS1 was up-regulated in osteosarcoma tissues and cell lines, and was modulated by SP1. LncRNA ILF3-AS1 promoted osteosarcoma cells proliferation and metastasis via modulation of miR-212-SOX5 pathway. (PMID:30819403)
  • Collectively, the rs2477686 in PEX10 , rs6080550 in SIRPA-SIRPG, and rs10842262 in SOX5 gene may indeed be the genetic risk factors for nonobstructive azoospermia (NOA), which requires further investigation using larger independent sets of samples in different ethnic populations. (PMID:30863997)
  • CircDOCK1 affected the progression of BC via modulation of circDOCK1/hsa-miR-132-3p/Sox5 pathway. (PMID:30983072)
  • The results revealed low expression of miR539 and high expression of SOX5 in gastric cancer tissues and cells as compared with the levels in normal tissues and cells, suggesting that there was a negative correlation between miR539 and SOX5. (PMID:31322222)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriosox5ENSDARG00000011582
mus_musculusSox5ENSMUSG00000041540
rattus_norvegicusSox5ENSRNOG00000027869
drosophila_melanogasterSox14FBGN0005612
drosophila_melanogasterSox21aFBGN0036411
drosophila_melanogasterSox102FFBGN0039938
caenorhabditis_elegansWBGENE00001182

Paralogs (20): SOX8 (ENSG00000005513), SOX30 (ENSG00000039600), SOX10 (ENSG00000100146), SOX6 (ENSG00000110693), SOX4 (ENSG00000124766), SOX21 (ENSG00000125285), SOX9 (ENSG00000125398), SOX15 (ENSG00000129194), SOX3 (ENSG00000134595), SOX13 (ENSG00000143842), SOX17 (ENSG00000164736), SOX14 (ENSG00000168875), SOX7 (ENSG00000171056), SOX11 (ENSG00000176887), SOX12 (ENSG00000177732), CFAP65 (ENSG00000181378), SOX2 (ENSG00000181449), SOX1 (ENSG00000182968), SRY (ENSG00000184895), SOX18 (ENSG00000203883)

Protein

Protein identifiers

Transcription factor SOX-5P35711 (reviewed: P35711)

All UniProt accessions (12): A0A2R8Y5Q1, A0A2R8Y7P3, A0A994J4A9, A0A994J4I4, A0A994J6X9, A0A994J7C0, P35711, F5GWL1, F5H0I3, G3V0H1, I3L0A5, T2CZM2

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor involved in chondrocytes differentiation and cartilage formation. Specifically binds the 5’-AACAAT-3’ DNA motif present in enhancers and super-enhancers and promotes expression of genes important for chondrogenesis, including cartilage matrix protein-coding genes, such as COL2A1 and AGC1. Required for overt chondrogenesis when condensed prechondrocytes differentiate into early stage chondrocytes: SOX5 and SOX6 cooperatively bind with SOX9 on active enhancers and super-enhancers associated with cartilage-specific genes, and thereby potentiate SOX9’s ability to transactivate. Not involved in precartilaginous condensation, the first step in chondrogenesis, during which skeletal progenitors differentiate into prechondrocytes. Together with SOX6, required to form and maintain a pool of highly proliferating chondroblasts between epiphyses and metaphyses, to form columnar chondroblasts, delay chondrocyte prehypertrophy but promote hypertrophy, and to delay terminal differentiation of chondrocytes on contact with ossification fronts. Binds to the proximal promoter region of the myelin protein MPZ gene.

Subunit / interactions. Forms homodimers and heterodimers with SOX6.

Subcellular location. Nucleus.

Disease relevance. Lamb-Shaffer syndrome (LAMSHF) [MIM:616803] An autosomal dominant, neurodevelopmental disorder characterized by global developmental delay, intellectual disability, language and motor impairment, and distinct facial features. Additional variable skeletal abnormalities may also be present. The disease is caused by variants affecting the gene represented in this entry.

Isoforms (5)

UniProt IDNamesCanonical?
P35711-11, L-SOX5Ayes
P35711-22, L-SOX5B
P35711-33
P35711-44
P35711-55

RefSeq proteins (6): NP_001248343, NP_001248344, NP_001317714, NP_008871, NP_694534, NP_821078 (=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR051356SOX/SOX-like_TFFamily

Pfam: PF00505

UniProt features (33 total): modified residue 7, compositionally biased region 6, sequence conflict 6, splice variant 5, region of interest 4, coiled-coil region 2, chain 1, DNA-binding region 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P35711-F158.950.21

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (7): 21, 131, 370, 372, 411, 414, 439

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 505 (showing top): FXR_IR1_Q6, CREL_01, MYAATNNNNNNNGGC_UNKNOWN, GOBP_CARTILAGE_DEVELOPMENT, GOBP_SKELETAL_SYSTEM_DEVELOPMENT, GOBP_REGULATION_OF_CARTILAGE_DEVELOPMENT, TGCACTT_MIR519C_MIR519B_MIR519A, LFA1_Q6, GCANCTGNY_MYOD_Q6, KORKOLA_CHORIOCARCINOMA_DN, DARWICHE_SKIN_TUMOR_PROMOTER_DN, DARWICHE_PAPILLOMA_RISK_LOW_DN, GOBP_NEUROGENESIS, DARWICHE_PAPILLOMA_RISK_HIGH_DN, RACCACAR_AML_Q6

GO Biological Process (12): cartilage condensation (GO:0001502), chondrocyte differentiation (GO:0002062), regulation of transcription by RNA polymerase II (GO:0006357), transcription by RNA polymerase II (GO:0006366), positive regulation of chondrocyte differentiation (GO:0032332), cell fate commitment (GO:0045165), cartilage development (GO:0051216), asymmetric neuroblast division (GO:0055059), positive regulation of cartilage development (GO:0061036), cellular response to transforming growth factor beta stimulus (GO:0071560), positive regulation of mesenchymal stem cell differentiation (GO:2000741), cell differentiation (GO:0030154)

GO Molecular Function (7): transcription cis-regulatory region binding (GO:0000976), RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription factor activity (GO:0003700), cis-regulatory region sequence-specific DNA binding (GO:0000987), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cartilage development3
cell differentiation2
regulation of DNA-templated transcription2
cellular developmental process2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
transcription cis-regulatory region binding2
skeletal system morphogenesis1
cell aggregation1
transcription by RNA polymerase II1
DNA-templated transcription1
chondrocyte differentiation1
regulation of chondrocyte differentiation1
positive regulation of cell differentiation1
positive regulation of cartilage development1
skeletal system development1
animal organ development1
connective tissue development1
asymmetric cell division1
cell fate commitment1
neuroblast division1
positive regulation of developmental process1
positive regulation of multicellular organismal process1
regulation of cartilage development1
cellular response to growth factor stimulus1
response to transforming growth factor beta1
mesenchymal stem cell differentiation1
positive regulation of stem cell differentiation1
regulation of mesenchymal stem cell differentiation1
transcription regulatory region nucleic acid binding1
sequence-specific double-stranded DNA binding1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
transcription regulator activity1
nucleic acid binding1
binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

2034 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SOX5SOX6P35712941
SOX5SOX9P48436901
SOX5ETNK1Q9HBU6859
SOX5CTNNB1P35222851
SOX5COL2A1P02458845
SOX5TCF7P36402783
SOX5FEZF2Q8TBJ5758
SOX5ACANP16112715
SOX5SP7Q8TDD2704
SOX5SATB2Q9UPW6695
SOX5DMRT1Q9Y5R6626
SOX5COL10A1Q03692613
SOX5TBR1Q16650612
SOX5NEUROG2Q9H2A3607
SOX5DLX5P56178603

IntAct

84 interactions, top by confidence:

ABTypeScore
NFICNFIBpsi-mi:“MI:2364”(proximity)0.690
CRXSOX5psi-mi:“MI:0915”(physical association)0.670
SOX5CRXpsi-mi:“MI:0915”(physical association)0.670
SOX5TLE5psi-mi:“MI:0915”(physical association)0.660
LMO2SOX5psi-mi:“MI:0915”(physical association)0.560
SOX5LMO1psi-mi:“MI:0915”(physical association)0.560
SUMO1P1SOX5psi-mi:“MI:0915”(physical association)0.560
SOX5KIFC3psi-mi:“MI:0915”(physical association)0.560
SOX5PRR20Cpsi-mi:“MI:0915”(physical association)0.560
MORN3SOX5psi-mi:“MI:0915”(physical association)0.560
KAT5SOX5psi-mi:“MI:0915”(physical association)0.560
TENT5BSOX5psi-mi:“MI:0915”(physical association)0.560
CBX8SOX5psi-mi:“MI:0915”(physical association)0.560
SOX5ZNF581psi-mi:“MI:0915”(physical association)0.560
SOX5CDC23psi-mi:“MI:0915”(physical association)0.560
LMO1SOX5psi-mi:“MI:0915”(physical association)0.560
SOX5SUMO1P1psi-mi:“MI:0915”(physical association)0.560
SOX5MORN3psi-mi:“MI:0915”(physical association)0.560
SOX5KAT5psi-mi:“MI:0915”(physical association)0.560
SOX5TENT5Bpsi-mi:“MI:0915”(physical association)0.560
SOX5CBX8psi-mi:“MI:0915”(physical association)0.560

BioGRID (192): TAF6 (Affinity Capture-MS), SOX5 (Two-hybrid), SOX5 (Two-hybrid), SOX5 (Two-hybrid), SOX5 (Two-hybrid), SOX5 (Two-hybrid), SOX5 (Two-hybrid), CDC23 (Two-hybrid), KAT5 (Two-hybrid), ARID5A (Two-hybrid), ZNF581 (Two-hybrid), CBX8 (Two-hybrid), FAM46B (Two-hybrid), PRR20A (Two-hybrid), MORN3 (Two-hybrid)

ESM2 similar proteins: A0A0G2JTZ2, A2BEA6, A4IFD2, B1H349, B3DM43, B3DM47, F1M8W4, O15409, O75030, P0CF24, P23899, P27889, P35680, P35710, P35711, P35712, P40645, P40647, P58463, P70062, P70063, P70064, Q23045, Q2LE08, Q4VYR7, Q4VYS1, Q58NQ4, Q5QL03, Q5RCU4, Q5RER5, Q5W1J5, Q6GL68, Q800Q5, Q86MD3, Q8HZ00, Q8MJ97, Q8MJ98, Q8MJ99, Q8MJA0, Q8STF6

Diamond homologs: A0A0G2JTZ2, A2TED3, A5D8R3, B1H349, B3DLD3, B3DM43, F1M8W4, O42342, O42601, P0C1G9, P35710, P35711, P35712, P35713, P35716, P36389, P36390, P36393, P36394, P36396, P40645, P40646, P40647, P40649, P40650, P40656, P40657, P43680, P47792, P48433, P48435, Q03255, Q03257, Q04891, Q05738, Q06831, Q06945, Q20201, Q23045, Q27949

SIGNOR signaling

2 interactions.

AEffectBMechanism
SOX5“up-regulates quantity by expression”COL2A1“transcriptional regulation”
MKX“up-regulates quantity by expression”SOX5“transcriptional regulation”

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 52 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

GO biological processes:

GO termPartnersFoldFDR
gene expression58.2×1e-02

Disease & clinical

Clinical variants and AI predictions

ClinVar

357 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic65
Likely pathogenic31
Uncertain significance178
Likely benign43
Benign6

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1098341NM_006940.6(SOX5):c.352C>T (p.Arg118Ter)Pathogenic
1172669NM_006940.6(SOX5):c.755dup (p.Gln253fs)Pathogenic
1195659NM_006940.6(SOX5):c.932-1G>CPathogenic
1297543NM_006940.6(SOX5):c.1639C>T (p.Arg547Ter)Pathogenic
1300230NM_006940.6(SOX5):c.433dup (p.Thr145fs)Pathogenic
1340410GRCh37/hg19 12p12.1(chr12:23537708-23757164)x1Pathogenic
1341190GRCh37/hg19 12p12.1(chr12:23656814-23741624)x1Pathogenic
1421590NM_006940.6(SOX5):c.811-2A>GPathogenic
1526379NM_006940.6(SOX5):c.1748A>G (p.Asn583Ser)Pathogenic
1527696GRCh37/hg19 12p12.1(chr12:23594653-23814395)Pathogenic
1527935NM_006940.6(SOX5):c.1807C>T (p.Gln603Ter)Pathogenic
1686224NM_006940.6(SOX5):c.1489-2A>GPathogenic
1700126NM_006940.4:c.482_810delPathogenic
1709432NM_006940.6(SOX5):c.1136del (p.Ser379fs)Pathogenic
1879227NM_006940.6(SOX5):c.1697G>A (p.Trp566Ter)Pathogenic
222022NM_006940.6(SOX5):c.1060G>T (p.Gly354Ter)Pathogenic
2442685NM_006940.6(SOX5):c.359del (p.Ser120fs)Pathogenic
2501978NM_006940.6(SOX5):c.1676C>T (p.Pro559Leu)Pathogenic
2574979NM_006940.6(SOX5):c.1337_1338dup (p.Ile447fs)Pathogenic
2642785NM_006940.6(SOX5):c.700C>T (p.Gln234Ter)Pathogenic
2685435GRCh37/hg19 12p12.1(chr12:23594097-23768820)x1Pathogenic
2775436NM_006940.6(SOX5):c.1862del (p.Pro621fs)Pathogenic
3337464NM_006940.6:c.741+1981_931+5178delPathogenic
3377222NM_006940.6(SOX5):c.554_555del (p.Phe185fs)Pathogenic
3378162NM_006940.6(SOX5):c.1342+1G>TPathogenic
3382063NM_006940.6(SOX5):c.1423C>T (p.Gln475Ter)Pathogenic
3383983NM_006940.6(SOX5):c.1598-1G>APathogenic
3391917GRCh37/hg19 12p12.1(chr12:23624334-23720028)x1Pathogenic
3574544NM_006940.6(SOX5):c.785_786insTTAT (p.Asn263fs)Pathogenic
3895970NC_000012.11:g.(23908659_23998916)(24102605?)delPathogenic

SpliceAI

3937 predictions. Top by Δscore:

VariantEffectΔscore
12:23534258:C:CAdonor_gain1.0000
12:23536450:TA:Tdonor_loss1.0000
12:23536451:A:ACdonor_gain1.0000
12:23536451:AC:Adonor_gain1.0000
12:23536452:C:CAdonor_gain1.0000
12:23536452:CC:Cdonor_gain1.0000
12:23536452:CCCA:Cdonor_gain1.0000
12:23536452:CCCAA:Cdonor_gain1.0000
12:23536666:GATC:Gacceptor_gain1.0000
12:23536668:TC:Tacceptor_gain1.0000
12:23536669:CC:Cacceptor_gain1.0000
12:23536669:CCTAT:Cacceptor_loss1.0000
12:23536670:C:CCacceptor_gain1.0000
12:23543207:TTA:Tdonor_loss1.0000
12:23543210:C:Adonor_loss1.0000
12:23543381:CTTC:Cacceptor_gain1.0000
12:23563253:CTGA:Cdonor_loss1.0000
12:23563254:TGA:Tdonor_loss1.0000
12:23563255:GACCT:Gdonor_loss1.0000
12:23563256:A:Cdonor_loss1.0000
12:23563257:C:CAdonor_loss1.0000
12:23594107:TTGTA:Tdonor_gain1.0000
12:23640807:CTTA:Cdonor_loss1.0000
12:23640808:TTACC:Tdonor_loss1.0000
12:23640809:TA:Tdonor_loss1.0000
12:23640810:ACC:Adonor_loss1.0000
12:23640811:CCT:Cdonor_loss1.0000
12:23640895:CAC:Cacceptor_gain1.0000
12:23640898:C:CCacceptor_gain1.0000
12:23640898:CT:Cacceptor_loss1.0000

AlphaMissense

5029 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
12:23536492:C:GR650P1.000
12:23536495:A:TM649K1.000
12:23536502:C:GA647P1.000
12:23536503:C:AK646N1.000
12:23536503:C:GK646N1.000
12:23536504:T:AK646M1.000
12:23536505:T:CK646E1.000
12:23536507:T:CY645C1.000
12:23536507:T:GY645S1.000
12:23536508:A:CY645D1.000
12:23536508:A:GY645H1.000
12:23536508:A:TY645N1.000
12:23536510:T:AE644V1.000
12:23536516:A:CI642S1.000
12:23536516:A:TI642N1.000
12:23536520:G:TR641S1.000
12:23536522:A:GL640P1.000
12:23536524:C:AK639N1.000
12:23536524:C:GK639N1.000
12:23536526:T:CK639E1.000
12:23536531:C:AG637V1.000
12:23536532:C:GG637R1.000
12:23536542:G:CC633W1.000
12:23536543:C:GC633S1.000
12:23536543:C:TC633Y1.000
12:23536544:A:GC633R1.000
12:23536544:A:TC633S1.000
12:23536549:C:GR631P1.000
12:23536550:G:AR631C1.000
12:23536550:G:CR631G1.000

dbSNP variants (sampled 300 via entrez): RS1000000723 (12:23852264 A>G), RS1000004060 (12:23968833 A>G), RS1000004735 (12:23848988 C>T), RS1000010432 (12:23581578 A>G), RS1000013036 (12:23610208 A>G), RS1000013979 (12:24503728 A>G), RS1000014609 (12:23623881 A>G), RS1000019209 (12:23992054 A>G), RS1000023348 (12:24290074 T>C), RS1000023632 (12:24423457 T>G), RS1000031040 (12:24544827 A>G), RS1000033898 (12:23826895 C>T), RS1000036300 (12:24534339 T>C), RS1000039633 (12:24159912 G>A), RS1000040949 (12:23761485 G>A)

Disease associations

OMIM: gene MIM:604975 | disease phenotypes: MIM:616803, MIM:610805, MIM:137580

GenCC curated gene-disease

DiseaseClassificationInheritance
Lamb-Shaffer syndromeStrongAutosomal dominant
developmental and speech delay due to SOX5 deficiencySupportiveAutosomal dominant

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
Lamb-Shaffer syndromeDefinitiveAD

Mondo (7): Lamb-Shaffer syndrome (MONDO:0014778), autism spectrum disorder (MONDO:0005258), strabismus (MONDO:0003432), congenital anomaly of kidney and urinary tract (MONDO:0019719), intellectual disability (MONDO:0001071), Tourette syndrome (MONDO:0007661), developmental and speech delay due to SOX5 deficiency (MONDO:0017782)

Orphanet (7): 12p12.1 microdeletion syndrome (Orphanet:313884), Developmental and speech delay due to SOX5 deficiency (Orphanet:313892), Lamb-Shaffer syndrome (Orphanet:530983), Renal or urinary tract malformation (Orphanet:93545), NON RARE IN EUROPE: Autism (Orphanet:106), NON RARE IN EUROPE: Unexplained intellectual disability (Orphanet:319658), NON RARE IN EUROPE: Tourette syndrome (Orphanet:856)

HPO phenotypes

69 total (30 of 69 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000028Cryptorchidism
HP:0000078Abnormality of the genital system
HP:0000189Narrow palate
HP:0000194Open mouth
HP:0000238Hydrocephalus
HP:0000286Epicanthus
HP:0000324Facial asymmetry
HP:0000358Posteriorly rotated ears
HP:0000369Low-set ears
HP:0000414Bulbous nose
HP:0000431Wide nasal bridge
HP:0000486Strabismus
HP:0000494Downslanted palpebral fissures
HP:0000545Myopia
HP:0000577Exotropia
HP:0000648Optic atrophy
HP:0000678Dental crowding
HP:0000708Atypical behavior
HP:0000718Aggressive behavior
HP:0000733Motor stereotypy
HP:0000739Anxiety
HP:0000750Delayed speech and language development
HP:0000768Pectus carinatum
HP:0000958Dry skin
HP:0000970Anhidrosis
HP:0000989Pruritus
HP:0000998Hypertrichosis
HP:0001000Abnormality of skin pigmentation
HP:0001053Hypopigmented skin patches

GWAS associations

52 associations (top):

StudyTraitp-value
GCST000487_3AIDS2.000000e-06
GCST000562_6PR interval3.000000e-13
GCST000618_3Response to antipsychotic treatment1.000000e-07
GCST000635_33Response to statin therapy2.000000e-06
GCST001156_5Systemic sclerosis1.000000e-07
GCST001160_4Systemic sclerosis5.000000e-06
GCST001362_3Non-obstructive azoospermia2.000000e-09
GCST002783_414Body mass index6.000000e-06
GCST002989_19LDL peak particle diameter (total fat intake interaction)8.000000e-06
GCST003030_5Oppositional defiant disorder dimensions in attention-deficit hyperactivity disorder7.000000e-06
GCST003138_3Juvenile osteochondritis dissecans3.000000e-06
GCST003225_9Pelvic organ prolapse (moderate/severe)6.000000e-06
GCST003602_10Inflammatory bowel disease3.000000e-06
GCST003817_10Mortality in sepsis3.000000e-06
GCST003818_69Resting heart rate3.000000e-53
GCST005080_9PR interval9.000000e-15
GCST005787_23Heart rate response to exercise2.000000e-14
GCST005788_17Heart rate response to recovery post exercise2.000000e-11
GCST005789_20Resting heart rate7.000000e-18
GCST005839_33Depression3.000000e-08
GCST005984_57Glomerular filtration rate7.000000e-09
GCST005985_37Creatinine levels6.000000e-09
GCST006624_79Systolic blood pressure7.000000e-10
GCST006803_35Schizophrenia1.000000e-08
GCST006979_891Heel bone mineral density1.000000e-10
GCST007045_8PR interval2.000000e-41
GCST007152_1Chronic back pain5.000000e-19
GCST007448_14Normal facial asymmetry (angle of surface orientation score)1.000000e-13
GCST007448_3Normal facial asymmetry (angle of surface orientation score)1.000000e-12
GCST007576_244Chronotype7.000000e-11

EFO canonical traits (18, from GWAS)

EFO IDTrait name
EFO:0004462PR interval
EFO:0004340body mass index
EFO:0007677LDL peak particle diameter measurement
EFO:0007678total fat intake measurement
EFO:0007679oppositional defiant disorder measurement
EFO:0004352mortality
EFO:0009184heart rate response to exercise
EFO:0009185heart rate response to recovery post exercise
EFO:0006335systolic blood pressure
EFO:0009270heel bone mineral density
EFO:0009751facial asymmetry measurement
EFO:0008328chronotype measurement
EFO:0008398T wave morphology measurement
EFO:0008039BMI-adjusted hip circumference
EFO:0010100multisite chronic pain
EFO:0004980appendicular lean mass
EFO:0009749age at first sexual intercourse measurement
EFO:0007788BMI-adjusted waist-hip ratio

MeSH disease descriptors (4)

DescriptorNameTree numbers
D008607Intellectual DisabilityC10.597.606.360; C23.888.592.604.646; F01.700.687; F03.625.539
D013285StrabismusC10.292.562.887; C11.590.810
D005879Tourette SyndromeC10.228.140.079.898; C10.228.662.825.800; C10.574.500.850; C16.320.400.820; F03.625.992.850
C566906Cakut (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

46 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, decreases expression5
bisphenol Aaffects expression, affects methylation, affects cotreatment, increases methylation3
arseniteaffects binding, decreases reaction, increases methylation2
entinostatdecreases expression, affects cotreatment2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
Tretinoindecreases expression, increases expression2
Valproic Acidaffects expression, decreases expression2
Aflatoxin B1decreases methylation2
p-Chloromercuribenzoic Acidaffects cotreatment, decreases expression2
FR900359increases phosphorylation1
methylmercuric chloridedecreases expression1
methyleugenoldecreases expression1
deoxynivalenolincreases expression1
tris(2-butoxyethyl) phosphateaffects expression1
beta-lapachonedecreases expression1
mono-(2-ethylhexyl)phthalateincreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
zinc chromatedecreases expression, increases abundance1
benzo(e)pyreneincreases methylation1
aflatoxin B2increases methylation1
chromium hexavalent iondecreases expression, increases abundance1
deguelinincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphindecreases expression, affects cotreatment1
bisphenol Sincreases methylation1
tianma gouteng yinincreases expression1
Fulvestrantaffects cotreatment, affects methylation1
Leflunomideincreases expression1
Caffeineaffects phosphorylation1
Diethylhexyl Phthalatedecreases expression1

Cellosaurus cell lines

5 cell lines: 3 embryonic stem cell, 2 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A6M9SEES3-1V human SOX5, clone1Embryonic stem cellMale
CVCL_A6N0SEES3-1V human SOX5, clone2Embryonic stem cellMale
CVCL_A6N1SEES3-1V human SOX5, clone3Embryonic stem cellMale
CVCL_TQ02HAP1 SOX5 (-) 1Cancer cell lineMale
CVCL_TQ03HAP1 SOX5 (-) 2Cancer cell lineMale

Clinical trials (associated diseases)

300 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00391261PHASE4COMPLETEDAn Open-label Trial of Metformin for Weight Control of Pediatric Patients on Antipsychotic Medications.
NCT01028820PHASE4COMPLETEDFMRI Brain Activation of Aripiprazole Treatment in Autism Spectrum Disorders
NCT01333865PHASE4COMPLETEDA Study of Memantine Hydrochloride (Namenda®) for Cognitive and Behavioral Impairment in Adults With Autism Spectrum Disorders
NCT01337700PHASE4COMPLETEDMilnacipran in Autism and the Functional Locus Coeruleus and Noradrenergic Model of Autism
NCT01695200PHASE4COMPLETEDOmega-3 Fatty Acids in Autism Spectrum Disorders
NCT02096952PHASE4COMPLETEDMethylphenidate ER Liquid Formulation in Adults With ASD and ADHD
NCT02235467PHASE4COMPLETEDMultisite Study: Parental Training Using Video Modelling to Develop Social Skills in Children With Autism
NCT02940574PHASE4COMPLETEDNeural and Behavioral Effects of Oxytocin in Autism Spectrum Disorders
NCT03333629PHASE4COMPLETEDPromoting Positive Outcomes for Individuals With ASD: Linking Early Detection, Treatment, and Long-term Outcomes
NCT03337646PHASE4COMPLETEDEvaluation of the Effect and Safety of Lisdexamfetamine in Children Aged 6-12 With ADHD and Autism
NCT03538431PHASE4COMPLETEDImproving Driving in Young People With Autism Spectrum Disorders
NCT03757585PHASE4COMPLETEDNatural Treatments for the Management of Emotional Dysregulation in Youth With Non-verbal Learning Disability (NVLD) and/or Autism Spectrum Disorders (ASD)
NCT04903353PHASE4COMPLETEDPragmatic Trial Comparing Weight Gain in Children With Autism Taking Risperidone Versus Aripiprazole
NCT05063656PHASE4COMPLETEDBiomarker-Driven Pharmacological Treatment of Adolescents With Autism Spectrum Disorder With Gabapentin
NCT05146245PHASE4UNKNOWNSafety and Pharmacokinetics of Antipsychotics in Children 2: Studying TDM in an RCT
NCT05916339PHASE4RECRUITINGAWARE: Management of ADHD in Autism Spectrum Disorder
NCT05954052PHASE4TERMINATEDA Study of Glutathione in Children With Autism Spectrum Disorder
NCT06853665PHASE4RECRUITINGThe TEAM Study - Treatment Efficacy for Autism/Attention Using Mixed Amphetamine
NCT07054697PHASE4COMPLETEDPilot-RCT With Individualized Homeopathic Treatment in the Children With Autism Spectrum Disorder
NCT07161804PHASE4COMPLETEDPilot RCT Using Homeopathic Medicines in ASD
NCT07439042PHASE4NOT_YET_RECRUITINGBuspirone for Anxiety in Autistic Youth
NCT01302964PHASE3COMPLETEDMirtazapine Treatment of Anxiety in Children and Adolescents With Pervasive Developmental Disorders
NCT01706523PHASE3TERMINATEDOpen Label Extension Study of STX209 (Arbaclofen) in Autism Spectrum Disorders
NCT01825798PHASE3COMPLETEDTreatment of Overweight Induced by Antipsychotic Medication in Young People With Autism Spectrum Disorders (ASD)
NCT01972074PHASE3COMPLETEDBehavioral and Neural Response to Memantine in Adolescents With Autism Spectrum Disorder
NCT02985749PHASE3COMPLETEDA Study of Oxytocin for the Treatment of Social Impairment in Individuals With High Functioning Autism Spectrum Disorder
NCT03197922PHASE3COMPLETEDTreatment of Encopresis in Children With Autism Spectrum Disorders
NCT03504917PHASE3TERMINATEDA Study of Balovaptan in Adults With Autism Spectrum Disorder With a 2-Year Open-Label Extension
NCT03553875PHASE3TERMINATEDMemantine for the Treatment of Social Deficits in Youth With Disorders of Impaired Social Interactions
NCT03640156PHASE3COMPLETEDModulating Socially Adaptive Mirror System Functioning in Autism by Oxytocin
NCT03715153PHASE3TERMINATEDEfficacy and Safety of Bumetanide Oral Liquid Formulation in Children Aged From 2 to Less Than 7 Years Old With Autism Spectrum Disorder.
NCT03715166PHASE3TERMINATEDEfficacy and Safety of Bumetanide Oral Liquid Formulation in Children and Adolescents Aged From 7 to Less Than 18 Years Old With Autism Spectrum Disorder
NCT04233502PHASE3WITHDRAWNEfficacy and Safety of Slenyto for Insomnia in Children With ASD
NCT04578756PHASE3COMPLETEDOpen-Label, Flexible-dose Study to Evaluate the Long-Term Safety and Tolerability of Cariprazine in the Treatment of Pediatric Participants With Schizophrenia, Bipolar I Disorder, or Autism Spectrum Disorder
NCT04623398PHASE3COMPLETEDEffect of Lithium in Patients With Autism Spectrum Disorder and Phelan-McDermid Syndrome (SHANK3 Haploinsufficiency)
NCT04725383PHASE3TERMINATEDAmitriptyline for Repetitive Behaviors in Autism Spectrum Disorders
NCT05212493PHASE3COMPLETEDThe Effects of Medical Cannabis in Children With Autistic Spectrum Disorder
NCT05361707PHASE3UNKNOWNEvaluating the Effects of Tasimelteon in Individuals With Autism Spectrum Disorder (ASD) and Sleep Disturbances
NCT05439616PHASE3COMPLETEDStudy of Cariprazine Oral Capsules or Solution to Assess Adverse Events and Change in Irritability Due to Autism Spectrum Disorder (ASD) in Participants Aged 5-17 Years With ASD
NCT06229210PHASE3RECRUITINGSafety and Tolerability Trial of Lumateperone in Pediatric Patients With Schizophrenia, Bipolar Disorder or Autism Spectrum Disorder