SOX14

gene
On this page

Also known as SOX28

Summary

SOX14 (SRY-box transcription factor 14, HGNC:11193) is a protein-coding gene on chromosome 3q22.3, encoding Transcription factor SOX-14 (O95416). Acts as a negative regulator of transcription.

This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins. Mutations in this gene are suggested to be responsible for the limb defects associated with blepharophimosis, ptosis, epicanthus inversus syndrome (BPES) and Mobius syndrome.

Source: NCBI Gene 8403 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 15 total
  • MANE Select transcript: NM_004189

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11193
Approved symbolSOX14
NameSRY-box transcription factor 14
Location3q22.3
Locus typegene with protein product
StatusApproved
AliasesSOX28
Ensembl geneENSG00000168875
Ensembl biotypeprotein_coding
OMIM604747
Entrez8403

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000306087

RefSeq mRNA: 1 — MANE Select: NM_004189 NM_004189

CCDS: CCDS3094

Canonical transcript exons

ENST00000306087 — 1 exons

ExonStartEnd
ENSE00001154585137764315137766334

Expression profiles

Bgee: expression breadth broad, 15 present calls, max score 54.24.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.2017 / max 38.4047, expressed in 45 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
387600.201745

Top tissues by expression

233 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
placentaUBERON:000198754.24gold quality
hypothalamusUBERON:000189851.30gold quality
lower esophagus mucosaUBERON:003583446.90silver quality
lower lobe of lungUBERON:000894945.42silver quality
substantia nigraUBERON:000203845.18gold quality
midbrainUBERON:000189144.51gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451143.37gold quality
buccal mucosa cellCL:000233643.07gold quality
amniotic fluidUBERON:000017342.79gold quality
prostate glandUBERON:000236742.58gold quality
secondary oocyteCL:000065542.57gold quality
gingivaUBERON:000182842.56gold quality
vastus lateralisUBERON:000137941.41gold quality
quadriceps femorisUBERON:000137741.37gold quality
superficial temporal arteryUBERON:000161441.33gold quality
palpebral conjunctivaUBERON:000181241.10gold quality
mucosa of paranasal sinusUBERON:000503040.98gold quality
trabecular bone tissueUBERON:000248340.78gold quality
jejunal mucosaUBERON:000039940.59gold quality
biceps brachiiUBERON:000150740.57gold quality
epithelium of nasopharynxUBERON:000195140.45gold quality
myocardiumUBERON:000234940.45gold quality
gingival epitheliumUBERON:000194940.43gold quality
cortical plateUBERON:000534340.35silver quality
germinal epithelium of ovaryUBERON:000130440.33gold quality
esophagus squamous epitheliumUBERON:000692040.29gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450240.27gold quality
dorsal plus ventral thalamusUBERON:000189740.22gold quality
mammary ductUBERON:000176540.19gold quality
jejunumUBERON:000211540.18gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.48

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
MICAL1Unknown

JASPAR motifs

MotifNameFamily
MA1562.1SOX14SOX-related factors
MA1562.2SOX14SOX-related factors

JASPAR matrix evidence (PMIDs): PMID:17852354

Upstream regulators (CollecTRI, top): FOXA2, FOXO4, GLI1

miRNA regulators (miRDB)

87 targeting SOX14, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-6758-5P100.0066.211470
HSA-MIR-6856-5P100.0065.471298
HSA-MIR-4668-3P100.0068.742635
HSA-MIR-607799.9968.042299
HSA-MIR-548N99.9871.944170
HSA-MIR-477599.9875.006394
HSA-MIR-6888-3P99.9765.951170
HSA-MIR-3065-5P99.9771.563281
HSA-MIR-314899.9775.066478
HSA-MIR-548AB99.9571.313488
HSA-MIR-55999.9572.283609
HSA-MIR-3912-5P99.9566.11925
HSA-MIR-651-3P99.9473.485177
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502
HSA-MIR-548BB-5P99.9471.273509
HSA-MIR-548C-5P99.9471.243488
HSA-MIR-548D-5P99.9471.233502
HSA-MIR-548H-5P99.9471.243488

Literature-anchored findings (GeneRIF, showing 4)

  • identified the positive control element and provided the first evidence that FOXA2 is involved in up-regulation of SOX14 expression in HepG2 and U87MG cell lines (PMID:20074681)
  • these results demonstrate that SOX14 can promote proliferation and invasion capacity of cervical cancer cells by activating the Wnt/beta-catenin pathway. (PMID:25973056)
  • High SOX14 expression is associated with bladder cancer. (PMID:28258492)
  • SOX14 hypermethylation as a tumour biomarker in cervical cancer. (PMID:34098886)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriosox14ENSDARG00000070929
mus_musculusSox14ENSMUSG00000053747
rattus_norvegicusSox14ENSRNOG00000022084
drosophila_melanogasterSox14FBGN0005612
drosophila_melanogasterSox21aFBGN0036411
drosophila_melanogasterSox102FFBGN0039938
caenorhabditis_elegansWBGENE00001182

Paralogs (20): SOX8 (ENSG00000005513), SOX30 (ENSG00000039600), SOX10 (ENSG00000100146), SOX6 (ENSG00000110693), SOX4 (ENSG00000124766), SOX21 (ENSG00000125285), SOX9 (ENSG00000125398), SOX15 (ENSG00000129194), SOX5 (ENSG00000134532), SOX3 (ENSG00000134595), SOX13 (ENSG00000143842), SOX17 (ENSG00000164736), SOX7 (ENSG00000171056), SOX11 (ENSG00000176887), SOX12 (ENSG00000177732), CFAP65 (ENSG00000181378), SOX2 (ENSG00000181449), SOX1 (ENSG00000182968), SRY (ENSG00000184895), SOX18 (ENSG00000203883)

Protein

Protein identifiers

Transcription factor SOX-14O95416 (reviewed: O95416)

Alternative names: Protein SOX-28

All UniProt accessions (1): O95416

UniProt curated annotations — full annotation on UniProt →

Function. Acts as a negative regulator of transcription.

Subcellular location. Nucleus.

RefSeq proteins (1): NP_004180* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR022097SOX_famFamily
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR050140SRY-related_HMG-box_TF-likeFamily

Pfam: PF00505, PF12336

UniProt features (2 total): chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O95416-F168.130.35

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 150 (showing top): GSE37336_LY6C_POS_VS_NEG_NAIVE_CD4_TCELL_DN, GOBP_CIRCADIAN_RHYTHM, RNGTGGGC_UNKNOWN, YAATNRNNNYNATT_UNKNOWN, BENPORATH_ES_WITH_H3K27ME3, GCANCTGNY_MYOD_Q6, GOBP_NEUROGENESIS, GGGTGGRR_PAX4_03, EVI1_05, GOBP_REGULATION_OF_CIRCADIAN_RHYTHM, TCF4_Q5, NKX62_Q2, NF1_Q6_01, GOBP_SENSORY_PERCEPTION_OF_LIGHT_STIMULUS, OCT1_03

GO Biological Process (10): negative regulation of transcription by RNA polymerase II (GO:0000122), nervous system development (GO:0007399), brain development (GO:0007420), visual perception (GO:0007601), entrainment of circadian clock (GO:0009649), neuron differentiation (GO:0030182), negative regulation of DNA-templated transcription (GO:0045892), positive regulation of transcription by RNA polymerase II (GO:0045944), regulation of neuron migration (GO:2001222), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (8): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), chromatin binding (GO:0003682), sequence-specific DNA binding (GO:0043565), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (3): chromatin (GO:0000785), nucleus (GO:0005634), transcription regulator complex (GO:0005667)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of transcription by RNA polymerase II3
RNA polymerase II transcription regulatory region sequence-specific DNA binding3
transcription by RNA polymerase II2
DNA-templated transcription2
binding2
negative regulation of DNA-templated transcription1
system development1
central nervous system development1
animal organ development1
head development1
sensory perception of light stimulus1
response to external stimulus1
regulation of circadian rhythm1
cell differentiation1
generation of neurons1
regulation of DNA-templated transcription1
negative regulation of RNA biosynthetic process1
positive regulation of DNA-templated transcription1
neuron migration1
regulation of cell migration1
regulation of gene expression1
regulation of RNA biosynthetic process1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription activator activity1
positive regulation of transcription by RNA polymerase II1
DNA binding1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1
protein-containing complex1

Protein interactions and networks

STRING

1158 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SOX14C4BPAP04003551
SOX14NKX2-2O95096463
SOX14DBX1A6NMT0438
SOX14CDR2LQ86X02426
SOX14LHX9Q9NQ69425
SOX14TMEM14AQ9Y6G1417
SOX14LHX3Q9UBR4407
SOX14TAL2Q16559405
SOX14GBX2P52951399
SOX14DKK4Q9UBT3396
SOX14ANAPC2Q9UJX6395
SOX14AKR7A2O43488387
SOX14ESYT3A0FGR9381
SOX14LHX5Q9H2C1376
SOX14LHX1P48742376

IntAct

97 interactions, top by confidence:

ABTypeScore
SOX14NAIF1psi-mi:“MI:0915”(physical association)0.670
NAIF1SOX14psi-mi:“MI:0915”(physical association)0.670
CRXSOX14psi-mi:“MI:0915”(physical association)0.560
SOX14ROR2psi-mi:“MI:0915”(physical association)0.560
SOX14SPMIP9psi-mi:“MI:0915”(physical association)0.560
FOXH1SOX14psi-mi:“MI:0915”(physical association)0.560
VENTXSOX14psi-mi:“MI:0915”(physical association)0.560
SOX14KRTAP15-1psi-mi:“MI:0915”(physical association)0.560
APBB1SOX14psi-mi:“MI:0915”(physical association)0.560
GRNSOX14psi-mi:“MI:0915”(physical association)0.560
SOX14NR3C1psi-mi:“MI:0915”(physical association)0.560
PSEN2SOX14psi-mi:“MI:0915”(physical association)0.560
TGM2SOX14psi-mi:“MI:0915”(physical association)0.560
SOX14UQCRC1psi-mi:“MI:0915”(physical association)0.560
SOX14HTTpsi-mi:“MI:0915”(physical association)0.560
SOX14ATXN3psi-mi:“MI:0915”(physical association)0.560
SOX14ATXN1psi-mi:“MI:0915”(physical association)0.560

BioGRID (14): NAIF1 (Two-hybrid), SOX14 (Two-hybrid), SOX14 (Affinity Capture-MS), SOX14 (Affinity Capture-MS), SOX14 (Two-hybrid), SOX14 (Two-hybrid), SOX14 (Two-hybrid), ROR2 (Two-hybrid), TEX37 (Two-hybrid), KRTAP15-1 (Two-hybrid), SOX14 (Two-hybrid), SOX14 (Affinity Capture-MS), SOX14 (Affinity Capture-MS), SOX14 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A4IIJ8, A4QNG3, B0ZTE1, B0ZTE2, B7ZR65, O42342, O95416, P04476, P09071, P23770, P27782, P46692, P47792, P48433, P55863, P61259, P70056, Q04892, Q08478, Q1KKY3, Q1KL17, Q28447, Q28GD5, Q28IU6, Q32PP9, Q66JW3, Q68FA4, Q6DFF5, Q6EJB7, Q6NVT7, Q6RVD7, Q6VVD7, Q7SZS1, Q8AWY9, Q8AXX8, Q90481, Q90X25, Q90ZH7, Q90ZH8, Q91169

Diamond homologs: A2TED3, A4QNG3, B0ZTE1, B0ZTE2, O00570, O42569, O57401, O60248, O95416, P36389, P36390, P36393, P36395, P36396, P41225, P43267, P47792, P48046, P48430, P48431, P48432, P48433, P51501, P53783, P53784, P54231, P55863, P61259, Q04892, Q05066, Q20201, Q21305, Q24533, Q28447, Q28778, Q28783, Q28798, Q2PG84, Q2Z1R2, Q32PP9

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 38 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Chemokine receptors bind chemokines638.7×1e-06

GO biological processes:

GO termPartnersFoldFDR
chemokine-mediated signaling pathway652.5×4e-07
antimicrobial humoral immune response mediated by antimicrobial peptide730.7×4e-07
chemotaxis622.0×2e-05
cellular response to lipopolysaccharide615.9×1e-04
positive regulation of cell migration813.3×1e-05
cell-cell signaling713.2×5e-05
inflammatory response1010.2×4e-06
immune response67.6×4e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

15 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance14
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

45 predictions. Top by Δscore:

VariantEffectΔscore
3:137764397:A:Tdonor_gain0.9200
3:137764405:G:Tdonor_gain0.7800
3:137764424:TG:Tdonor_gain0.7700
3:137764425:GG:Gdonor_gain0.7700
3:137764418:C:Tdonor_gain0.6000
3:137765422:A:AGacceptor_gain0.4400
3:137765423:G:GGacceptor_gain0.4400
3:137765053:ACAG:Adonor_loss0.3600
3:137765054:CAGG:Cdonor_loss0.3600
3:137765055:AGGTA:Adonor_loss0.3600
3:137765057:G:Adonor_loss0.3600
3:137765058:T:Adonor_loss0.3600
3:137765538:TCTGA:Tdonor_gain0.3500
3:137765052:GACAG:Gdonor_gain0.3400
3:137765453:C:Tdonor_gain0.3400
3:137764355:A:Tdonor_gain0.3300
3:137765057:G:GGdonor_gain0.3300
3:137764440:G:GTdonor_gain0.3000
3:137765423:GC:Gacceptor_gain0.3000
3:137765494:C:Gdonor_gain0.2900
3:137764396:G:GTdonor_gain0.2800
3:137765392:ACT:Adonor_gain0.2700
3:137765503:T:Gdonor_gain0.2700
3:137764404:GGAAC:Gdonor_gain0.2600
3:137764935:G:GCacceptor_gain0.2500
3:137765659:G:GAacceptor_gain0.2400
3:137764405:G:GTdonor_gain0.2300
3:137765176:C:Aacceptor_gain0.2300
3:137765485:C:Adonor_gain0.2300
3:137764329:GC:Gdonor_gain0.2200

AlphaMissense

1553 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:137764803:C:GH7D1.000
3:137764807:T:AI8N1.000
3:137764809:A:GK9E1.000
3:137764810:A:TK9M1.000
3:137764811:G:CK9N1.000
3:137764811:G:TK9N1.000
3:137764812:C:GR10G1.000
3:137764812:C:TR10W1.000
3:137764813:G:AR10Q1.000
3:137764813:G:TR10L1.000
3:137764815:C:AP11T1.000
3:137764815:C:GP11A1.000
3:137764815:C:TP11S1.000
3:137764816:C:AP11H1.000
3:137764816:C:GP11R1.000
3:137764816:C:TP11L1.000
3:137764819:T:AM12K1.000
3:137764819:T:CM12T1.000
3:137764819:T:GM12R1.000
3:137764820:G:AM12I1.000
3:137764820:G:CM12I1.000
3:137764820:G:TM12I1.000
3:137764821:A:CN13H1.000
3:137764821:A:GN13D1.000
3:137764821:A:TN13Y1.000
3:137764822:A:TN13I1.000
3:137764823:C:AN13K1.000
3:137764823:C:GN13K1.000
3:137764824:G:AA14T1.000
3:137764825:C:AA14D1.000

dbSNP variants (sampled 300 via entrez): RS1000159882 (3:137765112 C>T), RS1000507646 (3:137763570 T>A), RS1001392390 (3:137764274 G>T), RS1001827458 (3:137762742 T>C), RS1003051101 (3:137763032 C>G,T), RS1004449220 (3:137764978 G>T), RS1004690849 (3:137763845 A>G), RS1005145250 (3:137765893 T>G), RS1005189632 (3:137763683 G>C), RS1005897070 (3:137764609 C>A,T), RS1007548183 (3:137762342 G>A), RS1008029384 (3:137766599 C>A), RS1008582887 (3:137766423 G>A), RS1008788316 (3:137766602 T>C), RS1009005250 (3:137766307 A>G)

Disease associations

OMIM: gene MIM:604747 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (1): Microphthalmia-anophthalmia-coloboma (Orphanet:98555)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST002817_1Alzheimer’s disease in APOE e4- carriers5.000000e-07
GCST003084_7Glucocorticoid-induced osteonecrosis6.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

9 total (human), top 9 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Fincreases methylation1
arseniteincreases methylation1
CGP 52608affects binding, increases reaction1
deguelinincreases expression1
Resveratrolaffects cotreatment, decreases expression1
Benzo(a)pyreneaffects methylation1
Plant Extractsaffects cotreatment, decreases expression1
Valproic Aciddecreases methylation1
Aflatoxin B1decreases methylation1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A6L7SEES3-1V human SOX14, clone1Embryonic stem cellMale
CVCL_A6L8SEES3-1V human SOX14, clone2Embryonic stem cellMale
CVCL_A6L9SEES3-1V human SOX14, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Alzheimer disease, osteonecrosis