ESX1

gene
On this page

Also known as ESXR1

Summary

ESX1 (ESX homeobox 1, HGNC:14865) is a protein-coding gene on chromosome Xq22.2, encoding Homeobox protein ESX1 (Q8N693). May coordinately regulate cell cycle progression and transcription during spermatogenesis.

This gene encodes a dual-function 65 kDa protein that undergoes proteolytic cleavage to produce a 45 kDa N-terminal fragment with a paired-like homeodomain and a 20 kDa C-terminal fragment with a proline-rich domain. The C-terminal fragment localizes to the cytoplasm while the N-terminal fragment localizes exclusively to the nucleus. In contrast to human, the mouse homolog has a novel PN/PF motif in the C-terminus and is paternally imprinted in placental tissue. This gene likely plays a role in placental development and spermatogenesis.

Source: NCBI Gene 80712 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 79 total — 1 pathogenic, 2 likely-pathogenic
  • MANE Select transcript: NM_153448

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14865
Approved symbolESX1
NameESX homeobox 1
LocationXq22.2
Locus typegene with protein product
StatusApproved
AliasesESXR1
Ensembl geneENSG00000123576
Ensembl biotypeprotein_coding
OMIM300154
Entrez80712

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000372588

RefSeq mRNA: 1 — MANE Select: NM_153448 NM_153448

CCDS: CCDS14516

Canonical transcript exons

ENST00000372588 — 4 exons

ExonStartEnd
ENSE00000840326104254154104254577
ENSE00000840327104252783104252828
ENSE00001458161104250038104250896
ENSE00001458163104254768104254933

Expression profiles

Bgee: expression breadth broad, 23 present calls, max score 81.24.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1428 / max 69.8515, expressed in 38 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
2000360.105827
2000370.037016

Top tissues by expression

244 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047381.24gold quality
right testisUBERON:000453475.55gold quality
left testisUBERON:000453373.77gold quality
testisUBERON:000047373.45gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099171.09gold quality
adult organismUBERON:000702363.42gold quality
lower lobe of lungUBERON:000894954.22silver quality
epithelium of nasopharynxUBERON:000195153.07gold quality
colonic epitheliumUBERON:000039749.55gold quality
spermCL:000001943.92silver quality
skeletal muscle tissue of rectus abdominisUBERON:000451143.37gold quality
secondary oocyteCL:000065542.57gold quality
pharyngeal mucosaUBERON:000035542.45gold quality
vastus lateralisUBERON:000137941.41gold quality
quadriceps femorisUBERON:000137741.37gold quality
superficial temporal arteryUBERON:000161441.33gold quality
skin of hipUBERON:000155441.23silver quality
inferior vagus X ganglionUBERON:000536341.12gold quality
palpebral conjunctivaUBERON:000181241.10gold quality
subthalamic nucleusUBERON:000190641.03gold quality
mucosa of paranasal sinusUBERON:000503040.98gold quality
amniotic fluidUBERON:000017340.69gold quality
jejunal mucosaUBERON:000039940.59gold quality
biceps brachiiUBERON:000150740.57gold quality
myocardiumUBERON:000234940.45gold quality
gingival epitheliumUBERON:000194940.43gold quality
germinal epithelium of ovaryUBERON:000130440.33gold quality
esophagus squamous epitheliumUBERON:000692040.29gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450240.27gold quality
dorsal plus ventral thalamusUBERON:000189740.22gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.65

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

2 targets.

TargetRegulation
KRASRepression
PTPRVP

JASPAR motifs

MotifNameFamily
MA0644.1ESX1Paired-related HD factors
MA0644.2ESX1Paired-related HD factors
MA0644.3ESX1Paired-related HD factors

JASPAR matrix evidence (PMIDs): PMID:18585359

Upstream regulators (CollecTRI, top): DLX3, SMAD6

miRNA regulators (miRDB)

13 targeting ESX1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-569699.9872.364487
HSA-MIR-314899.9775.066478
HSA-MIR-548AR-3P99.8571.263889
HSA-MIR-548AZ-3P99.8270.563549
HSA-MIR-548BC99.8270.613524
HSA-MIR-548E-3P99.8270.593514
HSA-MIR-548F-3P99.8270.593540
HSA-MIR-548A-3P99.7670.583524
HSA-MIR-548G-3P99.4868.672159
HSA-MIR-6868-5P99.0665.691284
HSA-MIR-451198.3267.971500
HSA-MIR-3126-3P97.1766.51468
HSA-MIR-7108-5P96.4266.17598

Literature-anchored findings (GeneRIF, showing 8)

  • proteolytic processing of ESXR1 plays a role in concerted regulation of the cell cycle and transcription in human cells (PMID:15235584)
  • ESX1L is expressed during all stages of placental development and is localized to sparse areas of trophoblast in terminal villi in association with cytotrophoblastic cells (PMID:15601583)
  • In FGR-affected placentae, ESX1L levels were lower than in gestation-matched controls, indicating a potential role for the ESX1L gene within the growth control mechanism of the fetus, through its effect on placental function. (PMID:16613891)
  • ESX1 emerges as a potentially reliable molecular marker of residual spermatogenesis in azoospermic men (PMID:20356899)
  • ESX1 mRNA level decreases as the severity of spermatogenic defects increase in non-obstructive azoospermic men. (PMID:25316452)
  • Could analysis of testis-specific genes, as biomarkers in seminal plasma, predict presence of focal spermatogenesis in non-obstructive azoospermia? (PMID:31793700)
  • Association of ESX1 gene variants with non-obstructive azoospermia in Chinese males. (PMID:33633269)
  • ESX1 gene as a potential candidate responsible for male infertility in nonobstructive azoospermia. (PMID:37783880)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusEsx1ENSMUSG00000023443
rattus_norvegicusAABR07040695.1ENSRNOG00000048011

Paralogs (50): ARX (ENSG00000004848), PAX6 (ENSG00000007372), PAX7 (ENSG00000009709), ALX4 (ENSG00000052850), GSC2 (ENSG00000063515), PITX1 (ENSG00000069011), PAX2 (ENSG00000075891), RHOXF1 (ENSG00000101883), CRX (ENSG00000105392), EVX1 (ENSG00000106038), PAX4 (ENSG00000106331), NOBOX (ENSG00000106410), PITX3 (ENSG00000107859), PHOX2B (ENSG00000109132), OTX1 (ENSG00000115507), PRRX1 (ENSG00000116132), VSX2 (ENSG00000119614), PAX8 (ENSG00000125618), PAX1 (ENSG00000125813), RHOXF2 (ENSG00000131721), GSC (ENSG00000133937), RAX (ENSG00000134438), PAX3 (ENSG00000135903), ALX3 (ENSG00000156150), HESX1 (ENSG00000163666), PITX2 (ENSG00000164093), UNCX (ENSG00000164853), PHOX2A (ENSG00000165462), OTX2 (ENSG00000165588), DRGX (ENSG00000165606), PRRX2 (ENSG00000167157), SHOX2 (ENSG00000168779), OTP (ENSG00000171540), RAX2 (ENSG00000173976), EVX2 (ENSG00000174279), PROP1 (ENSG00000175325), ISX (ENSG00000175329), ALX1 (ENSG00000180318), MIXL1 (ENSG00000185155), SHOX (ENSG00000185960)

Protein

Protein identifiers

Homeobox protein ESX1Q8N693 (reviewed: Q8N693)

Alternative names: Extraembryonic, spermatogenesis, homeobox 1

All UniProt accessions (1): Q8N693

UniProt curated annotations — full annotation on UniProt →

Function. May coordinately regulate cell cycle progression and transcription during spermatogenesis. Inhibits degradation of polyubiquitinated cyclin A and cyclin B1 and thereby arrests the cell cycle at early M phase. ESXR1-N acts as a transcriptional repressor. Binds to the sequence 5’-TAATGTTATTA-3’ which is present within the first intron of the KRAS gene and inhibits its expression. ESXR1-C has the ability to inhibit cyclin turnover.

Subcellular location. Cytoplasm. Nucleus.

Tissue specificity. Expressed in placenta and testis. Expressed in testicular germ cell tumors.

Post-translational modifications. Undergoes proteolytic cleavage; produces a 45 kDa N-terminal homeodomain-containing fragment (ESXR1-N) and a 20 kDa C-terminal fragment (ESXR1-C).

RefSeq proteins (1): NP_703149* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR013847POUDomain
IPR017970Homeobox_CSConserved_site
IPR050649Paired_Homeobox_TFsFamily

Pfam: PF00046

UniProt features (30 total): repeat 15, region of interest 4, chain 3, compositionally biased region 3, sequence conflict 2, DNA-binding region 1, short sequence motif 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N693-F162.350.15

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 35 (showing top): BENPORATH_ES_WITH_H3K27ME3, GOBP_NEUROGENESIS, GOBP_REGULATION_OF_CELL_CYCLE, MARTIN_VIRAL_GPCR_SIGNALING_DN, CONRAD_GERMLINE_STEM_CELL, GOCC_NUCLEAR_SPECK, GOCC_NUCLEAR_BODY, GOCC_RIBONUCLEOPROTEIN_GRANULE, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, MEISSNER_BRAIN_HCP_WITH_H3K27ME3, HATADA_METHYLATED_IN_LUNG_CANCER_UP, chrXq22, GOBP_NEGATIVE_REGULATION_OF_TRANSCRIPTION_BY_RNA_POLYMERASE_II, GOBP_NEGATIVE_REGULATION_OF_NUCLEOBASE_CONTAINING_COMPOUND_METABOLIC_PROCESS, WANG_MLL_TARGETS

GO Biological Process (6): negative regulation of transcription by RNA polymerase II (GO:0000122), regulation of transcription by RNA polymerase II (GO:0006357), negative regulation of DNA-templated transcription (GO:0045892), neuron development (GO:0048666), regulation of cell cycle (GO:0051726), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (6): RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription repressor activity, RNA polymerase II-specific (GO:0001227), sequence-specific DNA binding (GO:0043565), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677)

GO Cellular Component (4): chromatin (GO:0000785), nucleus (GO:0005634), cytoplasm (GO:0005737), nuclear speck (GO:0016607)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of transcription by RNA polymerase II2
transcription by RNA polymerase II2
regulation of DNA-templated transcription2
DNA-templated transcription2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
cellular anatomical structure2
negative regulation of DNA-templated transcription1
negative regulation of RNA biosynthetic process1
neuron differentiation1
cell development1
cell cycle1
regulation of cellular process1
regulation of gene expression1
regulation of RNA biosynthetic process1
transcription cis-regulatory region binding1
chromatin1
DNA-binding transcription factor activity1
negative regulation of transcription by RNA polymerase II1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription repressor activity1
DNA binding1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
chromosome1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
nuclear ribonucleoprotein granule1

Protein interactions and networks

STRING

1538 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
ESX1PDE6BP35913869
ESX1LHX4Q969G2823
ESX1POU1F1P28069783
ESX1LHX3Q9UBR4774
ESX1ELF3P78545771
ESX1SOX3P35714718
ESX1SOX2P48431684
ESX1PROKR2Q8NFJ6647
ESX1GHRHRQ02643578
ESX1POU5F1P31359575
ESX1TBX19O60806554
ESX1CGASQ8N884549
ESX1RD3Q7Z3Z2519
ESX1ANOS1P23352489
ESX1CALCOCO2Q13137484

IntAct

8 interactions, top by confidence:

ABTypeScore
ESX1psi-mi:“MI:0915”(physical association)0.520
ESX1psi-mi:“MI:0915”(physical association)0.520
ESX1Dlg4psi-mi:“MI:0407”(direct interaction)0.440
TRAP1ESX1psi-mi:“MI:0915”(physical association)0.370
ESX1EXOC2psi-mi:“MI:0915”(physical association)0.370
ESX1psi-mi:“MI:0914”(association)0.350

BioGRID (6): ESX1 (Affinity Capture-MS), ESX1 (Co-fractionation), ESX1 (Co-fractionation), ESX1 (Affinity Capture-RNA), EXOC2 (Two-hybrid), TRAP1 (Two-hybrid)

ESM2 similar proteins: A0A087WUL8, A0A0J9YWL9, A0A0J9YY54, A0A0U1RQI7, A6NJ88, A6QL64, B4DH59, D3ZVV1, E9Q6E9, F1LWT0, O04492, O88799, P0DKJ7, P0DKJ8, P0DKL2, P0DPF3, P18583, P53353, Q08AG5, Q0P6D6, Q2EG98, Q3BBV2, Q4ZJZ1, Q5HY64, Q5JPF3, Q5QGU6, Q5TAG4, Q5TI25, Q5XHX6, Q6P3W6, Q6P902, Q6XPR3, Q6ZQX7, Q86T75, Q86VE3, Q86VQ3, Q8N2N9, Q8N660, Q8N693, Q96EQ9

Diamond homologs: A0A1W2PPK0, A0A1W2PPM1, A1A546, A1YEY5, A1YFI3, A1YG57, A1YGA2, A2T733, A2T777, A2T7P4, A6NFQ7, G5EC89, L8E946, O14813, O15499, O35690, O42250, O42356, O42357, O42477, O70137, O73917, O75360, O95076, O97670, P0DMV5, P26367, P26630, P29454, P41935, P47237, P47238, P53544, P53545, P53546, P54366, P55813, P55864, P56915, P56916

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

79 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic2
Uncertain significance54
Likely benign15
Benign2

Top pathogenic / likely-pathogenic (3)

Variant IDHGVSClassification
1706520GRCh37/hg19 Xq21.1-28(chrX:77670699-155233731)x1Pathogenic
1526837GRCh37/hg19 Xq22.2-22.3(chrX:103046942-104214535)Likely pathogenic
2690946NM_153448.4(ESX1):c.1094C>G (p.Pro365Arg)Likely pathogenic

SpliceAI

372 predictions. Top by Δscore:

VariantEffectΔscore
X:104251617:T:TAdonor_gain1.0000
X:104252830:T:Cacceptor_gain1.0000
X:104254147:CACTT:Cdonor_loss1.0000
X:104254148:ACTT:Adonor_loss1.0000
X:104254149:CTTA:Cdonor_loss1.0000
X:104254150:TTAC:Tdonor_loss1.0000
X:104254151:TA:Tdonor_loss1.0000
X:104254152:A:ACdonor_gain1.0000
X:104254152:A:AGdonor_loss1.0000
X:104254153:C:CGdonor_gain1.0000
X:104254153:CCG:Cdonor_gain1.0000
X:104254153:CCGCG:Cdonor_gain1.0000
X:104254776:T:TAdonor_gain1.0000
X:104250892:CAAAC:Cacceptor_gain0.9900
X:104250894:AACC:Aacceptor_loss0.9900
X:104250897:CTT:Cacceptor_loss0.9900
X:104250900:C:CTacceptor_gain0.9900
X:104251618:C:Adonor_gain0.9900
X:104252830:T:TCacceptor_gain0.9900
X:104254152:AC:Adonor_gain0.9900
X:104254152:ACCG:Adonor_gain0.9900
X:104254153:CC:Cdonor_gain0.9900
X:104254153:CCGC:Cdonor_gain0.9900
X:104254777:C:Adonor_gain0.9900
X:104250894:AAC:Aacceptor_gain0.9800
X:104250895:AC:Aacceptor_gain0.9800
X:104250896:CC:Cacceptor_gain0.9800
X:104250897:C:CCacceptor_gain0.9800
X:104250897:C:Tacceptor_gain0.9800
X:104250901:A:Tacceptor_gain0.9800

AlphaMissense

2614 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:104250888:A:CF187L0.999
X:104250888:A:TF187L0.999
X:104250890:A:GF187L0.999
X:104254222:G:CF146L0.999
X:104254222:G:TF146L0.999
X:104254224:A:GF146L0.999
X:104250889:A:GF187S0.998
X:104254186:G:CF158L0.998
X:104254186:G:TF158L0.998
X:104254188:A:GF158L0.998
X:104250876:T:AR191S0.997
X:104250876:T:GR191S0.997
X:104250879:T:AR190S0.997
X:104250879:T:GR190S0.997
X:104250891:C:AW186C0.997
X:104250891:C:GW186C0.997
X:104250893:A:GW186R0.997
X:104250893:A:TW186R0.997
X:104254199:A:GL154P0.997
X:104254223:A:CF146C0.997
X:104254223:A:GF146S0.997
X:104250880:C:AR190I0.996
X:104250880:C:GR190T0.996
X:104250884:T:CN189D0.996
X:104254172:T:CY163C0.996
X:104254208:A:GL151P0.996
X:104250870:C:AK193N0.995
X:104250870:C:GK193N0.995
X:104250877:C:GR191T0.995
X:104250883:T:CN189S0.995

dbSNP variants (sampled 300 via entrez): RS1000400132 (X:104255795 C>T), RS1000703516 (X:104253453 T>C), RS1001115927 (X:104253870 T>C), RS1002103890 (X:104252397 T>C), RS1003156087 (X:104250256 T>C), RS1004245810 (X:104256513 T>A,G), RS1004618930 (X:104255890 C>A,G), RS1006231366 (X:104252359 C>T), RS1006633442 (X:104251999 T>C,G), RS1007734351 (X:104251916 G>C), RS1009462543 (X:104253429 C>A), RS1010446 (X:104251066 C>A,G,T), RS1011178582 (X:104251264 T>C), RS1011976475 (X:104250207 A>T), RS1012334049 (X:104249847 G>A)

Disease associations

OMIM: gene MIM:300154 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (1): Male infertility with azoospermia or oligozoospermia due to single gene mutation (Orphanet:399805)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

9 total (human), top 9 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, increases methylation2
sodium arsenitedecreases expression1
butyraldehydeincreases expression1
pentanalincreases expression1
entinostatincreases expression1
(+)-JQ1 compounddecreases expression1
Arsenic Trioxideincreases expression1
Benzo(a)pyreneaffects methylation, increases methylation1
Dustdecreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A1L9SEES3-1V human ESX1, clone1Embryonic stem cellMale
CVCL_A1M0SEES3-1V human ESX1, clone2Embryonic stem cellMale
CVCL_A1M1SEES3-1V human ESX1, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.