EMX1

gene
On this page

Summary

EMX1 (empty spiracles homeobox 1, HGNC:3340) is a protein-coding gene on chromosome 2p13.2, encoding Homeobox protein EMX1 (Q04741). Transcription factor, which in cooperation with EMX2, acts to generate the boundary between the roof and archipallium in the developing brain.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in brain development; neuron differentiation; and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within several processes, including nervous system development; neuroepithelial cell differentiation; and regulation of oligodendrocyte progenitor proliferation. Located in chromosome and nucleolus.

Source: NCBI Gene 2016 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 41 total — 1 pathogenic
  • MANE Select transcript: NM_004097

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:3340
Approved symbolEMX1
Nameempty spiracles homeobox 1
Location2p13.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000135638
Ensembl biotypeprotein_coding
OMIM600034
Entrez2016

Gene structure

Transcript identifiers

Ensembl transcripts: 7 — 4 protein_coding, 2 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000258106, ENST00000394111, ENST00000464675, ENST00000473732, ENST00000491023, ENST00000616281, ENST00000967897

RefSeq mRNA: 1 — MANE Select: NM_004097 NM_004097

CCDS: CCDS1921

Canonical transcript exons

ENST00000258106 — 3 exons

ExonStartEnd
ENSE000008466137291751972918372
ENSE000023248197293378772934891
ENSE000035832467292430972924493

Expression profiles

Bgee: expression breadth ubiquitous, 116 present calls, max score 95.86.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.6168 / max 60.1254, expressed in 179 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
209340.3447140
209360.128163
209370.054831
209350.033914
209330.030510
209380.02489

Top tissues by expression

258 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ganglionic eminenceUBERON:000402395.86gold quality
cortical plateUBERON:000534393.79gold quality
ventricular zoneUBERON:000305392.95gold quality
oocyteCL:000002390.78gold quality
metanephros cortexUBERON:001053389.53gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099188.65gold quality
right frontal lobeUBERON:000281087.38gold quality
adult mammalian kidneyUBERON:000008286.68gold quality
Brodmann (1909) area 9UBERON:001354085.37gold quality
prefrontal cortexUBERON:000045185.22gold quality
cingulate cortexUBERON:000302784.17gold quality
anterior cingulate cortexUBERON:000983584.00gold quality
dorsolateral prefrontal cortexUBERON:000983483.89gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.23gold quality
secondary oocyteCL:000065582.56gold quality
neocortexUBERON:000195081.44gold quality
frontal cortexUBERON:000187080.62gold quality
cerebral cortexUBERON:000095678.62gold quality
embryoUBERON:000092277.84gold quality
kidneyUBERON:000211377.38gold quality
amygdalaUBERON:000187676.02gold quality
primary visual cortexUBERON:000243675.53gold quality
cortex of kidneyUBERON:000122574.01gold quality
Ammon’s hornUBERON:000195471.82gold quality
mucosa of transverse colonUBERON:000499169.22gold quality
left testisUBERON:000453369.01gold quality
right testisUBERON:000453468.98gold quality
occipital lobeUBERON:000202168.48gold quality
temporal lobeUBERON:000187168.44gold quality
renal medullaUBERON:000036268.02gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-CURD-119yes42.53
E-HCAD-5yes33.63
E-ANND-3no1.29

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

2 targets.

TargetRegulation
APC
NRP1Activation

JASPAR motifs

MotifNameFamily
MA0612.1EMX1NK
MA0612.2EMX1NK
MA0612.3EMX1NK

JASPAR matrix evidence (PMIDs): PMID:23332764, PMID:25215497

Upstream regulators (CollecTRI, top): SIX3

miRNA regulators (miRDB)

34 targeting EMX1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4795-3P100.0074.624024
HSA-MIR-453199.9969.703181
HSA-MIR-314899.9775.066478
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-612499.8769.783551
HSA-MIR-3151-5P99.8663.831069
HSA-MIR-6892-3P99.6866.401178
HSA-MIR-875-3P99.6369.472548
HSA-MIR-1915-3P99.5866.791988
HSA-MIR-18A-3P99.5665.681092
HSA-MIR-642A-5P99.5165.101152
HSA-MIR-427999.1966.702437
HSA-MIR-3190-5P98.8764.891345
HSA-MIR-7851-3P98.7264.88980
HSA-MIR-393898.7266.07834
HSA-MIR-1227-5P98.6565.321549
HSA-MIR-4646-3P98.6566.98693
HSA-MIR-4700-5P98.6367.431915
HSA-MIR-950098.6266.541845
HSA-MIR-6780A-3P98.4267.491518
HSA-MIR-3155A98.1666.09965
HSA-MIR-3155B98.1666.09965
HSA-MIR-48498.1666.921074
HSA-MIR-808997.7466.211698
HSA-MIR-4667-5P97.6166.671683
HSA-MIR-6779-3P97.5165.82789
HSA-MIR-311697.0765.781324
HSA-MIR-6773-5P97.0464.30595
HSA-MIR-6724-5P96.4163.11507
HSA-MIR-129196.2865.891224

Literature-anchored findings (GeneRIF, showing 5)

  • An intersectional fate-mapping analysis reveals that transgenic Emx1-lineage cells coexpressing Dlx exclusively generate medium spiny neurons but do not contribute to the excitatory neurons in the amygdala. (PMID:20016109)
  • Regulation of sarcomagenesis by the empty spiracles homeobox genes EMX1 and EMX2. (PMID:34016958)
  • Empty spiracles homeobox genes EMX1 and EMX2 regulate WNT pathway activation in sarcomagenesis. (PMID:34364391)
  • EMX1 functions as a tumor inhibitor in spinal cord glioma through transcriptional suppression of WASF2 and inactivation of the Wnt/beta-catenin axis. (PMID:35849030)
  • Generation of human excitatory forebrain neurons by cooperative binding of proneural NGN2 and homeobox factor EMX1. (PMID:38446849)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_rerioemx1ENSDARG00000039569
mus_musculusEmx1ENSMUSG00000033726
rattus_norvegicusEmx1ENSRNOG00000015493

Paralogs (3): VAX2 (ENSG00000116035), VAX1 (ENSG00000148704), EMX2 (ENSG00000170370)

Protein

Protein identifiers

Homeobox protein EMX1Q04741 (reviewed: Q04741)

Alternative names: Empty spiracles homolog 1, Empty spiracles-like protein 1

All UniProt accessions (3): A0A087WZF2, Q04741, F8W1B5

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor, which in cooperation with EMX2, acts to generate the boundary between the roof and archipallium in the developing brain. May function in combinations with OTX1/2 to specify cell fates in the developing central nervous system.

Subunit / interactions. Interacts with WRD11 (via the N-terminal and the central portion of the protein); the interaction associates EMX1 with GLI3.

Subcellular location. Nucleus. Cytoplasm.

Tissue specificity. Cerebral cortex.

Similarity. Belongs to the EMX homeobox family.

Isoforms (2)

UniProt IDNamesCanonical?
Q04741-11yes
Q04741-22

RefSeq proteins (1): NP_004088* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site
IPR020479HD_metazoaDomain
IPR050877EMX-VAX-Noto_Homeobox_TFsFamily

Pfam: PF00046

UniProt features (6 total): splice variant 2, chain 1, DNA-binding region 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q04741-F164.070.20

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 104 (showing top): RNGTGGGC_UNKNOWN, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_EPITHELIUM_DEVELOPMENT, MYOGENIN_Q6, GOBP_NEURON_PROJECTION_EXTENSION, GCANCTGNY_MYOD_Q6, GOBP_GROWTH, GOBP_NEUROGENESIS, GOBP_FOREBRAIN_DEVELOPMENT, GOBP_FOREBRAIN_REGIONALIZATION, GOBP_FOREBRAIN_GENERATION_OF_NEURONS, GOBP_NEURAL_PRECURSOR_CELL_PROLIFERATION, GOBP_CEREBRAL_CORTEX_NEURON_DIFFERENTIATION, GOBP_ANIMAL_ORGAN_MORPHOGENESIS, GOBP_IN_UTERO_EMBRYONIC_DEVELOPMENT

GO Biological Process (18): in utero embryonic development (GO:0001701), regulation of transcription by RNA polymerase II (GO:0006357), central nervous system development (GO:0007417), brain development (GO:0007420), response to xenobiotic stimulus (GO:0009410), post-embryonic development (GO:0009791), cerebral cortex regionalization (GO:0021796), cerebral cortex neuron differentiation (GO:0021895), neuron differentiation (GO:0030182), brain morphogenesis (GO:0048854), homeostasis of number of cells (GO:0048872), radial glial cell differentiation (GO:0060019), neuroepithelial cell differentiation (GO:0060563), regulation of oligodendrocyte progenitor proliferation (GO:0070445), neuron projection extension (GO:1990138), regulation of DNA-templated transcription (GO:0006355), telencephalon development (GO:0021537), cerebral cortex development (GO:0021987)

GO Molecular Function (5): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (5): chromatin (GO:0000785), nucleus (GO:0005634), chromosome (GO:0005694), nucleolus (GO:0005730), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
anatomical structure development2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
cellular anatomical structure2
intracellular membraneless organelle2
chordate embryonic development1
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
nervous system development1
system development1
central nervous system development1
animal organ development1
head development1
response to chemical1
multicellular organism development1
multicellular organismal process1
regionalization1
telencephalon regionalization1
cerebral cortex development1
forebrain neuron differentiation1
cell differentiation1
generation of neurons1
brain development1
animal organ morphogenesis1
multicellular organismal-level homeostasis1
glial cell differentiation1
columnar/cuboidal epithelial cell differentiation1
oligodendrocyte progenitor proliferation1
regulation of neural precursor cell proliferation1
developmental cell growth1
neuron projection morphogenesis1
developmental growth involved in morphogenesis1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
forebrain development1
pallium development1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1

Protein interactions and networks

STRING

1448 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
EMX1TBR1Q16650804
EMX1FOXG1P55315785
EMX1NEUROD6Q96NK8714
EMX1SHHQ15465668
EMX1FGF8P55075631
EMX1BCL11BQ9C0K0631
EMX1NESP48681626
EMX1SATB2Q9UPW6624
EMX1EOMESO95936621
EMX1FANCFQ9NPI8619
EMX1CUX1P39880615
EMX1NEUROG2Q9H2A3613
EMX1PVALBP20472608
EMX1FEZF2Q8TBJ5601
EMX1CAMK2AQ9UQM7587

IntAct

13 interactions, top by confidence:

ABTypeScore
WDR11EMX1psi-mi:“MI:0915”(physical association)0.500
GLI3EMX1psi-mi:“MI:0915”(physical association)0.500
GLI3EMX1psi-mi:“MI:0914”(association)0.500
EMX1GTF2A1Lpsi-mi:“MI:0915”(physical association)0.370
EMX1PRDM4psi-mi:“MI:0915”(physical association)0.370
ELK1EMX1psi-mi:“MI:0915”(physical association)0.370
EMX1HOXA10psi-mi:“MI:0915”(physical association)0.370
EMX1POLR2Dpsi-mi:“MI:0915”(physical association)0.370
EMX1ALX4psi-mi:“MI:0915”(physical association)0.370
EMX1TP63psi-mi:“MI:0915”(physical association)0.370
WDR11GLI3psi-mi:“MI:0914”(association)0.350

BioGRID (9): EMX1 (Protein-RNA), HSPH1 (Cross-Linking-MS (XL-MS)), EMX1 (Two-hybrid), HOXA10 (Two-hybrid), POLR2D (Two-hybrid), TP63 (Two-hybrid), GTF2A1L (Two-hybrid), PRDM4 (Two-hybrid), ALX4 (Two-hybrid)

ESM2 similar proteins: A0JPN1, A1YEY5, A1YF16, A1YFI3, A1YG57, A1YG93, A2T733, A2T764, A2T7P4, A6NJ46, A7MB54, O02786, O42230, O43711, O43763, O55144, O57601, O93367, O95096, P09633, P31274, P35548, P42586, P43120, P43345, P43697, P48031, P50223, P52946, P52947, P52951, P53545, P56915, P63156, P63157, Q02591, Q03014, Q03358, Q04741, Q04742

Diamond homologs: A2RU54, A5YC49, A6NHT5, A6NJ46, A8XJD0, D2KQB0, O08686, O35762, O42230, O43711, O43763, O55144, O57601, O70218, O88181, O93366, O93367, O93590, P02830, P15858, P19601, P20269, P22544, P22807, P22809, P28468, P31314, P40764, P42580, P42581, P43345, P43687, P48031, P52950, P53547, P53772, P53775, P56407, P63156, P63157

SIGNOR signaling

1 interactions.

AEffectBMechanism
EMX1“up-regulates quantity by expression”NRP1“transcriptional regulation”

Disease & clinical

Clinical variants and AI predictions

ClinVar

41 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance40
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
60181GRCh38/hg38 2p13.2(chr2:72861062-72992130)x1Pathogenic

SpliceAI

607 predictions. Top by Δscore:

VariantEffectΔscore
2:72924267:T:TAacceptor_gain1.0000
2:72924271:T:TAacceptor_gain1.0000
2:72924274:A:AGacceptor_gain1.0000
2:72924274:ACCT:Aacceptor_gain1.0000
2:72924275:C:Gacceptor_gain1.0000
2:72924277:T:TAacceptor_gain1.0000
2:72924279:C:CAacceptor_gain1.0000
2:72924491:CAGGT:Cdonor_loss1.0000
2:72924492:AGG:Adonor_loss1.0000
2:72924493:GGT:Gdonor_loss1.0000
2:72924494:GT:Gdonor_loss1.0000
2:72924495:T:Gdonor_loss1.0000
2:72933779:T:Aacceptor_gain1.0000
2:72918371:GG:Gdonor_gain0.9900
2:72918372:GG:Gdonor_gain0.9900
2:72924281:T:TAacceptor_gain0.9900
2:72924283:T:TAacceptor_gain0.9900
2:72924286:T:Aacceptor_gain0.9900
2:72924289:C:Aacceptor_gain0.9900
2:72924292:C:CAacceptor_gain0.9900
2:72924295:C:CAacceptor_gain0.9900
2:72924490:GCAG:Gdonor_gain0.9900
2:72933783:CCAG:Cacceptor_loss0.9900
2:72933784:CAGGT:Cacceptor_loss0.9900
2:72933785:A:Gacceptor_loss0.9900
2:72933786:G:GTacceptor_loss0.9900
2:72924298:C:CAacceptor_gain0.9800
2:72924494:G:GGdonor_gain0.9800
2:72933780:G:Aacceptor_gain0.9800
2:72933785:A:AGacceptor_gain0.9800

AlphaMissense

1862 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000459269 (2:72917019 G>A), RS1000488950 (2:72917279 C>G,T), RS1000605674 (2:72922795 C>T), RS1000792836 (2:72918981 G>A), RS1000805139 (2:72929268 C>G), RS1000948490 (2:72925118 T>C), RS1001002678 (2:72914903 G>T), RS1001038049 (2:72915303 G>A,C), RS1001240730 (2:72934613 C>A,T), RS1001521130 (2:72916056 T>C), RS1001594889 (2:72930365 G>A), RS1001599260 (2:72921262 G>A,T), RS1001672492 (2:72918555 G>T), RS1001744991 (2:72920965 T>C), RS1001777544 (2:72924231 C>T)

Disease associations

OMIM: gene MIM:600034 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST004946_91Schizophrenia2.000000e-10
GCST007201_21Schizophrenia1.000000e-08
GCST010725_60Malaria7.000000e-06
GCST010725_79Malaria5.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

24 total (human), top 24 by PubMed support.

ChemicalActions (top 5)PubMed papers
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
S-(1,2-dichlorovinyl)cysteineincreases expression, affects response to substance1
CGP 52608increases reaction, affects binding1
pyrachlostrobindecreases expression1
bisphenol Sdecreases methylation1
jinfukangaffects cotreatment, increases expression1
picoxystrobindecreases expression1
Resveratrolaffects cotreatment, decreases expression1
Acetaminophendecreases expression1
Arsenicaffects methylation1
Benzo(a)pyreneincreases methylation1
Cisplatinaffects cotreatment, increases expression1
Copperdecreases expression, affects cotreatment1
Lipopolysaccharidesaffects response to substance, increases expression1
Rotenonedecreases expression1
Smokeincreases expression1
Testosteroneincreases expression1
Triclosanincreases expression1
Urethanedecreases expression1
Aflatoxin B1decreases methylation1
Asbestos, Crocidolitedecreases methylation1
Asbestos, Amositedecreases methylation1
Okadaic Aciddecreases expression1
Acrylamidedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.