GSX2

gene
On this page

Also known as Gsh2

Summary

GSX2 (GS homeobox 2, HGNC:24959) is a protein-coding gene on chromosome 4q12, encoding GS homeobox 2 (Q9BZM3). Transcription factor that binds 5’-CNAATTAG-3’ DNA sequence and regulates the expression of numerous genes including genes important for brain development.

Enables DNA-binding transcription factor activity, RNA polymerase II-specific and sequence-specific double-stranded DNA binding activity. Predicted to be involved in generation of neurons; olfactory bulb development; and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within several processes, including nervous system development; positive regulation of Notch signaling pathway; and positive regulation of oligodendrocyte differentiation. Located in cytoplasm and nucleus.

Source: NCBI Gene 170825 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): diencephalic-mesencephalic junction dysplasia syndrome 2 (Moderate, GenCC)
  • GWAS associations: 8
  • Clinical variants (ClinVar): 53 total — 1 pathogenic
  • Phenotypes (HPO): 11
  • MANE Select transcript: NM_133267

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:24959
Approved symbolGSX2
NameGS homeobox 2
Location4q12
Locus typegene with protein product
StatusApproved
AliasesGsh2
Ensembl geneENSG00000180613
Ensembl biotypeprotein_coding
OMIM616253
Entrez170825

Gene structure

Transcript identifiers

Ensembl transcripts: 3 — 2 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000326902, ENST00000503800, ENST00000507839

RefSeq mRNA: 1 — MANE Select: NM_133267 NM_133267

CCDS: CCDS3494

Canonical transcript exons

ENST00000326902 — 2 exons

ExonStartEnd
ENSE000012496285410158254102498
ENSE000012496325410016354100918

Expression profiles

Bgee: expression breadth broad, 58 present calls, max score 72.25.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1300 / max 86.0728, expressed in 34 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
476340.130034

Top tissues by expression

217 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065572.25gold quality
amygdalaUBERON:000187660.32gold quality
superficial temporal arteryUBERON:000161456.08gold quality
epithelium of nasopharynxUBERON:000195155.67gold quality
palpebral conjunctivaUBERON:000181255.05gold quality
putamenUBERON:000187454.55gold quality
caudate nucleusUBERON:000187354.35gold quality
temporal lobeUBERON:000187154.31gold quality
germinal epithelium of ovaryUBERON:000130453.78gold quality
esophagus squamous epitheliumUBERON:000692052.96gold quality
heart right ventricleUBERON:000208052.43gold quality
nasal cavity epitheliumUBERON:000538451.29gold quality
prefrontal cortexUBERON:000045150.67gold quality
postcentral gyrusUBERON:000258150.64gold quality
anterior cingulate cortexUBERON:000983549.58gold quality
nucleus accumbensUBERON:000188249.35gold quality
frontal cortexUBERON:000187049.11gold quality
parietal lobeUBERON:000187249.08gold quality
neocortexUBERON:000195048.95gold quality
cerebellar vermisUBERON:000472048.93gold quality
quadriceps femorisUBERON:000137748.88gold quality
C1 segment of cervical spinal cordUBERON:000646948.66gold quality
tibial nerveUBERON:000132348.50gold quality
Brodmann (1909) area 9UBERON:001354048.37gold quality
vastus lateralisUBERON:000137948.25gold quality
right frontal lobeUBERON:000281048.18gold quality
superior frontal gyrusUBERON:000266148.16gold quality
dorsolateral prefrontal cortexUBERON:000983448.11gold quality
spinal cordUBERON:000224048.00gold quality
cerebral cortexUBERON:000095647.76gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-HCAD-5yes20.50
E-ANND-3no1.15

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

3 targets.

TargetRegulation
ALDH1A3Activation
GSX2
INS

JASPAR motifs

MotifNameFamily
MA0893.1GSX2HOX
MA0893.2GSX2HOX
MA0893.3GSX2HOX

JASPAR matrix evidence (PMIDs): PMID:18585359

Upstream regulators (CollecTRI, top): GSX2, PAX6

Literature-anchored findings (GeneRIF, showing 8)

  • ectopically expressed in myeloid leukemic cells with t(4;12)(q11-q12;p13), suggesting that expression of GSH2 was deregulated by the translocation, indicating a variant leukemogenic mechanism for translocations involving the 5’ end of ETV6 (PMID:11861295)
  • Gsx2 specifies striatal projection neuron and olfactory bulb interneuron identity at distinct time points during neurogenesis in transgenic mice. (PMID:19709628)
  • Transgenic mice lacking microRNAs miR-9-2 and miR-9-3 exhibit multiple defects of telencephalic structures which may be brought about by dysregulation of Foxg1, Nr2e1, Gsh2, and Meis2 expression. (PMID:21368052)
  • Gsx2 negatively regulates neurogenesis from postnatal progenitor cells. (PMID:22242181)
  • 4q12 translocations with GSX2 expression identify a CD7(+) acute myeloid leukaemia subset. (PMID:25816740)
  • Data show that GS Homeobox 2 (Gsx2) and Early B-cell factor 1 (Ebf1) combined overexpression in human embryonic stem (hES) cells achieves high yields of medium spiny neurons (MSNs). (PMID:28137879)
  • This study identified and validated a three-gene (EMP3, GSX2 and EMILIN3) prognostic signature in Lower-Grade Gliomas. (PMID:29794476)
  • Silencing GS Homeobox 2 Alleviates Gemcitabine Resistance in Pancreatic Cancer Cells by Activating SHH/GLI1 Signaling Pathway. (PMID:34623580)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriogsx2ENSDARG00000043322
mus_musculusGsx2ENSMUSG00000035946
rattus_norvegicusGsx2ENSRNOG00000002266
drosophila_melanogasterzenFBGN0004053
drosophila_melanogasterzen2FBGN0004054
drosophila_melanogasterCG30480FBGN0050480

Paralogs (42): HOXA11 (ENSG00000005073), HOXC8 (ENSG00000037965), HOXA1 (ENSG00000105991), HOXA2 (ENSG00000105996), HOXA3 (ENSG00000105997), HOXA5 (ENSG00000106004), HOXA6 (ENSG00000106006), HOXA13 (ENSG00000106031), TLX1 (ENSG00000107807), HOXB6 (ENSG00000108511), TLX2 (ENSG00000115297), HOXB8 (ENSG00000120068), HOXB5 (ENSG00000120075), HOXB3 (ENSG00000120093), HOXB1 (ENSG00000120094), HOXA7 (ENSG00000122592), HOXC13 (ENSG00000123364), HOXC11 (ENSG00000123388), HOXC12 (ENSG00000123407), HOXD1 (ENSG00000128645), HOXD3 (ENSG00000128652), HOXD9 (ENSG00000128709), HOXD10 (ENSG00000128710), HOXD11 (ENSG00000128713), HOXD13 (ENSG00000128714), PDX1 (ENSG00000139515), HOXB13 (ENSG00000159184), TLX3 (ENSG00000164438), HOXD4 (ENSG00000170166), HOXD12 (ENSG00000170178), HOXB9 (ENSG00000170689), HOXC5 (ENSG00000172789), HOXB2 (ENSG00000173917), HOXD8 (ENSG00000175879), HOXC9 (ENSG00000180806), HOXC10 (ENSG00000180818), HOXB4 (ENSG00000182742), HOXA4 (ENSG00000197576), HOXC6 (ENSG00000197757), HOXC4 (ENSG00000198353)

Protein

Protein identifiers

GS homeobox 2Q9BZM3 (reviewed: Q9BZM3)

Alternative names: Genetic-screened homeobox 2, Homeobox protein GSH-2

All UniProt accessions (2): D6R903, Q9BZM3

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor that binds 5’-CNAATTAG-3’ DNA sequence and regulates the expression of numerous genes including genes important for brain development. During telencephalic development, causes ventralization of pallial progenitors and, depending on the developmental stage, specifies different neuronal fates. At early stages, necessary and sufficient to correctly specify the ventral lateral ganglionic eminence (LGE) and its major derivatives, the striatal projection neurons. At later stages, may specify LGE progenitors toward dorsal LGE fates, including olfactory bulb interneurons.

Subcellular location. Nucleus. Cytoplasm.

Disease relevance. Diencephalic-mesencephalic junction dysplasia syndrome 2 (DMJDS2) [MIM:618646] An autosomal recessive neurodevelopmental disorder with onset at birth, characterized by severe global developmental delay, hypotonia, spastic tetraparesis, generalized dystonia and severe intellectual impairment. Brain imaging shows a unique brain malformation characterized by agenesis of putamina and globi pallidi, dysgenesis of the caudate nuclei, olfactory bulbs hypoplasia, and anomaly of the diencephalic-mesencephalic junction with abnormal corticospinal tract course. The disease is caused by variants affecting the gene represented in this entry.

Similarity. Belongs to the Antp homeobox family.

RefSeq proteins (1): NP_573574* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site
IPR020479HD_metazoaDomain
IPR042191GSH1/2Family

Pfam: PF00046

UniProt features (9 total): sequence variant 3, region of interest 2, compositionally biased region 2, chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BZM3-F161.180.20

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 122 (showing top): GOBP_SPINAL_CORD_DEVELOPMENT, GOBP_HINDBRAIN_DEVELOPMENT, GOBP_OLFACTORY_BULB_INTERNEURON_DIFFERENTIATION, GOBP_RESPIRATORY_GASEOUS_EXCHANGE_BY_RESPIRATORY_SYSTEM, BENPORATH_ES_WITH_H3K27ME3, GOBP_FOREBRAIN_MORPHOGENESIS, GOBP_REGULATION_OF_RESPIRATORY_SYSTEM_PROCESS, GOBP_POSITIVE_REGULATION_OF_GLIAL_CELL_DIFFERENTIATION, GOBP_POSITIVE_REGULATION_OF_NOTCH_SIGNALING_PATHWAY, GOBP_RESPIRATORY_SYSTEM_PROCESS, GOBP_NEUROGENESIS, GOBP_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, GOBP_POSITIVE_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, GOBP_CELL_DIFFERENTIATION_IN_SPINAL_CORD, GOBP_DORSAL_VENTRAL_PATTERN_FORMATION

GO Biological Process (23): regulation of respiratory gaseous exchange by nervous system process (GO:0002087), Notch signaling pathway (GO:0007219), spinal cord association neuron differentiation (GO:0021527), hindbrain morphogenesis (GO:0021575), forebrain dorsal/ventral pattern formation (GO:0021798), olfactory bulb interneuron differentiation (GO:0021889), telencephalon regionalization (GO:0021978), regulation of cell migration (GO:0030334), positive regulation of Notch signaling pathway (GO:0045747), neuron fate specification (GO:0048665), positive regulation of oligodendrocyte differentiation (GO:0048714), forebrain morphogenesis (GO:0048853), subpallium neuron fate commitment (GO:0060163), GABAergic neuron differentiation (GO:0097154), regulation of DNA-templated transcription (GO:0006355), regulation of transcription by RNA polymerase II (GO:0006357), pattern specification process (GO:0007389), central nervous system development (GO:0007417), brain development (GO:0007420), subpallium development (GO:0021544), olfactory bulb development (GO:0021772), neuron fate commitment (GO:0048663), generation of neurons (GO:0048699)

GO Molecular Function (4): DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700)

GO Cellular Component (3): chromatin (GO:0000785), nucleus (GO:0005634), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
anatomical structure morphogenesis2
forebrain regionalization2
telencephalon development2
neuron fate commitment2
regulation of DNA-templated transcription2
cellular anatomical structure2
respiratory gaseous exchange by respiratory system1
regulation of respiratory system process1
nervous system process1
cell surface receptor signaling pathway1
cell differentiation in spinal cord1
dorsal spinal cord development1
central nervous system neuron differentiation1
hindbrain development1
dorsal/ventral pattern formation1
olfactory bulb development1
forebrain neuron differentiation1
regionalization1
cell migration1
regulation of cell motility1
Notch signaling pathway1
regulation of Notch signaling pathway1
positive regulation of signal transduction1
cell fate specification1
positive regulation of glial cell differentiation1
oligodendrocyte differentiation1
regulation of oligodendrocyte differentiation1
forebrain development1
brain morphogenesis1
subpallium development1
neuron differentiation1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
transcription by RNA polymerase II1
multicellular organism development1
multicellular organismal process1
nervous system development1
system development1
central nervous system development1

Protein interactions and networks

STRING

776 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GSX2ASCL1P50553684
GSX2FOXG1P55315662
GSX2TBR1Q16650628
GSX2LHX6Q9UPM6625
GSX2SP8Q8IXZ3621
GSX2NEUROG2Q9H2A3611
GSX2OLIG2Q13516595
GSX2SHHQ15465564
GSX2BCL11BQ9C0K0556
GSX2MEIS2O14770543
GSX2PPP1R1BQ9UD71541
GSX2CALB2P22676525
GSX2LHX8Q68G74512
GSX2EOMESO95936499
GSX2HES6Q96HZ4491

IntAct

3 interactions, top by confidence:

ABTypeScore
GSX2ABL1psi-mi:“MI:0915”(physical association)0.400
GSX2RNF4psi-mi:“MI:0914”(association)0.350

BioGRID (7): KLHL8 (Affinity Capture-MS), RNF4 (Affinity Capture-MS), C11orf31 (Affinity Capture-MS), GSX2 (Affinity Capture-Western), GSX2 (Affinity Capture-Western), SHH (Affinity Capture-Western), GLI1 (Affinity Capture-Western)

ESM2 similar proteins: B0W1V2, B0X9H6, B3DM47, B4H4H5, B5DLV1, E7FDX5, G5EC89, O15266, O22300, O35317, O35984, O73673, O76762, O77215, O93366, P05527, P10035, P10181, P22809, P23459, P29552, P39021, P40424, P40425, P40426, P41778, P49925, P53772, P53773, P56224, Q05007, Q05192, Q07961, Q1KKY1, Q1KL13, Q24648, Q27355, Q28ZA9, Q4V5A3, Q4VYR7

Diamond homologs: A9ZPC9, F1R2J1, O02491, O42370, O43365, O60479, O93353, O93528, P02831, P02836, P05527, P09015, P09065, P09066, P09075, P09076, P09089, P09090, P09145, P09532, P10105, P14150, P15858, P17487, P19622, P20009, P20269, P23397, P27609, P27610, P31316, P31533, P31534, P31535, P31536, P31537, P31538, P34326, P34684, P42580

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

53 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance37
Likely benign6
Benign7

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
57309GRCh38/hg38 4q12(chr4:51870025-55102392)x1Pathogenic

SpliceAI

440 predictions. Top by Δscore:

VariantEffectΔscore
4:54101572:C:Aacceptor_gain1.0000
4:54101581:GGA:Gacceptor_gain1.0000
4:54100994:G:GTdonor_gain0.9900
4:54101569:C:CAacceptor_gain0.9900
4:54101573:G:Aacceptor_gain0.9900
4:54101577:CGCA:Cacceptor_loss0.9900
4:54101578:GCA:Gacceptor_loss0.9900
4:54101579:CAGG:Cacceptor_gain0.9900
4:54101580:A:AGacceptor_gain0.9900
4:54101580:A:Gacceptor_loss0.9900
4:54101580:AGGA:Aacceptor_gain0.9900
4:54101580:AGGAG:Aacceptor_gain0.9900
4:54101581:G:GGacceptor_gain0.9900
4:54101581:GGAG:Gacceptor_gain0.9900
4:54101581:GGAGG:Gacceptor_gain0.9900
4:54101577:C:CAacceptor_gain0.9800
4:54101580:AG:Aacceptor_gain0.9800
4:54101581:GG:Gacceptor_gain0.9800
4:54102122:G:GTdonor_gain0.9800
4:54102126:T:Gdonor_gain0.9800
4:54101580:A:Tacceptor_loss0.9700
4:54100471:A:AGdonor_gain0.9600
4:54100917:GG:Gdonor_gain0.9600
4:54100918:GG:Gdonor_gain0.9600
4:54101108:GCCTG:Gdonor_gain0.9600
4:54102046:G:GTdonor_gain0.9600
4:54100915:ATGGG:Adonor_loss0.9500
4:54100916:TGGGT:Tdonor_loss0.9500
4:54100917:GGGTA:Gdonor_loss0.9500
4:54100919:G:Adonor_loss0.9500

AlphaMissense

1969 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
4:54101624:G:CR206T1.000
4:54101624:G:TR206M1.000
4:54101625:G:CR206S1.000
4:54101625:G:TR206S1.000
4:54101632:T:AF209I1.000
4:54101632:T:CF209L1.000
4:54101632:T:GF209V1.000
4:54101633:T:CF209S1.000
4:54101633:T:GF209C1.000
4:54101634:C:AF209L1.000
4:54101634:C:GF209L1.000
4:54101646:A:CQ213H1.000
4:54101646:A:TQ213H1.000
4:54101648:T:AL214H1.000
4:54101648:T:CL214P1.000
4:54101651:T:CL215P1.000
4:54101657:T:AL217Q1.000
4:54101657:T:CL217P1.000
4:54101657:T:GL217R1.000
4:54101660:A:TE218V1.000
4:54101661:G:CE218D1.000
4:54101661:G:TE218D1.000
4:54101668:T:AF221I1.000
4:54101668:T:CF221L1.000
4:54101668:T:GF221V1.000
4:54101669:T:CF221S1.000
4:54101669:T:GF221C1.000
4:54101670:C:AF221L1.000
4:54101670:C:GF221L1.000
4:54101683:T:CY226H1.000

dbSNP variants (sampled 300 via entrez): RS1000102491 (4:54102767 G>A), RS1000162520 (4:54099966 A>C,G), RS1000452885 (4:54102520 T>C), RS1000830511 (4:54098183 A>T), RS1002166620 (4:54101331 G>C,T), RS1003433221 (4:54100201 G>C,T), RS1003504568 (4:54102869 CTTCT>C), RS1003964952 (4:54100194 C>G), RS1005052943 (4:54098713 C>A,G), RS1005073765 (4:54099531 A>C), RS1006008654 (4:54102068 G>A), RS1006369061 (4:54101844 G>A), RS1006663111 (4:54098684 G>A,C), RS1008093986 (4:54100706 G>A,C), RS1010150612 (4:54102512 G>A)

Disease associations

OMIM: gene MIM:616253 | disease phenotypes: MIM:618646

GenCC curated gene-disease

DiseaseClassificationInheritance
diencephalic-mesencephalic junction dysplasia syndrome 2ModerateAutosomal recessive

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
diencephalic-mesencephalic junction dysplasia syndrome 2LimitedAR

Mondo (1): diencephalic-mesencephalic junction dysplasia syndrome 2 (MONDO:0020762)

Orphanet (0):

HPO phenotypes

11 total (11 of 11 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0001332Dystonia
HP:0001344Absent speech
HP:0002134Abnormal basal ganglia morphology
HP:0002510Spastic tetraplegia
HP:0002540Inability to walk
HP:0003577Congenital onset
HP:0008872Feeding difficulties in infancy
HP:0011344Severe global developmental delay
HP:0012695Decreased thalamic volume
HP:0040326Hypoplasia of the olfactory bulb

GWAS associations

8 associations (top):

StudyTraitp-value
GCST002479_6Lupus nephritis in systemic lupus erythematosus5.000000e-07
GCST004639_30Prudent dietary pattern6.000000e-06
GCST006976_71Macular thickness8.000000e-10
GCST009462_28Optic disc size1.000000e-10
GCST009723_25Vertical cup-disc ratio (adjusted for vertical disc diameter)2.000000e-13
GCST009724_36Vertical cup-disc ratio (multi-trait analysis)1.000000e-15
GCST009724_37Vertical cup-disc ratio (multi-trait analysis)1.000000e-14
GCST011769_13Schizophrenia2.000000e-08

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0008111diet measurement
EFO:0006939cup-to-disc ratio measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

8 total (human), top 8 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Adecreases methylation1
terbufosincreases methylation1
CGP 52608affects binding, increases reaction1
Benzo(a)pyrenedecreases methylation1
Fonofosincreases methylation1
Parathionincreases methylation1
Valproic Aciddecreases methylation1
Aflatoxin B1decreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.