GBX2

gene
On this page

Summary

GBX2 (gastrulation brain homeobox 2, HGNC:4186) is a protein-coding gene on chromosome 2q37.2, encoding Homeobox protein GBX-2 (P52951). May act as a transcription factor for cell pluripotency and differentiation in the embryo.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in regulation of nervous system development and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within several processes, including branching involved in blood vessel morphogenesis; nervous system development; and neural crest cell migration. Predicted to be located in chromatin. Predicted to be active in nucleus.

Source: NCBI Gene 2637 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 50 total
  • MANE Select transcript: NM_001485

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4186
Approved symbolGBX2
Namegastrulation brain homeobox 2
Location2q37.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000168505
Ensembl biotypeprotein_coding
OMIM601135
Entrez2637

Gene structure

Transcript identifiers

Ensembl transcripts: 3 — 2 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000306318, ENST00000465889, ENST00000551105

RefSeq mRNA: 2 — MANE Select: NM_001485 NM_001301687, NM_001485

CCDS: CCDS2515, CCDS77545

Canonical transcript exons

ENST00000306318 — 2 exons

ExonStartEnd
ENSE00001874089236167449236168386
ENSE00001958948236165236236166437

Expression profiles

Bgee: expression breadth broad, 53 present calls, max score 80.39.

FANTOM5 (CAGE): breadth broad, TPM avg 0.5974 / max 37.1410, expressed in 211 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
346570.5974211

Top tissues by expression

256 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099180.39gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047375.80gold quality
tendon of biceps brachiiUBERON:000818866.07gold quality
putamenUBERON:000187465.16gold quality
stromal cell of endometriumCL:000225563.16gold quality
pancreatic ductal cellCL:000207962.86silver quality
caudate nucleusUBERON:000187361.09gold quality
gingival epitheliumUBERON:000194961.00gold quality
gluteal muscleUBERON:000200060.40gold quality
triceps brachiiUBERON:000150960.35gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451160.01gold quality
vena cavaUBERON:000408759.74gold quality
gingivaUBERON:000182858.81gold quality
parotid glandUBERON:000183158.44gold quality
C1 segment of cervical spinal cordUBERON:000646958.34gold quality
hypothalamusUBERON:000189856.77gold quality
spinal cordUBERON:000224056.74gold quality
myocardiumUBERON:000234955.84gold quality
nasal cavity epitheliumUBERON:000538455.65gold quality
medial globus pallidusUBERON:000247755.23gold quality
nucleus accumbensUBERON:000188254.79gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450254.41gold quality
deciduaUBERON:000245054.16gold quality
mucosa of sigmoid colonUBERON:000499354.02gold quality
male germ cellCL:000001552.70gold quality
spermCL:000001952.62gold quality
pericardiumUBERON:000240752.61gold quality
substantia nigraUBERON:000203852.27gold quality
globus pallidusUBERON:000187551.98gold quality
epithelial cell of pancreasCL:000008351.69gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.81

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

9 targets.

TargetRegulation
BMP4
EEF1A1
FGF8Unknown
GBX2
HMGB1
IL6Activation
LMO3
OTX2Repression
WNT1Unknown

JASPAR motifs

MotifNameFamily
MA0890.1GBX2HOX
MA0890.2GBX2HOX

JASPAR matrix evidence (PMIDs): PMID:18585359

Upstream regulators (CollecTRI, top): DLX5, GBX2, MYB, PAX2

miRNA regulators (miRDB)

53 targeting GBX2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692B100.0071.322622
HSA-MIR-5692C100.0071.322622
HSA-MIR-4795-3P100.0074.624024
HSA-MIR-4673100.0066.641490
HSA-MIR-3924100.0072.092394
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-23B-5P99.9866.07587
HSA-MIR-4645-5P99.9865.811284
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-LET-7C-3P99.9573.422862
HSA-MIR-6845-3P99.9466.881439
HSA-MIR-23A-5P99.9465.39468
HSA-MIR-10527-5P99.9172.283754
HSA-MIR-374A-5P99.9071.342923
HSA-MIR-374B-5P99.9069.982734
HSA-MIR-612499.8769.783551
HSA-MIR-797899.8666.90856
HSA-MIR-221-3P99.8671.561329
HSA-MIR-222-3P99.8671.351337
HSA-MIR-369-3P99.8570.522264
HSA-MIR-4659A-3P99.8072.624248
HSA-MIR-4659B-3P99.8072.624248
HSA-MIR-129999.7771.242389
HSA-MIR-1212999.7267.451311
HSA-MIR-6762-3P99.6666.941188
HSA-MIR-875-3P99.6369.472548
HSA-MIR-449999.6267.291470
HSA-MIR-24-3P99.5969.971934
HSA-MIR-1212399.5271.792990

Literature-anchored findings (GeneRIF, showing 9)

  • Altered expression is associated with therapy failure and death in patients with multiple types of cancer. (PMID:15931389)
  • Gbx2 and Otx2 interact with the WD40 domain of Groucho/Tle corepressors (PMID:17060451)
  • Data established a seven-gene (AR, ESR2, GATA3, GBX2, KRT16, MMP28 and WNT11) prognostic signature to define a subset of triple-negative breast cancer (TNBC). (PMID:23549873)
  • Gastrulation brain homeobox 2 (GBX2) methylation is associated with Gleason score (GS), pathologic stage (pT), and biochemical recurrence (BCR) in prostate cancer (PCa). Among intraductal carcinoma/cribriform architecture (IDC/C)-negative patients, GBX2 methylation improves prediction of BCR. Loss of TET2 in prostate cells results in greater GBX2 methylation. (PMID:31200836)
  • GBX2, as a tumor promoter in lung adenocarcinoma, enhances cells viability, invasion and migration by regulating the AKT/ERK signaling pathway. (PMID:31758726)
  • NCBP3/SNHG6 inhibits GBX2 transcription in a histone modification manner to facilitate the malignant biological behaviour of glioma cells. (PMID:32618493)
  • LncRNA FEZF1-AS1 accelerates the migration and invasion of laryngeal squamous cell carcinoma cells through miR-4497 targeting GBX2. (PMID:33550476)
  • circRERE regulates the expression of GBX2 through miR-1299 and ZC3H13/N(6)-methyladenosine (m(6)A) to promote growth and invasion of hepatocellular carcinoma cells. (PMID:36222159)
  • The accession number for the cDNA listed for the sequence data at the bottom of page 335 on the left side is incorrect. It should read U31468 instead of U31458. (PMID:8838315)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriogbx2ENSDARG00000002933
mus_musculusGbx2ENSMUSG00000034486
rattus_norvegicusGbx2ENSRNOG00000019495
drosophila_melanogasterunpgFBGN0015561

Paralogs (1): GBX1 (ENSG00000164900)

Protein

Protein identifiers

Homeobox protein GBX-2P52951 (reviewed: P52951)

Alternative names: Gastrulation and brain-specific homeobox protein 2

All UniProt accessions (2): P52951, F8VY47

UniProt curated annotations — full annotation on UniProt →

Function. May act as a transcription factor for cell pluripotency and differentiation in the embryo.

Subcellular location. Nucleus.

RefSeq proteins (2): NP_001288616, NP_001476* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site
IPR020479HD_metazoaDomain
IPR042982GBX-1/2Family

Pfam: PF00046

UniProt features (10 total): region of interest 3, compositionally biased region 3, sequence conflict 2, chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P52951-F163.070.17

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-9823739Formation of the anterior neural plate
R-HSA-9832991Formation of the posterior neural plate
R-HSA-9834899Specification of the neural plate border

MSigDB gene sets: 213 (showing top): GOBP_FOREBRAIN_NEURON_DEVELOPMENT, GOBP_MORPHOGENESIS_OF_AN_EPITHELIUM, GOBP_HINDBRAIN_DEVELOPMENT, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_METENCEPHALON_DEVELOPMENT, BENPORATH_ES_WITH_H3K27ME3, HNF3ALPHA_Q6, TATTATA_MIR374, GOBP_NEUROGENESIS, GOBP_NEURAL_TUBE_DEVELOPMENT, GOBP_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, GOBP_FOREBRAIN_DEVELOPMENT, GOBP_AMEBOIDAL_TYPE_CELL_MIGRATION, TANG_SENESCENCE_TP53_TARGETS_UP

GO Biological Process (19): branching involved in blood vessel morphogenesis (GO:0001569), neural crest cell migration (GO:0001755), regulation of transcription by RNA polymerase II (GO:0006357), nervous system development (GO:0007399), axon guidance (GO:0007411), cerebellum development (GO:0021549), midbrain-hindbrain boundary morphogenesis (GO:0021555), rhombomere 2 development (GO:0021568), thalamus development (GO:0021794), forebrain neuron development (GO:0021884), cerebellar granule cell precursor proliferation (GO:0021930), inner ear morphogenesis (GO:0042472), autonomic nervous system development (GO:0048483), regulation of nervous system development (GO:0051960), regulation of DNA-templated transcription (GO:0006355), hindbrain development (GO:0030902), midbrain-hindbrain boundary development (GO:0030917), tube morphogenesis (GO:0035239), positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (8): RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977), RNA polymerase II core promoter sequence-specific DNA binding (GO:0000979), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), DNA-binding transcription factor activity (GO:0003700), RNA polymerase II-specific DNA-binding transcription factor binding (GO:0061629), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Gastrulation3

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
anatomical structure development4
RNA polymerase II transcription regulatory region sequence-specific DNA binding3
regulation of DNA-templated transcription2
transcription by RNA polymerase II2
system development2
embryonic morphogenesis2
nervous system development2
brain development2
regulation of transcription by RNA polymerase II2
transcription cis-regulatory region binding2
angiogenesis1
blood vessel morphogenesis1
branching morphogenesis of an epithelial tube1
neural crest cell development1
mesenchymal cell migration1
axonogenesis1
neuron projection guidance1
metencephalon development1
midbrain-hindbrain boundary development1
rhombomere development1
diencephalon development1
forebrain neuron differentiation1
central nervous system neuron development1
cell proliferation in external granule layer1
ear morphogenesis1
inner ear development1
regulation of multicellular organismal development1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
rostrocaudal neural tube patterning1
anatomical structure morphogenesis1
tube development1
positive regulation of DNA-templated transcription1
core promoter sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription activator activity1
positive regulation of transcription by RNA polymerase II1

Protein interactions and networks

STRING

1318 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GBX2WNT1P04628881
GBX2FGF8P55075822
GBX2PAX2Q02962818
GBX2OTX2P32243723
GBX2SHHQ15465695
GBX2WNT3AP56704626
GBX2NEUROG1Q92886618
GBX2AGAP1Q9UPQ3617
GBX2FOXG1P55315602
GBX2ZIC1Q15915566
GBX2OLIG3Q7RTU3560
GBX2IRX3P78415553
GBX2LMX1AQ8TE12553
GBX2SIX3O95343549
GBX2TFCP2L1Q9NZI6547
GBX2EFNA5P52803547

IntAct

2 interactions, top by confidence:

ABTypeScore
GBX2FHL3psi-mi:“MI:0914”(association)0.350

BioGRID (6): CHD4 (Affinity Capture-MS), GBX1 (Affinity Capture-MS), FHL3 (Affinity Capture-MS), GBX2 (Affinity Capture-RNA), GBX2 (Proximity Label-MS), GBX2 (Proximity Label-MS)

ESM2 similar proteins: A0A8V0YY16, A0JPN1, A7MB54, A8MTJ6, O35762, O42115, O57601, O88181, O95096, P09065, P23683, P28356, P31311, P31315, P32443, P39020, P42581, P42586, P43697, P48031, P49640, P50222, P50476, P52951, P52954, P52955, P78426, P81067, P81068, P97334, Q14549, Q14774, Q1KKY1, Q1XID0, Q2NKI2, Q2VL76, Q2VL80, Q4V5A3, Q5SQQ9, Q60554

Diamond homologs: A2RU54, A5PKG8, A5YC49, A6NHT5, A6NMT0, A6YP92, A8XJD0, E7FDX5, F1R2J1, O08686, O35085, O42230, O43711, O43763, O55144, O57601, O70218, O88181, O93366, O93367, O97039, P15142, P15858, P19601, P20009, P20269, P22544, P22807, P26797, P29454, P31314, P42580, P42581, P43345, P43687, P48031, P52950, P52951, P52954, P52955

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

50 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance46
Likely benign1
Benign2

Top pathogenic / likely-pathogenic (0)

SpliceAI

194 predictions. Top by Δscore:

VariantEffectΔscore
2:236166433:CCCGA:Cacceptor_gain1.0000
2:236166434:CCGA:Cacceptor_gain1.0000
2:236166434:CCGAC:Cacceptor_gain1.0000
2:236166435:CGA:Cacceptor_gain1.0000
2:236166435:CGAC:Cacceptor_gain1.0000
2:236166436:GA:Gacceptor_gain1.0000
2:236166437:ACT:Aacceptor_loss1.0000
2:236166438:C:CCacceptor_gain1.0000
2:236166438:C:Tacceptor_loss1.0000
2:236166439:T:Cacceptor_loss1.0000
2:236166445:C:CTacceptor_gain1.0000
2:236167446:CA:Cdonor_loss1.0000
2:236167447:A:ACdonor_gain1.0000
2:236167448:C:CCdonor_gain1.0000
2:236167448:CCGAG:Cdonor_gain1.0000
2:236166446:A:Tacceptor_gain0.9900
2:236167448:CCG:Cdonor_gain0.9900
2:236166451:C:CTacceptor_gain0.9800
2:236166451:C:Tacceptor_gain0.9800
2:236167447:AC:Adonor_gain0.9800
2:236167448:CC:Cdonor_gain0.9800
2:236167448:CCGA:Cdonor_gain0.9800
2:236166436:G:Tacceptor_gain0.9700
2:236166452:A:Tacceptor_gain0.9100
2:236167136:AC:Adonor_gain0.8900
2:236167137:CC:Cdonor_gain0.8900
2:236167442:GAC:Gdonor_loss0.8700
2:236166439:T:Aacceptor_gain0.8600
2:236167131:AACCT:Adonor_loss0.8600
2:236167132:ACCTA:Adonor_loss0.8600

AlphaMissense

2262 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:236165963:C:AR333M1.000
2:236165965:G:CS332R1.000
2:236165965:G:TS332R1.000
2:236165967:T:GS332R1.000
2:236165969:A:TV331D1.000
2:236165975:A:TV329D1.000
2:236165978:G:CP328R1.000
2:236165978:G:TP328H1.000
2:236165979:G:AP328S1.000
2:236165981:A:CI327S1.000
2:236165981:A:GI327T1.000
2:236165981:A:TI327N1.000
2:236165984:G:CP326R1.000
2:236165984:G:TP326H1.000
2:236165985:G:AP326S1.000
2:236165985:G:TP326T1.000
2:236165987:A:GV325A1.000
2:236165987:A:TV325D1.000
2:236165990:A:TV324D1.000
2:236165993:A:CI323S1.000
2:236165993:A:GI323T1.000
2:236165993:A:TI323N1.000
2:236166050:C:GR304P1.000
2:236166052:T:AK303N1.000
2:236166052:T:GK303N1.000
2:236166053:T:AK303I1.000
2:236166054:T:CK303E1.000
2:236166055:C:AW302C1.000
2:236166055:C:GW302C1.000
2:236166056:C:GW302S1.000

dbSNP variants (sampled 300 via entrez): RS1000056972 (2:236161247 G>T), RS1000167521 (2:236164395 C>T), RS1000765688 (2:236164171 C>T), RS1000845033 (2:236164610 C>T), RS1001120963 (2:236170363 C>T), RS1001284865 (2:236168431 C>T), RS1001290705 (2:236167128 T>C), RS1001442332 (2:236161837 C>T), RS1001846896 (2:236163390 G>A), RS1001875347 (2:236161298 G>A), RS1002011537 (2:236165251 C>G,T), RS1002287084 (2:236169331 T>A,G), RS1002294603 (2:236165530 T>C), RS1002517440 (2:236160987 G>A,T), RS1002913809 (2:236161940 G>C)

Disease associations

OMIM: gene MIM:601135 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST004412_3Craniofacial microsomia5.000000e-13
GCST008396_1Educational attainment (years of education)3.000000e-08

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004784self reported educational attainment

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

27 total (human), top 27 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases expression, affects cotreatment, increases expression6
entinostatincreases expression, affects cotreatment2
Panobinostataffects cotreatment, increases expression2
aristolochic acid Iincreases expression1
bisphenol Fincreases expression1
methylmercuric chlorideincreases expression1
propionaldehydeincreases expression1
bisphenol Aincreases expression1
geranioldecreases expression1
trichostatin Aincreases expression1
arseniteincreases methylation1
butyraldehydeincreases expression1
tetrabromobisphenol Aincreases expression1
pentanalincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideincreases expression, affects cotreatment1
dorsomorphinaffects cotreatment, increases expression1
licochalcone Bincreases expression1
jinfukangdecreases expression1
Resveratrolaffects cotreatment, decreases expression1
Acetaminophendecreases expression1
Air Pollutantsincreases abundance, increases expression1
Aldehydesincreases expression1
Benzo(a)pyreneincreases methylation1
Cisplatindecreases expression1
Estradiolincreases expression1
Plant Extractsaffects cotreatment, decreases expression1
Particulate Matterincreases abundance, increases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A2G3SEES3-1V human GBX2, clone1Embryonic stem cellMale
CVCL_A2G4SEES3-1V human GBX2, clone2Embryonic stem cellMale
CVCL_A2G5SEES3-1V human GBX2, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): craniofacial microsomia