HMGXB4

gene
On this page

Also known as THC211630P53N

Summary

HMGXB4 (HMG-box containing 4, HGNC:5003) is a protein-coding gene on chromosome 22q12.3, encoding HMG domain-containing protein 4 (Q9UGU5). Negatively regulates Wnt/beta-catenin signaling during development.

High mobility group (HMG) proteins are nonhistone chromosomal proteins. See HMG2 (MIM 163906) for additional information on HMG proteins.

Source: NCBI Gene 10042 — RefSeq curated summary.

At a glance

  • GWAS associations: 12
  • Clinical variants (ClinVar): 68 total
  • MANE Select transcript: NM_001003681

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:5003
Approved symbolHMGXB4
NameHMG-box containing 4
Location22q12.3
Locus typegene with protein product
StatusApproved
AliasesTHC211630, P53N
Ensembl geneENSG00000100281
Ensembl biotypeprotein_coding
OMIM604702
Entrez10042

Gene structure

Transcript identifiers

Ensembl transcripts: 11 — 6 protein_coding, 3 retained_intron, 1 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000216106, ENST00000418170, ENST00000420166, ENST00000455359, ENST00000464480, ENST00000466438, ENST00000498212, ENST00000498325, ENST00000912955, ENST00000912956, ENST00000943161

RefSeq mRNA: 2 — MANE Select: NM_001003681 NM_001003681, NM_001362972

CCDS: CCDS33641

Canonical transcript exons

ENST00000216106 — 11 exons

ExonStartEnd
ENSE000019092643529360735295807
ENSE000019565593525749335257557
ENSE000034793313528396235284043
ENSE000034949503529299235293114
ENSE000035050553528823835288407
ENSE000035677153526464835265603
ENSE000035797833526307835263226
ENSE000035828753526379635263874
ENSE000036282603528734735287452
ENSE000036445393528599735286061
ENSE000036513633526232335262421

Expression profiles

Bgee: expression breadth ubiquitous, 294 present calls, max score 95.57.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 25.0337 / max 484.7332, expressed in 1801 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
19195525.03371801

Top tissues by expression

298 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
oocyteCL:000002395.57gold quality
secondary oocyteCL:000065595.23gold quality
buccal mucosa cellCL:000233693.20gold quality
spermCL:000001991.78gold quality
sural nerveUBERON:001548891.43gold quality
male germ cellCL:000001590.66gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047389.60gold quality
adrenal tissueUBERON:001830388.41gold quality
ventricular zoneUBERON:000305388.12gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099187.20gold quality
placentaUBERON:000198786.67gold quality
tibiaUBERON:000097986.62gold quality
bone marrowUBERON:000237186.56gold quality
urethraUBERON:000005786.50gold quality
nippleUBERON:000203086.26gold quality
embryoUBERON:000092286.09gold quality
synovial jointUBERON:000221786.08gold quality
cartilage tissueUBERON:000241885.66gold quality
cranial nerve IIUBERON:000094185.58gold quality
trabecular bone tissueUBERON:000248385.34gold quality
islet of LangerhansUBERON:000000685.31gold quality
cortical plateUBERON:000534385.30gold quality
penisUBERON:000098985.27gold quality
bone marrow cellCL:000209285.26gold quality
calcaneal tendonUBERON:000370185.18gold quality
germinal epithelium of ovaryUBERON:000130485.10gold quality
ganglionic eminenceUBERON:000402385.04gold quality
adult organismUBERON:000702384.98gold quality
mucosa of paranasal sinusUBERON:000503084.95gold quality
stromal cell of endometriumCL:000225584.67gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.24

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

4 targets.

TargetRegulation
ACTA2Repression
CNN1Repression
MYH11Repression
TAGLNRepression

Upstream regulators (CollecTRI, top): APOE, SRF

Literature-anchored findings (GeneRIF, showing 4)

  • high-mobility group protein 2-like 1 and SNP mapping of a psychotic bipolar affective disorder linkage region on 22q12.3 (PMID:17671966)
  • the first evidence of this novel HMG2L1 molecule playing an important role in attenuating smooth muscle differentiation (PMID:20511232)
  • These results highlight a previously un-reported role of HMGXB4 in the hematopoietic system and transcriptional imbalances of HMGXB4 could contribute to the aberrant expansion of the megakaryocytic lineage in primary myelofibrosis patients (PMID:26547506)
  • Analysis of the P(53N) a Novel Protein Encoded on Chromosome 22q12.1-12.3 in Glioblastomas and Ependymomas Specimens. (PMID:33595778)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriohmgxb4aENSDARG00000075783
mus_musculusHmgxb4ENSMUSG00000034518
rattus_norvegicusHmgxb4ENSRNOG00000013878
caenorhabditis_eleganshmg-3WBGENE00001973
caenorhabditis_elegansWBGENE00001974

Paralogs (20): HMGB3 (ENSG00000029993), HMG20B (ENSG00000064961), SP100 (ENSG00000067066), SMARCE1 (ENSG00000073584), SP140 (ENSG00000079263), TOX4 (ENSG00000092203), TOX3 (ENSG00000103460), TFAM (ENSG00000108064), UBTF (ENSG00000108312), HMGB1P1 (ENSG00000124097), TOX2 (ENSG00000124191), SP110 (ENSG00000135899), HMG20A (ENSG00000140382), SSRP1 (ENSG00000149136), HMGB2 (ENSG00000164104), HMGB4 (ENSG00000176256), SP140L (ENSG00000185404), HMGB1 (ENSG00000189403), TOX (ENSG00000198846), UBTFL1 (ENSG00000255009)

Protein

Protein identifiers

HMG domain-containing protein 4Q9UGU5 (reviewed: Q9UGU5)

Alternative names: HMG box-containing protein 4, High mobility group protein 2-like 1, Protein HMGBCG

All UniProt accessions (4): Q9UGU5, B0QXZ8, B0QXZ9, F8WDU7

UniProt curated annotations — full annotation on UniProt →

Function. Negatively regulates Wnt/beta-catenin signaling during development.

Subcellular location. Nucleus.

RefSeq proteins (2): NP_001003681, NP_001349901 (=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR025228DUF4171Domain
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR042477HMGXB4Family
IPR048016HMGXB4_HMG-boxDomain

Pfam: PF00505, PF13775

UniProt features (18 total): modified residue 5, compositionally biased region 5, cross-link 2, region of interest 2, chain 1, DNA-binding region 1, sequence variant 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9UGU5-F156.150.09

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (7): 197, 497, 502, 512, 8, 191, 102

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 201 (showing top): E2F_Q4, TAATAAT_MIR126, GOBP_LYSOSOMAL_TRANSPORT, E2F4DP1_01, GOBP_ENDOSOME_TO_LYSOSOME_TRANSPORT, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, GRAESSMANN_RESPONSE_TO_MC_AND_DOXORUBICIN_DN, GOBP_VACUOLAR_TRANSPORT, GOBP_VESICLE_MEDIATED_TRANSPORT, GOBP_REGULATION_OF_WNT_SIGNALING_PATHWAY, YY1_Q6, E2F1DP1_01, E2F1DP2_01, YY1_02, BLALOCK_ALZHEIMERS_DISEASE_UP

GO Biological Process (3): endosome to lysosome transport (GO:0008333), Wnt signaling pathway (GO:0016055), negative regulation of Wnt signaling pathway (GO:0030178)

GO Molecular Function (3): DNA binding (GO:0003677), identical protein binding (GO:0042802), protein binding (GO:0005515)

GO Cellular Component (2): NURF complex (GO:0016589), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
lysosomal transport1
intercellular transport1
vesicle-mediated transport1
cell surface receptor signaling pathway1
negative regulation of signal transduction1
Wnt signaling pathway1
regulation of Wnt signaling pathway1
nucleic acid binding1
protein binding1
binding1
ISWI-type complex1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

978 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
HMGXB4TOM1O60784924
HMGXB4MCM5P33992862
HMGXB4HMOX2P30519547
HMGXB4CCDC30Q5VVM6535
HMGXB4FOXRED2Q8IWF2490
HMGXB4AHSPQ9NZD4480
HMGXB4BPGMP07738475
HMGXB4TENT5CQ5VWP2468
HMGXB4WDR64B1ANS9458
HMGXB4IFIT1BQ5T764440
HMGXB4ZFYVE16Q7Z3T8438
HMGXB4GYPBP06028436
HMGXB4CELSR3Q9NYQ7429
HMGXB4HGSO14964428
HMGXB4HMOX1P09601425

IntAct

91 interactions, top by confidence:

ABTypeScore
KIFAP3KIF3Bpsi-mi:“MI:0914”(association)0.900
RBBP4CDK2AP1psi-mi:“MI:0914”(association)0.790
KPNA1TCERG1psi-mi:“MI:0914”(association)0.640
DPY30AKAP8psi-mi:“MI:0914”(association)0.610
ENKD1HMGXB4psi-mi:“MI:0915”(physical association)0.560
CCHCR1HMGXB4psi-mi:“MI:0915”(physical association)0.560
STAC3HMGXB4psi-mi:“MI:0915”(physical association)0.560
ZCCHC10HMGXB4psi-mi:“MI:0915”(physical association)0.560
TSPYL2HMGXB4psi-mi:“MI:0915”(physical association)0.560
INO80BHMGXB4psi-mi:“MI:0915”(physical association)0.560
AKAP17AHMGXB4psi-mi:“MI:0915”(physical association)0.560
HMGXB4HMGXB4psi-mi:“MI:0915”(physical association)0.560
LENG8HMGXB4psi-mi:“MI:0915”(physical association)0.560
NKAPD1HMGXB4psi-mi:“MI:0915”(physical association)0.560
PIAS2HMGXB4psi-mi:“MI:0915”(physical association)0.560
STRBPHMGXB4psi-mi:“MI:0915”(physical association)0.560
UBL5HMGXB4psi-mi:“MI:0915”(physical association)0.560
UBE2IHMGXB4psi-mi:“MI:0915”(physical association)0.560
TENT5BHMGXB4psi-mi:“MI:0915”(physical association)0.560
SMARCA5RBBP4psi-mi:“MI:0914”(association)0.530
RBBP4TNRC18psi-mi:“MI:0914”(association)0.530
BACC1SMARCA1psi-mi:“MI:0914”(association)0.530
BACC1SMARCA5psi-mi:“MI:0914”(association)0.530
H3C1SMCHD1psi-mi:“MI:2364”(proximity)0.410
HMGXB4HSP90B1psi-mi:“MI:0915”(physical association)0.400
HMGXB4NR1H2psi-mi:“MI:0915”(physical association)0.370

BioGRID (136): HMGXB4 (Protein-peptide), HMGXB4 (Affinity Capture-MS), HMGXB4 (Two-hybrid), HMGXB4 (Biochemical Activity), RAD51 (Affinity Capture-MS), RRAS (Affinity Capture-MS), TBCA (Affinity Capture-MS), RABGAP1L (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), HMGXB4 (Affinity Capture-MS), ARAP1 (Affinity Capture-MS)

ESM2 similar proteins: A0A087WPF7, A0A0R4IBL7, O09000, O54972, O70305, O75081, O75376, P15806, P15881, P15884, P15923, P21677, P30985, P51514, P98180, Q05AQ8, Q14157, Q14687, Q1LY51, Q2VPM4, Q3U3C9, Q4KKX4, Q4VCS5, Q566L4, Q5F3B1, Q5SFM8, Q5T6F2, Q60722, Q60974, Q61286, Q62655, Q6DIH5, Q7ZWN6, Q7ZXS3, Q80X50, Q86YP4, Q8BZ47, Q8CHY6, Q8IXK0, Q8VHG2

Diamond homologs: A9RA84, B0CM99, B1MTB0, B2RPK0, O01683, O04235, O15347, O15405, O49595, O49596, O49597, O54879, O64702, O94842, O94900, P07156, P07746, P09429, P0CO24, P0CO25, P10103, P11632, P11633, P11873, P12682, P17741, P23497, P26583, P26584, P26585, P26586, P27347, P30681, P40618, P40619, P40620, P40621, P40622, P40623, P40632

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 79 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
NuRD complex assembly514.7×2e-03
Activation of anterior HOX genes in hindbrain development during early embryogenesis611.4×2e-03
CHD1 and CHD2 subfamily511.3×5e-03
mRNA Polyadenylation59.2×9e-03
mRNA Splicing - Major Pathway66.8×9e-03
Dengue Virus-Host Interactions76.7×5e-03

GO biological processes:

GO termPartnersFoldFDR
nucleosome assembly612.4×1e-03
mRNA processing89.3×9e-04
RNA splicing79.1×1e-03
chromatin remodeling88.6×9e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

68 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance57
Likely benign0
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

1586 predictions. Top by Δscore:

VariantEffectΔscore
22:35257741:G:GTdonor_gain1.0000
22:35263057:A:AGacceptor_gain1.0000
22:35263058:A:Gacceptor_gain1.0000
22:35263060:T:Gacceptor_gain1.0000
22:35263061:A:AGacceptor_gain1.0000
22:35263062:A:Gacceptor_gain1.0000
22:35263062:AACCT:Aacceptor_gain1.0000
22:35263063:A:AGacceptor_gain1.0000
22:35263063:ACCT:Aacceptor_gain1.0000
22:35263064:C:Gacceptor_gain1.0000
22:35263066:T:Aacceptor_gain1.0000
22:35263068:T:TAacceptor_gain1.0000
22:35263073:CTCA:Cacceptor_loss1.0000
22:35263074:TCA:Tacceptor_loss1.0000
22:35263075:CAGA:Cacceptor_loss1.0000
22:35263076:A:AGacceptor_gain1.0000
22:35263076:A:Cacceptor_loss1.0000
22:35263077:G:GGacceptor_gain1.0000
22:35263077:G:GTacceptor_loss1.0000
22:35263223:GAAG:Gdonor_gain1.0000
22:35263227:G:GGdonor_gain1.0000
22:35263228:T:Adonor_loss1.0000
22:35263788:A:AGacceptor_gain1.0000
22:35263789:A:Gacceptor_gain1.0000
22:35263791:TATA:Tacceptor_loss1.0000
22:35263792:ATAG:Aacceptor_gain1.0000
22:35263793:TA:Tacceptor_loss1.0000
22:35263794:A:AGacceptor_gain1.0000
22:35263794:AG:Aacceptor_gain1.0000
22:35263795:G:Aacceptor_loss1.0000

AlphaMissense

3899 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
22:35283986:T:AY414N1.000
22:35283986:T:CY414H1.000
22:35283986:T:GY414D1.000
22:35283987:A:GY414C1.000
22:35283995:T:CF417L1.000
22:35283996:T:CF417S1.000
22:35283996:T:GF417C1.000
22:35283997:C:AF417L1.000
22:35283997:C:GF417L1.000
22:35284010:C:AR422S1.000
22:35285999:T:CF434L1.000
22:35286000:T:GF434C1.000
22:35286001:T:AF434L1.000
22:35286001:T:GF434L1.000
22:35286009:T:CL437P1.000
22:35286021:T:AL441Q1.000
22:35286021:T:CL441P1.000
22:35286024:C:AA442D1.000
22:35286032:T:AW445R1.000
22:35286032:T:CW445R1.000
22:35286033:G:CW445S1.000
22:35286034:G:CW445C1.000
22:35286034:G:TW445C1.000
22:35286058:A:CK453N1.000
22:35286058:A:TK453N1.000
22:35287350:T:AW456R1.000
22:35287350:T:CW456R1.000
22:35287351:G:CW456S1.000
22:35287352:G:CW456C1.000
22:35287352:G:TW456C1.000

dbSNP variants (sampled 300 via entrez): RS1000008720 (22:35241667 T>A), RS1000050365 (22:35253788 G>A), RS1000117191 (22:35282452 C>G), RS1000184578 (22:35283895 G>T), RS1000205825 (22:35288781 G>A), RS1000365778 (22:35259816 T>G), RS1000403889 (22:35244355 G>A), RS1000456865 (22:35250798 CCTCT>C,CCT), RS1000493889 (22:35261371 G>A), RS1000545773 (22:35261682 TTGG>T), RS1000643095 (22:35258334 C>G,T), RS1000716029 (22:35259537 G>A), RS1000725789 (22:35257067 C>T), RS1000775006 (22:35294245 C>T), RS1000787231 (22:35248644 A>G)

Disease associations

OMIM: gene MIM:604702 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

12 associations (top):

StudyTraitp-value
GCST002782_43Waist-to-hip ratio adjusted for body mass index5.000000e-06
GCST002782_44Waist-to-hip ratio adjusted for body mass index8.000000e-06
GCST004067_204Hip circumference adjusted for BMI2.000000e-09
GCST004067_7Hip circumference adjusted for BMI4.000000e-08
GCST005956_75Waist-to-hip ratio adjusted for BMI1.000000e-07
GCST005958_13Waist-to-hip ratio adjusted for BMI (age >50)1.000000e-06
GCST005962_33Waist-to-hip ratio adjusted for BMI x sex x age interaction (4df test)6.000000e-06
GCST012490_136Femur bone mineral density x serum urate levels interaction2.000000e-08
GCST012490_351Femur bone mineral density x serum urate levels interaction2.000000e-13
GCST90002402_645Platelet count4.000000e-10
GCST90020026_215Hip index4.000000e-09
GCST90020028_1547Hip circumference adjusted for BMI2.000000e-13

EFO canonical traits (6, from GWAS)

EFO IDTrait name
EFO:0007788BMI-adjusted waist-hip ratio
EFO:0008039BMI-adjusted hip circumference
EFO:0008007age at assessment
EFO:0008343sex interaction measurement
EFO:0004531urate measurement
EFO:0004309platelet count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

33 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidincreases expression, affects expression, decreases expression, decreases methylation, affects cotreatment5
Air Pollutantsaffects cotreatment, decreases expression, increases abundance2
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression, increases expression2
Cadmium Chloridedecreases expression, increases expression2
aristolochic acid Idecreases expression1
3-((6-(2-methoxyphenyl)pyrimidin-4-yl)amino)phenyl)methane sulfonamidedecreases expression1
GSK-J4increases expression1
FR900359affects phosphorylation1
TAK-243increases sumoylation1
alpha-pineneaffects cotreatment, decreases expression, increases abundance1
geranioldecreases expression1
coumarindecreases phosphorylation1
methacrylaldehydeaffects cotreatment, decreases expression, increases abundance1
di-n-butylphosphoric acidaffects expression1
CGP 52608affects binding, increases reaction1
Leflunomideincreases expression1
Acroleinincreases abundance, affects cotreatment, decreases expression1
Amiodaroneincreases expression1
Benzo(a)pyreneaffects methylation1
Caffeineaffects phosphorylation1
Dichlorodiphenyl Dichloroethyleneincreases expression1
Doxorubicindecreases expression1
Formaldehydedecreases expression1
Hydralazineaffects cotreatment, increases expression1
Ozoneaffects cotreatment, decreases expression, increases abundance1
Seleniumaffects cotreatment, decreases expression1
Thiramincreases expression1
Tretinoindecreases expression1
Vitamin Eaffects cotreatment, decreases expression1
Cyclosporineincreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A2S0SEES3-1V human HMG2L1, clone1Embryonic stem cellMale
CVCL_A2S1SEES3-1V human HMG2L1, clone2Embryonic stem cellMale
CVCL_A2S2SEES3-1V human HMG2L1, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.