TOX2

gene
On this page

Also known as dJ1108D11.2GCX-1

Summary

TOX2 (TOX high mobility group box family member 2, HGNC:16095) is a protein-coding gene on chromosome 20q13.12, encoding TOX high mobility group box family member 2 (Q96NM4). Putative transcriptional activator involved in the hypothalamo-pituitary-gonadal system.

Enables transcription coactivator activity. Involved in positive regulation of transcription by RNA polymerase II. Located in nucleoplasm.

Source: NCBI Gene 84969 — RefSeq curated summary.

At a glance

  • GWAS associations: 16
  • Clinical variants (ClinVar): 76 total
  • MANE Select transcript: NM_001098797

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:16095
Approved symbolTOX2
NameTOX high mobility group box family member 2
Location20q13.12
Locus typegene with protein product
StatusApproved
AliasesdJ1108D11.2, GCX-1
Ensembl geneENSG00000124191
Ensembl biotypeprotein_coding
OMIM611163
Entrez84969

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 9 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000341197, ENST00000358131, ENST00000372999, ENST00000413823, ENST00000423191, ENST00000435864, ENST00000864665, ENST00000864666, ENST00000956506, ENST00000956507

RefSeq mRNA: 4 — MANE Select: NM_001098797 NM_001098796, NM_001098797, NM_001098798, NM_032883

CCDS: CCDS13324, CCDS42875, CCDS46603

Canonical transcript exons

ENST00000341197 — 9 exons

ExonStartEnd
ENSE000016729824400654744006792
ENSE000017713844391485243914990
ENSE000034756774406865044069616
ENSE000036076134397336743973432
ENSE000037034214406571244066107
ENSE000037043954406477744064857
ENSE000037071554406673044066857
ENSE000037088334405429944054526
ENSE000037102724405130644051545

Expression profiles

Bgee: expression breadth ubiquitous, 196 present calls, max score 96.44.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 9.7935 / max 302.7338, expressed in 1283 samples.

FANTOM5 promoters (8 alternative TSS)

Promoter IDTPM avgSamples expressed
1846858.53991265
1846910.489692
1846880.295274
1846920.2702142
1846870.163476
1846900.01504
1846890.01124
1846930.00902

Top tissues by expression

249 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065596.44gold quality
thymusUBERON:000237094.91gold quality
mucosa of stomachUBERON:000119994.10gold quality
right lungUBERON:000216793.54gold quality
oocyteCL:000002392.61gold quality
lower lobe of lungUBERON:000894991.56gold quality
upper lobe of lungUBERON:000894888.51gold quality
upper lobe of left lungUBERON:000895288.28gold quality
lower esophagus muscularis layerUBERON:003583387.99gold quality
esophagogastric junction muscularis propriaUBERON:003584187.93gold quality
lower esophagusUBERON:001347387.91gold quality
hypothalamusUBERON:000189887.40gold quality
lower esophagus mucosaUBERON:003583486.01gold quality
descending thoracic aortaUBERON:000234585.86gold quality
pituitary glandUBERON:000000785.26gold quality
adenohypophysisUBERON:000219684.51gold quality
lungUBERON:000204884.03gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.92gold quality
right frontal lobeUBERON:000281083.51gold quality
prefrontal cortexUBERON:000045183.33gold quality
thoracic aortaUBERON:000151582.79gold quality
anterior cingulate cortexUBERON:000983582.67gold quality
Brodmann (1909) area 9UBERON:001354082.44gold quality
ascending aortaUBERON:000149682.42gold quality
substantia nigraUBERON:000203882.18gold quality
esophagusUBERON:000104382.01gold quality
left testisUBERON:000453381.97gold quality
frontal cortexUBERON:000187081.57gold quality
dorsolateral prefrontal cortexUBERON:000983481.41gold quality
postcentral gyrusUBERON:000258181.22gold quality

Single-cell (SCXA)

Detected in 7 experiment(s), a significant marker in 4.

ExperimentMarker?Max mean expression
E-MTAB-8410yes13.28
E-CURD-122yes10.63
E-ANND-3yes6.74
E-MTAB-6678yes4.88
E-MTAB-6075no557.67
E-MTAB-8060no47.16
E-CURD-112no3.25

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

3 targets.

TargetRegulation
ETS1Activation
IL15Activation
TBX21Activation

miRNA regulators (miRDB)

61 targeting TOX2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-340-5P100.0072.504437
HSA-MIR-4455100.0065.481587
HSA-MIR-4795-3P100.0074.624024
HSA-MIR-3163100.0077.238605
HSA-MIR-366299.9973.825684
HSA-MIR-6772-5P99.9467.01577
HSA-MIR-651-3P99.9473.485177
HSA-MIR-449699.8868.892236
HSA-MIR-449299.8768.253611
HSA-MIR-5582-3P99.8672.484221
HSA-MIR-3121-3P99.8271.963630
HSA-MIR-60999.8264.26505
HSA-MIR-57799.7869.132479
HSA-MIR-2681-5P99.7567.641655
HSA-MIR-6752-3P99.7266.711587
HSA-MIR-3679-3P99.6469.881599
HSA-MIR-9851-3P99.6369.681110
HSA-MIR-76299.5866.611994
HSA-MIR-65799.4866.02848
HSA-MIR-142-5P99.4870.922416
HSA-MIR-5590-3P99.4870.912429
HSA-MIR-449899.4767.422360
HSA-MIR-208A-5P99.4270.831913
HSA-MIR-208B-5P99.4270.831952
HSA-MIR-568399.3668.592083
HSA-MIR-593-5P99.3469.50965
HSA-MIR-431199.3170.473041
HSA-MIR-6799-5P99.1465.722093
HSA-MIR-5001-5P99.0566.761972
HSA-MIR-6829-5P98.8665.121480

Literature-anchored findings (GeneRIF, showing 7)

  • Identification and functional analysis of the rat Gcx1 ortholog. (PMID:14764631)
  • A novel aberrantly hypermethylated CpG island in cancer was discovered within the TOX2 promoter. TOX2 was unmethylated in normal cells. Expression of two novel TOX2 transcripts was significantly reduced in primary lung tumors. (PMID:22496870)
  • TOX2 plays a crucial role in controlling normal NK cell development by acting upstream of TBX21 transcriptional regulation (PMID:25352127)
  • A haplotype block across a 24-kb region within the TOX2 gene reached genome-wide significance in haplotype-block-based regional heritability mapping. Single-SNP- and haplotype-based association tests demonstrated that five of nine genotyped SNPs and two haplotypes within this block were significantly associated with major depressive disorder. (PMID:28153336)
  • CD4+ T cells in classical Hodgkin lymphoma express exhaustion associated transcription factors TOX and TOX2: Characterizing CD4+ T cells in Hodgkin lymphoma. (PMID:35111387)
  • TOX2 coordinates with TET2 to positively regulate central memory differentiation in human CAR T cells. (PMID:37467321)
  • TOX2 nuclear-cytosol translocation is linked to leukemogenesis of acute T-cell leukemia by repressing TIM3 transcription. (PMID:39080376)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriotox2ENSDARG00000100055
mus_musculusTox2ENSMUSG00000074607
rattus_norvegicusTox2ENSRNOG00000008146
caenorhabditis_eleganshmg-3WBGENE00001973
caenorhabditis_elegansWBGENE00001974

Paralogs (20): HMGB3 (ENSG00000029993), HMG20B (ENSG00000064961), SP100 (ENSG00000067066), SMARCE1 (ENSG00000073584), SP140 (ENSG00000079263), TOX4 (ENSG00000092203), HMGXB4 (ENSG00000100281), TOX3 (ENSG00000103460), TFAM (ENSG00000108064), UBTF (ENSG00000108312), HMGB1P1 (ENSG00000124097), SP110 (ENSG00000135899), HMG20A (ENSG00000140382), SSRP1 (ENSG00000149136), HMGB2 (ENSG00000164104), HMGB4 (ENSG00000176256), SP140L (ENSG00000185404), HMGB1 (ENSG00000189403), TOX (ENSG00000198846), UBTFL1 (ENSG00000255009)

Protein

Protein identifiers

TOX high mobility group box family member 2Q96NM4 (reviewed: Q96NM4)

Alternative names: Granulosa cell HMG box protein 1

All UniProt accessions (2): Q96NM4, Q5JYD4

UniProt curated annotations — full annotation on UniProt →

Function. Putative transcriptional activator involved in the hypothalamo-pituitary-gonadal system.

Subcellular location. Nucleus.

Isoforms (4)

UniProt IDNamesCanonical?
Q96NM4-11yes
Q96NM4-22
Q96NM4-33
Q96NM4-44

RefSeq proteins (4): NP_001092266, NP_001092267, NP_001092268, NP_116272 (=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR051365TOX_HMG-box_domainFamily

Pfam: PF00505

UniProt features (18 total): compositionally biased region 5, region of interest 4, splice variant 3, sequence conflict 2, chain 1, DNA-binding region 1, sequence variant 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96NM4-F151.700.06

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 159 (showing top): RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, AREB6_03, LINDGREN_BLADDER_CANCER_CLUSTER_2A_DN, ZHAN_MULTIPLE_MYELOMA_CD1_UP, KOYAMA_SEMA3B_TARGETS_UP, AACTTT_UNKNOWN, MYB_Q3, CHARAFE_BREAST_CANCER_LUMINAL_VS_BASAL_DN, TGGNNNNNNKCCAR_UNKNOWN, HAN_SATB1_TARGETS_DN, MODULE_48, GRADE_COLON_AND_RECTAL_CANCER_DN, MODULE_95, RIGGI_EWING_SARCOMA_PROGENITOR_UP

GO Biological Process (2): regulation of transcription by RNA polymerase II (GO:0006357), positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (4): transcription coactivator activity (GO:0003713), chromatin DNA binding (GO:0031490), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (2): nucleus (GO:0005634), nucleoplasm (GO:0005654)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II2
positive regulation of DNA-templated transcription2
regulation of DNA-templated transcription1
regulation of transcription by RNA polymerase II1
transcription coregulator activity1
DNA binding1
chromatin binding1
nucleic acid binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

1176 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TOX2ZNF461Q8TAF7923
TOX2PHGDHO43175686
TOX2LGALS4P56470542
TOX2TCF7P36402491
TOX2TBX21Q9UL17417
TOX2BATFQ16520404
TOX2DENND2BP78523403
TOX2TIGITQ495A1401
TOX2PDCD1Q15116400
TOX2EOMESO95936399
TOX2HAVCR2Q8TDQ0397
TOX2LAG3P18627368
TOX2PRDM1O75626353
TOX2SLC1A2P43004352
TOX2SLC32A1Q9H598349
TOX2GNG3P29798349

IntAct

7 interactions, top by confidence:

ABTypeScore
TOX2MFHAS1psi-mi:“MI:0407”(direct interaction)0.440
PKMTOX2psi-mi:“MI:0217”(phosphorylation reaction)0.440
S100A6VWA8psi-mi:“MI:0914”(association)0.350
TOX2TOX4psi-mi:“MI:0914”(association)0.350
TSEN2TMED8psi-mi:“MI:0914”(association)0.350

BioGRID (38): TOX (Affinity Capture-MS), TOX4 (Affinity Capture-MS), AKT3 (Affinity Capture-MS), GINS4 (Affinity Capture-MS), TOX2 (Reconstituted Complex), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), TOX2 (Two-hybrid), UNC5CL (Two-hybrid), P4HA3 (Two-hybrid)

ESM2 similar proteins: A1A5P0, A1XSY8, A1YF15, A1YG91, A2AFE9, A2D4Z7, A2T762, A5PJK7, A8WFF7, C0LZJ1, O36399, O43474, O75956, P03413, P08651, P09414, P17923, P21999, P22893, P26633, P26651, P41162, P46153, P47973, P97489, Q01196, Q02780, Q03347, Q08427, Q08775, Q12857, Q13761, Q13950, Q2HRB6, Q58CN7, Q60793, Q61169, Q63046, Q64131, Q76IQ7

Diamond homologs: A4QNP0, B2RPK0, B7SBD2, O15347, O15405, O54879, O94842, O94900, P0CO24, P0CO25, P11632, P11633, P11873, P12682, P40618, Q0P5K4, Q32L31, Q4IQX3, Q4PBZ9, Q4WY33, Q5B995, Q5R6A9, Q66JW3, Q6BRB4, Q6CC79, Q6CVH3, Q6DJL0, Q6IRR0, Q75B82, Q76IQ7, Q7S045, Q80W03, Q8BU11, Q96NM4, Q99PM1, Q9UVL1, Q9YH06, P07746, A9RA84, B0CM99

SIGNOR signaling

1 interactions.

AEffectBMechanism
TOX2“up-regulates quantity by expression”TBX21“transcriptional regulation”

Disease & clinical

Clinical variants and AI predictions

ClinVar

76 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance70
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2004 predictions. Top by Δscore:

VariantEffectΔscore
20:43973365:A:AGacceptor_gain1.0000
20:43973365:AGTTT:Aacceptor_gain1.0000
20:43973366:G:GGacceptor_gain1.0000
20:43973366:GTTTG:Gacceptor_gain1.0000
20:44006536:T:Aacceptor_gain1.0000
20:44006539:T:Aacceptor_gain1.0000
20:44006543:TTAG:Tacceptor_loss1.0000
20:44006544:TA:Tacceptor_loss1.0000
20:44006545:A:AGacceptor_gain1.0000
20:44006545:AG:Aacceptor_loss1.0000
20:44006546:G:GTacceptor_gain1.0000
20:44006546:GA:Gacceptor_gain1.0000
20:44006546:GAC:Gacceptor_gain1.0000
20:44006546:GACC:Gacceptor_gain1.0000
20:44006546:GACCT:Gacceptor_gain1.0000
20:44051304:A:AGacceptor_gain1.0000
20:44051305:G:GGacceptor_gain1.0000
20:44051305:GA:Gacceptor_gain1.0000
20:44051519:G:GTdonor_gain1.0000
20:44051531:G:GTdonor_gain1.0000
20:44051545:GGT:Gdonor_loss1.0000
20:44051546:G:GCdonor_loss1.0000
20:44051547:T:Gdonor_loss1.0000
20:44054523:GCAG:Gdonor_gain1.0000
20:44054524:CAGGT:Cdonor_loss1.0000
20:44054525:AGGTG:Adonor_loss1.0000
20:44054527:GT:Gdonor_loss1.0000
20:44054528:T:Adonor_loss1.0000
20:43957142:A:Gdonor_gain0.9900
20:43973366:GTTT:Gacceptor_gain0.9900

AlphaMissense

3289 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:44054404:T:CY262H1.000
20:44054404:T:GY262D1.000
20:44054405:A:GY262C1.000
20:44054411:T:CL264P1.000
20:44054413:T:CF265L1.000
20:44054414:T:CF265S1.000
20:44054414:T:GF265C1.000
20:44054415:C:AF265L1.000
20:44054415:C:GF265L1.000
20:44054416:T:AF266I1.000
20:44054416:T:CF266L1.000
20:44054416:T:GF266V1.000
20:44054417:T:CF266S1.000
20:44054417:T:GF266C1.000
20:44054418:C:AF266L1.000
20:44054418:C:GF266L1.000
20:44054420:G:CR267T1.000
20:44054420:G:TR267I1.000
20:44054421:A:CR267S1.000
20:44054421:A:TR267S1.000
20:44054430:G:CQ270H1.000
20:44054430:G:TQ270H1.000
20:44054438:T:AI273N1.000
20:44054438:T:CI273T1.000
20:44054438:T:GI273S1.000
20:44054440:A:GK274E1.000
20:44054442:G:CK274N1.000
20:44054442:G:TK274N1.000
20:44054464:T:CF282L1.000
20:44054465:T:CF282S1.000

dbSNP variants (sampled 300 via entrez): RS1000015465 (20:44061294 C>G,T), RS1000025640 (20:44067568 G>A), RS1000085334 (20:44060996 C>A), RS1000091428 (20:43914711 C>A,T), RS1000119553 (20:43970374 G>T), RS1000129791 (20:43983177 G>A), RS1000133547 (20:43945275 T>C), RS1000136583 (20:44022488 G>A), RS1000137066 (20:43993145 T>C), RS1000159573 (20:43926639 G>A), RS1000164111 (20:44043320 G>A), RS1000164552 (20:43949625 C>G,T), RS1000170804 (20:43927510 C>T), RS1000195242 (20:43988753 C>A,T), RS1000195583 (20:44011431 T>C,G)

Disease associations

OMIM: gene MIM:611163 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

16 associations (top):

StudyTraitp-value
GCST000713_10Conduct disorder (symptom count)6.000000e-06
GCST003265_339Post bronchodilator FEV1/FVC ratio in COPD6.000000e-07
GCST004601_197Red blood cell count4.000000e-09
GCST004602_232Mean corpuscular volume5.000000e-10
GCST007552_16Colorectal cancer4.000000e-07
GCST009597_105Multiple sclerosis6.000000e-09
GCST010002_68Refractive error1.000000e-08
GCST010219_20Attention deficit hyperactivity disorder (inattention symptoms)3.000000e-07
GCST90002381_361Eosinophil count4.000000e-22
GCST90002382_530Eosinophil percentage of white cells4.000000e-15
GCST90002390_670Mean corpuscular hemoglobin5.000000e-18
GCST90002392_107Mean corpuscular volume5.000000e-17
GCST90002397_279Mean spheric corpuscular volume8.000000e-10
GCST90002398_56Neutrophil count8.000000e-09
GCST90002403_319Red blood cell count2.000000e-14
GCST90002407_648White blood cell count1.000000e-10

EFO canonical traits (6, from GWAS)

EFO IDTrait name
EFO:0004713FEV/FVC ratio
EFO:0004305erythrocyte count
EFO:0004842eosinophil count
EFO:0007991eosinophil percentage of leukocytes
EFO:0004527mean corpuscular hemoglobin
EFO:0004833neutrophil count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

53 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, affects cotreatment, increases expression8
bisphenol Aaffects cotreatment, decreases methylation, increases expression2
belinostatdecreases expression, affects cotreatment2
Panobinostataffects cotreatment, decreases expression2
Cisplatinaffects expression, decreases expression2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression2
Particulate Matterdecreases expression, increases abundance, affects cotreatment2
triphenyl phosphateaffects expression1
propionaldehydeincreases expression1
trichostatin Aincreases expression1
sodium arseniteincreases expression1
butyraldehydeincreases expression1
aflatoxin B2decreases methylation1
S-(1,2-dichlorovinyl)cysteineaffects response to substance, increases expression, affects cotreatment, decreases expression1
pentanalincreases expression1
perfluorooctane sulfonic acidincreases expression1
2-palmitoylglycerolincreases expression1
entinostatincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression, increases expression1
ICG 001increases expression1
abrinedecreases expression1
ormosilaffects binding, increases expression1
2-methyl-2H-pyrazole-3-carboxylic acid (2-methyl-4-o-tolylazophenyl)amideincreases expression1
dorsomorphinincreases expression, affects cotreatment, decreases expression1
jinfukangincreases expression1
4-(4-((5-(4,5-dimethyl-2-nitrophenyl)-2-furanyl)methylene)-4,5-dihydro-3-methyl-5-oxo-1H-pyrazol-1-yl)benzoic acidaffects expression1
Resveratrolaffects cotreatment, decreases expression1
Temozolomidedecreases expression1
Decitabineaffects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): conduct disorder