RHOXF2

gene
On this page

Also known as THG1PEPP-2PEPP2CT107

Summary

RHOXF2 (Rhox homeobox family member 2, HGNC:30011) is a protein-coding gene on chromosome Xq24, encoding Rhox homeobox family member 2 (Q9BQY4). Transcription factor maybe involved in reproductive processes.

This gene, which encodes a transcriptional repressor, is one of two paralogous X-linked homeobox-containing genes and is highly expressed in a variety of cancers. In addition, the encoded protein associates with the cell membrane and with microtubules, and is concentrated at the leading edge of migratory cells.

Source: NCBI Gene 84528 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 33 total — 1 pathogenic
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_032498

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:30011
Approved symbolRHOXF2
NameRhox homeobox family member 2
LocationXq24
Locus typegene with protein product
StatusApproved
AliasesTHG1, PEPP-2, PEPP2, CT107
Ensembl geneENSG00000131721
Ensembl biotypeprotein_coding
OMIM300447
Entrez84528

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000371388

RefSeq mRNA: 1 — MANE Select: NM_032498 NM_032498

CCDS: CCDS14594

Canonical transcript exons

ENST00000371388 — 4 exons

ExonStartEnd
ENSE00001661159120158613120158829
ENSE00001678225120163458120165630
ENSE00001694270120160358120160403
ENSE00001794725120159015120159426

Expression profiles

Bgee: expression breadth broad, 22 present calls, max score 82.76.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.6132 / max 59.7410, expressed in 55 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
2097960.613255

Top tissues by expression

90 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047382.76gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099180.32gold quality
testisUBERON:000047366.95gold quality
right testisUBERON:000453465.55gold quality
left testisUBERON:000453364.90gold quality
colonic epitheliumUBERON:000039754.41gold quality
bone marrow cellCL:000209252.85gold quality
sural nerveUBERON:001548848.91gold quality
tonsilUBERON:000237244.88gold quality
bone marrowUBERON:000237142.50gold quality
olfactory segment of nasal mucosaUBERON:000538641.39silver quality
skeletal muscle tissueUBERON:000113441.18gold quality
right uterine tubeUBERON:000130240.41silver quality
corpus callosumUBERON:000233639.52silver quality
adrenal tissueUBERON:001830339.38silver quality
lower esophagus mucosaUBERON:003583439.04gold quality
liverUBERON:000210738.79gold quality
muscle tissueUBERON:000238538.22gold quality
bloodUBERON:000017838.16silver quality
primary visual cortexUBERON:000243638.10gold quality
urinary bladderUBERON:000125537.59gold quality
right hemisphere of cerebellumUBERON:001489037.08gold quality
monocyteCL:000057636.61gold quality
ventricular zoneUBERON:000305336.48gold quality
leukocyteCL:000073836.33gold quality
adult mammalian kidneyUBERON:000008236.26silver quality
granulocyteCL:000009435.96gold quality
ganglionic eminenceUBERON:000402335.49gold quality
endometriumUBERON:000129535.48gold quality
hindlimb stylopod muscleUBERON:000425235.40gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-GEOD-134144yes265.87
E-ANND-3no0.89

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

24 targeting RHOXF2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-8485100.0077.574731
HSA-MIR-548AW99.9972.573559
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-146A-5P99.9668.93988
HSA-MIR-146B-5P99.9669.13977
HSA-MIR-493-5P99.9672.472382
HSA-MIR-7153-5P99.9468.891006
HSA-MIR-627-3P99.9071.423316
HSA-MIR-153-5P99.8973.866317
HSA-MIR-5003-3P99.8569.292517
HSA-MIR-451799.7669.191867
HSA-MIR-494-3P99.7071.452795
HSA-MIR-312899.5067.851258
HSA-MIR-149-5P99.2567.161315
HSA-MIR-60698.7267.34960
HSA-MIR-4664-5P98.1765.071020
HSA-MIR-18B-3P98.0565.55595
HSA-MIR-891A-3P98.0567.99970
HSA-MIR-5588-5P97.4665.70913
HSA-MIR-342-5P97.2564.10817
HSA-MIR-424-3P97.2065.86385
HSA-MIR-3157-3P95.8667.08454
HSA-MIR-31-3P95.1769.82575
HSA-MIR-371B-3P94.4866.59345

Literature-anchored findings (GeneRIF, showing 4)

  • RHOXF2 is highly expressed in some leukemia cell lines and a variety of human cancers. Results indicate that RHOXF2 is involved in carcinogenesis. (PMID:21874235)
  • In striking contrast to non-human primates, humans appear to have homogenized their two RHOXF2 copies by the endogenous retrovirus -mediated non-allelic recombination mechanism. (PMID:21988730)
  • PEPP2 has a role in regulating function of membrane and microtubules (PMID:23682428)
  • Human Thg1, two critical tRNA-interacting residues, His-152 and Lys-187, alanine-substituted variants maintain a similar overall interaction with the anticodon region, arguing against the sufficiency of this interaction for driving catalysis. Instead, conservative mutagenesis revealed a new direct function for these residues in recognition of a non-Watson-Crick G-1:A73 bp (PMID:31000629)

Cross-species orthologs

0 orthologs

Paralogs (50): ARX (ENSG00000004848), PAX6 (ENSG00000007372), PAX7 (ENSG00000009709), ALX4 (ENSG00000052850), GSC2 (ENSG00000063515), PITX1 (ENSG00000069011), PAX2 (ENSG00000075891), RHOXF1 (ENSG00000101883), CRX (ENSG00000105392), EVX1 (ENSG00000106038), PAX4 (ENSG00000106331), NOBOX (ENSG00000106410), PITX3 (ENSG00000107859), PHOX2B (ENSG00000109132), OTX1 (ENSG00000115507), PRRX1 (ENSG00000116132), VSX2 (ENSG00000119614), ESX1 (ENSG00000123576), PAX8 (ENSG00000125618), PAX1 (ENSG00000125813), GSC (ENSG00000133937), RAX (ENSG00000134438), PAX3 (ENSG00000135903), ALX3 (ENSG00000156150), HESX1 (ENSG00000163666), PITX2 (ENSG00000164093), UNCX (ENSG00000164853), PHOX2A (ENSG00000165462), OTX2 (ENSG00000165588), DRGX (ENSG00000165606), PRRX2 (ENSG00000167157), SHOX2 (ENSG00000168779), OTP (ENSG00000171540), RAX2 (ENSG00000173976), EVX2 (ENSG00000174279), PROP1 (ENSG00000175325), ISX (ENSG00000175329), ALX1 (ENSG00000180318), MIXL1 (ENSG00000185155), SHOX (ENSG00000185960)

Protein

Protein identifiers

Rhox homeobox family member 2Q9BQY4 (reviewed: Q9BQY4)

Alternative names: Paired-like homeobox protein PEPP-2, Testis homeobox gene 1

All UniProt accessions (1): Q9BQY4

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor maybe involved in reproductive processes. Modulates expression of target genes encoding proteins involved in processes relevant to spermatogenesis.

Subcellular location. Nucleus.

Tissue specificity. Testis. Not detected in epididymis nor placenta. In testis, mainly expressed in germ cells, but also detected in somatic cells such as Sertoli cells, Leydig cells and peritubular cells.

Similarity. Belongs to the paired-like homeobox family. PEPP subfamily.

RefSeq proteins (1): NP_115887* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site

Pfam: PF00046

UniProt features (10 total): sequence variant 4, compositionally biased region 2, chain 1, DNA-binding region 1, region of interest 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BQY4-F164.380.22

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 22 (showing top): GOBP_NEUROGENESIS, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, chrXq24, WARTERS_IR_RESPONSE_5GY, NFE2L2.V2, GOMF_TRANSCRIPTION_REGULATOR_ACTIVITY, MIR1468_3P, MIR5003_3P, MIR3128, MIR149_5P, MIR31_3P, MIR18B_3P, MIR424_3P, GOBP_POSITIVE_REGULATION_OF_GENE_EXPRESSION, GOBP_GENERATION_OF_NEURONS

GO Biological Process (4): regulation of transcription by RNA polymerase II (GO:0006357), positive regulation of gene expression (GO:0010628), neuron development (GO:0048666), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (6): RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), identical protein binding (GO:0042802), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of gene expression2
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
gene expression1
positive regulation of macromolecule biosynthetic process1
neuron differentiation1
cell development1
DNA-templated transcription1
regulation of RNA biosynthetic process1
transcription cis-regulatory region binding1
chromatin1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
protein binding1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

512 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
RHOXF2TGIF2LXQ8IUE1599
RHOXF2ODF4Q2M2E3542
RHOXF2UNC5CO95185479
RHOXF2SLC71A3PQ5VZR4448
RHOXF2TSGA10Q9BZW7424
RHOXF2TPRX1Q8N7U7419
RHOXF2AKAP4Q5JQC9415
RHOXF2INHAP05111394
RHOXF2DUXBA0A1W2PPF3373
RHOXF2NANOGNBQ7Z5D8370
RHOXF2REXO5Q96IC2336
RHOXF2ARGFXA6NJG6327
RHOXF2TET1Q8NFU7324
RHOXF2PRSS37A4D1T9312
RHOXF2DPRXA6NFQ7305

IntAct

297 interactions, top by confidence:

ABTypeScore
RHOXF2DAZAP2psi-mi:“MI:0915”(physical association)0.850
DAZAP2RHOXF2psi-mi:“MI:0915”(physical association)0.850
RHOXF2ZBTB32psi-mi:“MI:0915”(physical association)0.810
ZBTB32RHOXF2psi-mi:“MI:0915”(physical association)0.810
RHOXF2TOLLIPpsi-mi:“MI:0915”(physical association)0.800
TRIP13RHOXF2psi-mi:“MI:0915”(physical association)0.790
RHOXF2TRIP13psi-mi:“MI:0915”(physical association)0.790
KRTAP19-5RHOXF2psi-mi:“MI:0915”(physical association)0.720
RHOXF2KRTAP19-7psi-mi:“MI:0915”(physical association)0.720
VAC14RHOXF2psi-mi:“MI:0915”(physical association)0.720
RHOXF2MAGED1psi-mi:“MI:0915”(physical association)0.720
RHOXF2KRTAP19-5psi-mi:“MI:0915”(physical association)0.720

BioGRID (139): RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid), RHOXF2 (Two-hybrid)

ESM2 similar proteins: A0A1W2PPF3, A0A3B3IT52, E9Q3T6, O02747, O13161, O42173, O57374, O73622, O95238, P01103, P01104, P09632, P0C7M4, P10242, P14837, P17278, P30561, P31538, P41738, P46200, Q1KKS8, Q28G02, Q32NH9, Q4JM65, Q5REX1, Q5WM45, Q66IG8, Q68EH7, Q6U8D7, Q6ZTZ1, Q7T1K4, Q8BIL2, Q8BKE5, Q8CFH6, Q8IWB6, Q8JIT7, Q8NHV9, Q8R4S2, Q8R4S4, Q8R4S5

Diamond homologs: A1A546, A1YEV8, A1YEY5, A1YFI3, A1YG25, A1YG57, A2T711, A2T733, A2T7P4, A6NNA5, A6YP92, F6YCR7, G5EC89, G5EDS1, O14813, O18400, O35085, O35137, O35160, O35602, O35690, O42115, O42201, O42356, O42357, O42358, O42567, O43186, O54751, O70137, O73917, O75360, O75364, O88436, O93385, O95076, O97039, P06601, P0C7M4, P26367

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 64 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Keratinization1019.2×7e-09

Disease & clinical

Clinical variants and AI predictions

ClinVar

33 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance23
Likely benign8
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
2424705NC_000023.10:g.(?117629935)(119761021_?)delPathogenic

SpliceAI

432 predictions. Top by Δscore:

VariantEffectΔscore
X:120159414:G:GGdonor_gain1.0000
X:120159424:GCG:Gdonor_gain1.0000
X:120159427:GTA:Gdonor_loss1.0000
X:120159428:T:Adonor_loss1.0000
X:120163449:T:Aacceptor_gain1.0000
X:120159427:G:GGdonor_gain0.9900
X:120163453:T:Aacceptor_gain0.9900
X:120163456:A:AGacceptor_gain0.9900
X:120163457:G:GGacceptor_gain0.9900
X:120163457:GATTT:Gacceptor_gain0.9900
X:120160356:A:AGacceptor_gain0.9800
X:120160357:G:GGacceptor_gain0.9800
X:120163453:TGAA:Tacceptor_loss0.9800
X:120163454:GAAG:Gacceptor_loss0.9800
X:120163455:AAG:Aacceptor_loss0.9800
X:120163456:A:Cacceptor_loss0.9800
X:120163457:G:Aacceptor_loss0.9800
X:120158828:GG:Gdonor_gain0.9700
X:120158829:GG:Gdonor_gain0.9700
X:120163457:GAT:Gacceptor_gain0.9700
X:120160636:A:Gdonor_gain0.9600
X:120160636:A:AGdonor_gain0.9500
X:120163457:GA:Gacceptor_gain0.9400
X:120163457:GATT:Gacceptor_gain0.9400
X:120163454:GAAGA:Gacceptor_gain0.9300
X:120163453:TGAAG:Tacceptor_gain0.9200
X:120159272:G:GTdonor_gain0.9000
X:120160357:GAA:Gacceptor_gain0.9000
X:120163455:AAGAT:Aacceptor_gain0.9000
X:120163452:TTGAA:Tacceptor_gain0.8900

AlphaMissense

1918 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:120163464:T:CF182L0.996
X:120163466:T:AF182L0.996
X:120163466:T:GF182L0.996
X:120159356:T:CF141L0.995
X:120159358:C:AF141L0.995
X:120159358:C:GF141L0.995
X:120159392:T:CF153L0.995
X:120159394:C:AF153L0.995
X:120159394:C:GF153L0.995
X:120159357:T:GF141C0.989
X:120159407:T:CF158L0.989
X:120159409:C:AF158L0.989
X:120159409:C:GF158L0.989
X:120163465:T:CF182S0.989
X:120163478:A:CR186S0.989
X:120163478:A:TR186S0.989
X:120159393:T:CF153S0.988
X:120159393:T:GF153C0.986
X:120163465:T:GF182C0.986
X:120159357:T:CF141S0.985
X:120159381:T:CL149P0.985
X:120163463:G:CW181C0.980
X:120163463:G:TW181C0.980
X:120163475:A:CR185S0.979
X:120163475:A:TR185S0.979
X:120163477:G:CR186T0.977
X:120163464:T:AF182I0.976
X:120159392:T:AF153I0.975
X:120159392:T:GF153V0.975
X:120159372:T:CL146P0.974

dbSNP variants (sampled 300 via entrez): RS1157163197 (X:120165269 C>G,T), RS1158059642 (X:120166088 CAGAA>C), RS1158110171 (X:120162425 G>A,C), RS1158614524 (X:120160871 G>T), RS1158680036 (X:120164471 CAAAAG>C), RS1158722971 (X:120162988 G>C), RS1158809182 (X:120165077 T>C), RS1159039261 (X:120159552 C>G,T), RS1159234947 (X:120161599 GA>G), RS1160181564 (X:120158860 C>T), RS1161658741 (X:120159017 A>G), RS1162210873 (X:120161863 T>C), RS1164660199 (X:120163009 A>T), RS1164742912 (X:120158094 GA>G,GAA), RS1164767328 (X:120165483 G>C)

Disease associations

OMIM: gene MIM:300447 | disease phenotypes: MIM:300354, MIM:300676

GenCC curated gene-disease

Mondo (2): X-linked intellectual disability, Cabezas type (MONDO:0010306), syndromic X-linked intellectual disability 14 (MONDO:0010398)

Orphanet (3): Lujan-Fryns syndrome (Orphanet:776), X-linked intellectual disability, Vitale type (Orphanet:85289), X-linked intellectual disability, Cabezas type (Orphanet:85293)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (1)

DescriptorNameTree numbers
C567063Mental Retardation, X-Linked, Syndromic 14 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5725135 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

1 potent at pChembl≥5 of 1 total, top 1 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.60IC50250nMMOLIBRESIB

PubChem BioAssay actives

1 with measured affinity, of 6 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2178798: Inhibition of RHOXF2 (unknown origin) incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisic500.2500uM

CTD chemical–gene interactions

5 total (human), top 5 by PubMed support.

ChemicalActions (top 5)PubMed papers
Decitabineincreases expression2
cobaltous chloridedecreases expression1
Plant Extractsdecreases expression1
Valproic Acidincreases methylation1
Okadaic Acidincreases expression1

ChEMBL screening assays

6 unique, capped per target: 6 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5697528BindingInhibition of RHOXF2 (unknown origin) assessed as fold change at 10 uM incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisInhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia. — Nature

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A5W2SEES3-1V human RHOXF2, clone1Embryonic stem cellMale
CVCL_A5W3SEES3-1V human RHOXF2, clone2Embryonic stem cellMale
CVCL_A5W4SEES3-1V human RHOXF2, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.