TPRX1

gene
On this page

Also known as FLJ40321

Summary

TPRX1 (tetrapeptide repeat homeobox 1, HGNC:32174) is a protein-coding gene on chromosome 19q13.33, encoding Tetra-peptide repeat homeobox protein 1 (Q8N7U7). Transcription factor expressed after fertilization required for zygotic genome activation (ZGA), a critical event in early embryonic development during which the developmental control passes from maternally provided mRNAs to the expression of the zygotic genome after fertilizati….

Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This gene is a member of the TPRX homeobox gene family.

Source: NCBI Gene 284355 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 96 total
  • MANE Select transcript: NM_001397346

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:32174
Approved symbolTPRX1
Nametetrapeptide repeat homeobox 1
Location19q13.33
Locus typegene with protein product
StatusApproved
AliasesFLJ40321
Ensembl geneENSG00000178928
Ensembl biotypeprotein_coding
OMIM611166
Entrez284355

Gene structure

Transcript identifiers

Ensembl transcripts: 3 — 3 protein_coding

ENST00000322175, ENST00000535759, ENST00000698655

RefSeq mRNA: 2 — MANE Select: NM_001397346 NM_001397346, NM_198479

CCDS: CCDS33066, CCDS92654

Canonical transcript exons

ENST00000698655 — 3 exons

ExonStartEnd
ENSE000039743384780350447803673
ENSE000039743394780123247802980
ENSE000039743404780451547804551

Expression profiles

Bgee: expression breadth tissue_specific, 8 present calls, max score 52.00.

Top tissues by expression

128 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099152.00gold quality
left testisUBERON:000453345.45gold quality
right testisUBERON:000453445.20gold quality
testisUBERON:000047345.11gold quality
colonic epitheliumUBERON:000039737.20gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
bone marrow cellCL:000209236.16gold quality
sural nerveUBERON:001548835.87gold quality
ganglionic eminenceUBERON:000402335.49gold quality
skeletal muscle tissueUBERON:000113433.38gold quality
muscle tissueUBERON:000238532.20gold quality
hindlimb stylopod muscleUBERON:000425232.15gold quality
bone marrowUBERON:000237131.74gold quality
monocyteCL:000057630.18gold quality
prefrontal cortexUBERON:000045129.99gold quality
leukocyteCL:000073829.95gold quality
stromal cell of endometriumCL:000225529.87gold quality
urinary bladderUBERON:000125529.71gold quality
vermiform appendixUBERON:000115429.36gold quality
duodenumUBERON:000211428.14gold quality
smooth muscle tissueUBERON:000113527.61gold quality
lymph nodeUBERON:000002927.57gold quality
cerebellumUBERON:000203727.28silver quality
cerebellar hemisphereUBERON:000224527.19silver quality
cerebellar cortexUBERON:000212927.12silver quality
tonsilUBERON:000237227.05gold quality
superior frontal gyrusUBERON:000266126.94gold quality
descending thoracic aortaUBERON:000234526.89gold quality
myometriumUBERON:000129626.81gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-GEOD-36552yes923.85
E-ANND-3no1.14

Regulation

Is transcription factor: yes

JASPAR motifs

MotifNameFamily
MA2600.1TPRX1Paired-related HD

JASPAR matrix evidence (PMIDs): PMID:39605530

miRNA regulators (miRDB)

34 targeting TPRX1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4262100.0073.263931
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-3646100.0073.565283
HSA-MIR-181A-5P99.9972.962995
HSA-MIR-181B-5P99.9972.972996
HSA-MIR-181C-5P99.9972.952996
HSA-MIR-181D-5P99.9973.042997
HSA-MIR-60799.9773.625593
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-7-1-3P99.9171.534384
HSA-MIR-7-2-3P99.9171.404394
HSA-MIR-367199.9073.043897
HSA-MIR-6756-5P99.8267.972466
HSA-MIR-6766-5P99.6867.702325
HSA-MIR-46699.6770.852863
HSA-MIR-3158-5P99.6567.511763
HSA-MIR-444199.4966.563216
HSA-MIR-312399.4767.152693
HSA-MIR-6507-5P99.3670.462524
HSA-MIR-427099.0266.261987
HSA-MIR-6769B-5P98.7364.911092
HSA-MIR-797798.6566.182590
HSA-MIR-6754-5P98.6065.541627
HSA-MIR-302F98.4469.021776
HSA-MIR-3155A98.1666.09965
HSA-MIR-3155B98.1666.09965
HSA-MIR-48498.1666.921074
HSA-MIR-6881-5P98.1667.38665
HSA-MIR-211-3P98.1466.771052

Cross-species orthologs

0 orthologs

Paralogs (1): TPRX2 (ENSG00000259009)

Protein

Protein identifiers

Tetra-peptide repeat homeobox protein 1Q8N7U7 (reviewed: Q8N7U7)

All UniProt accessions (3): A0A8V8TM49, A0A8V8TMK4, D2CFI5

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor expressed after fertilization required for zygotic genome activation (ZGA), a critical event in early embryonic development during which the developmental control passes from maternally provided mRNAs to the expression of the zygotic genome after fertilization. Binds and activates expression of key ZGA marker genes, such as NANOGNB, ZSCAN4, DUXB, KLF5 and DPPA3. Binds to regulatory DNA sequences containing a 5’-TAATCC-3’ sequence motif.

Subcellular location. Nucleus.

Similarity. Belongs to the paired homeobox family.

Isoforms (2)

UniProt IDNamesCanonical?
Q8N7U7-11yes
Q8N7U7-22

RefSeq proteins (2): NP_001384275, NP_940881 (=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily

UniProt features (17 total): compositionally biased region 7, region of interest 4, sequence conflict 3, chain 1, DNA-binding region 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N7U7-F147.450.04

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-9819196Zygotic genome activation (ZGA)

MSigDB gene sets: 24 (showing top): GOMF_TRANSCRIPTION_REGULATOR_ACTIVITY, GLI1_TARGET_GENES, ZNF146_TARGET_GENES, ZNF486_TARGET_GENES, MIR607, MIR4262, MIR3671, MIR181A_5P_MIR181B_5P, MIR181D_5P, MIR181C_5P, MIR1252_5P, MIR3123, MIR12128, MIR1226_3P, MIR3936

GO Biological Process (3): maternal-to-zygotic transition of gene expression (GO:0160021), regulation of DNA-templated transcription (GO:0006355), regulation of transcription by RNA polymerase II (GO:0006357)

GO Molecular Function (2): DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA binding (GO:0003677)

GO Cellular Component (3): chromatin (GO:0000785), nucleoplasm (GO:0005654), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Maternal to zygotic transition (MZT)1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of gene expression2
cellular anatomical structure2
DNA-templated transcription1
regulation of RNA biosynthetic process1
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
chromatin1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
nucleic acid binding1
chromosome1
nuclear lumen1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

408 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TPRX1CRXO43186879
TPRX1DPRXA6NFQ7818
TPRX1ARGFXA6NJG6797
TPRX1DUXBA0A1W2PPF3768
TPRX1LEUTXA8MZ59729
TPRX1DUXAA6NLW8705
TPRX1NANOGNBQ7Z5D8667
TPRX1RHOXF2BP0C7M4570
TPRX1LBX1P52954527
TPRX1ANHXE9PGG2463
TPRX1RHOXF2Q9BQY4419
TPRX1DUX4L2P0CJ85409
TPRX1ZMAT1Q5H9K5408
TPRX1UNCXA6NJT0404
TPRX1DRGXA6NNA5403

IntAct

2 interactions, top by confidence:

ABTypeScore
TPRX1PARP4psi-mi:“MI:0915”(physical association)0.400

BioGRID (20): TPRX1 (Two-hybrid), TPRX1 (Two-hybrid), TPRX1 (Two-hybrid), TPRX1 (Two-hybrid), TPRX1 (Two-hybrid), TPRX1 (Two-hybrid), MKRN3 (Two-hybrid), KRTAP6-2 (Two-hybrid), NELFB (Two-hybrid), INCA1 (Two-hybrid), GOLGA6L9 (Two-hybrid), PRR20B (Two-hybrid), PRR20A (Two-hybrid), FANCG (Two-hybrid), PRR20D (Two-hybrid)

ESM2 similar proteins: A0A0J9YWL9, A0A0J9YY54, A0A0U1RQI7, A0A494C071, A6NNC1, A6QL64, B3KS81, D3YZV8, E2RYF7, E9Q6E9, F1LWT0, F8W0I5, O60732, P0C8Z4, P0DKJ7, P0DKJ8, P0DKL2, P0DPF3, P18751, P20930, P43537, P53353, P59797, P62521, Q01456, Q08AG5, Q12816, Q3BBV2, Q5H9R4, Q5HY64, Q5JPF3, Q6P902, Q6ZQX7, Q86T75, Q86VE3, Q86VQ3, Q8BGJ3, Q8N307, Q8N7U7, Q8N7X1

Diamond homologs: A1YEV8, A1YEY5, A1YFI3, A1YG25, A1YG57, A1YGA2, A2T711, A2T733, A2T777, A2T7P4, A6YP92, F1NEA7, G3X9P6, G3X9U1, O14813, O15499, O35085, O35160, O42115, O43186, O43316, O54751, O73592, O75364, O93385, O97670, P06601, P09082, P09083, P0DV77, P10181, P18264, P20270, P21711, P22810, P29454, P32242, P32243, P50223, P53544

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

96 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance77
Likely benign19
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

571 predictions. Top by Δscore:

VariantEffectΔscore
19:47802978:CAC:Cacceptor_gain1.0000
19:47803498:TCAGA:Tdonor_loss1.0000
19:47803499:CAGA:Cdonor_loss1.0000
19:47803500:AGAC:Adonor_loss1.0000
19:47803501:GAC:Gdonor_loss1.0000
19:47803502:AC:Adonor_loss1.0000
19:47803503:C:CTdonor_loss1.0000
19:47803518:C:CAdonor_gain1.0000
19:47803670:GGGC:Gacceptor_gain1.0000
19:47803671:GGC:Gacceptor_gain1.0000
19:47803672:GC:Gacceptor_gain1.0000
19:47803673:CC:Cacceptor_gain1.0000
19:47803674:C:CCacceptor_gain1.0000
19:47803674:CTGG:Cacceptor_loss1.0000
19:47803675:T:Cacceptor_loss1.0000
19:47802976:CACAC:Cacceptor_gain0.9900
19:47802977:ACAC:Aacceptor_gain0.9900
19:47802978:CACC:Cacceptor_gain0.9900
19:47802982:T:Cacceptor_loss0.9900
19:47802989:C:CTacceptor_gain0.9900
19:47802990:A:Tacceptor_gain0.9900
19:47803506:G:Adonor_gain0.9900
19:47803509:G:Cdonor_gain0.9900
19:47803669:AGGGC:Aacceptor_gain0.9900
19:47803680:G:Cacceptor_gain0.9900
19:47803680:G:GCacceptor_gain0.9900
19:47818967:CTCAC:Cdonor_loss0.9900
19:47818968:TCAC:Tdonor_loss0.9900
19:47818969:CACCT:Cdonor_loss0.9900
19:47818970:A:Cdonor_loss0.9900

AlphaMissense

2909 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:47802969:C:AK14N0.985
19:47802969:C:GK14N0.985
19:47803508:A:GL9P0.957
19:47802964:C:GR16P0.955
19:47802970:T:AK14M0.955
19:47802972:G:CF13L0.955
19:47802972:G:TF13L0.955
19:47802974:A:GF13L0.955
19:47802975:C:AW12C0.953
19:47802975:C:GW12C0.953
19:47802973:A:GF13S0.950
19:47802977:A:GW12R0.948
19:47802977:A:TW12R0.948
19:47803517:T:AE6V0.946
19:47802965:G:TR16S0.939
19:47802959:C:GA18P0.933
19:47802954:T:AK19N0.926
19:47802954:T:GK19N0.926
19:47802968:T:CN15D0.923
19:47802970:T:GK14T0.923
19:47802967:T:CN15S0.921
19:47802962:G:TR17S0.917
19:47803518:C:TE6K0.911
19:47802967:T:GN15T0.908
19:47802967:T:AN15I0.906
19:47802971:T:CK14E0.906
19:47802961:C:GR17P0.903
19:47802966:A:CN15K0.900
19:47802966:A:TN15K0.900
19:47802980:C:TV11M0.876

dbSNP variants (sampled 300 via entrez): RS1000067162 (19:47814000 A>C), RS1000241886 (19:47808271 T>C), RS1000299028 (19:47820997 C>A,T), RS1000574795 (19:47817930 C>T), RS1000645890 (19:47819154 T>G), RS1000777598 (19:47801393 A>G), RS1000862751 (19:47805122 A>G), RS1001025614 (19:47810677 T>G), RS1001037778 (19:47816213 C>T), RS1001088667 (19:47816033 C>T), RS1001149556 (19:47803036 T>A,G), RS1001268673 (19:47809515 T>C), RS1001280507 (19:47813225 G>A), RS1001331956 (19:47810873 G>A,C), RS1001361224 (19:47818927 T>A)

Disease associations

OMIM: gene MIM:611166 | disease phenotypes: MIM:120970

GenCC curated gene-disease

Mondo (1): cone-rod dystrophy 2 (MONDO:0007362)

Orphanet (1): Cone rod dystrophy (Orphanet:1872)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

3 total (human), top 3 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
CGP 52608affects binding, increases reaction1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): cone-rod dystrophy 2