THAP3

gene
On this page

Summary

THAP3 (THAP domain containing 3, HGNC:20855) is a protein-coding gene on chromosome 1p36.31, encoding THAP domain-containing protein 3 (Q8WTV1). Component of a THAP1/THAP3-HCFC1-OGT complex that is required for the regulation of the transcriptional activity of RRM1.

Predicted to enable DNA binding activity and zinc ion binding activity. Involved in positive regulation of transcription by RNA polymerase II.

Source: NCBI Gene 90326 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 42 total
  • MANE Select transcript: NM_001195753

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20855
Approved symbolTHAP3
NameTHAP domain containing 3
Location1p36.31
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000041988
Ensembl biotypeprotein_coding
OMIM612532
Entrez90326

Gene structure

Transcript identifiers

Ensembl transcripts: 16 — 12 protein_coding, 2 protein_coding_CDS_not_defined, 1 retained_intron, 1 nonsense_mediated_decay

ENST00000054650, ENST00000307896, ENST00000377627, ENST00000472925, ENST00000480647, ENST00000484669, ENST00000484676, ENST00000487819, ENST00000866303, ENST00000866304, ENST00000866305, ENST00000866306, ENST00000922198, ENST00000922199, ENST00000922200, ENST00000953195

RefSeq mRNA: 8 — MANE Select: NM_001195753 NM_001195752, NM_001195753, NM_001394496, NM_001394497, NM_001394498, NM_001394499, NM_001394500, NM_138350

CCDS: CCDS55572, CCDS55573, CCDS86

Canonical transcript exons

ENST00000054650 — 6 exons

ExonStartEnd
ENSE0000140259166251506625292
ENSE0000147464766248686624954
ENSE0000295650966327966633562
ENSE0000340698166284996628691
ENSE0000347709266302886630353
ENSE0000367410166323916632495

Expression profiles

Bgee: expression breadth ubiquitous, 175 present calls, max score 91.59.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 7.8813 / max 93.0240, expressed in 1784 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
4136.96151766
4120.9197310

Top tissues by expression

286 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
apex of heartUBERON:000209891.59gold quality
lower esophagus mucosaUBERON:003583489.97gold quality
right hemisphere of cerebellumUBERON:001489088.16gold quality
cerebellar hemisphereUBERON:000224587.97gold quality
mucosa of transverse colonUBERON:000499187.94gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099187.89gold quality
cerebellar cortexUBERON:000212987.80gold quality
esophagogastric junction muscularis propriaUBERON:003584187.70gold quality
lower esophagus muscularis layerUBERON:003583387.50gold quality
endocervixUBERON:000045887.49gold quality
lower esophagusUBERON:001347387.49gold quality
right adrenal gland cortexUBERON:003582787.44gold quality
cortical plateUBERON:000534387.34gold quality
granulocyteCL:000009487.19gold quality
right adrenal glandUBERON:000123387.17gold quality
popliteal arteryUBERON:000225087.14gold quality
tibial arteryUBERON:000761087.13gold quality
frontal poleUBERON:000279587.08silver quality
hindlimb stylopod muscleUBERON:000425287.07gold quality
muscle layer of sigmoid colonUBERON:003580587.01gold quality
body of uterusUBERON:000985386.92gold quality
aortaUBERON:000094786.72gold quality
right coronary arteryUBERON:000162586.62gold quality
right lobe of thyroid glandUBERON:000111986.61gold quality
gastrocnemiusUBERON:000138886.61gold quality
descending thoracic aortaUBERON:000234586.59gold quality
ascending aortaUBERON:000149686.45gold quality
thoracic aortaUBERON:000151586.44gold quality
right frontal lobeUBERON:000281086.33gold quality
left adrenal glandUBERON:000123486.32gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-GEOD-110499no36.17
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

26 targeting THAP3, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4795-3P100.0074.624024
HSA-MIR-126-5P100.0072.713180
HSA-MIR-656-3P100.0072.152788
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-4731-5P99.8967.232537
HSA-MIR-3140-3P99.8868.472069
HSA-MIR-548AR-3P99.8571.263889
HSA-MIR-17-3P99.5566.771311
HSA-MIR-136-5P99.5067.261153
HSA-MIR-6727-3P99.4965.921333
HSA-MIR-608199.4866.071446
HSA-MIR-1213299.4768.901341
HSA-MIR-608399.4768.732393
HSA-MIR-425199.4069.193363
HSA-MIR-4722-3P99.3565.221099
HSA-MIR-751599.3168.221795
HSA-MIR-6814-5P99.0366.681273
HSA-MIR-6512-5P98.7669.291195
HSA-MIR-6516-5P98.4270.191551
HSA-MIR-6780A-3P98.4267.491518
HSA-MIR-126398.1369.18459
HSA-MIR-366597.7365.08975
HSA-MIR-6511A-3P97.6066.61713
HSA-MIR-6511B-3P97.6066.61713
HSA-MIR-3200-5P97.3465.97826
HSA-MIR-6854-3P90.9965.18155

Literature-anchored findings (GeneRIF, showing 1)

  • THAP3 recruits SMYD3 to OXPHOS genes and epigenetically promotes mitochondrial respiration in hepatocellular carcinoma. (PMID:38664231)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriosi:ch211-69l10.4ENSDARG00000098036
danio_reriosi:dkey-50i6.5ENSDARG00000101320
mus_musculusThap3ENSMUSG00000039759
rattus_norvegicusThap3ENSRNOG00000026840
drosophila_melanogasterCG13894FBGN0035157

Paralogs (7): THAP1 (ENSG00000131931), ARL14EP (ENSG00000152219), THAP8 (ENSG00000161277), THAP2 (ENSG00000173451), THAP6 (ENSG00000174796), THAP7 (ENSG00000184436), ARL14EPL (ENSG00000268223)

Protein

Protein identifiers

THAP domain-containing protein 3Q8WTV1 (reviewed: Q8WTV1)

All UniProt accessions (3): Q8WTV1, K7EIZ2, R4GNH0

UniProt curated annotations — full annotation on UniProt →

Function. Component of a THAP1/THAP3-HCFC1-OGT complex that is required for the regulation of the transcriptional activity of RRM1.

Subunit / interactions. Component of a THAP1/THAP3-HCFC1-OGT complex that contains at least, either THAP1 or THAP3, HCFC1 and OGT. Interacts directly with OGT and HCFC1 (via its HBM).

Tissue specificity. Highly expressed in heart, skeletal muscle and placenta. Weaker expression in brain, kidney and liver.

Isoforms (3)

UniProt IDNamesCanonical?
Q8WTV1-11yes
Q8WTV1-32
Q8WTV1-43

RefSeq proteins (8): NP_001182681, NP_001182682, NP_001381425, NP_001381426, NP_001381427, NP_001381428, NP_001381429, NP_612359 (=MANE)

Domains & families (InterPro)

IDNameType
IPR006612THAP_ZnfDomain
IPR026520THAP3Family

Pfam: PF05485

UniProt features (13 total): splice variant 4, mutagenesis site 3, chain 1, zinc finger region 1, sequence conflict 1, region of interest 1, short sequence motif 1, modified residue 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8WTV1-F171.570.37

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 122

Mutagenesis-validated functional residues (3):

PositionPhenotype
178abolishes interaction with hcfc1.
180abolishes interaction with hcfc1.
177–180abolishes interaction with hcfc1.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 50 (showing top): DACOSTA_UV_RESPONSE_VIA_ERCC3_UP, LASTOWSKA_NEUROBLASTOMA_COPY_NUMBER_DN, RAY_TUMORIGENESIS_BY_ERBB2_CDC25A_DN, GSE13762_CTRL_VS_125_VITAMIND_DAY5_DC_DN, chr1p36, GOBP_POSITIVE_REGULATION_OF_TRANSCRIPTION_BY_RNA_POLYMERASE_II, WAKABAYASHI_ADIPOGENESIS_PPARG_BOUND_8D, ALK_DN.V1_UP, PRKDC_TARGET_GENES, SETD7_TARGET_GENES, UBN1_TARGET_GENES, ZNF423_TARGET_GENES, ZNF711_TARGET_GENES, ZNF843_TARGET_GENES, ZSCAN30_TARGET_GENES

GO Biological Process (1): positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (4): DNA binding (GO:0003677), zinc ion binding (GO:0008270), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (0):

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of transcription by RNA polymerase II1
transcription by RNA polymerase II1
positive regulation of DNA-templated transcription1
nucleic acid binding1
transition metal ion binding1
binding1
cation binding1

Protein interactions and networks

STRING

370 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THAP3THAP12O43422882
THAP3THAP11Q96EK4877
THAP3DNAJC11Q9NVH1625
THAP3THAP6Q8TBB0621
THAP3THAP4Q8WY91574
THAP3THAP10Q9P2Z0570
THAP3RRM1P23921537
THAP3THAP8Q8NA92490
THAP3PAWRQ96IZ0469
THAP3OGTO15294459
THAP3KLHL21Q9UJP4424
THAP3HCFC1P51610405
THAP3TNFRSF25P78507385
THAP3ZNF251Q9BRH9373
THAP3SZRD1Q7Z422370

IntAct

81 interactions, top by confidence:

ABTypeScore
GTF2H4GTF2H1psi-mi:“MI:0914”(association)0.670
THAP3GTF2H4psi-mi:“MI:0915”(physical association)0.620
NUTF2THAP3psi-mi:“MI:0915”(physical association)0.560
THAP3UBL5psi-mi:“MI:0915”(physical association)0.560
CHATTHAP3psi-mi:“MI:0915”(physical association)0.560
FGFR3THAP3psi-mi:“MI:0915”(physical association)0.560
GRIN2CTHAP3psi-mi:“MI:0915”(physical association)0.560
LSAMPTHAP3psi-mi:“MI:0915”(physical association)0.560
NDUFS1THAP3psi-mi:“MI:0915”(physical association)0.560
POLR2ATHAP3psi-mi:“MI:0915”(physical association)0.560
PKN1THAP3psi-mi:“MI:0915”(physical association)0.560
RAC1THAP3psi-mi:“MI:0915”(physical association)0.560
SNRPBTHAP3psi-mi:“MI:0915”(physical association)0.560
TYRTHAP3psi-mi:“MI:0915”(physical association)0.560
UQCRBTHAP3psi-mi:“MI:0915”(physical association)0.560
UQCRC1THAP3psi-mi:“MI:0915”(physical association)0.560
VBP1THAP3psi-mi:“MI:0915”(physical association)0.560
EZRTHAP3psi-mi:“MI:0915”(physical association)0.560
THAP3psi-mi:“MI:0915”(physical association)0.560
DNALI1THAP3psi-mi:“MI:0915”(physical association)0.560
BAG6THAP3psi-mi:“MI:0915”(physical association)0.560
KLF11THAP3psi-mi:“MI:0915”(physical association)0.560
DNAJB6THAP3psi-mi:“MI:0915”(physical association)0.560
UBQLN1THAP3psi-mi:“MI:0915”(physical association)0.560
TARDBPTHAP3psi-mi:“MI:0915”(physical association)0.560

BioGRID (85): THAP3 (Affinity Capture-MS), THAP3 (Reconstituted Complex), THAP3 (Two-hybrid), THAP3 (Affinity Capture-MS), THAP3 (Affinity Capture-RNA), THAP3 (Negative Genetic), THAP3 (Negative Genetic), THAP3 (Negative Genetic), THAP3 (Negative Genetic), THAP3 (Positive Genetic), THAP3 (Positive Genetic), THAP3 (Positive Genetic), NUTF2 (Two-hybrid), UBL5 (Two-hybrid), THAP3 (Two-hybrid)

ESM2 similar proteins: A2AGX3, A4Q9F3, A6QPH9, D3YYI7, E9PGG2, P29590, Q0P5B4, Q0VC73, Q2TBI2, Q3T0G1, Q45KJ4, Q45KJ6, Q4R7H0, Q5BJT4, Q5E9N3, Q5NVM3, Q5PPH4, Q5U208, Q5XI57, Q642B6, Q6P3Z3, Q6P9L4, Q6ZN17, Q6ZVT0, Q70EL4, Q7L4P6, Q8AVK2, Q8BJ25, Q8BUM9, Q8C6D4, Q8CDF7, Q8CE64, Q8CHW1, Q8N554, Q8NA92, Q8NFT6, Q8TC41, Q8VCZ3, Q8WTV1, Q8WY91

Diamond homologs: B5XCB8, Q0IHI7, Q0P5B4, Q1JPT7, Q1RMM0, Q2TBI2, Q3T0G1, Q4R3Q6, Q4R7M0, Q5RCE4, Q5U208, Q5U560, Q5ZHN5, Q642B6, Q6DDT6, Q6DIN8, Q6P3Z3, Q7Z6K1, Q8BJ25, Q8CHW1, Q8WTV1, Q8WY91, Q9D305, Q9H0W7, Q9NVV9, Q8NA92, Q9H5L6, Q8VCZ3, Q9BT49, Q8TBB0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

42 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance30
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

988 predictions. Top by Δscore:

VariantEffectΔscore
1:6625272:G:GTdonor_gain1.0000
1:6625290:CCG:Cdonor_gain1.0000
1:6625292:GGT:Gdonor_loss1.0000
1:6625293:G:GGdonor_gain1.0000
1:6625293:GT:Gdonor_loss1.0000
1:6628494:CTTA:Cacceptor_loss1.0000
1:6628495:TTA:Tacceptor_loss1.0000
1:6628496:TA:Tacceptor_loss1.0000
1:6628497:A:AGacceptor_gain1.0000
1:6628498:G:GGacceptor_gain1.0000
1:6628689:CAGGT:Cdonor_loss1.0000
1:6625288:CACCG:Cdonor_gain0.9900
1:6625289:ACCG:Adonor_gain0.9900
1:6625291:CG:Cdonor_gain0.9900
1:6625292:GG:Gdonor_gain0.9900
1:6628494:CTTAG:Cacceptor_gain0.9900
1:6628495:TTAGG:Tacceptor_gain0.9900
1:6628496:TAG:Tacceptor_gain0.9900
1:6628497:AG:Aacceptor_gain0.9900
1:6628497:AGGTT:Aacceptor_gain0.9900
1:6628498:G:Tacceptor_gain0.9900
1:6628498:GG:Gacceptor_gain0.9900
1:6628498:GGT:Gacceptor_gain0.9900
1:6628498:GGTT:Gacceptor_gain0.9900
1:6628498:GGTTT:Gacceptor_gain0.9900
1:6630286:A:AGacceptor_gain0.9900
1:6630287:G:GGacceptor_gain0.9900
1:6633002:G:GTdonor_gain0.9900
1:6628490:T:TAacceptor_gain0.9800
1:6628493:CCTTA:Cacceptor_gain0.9800

AlphaMissense

1571 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:6625285:T:CF23L1.000
1:6625287:C:AF23L1.000
1:6625287:C:GF23L1.000
1:6628533:T:AW37R1.000
1:6628533:T:CW37R1.000
1:6628535:G:CW37C1.000
1:6628535:G:TW37C1.000
1:6628599:T:CF59L1.000
1:6628600:T:CF59S1.000
1:6628601:C:AF59L1.000
1:6628601:C:GF59L1.000
1:6625231:T:CC5R0.999
1:6625288:C:GH24D0.999
1:6628501:T:CF26S0.999
1:6628585:T:AI54N0.999
1:6628587:T:AC55S0.999
1:6628587:T:CC55R0.999
1:6628588:G:AC55Y0.999
1:6628588:G:CC55S0.999
1:6628589:C:GC55W0.999
1:6628600:T:GF59C0.999
1:6628659:C:AP79T0.999
1:6628659:C:TP79S0.999
1:6628660:C:AP79H0.999
1:6628668:T:CF82L0.999
1:6628670:C:AF82L0.999
1:6628670:C:GF82L0.999
1:6625286:T:CF23S0.998
1:6625288:C:AH24N0.998
1:6628500:T:CF26L0.998

dbSNP variants (sampled 300 via entrez): RS1000102480 (1:6628854 C>A), RS1000160269 (1:6634153 G>A,C), RS1000165263 (1:6624830 C>A), RS1000851247 (1:6633613 A>C), RS1001039185 (1:6624080 A>C,G), RS1001167064 (1:6625734 G>T), RS1001263963 (1:6634912 G>A), RS1001906718 (1:6634369 T>C), RS1002438719 (1:6630093 A>G), RS1002448009 (1:6629634 CT>C), RS1003159593 (1:6631926 T>C), RS1003440822 (1:6631092 C>A,T), RS1003507983 (1:6626993 G>A), RS1003641621 (1:6626409 C>G,T), RS1004005229 (1:6631078 C>T)

Disease associations

OMIM: gene MIM:612532 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST010988_520Adult body size3.000000e-08
GCST011769_1Schizophrenia8.000000e-09

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

23 total (human), top 23 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
bisphenol Aaffects cotreatment, increases expression1
beta-lapachoneincreases expression1
sodium arseniteaffects splicing, increases expression1
di-n-butylphosphoric acidaffects expression1
CGP 52608increases reaction, affects binding1
abrineincreases expression1
bisphenol Saffects cotreatment, increases expression1
Arsenicaffects methylation1
Benzo(a)pyreneincreases methylation1
Cadmiumincreases abundance, increases expression1
Caffeinedecreases phosphorylation1
Dexamethasoneaffects cotreatment, increases expression1
Indomethacinaffects cotreatment, increases expression1
Smokedecreases expression1
Tobacco Smoke Pollutionincreases expression1
Urethaneincreases expression1
Valproic Aciddecreases expression1
1-Methyl-3-isobutylxanthineaffects cotreatment, increases expression1
Antirheumatic Agentsincreases expression1
Cadmium Chlorideincreases abundance, increases expression1
Copper Sulfatedecreases expression1
Acrylamideincreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A7L4SEES3-1V human THAP3, clone1Embryonic stem cellMale
CVCL_A7L5SEES3-1V human THAP3, clone2Embryonic stem cellMale
CVCL_A7L6SEES3-1V human THAP3, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.