THAP4

gene
On this page

Also known as CGI-36Nb(III)

Summary

THAP4 (THAP domain containing 4, HGNC:23187) is a protein-coding gene on chromosome 2q37.3, encoding Peroxynitrite isomerase THAP4 (Q8WY91). Heme-binding protein able to scavenge peroxynitrite and to protect free L-tyrosine against peroxynitrite-mediated nitration, by acting as a peroxynitrite isomerase that converts peroxynitrite to nitrate.

Enables several functions, including heme binding activity; nitric oxide binding activity; and peroxynitrite isomerase activity. Involved in nitrate metabolic process and tyrosine metabolic process. Located in cytosol and nucleoplasm.

Source: NCBI Gene 51078 — RefSeq curated summary.

At a glance

  • GWAS associations: 6
  • Clinical variants (ClinVar): 106 total — 1 pathogenic
  • MANE Select transcript: NM_015963

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:23187
Approved symbolTHAP4
NameTHAP domain containing 4
Location2q37.3
Locus typegene with protein product
StatusApproved
AliasesCGI-36, Nb(III)
Ensembl geneENSG00000176946
Ensembl biotypeprotein_coding
OMIM612533
Entrez51078

Gene structure

Transcript identifiers

Ensembl transcripts: 9 — 7 protein_coding, 1 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000321679, ENST00000402136, ENST00000402545, ENST00000407315, ENST00000497486, ENST00000863245, ENST00000863246, ENST00000863247, ENST00000863248

RefSeq mRNA: 2 — MANE Select: NM_015963 NM_001164356, NM_015963

CCDS: CCDS2551, CCDS54440

Canonical transcript exons

ENST00000407315 — 6 exons

ExonStartEnd
ENSE00001554858241636941241637158
ENSE00001561536241584405241584725
ENSE00001673370241602970241603079
ENSE00001723868241606314241606473
ENSE00001776546241632917241634079
ENSE00003594678241601896241601999

Expression profiles

Bgee: expression breadth ubiquitous, 287 present calls, max score 97.62.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 19.4056 / max 107.2680, expressed in 1803 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
3493112.47291787
349323.20591529
349301.3608982
349331.0339691
349260.7857342
349270.5463281

Top tissues by expression

288 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
apex of heartUBERON:000209897.62gold quality
mucosa of transverse colonUBERON:000499196.21gold quality
right atrium auricular regionUBERON:000663196.19gold quality
hindlimb stylopod muscleUBERON:000425296.16gold quality
heart left ventricleUBERON:000208496.03gold quality
gastrocnemiusUBERON:000138896.01gold quality
cardiac ventricleUBERON:000208295.85gold quality
cardiac atriumUBERON:000208195.58gold quality
metanephros cortexUBERON:001053395.45gold quality
muscle of legUBERON:000138395.44gold quality
adult mammalian kidneyUBERON:000008295.19gold quality
lower esophagus mucosaUBERON:003583495.13gold quality
heartUBERON:000094894.86gold quality
muscle organUBERON:000163094.57gold quality
renal medullaUBERON:000036294.43gold quality
putamenUBERON:000187494.12gold quality
transverse colonUBERON:000115794.09gold quality
caudate nucleusUBERON:000187393.90gold quality
body of stomachUBERON:000116193.88gold quality
nucleus accumbensUBERON:000188293.77gold quality
olfactory segment of nasal mucosaUBERON:000538693.72gold quality
right frontal lobeUBERON:000281093.63gold quality
triceps brachiiUBERON:000150993.54gold quality
anterior cingulate cortexUBERON:000983593.51gold quality
cingulate cortexUBERON:000302793.50gold quality
prefrontal cortexUBERON:000045193.47gold quality
amygdalaUBERON:000187693.13gold quality
skin of legUBERON:000151193.07gold quality
skeletal muscle tissueUBERON:000113493.04gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451193.01gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes8.31

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

8 targeting THAP4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-806399.9169.763146
HSA-MIR-5002-5P99.7670.841763
HSA-MIR-7159-3P99.5170.171920
HSA-MIR-6768-3P99.1467.381319
HSA-MIR-1245B-5P98.8866.55576
HSA-MIR-314298.8866.09529
HSA-MIR-4638-3P97.9065.75905

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriothap4ENSDARG00000077323
mus_musculusThap4ENSMUSG00000026279
rattus_norvegicusThap4ENSRNOG00000018351
caenorhabditis_elegansWBGENE00008993

Protein

Protein identifiers

Peroxynitrite isomerase THAP4Q8WY91 (reviewed: Q8WY91)

Alternative names: Ferric Homo sapiens nitrobindin, THAP domain-containing protein 4

All UniProt accessions (2): Q8WY91, B5MCC0

UniProt curated annotations — full annotation on UniProt →

Function. Heme-binding protein able to scavenge peroxynitrite and to protect free L-tyrosine against peroxynitrite-mediated nitration, by acting as a peroxynitrite isomerase that converts peroxynitrite to nitrate. Therefore, this protein likely plays a role in peroxynitrite sensing and in the detoxification of reactive nitrogen and oxygen species (RNS and ROS, respectively). Is able to bind nitric oxide (NO) in vitro, but may act as a sensor of peroxynitrite levels in vivo, possibly modulating the transcriptional activity residing in the N-terminal region.

Subunit / interactions. Homodimer.

Subcellular location. Cytoplasm. Nucleus.

Cofactor. Binds 1 heme b group per subunit, that coordinates a highly solvent-exposed Fe(III) atom.

Domain organisation. The C-terminal nitrobindin region coordinates a heme and bears the isomerase activity. The N-terminal zinc finger domain likely binds DNA and may be involved in transcriptional regulation.

Pathway. Nitrogen metabolism.

Similarity. In the C-terminal section; belongs to the nitrobindin family.

Isoforms (2)

UniProt IDNamesCanonical?
Q8WY91-11yes
Q8WY91-22

RefSeq proteins (2): NP_001157828, NP_057047* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR006612THAP_ZnfDomain
IPR012674CalycinHomologous_superfamily
IPR014878THAP4-like_heme-bdDomain
IPR038441THAP_Znf_sfHomologous_superfamily
IPR045165NitrobindinFamily

Pfam: PF05485, PF08768

Catalyzed reactions (Rhea), 1 shown:

  • peroxynitrite = nitrate (RHEA:63116)

UniProt features (31 total): strand 11, region of interest 3, compositionally biased region 3, binding site 2, modified residue 2, splice variant 2, turn 2, chain 1, zinc finger region 1, sequence variant 1, sequence conflict 1, helix 1, short sequence motif 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
3IA8X-RAY DIFFRACTION1.79

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8WY91-F168.260.35

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (2): 567 (axial binding residue); 444

Post-translational modifications (2): 163, 239

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 102 (showing top): GOBP_ALPHA_AMINO_ACID_METABOLIC_PROCESS, LINDGREN_BLADDER_CANCER_CLUSTER_3_DN, GOBP_REACTIVE_NITROGEN_SPECIES_METABOLIC_PROCESS, GOBP_AROMATIC_AMINO_ACID_METABOLIC_PROCESS, GOBP_ORGANIC_ACID_METABOLIC_PROCESS, GARY_CD5_TARGETS_UP, STEIN_ESRRA_TARGETS_UP, GOMF_TETRAPYRROLE_BINDING, GAZDA_DIAMOND_BLACKFAN_ANEMIA_ERYTHROID_DN, GOMF_ISOMERASE_ACTIVITY, KRIGE_RESPONSE_TO_TOSEDOSTAT_6HR_DN, CEBPZ_TARGET_GENES, CIITA_TARGET_GENES, DROSHA_TARGET_GENES, E2F5_TARGET_GENES

GO Biological Process (2): obsolete tyrosine metabolic process (GO:0006570), nitrate metabolic process (GO:0042126)

GO Molecular Function (9): DNA binding (GO:0003677), zinc ion binding (GO:0008270), heme binding (GO:0020037), identical protein binding (GO:0042802), peroxynitrite isomerase activity (GO:0062213), nitric oxide binding (GO:0070026), protein binding (GO:0005515), isomerase activity (GO:0016853), metal ion binding (GO:0046872)

GO Cellular Component (4): nucleoplasm (GO:0005654), cytosol (GO:0005829), nucleus (GO:0005634), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
oxoacid metabolic process1
reactive nitrogen species metabolic process1
nucleic acid binding1
transition metal ion binding1
tetrapyrrole binding1
protein binding1
isomerase activity1
small molecule binding1
binding1
catalytic activity1
cation binding1
nuclear lumen1
cytoplasm1
intracellular membrane-bounded organelle1
intracellular anatomical structure1

Protein interactions and networks

STRING

632 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THAP4THAP11Q96EK4905
THAP4THAP12O43422904
THAP4THAP7Q9BT49584
THAP4THAP3Q8WTV1574
THAP4THAP10Q9P2Z0571
THAP4ANKMY1Q9P2S6562
THAP4MYO1FO00160526
THAP4STK25O00506509
THAP4PAWRQ96IZ0491
THAP4FARP2O94887487
THAP4HCFC1P51610480
THAP4SPINT3P49223479
THAP4DTYMKP23919462
THAP4RRM1P23921440
THAP4GLYATL3Q5SZD4394

IntAct

140 interactions, top by confidence:

ABTypeScore
TRADDTNFpsi-mi:“MI:0914”(association)0.790
THAP4CPNE2psi-mi:“MI:0915”(physical association)0.670
CMTM7THAP4psi-mi:“MI:0915”(physical association)0.560
THAP4MAL2psi-mi:“MI:0915”(physical association)0.560
AGTRAPTHAP4psi-mi:“MI:0915”(physical association)0.560
TPD52THAP4psi-mi:“MI:0915”(physical association)0.560
THAP4CMTM5psi-mi:“MI:0915”(physical association)0.560
SCAMP1THAP4psi-mi:“MI:0915”(physical association)0.560
AMD1THAP4psi-mi:“MI:0915”(physical association)0.560
DGAT2L6THAP4psi-mi:“MI:0915”(physical association)0.560
TMEM86BTHAP4psi-mi:“MI:0915”(physical association)0.560
LATTHAP4psi-mi:“MI:0915”(physical association)0.560
MS4A1THAP4psi-mi:“MI:0915”(physical association)0.560
CACNG1THAP4psi-mi:“MI:0915”(physical association)0.560
SLC30A8THAP4psi-mi:“MI:0915”(physical association)0.560
TMEM86ATHAP4psi-mi:“MI:0915”(physical association)0.560
THAP4NDRG4psi-mi:“MI:0915”(physical association)0.560
THAP4RMDN2psi-mi:“MI:0915”(physical association)0.560
ANKRD46THAP4psi-mi:“MI:0915”(physical association)0.560
IGFBP5THAP4psi-mi:“MI:0915”(physical association)0.560
GIMAP1THAP4psi-mi:“MI:0915”(physical association)0.560
PPGBTHAP4psi-mi:“MI:0915”(physical association)0.560
RTP2THAP4psi-mi:“MI:0915”(physical association)0.560
GPR37L1THAP4psi-mi:“MI:0915”(physical association)0.560
THAP4CMTM7psi-mi:“MI:0915”(physical association)0.560
THAP4THAP4psi-mi:“MI:0915”(physical association)0.560
THAP4FKBP7psi-mi:“MI:0915”(physical association)0.560
PLP2THAP4psi-mi:“MI:0915”(physical association)0.560
THAP4TNFRSF10Dpsi-mi:“MI:0915”(physical association)0.560

BioGRID (83): CORO2A (Affinity Capture-MS), THAP4 (Reconstituted Complex), AIFM1 (Affinity Capture-MS), ACTBL2 (Affinity Capture-MS), HSPA2 (Affinity Capture-MS), GFER (Affinity Capture-MS), ACTB (Affinity Capture-MS), HSPA8 (Affinity Capture-MS), THAP4 (Affinity Capture-RNA), THAP4 (Affinity Capture-RNA), THAP4 (Affinity Capture-MS), THAP4 (Affinity Capture-MS), THAP4 (Affinity Capture-MS), THAP4 (Two-hybrid), THAP4 (Two-hybrid)

ESM2 similar proteins: A0A1L8HTT5, A0JPQ7, A4FV61, A4IFK0, O75151, O88974, P35689, P49140, P55265, P55266, P97432, Q02040, Q0VEE6, Q14202, Q14596, Q15047, Q15554, Q501R9, Q53GL0, Q5EAN7, Q5F3F2, Q5HYC2, Q5R6F3, Q5RC94, Q5RF77, Q5SYB0, Q642B6, Q69Z66, Q69Z99, Q6P3Z3, Q76CY8, Q810L3, Q86XL3, Q8C4S8, Q8ND82, Q8NE31, Q8WY91, Q91VL8, Q95JV5, Q96EP1

Diamond homologs: A0PT73, A0QIQ0, A0R6J8, A1KM55, A1R8X8, A1SPL4, A1T297, A3PTJ4, A4FQE4, A4T0W5, A4X1P5, A5CTZ4, A5U669, A6W6D1, A8LDK0, A8LZD6, B0RAV9, B1VQH0, B1VS45, B2GIH3, B2HLY1, B8HD40, C5CC85, O64527, P9WFG6, P9WFG7, Q0RS51, Q0S4V4, Q0S749, Q1BEZ2, Q2JFV4, Q2TBI2, Q47KW7, Q4JXD1, Q5YS78, Q5Z2A5, Q642B6, Q6AAW4, Q6NFH6, Q6P3Z3

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 55 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

GO biological processes:

GO termPartnersFoldFDR
positive regulation of MAPK cascade610.1×4e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

106 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance79
Likely benign7
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
1340707GRCh37/hg19 2q37.3(chr2:239436367-242783384)x1Pathogenic

SpliceAI

1586 predictions. Top by Δscore:

VariantEffectΔscore
2:241601890:GCTCA:Gdonor_loss1.0000
2:241601891:CTCA:Cdonor_loss1.0000
2:241601892:TCACC:Tdonor_loss1.0000
2:241601893:CA:Cdonor_loss1.0000
2:241601894:A:Cdonor_loss1.0000
2:241601895:C:CGdonor_loss1.0000
2:241601895:CCTG:Cdonor_gain1.0000
2:241601995:CACGC:Cacceptor_gain1.0000
2:241601996:ACGCC:Aacceptor_loss1.0000
2:241601997:CGC:Cacceptor_gain1.0000
2:241601997:CGCCT:Cacceptor_loss1.0000
2:241601998:GCCTG:Gacceptor_loss1.0000
2:241601999:CCTGC:Cacceptor_loss1.0000
2:241602000:C:CAacceptor_loss1.0000
2:241602000:C:CCacceptor_gain1.0000
2:241602001:T:Aacceptor_loss1.0000
2:241602965:CTCAC:Cdonor_loss1.0000
2:241602966:TCA:Tdonor_loss1.0000
2:241603077:AACC:Aacceptor_loss1.0000
2:241603081:T:Gacceptor_loss1.0000
2:241606307:AACTT:Adonor_loss1.0000
2:241606308:ACTTA:Adonor_loss1.0000
2:241606309:CTT:Cdonor_loss1.0000
2:241606310:TTACG:Tdonor_loss1.0000
2:241606311:TACGA:Tdonor_loss1.0000
2:241606312:A:ACdonor_gain1.0000
2:241606313:C:CAdonor_gain1.0000
2:241606313:CG:Cdonor_gain1.0000
2:241606313:CGA:Cdonor_gain1.0000
2:241606313:CGAG:Cdonor_gain1.0000

AlphaMissense

3768 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:241633978:A:GF60S1.000
2:241634045:A:GW38R1.000
2:241634045:A:TW38R1.000
2:241584712:A:GF543S0.999
2:241603026:A:GF485S0.999
2:241606420:A:GW432R0.999
2:241606420:A:TW432R0.999
2:241633977:G:CF60L0.999
2:241633977:G:TF60L0.999
2:241633979:A:GF60L0.999
2:241633989:A:CC56W0.999
2:241633991:A:GC56R0.999
2:241633995:A:CF54L0.999
2:241633995:A:TF54L0.999
2:241633997:A:GF54L0.999
2:241634043:C:AW38C0.999
2:241634043:C:GW38C0.999
2:241634076:G:CF27L0.999
2:241634076:G:TF27L0.999
2:241634077:A:GF27S0.999
2:241634078:A:GF27L0.999
2:241636946:G:CF24L0.999
2:241636946:G:TF24L0.999
2:241636948:A:GF24L0.999
2:241637003:A:CC5W0.999
2:241637005:A:GC5R0.999
2:241584718:C:GR541P0.998
2:241603025:G:CF485L0.998
2:241603025:G:TF485L0.998
2:241603027:A:GF485L0.998

dbSNP variants (sampled 300 via entrez): RS1000003762 (2:241638313 A>G), RS1000016200 (2:241595918 G>T), RS1000061220 (2:241595663 A>G), RS1000071605 (2:241584367 C>T), RS1000113830 (2:241638565 C>A), RS1000171341 (2:241628301 C>A,G), RS1000195789 (2:241617614 A>G), RS1000237695 (2:241616322 A>G), RS1000241416 (2:241623349 C>G), RS1000276343 (2:241638430 C>G,T), RS1000327678 (2:241628859 G>A), RS1000406964 (2:241601679 CA>C,CAA), RS1000511012 (2:241592109 A>G), RS1000578258 (2:241634866 A>T), RS1000620179 (2:241597409 G>C)

Disease associations

OMIM: gene MIM:612533 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

6 associations (top):

StudyTraitp-value
GCST000597_2Brain structure6.000000e-07
GCST001942_6Prostate cancer5.000000e-09
GCST010653_8Thyroid stimulating hormone levels1.000000e-11
GCST012490_359Femur bone mineral density x serum urate levels interaction2.000000e-12
GCST90002402_296Platelet count2.000000e-12
GCST90020028_766Hip circumference adjusted for BMI1.000000e-08

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004531urate measurement
EFO:0004309platelet count
EFO:0008039BMI-adjusted hip circumference

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

19 total (human), top 19 by PubMed support.

ChemicalActions (top 5)PubMed papers
Aflatoxin B1increases methylation2
GSK-J4increases expression1
TAK-243increases sumoylation1
triphenyl phosphateaffects expression1
sodium arsenitedecreases expression1
coumarindecreases phosphorylation1
ICG 001increases expression1
Cisplatindecreases expression1
Doxorubicindecreases expression1
Ethyl Methanesulfonatedecreases expression1
Formaldehydedecreases expression1
Methyl Methanesulfonatedecreases expression1
Thiramdecreases expression1
Tretinoindecreases expression1
Valproic Acidincreases methylation1
Vitamin Eincreases expression1
Sodium Seleniteincreases expression1
Copper Sulfatedecreases expression1
Vitamin K 3affects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.