THAP9

gene
On this page

Also known as FLJ34093

Summary

THAP9 (THAP domain containing 9, HGNC:23192) is a protein-coding gene on chromosome 4q21.22, encoding DNA transposase THAP9 (Q9H5L6). Active transposase that specifically recognizes the bipartite 5’-TXXGGGX(A/T)-3’ consensus motif and mediates transposition.

Enables sequence-specific DNA binding activity and transposase activity. Involved in DNA integration and DNA transposition. Predicted to be located in nucleoplasm.

Source: NCBI Gene 79725 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 112 total
  • MANE Select transcript: NM_024672

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:23192
Approved symbolTHAP9
NameTHAP domain containing 9
Location4q21.22
Locus typegene with protein product
StatusApproved
AliasesFLJ34093
Ensembl geneENSG00000168152
Ensembl biotypeprotein_coding
OMIM612537
Entrez79725

Gene structure

Transcript identifiers

Ensembl transcripts: 6 — 5 nonsense_mediated_decay, 1 protein_coding

ENST00000302236, ENST00000505901, ENST00000506208, ENST00000509353, ENST00000514244, ENST00000514440

RefSeq mRNA: 2 — MANE Select: NM_024672 NM_001317776, NM_024672

CCDS: CCDS3598

Canonical transcript exons

ENST00000302236 — 5 exons

ExonStartEnd
ENSE000011502668291694482919969
ENSE000012465388290473682904931
ENSE000012465448290073582900882
ENSE000036149508290632482906627
ENSE000036826008290778582907935

Expression profiles

Bgee: expression breadth ubiquitous, 207 present calls, max score 90.41.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 7.6254 / max 405.9647, expressed in 1535 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
485873.99121279
485890.9999221
2032700.9130482
485880.6876264
485900.5512138
485920.262276
485910.220268

Top tissues by expression

276 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
spermCL:000001990.41gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047387.76gold quality
male germ cellCL:000001586.94gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099185.46gold quality
ventricular zoneUBERON:000305383.50gold quality
right testisUBERON:000453483.13gold quality
left testisUBERON:000453382.32gold quality
ganglionic eminenceUBERON:000402382.12gold quality
testisUBERON:000047381.60gold quality
calcaneal tendonUBERON:000370180.81gold quality
hindlimb stylopod muscleUBERON:000425279.84gold quality
biceps brachiiUBERON:000150779.43gold quality
secondary oocyteCL:000065579.16gold quality
muscle of legUBERON:000138377.85gold quality
gastrocnemiusUBERON:000138877.84gold quality
adrenal tissueUBERON:001830377.15gold quality
muscle organUBERON:000163075.56gold quality
corpus callosumUBERON:000233675.55gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450274.50gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451174.44gold quality
C1 segment of cervical spinal cordUBERON:000646974.10gold quality
cortical plateUBERON:000534373.48gold quality
islet of LangerhansUBERON:000000673.07gold quality
right lobe of thyroid glandUBERON:000111972.78gold quality
tendonUBERON:000004371.59gold quality
right lobe of liverUBERON:000111471.46gold quality
thyroid glandUBERON:000204671.33gold quality
left lobe of thyroid glandUBERON:000112071.17gold quality
left ovaryUBERON:000211971.11gold quality
spinal cordUBERON:000224071.09gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no3.64

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

3 targets.

TargetRegulation
IL21
IL9
SPI1

miRNA regulators (miRDB)

55 targeting THAP9, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3163100.0077.238605
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-60799.9773.625593
HSA-MIR-590-3P99.9674.346478
HSA-MIR-141-3P99.9472.792421
HSA-MIR-200A-3P99.9472.682420
HSA-MIR-335-3P99.9373.364958
HSA-MIR-129799.9173.413162
HSA-MIR-430799.8270.453374
HSA-MIR-4659A-3P99.8072.624248
HSA-MIR-4659B-3P99.8072.624248
HSA-MIR-63699.8069.581500
HSA-MIR-26A-5P99.7873.522303
HSA-MIR-26B-5P99.7873.512305
HSA-MIR-199A-3P99.7570.48929
HSA-MIR-199B-3P99.7570.48929
HSA-MIR-3129-5P99.7570.46914
HSA-MIR-442899.7366.411733
HSA-MIR-446599.7172.562096
HSA-MIR-4756-3P99.6266.301319
HSA-MIR-205399.5769.151635
HSA-MIR-427699.5667.662514
HSA-MIR-5007-3P99.5168.141242
HSA-MIR-409-3P99.5066.331192
HSA-MIR-57899.4668.361787
HSA-MIR-16-2-3P99.2970.601954
HSA-MIR-195-3P99.2970.611954
HSA-MIR-3160-5P99.2869.071938
HSA-MIR-519099.1567.761234

Literature-anchored findings (GeneRIF, showing 5)

  • results indicate that human THAP9 is an active DNA transposase that, although “domesticated,” still retains the catalytic activity to mobilize P transposable elements across species (PMID:23349291)
  • THAP9-AS1/miR-133b/SOX4 positive feedback loop facilitates the progression of esophageal squamous cell carcinoma. (PMID:33854048)
  • Oligomerization of THAP9 Transposase via Amino-Terminal Domains. (PMID:34033475)
  • THAP9 Transposase Cleaves DNA via Conserved Acidic Residues in an RNaseH-Like Domain. (PMID:34072453)
  • THAP9-AS1 Promotes Tumorigenesis and Reduces ROS Generation through the JAK2/STAT3 Signaling Pathway by Increasing SOCS3 Promoter Methylation in Osteosarcoma. (PMID:34691358)

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

DNA transposase THAP9Q9H5L6 (reviewed: Q9H5L6)

Alternative names: THAP domain-containing protein 9

All UniProt accessions (6): D6R957, D6RCT5, D6REM3, Q9H5L6, F2Z371, H0Y9F3

UniProt curated annotations — full annotation on UniProt →

Function. Active transposase that specifically recognizes the bipartite 5’-TXXGGGX(A/T)-3’ consensus motif and mediates transposition.

Miscellaneous. Able to mediate mobilization of P-elements when transfected in Drosophila.

RefSeq proteins (2): NP_001304705, NP_078948* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR006612THAP_ZnfDomain
IPR021896THAP9-like_HTHDomain
IPR038441THAP_Znf_sfHomologous_superfamily
IPR048365TNP-like_RNaseH_NDomain
IPR048366TNP-like_GBDDomain
IPR048367TNP-like_RNaseH_CDomain
IPR055035THAP9_CDomain

Pfam: PF05485, PF12017, PF21787, PF21788, PF21789, PF22824

UniProt features (10 total): sequence variant 4, sequence conflict 3, chain 1, zinc finger region 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H5L6-F173.490.24

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 60 (showing top): GOBP_DNA_INTEGRATION, chr4q21, TGGAAA_NFAT_Q4_01, GOBP_DNA_METABOLIC_PROCESS, GOBP_DNA_RECOMBINATION, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, GOBP_TRANSPOSITION, GOMF_CATALYTIC_ACTIVITY_ACTING_ON_DNA, TAVAZOIE_METASTASIS, KRAS.600.LUNG.BREAST_UP.V1_DN, KRAS.LUNG_UP.V1_DN, BANP_TARGET_GENES, E2F2_TARGET_GENES, ELF2_TARGET_GENES, NFE2L1_TARGET_GENES

GO Biological Process (5): DNA recombination (GO:0006310), DNA transposition (GO:0006313), DNA integration (GO:0015074), endothelial cell proliferation (GO:0001935), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (8): DNA binding (GO:0003677), transposase activity (GO:0004803), zinc ion binding (GO:0008270), transferase activity (GO:0016740), sequence-specific DNA binding (GO:0043565), DNA-binding transcription factor activity (GO:0003700), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (2): nucleus (GO:0005634), nucleoplasm (GO:0005654)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA metabolic process2
DNA recombination1
transposition1
epithelial cell proliferation1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
nucleic acid binding1
catalytic activity, acting on DNA1
transition metal ion binding1
catalytic activity1
DNA binding1
transcription cis-regulatory region binding1
regulation of DNA-templated transcription1
transcription regulator activity1
binding1
cation binding1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

450 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THAP9PGBD5Q8N414789
THAP9THAP12O43422607
THAP9THAP10Q9P2Z0599
THAP9HARBI1Q96MB7589
THAP9THAP8Q8NA92582
THAP9A0A1B0GW23A0A1B0GW23573
THAP9PGBD2Q6P3X8555
THAP9THAP11Q96EK4546
THAP9GIN1Q9NXP7531
THAP9PGBD1Q96JS3530
THAP9THAP1Q9NVV9517
THAP9ZBED8Q8IZ13477
THAP9MPHOSPH6Q99547471
THAP9SETMARQ53H47469
THAP9DBF4BQ8NFT6465

IntAct

14 interactions, top by confidence:

ABTypeScore
repTHAP9psi-mi:“MI:0915”(physical association)0.560
YWHAZBLTP3Bpsi-mi:“MI:0914”(association)0.530
KCTD17CBX4psi-mi:“MI:0914”(association)0.530
Kctd5psi-mi:“MI:0914”(association)0.350
MRPL50MRPL43psi-mi:“MI:0914”(association)0.350
MRPL9MRPL43psi-mi:“MI:0914”(association)0.350
KCTD17CBX4psi-mi:“MI:0914”(association)0.350
KCTD17PXDNLpsi-mi:“MI:0914”(association)0.350

BioGRID (11): THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Affinity Capture-MS), THAP9 (Two-hybrid), THAP9 (Two-hybrid)

ESM2 similar proteins: A3KMX0, A4IFA3, A4IGY9, A4Z943, A4Z944, B8QB46, D2EAC2, E1C2V1, O43422, O60290, P10911, P35125, P86452, Q13075, Q2NKX8, Q3UPF5, Q49AG3, Q5FWF4, Q5SVZ6, Q5T890, Q5TKR9, Q6DJS0, Q6EKJ0, Q6R2W3, Q6YI93, Q7Z2W4, Q80WE4, Q86UP8, Q86VD1, Q8BZ21, Q8N8K9, Q8QMP8, Q8TDB6, Q8WML3, Q92794, Q96JM7, Q96JS3, Q99388, Q99NI3, Q9CUX1

Diamond homologs: B5XCB8, Q0P5B4, Q1JPT7, Q1RMM0, Q3T0G1, Q4R3Q6, Q5RCE4, Q5U208, Q5U560, Q5ZHN5, Q6DDT6, Q6DIN8, Q8BJ25, Q8CHW1, Q8WTV1, Q9D305, Q9H0W7, Q9H5L6, Q9NVV9, Q0IHI7, Q8TBB0, Q8VCZ3, Q9BT49, Q4R7M0, Q7Z6K1, Q2TBI2, Q642B6, Q6P3Z3, Q8WY91

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

112 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance106
Likely benign5
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1213 predictions. Top by Δscore:

VariantEffectΔscore
4:82900883:G:GGdonor_gain1.0000
4:82906320:ATAG:Aacceptor_loss1.0000
4:82906321:TA:Tacceptor_loss1.0000
4:82906322:A:Gacceptor_loss1.0000
4:82906623:TTCAG:Tdonor_loss1.0000
4:82906624:TCAG:Tdonor_loss1.0000
4:82906625:CAG:Cdonor_loss1.0000
4:82906626:AGGT:Adonor_loss1.0000
4:82906627:GGTA:Gdonor_loss1.0000
4:82906628:G:Adonor_loss1.0000
4:82906629:T:Gdonor_loss1.0000
4:82900859:G:GTdonor_gain0.9900
4:82900880:CCA:Cdonor_gain0.9900
4:82900881:CA:Cdonor_gain0.9900
4:82900892:G:GTdonor_gain0.9900
4:82901123:A:Tdonor_gain0.9900
4:82906318:TCATA:Tacceptor_gain0.9900
4:82906319:CATAG:Cacceptor_gain0.9900
4:82906320:ATAGA:Aacceptor_gain0.9900
4:82906321:TAG:Tacceptor_gain0.9900
4:82906322:A:AGacceptor_gain0.9900
4:82906322:AGA:Aacceptor_gain0.9900
4:82906323:G:GGacceptor_gain0.9900
4:82907932:GAAC:Gdonor_gain0.9900
4:82907936:G:GGdonor_gain0.9900
4:82900843:ACACC:Adonor_gain0.9800
4:82900878:CACCA:Cdonor_gain0.9800
4:82900879:ACCA:Adonor_gain0.9800
4:82900880:CCAG:Cdonor_loss0.9800
4:82900881:CAG:Cdonor_loss0.9800

AlphaMissense

5973 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
4:82904767:T:AW38R0.999
4:82904767:T:CW38R0.999
4:82904839:T:CC62R0.999
4:82900815:T:CC5R0.998
4:82900878:C:GH26D0.998
4:82904769:G:CW38C0.998
4:82904769:G:TW38C0.998
4:82904852:T:CF66S0.998
4:82904738:T:CF28S0.997
4:82904768:G:CW38S0.997
4:82904812:T:AW53R0.997
4:82904812:T:CW53R0.997
4:82904839:T:AC62S0.997
4:82904840:G:CC62S0.997
4:82904759:G:CR35P0.996
4:82904841:T:GC62W0.996
4:82904851:T:CF66L0.996
4:82904853:T:AF66L0.996
4:82904853:T:GF66L0.996
4:82900815:T:AC5S0.995
4:82900816:G:AC5Y0.995
4:82900816:G:CC5S0.995
4:82900817:C:GC5W0.995
4:82900830:T:CC10R0.995
4:82904837:T:CL61P0.995
4:82904912:C:AP86H0.995
4:82900875:T:CF25L0.994
4:82900877:C:AF25L0.994
4:82900877:C:GF25L0.994
4:82904837:T:AL61Q0.994

dbSNP variants (sampled 300 via entrez): RS1000019032 (4:82900290 T>C), RS1000154176 (4:82899867 A>C), RS1000224887 (4:82919785 A>G,T), RS1000554053 (4:82912486 A>G), RS1000859753 (4:82908996 T>C), RS1000920858 (4:82902174 T>C), RS1000943217 (4:82905075 TA>T,TAA), RS1001078565 (4:82908700 T>G), RS1001095538 (4:82905414 CTG>C), RS1001182844 (4:82920083 TTTG>T), RS1001193035 (4:82907475 A>G), RS1001327965 (4:82919751 G>C,T), RS1001376284 (4:82901879 A>T), RS1001849434 (4:82919856 T>G), RS1001880703 (4:82919559 A>C,T)

Disease associations

OMIM: gene MIM:612537 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST012490_87Femur bone mineral density x serum urate levels interaction3.000000e-10

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004531urate measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

26 total (human), top 26 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression, affects expression7
trichostatin Adecreases expression, affects cotreatment3
Panobinostataffects cotreatment, decreases expression2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
3-((6-(2-methoxyphenyl)pyrimidin-4-yl)amino)phenyl)methane sulfonamidedecreases expression1
dicrotophosdecreases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
bisphenol Aincreases expression1
cupric oxideincreases phosphorylation1
entinostatdecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
clothianidinincreases expression1
abrineincreases expression1
dorsomorphinaffects cotreatment, decreases expression1
jinfukangdecreases expression1
Sunitinibdecreases expression1
Leflunomideincreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Ethyl Methanesulfonateincreases expression1
Folic Aciddecreases expression1
Methyl Methanesulfonateincreases expression1
Tretinoindecreases expression1
Cyclosporineincreases expression1
Aflatoxin B1decreases methylation1
Antirheumatic Agentsincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.