GTF2H2

gene
On this page

Also known as BTF2TFIIHBTF2P44T-BTF2P44p44

Summary

GTF2H2 (general transcription factor IIH subunit 2, HGNC:4656) is a protein-coding gene on chromosome 5q13.2, encoding General transcription factor IIH subunit 2 (Q13888). Component of the general transcription and DNA repair factor IIH (TFIIH) core complex, which is involved in general and transcription-coupled nucleotide excision repair (NER) of damaged DNA and, when complexed to CAK, in RNA transcription by RNA polymerase II.

This gene is part of a 500 kb inverted duplication on chromosome 5q13. This duplicated region contains at least four genes and repetitive elements which make it prone to rearrangements and deletions. The repetitiveness and complexity of the sequence have also caused difficulty in determining the organization of this genomic region. This gene is within the telomeric copy of the duplication. Deletion of this gene sometimes accompanies deletion of the neighboring SMN1 gene in spinal muscular atrophy (SMA) patients but it is unclear if deletion of this gene contributes to the SMA phenotype. This gene encodes the 44 kDa subunit of RNA polymerase II transcription initiation factor IIH which is involved in basal transcription and nucleotide excision repair. Transcript variants for this gene have been described, but their full length nature has not been determined. A second copy of this gene within the centromeric copy of the duplication has been described in the literature. It is reported to be different by either two or four base pairs; however, no sequence data is currently available for the centromeric copy of the gene.

Source: NCBI Gene 2966 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 64 total
  • MANE Select transcript: NM_001515

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4656
Approved symbolGTF2H2
Namegeneral transcription factor IIH subunit 2
Location5q13.2
Locus typegene with protein product
StatusApproved
AliasesBTF2, TFIIH, BTF2P44, T-BTF2P44, p44
Ensembl geneENSG00000145736
Ensembl biotypeprotein_coding
OMIM601748
Entrez2966

Gene structure

Transcript identifiers

Ensembl transcripts: 23 — 14 protein_coding, 6 retained_intron, 2 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000274400, ENST00000330280, ENST00000508592, ENST00000517900, ENST00000518275, ENST00000518898, ENST00000519783, ENST00000521602, ENST00000521858, ENST00000521942, ENST00000522654, ENST00000523003, ENST00000523693, ENST00000873591, ENST00000873592, ENST00000873593, ENST00000873594, ENST00000916278, ENST00000916279, ENST00000916280, ENST00000916281, ENST00000916282, ENST00000953259

RefSeq mRNA: 21 — MANE Select: NM_001515 NM_001364567, NM_001364568, NM_001364569, NM_001364570, NM_001364571, NM_001364572, NM_001364573, NM_001395387, NM_001395388, NM_001395389, NM_001395390, NM_001395391, NM_001395392, NM_001395393, NM_001395394, NM_001395395, NM_001395396, NM_001395397, NM_001395398, NM_001395399, NM_001515

CCDS: CCDS34183

Canonical transcript exons

ENST00000274400 — 16 exons

ExonStartEnd
ENSE000034600727104544471045507
ENSE000034992727104906871049158
ENSE000035008027106168871061765
ENSE000035237757105969071059744
ENSE000035542437105535271055457
ENSE000035596127104876271048849
ENSE000035793127103750771037546
ENSE000036259977106012071060170
ENSE000036360727106127271061308
ENSE000036592107104152271041624
ENSE000036593147104800671048113
ENSE000036597267106084471060930
ENSE000036824257106270871062796
ENSE000036891737104218171042284
ENSE000039783507106749471067676
ENSE000039783517103534771035796

Expression profiles

Bgee: expression breadth ubiquitous, 140 present calls, max score 91.65.

Top tissues by expression

140 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
endometriumUBERON:000129591.65gold quality
sural nerveUBERON:001548890.65gold quality
adrenal tissueUBERON:001830390.04gold quality
corpus callosumUBERON:000233689.74gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.15gold quality
calcaneal tendonUBERON:000370188.94gold quality
islet of LangerhansUBERON:000000686.76gold quality
monocyteCL:000057685.91gold quality
metanephros cortexUBERON:001053385.57gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047385.50gold quality
mucosa of transverse colonUBERON:000499185.25gold quality
smooth muscle tissueUBERON:000113585.19gold quality
vermiform appendixUBERON:000115485.17gold quality
leukocyteCL:000073885.15gold quality
pancreasUBERON:000126485.05gold quality
uterusUBERON:000099584.40gold quality
body of pancreasUBERON:000115084.28gold quality
lymph nodeUBERON:000002984.15gold quality
bone marrow cellCL:000209283.97gold quality
kidneyUBERON:000211383.97gold quality
tonsilUBERON:000237283.36gold quality
adult mammalian kidneyUBERON:000008283.20gold quality
cortex of kidneyUBERON:000122582.70gold quality
right lobe of thyroid glandUBERON:000111982.39gold quality
thyroid glandUBERON:000204682.16gold quality
left lobe of thyroid glandUBERON:000112082.12gold quality
mucosa of stomachUBERON:000119981.74gold quality
adrenal glandUBERON:000236981.67gold quality
right uterine tubeUBERON:000130281.65gold quality
bloodUBERON:000017881.48gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.10

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): MYC

miRNA regulators (miRDB)

28 targeting GTF2H2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692A100.0074.406850
HSA-MIR-3163100.0077.238605
HSA-MIR-4425100.0067.591049
HSA-MIR-428299.9975.366408
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-480399.9871.993117
HSA-MIR-548P99.9872.253784
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-302C-5P99.9772.563642
HSA-MIR-570-3P99.9672.414910
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-4778-3P99.9370.401818
HSA-MIR-430799.8270.453374
HSA-MIR-4659A-3P99.8072.624248
HSA-MIR-4659B-3P99.8072.624248
HSA-MIR-471999.7372.103329
HSA-MIR-217-5P99.4969.931419
HSA-MIR-4735-5P99.4368.491780
HSA-MIR-183-3P99.4169.411598
HSA-MIR-888-5P99.3070.151855
HSA-MIR-6807-3P99.1569.231275
HSA-MIR-450499.1069.141328
HSA-MIR-501-5P98.7768.881328
HSA-MIR-4680-3P98.6468.602093
HSA-MIR-5089-5P98.4566.061388
HSA-MIR-499B-5P98.3568.39988
HSA-MIR-4684-3P98.2469.911075

Literature-anchored findings (GeneRIF, showing 11)

  • p52 Mediates XPB function within the transcription/repair factor (PMID:12080057)
  • p44 of TFIIH has a role in binding to the nonstructural protein of Rift Valley fever virus to form nuclear filamentous structures (PMID:14980221)
  • solution structure of p44-(321-395) shows topology differs from other reported RING domains by a circular permutation of the extended secondary structure elements; mutagenesis suggests tight binding to p34 is mediated by hydrophobic interactions (PMID:15790571)
  • Impaired transcription in IPF is associated with decreased concentrations of transcription factor II-H in alveolar macrophages and may alter the intraalveolar milieu in IPF. (PMID:17332483)
  • miR-27a was identified as a key regulator of p44 mRNA. Moreover, miR-27a was shown to destabilize the p44 subunit of the TFIIH complex during the G2-M phase, thereby modulating the transcriptional shutdown observed during this transition. (PMID:21558443)
  • No correlation was found for p44 and occludin gene copy number and spinal muscular atrophy (PMID:21821450)
  • Together, these results suggest that p44/WDR77 expression causes the non-sensitivity of proliferating cells to TGFbeta signaling, thereby contributing to cellular proliferation during lung tumorigenesis. (PMID:24944016)
  • Advanced oxidation protein products down-regulate the expression of calcium transport channels through p44/42 MAPK signaling mechanisms in the small intestinal epithelium. (PMID:25801217)
  • interface variants between the p34 and p44 subunits only mildly affected the association between the full length proteins and did not impinge on TFIIH activities due to the presence of an additional interface involving the C4 domain of p34. (PMID:28977422)
  • The TFIIH subunits p44/p62 act as a damage sensor during nucleotide excision repair. (PMID:33166411)
  • Exosome-derived GTF2H2 from Huh7 cells can inhibit endothelial cell viability, migration, tube formation, and permeability. (PMID:36116407)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriogtf2h2ENSDARG00000016514
mus_musculusGtf2h2ENSMUSG00000021639
rattus_norvegicusGtf2h2ENSRNOG00000018230
drosophila_melanogasterSsl1FBGN0037202
caenorhabditis_elegansWBGENE00011814

Paralogs (1): GTF2H2C (ENSG00000183474)

Protein

Protein identifiers

General transcription factor IIH subunit 2Q13888 (reviewed: Q13888)

Alternative names: Basic transcription factor 2 44 kDa subunit, General transcription factor IIH polypeptide 2, TFIIH basal transcription factor complex p44 subunit

All UniProt accessions (5): D6RAW1, D6RGC9, Q13888, R4GMS9, R4GMV2

UniProt curated annotations — full annotation on UniProt →

Function. Component of the general transcription and DNA repair factor IIH (TFIIH) core complex, which is involved in general and transcription-coupled nucleotide excision repair (NER) of damaged DNA and, when complexed to CAK, in RNA transcription by RNA polymerase II. In NER, TFIIH acts by opening DNA around the lesion to allow the excision of the damaged oligonucleotide and its replacement by a new DNA fragment. In transcription, TFIIH has an essential role in transcription initiation. When the pre-initiation complex (PIC) has been established, TFIIH is required for promoter opening and promoter escape. Phosphorylation of the C-terminal tail (CTD) of the largest subunit of RNA polymerase II by the kinase module CAK controls the initiation of transcription. The N-terminus of GTF2H2 interacts with and regulates XPD whereas an intact C-terminus is required for a successful escape of RNAP II form the promoter.

Subunit / interactions. Component of the TFIID-containing RNA polymerase II pre-initiation complex that is composed of TBP and at least GTF2A1, GTF2A2, GTF2E1, GTF2E2, GTF2F1, GTF2H2, GTF2H3, GTF2H4, GTF2H5, GTF2B, TCEA1, ERCC2 and ERCC3. Component of the 7-subunit TFIIH core complex composed of XPB/ERCC3, XPD/ERCC2, GTF2H1, GTF2H2, GTF2H3, GTF2H4 and GTF2H5, which is active in NER. The core complex associates with the 3-subunit CDK-activating kinase (CAK) module composed of CCNH/cyclin H, CDK7 and MNAT1 to form the 10-subunit holoenzyme (holo-TFIIH) active in transcription. Interacts with XPB, XPD, GTF2H1 and GTF2H3. (Microbial infection) Interacts with varicella-zoster virus IE63 protein.

Subcellular location. Nucleus.

Tissue specificity. Widely expressed, with higher expression in skeletal muscle.

Similarity. Belongs to the GTF2H2 family.

Isoforms (1)

UniProt IDNamesCanonical?
Q13888-11yes

RefSeq proteins (21): NP_001351496, NP_001351497, NP_001351498, NP_001351499, NP_001351500, NP_001351501, NP_001351502, NP_001382316, NP_001382317, NP_001382318, NP_001382319, NP_001382320, NP_001382321, NP_001382322, NP_001382323, NP_001382324, NP_001382325, NP_001382326, NP_001382327, NP_001382328, NP_001506* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002035VWF_ADomain
IPR004595TFIIH_C1-like_domDomain
IPR007198Ssl1-likeDomain
IPR012170TFIIH_SSL1/p44Family
IPR013083Znf_RING/FYVE/PHDHomologous_superfamily
IPR013087Znf_C2H2_typeDomain
IPR036465vWFA_dom_sfHomologous_superfamily
IPR046349C1-like_sfHomologous_superfamily

Pfam: PF04056, PF07975

UniProt features (48 total): helix 14, strand 14, mutagenesis site 8, turn 5, sequence variant 2, chain 1, domain 1, sequence conflict 1, zinc finger region 1, modified residue 1

Structure

Experimental structures (PDB)

52 structures, top 30 by resolution.

PDBMethodResolution (Å)
28JMELECTRON MICROSCOPY3.29
7EGBELECTRON MICROSCOPY3.3
8EBUELECTRON MICROSCOPY3.3
9PD3ELECTRON MICROSCOPY3.3
28JSELECTRON MICROSCOPY3.32
5O85X-RAY DIFFRACTION3.4
9PD4ELECTRON MICROSCOPY3.4
6RO4ELECTRON MICROSCOPY3.5
7AD8ELECTRON MICROSCOPY3.5
9XYUELECTRON MICROSCOPY3.5
28KEELECTRON MICROSCOPY3.6
8EBXELECTRON MICROSCOPY3.6
8EBYELECTRON MICROSCOPY3.6
6NMIELECTRON MICROSCOPY3.7
7EGCELECTRON MICROSCOPY3.9
7NVXELECTRON MICROSCOPY3.9
8EBTELECTRON MICROSCOPY3.9
28JVELECTRON MICROSCOPY3.91
8BVWELECTRON MICROSCOPY4
8EBSELECTRON MICROSCOPY4
7ENAELECTRON MICROSCOPY4.07
8BYQELECTRON MICROSCOPY4.1
7ENCELECTRON MICROSCOPY4.13
8GXSELECTRON MICROSCOPY4.16
7NVWELECTRON MICROSCOPY4.3
9PCPELECTRON MICROSCOPY4.3
5OF4ELECTRON MICROSCOPY4.4
6O9MELECTRON MICROSCOPY4.4
7NVRELECTRON MICROSCOPY4.5
7LBMELECTRON MICROSCOPY4.8

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q13888-F184.290.53

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 95

Mutagenesis-validated functional residues (8):

PositionPhenotype
363no effect on the transcriptional activity of the reconstituted tfiih complex.
376no effect on the transcriptional activity of the reconstituted tfiih complex.
380no effect on the transcriptional activity of the reconstituted tfiih complex.
382no effect on the transcriptional activity of the reconstituted tfiih complex.
291reconstituted tfiih complex lacks p62 and has no transcriptional activity.
308reconstituted tfiih complex lacks p62 and has no transcriptional activity.
345no effect on the transcriptional activity of the reconstituted tfiih complex.
360no effect on the transcriptional activity of the reconstituted tfiih complex.

Function

Pathways and Gene Ontology

Reactome pathways

48 pathways

IDPathway
R-HSA-112382Formation of RNA Pol II elongation complex
R-HSA-113418Formation of the Early Elongation Complex
R-HSA-167152Formation of HIV elongation complex in the absence of HIV Tat
R-HSA-167158Formation of the HIV-1 Early Elongation Complex
R-HSA-167160RNA Pol II CTD phosphorylation and interaction with CE during HIV infection
R-HSA-167161HIV Transcription Initiation
R-HSA-167162RNA Polymerase II HIV Promoter Escape
R-HSA-167172Transcription of the HIV genome
R-HSA-167200Formation of HIV-1 elongation complex containing HIV-1 Tat
R-HSA-167246Tat-mediated elongation of the HIV-1 transcript
R-HSA-427413NoRC negatively regulates rRNA expression
R-HSA-5696395Formation of Incision Complex in GG-NER
R-HSA-5696400Dual Incision in GG-NER
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6781823Formation of TC-NER Pre-Incision Complex
R-HSA-6781827Transcription-Coupled Nucleotide Excision Repair (TC-NER)
R-HSA-6782135Dual incision in TC-NER
R-HSA-6782210Gap-filling DNA repair synthesis and ligation in TC-NER
R-HSA-6796648TP53 Regulates Transcription of DNA Repair Genes
R-HSA-72086mRNA Capping
R-HSA-73762RNA Polymerase I Transcription Initiation
R-HSA-73772RNA Polymerase I Promoter Escape
R-HSA-73776RNA Polymerase II Promoter Escape
R-HSA-73779RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-73863RNA Polymerase I Transcription Termination
R-HSA-75953RNA Polymerase II Transcription Initiation
R-HSA-75955RNA Polymerase II Transcription Elongation
R-HSA-76042RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-77075RNA Pol II CTD phosphorylation and interaction with CE
R-HSA-162587HIV Life Cycle

MSigDB gene sets: 185 (showing top): REACTOME_FORMATION_OF_INCISION_COMPLEX_IN_GG_NER, GOBP_G_PROTEIN_COUPLED_RECEPTOR_INTERNALIZATION, RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, BROWNE_HCMV_INFECTION_6HR_DN, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, REACTOME_RNA_POLYMERASE_I_TRANSCRIPTION_INITIATION, KANNAN_TP53_TARGETS_DN, GOBP_VESICLE_MEDIATED_TRANSPORT, KAUFFMANN_DNA_REPAIR_GENES, GOBP_NUCLEOTIDE_EXCISION_REPAIR, MUELLER_PLURINET, REACTOME_HIV_INFECTION, GOBP_RESPONSE_TO_UV, GOBP_RECEPTOR_INTERNALIZATION, GOBP_DNA_DAMAGE_RESPONSE

GO Biological Process (9): G protein-coupled receptor internalization (GO:0002031), nucleotide-excision repair (GO:0006289), regulation of transcription by RNA polymerase II (GO:0006357), transcription by RNA polymerase II (GO:0006366), transcription initiation at RNA polymerase II promoter (GO:0006367), response to UV (GO:0009411), DNA repair (GO:0006281), DNA-templated transcription (GO:0006351), DNA damage response (GO:0006974)

GO Molecular Function (4): zinc ion binding (GO:0008270), RNA polymerase II general transcription initiation factor activity (GO:0016251), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (7): core TFIIH complex portion of holo TFIIH complex (GO:0000438), transcription factor TFIIH core complex (GO:0000439), nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription factor TFIID complex (GO:0005669), transcription factor TFIIH holo complex (GO:0005675), nuclear speck (GO:0016607)

Reactome top-level categories

Rollup of top-12 pathways:

CategoryPathways
Transcription of the HIV genome4
Transcription-Coupled Nucleotide Excision Repair (TC-NER)3
RNA Polymerase II Transcription Elongation2
HIV Transcription Elongation2
Global Genome Nucleotide Excision Repair (GG-NER)2
Late Phase of HIV Life Cycle1
Tat-mediated elongation of the HIV-1 transcript1
Negative epigenetic regulation of rRNA expression1
RNA Polymerase II Transcription1
Nucleotide Excision Repair1
Transcriptional Regulation by TP531
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II3
RNA polymerase II transcription regulator complex3
transcription factor TFIIH core complex2
RNA polymerase II, holoenzyme2
desensitization of G protein-coupled receptor signaling pathway1
receptor internalization1
DNA repair1
regulation of DNA-templated transcription1
DNA-templated transcription1
DNA-templated transcription initiation1
response to light stimulus1
DNA metabolic process1
DNA damage response1
gene expression1
RNA biosynthetic process1
cellular response to stress1
transition metal ion binding1
general transcription initiation factor activity1
binding1
cation binding1
transcription factor TFIIH holo complex1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
nuclear cyclin-dependent protein kinase holoenzyme complex1
carboxy-terminal domain protein kinase complex1
nuclear ribonucleoprotein granule1

Protein interactions and networks

STRING

1193 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GTF2H2GTF2H4Q92759999
GTF2H2GTF2H1P32780999
GTF2H2ERCC3P19447998
GTF2H2ERCC2P18074997
GTF2H2GTF2H3Q13889997
GTF2H2GTF2H5Q6ZYL4945
GTF2H2CCNHP51946906
GTF2H2CDK7P50613878
GTF2H2ERCC8Q13216864
GTF2H2MNAT1P51948808
GTF2H2XPAP23025723
GTF2H2ERCC1P07992722
GTF2H2NAIPQ13075709
GTF2H2GTF2BQ00403681
GTF2H2SMN1Q16637623

IntAct

61 interactions, top by confidence:

ABTypeScore
GTF2H5GTF2H2psi-mi:“MI:0915”(physical association)0.800
GTF2H5GTF2H2psi-mi:“MI:0914”(association)0.800
CCNHERCC2psi-mi:“MI:0915”(physical association)0.750
GTF2H3GTF2H1psi-mi:“MI:0914”(association)0.740
GTF2H5GTF2H1psi-mi:“MI:0914”(association)0.730
ERCC2ERCC3psi-mi:“MI:0915”(physical association)0.670
ERCC3ERCC2psi-mi:“MI:0914”(association)0.670
GTF2H4GTF2H1psi-mi:“MI:0914”(association)0.670
GTF2H1MNAT1psi-mi:“MI:0914”(association)0.670
GTF2H2GTF2H1psi-mi:“MI:0915”(physical association)0.660
GTF2H5ERCC2psi-mi:“MI:0914”(association)0.650
GTF2H3ERCC3psi-mi:“MI:0914”(association)0.640
MNAT1ERCC2psi-mi:“MI:0914”(association)0.640
GTF2H3GTF2H2psi-mi:“MI:0914”(association)0.640
GTF2H2ERCC2psi-mi:“MI:0914”(association)0.620
GTF2H2ERCC2psi-mi:“MI:0915”(physical association)0.620
GTF2H4ERCC2psi-mi:“MI:0914”(association)0.530
ERCC3BCRpsi-mi:“MI:0914”(association)0.530
RNF135TXLNApsi-mi:“MI:0914”(association)0.530
RNF135XRCC4psi-mi:“MI:0914”(association)0.530
HSPB8VWA8psi-mi:“MI:0914”(association)0.530
GTF2H2ERCC8psi-mi:“MI:0407”(direct interaction)0.440
incESNX2psi-mi:“MI:0914”(association)0.430
KLF9GTF2H2psi-mi:“MI:0915”(physical association)0.400
GTF2H2H1-2psi-mi:“MI:0915”(physical association)0.400

BioGRID (202): GTF2H2 (Affinity Capture-MS), GTF2H2C (Affinity Capture-MS), GTF2H2 (Affinity Capture-MS), GTF2H2 (Affinity Capture-MS), GTF2H2C (Co-fractionation), GTF2H2 (Co-fractionation), GTF2H2C (Co-fractionation), GTF2H2 (Co-fractionation), GTF2H2C (Co-fractionation), GTF2H2 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H2C (Affinity Capture-MS), GTF2H2 (Affinity Capture-MS), GTF2H2 (Reconstituted Complex)

ESM2 similar proteins: A0A1L8F5J9, A0JN27, F1LTR1, F1NBL0, O15294, P35438, P35439, P56558, P61201, P61202, P61203, P61599, P61600, P63138, P79101, P81436, Q03555, Q05586, Q13888, Q15303, Q27HV0, Q2PFM2, Q2TBV5, Q4L208, Q58ED9, Q5R1P0, Q5SP67, Q5ZJ75, Q61527, Q62956, Q6IQT4, Q6IR75, Q6P1K8, Q6P632, Q7ZXR3, Q8BUV3, Q8C6G8, Q8CGY8, Q8R4D1, Q91854

Diamond homologs: A0JN27, O74995, P34567, Q04673, Q13888, Q2TBV5, Q6P1K8, Q86KZ2, Q9JIB4, Q9ZVN9

SIGNOR signaling

2 interactions.

AEffectBMechanism
GTF2H2“up-regulates activity”ESR1phosphorylation
GTF2H2“form complex”TFIIHbinding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 53 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
RNA Pol II CTD phosphorylation and interaction with CE during HIV infection10116.5×2e-17
RNA Pol II CTD phosphorylation and interaction with CE10116.5×2e-17
mRNA Capping10108.8×2e-17
Formation of the Early Elongation Complex1096.0×4e-17
Formation of the HIV-1 Early Elongation Complex1096.0×4e-17
Global Genome Nucleotide Excision Repair (GG-NER)791.4×4e-12
RNA Polymerase I Transcription Termination983.9×5e-15
Transcription-Coupled Nucleotide Excision Repair (TC-NER)1183.5×2e-17

GO biological processes:

GO termPartnersFoldFDR
transcription initiation at RNA polymerase II promoter970.2×2e-12
nucleotide-excision repair863.8×7e-11
transcription by RNA polymerase II913.2×3e-06
DNA repair79.3×7e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

64 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance19
Likely benign2
Benign32

Top pathogenic / likely-pathogenic (0)

SpliceAI

2277 predictions. Top by Δscore:

VariantEffectΔscore
5:71037505:A:ACdonor_gain1.0000
5:71037506:C:CCdonor_gain1.0000
5:71048004:AC:Adonor_gain1.0000
5:71048005:CC:Cdonor_gain1.0000
5:71048879:T:Cacceptor_gain1.0000
5:71048879:T:TCacceptor_gain1.0000
5:71049160:T:Cacceptor_gain1.0000
5:71049160:T:TCacceptor_gain1.0000
5:71055350:A:ACdonor_gain1.0000
5:71055351:C:CCdonor_gain1.0000
5:71059746:T:Cacceptor_gain1.0000
5:71059746:T:TCacceptor_gain1.0000
5:71060168:CAA:Cacceptor_gain1.0000
5:71060171:C:CCacceptor_gain1.0000
5:71060836:A:Cdonor_gain1.0000
5:71060838:TTTTA:Tdonor_loss1.0000
5:71060839:TTTA:Tdonor_loss1.0000
5:71060840:TTA:Tdonor_loss1.0000
5:71060841:TAC:Tdonor_loss1.0000
5:71060843:C:Tdonor_loss1.0000
5:71060926:CGCAT:Cacceptor_gain1.0000
5:71060928:CAT:Cacceptor_gain1.0000
5:71060929:ATCT:Aacceptor_loss1.0000
5:71060930:TCTA:Tacceptor_loss1.0000
5:71060931:C:CAacceptor_loss1.0000
5:71060931:C:CCacceptor_gain1.0000
5:71061762:CTCC:Cacceptor_gain1.0000
5:71061764:CC:Cacceptor_gain1.0000
5:71061765:CC:Cacceptor_gain1.0000
5:71061766:C:CGacceptor_loss1.0000

AlphaMissense

2591 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:71041597:A:GL318S1.000
5:71041606:G:TA315D1.000
5:71041615:A:CL312W1.000
5:71041621:A:GL310P1.000
5:71042182:A:CC308W1.000
5:71042183:C:TC308Y1.000
5:71042184:A:GC308R1.000
5:71042191:A:CC305W1.000
5:71042192:C:TC305Y1.000
5:71042193:A:GC305R1.000
5:71042201:G:TP302H1.000
5:71042224:A:CC294W1.000
5:71042226:A:GC294R1.000
5:71042233:G:CC291W1.000
5:71042234:C:TC291Y1.000
5:71042235:A:GC291R1.000
5:71045503:A:CF254L1.000
5:71045503:A:TF254L1.000
5:71045504:A:CF254C1.000
5:71045504:A:GF254S1.000
5:71045505:A:CF254V1.000
5:71045505:A:GF254L1.000
5:71048006:C:GG253R1.000
5:71048006:C:TG253R1.000
5:71049113:G:CS172R1.000
5:71049113:G:TS172R1.000
5:71049115:T:GS172R1.000
5:71060123:A:CS102R1.000
5:71060123:A:TS102R1.000
5:71060125:T:GS102R1.000

dbSNP variants (sampled 300 via entrez): RS1000330265 (5:71063919 A>C), RS1000927154 (5:71056263 G>C,T), RS1002328080 (5:71061435 C>T), RS1002931893 (5:71039973 A>G), RS1004375051 (5:71050736 G>A), RS1004810720 (5:71046348 A>C,G), RS1006464156 (5:71054969 T>C), RS1006891994 (5:71056093 T>C), RS1006926714 (5:71055231 T>C), RS1008471915 (5:71036515 TG>T), RS1008565273 (5:71063822 G>A), RS1008594959 (5:71061658 CT>C,CTT), RS1010264733 (5:71059415 G>A,C), RS1010574096 (5:71051944 T>C), RS1010606648 (5:71051209 G>A)

Disease associations

OMIM: gene MIM:601748 | disease phenotypes:

GenCC curated gene-disease

Mondo (1): primary amenorrhea (MONDO:1060208)

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arsenitedecreases expression, increases expression2
Tretinoindecreases expression2
Cadmium Chloridedecreases expression, increases abundance, increases expression2
myristicindecreases expression1
uranyl acetateaffects expression1
bisphenol Aincreases expression1
beta-lapachoneincreases expression1
bromoacetatedecreases expression1
abrinedecreases expression1
Arecolinedecreases expression1
Benzo(a)pyreneincreases expression1
Cadmiumincreases abundance, increases expression1
Doxorubicindecreases expression1
Formaldehydedecreases expression1
Seleniumdecreases expression, affects cotreatment, increases expression1
Uraniumaffects expression1
Urethanedecreases expression1
Vitamin Eaffects cotreatment, increases expression, decreases expression1
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression1
Aflatoxin B1decreases methylation1
Lactic Aciddecreases expression1

Clinical trials (associated diseases)

1 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT07164248Not specifiedCOMPLETEDEvaluation of Bone Mineral Density Indications and Outcomes in Female Adolescents: Implications for Early Detection of Osteopenia/Osteoporosis and Gynecologic Practice
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): primary amenorrhea