GTF2H4

gene
On this page

Also known as TFB2TFIIHP52

Summary

GTF2H4 (general transcription factor IIH subunit 4, HGNC:4658) is a protein-coding gene on chromosome 6p21.33, encoding General transcription factor IIH subunit 4 (Q92759). Component of the general transcription and DNA repair factor IIH (TFIIH) core complex, which is involved in general and transcription-coupled nucleotide excision repair (NER) of damaged DNA and, when complexed to CAK, in RNA transcription by RNA polymerase II. It is a common-essential gene (DepMap: required in 95.3% of cancer cell lines).

Enables RNA polymerase II general transcription initiation factor activity. Involved in transcription by RNA polymerase II. Located in nuclear speck. Part of core TFIIH complex portion of holo TFIIH complex and transcription factor TFIID complex.

Source: NCBI Gene 2968 — RefSeq curated summary.

At a glance

  • GWAS associations: 26
  • Clinical variants (ClinVar): 66 total — 2 pathogenic
  • Phenotypes (HPO): 24
  • Cancer dependency (DepMap): dependent in 95.3% of screened cell lines (common-essential)
  • MANE Select transcript: NM_001517

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4658
Approved symbolGTF2H4
Namegeneral transcription factor IIH subunit 4
Location6p21.33
Locus typegene with protein product
StatusApproved
AliasesTFB2, TFIIH, P52
Ensembl geneENSG00000213780
Ensembl biotypeprotein_coding
OMIM601760
Entrez2968

Gene structure

Transcript identifiers

Ensembl transcripts: 17 — 13 protein_coding, 3 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000259895, ENST00000376316, ENST00000453897, ENST00000475845, ENST00000483318, ENST00000487746, ENST00000903713, ENST00000903714, ENST00000903715, ENST00000903716, ENST00000903717, ENST00000903718, ENST00000924708, ENST00000924709, ENST00000942992, ENST00000942993, ENST00000942994

RefSeq mRNA: 1 — MANE Select: NM_001517 NM_001517

CCDS: CCDS34386

Canonical transcript exons

ENST00000259895 — 14 exons

ExonStartEnd
ENSE000016704953091232830912458
ENSE000017119123091201430912146
ENSE000019108283090820730908403
ENSE000034879303091168430911767
ENSE000034964673091115830911269
ENSE000035125053091381130914106
ENSE000035134043090993230910063
ENSE000035390363090943530909539
ENSE000035435663091143130911499
ENSE000035477083091330930913387
ENSE000035815673091085330910941
ENSE000035817403091066530910761
ENSE000036900533091311030913157
ENSE000037709883090903430909173

Expression profiles

Bgee: expression breadth ubiquitous, 139 present calls, max score 91.93.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 28.9870 / max 283.2635, expressed in 1800 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
6683628.98701800

Top tissues by expression

139 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right lobe of liverUBERON:000111491.93gold quality
pituitary glandUBERON:000000791.84gold quality
metanephros cortexUBERON:001053391.37gold quality
right hemisphere of cerebellumUBERON:001489091.32gold quality
cerebellar hemisphereUBERON:000224590.92gold quality
cerebellar cortexUBERON:000212990.88gold quality
cerebellumUBERON:000203790.83gold quality
adenohypophysisUBERON:000219690.73gold quality
left lobe of thyroid glandUBERON:000112090.39gold quality
right lobe of thyroid glandUBERON:000111990.25gold quality
thyroid glandUBERON:000204690.19gold quality
cortical plateUBERON:000534390.13gold quality
tibial arteryUBERON:000761089.97gold quality
body of uterusUBERON:000985389.97gold quality
popliteal arteryUBERON:000225089.96gold quality
right coronary arteryUBERON:000162589.56gold quality
apex of heartUBERON:000209889.43gold quality
tibial nerveUBERON:000132389.36gold quality
smooth muscle tissueUBERON:000113589.28gold quality
left coronary arteryUBERON:000162689.28gold quality
fundus of stomachUBERON:000116089.26gold quality
descending thoracic aortaUBERON:000234589.22gold quality
ascending aortaUBERON:000149689.21gold quality
thoracic aortaUBERON:000151589.19gold quality
myometriumUBERON:000129688.94gold quality
subcutaneous adipose tissueUBERON:000219088.84gold quality
esophagogastric junction muscularis propriaUBERON:003584188.80gold quality
endocervixUBERON:000045888.56gold quality
right ovaryUBERON:000211888.54gold quality
liverUBERON:000210788.53gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.97

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): E2F1, E2F4

miRNA regulators (miRDB)

4 targeting GTF2H4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4477A98.8369.752952
HSA-MIR-6779-3P97.5165.82789
HSA-MIR-797695.7565.671186
HSA-MIR-7108-3P94.3764.79183

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 95.3% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 18)

  • General transcription factor IIH protects promoters from PC4-mediated repression by relieving the topological constraint imposed by PC4 through the ERCC3 helicase activity rather than by reducing the repressive activity of PC4 via its phosphorylation. (PMID:12590132)
  • The second domain of TFIIH p62 subunit(residues 186-240) contains a region of high sequence conservation with an invariant tyrosine/phenylalanine motif which could play a key role as a protein-protein recognition module within TFIIH. (PMID:15533047)
  • TFIIH uses an expanded proximal promoter to regulate c-myc expression (PMID:15601838)
  • We conclude that the recruitment and activation of TFIIH represents a rate-limiting step for the emergence of HIV from latency. (PMID:16874302)
  • Alterations of chromatin at the RNA polymerase II stall site, which depend on CSB and TFIIH at least, are necessary for the UV-induced translocation of CSA to the nuclear matrix. (PMID:17242193)
  • XPG forms a stable complex with TFIIH, which is active in transcription and nucleotide excision repair (PMID:17466625)
  • mechanism in which the helicase activity of XPB is not used for the opening and repair of damaged DNA, which is instead only driven by its ATPase activity, in combination with the helicase activity of XPD (PMID:17466626)
  • The specific binding of the C-terminal acidic domain (AC-D) of the human TFIIEalpha subunit to the pleckstrin homology domain (PH-D) of the human TFIIH p62 subunit is demonstrated. (PMID:18354501)
  • TAF7 interacts with the transcription factors, TFIIH and P-TEFb, resulting in the inhibition of their Pol II CTD kinase activities (PMID:18391197)
  • TFIIH changes subunit composition in response to DNA damage. The CAK is released from the core during nucleotide excision repair (NER). (PMID:18614043)
  • For 6-4 photoproducts, we show that TFIIH complexes carrying an NH(2)-terminal XPD mutated protein are also deficient in recruitment of NER proteins downstream of TFIIH (PMID:18676829)
  • The frequency of congenital ichthyosis, collodion-baby type, was significantly higher in the TFIIH mutated group of trichothiodystrophy patients. (PMID:19681155)
  • Study demonstrates that an essential initiation factor, TFB2, forms a network of interactions with DNA near the transcription start site and facilitates promoter melting but may not be essential for promoter recognition. (PMID:19945377)
  • A single nucleotide polymorphism variant within the general transcription factor IIH, polypeptide 4 gene, GTF2H4, on chromosome 6p21.33 was significantly associated with MS (PMID:20522537)
  • GTF2H4 variants may not be associated with susceptibility to aspirin-exacerbated respiratory disease and obstructive symptoms in asthmatics. (PMID:22524621)
  • functional genetic variants of GTF2H4 confer susceptibility to lung cancer. (PMID:27288692)
  • Data demonstrates gene expression changes in differentially methylated GTF2H4 gene in patients with age-related macular degeneration. (PMID:30642396)
  • The role of Transcription Factor IIH complex in nucleotide excision repair. (PMID:37545038)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriogtf2h4ENSDARG00000036071
mus_musculusGtf2h4ENSMUSG00000001524
rattus_norvegicusGtf2h4ENSRNOG00000000831
drosophila_melanogastermrnFBGN0261109
caenorhabditis_elegansWBGENE00013529

Protein

Protein identifiers

General transcription factor IIH subunit 4Q92759 (reviewed: Q92759)

Alternative names: Basic transcription factor 2 52 kDa subunit, General transcription factor IIH polypeptide 4, TFIIH basal transcription factor complex p52 subunit

All UniProt accessions (2): A0A1U9X7S4, Q92759

UniProt curated annotations — full annotation on UniProt →

Function. Component of the general transcription and DNA repair factor IIH (TFIIH) core complex, which is involved in general and transcription-coupled nucleotide excision repair (NER) of damaged DNA and, when complexed to CAK, in RNA transcription by RNA polymerase II. In NER, TFIIH acts by opening DNA around the lesion to allow the excision of the damaged oligonucleotide and its replacement by a new DNA fragment. In transcription, TFIIH has an essential role in transcription initiation. When the pre-initiation complex (PIC) has been established, TFIIH is required for promoter opening and promoter escape. Phosphorylation of the C-terminal tail (CTD) of the largest subunit of RNA polymerase II by the kinase module CAK controls the initiation of transcription. Stimulates the ATPase activity of TFIIH subunit XPB/ERCC3.

Subunit / interactions. Component of the 7-subunit TFIIH core complex composed of XPB/ERCC3, XPD/ERCC2, GTF2H1, GTF2H2, GTF2H3, GTF2H4 and GTF2H5, which is active in NER. The core complex associates with the 3-subunit CDK-activating kinase (CAK) module composed of CCNH/cyclin H, CDK7 and MNAT1 to form the 10-subunit holoenzyme (holo-TFIIH) active in transcription. Part of TBP-based Pol II pre-initiation complex (PIC), in which Pol II core assembles with general transcription factors and other specific initiation factors including GTF2E1, GTF2E2, GTF2F1, GTF2F2, TCEA1, ERCC2, ERCC3, GTF2H2, GTF2H3, GTF2H4, GTF2H5, GTF2A1, GTF2A2, GTF2B and TBP; this large multi-subunit PIC complex mediates DNA unwinding and targets Pol II core to the transcription start site where the first phosphodiester bond forms.

Subcellular location. Nucleus.

Domain organisation. The C-terminus (residues 305-462) stimulates the ATPase activity of XPB/ERCC3.

Similarity. Belongs to the TFB2 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q92759-11yes
Q92759-22

RefSeq proteins (1): NP_001508* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR004598TFIIH_p52/Tfb2Family
IPR040662Tfb2_CDomain

Pfam: PF03849, PF18307

UniProt features (47 total): helix 22, strand 17, splice variant 3, turn 3, chain 1, sequence variant 1

Structure

Experimental structures (PDB)

51 structures, top 30 by resolution.

PDBMethodResolution (Å)
7NVVELECTRON MICROSCOPY2.9
28JMELECTRON MICROSCOPY3.29
7EGBELECTRON MICROSCOPY3.3
8EBUELECTRON MICROSCOPY3.3
9PD3ELECTRON MICROSCOPY3.3
28JSELECTRON MICROSCOPY3.32
9PD4ELECTRON MICROSCOPY3.4
6RO4ELECTRON MICROSCOPY3.5
7AD8ELECTRON MICROSCOPY3.5
9XYUELECTRON MICROSCOPY3.5
28KEELECTRON MICROSCOPY3.6
8EBXELECTRON MICROSCOPY3.6
8EBYELECTRON MICROSCOPY3.6
6NMIELECTRON MICROSCOPY3.7
7EGCELECTRON MICROSCOPY3.9
7NVXELECTRON MICROSCOPY3.9
8EBTELECTRON MICROSCOPY3.9
28JVELECTRON MICROSCOPY3.91
8BVWELECTRON MICROSCOPY4
8EBSELECTRON MICROSCOPY4
7ENAELECTRON MICROSCOPY4.07
8BYQELECTRON MICROSCOPY4.1
7ENCELECTRON MICROSCOPY4.13
8GXSELECTRON MICROSCOPY4.16
7NVWELECTRON MICROSCOPY4.3
9PCPELECTRON MICROSCOPY4.3
5OF4ELECTRON MICROSCOPY4.4
6O9MELECTRON MICROSCOPY4.4
7NVRELECTRON MICROSCOPY4.5
7LBMELECTRON MICROSCOPY4.8

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q92759-F185.850.51

Function

Pathways and Gene Ontology

Reactome pathways

48 pathways

IDPathway
R-HSA-112382Formation of RNA Pol II elongation complex
R-HSA-113418Formation of the Early Elongation Complex
R-HSA-167152Formation of HIV elongation complex in the absence of HIV Tat
R-HSA-167158Formation of the HIV-1 Early Elongation Complex
R-HSA-167160RNA Pol II CTD phosphorylation and interaction with CE during HIV infection
R-HSA-167161HIV Transcription Initiation
R-HSA-167162RNA Polymerase II HIV Promoter Escape
R-HSA-167172Transcription of the HIV genome
R-HSA-167200Formation of HIV-1 elongation complex containing HIV-1 Tat
R-HSA-167246Tat-mediated elongation of the HIV-1 transcript
R-HSA-427413NoRC negatively regulates rRNA expression
R-HSA-5696395Formation of Incision Complex in GG-NER
R-HSA-5696400Dual Incision in GG-NER
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6781823Formation of TC-NER Pre-Incision Complex
R-HSA-6781827Transcription-Coupled Nucleotide Excision Repair (TC-NER)
R-HSA-6782135Dual incision in TC-NER
R-HSA-6782210Gap-filling DNA repair synthesis and ligation in TC-NER
R-HSA-6796648TP53 Regulates Transcription of DNA Repair Genes
R-HSA-72086mRNA Capping
R-HSA-73762RNA Polymerase I Transcription Initiation
R-HSA-73772RNA Polymerase I Promoter Escape
R-HSA-73776RNA Polymerase II Promoter Escape
R-HSA-73779RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-73863RNA Polymerase I Transcription Termination
R-HSA-75953RNA Polymerase II Transcription Initiation
R-HSA-75955RNA Polymerase II Transcription Elongation
R-HSA-76042RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-77075RNA Pol II CTD phosphorylation and interaction with CE
R-HSA-162587HIV Life Cycle

MSigDB gene sets: 238 (showing top): GSE45365_NK_CELL_VS_CD8_TCELL_DN, REACTOME_FORMATION_OF_INCISION_COMPLEX_IN_GG_NER, YAGI_AML_WITH_INV_16_TRANSLOCATION, REACTOME_RNA_POLYMERASE_I_TRANSCRIPTION_INITIATION, KAUFFMANN_DNA_REPAIR_GENES, HUMMERICH_SKIN_CANCER_PROGRESSION_DN, PUJANA_CHEK2_PCC_NETWORK, GOBP_NUCLEOTIDE_EXCISION_REPAIR, KOKKINAKIS_METHIONINE_DEPRIVATION_96HR_UP, REACTOME_HIV_INFECTION, KOKKINAKIS_METHIONINE_DEPRIVATION_48HR_UP, GOBP_DNA_DAMAGE_RESPONSE, GOBP_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, GOBP_TRANSCRIPTION_INITIATION_AT_RNA_POLYMERASE_II_PROMOTER, BENPORATH_ES_CORE_NINE_CORRELATED

GO Biological Process (4): DNA repair (GO:0006281), nucleotide-excision repair (GO:0006289), transcription by RNA polymerase II (GO:0006366), DNA damage response (GO:0006974)

GO Molecular Function (4): ATPase activator activity (GO:0001671), double-stranded DNA binding (GO:0003690), RNA polymerase II general transcription initiation factor activity (GO:0016251), protein binding (GO:0005515)

GO Cellular Component (7): core TFIIH complex portion of holo TFIIH complex (GO:0000438), transcription factor TFIIH core complex (GO:0000439), nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription factor TFIID complex (GO:0005669), transcription factor TFIIH holo complex (GO:0005675), nuclear speck (GO:0016607)

Reactome top-level categories

Rollup of top-12 pathways:

CategoryPathways
Transcription of the HIV genome4
Transcription-Coupled Nucleotide Excision Repair (TC-NER)3
RNA Polymerase II Transcription Elongation2
HIV Transcription Elongation2
Global Genome Nucleotide Excision Repair (GG-NER)2
Late Phase of HIV Life Cycle1
Tat-mediated elongation of the HIV-1 transcript1
Negative epigenetic regulation of rRNA expression1
RNA Polymerase II Transcription1
Nucleotide Excision Repair1
Transcriptional Regulation by TP531
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulator complex3
transcription factor TFIIH core complex2
RNA polymerase II, holoenzyme2
DNA metabolic process1
DNA damage response1
DNA repair1
DNA-templated transcription1
cellular response to stress1
ATP-dependent activity1
molecular function activator activity1
DNA binding1
transcription by RNA polymerase II1
general transcription initiation factor activity1
binding1
transcription factor TFIIH holo complex1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
nuclear cyclin-dependent protein kinase holoenzyme complex1
carboxy-terminal domain protein kinase complex1
nuclear ribonucleoprotein granule1

Protein interactions and networks

STRING

1632 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GTF2H4GTF2H1P32780999
GTF2H4GTF2H2Q13888999
GTF2H4ERCC3P19447998
GTF2H4ERCC2P18074997
GTF2H4GTF2H3Q13889997
GTF2H4GTF2H5Q6ZYL4995
GTF2H4CCNHP51946959
GTF2H4MNAT1P51948931
GTF2H4CDK7P50613885
GTF2H4XPAP23025763
GTF2H4GTF2H2CQ6P1K8738
GTF2H4PUF60Q9UHX1670
GTF2H4TFAMQ00059667
GTF2H4GTF2BQ00403625
GTF2H4A0A090J7P6A0A090J7P6623

IntAct

71 interactions, top by confidence:

ABTypeScore
GTF2H5GTF2H4psi-mi:“MI:0915”(physical association)0.860
GTF2H1CDK7psi-mi:“MI:0915”(physical association)0.820
CCNHERCC2psi-mi:“MI:0915”(physical association)0.750
GTF2H3GTF2H1psi-mi:“MI:0914”(association)0.740
CETN2SFI1psi-mi:“MI:0914”(association)0.740
GTF2H5GTF2H1psi-mi:“MI:0914”(association)0.730
ERCC2ERCC3psi-mi:“MI:0915”(physical association)0.670
EAF1GTF2H4psi-mi:“MI:0915”(physical association)0.670
GTF2H4EAF1psi-mi:“MI:0915”(physical association)0.670
GTF2H4GTF2H1psi-mi:“MI:0914”(association)0.670
GTF2H5ERCC2psi-mi:“MI:0914”(association)0.650
GTF2H3ERCC3psi-mi:“MI:0914”(association)0.640
GTF2H2ERCC2psi-mi:“MI:0914”(association)0.620
THAP3GTF2H4psi-mi:“MI:0915”(physical association)0.620
SREK1IP1GTF2H4psi-mi:“MI:0915”(physical association)0.560
EIF1ADGTF2H4psi-mi:“MI:0915”(physical association)0.560
GTF2H4ERCC2psi-mi:“MI:0914”(association)0.530
NEURL4APBB1psi-mi:“MI:0914”(association)0.530
ERCC3BCRpsi-mi:“MI:0914”(association)0.530
GTF2H2CERCC3psi-mi:“MI:0914”(association)0.530
THAP3CASC3psi-mi:“MI:0914”(association)0.530
HDGFL2CDC7psi-mi:“MI:0914”(association)0.530
Rpl35RPS6psi-mi:“MI:0914”(association)0.350
GTF2H5GTF2H3psi-mi:“MI:0914”(association)0.350

BioGRID (168): GTF2H4 (Affinity Capture-MS), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Co-fractionation), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-MS), GTF2H4 (Affinity Capture-Western), GTF2H5 (Reconstituted Complex)

ESM2 similar proteins: A4F267, A6QR22, E9PZQ0, F1LMY4, O70422, O96008, P07144, P11716, P16960, P21817, P42054, P42055, P42056, P45880, P46274, P60027, P68002, P68003, P81155, P82013, P86223, Q0JNK5, Q1LZB5, Q29380, Q53PC7, Q5E9L7, Q5R7V4, Q5U3I0, Q60930, Q60931, Q6K548, Q6P825, Q75Q40, Q7F4F8, Q7ZTM6, Q91W86, Q920Q4, Q92759, Q969M1, Q9CZR3

Diamond homologs: O70422, P60027, P87303, Q02939, Q54C29, Q680U9, Q6BGW8, Q6BZX4, Q6CLR2, Q6FP41, Q75B51, Q92759

SIGNOR signaling

1 interactions.

AEffectBMechanism
GTF2H4“form complex”TFIIHbinding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 58 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
RNA Pol II CTD phosphorylation and interaction with CE during HIV infection10110.2×5e-17
RNA Pol II CTD phosphorylation and interaction with CE10110.2×5e-17
mRNA Capping10102.9×7e-17
Formation of the Early Elongation Complex1090.8×1e-16
Formation of the HIV-1 Early Elongation Complex1090.8×1e-16
Global Genome Nucleotide Excision Repair (GG-NER)786.4×1e-11
Formation of Incision Complex in GG-NER1282.3×2e-18
RNA Polymerase I Transcription Termination979.4×1e-14

GO biological processes:

GO termPartnersFoldFDR
nucleotide-excision repair1186.0×1e-16
transcription initiation at RNA polymerase II promoter1076.4×9e-15
transcription by RNA polymerase II912.9×3e-06
DNA repair79.1×6e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

66 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic2
Likely pathogenic0
Uncertain significance42
Likely benign3
Benign2

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
4533342GTF2H4, IVS2, G-A, -1Pathogenic
4533343GTF2H4, 2-BP DEL/3-BP INS, NT 1203Pathogenic

SpliceAI

1601 predictions. Top by Δscore:

VariantEffectΔscore
6:30909428:T:TAacceptor_gain1.0000
6:30909430:TACA:Tacceptor_loss1.0000
6:30909431:A:AGacceptor_gain1.0000
6:30909431:ACAG:Aacceptor_gain1.0000
6:30909431:ACAGG:Aacceptor_gain1.0000
6:30909432:C:Gacceptor_gain1.0000
6:30909432:CA:Cacceptor_loss1.0000
6:30909432:CAGGG:Cacceptor_gain1.0000
6:30909433:A:AGacceptor_gain1.0000
6:30909433:A:ATacceptor_loss1.0000
6:30909433:AG:Aacceptor_gain1.0000
6:30909433:AGG:Aacceptor_gain1.0000
6:30909433:AGGGA:Aacceptor_gain1.0000
6:30909434:G:GAacceptor_gain1.0000
6:30909434:GG:Gacceptor_gain1.0000
6:30909434:GGG:Gacceptor_gain1.0000
6:30909434:GGGA:Gacceptor_gain1.0000
6:30909434:GGGAG:Gacceptor_gain1.0000
6:30909536:GCAA:Gdonor_gain1.0000
6:30909537:CAA:Cdonor_gain1.0000
6:30909539:AGTAA:Adonor_loss1.0000
6:30909540:G:GGdonor_gain1.0000
6:30909540:GTA:Gdonor_loss1.0000
6:30909541:TAA:Tdonor_loss1.0000
6:30909930:A:AGacceptor_gain1.0000
6:30909930:AG:Aacceptor_gain1.0000
6:30909930:AGG:Aacceptor_gain1.0000
6:30909931:G:Aacceptor_gain1.0000
6:30909931:G:GAacceptor_gain1.0000
6:30909931:GGG:Gacceptor_gain1.0000

AlphaMissense

3000 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:30909159:T:GC41W1.000
6:30910042:T:CL118P1.000
6:30910733:T:AL148H1.000
6:30910745:C:AA152D1.000
6:30910756:T:AW156R1.000
6:30910756:T:CW156R1.000
6:30910758:G:CW156C1.000
6:30910758:G:TW156C1.000
6:30911195:G:CG200R1.000
6:30911196:G:AG200D1.000
6:30911198:T:CF201L1.000
6:30911200:C:AF201L1.000
6:30911200:C:GF201L1.000
6:30911204:T:CF203L1.000
6:30911205:T:CF203S1.000
6:30911206:C:AF203L1.000
6:30911206:C:GF203L1.000
6:30911208:T:CL204P1.000
6:30911727:T:CL262P1.000
6:30911736:T:CL265P1.000
6:30911747:G:AG269R1.000
6:30911747:G:CG269R1.000
6:30911747:G:TG269W1.000
6:30911748:G:AG269E1.000
6:30911748:G:TG269V1.000
6:30911754:T:AV271D1.000
6:30911763:G:CR274T1.000
6:30911763:G:TR274M1.000
6:30911764:G:CR274S1.000
6:30911764:G:TR274S1.000

dbSNP variants (sampled 300 via entrez): RS1000147397 (6:30909929 C>A,T), RS1000838947 (6:30914276 G>A), RS1001601716 (6:30910421 C>G), RS1001623670 (6:30908981 G>C), RS1001909214 (6:30907983 G>C), RS1001940463 (6:30908204 T>C), RS1003017365 (6:30911940 A>G), RS1005424937 (6:30909913 C>T), RS1005658139 (6:30912991 A>G), RS1006013568 (6:30911868 G>A), RS1006316243 (6:30908202 T>C), RS1006661418 (6:30914317 C>T), RS1007677254 (6:30914364 G>A,T), RS1008790953 (6:30912042 G>A), RS1009721713 (6:30908612 A>C)

Disease associations

OMIM: gene MIM:601760 | disease phenotypes: MIM:621435

GenCC curated gene-disease

Mondo (1): xeroderma pigmentosum, complementation group J (MONDO:0980987)

Orphanet (0):

HPO phenotypes

24 total (24 of 24 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0000252Microcephaly
HP:0000490Deeply set eye
HP:0000613Photophobia
HP:0000750Delayed speech and language development
HP:0000958Dry skin
HP:0000992Cutaneous photosensitivity
HP:0001003Multiple lentigines
HP:0001059Pterygium
HP:0001256Mild intellectual disability
HP:0001518Small for gestational age
HP:0001595Abnormal hair morphology
HP:0001597Abnormal nail morphology
HP:0001882Decreased total leukocyte count
HP:0001903Anemia
HP:0002527Falls
HP:0002750Delayed skeletal maturation
HP:0003593Infantile onset
HP:0004322Short stature
HP:0008064Ichthyosis
HP:0010783Erythema
HP:0011342Mild global developmental delay
HP:0031936Delayed ability to walk
HP:0100699Scarring

GWAS associations

26 associations (top):

StudyTraitp-value
GCST000853_3Ulcerative colitis4.000000e-06
GCST004521_114Autism spectrum disorder or schizophrenia3.000000e-17
GCST004521_117Autism spectrum disorder or schizophrenia3.000000e-15
GCST004521_121Autism spectrum disorder or schizophrenia3.000000e-13
GCST004521_131Autism spectrum disorder or schizophrenia2.000000e-10
GCST004521_132Autism spectrum disorder or schizophrenia2.000000e-09
GCST004521_171Autism spectrum disorder or schizophrenia4.000000e-14
GCST004521_19Autism spectrum disorder or schizophrenia2.000000e-12
GCST004521_2Autism spectrum disorder or schizophrenia2.000000e-16
GCST004521_209Autism spectrum disorder or schizophrenia5.000000e-16
GCST004521_210Autism spectrum disorder or schizophrenia5.000000e-15
GCST004521_211Autism spectrum disorder or schizophrenia5.000000e-15
GCST004521_265Autism spectrum disorder or schizophrenia7.000000e-14
GCST004521_27Autism spectrum disorder or schizophrenia1.000000e-09
GCST004521_295Autism spectrum disorder or schizophrenia6.000000e-18
GCST004521_3Autism spectrum disorder or schizophrenia2.000000e-15
GCST004521_33Autism spectrum disorder or schizophrenia1.000000e-08
GCST004521_48Autism spectrum disorder or schizophrenia1.000000e-09
GCST004521_70Autism spectrum disorder or schizophrenia8.000000e-20
GCST004521_79Autism spectrum disorder or schizophrenia1.000000e-16
GCST004723_2Conotruncal heart defects (maternal effects)3.000000e-07
GCST004723_3Conotruncal heart defects (maternal effects)8.000000e-07
GCST004946_173Schizophrenia2.000000e-08
GCST005541_14Sarcoidosis (Lofgren’s syndrome vs non-Lofgren’s syndrome)2.000000e-24
GCST010725_70Malaria5.000000e-06
GCST010725_9Malaria6.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

36 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Aincreases abundance, affects cotreatment, increases methylation, decreases expression, increases expression (+1 more)4
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxideincreases expression, decreases expression2
Cyclosporineincreases expression2
ginger extractdecreases reaction, increases abundance, increases expression1
2,4,6-tribromophenoldecreases expression1
sodium arsenitedecreases expression, increases abundance1
tetrabromobisphenol Aincreases expression1
CGP 52608affects binding, increases reaction1
monomethylarsonous acidincreases expression1
CPG-oligonucleotidedecreases expression1
pentabrominated diphenyl ether 100increases expression1
hexabrominated diphenyl ether 153increases expression1
(+)-JQ1 compounddecreases expression1
MT19c compounddecreases expression1
Resveratrolincreases expression1
Temozolomideincreases expression1
Fulvestrantdecreases methylation, affects cotreatment, increases methylation1
Arsenicdecreases expression, increases abundance1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation, increases methylation1
Carbamazepineaffects expression1
Coumestrolincreases expression1
Dexamethasoneincreases expression, affects cotreatment1
Diclofenacaffects expression1
Ellagic Acidincreases expression1
Indomethacinaffects cotreatment, increases expression1
Ivermectindecreases expression1
Niclosamidedecreases expression, increases expression1
Oils, Volatiledecreases reaction, increases abundance, increases expression1
Quercetinincreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A2J0SEES3-1V human GTF2H4, clone1Embryonic stem cellMale
CVCL_A2J1SEES3-1V human GTF2H4, clone2Embryonic stem cellMale
CVCL_A2J2SEES3-1V human GTF2H4, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.