GTF2I

gene
On this page

Also known as TFII-IBAP-135SPINBTKAP1DIWSIB291

Summary

GTF2I (general transcription factor IIi, HGNC:4659) is a protein-coding gene on chromosome 7q11.23, encoding General transcription factor II-I (P78347). Interacts with the basal transcription machinery by coordinating the formation of a multiprotein complex at the C-FOS promoter, and linking specific signal responsive activator complexes.

This gene encodes a phosphoprotein containing six characteristic repeat motifs. The encoded protein binds to the initiator element (Inr) and E-box element in promoters and functions as a regulator of transcription. This locus, along with several other neighboring genes, is deleted in Williams-Beuren syndrome. There are many closely related genes and pseudogenes for this gene on chromosome 7. This gene also has pseudogenes on chromosomes 9, 13, and 21. Alternatively spliced transcript variants encoding multiple isoforms have been observed.

Source: NCBI Gene 2969 — RefSeq curated summary.

At a glance

  • GWAS associations: 12
  • Clinical variants (ClinVar): 82 total — 1 pathogenic
  • Phenotypes (HPO): 186
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • Cancer driver (intOGen): activating (oncogene-like) across 1 cancer types
  • Dosage sensitivity (ClinGen): haploinsufficiency no evidence, triplosensitivity no evidence
  • Transcription factor: yes — 27 downstream targets (CollecTRI)
  • MANE Select transcript: NM_032999

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4659
Approved symbolGTF2I
Namegeneral transcription factor IIi
Location7q11.23
Locus typegene with protein product
StatusApproved
AliasesTFII-I, BAP-135, SPIN, BTKAP1, DIWS, IB291
Ensembl geneENSG00000263001
Ensembl biotypeprotein_coding
OMIM601679
Entrez2969

Gene structure

Transcript identifiers

Ensembl transcripts: 111 — 95 protein_coding, 10 retained_intron, 4 protein_coding_CDS_not_defined, 2 nonsense_mediated_decay

ENST00000432143, ENST00000438130, ENST00000443166, ENST00000462915, ENST00000464471, ENST00000473333, ENST00000476035, ENST00000482232, ENST00000484840, ENST00000490431, ENST00000491325, ENST00000573035, ENST00000613513, ENST00000614048, ENST00000614986, ENST00000615596, ENST00000620631, ENST00000620879, ENST00000621040, ENST00000621640, ENST00000621734, ENST00000650807, ENST00000652150, ENST00000690345, ENST00000850946, ENST00000901261, ENST00000901262, ENST00000901263, ENST00000901264, ENST00000901265, ENST00000901266, ENST00000901267, ENST00000901268, ENST00000901269, ENST00000901270, ENST00000901271, ENST00000901272, ENST00000901273, ENST00000901274, ENST00000901275, ENST00000901276, ENST00000901277, ENST00000901278, ENST00000901279, ENST00000901280, ENST00000901281, ENST00000901282, ENST00000901283, ENST00000901284, ENST00000901285, ENST00000901286, ENST00000901287, ENST00000901288, ENST00000901289, ENST00000901290, ENST00000901291, ENST00000901292, ENST00000901293, ENST00000901294, ENST00000901295, ENST00000901296, ENST00000901297, ENST00000901298, ENST00000901299, ENST00000901300, ENST00000901301, ENST00000901302, ENST00000932146, ENST00000932147, ENST00000932148, ENST00000932149, ENST00000932150, ENST00000932151, ENST00000932152, ENST00000932153, ENST00000932154, ENST00000932155, ENST00000932156, ENST00000932157, ENST00000932158, ENST00000932159, ENST00000932160, ENST00000932161, ENST00000932162, ENST00000932163, ENST00000932164, ENST00000932165, ENST00000932166, ENST00000932167, ENST00000932168, ENST00000932169, ENST00000932170, ENST00000932171, ENST00000932172, ENST00000932173, ENST00000932174, ENST00000932175, ENST00000932176, ENST00000932177, ENST00000932178, ENST00000932179, ENST00000949269, ENST00000949270, ENST00000949271, ENST00000949272, ENST00000949273, ENST00000949274, ENST00000949275, ENST00000949276, ENST00000949277, ENST00000949278

RefSeq mRNA: 6 — MANE Select: NM_032999 NM_001163636, NM_001280800, NM_001518, NM_032999, NM_033000, NM_033001

CCDS: CCDS47614, CCDS5573, CCDS5574, CCDS5575, CCDS64680

Canonical transcript exons

ENST00000573035 — 35 exons

ExonStartEnd
ENSE000026339067469097374691111
ENSE000026368407471485774714916
ENSE000026381357475878074758855
ENSE000026402607471887974718941
ENSE000026466287471689474716950
ENSE000026492587465771874658068
ENSE000026789237475954874760692
ENSE000029948297474588374745941
ENSE000030119967474344974743520
ENSE000030285257475209074752170
ENSE000030410557473804574738103
ENSE000030534167472878674728896
ENSE000030555077473022974730294
ENSE000030780627473650074736683
ENSE000030836107473392374733981
ENSE000031020377474902574749090
ENSE000031132397475136174751416
ENSE000031162927474801574748116
ENSE000031379097474633774746411
ENSE000031422477475309474753177
ENSE000031624587473546174735532
ENSE000031674637475384674754029
ENSE000031807997474475874744941
ENSE000031847197475550974755537
ENSE000031896747474926874749451
ENSE000031921897475795974758000
ENSE000032064607475682074756861
ENSE000032065137473247974732662
ENSE000037154337470060674700634
ENSE000037293857469896174699095
ENSE000037329477468912474689227
ENSE000037375417470516474705218
ENSE000037409357470024774700430
ENSE000037508217470639074706433
ENSE000037531037471103274711109

Expression profiles

Bgee: expression breadth ubiquitous, 237 present calls, max score 98.93.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 42.2078 / max 495.0200, expressed in 1815 samples.

FANTOM5 promoters (10 alternative TSS)

Promoter IDTPM avgSamples expressed
7909833.76711807
790993.93961354
791021.4941910
791010.9440575
791000.6378357
791040.4718251
791030.3242153
790960.2238121
791050.216776
2044820.188773

Top tissues by expression

269 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ganglionic eminenceUBERON:000402398.93gold quality
colonic epitheliumUBERON:000039798.78gold quality
sural nerveUBERON:001548898.75gold quality
ventricular zoneUBERON:000305398.70gold quality
right uterine tubeUBERON:000130298.62gold quality
adrenal tissueUBERON:001830398.57gold quality
right adrenal gland cortexUBERON:003582798.44gold quality
right lobe of thyroid glandUBERON:000111998.33gold quality
left lobe of thyroid glandUBERON:000112098.29gold quality
right adrenal glandUBERON:000123398.29gold quality
cortical plateUBERON:000534398.13gold quality
right ovaryUBERON:000211898.07gold quality
body of pancreasUBERON:000115098.05gold quality
left ovaryUBERON:000211998.03gold quality
metanephros cortexUBERON:001053397.94gold quality
left adrenal glandUBERON:000123497.90gold quality
skin of abdomenUBERON:000141697.79gold quality
left adrenal gland cortexUBERON:003582597.78gold quality
mucosa of stomachUBERON:000119997.77gold quality
ectocervixUBERON:001224997.71gold quality
endocervixUBERON:000045897.70gold quality
tibial nerveUBERON:000132397.68gold quality
skin of legUBERON:000151197.68gold quality
minor salivary glandUBERON:000183097.63gold quality
right hemisphere of cerebellumUBERON:001489097.63gold quality
left uterine tubeUBERON:000130397.43gold quality
cerebellar hemisphereUBERON:000224597.41gold quality
body of uterusUBERON:000985397.40gold quality
gall bladderUBERON:000211097.34gold quality
cerebellar cortexUBERON:000212997.33gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes9.02
E-MTAB-6524no189.82

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

27 targets.

TargetRegulation
ADAM2
AKAP12
BTK
CCND1
CCND2
CCND3
CD44
CD74
CISH
DNAAF4Unknown
E2F2
ERVW-4
FOSActivation
GSCActivation
GTF2I
HBB
HPS4
HSPA5Activation
ITSN1
KDR
POLR2F
SIRT1Activation
SLU7
SMARCA4
SRC
TGFB1
UCN

Upstream regulators (CollecTRI, top): FOXO3, GSC, GTF2I, HDAC3

miRNA regulators (miRDB)

131 targeting GTF2I, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-432-3P100.0067.86705
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-1193100.0065.93529
HSA-MIR-3134100.0066.43777
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-4668-3P100.0068.742635
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-450099.9972.722367
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-548N99.9871.944170
HSA-LET-7F-2-3P99.9870.982588
HSA-MIR-1185-1-3P99.9871.042593
HSA-MIR-1185-2-3P99.9871.042593
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-4482-3P99.9872.503147
HSA-MIR-569699.9872.364487
HSA-MIR-27A-3P99.9872.132955
HSA-MIR-27B-3P99.9872.132955
HSA-MIR-998599.9872.112939
HSA-MIR-56899.9869.862084

Functional genomics

ClinGen dosage: haploinsufficiency 0 (no evidence), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 40)

  • G-kinase I beta interacted specifically with TFII-I, an unusual transcriptional regulator that associates with multiple proteins to modulate both basal and signal-induced transcription (PMID:12082086)
  • GTF2IRD1 and GTF2I have roles in causing deficits on visual spatial functioning (PMID:12865760)
  • Comparison of these five families with reports of other individuals with partial deletions of the WS region most strongly implicates GTF2I in the mental retardation of WS. (PMID:14556246)
  • TFII-I is required for optimal induction of Grp78 by ER stress (PMID:15664986)
  • These results demonstrate that USF1/USF2 and TFII-I interact cooperatively at the upstream RBEIII element and are necessary for the induction of latent HIV-1 in response to T-cell activation signals. (PMID:15767439)
  • human VEGFR-2 promoter is functionally counter-regulated by TFII-I and TFII-IRD1. (PMID:15941713)
  • cGMP-dependent protein kinase Ibeta binds to TFII-I and IRAG through a common interaction motif (PMID:16166082)
  • TFII-I is recruited to the cyclin D1 promoter and transcriptionally activates this gene. (PMID:16314517)
  • TFII-I directly interacts with Bright through amino acids in Bright’s protein interaction domain (PMID:16738337)
  • The data suggest that TFII-I and USF regulate chromatin structure accessibility and recruitment of transcription complexes in the beta-globin gene locus. (PMID:16943425)
  • observations suggest a model in which TFII-I suppresses agonist-induced calcium entry by competing with TRPC3 for binding to phospholipase C-gamma (PMID:17023658)
  • TFII-I, PARP1, and SFPQ proteins, each previously implicated in gene regulation, form a complex controlling transcription of DYX1C1. Allelic differences in the promoter or 5’UTR of DYX1C1 may affect factor binding and thus regulation of the gene. (PMID:18445785)
  • Bioinformatics and microarray results were combined to identify TFII-I downstream targets in the vertebrate genome. (PMID:18579769)
  • These results demonstrate an essential role of TFII-I bound at an upstream LTR element for viral replication. (PMID:18976654)
  • analyzed promoter regions of TFII-I genes and described their additional exons, as well as tested tissue specificity of both previously reported and novel alternatively spliced isoforms (PMID:19111598)
  • Results suggest that TFII-I appears to have distinct roles in distinct phases of the mammalian cell cycle. (PMID:19182516)
  • these results provide an initial step in understanding the biological role of Itk-TFII-I signaling in T-cell function. (PMID:19701889)
  • TFII-I plays an inhibitory role in regulating genes that are essential in osteogenesis and intersects with the bone-specific transcription factor Runx2 and the retinoblastoma protein, pRb. (PMID:19880526)
  • functional hemizygosity for the GTF2I and GTF2IRD1 genes is the main cause of the neurocognitive profile and some aspects of the gestalt phenotype of Williams-Beuren syndrome (PMID:19897463)
  • These data indicate that Gtf2i is involved in several aspects of embryonic development and the development of social neurocircuitry and that GTF2I haploinsufficiency could be a contributor to the hypersociability in WBS patients. (PMID:21328569)
  • TFII-I may modulate the cellular functions of BRCA1. (PMID:21407215)
  • Data show that Igh 3’ enhancer-bound OCA-B and promoter-bound TFII-I mediate promoter-enhancer interactions, in both cis and trans, that are important for Igh transcription. (PMID:21549311)
  • These data indicate that an E-box motif (RBE1) within the core promoter in the long terminal repeat of HIV-1 is a bona fide binding site for the RBF-2 transcription factor complex USF1, USF2, and TFII-I. (PMID:21813151)
  • Our findings suggest the GTF2i gene is important in the etiology of autism in individuals with this duplication and in non-duplication cases with severe social interaction problems and repetitive behaviors. (PMID:22048961)
  • TFII-I gene deletion may explain the Williams-Beuren syndrome phenotype because it acts as a negative regulator of TRPC3 expression in human B lymphocytes. (PMID:22566418)
  • GTF2I duplication results in separation anxiety in mice and humans (PMID:22578324)
  • CLIP2 haploinsufficiency by itself does not lead to the physical or cognitive characteristics of the Williams-Beuren syndrome; GTF2IRD1 and GTF2I are the main genes causing the cognitive defects (PMID:22608712)
  • Results reveal novel mechanisms by which TFII-I and DBC1 can modulate cellular fate by affecting cell-cycle control as well as the homologous recombination pathway. (PMID:24231951)
  • TFII-I bridges Proliferating Cell Nuclear Antigen and Polzeta to promote Translesion synthesis (PMID:24922507)
  • GTF2I mutation correlated with better survival. (PMID:24974848)
  • Findings implicate the GTF2I gene in the neurogenetic basis of social communication and social anxiety, both in Williams syndrome and among individuals in healthy populations (PMID:25429715)
  • The GTF2I rs117026326 polymorphism is associated with anti-SSA-positive primary Sjogren’s syndrome. (PMID:25480810)
  • A proportion of this transcriptional dysregulation is caused by dosage imbalances in GTF2I, which encodes a key transcription factor at 7q11.23 that is associated with the LSD1 repressive chromatin complex and silences its dosage-sensitive targets. (PMID:25501393)
  • show that SUMOylation is critical for TFII-I to promote cell proliferation and colony formation. Our findings contribute to understanding the role of SUMOylation in liver cancer development (PMID:25869096)
  • Rather than contributing positively to promoter activity, a putative initiator element at the transcription start site acts as a target for negative regulation imposed on the L4P promoter of human adenovirus Type 5 by cellular TFII-I. (PMID:25926634)
  • Copy-number variation in the general transcription factor gene, GTF2I is associated with gene-dose-dependent anxiety in mouse models and in both Williams syndrome and Dup7. (PMID:26285132)
  • Study demonstrates a significant association between SLE in Chinese Han population and the GTF2I rs117026326 T allele/GTF2IRD1 rs4717901 C allele. (PMID:26320362)
  • A novel interaction between TFII-I and Mdm2 with a negative effect on TFII-I transcriptional activity has been documented. (PMID:26656605)
  • The authors found that human adenovirus 5 infection or ectopic E4-ORF3 expression leads to SUMOylation of TFII-I that precedes a rapid decline in TFII-I protein levels. (PMID:26814176)
  • A common polymorphism in the Williams syndrome gene GTF2I associated with reduced social anxiety predicts decreased threat-related amygdala reactivity, which mediates an association between genotype and increased warmth in women. (PMID:26853120)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusGtf2iENSMUSG00000060261
rattus_norvegicusGtf2iENSRNOG00000001479

Paralogs (18): GTF2IRD1 (ENSG00000006704), ZMYM2 (ENSG00000121741), ZMYM5 (ENSG00000132950), THAP12 (ENSG00000137492), ZMYM4 (ENSG00000146463), ZMYM3 (ENSG00000147130), ZMYM6 (ENSG00000163867), KIAA1958 (ENSG00000165185), GTF2IRD2B (ENSG00000174428), EPM2AIP1 (ENSG00000178567), GTF2IRD2 (ENSG00000196275), ZMYM1 (ENSG00000197056), QRICH1 (ENSG00000198218), FAM200C (ENSG00000221886), FAM200A (ENSG00000221909), SCAND3 (ENSG00000232040), ZBED5 (ENSG00000236287), FAM200B (ENSG00000237765)

Protein

Protein identifiers

General transcription factor II-IP78347 (reviewed: P78347)

Alternative names: Bruton tyrosine kinase-associated protein 135, SRF-Phox1-interacting protein, Williams-Beuren syndrome chromosomal region 6 protein

All UniProt accessions (10): P78347, A0A494C013, A0A494C0Q7, A0A494C1K3, A0A8I5KVB5, C9J6M0, X5D2J9, X5D939, X5DNP5, X5DR09

UniProt curated annotations — full annotation on UniProt →

Function. Interacts with the basal transcription machinery by coordinating the formation of a multiprotein complex at the C-FOS promoter, and linking specific signal responsive activator complexes. Promotes the formation of stable high-order complexes of SRF and PHOX1 and interacts cooperatively with PHOX1 to promote serum-inducible transcription of a reporter gene deriven by the C-FOS serum response element (SRE). Acts as a coregulator for USF1 by binding independently two promoter elements, a pyrimidine-rich initiator (Inr) and an upstream E-box. Required for the formation of functional ARID3A DNA-binding complexes and for activation of immunoglobulin heavy-chain transcription upon B-lymphocyte activation.

Subunit / interactions. Homodimer (Potential). Interacts with SRF and PHOX1. Binds a pyrimidine-rich initiator (Inr) and a recognition site (E-box) for upstream stimulatory factor 1 (USF1). Associates with the PH domain of Bruton’s tyrosine kinase (BTK). May be a component of a BHC histone deacetylase complex that contains HDAC1, HDAC2, HMG20B/BRAF35, KDM1A, RCOR1/CoREST, PHF21A/BHC80, ZMYM2, ZNF217, ZMYM3, GSE1 and GTF2I. Interacts with BTK and ARID3A. Interacts with isoform beta of PRKG1. Interacts with GPR50 (C-TERMINAL DOMAIN).

Subcellular location. Cytoplasm. Nucleus.

Tissue specificity. Ubiquitous. Isoform 1 is strongly expressed in fetal brain, weakly in adult brain, muscle, and lymphoblasts and is almost undetectable in other adult tissues, while the other isoforms are equally expressed in all adult tissues.

Post-translational modifications. Transiently phosphorylated on tyrosine residues by BTK in response to B-cell receptor stimulation. Phosphorylation on Tyr-248 and Tyr-398, and perhaps, on Tyr-503 contributes to BTK-mediated transcriptional activation. Sumoylated.

Disease relevance. GTF2I is located in the Williams-Beuren syndrome (WBS) critical region. WBS results from a hemizygous deletion of several genes on chromosome 7q11.23, thought to arise as a consequence of unequal crossing over between highly homologous low-copy repeat sequences flanking the deleted region. Haploinsufficiency of GTF2I may be the cause of certain cardiovascular and musculo-skeletal abnormalities observed in the disease.

Similarity. Belongs to the TFII-I family.

Isoforms (5)

UniProt IDNamesCanonical?
P78347-11yes
P78347-22
P78347-33
P78347-44
P78347-55

RefSeq proteins (6): NP_001157108, NP_001267729, NP_001509, NP_127492, NP_127493, NP_127494 (=MANE)

Domains & families (InterPro)

IDNameType
IPR004212GTF2IRepeat
IPR016659TF_II-IFamily
IPR036647GTF2I-like_rpt_sfHomologous_superfamily

Pfam: PF02946

UniProt features (120 total): cross-link 34, modified residue 24, helix 13, turn 11, strand 8, repeat 6, region of interest 5, sequence conflict 5, splice variant 4, mutagenesis site 4, compositionally biased region 2, initiator methionine 1, chain 1, short sequence motif 1, sequence variant 1

Structure

Experimental structures (PDB)

4 structures.

PDBMethodResolution (Å)
2D9BSOLUTION NMR
2DN4SOLUTION NMR
2ED2SOLUTION NMR
2EJESOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P78347-F169.390.31

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (58): 278, 298, 2, 19, 103, 130, 207, 210, 214, 248, 353, 398, 412, 450, 503, 517, 556, 558, 668, 674 …

Mutagenesis-validated functional residues (4):

PositionPhenotype
248abolishes btk-mediated transcriptional activation. abolishes btk-mediated phosphorylation and impairs btk-mediated trans
398abolishes btk-mediated transcriptional activation. abolishes btk-mediated phosphorylation and impairs btk-mediated trans
460no change on btk-mediated transcriptional activation.
503impairs btk-mediated transcriptional activation. abolishes btk-mediated phosphorylation and impairs btk-mediated transcr

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 691 (showing top): BERTUCCI_MEDULLARY_VS_DUCTAL_BREAST_CANCER_DN, PAL_PRMT5_TARGETS_UP, CHIANG_LIVER_CANCER_SUBCLASS_UNANNOTATED_DN, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, DARWICHE_SKIN_TUMOR_PROMOTER_DN, DARWICHE_PAPILLOMA_RISK_LOW_DN, DARWICHE_PAPILLOMA_RISK_HIGH_DN, DARWICHE_SQUAMOUS_CELL_CARCINOMA_DN, ADDYA_ERYTHROID_DIFFERENTIATION_BY_HEMIN, CCATCCA_MIR432, GOBP_NEGATIVE_REGULATION_OF_MULTICELLULAR_ORGANISMAL_PROCESS, BLALOCK_ALZHEIMERS_DISEASE_UP, HNF4_DR1_Q3, GOBP_BLOOD_VESSEL_MORPHOGENESIS, GOBP_NEGATIVE_REGULATION_OF_VASCULATURE_DEVELOPMENT

GO Biological Process (3): transcription by RNA polymerase II (GO:0006366), negative regulation of angiogenesis (GO:0016525), positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (6): DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700), RNA polymerase II-specific DNA-binding transcription factor binding (GO:0061629), protein binding (GO:0005515)

GO Cellular Component (4): nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
regulation of transcription by RNA polymerase II2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
DNA-templated transcription1
angiogenesis1
regulation of angiogenesis1
negative regulation of blood vessel morphogenesis1
transcription by RNA polymerase II1
positive regulation of DNA-templated transcription1
chromatin1
DNA-binding transcription factor activity1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription activator activity1
positive regulation of transcription by RNA polymerase II1
nucleic acid binding1
transcription cis-regulatory region binding1
regulation of DNA-templated transcription1
transcription regulator activity1
DNA-binding transcription factor binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1

Protein interactions and networks

STRING

1656 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GTF2IBTKQ06187887
GTF2IZMYM3Q14202859
GTF2IUSF1P22415849
GTF2IARHGAP35Q9NRY4829
GTF2IZMYM2Q9UBW7813
GTF2IKDM1AO60341804
GTF2IHYDINQ4G0P3797
GTF2ISRGAP2O75044796
GTF2IGPRIN2O60269780
GTF2ISRGAP3O43295775
GTF2ICLIP2Q9UDT6773
GTF2IHDAC1Q13547766
GTF2INPEPPSP55786763
GTF2IUGT2B17O75795763
GTF2INBPF1Q3BBV0763

IntAct

198 interactions, top by confidence:

ABTypeScore
RFXANKRFXAPpsi-mi:“MI:0914”(association)0.780
YY1TFPTpsi-mi:“MI:0914”(association)0.740
BTKGTF2Ipsi-mi:“MI:0915”(physical association)0.720
GTF2IBTKpsi-mi:“MI:0915”(physical association)0.720
BTKGTF2Ipsi-mi:“MI:0217”(phosphorylation reaction)0.720
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
E6TAX1BP3psi-mi:“MI:0914”(association)0.650
BRCA1GTF2Ipsi-mi:“MI:0915”(physical association)0.610
GTF2IBRCA1psi-mi:“MI:0915”(physical association)0.610
GTF2IBRCA1psi-mi:“MI:0403”(colocalization)0.610
CHEK2PPM1Gpsi-mi:“MI:0914”(association)0.560
MAD2L2CBX5psi-mi:“MI:0914”(association)0.530
NHLH2AP3B1psi-mi:“MI:0914”(association)0.530
SOSTKPNA4psi-mi:“MI:0914”(association)0.530
TIGD6MTHFRpsi-mi:“MI:0914”(association)0.530
LHFPL4ATP5F1Bpsi-mi:“MI:0914”(association)0.530
E6CASKpsi-mi:“MI:0914”(association)0.520
E6TAX1BP3psi-mi:“MI:0914”(association)0.520
TaxTAX1BP3psi-mi:“MI:0914”(association)0.520
GTF2IZMYM3psi-mi:“MI:0403”(colocalization)0.510
GTF2IZMYM2psi-mi:“MI:0403”(colocalization)0.510
GTF2IZMYM3psi-mi:“MI:2364”(proximity)0.510

BioGRID (513): GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-RNA), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Biochemical Activity), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS), GTF2I (Affinity Capture-MS)

ESM2 similar proteins: A0A2R6X6S3, A7MB80, B4GT53, B7ZQJ9, E1JH25, G5EC37, K8ERR8, O44757, O61366, P06536, P08970, P13864, P15130, P20193, P25049, P26358, P34402, P50534, P70047, P70475, P78347, P97500, Q01538, Q08875, Q08876, Q09449, Q09663, Q12830, Q22811, Q22992, Q24K09, Q581T1, Q5U2Y1, Q60JJ0, Q6E3D4, Q6FRS1, Q7ZXG4, Q80TY4, Q8CFC2, Q8IRH5

Diamond homologs: A4IFA3, A7MB80, B7ZQJ9, P78347, Q5U2Y1, Q6EKJ0, Q86UP8, Q99NI3, Q9ESZ8, Q9JI57, Q9UHL9

SIGNOR signaling

26 interactions.

AEffectBMechanism
GTF2I“up-regulates quantity by expression”HSPA5“transcriptional regulation”
GTF2I“up-regulates activity”ARID3Abinding
GTF2I“up-regulates activity”USF2binding
GTF2I“up-regulates activity”USF1binding
GTF2I“up-regulates activity”KDM1Abinding
GTF2I“up-regulates activity”KDM1Arelocalization
Gbetaup-regulatesGTF2Iphosphorylation
ERK1/2up-regulatesGTF2Iphosphorylation
BTKup-regulatesGTF2Iphosphorylation
GTF2Iup-regulatesPRRX1binding
MAPK1up-regulatesGTF2Iphosphorylation
MAPK3up-regulatesGTF2Iphosphorylation
PRKG1up-regulatesGTF2Iphosphorylation
JAK2“up-regulates activity”GTF2Iphosphorylation
BTK“up-regulates activity”GTF2Iphosphorylation
SRC“up-regulates activity”GTF2Iphosphorylation
GTF2IRD1down-regulatesGTF2I
GTF2I“up-regulates quantity by expression”FOS“transcriptional regulation”
GTF2I“up-regulates quantity by expression”GSC“transcriptional regulation”
GTF2I“up-regulates activity”“Polycomb repressive complex 2”relocalization

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 207 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
VEGFR2 mediated cell proliferation521.0×3e-04
Signaling by FGFR1 in disease510.8×4e-03
mRNA Splicing129.7×1e-06
Processing of Capped Intron-Containing Pre-mRNA159.1×4e-08
Signaling by BRAF and RAF1 fusions78.8×1e-03
Regulation of RAS by GAPs68.5×4e-03
mRNA Splicing - Minor Pathway58.2×1e-02
mRNA Splicing - Major Pathway187.2×4e-08

GO biological processes:

GO termPartnersFoldFDR
positive regulation of fibroblast proliferation610.0×7e-03
mRNA transport68.9×9e-03
mRNA splicing, via spliceosome168.2×1e-07
negative regulation of translation77.7×7e-03
chromatin remodeling135.3×7e-04
RNA splicing105.0×7e-03
mRNA processing114.9×6e-03

Disease & clinical

Cancer significance

From intOGen — cancer-driver classification: activating (oncogene-like) across 1 cancer types — THYM.

Clinical variants and AI predictions

ClinVar

82 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance52
Likely benign11
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
916130NC_000007.13:g.73944168_74138459dupPathogenic

SpliceAI

4802 predictions. Top by Δscore:

VariantEffectΔscore
7:74689119:CACA:Cacceptor_loss1.0000
7:74689120:A:AGacceptor_gain1.0000
7:74689120:ACAG:Aacceptor_gain1.0000
7:74689120:ACAGG:Aacceptor_gain1.0000
7:74689121:C:Gacceptor_gain1.0000
7:74689121:CA:Cacceptor_loss1.0000
7:74689122:A:AGacceptor_gain1.0000
7:74689122:AG:Aacceptor_gain1.0000
7:74689122:AGG:Aacceptor_gain1.0000
7:74689123:G:GCacceptor_loss1.0000
7:74689123:G:GGacceptor_gain1.0000
7:74689123:GG:Gacceptor_gain1.0000
7:74689123:GGG:Gacceptor_gain1.0000
7:74689224:CATG:Cdonor_loss1.0000
7:74689228:G:GAdonor_loss1.0000
7:74689228:G:GGdonor_gain1.0000
7:74689229:T:Adonor_loss1.0000
7:74690969:GTAG:Gacceptor_loss1.0000
7:74690970:TAGT:Tacceptor_loss1.0000
7:74690971:A:AGacceptor_gain1.0000
7:74690971:A:Tacceptor_loss1.0000
7:74690971:AGT:Aacceptor_gain1.0000
7:74690972:G:Aacceptor_loss1.0000
7:74690972:G:GAacceptor_gain1.0000
7:74690972:GT:Gacceptor_gain1.0000
7:74690972:GTG:Gacceptor_gain1.0000
7:74690972:GTGT:Gacceptor_gain1.0000
7:74690972:GTGTA:Gacceptor_gain1.0000
7:74691058:G:GTdonor_gain1.0000
7:74691080:A:Tdonor_gain1.0000

AlphaMissense

6557 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
7:74689205:T:CL26P1.000
7:74689217:T:AL30H1.000
7:74689217:T:CL30P1.000
7:74690973:T:CC34R1.000
7:74690983:T:CL37P1.000
7:74691007:C:AA45D1.000
7:74691009:T:CC46R1.000
7:74691011:C:GC46W1.000
7:74691013:T:AI47N1.000
7:74691015:G:CA48P1.000
7:74691016:C:AA48E1.000
7:74691045:G:AG58R1.000
7:74691045:G:CG58R1.000
7:74691046:G:AG58E1.000
7:74691057:G:AG62R1.000
7:74691057:G:CG62R1.000
7:74691058:G:AG62E1.000
7:74691100:T:CF76S1.000
7:74699069:T:AV116D1.000
7:74730234:T:AW354R1.000
7:74730234:T:CW354R1.000
7:74730236:G:CW354C1.000
7:74730236:G:TW354C1.000
7:74730256:T:CL361P1.000
7:74730258:C:AR362S1.000
7:74730268:T:AV365D1.000
7:74732484:G:CA376P1.000
7:74732590:C:AP411Q1.000
7:74732601:G:AG415R1.000
7:74732601:G:CG415R1.000

dbSNP variants (sampled 300 via entrez): RS1000073094 (7:74675105 G>A), RS1000119497 (7:74687112 A>C), RS1000197518 (7:74690475 A>G), RS1000254818 (7:74718622 T>G), RS1000322207 (7:74668960 T>A), RS1000842545 (7:74693508 A>C), RS1001070241 (7:74691715 A>G), RS1001108272 (7:74662670 G>A,C,T), RS1001125961 (7:74688392 A>G), RS1001154261 (7:74703633 G>T), RS1001504448 (7:74659190 T>C), RS1001554958 (7:74658903 C>T), RS1001882514 (7:74656632 T>C), RS1002058364 (7:74664231 T>C,G), RS1002179744 (7:74665833 T>G)

Disease associations

OMIM: gene MIM:601679 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

186 total (30 of 186 shown, HPO-id order):

HPOTerm
HP:0000010Recurrent urinary tract infections
HP:0000014Abnormality of the bladder
HP:0000015Bladder diverticulum
HP:0000023Inguinal hernia
HP:0000025Functional abnormality of male internal genitalia
HP:0000028Cryptorchidism
HP:0000044Hypogonadotropic hypogonadism
HP:0000075Renal duplication
HP:0000076Vesicoureteral reflux
HP:0000083Renal insufficiency
HP:0000089Renal hypoplasia
HP:0000093Proteinuria
HP:0000121Nephrocalcinosis
HP:0000125Pelvic kidney
HP:0000147Polycystic ovaries
HP:0000154Wide mouth
HP:0000158Macroglossia
HP:0000179Thick lower lip vermilion
HP:0000212Gingival overgrowth
HP:0000232Everted lower lip vermilion
HP:0000252Microcephaly
HP:0000275Narrow face
HP:0000280Coarse facial features
HP:0000286Epicanthus
HP:0000307Pointed chin
HP:0000337Broad forehead
HP:0000343Long philtrum
HP:0000347Micrognathia
HP:0000348High forehead
HP:0000358Posteriorly rotated ears

GWAS associations

12 associations (top):

StudyTraitp-value
GCST002217_1Sjögren’s syndrome1.000000e-53
GCST002541_60Menarche (age at onset)5.000000e-12
GCST003604_1Sjögren’s syndrome1.000000e-15
GCST005752_2Systemic lupus erythematosus3.000000e-14
GCST006061_65Atrial fibrillation3.000000e-09
GCST006414_123Atrial fibrillation3.000000e-10
GCST006630_11Diastolic blood pressure2.000000e-09
GCST006979_127Heel bone mineral density5.000000e-11
GCST007611_11Chronic obstructive pulmonary disease or high blood pressure (pleiotropy)5.000000e-11
GCST007615_25C-reactive protein levels3.000000e-08
GCST010242_355HDL cholesterol levels9.000000e-11
GCST011426_23Systemic lupus erythematosus2.000000e-11

EFO canonical traits (5, from GWAS)

EFO IDTrait name
EFO:0004703age at menarche
EFO:0006336diastolic blood pressure
EFO:0009270heel bone mineral density
EFO:0004458C-reactive protein measurement
EFO:0004612high density lipoprotein cholesterol measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5724693 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

5 potent at pChembl≥5 of 5 total, top 5 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
7.03Kd94.5nMCHEMBL5653589
7.03ED5094.5nMCHEMBL5653589
5.56Kd2772nMCHEMBL3752910
5.56ED502772nMCHEMBL3752910
5.12Kd7551nMMOLIBRESIB

PubChem BioAssay actives

3 with measured affinity, of 11 total; 3 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2148473: Binding affinity to human GTF2I incubated for 45 mins by Kinobead based pull down assaykd0.0945uM
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2148473: Binding affinity to human GTF2I incubated for 45 mins by Kinobead based pull down assaykd2.7718uM
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2179229: Binding affinity against GTF2I (unknown origin) assessed as apparent dissociation constant incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysiskd7.5510uM

CTD chemical–gene interactions

54 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression, affects expression5
trichostatin Adecreases expression, affects expression, affects cotreatment3
sodium arsenitedecreases expression, increases expression2
Acetaminophendecreases expression2
Air Pollutantsincreases abundance, increases expression, decreases expression2
Benzo(a)pyrenedecreases expression, increases methylation2
Particulate Matterdecreases expression, increases abundance, increases expression2
aristolochic acid Idecreases expression1
TAK-243decreases sumoylation1
triphenyl phosphateaffects expression1
bisphenol Aincreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
W 7decreases reaction, increases expression1
cobaltous chloridedecreases expression1
butyraldehydedecreases expression1
perfluorooctanoic aciddecreases expression1
ochratoxin Adecreases expression1
cerous chloridedecreases expression1
coumarindecreases phosphorylation1
beta-methylcholineaffects expression1
celastrolincreases expression1
perfluorooctane sulfonic aciddecreases expression1
2,3,5-(triglutathion-S-yl)hydroquinoneaffects localization, decreases phosphorylation, affects activity, increases abundance, increases ADP-ribosylation1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
abrinedecreases expression1
dorsomorphinaffects cotreatment, decreases expression1
1-(4-(6-bromobenzo(1,3)dioxol-5-yl)-3a,4,5,9b-tetrahydro-3H-cyclopenta(c)quinolin-8-yl)ethanonedecreases phosphorylation1
bisphenol AFincreases expression1
Zoledronic Acidincreases expression1
Vorinostatdecreases expression1

ChEMBL screening assays

8 unique, capped per target: 8 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5651515BindingBinding affinity to human GTF2I incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Cellosaurus cell lines

2 cell lines: 2 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_SQ72HAP1 GTF2I (-) 1Cancer cell lineMale
CVCL_XP44HAP1 GTF2I (-) 2Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Sjogren syndrome