GTF2A1

gene
On this page

Also known as TFIIA

Summary

GTF2A1 (general transcription factor IIA subunit 1, HGNC:4646) is a protein-coding gene on chromosome 14q31.1, encoding Transcription initiation factor IIA subunit 1 (P52655). TFIIA is a component of the transcription machinery of RNA polymerase II and plays an important role in transcriptional activation. It is a selective cancer dependency (DepMap: 88.8% of cell lines).

Accurate transcription initiation on TATA-containing class II genes involves the ordered assembly of RNA polymerase II (POLR2A; MIM 180660) and several general initiation factors (summarized by DeJong and Roeder, 1993 [PubMed 8224848]). One of these factors is TFIIA, which when purified from HeLa extracts consists of 35-, 19-, and 12-kD subunits.

Source: NCBI Gene 2957 — RefSeq curated summary.

At a glance

  • GWAS associations: 7
  • Clinical variants (ClinVar): 37 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 88.8% of screened cell lines
  • MANE Select transcript: NM_015859

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4646
Approved symbolGTF2A1
Namegeneral transcription factor IIA subunit 1
Location14q31.1
Locus typegene with protein product
StatusApproved
AliasesTFIIA
Ensembl geneENSG00000165417
Ensembl biotypeprotein_coding
OMIM600520
Entrez2957

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 3 protein_coding, 1 nonsense_mediated_decay, 1 retained_intron

ENST00000298173, ENST00000434192, ENST00000553612, ENST00000556268, ENST00000865083

RefSeq mRNA: 3 — MANE Select: NM_015859 NM_001278940, NM_015859, NM_201595

CCDS: CCDS9873, CCDS9874

Canonical transcript exons

ENST00000553612 — 9 exons

ExonStartEnd
ENSE000024515078122048981220959
ENSE000024790988117545281180330
ENSE000034586368120390081204104
ENSE000034797378120159481201658
ENSE000034816718119610881196241
ENSE000035777948119740981197484
ENSE000035824848121641381216514
ENSE000035844208118553181185620
ENSE000036934648119251981192839

Expression profiles

Bgee: expression breadth ubiquitous, 271 present calls, max score 94.48.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 39.2074 / max 1587.4265, expressed in 1818 samples.

FANTOM5 promoters (9 alternative TSS)

Promoter IDTPM avgSamples expressed
14430519.58671799
14430610.92151780
1443044.93931668
1443021.1707774
1443081.0789675
1443090.5862342
1443070.4471185
1443100.2809131
1443030.196188

Top tissues by expression

298 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
esophagus squamous epitheliumUBERON:000692094.48gold quality
tibiaUBERON:000097994.24gold quality
corpus epididymisUBERON:000435994.14gold quality
ponsUBERON:000098892.41gold quality
trabecular bone tissueUBERON:000248392.35gold quality
jejunal mucosaUBERON:000039992.31gold quality
oral cavityUBERON:000016792.20gold quality
postcentral gyrusUBERON:000258192.14gold quality
caput epididymisUBERON:000435892.04gold quality
calcaneal tendonUBERON:000370192.02gold quality
tendon of biceps brachiiUBERON:000818891.99gold quality
cauda epididymisUBERON:000436091.97gold quality
parietal pleuraUBERON:000240091.93gold quality
lateral nuclear group of thalamusUBERON:000273691.81gold quality
tendonUBERON:000004391.78gold quality
upper leg skinUBERON:000426291.75gold quality
penisUBERON:000098991.61gold quality
adult organismUBERON:000702391.59gold quality
lateral globus pallidusUBERON:000247691.58gold quality
parietal lobeUBERON:000187291.49gold quality
substantia nigra pars compactaUBERON:000196591.40gold quality
medial globus pallidusUBERON:000247791.32gold quality
superficial temporal arteryUBERON:000161491.25gold quality
mucosa of paranasal sinusUBERON:000503091.17gold quality
globus pallidusUBERON:000187591.11gold quality
jejunumUBERON:000211591.10gold quality
biceps brachiiUBERON:000150790.89gold quality
superior vestibular nucleusUBERON:000722790.89gold quality
germinal epithelium of ovaryUBERON:000130490.84gold quality
seminal vesicleUBERON:000099890.82gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.93
E-CURD-10no521.39

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
TBPUnknown

Upstream regulators (CollecTRI, top): TAF1, TBP, TBPL1

miRNA regulators (miRDB)

315 targeting GTF2A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-3163100.0077.238605
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-MIR-98-3P100.0074.083907
HSA-MIR-4510100.0066.602050
HSA-MIR-6127100.0066.762188
HSA-MIR-6129100.0066.462080
HSA-MIR-6130100.0066.692012
HSA-MIR-6133100.0066.482064
HSA-MIR-5692B100.0071.322622
HSA-MIR-5692C100.0071.322622
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-5692A100.0074.406850
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-4533100.0069.482758
HSA-MIR-196A-5P100.0068.16684
HSA-MIR-196B-5P100.0068.16681
HSA-MIR-4476100.0068.182030
HSA-MIR-6876-5P100.0067.682126
HSA-MIR-4668-3P100.0068.742635
HSA-MIR-3646100.0073.565283
HSA-MIR-340-5P100.0072.504437
HSA-MIR-126-5P100.0072.713180
HSA-MIR-428299.9975.366408
HSA-MIR-366299.9973.825684
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-548AW99.9972.573559
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-520D-5P99.9873.344883

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 88.8% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 13)

  • V96F substitution of a deamidated human GTF2A1 (TFIIA) peptide promotes T-cell activation in an in vitro model of Celiac disease inflammation. (PMID:12054857)
  • spatial positioning of the DNA-bound activation domain is important for efficient activation, possibly by maximizing its interactions with the transcriptional machinery including the TBP-TFIIA-TFIIB-promoter quaternary complex (PMID:12538582)
  • Results present the X-ray structures of human and yeast TATA box-binding protein /transcription factor IIA/DNA complexes at 2.1A and 1.9A resolution, respectively (PMID:12972251)
  • transcription factor IIA may regulate the intracellular molecular state and the function of TATA-binding protein-like protein (TLP) through its property of binding to TLP (PMID:14570910)
  • Transfected taspase 1 enhances cleavage of TFIIA, and RNA interference knockdown of endogenous taspase 1 diminishes cleavage of TFIIA in vivo. (PMID:16537915)
  • TFIIA induces a conformational change within the TBP/TATA complex that enhances its stability under both in vitro and physiological salt conditions. (PMID:17681538)
  • GTF2A1 alone, or GTF2A1 plus HAAO are excellent candidate biomarkers for detecting Ovarian cancer (PMID:19724865)
  • Our results provide first evidence that Taspase1 processing affects TFIIA regulation of TFIID and suggest that Taspase1 processing of TFIIA is required to establish INR-selective core promoter activity in the presence of NC2. (PMID:25996597)
  • The interaction of TBP-like protein with TFIIA is essential to suppress proteasome-dependent turnover of TBP-like protein (PMID:27696626)
  • In a cohort of youth at risk for bipolar disorder, pathway analysis showed an enrichment of the glucocorticoid receptor (GR) pathway with the genes MED1, HSPA1L, GTF2A1 and TAF15, which might underlie the previously reported role of stress response in the risk for bipolar disorder in vulnerable populations. (PMID:28291257)
  • Data suggest that many TATA-box-containing promoters contain a TFIIA recognition element (IIARE) that activates transcription by increasing the recruitment of RNA polymerase II, TFIIA (TF2A1/TF2A2 subunits), TAF4, and P300. (TAF4 = TATA box binding protein (TBP)-associated factor, RNA polymerase II, C1; P300 = E1A binding protein p300) (PMID:28539359)
  • De novo produced TFIIA is rapidly confined to the cytoplasm via an evolutionary conserved nuclear export signal (NES, amino acids 21VINDVRDIFL30), interacting with the nuclear export receptor Exportin-1/chromosomal region maintenance 1 (Crm1). (PMID:28992066)
  • Proteolytic processing of TFIIA by Taspase1 was found to mask evolutionary conserved nuclear export signal, thereby promoting nuclear localization and transcriptional activation of TFIIA target genes, such as CDKN2A. (PMID:29097782)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriogtf2a1ENSDARG00000011000
mus_musculusGtf2a1ENSMUSG00000020962
rattus_norvegicusGtf2a1ENSRNOG00000004300
drosophila_melanogasterTfIIA-LFBGN0011289
caenorhabditis_elegansWBGENE00004136

Paralogs (1): GTF2A1L (ENSG00000242441)

Protein

Protein identifiers

Transcription initiation factor IIA subunit 1P52655 (reviewed: P52655)

Alternative names: General transcription factor IIA subunit 1, TFIIAL, Transcription initiation factor TFIIA 42 kDa subunit

All UniProt accessions (2): P52655, J3KNC0

UniProt curated annotations — full annotation on UniProt →

Function. TFIIA is a component of the transcription machinery of RNA polymerase II and plays an important role in transcriptional activation. TFIIA in a complex with TBP mediates transcriptional activity.

Subunit / interactions. TFIIA is a heterodimer of the large unprocessed subunit 1 and a small subunit gamma. It was originally believed to be a heterotrimer of an alpha (p35), a beta (p19) and a gamma subunit (p12). TFIIA forms a complex with TBP. Part of TBP-based Pol II pre-initiation complex (PIC), in which Pol II core assembles with general transcription factors and other specific initiation factors including GTF2E1, GTF2E2, GTF2F1, GTF2F2, TCEA1, ERCC2, ERCC3, GTF2H2, GTF2H3, GTF2H4, GTF2H5, GTF2A1, GTF2A2, GTF2B and TBP; this large multi-subunit PIC complex mediates DNA unwinding and targets Pol II core to the transcription start site where the first phosphodiester bond forms.

Subcellular location. Nucleus.

Post-translational modifications. The alpha and beta subunits are postranslationally produced from the precursor form by TASP1. The cleavage promotes proteasomal degradation.

Similarity. Belongs to the TFIIA subunit 1 family.

Isoforms (2)

UniProt IDNamesCanonical?
P52655-142 kDayes
P52655-237 kDa

RefSeq proteins (3): NP_001265869, NP_056943, NP_963889 (=MANE)

Domains & families (InterPro)

IDNameType
IPR004855TFIIA_asu/bsuFamily
IPR009088TFIIA_b-brlHomologous_superfamily

Pfam: PF03153

UniProt features (42 total): mutagenesis site 13, modified residue 5, strand 5, chain 3, helix 3, region of interest 3, compositionally biased region 3, binding site 2, sequence variant 2, initiator methionine 1, site 1, splice variant 1

Structure

Experimental structures (PDB)

52 structures, top 30 by resolution.

PDBMethodResolution (Å)
1NVPX-RAY DIFFRACTION2.1
5M4SX-RAY DIFFRACTION2.38
7NVUELECTRON MICROSCOPY2.5
7NVSELECTRON MICROSCOPY2.8
7NVTELECTRON MICROSCOPY2.9
8S52ELECTRON MICROSCOPY2.9
7ZWDELECTRON MICROSCOPY3
7ZX8ELECTRON MICROSCOPY3
8S51ELECTRON MICROSCOPY3.1
7ZWCELECTRON MICROSCOPY3.2
7EGBELECTRON MICROSCOPY3.3
7ZX7ELECTRON MICROSCOPY3.4
8S5NELECTRON MICROSCOPY3.4
7ZXEELECTRON MICROSCOPY3.5
7EG9ELECTRON MICROSCOPY3.7
8BZ1ELECTRON MICROSCOPY3.8
5IYBELECTRON MICROSCOPY3.9
5IYCELECTRON MICROSCOPY3.9
5IYDELECTRON MICROSCOPY3.9
7EGCELECTRON MICROSCOPY3.9
8BVWELECTRON MICROSCOPY4
7EGAELECTRON MICROSCOPY4.1
8BYQELECTRON MICROSCOPY4.1
8GXSELECTRON MICROSCOPY4.16
7EDXELECTRON MICROSCOPY4.5
7NVRELECTRON MICROSCOPY4.5
7LBMELECTRON MICROSCOPY4.8
8GXQELECTRON MICROSCOPY5.04
5IYAELECTRON MICROSCOPY5.4
8WAKELECTRON MICROSCOPY5.47

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P52655-F156.930.21

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 274–275 (cleavage; by tasp1)

Ligand- & substrate-binding residues (2): 343; 344

Post-translational modifications (5): 2, 280, 281, 316, 321

Mutagenesis-validated functional residues (13):

PositionPhenotype
270slightly affects cleavage and yields elevated levels of the precursor.
272abolishes cleavage.
273abolishes cleavage.
274abolishes cleavage.
275abolishes cleavage.
276does not affect cleavage.
277does not affect cleavage.
278significant reduction of cleavage.
280slightly affects cleavage, yields elevated levels of the precursor. eliminates phosphorylation; when associated with a-2
281eliminates phosphorylation; when associated with a-280; a-316 and a-321.
282slightly affects cleavage and yields elevated levels of the precursor.
316strongly reduces phosphorylation; when associated with a-321. eliminates phosphorylation; when associated with a-280; a-
321strongly reduces phosphorylation; when associated with a-316. eliminates phosphorylation; when associated with a-280; a-

Function

Pathways and Gene Ontology

Reactome pathways

21 pathways

IDPathway
R-HSA-167161HIV Transcription Initiation
R-HSA-167162RNA Polymerase II HIV Promoter Escape
R-HSA-167172Transcription of the HIV genome
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6807505RNA polymerase II transcribes snRNA genes
R-HSA-73776RNA Polymerase II Promoter Escape
R-HSA-73779RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-75953RNA Polymerase II Transcription Initiation
R-HSA-76042RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-9018519Estrogen-dependent gene expression
R-HSA-162582Signal Transduction
R-HSA-162587HIV Life Cycle
R-HSA-162599Late Phase of HIV Life Cycle
R-HSA-162906HIV Infection
R-HSA-1643685Disease
R-HSA-5663205Infectious disease
R-HSA-73857RNA Polymerase II Transcription
R-HSA-74160Gene expression (Transcription)
R-HSA-8939211ESR-mediated signaling
R-HSA-9006931Signaling by Nuclear Receptors
R-HSA-9824446Viral Infection Pathways

MSigDB gene sets: 361 (showing top): AHRARNT_01, HNF3ALPHA_Q6, TGCGCANK_UNKNOWN, GOBP_POSITIVE_REGULATION_OF_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, CGGAARNGGCNG_UNKNOWN, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, IVANOVA_HEMATOPOIESIS_MATURE_CELL, CREBP1_Q2, RACCACAR_AML_Q6, BROWNE_HCMV_INFECTION_12HR_UP, USF_C, CREB_Q4, REACTOME_HIV_INFECTION, YY1_02, USF_01

GO Biological Process (5): transcription by RNA polymerase II (GO:0006366), positive regulation of transcription by RNA polymerase II (GO:0045944), RNA polymerase II preinitiation complex assembly (GO:0051123), positive regulation of transcription initiation by RNA polymerase II (GO:0060261), transcription initiation at RNA polymerase II promoter (GO:0006367)

GO Molecular Function (8): RNA polymerase II core promoter sequence-specific DNA binding (GO:0000979), RNA polymerase II general transcription initiation factor binding (GO:0001091), DNA binding (GO:0003677), RNA polymerase II general transcription initiation factor activity (GO:0016251), TBP-class protein binding (GO:0017025), protein heterodimerization activity (GO:0046982), RNA polymerase II-specific DNA-binding transcription factor binding (GO:0061629), protein binding (GO:0005515)

GO Cellular Component (7): nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription factor TFIID complex (GO:0005669), transcription factor TFIIA complex (GO:0005672), cytosol (GO:0005829), transcription preinitiation complex (GO:0097550), cytoplasm (GO:0005737)

Reactome top-level categories

Rollup of top-12 pathways:

CategoryPathways
RNA Polymerase II Transcription4
Transcription of the HIV genome2
RNA Polymerase II Transcription Initiation And Promoter Clearance2
Late Phase of HIV Life Cycle1
ESR-mediated signaling1
HIV Infection1
HIV Life Cycle1
Viral Infection Pathways1
Disease1
Gene expression (Transcription)1
Signaling by Nuclear Receptors1
Signal Transduction1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II3
cellular anatomical structure3
transcription initiation at RNA polymerase II promoter2
general transcription initiation factor binding2
RNA polymerase II, holoenzyme2
RNA polymerase II transcription regulator complex2
DNA-templated transcription1
regulation of transcription by RNA polymerase II1
positive regulation of DNA-templated transcription1
transcription preinitiation complex assembly1
positive regulation of transcription by RNA polymerase II1
regulation of transcription initiation by RNA polymerase II1
positive regulation of DNA-templated transcription initiation1
DNA-templated transcription initiation1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
core promoter sequence-specific DNA binding1
RNA polymerase II complex binding1
nucleic acid binding1
general transcription initiation factor activity1
protein dimerization activity1
DNA-binding transcription factor binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
cytoplasm1
protein-DNA complex1
intracellular anatomical structure1

Protein interactions and networks

STRING

1418 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GTF2A1GTF2A2P52657962
GTF2A1TBPP20226861
GTF2A1GTF2BQ00403861
GTF2A1TBPL1P62380858
GTF2A1STON1Q9Y6Q2847
GTF2A1STON2Q8WXE9817
GTF2A1MED1Q15648781
GTF2A1POLR2AP24928772
GTF2A1ERCC3P19447766
GTF2A1GTF2E1P29083732
GTF2A1MED27Q6P2C8717
GTF2A1MED26O95402668
GTF2A1MED7O43513649
GTF2A1MED23Q9ULK4648
GTF2A1MED14O60244597

IntAct

75 interactions, top by confidence:

ABTypeScore
GTF2A1GTF2A2psi-mi:“MI:0915”(physical association)0.940
GTF2A2GTF2A1psi-mi:“MI:0915”(physical association)0.940
GTF2A1GTF2A2psi-mi:“MI:0407”(direct interaction)0.940
GTF2A1TBPL1psi-mi:“MI:0915”(physical association)0.850
TBPL1GTF2A1psi-mi:“MI:0915”(physical association)0.850
POLR2EPOLR2Dpsi-mi:“MI:0915”(physical association)0.790
TBPGTF2A1psi-mi:“MI:0915”(physical association)0.740
GTF2A1TBPpsi-mi:“MI:0914”(association)0.740
MAPK7PFDN6psi-mi:“MI:0914”(association)0.640
GTF2A1NFYApsi-mi:“MI:0915”(physical association)0.560
NFYAGTF2A1psi-mi:“MI:0915”(physical association)0.560
GTF2A1PRR20Dpsi-mi:“MI:0915”(physical association)0.560
GTF2A1SP4psi-mi:“MI:0915”(physical association)0.560
GTF2A1BIRC7psi-mi:“MI:0915”(physical association)0.560
GTF2A1PSMC6psi-mi:“MI:0915”(physical association)0.560
GTF2A1psi-mi:“MI:0915”(physical association)0.560
GTF2A1ERN1psi-mi:“MI:0915”(physical association)0.560
GTF2A1FGFR3psi-mi:“MI:0915”(physical association)0.560
GTF2A1FKBP1Apsi-mi:“MI:0915”(physical association)0.560
GTF2A1HRASpsi-mi:“MI:0915”(physical association)0.560
GTF2A1PMP22psi-mi:“MI:0915”(physical association)0.560

BioGRID (128): GTF2A1 (Affinity Capture-MS), GTF2A2 (Two-hybrid), NFYA (Two-hybrid), GTF2A1 (Affinity Capture-RNA), GTF2A1 (Affinity Capture-Western), TBP (Affinity Capture-Western), Tbp (Affinity Capture-Western), GTF2A1 (Affinity Capture-MS), C17orf62 (Affinity Capture-MS), INPPL1 (Affinity Capture-MS), GTF2A2 (Affinity Capture-MS), TBPL1 (Affinity Capture-MS), PTCD2 (Affinity Capture-MS), TBP (Affinity Capture-MS), GTF2A2 (Co-fractionation)

ESM2 similar proteins: A1Z9E2, A4QNP0, B0R0I6, B5DE69, F1MH24, F1SPM8, O08949, O94842, P0C1X8, P0CF24, P27699, P34333, P52655, P79145, Q01167, Q02086, Q03061, Q08CM4, Q09XV5, Q0IHV2, Q0P5K4, Q1LZH5, Q2M2I8, Q3TUF7, Q3UCQ1, Q3UHJ0, Q571G4, Q58NQ4, Q5E9U0, Q5QL03, Q5R6A9, Q5RBN8, Q5RCU0, Q641Z1, Q6DJL0, Q6IRR0, Q6MZP7, Q7ZUV7, Q7ZX03, Q86NP2

Diamond homologs: O08949, P32773, P52654, P52655, Q54G80, Q5RCU0, Q8R4I4, Q8SW23, Q99PM3, Q9UNN4, Q9USU9

SIGNOR signaling

13 interactions.

AEffectBMechanism
TAF1“up-regulates activity”GTF2A1phosphorylation
TAF1up-regulatesGTF2A1phosphorylation
GTF2A1“form complex”TFIIAbinding
CSNK2A1“up-regulates activity”GTF2A1phosphorylation
CSNK2A2“up-regulates activity”GTF2A1phosphorylation

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 36 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
HIV Transcription Initiation653.8×6e-08
RNA Polymerase II HIV Promoter Escape653.8×6e-08
RNA Polymerase II Promoter Escape653.8×6e-08
RNA Polymerase II Transcription Pre-Initiation And Promoter Opening653.8×6e-08
RNA Polymerase II Transcription Initiation653.8×6e-08
RNA Polymerase II Transcription Initiation And Promoter Clearance653.8×6e-08
Transcription of the HIV genome639.9×3e-07
Late Phase of HIV Life Cycle532.3×1e-05

GO biological processes:

GO termPartnersFoldFDR
transcription initiation at RNA polymerase II promoter555.1×4e-06
transcription by RNA polymerase II816.6×4e-06

Disease & clinical

Clinical variants and AI predictions

ClinVar

37 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance25
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2181 predictions. Top by Δscore:

VariantEffectΔscore
14:81185526:CTTA:Cdonor_loss1.0000
14:81185527:TTACC:Tdonor_loss1.0000
14:81185528:TACC:Tdonor_loss1.0000
14:81185529:A:ACdonor_gain1.0000
14:81185529:A:ATdonor_loss1.0000
14:81185530:C:Adonor_loss1.0000
14:81185530:C:CCdonor_gain1.0000
14:81185616:GGCTC:Gacceptor_gain1.0000
14:81185618:CTC:Cacceptor_gain1.0000
14:81185619:TC:Tacceptor_gain1.0000
14:81185620:CC:Cacceptor_gain1.0000
14:81185621:C:CCacceptor_gain1.0000
14:81185627:A:ACacceptor_gain1.0000
14:81185627:A:Cacceptor_gain1.0000
14:81192512:AACTT:Adonor_loss1.0000
14:81192513:ACTTA:Adonor_loss1.0000
14:81192514:CTT:Cdonor_loss1.0000
14:81192515:TTA:Tdonor_loss1.0000
14:81192516:TA:Tdonor_loss1.0000
14:81192517:A:ACdonor_gain1.0000
14:81192517:A:ATdonor_loss1.0000
14:81192517:ACTT:Adonor_gain1.0000
14:81192518:C:CGdonor_gain1.0000
14:81192518:CT:Cdonor_gain1.0000
14:81192518:CTT:Cdonor_gain1.0000
14:81192518:CTTC:Cdonor_gain1.0000
14:81192520:T:TAdonor_gain1.0000
14:81192836:GCAC:Gacceptor_gain1.0000
14:81192837:CAC:Cacceptor_gain1.0000
14:81192837:CACC:Cacceptor_gain1.0000

AlphaMissense

2484 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
14:81180226:C:AW376C1.000
14:81180226:C:GW376C1.000
14:81180228:A:GW376R1.000
14:81180228:A:TW376R1.000
14:81180234:C:GA374P1.000
14:81180239:C:TG372E1.000
14:81180240:C:GG372R1.000
14:81180240:C:TG372R1.000
14:81180245:G:TA370D1.000
14:81180246:C:GA370P1.000
14:81180248:T:AK369I1.000
14:81180253:A:CF367L1.000
14:81180253:A:TF367L1.000
14:81180254:A:CF367C1.000
14:81180254:A:GF367S1.000
14:81180255:A:GF367L1.000
14:81180275:A:GL360P1.000
14:81180281:A:TM358K1.000
14:81180287:C:AG356V1.000
14:81180287:C:TG356D1.000
14:81180288:C:AG356C1.000
14:81180288:C:GG356R1.000
14:81180288:C:TG356S1.000
14:81180292:C:AK354N1.000
14:81180292:C:GK354N1.000
14:81180294:T:CK354E1.000
14:81180296:A:GL353P1.000
14:81180301:A:CF351L1.000
14:81180301:A:TF351L1.000
14:81180302:A:GF351S1.000

dbSNP variants (sampled 300 via entrez): RS1000042907 (14:81221684 C>G), RS1000095806 (14:81177870 C>T), RS1000116247 (14:81194398 A>C,G), RS1000164876 (14:81179548 A>G), RS1000367664 (14:81223205 G>A), RS1000449937 (14:81193217 A>C), RS1000477944 (14:81175914 C>T), RS1000488463 (14:81190753 G>A), RS1000589701 (14:81183173 T>C), RS1000593436 (14:81177558 G>A), RS1000773384 (14:81189173 T>C), RS1000775489 (14:81180753 TA>T,TAA), RS1000793252 (14:81187584 G>T), RS1000839249 (14:81190378 G>A), RS1000903846 (14:81195249 C>T)

Disease associations

OMIM: gene MIM:600520 | disease phenotypes: MIM:220200

GenCC curated gene-disease

Mondo (1): Dandy-Walker syndrome (MONDO:0009072)

Orphanet (1): Isolated Dandy-Walker malformation (Orphanet:217)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

7 associations (top):

StudyTraitp-value
GCST003542_76Night sleep phenotypes4.000000e-06
GCST010241_52Apolipoprotein A1 levels7.000000e-15
GCST010242_337HDL cholesterol levels3.000000e-13
GCST010653_82Thyroid stimulating hormone levels7.000000e-22
GCST010653_83Thyroid stimulating hormone levels1.000000e-12
GCST90002398_223Neutrophil count8.000000e-16
GCST90002407_578White blood cell count3.000000e-15

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004614apolipoprotein A 1 measurement
EFO:0004612high density lipoprotein cholesterol measurement
EFO:0004833neutrophil count

MeSH disease descriptors (1)

DescriptorNameTree numbers
D003616Dandy-Walker SyndromeC10.228.140.252.300; C10.228.140.602.500; C10.500.205; C16.131.666.205

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4832 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

Binding affinities (BindingDB)

22 measured of 22 human assays (22 total across all organisms); most potent 22 below. Values come from heterogeneous assays and are not directly comparable.

LigandMeasureValuePatent
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-pyridin-4-ylbutanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI20 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-piperidin-4-ylbutanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI90 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-(1-oxidopyridin-1-ium-4-yl)butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI120 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2R)-2-(benzylsulfonylamino)-N-[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]-5-piperidin-4-ylpentanamideKI300 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2R)-2-(benzylsulfonylamino)-N-[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]-5-(1-oxidopyridin-1-ium-4-yl)pentanamideKI400 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
methyl 4-[(4R)-4-(benzylsulfonylamino)-5-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-5-oxopentyl]piperidine-1-carboxylateKI400 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
BDBM108100KI750 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2R)-2-(benzylsulfonylamino)-N-[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]-5-pyridin-4-ylpentanamideKI850 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2R)-2-(benzylsulfonylamino)-N-[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]-5-phenylpentanamideKI1100 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-[1-(2-methoxyacetyl)piperidin-4-yl]butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI1600 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-2-phenylacetyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI1600 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
4-[(3R)-3-(benzylsulfonylamino)-4-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-4-oxobutyl]-N-methylpiperidine-1-carboxamideKI1700 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-4-(1-acetylpiperidin-4-yl)-2-(benzylsulfonylamino)butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI1900 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
methyl 4-[(3R)-3-(benzylsulfonylamino)-4-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-4-oxobutyl]piperidine-1-carboxylateKI2000 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-(1-butanoylpiperidin-4-yl)butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI2200 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-[1-(cyclopropanecarbonyl)piperidin-4-yl]butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI2300 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-(1-propanoylpiperidin-4-yl)butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI2300 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
3-[[(2R)-1-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-1-oxo-5-phenylpentan-2-yl]sulfamoylmethyl]benzoic acidKI2400 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2R)-2-(benzylsulfonylamino)-N-[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]-5-(tetrazol-1-yl)pentanamideKI2400 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
4-[(3R)-3-(benzylsulfonylamino)-4-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-4-oxobutyl]-N,N-dimethylpiperidine-1-carboxamideKI2500 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
(2S)-2-[[(2R)-2-(benzylsulfonylamino)-4-[1-(2-methylpropanoyl)piperidin-4-yl]butanoyl]amino]-N-[(4-carbamimidoylphenyl)methyl]-4-piperidin-4-ylbutanamideKI3200 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use
methyl 3-[[(2R)-1-[[(2S)-1-[(4-carbamimidoylphenyl)methylamino]-1-oxo-4-piperidin-4-ylbutan-2-yl]amino]-1-oxo-5-phenylpentan-2-yl]sulfamoylmethyl]benzoateKI5900 nMUS-8598206: Trypsin-like serine protease inhibitors, and their preparation and use

ChEMBL bioactivities

20 potent at pChembl≥5 of 24 total, top 19 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.62Ki240nMCHEMBL3660186
6.10Ki800nMCHEMBL3660173
6.04Ki920nMCHEMBL3660190
5.92Ki1200nMCHEMBL3660191
5.77Ki1700nMCHEMBL3660187
5.44Ki3600nMCHEMBL3660189
5.42Ki3800nMCHEMBL3660176
5.41Ki3900nMCHEMBL3660179
5.38Ki4200nMCHEMBL3660177
5.36Ki4400nMCHEMBL3660171
5.35Ki4500nMCHEMBL3660178
5.33Ki4700nMCHEMBL3660192
5.32Ki4800nMCHEMBL3660181
5.31Ki4900nMCHEMBL3660184
5.26Ki5500nMCHEMBL3660182
5.25Ki5600nMCHEMBL3660175
5.10Ki8000nMCHEMBL3660172
5.07Ki8500nMCHEMBL3660174
5.05Ki9000nMCHEMBL3660188

CTD chemical–gene interactions

34 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
trichostatin Adecreases expression2
aristolochic acid Idecreases expression1
geldanamycinincreases expression1
triphenyl phosphateaffects expression1
bisphenol Aaffects cotreatment, increases expression1
sodium arsenatedecreases expression1
arseniteaffects binding, decreases reaction1
sodium arseniteincreases expression1
CGP 52608affects binding, increases reaction1
Resveratrolincreases expression, affects cotreatment1
Arsenic Trioxideincreases sumoylation1
Vorinostatdecreases expression1
Acetaminophenincreases expression1
Gemcitabineaffects expression1
Air Pollutantsdecreases expression, increases abundance1
Arsenicaffects methylation1
Benzo(a)pyreneaffects methylation1
Butyratesdecreases expression1
Dexamethasoneaffects cotreatment, increases expression1
Enzyme Inhibitorsincreases O-linked glycosylation, decreases activity1
Folic Aciddecreases expression1
Indomethacinaffects cotreatment, increases expression1
Ivermectindecreases expression1
Plant Extractsaffects cotreatment, increases expression1
Smokedecreases expression, increases abundance1
Thimerosaldecreases expression1
Tobacco Smoke Pollutionincreases expression1
Tretinoindecreases expression1
Valproic Aciddecreases methylation1
1-Methyl-3-isobutylxanthineaffects cotreatment, increases expression1

ChEMBL screening assays

2 unique, capped per target: 2 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL3705624BindingInhibition Assay: Inhibition of human FIIa was determined by the method described in [0092]-[0098] using human alpha-thrombin from Enzyme Research Laboratories at 0.1 NIH U/mL and Mes-d-Cha-Gly-Arg-pNA (Pefachrome tPA) at 2 mM, 1 mM, and 0.Trypsin-like serine protease inhibitors, and their preparation and use

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Dandy-Walker syndrome