TAF4

gene
On this page

Also known as TAFII130TAFII135

Summary

TAF4 (TATA-box binding protein associated factor 4, HGNC:11537) is a protein-coding gene on chromosome 20q13.33, encoding Transcription initiation factor TFIID subunit 4 (O00268). The TFIID basal transcription factor complex plays a major role in the initiation of RNA polymerase II (Pol II)-dependent transcription. It is a selective cancer dependency (DepMap: 16.6% of cell lines).

Initiation of transcription by RNA polymerase II requires the activities of more than 70 polypeptides. The protein that coordinates these activities is transcription factor IID (TFIID), which binds to the core promoter to position the polymerase properly, serves as the scaffold for assembly of the remainder of the transcription complex, and acts as a channel for regulatory signals. TFIID is composed of the TATA-binding protein (TBP) and a group of evolutionarily conserved proteins known as TBP-associated factors or TAFs. TAFs may participate in basal transcription, serve as coactivators, function in promoter recognition or modify general transcription factors (GTFs) to facilitate complex assembly and transcription initiation. This gene encodes one of the larger subunits of TFIID that has been shown to potentiate transcriptional activation by retinoic acid, thyroid hormone and vitamin D3 receptors. In addition, this subunit interacts with the transcription factor CREB, which has a glutamine-rich activation domain, and binds to other proteins containing glutamine-rich regions. Aberrant binding to this subunit by proteins with expanded polyglutamine regions has been suggested as one of the pathogenetic mechanisms underlying a group of neurodegenerative disorders referred to as polyglutamine diseases.

Source: NCBI Gene 6874 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): intellectual developmental disorder, autosomal dominant 73 (Strong, GenCC)
  • GWAS associations: 7
  • Clinical variants (ClinVar): 293 total — 10 pathogenic, 8 likely-pathogenic
  • Phenotypes (HPO): 101
  • Cancer dependency (DepMap): dependent in 16.6% of screened cell lines
  • MANE Select transcript: NM_003185

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11537
Approved symbolTAF4
NameTATA-box binding protein associated factor 4
Location20q13.33
Locus typegene with protein product
StatusApproved
AliasesTAFII130, TAFII135
Ensembl geneENSG00000130699
Ensembl biotypeprotein_coding
OMIM601796
Entrez6874

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 3 protein_coding, 3 protein_coding_CDS_not_defined, 2 retained_intron, 2 nonsense_mediated_decay

ENST00000252996, ENST00000436129, ENST00000474089, ENST00000486599, ENST00000488539, ENST00000608458, ENST00000608887, ENST00000609041, ENST00000609045, ENST00000692470

RefSeq mRNA: 1 — MANE Select: NM_003185 NM_003185

CCDS: CCDS33500

Canonical transcript exons

ENST00000252996 — 15 exons

ExonStartEnd
ENSE000008972126200905262009174
ENSE000008972166201004662010165
ENSE000008972216201281562012934
ENSE000008972266201454762014707
ENSE000013790376206445162065881
ENSE000016368786197479861976335
ENSE000037031096200651062006758
ENSE000037036396199755061997669
ENSE000037048476199898361999108
ENSE000037058976199813661998192
ENSE000037069576200316062003274
ENSE000037089086200754762007636
ENSE000037090786200012462000254
ENSE000037101486200373162003878
ENSE000037114686200055262000721

Expression profiles

Bgee: expression breadth ubiquitous, 139 present calls, max score 89.45.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 7.3722 / max 61.1785, expressed in 1674 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1882375.98981633
1882361.0700621
1882350.3125131

Top tissues by expression

139 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right testisUBERON:000453489.45gold quality
left testisUBERON:000453389.28gold quality
testisUBERON:000047388.42gold quality
adrenal tissueUBERON:001830388.42gold quality
skeletal muscle tissueUBERON:000113487.55gold quality
bone marrow cellCL:000209287.50gold quality
ventricular zoneUBERON:000305386.96gold quality
ovaryUBERON:000099286.55gold quality
left ovaryUBERON:000211986.51gold quality
ganglionic eminenceUBERON:000402386.39gold quality
embryoUBERON:000092286.38gold quality
right ovaryUBERON:000211886.12gold quality
right hemisphere of cerebellumUBERON:001489085.78gold quality
cerebellumUBERON:000203785.00gold quality
cerebellar hemisphereUBERON:000224584.93gold quality
cerebellar cortexUBERON:000212984.90gold quality
gastrocnemiusUBERON:000138884.75gold quality
tonsilUBERON:000237284.45gold quality
cortical plateUBERON:000534384.18gold quality
muscle tissueUBERON:000238584.12gold quality
muscle of legUBERON:000138384.08gold quality
colonic epitheliumUBERON:000039783.95gold quality
hindlimb stylopod muscleUBERON:000425283.89gold quality
bone marrowUBERON:000237183.80gold quality
sural nerveUBERON:001548883.64gold quality
uterine cervixUBERON:000000283.17gold quality
duodenumUBERON:000211483.08gold quality
body of pancreasUBERON:000115082.67gold quality
body of uterusUBERON:000985382.61gold quality
cortex of kidneyUBERON:000122582.56gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-GEOD-124858no21.13
E-ANND-3no2.24

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

2 targets.

TargetRegulation
ATF7Repression
CDKN1AUnknown

Upstream regulators (CollecTRI, top): CREM

miRNA regulators (miRDB)

158 targeting TAF4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3163100.0077.238605
HSA-MIR-5692A100.0074.406850
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-5011-5P100.0083.465820
HSA-MIR-4533100.0069.482758
HSA-MIR-190A-3P100.0080.355520
HSA-MIR-366299.9973.825684
HSA-MIR-548AW99.9972.573559
HSA-MIR-548P99.9872.253784
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-19A-3P99.9875.332762
HSA-MIR-19B-3P99.9875.442754
HSA-MIR-4650-5P99.9864.69999
HSA-MIR-548N99.9871.944170
HSA-MIR-477599.9875.006394
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-1213699.9872.815713
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-LET-7C-3P99.9573.422862
HSA-MIR-548AB99.9571.313488
HSA-MIR-55999.9572.283609

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 16.6% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 14)

  • huntingtin interacts with Sp1 and TAFII130; transcriptional activity of SP1 and TAFII130 disrupted in early Huntingtin’s Disease (PMID:11988536)
  • Data present the crystal structure of a complex formed by the interacting domains from two subunits of the general transcription factor TFIID, the human TATA binding protein-associated factors hTAF4 (hTAF(II)135) and hTAF12 (hTAF(II)20). (PMID:12237304)
  • trans-activation of promoters by simian virus 40 small t-antigen may depend on a consensus TATA motif and such promoters recruit the general transcription factor hTAF(II)130/135 (PMID:12810884)
  • TAF4b incorporation into TFIID induces an open conformation at the lobe involved in TFIIA & putative activator interactions, correlating with differential activator-dependent transcription & promoter recognition by 4b/4-IID. (PMID:18206971)
  • Findings suggest that DNA binding by TAF4/4b-TAF12 facilitates the association of TFIID with the core promoter of a subset of genes. (PMID:19635797)
  • These interactions are important to the transcriptional activation of these genes by Rta since introducing TAF4 shRNA substantially reduces the ability of Rta to activate these promoters (PMID:23326574)
  • TAF4 isoforms generated by the alternative splicing participate in the conversion of the cellular transcriptional programs from the maintenance of stem cell state to differentiation (PMID:24098348)
  • Inactivation of hTAF4-TAFH domain accelerates differentiation of human neural progenitor cells. (PMID:24696168)
  • Targeted splicing of hTAF4-TAFH coordinates AS of other TFIID subunits, underscoring the role of TAF4 in synchronised changes of Pol II complex composition essential for efficient cellular reprogramming. (PMID:27499390)
  • the two Q-domains of Sp1 and four Q-domains of TAF4 were disordered under physiological conditions. (PMID:27515574)
  • Data suggest that many TATA-box-containing promoters contain a TFIIA recognition element (IIARE) that activates transcription by increasing the recruitment of RNA polymerase II, TFIIA (TF2A1/TF2A2 subunits), TAF4, and P300. (TAF4 = TATA box binding protein (TBP)-associated factor, RNA polymerase II, C1; P300 = E1A binding protein p300) (PMID:28539359)
  • Identification of heteromolecular binding sites in transcription factors Sp1 and TAF4 using high-resolution nuclear magnetic resonance spectroscopy has been reported. (PMID:28857320)
  • alphaalpha-hub coregulator structure and flexibility determine transcription factor binding and selection in regulatory interactomes. (PMID:35452682)
  • De novo putative loss-of-function variants in TAF4 are associated with a neuro-developmental disorder. (PMID:35904126)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriotaf4aENSDARG00000068192
mus_musculusTaf4ENSMUSG00000039117
rattus_norvegicusTaf4ENSRNOG00000054497
caenorhabditis_eleganstaf-4WBGENE00006385

Paralogs (1): TAF4B (ENSG00000141384)

Protein

Protein identifiers

Transcription initiation factor TFIID subunit 4O00268 (reviewed: O00268)

Alternative names: RNA polymerase II TBP-associated factor subunit C, TBP-associated factor 4, Transcription initiation factor TFIID 130 kDa subunit, Transcription initiation factor TFIID 135 kDa subunit

All UniProt accessions (5): O00268, A0A8I5KRH3, A0A8I5KW49, V9GY03, V9GY14

UniProt curated annotations — full annotation on UniProt →

Function. The TFIID basal transcription factor complex plays a major role in the initiation of RNA polymerase II (Pol II)-dependent transcription. TFIID recognizes and binds promoters with or without a TATA box via its subunit TBP, a TATA-box-binding protein, and promotes assembly of the pre-initiation complex (PIC). The TFIID complex consists of TBP and TBP-associated factors (TAFs), including TAF1, TAF2, TAF3, TAF4, TAF5, TAF6, TAF7, TAF8, TAF9, TAF10, TAF11, TAF12 and TAF13. TAF4 may maintain an association between the TFIID and TFIIA complexes, while bound to the promoter, together with TBP, during PIC assembly. Potentiates transcriptional activation by the AF-2S of the retinoic acid, vitamin D3 and thyroid hormone.

Subunit / interactions. Component of the TFIID basal transcription factor complex, composed of TATA-box-binding protein TBP, and a number of TBP-associated factors (TAFs), including TAF1, TAF2, TAF3, TAF4, TAF5, TAF6, TAF7, TAF8, TAF9, TAF10, TAF11, TAF12 and TAF13. Component of the TFTC-HAT complex, at least composed of TAF5L, TAF6L, TADA3L, SUPT3H, TAF2, TAF4, TAF5, GCN5L2/GCN5, TAF10, TAF12 and TRRAP. Component of some MLL1/MLL complex, at least composed of the core components KMT2A/MLL1, ASH2L, HCFC1/HCF1, WDR5 and RBBP5, as well as the facultative components BACC1, CHD8, E2F6, HSP70, INO80C, KANSL1, LAS1L, MAX, MCRS1, MGA, MYST1/MOF, PELP1, PHF20, PRP31, RING2, RUVB1/TIP49A, RUVB2/TIP49B, SENP3, TAF1, TAF4, TAF6, TAF7, TAF9 and TEX10. Interacts with ATF7; the interaction inhibits ATF7-mediated tranactivation. (Microbial infection) Interacts with SV40 Large T antigen.

Subcellular location. Nucleus.

Disease relevance. Intellectual developmental disorder, autosomal dominant 73 (MRD73) [MIM:620450] An autosomal dominant disorder characterized by intellectual disability ranging from mild to severe, developmental delay, speech delay, behavioral abnormalities, and non-specific dysmorphic facial features. The disease is caused by variants affecting the gene represented in this entry.

Similarity. Belongs to the TAF4 family.

RefSeq proteins (1): NP_003176* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR003894TAFH_NHR1Domain
IPR007900TAF4_CDomain
IPR009072Histone-foldHomologous_superfamily
IPR037249TAFH/NHR1_dom_sfHomologous_superfamily
IPR045144TAF4Family

Pfam: PF05236, PF07531

UniProt features (66 total): compositionally biased region 15, helix 15, modified residue 11, region of interest 7, sequence variant 5, sequence conflict 5, strand 4, turn 2, chain 1, domain 1

Structure

Experimental structures (PDB)

31 structures, top 30 by resolution.

PDBMethodResolution (Å)
2P6VX-RAY DIFFRACTION2
1H3OX-RAY DIFFRACTION2.3
7EGGELECTRON MICROSCOPY2.77
7EGFELECTRON MICROSCOPY3.16
7EGBELECTRON MICROSCOPY3.3
7EG9ELECTRON MICROSCOPY3.7
7EGCELECTRON MICROSCOPY3.9
7ENAELECTRON MICROSCOPY4.07
7EGAELECTRON MICROSCOPY4.1
7ENCELECTRON MICROSCOPY4.13
8GXSELECTRON MICROSCOPY4.16
6MZCELECTRON MICROSCOPY4.5
7EDXELECTRON MICROSCOPY4.5
8GXQELECTRON MICROSCOPY5.04
8WAKELECTRON MICROSCOPY5.47
8WAPELECTRON MICROSCOPY5.85
8WANELECTRON MICROSCOPY6.07
8WASELECTRON MICROSCOPY6.13
7EG7ELECTRON MICROSCOPY6.2
8WAQELECTRON MICROSCOPY6.29
8WAOELECTRON MICROSCOPY6.4
7EGDELECTRON MICROSCOPY6.75
8WARELECTRON MICROSCOPY7.2
7EG8ELECTRON MICROSCOPY7.4
6MZMELECTRON MICROSCOPY7.5
8WALELECTRON MICROSCOPY8.52
7EGJELECTRON MICROSCOPY8.64
7EGEELECTRON MICROSCOPY9
6MZDELECTRON MICROSCOPY9.8
7EGIELECTRON MICROSCOPY9.82

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O00268-F154.710.16

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (11): 109, 424, 435, 594, 595, 597, 605, 611, 621, 631, 658

Function

Pathways and Gene Ontology

Reactome pathways

20 pathways

IDPathway
R-HSA-167161HIV Transcription Initiation
R-HSA-167162RNA Polymerase II HIV Promoter Escape
R-HSA-167172Transcription of the HIV genome
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6804756Regulation of TP53 Activity through Phosphorylation
R-HSA-73776RNA Polymerase II Promoter Escape
R-HSA-73779RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-75953RNA Polymerase II Transcription Initiation
R-HSA-76042RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-162587HIV Life Cycle
R-HSA-162599Late Phase of HIV Life Cycle
R-HSA-162906HIV Infection
R-HSA-1643685Disease
R-HSA-212436Generic Transcription Pathway
R-HSA-3700989Transcriptional Regulation by TP53
R-HSA-5633007Regulation of TP53 Activity
R-HSA-5663205Infectious disease
R-HSA-73857RNA Polymerase II Transcription
R-HSA-74160Gene expression (Transcription)
R-HSA-9824446Viral Infection Pathways

MSigDB gene sets: 0 (showing top):

GO Biological Process (12): ovarian follicle development (GO:0001541), regulation of DNA repair (GO:0006282), DNA-templated transcription initiation (GO:0006352), regulation of transcription by RNA polymerase II (GO:0006357), transcription by RNA polymerase II (GO:0006366), transcription initiation at RNA polymerase II promoter (GO:0006367), mRNA transcription by RNA polymerase II (GO:0042789), positive regulation of DNA-templated transcription (GO:0045893), RNA polymerase II preinitiation complex assembly (GO:0051123), positive regulation of transcription initiation by RNA polymerase II (GO:0060261), DNA-templated transcription (GO:0006351), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (7): DNA binding (GO:0003677), aryl hydrocarbon receptor binding (GO:0017162), protein heterodimerization activity (GO:0046982), core promoter sequence-specific DNA binding (GO:0001046), protein binding (GO:0005515), RNA polymerase II general transcription initiation factor activity (GO:0016251), RNA polymerase II-specific DNA-binding transcription factor binding (GO:0061629)

GO Cellular Component (10): chromatin (GO:0000785), male germ cell nucleus (GO:0001673), nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription factor TFIID complex (GO:0005669), cytosol (GO:0005829), protein-containing complex (GO:0032991), transcription factor TFTC complex (GO:0033276), MLL1 complex (GO:0071339), transcription regulator complex (GO:0005667)

Reactome top-level categories

Rollup of top-13 pathways:

CategoryPathways
RNA Polymerase II Transcription4
Transcription of the HIV genome2
RNA Polymerase II Transcription Initiation And Promoter Clearance2
Late Phase of HIV Life Cycle1
Regulation of TP53 Activity1
HIV Infection1
HIV Life Cycle1
Viral Infection Pathways1
Generic Transcription Pathway1
Transcriptional Regulation by TP531
Disease1
Gene expression (Transcription)1
Infectious disease1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA-templated transcription4
transcription by RNA polymerase II4
cellular anatomical structure3
RNA biosynthetic process2
regulation of DNA-templated transcription2
transcription initiation at RNA polymerase II promoter2
RNA polymerase II, holoenzyme2
RNA polymerase II transcription regulator complex2
female gonad development1
anatomical structure development1
DNA repair1
regulation of DNA metabolic process1
regulation of cellular response to stress1
DNA-templated transcription initiation1
mRNA transcription1
positive regulation of RNA biosynthetic process1
transcription preinitiation complex assembly1
positive regulation of transcription by RNA polymerase II1
regulation of transcription initiation by RNA polymerase II1
positive regulation of DNA-templated transcription initiation1
gene expression1
regulation of gene expression1
regulation of RNA biosynthetic process1
nucleic acid binding1
signaling receptor binding1
RNA polymerase II-specific DNA-binding transcription factor binding1
protein dimerization activity1
transcription cis-regulatory region binding1
binding1
general transcription initiation factor activity1
DNA-binding transcription factor binding1
chromosome1
germ cell nucleus1
intracellular membrane-bounded organelle1
nuclear lumen1
cytoplasm1
cellular_component1
SAGA-type complex1
MLL1/2 complex1
protein-containing complex1

Protein interactions and networks

STRING

1404 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TAF4TAF5Q15542998
TAF4TAF12Q16514997
TAF4TAF9Q16594993
TAF4TAF6P49848990
TAF4CREB1P16220985
TAF4TAF2Q6P1X5961
TAF4TBPP20226959
TAF4HTTP42858948
TAF4TAF1P21675942
TAF4TAF10Q12962919
TAF4TAF8Q7Z7C8888
TAF4TAF7Q15545841
TAF4TAF11Q15544822
TAF4TAF3Q5VWG9791
TAF4KAT2BQ92831767

IntAct

120 interactions, top by confidence:

ABTypeScore
TAF12TAF4psi-mi:“MI:0407”(direct interaction)0.760
TAF12TAF4psi-mi:“MI:0914”(association)0.760
TBPTAF4psi-mi:“MI:0914”(association)0.730
TAF12TAF4psi-mi:“MI:0915”(physical association)0.720
TAF4TAF12psi-mi:“MI:0407”(direct interaction)0.720
TAF4TAF12psi-mi:“MI:0915”(physical association)0.720
TAF12TAF4psi-mi:“MI:0914”(association)0.720
TAF10TAF4psi-mi:“MI:0914”(association)0.650
TAF10TAF4psi-mi:“MI:0915”(physical association)0.650
TAF4CREBZFpsi-mi:“MI:0407”(direct interaction)0.560
TAF4CREB3psi-mi:“MI:0407”(direct interaction)0.560

BioGRID (225): TAF4 (Affinity Capture-MS), TAF4 (Protein-peptide), TAF4 (Affinity Capture-MS), TAF4 (Affinity Capture-MS), TAF4 (Affinity Capture-MS), ANAPC1 (Co-fractionation), ANAPC7 (Co-fractionation), CDC16 (Co-fractionation), NAP1L4 (Co-fractionation), TAF1 (Co-fractionation), TAF10 (Co-fractionation), TAF4 (Co-fractionation), TAF4 (Co-fractionation), TAF4 (Co-fractionation), TAF4 (Co-fractionation)

ESM2 similar proteins: A4D2P6, A5PJV8, A6NFD8, D4AE48, O00268, O00287, O35274, O35779, O43566, P04198, P12755, P55199, Q08DA0, Q0D2I5, Q2KJ58, Q504T8, Q5XKK7, Q60698, Q61976, Q6NZ67, Q6P582, Q6R891, Q6T4P5, Q7Z6J2, Q80YR4, Q86UD0, Q86UK7, Q8BXL9, Q8CEG5, Q8R4T5, Q8TF61, Q8VCG9, Q969F2, Q969G9, Q96HZ4, Q96SB3, Q99PV5, Q9BQ61, Q9BUN5, Q9BZE9

Diamond homologs: G5E8Z2, O00268, O61707, P47825, Q92750, O43439, O54972, O70374, O75081, O75398, O77562, O88450, Q06455, Q5F3B1, Q61909, Q9IAB2, Q9Z1T5

SIGNOR signaling

2 interactions.

AEffectBMechanism
TAF4“form complex”TFIIDbinding
TAF4“down-regulates activity”ATF7binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 87 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
HIV Transcription Initiation1143.5×5e-14
RNA Polymerase II HIV Promoter Escape1143.5×5e-14
RNA Polymerase II Promoter Escape1143.5×5e-14
RNA Polymerase II Transcription Pre-Initiation And Promoter Opening1143.5×5e-14
RNA Polymerase II Transcription Initiation1143.5×5e-14
RNA Polymerase II Transcription Initiation And Promoter Clearance1143.5×5e-14
Transcription of the HIV genome1132.3×1e-12
Late Phase of HIV Life Cycle1028.5×7e-11

GO biological processes:

GO termPartnersFoldFDR
RNA polymerase II preinitiation complex assembly1344.7×9e-16
transcription initiation at RNA polymerase II promoter942.7×1e-10
positive regulation of transcription initiation by RNA polymerase II1137.9×2e-12
mRNA transcription by RNA polymerase II833.5×1e-08
regulation of DNA repair724.5×1e-06
regulation of RNA splicing616.6×1e-04
transcription by RNA polymerase II108.9×1e-05

Disease & clinical

Clinical variants and AI predictions

ClinVar

293 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic10
Likely pathogenic8
Uncertain significance209
Likely benign43
Benign3

Top pathogenic / likely-pathogenic (18)

Variant IDHGVSClassification
3173503NM_003185.4(TAF4):c.1519C>T (p.Gln507Ter)Pathogenic
3173508NM_003185.4(TAF4):c.2719C>T (p.Gln907Ter)Pathogenic
3242345NM_003185.4(TAF4):c.2845C>T (p.Gln949Ter)Pathogenic
3242346NM_003185.4(TAF4):c.1348C>T (p.Gln450Ter)Pathogenic
3242347NM_003185.4(TAF4):c.2185C>T (p.Gln729Ter)Pathogenic
3242348NM_003185.4(TAF4):c.2453dup (p.Asn818fs)Pathogenic
3242349NM_003185.4(TAF4):c.2664del (p.Lys888fs)Pathogenic
4729492NM_003185.4(TAF4):c.898_910del (p.Pro300fs)Pathogenic
4818966NM_003185.4(TAF4):c.2425dup (p.Ser809fs)Pathogenic
4819103NM_003185.4(TAF4):c.633dup (p.Ala212fs)Pathogenic
1687439NM_003185.4(TAF4):c.853_860del (p.Gly285fs)Likely pathogenic
3236097NM_003185.4(TAF4):c.213_268del (p.Gly72fs)Likely pathogenic
3376435NM_003185.4(TAF4):c.808_818del (p.Pro270fs)Likely pathogenic
3587593NM_003185.4(TAF4):c.962del (p.Gly321fs)Likely pathogenic
3765856NM_003185.4(TAF4):c.1290del (p.Val431fs)Likely pathogenic
3767129NM_003185.4(TAF4):c.1529del (p.Gly510fs)Likely pathogenic
3773682NM_003185.4(TAF4):c.3160_3163del (p.Arg1054fs)Likely pathogenic
4291844NM_003185.4(TAF4):c.756del (p.Ala253fs)Likely pathogenic

SpliceAI

4369 predictions. Top by Δscore:

VariantEffectΔscore
20:61995377:T:TAdonor_gain1.0000
20:61995380:A:Cdonor_gain1.0000
20:61997544:CCGTA:Cdonor_loss1.0000
20:61997545:CGTA:Cdonor_loss1.0000
20:61997547:TACC:Tdonor_loss1.0000
20:61997549:C:CAdonor_loss1.0000
20:61997549:CCT:Cdonor_gain1.0000
20:61997585:T:Adonor_gain1.0000
20:61997667:CAT:Cacceptor_gain1.0000
20:61997667:CATC:Cacceptor_loss1.0000
20:61997667:CATCT:Cacceptor_gain1.0000
20:61997670:C:CCacceptor_gain1.0000
20:61997671:T:Cacceptor_gain1.0000
20:61997671:T:TCacceptor_gain1.0000
20:61997673:T:Cacceptor_gain1.0000
20:61997673:T:TCacceptor_gain1.0000
20:61997675:A:Cacceptor_gain1.0000
20:61997676:T:TCacceptor_gain1.0000
20:61997677:T:Cacceptor_gain1.0000
20:61997678:T:Cacceptor_gain1.0000
20:61997678:T:TCacceptor_gain1.0000
20:61997720:CTTA:Cacceptor_gain1.0000
20:61998137:T:TAdonor_gain1.0000
20:61998188:CGAGA:Cacceptor_gain1.0000
20:61998189:GAGA:Gacceptor_gain1.0000
20:61998190:AGA:Aacceptor_gain1.0000
20:61998191:GA:Gacceptor_gain1.0000
20:61998193:C:CCacceptor_gain1.0000
20:61998199:T:Cacceptor_gain1.0000
20:61998199:T:TCacceptor_gain1.0000

AlphaMissense

6855 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:61976175:A:GL1084P1.000
20:61976188:A:CY1080D1.000
20:61976190:A:GL1079P1.000
20:61976238:A:GL1063P1.000
20:61976241:T:AD1062V1.000
20:61976241:T:CD1062G1.000
20:61976241:T:GD1062A1.000
20:61976253:A:TV1058D1.000
20:61997596:C:AR1015M1.000
20:61997602:C:TG1013E1.000
20:61997603:C:GG1013R1.000
20:61997603:C:TG1013R1.000
20:61997605:A:TI1012N1.000
20:61997608:G:TA1011E1.000
20:61997609:C:GA1011P1.000
20:61997612:C:GA1010P1.000
20:61997614:A:GL1009P1.000
20:61997617:G:TA1008E1.000
20:61997618:C:GA1008P1.000
20:61997625:G:CN1005K1.000
20:61997625:G:TN1005K1.000
20:61997630:C:GA1004P1.000
20:61997635:C:GR1002P1.000
20:61997653:A:GL996P1.000
20:61997665:T:GQ992P1.000
20:61997668:A:GM991T1.000
20:61998139:C:AK989N1.000
20:61998139:C:GK989N1.000
20:61998141:T:CK989E1.000
20:61998144:C:GA988P1.000

dbSNP variants (sampled 300 via entrez): RS1000003021 (20:61998821 A>T), RS1000061128 (20:62064405 C>A,T), RS1000062720 (20:62039670 T>C), RS1000067117 (20:62014446 C>T), RS1000071229 (20:61979912 G>A), RS1000084670 (20:61999366 C>T), RS1000117308 (20:62007274 C>A), RS1000168735 (20:62056770 C>T), RS1000170472 (20:61991538 G>A), RS1000201195 (20:61991129 A>G), RS1000212067 (20:62025162 T>C,G), RS1000228566 (20:62064070 G>A), RS1000252583 (20:62045879 T>A,C), RS1000310250 (20:61985559 G>A), RS1000375370 (20:62029916 A>G)

Disease associations

OMIM: gene MIM:601796 | disease phenotypes: MIM:620450

GenCC curated gene-disease

DiseaseClassificationInheritance
intellectual developmental disorder, autosomal dominant 73StrongAutosomal dominant

Mondo (2): intellectual developmental disorder, autosomal dominant 73 (MONDO:0957536), neurodevelopmental disorder (MONDO:0700092)

Orphanet (0):

HPO phenotypes

101 total (30 of 101 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000023Inguinal hernia
HP:0000028Cryptorchidism
HP:0000072Hydroureter
HP:0000098Tall stature
HP:0000179Thick lower lip vermilion
HP:0000185Cleft soft palate
HP:0000218High palate
HP:0000232Everted lower lip vermilion
HP:0000252Microcephaly
HP:0000278Retrognathia
HP:0000289Broad philtrum
HP:0000319Smooth philtrum
HP:0000322Short philtrum
HP:0000331Short chin
HP:0000358Posteriorly rotated ears
HP:0000369Low-set ears
HP:0000400Macrotia
HP:0000403Recurrent otitis media
HP:0000405Conductive hearing impairment
HP:0000426Prominent nasal bridge
HP:0000430Underdeveloped nasal alae
HP:0000431Wide nasal bridge
HP:0000444Convex nasal ridge
HP:0000445Wide nose
HP:0000448Prominent nose
HP:0000463Anteverted nares
HP:0000490Deeply set eye
HP:0000494Downslanted palpebral fissures
HP:0000540Hypermetropia

GWAS associations

7 associations (top):

StudyTraitp-value
GCST003486_11Response to fenofibrate (LDL cholesterol levels)9.000000e-06
GCST004904_9Body mass index4.000000e-09
GCST009391_1802Metabolite levels6.000000e-06
GCST009391_6Metabolite levels1.000000e-06
GCST011983_22Fasting glucose2.000000e-06
GCST90002397_291Mean spheric corpuscular volume9.000000e-12
GCST90002403_530Red blood cell count3.000000e-11

EFO canonical traits (5, from GWAS)

EFO IDTrait name
EFO:0007804LDL cholesterol change measurement
EFO:0004340body mass index
EFO:0010403triacylglycerol 48:0 measurement
EFO:0007813cotinine measurement
EFO:0004305erythrocyte count

MeSH disease descriptors (1)

DescriptorNameTree numbers
D065886Neurodevelopmental DisordersF03.625

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

31 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Cadmium Chloridedecreases expression, increases abundance, increases expression3
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxideaffects expression, decreases expression2
GSK-J4increases expression1
dicrotophosincreases expression1
bisphenol Aincreases methylation1
sodium arseniteincreases expression1
butyraldehydedecreases expression1
aflatoxin B2decreases methylation1
3-(2-hydroxy-4-(2-methylnonan-2-yl)phenyl)cyclohexan-1-oldecreases expression1
Air Pollutantsaffects expression, increases abundance1
Arsenicaffects methylation1
Cadmiumdecreases expression, increases abundance1
Caffeineincreases phosphorylation1
Cisplatindecreases expression1
Doxorubicindecreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Formaldehydedecreases expression1
Ivermectindecreases expression1
Leadaffects expression1
Ozoneincreases abundance, affects expression1
Pesticidesaffects methylation1
Smokedecreases expression1
Tamoxifenincreases expression1
Tretinoindecreases expression1
Tunicamycindecreases expression1
Valproic Acidincreases methylation1
Zincaffects expression1
Josamycinaffects response to substance1
Cyclosporinedecreases methylation1
Aflatoxin B1decreases methylation, increases methylation1

Clinical trials (associated diseases)

202 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT04586348PHASE4UNKNOWNPrenatal Iodine Supplementation and Early Childhood Neurodevelopment
NCT04873115PHASE4UNKNOWNDouble-blind, Placebo-controlled, Randomized Clinical Trial Comparing the Efficacy and Safety of Sialanar Plus orAl rehabiLitation Against Placebo Plus Oral Rehabilitation for chIldren and Adolescents With seVere Sialorrhoea and Neurodisabilties,
NCT02559102PHASE3COMPLETEDDexmedetomidine Sedation Versus General Anaesthesia for Inguinal Hernia Surgery in Infants
NCT02757079PHASE3COMPLETEDStudy of the Efficacy and Safety of NPC-15 for Sleep Disorders of Children With Neurodevelopmental Disorders
NCT06915480PHASE3RECRUITINGReducing Missed Appointments
NCT07377032PHASE3RECRUITINGTAP-GRIN: Interventional Study on Patients With GRIN-related Neurodevelopmental Disorders
NCT02909959PHASE2COMPLETEDSulforaphane for the Treatment of Young Men With Autism Spectrum Disorder
NCT06081348PHASE2RECRUITINGSertraline vs. Placebo in the Treatment of Anxiety in Children and AdoLescents With NeurodevelopMental Disorders
NCT06352372PHASE2COMPLETEDSafety and Efficacy of tPBM for Epileptiform Activity in Autism
NCT00503191PHASE1COMPLETEDNeuroModulation Technique Treatment of Autism
NCT04475848PHASE1COMPLETEDA Study to Investigate the Safety, Tolerability, Pharmacokinetics, Pharmacodynamics and Food Effect of RO6953958 in Healthy Participants
NCT06300398PHASE1COMPLETEDIAMA-6 Oral Dose Study in Healthy Adults
NCT01783041PHASE2/PHASE3COMPLETEDEffect of Early L-Carnitine Supplementation on Neurodevelopmental Outcomes in Very Preterm Infants
NCT05767385PHASE2/PHASE3RECRUITINGFetal Cerebrovascular Autoregulation in Congenital Heart Disease and Association With Neonatal Neurobehavior
NCT05675098EARLY_PHASE1NOT_YET_RECRUITINGCentral Nervous System Stimulants and Physical Function in Children With Cerebral Palsy
NCT00783783Not specifiedCOMPLETEDCYP2D6 Pharmacogenetics in Risperidone-Treated Children
NCT01778504Not specifiedRECRUITINGStudying Childhood-onset Behavioral, Psychiatric, and Developmental Disorders
NCT01850784Not specifiedUNKNOWNHigh Energy Formula Feeding in Infants With Congenital Heart Disease
NCT01922791Not specifiedCOMPLETEDNutrition and Pregnancy Intervention Study
NCT01942525Not specifiedUNKNOWNInfluence of Intrauterine Growth Restriction on Amplitude-integrated EEG in Preterm Infants
NCT02003170Not specifiedCOMPLETEDEtiology and Early Diagnosis of Neurodevelopmental Disorders
NCT02118649Not specifiedACTIVE_NOT_RECRUITINGEnhancing Behavior and Brain Response to Visual Targets Using a Computer Game
NCT02557191Not specifiedTERMINATEDBiomarkers, Neurodevelopment and Preterm Infants
NCT02690675Not specifiedCOMPLETEDIron Supplement Effect on Child Development
NCT02694003Not specifiedCOMPLETEDBetter Nights, Better Days for Children With Neurodevelopment Disorders
NCT02792894Not specifiedCOMPLETEDFamily Networks (FaNs) for Children With Developmental Disorders and Delays
NCT02871674Not specifiedUNKNOWNGood Night Project: Behavioural Sleep Interventions for Children With ADHD: A Randomised Controlled Trial
NCT02887157Not specifiedCOMPLETEDAnalyzing Retinal Microanatomy in ROP
NCT02898298Not specifiedCOMPLETEDPositive Emotion Regulation Training in Children, Adolescents and Young Adults With and Without Developmental Disorder
NCT02912780Not specifiedUNKNOWNIntroduction of Microsystems in a Level 3 Neonatal Intensive Care Unit
NCT03023293Not specifiedCOMPLETEDn-3 PUFAs, Irisin and Maternal Glucose Metabolism From Pregnancy to Postpartum
NCT03023644Not specifiedCOMPLETEDImproving Neurodevelopmental Outcomes in Children With Congenital Heart Disease: An Intervention Study
NCT03032991Not specifiedUNKNOWNEarly Biomarkers of Neurodevelopment in Offspring of Diabetic Mothers
NCT03088189Not specifiedTERMINATEDEffect of Parental Peri-conceptional Vitamin B12 Supplementation on Infant Neurocognitive Development in Offspring
NCT03096028Not specifiedCOMPLETEDDevelopmental Origins of Mental Health Disorders
NCT03148782Not specifiedCOMPLETEDBrain Plasticity Underlying Acquisition of New Organizational Skills in Children-R61 Phase
NCT03172104Not specifiedCOMPLETEDNeurobehavioural Development of Infants Born <30 Weeks Gestational Age Between Birth and Five Years of Age
NCT03222375Not specifiedRECRUITINGSQUED™ Series 28.1 Home-use and Treatment of Autowave Reverberator of Autism
NCT03229928Not specifiedCOMPLETEDClinical Testing of a Real-Time Behavior Measurement Tool: Measuring Outcomes for CHAnge
NCT03232489Not specifiedUNKNOWNStudy for the Evaluation of the Feasibility of Applying Advanced MRI Scanning in Pediatric Clinical Practice