TAF7L

gene
On this page

Also known as CT40

Summary

TAF7L (TATA-box binding protein associated factor 7 like, HGNC:11548) is a protein-coding gene on chromosome Xq22.1, encoding Transcription initiation factor TFIID subunit 7-like (Q5H9L4). Probably functions as a spermatogenesis-specific component of the DNA-binding general transcription factor complex TFIID, a multimeric protein complex that plays a central role in mediating promoter responses to various activators and repressors.

This gene is similar to a mouse gene that encodes a TATA box binding protein-associated factor, and shows testis-specific expression. The encoded protein could be a spermatogenesis-specific component of the DNA-binding general transcription factor complex TFIID. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 54457 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 95 total — 1 pathogenic, 1 likely-pathogenic
  • MANE Select transcript: NM_001168474

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11548
Approved symbolTAF7L
NameTATA-box binding protein associated factor 7 like
LocationXq22.1
Locus typegene with protein product
StatusApproved
AliasesCT40
Ensembl geneENSG00000102387
Ensembl biotypeprotein_coding
OMIM300314
Entrez54457

Gene structure

Transcript identifiers

Ensembl transcripts: 3 — 3 protein_coding

ENST00000324762, ENST00000356784, ENST00000372907

RefSeq mRNA: 3 — MANE Select: NM_001168474 NM_001168474, NM_001410720, NM_024885

CCDS: CCDS35347, CCDS55466, CCDS94640

Canonical transcript exons

ENST00000356784 — 13 exons

ExonStartEnd
ENSE00000673943101278049101278121
ENSE00000673944101281720101281775
ENSE00000868931101276307101276528
ENSE00000868932101277606101277719
ENSE00000868935101278994101279035
ENSE00000868937101282327101282453
ENSE00000868938101283450101283583
ENSE00000868940101286575101286653
ENSE00001295538101276000101276112
ENSE00001319364101275222101275281
ENSE00001428283101291224101291334
ENSE00001435623101268257101269237
ENSE00003545475101287478101287545

Expression profiles

Bgee: expression breadth ubiquitous, 136 present calls, max score 91.50.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.2556 / max 84.3570, expressed in 55 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
1999440.104427
1999430.101026
1999450.02592
1999460.02433

Top tissues by expression

266 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right testisUBERON:000453491.50gold quality
left testisUBERON:000453391.09gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099188.83gold quality
testisUBERON:000047388.80gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047386.36gold quality
adult organismUBERON:000702369.02gold quality
stromal cell of endometriumCL:000225565.67gold quality
endocervixUBERON:000045862.33gold quality
nucleus accumbensUBERON:000188261.64gold quality
endometriumUBERON:000129559.47gold quality
skin of legUBERON:000151158.79gold quality
right lobe of liverUBERON:000111457.12gold quality
seminal vesicleUBERON:000099857.05silver quality
Brodmann (1909) area 9UBERON:001354056.96gold quality
ectocervixUBERON:001224956.86gold quality
anterior cingulate cortexUBERON:000983556.34gold quality
cingulate cortexUBERON:000302756.33gold quality
ventricular zoneUBERON:000305356.15gold quality
dorsolateral prefrontal cortexUBERON:000983456.06gold quality
right frontal lobeUBERON:000281055.98gold quality
amygdalaUBERON:000187655.88gold quality
zone of skinUBERON:000001455.03gold quality
uterine cervixUBERON:000000254.04gold quality
caudate nucleusUBERON:000187353.81gold quality
skin of abdomenUBERON:000141653.63gold quality
prefrontal cortexUBERON:000045153.44gold quality
lateral globus pallidusUBERON:000247653.26gold quality
telencephalonUBERON:000189352.89gold quality
Ammon’s hornUBERON:000195452.64gold quality
neocortexUBERON:000195052.63gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.59

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): TBPL1

miRNA regulators (miRDB)

33 targeting TAF7L, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-366299.9973.825684
HSA-MIR-548AA99.9670.643753
HSA-MIR-548AP-3P99.9670.643753
HSA-MIR-548T-3P99.9670.643753
HSA-MIR-568099.9169.833421
HSA-MIR-137-3P99.8774.742401
HSA-MIR-548D-3P99.8770.674362
HSA-MIR-548BB-3P99.8670.584354
HSA-MIR-548AC99.8470.774351
HSA-MIR-548H-3P99.8470.804349
HSA-MIR-548Z99.8470.804349
HSA-MIR-520F-3P99.8271.321216
HSA-MIR-4799-5P99.8270.602663
HSA-MIR-430799.8270.453374
HSA-MIR-313399.8170.923506
HSA-MIR-57799.7869.132479
HSA-MIR-1212499.6869.172700
HSA-MIR-24-3P99.5969.971934
HSA-MIR-17-3P99.5566.771311
HSA-MIR-317699.2564.35954
HSA-MIR-3922-3P99.2564.961136
HSA-MIR-126499.2566.811317
HSA-MIR-548AS-3P99.1269.122294
HSA-MIR-193A-3P98.5966.36769
HSA-MIR-193B-3P98.5966.62748
HSA-MIR-426698.5367.291035
HSA-MIR-3144-3P98.1567.34677
HSA-MIR-6502-3P97.8665.43569
HSA-MIR-56297.6665.63698
HSA-MIR-64597.2866.30486

Literature-anchored findings (GeneRIF, showing 9)

  • TAF7L gene polymorphisms were found in male infertility patients and normal controls. (PMID:16597641)
  • The sequence variant in exon 13 of the TAF7L gene located on the X chromosome may represent a risk factor for spermatogenic failure. (PMID:17714218)
  • expression of AURKC, OIP5, PIWIL2 and TAF7L differed between patients with Acute myeloid leukemia, myelodysplastic syndrome and healthy controls in a gender-dependent manner (PMID:23292864)
  • Frameshift mutations of TAF7L gene is associated with stomach and colorectal cancers. (PMID:25098277)
  • Over-expression of TAF7L was observed in breast tumor cell lines. (PMID:26107214)
  • these findings support a putative role of TAF7L in tumor suppression (PMID:29307603)
  • Identification of a missense mutation (D136G) in the X-linked TAF7L gene as a potential cause of oligozoospermia in men. (PMID:35554494)
  • The association of testis-specific hTAF7L gene variants with idiopathic azoospermic and severe oligozoospermic male infertility. (PMID:36068176)
  • Deleterious variants in TAF7L cause human oligoasthenoteratozoospermia and its impairing histone to protamine exchange inducing reduced in vitro fertilization. (PMID:36714566)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
mus_musculusTaf7lENSMUSG00000009596
mus_musculusTaf7l2ENSMUSG00000121803
rattus_norvegicusTaf7l-ps1ENSRNOG00000022632
rattus_norvegicusTaf7lENSRNOG00000026315
drosophila_melanogasterTaf7FBGN0024909
caenorhabditis_elegansWBGENE00006388
caenorhabditis_elegansWBGENE00006389

Paralogs (1): TAF7 (ENSG00000178913)

Protein

Protein identifiers

Transcription initiation factor TFIID subunit 7-likeQ5H9L4 (reviewed: Q5H9L4)

Alternative names: Cancer/testis antigen 40, RNA polymerase II TBP-associated factor subunit Q, TATA box-binding protein-associated factor 50 kDa, Transcription initiation factor TFIID 50 kDa subunit

All UniProt accessions (1): Q5H9L4

UniProt curated annotations — full annotation on UniProt →

Function. Probably functions as a spermatogenesis-specific component of the DNA-binding general transcription factor complex TFIID, a multimeric protein complex that plays a central role in mediating promoter responses to various activators and repressors. May play a role in spermatogenesis.

Subunit / interactions. TFIID is composed of TATA binding protein (TBP) and a number of TBP-associated factors (TAFs). TAF7L may replace TAF7 in a spermatogenesis-specific form of TFIID. Interacts with TBP; the interaction occurs in a sub-population of cells (pachytene and haploid round spermatids) and is developmentally regulated through differential intracellular localization of the two proteins. Interacts with TAF1.

Subcellular location. Nucleus. Cytoplasm.

Tissue specificity. Testis-specific.

Similarity. Belongs to the TAF7 family.

Isoforms (3)

UniProt IDNamesCanonical?
Q5H9L4-11yes
Q5H9L4-22
Q5H9L4-33

RefSeq proteins (3): NP_001161946, NP_001397649, NP_079161 (=MANE)

Domains & families (InterPro)

IDNameType
IPR006751TAFII55_prot_cons_regDomain
IPR037817TAF7Family

Pfam: PF04658

UniProt features (15 total): sequence variant 6, compositionally biased region 3, region of interest 2, splice variant 2, chain 1, coiled-coil region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5H9L4-F168.140.33

Function

Pathways and Gene Ontology

Reactome pathways

20 pathways

IDPathway
R-HSA-167161HIV Transcription Initiation
R-HSA-167162RNA Polymerase II HIV Promoter Escape
R-HSA-167172Transcription of the HIV genome
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6804756Regulation of TP53 Activity through Phosphorylation
R-HSA-73776RNA Polymerase II Promoter Escape
R-HSA-73779RNA Polymerase II Transcription Pre-Initiation And Promoter Opening
R-HSA-75953RNA Polymerase II Transcription Initiation
R-HSA-76042RNA Polymerase II Transcription Initiation And Promoter Clearance
R-HSA-162587HIV Life Cycle
R-HSA-162599Late Phase of HIV Life Cycle
R-HSA-162906HIV Infection
R-HSA-1643685Disease
R-HSA-212436Generic Transcription Pathway
R-HSA-3700989Transcriptional Regulation by TP53
R-HSA-5633007Regulation of TP53 Activity
R-HSA-5663205Infectious disease
R-HSA-73857RNA Polymerase II Transcription
R-HSA-74160Gene expression (Transcription)
R-HSA-9824446Viral Infection Pathways

MSigDB gene sets: 79 (showing top): SCHWAB_TARGETS_OF_BMYB_POLYMORPHIC_VARIANTS_DN, GOBP_MALE_GAMETE_GENERATION, REACTOME_HIV_INFECTION, SHETH_LIVER_CANCER_VS_TXNIP_LOSS_PAM5, GOBP_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, GOBP_TRANSCRIPTION_INITIATION_AT_RNA_POLYMERASE_II_PROMOTER, SLEBOS_HEAD_AND_NECK_CANCER_WITH_HPV_UP, TATA_C, GOCC_RNA_POLYMERASE_COMPLEX, GOBP_DEVELOPMENTAL_PROCESS_INVOLVED_IN_REPRODUCTION, GOBP_PROTEIN_DNA_COMPLEX_ORGANIZATION, GOCC_TRANSCRIPTION_FACTOR_TFIID_COMPLEX, GOCC_TRANSFERASE_COMPLEX_TRANSFERRING_PHOSPHORUS_CONTAINING_GROUPS, GOCC_RNA_POLYMERASE_II_TRANSCRIPTION_REGULATOR_COMPLEX, GOCC_TRANSFERASE_COMPLEX

GO Biological Process (4): spermatogenesis (GO:0007283), cell differentiation (GO:0030154), RNA polymerase II preinitiation complex assembly (GO:0051123), transcription initiation at RNA polymerase II promoter (GO:0006367)

GO Molecular Function (2): protein binding (GO:0005515), RNA polymerase II general transcription initiation factor activity (GO:0016251)

GO Cellular Component (3): transcription factor TFIID complex (GO:0005669), cytoplasm (GO:0005737), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-13 pathways:

CategoryPathways
RNA Polymerase II Transcription4
Transcription of the HIV genome2
RNA Polymerase II Transcription Initiation And Promoter Clearance2
Late Phase of HIV Life Cycle1
Regulation of TP53 Activity1
HIV Infection1
HIV Life Cycle1
Viral Infection Pathways1
Generic Transcription Pathway1
Transcriptional Regulation by TP531
Disease1
Gene expression (Transcription)1
Infectious disease1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II2
developmental process involved in reproduction1
male gamete generation1
cellular developmental process1
transcription initiation at RNA polymerase II promoter1
transcription preinitiation complex assembly1
DNA-templated transcription initiation1
binding1
general transcription initiation factor activity1
RNA polymerase II, holoenzyme1
RNA polymerase II transcription regulator complex1
intracellular anatomical structure1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

926 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TAF7LTBPP20226914
TAF7LTBPL1P62380751
TAF7LTAF4O00268658
TAF7LTEX11Q8IYF3649
TAF7LTAF8Q7Z7C8644
TAF7LTAF9BQ9HBM6640
TAF7LTAF5Q15542634
TAF7LTBPL2Q6SJ96628
TAF7LTAF6P49848625
TAF7LNXF2BQ9GZY0618
TAF7LTAF3Q5VWG9616
TAF7LTAF10Q12962578
TAF7LTAF12Q16514571
TAF7LTAF9Q16594548
TAF7LTAF1P21675544

IntAct

12 interactions, top by confidence:

ABTypeScore
NAA10TAF7Lpsi-mi:“MI:0915”(physical association)0.560
TAF7LTAF4psi-mi:“MI:0914”(association)0.530
SRPK2TAF7Lpsi-mi:“MI:0217”(phosphorylation reaction)0.440
LRRK2TAF7Lpsi-mi:“MI:0407”(direct interaction)0.440
TAF7LTUFMpsi-mi:“MI:0915”(physical association)0.400
NEK4E2F8psi-mi:“MI:0914”(association)0.350
TAF7LKLRG2psi-mi:“MI:0914”(association)0.350
NAA10TAF7Lpsi-mi:“MI:0915”(physical association)0.000

BioGRID (46): TAF1 (Affinity Capture-MS), TAF3 (Affinity Capture-MS), TAF9B (Affinity Capture-MS), TAF2 (Affinity Capture-MS), TAF11 (Affinity Capture-MS), TAF6 (Affinity Capture-MS), TAF5 (Affinity Capture-MS), TAF4B (Affinity Capture-MS), TAF9 (Affinity Capture-MS), TAF8 (Affinity Capture-MS), TAF4 (Affinity Capture-MS), TAF12 (Affinity Capture-MS), TAF13 (Affinity Capture-MS), TAF10 (Affinity Capture-MS), UTP6 (Affinity Capture-MS)

ESM2 similar proteins: A2AU37, A5LFW4, A6QNM3, A6QPC8, A9SV60, A9SY64, B0BN28, C0SV12, C6KIE6, F4JET1, O15013, O60566, O65312, O82504, P43124, Q0V7M7, Q10SU5, Q24168, Q28GV1, Q2RBJ4, Q2TBI1, Q3V124, Q4V3E2, Q4V8G2, Q5H9L4, Q5RH01, Q60862, Q641G4, Q6AUQ7, Q75PQ8, Q86VD1, Q8C5W4, Q8H1E8, Q8N140, Q91628, Q96BD8, Q9C689, Q9CPV1, Q9FJX9, Q9LUR0

Diamond homologs: B9DG24, Q15545, Q2HJG8, Q4R5A5, Q5H9L4, Q5R7L9, Q6R1L1, Q9D3R9, Q9R1C0, Q9VHY5, O13701, Q05021

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

95 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic1
Uncertain significance41
Likely benign7
Benign3

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
3246507NC_000023.10:g.(?100486637)(100662891_?)delPathogenic
4086106NM_001168474.2(TAF7L):c.145+1G>ALikely pathogenic

SpliceAI

1462 predictions. Top by Δscore:

VariantEffectΔscore
X:101275216:TCTTA:Tdonor_loss1.0000
X:101275217:CTTAC:Cdonor_loss1.0000
X:101275218:TTA:Tdonor_loss1.0000
X:101275219:TA:Tdonor_loss1.0000
X:101275221:CCTT:Cdonor_gain1.0000
X:101275279:ATTC:Aacceptor_loss1.0000
X:101275280:TT:Tacceptor_gain1.0000
X:101275281:TCT:Tacceptor_loss1.0000
X:101275282:C:CAacceptor_loss1.0000
X:101275282:C:CCacceptor_gain1.0000
X:101275283:T:Gacceptor_loss1.0000
X:101275995:TTTAC:Tdonor_loss1.0000
X:101275996:TTACC:Tdonor_loss1.0000
X:101275997:TA:Tdonor_loss1.0000
X:101275998:A:Cdonor_loss1.0000
X:101275999:C:CTdonor_loss1.0000
X:101276108:CATGA:Cacceptor_gain1.0000
X:101276109:ATGA:Aacceptor_gain1.0000
X:101276110:TGA:Tacceptor_gain1.0000
X:101276111:GA:Gacceptor_gain1.0000
X:101276113:C:CCacceptor_gain1.0000
X:101276527:TTCTA:Tacceptor_gain1.0000
X:101276529:CTA:Cacceptor_gain1.0000
X:101276532:C:CCacceptor_gain1.0000
X:101278057:A:ACdonor_gain1.0000
X:101278058:C:CAdonor_gain1.0000
X:101278058:CG:Cdonor_gain1.0000
X:101278058:CGG:Cdonor_gain1.0000
X:101278122:C:CCacceptor_gain1.0000
X:101278140:T:TCacceptor_gain1.0000

AlphaMissense

2543 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:101277717:A:GW280R0.988
X:101277717:A:TW280R0.988
X:101283501:G:CS162R0.987
X:101283501:G:TS162R0.987
X:101283503:T:GS162R0.987
X:101283562:A:TV142D0.987
X:101287494:A:TI103K0.986
X:101283535:G:TA151D0.984
X:101278061:C:GA275P0.982
X:101278081:A:GL268P0.981
X:101283464:C:GA175P0.980
X:101287488:C:GR105P0.980
X:101287491:A:GL104S0.979
X:101282329:C:TG221D0.978
X:101281738:G:CF234L0.976
X:101281738:G:TF234L0.976
X:101281740:A:GF234L0.976
X:101282442:G:CC183W0.975
X:101282444:A:GC183R0.974
X:101286637:A:TV114D0.974
X:101282449:A:TL181H0.973
X:101282452:A:GM180T0.973
X:101287489:G:TR105S0.972
X:101287494:A:CI103R0.972
X:101277715:C:AW280C0.968
X:101277715:C:GW280C0.968
X:101283499:A:GL163P0.968
X:101283520:A:GL156S0.968
X:101287485:A:GL106P0.968
X:101276031:A:GL418P0.965

dbSNP variants (sampled 300 via entrez): RS1000270309 (X:101284148 G>A), RS1000292111 (X:101288435 G>A), RS1000582636 (X:101290057 A>T), RS1000664021 (X:101288172 C>T), RS1000759115 (X:101280480 A>C), RS1000791768 (X:101280919 C>G,T), RS1000870154 (X:101270107 C>T), RS1000877026 (X:101271537 CTT>C), RS1000953236 (X:101267947 C>T), RS1000972144 (X:101280724 T>C), RS1001059443 (X:101290380 T>C), RS1001525325 (X:101294495 A>G), RS1001699481 (X:101286778 G>A), RS1001764718 (X:101287214 T>C), RS1001902227 (X:101276970 C>A)

Disease associations

OMIM: gene MIM:300314 | disease phenotypes: MIM:300643, MIM:301106

GenCC curated gene-disease

Mondo (3): rolandic epilepsy, intellectual disability, and speech dyspraxia, X-linked (MONDO:0010388), spermatogenic failure, X-linked, 7 (MONDO:0957202), congenital portosystemic shunt (MONDO:0018811)

Orphanet (1): Congenital portosystemic shunt (Orphanet:480531)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (1)

DescriptorNameTree numbers
C564467Rolandic Epilepsy, Mental Retardation, and Speech Dyspraxia, X-Linked (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

6 total (human), top 6 by PubMed support.

ChemicalActions (top 5)PubMed papers
arseniteincreases methylation1
cupric chlorideincreases expression1
CGP 52608increases reaction, affects binding1
Decitabinedecreases expression, decreases reaction1
Benzo(a)pyrenedecreases methylation1
Smokedecreases expression, decreases reaction1

Clinical trials (associated diseases)

2 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT06041906Not specifiedENROLLING_BY_INVITATIONInternational Registry of Congenital Portosystemic Shunt (IRCPSS)
NCT07314814Not specifiedNOT_YET_RECRUITINGGenetic Hallmarks of Patients With Congenital Portosystemic Shunts and Portopulmonary Hypertension