CPSF6

gene
On this page

Also known as CFIMHPBRII-4HPBRII-7CFIM68CFIM72

Summary

CPSF6 (cleavage and polyadenylation specific factor 6, HGNC:13871) is a protein-coding gene on chromosome 12q15, encoding Cleavage and polyadenylation specificity factor subunit 6 (Q16630). Component of the cleavage factor Im (CFIm) complex that functions as an activator of the pre-mRNA 3’-end cleavage and polyadenylation processing required for the maturation of pre-mRNA into functional mRNAs. It is a common-essential gene (DepMap: required in 90.1% of cancer cell lines).

The protein encoded by this gene is one subunit of a cleavage factor required for 3’ RNA cleavage and polyadenylation processing. The interaction of the protein with the RNA is one of the earliest steps in the assembly of the 3’ end processing complex and facilitates the recruitment of other processing factors. The cleavage factor complex is composed of four polypeptides. This gene encodes the 68kD subunit. It has a domain organization reminiscent of spliceosomal proteins.

Source: NCBI Gene 11052 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): neurodevelopmental disorder (Limited, GenCC)
  • GWAS associations: 7
  • Clinical variants (ClinVar): 48 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 90.1% of screened cell lines (common-essential)
  • MANE Select transcript: NM_007007

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:13871
Approved symbolCPSF6
Namecleavage and polyadenylation specific factor 6
Location12q15
Locus typegene with protein product
StatusApproved
AliasesCFIM, HPBRII-4, HPBRII-7, CFIM68, CFIM72
Ensembl geneENSG00000111605
Ensembl biotypeprotein_coding
OMIM604979
Entrez11052

Gene structure

Transcript identifiers

Ensembl transcripts: 41 — 37 protein_coding, 2 nonsense_mediated_decay, 1 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000266679, ENST00000435070, ENST00000456847, ENST00000547486, ENST00000550075, ENST00000550987, ENST00000551516, ENST00000650046, ENST00000886656, ENST00000886657, ENST00000886658, ENST00000886659, ENST00000886660, ENST00000886661, ENST00000886662, ENST00000886663, ENST00000886664, ENST00000886665, ENST00000886666, ENST00000886667, ENST00000927154, ENST00000927155, ENST00000927156, ENST00000927157, ENST00000927158, ENST00000927159, ENST00000927160, ENST00000927161, ENST00000927162, ENST00000927163, ENST00000927164, ENST00000927165, ENST00000927166, ENST00000927167, ENST00000927168, ENST00000927169, ENST00000941456, ENST00000941457, ENST00000941458, ENST00000941459, ENST00000941460

RefSeq mRNA: 2 — MANE Select: NM_007007 NM_001300947, NM_007007

CCDS: CCDS73494, CCDS8988

Canonical transcript exons

ENST00000435070 — 10 exons

ExonStartEnd
ENSE000007522666926004469260197
ENSE000007522676925942869259543
ENSE000007522696925859069259094
ENSE000007522716925773269257905
ENSE000013921186926237369262562
ENSE000017265416926951269274358
ENSE000017642026923956969239706
ENSE000035571506925669769256842
ENSE000035719776925112969251338
ENSE000035838016925305169253154

Expression profiles

Bgee: expression breadth ubiquitous, 291 present calls, max score 95.04.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 64.2456 / max 4716.2916, expressed in 1827 samples.

FANTOM5 promoters (8 alternative TSS)

Promoter IDTPM avgSamples expressed
12665244.97201824
12665313.58371773
1266545.21991481
1266710.2563117
1266700.092642
1266680.045212
1266690.04346
2067880.032511

Top tissues by expression

299 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ganglionic eminenceUBERON:000402395.04gold quality
bone marrow cellCL:000209294.96gold quality
calcaneal tendonUBERON:000370194.93gold quality
ventricular zoneUBERON:000305394.89gold quality
tonsilUBERON:000237294.36gold quality
adrenal tissueUBERON:001830394.23gold quality
embryoUBERON:000092294.19gold quality
left ovaryUBERON:000211994.16gold quality
right ovaryUBERON:000211893.69gold quality
vermiform appendixUBERON:000115493.53gold quality
sural nerveUBERON:001548893.51gold quality
endometriumUBERON:000129593.48gold quality
cortical plateUBERON:000534393.48gold quality
ovaryUBERON:000099293.47gold quality
islet of LangerhansUBERON:000000693.38gold quality
tendonUBERON:000004393.27gold quality
lymph nodeUBERON:000002992.74gold quality
caecumUBERON:000115392.57gold quality
stromal cell of endometriumCL:000225592.51gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099192.42gold quality
bone marrowUBERON:000237192.32gold quality
palpebral conjunctivaUBERON:000181292.28gold quality
colonic epitheliumUBERON:000039792.16gold quality
seminal vesicleUBERON:000099892.14gold quality
body of uterusUBERON:000985392.14gold quality
right hemisphere of cerebellumUBERON:001489092.09gold quality
monocyteCL:000057691.98gold quality
mononuclear cellCL:000084291.88gold quality
uterusUBERON:000099591.85gold quality
epithelium of nasopharynxUBERON:000195191.82gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-CURD-88yes3.90
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

332 targeting CPSF6, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-8485100.0077.574731
HSA-MIR-3163100.0077.238605
HSA-MIR-4533100.0069.482758
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-3646100.0073.565283
HSA-MIR-4262100.0073.263931
HSA-MIR-5692A100.0074.406850
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-428299.9975.366408
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-366299.9973.825684
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-181A-5P99.9972.962995
HSA-MIR-181B-5P99.9972.972996
HSA-MIR-181C-5P99.9972.952996
HSA-MIR-181D-5P99.9973.042997
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-548AW99.9972.573559
HSA-MIR-34A-5P99.9971.211784
HSA-MIR-449A99.9971.051776
HSA-MIR-318599.9968.121959

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 90.1% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 40)

  • Data show that cleavage factor I(m) can stimulate both cleavage and poly(adenine) addition, and can act to suppress poly(A) site cleavage in a sequence-dependent manner. (PMID:14690600)
  • Elevated levels of human cleavage factor Im suppressed cleavage at the primary poly(A) site of the pre-mRNA encoding the 68 kDa subunit of CFI(m). (PMID:14690600)
  • region in the subunit of CF I(m) involved in RNA binding, protein-protein interactions, and subcellular localization (PMID:15169763)
  • Key paper demonstrating mechanism of CF Im binding to CPSF. Also challenges the long standing perception of mammalian mRNA 3’-end processing that insists that mammalian poly (A) sites are determined by two sequence elements alone. (PMID:15937220)
  • a direct role of the U2AF 65/CF I(m) 59 interaction in the functional coordination of splicing and 3’ end processing. (PMID:17024186)
  • CF Im68 subnuclear distribution and mobility (PMID:17267687)
  • These results reveal a novel function for the pre-mRNA 3’ end processing factor CF I(m)68 in mRNA export. (PMID:19864460)
  • HIV-1 harboring the N74D mutation in capsid protein fails to interact with CPSF6 and evades the nuclear import restriction. (PMID:20227665)
  • Data provide evidence that CFIm exists as a heterotetramer of 25-kD, 59-kD and 68-kD subunits of CFIm: CFIm25, CFIm59 and CFIm68. (PMID:20695905)
  • Crystal structure of CFIm25-CFIm68 RRM heterotetramer illustrated that CFIm25 homodimer is clamped by two CFIm68 monomers on each side of the dimer interface (PMID:21295486)
  • Structure of a CFI(m)25/CFI(m)68 RRM heterotetramer and biochemical data indicated that CFIm25 specifically recognized two UGUA elements, CFIm68 facilitates looping of the intervening RNA. CFIm-mediated RNA looping may regulate alternative polyadenylation (PMID:21295486)
  • The study reports the crystal structure of human cleavage factor I(m), comprising cleavage factor I(m)25 and the RNA recognition motif domain of cleavage factor I(m)68 kDa (CF I(m)68RRM). (PMID:21483454)
  • A 9-residue stretch of hydrophobic amino acids in CPSF6 restricts HIV-1 infection through an interaction with capsid. (PMID:22301135)
  • Knockdown of CF I(m)68 induced a systematic use of proximal polyadenylation sites, indicating that changes in relative abundance of a single 3’ end processing factor can modulate the length of 3’ untranslated regions across the transcriptome. (PMID:22813749)
  • CPSF6 binds specifically to a novel proteinprotein interface on the N-terminal domain of HIV-1 capsid protein (CA). (PMID:22956906)
  • Loss-of-function of CFIm68 and CFIm25 leads to a transcriptome-wide increase in the use of proximal polyadenylation sites in HEK293 cells. (PMID:23187700)
  • TNPO3 promotes HIV-1 infectivity indirectly, by shifting the CA-binding protein CPSF6 to the nucleus, thus preventing the excessive HIV-1 CA stability that would otherwise result from cytoplasmic accumulation of CPSF6. (PMID:23414560)
  • These results suggested that inhibition of HIV-1 by TNPO3-depleted cells requires CPSF6. (PMID:23622145)
  • A carboxy-terminally truncated human CPSF6 lacking residues encoded by exon 6 inhibits HIV-1 cDNA synthesis and promotes capsid disassembly. (PMID:23658440)
  • study uncovers two opposing CA-dependent functions of CPSF6 in HIV-1 replication in vivo; however, the benefit for binding CPSF6 appears to outweigh the cost, providing support for a vital function of CPSF6 during HIV-1 replication in vivo (PMID:24415937)
  • NUP153 and CPSF6 have overlapping binding sites, but each makes unique capsid monomers (CA) interactions. Multiple ligands share an overlapping interface in HIV-1 capsid that is lost upon viral disassembly. (PMID:25356722)
  • Structural basis of HIV-1 capsid recognition by PF74 and CPSF6. (PMID:25518861)
  • integration targeting proceeds via two distinct mechanisms: capsid-CPSF6 binding directs HIV-1 to actively transcribed euchromatin, where the integrase-LEDGF/p75 interaction drives integration into gene bodies. (PMID:26858452)
  • CPSF6 binding to its docking site in HIV-1 capsid leads to the recruitment of CFIm tetramer, suggesting that CFIm mediates CPSF6 function(s) in integration site targeting. (PMID:26994143)
  • Studies suggest that binding of cleavage factor Im (CFIm) may be one of the earliest steps in promoting formation of an active cleavage complex. (PMID:27528751)
  • we found CPSF6 and all core paraspeckles proteins to be overexpressed in human breast cancer cases and their expression to correlate with poor patient outcomes. (PMID:28673861)
  • The authors demonstrate that CFIm functions as an enhancer-dependent activator of mRNA 3’ processing. CFIm regulates global alternative polyadenylation by specifically binding and activating enhancer-containing poly(A) sites. (PMID:29276085)
  • A novel CPSF6-RARG fusion transcript in acute myeloid leukemia. (PMID:29568099)
  • High CPSF6 expression is associated with HIV infections. (PMID:29643241)
  • CPSF6-capsid interactions allow the virus to bypass peripheral heterochromatin and penetrate the nuclear structure for integration. (PMID:30173955)
  • Study shows that the cellular alternative RNA processing factor CPSF6 plays an important role in the ability of NP1 to suppress internal polyadenylation at (pA)p, in influencing viral mRNA export, and, thus, in modulating gene expression of minute virus of canines, a parvovirus.The cellular RNA processing factor CPSF6 interacted with NP1 in transfected cells and participated with NP1 to modulate its effects. (PMID:30355695)
  • nuclear entry of HIV-1 subviral complexes in macrophages is mediated by consecutive binding of Nup153 and CPSF6. (PMID:30672737)
  • The crystal structure of the RSLD-TNPO3 complex revealed potential CPSF6 interaction residues, which were confirmed to mediate TNPO3 binding and CPSF6 nuclear import. Both binding and nuclear import were independent of RSLD phosphorylation, though a hyperphosphorylated mimetic mutant failed to bind TNPO3 and mislocalized to the cell cytoplasm (PMID:30916345)
  • HIV-1 is more dependent on the K182 capsid residue than HIV-2 for interactions with CPSF6. (PMID:31071616)
  • pol II Chromatin immunoprecipitation-quantitative PCR on a subset of protein coding genes after knocking down CFIm68. (PMID:31477156)
  • The identification of the direct interaction between viral NP1 and host CPSF6 provides new insights into the mechanism by which a viral small nonstructural protein facilitates the multiple regulation of viral gene expression and replication and reveals a novel target for potent antiviral drug development. (PMID:31666379)
  • Largely intact HIV-1 capsids dock at the nuclear pore in infected SupT1-R5 cells, with CPSF6 being a facilitator of nucleoplasmic entry in this cell type. (PMID:31690677)
  • The 4th and 112th Residues of Viral Capsid Cooperatively Modulate Capsid-CPSF6 Interactions of HIV-1. (PMID:31941344)
  • Requirement for cleavage factor IIm in the control of alternative polyadenylation in breast cancer cells. (PMID:32295865)
  • MxB impedes the NUP358-mediated HIV-1 pre-integration complex nuclear import and viral replication cooperatively with CPSF6. (PMID:32600399)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriocpsf6ENSDARG00000018618
mus_musculusCpsf6ENSMUSG00000055531
rattus_norvegicusCpsf6ENSRNOG00000005927
drosophila_melanogasterCpsf6FBGN0035872
caenorhabditis_elegansWBGENE00008362

Paralogs (1): CPSF7 (ENSG00000149532)

Protein

Protein identifiers

Cleavage and polyadenylation specificity factor subunit 6Q16630 (reviewed: Q16630)

Alternative names: Cleavage and polyadenylation specificity factor 68 kDa subunit, Cleavage factor Im complex 68 kDa subunit, Pre-mRNA cleavage factor Im 68 kDa subunit, Protein HPBRII-4/7

All UniProt accessions (5): Q16630, A0A3B3ISW1, F8VNP8, F8W084, F8WJN3

UniProt curated annotations — full annotation on UniProt →

Function. Component of the cleavage factor Im (CFIm) complex that functions as an activator of the pre-mRNA 3’-end cleavage and polyadenylation processing required for the maturation of pre-mRNA into functional mRNAs. CFIm contributes to the recruitment of multiprotein complexes on specific sequences on the pre-mRNA 3’-end, so called cleavage and polyadenylation signals (pA signals). Most pre-mRNAs contain multiple pA signals, resulting in alternative cleavage and polyadenylation (APA) producing mRNAs with variable 3’-end formation. The CFIm complex acts as a key regulator of cleavage and polyadenylation site choice during APA through its binding to 5’-UGUA-3’ elements localized in the 3’-untranslated region (UTR) for a huge number of pre-mRNAs. CPSF6 enhances NUDT21/CPSF5 binding to 5’-UGUA-3’ elements localized upstream of pA signals and promotes RNA looping, and hence activates directly the mRNA 3’-processing machinery. Plays a role in mRNA export. (Microbial infection) Binds HIV-1 capsid-nucleocapsid (HIV-1 CA-NC) complexes and might thereby promote the integration of the virus in the nucleus of dividing cells (in vitro).

Subunit / interactions. Component of the cleavage factor Im (CFIm) complex which is a heterotetramer composed of two subunits of NUDT21/CPSF5 and two subunits of CPSF6 or CPSF7 or a heterodimer of CPSF6 and CPSF7. The cleavage factor Im (CFIm) complex associates with the CPSF and CSTF complexes to promote the assembly of the core mRNA 3’-processing machinery. Associates with the exon junction complex (EJC). Associates with the 80S ribosome particle. Interacts (via the RRM domain) with NUDT21/CPSF5; this interaction is direct and enhances binding to RNA. Interacts (via Arg/Ser-rich domain) with FIP1L1 (preferentially via unphosphorylated form and Arg/Glu/Asp-rich domain); this interaction mediates, at least in part, the interaction between the CFIm and CPSF complexes and may be inhibited by CPSF6 hyper-phosphorylation. Interacts (via N-terminus) with NXF1; this interaction is direct. Interacts with SRSF3. Interacts with SRSF7. Interacts with SNRNP70. Interacts with TRA2B/SFRS10. Interacts with UPF1. Interacts with UPF3B. Interacts with VIRMA. Interacts (via Arg/Ser-rich domain) with TNPO3; promoting nuclear import of CPSF6 independently of its phosphorylation status. Interacts with YTHDC1. (Microbial infection) Interacts (via C-terminus) with HIV-1 capsid protein p24 (CA).

Subcellular location. Nucleus. Nucleoplasm. Nucleus speckle. Cytoplasm.

Post-translational modifications. Phosphorylated. Phosphorylated in the Arg/Ser-rich domain by SRPK1, in vitro. Symmetrically dimethylated on arginine residues in the GAR motif by PRMT5 in a WDR77- and CLNS1A-dependent manner. Asymmetrically dimethylated on arginine residues in the GAR motif by PRMT1.

Domain organisation. Contains an Arg/Ser-rich domain composed of arginine-serine dipeptide repeats within the C-terminal region that is necessary and sufficient for activating mRNA 3’-processing and alternative polyadenylation (APA).

Similarity. Belongs to the RRM CPSF6/7 family.

Isoforms (3)

UniProt IDNamesCanonical?
Q16630-11yes
Q16630-22
Q16630-33

RefSeq proteins (2): NP_001287876, NP_008938* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000504RRM_domDomain
IPR012677Nucleotide-bd_a/b_plait_sfHomologous_superfamily
IPR034769CPSF6_RRMDomain
IPR034772CPSF6/7Family
IPR035979RBD_domain_sfHomologous_superfamily
IPR057951CPSF6/7_RSLD_NDomain

Pfam: PF00076, PF25524

UniProt features (56 total): region of interest 11, mutagenesis site 11, compositionally biased region 9, modified residue 8, strand 5, helix 4, splice variant 2, turn 2, chain 1, domain 1, short sequence motif 1, sequence conflict 1

Structure

Experimental structures (PDB)

16 structures.

PDBMethodResolution (Å)
4B4NX-RAY DIFFRACTION1.81
4U0AX-RAY DIFFRACTION2.05
6AY9X-RAY DIFFRACTION2.5
4WYMX-RAY DIFFRACTION2.6
3P5TX-RAY DIFFRACTION2.7
6GX9X-RAY DIFFRACTION2.7
4U0BX-RAY DIFFRACTION2.8
7SNQX-RAY DIFFRACTION2.81
3P6YX-RAY DIFFRACTION2.9
3Q2SX-RAY DIFFRACTION2.9
7ZUDX-RAY DIFFRACTION2.93
3Q2TX-RAY DIFFRACTION3.06
9CNVELECTRON MICROSCOPY3.16
8GDVX-RAY DIFFRACTION3.3
8CL1ELECTRON MICROSCOPY3.35
8EJLELECTRON MICROSCOPY3.9

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q16630-F165.110.23

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (8): 157, 404, 407, 494, 500, 511, 513, 525

Mutagenesis-validated functional residues (11):

PositionPhenotype
84reduces affinity for ugua rna by 40%; when associated with a-128.
86abolishes interaction with nudt21/cpsf5; when associated with v-87.
87abolishes interaction with nudt21/cpsf5; when associated with v-86.
90–91reduces affinity for ugua rna by 70%. strongly reduced affinity for ugua rna; when associated with a-94.
94strongly reduced affinity for ugua rna; when associated with 90-a-a-91.
111reduces affinity for ugua rna by 85%.
126increases affinity for ugua rna by 40%.
128reduces affinity for ugua rna by 40%; when associated with a-84.
202decreased methylation in presence of prmt5/wdr77. loss of methylation in presence of prmt5/wdr77 or prmt1; when associat
204decreased methylation in presence of prmt5/wdr77. loss of methylation in presence of prmt5/wdr77 or prmt1; when associat
206loss of methylation in presence of prmt5/wdr77 or prmt1.

Function

Pathways and Gene Ontology

Reactome pathways

8 pathways

IDPathway
R-HSA-1839117Signaling by cytosolic FGFR1 fusion mutants
R-HSA-5655302Signaling by FGFR1 in disease
R-HSA-72187mRNA 3’-end processing
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-77595Processing of Intronless Pre-mRNAs
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions

MSigDB gene sets: 298 (showing top): AAGCAAT_MIR137, GAUSSMANN_MLL_AF4_FUSION_TARGETS_A_UP, GNF2_MCM5, GOBP_REGULATION_OF_NUCLEOBASE_CONTAINING_COMPOUND_TRANSPORT, PUJANA_CHEK2_PCC_NETWORK, GOBP_POSITIVE_REGULATION_OF_INTRACELLULAR_TRANSPORT, GOBP_NUCLEAR_TRANSPORT, BROWNE_HCMV_INFECTION_48HR_DN, GOBP_PROTEIN_HETEROTETRAMERIZATION, GOBP_REGULATION_OF_NUCLEOCYTOPLASMIC_TRANSPORT, MILI_PSEUDOPODIA_HAPTOTAXIS_UP, GAZDA_DIAMOND_BLACKFAN_ANEMIA_PROGENITOR_DN, REACTOME_MRNA_3_END_PROCESSING, GOBP_REGULATION_OF_CELLULAR_LOCALIZATION, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA

GO Biological Process (7): mRNA processing (GO:0006397), mRNA 3’-end processing (GO:0031124), positive regulation of RNA export from nucleus (GO:0046833), protein tetramerization (GO:0051262), protein heterotetramerization (GO:0051290), mRNA alternative polyadenylation (GO:0110104), co-transcriptional mRNA 3’-end processing, cleavage and polyadenylation pathway (GO:0180010)

GO Molecular Function (6): RNA binding (GO:0003723), mRNA binding (GO:0003729), ribosomal large subunit binding (GO:0043023), exon-exon junction complex binding (GO:1990448), nucleic acid binding (GO:0003676), protein binding (GO:0005515)

GO Cellular Component (11): nucleus (GO:0005634), nucleoplasm (GO:0005654), perichromatin fibrils (GO:0005726), cytoplasm (GO:0005737), mRNA cleavage and polyadenylation specificity factor complex (GO:0005847), mRNA cleavage factor complex (GO:0005849), membrane (GO:0016020), nuclear speck (GO:0016607), interchromatin granule (GO:0035061), paraspeckles (GO:0042382), ribonucleoprotein complex (GO:1990904)

Reactome top-level categories

Rollup of top-8 pathways:

CategoryPathways
FGFR1 mutant receptor activation1
Signaling by FGFR in disease1
Processing of Capped Intron-Containing Pre-mRNA1
Metabolism of RNA1
RNA Polymerase II Transcription1
Processing of Capped Intronless Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure4
mRNA 3’-end processing2
binding2
nuclear ribonucleoprotein granule2
RNA processing1
mRNA metabolic process1
mRNA processing1
RNA 3’-end processing1
RNA export from nucleus1
positive regulation of nucleobase-containing compound transport1
positive regulation of nucleocytoplasmic transport1
regulation of RNA export from nucleus1
protein complex oligomerization1
protein tetramerization1
protein heterooligomerization1
co-transcriptional RNA 3’-end processing, cleavage and polyadenylation pathway1
nucleic acid binding1
RNA binding1
ribosome binding1
protein-containing complex binding1
intracellular membrane-bounded organelle1
nuclear lumen1
nuclear chromosome1
chromatin1
intracellular anatomical structure1
mRNA cleavage factor complex1
nuclear protein-containing complex1
nuclear body1
protein-containing complex1

Protein interactions and networks

STRING

2172 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CPSF6NUDT21O43809999
CPSF6U2AF2P26368970
CPSF6VIRMAQ69YN4948
CPSF6YTHDC1Q96MU7925
CPSF6PCF11O94913869
CPSF6SYMPKQ92797866
CPSF6NXF1Q9UBU9857
CPSF6PAPOLGQ9BWT3851
CPSF6PAPOLAP51003850
CPSF6NUP153P49790849
CPSF6PAPOLBQ9NRJ5846
CPSF6CSTF2P33240839
CPSF6CLP1Q92989834
CPSF6CPSF3Q9UKF6827
CPSF6RANBP2P49792817

IntAct

193 interactions, top by confidence:

ABTypeScore
CPSF6NUDT21psi-mi:“MI:0407”(direct interaction)0.890
CPSF6NUDT21psi-mi:“MI:0914”(association)0.890
NUDT21CPSF6psi-mi:“MI:0915”(physical association)0.890
CPSF6NUDT21psi-mi:“MI:0915”(physical association)0.890
CPSF6WWP1psi-mi:“MI:0915”(physical association)0.780
WWP1CPSF6psi-mi:“MI:0915”(physical association)0.780
SCYL1SEC31Apsi-mi:“MI:0914”(association)0.710
ARMC7CPSF6psi-mi:“MI:0915”(physical association)0.670
PPIL1CPSF6psi-mi:“MI:0915”(physical association)0.670
CPSF6PPIL1psi-mi:“MI:0915”(physical association)0.670
CPSF6ARMC7psi-mi:“MI:0915”(physical association)0.670
CPSF6WWP2psi-mi:“MI:0915”(physical association)0.660
FXR2CSNK2A1psi-mi:“MI:0914”(association)0.640
CPSF6TOLLIPpsi-mi:“MI:0915”(physical association)0.560
OTUB2CPSF6psi-mi:“MI:0915”(physical association)0.560
TOLLIPCPSF6psi-mi:“MI:0915”(physical association)0.560
CPSF6OTUB2psi-mi:“MI:0915”(physical association)0.560
SRPK2RRP9psi-mi:“MI:0914”(association)0.530
PLSCR1CPSF6psi-mi:“MI:0915”(physical association)0.510
CPSF6PLSCR1psi-mi:“MI:0915”(physical association)0.510
CPSF6DDX39Apsi-mi:“MI:0914”(association)0.480
NUDT21HNRNPKpsi-mi:“MI:0914”(association)0.460

BioGRID (654): CPSF6 (Affinity Capture-MS), PPIL1 (Two-hybrid), TOLLIP (Two-hybrid), OTUB2 (Two-hybrid), ARMC7 (Two-hybrid), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Affinity Capture-MS), CPSF6 (Two-hybrid), GIN1 (Two-hybrid)

ESM2 similar proteins: A1A6K6, B9DFV2, F4HWF9, F4JCU0, F4JP52, F4KDN0, G7ID19, O22812, O80567, O80678, Q08A72, Q0D3J9, Q0P5D2, Q0VBL3, Q10M00, Q16630, Q3MHY8, Q5NVH8, Q5XI29, Q5ZL34, Q6ATR0, Q6DDW4, Q6JB11, Q6NWC6, Q6ZK57, Q7KMJ6, Q7Z5Q1, Q812E0, Q8BTV2, Q8H0P8, Q8LPQ9, Q8N684, Q8W4I9, Q940D0, Q94CJ8, Q96T37, Q9CQT2, Q9LES2, Q9LKA4, Q9LKA5

Diamond homologs: A0A0D1C8Z4, Q0P5D2, Q16630, Q5NVH8, Q5XI29, Q5ZL34, Q6DDW4, Q6NVF9, Q6NWC6, Q8BTV2, Q8N684, Q9VSH4, P10979, Q99070, Q03251, Q9SIX3

SIGNOR signaling

1 interactions.

AEffectBMechanism
CPSF6“form complex”“CFI complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 185 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA 3’-end processing710.4×5e-04
Processing of Capped Intron-Containing Pre-mRNA159.3×2e-08
mRNA Polyadenylation149.2×7e-08
mRNA Splicing119.1×5e-06
mRNA Splicing - Major Pathway218.6×2e-11
SUMO E3 ligases SUMOylate target proteins68.1×9e-03
Dengue Virus-Host Interactions165.5×5e-06
Metabolism of RNA144.4×4e-04

GO biological processes:

GO termPartnersFoldFDR
negative regulation of mRNA splicing, via spliceosome523.2×4e-04
mRNA splicing, via spliceosome1810.0×3e-10
RNA splicing147.5×4e-06
mRNA processing146.7×1e-05
protein phosphorylation135.3×3e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

48 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance34
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2163 predictions. Top by Δscore:

VariantEffectΔscore
12:69239704:CAGG:Cdonor_loss1.0000
12:69239707:GTAC:Gdonor_loss1.0000
12:69251124:CTCAG:Cacceptor_loss1.0000
12:69251127:AGG:Aacceptor_loss1.0000
12:69251128:G:Tacceptor_loss1.0000
12:69251191:A:AGacceptor_gain1.0000
12:69251192:A:Gacceptor_gain1.0000
12:69251302:A:Tdonor_gain1.0000
12:69251334:CATGG:Cdonor_gain1.0000
12:69251335:ATGG:Adonor_gain1.0000
12:69251336:TGG:Tdonor_gain1.0000
12:69251337:GG:Gdonor_gain1.0000
12:69251337:GGG:Gdonor_gain1.0000
12:69251337:GGGTA:Gdonor_loss1.0000
12:69251338:GG:Gdonor_gain1.0000
12:69251339:G:GAdonor_loss1.0000
12:69251339:G:GGdonor_gain1.0000
12:69253020:A:AGacceptor_gain1.0000
12:69253020:AAT:Aacceptor_gain1.0000
12:69253152:GGG:Gdonor_gain1.0000
12:69253153:GGG:Gdonor_gain1.0000
12:69256838:GAAAA:Gdonor_gain1.0000
12:69256843:G:GGdonor_gain1.0000
12:69257727:TGCA:Tacceptor_loss1.0000
12:69257728:GCAG:Gacceptor_loss1.0000
12:69257729:CA:Cacceptor_loss1.0000
12:69257730:A:AGacceptor_gain1.0000
12:69257731:G:GAacceptor_gain1.0000
12:69257731:GCT:Gacceptor_gain1.0000
12:69257731:GCTA:Gacceptor_gain1.0000

AlphaMissense

3529 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
12:69239671:G:CD9H1.000
12:69239675:T:GI10S1.000
12:69251163:T:CL32S1.000
12:69251318:T:GY84D1.000
12:69251322:T:AI85N1.000
12:69251324:G:AG86R1.000
12:69251324:G:CG86R1.000
12:69251325:G:AG86E1.000
12:69251325:G:TG86V1.000
12:69251331:T:CL88P1.000
12:69251331:T:GL88R1.000
12:69251336:T:AW90R1.000
12:69251336:T:CW90R1.000
12:69251338:G:CW90C1.000
12:69251338:G:TW90C1.000
12:69253051:T:AW91R1.000
12:69253051:T:CW91R1.000
12:69253053:G:CW91C1.000
12:69253053:G:TW91C1.000
12:69253070:T:CL97S1.000
12:69253120:T:CF114L1.000
12:69253121:T:CF114S1.000
12:69253122:T:AF114L1.000
12:69253122:T:GF114L1.000
12:69253126:G:AE116K1.000
12:69253127:A:TE116V1.000
12:69253131:T:AN117K1.000
12:69253131:T:GN117K1.000
12:69253140:T:AN120K1.000
12:69253140:T:GN120K1.000

dbSNP variants (sampled 300 via entrez): RS1000036530 (12:69242320 A>G,T), RS1000088484 (12:69242652 C>G), RS1000199292 (12:69257205 T>C), RS1000206522 (12:69267505 A>C), RS1000241726 (12:69248727 G>A,T), RS1000301788 (12:69253774 A>C,G), RS1000327943 (12:69254306 T>A), RS1000356932 (12:69247696 A>G), RS1000389587 (12:69247973 A>G), RS1000401484 (12:69254630 C>G,T), RS1000626408 (12:69261133 T>C), RS1000758668 (12:69274105 T>G), RS1000839188 (12:69243295 A>T), RS1001012722 (12:69237879 T>C), RS1001041054 (12:69243755 G>A,T)

Disease associations

OMIM: gene MIM:604979 | disease phenotypes:

GenCC curated gene-disease

DiseaseClassificationInheritance
neurodevelopmental disorderLimitedAutosomal dominant

Mondo (1): neurodevelopmental disorder (MONDO:0700092)

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

7 associations (top):

StudyTraitp-value
GCST000175_26Height3.000000e-06
GCST002515_1Pneumoconiosis in silica exposure2.000000e-08
GCST007327_89Smoking status (ever vs never smokers)7.000000e-10
GCST007928_2Medication use (diuretics)1.000000e-11
GCST007930_131Medication use (agents acting on the renin-angiotensin system)8.000000e-12
GCST008810_82Smoking initiation (ever regular vs never regular)2.000000e-09
GCST011703_10Smoking initiation2.000000e-10

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0004318smoking behavior
EFO:0009928Diuretic use measurement
EFO:0009931Agents acting on the renin-angiotensin system use measurement
EFO:0005670smoking initiation

MeSH disease descriptors (1)

DescriptorNameTree numbers
D065886Neurodevelopmental DisordersF03.625

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL6067065 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

2 potent at pChembl≥5 of 4 total, top 2 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.46Kd348.6nMCHEMBL5653589
6.46ED50348.6nMCHEMBL5653589

PubChem BioAssay actives

1 with measured affinity, of 4 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2148135: Binding affinity to human CPSF6 incubated for 45 mins by Kinobead based pull down assaykd0.3486uM

CTD chemical–gene interactions

44 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression, affects expression, decreases methylation9
bisphenol Aaffects expression, decreases expression, increases expression4
sodium arseniteincreases activity, decreases expression2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
aristolochic acid Idecreases expression1
FR900359increases phosphorylation1
triphenyl phosphateaffects expression1
pyrogallol 1,3-dimethyl etheraffects localization, increases expression, affects cotreatment, decreases expression1
decabromobiphenyl etherdecreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
cobaltous chloridedecreases expression1
tetrabromobisphenol Adecreases expression1
di-n-butylphosphoric acidaffects expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
2,2’,4,4’-tetrabromodiphenyl etherdecreases expression1
dorsomorphinaffects cotreatment, decreases expression1
pentabrominated diphenyl ether 100decreases expression1
hexabrominated diphenyl ether 153decreases expression1
NSC 689534affects binding, decreases expression1
3-(2-hydroxy-4-(2-methylnonan-2-yl)phenyl)cyclohexan-1-oldecreases expression1
Leflunomidedecreases expression1
Acetaminophenincreases expression1
Benzo(a)pyreneincreases methylation1
Caffeinedecreases phosphorylation1
Carbamazepineaffects expression1
Copperaffects binding, decreases expression1
Demecolcinedecreases expression1
Diclofenacaffects expression1
Doxorubicinaffects expression1
Ethyl Methanesulfonatedecreases expression1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5651177BindingBinding affinity to human CPSF6 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

202 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT04586348PHASE4UNKNOWNPrenatal Iodine Supplementation and Early Childhood Neurodevelopment
NCT04873115PHASE4UNKNOWNDouble-blind, Placebo-controlled, Randomized Clinical Trial Comparing the Efficacy and Safety of Sialanar Plus orAl rehabiLitation Against Placebo Plus Oral Rehabilitation for chIldren and Adolescents With seVere Sialorrhoea and Neurodisabilties,
NCT02559102PHASE3COMPLETEDDexmedetomidine Sedation Versus General Anaesthesia for Inguinal Hernia Surgery in Infants
NCT02757079PHASE3COMPLETEDStudy of the Efficacy and Safety of NPC-15 for Sleep Disorders of Children With Neurodevelopmental Disorders
NCT06915480PHASE3RECRUITINGReducing Missed Appointments
NCT07377032PHASE3RECRUITINGTAP-GRIN: Interventional Study on Patients With GRIN-related Neurodevelopmental Disorders
NCT02909959PHASE2COMPLETEDSulforaphane for the Treatment of Young Men With Autism Spectrum Disorder
NCT06081348PHASE2RECRUITINGSertraline vs. Placebo in the Treatment of Anxiety in Children and AdoLescents With NeurodevelopMental Disorders
NCT06352372PHASE2COMPLETEDSafety and Efficacy of tPBM for Epileptiform Activity in Autism
NCT00503191PHASE1COMPLETEDNeuroModulation Technique Treatment of Autism
NCT04475848PHASE1COMPLETEDA Study to Investigate the Safety, Tolerability, Pharmacokinetics, Pharmacodynamics and Food Effect of RO6953958 in Healthy Participants
NCT06300398PHASE1COMPLETEDIAMA-6 Oral Dose Study in Healthy Adults
NCT01783041PHASE2/PHASE3COMPLETEDEffect of Early L-Carnitine Supplementation on Neurodevelopmental Outcomes in Very Preterm Infants
NCT05767385PHASE2/PHASE3RECRUITINGFetal Cerebrovascular Autoregulation in Congenital Heart Disease and Association With Neonatal Neurobehavior
NCT05675098EARLY_PHASE1NOT_YET_RECRUITINGCentral Nervous System Stimulants and Physical Function in Children With Cerebral Palsy
NCT00783783Not specifiedCOMPLETEDCYP2D6 Pharmacogenetics in Risperidone-Treated Children
NCT01778504Not specifiedRECRUITINGStudying Childhood-onset Behavioral, Psychiatric, and Developmental Disorders
NCT01850784Not specifiedUNKNOWNHigh Energy Formula Feeding in Infants With Congenital Heart Disease
NCT01922791Not specifiedCOMPLETEDNutrition and Pregnancy Intervention Study
NCT01942525Not specifiedUNKNOWNInfluence of Intrauterine Growth Restriction on Amplitude-integrated EEG in Preterm Infants
NCT02003170Not specifiedCOMPLETEDEtiology and Early Diagnosis of Neurodevelopmental Disorders
NCT02118649Not specifiedACTIVE_NOT_RECRUITINGEnhancing Behavior and Brain Response to Visual Targets Using a Computer Game
NCT02557191Not specifiedTERMINATEDBiomarkers, Neurodevelopment and Preterm Infants
NCT02690675Not specifiedCOMPLETEDIron Supplement Effect on Child Development
NCT02694003Not specifiedCOMPLETEDBetter Nights, Better Days for Children With Neurodevelopment Disorders
NCT02792894Not specifiedCOMPLETEDFamily Networks (FaNs) for Children With Developmental Disorders and Delays
NCT02871674Not specifiedUNKNOWNGood Night Project: Behavioural Sleep Interventions for Children With ADHD: A Randomised Controlled Trial
NCT02887157Not specifiedCOMPLETEDAnalyzing Retinal Microanatomy in ROP
NCT02898298Not specifiedCOMPLETEDPositive Emotion Regulation Training in Children, Adolescents and Young Adults With and Without Developmental Disorder
NCT02912780Not specifiedUNKNOWNIntroduction of Microsystems in a Level 3 Neonatal Intensive Care Unit
NCT03023293Not specifiedCOMPLETEDn-3 PUFAs, Irisin and Maternal Glucose Metabolism From Pregnancy to Postpartum
NCT03023644Not specifiedCOMPLETEDImproving Neurodevelopmental Outcomes in Children With Congenital Heart Disease: An Intervention Study
NCT03032991Not specifiedUNKNOWNEarly Biomarkers of Neurodevelopment in Offspring of Diabetic Mothers
NCT03088189Not specifiedTERMINATEDEffect of Parental Peri-conceptional Vitamin B12 Supplementation on Infant Neurocognitive Development in Offspring
NCT03096028Not specifiedCOMPLETEDDevelopmental Origins of Mental Health Disorders
NCT03148782Not specifiedCOMPLETEDBrain Plasticity Underlying Acquisition of New Organizational Skills in Children-R61 Phase
NCT03172104Not specifiedCOMPLETEDNeurobehavioural Development of Infants Born <30 Weeks Gestational Age Between Birth and Five Years of Age
NCT03222375Not specifiedRECRUITINGSQUED™ Series 28.1 Home-use and Treatment of Autowave Reverberator of Autism
NCT03229928Not specifiedCOMPLETEDClinical Testing of a Real-Time Behavior Measurement Tool: Measuring Outcomes for CHAnge
NCT03232489Not specifiedUNKNOWNStudy for the Evaluation of the Feasibility of Applying Advanced MRI Scanning in Pediatric Clinical Practice