CPSF3

gene
On this page

Also known as CPSF-73CPSF73YSH1

Summary

CPSF3 (cleavage and polyadenylation specific factor 3, HGNC:2326) is a protein-coding gene on chromosome 2p25.1, encoding Cleavage and polyadenylation specificity factor subunit 3 (Q9UKF6). Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. It is a common-essential gene (DepMap: required in 99.3% of cancer cell lines).

This gene encodes a member of the metallo-beta-lactamase family. The encoded protein is a 73kDa subunit of the cleavage and polyadenylation specificity factor and functions as an endonuclease that recognizes the pre-mRNA 3’-cleavage site AAUAAA prior to polyadenylation. It also cleaves after the pre-mRNA sequence ACCCA during histone 3’-end pre-mRNA processing.

Source: NCBI Gene 51692 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): neurodevelopmental disorder with microcephaly, hypotonia, nystagmus, and seizures (Moderate, GenCC)
  • Clinical variants (ClinVar): 91 total — 1 pathogenic, 1 likely-pathogenic
  • Phenotypes (HPO): 29
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.3% of screened cell lines (common-essential)
  • MANE Select transcript: NM_016207

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2326
Approved symbolCPSF3
Namecleavage and polyadenylation specific factor 3
Location2p25.1
Locus typegene with protein product
StatusApproved
AliasesCPSF-73, CPSF73, YSH1
Ensembl geneENSG00000119203
Ensembl biotypeprotein_coding
OMIM606029
Entrez51692

Gene structure

Transcript identifiers

Ensembl transcripts: 11 — 9 protein_coding, 1 protein_coding_CDS_not_defined, 1 retained_intron

ENST00000238112, ENST00000460593, ENST00000475482, ENST00000489403, ENST00000489629, ENST00000882813, ENST00000882814, ENST00000921418, ENST00000921419, ENST00000921420, ENST00000947880

RefSeq mRNA: 5 — MANE Select: NM_016207 NM_001321833, NM_001321834, NM_001321835, NM_001321836, NM_016207

CCDS: CCDS1664, CCDS82417

Canonical transcript exons

ENST00000238112 — 18 exons

ExonStartEnd
ENSE0000080464094418189441976
ENSE0000080464194404919440666
ENSE0000080464394362119436361
ENSE0000080464494338719433960
ENSE0000100121694677079467776
ENSE0000100121994307529430880
ENSE0000100122194529139453021
ENSE0000100122294556599455757
ENSE0000100122394299239430020
ENSE0000100122494569339457027
ENSE0000183069694236549423823
ENSE0000185379794729169473101
ENSE0000357290694595319459618
ENSE0000362466594287659428828
ENSE0000363991394435159443661
ENSE0000364625094481989448350
ENSE0000365004994713439471439
ENSE0000378797994325119432688

Expression profiles

Bgee: expression breadth ubiquitous, 256 present calls, max score 94.76.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 34.0206 / max 494.4929, expressed in 1813 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1875926.62311808
187587.37131714
187600.01103
187620.00944
187610.00583

Top tissues by expression

258 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ganglionic eminenceUBERON:000402394.76gold quality
embryoUBERON:000092294.75gold quality
ventricular zoneUBERON:000305394.70gold quality
left testisUBERON:000453393.62gold quality
right testisUBERON:000453493.50gold quality
testisUBERON:000047393.30gold quality
endothelial cellCL:000011593.04gold quality
tendon of biceps brachiiUBERON:000818892.28gold quality
adrenal tissueUBERON:001830392.02gold quality
spermCL:000001991.61gold quality
monocyteCL:000057691.52gold quality
stromal cell of endometriumCL:000225591.44gold quality
leukocyteCL:000073891.34gold quality
islet of LangerhansUBERON:000000691.19gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047390.62gold quality
tendonUBERON:000004390.29gold quality
cortical plateUBERON:000534390.03gold quality
epithelial cell of pancreasCL:000008389.82gold quality
calcaneal tendonUBERON:000370189.80gold quality
trabecular bone tissueUBERON:000248389.76gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.57gold quality
medial globus pallidusUBERON:000247789.57gold quality
bone marrowUBERON:000237189.20gold quality
rectumUBERON:000105289.18gold quality
right adrenal gland cortexUBERON:003582789.18gold quality
right adrenal glandUBERON:000123389.16gold quality
gingival epitheliumUBERON:000194988.95gold quality
smooth muscle tissueUBERON:000113588.91gold quality
vermiform appendixUBERON:000115488.83gold quality
visceral pleuraUBERON:000240188.81gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.34
E-GEOD-124858no456.82
E-CURD-112no4.01

Regulation

Is transcription factor: no

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.3% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 18)

  • up-regulated by HIV-1 TAT protein; this could represent a novel mechanism by which HIV increases mRNA processing, causing an increase in both cell and viral gene expression. (PMID:15194760)
  • crystal structures of human CPSF-73 at 2.1 A resolution, complexed with zinc ions and a sulphate that might mimic the phosphate group of the substrate; CPSF-73 is the pre-mRNA 3’-end-processing endonuclease (PMID:17128255)
  • this study identifies CPSF-73 as an important regulatory protein that represses the basal transcriptional activity of the HIV-1 LTR promoter (PMID:17669424)
  • Conserved residues in CPSF-73 and CPSF2 are required for assembly of the endonuclease activity that cleaves histone pre-mRNAs. (PMID:18688255)
  • CSR1 appears to induce cell death through a novel mechanism by hijacking a critical RNA processing enzyme CPSF3. (PMID:18806823)
  • Degradation of the downstream cleavage product (DCP) does not depend on the Xrn2 5’ exonuclease, suggesting that CPSF-73 degrades the DCP both in vitro and in vivo. (PMID:18955505)
  • The tumor suppressor Cdc73 functionally associates with CPSF and CstF 3’ mRNA processing factors. (PMID:19136632)
  • The data support a model where Star-PAP binds to the pre-mRNA, recruits the CPSF complex to the 3’-end of pre-mRNA and then defines cleavage by CPSF 73 and subsequent polyadenylation of its target mRNAs. (PMID:21102410)
  • the molecular mechanism that recruits the CPSF73 endonuclease to histone pre-mRNAs (PMID:23071092)
  • CASC9 is a promising prognostic predictor for patients with CRC and the CASC9-CPSF3-TGFbeta2 axis is a potential therapeutic target for CRC treatment. (PMID:31186036)
  • CPSF3 represents a synthetic lethal node in a subset of acute myeloid leukemia and Ewing’s sarcoma cancer cell lines (PMID:31819276)
  • Cleavage and polyadenylation-specific factor 3 induces cell cycle arrest via PI3K/Akt/GSK-3beta signaling pathways and predicts a negative prognosis in hepatocellular carcinoma. (PMID:33666519)
  • A real-time fluorescence assay for CPSF73, the nuclease for pre-mRNA 3’-end processing. (PMID:34230059)
  • Population-level deficit of homozygosity unveils CPSF3 as an intellectual disability syndrome gene. (PMID:35121750)
  • An examination of the metal ion content in the active sites of human endonucleases CPSF73 and INTS11. (PMID:36822327)
  • Therapeutic targeting of CPSF3-dependent transcriptional termination in ovarian cancer. (PMID:37992178)
  • CPSF3 inhibition blocks pancreatic cancer cell proliferation through disruption of core histone mRNA processing. (PMID:38191171)
  • CPSF3 regulates alternative polyadenylation of CNIH2 to promote esophageal squamous cell carcinoma progression. (PMID:38718887)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriocpsf3ENSDARG00000027649
mus_musculusCpsf3ENSMUSG00000054309
rattus_norvegicusCpsf3ENSRNOG00000052418
drosophila_melanogasterCpsf73FBGN0261065
caenorhabditis_elegansWBGENE00013460

Paralogs (3): INTS9 (ENSG00000104299), INTS11 (ENSG00000127054), CPSF2 (ENSG00000165934)

Protein

Protein identifiers

Cleavage and polyadenylation specificity factor subunit 3Q9UKF6 (reviewed: Q9UKF6)

Alternative names: Cleavage and polyadenylation specificity factor 73 kDa subunit, mRNA 3’-end-processing endonuclease CPSF-73

All UniProt accessions (3): Q9UKF6, C9JZH6, G5E9W3

UniProt curated annotations — full annotation on UniProt →

Function. Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. Has endonuclease activity, and functions as an mRNA 3’-end-processing endonuclease. Also involved in the histone 3’-end pre-mRNA processing. U7 snRNP-dependent protein that induces both the 3’-endoribonucleolytic cleavage of histone pre-mRNAs and acts as a 5’ to 3’ exonuclease for degrading the subsequent downstream cleavage product (DCP) of mature histone mRNAs. Cleavage occurs after the 5’-ACCCA-3’ sequence in the histone pre-mRNA leaving a 3’hydroxyl group on the upstream fragment containing the stem loop (SL) and 5’ phosphate on the downstream cleavage product (DCP) starting with CU nucleotides. The U7-dependent 5’ to 3’ exonuclease activity is processive and degrades the DCP RNA substrate even after complete removal of the U7-binding site. Binds to the downstream cleavage product (DCP) of histone pre-mRNAs and the cleaved DCP RNA substrate in a U7 snRNP dependent manner. Required for entering/progressing through S-phase of the cell cycle. Required for the selective processing of microRNAs (miRNAs) during embryonic stem cell differentiation via its interaction with ISY1. Required for the biogenesis of all miRNAs from the pri-miR-17-92 primary transcript except miR-92a. Only required for the biogenesis of miR-290 and miR-96 from the pri-miR-290-295 and pri-miR-96-183 primary transcripts, respectively.

Subunit / interactions. Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Interacts with CPSF2, CSTF2 and SYMPK. Interacts with TUT1; the interaction is direct and mediates the recruitment of the CPSF complex on the 3’UTR of pre-mRNAs. Interacts with WDR33. Interacts with ZC3H3.

Subcellular location. Nucleus.

Post-translational modifications. Sumoylated on Lys-462, Lys-465 and Lys-545, preferentially by SUMO3.

Disease relevance. Neurodevelopmental disorder with microcephaly, hypotonia, nystagmus, and seizures (NEDMHS) [MIM:619876] An autosomal recessive disorder characterized by global developmental delay and impaired intellectual development apparent from infancy. Affected individuals have hypotonia, poor or absent motor skills, feeding difficulties with poor overall growth, microcephaly, mild dysmorphic features, and early-onset seizures. Additional variable features include nystagmus, cortical blindness, and spasticity. The disease is caused by variants affecting the gene represented in this entry.

Cofactor. Binds 2 Zn(2+) ions per subunit.

Similarity. Belongs to the metallo-beta-lactamase superfamily. RNA-metabolizing metallo-beta-lactamase-like family. CPSF3 subfamily.

RefSeq proteins (5): NP_001308762, NP_001308763, NP_001308764, NP_001308765, NP_057291* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001279Metallo-B-lactamasDomain
IPR011108RMMBLDomain
IPR021718CPSF73-100_CDomain
IPR022712Beta_CaspDomain
IPR036866RibonucZ/Hydroxyglut_hydroHomologous_superfamily
IPR050698MBLFamily

Pfam: PF00753, PF07521, PF10996, PF11718

UniProt features (85 total): strand 28, helix 21, mutagenesis site 9, binding site 8, turn 6, sequence variant 4, modified residue 3, cross-link 3, initiator methionine 1, chain 1, active site 1

Structure

Experimental structures (PDB)

10 structures.

PDBMethodResolution (Å)
8T1QX-RAY DIFFRACTION1.7
2I7TX-RAY DIFFRACTION2.1
2I7VX-RAY DIFFRACTION2.1
8T1RX-RAY DIFFRACTION2.2
6M8QX-RAY DIFFRACTION2.49
9NH5ELECTRON MICROSCOPY2.82
9NH6ELECTRON MICROSCOPY2.82
9NGOELECTRON MICROSCOPY3.01
6V4XELECTRON MICROSCOPY3.2
9NB1ELECTRON MICROSCOPY3.85

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9UKF6-F190.410.65

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 396 (proton donor)

Ligand- & substrate-binding residues (8): 418; 71; 73; 75; 76; 158; 179; 179

Post-translational modifications (6): 2, 659, 681, 462, 465, 545

Mutagenesis-validated functional residues (9):

PositionPhenotype
73inhibits histone 3’-end processing.
75–76loss of histone 3’-end processing.
75inhibits histone 3’-end processing.
76inhibits histone 3’-end processing.
334does not inhibit histone 3’-end processing.
396inhibits histone 3’-end processing.
462reduced sumoylation; when associated with r-465 and r-545.
465reduced sumoylation; when associated with r-462 and r-545.
545reduced sumoylation; when associated with r-462 and r-465.

Function

Pathways and Gene Ontology

Reactome pathways

7 pathways

IDPathway
R-HSA-159231Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-72187mRNA 3’-end processing
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-77595Processing of Intronless Pre-mRNAs
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA

MSigDB gene sets: 223 (showing top): RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, chr2p25, GOMF_ENDONUCLEASE_ACTIVITY, GOMF_RNA_NUCLEASE_ACTIVITY, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, GOMF_NUCLEASE_ACTIVITY, STARK_PREFRONTAL_CORTEX_22Q11_DELETION_DN, GOBP_CELL_CYCLE_PHASE_TRANSITION, GOBP_POSITIVE_REGULATION_OF_MITOTIC_CELL_CYCLE, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, GOBP_POSITIVE_REGULATION_OF_G1_S_TRANSITION_OF_MITOTIC_CELL_CYCLE, GOMF_RNA_ENDONUCLEASE_ACTIVITY, REACTOME_MRNA_3_END_PROCESSING, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, GOBP_REGULATION_OF_CELL_CYCLE

GO Biological Process (6): mRNA 3’-end processing by stem-loop binding and cleavage (GO:0006398), mRNA 3’-end processing (GO:0031124), co-transcriptional mRNA 3’-end processing, cleavage and polyadenylation pathway (GO:0180010), positive regulation of G1/S transition of mitotic cell cycle (GO:1900087), RNA processing (GO:0006396), mRNA processing (GO:0006397)

GO Molecular Function (8): RNA binding (GO:0003723), RNA endonuclease activity (GO:0004521), 5’-3’ RNA exonuclease activity (GO:0004534), metal ion binding (GO:0046872), nuclease activity (GO:0004518), endonuclease activity (GO:0004519), protein binding (GO:0005515), hydrolase activity (GO:0016787)

GO Cellular Component (4): nucleoplasm (GO:0005654), mRNA cleavage and polyadenylation specificity factor complex (GO:0005847), ribonucleoprotein complex (GO:1990904), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-7 pathways:

CategoryPathways
Transport of Mature mRNAs Derived from Intronless Transcripts1
Processing of Capped Intron-Containing Pre-mRNA1
RNA Polymerase II Transcription1
Processing of Capped Intronless Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
mRNA 3’-end processing2
histone mRNA metabolic process1
mRNA processing1
RNA 3’-end processing1
co-transcriptional RNA 3’-end processing, cleavage and polyadenylation pathway1
G1/S transition of mitotic cell cycle1
positive regulation of mitotic cell cycle phase transition1
positive regulation of cell cycle G1/S phase transition1
regulation of G1/S transition of mitotic cell cycle1
gene expression1
RNA biosynthetic process1
primary metabolic process1
RNA processing1
mRNA metabolic process1
nucleic acid binding1
endonuclease activity1
RNA nuclease activity1
5’-3’ exonuclease activity1
RNA exonuclease activity, producing 5’-phosphomonoesters1
cation binding1
catalytic activity, acting on a nucleic acid1
nuclease activity1
binding1
catalytic activity1
nuclear lumen1
cellular anatomical structure1
mRNA cleavage factor complex1
protein-containing complex1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

2928 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CPSF3CPSF1Q10570998
CPSF3CPSF4O95639997
CPSF3SYMPKQ92797997
CPSF3WDR33Q9C0J8997
CPSF3CPSF2Q9P2I0993
CPSF3FIP1L1Q6UN15979
CPSF3CSTF2P33240974
CPSF3SCARA3Q6AZY7966
CPSF3CSTF3Q12996944
CPSF3DCLRE1AQ6PJP8940
CPSF3PCF11O94913939
CPSF3LSM11P83369911
CPSF3SSU72Q9NP77885
CPSF3CSTF1Q05048875
CPSF3PAPOLGQ9BWT3854

IntAct

94 interactions, top by confidence:

ABTypeScore
MED4MED19psi-mi:“MI:2364”(proximity)0.900
CPSF2SYMPKpsi-mi:“MI:0915”(physical association)0.850
SYMPKCPSF2psi-mi:“MI:0914”(association)0.850
CPSF2SYMPKpsi-mi:“MI:0914”(association)0.850
LMO1ZBTB43psi-mi:“MI:0914”(association)0.830
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
CPSF3CPSF4psi-mi:“MI:0914”(association)0.640
CDC73CSTF2psi-mi:“MI:0914”(association)0.580
TUT1PIP5K1Apsi-mi:“MI:0914”(association)0.560
HTTCPSF3psi-mi:“MI:0915”(physical association)0.560
CPSF1CPSF4psi-mi:“MI:0915”(physical association)0.560
SYMPKCPSF4psi-mi:“MI:0914”(association)0.530
KSR2POLR3Apsi-mi:“MI:0914”(association)0.530
TUT1CSTF2psi-mi:“MI:0914”(association)0.530
UBE3DSTIP1psi-mi:“MI:0914”(association)0.530
NCBP3SAP18psi-mi:“MI:0914”(association)0.530
WDR33CPSF4psi-mi:“MI:0914”(association)0.530
CPSF4LCCNHpsi-mi:“MI:0914”(association)0.530
CPSF3CSTF2psi-mi:“MI:0914”(association)0.530
CDC73CPSF4psi-mi:“MI:0914”(association)0.500
CPSF2CSTF2psi-mi:“MI:0914”(association)0.500
CPSF6DDX39Apsi-mi:“MI:0914”(association)0.480

BioGRID (218): CPSF3 (Affinity Capture-RNA), CPSF3 (Affinity Capture-MS), CPSF3 (Two-hybrid), CPSF2 (Co-fractionation), CPSF3 (Co-fractionation), CPSF3 (Co-fractionation), CPSF3 (Co-fractionation), CPSF3 (Co-fractionation), DDX23 (Co-fractionation), WDR33 (Co-fractionation), CPSF3 (Affinity Capture-MS), FIP1L1 (Co-fractionation), FIP1L1 (Affinity Capture-MS), CPSF3 (Affinity Capture-Western), CPSF3 (Proximity Label-MS)

ESM2 similar proteins: A0A1L8F5J9, A0JN27, F1LTR1, F1NBL0, O15294, P35438, P35439, P56558, P61201, P61202, P61203, P61599, P61600, P63138, P79101, P81436, Q03555, Q05586, Q13888, Q15303, Q27HV0, Q2PFM2, Q2TBV5, Q4L208, Q58ED9, Q5R1P0, Q5SP67, Q5ZJ75, Q61527, Q62956, Q6IQT4, Q6IR75, Q6P1K8, Q6P632, Q7ZXR3, Q8BUV3, Q8C6G8, Q8CGY8, Q8R4D1, Q91854

Diamond homologs: A0QVT2, B7VIP5, B9XZG7, C4K7Z9, M4MR97, O31760, O50112, P0CM88, P0CM89, P47385, P54122, P54123, P56185, P75497, P79101, P9WGZ8, P9WGZ9, Q2FHG3, Q2FHZ1, Q2FZ19, Q2FZG9, Q2YX35, Q2YXP1, Q45493, Q49WL3, Q49X63, Q4L5A3, Q4L5X8, Q5HGF6, Q5HGZ5, Q5HPR6, Q5HQ80, Q5JH24, Q6G9T8, Q6GAC5, Q6GHG0, Q6GHZ6, Q72JJ7, Q74ZC0, Q7A115

SIGNOR signaling

3 interactions.

AEffectBMechanism
CPSF3“form complex”“CPSF complex”binding
CPSF3up-regulatesmRNA_polyadenylation

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 92 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Processing of Intronless Pre-mRNAs972.4×1e-13
RNA Polymerase II Transcription Termination1237.1×3e-14
mRNA 3’-end processing1336.0×4e-15
Transport of Mature mRNA Derived from an Intronless Transcript726.8×3e-07
mRNA Polyadenylation1822.3×2e-17
Processing of Capped Intron-Containing Pre-mRNA1112.7×5e-08
Dengue Virus-Host Interactions1912.2×5e-14
mRNA Splicing710.8×1e-04

GO biological processes:

GO termPartnersFoldFDR
negative regulation of mRNA splicing, via spliceosome546.7×7e-06
mRNA 3’-end processing641.1×1e-06
regulation of alternative mRNA splicing, via spliceosome617.9×9e-05
mRNA processing1413.4×1e-09
mRNA splicing, via spliceosome1112.3×4e-07
RNA splicing77.5×3e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

91 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic1
Uncertain significance50
Likely benign9
Benign0

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
1686866NM_016207.4(CPSF3):c.1403G>A (p.Gly468Glu)Pathogenic
3899256NM_016207.4(CPSF3):c.761-1G>TLikely pathogenic

SpliceAI

2802 predictions. Top by Δscore:

VariantEffectΔscore
2:9423655:G:GTdonor_gain1.0000
2:9423845:G:Tdonor_gain1.0000
2:9428752:T:TAacceptor_gain1.0000
2:9428756:T:Gacceptor_gain1.0000
2:9428760:TACA:Tacceptor_loss1.0000
2:9428762:CAGT:Cacceptor_loss1.0000
2:9428763:A:AGacceptor_gain1.0000
2:9428763:AGT:Aacceptor_gain1.0000
2:9428763:AGTG:Aacceptor_gain1.0000
2:9428764:G:GAacceptor_gain1.0000
2:9428764:GT:Gacceptor_gain1.0000
2:9428764:GTG:Gacceptor_gain1.0000
2:9428764:GTGG:Gacceptor_gain1.0000
2:9428764:GTGGA:Gacceptor_gain1.0000
2:9428815:G:GTdonor_gain1.0000
2:9428825:AATGG:Adonor_loss1.0000
2:9428826:ATGG:Adonor_loss1.0000
2:9428827:TGGT:Tdonor_loss1.0000
2:9428829:G:GGdonor_gain1.0000
2:9428829:GT:Gdonor_loss1.0000
2:9428830:T:Adonor_loss1.0000
2:9429917:TTTCA:Tacceptor_loss1.0000
2:9429918:TTCAG:Tacceptor_loss1.0000
2:9429921:A:AGacceptor_gain1.0000
2:9429921:AGCTC:Aacceptor_gain1.0000
2:9429922:G:GAacceptor_gain1.0000
2:9429922:GC:Gacceptor_gain1.0000
2:9429922:GCT:Gacceptor_gain1.0000
2:9429922:GCTC:Gacceptor_gain1.0000
2:9429922:GCTCG:Gacceptor_gain1.0000

AlphaMissense

4561 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:9428766:G:AG18R1.000
2:9428766:G:CG18R1.000
2:9428767:G:AG18E1.000
2:9428772:G:TG20W1.000
2:9428773:G:AG20E1.000
2:9428784:G:AG24R1.000
2:9428784:G:CG24R1.000
2:9428785:G:AG24E1.000
2:9428785:G:TG24V1.000
2:9428788:G:CR25T1.000
2:9428788:G:TR25I1.000
2:9428789:A:CR25S1.000
2:9428789:A:TR25S1.000
2:9428794:G:AC27Y1.000
2:9428795:T:GC27W1.000
2:9429924:T:CL39P1.000
2:9429926:G:CD40H1.000
2:9429932:G:AG42R1.000
2:9429932:G:CG42R1.000
2:9429932:G:TG42W1.000
2:9429933:G:AG42E1.000
2:9429933:G:TG42V1.000
2:9429938:C:GH44D1.000
2:9429939:A:GH44R1.000
2:9429966:T:CL53P1.000
2:9429969:C:AP54H1.000
2:9429969:C:GP54R1.000
2:9430005:T:CL66P1.000
2:9430008:T:CL67P1.000
2:9430016:A:CS70R1.000

dbSNP variants (sampled 300 via entrez): RS1000032243 (2:9429088 C>A), RS1000075612 (2:9423385 G>A,T), RS1000140703 (2:9466366 A>C,G), RS1000151459 (2:9469174 G>A,T), RS1000281044 (2:9436425 G>A,C,T), RS1000305242 (2:9472393 T>C), RS1000558547 (2:9453452 G>C), RS1000567529 (2:9423399 T>C), RS1000791481 (2:9447706 T>G), RS1000841644 (2:9464129 C>T), RS1000864601 (2:9442014 C>G,T), RS1000894420 (2:9434688 A>C,G), RS1000936440 (2:9441602 G>A,C,T), RS1000952119 (2:9464508 G>C), RS1001012105 (2:9443651 C>T)

Disease associations

OMIM: gene MIM:606029 | disease phenotypes: MIM:618862, MIM:619876

GenCC curated gene-disease

DiseaseClassificationInheritance
neurodevelopmental disorder with microcephaly, hypotonia, nystagmus, and seizuresModerateAutosomal recessive

Mondo (2): neurodevelopmental disorder with hypotonia, microcephaly, and seizures (MONDO:0030025), neurodevelopmental disorder with microcephaly, hypotonia, nystagmus, and seizures (MONDO:0859250)

Orphanet (0):

HPO phenotypes

29 total (29 of 29 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0000158Macroglossia
HP:0000218High palate
HP:0000252Microcephaly
HP:0000276Long face
HP:0000463Anteverted nares
HP:0000486Strabismus
HP:0000639Nystagmus
HP:0000648Optic atrophy
HP:0001249Intellectual disability
HP:0001250Seizure
HP:0001252Hypotonia
HP:0001257Spasticity
HP:0001263Global developmental delay
HP:0001270Motor delay
HP:0001320Cerebellar vermis hypoplasia
HP:0001508Failure to thrive
HP:0001558Decreased fetal movement
HP:0002020Gastroesophageal reflux
HP:0002059Cerebral atrophy
HP:0002181Cerebral edema
HP:0002719Recurrent infections
HP:0003593Infantile onset
HP:0007178Motor polyneuropathy
HP:0007371Corpus callosum atrophy
HP:0011461Fetal onset
HP:0011968Feeding difficulties
HP:0100021Cerebral palsy
HP:0100704Cerebral visual impairment

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5724779 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

37 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Adecreases expression, increases expression2
sodium arseniteaffects cotreatment, decreases expression2
Air Pollutantsaffects expression, increases abundance, decreases expression2
Tretinoinaffects cotreatment, decreases expression2
Aflatoxin B1decreases methylation, increases methylation2
Cadmium Chloridedecreases expression, increases abundance, increases expression2
aristolochic acid Idecreases expression1
FR900359affects phosphorylation1
2,4,6-tribromophenoldecreases expression1
triphenyl phosphateaffects expression1
decabromobiphenyl etherdecreases expression1
arseniteincreases reaction, affects binding1
tetrabromobisphenol Adecreases expression1
potassium chromate(VI)affects cotreatment, decreases expression1
epigallocatechin gallateaffects cotreatment, decreases expression1
di-n-butylphosphoric acidaffects expression1
pentabrominated diphenyl ether 100increases expression1
jinfukangdecreases expression1
NSC 689534decreases expression, affects binding1
Acetaminophendecreases expression1
Air Pollutants, Occupationaldecreases expression1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation1
Benztropineaffects cotreatment, decreases expression1
Cadmiumincreases expression, increases abundance1
Caffeineincreases phosphorylation1
Copperaffects binding, decreases expression1
Coumestrolincreases expression1
Cuprizoneaffects cotreatment, decreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1

ChEMBL screening assays

6 unique, capped per target: 6 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5697611BindingInhibition of CPSF3 (unknown origin) assessed as fold change at 10 uM incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisInhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia. — Nature

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.