CPSF1

gene
On this page

Also known as CPSF160

Summary

CPSF1 (cleavage and polyadenylation specific factor 1, HGNC:2324) is a protein-coding gene on chromosome 8q24.3, encoding Cleavage and polyadenylation specificity factor subunit 1 (Q10570). Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. It is a common-essential gene (DepMap: required in 98.8% of cancer cell lines).

Cleavage and polyadenylation specificity factor (CPSF) is a multisubunit complex that plays a central role in 3-prime processing of pre-mRNAs. CPSF recognizes the AAUAAA signal in the pre-mRNA and interacts with other proteins to facilitate both RNA cleavage and poly(A) synthesis. CPSF1 is the largest subunit of the CPSF complex (Murthy and Manley, 1995 [PubMed 7590244]).

Source: NCBI Gene 29894 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): myopia 27 (Strong, GenCC)
  • GWAS associations: 1
  • Clinical variants (ClinVar): 318 total — 5 pathogenic, 3 likely-pathogenic
  • Phenotypes (HPO): 3
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • Cancer dependency (DepMap): dependent in 98.8% of screened cell lines (common-essential)
  • MANE Select transcript: NM_013291

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2324
Approved symbolCPSF1
Namecleavage and polyadenylation specific factor 1
Location8q24.3
Locus typegene with protein product
StatusApproved
AliasesCPSF160
Ensembl geneENSG00000071894
Ensembl biotypeprotein_coding
OMIM606027
Entrez29894

Gene structure

Transcript identifiers

Ensembl transcripts: 31 — 21 protein_coding, 9 retained_intron, 1 nonsense_mediated_decay

ENST00000526271, ENST00000527827, ENST00000527916, ENST00000529288, ENST00000531042, ENST00000531480, ENST00000531727, ENST00000532560, ENST00000532725, ENST00000532935, ENST00000533492, ENST00000616140, ENST00000620219, ENST00000622776, ENST00000886809, ENST00000886810, ENST00000886811, ENST00000886812, ENST00000886813, ENST00000886814, ENST00000886815, ENST00000886816, ENST00000886817, ENST00000886818, ENST00000886819, ENST00000913993, ENST00000913994, ENST00000913995, ENST00000913996, ENST00000913997, ENST00000913998

RefSeq mRNA: 1 — MANE Select: NM_013291 NM_013291

CCDS: CCDS34966

Canonical transcript exons

ENST00000616140 — 38 exons

ExonStartEnd
ENSE00003482940144401211144401291
ENSE00003614369144401430144401563
ENSE00003712245144398779144398868
ENSE00003713910144400354144400493
ENSE00003716011144400671144400817
ENSE00003717346144394234144394300
ENSE00003718378144400924144401075
ENSE00003718996144409015144409172
ENSE00003720349144398958144399038
ENSE00003725856144409289144409335
ENSE00003727674144399992144400085
ENSE00003738057144394113144394160
ENSE00003744072144393452144393590
ENSE00003744148144393231144393365
ENSE00003748479144393667144393796
ENSE00003788623144393883144394038
ENSE00003889080144395098144395182
ENSE00003889229144398525144398638
ENSE00003889326144399781144399868
ENSE00003889354144397954144398132
ENSE00003889632144395265144395355
ENSE00003889855144396348144396500
ENSE00003890292144397743144397879
ENSE00003890322144394882144395023
ENSE00003890453144401646144401673
ENSE00003890597144397207144397413
ENSE00003890813144396598144396741
ENSE00003890831144399276144399373
ENSE00003891118144396840144396929
ENSE00003891484144399128144399202
ENSE00003892421144395435144395551
ENSE00003893798144399452144399503
ENSE00003893840144394379144394555
ENSE00003894660144398302144398443
ENSE00003894668144399588144399710
ENSE00003895112144397487144397661
ENSE00003895529144400166144400276
ENSE00003895575144394644144394796

Expression profiles

Bgee: expression breadth ubiquitous, 134 present calls, max score 98.97.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 30.1606 / max 162.1670, expressed in 1805 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
9566424.13461803
956655.04291540
956630.7342448
956660.2036102
956600.02177
956620.02039
956610.00343

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right testisUBERON:000453498.97gold quality
left testisUBERON:000453398.90gold quality
right hemisphere of cerebellumUBERON:001489098.51gold quality
right uterine tubeUBERON:000130298.49gold quality
cerebellar hemisphereUBERON:000224598.38gold quality
cerebellumUBERON:000203798.36gold quality
cerebellar cortexUBERON:000212998.36gold quality
granulocyteCL:000009498.35gold quality
pituitary glandUBERON:000000798.07gold quality
lower esophagus mucosaUBERON:003583498.00gold quality
adenohypophysisUBERON:000219697.99gold quality
duodenumUBERON:000211497.92gold quality
testisUBERON:000047397.82gold quality
mucosa of transverse colonUBERON:000499197.79gold quality
spleenUBERON:000210697.77gold quality
small intestine Peyer’s patchUBERON:000345497.75gold quality
small intestineUBERON:000210897.51gold quality
right ovaryUBERON:000211897.44gold quality
body of pancreasUBERON:000115097.35gold quality
right lobe of thyroid glandUBERON:000111997.31gold quality
metanephros cortexUBERON:001053397.31gold quality
body of stomachUBERON:000116197.29gold quality
prostate glandUBERON:000236797.26gold quality
transverse colonUBERON:000115797.11gold quality
primary visual cortexUBERON:000243697.08gold quality
left lobe of thyroid glandUBERON:000112097.00gold quality
left ovaryUBERON:000211996.98gold quality
apex of heartUBERON:000209896.91gold quality
body of uterusUBERON:000985396.88gold quality
tibial nerveUBERON:000132396.78gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.85
E-GEOD-110499no51.92

Regulation

Is transcription factor: no

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 98.8% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 6)

  • CPSF1 was identified among protein-binding candidates. A consensus polyadenylation signal AAUAAA is present in intron 6 of IL7 receptor directly downstream from the 5’ splice site. Mutations to this site and CPSF1 knockdown increased exon 6 inclusion. (PMID:23151878)
  • The authors define the molecular architecture of the core human CPSF complex comprising CPSF160, WDR33, CPSF30 and Fip1 and identify specific domains involved in inter-subunit interactions. Together, these results shed light on the function of CPSF in mediating polyA signal-dependent RNA cleavage and polyadenylation. (PMID:29274231)
  • Knockdown of CPSF inhibits the proliferation, migration and invasion of ovarian carcinoma cells. (PMID:29358555)
  • cryo-electron microscopic analysis of a core CPSF module bound to the polyadenylation signal hexamer motif (PMID:29358758)
  • Heterozygous loss-of-function mutations in CPSF1 are associated with early-onset high myopia and that CPSF1 may play an important role in the development of retinal ganglion cell axon projection. (PMID:30689892)
  • Epidermal progenitors suppress GRHL3-mediated differentiation through intronic polyadenylation promoted by CPSF-HNRNPA3 collaboration. (PMID:33469008)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriocpsf1ENSDARG00000034178
mus_musculusCpsf1ENSMUSG00000034022
rattus_norvegicusCpsf1ENSRNOG00000030705
drosophila_melanogasterCpsf160FBGN0024698
caenorhabditis_elegansWBGENE00022301

Paralogs (2): DDB1 (ENSG00000167986), SF3B3 (ENSG00000189091)

Protein

Protein identifiers

Cleavage and polyadenylation specificity factor subunit 1Q10570 (reviewed: Q10570)

Alternative names: Cleavage and polyadenylation specificity factor 160 kDa subunit

All UniProt accessions (4): Q10570, A0A087WTV4, A0A087X101, E9PIM1

UniProt curated annotations — full annotation on UniProt →

Function. Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. This subunit is involved in the RNA recognition step of the polyadenylation reaction. May play a role in eye morphogenesis and the development of retinal ganglion cell projections to the midbrain.

Subunit / interactions. Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Found in a complex with CPSF1, FIP1L1 and PAPOLA. Interacts with FIP1L1, TENT2/GLD2 and SRRM1. Interacts with TUT1; the interaction is direct and mediates the recruitment of the CPSF complex on the 3’UTR of selected pre-mRNAs.

Subcellular location. Nucleus. Nucleoplasm.

Tissue specificity. Widely expressed, with high expression in the retina.

Post-translational modifications. The N-terminus is blocked.

Disease relevance. Myopia 27, autosomal dominant (MYP27) [MIM:618827] A form of myopia, a refractive error of the eye, in which parallel rays from a distant object come to focus in front of the retina, vision being better for near objects than for far. MYP27 patients are affected by early-onset high myopia with increased axial lengths. Fundus changes include optic nerve head crescent and tigroid appearance of the posterior retina. The disease is caused by variants affecting the gene represented in this entry.

Similarity. Belongs to the CPSF1 family.

RefSeq proteins (1): NP_037423* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR004871RSE1/DDB1/CPSF1_CDomain
IPR015943WD40/YVTN_repeat-like_dom_sfHomologous_superfamily
IPR018846Beta-prop_RSE1/DDB1/CPSF1_1stDomain
IPR050358RSE1/DDB1/CFT1Family
IPR058543Beta-prop_RSE1/DDB1/CPSF1_2ndDomain

Pfam: PF03178, PF10433, PF23726

UniProt features (146 total): strand 103, turn 14, helix 14, region of interest 4, sequence variant 3, sequence conflict 2, compositionally biased region 2, modified residue 2, chain 1, short sequence motif 1

Structure

Experimental structures (PDB)

13 structures.

PDBMethodResolution (Å)
9T3XELECTRON MICROSCOPY2.1
6F9NX-RAY DIFFRACTION2.5
8E3IELECTRON MICROSCOPY2.53
9OXEELECTRON MICROSCOPY2.53
8E3QELECTRON MICROSCOPY2.68
8R8RELECTRON MICROSCOPY2.79
6URGELECTRON MICROSCOPY3
6FUWELECTRON MICROSCOPY3.07
9OXSELECTRON MICROSCOPY3.07
6BLYELECTRON MICROSCOPY3.36
6DNHELECTRON MICROSCOPY3.4
6UROELECTRON MICROSCOPY3.6
6BM0ELECTRON MICROSCOPY3.8

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q10570-F182.970.62

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (2): 756, 766

Function

Pathways and Gene Ontology

Reactome pathways

8 pathways

IDPathway
R-HSA-159231Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-6784531tRNA processing in the nucleus
R-HSA-72187mRNA 3’-end processing
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-77595Processing of Intronless Pre-mRNAs
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA

MSigDB gene sets: 133 (showing top): WANG_CLIM2_TARGETS_UP, ACEVEDO_NORMAL_TISSUE_ADJACENT_TO_LIVER_TUMOR_DN, BROWNE_HCMV_INFECTION_48HR_DN, ONKEN_UVEAL_MELANOMA_UP, BLALOCK_ALZHEIMERS_DISEASE_UP, REACTOME_MRNA_3_END_PROCESSING, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, GOBP_MRNA_3_END_PROCESSING, BROWNE_HCMV_INFECTION_14HR_DN, LIAO_METASTASIS, DANG_BOUND_BY_MYC, DEURIG_T_CELL_PROLYMPHOCYTIC_LEUKEMIA_UP, SHEN_SMARCA2_TARGETS_DN, REACTOME_METABOLISM_OF_RNA, PARENT_MTOR_SIGNALING_UP

GO Biological Process (3): mRNA processing (GO:0006397), co-transcriptional RNA 3’-end processing, cleavage and polyadenylation pathway (GO:0180012), RNA 3’-end processing (GO:0031123)

GO Molecular Function (5): enzyme binding (GO:0019899), mRNA 3’-UTR AU-rich region binding (GO:0035925), nucleic acid binding (GO:0003676), RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (3): nucleus (GO:0005634), nucleoplasm (GO:0005654), mRNA cleavage and polyadenylation specificity factor complex (GO:0005847)

Reactome top-level categories

Rollup of top-8 pathways:

CategoryPathways
Transport of Mature mRNAs Derived from Intronless Transcripts1
tRNA processing1
Processing of Capped Intron-Containing Pre-mRNA1
RNA Polymerase II Transcription1
Processing of Capped Intronless Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
binding2
mRNA metabolic process1
RNA 3’-end processing1
protein binding1
mRNA 3’-UTR binding1
nucleic acid binding1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
mRNA cleavage factor complex1

Protein interactions and networks

STRING

2282 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CPSF1CPSF4O95639999
CPSF1CPSF2Q9P2I0999
CPSF1CPSF3Q9UKF6998
CPSF1CSTF3Q12996998
CPSF1WDR33Q9C0J8998
CPSF1FIP1L1Q6UN15993
CPSF1PAPOLAP51003968
CPSF1SYMPKQ92797961
CPSF1CSTF1Q05048919
CPSF1CSTF2P33240919
CPSF1CPSF7Q8N684894
CPSF1NUDT21O43809859
CPSF1PAPOLGQ9BWT3828
CPSF1PAPOLBQ9NRJ5818
CPSF1CPSF6Q16630803

IntAct

132 interactions, top by confidence:

ABTypeScore
CPSF1CPSF4psi-mi:“MI:0915”(physical association)0.670
CPSF3CPSF4psi-mi:“MI:0914”(association)0.640
CDC73CSTF2psi-mi:“MI:0914”(association)0.580
CDC73CSTF2psi-mi:“MI:0915”(physical association)0.580
CPSF1RELpsi-mi:“MI:0915”(physical association)0.560
NS1PIK3R2psi-mi:“MI:0914”(association)0.560
CPSF1DDIT4Lpsi-mi:“MI:0915”(physical association)0.560
INCA1CPSF1psi-mi:“MI:0915”(physical association)0.560
CPSF1CPSF4psi-mi:“MI:0915”(physical association)0.560
SYMPKCPSF4psi-mi:“MI:0914”(association)0.530
NCBP3SAP18psi-mi:“MI:0914”(association)0.530
WDR33CPSF4psi-mi:“MI:0914”(association)0.530
NS1PIK3R2psi-mi:“MI:0914”(association)0.530
CDC73CPSF4psi-mi:“MI:0914”(association)0.500
CPSF1NPM1psi-mi:“MI:0915”(physical association)0.500
CPSF6DDX39Apsi-mi:“MI:0914”(association)0.480
DDX21MED19psi-mi:“MI:2364”(proximity)0.480
TUT1CPSF1psi-mi:“MI:0914”(association)0.460
RPP38CPSF1psi-mi:“MI:0915”(physical association)0.400
CPSF1WWOXpsi-mi:“MI:0915”(physical association)0.400
ECE1CPSF1psi-mi:“MI:0915”(physical association)0.370

BioGRID (357): CPSF1 (Affinity Capture-Western), CPSF1 (Two-hybrid), CPSF1 (Affinity Capture-RNA), CPSF1 (Affinity Capture-RNA), CPSF1 (Affinity Capture-MS), CPSF1 (Affinity Capture-MS), CPSF1 (Affinity Capture-MS), CPSF1 (Affinity Capture-MS), CPSF1 (Affinity Capture-MS), CPSF1 (Co-fractionation), CPSF1 (Co-fractionation), CPSF2 (Co-fractionation), CPSF3 (Co-fractionation), CPSF4 (Co-fractionation), CSTF2 (Co-fractionation)

ESM2 similar proteins: A0A0R4IC37, A1A4K3, A2CEI4, B1WC10, E9PY46, F1QEB7, F4IDS7, O08658, O13046, O75694, O75717, O95876, P33194, P37199, P59328, Q08D69, Q10569, Q10570, Q16531, Q32NR9, Q3U1J4, Q4ADV7, Q566H4, Q5DQR4, Q5R649, Q5U1Z0, Q5ZLG9, Q6P6Z0, Q6PGF3, Q6PJI9, Q7XWP1, Q802U2, Q805F9, Q8BMG7, Q8C0M0, Q8C456, Q8CEC0, Q8CJF7, Q8K1X1, Q8NFP9

Diamond homologs: A0A0R4IC37, A8XPU7, Q10569, Q10570, Q7XWP1, Q9EPU4, Q9FGR0, Q9N4C2, Q9V726, Q4WLI5, Q5B1X8

SIGNOR signaling

1 interactions.

AEffectBMechanism
CPSF1“form complex”“CPSF complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 120 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Processing of Intronless Pre-mRNAs853.7×1e-10
RNA Polymerase II Transcription Termination1025.8×4e-10
mRNA 3’-end processing1125.5×6e-11
mRNA Polyadenylation2020.7×1e-18
Transport of Mature mRNA Derived from an Intronless Transcript619.2×4e-05
Processing of Capped Intron-Containing Pre-mRNA1312.6×2e-09
mRNA Splicing810.3×5e-05
Dengue Virus-Host Interactions189.7×6e-11

GO biological processes:

GO termPartnersFoldFDR
mRNA 3’-end processing736.4×5e-07
regulation of alternative mRNA splicing, via spliceosome511.3×8e-03
mRNA splicing, via spliceosome119.3×7e-06
mRNA processing118.0×2e-05

Disease & clinical

Clinical variants and AI predictions

ClinVar

318 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic5
Likely pathogenic3
Uncertain significance233
Likely benign18
Benign1

Top pathogenic / likely-pathogenic (8)

Variant IDHGVSClassification
1074846NM_013291.3(CPSF1):c.2524dup (p.Glu842fs)Pathogenic
829510NM_013291.3(CPSF1):c.4146-2A>GPathogenic
829512NM_013291.3(CPSF1):c.3823G>T (p.Asp1275Tyr)Pathogenic
829514NM_013291.3(CPSF1):c.1858C>T (p.Gln620Ter)Pathogenic
837526NM_013291.3(CPSF1):c.1603_1604del (p.Met535fs)Pathogenic
3068158NM_013291.3(CPSF1):c.539+1G>ALikely pathogenic
3235178NM_013291.3(CPSF1):c.1258_1259del (p.Lys420fs)Likely pathogenic
4293872NM_013291.3(CPSF1):c.2156del (p.Asp719fs)Likely pathogenic

SpliceAI

5358 predictions. Top by Δscore:

VariantEffectΔscore
8:144393450:A:ACdonor_gain1.0000
8:144393451:C:CCdonor_gain1.0000
8:144393451:CT:Cdonor_gain1.0000
8:144393451:CTA:Cdonor_gain1.0000
8:144393451:CTAT:Cdonor_gain1.0000
8:144393451:CTATG:Cdonor_gain1.0000
8:144393587:CATC:Cacceptor_gain1.0000
8:144393589:TC:Tacceptor_gain1.0000
8:144393590:CC:Cacceptor_gain1.0000
8:144393590:CCTGG:Cacceptor_loss1.0000
8:144393591:C:CCacceptor_gain1.0000
8:144393591:CT:Cacceptor_loss1.0000
8:144393592:T:Aacceptor_loss1.0000
8:144393661:GCTC:Gdonor_loss1.0000
8:144393662:CTCA:Cdonor_loss1.0000
8:144393663:TCA:Tdonor_loss1.0000
8:144393664:CACCG:Cdonor_loss1.0000
8:144393666:C:CTdonor_loss1.0000
8:144393792:GGTGG:Gacceptor_gain1.0000
8:144393793:GTGG:Gacceptor_gain1.0000
8:144393794:TGG:Tacceptor_gain1.0000
8:144393795:GG:Gacceptor_gain1.0000
8:144393796:GC:Gacceptor_loss1.0000
8:144393797:C:CAacceptor_loss1.0000
8:144393797:C:CCacceptor_gain1.0000
8:144393801:C:CTacceptor_gain1.0000
8:144393802:A:Tacceptor_gain1.0000
8:144393806:G:Cacceptor_gain1.0000
8:144393806:G:GCacceptor_gain1.0000
8:144394039:C:CCacceptor_gain1.0000

AlphaMissense

9421 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
8:144393468:C:TG1423D1.000
8:144393480:G:TA1419D1.000
8:144393715:A:GL1366P1.000
8:144393727:A:GL1362P1.000
8:144393736:A:GL1359P1.000
8:144394011:A:GL1296P1.000
8:144394028:A:CS1290R1.000
8:144394028:A:TS1290R1.000
8:144394030:T:GS1290R1.000
8:144394422:A:GL1234P1.000
8:144394446:T:AD1226V1.000
8:144394503:T:AD1207V1.000
8:144394503:T:CD1207G1.000
8:144394504:C:GD1207H1.000
8:144394508:G:CF1205L1.000
8:144394508:G:TF1205L1.000
8:144394509:A:CF1205C1.000
8:144394509:A:GF1205S1.000
8:144394510:A:GF1205L1.000
8:144394510:A:TF1205I1.000
8:144394651:C:TG1187D1.000
8:144394886:C:TG1137E1.000
8:144394887:C:GG1137R1.000
8:144394887:C:TG1137R1.000
8:144394926:C:AG1124W1.000
8:144395120:A:GW1084R1.000
8:144395120:A:TW1084R1.000
8:144395534:A:CS999R1.000
8:144395534:A:TS999R1.000
8:144395536:T:GS999R1.000

dbSNP variants (sampled 300 via entrez): RS1000004491 (8:144410450 T>A), RS1000079849 (8:144400198 C>G,T), RS1000327769 (8:144402263 G>A), RS1001277393 (8:144405090 T>G), RS1001364441 (8:144393113 T>C), RS1001460151 (8:144400042 G>A), RS1002563980 (8:144408891 C>A), RS1002674087 (8:144406210 G>T), RS1002854810 (8:144400890 C>G,T), RS1003237699 (8:144411153 G>T), RS1003544905 (8:144401132 T>C), RS1003673112 (8:144404831 C>A,T), RS1003835135 (8:144409872 A>G), RS1004352058 (8:144407424 G>A), RS1004581928 (8:144401726 T>C)

Disease associations

OMIM: gene MIM:606027 | disease phenotypes: MIM:618827

GenCC curated gene-disease

DiseaseClassificationInheritance
myopia 27StrongAutosomal dominant

Mondo (1): myopia 27 (MONDO:0032941)

Orphanet (0):

HPO phenotypes

3 total (3 of 3 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0007800Increased axial length of the globe
HP:0011003High myopia

GWAS associations

1 associations (top):

StudyTraitp-value
GCST002598_30Educational attainment9.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004784self reported educational attainment

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5724781 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

3 potent at pChembl≥5 of 3 total, top 3 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
8.01Kd9.747nMCHEMBL3752910
7.93ED5011.83nMCHEMBL3752910
5.81IC501540nMMOLIBRESIB

PubChem BioAssay actives

2 with measured affinity, of 8 total; 2 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149844: Binding affinity to human CPSF1 incubated for 45 mins by Kinobead based pull down assaykd0.0097uM
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2178616: Inhibition of CPSF1 (unknown origin) incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisic501.5400uM

CTD chemical–gene interactions

31 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arsenitedecreases expression, affects cotreatment, increases abundance, increases expression4
FR900359affects phosphorylation1
bisphenol Adecreases expression1
potassium perchloratedecreases expression1
arseniteaffects binding, decreases reaction1
cobaltous chlorideincreases expression1
pyrrolidine dithiocarbamic acidaffects cotreatment, increases expression1
bathocuproine sulfonateaffects cotreatment, increases expression1
beta-methylcholineaffects expression1
ICG 001increases expression1
MT19c compounddecreases expression1
4-(4-((5-(4,5-dimethyl-2-nitrophenyl)-2-furanyl)methylene)-4,5-dihydro-3-methyl-5-oxo-1H-pyrazol-1-yl)benzoic acidincreases expression1
Arsenicaffects cotreatment, decreases expression, increases abundance, increases expression1
Caffeineaffects phosphorylation1
Cannabidiolaffects cotreatment, decreases expression1
Cisplatinaffects cotreatment, increases expression1
Cuprizoneaffects cotreatment, decreases expression1
Ivermectindecreases expression1
Leadaffects expression1
Methotrexateaffects response to substance1
Piroxicamaffects cotreatment, increases expression1
Ribonucleotidesaffects binding1
Rotenoneincreases expression1
Seleniumincreases expression1
Smokedecreases expression1
Tobacco Smoke Pollutiondecreases expression1
Valproic Acidincreases methylation1
Vitamin Eincreases expression1
beta-Naphthoflavonedecreases expression1
Acrylamidedecreases expression1

ChEMBL screening assays

7 unique, capped per target: 7 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652886BindingBinding affinity to human CPSF1 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Associated diseases: myopia 27
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): myopia 27