CPSF4

gene
On this page

Also known as NARCPSF30

Summary

CPSF4 (cleavage and polyadenylation specific factor 4, HGNC:2327) is a protein-coding gene on chromosome 7q22.1, encoding Cleavage and polyadenylation specificity factor subunit 4 (O95639). Component of the cleavage and polyadenylation specificity factor (CPSF) complex that play a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. It is a common-essential gene (DepMap: required in 99.8% of cancer cell lines).

Inhibition of the nuclear export of poly(A)-containing mRNAs caused by the influenza A virus NS1 protein requires its effector domain. The NS1 effector domain functionally interacts with the cellular 30 kDa subunit of cleavage and polyadenylation specific factor 4, an essential component of the 3’ end processing machinery of cellular pre-mRNAs. In influenza virus-infected cells, the NS1 protein is physically associated with cleavage and polyadenylation specific factor 4, 30kD subunit. Binding of the NS1 protein to the 30 kDa protein in vitro prevents CPSF binding to the RNA substrate and inhibits 3’ end cleavage and polyadenylation of host pre-mRNAs. Thus the NS1 protein selectively inhibits the nuclear export of cellular, and not viral, mRNAs. Multiple alternatively spliced transcript variants that encode different isoforms have been described for this gene.

Source: NCBI Gene 10898 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 3 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.8% of screened cell lines (common-essential)
  • MANE Select transcript: NM_006693

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2327
Approved symbolCPSF4
Namecleavage and polyadenylation specific factor 4
Location7q22.1
Locus typegene with protein product
StatusApproved
AliasesNAR, CPSF30
Ensembl geneENSG00000160917
Ensembl biotypeprotein_coding
OMIM603052
Entrez10898

Gene structure

Transcript identifiers

Ensembl transcripts: 26 — 20 protein_coding, 4 retained_intron, 1 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000292476, ENST00000412686, ENST00000430038, ENST00000436336, ENST00000440514, ENST00000441580, ENST00000451876, ENST00000452047, ENST00000465132, ENST00000469897, ENST00000471455, ENST00000482251, ENST00000484112, ENST00000887788, ENST00000887790, ENST00000887791, ENST00000887794, ENST00000887795, ENST00000926770, ENST00000926771, ENST00000967753, ENST00000967754, ENST00000967755, ENST00000967756, ENST00000967757, ENST00000967758

RefSeq mRNA: 5 — MANE Select: NM_006693 NM_001081559, NM_001318160, NM_001318161, NM_001318162, NM_006693

CCDS: CCDS47652, CCDS5664, CCDS83205

Canonical transcript exons

ENST00000292476 — 8 exons

ExonStartEnd
ENSE000010554379945396699454136
ENSE000018429429943894399439185
ENSE000034875589945236899452440
ENSE000035125049945027699450371
ENSE000035494949944812199448273
ENSE000035963749944478999444839
ENSE000037848999945070299450795
ENSE000038489899945643299457373

Expression profiles

Bgee: expression breadth ubiquitous, 280 present calls, max score 98.81.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 9.8118 / max 132.3603, expressed in 1774 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
798746.83151712
798752.98031054

Top tissues by expression

293 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065598.81gold quality
oocyteCL:000002396.79gold quality
right hemisphere of cerebellumUBERON:001489096.50gold quality
mucosa of stomachUBERON:000119996.39gold quality
cerebellar hemisphereUBERON:000224596.11gold quality
cerebellar cortexUBERON:000212996.01gold quality
cerebellumUBERON:000203794.52gold quality
right frontal lobeUBERON:000281094.21gold quality
left testisUBERON:000453393.80gold quality
right testisUBERON:000453493.70gold quality
nerveUBERON:000102193.65gold quality
tibial nerveUBERON:000132393.65gold quality
endothelial cellCL:000011593.23gold quality
body of pancreasUBERON:000115093.11gold quality
ectocervixUBERON:001224992.93gold quality
granulocyteCL:000009492.76gold quality
type B pancreatic cellCL:000016992.62gold quality
left uterine tubeUBERON:000130392.62gold quality
skin of legUBERON:000151192.53gold quality
body of uterusUBERON:000985392.48gold quality
skin of abdomenUBERON:000141692.46gold quality
testisUBERON:000047392.29gold quality
lower esophagus mucosaUBERON:003583492.27gold quality
gastrocnemiusUBERON:000138892.23gold quality
adenohypophysisUBERON:000219692.21gold quality
Brodmann (1909) area 9UBERON:001354092.20gold quality
right ovaryUBERON:000211891.86gold quality
cingulate cortexUBERON:000302791.84gold quality
amygdalaUBERON:000187691.80gold quality
subcutaneous adipose tissueUBERON:000219091.80gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.68

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

57 targeting CPSF4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-450099.9972.722367
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-60799.9773.625593
HSA-LET-7D-5P99.9671.761632
HSA-MIR-445899.9671.641650
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-23A-3P99.9574.243163
HSA-MIR-23B-3P99.9574.243163
HSA-MIR-23C99.9573.923192
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-367199.9073.043897
HSA-MIR-449699.8868.892236
HSA-MIR-797899.8666.90856
HSA-MIR-469899.8471.414303
HSA-MIR-3934-3P99.7665.511351
HSA-MIR-808499.7369.571760
HSA-MIR-4677-5P99.7070.091940
HSA-MIR-453099.6966.471509
HSA-MIR-3934-5P99.6764.04846
HSA-MIR-613299.6065.831554
HSA-MIR-6836-5P99.6065.621538
HSA-MIR-6832-3P99.5270.441726
HSA-MIR-5582-5P99.2771.421879

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.8% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 15)

  • Data show that the NS1A protein of the pathogenic H5N1 influenza A/Hong Kong/483/97 (HK97) virus isolated from humans has an intrinsic defect in CPSF30 binding. (PMID:17522219)
  • CPSF4 plays a critical role in regulating lung cancer cell proliferation and survival and may be a potential prognostic biomarker and therapeutic target for lung adenocarcinoma. (PMID:24358221)
  • Data show that 4 drug-like compounds from traditional Chinese Medicine (TCM) database were selected as potential inhibitors for the cleavage and polyadenylation specific factor CPSF30-binding site of influenza A virus of non-structural protein 1 (NS1A). (PMID:24562912)
  • CPSF4 upregulates human TERT promoter activity, human TERT expression and telomerase activity. (PMID:24618080)
  • unexpectedly found that CPSF subunits CPSF30 and Wdr33 directly contact AAUAAA (PMID:25301780)
  • These findings support a role for iron in some zinc-finger proteins. Using electrophoretic mobility shift assays and fluorescence anisotropy, we report that CPSF30 selectively recognizes the AU-rich hexamer (AAUAAA) sequence present in pre-mRNA, providing the first molecular-based evidence to our knowledge for CPSF30/RNA binding. (PMID:27071088)
  • Cleavage and polyadenylation specific factor 4 targets NF-kappaB/cyclooxygenase-2 signaling to promote the growth and survival of non-small cell lung carcinoma cells. (PMID:27450326)
  • Residues F103 and M106 within the NS1-CPSF4 binding region are important for viral replication. (PMID:28554059)
  • The authors define the molecular architecture of the core human CPSF complex comprising CPSF160, WDR33, CPSF30 and Fip1 and identify specific domains involved in inter-subunit interactions. Together, these results shed light on the function of CPSF in mediating polyA signal-dependent RNA cleavage and polyadenylation. (PMID:29274231)
  • MiR-4458 inhibits breast cancer cell growth, migration, and invasiveness by targeting CPSF4; results suggest that the miR-4458-CPSF4-COX-2-hTERT axis might serve as a potential target for the treatment of breast cancer patients (PMID:30970220)
  • Overproduced CPSF4 Promotes Cell Proliferation and Invasion via PI3K-AKT Signaling Pathway in Oral Squamous Cell Carcinoma. (PMID:33535057)
  • CPSF4 promotes triple negative breast cancer metastasis by upregulating MDM4. (PMID:34006850)
  • CPSF4 regulates circRNA formation and microRNA mediated gene silencing in hepatocellular carcinoma. (PMID:34103682)
  • CPSF4 promotes tumor-initiating phenotype by enhancing VEGF/NRP2/TAZ signaling in lung cancer. (PMID:36567417)
  • Cleavage and Polyadenylation-Specific Factor 4 (CPSF4) Expression Is Associated with Enhanced Prostate Cancer Cell Migration and Cell Cycle Dysregulation, In Vitro. (PMID:37629142)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriocpsf4ENSDARG00000020217
mus_musculusCpsf4ENSMUSG00000029625
rattus_norvegicusCpsf4ENSRNOG00000000985
rattus_norvegicusCpsf4l2ENSRNOG00000025217
rattus_norvegicusCpsf4-ps1ENSRNOG00000062241
drosophila_melanogasterClpFBGN0015621

Paralogs (2): ZC3H3 (ENSG00000014164), CPSF4L (ENSG00000187959)

Protein

Protein identifiers

Cleavage and polyadenylation specificity factor subunit 4O95639 (reviewed: O95639)

Alternative names: Cleavage and polyadenylation specificity factor 30 kDa subunit, NS1 effector domain-binding protein 1, No arches homolog

All UniProt accessions (7): O95639, B7Z7B0, C9JEV9, C9K0K2, F8WEL7, H7C016, H7C419

UniProt curated annotations — full annotation on UniProt →

Function. Component of the cleavage and polyadenylation specificity factor (CPSF) complex that play a key role in pre-mRNA 3’-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about cleavage and poly(A) addition. CPSF4 binds RNA polymers with a preference for poly(U).

Subunit / interactions. Component of the cleavage and polyadenylation specificity factor (CPSF) complex, composed of CPSF1, CPSF2, CPSF3, CPSF4 and FIP1L1. Interacts with FIP1L1. (Microbial infection) Interacts with influenza A virus NS1 blocks processing of pre-mRNAs, thereby preventing nuclear export of host cell mRNAs.

Subcellular location. Nucleus.

Miscellaneous. May be due to a competing acceptor splice site.

Similarity. Belongs to the CPSF4/YTH1 family.

Isoforms (3)

UniProt IDNamesCanonical?
O95639-11yes
O95639-22
O95639-33

RefSeq proteins (5): NP_001075028, NP_001305089, NP_001305090, NP_001305091, NP_006684* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000571Znf_CCCHDomain
IPR001878Znf_CCHCDomain
IPR036855Znf_CCCH_sfHomologous_superfamily
IPR036875Znf_CCHC_sfHomologous_superfamily
IPR041686Znf-CCCH_3Domain
IPR045348CPSF4/Yth1Family

Pfam: PF00098, PF00642, PF15663

UniProt features (32 total): helix 11, zinc finger region 6, strand 5, modified residue 4, splice variant 2, turn 2, chain 1, region of interest 1

Structure

Experimental structures (PDB)

13 structures.

PDBMethodResolution (Å)
7K95X-RAY DIFFRACTION1.9
2RHKX-RAY DIFFRACTION1.95
7ZYHX-RAY DIFFRACTION2.2
8E3IELECTRON MICROSCOPY2.53
9OXEELECTRON MICROSCOPY2.53
8E3QELECTRON MICROSCOPY2.68
8R8RELECTRON MICROSCOPY2.79
6URGELECTRON MICROSCOPY3
6FUWELECTRON MICROSCOPY3.07
9OXSELECTRON MICROSCOPY3.07
6DNHELECTRON MICROSCOPY3.4
6UROELECTRON MICROSCOPY3.6
2D9NSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O95639-F176.600.38

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (4): 212, 267, 200, 202

Function

Pathways and Gene Ontology

Reactome pathways

9 pathways

IDPathway
R-HSA-159231Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-168315Inhibition of Host mRNA Processing and RNA Silencing
R-HSA-6784531tRNA processing in the nucleus
R-HSA-72187mRNA 3’-end processing
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-77595Processing of Intronless Pre-mRNAs
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA

MSigDB gene sets: 191 (showing top): MORF_MSH3, MORF_BRCA1, MORF_ATRX, AP2_Q3, PUJANA_CHEK2_PCC_NETWORK, MARTINEZ_RB1_TARGETS_DN, MORF_PPP5C, REACTOME_MRNA_3_END_PROCESSING, MORF_FANCG, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, GOBP_MRNA_3_END_PROCESSING, MORF_RAP1A, DODD_NASOPHARYNGEAL_CARCINOMA_UP, PUJANA_BRCA_CENTERED_NETWORK, MORF_MT4

GO Biological Process (2): mRNA processing (GO:0006397), mRNA 3’-end processing (GO:0031124)

GO Molecular Function (6): RNA binding (GO:0003723), zinc ion binding (GO:0008270), sequence-specific double-stranded DNA binding (GO:1990837), nucleic acid binding (GO:0003676), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (3): nucleoplasm (GO:0005654), mRNA cleavage and polyadenylation specificity factor complex (GO:0005847), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-9 pathways:

CategoryPathways
Transport of Mature mRNAs Derived from Intronless Transcripts1
NS1 Mediated Effects on Host Pathways1
tRNA processing1
Processing of Capped Intron-Containing Pre-mRNA1
RNA Polymerase II Transcription1
Processing of Capped Intronless Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding2
RNA processing1
mRNA metabolic process1
mRNA processing1
RNA 3’-end processing1
nucleic acid binding1
transition metal ion binding1
double-stranded DNA binding1
sequence-specific DNA binding1
cation binding1
nuclear lumen1
cellular anatomical structure1
mRNA cleavage factor complex1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

1512 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CPSF4CPSF1Q10570999
CPSF4CPSF2Q9P2I0999
CPSF4CPSF3Q9UKF6997
CPSF4WDR33Q9C0J8997
CPSF4FIP1L1Q6UN15971
CPSF4CSTF3Q12996903
CPSF4NOC2LQ9Y3T9882
CPSF4CSTF2P33240878
CPSF4CSTF1Q05048874
CPSF4SYMPKQ92797856
CPSF4PCF11O94913834
CPSF4PABPN1Q86U42799
CPSF4CPSF7Q8N684796
CPSF4NUDT21O43809787
CPSF4PAPOLAP51003777

IntAct

100 interactions, top by confidence:

ABTypeScore
EAF1ELL2psi-mi:“MI:0914”(association)0.840
CPSF4FIP1L1psi-mi:“MI:0914”(association)0.660
CPSF4FIP1L1psi-mi:“MI:0915”(physical association)0.660
CPSF3CPSF4psi-mi:“MI:0914”(association)0.640
MEOX2CPSF4psi-mi:“MI:0915”(physical association)0.560
CPSF4psi-mi:“MI:0915”(physical association)0.560
CPSF4MEOX2psi-mi:“MI:0915”(physical association)0.560
CPSF4psi-mi:“MI:0915”(physical association)0.560
CPSF1CPSF4psi-mi:“MI:0915”(physical association)0.560
SYMPKCPSF4psi-mi:“MI:0914”(association)0.530
WDR33CPSF4psi-mi:“MI:0914”(association)0.530
ZC3H18AQRpsi-mi:“MI:0914”(association)0.530
NS1CPSF4psi-mi:“MI:0915”(physical association)0.500
CDC73CPSF4psi-mi:“MI:0914”(association)0.500
CPSF4CDC73psi-mi:“MI:0915”(physical association)0.500
CPSF6DDX39Apsi-mi:“MI:0914”(association)0.480
DDX21MED19psi-mi:“MI:2364”(proximity)0.480
NPM1CPSF4psi-mi:“MI:0914”(association)0.480
NS1CPSF4psi-mi:“MI:0915”(physical association)0.400
CPSF4NSpsi-mi:“MI:0915”(physical association)0.370
NS1CPSF4psi-mi:“MI:0915”(physical association)0.370
CPSF4NS1psi-mi:“MI:0915”(physical association)0.370
NSCPSF4psi-mi:“MI:0915”(physical association)0.370

BioGRID (169): CPSF4 (Affinity Capture-Western), CPSF4 (Two-hybrid), FIP1L1 (Two-hybrid), CPSF4 (Affinity Capture-MS), CPSF4 (Affinity Capture-MS), FIP1L1 (Two-hybrid), CPSF2 (Co-fractionation), CPSF4 (Co-fractionation), CPSF4 (Co-fractionation), CPSF4 (Co-fractionation), WDR33 (Co-fractionation), FIP1L1 (Affinity Capture-Western), CPSF4 (Reconstituted Complex), CPSF4 (Affinity Capture-MS), CPSF4 (Affinity Capture-MS)

ESM2 similar proteins: A0A1L1SUL6, A4QJJ7, A4QK16, A6H5F6, A6MM34, A6NH52, A6NMK7, A7M939, A8SEA1, A8W3M5, B0Z4S1, B0Z505, B0Z589, B0Z5H3, B1AR13, B1NWE8, B2LMJ1, O19048, O19137, O95639, P0C7P0, P0DI19, P46292, P60335, P83870, P83871, P98138, Q09G48, Q0G9M1, Q0WMV8, Q15365, Q1KXW1, Q332Y0, Q5E9A3, Q5FVR7, Q5R654, Q63789, Q66KE3, Q68S08, Q6DJP7

Diamond homologs: A6NMK7, A9LNK9, O19137, O95639, Q4P384, Q4WKD9, Q59T36, Q5BGN2, Q5FVR7, Q66KE3, Q6DJP7, Q8BQZ5, Q9VPT8, Q06102, Q2URI6, Q4IPA4, Q6BTT1, Q6C922, Q6CKU1, Q6FTL0, Q758T3, Q7SGR2, Q9UTD1, P0CS64, P0CS65, Q0DA50, Q13064, Q9FE91

SIGNOR signaling

1 interactions.

AEffectBMechanism
CPSF4“form complex”“CPSF complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 97 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Processing of Intronless Pre-mRNAs971.4×1e-13
mRNA 3’-end processing1643.8×1e-20
RNA Polymerase II Transcription Termination1442.7×7e-18
Transport of Mature Transcript to Cytoplasm631.7×1e-06
mRNA Polyadenylation2024.4×1e-20
Transport of Mature mRNA Derived from an Intronless Transcript622.7×7e-06
mRNA Splicing1015.2×4e-08
Transport of Mature mRNA derived from an Intron-Containing Transcript714.8×1e-05

GO biological processes:

GO termPartnersFoldFDR
mRNA 3’-end processing745.2×4e-08
regulation of alternative mRNA splicing, via spliceosome616.8×1e-04
mRNA transport515.1×1e-03
mRNA processing1412.7×2e-09
mRNA splicing, via spliceosome1111.6×4e-07
RNA splicing1010.1×6e-06
chromatin remodeling75.9×9e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

3 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign0
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

2297 predictions. Top by Δscore:

VariantEffectΔscore
7:99439186:G:GGdonor_gain1.0000
7:99444787:A:AGacceptor_gain1.0000
7:99444788:G:GGacceptor_gain1.0000
7:99444788:GA:Gacceptor_gain1.0000
7:99444840:GT:Gdonor_loss1.0000
7:99444841:T:Gdonor_loss1.0000
7:99448115:C:Aacceptor_gain1.0000
7:99448119:AGGG:Aacceptor_gain1.0000
7:99448120:GGGG:Gacceptor_gain1.0000
7:99448269:GTTCG:Gdonor_gain1.0000
7:99450367:GCACG:Gdonor_gain1.0000
7:99450370:CGGT:Cdonor_loss1.0000
7:99450373:T:Gdonor_loss1.0000
7:99450793:GCA:Gdonor_gain1.0000
7:99450796:G:GGdonor_gain1.0000
7:99452343:A:AGacceptor_gain1.0000
7:99452344:T:Gacceptor_gain1.0000
7:99452351:T:Aacceptor_gain1.0000
7:99452352:G:Aacceptor_gain1.0000
7:99452358:T:Aacceptor_gain1.0000
7:99454136:GGT:Gdonor_loss1.0000
7:99454137:GTG:Gdonor_loss1.0000
7:99439138:TG:Tdonor_gain0.9900
7:99439139:GG:Gdonor_gain0.9900
7:99439181:GGACA:Gdonor_gain0.9900
7:99439182:GACA:Gdonor_gain0.9900
7:99439182:GACAG:Gdonor_gain0.9900
7:99439183:A:Tdonor_gain0.9900
7:99444784:TACAG:Tacceptor_gain0.9900
7:99444785:A:AGacceptor_gain0.9900

AlphaMissense

1793 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
7:99439134:G:CA18P1.000
7:99439152:G:AG24R1.000
7:99439152:G:CG24R1.000
7:99439153:G:AG24E1.000
7:99439170:T:CF30L1.000
7:99439171:T:CF30S1.000
7:99439171:T:GF30C1.000
7:99439172:C:AF30L1.000
7:99439172:C:GF30L1.000
7:99439180:T:CM33T1.000
7:99439180:T:GM33R1.000
7:99439181:G:AM33I1.000
7:99439181:G:CM33I1.000
7:99439181:G:TM33I1.000
7:99439182:G:CD34H1.000
7:99439183:A:CD34A1.000
7:99439183:A:TD34V1.000
7:99444790:G:CK35N1.000
7:99444790:G:TK35N1.000
7:99444804:T:AV40D1.000
7:99444806:T:AC41S1.000
7:99444806:T:CC41R1.000
7:99444807:G:AC41Y1.000
7:99444807:G:CC41S1.000
7:99444807:G:TC41F1.000
7:99444808:T:GC41W1.000
7:99444815:T:CF44L1.000
7:99444817:T:AF44L1.000
7:99444817:T:GF44L1.000
7:99444830:T:AC49S1.000

dbSNP variants (sampled 300 via entrez): RS1000236639 (7:99440373 CAG>C), RS1000357355 (7:99445716 T>C), RS1000711537 (7:99438279 C>T), RS1000858065 (7:99444334 G>A,T), RS1000889385 (7:99444538 A>G,T), RS1001152144 (7:99449166 G>A), RS1001183304 (7:99449494 G>C), RS1001187819 (7:99438586 C>A,T), RS1001271470 (7:99450644 C>T), RS1001305109 (7:99456671 A>G), RS1001792642 (7:99439965 C>T), RS1001840351 (7:99452694 C>A), RS1002117584 (7:99439598 T>A,G), RS1002180223 (7:99437931 C>T), RS1002211235 (7:99438228 C>T)

Disease associations

OMIM: gene MIM:603052 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST004730_2Facial emotion recognition (sad faces)3.000000e-06
GCST005144_1Tacrolimus trough concentration in kidney transplant patients2.000000e-17
GCST005335_2Diffuse cutaneous systemic sclerosis2.000000e-06
GCST006249_21Serum metabolite levels4.000000e-11
GCST006249_45Serum metabolite levels6.000000e-32
GCST006249_47Serum metabolite levels1.000000e-45
GCST90002389_160Lymphocyte percentage of white cells4.000000e-16
GCST90002399_184Neutrophil percentage of white cells2.000000e-15

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0008329facial emotion recognition measurement
EFO:0008458tacrolimus measurement
EFO:0007993lymphocyte percentage of leukocytes
EFO:0007990neutrophil percentage of leukocytes

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL6066178 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

25 total (human), top 25 by PubMed support.

ChemicalActions (top 5)PubMed papers
Cadmium Chloridedecreases expression, increases abundance, increases expression2
aristolochic acid Idecreases expression1
triphenyl phosphateaffects expression1
beta-methylcholineaffects expression1
K 7174decreases expression1
4-chloro-N-((4-(1,1-dimethylethyl)phenyl)methyl)-3-ethyl-1-methyl-1H-pyrazole-5-carboxamidedecreases expression1
erucylphospho-N,N,N-trimethylpropylammoniumdecreases expression1
pyrimidifendecreases expression1
thifluzamidedecreases expression1
jinfukangaffects cotreatment, increases expression1
Sunitinibincreases expression1
Cadmiumincreases abundance, increases expression1
Caffeinedecreases phosphorylation1
Cisplatinaffects cotreatment, increases expression1
Ethyl Methanesulfonateincreases expression1
Ivermectindecreases expression1
Leadaffects expression1
Methyl Methanesulfonateincreases expression1
Ribonucleotidesaffects binding1
Rotenonedecreases expression1
Smokedecreases expression1
Tretinoindecreases expression1
Valproic Acidincreases expression1
Aflatoxin B1increases expression1
Copper Sulfatedecreases expression1

ChEMBL screening assays

6 unique, capped per target: 6 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5697614BindingInhibition of CPSF4 (unknown origin) assessed as fold change at 10 uM incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisInhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia. — Nature

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): diffuse scleroderma