PRPF40B

gene
On this page

Also known as HYPC

Summary

PRPF40B (pre-mRNA processing factor 40B, HGNC:25031) is a protein-coding gene on chromosome 12q13.12, encoding Pre-mRNA-processing factor 40 homolog B (Q6NWY9). May be involved in pre-mRNA splicing.

This gene encodes a WW-domain containing protein similar to yeast splicing factor PRP40. This protein has been shown to interact with Huntingtin and methyl CpG binding protein 2 (MeCP2). Alternative splicing results in different transcript variants.

Source: NCBI Gene 25766 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 95 total
  • MANE Select transcript: NM_001031698

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25031
Approved symbolPRPF40B
Namepre-mRNA processing factor 40B
Location12q13.12
Locus typegene with protein product
StatusApproved
AliasesHYPC
Ensembl geneENSG00000110844
Ensembl biotypeprotein_coding
OMIM621019
Entrez25766

Gene structure

Transcript identifiers

Ensembl transcripts: 28 — 16 protein_coding, 7 retained_intron, 4 protein_coding_CDS_not_defined, 1 nonsense_mediated_decay

ENST00000261897, ENST00000380281, ENST00000508736, ENST00000527253, ENST00000546626, ENST00000547764, ENST00000548399, ENST00000548436, ENST00000548825, ENST00000549547, ENST00000551063, ENST00000551111, ENST00000551269, ENST00000551320, ENST00000551418, ENST00000552301, ENST00000853189, ENST00000853190, ENST00000853191, ENST00000853192, ENST00000853193, ENST00000853194, ENST00000922971, ENST00000922972, ENST00000922973, ENST00000960369, ENST00000960370, ENST00000960371

RefSeq mRNA: 11 — MANE Select: NM_001031698 NM_001031698, NM_001363607, NM_001379030, NM_001379031, NM_001379032, NM_001379033, NM_001379034, NM_001379035, NM_001379036, NM_001379037, NM_012272

CCDS: CCDS31796, CCDS86298

Canonical transcript exons

ENST00000548825 — 26 exons

ExonStartEnd
ENSE000009197244963773349637824
ENSE000009197264964223549642372
ENSE000009197274964258049642675
ENSE000009197314964386149644004
ENSE000016573134963453849634602
ENSE000017083764963747049637584
ENSE000017172864963584349635993
ENSE000017667384963536549635473
ENSE000017853494963671649636849
ENSE000017979994963509949635263
ENSE000023428044962356149623593
ENSE000032665904964410049644665
ENSE000034724544963433249634455
ENSE000034912704963186049631925
ENSE000034993714963285549632880
ENSE000034995934963363749633661
ENSE000035221274964369149643752
ENSE000035262354964293049643016
ENSE000035355984963054549630625
ENSE000035743414963388649634092
ENSE000035832414964190849642024
ENSE000035989614963259649632623
ENSE000036088274963342749633547
ENSE000036265814963301449633124
ENSE000036407074964322349643397
ENSE000037596474963140149631544

Expression profiles

Bgee: expression breadth ubiquitous, 196 present calls, max score 94.47.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 4.6888 / max 57.4145, expressed in 1463 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1253442.86981158
1253431.1958760
1253420.6231360

Top tissues by expression

242 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right lobe of thyroid glandUBERON:000111994.47gold quality
left lobe of thyroid glandUBERON:000112094.12gold quality
adenohypophysisUBERON:000219693.48gold quality
left testisUBERON:000453393.29gold quality
sural nerveUBERON:001548893.29gold quality
right testisUBERON:000453493.28gold quality
left ovaryUBERON:000211993.08gold quality
right uterine tubeUBERON:000130292.98gold quality
endocervixUBERON:000045892.83gold quality
right ovaryUBERON:000211892.83gold quality
thyroid glandUBERON:000204692.74gold quality
metanephros cortexUBERON:001053392.28gold quality
body of uterusUBERON:000985391.95gold quality
mucosa of stomachUBERON:000119991.87gold quality
pituitary glandUBERON:000000791.81gold quality
ectocervixUBERON:001224991.29gold quality
right hemisphere of cerebellumUBERON:001489090.87gold quality
apex of heartUBERON:000209890.78gold quality
testisUBERON:000047390.51gold quality
right adrenal gland cortexUBERON:003582790.46gold quality
left uterine tubeUBERON:000130390.33gold quality
right frontal lobeUBERON:000281090.30gold quality
gastrocnemiusUBERON:000138890.29gold quality
right adrenal glandUBERON:000123390.12gold quality
descending thoracic aortaUBERON:000234590.00gold quality
cerebellar hemisphereUBERON:000224589.98gold quality
esophagogastric junction muscularis propriaUBERON:003584189.93gold quality
ascending aortaUBERON:000149689.89gold quality
thoracic aortaUBERON:000151589.88gold quality
muscle of legUBERON:000138389.83gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no3.17

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

25 targeting PRPF40B, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-340-5P100.0072.504437
HSA-MIR-318599.9968.121959
HSA-MIR-185-3P99.9567.011743
HSA-MIR-4731-5P99.8967.232537
HSA-MIR-674599.7465.331321
HSA-MIR-4690-5P99.6566.24813
HSA-MIR-6715B-5P99.6469.631420
HSA-MIR-182799.6368.573265
HSA-MIR-4649-3P99.5666.901783
HSA-MIR-426999.5569.891373
HSA-MIR-766-5P99.4767.912225
HSA-MIR-4716-5P98.8268.571168
HSA-MIR-1212598.5967.541044
HSA-MIR-6764-3P98.4467.641153
HSA-MIR-6824-3P98.4467.621154
HSA-MIR-1910-3P98.4467.511695
HSA-MIR-6804-5P98.3965.771084
HSA-MIR-6511A-5P98.1367.471770
HSA-MIR-6880-5P98.0865.591282
HSA-MIR-6742-3P97.9564.501490
HSA-MIR-7113-5P97.8867.331735
HSA-MIR-452197.7367.64684
HSA-MIR-335-5P97.1068.121022
HSA-MIR-342-3P96.4467.481344
HSA-MIR-55495.2066.98341

Literature-anchored findings (GeneRIF, showing 2)

  • Placing our data in a functional context, we also show that PRPF40B depletion increased Fas/CD95 receptor number and cell apoptosis, which suggests the ability of PRPF40B to alter the alternative splicing of key apoptotic genes to regulate cell survival (PMID:25605964)
  • Study characterizes the transcriptomic regulation of PRPF40B in K562 cells upon knockout (KO) and rescue with MDS mutations, to reveal a set of targets with weak splice sites and A-rich motifs. Importantly, PRPF40B KO up-regulated a hypoxia signature which, together with its low expression in AML, suggests a role of PRPF40B in myeloid leukemias. (PMID:31088860)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
mus_musculusPrpf40bENSMUSG00000023007
rattus_norvegicusPrpf40bENSRNOG00000052539
drosophila_melanogasterPrp40FBGN0031492
caenorhabditis_elegansWBGENE00014218

Paralogs (1): PRPF40A (ENSG00000196504)

Protein

Protein identifiers

Pre-mRNA-processing factor 40 homolog BQ6NWY9 (reviewed: Q6NWY9)

Alternative names: Huntingtin yeast partner C, Huntingtin-interacting protein C

All UniProt accessions (4): A0A8Z5ADK7, Q6NWY9, F8VU11, H0YII7

UniProt curated annotations — full annotation on UniProt →

Function. May be involved in pre-mRNA splicing.

Subunit / interactions. Interacts with the N-terminus of HD.

Subcellular location. Nucleus speckle.

Tissue specificity. Expressed in the striatum and cortex of the brain (at protein level). Highly expressed in testis, fetal kidney and fetal brain. Moderately expressed in pancreas, skeletal muscle, placenta, brain and heart. Weakly expressed in colon, ileum, ovary, prostate, spleen, kidney and fetal lung.

Similarity. Belongs to the PRPF40 family.

Isoforms (4)

UniProt IDNamesCanonical?
Q6NWY9-11yes
Q6NWY9-22
Q6NWY9-33
Q6NWY9-44

RefSeq proteins (11): NP_001026868, NP_001350536, NP_001365959, NP_001365960, NP_001365961, NP_001365962, NP_001365963, NP_001365964, NP_001365965, NP_001365966, NP_036404 (=MANE)

Domains & families (InterPro)

IDNameType
IPR001202WW_domDomain
IPR002713FF_domainDomain
IPR036020WW_dom_sfHomologous_superfamily
IPR036517FF_domain_sfHomologous_superfamily
IPR039726Prp40-likeFamily

Pfam: PF00397, PF01846, PF25432

UniProt features (31 total): domain 8, compositionally biased region 8, splice variant 6, modified residue 4, region of interest 2, cross-link 2, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q6NWY9-F169.160.35

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (6): 148, 764, 832, 852, 175, 838

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 95 (showing top): GAUSSMANN_MLL_AF4_FUSION_TARGETS_C_UP, GOBP_RNA_SPLICING, GOBP_MRNA_CIS_SPLICING_VIA_SPLICEOSOME, GOCC_U1_SNRNP, GOCC_U2_TYPE_SPLICEOSOMAL_COMPLEX, GOCC_NUCLEAR_SPECK, GOCC_NUCLEAR_BODY, GOCC_RIBONUCLEOPROTEIN_GRANULE, GOCC_PRESPLICEOSOME, GOCC_SPLICEOSOMAL_COMPLEX, GOCC_RIBONUCLEOPROTEIN_COMPLEX, KEGG_SPLICEOSOME, GOBP_MRNA_PROCESSING, GOCC_SUPRAMOLECULAR_COMPLEX, GOCC_SM_LIKE_PROTEIN_FAMILY_COMPLEX

GO Biological Process (4): mRNA splicing, via spliceosome (GO:0000398), mRNA cis splicing, via spliceosome (GO:0045292), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (2): RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (5): U1 snRNP (GO:0005685), nuclear matrix (GO:0016363), nuclear speck (GO:0016607), U2-type prespliceosome (GO:0071004), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
mRNA processing1
mRNA splicing, via spliceosome1
mRNA metabolic process1
nucleic acid binding1
binding1
spliceosomal snRNP complex1
nuclear lumen1
cellular anatomical structure1
nuclear ribonucleoprotein granule1
U2-type spliceosomal complex1
U1 snRNP1
U2 snRNP1
prespliceosome1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

862 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PRPF40BFICDQ9BVA6833
PRPF40BU2AF2P26368830
PRPF40BSF3A1Q15459783
PRPF40BSETD2Q9BYW2664
PRPF40BZRSR2Q15696641
PRPF40BU2AF1Q01081610
PRPF40BSRSF2Q01130605
PRPF40BA0A0A6YYJ8A0A0A6YYJ8544
PRPF40BLUC7L2Q9Y383535
PRPF40BMAGEA3P43357529
PRPF40BZDHHC17Q8IUH5512
PRPF40BMECP2P51608510
PRPF40BHTTP42858500
PRPF40BBCORL1Q5H9F3494
PRPF40BSF1Q15637471

IntAct

9 interactions, top by confidence:

ABTypeScore
PRPF40BFUBP1psi-mi:“MI:0915”(physical association)0.560
PRPF40BP4HA1psi-mi:“MI:0915”(physical association)0.400
repTAF4psi-mi:“MI:0914”(association)0.350
PRPF40BGNB5psi-mi:“MI:0914”(association)0.350
PRPF40BRBM25psi-mi:“MI:0914”(association)0.350
ARGLU1PIAS2psi-mi:“MI:2364”(proximity)0.270
PRPF40BFUBP1psi-mi:“MI:0915”(physical association)0.000

BioGRID (44): PRPF40B (Co-fractionation), PRPF40B (Co-fractionation), PRPF40B (Co-fractionation), SRRM2 (Co-fractionation), PRPF40B (Affinity Capture-MS), PRPF40B (Affinity Capture-MS), PRPF40B (Affinity Capture-MS), RGS9 (Affinity Capture-MS), GNB5 (Affinity Capture-MS), RBM25 (Affinity Capture-MS), PRPF40B (Two-hybrid), HTT (Reconstituted Complex), FUBP1 (Two-hybrid), PRPF40B (Affinity Capture-MS), PRPF40B (Proximity Label-MS)

ESM2 similar proteins: A0JMV4, A4IFB1, A4IGK4, B6EUA9, D3ZTQ1, E6ZGB4, F1QLR3, O14776, O70523, O75376, O75400, P21127, P24788, P25440, P34600, P35922, P51114, P51115, Q12872, Q15059, Q2T9I5, Q2TBT7, Q3USH5, Q4KKX4, Q5BJ56, Q5R539, Q5T8P6, Q5TJG6, Q5XI81, Q60974, Q61584, Q6DDU9, Q6INA9, Q6MGA9, Q6NWY9, Q6NZN0, Q6P949, Q6PFK1, Q7JJ13, Q7ZTQ5

Diamond homologs: B6EUA9, F4JCC1, O14776, O75400, Q3B807, Q5VWI1, Q6NWY9, Q80W14, Q8CGF7, Q9R1C7, P34600, O04425

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

95 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance82
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

5343 predictions. Top by Δscore:

VariantEffectΔscore
12:49588635:A:Tacceptor_gain1.0000
12:49598749:GCCCA:Gdonor_loss1.0000
12:49598751:CCACC:Cdonor_loss1.0000
12:49598752:CA:Cdonor_loss1.0000
12:49598753:A:Tdonor_loss1.0000
12:49598943:CTTGC:Cacceptor_gain1.0000
12:49598944:TTGC:Tacceptor_gain1.0000
12:49598945:TGC:Tacceptor_gain1.0000
12:49598946:GC:Gacceptor_gain1.0000
12:49598947:CC:Cacceptor_gain1.0000
12:49598948:C:CCacceptor_gain1.0000
12:49598948:C:CGacceptor_loss1.0000
12:49598949:T:Gacceptor_loss1.0000
12:49599464:CCTA:Cdonor_loss1.0000
12:49599465:CTACC:Cdonor_loss1.0000
12:49599504:A:ACdonor_gain1.0000
12:49599504:ATGG:Adonor_gain1.0000
12:49599505:T:Cdonor_gain1.0000
12:49603181:CTA:Cdonor_loss1.0000
12:49603182:TA:Tdonor_loss1.0000
12:49603184:C:CAdonor_loss1.0000
12:49603184:CCTTG:Cdonor_gain1.0000
12:49603186:TTGTG:Tdonor_gain1.0000
12:49603364:TCAC:Tacceptor_gain1.0000
12:49603365:CACC:Cacceptor_gain1.0000
12:49603367:CCT:Cacceptor_loss1.0000
12:49603368:C:CCacceptor_gain1.0000
12:49603369:T:Cacceptor_loss1.0000
12:49604309:TCA:Tdonor_loss1.0000
12:49604310:CACC:Cdonor_loss1.0000

AlphaMissense

5843 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
12:49633089:T:AW120R1.000
12:49633089:T:CW120R1.000
12:49633091:G:CW120C1.000
12:49633091:G:TW120C1.000
12:49633448:T:AW139R1.000
12:49633448:T:CW139R1.000
12:49633514:T:AW161R1.000
12:49633514:T:CW161R1.000
12:49633516:G:CW161C1.000
12:49633516:G:TW161C1.000
12:49634433:T:CF283S1.000
12:49634442:T:CL286P1.000
12:49634445:T:CL287P1.000
12:49634542:T:AV292D1.000
12:49634559:T:AW298R1.000
12:49634559:T:CW298R1.000
12:49634560:G:CW298S1.000
12:49634561:G:CW298C1.000
12:49634561:G:TW298C1.000
12:49634568:G:CA301P1.000
12:49635123:A:CK320N1.000
12:49635123:A:TK320N1.000
12:49635124:A:GK321E1.000
12:49635126:G:CK321N1.000
12:49635126:G:TK321N1.000
12:49635130:G:CA323P1.000
12:49635133:T:CF324L1.000
12:49635135:C:AF324L1.000
12:49635135:C:GF324L1.000
12:49635147:G:CK328N1.000

dbSNP variants (sampled 300 via entrez): RS1000016086 (12:49637274 T>G), RS1000016184 (12:49634595 C>G,T), RS1000055983 (12:49641230 G>A), RS1000086690 (12:49641547 C>T), RS1000188901 (12:49627232 C>G), RS1000450063 (12:49640449 A>C,T), RS1000587176 (12:49620945 C>T), RS1000626197 (12:49626837 A>G), RS1000707385 (12:49629166 T>C), RS1001083128 (12:49628883 A>G), RS1001140810 (12:49622181 G>A), RS1001171856 (12:49622594 A>G), RS1001617506 (12:49637829 G>A), RS1001648496 (12:49638271 T>A,G), RS1002007240 (12:49640145 G>T)

Disease associations

OMIM: gene MIM:621019 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST008595_117Cognitive ability, years of educational attainment or schizophrenia (pleiotropy)9.000000e-09

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004337intelligence
EFO:0004784self reported educational attainment

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

28 total (human), top 28 by PubMed support.

ChemicalActions (top 5)PubMed papers
FR900359affects phosphorylation1
bisphenol Faffects cotreatment, increases expression1
TAK-243decreases sumoylation1
triphenyl phosphateaffects expression1
bisphenol Aincreases methylation1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
coumarinincreases phosphorylation1
entinostatincreases expression1
abrineincreases expression1
jinfukangaffects cotreatment, increases expression1
Resveratrolaffects cotreatment, decreases expression1
Sunitinibincreases expression1
Air Pollutantsdecreases expression, increases abundance1
Benzo(a)pyreneaffects methylation1
Caffeinedecreases phosphorylation1
Cisplatinincreases expression, affects cotreatment1
Dexamethasoneaffects cotreatment, increases expression1
Doxorubicinincreases expression1
Indomethacinaffects cotreatment, increases expression1
Phthalic Acidsincreases methylation1
Plant Extractsaffects cotreatment, decreases expression1
Tetrachlorodibenzodioxinaffects expression1
Tobacco Smoke Pollutiondecreases expression1
Valproic Acidaffects expression1
1-Methyl-3-isobutylxanthineaffects cotreatment, increases expression1
Cyclosporineincreases expression1
Acrylamidedecreases expression1
Particulate Matterdecreases expression, increases abundance1

Cellosaurus cell lines

3 cell lines: 3 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_E0M1Ubigene HeLa PRPF40B KOCancer cell lineFemale
CVCL_TH22HAP1 PRPF40B (-) 1Cancer cell lineMale
CVCL_XR93HAP1 PRPF40B (-) 2Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.