RPAP1

gene
On this page

Also known as DKFZP727M111KIAA1403MGC858FLJ12732

Summary

RPAP1 (RNA polymerase II associated protein 1, HGNC:24567) is a protein-coding gene on chromosome 15q15.1, encoding RNA polymerase II-associated protein 1 (Q9BWH6). Forms an interface between the RNA polymerase II enzyme and chaperone/scaffolding protein, suggesting that it is required to connect RNA polymerase II to regulators of protein complex formation. It is a common-essential gene (DepMap: required in 99.7% of cancer cell lines).

This protein forms part of the RNA polymerase II (RNAPII) enzyme complex and may recruit RNAPII to chromatin through its interaction with acetylated histones.

Source: NCBI Gene 26015 — RefSeq curated summary.

At a glance

  • GWAS associations: 9
  • Clinical variants (ClinVar): 242 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.7% of screened cell lines (common-essential)
  • MANE Select transcript: NM_015540

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:24567
Approved symbolRPAP1
NameRNA polymerase II associated protein 1
Location15q15.1
Locus typegene with protein product
StatusApproved
AliasesDKFZP727M111, KIAA1403, MGC858, FLJ12732
Ensembl geneENSG00000103932
Ensembl biotypeprotein_coding
OMIM611475
Entrez26015

Gene structure

Transcript identifiers

Ensembl transcripts: 29 — 22 protein_coding, 5 retained_intron, 1 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000304330, ENST00000561603, ENST00000561631, ENST00000562303, ENST00000563293, ENST00000564934, ENST00000565035, ENST00000565167, ENST00000566863, ENST00000567866, ENST00000568413, ENST00000569829, ENST00000867487, ENST00000867488, ENST00000867489, ENST00000867490, ENST00000867491, ENST00000867492, ENST00000867493, ENST00000867494, ENST00000867495, ENST00000912250, ENST00000912251, ENST00000912252, ENST00000912253, ENST00000912254, ENST00000912255, ENST00000912256, ENST00000970347

RefSeq mRNA: 1 — MANE Select: NM_015540 NM_015540

CCDS: CCDS10079

Canonical transcript exons

ENST00000304330 — 25 exons

ExonStartEnd
ENSE000006767004152377141523972
ENSE000006767114152742341527605
ENSE000006767144152786041528027
ENSE000006767244153612941536218
ENSE000006767264153694541537201
ENSE000008840624152276541522960
ENSE000009310394152499141525148
ENSE000011525424153471441534935
ENSE000011525484153650141536649
ENSE000012061254152716741527301
ENSE000012061764152689841527068
ENSE000026272284154421941544257
ENSE000034876234153102341531202
ENSE000035040964152039141521147
ENSE000035225404152823541528336
ENSE000035279304152173841521880
ENSE000035397574152324541523354
ENSE000035911354151779841517857
ENSE000036078514152947041529568
ENSE000036532944152986441529979
ENSE000036648684152409641524254
ENSE000036712074151717641517691
ENSE000036746254152209841522250
ENSE000036910754151800641518182
ENSE000037881994153551241535632

Expression profiles

Bgee: expression breadth ubiquitous, 194 present calls, max score 89.83.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 6.9423 / max 81.2498, expressed in 1741 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1495286.94231741

Top tissues by expression

260 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
hindlimb stylopod muscleUBERON:000425289.83gold quality
gastrocnemiusUBERON:000138888.59gold quality
apex of heartUBERON:000209888.51gold quality
muscle of legUBERON:000138388.02gold quality
ventricular zoneUBERON:000305387.39gold quality
sural nerveUBERON:001548886.34gold quality
muscle organUBERON:000163086.30gold quality
skeletal muscle organUBERON:001489286.30gold quality
mucosa of transverse colonUBERON:000499186.25gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099186.20gold quality
ganglionic eminenceUBERON:000402386.06gold quality
lower esophagus mucosaUBERON:003583485.49gold quality
stromal cell of endometriumCL:000225585.05gold quality
muscle layer of sigmoid colonUBERON:003580584.51gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.46gold quality
cortical plateUBERON:000534384.39gold quality
right adrenal glandUBERON:000123384.13gold quality
lower esophagusUBERON:001347384.11gold quality
lower esophagus muscularis layerUBERON:003583384.10gold quality
heart left ventricleUBERON:000208483.99gold quality
right frontal lobeUBERON:000281083.96gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451183.94silver quality
prefrontal cortexUBERON:000045183.86gold quality
right adrenal gland cortexUBERON:003582783.81gold quality
body of stomachUBERON:000116183.75gold quality
skin of legUBERON:000151183.66gold quality
cardiac ventricleUBERON:000208283.61gold quality
esophagogastric junction muscularis propriaUBERON:003584183.53gold quality
esophagusUBERON:000104383.20gold quality
right ovaryUBERON:000211883.19gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.08

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): STAT3

miRNA regulators (miRDB)

14 targeting RPAP1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-117999.7168.701040
HSA-MIR-119799.7067.751027
HSA-MIR-488-3P99.6168.791731
HSA-MIR-1207-5P99.4969.112983
HSA-MIR-6722-3P99.4567.621919
HSA-MIR-1909-3P99.0366.561662
HSA-MIR-315498.9466.551455
HSA-MIR-887-5P98.8265.901347
HSA-MIR-126798.2469.05837
HSA-MIR-4768-3P98.1666.022330
HSA-MIR-4700-3P97.7468.641014
HSA-MIR-66597.6065.641781
HSA-MIR-1287-5P96.8065.30743
HSA-MIR-6828-3P96.0667.611155

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.7% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 2)

  • Data report the purification of RNA polymerase II-associated protein 1 (RPAP1), a 153-kDa polypeptide of unknown function. (PMID:15282305)
  • SII is important for preventing cellular death due to oxidative DNA damage, assisting RNAPII to bypass 8-oxoG (PMID:17374514)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriorpap1ENSDARG00000075072
mus_musculusRpap1ENSMUSG00000034032
rattus_norvegicusRpap1ENSRNOG00000005483
drosophila_melanogasterCG32104FBGN0052104

Protein

Protein identifiers

RNA polymerase II-associated protein 1Q9BWH6 (reviewed: Q9BWH6)

All UniProt accessions (5): Q9BWH6, H3BPM3, H3BPY8, H3BRE8, H3BTJ6

UniProt curated annotations — full annotation on UniProt →

Function. Forms an interface between the RNA polymerase II enzyme and chaperone/scaffolding protein, suggesting that it is required to connect RNA polymerase II to regulators of protein complex formation. Required for interaction of the RNA polymerase II complex with acetylated histone H3.

Subunit / interactions. Part of an RNA polymerase II complex that contains POLR2A, POLR2B, POLR2C, POLR2D, POLR2E, POLR2F, POLR2G, POLR2H, POLR2I, POLR2J, POLR2K, POLR2L, RPAP1, FCP1 plus the general transcription factors TFIIB and TFIIF.

Subcellular location. Nucleus.

Miscellaneous. May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Similarity. Belongs to the RPAP1 family.

Isoforms (3)

UniProt IDNamesCanonical?
Q9BWH6-11yes
Q9BWH6-22
Q9BWH6-33

RefSeq proteins (1): NP_056355* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR013929RPAP1_CDomain
IPR013930RPAP1_NDomain
IPR039913RPAP1/Rba50Family
IPR057989TPR_RPAP1/MINIYO-likeDomain

Pfam: PF08620, PF08621, PF25766

UniProt features (22 total): sequence variant 7, region of interest 4, modified residue 3, splice variant 3, compositionally biased region 3, chain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BWH6-F175.030.26

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (3): 1121, 72, 321

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 85 (showing top): MONNIER_POSTRADIATION_TUMOR_ESCAPE_UP, ICHIBA_GRAFT_VERSUS_HOST_DISEASE_35D_DN, GOCC_RNA_POLYMERASE_COMPLEX, MODULE_207, MILI_PSEUDOPODIA_CHEMOTAXIS_DN, GOCC_TRANSFERASE_COMPLEX_TRANSFERRING_PHOSPHORUS_CONTAINING_GROUPS, NUYTTEN_EZH2_TARGETS_DN, GOCC_TRANSFERASE_COMPLEX, SCGGAAGY_ELK1_02, GAZDA_DIAMOND_BLACKFAN_ANEMIA_ERYTHROID_DN, GOMF_TRANSFERASE_ACTIVITY_TRANSFERRING_PHOSPHORUS_CONTAINING_GROUPS, PASINI_SUZ12_TARGETS_UP, MTF1_Q4, MTOR_UP.N4.V1_UP, CIITA_TARGET_GENES

GO Biological Process (1): transcription by RNA polymerase II (GO:0006366)

GO Molecular Function (4): DNA binding (GO:0003677), nucleotidyltransferase activity (GO:0016779), protein binding (GO:0005515), transferase activity (GO:0016740)

GO Cellular Component (2): DNA-directed RNA polymerase complex (GO:0000428), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA-templated transcription1
nucleic acid binding1
transferase activity, transferring phosphorus-containing groups1
binding1
catalytic activity1
RNA polymerase complex1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

646 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
RPAP1GPN1Q9HCN4968
RPAP1RPAP2Q8IXW5932
RPAP1RPAP3Q9H6T3930
RPAP1POLR2JP52435865
RPAP1RUVBL1P82276852
RPAP1INTS3Q68E01810
RPAP1RUVBL2Q9Y230802
RPAP1PFDN6O15212776
RPAP1MED17Q9NVC6766
RPAP1GPN2Q9H9Y4724
RPAP1GPN3Q9UHW5691
RPAP1POLR2CP19387558
RPAP1SAXO2Q658L1475
RPAP1RNF41Q9H4P4454
RPAP1NDUFAF1Q9Y375453

IntAct

75 interactions, top by confidence:

ABTypeScore
POLR2JPOLR1Cpsi-mi:“MI:0914”(association)0.830
RPAP1POLR2Lpsi-mi:“MI:0915”(physical association)0.740
POLR2LRPAP1psi-mi:“MI:0915”(physical association)0.740
POLR2CPOLR2Dpsi-mi:“MI:0914”(association)0.730
POLR2JPOLR2Dpsi-mi:“MI:0914”(association)0.730
POLR2LRCCD1psi-mi:“MI:0914”(association)0.640
GPN2POLR2Bpsi-mi:“MI:0914”(association)0.640
POLR2LPOLR3Apsi-mi:“MI:0914”(association)0.640
RPAP1psi-mi:“MI:0915”(physical association)0.550
C1orf94USO1psi-mi:“MI:0914”(association)0.550
POLR2JSUPT5Hpsi-mi:“MI:0914”(association)0.530
POLR1CPOLR3Apsi-mi:“MI:0914”(association)0.530
GPN1POLR3Apsi-mi:“MI:0914”(association)0.530
POLR2MBIN1psi-mi:“MI:0914”(association)0.530
GPN2POLR2Cpsi-mi:“MI:0914”(association)0.530
POLR2JMED14psi-mi:“MI:0914”(association)0.530
POT1RPAP1psi-mi:“MI:0915”(physical association)0.510
RPAP1TERF2IPpsi-mi:“MI:0915”(physical association)0.370
POLKTIA1psi-mi:“MI:0914”(association)0.350
Xpo1IFT56psi-mi:“MI:0914”(association)0.350
PB2ESYT2psi-mi:“MI:0914”(association)0.350
PB2IPO5psi-mi:“MI:0914”(association)0.350
HSPA8PPP6Cpsi-mi:“MI:0914”(association)0.350
C9orf72CHD2psi-mi:“MI:0914”(association)0.350
Prdm16ESYT2psi-mi:“MI:0914”(association)0.350

BioGRID (138): RPAP1 (Two-hybrid), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Co-fractionation), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS), RPAP1 (Affinity Capture-MS)

ESM2 similar proteins: A0A8I6ASZ5, A0JN53, A4IG66, D3Z8X7, D3ZND0, G3X992, O00750, O08836, O70576, P0DKR2, Q15021, Q1JQC5, Q1L5Z9, Q1LWH4, Q1LXZ7, Q2YD98, Q3T1I9, Q3TV65, Q3UJU9, Q4R5Q4, Q5EAU9, Q5JTW2, Q5R6Z1, Q5TC12, Q61249, Q66H15, Q6NY52, Q6P5E6, Q6PBQ2, Q6PI26, Q80TE0, Q80V31, Q80XC6, Q8BIW9, Q8BM55, Q8K2Z4, Q8R3L2, Q8VDP4, Q8WVB6, Q92574

Diamond homologs: A0JN53, O43088, Q04418, Q3T1I9, Q80TE0, Q9BWH6

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 94 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
FGFR2 mutant receptor activation674.9×6e-09
Signaling by FGFR2 IIIa TM659.1×9e-09
Abortive elongation of HIV-1 transcript in the absence of Tat757.0×6e-09
RNA Polymerase III Chain Elongation552.0×3e-07
FGFR2 alternative splicing748.5×6e-09
RNA Pol II CTD phosphorylation and interaction with CE during HIV infection746.8×6e-09
Signaling by FGFR2746.8×6e-09
RNA Pol II CTD phosphorylation and interaction with CE746.8×6e-09

Disease & clinical

Clinical variants and AI predictions

ClinVar

242 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance199
Likely benign13
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

4100 predictions. Top by Δscore:

VariantEffectΔscore
15:41518005:CTGAG:Cdonor_gain1.0000
15:41518183:C:CCacceptor_gain1.0000
15:41521736:A:ACdonor_gain1.0000
15:41521737:C:CCdonor_gain1.0000
15:41523350:CTTGG:Cacceptor_gain1.0000
15:41523351:TTGG:Tacceptor_gain1.0000
15:41523355:C:CCacceptor_gain1.0000
15:41524985:CCTCA:Cdonor_loss1.0000
15:41524986:CTCAC:Cdonor_loss1.0000
15:41524987:TCA:Tdonor_loss1.0000
15:41524990:C:Gdonor_loss1.0000
15:41525144:CTCAA:Cacceptor_gain1.0000
15:41525146:CAA:Cacceptor_gain1.0000
15:41525149:C:CCacceptor_gain1.0000
15:41527153:C:CAdonor_gain1.0000
15:41527161:TCTCA:Tdonor_loss1.0000
15:41527162:CTCAC:Cdonor_loss1.0000
15:41527163:TCA:Tdonor_loss1.0000
15:41527164:CACC:Cdonor_loss1.0000
15:41527165:A:ACdonor_gain1.0000
15:41527165:AC:Adonor_gain1.0000
15:41527165:ACC:Adonor_gain1.0000
15:41527165:ACCC:Adonor_loss1.0000
15:41527166:C:CCdonor_gain1.0000
15:41527166:C:CGdonor_loss1.0000
15:41527166:CC:Cdonor_gain1.0000
15:41527166:CCC:Cdonor_gain1.0000
15:41527166:CCCT:Cdonor_gain1.0000
15:41527297:AGCCC:Aacceptor_gain1.0000
15:41527298:GCCC:Gacceptor_gain1.0000

AlphaMissense

8871 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
15:41529552:A:GF359S0.997
15:41529551:G:CF359L0.994
15:41529551:G:TF359L0.994
15:41529553:A:GF359L0.994
15:41528302:A:GL398P0.991
15:41528292:G:CS401R0.990
15:41528292:G:TS401R0.990
15:41528294:T:GS401R0.990
15:41528311:A:GL395P0.986
15:41529552:A:CF359C0.986
15:41528329:C:TG389E0.985
15:41528327:A:CY390D0.984
15:41525034:A:GW678R0.983
15:41525034:A:TW678R0.983
15:41528327:A:GY390H0.983
15:41529493:G:CH379D0.983
15:41528326:T:GY390S0.982
15:41529495:A:GL378P0.982
15:41528266:G:TA410E0.981
15:41529908:A:GW339R0.981
15:41529908:A:TW339R0.981
15:41528277:C:AQ406H0.980
15:41528277:C:GQ406H0.980
15:41529906:C:AW339C0.980
15:41529906:C:GW339C0.980
15:41528330:C:AG389W0.979
15:41520783:A:GW1135R0.978
15:41520783:A:TW1135R0.978
15:41528314:T:AE394V0.977
15:41528274:T:AR407S0.976

dbSNP variants (sampled 300 via entrez): RS1000107201 (15:41533039 A>G), RS1000413624 (15:41533334 G>C), RS1000468839 (15:41520406 G>A,T), RS1000573540 (15:41520367 G>A), RS1000665908 (15:41525977 T>C), RS1000890547 (15:41529266 T>G), RS1000975269 (15:41533197 C>T), RS1001019947 (15:41545622 A>T), RS1001022257 (15:41546138 C>T), RS1001274490 (15:41539773 C>T), RS1001340927 (15:41523029 G>A,C), RS1001669988 (15:41535429 G>T), RS1001696757 (15:41543255 G>C), RS1001941866 (15:41529146 C>T), RS1002144972 (15:41518784 A>G)

Disease associations

OMIM: gene MIM:611475 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

9 associations (top):

StudyTraitp-value
GCST006666_16Lipid traits (pleiotropy) (HIPO component 1)4.000000e-09
GCST007430_126Peak expiratory flow3.000000e-07
GCST007431_120Lung function (FEV1/FVC)4.000000e-23
GCST007923_11Medication use (drugs used in diabetes)1.000000e-08
GCST012495_5Lung function (FEV1/FVC)2.000000e-13
GCST90020024_479A body shape index2.000000e-08
GCST90020025_444Waist-to-hip ratio adjusted for BMI4.000000e-11
GCST90020027_622Waist-hip index6.000000e-11
GCST90020029_285Waist circumference adjusted for body mass index4.000000e-08

EFO canonical traits (9, from GWAS)

EFO IDTrait name
EFO:0004530triglyceride measurement
EFO:0004574total cholesterol measurement
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0004612high density lipoprotein cholesterol measurement
EFO:0009718peak expiratory flow
EFO:0004713FEV/FVC ratio
EFO:0009924Drugs used in diabetes use measurement
EFO:0007789BMI-adjusted waist circumference
EFO:0007788BMI-adjusted waist-hip ratio

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL6067272 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

2 potent at pChembl≥5 of 2 total, top 2 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
8.19Kd6.447nMCHEMBL5653589
8.19ED506.447nMCHEMBL5653589

PubChem BioAssay actives

1 with measured affinity, of 2 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149238: Binding affinity to human RPAP1 incubated for 45 mins by Kinobead based pull down assaykd0.0064uM

CTD chemical–gene interactions

30 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
methacrylaldehydeaffects cotreatment, affects expression, increases expression, increases abundance2
Acroleinincreases abundance, affects cotreatment, affects expression, increases expression2
Ozoneincreases expression, increases abundance, affects cotreatment, affects expression2
aristolochic acid Iincreases expression1
FR900359decreases phosphorylation1
dicrotophosincreases expression1
triphenyl phosphateaffects expression1
alpha-pineneaffects cotreatment, affects expression, increases abundance1
sodium arseniteincreases expression1
aflatoxin B2decreases methylation1
coumarinincreases phosphorylation1
epigallocatechin gallatedecreases expression1
di-n-butylphosphoric acidaffects expression1
CGP 52608affects binding, increases reaction1
Air Pollutantsaffects cotreatment, affects expression, increases abundance1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation1
Cisplatindecreases expression1
Ivermectindecreases expression1
Potassium Dichromateincreases expression1
Quercetinincreases expression1
Rotenonedecreases expression1
Smokedecreases expression1
T-2 Toxinincreases expression1
Testosteroneincreases expression1
Tretinoindecreases expression1
Valproic Acidincreases methylation1
Aflatoxin B1decreases methylation1
Copper Sulfatedecreases expression1
Volatile Organic Compoundsaffects cotreatment, affects expression1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652280BindingBinding affinity to human RPAP1 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.