Section 1: Gene Identifiers
Primary Gene Identifiers
| Database | Identifier | Details |
|---|
| HGNC ID | HGNC:270 | Approved symbol: PARP1 |
| HGNC Name | poly(ADP-ribose) polymerase 1 | Status: Approved |
| Ensembl Gene | ENSG00000143799 | GRCh38 |
| NCBI Entrez | 142 | Gene ID |
| OMIM | 173870 | Gene/Locus |
Genomic Location (GRCh38/hg38)
| Attribute | Value |
|---|
| Chromosome | 1 |
| Cytogenetic Band | 1q42.12 |
| Start Position | 226,360,210 |
| End Position | 226,408,154 |
| Strand | Minus (-) |
| Genome Assembly | GRCh38 |
Gene Symbols and Aliases
| Type | Values |
|---|
| Approved Symbol | PARP1 |
| Previous Symbols | PPOL, ADPRT |
| Aliases | PARP, ARTD1, Poly-PARP, PARS, ADPRT1, pADPRT-1 |
| Locus Type | Gene with protein product |
| Gene Groups | Zinc fingers PARP-type; Poly(ADP-ribose) polymerases |
Section 2: Transcript Identifiers
Ensembl Transcripts
Total Transcript Count: 37
| Transcript ID | Biotype | Start | End | Notes |
|---|
| ENST00000366794 | protein_coding | 226,360,691 | 226,408,093 | Canonical |
| ENST00000677203 | protein_coding | 226,360,697 | 226,408,100 | |
| ENST00000874606 | protein_coding | 226,360,693 | 226,408,090 | |
| ENST00000874607 | protein_coding | 226,361,005 | 226,408,093 | |
| ENST00000874608 | protein_coding | 226,361,003 | 226,408,060 | |
| ENST00000874609 | protein_coding | 226,361,003 | 226,408,055 | |
| ENST00000922077 | protein_coding | 226,360,693 | 226,408,116 | |
| ENST00000922078 | protein_coding | 226,360,691 | 226,408,100 | |
| ENST00000922079 | protein_coding | 226,361,003 | 226,408,096 | |
| ENST00000922080 | protein_coding | 226,361,003 | 226,408,090 | |
| ENST00000922081 | protein_coding | 226,361,004 | 226,408,064 | |
| ENST00000963780 | protein_coding | 226,360,693 | 226,408,120 | |
| ENST00000963781 | protein_coding | 226,361,006 | 226,408,064 | |
| ENST00000366792 | protein_coding_CDS_not_defined | 226,392,258 | 226,408,075 | |
| ENST00000469663 | protein_coding_CDS_not_defined | 226,388,698 | 226,392,792 | |
| ENST00000676481 | protein_coding_CDS_not_defined | 226,360,579 | 226,364,336 | |
| ENST00000677189 | protein_coding_CDS_not_defined | 226,360,210 | 226,372,887 | |
| ENST00000677985 | protein_coding_CDS_not_defined | 226,360,210 | 226,372,915 | |
| ENST00000678226 | protein_coding_CDS_not_defined | 226,360,210 | 226,381,883 | |
| ENST00000677091 | nonsense_mediated_decay | 226,360,700 | 226,408,100 | |
| ENST00000678144 | nonsense_mediated_decay | 226,360,700 | 226,408,096 | |
| ENST00000678560 | nonsense_mediated_decay | 226,360,697 | 226,408,085 | |
| ENST00000366790 | retained_intron | 226,399,195 | 226,407,982 | |
| ENST00000463968 | retained_intron | 226,361,319 | 226,364,372 | |
| ENST00000468608 | retained_intron | 226,361,708 | 226,363,160 | |
| ENST00000490921 | retained_intron | 226,361,295 | 226,371,115 | |
| ENST00000491816 | retained_intron | 226,361,410 | 226,362,252 | |
| ENST00000498787 | retained_intron | 226,360,748 | 226,372,746 | |
| ENST00000676565 | retained_intron | 226,360,700 | 226,376,067 | |
| ENST00000676685 | retained_intron | 226,360,729 | 226,408,154 | |
| ENST00000676709 | retained_intron | 226,360,748 | 226,408,154 | |
| ENST00000677374 | retained_intron | 226,362,218 | 226,386,797 | |
| ENST00000677815 | retained_intron | 226,362,386 | 226,366,954 | |
| ENST00000677884 | retained_intron | 226,360,697 | 226,408,096 | |
| ENST00000678288 | retained_intron | 226,360,210 | 226,374,420 | |
| ENST00000678781 | retained_intron | 226,360,727 | 226,408,154 | |
| ENST00000679276 | retained_intron | 226,360,743 | 226,408,154 | |
RefSeq Transcripts (Human)
| RefSeq ID | Type | Status | MANE Select |
|---|
| NM_001618 | mRNA | REVIEWED | ✓ Yes |
| NM_179834 | mRNA | REVIEWED | No |
| NP_001609 | protein | REVIEWED | ✓ Yes |
| NP_850165 | protein | REVIEWED | No |
CCDS Identifier
Exons for Canonical Transcript (ENST00000366794)
Total Exon Count: 23
| Exon ID | Start | End | Strand |
|---|
| ENSE00001697768 | 226,407,810 | 226,408,093 | - |
| ENSE00000962007 | 226,402,214 | 226,402,379 | - |
| ENSE00003548024 | 226,392,199 | 226,392,314 | - |
| ENSE00003550925 | 226,390,410 | 226,390,624 | - |
| ENSE00000962010 | 226,388,656 | 226,388,755 | - |
| ENSE00000962011 | 226,386,326 | 226,386,442 | - |
| ENSE00000962012 | 226,385,504 | 226,385,680 | - |
| ENSE00000962013 | 226,383,036 | 226,383,183 | - |
| ENSE00000962014 | 226,381,068 | 226,381,208 | - |
| ENSE00000962015 | 226,379,922 | 226,380,164 | - |
| ENSE00000962016 | 226,379,573 | 226,379,641 | - |
| ENSE00000962017 | 226,379,142 | 226,379,274 | - |
| ENSE00000962018 | 226,377,108 | 226,377,303 | - |
| ENSE00000962019 | 226,374,226 | 226,374,354 | - |
| ENSE00003634072 | 226,368,199 | 226,368,321 | - |
| ENSE00003576366 | 226,367,480 | 226,367,608 | - |
| ENSE00003492408 | 226,365,954 | 226,366,052 | - |
| ENSE00003498847 | 226,365,002 | 226,365,154 | - |
| ENSE00003595606 | 226,363,943 | 226,364,070 | - |
| ENSE00003642486 | 226,363,099 | 226,363,160 | - |
| ENSE00000962020 | 226,370,434 | 226,370,517 | - |
| ENSE00003476295 | 226,361,969 | 226,362,083 | - |
| ENSE00001442633 | 226,360,691 | 226,361,541 | - |
Section 3: Protein Identifiers
UniProt Accessions
| UniProt ID | Status | Name |
|---|
| P09874 | ✓ Reviewed (Swiss-Prot) | Poly [ADP-ribose] polymerase 1 |
| A0A7I2V384 | Unreviewed (TrEMBL) | - |
| A0A7I2V3E1 | Unreviewed (TrEMBL) | - |
| A0A7I2V5E9 | Unreviewed (TrEMBL) | - |
| A0A7I2V625 | Unreviewed (TrEMBL) | - |
Canonical Protein Properties (P09874)
| Property | Value |
|---|
| Length | 1,014 amino acids |
| Mass | 113,084 Da |
| Alternative Names | ADP-ribosyltransferase diphtheria toxin-like 1; DNA ADP-ribosyltransferase PARP1; NAD(+) ADP-ribosyltransferase 1; Poly[ADP-ribose] synthase 1 |
RefSeq Proteins (Human)
| RefSeq ID | Status | MANE Select |
|---|
| NP_001609 | REVIEWED | ✓ Yes |
| NP_850165 | REVIEWED | No |
Protein Domains and Families (InterPro)
Total Domain Count: 14
| InterPro ID | Name | Type |
|---|
| IPR001357 | BRCT_dom | Domain |
| IPR001510 | Znf_PARP | Domain |
| IPR004102 | Poly(ADP-ribose)pol_reg_dom | Domain |
| IPR008893 | WGR_domain | Domain |
| IPR012317 | Poly(ADP-ribose)pol_cat_dom | Domain |
| IPR012982 | PARP1-like_PADR1_Zn_ribbon | Domain |
| IPR049296 | PARP1-like_PADR1_N | Domain |
| IPR008288 | PARP | Family |
| IPR050800 | ARTD/PARP | Family |
| IPR036420 | BRCT_dom_sf | Homologous_superfamily |
| IPR036616 | Poly(ADP-ribose)pol_reg_dom_sf | Homologous_superfamily |
| IPR036930 | WGR_dom_sf | Homologous_superfamily |
| IPR036957 | Znf_PARP_sf | Homologous_superfamily |
| IPR038650 | PADR1_C_dom_sf | Homologous_superfamily |
Section 4: Structure Identifiers
Experimental Structures
Total PDB Structure Count: 155
| PDB ID | Method | Resolution (Å) | Description |
|---|
| 6NRH | X-RAY | 1.50 | ART domain bound to inhibitor UTT63 |
| 9ETQ | X-RAY | 1.59 | Catalytic domain bound to AZD5305 (SARUPARIB) |
| 7AAC | X-RAY | 1.59 | Catalytic domain with veliparib |
| 6NRJ | X-RAY | 1.65 | ART domain bound to inhibitor UTT93 |
| 7KK5 | X-RAY | 1.70 | Catalytic domain with niraparib |
| 6NRG | X-RAY | 1.70 | ART domain bound to inhibitor UTT57 |
| 2RIQ | X-RAY | 1.70 | Third Zinc-binding domain |
| 7AAA | X-RAY | 1.74 | Catalytic domain (apo) |
| 7KK2 | X-RAY | 1.695 | Catalytic domain |
| 6NTU | X-RAY | 1.80 | ART domain bound to inhibitor UKTT-15 |
| 7ONT | X-RAY | 1.85 | Selective pyridine carboxamide inhibitor |
| 5WS1 | X-RAY | 1.90 | Benzoimidazole inhibitor |
| 4ZZZ | X-RAY | 1.90 | Isoindolinone inhibitor |
| 7KK4 | X-RAY | 1.96 | Catalytic domain with olaparib |
| 7ONS | X-RAY | 1.97 | Isoquinolone inhibitor |
| 6M3I | X-RAY | 1.98 | HPF1/PARP1 complex |
| 6NRF | X-RAY | 2.00 | ART domain with inhibitor UTT103 |
| 6XVW | X-RAY | 2.00 | Catalytic domain with MC2050 |
| 7KK3 | X-RAY | 2.06 | Catalytic domain with talazoparib |
| 7KK6 | X-RAY | 2.06 | Catalytic domain with veliparib |
| 7ONR | X-RAY | 2.05 | 8-chloroquinazolinone inhibitor |
| 8VYH | X-RAY | 2.05 | Compound complex |
| 6VKK | X-RAY | 2.10 | Catalytic domain with rucaparib |
| 8HE7 | X-RAY | 2.10 | Quinazoline-2,4(1H,3H)-dione inhibitor |
| 4UND | X-RAY | 2.20 | Catalytic domain with talazoparib |
| 4R6E | X-RAY | 2.20 | Catalytic domain with niraparib |
| 7AAD | X-RAY | 2.21 | Catalytic domain with olaparib |
| 6GHK | X-RAY | 2.28 | Catalytic domain with ME0527 |
| 2RD6 | X-RAY | 2.30 | Catalytic domain with inhibitor |
| 6BHV | X-RAY | 2.30 | NAD+ analog BAD complex |
| 4PJT | X-RAY | 2.35 | Catalytic domain with BMN 673 |
| 5DS3 | X-RAY | 2.60 | Constitutively active PARP-1 |
| 7CMW | X-RAY | 2.70 | Catalytic domain with pamiparib |
| 8HLR | X-RAY | 2.85 | Catalytic domain with Fluzoparib |
| 2COK | NMR | - | BRCT domain solution structure |
| 2CR9 | NMR | - | WGR domain solution structure |
| 2CS2 | NMR | - | Second Zn-finger domain |
| 2DMJ | NMR | - | First zf-PARP domain |
| 2JVN | NMR | - | Domain C |
| 2L30 | NMR | - | Zinc finger 1 |
| 2L31 | NMR | - | Zinc finger 2 |
| 7SCY | CRYO-EM | 4.10 | Nuc147 bound to single BRCT |
| 7SCZ | CRYO-EM | 3.50 | Nuc147 bound to multiple BRCTs |
| 9MJA | CRYO-EM | 4.30 | PARP1 ART with HPF1 and EB47 |
... and 112 additional structures
Predicted Structures
| Source | ID | pLDDT Score | Sequence Length | Fraction Very High Confidence |
|---|
| AlphaFold | P09874 | 83.23 | 7944 | 0.51 |
Section 5: Cross-Species Orthologs
| Organism | Gene ID | Symbol | Biotype |
|---|
| Mouse (Mus musculus) | ENSMUSG00000026496 | Parp1 | protein_coding |
| Rat (Rattus norvegicus) | ENSRNOG00000003084 | Parp1 | protein_coding |
| Zebrafish (Danio rerio) | ENSDARG00000019529 | parp1 | protein_coding |
| Fruit fly (Drosophila melanogaster) | FBGN0010247 | Parp1 | protein_coding |
| Worm (C. elegans) | WBGENE00004049 | parp-1 | protein_coding |
| Yeast (S. cerevisiae) | No ortholog found | - | - |
Section 6: Clinical Variants & AI Predictions
ClinVar Clinical Variants
Total Variant Count: 115
Variant Classification Summary
| Classification | Count |
|---|
| Pathogenic | 0 |
| Likely Pathogenic | 0 |
| Uncertain Significance (VUS) | ~85 |
| Likely Benign | ~15 |
| Benign | ~15 |
Representative ClinVar Variants
| ClinVar ID | HGVS Notation | Classification | Review Status |
|---|
| 1261945 | c.2285T>C (p.Val762Ala) | Benign | Multiple submitters |
| 715247 | c.942C>T (p.Asp314=) | Benign | Multiple submitters |
| 779066 | c.1129C>T (p.Pro377Ser) | Benign/Likely benign | Multiple submitters |
| 780306 | c.1245C>T (p.Leu415=) | Benign | Multiple submitters |
| 711740 | c.1914C>T (p.Phe638=) | Benign | Single submitter |
| 775674 | c.562G>A (p.Ala188Thr) | Benign | Single submitter |
| 725933 | c.339T>C (p.Phe113=) | Benign | Single submitter |
| 3041509 | c.2819A>G (p.Lys940Arg) | Benign | No criteria |
| 2328794 | c.886G>A (p.Glu296Lys) | Likely benign | Single submitter |
| 737642 | c.1000G>A (p.Val334Ile) | Likely benign | Single submitter |
| 722614 | c.1146C>G (p.Ser382=) | Likely benign | Single submitter |
| 725858 | c.2250G>A (p.Pro750=) | Likely benign | Single submitter |
| 761542 | c.121-4C>G | Likely benign | Single submitter |
| 2205461 | c.1840T>C (p.Phe614Leu) | Uncertain significance | Single submitter |
| 2205462 | c.1842C>A (p.Phe614Leu) | Uncertain significance | Single submitter |
| 2216601 | c.482G>T (p.Gly161Val) | Uncertain significance | Single submitter |
| 2254608 | c.159C>A (p.His53Gln) | Uncertain significance | Single submitter |
| 2263329 | c.472T>C (p.Tyr158His) | Uncertain significance | Single submitter |
| 2263390 | c.1190C>T (p.Thr397Ile) | Uncertain significance | Single submitter |
| 2270980 | c.1231A>G (p.Met411Val) | Uncertain significance | Single submitter |
AI-Based Variant Effect Predictions
AlphaMissense Predictions
Total Predictions: 6,722
TOP 50 Predicted Pathogenic Missense Variants:
| Variant ID | Protein Variant | AM Score | Classification |
|---|
| 1:226361465:A:G | W1014R | 0.984 | likely_pathogenic |
| 1:226361465:A:T | W1014R | 0.984 | likely_pathogenic |
| 1:226361478:A:C | F1009L | 0.983 | likely_pathogenic |
| 1:226361478:A:T | F1009L | 0.983 | likely_pathogenic |
| 1:226361486:A:G | F1007L | 0.998 | likely_pathogenic |
| 1:226361484:G:C | F1007L | 0.998 | likely_pathogenic |
| 1:226361484:G:T | F1007L | 0.998 | likely_pathogenic |
| 1:226361485:A:G | F1007S | 0.997 | likely_pathogenic |
| 1:226361504:A:C | Y1001D | 0.997 | likely_pathogenic |
| 1:226361497:A:G | L1003P | 0.995 | likely_pathogenic |
| 1:226361500:A:C | L1002R | 0.995 | likely_pathogenic |
| 1:226361500:A:G | L1002P | 0.999 | likely_pathogenic |
| 1:226361500:A:T | L1002Q | 0.986 | likely_pathogenic |
| 1:226361503:T:G | Y1001S | 0.990 | likely_pathogenic |
| 1:226361497:A:C | L1003R | 0.987 | likely_pathogenic |
| 1:226361491:A:G | L1005P | 0.996 | likely_pathogenic |
| 1:226361491:A:C | L1005R | 0.982 | likely_pathogenic |
| 1:226361486:A:C | F1007V | 0.981 | likely_pathogenic |
| 1:226361486:A:T | F1007I | 0.984 | likely_pathogenic |
| 1:226361485:A:C | F1007C | 0.978 | likely_pathogenic |
| 1:226361504:A:T | Y1001N | 0.975 | likely_pathogenic |
| 1:226361497:A:T | L1003Q | 0.970 | likely_pathogenic |
| 1:226361504:A:G | Y1001H | 0.962 | likely_pathogenic |
| 1:226361491:A:T | L1005Q | 0.949 | likely_pathogenic |
| 1:226361503:T:C | Y1001C | 0.947 | likely_pathogenic |
| 1:226361479:A:G | F1009S | 0.936 | likely_pathogenic |
| 1:226361463:C:A | W1014C | 0.933 | likely_pathogenic |
| 1:226361463:C:G | W1014C | 0.933 | likely_pathogenic |
| 1:226361493:T:A | K1004N | 0.932 | likely_pathogenic |
| 1:226361493:T:G | K1004N | 0.932 | likely_pathogenic |
| 1:226361509:A:C | L999R | 0.919 | likely_pathogenic |
| 1:226361505:C:A | K1000N | 0.912 | likely_pathogenic |
| 1:226361505:C:G | K1000N | 0.912 | likely_pathogenic |
| 1:226361509:A:G | L999P | 0.905 | likely_pathogenic |
| 1:226361464:C:G | W1014S | 0.856 | likely_pathogenic |
| 1:226361479:A:C | F1009C | 0.852 | likely_pathogenic |
| 1:226361480:A:T | F1009I | 0.844 | likely_pathogenic |
| 1:226361480:A:C | F1009V | 0.835 | likely_pathogenic |
| 1:226361494:T:A | K1004I | 0.825 | likely_pathogenic |
| 1:226361495:T:C | K1004E | 0.758 | likely_pathogenic |
| 1:226361485:A:T | F1007Y | 0.730 | likely_pathogenic |
| 1:226361494:T:G | K1004T | 0.729 | likely_pathogenic |
| 1:226361506:T:A | K1000M | 0.720 | likely_pathogenic |
| 1:226361465:A:C | W1014G | 0.718 | likely_pathogenic |
| 1:226361475:C:A | K1010N | 0.693 | likely_pathogenic |
| 1:226361475:C:G | K1010N | 0.693 | likely_pathogenic |
| 1:226361464:C:A | W1014L | 0.688 | likely_pathogenic |
| 1:226361467:A:G | L1013P | 0.669 | likely_pathogenic |
| 1:226361507:T:C | K1000E | 0.667 | likely_pathogenic |
| 1:226361467:A:C | L1013R | 0.655 | likely_pathogenic |
SpliceAI Predictions
Total Splice Predictions: 3,166
TOP 50 Predicted Splice-Altering Variants:
| Variant ID | Effect | Delta Score |
|---|
| 1:226362082:ACCT:A | acceptor_loss | 0.98 |
| 1:226362083:CCTGA:C | acceptor_loss | 0.98 |
| 1:226362084:CTGA:C | acceptor_loss | 0.98 |
| 1:226362085:T:A | acceptor_loss | 0.98 |
| 1:226362086:G:C | acceptor_loss | 0.94 |
| 1:226362960:ACACC:A | donor_gain | 0.94 |
| 1:226362961:CACCC:C | donor_gain | 0.94 |
| 1:226362962:ACC:A | donor_gain | 0.94 |
| 1:226362079:CAAAC:C | acceptor_gain | 0.90 |
| 1:226362087:AAAAA:A | acceptor_loss | 0.89 |
| 1:226362084:C:CC | acceptor_gain | 0.87 |
| 1:226362091:A:C | acceptor_loss | 0.42 |
| 1:226362080:AAAC:A | acceptor_gain | 0.61 |
| 1:226362089:A:C | acceptor_loss | 0.63 |
| 1:226362088:A:C | acceptor_loss | 0.59 |
| 1:226362955:CACA:C | donor_loss | 0.51 |
| 1:226362956:ACAC:A | donor_loss | 0.57 |
| 1:226362958:AC:A | donor_loss | 0.57 |
| 1:226362959:CACA:C | donor_loss | 0.57 |
| 1:226362963:C:CG | donor_loss | 0.57 |
| 1:226362962:AC:A | donor_gain | 0.55 |
| 1:226362081:AAC:A | acceptor_gain | 0.50 |
| 1:226362961:CACC:C | donor_gain | 0.49 |
| 1:226362960:ACAC:A | donor_gain | 0.49 |
| 1:226362097:G:C | acceptor_loss | 0.44 |
| 1:226361804:AGGCT:A | acceptor_gain | 0.42 |
| 1:226361805:GGCT:G | acceptor_gain | 0.42 |
| 1:226362055:TG:T | acceptor_loss | 0.42 |
| 1:226362954:G:T | donor_gain | 0.42 |
| 1:226362960:A:T | donor_gain | 0.42 |
| 1:226361967:A:AC | donor_gain | 0.40 |
| 1:226361968:C:CC | donor_gain | 0.40 |
| 1:226362956:A:AC | donor_gain | 0.40 |
| 1:226362957:C:CC | donor_gain | 0.40 |
| 1:226362608:TCA:T | donor_gain | 0.39 |
| 1:226361823:ACC:A | acceptor_gain | 0.38 |
| 1:226361824:C:A | acceptor_gain | 0.38 |
| 1:226362961:C:T | donor_gain | 0.38 |
| 1:226362962:A:AC | donor_gain | 0.37 |
| 1:226362963:C:CC | donor_gain | 0.37 |
| 1:226362092:C:CT | acceptor_gain | 0.36 |
| 1:226362954:GCACA:G | donor_loss | 0.36 |
| 1:226362082:AC:A | acceptor_gain | 0.35 |
| 1:226362083:CC:C | acceptor_gain | 0.35 |
| 1:226362040:C:CT | acceptor_loss | 0.32 |
| 1:226362036:C:CT | acceptor_loss | 0.33 |
| 1:226361822:GACC:G | acceptor_gain | 0.33 |
| 1:226361961:968:CT:C | donor_gain | 0.33 |
| 1:226362198:T:TA | donor_gain | 0.33 |
| 1:226361806:GCTAT:G | acceptor_gain | 0.33 |
Section 7: Biological Pathways & Gene Ontology
Reactome Pathways
Total Pathway Count: 8
| Pathway ID | Name | Disease Pathway |
|---|
| R-HSA-110362 | POLB-Dependent Long Patch Base Excision Repair | No |
| R-HSA-192814 | vRNA Synthesis | Yes |
| R-HSA-2173795 | Downregulation of SMAD2/3:SMAD4 transcriptional activity | No |
| R-HSA-3108214 | SUMOylation of DNA damage response and repair proteins | No |
| R-HSA-5685939 | HDR through MMEJ (alt-NHEJ) | No |
| R-HSA-5696394 | DNA Damage Recognition in GG-NER | No |
| R-HSA-5696395 | Formation of Incision Complex in GG-NER | No |
| R-HSA-5696400 | Dual Incision in GG-NER | No |
Gene Ontology Annotations
Total GO Terms: 102
Biological Process (TOP 20)
| GO ID | Term |
|---|
| GO:0006281 | DNA repair |
| GO:0006302 | double-strand break repair |
| GO:0006974 | DNA damage response |
| GO:0070212 | protein poly-ADP-ribosylation |
| GO:0070213 | protein auto-ADP-ribosylation |
| GO:0030592 | DNA ADP-ribosylation |
| GO:0000122 | negative regulation of transcription by RNA polymerase II |
| GO:0045944 | positive regulation of transcription by RNA polymerase II |
| GO:0000723 | telomere maintenance |
| GO:0006915 | apoptotic process |
| GO:0007005 | mitochondrion organization |
| GO:0032042 | mitochondrial DNA metabolic process |
| GO:0043504 | mitochondrial DNA repair |
| GO:0010332 | response to gamma radiation |
| GO:0034599 | cellular response to oxidative stress |
| GO:0034644 | cellular response to UV |
| GO:0071932 | replication fork reversal |
| GO:0045087 | innate immune response |
| GO:0043123 | positive regulation of canonical NF-kappaB signal transduction |
| GO:0060545 | positive regulation of necroptotic process |
Molecular Function (TOP 20)
| GO ID | Term |
|---|
| GO:0003950 | NAD+ poly-ADP-ribosyltransferase activity |
| GO:0140294 | NAD DNA ADP-ribosyltransferase activity |
| GO:1990404 | NAD+-protein mono-ADP-ribosyltransferase activity |
| GO:0140805 | NAD+-protein-serine ADP-ribosyltransferase activity |
| GO:0140806 | NAD+-protein-aspartate ADP-ribosyltransferase activity |
| GO:0140807 | NAD+-protein-glutamate ADP-ribosyltransferase activity |
| GO:0140808 | NAD+-protein-tyrosine ADP-ribosyltransferase activity |
| GO:0140815 | NAD+-protein-histidine ADP-ribosyltransferase activity |
| GO:0140816 | NAD+-histone H2BS6 serine ADP-ribosyltransferase activity |
| GO:0140817 | NAD+-histone H3S10 serine ADP-ribosyltransferase activity |
| GO:0140822 | NAD+-histone H2BE35 glutamate ADP-ribosyltransferase activity |
| GO:0003677 | DNA binding |
| GO:0003682 | chromatin binding |
| GO:0003684 | damaged DNA binding |
| GO:0003723 | RNA binding |
| GO:0051287 | NAD binding |
| GO:0008270 | zinc ion binding |
| GO:0042802 | identical protein binding |
| GO:0042803 | protein homodimerization activity |
| GO:0031491 | nucleosome binding |
Cellular Component (TOP 20)
| GO ID | Term |
|---|
| GO:0005634 | nucleus |
| GO:0005654 | nucleoplasm |
| GO:0005730 | nucleolus |
| GO:0005635 | nuclear envelope |
| GO:0005829 | cytosol |
| GO:0005739 | mitochondrion |
| GO:0000785 | chromatin |
| GO:0000781 | chromosome, telomeric region |
| GO:0035861 | site of double-strand break |
| GO:0090734 | site of DNA damage |
| GO:0043596 | nuclear replication fork |
| GO:0016604 | nuclear body |
| GO:0016020 | membrane |
| GO:0032991 | protein-containing complex |
| GO:0032993 | protein-DNA complex |
| GO:0005667 | transcription regulator complex |
Section 8: Protein Interactions & Molecular Networks
Protein-Protein Interactions
STRING Interactions: 5,700
IntAct Interactions: 541+
BioGRID Interactions: 2,099
TOP 50 STRING Interactors (by confidence score)
| UniProt ID | Gene | Score | Description |
|---|
| P09874 | PARP1 | 996 | Self-interaction |
| P18887 | XRCC1 | 996 | DNA repair |
| P51532 | SMARCA4 | 995 | Chromatin remodeling |
| P12956 | XRCC6 | 983 | DNA repair (Ku70) |
| P78527 | PRKDC | 976 | DNA-dependent protein kinase |
| P49916 | LIG3 | 973 | DNA ligase 3 |
| P13010 | XRCC5 | 968 | DNA repair (Ku80) |
| P06746 | POLB | 963 | DNA polymerase beta |
| P42574 | CASP3 | 954 | Caspase-3 |
| P06748 | NPM1 | 950 | Nucleophosmin |
| P51587 | BRCA2 | 942 | Breast cancer type 2 |
| Q86W56 | PARP10 | 940 | PARP family |
| P38398 | BRCA1 | 936 | Breast cancer type 1 |
| P55211 | CASP9 | 916 | Caspase-9 |
| P27695 | APEX1 | 912 | AP endonuclease |
| Q7Z2W4 | ZNF365 | 894 | Zinc finger protein |
| Q9NUW8 | TDP1 | 889 | Tyrosyl-DNA phosphodiesterase |
| Q9H0J9 | NBN | 886 | Nibrin |
| O75367 | H2AX | 885 | Histone H2AX |
| P10415 | BCL2 | 879 | Apoptosis regulator |
| P08758 | ANXA5 | 874 | Annexin A5 |
| P04637 | TP53 | 870 | Tumor protein p53 |
| Q9UNS1 | TIMELESS | 870 | Timeless |
| Q9NX46 | POLQ | 869 | DNA polymerase theta |
| Q7Z3E1 | SPRTN | 863 | DNA repair |
| P55210 | CASP7 | 849 | Caspase-7 |
| Q06609 | RAD51 | 844 | DNA recombination |
| P11387 | TOP1 | 832 | DNA topoisomerase 1 |
| Q12888 | TP53BP1 | 829 | p53-binding protein 1 |
| Q13315 | ATM | 829 | ATM kinase |
| Q14191 | WRN | 828 | Werner syndrome helicase |
| P31749 | AKT1 | 824 | Protein kinase B |
| P26358 | DNMT1 | 822 | DNA methyltransferase 1 |
| O95271 | TNKS | 821 | Tankyrase-1 |
| Q14790 | CASP8 | 820 | Caspase-8 |
| Q01844 | EWS | 802 | RNA binding protein |
| Q9NWS1 | HNRNPUL1 | 802 | hnRNP U-like |
| Q92851 | CASP10 | 800 | Caspase-10 |
| Q07817 | BCL2L1 | 796 | BCL-xL |
| P78554 | CYP2C9 | 794 | Cytochrome P450 |
| Q460N5 | PARP14 | 787 | PARP family |
| Q13485 | SMAD4 | 784 | SMAD family |
| Q460N3 | PARP9 | 784 | PARP family |
| O95863 | SNAI1 | 783 | Snail |
| O14757 | CHEK1 | 782 | Checkpoint kinase 1 |
| P39748 | FEN1 | 781 | Flap endonuclease |
| Q7Z2E3 | PARP16 | 781 | PARP family |
| Q8IXQ6 | PARP15 | 776 | PARP family |
| P04179 | SOD2 | 774 | Superoxide dismutase |
Protein Similarity
ESM2 Structural/Embedding Similarity
Total Similar Proteins: 123
| UniProt ID | Top Similarity | Average Similarity |
|---|
| O75531 | 1.0000 | 0.9665 |
| P49917 | 1.0000 | 0.9812 |
| P61283 | 1.0000 | 0.9665 |
| Q5R6L3 | 1.0000 | 0.9813 |
| Q5RBU9 | 1.0000 | 0.9665 |
| D4A4T9 | 0.9999 | 0.9813 |
| P10711 | 0.9999 | 0.9832 |
| Q8CDN6 | 0.9999 | 0.9705 |
| Q920J4 | 0.9999 | 0.9709 |
| P27008 | 0.9998 | 0.9834 |
| Q4R7U2 | 0.9998 | 0.9810 |
| Q5R7N2 | 0.9998 | 0.9760 |
| Q6AYK6 | 0.9998 | 0.9673 |
| Q12792 | 0.9998 | 0.9749 |
| A9YUB1 | 0.9997 | 0.9809 |
| G3GTP0 | 0.9997 | 0.9815 |
| O35047 | 0.9997 | 0.9742 |
| Q29RL2 | 0.9997 | 0.9805 |
| O89044 | 0.9997 | 0.9770 |
| O95433 | 0.9997 | 0.9738 |
DIAMOND Sequence Homology
Total Homologous Proteins: 27
| UniProt ID | Top Identity (%) | Top Bitscore |
|---|
| Q9R152 | 95.4 | 1886 |
| P27008 | 95.4 | 1880 |
| P11103 | 94.5 | 1867 |
| P09874 | 92.4 | 1833 |
| P18493 | 88.9 | 1773 |
| Q5RHR0 | 87.4 | 1446 |
| P49916 | 86.4 | 1720 |
| P97386 | 86.4 | 1722 |
| O88554 | 83.9 | 958 |
| Q9UGN5 | 83.9 | 950 |
| P26446 | 80.0 | 1563 |
| Q7EYV7 | 80.0 | 1576 |
| Q9ZSV1 | 80.0 | 1575 |
| Q3ULW8 | 79.8 | 835 |
| Q9Y6F1 | 79.8 | 831 |
| Q0JMY1 | 79.5 | 805 |
| Q5Z8Q9 | 79.5 | 979 |
| P31669 | 77.0 | 1503 |
| O50017 | 76.6 | 964 |
| Q11208 | 62.4 | 1228 |
Section 9: Transcription Factor Regulatory Data
Downstream Targets (Genes Regulated BY PARP1)
Total Target Gene Count: 159 (from CollecTRI)
TOP 50 Target Genes
| Target Gene | Regulation | Confidence |
|---|
| BRCA1 | - | High |
| BRCA2 | Repression | High |
| BCL2 | Repression | High |
| BCL2L1 | - | High |
| E2F1 | Activation | High |
| CDH1 | Activation | High |
| CDKN1A | Unknown | High |
| CDKN1B | - | High |
| CDKN2A | - | High |
| CCND1 | - | High |
| MMP9 | - | High |
| VIM | Activation | High |
| FN1 | Unknown | High |
| ESR1 | - | High |
| AR | - | High |
| ERBB2 | - | High |
| CASP3 | - | High |
| CASP4 | - | High |
| DNMT1 | - | High |
| OGG1 | - | High |
| XRCC1 | - | High |
| TOP2B | - | High |
| POLD1 | - | High |
| SIRT1 | - | High |
| PPARG | - | High |
| TNF | - | Low |
| IL2 | - | Low |
| IFNG | Unknown | High |
| CXCL1 | Unknown | High |
| CXCL10 | Repression | High |
| CXCL12 | Unknown | High |
| NOS2 | - | High |
| PTGS2 | Unknown | High |
| ICAM1 | - | High |
| VCAM1 | - | High |
| MCL1 | - | High |
| XIAP | - | High |
| HSPA1B | Repression | High |
| HTR1A | Repression | High |
| OPRM1 | Repression | High |
| SNCA | - | High |
| TGFBR1 | - | High |
| TGFBR2 | Activation | High |
| SERPINE1 | Activation | - |
| SERPINF1 | Repression | - |
| THBD | Activation | - |
| SNAI1 | Repression | - |
| FOXO1 | Repression | - |
| JUN | Unknown | - |
| NFKB1 | Unknown | - |
Upstream Regulators (TFs that Regulate PARP1)
| Transcription Factor | Regulation | Confidence |
|---|
| SP1 | - | High |
| SP3 | - | High |
| YY1 | Unknown | High |
| MYC | Unknown | High |
| AR | - | High |
| NFATC2 | - | High |
| TP53 | - | Low |
| RELA | - | Low |
| NFKB | - | Low |
| ETS1 | Activation | Low |
| MYBL2 | - | Low |
| EP300 | Unknown | - |
| FLT3 | Activation | - |
| PGR | Unknown | - |
| PAX3 | - | Low |
SIGNOR Regulatory Interactions
Total SIGNOR Interactions: 44
| Entity A | Entity B | Effect | Mechanism |
|---|
| CASP3 | PARP1 | down-regulates activity | cleavage |
| CASP7 | PARP1 | down-regulates | cleavage |
| MAPK1 | PARP1 | up-regulates | phosphorylation |
| CDK2 | PARP1 | up-regulates activity | phosphorylation |
| CDK5 | PARP1 | down-regulates activity | phosphorylation |
| EGFR | PARP1 | up-regulates activity | phosphorylation |
| MET | PARP1 | up-regulates activity | phosphorylation |
| PRKDC | PARP1 | down-regulates activity | phosphorylation |
| ATR | PARP1 | down-regulates activity | phosphorylation |
| PRKACA | PARP1 | up-regulates activity | phosphorylation |
| PRKCD | PARP1 | down-regulates activity | phosphorylation |
| NAT10 | PARP1 | up-regulates quantity | acetylation |
| CHFR | PARP1 | down-regulates quantity | polyubiquitination |
| RNF144A | PARP1 | down-regulates quantity | ubiquitination |
| PHRF1 | PARP1 | down-regulates quantity | ubiquitination |
| DNA_damage | PARP1 | up-regulates | - |
| PARP1 | TP53 | up-regulates activity | relocalization |
| PARP1 | POLA1 | up-regulates activity | binding |
| PARP1 | KDM5B | up-regulates activity | relocalization |
| PARP1 | MRE11 | up-regulates activity | relocalization |
Section 10: Drug & Pharmacology Data
ChEMBL Target Information
| ChEMBL Target ID | Name | Type |
|---|
| CHEMBL3105 | Poly [ADP-ribose] polymerase 1 | SINGLE PROTEIN |
| CHEMBL3390820 | PARP 1, 2 and 3 | PROTEIN FAMILY |
| CHEMBL4742271 | Protein cereblon/PARP1 | PROTEIN-PROTEIN INTERACTION |
| CHEMBL4748223 | von Hippel-Lindau disease tumor suppressor/PARP1 | PROTEIN-PROTEIN INTERACTION |
Targeting Molecules
Total Compounds Tested: 5,836+ (ChEMBL activity records)
FDA-Approved PARP Inhibitors (Phase 4)
| ChEMBL ID | Drug Name | Type | Phase | Indications |
|---|
| CHEMBL521686 | OLAPARIB | Small molecule | 4 | Ovarian, breast, pancreatic, prostate cancer |
| CHEMBL1094636 | NIRAPARIB | Small molecule | 4 | Ovarian cancer |
| CHEMBL1173055 | RUCAPARIB | Small molecule | 4 | Ovarian, prostate cancer |
| CHEMBL3833368 | RUCAPARIB CAMSYLATE | Small molecule | 4 | (salt form) |
| CHEMBL3137320 | TALAZOPARIB | Small molecule | 4 | Breast cancer |
| CHEMBL629 | AMITRIPTYLINE | Small molecule | 4 | (off-target activity) |
Clinical Development Stage Compounds (Phase 3)
| ChEMBL ID | Drug Name | Type | Phase |
|---|
| CHEMBL506871 | VELIPARIB | Small molecule | 3 |
| CHEMBL1170047 | INIPARIB | Small molecule | 3 |
| CHEMBL3930624 | FLUZOPARIB | Small molecule | 3 |
| CHEMBL4112930 | PAMIPARIB | Small molecule | 3 |
| CHEMBL5095220 | SARUPARIB | Small molecule | 3 |
PharmGKB Information
| Property | Value |
|---|
| PharmGKB ID | PA32 |
| VIP Gene | ✓ Yes (Very Important Pharmacogene) |
| CPIC Guideline | No |
| Chromosome | chr1 |
Known Drug-Gene Interactions (SIGNOR)
| Drug | Effect | Mechanism |
|---|
| Olaparib | down-regulates | chemical inhibition |
| Veliparib | down-regulates | chemical inhibition |
| 4-Iodo-3-nitrobenzamide (Iniparib) | down-regulates | chemical inhibition |
| A-966492 | down-regulates | chemical inhibition |
| LSM-1988 | down-regulates | chemical inhibition |
Section 11: Expression Profiles
Tissue Expression (Bgee)
| Metric | Value |
|---|
| Expression Breadth | Ubiquitous |
| Total Present Calls | 292 |
| Total Absent Calls | 3 |
| Total Conditions | 295 |
| Max Expression Score | 98.57 |
| Average Expression Score | 88.91 |
| Gold Quality Conditions | 268 |
TOP 30 Tissues by Expression Score
| Tissue (UBERON) | Expression | Score | Quality |
|---|
| Ventricular zone | present | 98.57 | gold |
| Ganglionic eminence | present | 98.30 | gold |
| Primordial germ cell in gonad | present | 98.15 | gold |
| Embryo | present | 97.98 | gold |
| C1 segment of cervical spinal cord | present | 97.23 | gold |
| Spinal cord | present | 96.92 | gold |
| Lymph node | present | 96.73 | gold |
| Middle frontal gyrus | present | 96.46 | gold |
| Cortical plate | present | 96.28 | gold |
| Vermiform appendix | present | 96.16 | gold |
| Male germ line stem cell in testis | present | 96.05 | gold |
| Corpus callosum | present | 95.95 | gold |
| Apex of heart | present | 95.72 | gold |
| Amygdala | present | 95.59 | gold |
| Caecum | present | 95.52 | gold |
| Islet of Langerhans | present | 95.50 | gold |
| Heart left ventricle | present | 95.40 | gold |
| Spleen | present | 95.36 | gold |
| Cardiac ventricle | present | 95.29 | gold |
| Prefrontal cortex | present | 95.28 | gold |
| Body of pancreas | present | 95.27 | gold |
| Putamen | present | 95.06 | gold |
| Inferior vagus X ganglion | present | 95.01 | gold |
| Caudate nucleus | present | 94.82 | gold |
| Adenohypophysis | present | 94.80 | gold |
| Pancreas | present | 94.72 | gold |
| Bone marrow cell | present | 94.64 | gold |
| Tonsil | present | 94.52 | gold |
| Pituitary gland | present | 94.52 | gold |
| Right frontal lobe | present | 94.52 | gold |
Expression Pattern: PARP1 shows ubiquitous expression across virtually all human tissues, with highest expression in developing neural tissues, germline cells, and immune tissues.
Single-Cell Expression Data (Expression Atlas)
Total Single-Cell Datasets: 11
| Experiment ID | Description | Cell Count |
|---|
| E-CURD-112 | Fetal bone marrow haematopoiesis (10X) | 56,592 |
| E-CURD-77 | COVID-19 convalescent patient lymphocytes | 62,076 |
| E-GEOD-114530 | Human fetal kidneys | 22,148 |
| E-GEOD-124263 | Neonatal and adult human testis | 64,206 |
| E-GEOD-139324 | Head and neck cancer immune landscape | 204,315 |
| E-GEOD-76312 | CML cancer stem cells during disease | 2,151 |
| E-HCAD-23 | First-trimester placenta and decidua | 41,132 |
| E-HCAD-56 | Developing human spinal cord | 245,394 |
| E-MTAB-6701 | First trimester fetal-maternal interface | 135,071 |
| E-MTAB-7008 | iPSC endoderm differentiation | 1,024 |
| E-MTAB-9689 | RPE1-MYCN-ER cells with MYCN activation | 10,476 |
Section 12: Disease Associations
Mendelian Disease Links (GenCC)
| Disease | Classification | Inheritance | Submitter |
|---|
| Neurodevelopmental disorder | Limited | Autosomal recessive | G2P |
GWAS Associations
Total GWAS Associations: 10
| Study ID | Disease/Trait | P-value | Mapped Gene |
|---|
| GCST010303_59 | Nevus count or cutaneous melanoma | 2×10⁻¹⁷ | PARP1 |
| GCST008366_15 | Leukocyte telomere length | 2×10⁻¹⁸ | PARP1 |
| GCST008366_5 | Leukocyte telomere length | 2×10⁻¹⁶ | PARP1 |
| GCST007505_16 | Nevus count or cutaneous melanoma | 1×10⁻¹¹ | PARP1 - RPS3AP7 |
| GCST009856_26 | Leukocyte telomere length | 1×10⁻¹⁰ | PARP1 |
| GCST005991_6 | Platelet count | 2×10⁻⁸ | PARP1 |
| GCST001266_2 | Melanoma | 9×10⁻⁸ | PARP1 |
| GCST005143_1 | Telomere length | 2×10⁻⁶ | CDKN2AIPNLP1 - STUM |
| GCST005194_33 | Coronary artery disease | 7×10⁻⁶ | YBX1P9 - PARP1 |
| GCST008394_1 | Mild to moderate CKD | 1×10⁻⁶ | YBX1P9 - PARP1 |
HPO Phenotype Associations
No direct HPO phenotype associations found.
Cancer Relevance
PARP1 is a major therapeutic target in cancer treatment due to:
- Synthetic lethality with BRCA1/2 mutations
- Role in DNA damage repair pathways
- Involvement in genomic stability maintenance
PARP inhibitors are approved for treatment of:
- Ovarian cancer (BRCA-mutated and HRD-positive)
- Breast cancer (BRCA-mutated)
- Pancreatic cancer (BRCA-mutated)
- Prostate cancer (HRR gene-mutated)
DATA SOURCES
| Database | Description |
|---|
| HGNC | HUGO Gene Nomenclature Committee |
| Ensembl | Genome browser and annotation |
| NCBI Entrez | Gene database |
| OMIM | Online Mendelian Inheritance in Man |
| UniProt | Protein sequence and function |
| InterPro | Protein domains and families |
| PDB | Protein Data Bank structures |
| AlphaFold | AI-predicted structures |
| ClinVar | Clinical variant interpretations |
| AlphaMissense | AI missense pathogenicity predictions |
| SpliceAI | AI splice variant predictions |
| Reactome | Biological pathways |
| Gene Ontology | Functional annotations |
| STRING | Protein interactions |
| IntAct | Molecular interactions |
| BioGRID | Biological interactions |
| SIGNOR | Signaling networks |
| ChEMBL | Bioactive molecules |
| PharmGKB | Pharmacogenomics |
| Bgee | Gene expression |
| Expression Atlas | Single-cell expression |
| GWAS Catalog | Genome-wide associations |
| GenCC | Gene-disease curation |
| CollecTRI | Transcription factor targets |
Reference generated: March 2026
Data retrieved via biobtree integrated biological database search
I have compiled a comprehensive cross-database identifier and functional mapping reference for human PARP1 (Poly(ADP-ribose) Polymerase 1).
Summary of Key Findings
| Category | Key Data |
|---|
| Gene IDs | HGNC:270, ENSG00000143799, Entrez:142, OMIM:173870 |
| Location | Chr 1q42.12: 226,360,210-226,408,154 (minus strand) |
| Transcripts | 37 total (13 protein-coding), MANE Select: NM_001618 |
| Protein | UniProt P09874, 1,014 aa, 113 kDa |
| Domains | 14 InterPro (BRCT, Zn-finger PARP, WGR, catalytic) |
| Structures | 155 PDB (mostly X-ray with inhibitors) |
| Clinical Variants | 115 ClinVar entries (no pathogenic reported) |
| AI Predictions | 6,722 AlphaMissense, 3,166 SpliceAI |
| Pathways | 8 Reactome (DNA repair, NER, base excision) |
| GO Terms | 102 total annotations |
| Interactions | 5,700+ STRING, 541+ IntAct |
| TF Targets | 159 downstream genes regulated |
| Approved Drugs | Olaparib, Niraparib, Rucaparib, Talazoparib |
| Expression | Ubiquitous (292/295 tissues positive) |
| GWAS | Telomere length, melanoma, platelet count |
PARP1 is one of the most extensively characterized drug targets, with 5 FDA-approved PARP inhibitors used in cancer therapy exploiting synthetic lethality with BRCA mutations.