PARP1 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human PARP1. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human PARP1. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 32 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-27 — For the latest data, query BioBTree directly via MCP or API.
View API calls (43)
PARP1

Section 1: Gene Identifiers Primary Gene Identifiers

DatabaseIdentifierDetails
HGNC IDHGNC:270Approved symbol: PARP1
HGNC Namepoly(ADP-ribose) polymerase 1Status: Approved
Ensembl GeneENSG00000143799GRCh38
NCBI Entrez142Gene ID
OMIM173870Gene/Locus
Genomic Location (GRCh38/hg38)
AttributeValue
Chromosome1
Cytogenetic Band1q42.12
Start Position226,360,210
End Position226,408,154
StrandMinus (-)
Genome AssemblyGRCh38
Gene Symbols and Aliases
TypeValues
Approved SymbolPARP1
Previous SymbolsPPOL, ADPRT
AliasesPARP, ARTD1, Poly-PARP, PARS, ADPRT1, pADPRT-1
Locus TypeGene with protein product
Gene GroupsZinc fingers PARP-type; Poly(ADP-ribose) polymerases

Section 2: Transcript Identifiers Ensembl Transcripts Total Transcript Count: 37

Transcript IDBiotypeStartEndNotes
ENST00000366794protein_coding226,360,691226,408,093Canonical
ENST00000677203protein_coding226,360,697226,408,100
ENST00000874606protein_coding226,360,693226,408,090
ENST00000874607protein_coding226,361,005226,408,093
ENST00000874608protein_coding226,361,003226,408,060
ENST00000874609protein_coding226,361,003226,408,055
ENST00000922077protein_coding226,360,693226,408,116
ENST00000922078protein_coding226,360,691226,408,100
ENST00000922079protein_coding226,361,003226,408,096
ENST00000922080protein_coding226,361,003226,408,090
ENST00000922081protein_coding226,361,004226,408,064
ENST00000963780protein_coding226,360,693226,408,120
ENST00000963781protein_coding226,361,006226,408,064
ENST00000366792protein_coding_CDS_not_defined226,392,258226,408,075
ENST00000469663protein_coding_CDS_not_defined226,388,698226,392,792
ENST00000676481protein_coding_CDS_not_defined226,360,579226,364,336
ENST00000677189protein_coding_CDS_not_defined226,360,210226,372,887
ENST00000677985protein_coding_CDS_not_defined226,360,210226,372,915
ENST00000678226protein_coding_CDS_not_defined226,360,210226,381,883
ENST00000677091nonsense_mediated_decay226,360,700226,408,100
ENST00000678144nonsense_mediated_decay226,360,700226,408,096
ENST00000678560nonsense_mediated_decay226,360,697226,408,085
ENST00000366790retained_intron226,399,195226,407,982
ENST00000463968retained_intron226,361,319226,364,372
ENST00000468608retained_intron226,361,708226,363,160
ENST00000490921retained_intron226,361,295226,371,115
ENST00000491816retained_intron226,361,410226,362,252
ENST00000498787retained_intron226,360,748226,372,746
ENST00000676565retained_intron226,360,700226,376,067
ENST00000676685retained_intron226,360,729226,408,154
ENST00000676709retained_intron226,360,748226,408,154
ENST00000677374retained_intron226,362,218226,386,797
ENST00000677815retained_intron226,362,386226,366,954
ENST00000677884retained_intron226,360,697226,408,096
ENST00000678288retained_intron226,360,210226,374,420
ENST00000678781retained_intron226,360,727226,408,154
ENST00000679276retained_intron226,360,743226,408,154
RefSeq Transcripts (Human)
RefSeq IDTypeStatusMANE Select
NM_001618mRNAREVIEWED✓ Yes
NM_179834mRNAREVIEWEDNo
NP_001609proteinREVIEWED✓ Yes
NP_850165proteinREVIEWEDNo
CCDS Identifier
CCDS ID
CCDS1554
Exons for Canonical Transcript (ENST00000366794) Total Exon Count: 23
Exon IDStartEndStrand
ENSE00001697768226,407,810226,408,093-
ENSE00000962007226,402,214226,402,379-
ENSE00003548024226,392,199226,392,314-
ENSE00003550925226,390,410226,390,624-
ENSE00000962010226,388,656226,388,755-
ENSE00000962011226,386,326226,386,442-
ENSE00000962012226,385,504226,385,680-
ENSE00000962013226,383,036226,383,183-
ENSE00000962014226,381,068226,381,208-
ENSE00000962015226,379,922226,380,164-
ENSE00000962016226,379,573226,379,641-
ENSE00000962017226,379,142226,379,274-
ENSE00000962018226,377,108226,377,303-
ENSE00000962019226,374,226226,374,354-
ENSE00003634072226,368,199226,368,321-
ENSE00003576366226,367,480226,367,608-
ENSE00003492408226,365,954226,366,052-
ENSE00003498847226,365,002226,365,154-
ENSE00003595606226,363,943226,364,070-
ENSE00003642486226,363,099226,363,160-
ENSE00000962020226,370,434226,370,517-
ENSE00003476295226,361,969226,362,083-
ENSE00001442633226,360,691226,361,541-

Section 3: Protein Identifiers UniProt Accessions

UniProt IDStatusName
P09874✓ Reviewed (Swiss-Prot)Poly [ADP-ribose] polymerase 1
A0A7I2V384Unreviewed (TrEMBL)-
A0A7I2V3E1Unreviewed (TrEMBL)-
A0A7I2V5E9Unreviewed (TrEMBL)-
A0A7I2V625Unreviewed (TrEMBL)-
Canonical Protein Properties (P09874)
PropertyValue
Length1,014 amino acids
Mass113,084 Da
Alternative NamesADP-ribosyltransferase diphtheria toxin-like 1; DNA ADP-ribosyltransferase PARP1; NAD(+) ADP-ribosyltransferase 1; Poly[ADP-ribose] synthase 1
RefSeq Proteins (Human)
RefSeq IDStatusMANE Select
NP_001609REVIEWED✓ Yes
NP_850165REVIEWEDNo
Protein Domains and Families (InterPro) Total Domain Count: 14
InterPro IDNameType
IPR001357BRCT_domDomain
IPR001510Znf_PARPDomain
IPR004102Poly(ADP-ribose)pol_reg_domDomain
IPR008893WGR_domainDomain
IPR012317Poly(ADP-ribose)pol_cat_domDomain
IPR012982PARP1-like_PADR1_Zn_ribbonDomain
IPR049296PARP1-like_PADR1_NDomain
IPR008288PARPFamily
IPR050800ARTD/PARPFamily
IPR036420BRCT_dom_sfHomologous_superfamily
IPR036616Poly(ADP-ribose)pol_reg_dom_sfHomologous_superfamily
IPR036930WGR_dom_sfHomologous_superfamily
IPR036957Znf_PARP_sfHomologous_superfamily
IPR038650PADR1_C_dom_sfHomologous_superfamily

Section 4: Structure Identifiers Experimental Structures Total PDB Structure Count: 155

PDB IDMethodResolution (Å)Description
6NRHX-RAY1.50ART domain bound to inhibitor UTT63
9ETQX-RAY1.59Catalytic domain bound to AZD5305 (SARUPARIB)
7AACX-RAY1.59Catalytic domain with veliparib
6NRJX-RAY1.65ART domain bound to inhibitor UTT93
7KK5X-RAY1.70Catalytic domain with niraparib
6NRGX-RAY1.70ART domain bound to inhibitor UTT57
2RIQX-RAY1.70Third Zinc-binding domain
7AAAX-RAY1.74Catalytic domain (apo)
7KK2X-RAY1.695Catalytic domain
6NTUX-RAY1.80ART domain bound to inhibitor UKTT-15
7ONTX-RAY1.85Selective pyridine carboxamide inhibitor
5WS1X-RAY1.90Benzoimidazole inhibitor
4ZZZX-RAY1.90Isoindolinone inhibitor
7KK4X-RAY1.96Catalytic domain with olaparib
7ONSX-RAY1.97Isoquinolone inhibitor
6M3IX-RAY1.98HPF1/PARP1 complex
6NRFX-RAY2.00ART domain with inhibitor UTT103
6XVWX-RAY2.00Catalytic domain with MC2050
7KK3X-RAY2.06Catalytic domain with talazoparib
7KK6X-RAY2.06Catalytic domain with veliparib
7ONRX-RAY2.058-chloroquinazolinone inhibitor
8VYHX-RAY2.05Compound complex
6VKKX-RAY2.10Catalytic domain with rucaparib
8HE7X-RAY2.10Quinazoline-2,4(1H,3H)-dione inhibitor
4UNDX-RAY2.20Catalytic domain with talazoparib
4R6EX-RAY2.20Catalytic domain with niraparib
7AADX-RAY2.21Catalytic domain with olaparib
6GHKX-RAY2.28Catalytic domain with ME0527
2RD6X-RAY2.30Catalytic domain with inhibitor
6BHVX-RAY2.30NAD+ analog BAD complex
4PJTX-RAY2.35Catalytic domain with BMN 673
5DS3X-RAY2.60Constitutively active PARP-1
7CMWX-RAY2.70Catalytic domain with pamiparib
8HLRX-RAY2.85Catalytic domain with Fluzoparib
2COKNMR-BRCT domain solution structure
2CR9NMR-WGR domain solution structure
2CS2NMR-Second Zn-finger domain
2DMJNMR-First zf-PARP domain
2JVNNMR-Domain C
2L30NMR-Zinc finger 1
2L31NMR-Zinc finger 2
7SCYCRYO-EM4.10Nuc147 bound to single BRCT
7SCZCRYO-EM3.50Nuc147 bound to multiple BRCTs
9MJACRYO-EM4.30PARP1 ART with HPF1 and EB47
... and 112 additional structures Predicted Structures
SourceIDpLDDT ScoreSequence LengthFraction Very High Confidence
AlphaFoldP0987483.2379440.51

Section 5: Cross-Species Orthologs

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000026496Parp1protein_coding
Rat (Rattus norvegicus)ENSRNOG00000003084Parp1protein_coding
Zebrafish (Danio rerio)ENSDARG00000019529parp1protein_coding
Fruit fly (Drosophila melanogaster)FBGN0010247Parp1protein_coding
Worm (C. elegans)WBGENE00004049parp-1protein_coding
Yeast (S. cerevisiae)No ortholog found--

Section 6: Clinical Variants & AI Predictions ClinVar Clinical Variants Total Variant Count: 115 Variant Classification Summary

ClassificationCount
Pathogenic0
Likely Pathogenic0
Uncertain Significance (VUS)~85
Likely Benign~15
Benign~15
Representative ClinVar Variants
ClinVar IDHGVS NotationClassificationReview Status
1261945c.2285T>C (p.Val762Ala)BenignMultiple submitters
715247c.942C>T (p.Asp314=)BenignMultiple submitters
779066c.1129C>T (p.Pro377Ser)Benign/Likely benignMultiple submitters
780306c.1245C>T (p.Leu415=)BenignMultiple submitters
711740c.1914C>T (p.Phe638=)BenignSingle submitter
775674c.562G>A (p.Ala188Thr)BenignSingle submitter
725933c.339T>C (p.Phe113=)BenignSingle submitter
3041509c.2819A>G (p.Lys940Arg)BenignNo criteria
2328794c.886G>A (p.Glu296Lys)Likely benignSingle submitter
737642c.1000G>A (p.Val334Ile)Likely benignSingle submitter
722614c.1146C>G (p.Ser382=)Likely benignSingle submitter
725858c.2250G>A (p.Pro750=)Likely benignSingle submitter
761542c.121-4C>GLikely benignSingle submitter
2205461c.1840T>C (p.Phe614Leu)Uncertain significanceSingle submitter
2205462c.1842C>A (p.Phe614Leu)Uncertain significanceSingle submitter
2216601c.482G>T (p.Gly161Val)Uncertain significanceSingle submitter
2254608c.159C>A (p.His53Gln)Uncertain significanceSingle submitter
2263329c.472T>C (p.Tyr158His)Uncertain significanceSingle submitter
2263390c.1190C>T (p.Thr397Ile)Uncertain significanceSingle submitter
2270980c.1231A>G (p.Met411Val)Uncertain significanceSingle submitter
AI-Based Variant Effect Predictions AlphaMissense Predictions Total Predictions: 6,722 TOP 50 Predicted Pathogenic Missense Variants:
Variant IDProtein VariantAM ScoreClassification
1:226361465:A:GW1014R0.984likely_pathogenic
1:226361465:A:TW1014R0.984likely_pathogenic
1:226361478:A:CF1009L0.983likely_pathogenic
1:226361478:A:TF1009L0.983likely_pathogenic
1:226361486:A:GF1007L0.998likely_pathogenic
1:226361484:G:CF1007L0.998likely_pathogenic
1:226361484:G:TF1007L0.998likely_pathogenic
1:226361485:A:GF1007S0.997likely_pathogenic
1:226361504:A:CY1001D0.997likely_pathogenic
1:226361497:A:GL1003P0.995likely_pathogenic
1:226361500:A:CL1002R0.995likely_pathogenic
1:226361500:A:GL1002P0.999likely_pathogenic
1:226361500:A:TL1002Q0.986likely_pathogenic
1:226361503:T:GY1001S0.990likely_pathogenic
1:226361497:A:CL1003R0.987likely_pathogenic
1:226361491:A:GL1005P0.996likely_pathogenic
1:226361491:A:CL1005R0.982likely_pathogenic
1:226361486:A:CF1007V0.981likely_pathogenic
1:226361486:A:TF1007I0.984likely_pathogenic
1:226361485:A:CF1007C0.978likely_pathogenic
1:226361504:A:TY1001N0.975likely_pathogenic
1:226361497:A:TL1003Q0.970likely_pathogenic
1:226361504:A:GY1001H0.962likely_pathogenic
1:226361491:A:TL1005Q0.949likely_pathogenic
1:226361503:T:CY1001C0.947likely_pathogenic
1:226361479:A:GF1009S0.936likely_pathogenic
1:226361463:C:AW1014C0.933likely_pathogenic
1:226361463:C:GW1014C0.933likely_pathogenic
1:226361493:T:AK1004N0.932likely_pathogenic
1:226361493:T:GK1004N0.932likely_pathogenic
1:226361509:A:CL999R0.919likely_pathogenic
1:226361505:C:AK1000N0.912likely_pathogenic
1:226361505:C:GK1000N0.912likely_pathogenic
1:226361509:A:GL999P0.905likely_pathogenic
1:226361464:C:GW1014S0.856likely_pathogenic
1:226361479:A:CF1009C0.852likely_pathogenic
1:226361480:A:TF1009I0.844likely_pathogenic
1:226361480:A:CF1009V0.835likely_pathogenic
1:226361494:T:AK1004I0.825likely_pathogenic
1:226361495:T:CK1004E0.758likely_pathogenic
1:226361485:A:TF1007Y0.730likely_pathogenic
1:226361494:T:GK1004T0.729likely_pathogenic
1:226361506:T:AK1000M0.720likely_pathogenic
1:226361465:A:CW1014G0.718likely_pathogenic
1:226361475:C:AK1010N0.693likely_pathogenic
1:226361475:C:GK1010N0.693likely_pathogenic
1:226361464:C:AW1014L0.688likely_pathogenic
1:226361467:A:GL1013P0.669likely_pathogenic
1:226361507:T:CK1000E0.667likely_pathogenic
1:226361467:A:CL1013R0.655likely_pathogenic
SpliceAI Predictions Total Splice Predictions: 3,166 TOP 50 Predicted Splice-Altering Variants:
Variant IDEffectDelta Score
1:226362082:ACCT:Aacceptor_loss0.98
1:226362083:CCTGA:Cacceptor_loss0.98
1:226362084:CTGA:Cacceptor_loss0.98
1:226362085:T:Aacceptor_loss0.98
1:226362086:G:Cacceptor_loss0.94
1:226362960:ACACC:Adonor_gain0.94
1:226362961:CACCC:Cdonor_gain0.94
1:226362962:ACC:Adonor_gain0.94
1:226362079:CAAAC:Cacceptor_gain0.90
1:226362087:AAAAA:Aacceptor_loss0.89
1:226362084:C:CCacceptor_gain0.87
1:226362091:A:Cacceptor_loss0.42
1:226362080:AAAC:Aacceptor_gain0.61
1:226362089:A:Cacceptor_loss0.63
1:226362088:A:Cacceptor_loss0.59
1:226362955:CACA:Cdonor_loss0.51
1:226362956:ACAC:Adonor_loss0.57
1:226362958:AC:Adonor_loss0.57
1:226362959:CACA:Cdonor_loss0.57
1:226362963:C:CGdonor_loss0.57
1:226362962:AC:Adonor_gain0.55
1:226362081:AAC:Aacceptor_gain0.50
1:226362961:CACC:Cdonor_gain0.49
1:226362960:ACAC:Adonor_gain0.49
1:226362097:G:Cacceptor_loss0.44
1:226361804:AGGCT:Aacceptor_gain0.42
1:226361805:GGCT:Gacceptor_gain0.42
1:226362055:TG:Tacceptor_loss0.42
1:226362954:G:Tdonor_gain0.42
1:226362960:A:Tdonor_gain0.42
1:226361967:A:ACdonor_gain0.40
1:226361968:C:CCdonor_gain0.40
1:226362956:A:ACdonor_gain0.40
1:226362957:C:CCdonor_gain0.40
1:226362608:TCA:Tdonor_gain0.39
1:226361823:ACC:Aacceptor_gain0.38
1:226361824:C:Aacceptor_gain0.38
1:226362961:C:Tdonor_gain0.38
1:226362962:A:ACdonor_gain0.37
1:226362963:C:CCdonor_gain0.37
1:226362092:C:CTacceptor_gain0.36
1:226362954:GCACA:Gdonor_loss0.36
1:226362082:AC:Aacceptor_gain0.35
1:226362083:CC:Cacceptor_gain0.35
1:226362040:C:CTacceptor_loss0.32
1:226362036:C:CTacceptor_loss0.33
1:226361822:GACC:Gacceptor_gain0.33
1:226361961:968:CT:Cdonor_gain0.33
1:226362198:T:TAdonor_gain0.33
1:226361806:GCTAT:Gacceptor_gain0.33

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathway Count: 8

Pathway IDNameDisease Pathway
R-HSA-110362POLB-Dependent Long Patch Base Excision RepairNo
R-HSA-192814vRNA SynthesisYes
R-HSA-2173795Downregulation of SMAD2/3:SMAD4 transcriptional activityNo
R-HSA-3108214SUMOylation of DNA damage response and repair proteinsNo
R-HSA-5685939HDR through MMEJ (alt-NHEJ)No
R-HSA-5696394DNA Damage Recognition in GG-NERNo
R-HSA-5696395Formation of Incision Complex in GG-NERNo
R-HSA-5696400Dual Incision in GG-NERNo
Gene Ontology Annotations Total GO Terms: 102 Biological Process (TOP 20)
GO IDTerm
GO:0006281DNA repair
GO:0006302double-strand break repair
GO:0006974DNA damage response
GO:0070212protein poly-ADP-ribosylation
GO:0070213protein auto-ADP-ribosylation
GO:0030592DNA ADP-ribosylation
GO:0000122negative regulation of transcription by RNA polymerase II
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0000723telomere maintenance
GO:0006915apoptotic process
GO:0007005mitochondrion organization
GO:0032042mitochondrial DNA metabolic process
GO:0043504mitochondrial DNA repair
GO:0010332response to gamma radiation
GO:0034599cellular response to oxidative stress
GO:0034644cellular response to UV
GO:0071932replication fork reversal
GO:0045087innate immune response
GO:0043123positive regulation of canonical NF-kappaB signal transduction
GO:0060545positive regulation of necroptotic process
Molecular Function (TOP 20)
GO IDTerm
GO:0003950NAD+ poly-ADP-ribosyltransferase activity
GO:0140294NAD DNA ADP-ribosyltransferase activity
GO:1990404NAD+-protein mono-ADP-ribosyltransferase activity
GO:0140805NAD+-protein-serine ADP-ribosyltransferase activity
GO:0140806NAD+-protein-aspartate ADP-ribosyltransferase activity
GO:0140807NAD+-protein-glutamate ADP-ribosyltransferase activity
GO:0140808NAD+-protein-tyrosine ADP-ribosyltransferase activity
GO:0140815NAD+-protein-histidine ADP-ribosyltransferase activity
GO:0140816NAD+-histone H2BS6 serine ADP-ribosyltransferase activity
GO:0140817NAD+-histone H3S10 serine ADP-ribosyltransferase activity
GO:0140822NAD+-histone H2BE35 glutamate ADP-ribosyltransferase activity
GO:0003677DNA binding
GO:0003682chromatin binding
GO:0003684damaged DNA binding
GO:0003723RNA binding
GO:0051287NAD binding
GO:0008270zinc ion binding
GO:0042802identical protein binding
GO:0042803protein homodimerization activity
GO:0031491nucleosome binding
Cellular Component (TOP 20)
GO IDTerm
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005730nucleolus
GO:0005635nuclear envelope
GO:0005829cytosol
GO:0005739mitochondrion
GO:0000785chromatin
GO:0000781chromosome, telomeric region
GO:0035861site of double-strand break
GO:0090734site of DNA damage
GO:0043596nuclear replication fork
GO:0016604nuclear body
GO:0016020membrane
GO:0032991protein-containing complex
GO:0032993protein-DNA complex
GO:0005667transcription regulator complex

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions STRING Interactions: 5,700 IntAct Interactions: 541+ BioGRID Interactions: 2,099 TOP 50 STRING Interactors (by confidence score)

UniProt IDGeneScoreDescription
P09874PARP1996Self-interaction
P18887XRCC1996DNA repair
P51532SMARCA4995Chromatin remodeling
P12956XRCC6983DNA repair (Ku70)
P78527PRKDC976DNA-dependent protein kinase
P49916LIG3973DNA ligase 3
P13010XRCC5968DNA repair (Ku80)
P06746POLB963DNA polymerase beta
P42574CASP3954Caspase-3
P06748NPM1950Nucleophosmin
P51587BRCA2942Breast cancer type 2
Q86W56PARP10940PARP family
P38398BRCA1936Breast cancer type 1
P55211CASP9916Caspase-9
P27695APEX1912AP endonuclease
Q7Z2W4ZNF365894Zinc finger protein
Q9NUW8TDP1889Tyrosyl-DNA phosphodiesterase
Q9H0J9NBN886Nibrin
O75367H2AX885Histone H2AX
P10415BCL2879Apoptosis regulator
P08758ANXA5874Annexin A5
P04637TP53870Tumor protein p53
Q9UNS1TIMELESS870Timeless
Q9NX46POLQ869DNA polymerase theta
Q7Z3E1SPRTN863DNA repair
P55210CASP7849Caspase-7
Q06609RAD51844DNA recombination
P11387TOP1832DNA topoisomerase 1
Q12888TP53BP1829p53-binding protein 1
Q13315ATM829ATM kinase
Q14191WRN828Werner syndrome helicase
P31749AKT1824Protein kinase B
P26358DNMT1822DNA methyltransferase 1
O95271TNKS821Tankyrase-1
Q14790CASP8820Caspase-8
Q01844EWS802RNA binding protein
Q9NWS1HNRNPUL1802hnRNP U-like
Q92851CASP10800Caspase-10
Q07817BCL2L1796BCL-xL
P78554CYP2C9794Cytochrome P450
Q460N5PARP14787PARP family
Q13485SMAD4784SMAD family
Q460N3PARP9784PARP family
O95863SNAI1783Snail
O14757CHEK1782Checkpoint kinase 1
P39748FEN1781Flap endonuclease
Q7Z2E3PARP16781PARP family
Q8IXQ6PARP15776PARP family
P04179SOD2774Superoxide dismutase
Protein Similarity ESM2 Structural/Embedding Similarity Total Similar Proteins: 123
UniProt IDTop SimilarityAverage Similarity
O755311.00000.9665
P499171.00000.9812
P612831.00000.9665
Q5R6L31.00000.9813
Q5RBU91.00000.9665
D4A4T90.99990.9813
P107110.99990.9832
Q8CDN60.99990.9705
Q920J40.99990.9709
P270080.99980.9834
Q4R7U20.99980.9810
Q5R7N20.99980.9760
Q6AYK60.99980.9673
Q127920.99980.9749
A9YUB10.99970.9809
G3GTP00.99970.9815
O350470.99970.9742
Q29RL20.99970.9805
O890440.99970.9770
O954330.99970.9738
DIAMOND Sequence Homology Total Homologous Proteins: 27
UniProt IDTop Identity (%)Top Bitscore
Q9R15295.41886
P2700895.41880
P1110394.51867
P0987492.41833
P1849388.91773
Q5RHR087.41446
P4991686.41720
P9738686.41722
O8855483.9958
Q9UGN583.9950
P2644680.01563
Q7EYV780.01576
Q9ZSV180.01575
Q3ULW879.8835
Q9Y6F179.8831
Q0JMY179.5805
Q5Z8Q979.5979
P3166977.01503
O5001776.6964
Q1120862.41228

Section 9: Transcription Factor Regulatory Data Downstream Targets (Genes Regulated BY PARP1) Total Target Gene Count: 159 (from CollecTRI) TOP 50 Target Genes

Target GeneRegulationConfidence
BRCA1-High
BRCA2RepressionHigh
BCL2RepressionHigh
BCL2L1-High
E2F1ActivationHigh
CDH1ActivationHigh
CDKN1AUnknownHigh
CDKN1B-High
CDKN2A-High
CCND1-High
MMP9-High
VIMActivationHigh
FN1UnknownHigh
ESR1-High
AR-High
ERBB2-High
CASP3-High
CASP4-High
DNMT1-High
OGG1-High
XRCC1-High
TOP2B-High
POLD1-High
SIRT1-High
PPARG-High
TNF-Low
IL2-Low
IFNGUnknownHigh
CXCL1UnknownHigh
CXCL10RepressionHigh
CXCL12UnknownHigh
NOS2-High
PTGS2UnknownHigh
ICAM1-High
VCAM1-High
MCL1-High
XIAP-High
HSPA1BRepressionHigh
HTR1ARepressionHigh
OPRM1RepressionHigh
SNCA-High
TGFBR1-High
TGFBR2ActivationHigh
SERPINE1Activation-
SERPINF1Repression-
THBDActivation-
SNAI1Repression-
FOXO1Repression-
JUNUnknown-
NFKB1Unknown-
Upstream Regulators (TFs that Regulate PARP1)
Transcription FactorRegulationConfidence
SP1-High
SP3-High
YY1UnknownHigh
MYCUnknownHigh
AR-High
NFATC2-High
TP53-Low
RELA-Low
NFKB-Low
ETS1ActivationLow
MYBL2-Low
EP300Unknown-
FLT3Activation-
PGRUnknown-
PAX3-Low
SIGNOR Regulatory Interactions Total SIGNOR Interactions: 44
Entity AEntity BEffectMechanism
CASP3PARP1down-regulates activitycleavage
CASP7PARP1down-regulatescleavage
MAPK1PARP1up-regulatesphosphorylation
CDK2PARP1up-regulates activityphosphorylation
CDK5PARP1down-regulates activityphosphorylation
EGFRPARP1up-regulates activityphosphorylation
METPARP1up-regulates activityphosphorylation
PRKDCPARP1down-regulates activityphosphorylation
ATRPARP1down-regulates activityphosphorylation
PRKACAPARP1up-regulates activityphosphorylation
PRKCDPARP1down-regulates activityphosphorylation
NAT10PARP1up-regulates quantityacetylation
CHFRPARP1down-regulates quantitypolyubiquitination
RNF144APARP1down-regulates quantityubiquitination
PHRF1PARP1down-regulates quantityubiquitination
DNA_damagePARP1up-regulates-
PARP1TP53up-regulates activityrelocalization
PARP1POLA1up-regulates activitybinding
PARP1KDM5Bup-regulates activityrelocalization
PARP1MRE11up-regulates activityrelocalization

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL Target IDNameType
CHEMBL3105Poly [ADP-ribose] polymerase 1SINGLE PROTEIN
CHEMBL3390820PARP 1, 2 and 3PROTEIN FAMILY
CHEMBL4742271Protein cereblon/PARP1PROTEIN-PROTEIN INTERACTION
CHEMBL4748223von Hippel-Lindau disease tumor suppressor/PARP1PROTEIN-PROTEIN INTERACTION
Targeting Molecules Total Compounds Tested: 5,836+ (ChEMBL activity records) FDA-Approved PARP Inhibitors (Phase 4)
ChEMBL IDDrug NameTypePhaseIndications
CHEMBL521686OLAPARIBSmall molecule4Ovarian, breast, pancreatic, prostate cancer
CHEMBL1094636NIRAPARIBSmall molecule4Ovarian cancer
CHEMBL1173055RUCAPARIBSmall molecule4Ovarian, prostate cancer
CHEMBL3833368RUCAPARIB CAMSYLATESmall molecule4(salt form)
CHEMBL3137320TALAZOPARIBSmall molecule4Breast cancer
CHEMBL629AMITRIPTYLINESmall molecule4(off-target activity)
Clinical Development Stage Compounds (Phase 3)
ChEMBL IDDrug NameTypePhase
CHEMBL506871VELIPARIBSmall molecule3
CHEMBL1170047INIPARIBSmall molecule3
CHEMBL3930624FLUZOPARIBSmall molecule3
CHEMBL4112930PAMIPARIBSmall molecule3
CHEMBL5095220SARUPARIBSmall molecule3
PharmGKB Information
PropertyValue
PharmGKB IDPA32
VIP Gene✓ Yes (Very Important Pharmacogene)
CPIC GuidelineNo
Chromosomechr1
Known Drug-Gene Interactions (SIGNOR)
DrugEffectMechanism
Olaparibdown-regulateschemical inhibition
Veliparibdown-regulateschemical inhibition
4-Iodo-3-nitrobenzamide (Iniparib)down-regulateschemical inhibition
A-966492down-regulateschemical inhibition
LSM-1988down-regulateschemical inhibition

Section 11: Expression Profiles Tissue Expression (Bgee)

MetricValue
Expression BreadthUbiquitous
Total Present Calls292
Total Absent Calls3
Total Conditions295
Max Expression Score98.57
Average Expression Score88.91
Gold Quality Conditions268
TOP 30 Tissues by Expression Score
Tissue (UBERON)ExpressionScoreQuality
Ventricular zonepresent98.57gold
Ganglionic eminencepresent98.30gold
Primordial germ cell in gonadpresent98.15gold
Embryopresent97.98gold
C1 segment of cervical spinal cordpresent97.23gold
Spinal cordpresent96.92gold
Lymph nodepresent96.73gold
Middle frontal gyruspresent96.46gold
Cortical platepresent96.28gold
Vermiform appendixpresent96.16gold
Male germ line stem cell in testispresent96.05gold
Corpus callosumpresent95.95gold
Apex of heartpresent95.72gold
Amygdalapresent95.59gold
Caecumpresent95.52gold
Islet of Langerhanspresent95.50gold
Heart left ventriclepresent95.40gold
Spleenpresent95.36gold
Cardiac ventriclepresent95.29gold
Prefrontal cortexpresent95.28gold
Body of pancreaspresent95.27gold
Putamenpresent95.06gold
Inferior vagus X ganglionpresent95.01gold
Caudate nucleuspresent94.82gold
Adenohypophysispresent94.80gold
Pancreaspresent94.72gold
Bone marrow cellpresent94.64gold
Tonsilpresent94.52gold
Pituitary glandpresent94.52gold
Right frontal lobepresent94.52gold
Expression Pattern: PARP1 shows ubiquitous expression across virtually all human tissues, with highest expression in developing neural tissues, germline cells, and immune tissues. Single-Cell Expression Data (Expression Atlas) Total Single-Cell Datasets: 11
Experiment IDDescriptionCell Count
E-CURD-112Fetal bone marrow haematopoiesis (10X)56,592
E-CURD-77COVID-19 convalescent patient lymphocytes62,076
E-GEOD-114530Human fetal kidneys22,148
E-GEOD-124263Neonatal and adult human testis64,206
E-GEOD-139324Head and neck cancer immune landscape204,315
E-GEOD-76312CML cancer stem cells during disease2,151
E-HCAD-23First-trimester placenta and decidua41,132
E-HCAD-56Developing human spinal cord245,394
E-MTAB-6701First trimester fetal-maternal interface135,071
E-MTAB-7008iPSC endoderm differentiation1,024
E-MTAB-9689RPE1-MYCN-ER cells with MYCN activation10,476

Section 12: Disease Associations Mendelian Disease Links (GenCC)

DiseaseClassificationInheritanceSubmitter
Neurodevelopmental disorderLimitedAutosomal recessiveG2P
GWAS Associations Total GWAS Associations: 10
Study IDDisease/TraitP-valueMapped Gene
GCST010303_59Nevus count or cutaneous melanoma2×10⁻¹⁷PARP1
GCST008366_15Leukocyte telomere length2×10⁻¹⁸PARP1
GCST008366_5Leukocyte telomere length2×10⁻¹⁶PARP1
GCST007505_16Nevus count or cutaneous melanoma1×10⁻¹¹PARP1 - RPS3AP7
GCST009856_26Leukocyte telomere length1×10⁻¹⁰PARP1
GCST005991_6Platelet count2×10⁻⁸PARP1
GCST001266_2Melanoma9×10⁻⁸PARP1
GCST005143_1Telomere length2×10⁻⁶CDKN2AIPNLP1 - STUM
GCST005194_33Coronary artery disease7×10⁻⁶YBX1P9 - PARP1
GCST008394_1Mild to moderate CKD1×10⁻⁶YBX1P9 - PARP1
HPO Phenotype Associations No direct HPO phenotype associations found. Cancer Relevance PARP1 is a major therapeutic target in cancer treatment due to:
  1. Synthetic lethality with BRCA1/2 mutations
  2. Role in DNA damage repair pathways
  3. Involvement in genomic stability maintenance PARP inhibitors are approved for treatment of:
  • Ovarian cancer (BRCA-mutated and HRD-positive)
  • Breast cancer (BRCA-mutated)
  • Pancreatic cancer (BRCA-mutated)
  • Prostate cancer (HRR gene-mutated)

DATA SOURCES

DatabaseDescription
HGNCHUGO Gene Nomenclature Committee
EnsemblGenome browser and annotation
NCBI EntrezGene database
OMIMOnline Mendelian Inheritance in Man
UniProtProtein sequence and function
InterProProtein domains and families
PDBProtein Data Bank structures
AlphaFoldAI-predicted structures
ClinVarClinical variant interpretations
AlphaMissenseAI missense pathogenicity predictions
SpliceAIAI splice variant predictions
ReactomeBiological pathways
Gene OntologyFunctional annotations
STRINGProtein interactions
IntActMolecular interactions
BioGRIDBiological interactions
SIGNORSignaling networks
ChEMBLBioactive molecules
PharmGKBPharmacogenomics
BgeeGene expression
Expression AtlasSingle-cell expression
GWAS CatalogGenome-wide associations
GenCCGene-disease curation
CollecTRITranscription factor targets

Reference generated: March 2026 Data retrieved via biobtree integrated biological database search I have compiled a comprehensive cross-database identifier and functional mapping reference for human PARP1 (Poly(ADP-ribose) Polymerase 1). Summary of Key Findings

CategoryKey Data
Gene IDsHGNC:270, ENSG00000143799, Entrez:142, OMIM:173870
LocationChr 1q42.12: 226,360,210-226,408,154 (minus strand)
Transcripts37 total (13 protein-coding), MANE Select: NM_001618
ProteinUniProt P09874, 1,014 aa, 113 kDa
Domains14 InterPro (BRCT, Zn-finger PARP, WGR, catalytic)
Structures155 PDB (mostly X-ray with inhibitors)
Clinical Variants115 ClinVar entries (no pathogenic reported)
AI Predictions6,722 AlphaMissense, 3,166 SpliceAI
Pathways8 Reactome (DNA repair, NER, base excision)
GO Terms102 total annotations
Interactions5,700+ STRING, 541+ IntAct
TF Targets159 downstream genes regulated
Approved DrugsOlaparib, Niraparib, Rucaparib, Talazoparib
ExpressionUbiquitous (292/295 tissues positive)
GWASTelomere length, melanoma, platelet count
PARP1 is one of the most extensively characterized drug targets, with 5 FDA-approved PARP inhibitors used in cancer therapy exploiting synthetic lethality with BRCA mutations.