CASP3 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human CASP3. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human CASP3. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 33 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, go, gwas, hgnc, hpo, intact, interpro, mim, ortholog, paralog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-04-02 — For the latest data, query BioBTree directly via MCP or API.
View API calls (45)
CASP3

Primary Identifiers

DatabaseIdentifierName
HGNCHGNC:1504caspase 3
HGNC SymbolCASP3Approved symbol
Ensembl GeneENSG00000164305CASP3
NCBI Entrez Gene836caspase 3
OMIM600636Gene/locus entry
PharmGKBPA26087VIP gene
Gene Aliases
  • CPP32, CPP32B, Yama, apopain, SCA-1 Previous Names

  • caspase 3, apoptosis-related cysteine protease

  • caspase 3, apoptosis-related cysteine peptidase Genomic Location

AttributeValue
Chromosome4
Cytogenetic Band4q35.1
Start Position (GRCh38)184,627,695
End Position (GRCh38)184,650,418
StrandMinus (-)
Genomic AccessionNC_000004.12
Gene Classification
- Locus Type- Locus Group- Gene Groups- Status
Gene with protein productProtein-coding geneCaspases; Small nucleolar RNA protein coding host genesApproved
Transcript Identifiers**
Ensembl Transcripts Total Count: 32 transcripts
Transcript IDBiotypeStartEndStatus
ENST00000308394protein_coding184,627,696184,649,447Canonical
ENST00000393585protein_coding184,627,696184,648,592-
ENST00000393588protein_coding184,629,384184,649,455-
ENST00000447121protein_coding184,631,896184,649,447-
ENST00000517513protein_coding184,629,256184,649,447-
ENST00000523916protein_coding184,627,696184,649,509-
ENST00000700100protein_coding184,627,698184,648,879-
ENST00000700101protein_coding184,627,698184,650,062-
ENST00000700102retained_intron184,627,707184,649,447-
ENST00000700103retained_intron184,627,715184,649,447-
ENST00000700104nonsense_mediated_decay184,627,715184,649,447-
ENST00000872339protein_coding184,627,698184,649,447-
ENST00000872340protein_coding184,627,699184,649,444-
ENST00000872341protein_coding184,627,703184,649,444-
ENST00000872342protein_coding184,627,695184,649,405-
ENST00000872343protein_coding184,627,699184,649,312-
ENST00000872344protein_coding184,627,698184,649,273-
ENST00000872345protein_coding184,627,703184,649,228-
ENST00000872346protein_coding184,628,721184,648,914-
ENST00000872347protein_coding184,627,696184,638,993-
ENST00000939939protein_coding184,627,696184,650,418-
ENST00000939940protein_coding184,627,696184,649,447-
ENST00000939941protein_coding184,627,696184,649,447-
ENST00000939942protein_coding184,627,696184,649,444-
ENST00000939943protein_coding184,627,699184,649,443-
ENST00000939944protein_coding184,627,704184,649,443-
ENST00000939945protein_coding184,627,705184,649,444-
ENST00000939946protein_coding184,628,470184,649,444-
ENST00000972040protein_coding184,627,704184,649,495-
ENST00000972041protein_coding184,627,703184,649,446-
ENST00000972042protein_coding184,627,703184,649,444-
ENST00000972043protein_coding184,627,705184,649,390-
Biotype Summary: 29 protein_coding, 2 retained_intron, 1 nonsense_mediated_decay RefSeq Transcripts (Human) mRNA Accessions:
AccessionStatusMANE Select
NM_004346REVIEWED✓ Yes (Canonical)
NM_032991REVIEWEDNo
NM_001354777REVIEWEDNo
NM_001354779REVIEWEDNo
NM_001354780REVIEWEDNo
NM_001354781REVIEWEDNo
NM_001354782REVIEWEDNo
NM_001354783REVIEWEDNo
NM_001354784REVIEWEDNo
NM_001440946REVIEWEDNo
CCDS Identifiers
CCDS IDStatus
CCDS3836.1Consensus
Exons for Canonical Transcript (ENST00000308394) Total Exon Count: 8
Exon IDStartEndLength
ENSE00001592717184,649,395184,649,44753 bp
ENSE00001215704184,648,465184,648,631167 bp
ENSE00002443273184,638,401184,638,46868 bp
ENSE00001083265184,635,294184,635,418125 bp
ENSE00001083266184,632,268184,632,396129 bp
ENSE00001083271184,631,765184,631,940176 bp
ENSE00001083268184,631,037184,631,157121 bp
ENSE00003514002184,627,696184,629,5011,806 bp

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusLengthMass
P42574Caspase-3✓ Reviewed (Swiss-Prot)277 aa31,608 Da
Alternative Names:
  • Apopain
  • Cysteine protease CPP32
  • Protein Yama
  • SREBP cleavage activity 1 RefSeq Protein Accessions (Human)
AccessionStatusMANE Select
NP_004337REVIEWED✓ Yes
NP_116786REVIEWEDNo
NP_001341706REVIEWEDNo
NP_001341708REVIEWEDNo
NP_001341709REVIEWEDNo
NP_001341710REVIEWEDNo
NP_001341711REVIEWEDNo
NP_001341712REVIEWEDNo
NP_001341713REVIEWEDNo
NP_001427875REVIEWEDNo
Protein Domains and Families (InterPro) Total Count: 8 entries
InterPro IDNameType
IPR001309Pept_C14_p20Domain
IPR002138Pept_C14_p10Domain
IPR002398Pept_C14Family
IPR011600Pept_C14_caspaseDomain
IPR015917Pept_C14ADomain
IPR016129Caspase_his_ASActive_site
IPR029030Caspase-like_dom_sfHomologous_superfamily
IPR033139Caspase_cys_ASActive_site

Section 4: Structure Identifiers Experimental Structures (PDB) Total Count: 135+ structures

PDB IDMethodResolutionDescription
2DKOX-ray1.06 ÅHighest resolution - with inhibitor
2J32X-ray1.30 ÅLoop bundle hydrogen bonds study
5IBPX-ray1.38 ÅV266M mutant
2J30X-ray1.40 ÅMaturation and activity study
4JJEX-ray1.48 ÅUnnatural amino acid peptides
4QUJX-ray1.50 ÅT140G/V266H mutant
3PCXX-ray1.50 ÅE246A/K242A double mutant
2J31X-ray1.50 ÅMaturation study
6CKZX-ray1.50 ÅWith Ac-DW3-KE inhibitor
6BFJX-ray1.54 ÅT245D/S249D mutant
5IASX-ray1.54 ÅV266Y mutant
2XYGX-ray1.54 ÅWith CAS329306 inhibitor
5IAEX-ray1.55 ÅV266F mutant
4QU9X-ray1.56 ÅF128A mutant
1NMEX-ray1.60 ÅWith tethered salicylate
6BGSX-ray1.60 ÅS150Y mutant
4QTYX-ray1.60 ÅE190A mutant
3PD1X-ray1.62 ÅK242A mutant
4PS0X-ray1.63 ÅWith unnatural amino acid peptides
3ITNX-ray1.63 ÅPseudo-activated procaspase-3
5IBCX-ray1.66 ÅV266I mutant
2CJYX-ray1.67 ÅExtended substrate recognition
2CNLX-ray1.67 ÅWith aza-peptide epoxide inhibitor
2H5IX-ray1.69 ÅWith Ac-DEVD-Cho inhibitor
4EHNX-ray1.69 ÅAllosteric modulation study
4EHAX-ray1.70 ÅAllosteric modulation study
2CDRX-ray1.70 ÅWith aza-peptide epoxide inhibitor
2CJXX-ray1.70 ÅExtended substrate recognition
2CNNX-ray1.70 ÅWith aza-peptide epoxide inhibitor
4DCJX-ray1.70 ÅL168D mutant
4DCOX-ray1.70 ÅL168Y mutant
4DCPX-ray1.70 ÅL168F mutant
4PRYX-ray1.70 ÅWith unnatural amino acid peptides
1NMSX-ray1.70 ÅWith irreversible inhibitor
5IBRX-ray1.74 ÅV266K mutant
2CNKX-ray1.75 ÅWith aza-peptide epoxide inhibitor
5IARX-ray1.76 ÅV266W mutant
2C1EX-ray1.77 ÅWith Michael acceptor inhibitor
4EHHX-ray1.78 ÅAllosteric modulation study
5IABX-ray1.79 ÅV266D mutant
5I9BX-ray1.80 ÅV266A mutant
5IAKX-ray1.82 ÅV266S mutant
2C2KX-ray1.87 ÅWith Michael acceptor inhibitor
6BFLX-ray1.87 ÅD9A/D28A/T245D mutant
6BG4X-ray1.87 ÅT152D mutant
6BGKX-ray1.87 ÅD9A/D28A/T152D mutant
2XYHX-ray1.89 ÅWith CAS60254719 inhibitor
4QUAX-ray1.89 ÅY195F mutant
1QX3X-ray1.90 ÅUnliganded conformational study
3GJSX-ray1.90 ÅDiverse P4 residue binding
All 135 structures determined by X-ray crystallography Predicted Structures (AlphaFold)
Model IDConfidence (pLDDT)Sequence LengthHigh Confidence Fraction
AF-P42574-F186.642,21477% (pLDDT > 90)

Section 5: Cross-Species Orthologs

OrganismGene IDSymbolType
Mouse (Mus musculus)ENSMUSG00000031628Casp3protein_coding
Rat (Rattus norvegicus)ENSRNOG00000010475Casp3protein_coding
Zebrafish (Danio rerio)ENSDARG00000017905casp3aprotein_coding
Zebrafish (Danio rerio)ENSDARG00000055045casp3bprotein_coding
Fruit fly (D. melanogaster)FBGN0010501Dcp-1protein_coding
Fruit fly (D. melanogaster)FBGN0019972Driceprotein_coding
Worm (C. elegans)WBGENE00000821csp-3protein_coding
Worm (C. elegans)WBGENE00000417ced-3-
Worm (C. elegans)WBGENE00000820csp-2-

Reference generated from BioBTree integrated biological database. Data current as of database build date. I’ve compiled a comprehensive cross-database identifier and functional mapping reference for human CASP3 (Caspase-3). Key highlights: Gene Level:

  • HGNC:1504, ENSG00000164305, Entrez 836, OMIM 600636
  • Located at chromosome 4q35.1 (184,627,695-184,650,418, minus strand) Transcripts: 32 Ensembl transcripts, 10+ RefSeq mRNAs with NM_004346 as MANE Select canonical Protein: UniProt P42574 (277 aa, 31.6 kDa), containing caspase-specific domains (p20/p10 subunits) Structures: 135+ PDB structures (highest resolution: 1.06 Å), plus AlphaFold model with 86.64 pLDDT Clinical Variants: 27 ClinVar entries (1 pathogenic CNV), 1,335 SpliceAI predictions, 1,866 AlphaMissense predictions with many high-confidence pathogenic calls in catalytic residues Pathways: 17 Reactome pathways centered on apoptosis, pyroptosis, and cell death signaling Interactions: 8,154+ STRING interactors, 143 curated IntAct interactions - key partners include XIAP, APAF1, CASP8/9, BCL2, PARP1, TP53 Drug Targets: 5 molecules in clinical development (Diacerein Phase 3, Emricasan/Pralnacasan Phase 2, PAC-1 Phase 1) Expression: Ubiquitously expressed (245/277 tissues positive), highest in intestinal mucosa, immune cells, and nervous system Disease: Strong GWAS association with Kawasaki disease (p=1×10⁻¹⁰); functional roles in neurodegeneration, cancer, and inflammatory diseases