APP Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human APP — a definitive lookup resource covering: ### Section …

Provide a comprehensive cross-database identifier and functional mapping reference for human APP — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene APP, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene APP, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene APP protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene APP protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene APP, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene APP, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene APP, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene APP protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene APP, summarize transcription factor regulatory data. If APP is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate APP — names with evidence type (ChIP-seq / predicted / experimentally validated) If APP is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene APP protein as a drug target, summarize pharmacology data. If APP is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If APP is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene APP, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene APP, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in APP: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

APP

Executive summary

APP (Amyloid Precursor Protein; HGNC:620, chromosome 21) is the central molecular actor in Alzheimer’s disease: proteolytic cleavage of its 770-amino-acid ectodomain by BACE1 and γ-secretase generates the amyloid-beta peptides that aggregate into the plaques defining the disease. It is ubiquitously expressed (average Bgee score 97.15/100), with highest levels across cortical and subcortical brain regions. Pathogenic and likely pathogenic variants in ClinVar (~27 entries) cluster tightly around codons 670–720 — the β- and γ-secretase cleavage region — causing autosomal dominant Alzheimer’s disease (type 1 and 3) and cerebral amyloid angiopathy; duplication of the APP locus is itself pathogenic. The protein is structurally exceptionally well-characterized, with 248 experimental PDB entries including cryo-EM fibril structures from patient-derived brain tissue. Despite 100 molecules in ChEMBL targeting APP, only four have reached Phase 4, all amyloid-β PET imaging tracers; no therapeutic agent has demonstrated robust clinical efficacy against the target itself.

APP — Reference

Cross-database identifier and functional mapping reference for APP.

Gene identifiers

  • HGNC ID: HGNC:620
  • Approved symbol: APP
  • Ensembl gene ID: ENSG00000142192
  • NCBI Entrez Gene ID: 351
  • OMIM locus ID: 104760
  • Genomic location (GRCh38):
    • Chromosome: 21
    • Start position: 25,880,535
    • End position: 26,171,128
    • Strand: − (minus)

Transcript identifiers

Ensembl Transcripts

Total: 40 transcripts

Transcript IDBiotype
ENST00000346798protein_coding
ENST00000348990protein_coding
ENST00000354192protein_coding
ENST00000357903protein_coding
ENST00000358918protein_coding
ENST00000359726protein_coding
ENST00000415997protein_coding
ENST00000439274protein_coding
ENST00000440126protein_coding
ENST00000448850protein_coding
ENST00000462267retained_intron
ENST00000463070protein_coding_CDS_not_defined
ENST00000464867retained_intron
ENST00000466453protein_coding_CDS_not_defined
ENST00000474136protein_coding_CDS_not_defined
ENST00000491395protein_coding_CDS_not_defined
ENST00000548570protein_coding_CDS_not_defined
ENST00000707132protein_coding_CDS_not_defined
ENST00000707133protein_coding_CDS_not_defined
ENST00000707134protein_coding_CDS_not_defined
ENST00000872524protein_coding
ENST00000872525protein_coding
ENST00000872526protein_coding
ENST00000872527protein_coding
ENST00000872528protein_coding
ENST00000872529protein_coding
ENST00000872530protein_coding
ENST00000872531protein_coding
ENST00000872532protein_coding
ENST00000872533protein_coding
ENST00000872534protein_coding
ENST00000872535protein_coding
ENST00000872536protein_coding
ENST00000872537protein_coding
ENST00000872538protein_coding
ENST00000932837protein_coding
ENST00000932838protein_coding
ENST00000965079protein_coding
ENST00000965080protein_coding
ENST00000965081protein_coding

RefSeq mRNA (NM_ accessions - human, chromosome 21)

Total: 11 human mRNA accessions

AccessionMANE Select
NM_000484
NM_001136016
NM_001136129
NM_001136130
NM_001136131
NM_001204301
NM_001204302
NM_001204303
NM_001385253
NM_201413
NM_201414

CCDS Identifiers

Total: 8 CCDS IDs

  • CCDS13576
  • CCDS13577
  • CCDS33523
  • CCDS46638
  • CCDS46639
  • CCDS56211
  • CCDS56212
  • CCDS56213

Canonical Transcript: ENST00000346798 (MANE Select: NM_000484)

Total exons: 18

Exon IDStartEndStrandLength (bp)
ENSE000039982972588055025881771-1221
ENSE000039982982602184026022042-202
ENSE000039982992608994326090072-129
ENSE000039982912611197926112146-167
ENSE000039982942598234425982477-133
ENSE000039982932597595425976028-74
ENSE000039982902597507025975228-158
ENSE000039982882595459025954689-99
ENSE000039982872591174125911962-221
ENSE000039982922590502425905077-53
ENSE000039982962589757325897673-100
ENSE000039982862589172225891868-146
ENSE000039982852605100026051193-193
ENSE000039982822605323626053348-112
ENSE000013093222599736025997416-56
ENSE000036917232600001526000182-167
ENSE000039982792595562725955755-128
ENSE000038454662617056426170770-206

Protein identifiers

UniProt Accessions

  • P05067 (reviewed, canonical entry)

RefSeq Protein Accessions (NP_)

MANE Select (main reference):

  • NP_000475

Transcript variants:

  • NP_001129488
  • NP_001129601
  • NP_001129602
  • NP_001129603
  • NP_001137936
  • NP_001137937
  • NP_001137938
  • NP_001137939
  • NP_001185752
  • NP_001185753
  • NP_001185754
  • NP_001185755
  • NP_001191230
  • NP_001191231
  • NP_001191232
  • NP_001246731
  • NP_001246732
  • NP_001246733
  • NP_001287043
  • NP_001372182
  • NP_031497
  • NP_062161
  • NP_648561
  • NP_958816
  • NP_958817
  • NP_996065

Protein Domains and Families

InterPro (IPR):

IDNameType
IPR002223Kunitz_BPTIDomain
IPR008154Amyloid_glyco_extraDomain
IPR008155Amyloid_glycoFamily
IPR011178Amyloid_glyco_Cu-bdDomain
IPR011993PH-like_dom_sfHomologous_superfamily
IPR013803Amyloid_glyco_AbetaDomain
IPR015849Amyloid_glyco_heparin-bdDomain
IPR019543APP_amyloid_CDomain
IPR019744APP_CUBD_CSConserved_site
IPR019745Amyloid_glyco_intracell_CSConserved_site
IPR020901Prtase_inh_Kunz-CSConserved_site
IPR024329Amyloid_glyco_E2_domainDomain
IPR036176E2_sfHomologous_superfamily
IPR036454Amyloid_glyco_heparin-bd_sfHomologous_superfamily
IPR036669Amyloid_Cu-bd_sfHomologous_superfamily
IPR036880Kunitz_BPTI_sfHomologous_superfamily
IPR037071Amyloid_glyco_Abeta_sfHomologous_superfamily

Pfam (PF):

  • PF00014
  • PF02177
  • PF03494
  • PF10515
  • PF12924
  • PF12925

SMART (SM):

  • SM00006
  • SM00131

CATH/Gene3D:

  • 1.20.120.770
  • 2.30.29.30
  • 3.30.1490.140
  • 3.90.570.10
  • 4.10.230.10
  • 4.10.410.10

Antibody Resources

No antibody resources mapped in biobtree for human APP protein.

Structure

Experimental Structures

Total PDB entries: 248

X-ray Crystallography (predominant method)

Including structures with resolutions ranging from 0.85 Å (2FMA - highest resolution) to 3.5007 Å. Notable entries include:

  • 2FMA: 0.85 Å (APP copper binding domain)
  • 6NB9: 1.05 Å (Amyloid-Beta 20-34 with L-isoaspartate 23)
  • 3JQL: 1.2 Å (Phospholipase A2 complex)
  • Multiple Fab-antibody complexes (resolution 1.5-2.95 Å)
  • APP domain structures (E1, E2, copper binding domain, intracellular domain)
  • Amyloid-beta peptide fragments with various binding partners

Solution NMR

Full-length and fragment structures including:

  • Amyloid-beta peptide (1-28, 1-40, 1-42, fragments 17-34, 25-35)
  • APP copper binding domain
  • APP transmembrane domain (WT and mutants: V44M, G38L, G38P, I45T)
  • APP intracellular domain
  • Complex structures with polyphenols and metal ions

Solid-State NMR

Fibril structures including:

  • 40-residue and 42-residue beta-amyloid fibrils
  • D23N “Iowa” mutant fibril
  • Osaka mutation fibril
  • S8 phosphorylated Abeta40
  • Mixed Abeta(1-40) and Abeta(1-42) fibrils

Cryo-EM / Electron Microscopy

High-resolution fibril structures (2.4-7.84 Å), including:

  • Wild-type and mutant Abeta(1-42) fibrils from human brain
  • Abeta(1-40) fibrils (multiple morphologies)
  • APP-C99 and gamma-secretase complex
  • CAA patient-derived fibrils (Arctic E22G, Dutch-type mutations)
  • Down syndrome-associated fibrils
  • Transgenic mouse models (APP23, APP/PS1, ARTE10, 5xFAD, tgAPPSwe)
  • Montreal/Tottori mutations

Electron Crystallography (MicroED)

Atomic-resolution peptide structures (1.05-1.402 Å):

  • Amyloid-beta peptide fragments
  • D23N Iowa mutation variant

Predicted Structures

AlphaFold Model ID: AF-P05067-F1 (Version 4)

Confidence Metrics (pLDDT-based):

  • Global pLDDT: 68.11
  • Very High confidence (pLDDT ≥ 90): 30.1% of sequence
  • Confident (pLDDT 70-90): 26.6% of sequence
  • Low (pLDDT 50-70): 8.6% of sequence
  • Very Low (pLDDT < 50): 34.6% of sequence

PAE Metrics:

  • Mean PAE: 25.19
  • Max PAE: 31.75
  • Fraction PAE ≤ 5 (high confidence): 5.4%

Sequence: 770 amino acids (full-length protein)

Cross-species orthologs

OrganismGene IDGene Symbol
Mouse (Mus musculus)ENSMUSG00000022892App
Rat (Rattus norvegicus)ENSRNOG00000006997App
Zebrafish (Danio rerio)ENSDARG00000055543, ENSDARG00000104279appb, appa
Fruit fly (Drosophila melanogaster)FBGN0000108Appl
Worm (C. elegans)WBGENE00000149apl-1
Yeast (S. cerevisiae)none

Clinical variants & AI predictions

ClinVar Summary

ClassificationCount
Pathogenic~21
Likely Pathogenic~6
Uncertain Significance~350
Likely Benign~150
Benign~50
Conflicting/Other~16
Total~593

TOP 30 Pathogenic/Likely Pathogenic Variants

Variant IDHGVS NotationClassificationAssociated Condition
18088NM_000484.4(APP):c.2149G>Ap.Val717IlePathogenic; Alzheimer’s disease-3
18089NM_000484.4(APP):c.2149G>Tp.Val717PhePathogenic; Alzheimer’s disease-3
18105NM_000484.4(APP):c.2149G>Cp.Val717LeuPathogenic; Alzheimer’s disease-3
18090NM_000484.4(APP):c.2150T>Gp.Val717GlyPathogenic; Alzheimer’s disease-3
18087NM_000484.4(APP):c.2077G>Cp.Glu693GlnPathogenic; Alzheimer’s disease-1
18098NM_000484.4(APP):c.2078A>Gp.Glu693GlyPathogenic; Alzheimer’s disease-1
18099NM_000484.4(APP):c.2077G>Ap.Glu693LysPathogenic; Alzheimer’s disease-1
18100NM_000484.4(APP):c.2141C>Tp.Thr714IlePathogenic; Alzheimer’s disease-3
18102NM_000484.4(APP):c.2140A>Gp.Thr714AlaPathogenic; Alzheimer’s disease-3
18101NM_000484.4(APP):c.2080G>Ap.Asp694AsnPathogenic; Alzheimer’s disease-1
18097NM_000484.4(APP):c.2143G>Ap.Val715MetPathogenic/Likely pathogenic; Alzheimer’s disease-3
18096NM_000484.4(APP):c.2146A>Gp.Ile716ValPathogenic; Alzheimer’s disease-3
1457308NM_000484.4(APP):c.2145_2146delinsTGp.Ile716ValPathogenic; Alzheimer’s disease-3
2498894NM_000484.4(APP):c.2146A>Tp.Ile716PhePathogenic; Alzheimer’s disease-3
18091NM_000484.4(APP):c.2075C>Gp.Ala692GlyPathogenic; Alzheimer’s disease-1
18106NM_000484.4(APP):c.2018C>Tp.Ala673ValPathogenic; Alzheimer’s disease-1
18093NM_000484.4(APP):c.2010_2011invp.Lys670_Met671delinsAsnLeuPathogenic; Alzheimer’s disease-1
18094NM_000484.4(APP):c.2137G>Ap.Ala713ThrLikely pathogenic; Alzheimer’s disease-3
18103NM_000484.4(APP):c.2113C>Gp.Leu705ValLikely pathogenic; Alzheimer’s disease-3
1298879NM_000484.4(APP):c.2148C>Gp.Ile716MetLikely pathogenic; Alzheimer’s disease-3
1342870NM_000484.4(APP):c.2155A>Cp.Thr719ProLikely pathogenic; Alzheimer’s disease-3
2628367NM_000484.4(APP):c.2061A>Cp.Lys687AsnLikely pathogenic; Alzheimer’s disease-1
3382968NM_000484.4(APP):c.2059A>Cp.Lys687GlnLikely pathogenic; Alzheimer’s disease-1
1526702GRCh37/hg19 21q11.2-21.3Copy number gainPathogenic; Alzheimer’s disease
2423147GRCh37/hg19 21q21.3DuplicationPathogenic; Alzheimer’s disease
3338917GRCh37/hg19 21q21.3DuplicationPathogenic; Alzheimer’s disease
1807236Single alleleDuplicationPathogenic; Alzheimer’s disease

AlphaMissense Pathogenicity Predictions

Total variants with predictions: ~10,000+
Likely pathogenic (filtered): ~5,000+

TOP 30 Likely Pathogenic Missense Variants (by am_pathogenicity score)

Protein Variantam_pathogenicityPosition
P760Q1.000~755-760 (near β-secretase cleavage)
N759K1.000~755-760
E766D0.991~760-770
M768T0.897~760-770
Q767H0.884~760-770
N770K0.879~770
K763N0.998~760-770
F764L0.999~760-770
F764S0.998~760-770
F765L0.989~760-770
F765C0.998~760-770
P760L0.999~755-760
P760R0.999~755-760
Y762H0.999~760-770
Y762C0.998~760-770
Y757H0.999~750-760
Y757C0.998~750-760
Y757D0.998~750-760
E758G0.993~750-760
E758V0.998~750-760
E766K0.993~760-770
T761I0.996~760
N770I0.853~770
M768I0.816~765-770
M768R0.853~765-770
Q769L0.598~765-770
Q767R0.760~765-770
Q767P0.979~765-770
N759D0.999~755-760
N759S0.995~755-760

SpliceAI Splice Effect Predictions

Total variants with splice predictions: ~4,341

Breakdown by effect type:

  • Acceptor gain: ~1,500+
  • Donor gain: ~1,500+
  • Acceptor loss: ~200+
  • Donor loss: ~140+

TOP 30 High-Confidence Splice Effect Variants (score ≥0.90)

Genomic PositionEffect TypeScoreDetails
21:25881767acceptor_gain1.0000Strong acceptor creation
21:25881768acceptor_gain1.0000Strong acceptor creation
21:25881769acceptor_gain1.0000Strong acceptor creation
21:25881770acceptor_gain1.0000Strong acceptor creation
21:25881771acceptor_gain1.0000Strong acceptor creation
21:25881772acceptor_gain0.9900Near-perfect acceptor
21:25881773acceptor_loss1.0000Loss of splice acceptor
21:25881780acceptor_gain0.9800Strong acceptor creation
21:25881781acceptor_gain0.9400Strong acceptor
21:25881862donor_gain0.9000Donor site creation
21:25881863donor_gain0.9300Strong donor creation
21:25881863donor_gain0.9200Strong donor creation
21:25881942donor_gain0.9100Strong donor creation
21:25881945donor_gain0.9100Strong donor creation
21:25881911donor_gain0.8600Donor site creation
21:25881769acceptor_gain0.8500Strong acceptor
21:25881774acceptor_gain0.8500Strong acceptor
21:25881775acceptor_gain0.8500Strong acceptor
21:25881937donor_gain0.6900Moderate donor
21:25881939donor_gain0.7300Moderate-strong donor
21:25881780acceptor_gain0.7600Strong acceptor
21:25881944donor_gain0.6800Moderate donor
21:25881859donor_gain0.6200Moderate donor
21:25881858donor_gain0.7200Moderate donor
21:25881903donor_gain0.6100Moderate donor
21:25881858donor_gain0.6900Moderate donor
21:25881770acceptor_gain0.9100Strong acceptor
21:25881772acceptor_gain0.9000Strong acceptor
21:25881769acceptor_gain0.9100Strong acceptor
21:25881704acceptor_gain0.6400Moderate acceptor

Key observations:

  • APP variants cluster around codons 690-720 (β-secretase and γ-secretase cleavage region)
  • Pathogenic missense variants predominantly locate in C-terminal domain
  • High-confidence splice predictions concentrated in exon/intron boundaries
  • Strong correlation between predicted pathogenicity and known Alzheimer’s disease-associated mutations

Pathways & Gene Ontology

Reactome Pathways

Total: 17 pathways

Pathway IDPathway Name
R-HSA-114608Platelet degranulation
R-HSA-3000178ECM proteoglycans
R-HSA-381426Regulation of Insulin-like Growth Factor (IGF) transport and uptake by Insulin-like Growth Factor Binding Proteins (IGFBPs)
R-HSA-416476G alpha (q) signalling events
R-HSA-418594G alpha (i) signalling events
R-HSA-432720Lysosome Vesicle Biogenesis
R-HSA-444473Formyl peptide receptors bind formyl peptides and many other ligands
R-HSA-445989TAK1-dependent IKK and NF-kappa-B activation
R-HSA-844456The NLRP3 inflammasome
R-HSA-879415Advanced glycosylation endproduct receptor signaling
R-HSA-8862803Deregulated CDK5 triggers multiple neurodegenerative pathways in Alzheimer’s disease models
R-HSA-8957275Post-translational protein phosphorylation
R-HSA-933542TRAF6 mediated NF-kB activation
R-HSA-9609523Insertion of tail-anchored proteins into the endoplasmic reticulum membrane
R-HSA-9660826Purinergic signaling in leishmaniasis infection
R-HSA-977225Amyloid fiber formation
R-HSA-9837999Mitochondrial protein degradation

Gene Ontology Annotations

Biological Process: 71 terms

Top 20:

GO IDTerm
GO:0001774microglial cell activation
GO:0002265astrocyte activation involved in immune response
GO:0006417regulation of translation
GO:0006816calcium ion transport
GO:0006878intracellular copper ion homeostasis
GO:0006897endocytosis
GO:0007155cell adhesion
GO:0007219Notch signaling pathway
GO:0007409axonogenesis
GO:0007417central nervous system development
GO:0007611learning or memory
GO:0007612learning
GO:0007617mating behavior
GO:0007626locomotory behavior
GO:0008088axo-dendritic transport
GO:0008285negative regulation of cell population proliferation
GO:0008344adult locomotory behavior
GO:0008542visual learning
GO:0010288response to lead ion
GO:0010468regulation of gene expression

Molecular Function: 13 terms

GO IDTerm
GO:0003677DNA binding
GO:0004867serine-type endopeptidase inhibitor activity
GO:0005102signaling receptor binding
GO:0008201heparin binding
GO:0016504peptidase activator activity
GO:0019899enzyme binding
GO:0030546signaling receptor activator activity
GO:0042802identical protein binding
GO:0046914transition metal ion binding
GO:0048018receptor ligand activity
GO:0051425PTB domain binding
GO:0070851growth factor receptor binding
GO:0120283protein serine/threonine kinase binding

Cellular Component: 38 terms

Top 20:

GO IDTerm
GO:0005576extracellular region
GO:0005615extracellular space
GO:0005641nuclear envelope lumen
GO:0005737cytoplasm
GO:0005743mitochondrial inner membrane
GO:0005768endosome
GO:0005769early endosome
GO:0005783endoplasmic reticulum
GO:0005788endoplasmic reticulum lumen
GO:0005794Golgi apparatus
GO:0005796Golgi lumen
GO:0005798Golgi-associated vesicle
GO:0005829cytosol
GO:0005886plasma membrane
GO:0005905clathrin-coated pit
GO:0009986cell surface
GO:0016020membrane
GO:0030424axon
GO:0030425dendrite
GO:0031093platelet alpha granule lumen

Protein interactions & networks

Protein-protein interactions

Interaction databases:

  • STRING: ~7,062 total interactions (experimental, database, and predicted)
  • BioGRID: ~2,961 total interactions (experimental systems documented)

TOP 30 highest-confidence STRING interactors (by combined score 0–1000):

RankUniProtGeneProtein NameScore
1P02649APOEApolipoprotein E999
2P56817BACE1Beta-secretase 1999
3O00213APPBPB1Amyloid-beta precursor protein binding family B member 1998
4Q92673SORL1Sortilin-related receptor994
5O75509TNFRSF21Tumor necrosis factor receptor superfamily member 21993
6P49768PSEN1Presenilin-1 (γ-secretase component)992
7Q99767APBA2Amyloid-beta A4 precursor protein-binding family A member 2988
8P10636MAPTMicrotubule-associated protein tau986
9Q13564NAE1NEDD8-activating enzyme E1 regulatory subunit986
10P04233CD74HLA class II histocompatibility antigen gamma chain985
11P04156PRNPMajor prion protein984
12Q9HCB6SPON1Spondin-1980
13Q92993KAT5Histone acetyltransferase KAT5973
14Q92870APPBPB2Amyloid-beta precursor protein binding family B member 2966
15P10909CLUClusterin964
16Q07954LRP1Low-density lipoprotein receptor-related protein 1963
17Q09472EP300Histone acetyltransferase p300958
18P49810PSEN2Presenilin-2 (γ-secretase component)955
19P37840SNCAAlpha-synuclein954
20Q92542NCSTNNicastrin (γ-secretase component)941
21P29354GRB2Growth factor receptor-bound protein 2926
22O95704APPBPB3Amyloid-beta A4 precursor protein-binding family B member 3922
23P08247SYPSynaptophysin916
24Q02410APBA1Amyloid-beta A4 precursor protein-binding family A member 1914
25P14735IDEInsulin-degrading enzyme912
26P02766TTRTransthyretin907
27P10997IAPPIslet amyloid polypeptide902
28O96008TOMM40Mitochondrial import receptor subunit TOM40 homolog898
29P07858CTSBCathepsin B897
30Q9H0B6, Q9NZ42MultipleMultiple interactors896

Key functional groups: Presenilin complex members (PSEN1, PSEN2, NCSTN), APBA adaptors (APBA1, APBA2), APP-binding proteins (APPBPB1-3), amyloidogenic proteases (BACE1), amyloid-binding lipoproteins (APOE, CLU, TTR), and neurodegenerative pathway proteins (MAPT, SNCA, IAPP).


Protein similarity

Structural/embedding similarity (ESM2): 81 proteins identified

  • Max similarity range: 0.94–1.0
  • Top matches represent orthologs and paralogs across mammals and vertebrates
  • Highest-scoring hits (≥0.998): A2WNF5, A5A6P7, P05067 (self), P08592, P09486, P12023, P15943, P53601, Q05IS80, Q60495, Q95241, Q9Y625 (avg similarity 0.97–0.98)

Sequence homology (DIAMOND): 77 proteins identified

  • Identity range: 54.5%–100%
  • Bitscore range: 77–1,402

Top 20 sequence homologs (by bitscore):

RankUniProtIdentity (%)BitscoreProtein
1P05067100.01402APP (human, self-reference)
2Q5IS80100.01402APP ortholog
3P1202399.01399APP paralog
4P53601100.01397APP ortholog
5P0859299.01395APP-related protein
6P79307100.01367APP variant
7Q60495100.01365APP ortholog (mouse)
8Q95241100.01353APP ortholog
9P1594391.41276APP-related protein
10Q0648191.41277APP homolog
11P1598965.33273Distant homolog (high-complexity match)
12Q0315788.11056Conserved APP domain
13Q0633587.61203APP-family protein
14P4922354.577.4Minimal identity match
15O4327876.3842Kunitz-type protease inhibitor 1
16O4329170.6361Kunitz-type protease inhibitor 2
17P0276084.6580Protein AMBP
18Q0745688.3643APP-domain protein
19Q2UY0986.7819Conserved homolog
20Q2UY1186.6825Conserved homolog

Top 20 matches predominantly span: APP orthologs across vertebrates (96–100% identity), APP-related family proteins, and proteins sharing Kunitz protease-inhibitor domains characteristic of APP.

Transcription factor regulatory data

APP (amyloid beta precursor protein) is not a transcription factor. It is a protein-coding gene (locus group: protein-coding gene) and does not encode a DNA-binding transcription factor. Therefore, no downstream targets or JASPAR DNA binding motifs are applicable.

Upstream regulators

APP is regulated by the following transcription factors:

High-confidence regulators:

Transcription FactorRegulationEvidence Type
CTCFActivationCurated (TRRUST, TFactS, SIGNOR, NTNU Curated, DoRothEA)
SMAD2ActivationExperimentally validated (ExTRI, NTNU Curated)
PURAUnknownExperimentally validated (ExTRI)
ETS2UnknownCurated/Experimentally validated (TRRUST, GEREDB, ExTRI)
MYCUnknownPredicted/Curated (DoRothEA, HTRI)
AP1UnknownCurated/Experimentally validated (ExTRI, HTRI, TRRUST, DoRothEA)

Low-confidence regulators (19 additional): AHR (Low), ESR2 (Low), ESR1 (Low), EGR1 (Low), BSX (Low), CTNNB1 (Repression, Low), AR (Low), CEBPG (Low), CNOT4 (Low), SP3 (Low), STAT1 (Unknown), RARA (Repression), PPARG (Repression)

Total: 19 upstream transcription factor regulators identified from COLLECTRI database (NTNU Curated, ExTRI, TRRUST, GEREDB, DoRothEA, TFactS, SIGNOR sources).

Drug & pharmacology data

APP (Amyloid-beta Precursor Protein) is a validated drug target with 100 targeting molecules in ChEMBL.

Targeting Molecules Summary

  • Total count: 100 molecules in ChEMBL targeting APP (P05067)
  • By development phase: Phase 4 (4), Phase 3 (6), Phase 2 (4), Phase 0-1 (~86)

TOP 30 Molecules by Development Phase

Phase 4 (Approved/Published):

  1. CHEMBL1774461 — FLORBETAPIR F 18 | Amyloid-β PET imaging tracer | 44 clinical trials
  2. CHEMBL1908919 — FLORBETAPIR | Amyloid-β imaging agent (parent molecule) | 8 clinical trials
  3. CHEMBL191083 — METHYLENE BLUE CATION | Multi-target (APP stabilization, tauopathy) | 76 clinical trials
  4. CHEMBL2042122 — FLUTEMETAMOL F 18 | Amyloid-β PET imaging agent | 2 clinical trials

Phase 3 (Clinical Development): 5. CHEMBL140 — CURCUMIN | Multi-target antioxidant; amyloid aggregation inhibitor | 204 clinical trials 6. CHEMBL145 — CAFFEIC ACID | Polyphenol; antioxidant mechanism | 3 clinical trials 7. CHEMBL149082 — TRAMIPROSATE | Amyloid-β aggregation inhibitor (homotaurine) | 3 clinical trials 8. CHEMBL165 — RESVERATROL | Sirtuins/NAD+ pathway activator; antioxidant | 14 clinical trials 9. CHEMBL2048308 — FLUTAFURANOL | Amyloid-β PET imaging agent | No clinical trials 10. CHEMBL151 — LUTEOLIN | Flavonoid; multi-target antioxidant | 8 clinical trials

Phase 2 (Clinical Development): 11. CHEMBL1368322 — PARAROSANILINE | Dye compound; unclear mechanism 12. CHEMBL150 — KAEMPFEROL | Flavonoid; antioxidant | Included in curcumin trials 13. CHEMBL207456 — PITTSBURGH COMPOUND B | Amyloid-β PET imaging tracer | Research phase 14. CHEMBL2364601 — AFTOBETIN | Phenolic compound | No trial data

Remaining Phase 0-1 molecules (16-100): 86 compounds in preclinical/early research stages, including natural products (resveratrol derivatives, polyphenols), synthetic small molecules, and imaging agents.

Clinical Trials

Top APP-targeting drugs with documented clinical trial activity:

  • CURCUMIN: 204 trials across multiple indications including Alzheimer’s disease, cognitive decline, neuroinflammation
  • METHYLENE BLUE: 76 trials; primary focus on cognitive/neurodegenerative conditions
  • FLORBETAPIR F 18: 44 trials; all diagnostic (PET imaging for amyloid pathology assessment)
  • RESVERATROL: 14 trials; aging, cardiovascular, metabolic indications
  • TRAMIPROSATE: 3 completed Phase 3 trials for mild-to-moderate Alzheimer’s disease (failed primary endpoints; discontinued development)

Pharmacogenomics

No known pharmacogenomics data or APP-gene dosing interactions documented in current databases. APP targeting drugs show no established genetic variants affecting drug response. Dosing is standard across populations with no APP genotype-guided adjustments in clinical practice.

Expression profiles

Tissue Expression (Bgee)

APP demonstrates ubiquitous expression across human tissues (expression breadth: ubiquitous, average score: 97.15/100). The following table shows the top 30 tissues and anatomical regions with expression scores from Bgee:

RankTissue/RegionExpression ScoreQuality
1Prefrontal cortex99.79Gold
2Renal medulla99.69Gold
3Brodmann area 999.69Gold
4Medial globus pallidus99.66Gold
5Stromal cell of endometrium99.65Gold
6Tendon of biceps brachii99.64Gold
7Cortical plate99.61Gold
8Globus pallidus99.58Gold
9Primary visual cortex99.58Gold
10Islet of Langerhans99.56Gold
11C1 segment of cervical spinal cord99.56Gold
12Choroid plexus epithelium99.55Gold
13Metanephros cortex99.55Gold
14Occipital lobe99.53Gold
15Dorsolateral prefrontal cortex99.51Gold
16Nucleus accumbens99.47Gold
17Brodmann area 2399.46Gold
18Colonic epithelium99.46Gold
19Frontal cortex99.45Gold
20Putamen99.45Gold
21Ventral tegmental area99.45Gold
22Gall bladder99.45Gold
23Caudate nucleus99.44Gold
24Spinal cord99.44Gold
25Hypothalamus99.43Gold
26Endothelial cell99.42Gold
27Inferior vagus X ganglion99.41Gold
28Neocortex99.41Gold
29Corpus callosum99.41Gold
30Amygdala99.40Gold

Key patterns: Strong enrichment in brain regions (prefrontal cortex, Brodmann areas, basal ganglia, neocortex), particularly in cortical and subcortical structures consistent with APP’s role in Alzheimer’s disease. Significant expression also detected in kidney, endothelium, and gastrointestinal tissues.

Single-Cell Expression Datasets (Single Cell Expression Atlas)

APP is detected across 38 independent single-cell experiments spanning diverse tissues and disease states. Notable datasets include:

Dataset IDDescriptionTissue/ContextCell Count
E-CURD-114Cellular specificity of smoking effectsHuman airway epithelium81,801
E-HCAD-15Profibrotic roles in diseasePulmonary fibrosis tissue372,319
E-HCAD-9Intrahepatic immune landscapeHuman liver79,612
E-MTAB-9543GI tract atlas across space/timeGastrointestinal tract319,479
E-MTAB-10287Endometrial and uterine tissueEndometrium/uterus340,047
E-MTAB-8142Skin cell typesBreast skin809,675
E-GEOD-180759Immune response in CNS diseaseMS brain lesions421,220
E-MTAB-10042Hematopoietic developmentFetal bone marrow21,485
E-GEOD-135922Retinal and choroidal cellsRetina/choroid55,571
E-GEOD-137537Retinal cell atlasRetina12,881
E-GEOD-139324Immune landscapeHead and neck cancer204,315
E-MTAB-6701Fetal-maternal interfaceFirst trimester placenta135,071
E-GEOD-131882Kidney in diabetesDiabetic kidney43,748

Key cell-type contexts: APP is broadly expressed across epithelial cells, endothelial cells, immune cells (macrophages, microglia), and neural cells across healthy and disease states. Particularly abundant in datasets from neuroinflammatory contexts (MS, CNS disease) and developmental tissues (fetal bone marrow, placenta).

Disease associations

Mendelian / Monogenic Diseases

Disease NameDisease IDInheritanceEvidence Level
Alzheimer disease type 1OMIM:104300 / MONDO:0007088Autosomal dominantStrong
Cerebral amyloid angiopathy, APP-relatedOMIM:605714 / MONDO:0011583Autosomal dominantStrong
Early-onset autosomal dominant Alzheimer diseaseOrphanet:1020 / MONDO:0015140Autosomal dominantSupportive
ABeta amyloidosis, Dutch typeOrphanet:100006 / MONDO:0015033Autosomal dominantSupportive
ABeta amyloidosis, Iowa typeOrphanet:324708 / MONDO:0017946Autosomal dominantSupportive
ABeta amyloidosis, Italian typeOrphanet:324713 / MONDO:0017947Autosomal dominantSupportive
ABetaA21G amyloidosisOrphanet:324718 / MONDO:0017948Autosomal dominantSupportive
ABeta amyloidosis, Arctic typeOrphanet:324723 / MONDO:0017949Autosomal dominantSupportive
ABetaL34V amyloidosisOrphanet:324703Autosomal dominantSupportive
Hereditary cerebral amyloid angiopathyOrphanet:85458Autosomal dominantSupportive
Familial focal epilepsy with variable fociOrphanet:98820 / MONDO:0020310Autosomal dominantSupportive

Phenotype Associations (Top 30 HPO Terms)

HPO IDPhenotype
HP:0000006Autosomal dominant inheritance
HP:0001250Seizure
HP:0002511Alzheimer disease
HP:0000726Dementia
HP:0002354Memory impairment
HP:0001268Mental deterioration
HP:0000708Atypical behavior
HP:0000734Disinhibition
HP:0000738Hallucinations
HP:0001289Confusion
HP:0002381Aphasia
HP:0011970Cerebral amyloid angiopathy
HP:0001342Cerebral hemorrhage
HP:0004968Recurrent cerebral hemorrhage
HP:0011695Cerebellar hemorrhage
HP:0001297Stroke
HP:0002637Cerebral ischemia
HP:0002120Cerebral cortical atrophy
HP:0002185Neurofibrillary tangles
HP:0003791Deposits immunoreactive to beta-amyloid protein
HP:0001300Parkinsonism
HP:0001251Ataxia
HP:0001336Myoclonus
HP:0002015Dysphagia
HP:0000657Oculomotor apraxia
HP:0000504Abnormality of vision
HP:0002076Migraine
HP:0001259Coma
HP:0001263Global developmental delay
HP:0012759Neurodevelopmental abnormality

Complex Disease / GWAS Associations (Top 7)

Trait/DiseaseEffect Size (p-value)Associated GeneChromosome
Alzheimer’s disease1.0 × 10⁻¹¹APP21
Cerebrospinal fluid α-synuclein levels2.0 × 10⁻⁷GABPA - APP21
Glaucoma (primary open-angle)3.0 × 10⁻⁷GABPA - APP21
Glaucoma (primary open-angle)7.0 × 10⁻⁶GABPA - APP21
Longitudinal change in brain amyloid plaque burden3.0 × 10⁻⁶JAM221
Spatial processing2.0 × 10⁻⁶APP21
HDL cholesterol levels in HIV infection8.0 × 10⁻⁶

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 44 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, cathgene3d, ccds, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, expressionatlas, gencc, go, gtex, gwas, hgnc, hpa, hpo, interpro, jaspar, mim, mondo, msigdb, orphanet, ortholog, pdb, pfam, reactome, refseq, scxa, scxa_expression, smart, spliceai, string_interaction, transcript, uniprot, wormbase
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (160)