APOE Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human APOE. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human APOE. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 38 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, clinvar, collectri, diamond_similarity, drugbank, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pfam, pharmgkb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, supfam, transcript, uberon, ufeature, uniprot
Generated: 2026-03-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (44)
APOE

I have gathered all the necessary data. Now let me compile the comprehensive cross-database reference document for human APOE. APOE (Apolipoprotein E) - Comprehensive Cross-Database Identifier Reference


Section 1: Gene Identifiers Primary Gene Identifiers

DatabaseIdentifierNotes
HGNC IDHGNC:613Approved symbol
HGNC SymbolAPOEApproved name: apolipoprotein E
Ensembl GeneENSG00000130203GRCh38
NCBI Entrez Gene348
OMIM Gene/Locus107741
Gene Metadata
  • Full Name: apolipoprotein E
  • Locus Type: gene with protein product
  • Locus Group: protein-coding gene
  • Gene Group: Apolipoproteins
  • Previous Symbols: AD2 (Alzheimer disease 2, APOE*E4-associated, late onset)
  • Synonyms: APO-E, ApoE4, LDLCQ5, LPG
  • Status: Approved Genomic Location (GRCh38/hg38)
AttributeValue
Chromosome19
Cytogenetic Band19q13.32
Start Position44,903,787
End Position44,909,396
Strand+ (positive)
Genomic AccessionNC_000019.10
Gene Span5,610 bp

Section 2: Transcript Identifiers Ensembl Transcripts Total Count: 32 transcripts (31 protein_coding, 1 retained_intron)

Transcript IDBiotypeStartEnd5'UTR3'UTR
ENST00000252486protein_coding44,905,79644,909,39344,906,602-62444,909,251-393
ENST00000425718protein_coding44,906,36044,908,95444,906,360-624-
ENST00000434152protein_coding44,905,81244,909,02544,905,812-868-
ENST00000446996protein_coding44,905,79144,908,94444,906,587-624-
ENST00000485628retained_intron44,905,79644,907,326--
ENST00000864817protein_coding44,903,78744,909,39244,906,602-62444,909,251-392
ENST00000864820protein_coding44,905,42144,909,39244,906,602-62444,909,251-392
ENST00000864822protein_coding44,905,47144,909,39244,906,602-62444,909,251-392
ENST00000864824protein_coding44,905,79144,909,39544,906,602-62444,909,251-395
ENST00000864826-864849protein_codingVariousVariousVariousVarious
(Additional 22 protein_coding transcripts with minor variations) RefSeq Transcripts (Human) Total Human Count: 5 mRNA + 5 protein accessions
AccessionTypeStatusMANE Select
NM_000041mRNAREVIEWED✓ Yes
NM_001302688mRNAREVIEWEDNo
NM_001302689mRNAREVIEWEDNo
NM_001302690mRNAREVIEWEDNo
NM_001302691mRNAREVIEWEDNo
RefSeq Protein Accessions (Human)
AccessionStatusMANE Select
NP_000032REVIEWED✓ Yes (Canonical)
NP_001289617REVIEWEDNo
NP_001289618REVIEWEDNo
NP_001289619REVIEWEDNo
NP_001289620REVIEWEDNo
CCDS Identifiers Total Count: 1
CCDS IDNotes
CCDS12647Consensus coding sequence
Canonical Transcript Exons (ENST00000252486) Total Exon Count: 4
Exon IDStartEndLengthStrand
ENSE0000104857644,905,79644,905,84146 bp+
ENSE0000357708644,906,60244,906,66766 bp+
ENSE0000089395244,907,76044,907,952193 bp+
ENSE0000089395444,908,53344,909,393861 bp+

Section 3: Protein Identifiers UniProt Accessions Total Count: 5

UniProt IDStatusLengthMassNotes
P02649Reviewed (Swiss-Prot)317 aa36,154 DaCanonical
A0A0S2Z3D5Unreviewed (TrEMBL)--Isoform
E7ERP7Unreviewed (TrEMBL)--Isoform
E9PEV4Unreviewed (TrEMBL)--Isoform
H0Y7L5Unreviewed (TrEMBL)--Isoform
Protein Domains and Families Total Domain Annotations: 4
DatabaseIDNameType
InterProIPR000074ApoA_EFamily
InterProIPR050163Apolipoprotein_A1/A4/EFamily
PfamPF01442ApolipoproteinDomain
SUPFAMSSF58113ApolipoproteinSuperfamily
Key Functional Regions (from UniProt)
RegionResiduesDescription
Signal peptide1-18Cleaved
LDL receptor binding164-174Receptor interaction
Lipid binding216-296Lipoprotein association
Homooligomerization272-323Self-association
VLDL specificity284-296VLDL binding
Heparin binding168-171, 235-242Heparan sulfate binding
Phosphoserine153PTM site
O-glycosylation218GalNAc-Thr

Section 4: Structure Identifiers Experimental Structures (PDB) Total PDB Count: 29

PDB IDMethodResolutionTitle
7FCRX-ray1.40 ÅN-terminal domain mutants
8AX9X-ray1.549 ÅApoE4 N-terminal domain
8AX8X-ray1.551 ÅApoE4 N-terminal domain
6NCOX-ray1.707 ÅFragment-based stabilizer complex
1GS9X-ray1.70 ÅApoE4 22kD domain
1OR3X-ray1.73 ÅApoE3 truncation mutant 165
8CE0X-ray1.75 ÅN-terminal domain
1NFNX-ray1.80 ÅApoE3
6NCNX-ray1.82 ÅFragment-based stabilizer
1BZ4X-ray1.85 ÅApoE3 truncation mutant
1H7IX-ray1.90 ÅApoE3 K146Q mutant
8CDYX-ray1.90 ÅN-terminal domain
1EA8X-ray1.95 ÅApoE3 K146E mutant
1B68X-ray2.00 ÅApoE4 22K fragment
1NFOX-ray2.00 ÅApoE2 D154A
6V7MX-ray2.00 ÅCleaved N-terminal domain
7UVJX-ray1.99 ÅEpitope-scaffold immunogen
1LPEX-ray2.25 ÅLDL receptor-binding domain
1LE4X-ray2.50 ÅCommon mutants structure
1OR2X-ray2.50 ÅApoE3 truncation
6IWBX-ray2.50 ÅComplex with BCL-2
1LE2X-ray3.00 ÅMutant structure
8GRXCryo-EM3.00 ÅApoE4 receptor complex
2KC3NMR-Receptor binding domain
2KNYNMR-CR17-ApoE fusion
2L7BNMR-Full length ApoE3
1OEFNMR-Peptide residues 263-286
1OEGNMR-Peptide residues 267-289
7FCSX-ray1.60 ÅN-terminal domain mutants
Predicted Structures (AlphaFold)
AlphaFold IDSequence LengthGlobal pLDDTHigh Confidence (%)
AF-P02649-F12,53976.1333%

Section 5: Cross-Species Orthologs Total Ortholog Count: 4 (from Ensembl Compara)

SpeciesCommon NameEnsembl Gene IDSymbolBiotype
Mus musculusMouseENSMUSG00000002985Apoeprotein_coding
Rattus norvegicusRatENSRNOG00000018454Apoeprotein_coding
Danio rerioZebrafishENSDARG00000040295apoebprotein_coding
Danio rerioZebrafishENSDARG00000102004apoeaprotein_coding
Additional Entrez Orthologs
  • Mouse Entrez: 11816
  • Rat Entrez: 25728
  • Zebrafish Entrez: 30314 (apoeb) Note: No orthologs identified in Drosophila melanogaster, C. elegans, or S. cerevisiae. APOE appears to be a vertebrate-specific gene.

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary Total Variant Count: 263

ClassificationCount
Pathogenic11
Likely pathogenic6
Pathogenic/Likely pathogenic1
Uncertain significance (VUS)~30
Likely benign~80
Benign~10
Conflicting classifications~8
Drug response1
Risk factor2
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassificationReview Status
126456c.497TCC[1] (p.Leu167del)MicrosatellitePathogenicMultiple submitters
17849c.61G>A (p.Glu21Lys)SNVPathogenicNo assertion criteria
17852c.237-2A>GSNV (splice)PathogenicNo assertion criteria
17853c.415_435dup (p.Glu139_Gly145dup)DuplicationPathogenicNo assertion criteria
17857c.490A>G (p.Lys164Glu)SNVPathogenicNo assertion criteria
17858c.490A>C (p.Lys164Gln)SNVPathogenicNo assertion criteria
17859c.736C>T (p.Arg246Cys)SNVPathogenicNo assertion criteria
17861c.146del (p.Gly49fs)DeletionPathogenicNo assertion criteria
17874c.455G>A (p.Arg152Gln)SNVPathogenicNo assertion criteria
17875c.875G>A (p.Arg292His)SNVPathogenicNo assertion criteria
17879c.488G>C (p.Arg163Pro)SNVPathogenicNo assertion criteria
17880c.127C>T (p.Arg43Cys)SNVPathogenic/LPMultiple submitters
1077013c.494G>C (p.Arg165Pro)SNVLikely pathogenicSingle submitter
17850c.460C>A (p.Arg154Ser)SNVLikely pathogenicSingle submitter
17862c.683G>A (p.Trp228Ter)SNVLikely pathogenicMultiple submitters
17865c.488G>A (p.Arg163His)SNVLikely pathogenicSingle submitter
1341575c.548G>C (p.Gly183Ala)SNVLikely pathogenicSingle submitter
17848c.526C>T (p.Arg176Cys)SNVDrug responseExpert panel
SpliceAI Predictions Total Count: 450 TOP 50 Predicted Splice-Altering Variants (by delta score):
VariantEffectDelta Score
19:44905838:GCCG:Gdonor_gain0.99
19:44905821:G:GTdonor_gain0.98
19:44905842:G:GGdonor_gain0.98
19:44905839:CCGG:Cdonor_loss0.97
19:44905840:CGGTG:Cdonor_loss0.97
19:44905841:GGT:Gdonor_loss0.97
19:44905843:T:Adonor_loss0.97
19:44905844:GA:Gdonor_loss0.97
19:44905816:TG:Tdonor_gain0.95
19:44905845:A:ACdonor_loss0.95
19:44905935:A:Tdonor_gain0.94
19:44905846:G:Cdonor_loss0.94
19:44905866:GGGA:Gdonor_gain0.91
19:44905818:AAG:Adonor_gain0.89
19:44905812:G:GTdonor_gain0.85
19:44905836:G:GTdonor_gain0.83
19:44905817:GA:Gdonor_gain0.82
19:44905919:TCCAG:Tdonor_loss0.78
.........
AlphaMissense Predictions Total Count: 2,023 missense predictions TOP 50 Predicted Pathogenic Missense Variants:
VariantProtein ChangeAM ScoreClass
19:44906637:T:AW5R0.869likely_pathogenic
19:44906637:T:CW5R0.869likely_pathogenic
19:44906644:C:AA7E0.836likely_pathogenic
19:44906653:T:AV10D0.827likely_pathogenic
19:44906656:C:AT11K0.790likely_pathogenic
19:44906641:C:AA6D0.783likely_pathogenic
19:44906667:G:AG15R0.730likely_pathogenic
19:44906667:G:CG15R0.730likely_pathogenic
19:44907760:G:AG15E0.728likely_pathogenic
19:44906650:T:GL9R0.692likely_pathogenic
19:44906656:C:GT11R0.690likely_pathogenic
19:44907762:T:CC16R0.684likely_pathogenic
19:44906665:C:AA14E0.633likely_pathogenic
19:44907764:C:GC16W0.575likely_pathogenic
19:44906650:T:AL9Q0.567likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathway Count: 34

Pathway IDPathway NameDisease Pathway
R-HSA-174824Plasma lipoprotein assembly, remodeling, and clearanceNo
R-HSA-977225Amyloid fiber formationNo
R-HSA-8963888Chylomicron assemblyNo
R-HSA-8963898Plasma lipoprotein assemblyNo
R-HSA-8963899Plasma lipoprotein remodelingNo
R-HSA-8963901Chylomicron remodelingNo
R-HSA-8964026Chylomicron clearanceNo
R-HSA-8964043Plasma lipoprotein clearanceNo
R-HSA-8964058HDL remodelingNo
R-HSA-9029569NR1H3 & NR1H2 regulate cholesterol transport/effluxNo
R-HSA-9024446NR1H2 and NR1H3-mediated signalingNo
R-HSA-975634Retinoid metabolism and transportNo
R-HSA-6806667Metabolism of fat-soluble vitaminsNo
R-HSA-1236394Signaling by ERBB4No
R-HSA-1251985Nuclear signaling by ERBB4No
R-HSA-2173782Binding and Uptake by Scavenger ReceptorsNo
R-HSA-3000480Scavenging by Class A ReceptorsNo
R-HSA-2187338Visual phototransductionNo
R-HSA-8864260Transcriptional regulation by AP-2 (TFAP2)No
R-HSA-381426Regulation of IGF transport by IGFBPsNo
Gene Ontology Annotations Total GO Count: 166 Biological Process (TOP 20)
GO IDTerm
GO:0006869lipid transport
GO:0008203cholesterol metabolic process
GO:0042632cholesterol homeostasis
GO:0033344cholesterol efflux
GO:0043691reverse cholesterol transport
GO:0010875positive regulation of cholesterol efflux
GO:0006707cholesterol catabolic process
GO:0042157lipoprotein metabolic process
GO:0042158lipoprotein biosynthetic process
GO:0042159lipoprotein catabolic process
GO:0034372very-low-density lipoprotein particle remodeling
GO:0034374low-density lipoprotein particle remodeling
GO:0034375high-density lipoprotein particle remodeling
GO:0034380high-density lipoprotein particle assembly
GO:0034382chylomicron remnant clearance
GO:0006641triglyceride metabolic process
GO:0006898receptor-mediated endocytosis
GO:0007616long-term memory
GO:0042982amyloid precursor protein metabolic process
GO:0031175neuron projection development
Molecular Function (TOP 20)
GO IDTerm
GO:0008289lipid binding
GO:0005319lipid transporter activity
GO:0005543phospholipid binding
GO:0008201heparin binding
GO:0043395heparan sulfate proteoglycan binding
GO:0001540amyloid-beta binding
GO:0005515protein binding
GO:0005102signaling receptor binding
GO:0042802identical protein binding
GO:0042803protein homodimerization activity
GO:0019899enzyme binding
GO:0044877protein-containing complex binding
GO:0005198structural molecule activity
GO:0016209antioxidant activity
Cellular Component (TOP 20)
GO IDTerm
GO:0005576extracellular region
GO:0005615extracellular space
GO:0034364high-density lipoprotein particle
GO:0034362low-density lipoprotein particle
GO:0034361very-low-density lipoprotein particle
GO:0034363intermediate-density lipoprotein particle
GO:0042627chylomicron
GO:0034360chylomicron remnant
GO:0034365discoidal high-density lipoprotein particle
GO:0005886plasma membrane
GO:0016020membrane
GO:0005783endoplasmic reticulum
GO:0005788endoplasmic reticulum lumen
GO:0005794Golgi apparatus
GO:0005768endosome
GO:0005769early endosome
GO:0043025neuronal cell body
GO:0030425dendrite
GO:0043083synaptic cleft
GO:0042470melanosome

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions Summary

DatabaseTotal Interactions
STRING6,284
IntAct293
BioGRID216
SIGNOR6
TOP 50 Highest-Confidence Interacting Proteins (STRING)
UniProt IDGeneScoreDescription
P02649APOE999Self-interaction
P02647APOA1999Apolipoprotein A-I
P04114APOB999Apolipoprotein B-100
P05067APP999Amyloid precursor protein
P02652APOA2998Apolipoprotein A-II
P02655APOC2998Apolipoprotein C-II
P02656APOC3998Apolipoprotein C-III
P10909CLU998Clusterin
P98155VLDLR997VLDL receptor
Q07954LRP1997LDL receptor-related protein 1
Q14114LRP8997LDL receptor-related protein 8
P02654APOC1996Apolipoprotein C-I
Q92673SORL1995Sortilin-related receptor
Q9NZC2TREM2994Triggering receptor on myeloid cells 2
P01141NGF989Nerve growth factor
P98164LRP2988LDL receptor-related protein 2
O95477ABCA1979ATP-binding cassette A1
P06727APOA4979Apolipoprotein A-IV
P10636MAPT973Microtubule-associated protein tau
P37840SNCA967Alpha-synuclein
Q8WTV0SCARB1963Scavenger receptor class B member 1
P49810PSEN2961Presenilin-2
P01023A2M957Alpha-2-macroglobulin
P49768PSEN1956Presenilin-1
Q86X55LHFPL5948LHFPL tetraspan subfamily member 5
P02768ALB947Albumin
P08603CFH940Complement factor H
P16070CD44937CD44 antigen
P05090APOD932Apolipoprotein D
Key IntAct Interactions (Direct)
InteractorTypeConfidence
LRP1Direct interaction0.78
APPDirect interaction0.75
SNCA (α-synuclein)Physical association0.81
LDLRDirect interaction0.71
CFHDirect interaction0.67
APOE (self)Physical association0.64
TREM2Direct interaction0.60
VLDLRDirect interaction0.44
SIGNOR Signaling Relationships
Entity AEntity BEffectMechanism
APOA1APOEUp-regulates activityRelocalization
APOEVLDLRUp-regulatesBinding
APOESORL1Up-regulatesBinding
APOEMAPTUp-regulates activityBinding
SORT1APOEUp-regulates quantityBinding
SRSF11APOEUp-regulates (stabilization)Post-transcriptional
Protein Structural Similarity (ESM2) Total Similar Proteins: 71
UniProt IDTop SimilarityAvg Similarity
P057701.00000.9990
P0DKY21.00000.9990
P105171.00000.9990
P0DO941.00000.9990
P0DO951.00000.9989
P0DTR81.00000.9989
.........
Sequence Homology (DIAMOND) Total Homologous Proteins: 74
UniProt IDTop IdentityBitscore
P05770100.0%358
P0DKW7100.0%289
P0DKW8100.0%284
P0DUI5100.0%445
P0DUI6100.0%445
P0DTT2100.0%384
P02650 (APOE2)97.1%367

Section 9: Transcription Factor Regulatory Data Note: APOE is primarily a secreted protein, not a transcription factor. However, regulatory data exists for TFs that regulate APOE expression, and APOE has been reported to have some transcriptional regulatory effects. Upstream Regulators (TFs that Regulate APOE) Total Count: 30 TF-target relationships

TF GeneRegulationConfidence
NR1H3 (LXRα)ActivationHigh
NR1H2 (LXRβ)ActivationLow
ATF4ActivationHigh
PPARGActivationLow
PPARDUnknownHigh
TREM2Activation-
DNAJA4Activation-
BMP2Activation-
AP1RepressionHigh
JUNRepressionHigh
SPI1Repression-
NFKB1UnknownHigh
NFKBUnknownHigh
RELAUnknownLow
SP1UnknownLow
STAT1Unknown-
TP53UnknownHigh
TFAP2AUnknownHigh
TFAP2BUnknownHigh
TCF3UnknownHigh
ZIC1UnknownHigh
ZIC2UnknownHigh
ZNF202UnknownHigh
NR2C2UnknownHigh
IRF6UnknownHigh
FOXM1UnknownLow
BHLHA15UnknownLow
AEBP1UnknownLow
CEBPGUnknownLow
NR1H4UnknownLow
Downstream Targets (Genes Regulated BY APOE) Based on CollecTRI data, APOE has been reported to affect:
Target GeneRegulation
HMGXB4Repression
LGMNRepression

Section 10: Drug & Pharmacology Data PharmGKB Gene Information

  • PharmGKB ID: PA55
  • VIP Gene: ✓ Yes (Very Important Pharmacogene)
  • Has Variant Annotations: ✓ Yes
  • CPIC Guideline: No DrugBank Associations Total DrugBank Entries: 8
DrugBank IDNotes
DB00460
DB00877
DB01593
DB09130
DB11886
DB14487
DB14533
DB14548
PharmGKB Drug-Gene Associations Total Drug Associations: 29
Drug/Drug ClassTypeClinical AnnotationsVariant Annotations
HMG-CoA reductase inhibitors (Statins)Drug Class2441,092
nicotineDrug136437
risperidoneDrug78385
simvastatinDrug55212
bupropionDrug34112
rosuvastatinDrug24170
quetiapineDrug2198
fluvastatinDrug2068
Antivirals for HIV (combinations)Drug Class26140
SSRIsDrug Class2221,181
ritonavirDrug1974
tenofovirProdrug17109
fenofibrateDrug1751
acenocoumarolDrug17231
lovastatinDrug1217
aripiprazoleDrug11105
ranibizumabDrug779
donepezilDrug561
rivastigmineDrug538
galantamineDrug438
ziprasidoneDrug324
apixabanDrug337
stavudineDrug212
lecanemabDrug00
aducanumabDrug00
donanemabDrug00
Clinical Relevance APOE genotype (ε2/ε3/ε4 alleles) affects:
  • Statin response: LDL cholesterol reduction varies by genotype
  • Alzheimer’s disease drugs: Response to donepezil, rivastigmine, galantamine
  • Cardiovascular risk: Interaction with lipid-lowering therapies
  • Anti-amyloid therapies: Emerging relevance for lecanemab, aducanumab, donanemab

Section 11: Expression Profiles Bgee Expression Summary

  • Expression Breadth: Ubiquitous
  • Total Present Calls: 267
  • Max Expression Score: 99.93 TOP 30 Tissues with High Expression (Bgee/UBERON)
UBERON IDTissue/OrganExpression
UBERON:0002107LiverVery High
UBERON:0000955BrainVery High
UBERON:0001235Adrenal cortexHigh
UBERON:0002369Adrenal glandHigh
UBERON:0001876AmygdalaHigh
UBERON:0001898HypothalamusHigh
UBERON:0001882Nucleus accumbensHigh
UBERON:0001873Caudate nucleusHigh
UBERON:0001874PutamenHigh
UBERON:0002037CerebellumHigh
UBERON:0000956Cerebral cortexHigh
UBERON:0001950NeocortexHigh
UBERON:0001954Ammon's horn (Hippocampus)High
UBERON:0002728Entorhinal cortexHigh
UBERON:0009834Dorsolateral prefrontal cortexHigh
UBERON:0000451Prefrontal cortexHigh
UBERON:0001870Frontal cortexHigh
UBERON:0001871Temporal lobeHigh
UBERON:0002038Substantia nigraHigh
UBERON:0001891MidbrainHigh
UBERON:0001896Medulla oblongataHigh
UBERON:0000988PonsHigh
UBERON:0002240Spinal cordHigh
UBERON:0001013Adipose tissueModerate
UBERON:0002113KidneyModerate
UBERON:0002106SpleenModerate
UBERON:0000029Lymph nodeModerate
UBERON:0000992OvaryModerate
UBERON:0000473TestisModerate
UBERON:0002046Thyroid glandModerate
Single-Cell Expression Data Total Single-Cell Experiments: 38
Experiment IDDescriptionSpeciesCells
E-ANND-5Developing human immune system across organsHuman911,873
E-HCAD-15Pulmonary fibrosisHuman372,319
E-MTAB-10287Endometrial biopsiesHuman340,047
E-HCAD-56Mouse & human spinal cordHuman245,394
E-MTAB-8894Fetal lateral ganglionic eminenceHuman150,129
E-GEOD-134144Testis development during pubertyHuman150,071
E-MTAB-6701First trimester fetal-maternal interfaceHuman135,071
E-MTAB-6308Lung tumor endothelial cellsHuman113,132
E-HCAD-30T cells in CNSHuman109,642
E-MTAB-8322Synovial tissue macrophages (RA)Human99,512
E-MTAB-8381Ovary cortex cellsHuman46,428
E-HCAD-23First-trimester placenta (Drop-seq)Human41,132
E-MTAB-10662Fetal lungHuman39,900
E-HCAD-10Kidney immune zonationHuman38,813
............
Notable Cell Population Patterns:
  • High in hepatocytes and liver macrophages
  • Enriched in brain astrocytes and microglia
  • Present in retinal pigment epithelium
  • Expressed in adipose tissue macrophages

Section 12: Disease Associations Mendelian/Monogenic Disease Links (GenCC) Total GenCC Entries: 13

DiseaseOMIM/OrphanetClassificationInheritanceSubmitter
Alzheimer disease 2OMIM:104310DefinitiveADAmbry Genetics
Alzheimer disease 2OMIM:104310LimitedUnknownLabcorp Genetics
Hyperlipoproteinemia type IIIOMIM:617347StrongAD/ARGenomics England, Labcorp
Hyperlipoproteinemia type IIIOMIM:617347ModerateARAmbry Genetics
Lipoprotein glomerulopathyOMIM:611771StrongADGenomics England, Labcorp
Lipoprotein glomerulopathyOMIM:611771ModerateADAmbry Genetics
Lipoprotein glomerulopathyORPHANET:329481SupportiveADOrphanet
Sea-blue histiocyte syndromeOMIM:269600LimitedADLabcorp Genetics
Sea-blue histiocyte syndromeORPHANET:158029SupportiveADOrphanet
Orphanet Disease Associations
Orphanet IDDisease NameTypeGene CountPhenotypes
329481Lipoprotein glomerulopathyDisease10
412DysbetalipoproteinemiaDisease126
Phenotype Associations (HPO) Total HPO Terms: 84 TOP 50 Phenotype Terms:
HPO IDPhenotype
HP:0002511Alzheimer disease
HP:0000726Dementia
HP:0002354Memory impairment
HP:0100543Cognitive impairment
HP:0002155Hypertriglyceridemia
HP:0003124Hypercholesterolemia
HP:0003141Increased LDL cholesterol
HP:0003233Decreased HDL cholesterol
HP:0004943Accelerated atherosclerosis
HP:0005181Premature coronary artery atherosclerosis
HP:0001681Angina pectoris
HP:0000608Macular degeneration
HP:0030499Macular drusen
HP:0011506Choroidal neovascularization
HP:0011970Cerebral amyloid angiopathy
HP:0100256Senile plaques
HP:0002185Neurofibrillary tangles
HP:0002120Cerebral cortical atrophy
HP:0007112Temporal cortical atrophy
HP:0001013Eruptive xanthomas
HP:0001114Xanthelasma
HP:0010874Tendon xanthomatosis
HP:0025530Xanthomas of palmar creases
HP:0031290Tuberous xanthoma
HP:0001084Corneal arcus
HP:0000660Lipemia retinalis
HP:0001300Parkinsonism
HP:0001250Seizure
HP:0002197Generalized-onset seizure
HP:0001260Dysarthria
HP:0001288Gait disturbance
HP:0001289Confusion
HP:0002186Apraxia
HP:0030784Anomic aphasia
HP:0001336Myoclonus
HP:0001332Dystonia
HP:0000751Personality changes
HP:0000741Apathy
HP:0000718Aggressive behavior
HP:0006979Sleep-wake cycle disturbance
HP:0000083Renal insufficiency
HP:0000093Proteinuria
HP:0100820Glomerulopathy
HP:0012574Mesangial hypercellularity
HP:0000799Renal steatosis
HP:0001397Hepatic steatosis
HP:0002240Hepatomegaly
HP:0001394Cirrhosis
HP:0001744Splenomegaly
HP:0001982Sea-blue histiocytosis
GWAS Associations Total GWAS Count: 403+ TOP 30 GWAS Associations:
Study IDTrait/DiseaseP-valueMapped Gene
GCST002898LDL cholesterol4.94e-324APOE-APOC1
GCST000682Alzheimer's disease1e-295TOMM40
GCST002222LDL cholesterol2e-178APOC1
GCST000479Alzheimer's disease2e-157TOMM40
GCST002221Total cholesterol1e-149APOC1
GCST001529Alzheimer's disease8e-149APOC1
GCST002896Total cholesterol8e-239APOE
GCST000759LDL cholesterol9e-147APOC1
GCST002422Alzheimer's disease9e-116TOMM40
GCST000760Total cholesterol5e-111APOC1
GCST001087Alzheimer's disease8e-89TOMM40
GCST000134LDL cholesterol1e-60APOC1
GCST001392Lipid metabolism phenotypes3e-58APOE
GCST001727Lp-PLA2 activity1e-56APOE-APOC1
GCST002174Lipid traits3e-53APOE
GCST001285Psychosis and Alzheimer's disease9e-52TOMM40
GCST001408Statin response (LDL change)2e-47APOE
GCST000124Alzheimer's disease2e-44APOC1
GCST000132LDL cholesterol3e-43APOC1
GCST003219Advanced AMD2e-42APOE
GCST001639Metabolite levels6e-42APOE-APOC1
GCST000313Alzheimer's disease1e-40TOMM40
GCST000018Late-onset Alzheimer's1e-39APOC1
GCST000121Alzheimer's disease1e-39APOC1
GCST001947Late-onset Alzheimer's5e-39NECTIN2
GCST002897Triglycerides9e-37APOC1
GCST002404Longevity (90+ years)3e-36APOC1
GCST000808Late-onset Alzheimer's5e-36TOMM40
GCST001530Hippocampal atrophy2e-33-
GCST003073Cerebral amyloid (PET)8e-32APOE

SUMMARY STATISTICS

CategoryCount
Gene Identifiers5 (HGNC, Ensembl, Entrez, OMIM, CCDS)
Transcripts (Ensembl)32
RefSeq Transcripts (Human)5 mRNA + 5 protein
Exons (Canonical)4
UniProt Entries5 (1 reviewed)
Protein Domains4
PDB Structures29
AlphaFold Models1
Orthologs4 (Mouse, Rat, Zebrafish x2)
ClinVar Variants263
Pathogenic Variants11
SpliceAI Predictions450
AlphaMissense Predictions2,023
Reactome Pathways34
GO Annotations166
STRING Interactions6,284
IntAct Interactions293
BioGRID Interactions216
TF Regulators30
PharmGKB Drug Associations29
DrugBank Entries8
Tissues with Expression267+
Single-Cell Datasets38
HPO Phenotypes84
GWAS Associations403+
GenCC Disease Links13
ESM2 Similar Proteins71
DIAMOND Homologs74

Reference compiled from: HGNC, Ensembl, NCBI Entrez, OMIM, UniProt, PDB, AlphaFold, ClinVar, SpliceAI, AlphaMissense, Reactome, Gene Ontology, STRING, IntAct, BioGRID, SIGNOR, CollecTRI, PharmGKB, DrugBank, Bgee, Single Cell Expression Atlas, HPO, GWAS Catalog, GenCC, Orphanet