I have gathered all the necessary data. Now let me compile the comprehensive cross-database reference document for human APOE.
APOE (Apolipoprotein E) - Comprehensive Cross-Database Identifier Reference
Section 1: Gene Identifiers
Primary Gene Identifiers
| Database | Identifier | Notes |
|---|
| HGNC ID | HGNC:613 | Approved symbol |
| HGNC Symbol | APOE | Approved name: apolipoprotein E |
| Ensembl Gene | ENSG00000130203 | GRCh38 |
| NCBI Entrez Gene | 348 | |
| OMIM Gene/Locus | 107741 | |
Gene Metadata
- Full Name: apolipoprotein E
- Locus Type: gene with protein product
- Locus Group: protein-coding gene
- Gene Group: Apolipoproteins
- Previous Symbols: AD2 (Alzheimer disease 2, APOE*E4-associated, late onset)
- Synonyms: APO-E, ApoE4, LDLCQ5, LPG
- Status: Approved
Genomic Location (GRCh38/hg38)
| Attribute | Value |
|---|
| Chromosome | 19 |
| Cytogenetic Band | 19q13.32 |
| Start Position | 44,903,787 |
| End Position | 44,909,396 |
| Strand | + (positive) |
| Genomic Accession | NC_000019.10 |
| Gene Span | 5,610 bp |
Section 2: Transcript Identifiers
Ensembl Transcripts
Total Count: 32 transcripts (31 protein_coding, 1 retained_intron)
| Transcript ID | Biotype | Start | End | 5'UTR | 3'UTR |
|---|
| ENST00000252486 | protein_coding | 44,905,796 | 44,909,393 | 44,906,602-624 | 44,909,251-393 |
| ENST00000425718 | protein_coding | 44,906,360 | 44,908,954 | 44,906,360-624 | - |
| ENST00000434152 | protein_coding | 44,905,812 | 44,909,025 | 44,905,812-868 | - |
| ENST00000446996 | protein_coding | 44,905,791 | 44,908,944 | 44,906,587-624 | - |
| ENST00000485628 | retained_intron | 44,905,796 | 44,907,326 | - | - |
| ENST00000864817 | protein_coding | 44,903,787 | 44,909,392 | 44,906,602-624 | 44,909,251-392 |
| ENST00000864820 | protein_coding | 44,905,421 | 44,909,392 | 44,906,602-624 | 44,909,251-392 |
| ENST00000864822 | protein_coding | 44,905,471 | 44,909,392 | 44,906,602-624 | 44,909,251-392 |
| ENST00000864824 | protein_coding | 44,905,791 | 44,909,395 | 44,906,602-624 | 44,909,251-395 |
| ENST00000864826-864849 | protein_coding | Various | Various | Various | Various |
(Additional 22 protein_coding transcripts with minor variations)
RefSeq Transcripts (Human)
Total Human Count: 5 mRNA + 5 protein accessions
| Accession | Type | Status | MANE Select |
|---|
| NM_000041 | mRNA | REVIEWED | ✓ Yes |
| NM_001302688 | mRNA | REVIEWED | No |
| NM_001302689 | mRNA | REVIEWED | No |
| NM_001302690 | mRNA | REVIEWED | No |
| NM_001302691 | mRNA | REVIEWED | No |
RefSeq Protein Accessions (Human)
| Accession | Status | MANE Select |
|---|
| NP_000032 | REVIEWED | ✓ Yes (Canonical) |
| NP_001289617 | REVIEWED | No |
| NP_001289618 | REVIEWED | No |
| NP_001289619 | REVIEWED | No |
| NP_001289620 | REVIEWED | No |
CCDS Identifiers
Total Count: 1
| CCDS ID | Notes |
|---|
| CCDS12647 | Consensus coding sequence |
Canonical Transcript Exons (ENST00000252486)
Total Exon Count: 4
| Exon ID | Start | End | Length | Strand |
|---|
| ENSE00001048576 | 44,905,796 | 44,905,841 | 46 bp | + |
| ENSE00003577086 | 44,906,602 | 44,906,667 | 66 bp | + |
| ENSE00000893952 | 44,907,760 | 44,907,952 | 193 bp | + |
| ENSE00000893954 | 44,908,533 | 44,909,393 | 861 bp | + |
Section 3: Protein Identifiers
UniProt Accessions
Total Count: 5
| UniProt ID | Status | Length | Mass | Notes |
|---|
| P02649 | Reviewed (Swiss-Prot) | 317 aa | 36,154 Da | Canonical |
| A0A0S2Z3D5 | Unreviewed (TrEMBL) | - | - | Isoform |
| E7ERP7 | Unreviewed (TrEMBL) | - | - | Isoform |
| E9PEV4 | Unreviewed (TrEMBL) | - | - | Isoform |
| H0Y7L5 | Unreviewed (TrEMBL) | - | - | Isoform |
Protein Domains and Families
Total Domain Annotations: 4
| Database | ID | Name | Type |
|---|
| InterPro | IPR000074 | ApoA_E | Family |
| InterPro | IPR050163 | Apolipoprotein_A1/A4/E | Family |
| Pfam | PF01442 | Apolipoprotein | Domain |
| SUPFAM | SSF58113 | Apolipoprotein | Superfamily |
Key Functional Regions (from UniProt)
| Region | Residues | Description |
|---|
| Signal peptide | 1-18 | Cleaved |
| LDL receptor binding | 164-174 | Receptor interaction |
| Lipid binding | 216-296 | Lipoprotein association |
| Homooligomerization | 272-323 | Self-association |
| VLDL specificity | 284-296 | VLDL binding |
| Heparin binding | 168-171, 235-242 | Heparan sulfate binding |
| Phosphoserine | 153 | PTM site |
| O-glycosylation | 218 | GalNAc-Thr |
Section 4: Structure Identifiers
Experimental Structures (PDB)
Total PDB Count: 29
| PDB ID | Method | Resolution | Title |
|---|
| 7FCR | X-ray | 1.40 Å | N-terminal domain mutants |
| 8AX9 | X-ray | 1.549 Å | ApoE4 N-terminal domain |
| 8AX8 | X-ray | 1.551 Å | ApoE4 N-terminal domain |
| 6NCO | X-ray | 1.707 Å | Fragment-based stabilizer complex |
| 1GS9 | X-ray | 1.70 Å | ApoE4 22kD domain |
| 1OR3 | X-ray | 1.73 Å | ApoE3 truncation mutant 165 |
| 8CE0 | X-ray | 1.75 Å | N-terminal domain |
| 1NFN | X-ray | 1.80 Å | ApoE3 |
| 6NCN | X-ray | 1.82 Å | Fragment-based stabilizer |
| 1BZ4 | X-ray | 1.85 Å | ApoE3 truncation mutant |
| 1H7I | X-ray | 1.90 Å | ApoE3 K146Q mutant |
| 8CDY | X-ray | 1.90 Å | N-terminal domain |
| 1EA8 | X-ray | 1.95 Å | ApoE3 K146E mutant |
| 1B68 | X-ray | 2.00 Å | ApoE4 22K fragment |
| 1NFO | X-ray | 2.00 Å | ApoE2 D154A |
| 6V7M | X-ray | 2.00 Å | Cleaved N-terminal domain |
| 7UVJ | X-ray | 1.99 Å | Epitope-scaffold immunogen |
| 1LPE | X-ray | 2.25 Å | LDL receptor-binding domain |
| 1LE4 | X-ray | 2.50 Å | Common mutants structure |
| 1OR2 | X-ray | 2.50 Å | ApoE3 truncation |
| 6IWB | X-ray | 2.50 Å | Complex with BCL-2 |
| 1LE2 | X-ray | 3.00 Å | Mutant structure |
| 8GRX | Cryo-EM | 3.00 Å | ApoE4 receptor complex |
| 2KC3 | NMR | - | Receptor binding domain |
| 2KNY | NMR | - | CR17-ApoE fusion |
| 2L7B | NMR | - | Full length ApoE3 |
| 1OEF | NMR | - | Peptide residues 263-286 |
| 1OEG | NMR | - | Peptide residues 267-289 |
| 7FCS | X-ray | 1.60 Å | N-terminal domain mutants |
Predicted Structures (AlphaFold)
| AlphaFold ID | Sequence Length | Global pLDDT | High Confidence (%) |
|---|
| AF-P02649-F1 | 2,539 | 76.13 | 33% |
Section 5: Cross-Species Orthologs
Total Ortholog Count: 4 (from Ensembl Compara)
| Species | Common Name | Ensembl Gene ID | Symbol | Biotype |
|---|
| Mus musculus | Mouse | ENSMUSG00000002985 | Apoe | protein_coding |
| Rattus norvegicus | Rat | ENSRNOG00000018454 | Apoe | protein_coding |
| Danio rerio | Zebrafish | ENSDARG00000040295 | apoeb | protein_coding |
| Danio rerio | Zebrafish | ENSDARG00000102004 | apoea | protein_coding |
Additional Entrez Orthologs
- Mouse Entrez: 11816
- Rat Entrez: 25728
- Zebrafish Entrez: 30314 (apoeb)
Note: No orthologs identified in Drosophila melanogaster, C. elegans, or S. cerevisiae. APOE appears to be a vertebrate-specific gene.
Section 6: Clinical Variants & AI Predictions
ClinVar Variant Summary
Total Variant Count: 263
| Classification | Count |
|---|
| Pathogenic | 11 |
| Likely pathogenic | 6 |
| Pathogenic/Likely pathogenic | 1 |
| Uncertain significance (VUS) | ~30 |
| Likely benign | ~80 |
| Benign | ~10 |
| Conflicting classifications | ~8 |
| Drug response | 1 |
| Risk factor | 2 |
TOP 50 Pathogenic/Likely Pathogenic Variants
| ClinVar ID | HGVS Notation | Type | Classification | Review Status |
|---|
| 126456 | c.497TCC[1] (p.Leu167del) | Microsatellite | Pathogenic | Multiple submitters |
| 17849 | c.61G>A (p.Glu21Lys) | SNV | Pathogenic | No assertion criteria |
| 17852 | c.237-2A>G | SNV (splice) | Pathogenic | No assertion criteria |
| 17853 | c.415_435dup (p.Glu139_Gly145dup) | Duplication | Pathogenic | No assertion criteria |
| 17857 | c.490A>G (p.Lys164Glu) | SNV | Pathogenic | No assertion criteria |
| 17858 | c.490A>C (p.Lys164Gln) | SNV | Pathogenic | No assertion criteria |
| 17859 | c.736C>T (p.Arg246Cys) | SNV | Pathogenic | No assertion criteria |
| 17861 | c.146del (p.Gly49fs) | Deletion | Pathogenic | No assertion criteria |
| 17874 | c.455G>A (p.Arg152Gln) | SNV | Pathogenic | No assertion criteria |
| 17875 | c.875G>A (p.Arg292His) | SNV | Pathogenic | No assertion criteria |
| 17879 | c.488G>C (p.Arg163Pro) | SNV | Pathogenic | No assertion criteria |
| 17880 | c.127C>T (p.Arg43Cys) | SNV | Pathogenic/LP | Multiple submitters |
| 1077013 | c.494G>C (p.Arg165Pro) | SNV | Likely pathogenic | Single submitter |
| 17850 | c.460C>A (p.Arg154Ser) | SNV | Likely pathogenic | Single submitter |
| 17862 | c.683G>A (p.Trp228Ter) | SNV | Likely pathogenic | Multiple submitters |
| 17865 | c.488G>A (p.Arg163His) | SNV | Likely pathogenic | Single submitter |
| 1341575 | c.548G>C (p.Gly183Ala) | SNV | Likely pathogenic | Single submitter |
| 17848 | c.526C>T (p.Arg176Cys) | SNV | Drug response | Expert panel |
SpliceAI Predictions
Total Count: 450
TOP 50 Predicted Splice-Altering Variants (by delta score):
| Variant | Effect | Delta Score |
|---|
| 19:44905838:GCCG:G | donor_gain | 0.99 |
| 19:44905821:G:GT | donor_gain | 0.98 |
| 19:44905842:G:GG | donor_gain | 0.98 |
| 19:44905839:CCGG:C | donor_loss | 0.97 |
| 19:44905840:CGGTG:C | donor_loss | 0.97 |
| 19:44905841:GGT:G | donor_loss | 0.97 |
| 19:44905843:T:A | donor_loss | 0.97 |
| 19:44905844:GA:G | donor_loss | 0.97 |
| 19:44905816:TG:T | donor_gain | 0.95 |
| 19:44905845:A:AC | donor_loss | 0.95 |
| 19:44905935:A:T | donor_gain | 0.94 |
| 19:44905846:G:C | donor_loss | 0.94 |
| 19:44905866:GGGA:G | donor_gain | 0.91 |
| 19:44905818:AAG:A | donor_gain | 0.89 |
| 19:44905812:G:GT | donor_gain | 0.85 |
| 19:44905836:G:GT | donor_gain | 0.83 |
| 19:44905817:GA:G | donor_gain | 0.82 |
| 19:44905919:TCCAG:T | donor_loss | 0.78 |
| ... | ... | ... |
AlphaMissense Predictions
Total Count: 2,023 missense predictions
TOP 50 Predicted Pathogenic Missense Variants:
| Variant | Protein Change | AM Score | Class |
|---|
| 19:44906637:T:A | W5R | 0.869 | likely_pathogenic |
| 19:44906637:T:C | W5R | 0.869 | likely_pathogenic |
| 19:44906644:C:A | A7E | 0.836 | likely_pathogenic |
| 19:44906653:T:A | V10D | 0.827 | likely_pathogenic |
| 19:44906656:C:A | T11K | 0.790 | likely_pathogenic |
| 19:44906641:C:A | A6D | 0.783 | likely_pathogenic |
| 19:44906667:G:A | G15R | 0.730 | likely_pathogenic |
| 19:44906667:G:C | G15R | 0.730 | likely_pathogenic |
| 19:44907760:G:A | G15E | 0.728 | likely_pathogenic |
| 19:44906650:T:G | L9R | 0.692 | likely_pathogenic |
| 19:44906656:C:G | T11R | 0.690 | likely_pathogenic |
| 19:44907762:T:C | C16R | 0.684 | likely_pathogenic |
| 19:44906665:C:A | A14E | 0.633 | likely_pathogenic |
| 19:44907764:C:G | C16W | 0.575 | likely_pathogenic |
| 19:44906650:T:A | L9Q | 0.567 | likely_pathogenic |
Section 7: Biological Pathways & Gene Ontology
Reactome Pathways
Total Pathway Count: 34
| Pathway ID | Pathway Name | Disease Pathway |
|---|
| R-HSA-174824 | Plasma lipoprotein assembly, remodeling, and clearance | No |
| R-HSA-977225 | Amyloid fiber formation | No |
| R-HSA-8963888 | Chylomicron assembly | No |
| R-HSA-8963898 | Plasma lipoprotein assembly | No |
| R-HSA-8963899 | Plasma lipoprotein remodeling | No |
| R-HSA-8963901 | Chylomicron remodeling | No |
| R-HSA-8964026 | Chylomicron clearance | No |
| R-HSA-8964043 | Plasma lipoprotein clearance | No |
| R-HSA-8964058 | HDL remodeling | No |
| R-HSA-9029569 | NR1H3 & NR1H2 regulate cholesterol transport/efflux | No |
| R-HSA-9024446 | NR1H2 and NR1H3-mediated signaling | No |
| R-HSA-975634 | Retinoid metabolism and transport | No |
| R-HSA-6806667 | Metabolism of fat-soluble vitamins | No |
| R-HSA-1236394 | Signaling by ERBB4 | No |
| R-HSA-1251985 | Nuclear signaling by ERBB4 | No |
| R-HSA-2173782 | Binding and Uptake by Scavenger Receptors | No |
| R-HSA-3000480 | Scavenging by Class A Receptors | No |
| R-HSA-2187338 | Visual phototransduction | No |
| R-HSA-8864260 | Transcriptional regulation by AP-2 (TFAP2) | No |
| R-HSA-381426 | Regulation of IGF transport by IGFBPs | No |
Gene Ontology Annotations
Total GO Count: 166
Biological Process (TOP 20)
| GO ID | Term |
|---|
| GO:0006869 | lipid transport |
| GO:0008203 | cholesterol metabolic process |
| GO:0042632 | cholesterol homeostasis |
| GO:0033344 | cholesterol efflux |
| GO:0043691 | reverse cholesterol transport |
| GO:0010875 | positive regulation of cholesterol efflux |
| GO:0006707 | cholesterol catabolic process |
| GO:0042157 | lipoprotein metabolic process |
| GO:0042158 | lipoprotein biosynthetic process |
| GO:0042159 | lipoprotein catabolic process |
| GO:0034372 | very-low-density lipoprotein particle remodeling |
| GO:0034374 | low-density lipoprotein particle remodeling |
| GO:0034375 | high-density lipoprotein particle remodeling |
| GO:0034380 | high-density lipoprotein particle assembly |
| GO:0034382 | chylomicron remnant clearance |
| GO:0006641 | triglyceride metabolic process |
| GO:0006898 | receptor-mediated endocytosis |
| GO:0007616 | long-term memory |
| GO:0042982 | amyloid precursor protein metabolic process |
| GO:0031175 | neuron projection development |
Molecular Function (TOP 20)
| GO ID | Term |
|---|
| GO:0008289 | lipid binding |
| GO:0005319 | lipid transporter activity |
| GO:0005543 | phospholipid binding |
| GO:0008201 | heparin binding |
| GO:0043395 | heparan sulfate proteoglycan binding |
| GO:0001540 | amyloid-beta binding |
| GO:0005515 | protein binding |
| GO:0005102 | signaling receptor binding |
| GO:0042802 | identical protein binding |
| GO:0042803 | protein homodimerization activity |
| GO:0019899 | enzyme binding |
| GO:0044877 | protein-containing complex binding |
| GO:0005198 | structural molecule activity |
| GO:0016209 | antioxidant activity |
Cellular Component (TOP 20)
| GO ID | Term |
|---|
| GO:0005576 | extracellular region |
| GO:0005615 | extracellular space |
| GO:0034364 | high-density lipoprotein particle |
| GO:0034362 | low-density lipoprotein particle |
| GO:0034361 | very-low-density lipoprotein particle |
| GO:0034363 | intermediate-density lipoprotein particle |
| GO:0042627 | chylomicron |
| GO:0034360 | chylomicron remnant |
| GO:0034365 | discoidal high-density lipoprotein particle |
| GO:0005886 | plasma membrane |
| GO:0016020 | membrane |
| GO:0005783 | endoplasmic reticulum |
| GO:0005788 | endoplasmic reticulum lumen |
| GO:0005794 | Golgi apparatus |
| GO:0005768 | endosome |
| GO:0005769 | early endosome |
| GO:0043025 | neuronal cell body |
| GO:0030425 | dendrite |
| GO:0043083 | synaptic cleft |
| GO:0042470 | melanosome |
Section 8: Protein Interactions & Molecular Networks
Protein-Protein Interactions Summary
| Database | Total Interactions |
|---|
| STRING | 6,284 |
| IntAct | 293 |
| BioGRID | 216 |
| SIGNOR | 6 |
TOP 50 Highest-Confidence Interacting Proteins (STRING)
| UniProt ID | Gene | Score | Description |
|---|
| P02649 | APOE | 999 | Self-interaction |
| P02647 | APOA1 | 999 | Apolipoprotein A-I |
| P04114 | APOB | 999 | Apolipoprotein B-100 |
| P05067 | APP | 999 | Amyloid precursor protein |
| P02652 | APOA2 | 998 | Apolipoprotein A-II |
| P02655 | APOC2 | 998 | Apolipoprotein C-II |
| P02656 | APOC3 | 998 | Apolipoprotein C-III |
| P10909 | CLU | 998 | Clusterin |
| P98155 | VLDLR | 997 | VLDL receptor |
| Q07954 | LRP1 | 997 | LDL receptor-related protein 1 |
| Q14114 | LRP8 | 997 | LDL receptor-related protein 8 |
| P02654 | APOC1 | 996 | Apolipoprotein C-I |
| Q92673 | SORL1 | 995 | Sortilin-related receptor |
| Q9NZC2 | TREM2 | 994 | Triggering receptor on myeloid cells 2 |
| P01141 | NGF | 989 | Nerve growth factor |
| P98164 | LRP2 | 988 | LDL receptor-related protein 2 |
| O95477 | ABCA1 | 979 | ATP-binding cassette A1 |
| P06727 | APOA4 | 979 | Apolipoprotein A-IV |
| P10636 | MAPT | 973 | Microtubule-associated protein tau |
| P37840 | SNCA | 967 | Alpha-synuclein |
| Q8WTV0 | SCARB1 | 963 | Scavenger receptor class B member 1 |
| P49810 | PSEN2 | 961 | Presenilin-2 |
| P01023 | A2M | 957 | Alpha-2-macroglobulin |
| P49768 | PSEN1 | 956 | Presenilin-1 |
| Q86X55 | LHFPL5 | 948 | LHFPL tetraspan subfamily member 5 |
| P02768 | ALB | 947 | Albumin |
| P08603 | CFH | 940 | Complement factor H |
| P16070 | CD44 | 937 | CD44 antigen |
| P05090 | APOD | 932 | Apolipoprotein D |
Key IntAct Interactions (Direct)
| Interactor | Type | Confidence |
|---|
| LRP1 | Direct interaction | 0.78 |
| APP | Direct interaction | 0.75 |
| SNCA (α-synuclein) | Physical association | 0.81 |
| LDLR | Direct interaction | 0.71 |
| CFH | Direct interaction | 0.67 |
| APOE (self) | Physical association | 0.64 |
| TREM2 | Direct interaction | 0.60 |
| VLDLR | Direct interaction | 0.44 |
SIGNOR Signaling Relationships
| Entity A | Entity B | Effect | Mechanism |
|---|
| APOA1 | APOE | Up-regulates activity | Relocalization |
| APOE | VLDLR | Up-regulates | Binding |
| APOE | SORL1 | Up-regulates | Binding |
| APOE | MAPT | Up-regulates activity | Binding |
| SORT1 | APOE | Up-regulates quantity | Binding |
| SRSF11 | APOE | Up-regulates (stabilization) | Post-transcriptional |
Protein Structural Similarity (ESM2)
Total Similar Proteins: 71
| UniProt ID | Top Similarity | Avg Similarity |
|---|
| P05770 | 1.0000 | 0.9990 |
| P0DKY2 | 1.0000 | 0.9990 |
| P10517 | 1.0000 | 0.9990 |
| P0DO94 | 1.0000 | 0.9990 |
| P0DO95 | 1.0000 | 0.9989 |
| P0DTR8 | 1.0000 | 0.9989 |
| ... | ... | ... |
Sequence Homology (DIAMOND)
Total Homologous Proteins: 74
| UniProt ID | Top Identity | Bitscore |
|---|
| P05770 | 100.0% | 358 |
| P0DKW7 | 100.0% | 289 |
| P0DKW8 | 100.0% | 284 |
| P0DUI5 | 100.0% | 445 |
| P0DUI6 | 100.0% | 445 |
| P0DTT2 | 100.0% | 384 |
| P02650 (APOE2) | 97.1% | 367 |
Section 9: Transcription Factor Regulatory Data
Note: APOE is primarily a secreted protein, not a transcription factor. However, regulatory data exists for TFs that regulate APOE expression, and APOE has been reported to have some transcriptional
regulatory effects.
Upstream Regulators (TFs that Regulate APOE)
Total Count: 30 TF-target relationships
| TF Gene | Regulation | Confidence |
|---|
| NR1H3 (LXRα) | Activation | High |
| NR1H2 (LXRβ) | Activation | Low |
| ATF4 | Activation | High |
| PPARG | Activation | Low |
| PPARD | Unknown | High |
| TREM2 | Activation | - |
| DNAJA4 | Activation | - |
| BMP2 | Activation | - |
| AP1 | Repression | High |
| JUN | Repression | High |
| SPI1 | Repression | - |
| NFKB1 | Unknown | High |
| NFKB | Unknown | High |
| RELA | Unknown | Low |
| SP1 | Unknown | Low |
| STAT1 | Unknown | - |
| TP53 | Unknown | High |
| TFAP2A | Unknown | High |
| TFAP2B | Unknown | High |
| TCF3 | Unknown | High |
| ZIC1 | Unknown | High |
| ZIC2 | Unknown | High |
| ZNF202 | Unknown | High |
| NR2C2 | Unknown | High |
| IRF6 | Unknown | High |
| FOXM1 | Unknown | Low |
| BHLHA15 | Unknown | Low |
| AEBP1 | Unknown | Low |
| CEBPG | Unknown | Low |
| NR1H4 | Unknown | Low |
Downstream Targets (Genes Regulated BY APOE)
Based on CollecTRI data, APOE has been reported to affect:
| Target Gene | Regulation |
|---|
| HMGXB4 | Repression |
| LGMN | Repression |
Section 10: Drug & Pharmacology Data
PharmGKB Gene Information
- PharmGKB ID: PA55
- VIP Gene: ✓ Yes (Very Important Pharmacogene)
- Has Variant Annotations: ✓ Yes
- CPIC Guideline: No
DrugBank Associations
Total DrugBank Entries: 8
| DrugBank ID | Notes |
|---|
| DB00460 | |
| DB00877 | |
| DB01593 | |
| DB09130 | |
| DB11886 | |
| DB14487 | |
| DB14533 | |
| DB14548 | |
PharmGKB Drug-Gene Associations
Total Drug Associations: 29
| Drug/Drug Class | Type | Clinical Annotations | Variant Annotations |
|---|
| HMG-CoA reductase inhibitors (Statins) | Drug Class | 244 | 1,092 |
| nicotine | Drug | 136 | 437 |
| risperidone | Drug | 78 | 385 |
| simvastatin | Drug | 55 | 212 |
| bupropion | Drug | 34 | 112 |
| rosuvastatin | Drug | 24 | 170 |
| quetiapine | Drug | 21 | 98 |
| fluvastatin | Drug | 20 | 68 |
| Antivirals for HIV (combinations) | Drug Class | 26 | 140 |
| SSRIs | Drug Class | 222 | 1,181 |
| ritonavir | Drug | 19 | 74 |
| tenofovir | Prodrug | 17 | 109 |
| fenofibrate | Drug | 17 | 51 |
| acenocoumarol | Drug | 17 | 231 |
| lovastatin | Drug | 12 | 17 |
| aripiprazole | Drug | 11 | 105 |
| ranibizumab | Drug | 7 | 79 |
| donepezil | Drug | 5 | 61 |
| rivastigmine | Drug | 5 | 38 |
| galantamine | Drug | 4 | 38 |
| ziprasidone | Drug | 3 | 24 |
| apixaban | Drug | 3 | 37 |
| stavudine | Drug | 2 | 12 |
| lecanemab | Drug | 0 | 0 |
| aducanumab | Drug | 0 | 0 |
| donanemab | Drug | 0 | 0 |
Clinical Relevance
APOE genotype (ε2/ε3/ε4 alleles) affects:
- Statin response: LDL cholesterol reduction varies by genotype
- Alzheimer’s disease drugs: Response to donepezil, rivastigmine, galantamine
- Cardiovascular risk: Interaction with lipid-lowering therapies
- Anti-amyloid therapies: Emerging relevance for lecanemab, aducanumab, donanemab
Section 11: Expression Profiles
Bgee Expression Summary
- Expression Breadth: Ubiquitous
- Total Present Calls: 267
- Max Expression Score: 99.93
TOP 30 Tissues with High Expression (Bgee/UBERON)
| UBERON ID | Tissue/Organ | Expression |
|---|
| UBERON:0002107 | Liver | Very High |
| UBERON:0000955 | Brain | Very High |
| UBERON:0001235 | Adrenal cortex | High |
| UBERON:0002369 | Adrenal gland | High |
| UBERON:0001876 | Amygdala | High |
| UBERON:0001898 | Hypothalamus | High |
| UBERON:0001882 | Nucleus accumbens | High |
| UBERON:0001873 | Caudate nucleus | High |
| UBERON:0001874 | Putamen | High |
| UBERON:0002037 | Cerebellum | High |
| UBERON:0000956 | Cerebral cortex | High |
| UBERON:0001950 | Neocortex | High |
| UBERON:0001954 | Ammon's horn (Hippocampus) | High |
| UBERON:0002728 | Entorhinal cortex | High |
| UBERON:0009834 | Dorsolateral prefrontal cortex | High |
| UBERON:0000451 | Prefrontal cortex | High |
| UBERON:0001870 | Frontal cortex | High |
| UBERON:0001871 | Temporal lobe | High |
| UBERON:0002038 | Substantia nigra | High |
| UBERON:0001891 | Midbrain | High |
| UBERON:0001896 | Medulla oblongata | High |
| UBERON:0000988 | Pons | High |
| UBERON:0002240 | Spinal cord | High |
| UBERON:0001013 | Adipose tissue | Moderate |
| UBERON:0002113 | Kidney | Moderate |
| UBERON:0002106 | Spleen | Moderate |
| UBERON:0000029 | Lymph node | Moderate |
| UBERON:0000992 | Ovary | Moderate |
| UBERON:0000473 | Testis | Moderate |
| UBERON:0002046 | Thyroid gland | Moderate |
Single-Cell Expression Data
Total Single-Cell Experiments: 38
| Experiment ID | Description | Species | Cells |
|---|
| E-ANND-5 | Developing human immune system across organs | Human | 911,873 |
| E-HCAD-15 | Pulmonary fibrosis | Human | 372,319 |
| E-MTAB-10287 | Endometrial biopsies | Human | 340,047 |
| E-HCAD-56 | Mouse & human spinal cord | Human | 245,394 |
| E-MTAB-8894 | Fetal lateral ganglionic eminence | Human | 150,129 |
| E-GEOD-134144 | Testis development during puberty | Human | 150,071 |
| E-MTAB-6701 | First trimester fetal-maternal interface | Human | 135,071 |
| E-MTAB-6308 | Lung tumor endothelial cells | Human | 113,132 |
| E-HCAD-30 | T cells in CNS | Human | 109,642 |
| E-MTAB-8322 | Synovial tissue macrophages (RA) | Human | 99,512 |
| E-MTAB-8381 | Ovary cortex cells | Human | 46,428 |
| E-HCAD-23 | First-trimester placenta (Drop-seq) | Human | 41,132 |
| E-MTAB-10662 | Fetal lung | Human | 39,900 |
| E-HCAD-10 | Kidney immune zonation | Human | 38,813 |
| ... | ... | ... | ... |
Notable Cell Population Patterns:
- High in hepatocytes and liver macrophages
- Enriched in brain astrocytes and microglia
- Present in retinal pigment epithelium
- Expressed in adipose tissue macrophages
Section 12: Disease Associations
Mendelian/Monogenic Disease Links (GenCC)
Total GenCC Entries: 13
| Disease | OMIM/Orphanet | Classification | Inheritance | Submitter |
|---|
| Alzheimer disease 2 | OMIM:104310 | Definitive | AD | Ambry Genetics |
| Alzheimer disease 2 | OMIM:104310 | Limited | Unknown | Labcorp Genetics |
| Hyperlipoproteinemia type III | OMIM:617347 | Strong | AD/AR | Genomics England, Labcorp |
| Hyperlipoproteinemia type III | OMIM:617347 | Moderate | AR | Ambry Genetics |
| Lipoprotein glomerulopathy | OMIM:611771 | Strong | AD | Genomics England, Labcorp |
| Lipoprotein glomerulopathy | OMIM:611771 | Moderate | AD | Ambry Genetics |
| Lipoprotein glomerulopathy | ORPHANET:329481 | Supportive | AD | Orphanet |
| Sea-blue histiocyte syndrome | OMIM:269600 | Limited | AD | Labcorp Genetics |
| Sea-blue histiocyte syndrome | ORPHANET:158029 | Supportive | AD | Orphanet |
Orphanet Disease Associations
| Orphanet ID | Disease Name | Type | Gene Count | Phenotypes |
|---|
| 329481 | Lipoprotein glomerulopathy | Disease | 1 | 0 |
| 412 | Dysbetalipoproteinemia | Disease | 1 | 26 |
Phenotype Associations (HPO)
Total HPO Terms: 84
TOP 50 Phenotype Terms:
| HPO ID | Phenotype |
|---|
| HP:0002511 | Alzheimer disease |
| HP:0000726 | Dementia |
| HP:0002354 | Memory impairment |
| HP:0100543 | Cognitive impairment |
| HP:0002155 | Hypertriglyceridemia |
| HP:0003124 | Hypercholesterolemia |
| HP:0003141 | Increased LDL cholesterol |
| HP:0003233 | Decreased HDL cholesterol |
| HP:0004943 | Accelerated atherosclerosis |
| HP:0005181 | Premature coronary artery atherosclerosis |
| HP:0001681 | Angina pectoris |
| HP:0000608 | Macular degeneration |
| HP:0030499 | Macular drusen |
| HP:0011506 | Choroidal neovascularization |
| HP:0011970 | Cerebral amyloid angiopathy |
| HP:0100256 | Senile plaques |
| HP:0002185 | Neurofibrillary tangles |
| HP:0002120 | Cerebral cortical atrophy |
| HP:0007112 | Temporal cortical atrophy |
| HP:0001013 | Eruptive xanthomas |
| HP:0001114 | Xanthelasma |
| HP:0010874 | Tendon xanthomatosis |
| HP:0025530 | Xanthomas of palmar creases |
| HP:0031290 | Tuberous xanthoma |
| HP:0001084 | Corneal arcus |
| HP:0000660 | Lipemia retinalis |
| HP:0001300 | Parkinsonism |
| HP:0001250 | Seizure |
| HP:0002197 | Generalized-onset seizure |
| HP:0001260 | Dysarthria |
| HP:0001288 | Gait disturbance |
| HP:0001289 | Confusion |
| HP:0002186 | Apraxia |
| HP:0030784 | Anomic aphasia |
| HP:0001336 | Myoclonus |
| HP:0001332 | Dystonia |
| HP:0000751 | Personality changes |
| HP:0000741 | Apathy |
| HP:0000718 | Aggressive behavior |
| HP:0006979 | Sleep-wake cycle disturbance |
| HP:0000083 | Renal insufficiency |
| HP:0000093 | Proteinuria |
| HP:0100820 | Glomerulopathy |
| HP:0012574 | Mesangial hypercellularity |
| HP:0000799 | Renal steatosis |
| HP:0001397 | Hepatic steatosis |
| HP:0002240 | Hepatomegaly |
| HP:0001394 | Cirrhosis |
| HP:0001744 | Splenomegaly |
| HP:0001982 | Sea-blue histiocytosis |
GWAS Associations
Total GWAS Count: 403+
TOP 30 GWAS Associations:
| Study ID | Trait/Disease | P-value | Mapped Gene |
|---|
| GCST002898 | LDL cholesterol | 4.94e-324 | APOE-APOC1 |
| GCST000682 | Alzheimer's disease | 1e-295 | TOMM40 |
| GCST002222 | LDL cholesterol | 2e-178 | APOC1 |
| GCST000479 | Alzheimer's disease | 2e-157 | TOMM40 |
| GCST002221 | Total cholesterol | 1e-149 | APOC1 |
| GCST001529 | Alzheimer's disease | 8e-149 | APOC1 |
| GCST002896 | Total cholesterol | 8e-239 | APOE |
| GCST000759 | LDL cholesterol | 9e-147 | APOC1 |
| GCST002422 | Alzheimer's disease | 9e-116 | TOMM40 |
| GCST000760 | Total cholesterol | 5e-111 | APOC1 |
| GCST001087 | Alzheimer's disease | 8e-89 | TOMM40 |
| GCST000134 | LDL cholesterol | 1e-60 | APOC1 |
| GCST001392 | Lipid metabolism phenotypes | 3e-58 | APOE |
| GCST001727 | Lp-PLA2 activity | 1e-56 | APOE-APOC1 |
| GCST002174 | Lipid traits | 3e-53 | APOE |
| GCST001285 | Psychosis and Alzheimer's disease | 9e-52 | TOMM40 |
| GCST001408 | Statin response (LDL change) | 2e-47 | APOE |
| GCST000124 | Alzheimer's disease | 2e-44 | APOC1 |
| GCST000132 | LDL cholesterol | 3e-43 | APOC1 |
| GCST003219 | Advanced AMD | 2e-42 | APOE |
| GCST001639 | Metabolite levels | 6e-42 | APOE-APOC1 |
| GCST000313 | Alzheimer's disease | 1e-40 | TOMM40 |
| GCST000018 | Late-onset Alzheimer's | 1e-39 | APOC1 |
| GCST000121 | Alzheimer's disease | 1e-39 | APOC1 |
| GCST001947 | Late-onset Alzheimer's | 5e-39 | NECTIN2 |
| GCST002897 | Triglycerides | 9e-37 | APOC1 |
| GCST002404 | Longevity (90+ years) | 3e-36 | APOC1 |
| GCST000808 | Late-onset Alzheimer's | 5e-36 | TOMM40 |
| GCST001530 | Hippocampal atrophy | 2e-33 | - |
| GCST003073 | Cerebral amyloid (PET) | 8e-32 | APOE |
SUMMARY STATISTICS
| Category | Count |
|---|
| Gene Identifiers | 5 (HGNC, Ensembl, Entrez, OMIM, CCDS) |
| Transcripts (Ensembl) | 32 |
| RefSeq Transcripts (Human) | 5 mRNA + 5 protein |
| Exons (Canonical) | 4 |
| UniProt Entries | 5 (1 reviewed) |
| Protein Domains | 4 |
| PDB Structures | 29 |
| AlphaFold Models | 1 |
| Orthologs | 4 (Mouse, Rat, Zebrafish x2) |
| ClinVar Variants | 263 |
| Pathogenic Variants | 11 |
| SpliceAI Predictions | 450 |
| AlphaMissense Predictions | 2,023 |
| Reactome Pathways | 34 |
| GO Annotations | 166 |
| STRING Interactions | 6,284 |
| IntAct Interactions | 293 |
| BioGRID Interactions | 216 |
| TF Regulators | 30 |
| PharmGKB Drug Associations | 29 |
| DrugBank Entries | 8 |
| Tissues with Expression | 267+ |
| Single-Cell Datasets | 38 |
| HPO Phenotypes | 84 |
| GWAS Associations | 403+ |
| GenCC Disease Links | 13 |
| ESM2 Similar Proteins | 71 |
| DIAMOND Homologs | 74 |
Reference compiled from: HGNC, Ensembl, NCBI Entrez, OMIM, UniProt, PDB, AlphaFold, ClinVar, SpliceAI, AlphaMissense, Reactome, Gene Ontology, STRING, IntAct, BioGRID, SIGNOR, CollecTRI, PharmGKB,
DrugBank, Bgee, Single Cell Expression Atlas, HPO, GWAS Catalog, GenCC, Orphanet