PCSK9 Gene Complete Identifier and Functional Mapping Reference
Provide a comprehensive cross-database identifier and functional mapping reference for human PCSK9. This should serve as a definitive lookup resource …
Provide a comprehensive cross-database identifier and functional mapping reference for human PCSK9. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section
Structured Data Sources
This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 31 biological databases.
Further analyze this answer or run your own queries with BioBTree MCP.
alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, ortholog, pdb, pharmgkb_gene, reactome, refseq, spliceai, string_interaction, transcript, uniprotView API calls (48)
- search(PCSK9) https://sugi.bio/biobtree/api/search?i=PCSK9
- search(PCSK9, hgnc) https://sugi.bio/biobtree/api/search?i=PCSK9&s=hgnc
- search(PCSK9, ensembl) https://sugi.bio/biobtree/api/search?i=PCSK9&s=ensembl
- entry(HGNC:20001, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A20001&s=hgnc
- entry(ENSG00000169174, ensembl) https://sugi.bio/biobtree/api/entry?i=ENSG00000169174&s=ensembl
- entry(255738, entrez) https://sugi.bio/biobtree/api/entry?i=255738&s=entrez
- map(HGNC:20001, >>hgnc>>mim) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Emim
- map(HGNC:20001, >>hgnc>>uniprot) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Euniprot
- map(ENSG00000169174, >>ensembl>>transcript) https://sugi.bio/biobtree/api/map?i=ENSG00000169174&m=%3E%3Eensembl%3E%3Etranscript
- map(HGNC:20001, >>hgnc>>refseq) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Erefseq
- map(HGNC:20001, >>hgnc>>ccds) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eccds
- entry(Q8NBP7, uniprot) https://sugi.bio/biobtree/api/entry?i=Q8NBP7&s=uniprot
- map(ENST00000302118, >>transcript>>exon) https://sugi.bio/biobtree/api/map?i=ENST00000302118&m=%3E%3Etranscript%3E%3Eexon
- map(Q8NBP7, >>uniprot>>interpro) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Einterpro
- map(Q8NBP7, >>uniprot>>pdb) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Epdb
- map(Q8NBP7, >>uniprot>>alphafold) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ealphafold
- map(ENSG00000169174, >>ensembl>>ortholog) https://sugi.bio/biobtree/api/map?i=ENSG00000169174&m=%3E%3Eensembl%3E%3Eortholog
- map(HGNC:20001, >>hgnc>>clinvar) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar
- map(Q8NBP7, >>uniprot>>go) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ego
- map(Q8NBP7, >>uniprot>>reactome) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ereactome
- map(HGNC:20001, >>hgnc>>clinvar[germline_classification=="Pathogenic"]) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar%5Bgermline_classification%3D%3D%22Pathogenic%22%5D
- map(HGNC:20001, >>hgnc>>clinvar[germline_classification=="Likely pathogenic"]) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar%5Bgermline_classification%3D%3D%22Likely%20pathogenic%22%5D
- map(HGNC:20001, >>hgnc>>spliceai) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Espliceai
- map(Q8NBP7, >>uniprot>>alphamissense) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ealphamissense
- map(Q8NBP7, >>uniprot>>alphamissense[am_class=="likely_pathogenic"]) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ealphamissense%5Bam_class%3D%3D%22likely_pathogenic%22%5D
- map(Q8NBP7, >>uniprot>>string_interaction) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Estring_interaction
- map(Q8NBP7, >>uniprot>>intact) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Eintact
- map(Q8NBP7, >>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Echembl_target
- map(CHEMBL2929, >>chembl_target>>chembl_molecule) https://sugi.bio/biobtree/api/map?i=CHEMBL2929&m=%3E%3Echembl_target%3E%3Echembl_molecule
- map(Q8NBP7, >>uniprot>>esm2_similarity) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Eesm2_similarity
- map(Q8NBP7, >>uniprot>>diamond_similarity) https://sugi.bio/biobtree/api/map?i=Q8NBP7&m=%3E%3Euniprot%3E%3Ediamond_similarity
- map(ENSG00000169174, >>ensembl>>bgee) https://sugi.bio/biobtree/api/map?i=ENSG00000169174&m=%3E%3Eensembl%3E%3Ebgee
- map(HGNC:20001, >>hgnc>>collectri) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Ecollectri
- map(HGNC:20001, >>hgnc>>gwas) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Egwas
- map(HGNC:20001, >>hgnc>>hpo) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Ehpo
- map(HGNC:20001, >>hgnc>>gencc) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Egencc
- map(ENSG00000169174, >>ensembl>>bgee>>bgee_evidence) https://sugi.bio/biobtree/api/map?i=ENSG00000169174&m=%3E%3Eensembl%3E%3Ebgee%3E%3Ebgee_evidence
- search(evolocumab, chembl_molecule) https://sugi.bio/biobtree/api/search?i=evolocumab&s=chembl_molecule
- search(alirocumab, chembl_molecule) https://sugi.bio/biobtree/api/search?i=alirocumab&s=chembl_molecule
- search(inclisiran, chembl_molecule) https://sugi.bio/biobtree/api/search?i=inclisiran&s=chembl_molecule
- entry(CHEMBL2364655, chembl_molecule) https://sugi.bio/biobtree/api/entry?i=CHEMBL2364655&s=chembl_molecule
- entry(CHEMBL2109540, chembl_molecule) https://sugi.bio/biobtree/api/entry?i=CHEMBL2109540&s=chembl_molecule
- entry(CHEMBL3990033, chembl_molecule) https://sugi.bio/biobtree/api/entry?i=CHEMBL3990033&s=chembl_molecule
- map(HGNC:20001, >>hgnc>>pharmgkb_gene) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Epharmgkb_gene
- map(HGNC:20001, >>hgnc>>clinvar[germline_classification=="Benign"]) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar%5Bgermline_classification%3D%3D%22Benign%22%5D
- map(HGNC:20001, >>hgnc>>clinvar[germline_classification=="Likely benign"]) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar%5Bgermline_classification%3D%3D%22Likely%20benign%22%5D
- map(HGNC:20001, >>hgnc>>clinvar[germline_classification=="Uncertain significance"]) https://sugi.bio/biobtree/api/map?i=HGNC%3A20001&m=%3E%3Ehgnc%3E%3Eclinvar%5Bgermline_classification%3D%3D%22Uncertain%20significance%22%5D
- entry(PA38617, pharmgkb_gene) https://sugi.bio/biobtree/api/entry?i=PA38617&s=pharmgkb_gene
Read(hugo/content/biobtree/braf-gene-complete-identifier-and-functional-mapping.md)
| Section 3 | - Canonical |
|---|---|
| Protein Identifiers | |
| Q8NBP7 (UniProt reviewed) - 692 aa, 74.3 kDa |
| Section 4 | - 62 PDB structures (X-ray | - AlphaFold model | Section 5 |
|---|---|---|---|
| Structures | |||
| 1.27-7.01Å; 3 Cryo-EM) | pLDDT 85.03 | Orthologs |
| Section 6 | - 1,456 ClinVar variants |
|---|---|
| Clinical Variants & AI Predictions | |
| 9 Pathogenic, 4 Likely Pathogenic, 160+ VUS |
4 Reactome pathways (LDL clearance, IGF transport, VLDLR degradation)
54 GO terms (cholesterol metabolism, LDL receptor binding) Section 8: Interactions
2,840 STRING interactions | 71 IntAct curated interactions
Key partners: LDLR, APOB, APOE, SREBF2, HMGCR Section 9: Transcriptional Regulation
10 upstream TF regulators including SREBF1/2, HNF1A (activators), PPARA, FXR (repressors)
| Section 10 | - 3 FDA-approved PCSK9 inhibitors |
|---|---|
| Drug & Pharmacology | |
| Evolocumab (Repatha), Alirocumab (Praluent), Inclisiran (Leqvio) |
Highest in liver (91.35 score), also gastrointestinal tract, cerebellum
Ubiquitous expression pattern (147 tissues) Section 12: Disease Associations
Definitive link to autosomal dominant hypercholesterolemia type 3
37 HPO phenotypes | 179 GWAS associations (LDL cholesterol: p=3e-257)