VHL Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human VHL. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human VHL. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 31 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, spliceai, string_interaction, transcript, uniprot
Generated: 2026-04-01 — For the latest data, query BioBTree directly via MCP or API.
View API calls (42)
VHL

VHL Gene: Comprehensive Cross-Database Identifier and Functional Mapping Reference GENE OVERVIEW

A critical tumor suppressor involved in HIF ubiquitination
Summary: This gene encodes a component of a ubiquitination complex involved in the ubiquitination and degradation of hypoxia-inducible-factor (HIF). VHL plays a central role in the regulation of gene expression by oxygen, cilia formation, cytokine signaling, regulation of senescence, and formation of the extracellular matrix. Variants are associated with von Hippel-Lindau syndrome, pheochromocytoma, erythrocytosis, renal cell carcinoma, and cerebellar hemangioblastoma.

Section 1: Gene Identifiers

DatabaseIdentifierNotes
HGNCHGNC:12687Approved symbol: VHL
EnsemblENSG00000134086Human gene
NCBI Entrez7428Gene ID
OMIM608537Gene/locus MIM
UniGeneHs.585416Cluster
Genomic Location
AttributeValue
Chromosome3
Cytogenetic Band3p25.3
Start Position10,141,778 (GRCh38)
End Position10,153,676 (GRCh38)
Strand+ (plus/forward)
Genomic AccessionNC_000003.12
Gene Aliases
  • VHL1
  • HRCA1
  • RCA1
  • pVHL Gene Type & Status
AttributeValue
Locus TypeGene with protein product
Locus GroupProtein-coding gene
StatusApproved

Section 2: Transcript Identifiers Ensembl Transcripts Total Transcripts: 13

Transcript IDBiotypeStartEndStrand
ENST00000256474protein_coding10,141,77810,153,667+
ENST00000345392protein_coding10,141,78810,153,667+
ENST00000477538retained_intron10,141,80210,153,667+
ENST00000696142nonsense_mediated_decay10,141,78610,153,667+
ENST00000696143protein_coding10,141,81610,153,667+
ENST00000696153protein_coding10,141,77810,153,676+
ENST00000713811protein_coding10,141,77810,153,667+
ENST00000713812protein_coding10,141,78610,153,667+
ENST00000713813retained_intron10,141,78710,153,667+
ENST00000713814nonsense_mediated_decay10,141,79110,153,667+
ENST00000713815protein_coding10,141,79710,153,667+
ENST00000713816protein_coding_CDS_not_defined10,141,80010,153,667+
ENST00000713982protein_coding10,141,77810,153,667+
RefSeq Transcripts
AccessionTypeStatusMANE Select
NM_000551mRNAREVIEWED✓ Yes
NM_198156mRNAREVIEWEDNo
NM_001354723mRNAREVIEWEDNo
NR_176335ncRNAREVIEWEDNo
RefSeq Proteins
AccessionStatusMANE Select
NP_000542REVIEWED✓ Yes
NP_937799REVIEWEDNo
NP_001341652REVIEWEDNo
CCDS IDs Total: 3
  • CCDS2597
  • CCDS2598
  • CCDS93209 Canonical Transcript Exons (ENST00000256474) Total Exons: 3
Exon IDStartEndStrandLength
ENSE0000193097410,141,77810,142,187+410 bp
ENSE0000402205610,146,51410,146,636+123 bp
ENSE0000402205710,149,78710,153,667+3,881 bp

Section 3: Protein Identifiers UniProt Accessions Total: 10 entries

AccessionStatusLengthMassDescription
P40337✓ Reviewed (Swiss-Prot)213 aa24,153 DaCanonical entry
A0A024R2F2Unreviewed--TrEMBL
A0A0S2Z4K1Unreviewed--TrEMBL
A0A8Q3SIA6Unreviewed--TrEMBL
A0A8Q3WL21Unreviewed--TrEMBL
A0AAQ5BGZ3Unreviewed--TrEMBL
A0AAQ5BGZ5Unreviewed--TrEMBL
A0AAQ5BH02Unreviewed--TrEMBL
A0AAQ5BH16Unreviewed--TrEMBL
A0AAQ5BH56Unreviewed--TrEMBL
Alternative Names
  • von Hippel-Lindau disease tumor suppressor
  • Protein G7
  • pVHL Protein Domains and Families (InterPro) Total: 6
InterPro IDNameType
IPR022772VHL_tumour_suppress_b/a_domDomain
IPR024048VHL_alpha_domDomain
IPR024053VHL_beta_domDomain
IPR036208VHL_sfHomologous Superfamily
IPR037139VHL_alpha_dom_sfHomologous Superfamily
IPR037140VHL_beta_dom_sfHomologous Superfamily
Pfam Domains
  • PF01847: VHL (VHL beta domain)
  • PF17211: VHL_C (VHL alpha domain)

Section 4: Structure Identifiers Experimental Structures (PDB) Total: 139 structures TOP 50 Structures by Resolution

PDB IDMethodResolutionTitle
9GIOX-RAY1.486 ÅVHL-EloC-EloB complex with covalent compound
7Z76X-RAY1.32 ÅSMARCA2/VCB complex with compound 10
7JTOX-RAY1.70 ÅProtac MS33 with WDR5 and VCB
4AJYX-RAY1.73 ÅVHL-ElonginB-ElonginC with HIF1-alpha peptide
6GMRX-RAY1.75 ÅVCB with pyrrolyl-phenylmethanol
6HR2X-RAY1.76 ÅPROTAC 2 with SMARCA4 bromodomain and VCB
8BB3X-RAY1.80 ÅWDR5/VCB with PROTAC (PEG linker)
6GFXX-RAY1.83 ÅVCB with modified HIF-1a CODD peptide
1LM8X-RAY1.85 ÅHIF-1a-pVHL-ElonginB-ElonginC Complex
6ZHCX-RAY1.92 ÅPROTAC6 VHL:EloB:EloC and Bcl-xL
6GMNX-RAY1.94 ÅVCB with furo-pyrrole-carboxylate
7Z77X-RAY1.97 ÅSMARCA2/VCB with compound 6
8P0FX-RAY1.98 ÅVCB complex with compound 1
9BOLX-RAY1.99 ÅVHL/ElonginB/ElonginC with compound 5
1LQBX-RAY2.00 ÅHydroxylated HIF-1 alpha with pVHL/elongin
4B9KX-RAY2.00 ÅVCB with hydroxypyrrolidine compound
6BVBX-RAY2.00 ÅHIF-2alpha-pVHL-elongin B-elongin C
8BB5X-RAY2.05 ÅWDR5/VCB with PROTAC (Aryl linker)
9EQJX-RAY2.05 ÅVCB with MP-1-39
4W9FX-RAY2.10 ÅVCB with methylthiazol-benzyl compound
4W9HX-RAY2.10 ÅVCB with acetamido compound
4W9KX-RAY2.10 ÅVCB with phenylpropanamido compound
5NVVX-RAY2.10 ÅVCB with hydroxyacetamido compound
5NW1X-RAY2.10 ÅVCB with cyclobutanecarboxamido compound
9DTXX-RAY2.11 ÅPRT3789 with BRG1 bromodomain and VCB
7JTPX-RAY2.12 ÅProtac MS67 with WDR5 and VCB
9RKCX-RAY2.19 ÅACBI4-KRAS G12D-VCB ternary complex
8QW6X-RAY2.20 ÅCompound 3-KRAS G12V-VCB complex
4W9CX-RAY2.20 ÅVCB with oxazol-benzyl compound
4W9DX-RAY2.20 ÅVCB with methyloxazol compound
4W9JX-RAY2.20 ÅVCB with methylpentanamido compound
4W9LX-RAY2.20 ÅVCB with dimethylbutanamido compound
5NVWX-RAY2.20 ÅVCB with cyclopropanecarboxamido
5NVXX-RAY2.20 ÅVCB with fluorocyclopropane compound
5NW2X-RAY2.20 ÅVCB with oxetane-carboxamido compound
9HYPX-RAY2.20 ÅSMARCA2-VCB with PROTAC P5
7PI4X-RAY2.24 ÅFAK-GSK215 PROTAC-VCB
6HAYX-RAY2.24 ÅPROTAC 1 with SMARCA2 and VCB
8QVUX-RAY2.24 ÅACBI3 with KRAS G12D and VCB
7KHHX-RAY2.28 ÅVHL/BRD4-BD1/Compound9 ternary complex
9QE4X-RAY2.28 ÅVCB with compound 114
6GFZX-RAY2.30 ÅVCB with fluoro-hydroxyproline compound
5NW0X-RAY2.30 ÅVCB with acetamidocyclopropane compound
9OIOX-RAY2.30 ÅVCB with fragments 9 and 14
4BKSX-RAY2.20 ÅVCB with ethanoyl-oxazol compound
4BKTX-RAY2.35 ÅVCB with methyloxazol compound
6HAXX-RAY2.35 ÅPROTAC 2 with SMARCA2 and VCB
8QW7X-RAY2.36 ÅCompound 4-KRAS G12V-VCB complex
9HYNX-RAY2.37 ÅSMARCA2-VCB with PROTAC P1
8CQLX-RAY2.38 ÅVCB with compound 33
Methods Distribution
MethodCount
X-RAY DIFFRACTION~125
ELECTRON MICROSCOPY~14
AlphaFold Predicted Structure
AttributeValue
AlphaFold IDAF-P40337-F1
Global pLDDT85.45
Sequence Length1702
Fraction High Confidence (pLDDT >90)72%

Section 5: Cross-Species Orthologs

OrganismEnsembl IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000033933Vhlprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000010258Vhlprotein_coding
Zebrafish (Danio rerio)ENSDARG00000070617vhlprotein_coding
Fruit fly (Drosophila melanogaster)FBGN0041174Vhlprotein_coding
Worm (C. elegans)-No data available-
Yeast (S. cerevisiae)-No data available-
Ortholog UniProt Entries
OrganismUniProtLengthIdentity
MouseP40338181 aa93.4%
RatQ64259185 aa93.4%

Section 6: Clinical Variants & AI Predictions ClinVar Summary Total Variants: 930

ClassificationCount
Pathogenic~200+
Likely Pathogenic~50+
Uncertain Significance (VUS)~400+
Likely Benign~100+
Benign~50+
Conflicting Classifications~50+
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
182977c.257C>T (p.Pro86Leu)SNVPathogenic (Expert Panel)
182978c.263G>A (p.Trp88Ter)SNVPathogenic (Expert Panel)
2226c.191G>C (p.Arg64Pro)SNVPathogenic (Expert Panel)
43597c.194C>G (p.Ser65Trp)SNVPathogenic (Expert Panel)
428806c.208G>T (p.Glu70Ter)SNVPathogenic (Expert Panel)
182975c.194C>T (p.Ser65Leu)SNVPathogenic
193118c.245G>C (p.Arg82Pro)SNVPathogenic
220414c.337C>T (p.Arg113Ter)SNVPathogenic
223168c.233A>C (p.Asn78Thr)SNVPathogenic
223176c.293A>G (p.Tyr98Cys)SNVPathogenic
2223c.292T>C (p.Tyr98His)SNVPathogenic
2228c.334T>A (p.Tyr112Asn)SNVPathogenic
223184c.320G>A (p.Arg107His)SNVPathogenic
223186c.332G>A (p.Ser111Asn)SNVPathogenic
223187c.333C>G (p.Ser111Arg)SNVPathogenic
223190c.340+1G>ASplicePathogenic
223193c.340G>C (p.Gly114Arg)SNVPathogenic
2236c.250G>T (p.Val84Leu)SNVPathogenic
2237c.277G>A (p.Gly93Ser)SNVPathogenic
229860c.250G>C (p.Val84Leu)SNVPathogenic
456577c.262T>A (p.Trp88Arg)SNVPathogenic
496053c.262T>C (p.Trp88Arg)SNVPathogenic
560741c.329del (p.His110fs)DeletionPathogenic
583094c.340G>A (p.Gly114Ser)SNVPathogenic
182986c.180del (p.Val62fs)DeletionPathogenic
223161c.194C>A (p.Ser65Ter)SNVPathogenic
223162c.203C>A (p.Ser68Ter)SNVPathogenic
223164c.217C>T (p.Gln73Ter)SNVPathogenic
411961c.226_227del (p.Phe76fs)DeletionPathogenic
220487c.258del (p.Val87fs)DeletionPathogenic
163dupc.163dup (p.Glu55fs)DuplicationPathogenic
164_171dupc.164_171dup (p.Arg60fs)DuplicationPathogenic
1070179c.239G>A (p.Ser80Asn)SNVPathogenic/Likely pathogenic
1023005c.190C>A (p.Arg64Ser)SNVLikely pathogenic
1297059c.244C>G (p.Arg82Gly)SNVPathogenic
1426858c.193T>A (p.Ser65Thr)SNVPathogenic
1782641c.191dup (p.Ser65fs)DuplicationPathogenic
1785035c.204dup (p.Arg69fs)DuplicationPathogenic
428811c.278del (p.Gly93fs)DeletionPathogenic
428804c.286C>T (p.Gln96Ter)SNVPathogenic
496052c.238A>C (p.Ser80Arg)SNVPathogenic
496055c.313A>C (p.Thr105Pro)SNVPathogenic
547829c.193T>C (p.Ser65Pro)SNVPathogenic
565557c.331A>T (p.Ser111Cys)SNVPathogenic
580847c.264G>C (p.Trp88Cys)SNVPathogenic
2214c.223_225del (p.Ile75del)DeletionPathogenic
2220c.263G>C (p.Trp88Ser)SNVPathogenic
223175c.277G>T (p.Gly93Cys)SNVPathogenic
2028206c.231C>A (p.Cys77Ter)SNVPathogenic
2925342c.202T>C (p.Ser68Pro)SNVPathogenic
SpliceAI Predictions Total Splice Variants: 364 TOP 50 High-Impact Splice Variants
VariantEffectDelta Score
3:10142188:GT>Gdonor_loss0.98
3:10142185:GAG>Gdonor_gain0.95
3:10142188:G>GGdonor_gain0.92
3:10142235:TCTG>Tdonor_gain0.87
3:10142222:G>Tdonor_gain0.82
3:10142180:C>Gdonor_gain0.75
3:10142880:G>GTdonor_gain0.66
3:10142001:G>GTdonor_gain0.51
3:10142185:G>GTdonor_gain0.53
3:10142186:AG>Adonor_gain0.50
3:10142187:GG>Gdonor_gain0.50
3:10145957:T>Gdonor_gain0.49
3:10142069:C>Adonor_gain0.45
3:10142866:TC>Tdonor_gain0.45
3:10142674:GC>Gdonor_gain0.45
3:10142320:GCTC>Gdonor_gain0.45
3:10142853:G>GTdonor_gain0.44
3:10143228:TC>Tdonor_gain0.44
3:10144640:T>TAdonor_gain0.44
3:10144656:C>Tdonor_gain0.43
3:10145954:GCTT>Gdonor_gain0.42
3:10143003:A>Cdonor_gain0.42
3:10143323:C>Gdonor_gain0.41
3:10143774:G>Tdonor_gain0.41
3:10142236:C>Adonor_gain0.40
3:10142961:T>Gdonor_gain0.40
3:10142895:T>Adonor_gain0.39
3:10142905:G>Tdonor_gain0.39
3:10141968:G>GTdonor_gain0.37
3:10142185:GAGGT>Gdonor_gain0.37
AlphaMissense Predictions Total Missense Predictions: 1,370 TOP 50 Pathogenic Missense Predictions
VariantProtein ChangeAM ScoreClass
3:10142081:T>AN78K0.999likely_pathogenic
3:10142081:T>GN78K0.999likely_pathogenic
3:10142109:T>AW88R0.999likely_pathogenic
3:10142109:T>CW88R0.999likely_pathogenic
3:10142085:A>CS80R0.999likely_pathogenic
3:10142087:T>AS80R0.999likely_pathogenic
3:10142087:T>GS80R0.999likely_pathogenic
3:10142074:T>CF76S0.999likely_pathogenic
3:10142079:A>GN78D0.998likely_pathogenic
3:10142104:C>AP86H0.997likely_pathogenic
3:10142073:T>CF76L0.996likely_pathogenic
3:10142075:C>AF76L0.996likely_pathogenic
3:10142075:C>GF76L0.996likely_pathogenic
3:10142104:C>GP86R0.995likely_pathogenic
3:10142080:A>TN78I0.994likely_pathogenic
3:10142083:G>CR79P0.993likely_pathogenic
3:10142068:T>AV74D0.993likely_pathogenic
3:10142098:T>AV84E0.992likely_pathogenic
3:10142079:A>TN78Y0.992likely_pathogenic
3:10142091:C>AR82S0.992likely_pathogenic
3:10142074:T>GF76C0.991likely_pathogenic
3:10142092:G>CR82P0.990likely_pathogenic
3:10142103:C>TP86S0.990likely_pathogenic
3:10142071:T>AI75N0.988likely_pathogenic
3:10142079:A>CN78H0.987likely_pathogenic
3:10142041:C>GS65W0.981likely_pathogenic
3:10142061:T>CS72P0.979likely_pathogenic
3:10142071:T>CI75T0.978likely_pathogenic
3:10142097:G>AV84M0.977likely_pathogenic
3:10142086:G>AS80N0.976likely_pathogenic
3:10142103:C>AP86T0.975likely_pathogenic
3:10142101:T>CL85P0.975likely_pathogenic
3:10142062:C>AS72Y0.970likely_pathogenic
3:10142091:C>GR82G0.970likely_pathogenic
3:10142086:G>TS80I0.970likely_pathogenic
3:10142073:T>GF76V0.967likely_pathogenic
3:10142062:C>TS72F0.965likely_pathogenic
3:10142067:G>TV74F0.963likely_pathogenic
3:10142076:T>CC77R0.963likely_pathogenic
3:10142038:G>CR64P0.962likely_pathogenic
3:10142092:G>TR82L0.962likely_pathogenic
3:10142073:T>AF76I0.951likely_pathogenic
3:10142097:G>CV84L0.950likely_pathogenic
3:10142082:C>GR79G0.950likely_pathogenic
3:10142076:T>AC77S0.949likely_pathogenic
3:10142077:G>AC77Y0.943likely_pathogenic
3:10142078:C>GC77W0.942likely_pathogenic
3:10142050:C>GS68W0.835likely_pathogenic
3:10142040:T>CS65P0.986likely_pathogenic
3:10142041:C>TS65L0.984likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total: 7 pathways

Pathway IDNameDisease Pathway
R-HSA-1234176Oxygen-dependent proline hydroxylation of HIF AlphaNo
R-HSA-3232142SUMOylation of ubiquitinylation proteinsNo
R-HSA-8951664NeddylationNo
R-HSA-983168Antigen processing: Ubiquitination & Proteasome degradationNo
R-HSA-9706019RHOBTB3 ATPase cycleNo
R-HSA-9682706Replication of the SARS-CoV-1 genomeYes
R-HSA-9694686Replication of the SARS-CoV-2 genomeYes
Gene Ontology Annotations Total: 38 GO terms Biological Process (22 terms)
GO IDTerm
GO:0000122negative regulation of transcription by RNA polymerase II
GO:0000902cell morphogenesis
GO:0006355regulation of DNA-templated transcription
GO:0006508proteolysis
GO:0008285negative regulation of cell population proliferation
GO:0009968negative regulation of signal transduction
GO:0010468regulation of gene expression
GO:0010507negative regulation of autophagy
GO:0010629negative regulation of gene expression
GO:0016567protein ubiquitination
GO:0034244negative regulation of transcription elongation by RNA polymerase II
GO:0043066negative regulation of apoptotic process
GO:0043161proteasome-mediated ubiquitin-dependent protein catabolic process
GO:0045597positive regulation of cell differentiation
GO:0045893positive regulation of DNA-templated transcription
GO:0046426negative regulation of receptor signaling pathway via JAK-STAT
GO:0050821protein stabilization
GO:0071456cellular response to hypoxia
GO:1900037regulation of cellular response to hypoxia
GO:1904262negative regulation of TORC1 signaling
GO:1990000amyloid fibril formation
Molecular Function (8 terms)
GO IDTerm
GO:0003711transcription elongation factor activity
GO:0003714transcription corepressor activity
GO:0004842ubiquitin-protein transferase activity
GO:0019899enzyme binding
GO:0060090molecular adaptor activity
GO:0120283protein serine/threonine kinase binding
GO:0140297DNA-binding transcription factor binding
GO:1990756ubiquitin-like ligase-substrate adaptor activity
Cellular Component (8 terms)
GO IDTerm
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005739mitochondrion
GO:0005783endoplasmic reticulum
GO:0005829cytosol
GO:0005886plasma membrane
GO:0005929cilium
GO:0015630microtubule cytoskeleton
GO:0097542ciliary tip

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions Total: 1,785 interactions TOP 50 Highest-Confidence Interactions

UniProt BGeneScoreDescription
Q15369ELOC999Elongin-C
Q13617CUL2998Cullin-2
Q16665HIF1A998Hypoxia-inducible factor 1-alpha
Q15370ELOB997Elongin-B
P62877RBX1995E3 ubiquitin-protein ligase RBX1
Q99814EPAS1967HIF-2-alpha
Q9GZT9EGLN1950Egl nine homolog 1 (PHD2)
O94782USP1920Ubiquitin carboxyl-terminal hydrolase 1
Q8TEY7USP33919Ubiquitin carboxyl-terminal hydrolase 33
P27540ARNT901Aryl hydrocarbon receptor nuclear translocator
Q96KS0EGLN2871Egl nine homolog 2 (PHD1)
Q9H6Z9EGLN3870Egl nine homolog 3 (PHD3)
Q96SL1TCEB3817Transcription elongation factor B polypeptide 3
Q9BT92TCEB3B813Elongin-A2
P61758VBP1809Prefoldin subunit 3
O60841TCEB3C801Elongin-A3
Q9NWT6HIF3A781Hypoxia-inducible factor 3-alpha
Q16790CA9761Carbonic anhydrase 9
Q6IE81TCEB3CL758Elongin-A-like
Q9Y2N7HIF1AN753Hypoxia-inducible factor 1-alpha inhibitor
O43255SOCS7732Suppressor of cytokine signaling 7
Q8IYF1HIF1AN715Factor inhibiting HIF1
Q14241TCEB1708Elongin-B
P04637TP53699Cellular tumor antigen p53
Q9Y2K6USP20651Ubiquitin carboxyl-terminal hydrolase 20
P07954FH647Fumarate hydratase
Q12983BNIP3630BCL2/adenovirus E1B 19 kDa protein-interacting protein 3
P34991HES1620Transcription factor HES-1
Q96F10SAT1603Diamine acetyltransferase 1
P01588EPO597Erythropoietin
Q96CJ1RHOBTB3590Rho-related BTB domain-containing protein 3
P02751FN1579Fibronectin
P11166SLC2A1571Solute carrier family 2 member 1 (GLUT1)
P25388RRAD564GTP-binding protein RAD
O43570CA12556Carbonic anhydrase 12
O15460PROLYL551Prolyl 4-hydroxylase subunit alpha-2
P13674P4HA1550Prolyl 4-hydroxylase subunit alpha-1
P17980PSMC354926S proteasome regulatory subunit 6A
Q9UGP4LIMD1543LIM domain-containing protein 1
P28300LOX539Protein-lysine 6-oxidase
Q9H3M7TXNIP535Thioredoxin-interacting protein
Q09472EP300533Histone acetyltransferase p300
P21912SDHB517Succinate dehydrogenase iron-sulfur subunit
Q15717ELAVL1517ELAV-like protein 1
P30991SHC1515SHC-transforming protein 1
Q15843NEDD8509NEDD8
O43524FOXO3504Forkhead box protein O3
P24928POLR2A500DNA-directed RNA polymerase II subunit RPB1
P49719HDAC1497Histone deacetylase 1
Q13315ATM496Serine-protein kinase ATM
IntAct Physical Interactions Total: 407 interactions Key Interacting Partners (selected)
Partner GeneInteraction TypeScore
MDFIphysical association0.670
JADE1physical association0.370
CUL2physical association-
ELOBphysical association-
ELOCphysical association-
NEDD8physical association-
RBX1physical association-
CDK2physical association-
TCP1direct interaction0.440
Protein Similarity ESM2 Structural/Embedding Similarity Total similar proteins: 50 (per entry)
UniProtTop SimilarityAvg Similarity
P40337 (Human)0.99720.9480
P40338 (Mouse)0.99820.9533
Q5Q9Z20.99720.9553
Q64259 (Rat)0.99820.9513
DIAMOND Sequence Homology
UniProtTop IdentityTop Bitscore
P40337 (Human)92.60%318
P40338 (Mouse)93.40%339
Q64259 (Rat)93.40%340
Q5Q9Z291.50%313
Q6RSH770.60%148

Section 9: Transcription Factor Regulatory Data VHL functions primarily as an E3 ubiquitin ligase component rather than a classical DNA-binding transcription factor. However, it regulates gene expression through HIF-alpha degradation. CollecTRI: TF-Target Relationships Total: 29 relationships Genes Regulated BY VHL (VHL as Regulator)

Target GeneRegulation TypeConfidence
ANGPT2Repression-
ARActivation-
CA12Unknown-
CA9Unknown-
CCND1Unknown-
CDH1Unknown-
CDKN1BActivation-
CDKN1CActivation-
CENPXUnknown-
COL4A2Unknown-
CXCR4Repression-
DAB2Activation-
HIF1ARepression-
IGF1RRepression-
KLF10Unknown-
NMURepression-
PDGFBActivation-
SNAI1Repression-
SPARCActivation-
TCF4Unknown-
TFRCRepression-
TMEM115Activation-
TP53Unknown-
VEGFAActivation-
ZEB2--
TFs Regulating VHL (Upstream Regulators)
TF GeneRegulationConfidence
E2F4Unknown-
HIF1AUnknownHigh
MYCUnknownLow
TP53-Low

Section 10: Drug & Pharmacology Data ChEMBL Targets Total: 156 target entries (includes complexes and PPIs) VHL is primarily used as an E3 ligase component in PROTAC (Proteolysis Targeting Chimera) drug design. Target Types

TypeCount
Single Protein1
Protein Complex4
Protein-Protein Interaction151
TOP 30 PROTAC-Related Targets
ChEMBL IDTarget DescriptionType
CHEMBL3108660VHL tumor suppressorSINGLE PROTEIN
CHEMBL3301400VHL/Elongin B/Elongin CPROTEIN COMPLEX
CHEMBL4296117VHL/Elongin-C/Elongin-B/HIF1APROTEIN COMPLEX
CHEMBL4296140VHL/BRD4PROTEIN-PROTEIN INTERACTION
CHEMBL4296148VHL/EloB/EloC/BRD4PROTEIN-PROTEIN INTERACTION
CHEMBL4523998VHL/EGFRPROTEIN-PROTEIN INTERACTION
CHEMBL5169273VHL/KRASPROTEIN-PROTEIN INTERACTION
CHEMBL4748227VHL/BRG1 (SMARCA4)PROTEIN-PROTEIN INTERACTION
CHEMBL5169067VHL/WEE1PROTEIN-PROTEIN INTERACTION
CHEMBL5169069VHL/WDR5PROTEIN-PROTEIN INTERACTION
CHEMBL4748224VHL/BCL-2PROTEIN-PROTEIN INTERACTION
CHEMBL4748229VHL/BCL-XLPROTEIN-PROTEIN INTERACTION
CHEMBL4630740VHL/BTKPROTEIN-PROTEIN INTERACTION
CHEMBL4523741VHL/FAKPROTEIN-PROTEIN INTERACTION
CHEMBL4630745VHL/HDAC1PROTEIN-PROTEIN INTERACTION
CHEMBL4630743VHL/HDAC2PROTEIN-PROTEIN INTERACTION
CHEMBL4630741VHL/HDAC3PROTEIN-PROTEIN INTERACTION
CHEMBL4630737VHL/HISTONE DEACETYLASE 6PROTEIN-PROTEIN INTERACTION
CHEMBL4523730VHL/Androgen receptorPROTEIN-PROTEIN INTERACTION
CHEMBL4523726VHL/Estrogen receptorPROTEIN-PROTEIN INTERACTION
CHEMBL5291692VHL/MDM2PROTEIN-PROTEIN INTERACTION
CHEMBL4748223VHL/PARP1PROTEIN-PROTEIN INTERACTION
CHEMBL4748233VHL/PARP2PROTEIN-PROTEIN INTERACTION
CHEMBL5169073VHL/EZH2PROTEIN-PROTEIN INTERACTION
CHEMBL4748216VHL/Aurora kinase APROTEIN-PROTEIN INTERACTION
CHEMBL5291691VHL/VEGFR2PROTEIN-PROTEIN INTERACTION
CHEMBL4523743VHL/ALKPROTEIN-PROTEIN INTERACTION
CHEMBL4523725VHL/ABL1PROTEIN-PROTEIN INTERACTION
CHEMBL4523735VHL/FLT3PROTEIN-PROTEIN INTERACTION
CHEMBL5291688VHL/FGFR1PROTEIN-PROTEIN INTERACTION
ChEMBL Activities Total: 1,185 bioactivity records BindingDB Total: 501 binding entries PharmGKB
AttributeValue
PharmGKB IDPA37307
VIP GeneYes
CPIC GuidelineNo
Chromosomechr3

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression BreadthUbiquitous
Total Tissues with Expression186
Max Expression Score94.86
Average Expression Score76.78
Gold Quality Observations258
TOP 30 Tissues by Expression Score
Tissue/Cell TypeIDScoreQuality
Cortical plateUBERON:000534394.86Gold
MonocyteCL:000057694.31Gold
Mononuclear cellCL:000084293.09Gold
LeukocyteCL:000073892.90Gold
Ganglionic eminenceUBERON:000402392.23Gold
Stromal cell of endometriumCL:000225589.57Gold
Ventricular zoneUBERON:000305389.00Gold
Buccal mucosa cellCL:000233687.61Gold
Adrenal tissueUBERON:001830387.40Gold
GranulocyteCL:000009487.28Gold
Islet of LangerhansUBERON:000000687.15Gold
Lymph nodeUBERON:000002986.73Gold
Colonic epitheliumUBERON:000039786.51Gold
RectumUBERON:000105286.47Gold
Endometrium epitheliumUBERON:000481185.95Gold
Cerebellar hemisphereUBERON:000224585.80Gold
Cerebellar cortexUBERON:000212985.76Gold
Small intestine Peyer's patchUBERON:000345485.11Gold
Male germ line stem cell in testisCL:0000089 ∩ UBERON:000047385.05Gold
Right cerebellar hemisphereUBERON:001489084.58Gold
Left adrenal glandUBERON:000123484.34Gold
SpleenUBERON:000210684.24Gold
C1 segment of cervical spinal cordUBERON:000646984.21Gold
Right adrenal gland cortexUBERON:003582784.20Gold
Lower esophagus mucosaUBERON:003583484.17Gold
Left adrenal gland cortexUBERON:003582584.16Gold
Gall bladderUBERON:000211084.16Gold
Minor salivary glandUBERON:000183083.85Gold
Right adrenal glandUBERON:000123383.70Gold
Calcaneal tendonUBERON:000370183.68Gold
Key Expression Patterns
  • Ubiquitous expression across most tissues
  • Highest expression in brain cortical regions and immune cells
  • Strong expression in adrenal tissue (relevant to pheochromocytoma)
  • High expression in kidney (relevant to renal cell carcinoma)
  • Expression in cerebellum (relevant to hemangioblastoma)

Section 12: Disease Associations GenCC Gene-Disease Validity Total: 11 curated associations

DiseaseClassificationMOISubmitter
von Hippel-Lindau disease (OMIM:193300)DefinitiveADG2P
von Hippel-Lindau diseaseStrongADGenomics England PanelApp
von Hippel-Lindau diseaseStrongADLabcorp Genetics
von Hippel-Lindau diseaseSupportiveADOrphanet
Pheochromocytoma (MONDO:0008233)DefinitiveADG2P
Renal cell carcinoma (MONDO:0005086)StrongADGenomics England PanelApp
Chuvash polycythemia (OMIM:263400)StrongARGenomics England PanelApp
Chuvash polycythemiaStrongARLabcorp Genetics
Chuvash erythrocytosisSupportiveAROrphanet
Secondary polycythemiaStrongARAmbry Genetics
Hereditary pheochromocytoma-paragangliomaSupportiveADOrphanet
MOI: AD = Autosomal Dominant; AR = Autosomal Recessive Orphanet Disease Associations
Orphanet IDDisease NameGene CountPhenotypes
892Von Hippel-Lindau disease241
238557Chuvash erythrocytosis10
29072Hereditary pheochromocytoma-paraganglioma1541
276621Sporadic pheochromocytoma/secreting paraganglioma636
HPO Phenotype Associations Total: 105 phenotype terms TOP 50 Associated Phenotypes
HPO IDPhenotype
HP:0000006Autosomal dominant inheritance
HP:0000007Autosomal recessive inheritance
HP:0001901Polycythemia
HP:0001899Increased hematocrit
HP:0001900Increased circulating hemoglobin
HP:0001898Increased red blood cell mass
HP:0002666Pheochromocytoma
HP:0006748Adrenal pheochromocytoma
HP:0006737Extraadrenal pheochromocytoma
HP:0002668Paraganglioma
HP:0002864Paraganglioma of head and neck
HP:0005584Renal cell carcinoma
HP:0006880Cerebellar hemangioblastoma
HP:0009713Spinal hemangioblastoma
HP:0009711Retinal capillary hemangioma
HP:0001028Hemangioma
HP:0005562Multiple renal cysts
HP:0001737Pancreatic cysts
HP:0030393Endolymphatic sac tumor
HP:0009715Papillary cystadenoma of the epididymis
HP:0030424Epididymal cyst
HP:0000822Hypertension
HP:0000875Episodic hypertension
HP:0002640Hypertension associated with pheochromocytoma
HP:0003334Elevated circulating catecholamine level
HP:0003345Elevated urinary norepinephrine level
HP:0003639Elevated urinary epinephrine level
HP:0011976Elevated urinary catecholamine level
HP:0001649Tachycardia
HP:0011703Sinus tachycardia
HP:0001962Palpitations
HP:0000975Hyperhidrosis
HP:0001069Episodic hyperhidrosis
HP:0002315Headache
HP:0002331Recurrent paroxysmal headache
HP:0000739Anxiety
HP:0000740Episodic paroxysmal anxiety
HP:0025269Panic attack
HP:0000980Pallor
HP:0001824Weight loss
HP:0002018Nausea
HP:0002027Abdominal pain
HP:0002574Episodic abdominal pain
HP:0000541Retinal detachment
HP:0000572Visual loss
HP:0001085Papilledema
HP:0000360Tinnitus
HP:0008629Pulsatile tinnitus
HP:0000407Sensorineural hearing impairment
HP:0002321Vertigo
GWAS Associations Total: 2 associations
GWAS StudyTraitP-value
GCST011940_2Bullous pemphigoid9.0 × 10⁻³⁸
GCST90020028_772Hip circumference adjusted for BMI2.0 × 10⁻⁸

QUICK REFERENCE CARD

VHL - von Hippel-Lindau tumor suppressor
IDENTIFIERS
─────────────────────────────────────────────────────────────────────
HGNC:12687ENSG00000134086Entrez:7428OMIM:608537
UniProt: P40337 (canonical)RefSeq: NM_000551 (MANE Select)
GENOMIC LOCATION
─────────────────────────────────────────────────────────────────────
Chr 3p25.310,141,778-10,153,676 (GRCh38)+ strand
3 exons13 transcripts213 amino acids
KEY NUMBERS
─────────────────────────────────────────────────────────────────────
ClinVar variants: 930PDB structures: 139GO terms: 38
Pathogenic variants: 200+Protein interactions: 1,785
PROTAC targets: 156Phenotypes: 105
PRIMARY DISEASES
─────────────────────────────────────────────────────────────────────
• von Hippel-Lindau syndrome (AD)
• Pheochromocytoma/Paraganglioma (AD)
• Clear cell renal carcinoma (AD)
• Chuvash polycythemia (AR)
• Cerebellar hemangioblastoma
KEY FUNCTION
─────────────────────────────────────────────────────────────────────
E3 ubiquitin ligase component (VHL-ElonginB-ElonginC complex)
Targets HIF-alpha for proteasomal degradation under normoxia
Master regulator of oxygen-sensing pathway

Reference compiled from BioBTree integrated biological databases. Data current as of query date. I’ve compiled a comprehensive cross-database identifier and functional mapping reference for human VHL (von Hippel-Lindau tumor suppressor). Summary of Coverage

SectionContent
Gene IdentifiersHGNC:12687, ENSG00000134086, Entrez:7428, OMIM:608537, Chr3:10,141,778-10,153,676
Transcripts13 Ensembl transcripts, 4 RefSeq mRNAs, 3 CCDS IDs, MANE Select: NM_000551
Proteins10 UniProt entries (P40337 canonical), 6 InterPro domains, 2 Pfam domains
Structures139 PDB structures (best: 1.32Å), AlphaFold pLDDT: 85.45
OrthologsMouse, Rat, Zebrafish, Drosophila (>91% identity)
Clinical Variants930 ClinVar entries, 200+ pathogenic, 364 SpliceAI predictions, 1,370 AlphaMissense
Pathways/GO7 Reactome pathways, 38 GO terms (22 BP, 8 MF, 8 CC)
Interactions1,785 STRING interactions, 407 IntAct entries
TF Regulation29 CollecTRI relationships (VHL regulates HIF1A, VEGFA, etc.)
Drug/Pharma156 ChEMBL targets (primarily PROTAC applications), 1,185 bioactivities
ExpressionUbiquitous; highest in brain cortex, immune cells, adrenal
DiseasesVHL syndrome, pheochromocytoma, renal cell carcinoma, Chuvash polycythemia
This reference serves as a definitive lookup resource for researchers working with VHL in contexts ranging from basic research to clinical variant interpretation and drug development (especially PROTAC design where VHL is a key E3 ligase).