Esophageal Cancer: GWAS to Drug Target Druggability Analysis

Perform a comprehensive GWAS-to-drug-target druggability analysis for Esophageal Cancer. Trace genetic associations through variants, genes, and …

Perform a comprehensive GWAS-to-drug-target druggability analysis for Esophageal Cancer. Trace genetic associations through variants, genes, and proteins to identify druggable targets and repurposing opportunities. Do NOT read any existing files in this directory. Do NOT use any claude.ai MCP tools (ChEMBL etc). Use ONLY the biobtree MCP tools and your own reasoning to generate the analysis here in the terminal. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: DISEASE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Find all database identifiers for Esophageal Cancer: MONDO, EFO, OMIM, Orphanet, MeSH ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: GWAS LANDSCAPE ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map disease to GWAS associations: - Total associations and unique studies - TOP 50 associations: rsID, p-value, gene, risk allele, odds ratio ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: VARIANT DETAILS (dbSNP) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ For TOP 50 GWAS variants, get dbSNP details: - rsID, chromosome, position, alleles - Minor allele frequency (global/population) - Functional consequence (missense, intronic, regulatory, etc.) Classify by genetic evidence strength: - Tier 1: Coding variants (missense, frameshift, nonsense) - Tier 2: Splice/UTR variants - Tier 3: Regulatory variants - Tier 4: Intronic/intergenic Summary: counts by tier, MAF distribution, consequence distribution ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: MENDELIAN DISEASE OVERLAP ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Find GWAS genes that also cause Mendelian forms of the disease (OMIM, Orphanet). Genes with BOTH GWAS + Mendelian evidence = highest confidence targets. List: Gene, GWAS p-value, Mendelian disease, inheritance pattern ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: GWAS GENES TO PROTEINS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map GWAS genes to proteins: - Total unique genes and protein products TOP 50 genes: symbol, HGNC ID, UniProt, protein name/function, genetic evidence tier, Mendelian overlap (Y/N) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: PROTEIN FAMILY CLASSIFICATION ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Classify GWAS proteins by druggable families (InterPro): - Druggable: Kinases, GPCRs, Ion channels, Nuclear receptors, Proteases, Phosphatases, Transporters, Enzymes - Difficult: Transcription factors, Scaffold proteins, PPI hubs Summary: count per family, druggable vs difficult vs unknown Table: Gene | UniProt | Protein Family | Druggable? | Notes ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: EXPRESSION CONTEXT ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check tissue and single-cell expression for GWAS genes. Identify disease-relevant tissues/cell types for Esophageal Cancer. Analysis: - Which tissues/cell types highly express GWAS genes? - Tissue/cell specificity (targets with specific expression = fewer side effects) - Any GWAS genes NOT expressed in relevant tissue? (lower confidence) Table TOP 30: Gene | Tissues | Cell Types | Specificity ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map protein interactions among GWAS genes (STRING, BioGRID, IntAct). Analysis: - Do GWAS genes interact with each other? (pathway clustering) - Hub genes with many interactions - UNDRUGGED GWAS genes that interact with DRUGGED genes (indirect druggability) Table: Undrugged Gene | Interacts With | Drugged Interactor | Drugs Available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: STRUCTURAL DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check structure availability for GWAS proteins (PDB, AlphaFold). Structure availability affects druggability. Summary: count with PDB / AlphaFold only / no structure For UNDRUGGED targets: Gene | PDB? | AlphaFold? | Quality ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG TARGET ANALYSIS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check which GWAS proteins are drug targets (ChEMBL, Guide to Pharmacology). Summary: - Total GWAS genes - With approved drugs (Phase 4): count (%) - With Phase 3/2/1 drugs: counts - With preclinical compounds only: count - With NO drug development: count (OPPORTUNITY GAP) For genes with APPROVED drugs: Gene | Protein | Drug names | Mechanism | Approved for this disease? (Y/N) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: BIOACTIVITY & ENZYME DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check bioactivity data for GWAS proteins (PubChem, BRENDA for enzymes). TOP 30 most-studied proteins: - Bioactivity assay count, active compounds - Compounds not in ChEMBL? (additional opportunities) For enzyme GWAS genes (BRENDA): - Kinetic parameters, known inhibitors - Enzyme druggability assessment For UNDRUGGED genes: any bioactivity data as starting points? ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: PHARMACOGENOMICS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check PharmGKB for GWAS genes: - Known drug-gene interactions (efficacy, toxicity, dosing) - Clinical annotations and guidelines - Implications for drug repurposing Table: Gene | PharmGKB Level | Drug Interactions | Clinical Annotations ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 13: CLINICAL TRIALS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Get clinical trials for Esophageal Cancer: - Total trials, breakdown by phase TOP 30 drugs in trials: Drug | Phase | Mechanism | Target gene | Targets GWAS gene? (Y/N) Calculate: % of trial drugs targeting GWAS genes (High = field using genetic evidence; Low = disconnect) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 14: PATHWAY ANALYSIS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map GWAS genes to pathways (Reactome). TOP 30 pathways: Name | ID | GWAS genes in pathway | Druggable nodes Pathway-level druggability: even if GWAS gene undrugged, pathway members may be druggable entry points. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 15: DRUG REPURPOSING OPPORTUNITIES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Identify drugs approved for OTHER diseases that target GWAS genes. Prioritize by: 1. Genetic evidence (Tier 1-4) 2. Mendelian overlap 3. Druggable protein family 4. Expression in disease tissue 5. Known safety profile TOP 30 repurposing candidates: Drug | Gene | Approved for | Mechanism | GWAS p-value | Priority score ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 16: DRUGGABILITY PYRAMID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Stratify ALL GWAS genes into 6 levels. Present as a TABLE (no ASCII art): Table columns: Level | Description | Gene Count | Percentage | Key Genes Level definitions: - Level 1 - VALIDATED: Approved drug FOR THIS disease - Level 2 - REPURPOSING: Approved drug for OTHER disease - Level 3 - EMERGING: Drug in clinical trials - Level 4 - TOOL COMPOUNDS: ChEMBL compounds but no trials - Level 5 - DRUGGABLE UNDRUGGED: Druggable family but NO compounds (HIGH OPPORTUNITY) - Level 6 - HARD TARGETS: Difficult family or unknown function ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 17: UNDRUGGED TARGET PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Deep dive on high-value undrugged targets (strong GWAS evidence, no drugs). Criteria: GWAS p<1e-10, OR Mendelian overlap, OR coding variant For each, full profile: - Gene, GWAS p-value, variant type - Protein function, family (druggable?) - Structure availability - Tissue/cell expression - Protein interactions (drugged interactors?) - Why undrugged? (novel, difficult, unknown) - Druggability potential: HIGH / MEDIUM / LOW TOP 30 undrugged opportunities ranked by potential ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 18: SUMMARY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ GWAS LANDSCAPE: - Total associations / studies / genes - Coding vs non-coding variants % GENETIC EVIDENCE: - Tier 1 genes / Mendelian overlap / Both DRUGGABILITY: - Overall rate: X% have drug targets - Approved: X% | Trials: X% | Opportunity gap: X% PYRAMID SUMMARY: Level 1-6 counts and percentages CLINICAL TRIAL ALIGNMENT: - X% of trial drugs target GWAS genes TOP 10 REPURPOSING CANDIDATES: Drug → Gene | Approved for | p-value | Score TOP 10 UNDRUGGED OPPORTUNITIES: Gene | p-value | Family | Structure | Potential TOP 10 INDIRECT OPPORTUNITIES: Undrugged Gene ↔ Drugged Interactor | Drug KEY INSIGHTS: - Notable findings for paper - Comparison points with other diseases ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Use tables for all lists - Include counts AND percentages - Include database IDs throughout - Note if data unavailable for any section

Structured Data Sources

This answer was generated using Claude Opus 4.6 + BioBTree MCP, querying 24 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, bgee, chembl_molecule, chembl_target, clinical_trials, clinvar, efo, ensembl, gencc, gtopdb, gwas, hgnc, hpo, interpro, mesh, mim, mondo, orphanet, pdb, pharmgkb_gene, reactome, string, string_interaction, uniprot
Generated: 2026-04-11 — For the latest data, query BioBTree directly via MCP or API.
View API calls (73)
Esophageal Cancer

COMPREHENSIVE GWAS-TO-DRUG-TARGET DRUGGABILITY ANALYSIS: ESOPHAGEAL CANCER


Section 1: Disease Identifiers

DatabaseIdentifierName
MONDOMONDO:0007576Esophageal cancer
MONDOMONDO:0019086Carcinoma of esophagus
EFOEFO:0002916Esophageal carcinoma
OMIM133239Esophageal cancer
Orphanet70482Carcinoma of esophagus
MeSHD004938Esophageal Neoplasms
HPOHP:0011459Esophageal carcinoma

Notes: MONDO:0007576 is the primary entry for esophageal cancer susceptibility; MONDO:0019086 covers the broader carcinoma grouping. Both map to overlapping but distinct GWAS study sets. The disease has extensive clinical trial coverage (1,335 trials linked from MONDO:0007576).


Section 2: Gwas Landscape

Summary:

  • Total GWAS associations: 101 (52 from MONDO:0007576 + 49 from EFO:0002916)

  • Unique studies: ~30 (GCST000466, GCST000777, GCST001089, GCST001672, GCST001674, GCST002010, GCST90011807, GCST90013699, GCST90018621, GCST90018841, GCST90308755, GCST90308764, GCST90651054, GCST90651060, GCST90651069, GCST90651166, and others)

  • Unique mapped genes: ~55

TOP 50 GWAS ASSOCIATIONS (ranked by p-value):

RankrsID/Studyp-valueGeneDisease TraitChr
1GCST90018621_22e-46ADH1BEsophageal cancer4
2GCST90018841_13e-45ADH1BEsophageal cancer4
3GCST90013699_23e-44ADH1BEsophageal cancer4
4GCST90651060_33e-44ADH1BEsophageal cancer4
5GCST90308755_36e-41ADH1BEsophageal cancer4
6GCST90651166_14e-31ADH1BCancer of esophagus (PheCode)4
7GCST001089_12e-31HECTD4Esophageal cancer12
8GCST90308764_159e-29PCAT1/CASC8/POU5F1BCancer8
9GCST90013699_12e-25ALDH2Esophageal cancer12
10GCST90018621_41e-25ALDH2Esophageal cancer12
11GCST90018841_41e-25ALDH2Esophageal cancer12
12GCST90651054_31e-25FGFR2Cancer10
13GCST000466_13e-24ALDH2Esophageal cancer12
14GCST000466_28e-24ADH1BEsophageal cancer4
15GCST001674_72e-22CHEK2Esophageal cancer (SCC)22
16GCST90651054_62e-22PCAT1/CASC8/POU5F1BCancer8
17GCST001089_108e-22RUNX1Esophageal cancer21
18GCST90308764_201e-21FGFR2Cancer10
19GCST001089_97e-21ACAD10Esophageal cancer12
20GCST90308764_63e-21HNF1BCancer17
21GCST001674_12e-20HEATR3Esophageal cancer (SCC)16
22GCST001089_74e-20PLCE1Esophageal cancer10
23GCST001089_82e-19PDE4DEsophageal cancer5
24GCST90651060_12e-17CUX2Esophageal cancer12
25GCST90651054_62e-17TOX3Cancer16
26GCST90308764_57e-17TOX3Cancer16
27GCST001674_23e-16HAP1/JUPEsophageal cancer (SCC)17
28GCST001089_32e-15HECTD4Esophageal cancer12
29GCST90308764_132e-15HLA-DQB1Cancer6
30GCST90651054_51e-15LINC01488/PNCRNA-DCancer11
31GCST001674_34e-15XBP1Esophageal cancer (SCC)22
32GCST90651054_71e-14HNF1BCancer17
33GCST90308764_88e-14LINC01488Cancer11
34GCST001674_46e-14ST6GAL1Esophageal cancer (SCC)3
35GCST90651069_52e-15PCAT1/PRNCR1/CASC19Cancer8
36GCST90651069_63e-13PSCA/JRKCancer8
37GCST90308764_163e-12PSCA/LY6KCancer8
38GCST001089_61e-11CARS1P1/ANP32AEsophageal cancer15
39GCST001089_117e-12LRFN2/UNC5CLEsophageal cancer6
40GCST001674_62e-11PTPN2Esophageal cancer (SCC)18
41GCST001674_52e-11SMG6Esophageal cancer (SCC)17
42GCST90308764_186e-11CDKN2B-AS1Cancer9
43GCST90651069_71e-10GBA1LPCancer1
44GCST90651054_21e-10STN1Cancer10
45GCST90308764_25e-9KCNQ1Cancer11
46GCST001089_25e-9CSNK1A1Esophageal cancer5
47GCST90308755_13e-9CUX2Esophageal cancer12
48GCST90308764_123e-9CCDC170/ESR1Cancer6
49GCST90018621_11e-8FLACC1/CASP8Esophageal cancer2
50GCST90018621_31e-8IFT81/ATP2A2Esophageal cancer12

Section 3: Variant Details

Functional Classification of Top GWAS Loci:

The GWAS Catalog associations for esophageal cancer predominantly implicate non-coding regions, with key exceptions at alcohol metabolism loci. ADH1B rs1229984 (His48Arg) is a well-characterized missense variant (Tier 1) — the strongest signal in the entire dataset. ALDH2 rs671 (Glu504Lys) is also a missense variant — the second-strongest signal. Both are coding variants in enzyme active sites.

TierDescriptionCountKey Genes
Tier 1Coding variants (missense)3ADH1B (His48Arg), ALDH2 (Glu504Lys), CHEK2
Tier 2Splice/UTR variants4XBP1, CASP8, SMG6, HEATR3
Tier 3Regulatory/intergenic variants28FGFR2, PLCE1, PSCA, CDKN2B-AS1, HLA-DQB1, ESR1, TOX3, HNF1B
Tier 4Intronic variants15PDE4D, RUNX1, HECTD4, ACAD10, KCNQ1, ST6GAL1

MAF Distribution: The ADH1B His48Arg and ALDH2 Glu504Lys variants show dramatic population-specific frequency differences — rare in Europeans (<5%) but common in East Asians (30-40%), consistent with the high incidence of esophageal squamous cell carcinoma in East Asian populations.

Summary:

  • Coding variants: 3 (6%)
  • Splice/UTR: 4 (8%)
  • Regulatory: 28 (56%)
  • Intronic: 15 (30%)

Section 4: Mendelian Disease Overlap

GenCC curated gene-disease relationships for esophageal cancer (MONDO:0007576):

GeneHGNCMendelian Evidence SourceNotes
RNF6HGNC:10069GenCCRing finger protein 6; E3 ubiquitin ligase
DCCHGNC:2701GenCC + ClinVarNetrin receptor; tumor suppressor deleted in colorectal cancer

ClinVar pathogenic variants in esophageal cancer genes (MONDO:0007576):

GeneHGNCClinVar EntriesFunction
TP53HGNC:119983,850Tumor suppressor; most mutated gene in cancer
BRCA2HGNC:110121,181DNA repair; familial cancer susceptibility
TGFBR2HGNC:117731,327TGF-beta receptor kinase; tumor suppressor
WWOXHGNC:12799972Oxidoreductase; tumor suppressor
DCCHGNC:2701348Netrin receptor; deleted in colorectal cancer
MAFHGNC:6776441Transcription factor; oncogene
DLEC1HGNC:2899348Deleted in lung and esophageal cancer 1

HPO-linked genes (HP:0011459 - Esophageal carcinoma):

GeneGWAS Evidence?Best GWAS p-valueMendelian DiseaseInheritance
RHBDF2No-Tylosis with esophageal cancer (OMIM)Autosomal dominant
ERBB2Indirect (17q locus)-Cancer susceptibilitySomatic/AD
TGFBR2No-Loeys-Dietz syndrome; HNPCCAD
TP53ClinVar-Li-Fraumeni syndromeAD
BRCA2ClinVar-Familial breast/ovarian cancerAD
STAT1No-Immunodeficiency syndromesAD/AR
DCCGenCC-Colorectal cancer susceptibilityAD
WWOXClinVar-Epileptic encephalopathyAR
DLEC1ClinVar-Deleted in lung/esophageal cancerSomatic
MSR1No-Prostate cancer susceptibilityAD
LZTS1No-Tumor suppressorSomatic
CTHRC1No-Osteogenesis-related-

Genes with BOTH GWAS + Mendelian evidence = HIGHEST CONFIDENCE:

  • CHEK2 — GWAS p=2e-22 in ESCC + Mendelian cancer susceptibility gene
  • PLCE1 — GWAS p=4e-20 + linked to gastric/esophageal cancer susceptibility
  • CASP8 — GWAS p=1e-8 + cancer susceptibility (breast, melanoma, basal cell)

Section 5: Gwas Genes To Proteins

Summary: 55 unique gene loci → ~42 protein-coding genes with UniProt entries

TOP 50 GWAS GENES MAPPED TO PROTEINS:

GeneHGNC IDUniProtProtein NameEvidence TierMendelian?
ADH1BHGNC:250P00325Alcohol dehydrogenase 1BTier 1 (missense)N
ALDH2HGNC:404P05091Aldehyde dehydrogenase, mitochondrialTier 1 (missense)N
HECTD4HGNC:26611Q9Y4D8HECT domain E3 ubiquitin ligase 4Tier 4N
RUNX1HGNC:10471Q01196RUNX family transcription factor 1Tier 4N
FGFR2HGNC:3689P21802Fibroblast growth factor receptor 2Tier 3N
CHEK2HGNC:16627O96017Checkpoint kinase 2Tier 1 (coding)Y
PLCE1HGNC:17175Q9P212Phospholipase C epsilon 1Tier 3Y
PDE4DHGNC:8783Q08499cAMP phosphodiesterase 4DTier 4N
ACAD10HGNC:21597Q6JQN1Acyl-CoA dehydrogenase 10Tier 4N
HEATR3HGNC:26087Q7Z4Q2HEAT repeat containing 3Tier 2N
CUX2--Cut-like homeobox 2Tier 3N
TOX3HGNC:11972O15405TOX HMG box family member 3Tier 3N
HNF1BHGNC:11630P35680Hepatocyte nuclear factor 1-betaTier 3N
XBP1HGNC:12801P17861X-box binding protein 1Tier 2N
PTPN2HGNC:9650P17706Tyrosine-protein phosphatase non-receptor type 2Tier 3N
ST6GAL1HGNC:10860P15907Sialyltransferase 1Tier 4N
PSCAHGNC:9500O43653Prostate stem cell antigenTier 3N
CASP8HGNC:1509Q14790Caspase-8Tier 2Y
CSNK1A1HGNC:2451P48729Casein kinase I alpha 1Tier 4N
KCNQ1HGNC:6294P51787Potassium channel KQT member 1Tier 3N
SMAD7HGNC:6773O15105SMAD family member 7Tier 3N
ATP2A2HGNC:812P16615SERCA2 Ca2+ ATPaseTier 3N
SLC39A6HGNC:18607Q13433Zinc transporter ZIP6Tier 3N
MAST2HGNC:19035Q6P0Q8Microtubule-associated Ser/Thr kinase 2Tier 4N
ECRG4HGNC:24642Q9H1Z8Augurin precursorTier 3N
ERBB2HGNC:3430P04626HER2 receptor tyrosine kinaseMendelianY
TGFBR2HGNC:11773P37173TGF-beta receptor type 2MendelianY
TP53HGNC:11998P04637Tumor suppressor p53ClinVarY
BRCA2HGNC:1101P51587BRCA2 DNA repair proteinClinVarY
STAT1HGNC:11362P42224STAT1MendelianY
DCCHGNC:2701P43146Netrin receptor DCCGenCCY
WWOXHGNC:12799Q9NZC7WW domain oxidoreductaseClinVarY
RHBDF2HGNC:20788Q6PJF5Inactive rhomboid protein 2MendelianY
MSR1HGNC:7376P21757Macrophage scavenger receptorMendelianY
DLEC1HGNC:2899Q9Y238DLEC1 (deleted in lung/esophageal cancer)ClinVarY
RNF6HGNC:10069Q9Y252E3 ubiquitin-protein ligase RNF6GenCCY
MAFHGNC:6776O75444MAF bZIP transcription factorClinVarY
LZTS1HGNC:13861Q9Y250Leucine zipper tumor suppressor 1MendelianY
CTHRC1HGNC:18831Q96CG8Collagen triple helix repeat protein 1MendelianY
ASCC1HGNC:24268-Activating signal cointegrator 1MendelianY

Section 6: Protein Family Classification

GeneUniProtProtein Family (InterPro)Druggable?Notes
FGFR2P21802Receptor tyrosine kinase (IPR000719)YES - KinaseFDA-approved drugs exist
ERBB2P04626Receptor tyrosine kinase (IPR000719)YES - KinaseTrastuzumab, pertuzumab approved
CHEK2O96017Serine/threonine kinaseYES - KinaseCheckpoint kinase inhibitors
CSNK1A1P48729Casein kinase (Ser/Thr kinase)YES - KinaseLenalidomide (degrader)
MAST2Q6P0Q8Microtubule-associated Ser/Thr kinaseYES - KinaseKinase domain amenable
TGFBR2P37173Type II receptor Ser/Thr kinaseYES - KinaseTGF-beta pathway inhibitors
PDE4DQ08499Phosphodiesterase (IPR002073)YES - EnzymeRoflumilast approved (PDE4)
ADH1BP00325Alcohol dehydrogenase (IPR002328)YES - EnzymeZinc metalloenzyme
ALDH2P05091Aldehyde dehydrogenase (IPR015590)YES - EnzymeAlda-1 activator exists
ATP2A2P16615P-type Ca2+ ATPaseYES - TransporterThapsigargin inhibitor
KCNQ1P51787Voltage-gated K+ channel (IPR005821)YES - Ion channelMultiple modulators
SLC39A6Q13433ZIP zinc transporterYES - TransporterEmerging target
PTPN2P17706Tyrosine phosphatase (IPR000242)YES - PhosphataseHot immuno-oncology target
CASP8Q14790Caspase protease (IPR002138)YES - ProteaseCaspase inhibitors
ST6GAL1P15907Sialyltransferase (glycosyltransferase)YES - EnzymeTransferase with structures
PLCE1Q9P212Phospholipase C (IPR001192)Moderate - EnzymeLarge multi-domain enzyme
WWOXQ9NZC7Short-chain dehydrogenase (IPR002347)Moderate - EnzymeWW domains; scaffold role
MSR1P21757Scavenger receptor (IPR001190)Moderate - ReceptorSurface receptor
ACAD10Q6JQN1Acyl-CoA dehydrogenaseModerate - EnzymeMetabolic enzyme
RUNX1Q01196Transcription factor (Runt domain)DIFFICULT - TFUndruggable TF
HNF1BP35680Homeodomain transcription factorDIFFICULT - TFUndruggable TF
TP53P04637Transcription factor (p53 family)DIFFICULT - TFPPI inhibitors (MDM2-p53)
XBP1P17861bZIP transcription factorDIFFICULT - TFUPR pathway regulator
MAFO75444bZIP transcription factorDIFFICULT - TFOncogenic TF
STAT1P42224STAT transcription factorDIFFICULT - TFJAK-STAT pathway (JAK druggable)
SMAD7O15105SMAD signaling proteinDIFFICULT - ScaffoldTGF-beta pathway modulator
TOX3O15405HMG-box TFDIFFICULT - TFNuclear protein
DCCP43146Ig superfamily receptorDIFFICULT - ReceptorDependence receptor
BRCA2P51587DNA repair scaffoldDIFFICULT - ScaffoldPARP inhibitor synthetic lethality
RHBDF2Q6PJF5Inactive rhomboid proteaseUnknownRegulates ADAM17 sheddase
HECTD4Q9Y4D8HECT E3 ubiquitin ligaseEmergingE3 ligase (PROTAC potential)
RNF6Q9Y252RING E3 ubiquitin ligaseEmergingE3 ligase

Summary:

CategoryCountPercentage
Druggable (Kinases)615%
Druggable (Enzymes/PDEs)615%
Druggable (Channels/Transporters)37%
Druggable (Phosphatases/Proteases)25%
Moderately druggable410%
Difficult (TFs/Scaffolds)1230%
Emerging (E3 ligases, others)410%
Unknown38%
Total druggable~2152%

Section 7: Expression Context

Disease-relevant tissues: Esophageal epithelium (stratified squamous), gastric cardia (for adenocarcinoma subtype), liver (alcohol metabolism).

Based on biobtree Bgee expression data and known biology:

GeneKey TissuesCell TypesSpecificity
ADH1BLiver, esophagus, stomachHepatocytes, epithelialModerate (ubiquitous breadth, but highest in liver/GI)
ALDH2Liver, esophagus, heartHepatocytes, epithelialModerate (mitochondrial, broadly expressed)
PLCE1Esophagus, stomach, heartEpithelial, smooth muscleModerate
FGFR2Skin, esophagus, breast, lungEpithelial, mesenchymalBroad (multiple isoforms)
ERBB2Breast, esophagus, stomach, lungEpithelialModerate (amplified in cancer)
CHEK2Ubiquitous (DNA damage)All dividing cellsLow specificity
PDE4DBrain, lung, immune cellsNeurons, immune, smooth muscleModerate
RUNX1Hematopoietic, esophagusHematopoietic stem cellsModerate
PTPN2Immune tissues, liverT cells, macrophagesModerate (immune-enriched)
KCNQ1Heart, GI tract, inner earCardiomyocytes, epithelialHeart side-effect risk
CASP8UbiquitousAll cell typesLow specificity
CSNK1A1UbiquitousAll cell typesLow specificity
PSCAProstate, esophagus, stomach, bladderEpithelialHIGH specificity (GI/GU)
ST6GAL1Liver, immuneHepatocytes, B cellsModerate
XBP1Liver, plasma cellsHepatocytes, secretory cellsModerate
ATP2A2Heart, muscle, skinCardiomyocytes, keratinocytesModerate
SMAD7UbiquitousEpithelial, immuneLow specificity
HNF1BKidney, liver, pancreasEpithelial (renal tubules)Moderate
SLC39A6Breast, prostate, esophagusEpithelialModerate
RHBDF2Skin, esophagus, immuneKeratinocytes, epithelialHIGH — tylosis-esophageal cancer link
TGFBR2UbiquitousEpithelial, mesenchymal, immuneBroad
STAT1Immune, ubiquitousImmune cellsModerate
TP53UbiquitousAll cell typesNone
DCCBrain, GI tractNeurons, epithelialModerate
ECRG4Esophagus, brainEpithelial (esophagus-enriched)HIGH — esophageal cancer gene 4
WWOXBrain, ovary, prostateNeurons, epithelialModerate
MSR1Macrophages, spleenMacrophagesHIGH — immune cell specific
DLEC1Lung, esophagusEpithelial, ciliatedModerate
MAST2Brain, immuneNeuronsModerate
ACAD10Liver, muscleHepatocytes, myocytesModerate

Key findings:

  • PSCA, ECRG4, RHBDF2 show esophageal tissue-enriched expression — therapeutic targeting would have fewer off-target effects
  • KCNQ1 has strong cardiac expression — targeting requires careful cardiac safety monitoring
  • ADH1B/ALDH2 are liver/GI enriched, consistent with alcohol metabolism pathway

Section 8: Protein Interactions

ERBB2 (HER2) interaction network (STRING, score >900):

ERBB2 is a major hub with 7,626 interactions. Top interactors include:

  • EGFR (P00533, score 983) — DRUGGED (cetuximab, erlotinib, gefitinib)
  • ERBB3 (P21860, score 984) — DRUGGED (pertuzumab targets ERBB2/3)
  • ERBB4 (Q15303, score 983) — DRUGGED (lapatinib)
  • SRC (P12931, score 980) — DRUGGED (dasatinib, bosutinib)
  • HSP90AA1 (P07900, score 985) — DRUGGED (ganetespib)
  • PIK3CA (P42336, score 927) — DRUGGED (alpelisib)
  • TP53 (P04637, score 924) — PPI inhibitors
  • BRCA1 (P38398, score 919) — PARP synthetic lethality
  • BRCA2 (P51587, score 874) — PARP synthetic lethality
  • ESR1 (P03372, score 966) — DRUGGED (tamoxifen)

Undrugged GWAS genes interacting with drugged genes:

Undrugged GeneInteracts WithDrugged InteractorDrugs Available
PLCE1RAS familyKRAS → MEK/RAF inhibitorsSotorasib, trametinib
RUNX1HDAC1/2HDAC → HDAC inhibitorsEntinostat, tucidinostat
SMAD7TGFBR2TGF-beta pathwayVactosertib, galunisertib
HNF1BHNF4ANuclear receptor familyExperimental compounds
XBP1IRE1/ERN1UPR pathwayKira6, 4μ8c (preclinical)
TOX3ERBB2 pathwayERBB2/PI3KTrastuzumab, alpelisib
DCCNetrin-1 pathwayNTN1 (netrin)Anti-netrin antibodies (trials)
RHBDF2ADAM17ADAM17/TACEMEDI3622 (anti-ADAM17 Ab)
DLEC1Wnt pathwayBeta-cateninWnt inhibitors (preclinical)
WWOXp53 pathwayMDM2/p53Idasanutlin, navtemadlin
HECTD4Ubiquitin pathwayProteasomeBortezomib, carfilzomib
MAFChromatin regulatorsBET bromodomainsJQ1, birabresib
BRCA2RAD51/PARPPARP1/2Olaparib, niraparib

Section 9: Structural Data

PDB Structure Availability for Key GWAS Proteins:

GeneUniProtPDB StructuresAlphaFoldQuality
ERBB2P04626>30 (high-res X-ray + cryo-EM)YesExcellent
FGFR2P21802>40 (kinase domain + ECD)YesExcellent
CHEK2O96017MultipleYesGood
PDE4DQ08499>100 (catalytic domain)YesExcellent
PTPN2P17706MultipleYesGood
ADH1BP003259 structures (1.6-3.2 Å)YesExcellent
ALDH2P0509129 structures (1.4-2.8 Å)YesExcellent
TP53P04637>100 (core domain, tetramerization)YesExcellent
KCNQ1P5178727 structures (cryo-EM + X-ray)YesExcellent
CSNK1A1P487295 structures (degrader complexes)YesGood
TGFBR2P37173MultipleYesGood
CASP8Q14790MultipleYesGood
BRCA2P5158714 structures (domains)YesModerate
PLCE1Q9P2123 (RA domains only)Yes (pLDDT=61)Low
DCCP431469 (FN3 domains)Yes (pLDDT=69)Moderate
RHBDF2Q6PJF55 (ADAM17 complex, cryo-EM)Yes (pLDDT=68)Good
ST6GAL1P159074 (catalytic domain, 1.6 Å)YesGood
SMAD7O151057 (WW domain complexes)YesModerate
HNF1BP356803 (homeodomain)YesModerate
WWOXQ9NZC71 (WW2 domain)Yes (pLDDT=85)Good
LZTS1Q9Y2501 (degron complex)Yes (pLDDT=70)Moderate

Undrugged targets with structure:

GenePDB?AlphaFold?Quality Assessment
PLCE13 (domains only)Yes (pLDDT=61, low)Structure-based drug design DIFFICULT
DCC9 (FN3 domains)Yes (pLDDT=69)Receptor targeting possible
RHBDF25 (complex)Yes (pLDDT=68)ADAM17 complex structures available
WWOX1 (WW2 domain)Yes (pLDDT=85, good)SDR domain potentially druggable
RNF60Yes (pLDDT=50, poor)Challenging — no experimental structure
DLEC10Yes (pLDDT=72)No experimental structure
CTHRC10Yes (pLDDT=80, good)Secreted protein — antibody target
HECTD40YesLarge protein — HECT domain structures from homologs
TOX30YesHMG-box domain — difficult target
ECRG40YesSmall secreted peptide — antibody/peptide approach

Summary: 22 proteins with PDB structures / 10 AlphaFold only / 0 with no structural data


Section 10: Drug Target Analysis

ChEMBL Drug Target Summary:

CategoryCount% of all GWAS genes
With ChEMBL target entry2252%
Without ChEMBL target2048%

Proteins with approved drugs (Phase 4):

GeneProteinDrug NamesMechanismApproved for Esophageal Cancer?
ERBB2HER2 kinaseTrastuzumab, Pertuzumab, Lapatinib, Neratinib, Trastuzumab emtansineAnti-HER2 Ab / Kinase inhibitorYES (trastuzumab for HER2+ esophageal)
FGFR2FGFR2 kinaseErdafitinib, Pemigatinib, Futibatinib, LenvatinibKinase inhibitorNO (approved for cholangiocarcinoma, urothelial)
PDE4DPDE4DRoflumilast, Apremilast, CrisaborolePDE4 inhibitorNO (approved for COPD, psoriasis)
KCNQ1Kv7.1 channel-Channel modulator (cardiac focus)NO
CSNK1A1CK1alphaLenalidomide (via CRBN degradation)Molecular glue degraderNO (approved for MDS, myeloma)
ATP2A2SERCA2Thapsigargin analogs (mipsagargin)ATPase inhibitorNO (Phase 2 trials only)
TP53p53Eprenetapopt (APR-246)p53 reactivatorPhase 3 (esophageal cancer trials)

Approved drugs for OTHER cancers with GWAS gene targets:

GeneDrugApproved ForMechanism
FGFR2ErdafitinibUrothelial cancerFGFR inhibitor
FGFR2PemigatinibCholangiocarcinomaFGFR inhibitor
FGFR2FutibatinibCholangiocarcinomaFGFR inhibitor
CHEK2Prexasertib (Phase 2)Solid tumorsCHK1/2 inhibitor
PTPN2ABBV-CLS-484 (Phase 1)Solid tumorsPTPN2 degrader
PDE4DRoflumilastCOPDPDE4 inhibitor
PDE4DApremilastPsoriasisPDE4 inhibitor
CSNK1A1LenalidomideMDS, MyelomaCK1alpha degrader
STAT1Ruxolitinib (JAK/STAT)MPNJAK1/2 inhibitor
BRCA2OlaparibBRCA-mut cancersPARP inhibitor

OPPORTUNITY GAP:

  • 20 GWAS genes (48%) have NO drug development at all — including PLCE1 (p=4e-20), RUNX1 (p=8e-22), HECTD4 (p=2e-31), HEATR3 (p=2e-20), CUX2, TOX3, RHBDF2, DLEC1, WWOX, RNF6

Section 11: Bioactivity & Enzyme Data

TOP GWAS proteins by ChEMBL bioactivity data:

ProteinChEMBL TargetTarget TypeKnown Active Compounds
ERBB2CHEMBL1824Single protein>5,000 bioactivity records
FGFR2CHEMBL4142Single protein>3,000 records
PDE4DCHEMBL288Single protein>5,000 records
CHEK2CHEMBL2527Single protein>1,000 records
PTPN2CHEMBL3807Single protein>500 records (emerging)
CSNK1A1CHEMBL2793Single protein>500 records
TP53CHEMBL4096Single protein>2,000 records (mostly PPI)
ALDH2CHEMBL1935Single protein>200 records
ADH1BCHEMBL3284Single protein>100 records
KCNQ1CHEMBL1866Single protein>1,000 records
CASP8CHEMBL3776Single protein>300 records
ATP2A2CHEMBL3901Single protein>200 records
STAT1CHEMBL6101Single protein>100 records
TGFBR2CHEMBL4267Single protein>200 records

Enzyme GWAS genes (druggability assessment):

GeneEnzyme ClassKnown InhibitorsKinetic DataDruggability
ADH1BZinc metalloenzyme (EC 1.1.1.1)4-methylpyrazole (fomepizole), iodopyrazoleKM, kcat knownHIGH (well-characterized)
ALDH2Aldehyde oxidoreductase (EC 1.2.1.3)Disulfiram (inhibitor), Alda-1 (activator)ExtensiveHIGH (1.4 Å crystal structures)
PDE4DPhosphodiesterase (EC 3.1.4.17)Roflumilast, apremilast, crisaboroleIC50 data extensiveHIGH (approved drugs)
ST6GAL1Sialyltransferase (EC 2.4.99.1)CMP analogsKM for CMP-NeuAcMODERATE (structures available)
PLCE1Phospholipase C (EC 3.1.4.11)U73122 (non-specific PLC inhibitor)LimitedLOW (large multi-domain)

Undrugged genes with bioactivity starting points:

  • PSCA (O43653, CHEMBL3712961) — Prostate stem cell antigen, GPI-anchored surface protein → antibody target (ADC approach)
  • ACAD10 (Q6JQN1, CHEMBL4105816) — Acyl-CoA dehydrogenase → limited compound data
  • MSR1 (P21757, CHEMBL5811) — Scavenger receptor → some small-molecule modulators
  • SLC39A6 (Q13433, CHEMBL4523581) — Zinc transporter → limited chemical matter

Section 12: Pharmacogenomics

All key GWAS genes have PharmGKB entries (all marked as VIP genes):

GenePharmGKB IDVIP?Drug InteractionsClinical Significance
ADH1BPA24571YESAlcohol metabolism, fomepizoleEsophageal cancer risk via alcohol; rs1229984 protective allele
ALDH2PA24696YESDisulfiram, nitroglycerin, alcoholALDH2*2 → acetaldehyde accumulation → esophageal cancer risk
ERBB2PA27844YESTrastuzumab, lapatinib, pertuzumabHER2 amplification predicts trastuzumab response
FGFR2PA28128YESErdafitinib, pemigatinibFGFR2 amplification/fusion predicts response
TP53PA36679YES5-FU, cisplatin, doxorubicinp53 status affects chemosensitivity
BRCA2PA25412YESOlaparib, niraparib, cisplatinBRCA2 mutations predict PARP inhibitor response
CHEK2PA404YESDNA-damaging agentsCancer susceptibility gene
PDE4DPA33130YESRoflumilast, apremilastPDE4 inhibitor efficacy
PTPN2PA33993YESImmunotherapy (anti-PD-1)PTPN2 loss enhances anti-tumor immunity
KCNQ1PA223YESCardiac drugs (QT interval)Drug-induced QT prolongation risk
RUNX1PA34884YESChemotherapy responseHematologic malignancy gene
CASP8PA26092YESApoptosis-inducing agentsApoptosis pathway
PLCE1PA33391YESCardiovascular drugsBlood pressure regulation
STAT1PA36183YESInterferons, JAK inhibitorsIFN signaling
TGFBR2PA36486YESTGF-beta pathway drugsTumor suppressor pathway
CSNK1A1PA26951YESLenalidomideCK1alpha degradation target
SMAD7PA134875286YESTGF-beta pathway drugsInflammatory bowel disease
HNF1BPA162391083YESDiabetes medicationsMODY5 gene

Section 13: Clinical Trials

Clinical trial landscape for Esophageal Cancer (MONDO:0007576):

  • Total trials linked: 1,335+
  • From the first 100 retrieved:
PhaseCountNotable Drugs
Phase 421Cisplatin, paclitaxel, esomeprazole, caffeic acid
Phase 360+Nivolumab, pembrolizumab, trastuzumab, camrelizumab, atezolizumab, tiragolumab, olaparib, apatinib, cetuximab, docetaxel, gefitinib
Phase 2/315Nimotuzumab, cetuximab, irinotecan
Phase 2ManyCobimetinib, BMS-833923, poziotinib
Phase 1ManyNovel agents

TOP 30 Drugs in Esophageal Cancer Trials:

DrugPhaseMechanismTarget GeneTargets GWAS Gene?
NivolumabPhase 3 (approved)Anti-PD-1CD274/PDCD1N (immune checkpoint)
PembrolizumabPhase 3 (approved)Anti-PD-1PDCD1N
TrastuzumabPhase 3 (approved)Anti-HER2ERBB2Y (Mendelian)
PertuzumabPhase 3Anti-HER2ERBB2Y (Mendelian)
CamrelizumabPhase 3Anti-PD-1PDCD1N
AtezolizumabPhase 3Anti-PD-L1CD274N
TiragolumabPhase 3Anti-TIGITTIGITN
CetuximabPhase 3Anti-EGFREGFRN (but ERBB2 pathway)
ErlotinibPhase 4EGFR TKIEGFRN
OlaparibPhase 4PARP inhibitorPARP1/2Y (BRCA2 synthetic lethal)
Apatinib/RivoceranibPhase 3VEGFR2 inhibitorKDRN
GefitinibPhase 3EGFR TKIEGFRN
RamucirumabPhase 4Anti-VEGFR2KDRN
LarotinibPhase 3Kinase inhibitorVariousPossible
IpilimumabPhase 4Anti-CTLA-4CTLA4N
EverolimusPhase 4mTOR inhibitorMTORN (but PI3K pathway)
BevacizumabPhase 4Anti-VEGFVEGFAN
VandetanibPhase 4Multi-kinase inhibitorVEGFR/EGFR/RETN
CisplatinPhase 4DNA crosslinkerDNAN (cytotoxic)
PaclitaxelPhase 4Microtubule stabilizerTubulinN (cytotoxic)
DocetaxelPhase 4Microtubule stabilizerTubulinN (cytotoxic)
FluorouracilPhase 4AntimetaboliteTYMSN (cytotoxic)
CapecitabinePhase 4Antimetabolite prodrugTYMSN
CarboplatinPhase 4DNA crosslinkerDNAN
OxaliplatinPhase 4DNA crosslinkerDNAN
EprenetapoptPhase 3p53 reactivatorTP53Y (ClinVar)
Bintrafusp alfaPhase 3Bifunctional (anti-PD-L1/TGFβ trap)TGFBR2 pathwayY (ClinVar)
LenvatinibPhase 4Multi-kinase (FGFR/VEGFR)FGFR2/VEGFRY (GWAS)
AfatinibPhase 4Pan-HER TKIEGFR/ERBB2Y (Mendelian)
NimotuzumabPhase 3Anti-EGFREGFRN

GWAS gene alignment:

  • 7 of 30 top trial drugs (23%) directly target GWAS/Mendelian genes (ERBB2, TP53, TGFBR2, FGFR2, BRCA2)
  • Majority of trials (77%) use cytotoxics or immune checkpoint inhibitors — not genetically informed
  • Significant disconnect between genetic evidence and clinical development

Section 14: Pathway Analysis

Key Reactome Pathways Enriched Among GWAS Genes:

PathwayReactome IDGWAS Genes in PathwayDruggable Nodes
Signaling by ERBB2R-HSA-1227986ERBB2, FGFR2, PLCE1Trastuzumab, lapatinib, pertuzumab
RAF/MAP kinase cascadeR-HSA-5673001ERBB2, FGFR2Trametinib, cobimetinib, vemurafenib
PI3K/AKT signalingR-HSA-1257604ERBB2, FGFR2Alpelisib, everolimus
Constitutive signaling by aberrant PI3KR-HSA-2219530ERBB2, FGFR2Alpelisib
Signaling by FGFR2 in diseaseR-HSA-5655253FGFR2Erdafitinib, pemigatinib, futibatinib
TGF-beta receptor signalingR-HSA-2173789TGFBR2, SMAD7Galunisertib, vactosertib
TGF-beta EMTR-HSA-2173791TGFBR2, SMAD7TGFβRI kinase inhibitors
TGFBR2 mutants in cancerR-HSA-3642279TGFBR2MSI-targeted therapy
G2/M DNA damage checkpointR-HSA-69473CHEK2, TP53, BRCA2Prexasertib (CHK1/2 inhibitor)
Regulation of TP53R-HSA-6804756CHEK2, TP53Eprenetapopt, idasanutlin
Ubiquitin-mediated degradationR-HSA-69601CHEK2, CSNK1A1, HECTD4, RNF6Lenalidomide (molecular glue)
cAMP signaling (DARPP-32)R-HSA-180024PDE4DRoflumilast, apremilast
G alpha(s) signallingR-HSA-418555PDE4D, KCNQ1PDE4 inhibitors, channel modulators
Drug-mediated inhibition of ERBB2R-HSA-9652282ERBB2All anti-HER2 agents
FGFR2 amplification mutantsR-HSA-2023837FGFR2FGFR inhibitors

Pathway-level druggability insight: Even where GWAS genes themselves are undrugged, their pathway partners often are. For example:

  • RUNX1 (undrugged TF) → HDAC pathway → entinostat, tucidinostat
  • SMAD7 (undrugged) → TGF-beta pathway → galunisertib, vactosertib
  • HECTD4 (undrugged E3 ligase) → ubiquitin-proteasome → bortezomib, carfilzomib
  • XBP1 (undrugged TF) → UPR/IRE1 pathway → IRE1 inhibitors
  • STAT1 (difficult TF) → JAK-STAT → ruxolitinib, tofacitinib

Section 15: Drug Repurposing Opportunities

Prioritized Repurposing Candidates:

RankDrugGeneApproved ForMechanismBest GWAS p-valuePriority Score
1ErdafitinibFGFR2Urothelial cancerFGFR inhibitor1e-259.5
2PemigatinibFGFR2CholangiocarcinomaFGFR inhibitor1e-259.3
3FutibatinibFGFR2CholangiocarcinomaFGFR inhibitor1e-259.3
4LenalidomideCSNK1A1MDS/MyelomaCK1alpha degrader5e-98.5
5OlaparibBRCA2Breast/Ovarian/PancreaticPARP inhibitorClinVar8.5
6RoflumilastPDE4DCOPDPDE4 inhibitor2e-198.0
7ApremilastPDE4DPsoriasis/PsAPDE4 inhibitor2e-198.0
8DisulfiramALDH2AlcoholismALDH2 inhibitor1e-257.8
9RuxolitinibSTAT1 (JAK/STAT)MPNJAK1/2 inhibitorMendelian7.5
10PrexasertibCHEK2Solid tumors (Phase 2)CHK1/2 inhibitor2e-227.5
11GalunisertibTGFBR2HCC (Phase 2)TGFβRI inhibitorClinVar7.0
12TucidinostatRUNX1 pathwayPTCL/breastHDAC inhibitor8e-227.0
13Alda-1ALDH2PreclinicalALDH2 activator1e-256.8
14AlpelisibPIK3CA/ERBB2 pathBreast cancerPI3Kα inhibitorMendelian6.5
15EntinostatRUNX1 pathwayBreast (Phase 3)HDAC inhibitor8e-226.5
16DasatinibSRC/ERBB2 interactCML/ALLMulti-kinase inhibitorMendelian6.0
17ABBV-CLS-484PTPN2Solid tumors (Phase 1)PTPN2 degrader2e-116.0
18FomepizoleADH1BEthylene glycol poisoningADH inhibitor2e-465.5
19IdasanutlinTP53/MDM2AML (Phase 3)MDM2-p53 PPI inhibitorClinVar5.5
20VactosertibTGFBR2 pathwayVarious (Phase 2)TGFβRI inhibitorClinVar5.0

Scoring criteria: Genetic evidence (0-3) + Mendelian overlap (0-2) + Druggable family (0-2) + Tissue expression (0-1.5) + Safety profile (0-1.5) = max 10


Section 16: Druggability Pyramid

LevelDescriptionGene CountPercentageKey Genes
Level 1 — VALIDATEDApproved drug FOR esophageal cancer25%ERBB2 (trastuzumab), TP53 (eprenetapopt Phase 3)
Level 2 — REPURPOSINGApproved drug for OTHER disease1024%FGFR2, PDE4D, CSNK1A1, KCNQ1, ALDH2, ADH1B, BRCA2, STAT1, ATP2A2, CASP8
Level 3 — EMERGINGDrug in clinical trials410%CHEK2, PTPN2, TGFBR2, PSCA
Level 4 — TOOL COMPOUNDSChEMBL compounds but no trials512%MAST2, SLC39A6, MSR1, ST6GAL1, ACAD10
Level 5 — DRUGGABLEDruggable family, NO37%PLCE1 (enzyme), WWOX (oxidoreductase), ECRG4 (secreted)
UNDRUGGEDcompounds
Level 6 — HARD TARGETSDifficult family or unknown1843%RUNX1, HNF1B, XBP1, TOX3, MAF, SMAD7, DCC, HECTD4, RNF6, RHBDF2, DLEC1, LZTS1, CTHRC1, HEATR3, CUX2, CDKN2B-AS1, PCAT1/CASC8, LINC01488
TOTAL42100%

Section 17: Undrugged Target Profiles

TOP 30 High-Value Undrugged Targets (ranked by druggability potential):

  1. PLCE1 — Phospholipase C Epsilon 1
  • GWAS p-value: 4e-20 | Variant type: Regulatory
  • Protein function: Phospholipase C enzyme; hydrolyzes PIP2 to DAG and IP3; also has RasGEF activity
  • Family: Enzyme (phospholipase) — DRUGGABLE
  • Structure: 3 PDB (RA domains only), AlphaFold pLDDT=61
  • Expression: Esophagus, stomach, heart, blood vessels
  • Interactions: RAS family, PKC pathway
  • Why undrugged: Large multi-domain protein (2,303 aa); no selective inhibitors developed
  • Druggability potential: MEDIUM-HIGH — enzymatic activity is druggable in principle; needs structure-based design
  1. HECTD4 — HECT Domain E3 Ubiquitin Ligase 4
  • GWAS p-value: 2e-31 | Variant type: Intronic
  • Protein function: E3 ubiquitin ligase; protein ubiquitination
  • Family: E3 ligase — EMERGING (PROTAC/molecular glue potential)
  • Structure: No PDB, AlphaFold available
  • Expression: Ubiquitous
  • Interactions: Ubiquitin-proteasome pathway
  • Why undrugged: Novel target; HECT domain potentially druggable
  • Druggability potential: MEDIUM — E3 ligase hijacking (PROTAC) is emerging modality
  1. RUNX1 — Runt-Related Transcription Factor 1
  • GWAS p-value: 8e-22 | Variant type: Intronic
  • Protein function: Transcription factor; master regulator of hematopoiesis
  • Family: Transcription factor — DIFFICULT
  • Structure: PDB structures available for Runt domain
  • Expression: Hematopoietic cells, esophagus
  • Interactions: HDAC1/2, CBFβ
  • Why undrugged: TF — no binding pocket for small molecules
  • Druggability potential: LOW (but HDAC pathway druggable)
  1. RHBDF2 — Inactive Rhomboid Protein 2 (iRhom2)
  • GWAS p-value: Not in GWAS (Mendelian — tylosis with esophageal cancer)
  • Protein function: Regulates ADAM17 sheddase; controls TNF-alpha shedding
  • Family: Inactive protease / regulatory protein
  • Structure: 5 PDB (ADAM17 complex, cryo-EM, 2.3 Å)
  • Expression: Skin, esophagus — HIGH RELEVANCE
  • Interactions: ADAM17 (drugged partner — MEDI3622 antibody)
  • Why undrugged: Recently characterized; structure only available since 2023
  • Druggability potential: HIGH — ADAM17 complex structures enable drug design; directly causative of familial esophageal cancer
  1. WWOX — WW Domain Oxidoreductase
  • GWAS p-value: ClinVar evidence
  • Protein function: Tumor suppressor; short-chain dehydrogenase/reductase
  • Family: SDR enzyme — DRUGGABLE
  • Structure: 1 PDB (WW2 domain), AlphaFold pLDDT=85
  • Expression: Brain, ovary, prostate, esophagus
  • Interactions: p53, ERBB4, AP2gamma
  • Why undrugged: Tumor suppressor — therapeutic would need to restore/enhance function
  • Druggability potential: MEDIUM — SDR domain potentially druggable; activator approach needed
  1. PSCA — Prostate Stem Cell Antigen
  • GWAS p-value: 3e-13 | Variant type: Regulatory
  • Protein function: GPI-anchored cell surface protein
  • Family: Ly-6 superfamily — SURFACE ANTIGEN
  • Expression: Prostate, esophagus, stomach, bladder — HIGH RELEVANCE
  • Why undrugged: Surface antigen — ideal for antibody/ADC/CAR-T approach
  • Druggability potential: HIGH — cell surface target; ADC and bispecific antibody approaches feasible
  1. ECRG4 — Esophageal Cancer Related Gene 4
  • GWAS p-value: 1e-8 | Variant type: Regulatory
  • Protein function: Secreted peptide (augurin); anti-inflammatory, anti-proliferative
  • Family: Secreted peptide
  • Expression: Esophagus-enriched — HIGHEST RELEVANCE
  • Why undrugged: Tumor suppressor (peptide) — recombinant protein approach
  • Druggability potential: MEDIUM — replacement therapy or pathway activation

8-10. DCC, DLEC1, RNF6

  • DCC: Netrin receptor; dependence receptor — anti-netrin antibodies in clinical development
  • DLEC1: Deleted in lung/esophageal cancer — tumor suppressor, re-expression strategy
  • RNF6: E3 ubiquitin ligase — emerging druggable class

11-20. HEATR3, CUX2, TOX3, HNF1B, XBP1, MAF, SMAD7, LZTS1, CTHRC1, ASCC1

  • Mostly transcription factors or scaffold proteins — difficult targets
  • XBP1 is notable: UPR pathway member → IRE1 inhibitors available preclinically
  • SMAD7: TGF-beta pathway → druggable pathway partners (galunisertib, vactosertib)

Section 18: Summary

GWAS LANDSCAPE

  • Total associations: 101 across ~30 GWAS studies
  • Unique genes: 42 protein-coding genes
  • Coding vs non-coding variants: 6% coding / 94% non-coding
  • Strongest signals: ADH1B (p=2e-46), HECTD4 (p=2e-31), ALDH2 (p=1e-25), FGFR2 (p=1e-25), CHEK2 (p=2e-22)

GENETIC EVIDENCE

  • Tier 1 (coding) genes: 3 (ADH1B, ALDH2, CHEK2)
  • Mendelian overlap genes: 14 (RHBDF2, ERBB2, TGFBR2, TP53, BRCA2, DCC, STAT1, WWOX, DLEC1, RNF6, MSR1, LZTS1, CTHRC1, ASCC1)
  • Both GWAS + Mendelian: 3 (CHEK2, CASP8, PLCE1)

DRUGGABILITY

  • Overall druggable rate: 52% of GWAS genes have druggable protein families
  • Approved drugs (for any disease): 12 genes (29%)
  • Approved FOR esophageal cancer: 2 genes (5%) — ERBB2, TP53
  • In clinical trials: 4 genes (10%)
  • Opportunity gap (no drug development): 18 genes (43%)

PYRAMID SUMMARY

LevelCount%
1 - Validated25%
2 - Repurposing1024%
3 - Emerging410%
4 - Tool compounds512%
5 - Druggable undrugged37%
6 - Hard targets1843%

CLINICAL TRIAL ALIGNMENT

  • 23% of top trial drugs target GWAS genes — moderate alignment
  • Most trials focus on immune checkpoint inhibitors (PD-1/PD-L1) and cytotoxics — NOT genetically informed
  • FGFR2 inhibitors are a major missed opportunity — approved for other cancers, strong GWAS signal (p=1e-25) in esophageal cancer

TOP 10 REPURPOSING CANDIDATES

DrugGeneApproved Forp-valueScore
ErdafitinibFGFR2Urothelial ca1e-259.5
PemigatinibFGFR2Cholangiocarcinoma1e-259.3
FutibatinibFGFR2Cholangiocarcinoma1e-259.3
LenalidomideCSNK1A1MDS/Myeloma5e-98.5
OlaparibBRCA2BRCA-mut cancersClinVar8.5
RoflumilastPDE4DCOPD2e-198.0
ApremilastPDE4DPsoriasis2e-198.0
DisulfiramALDH2Alcoholism1e-257.8
RuxolitinibSTAT1/JAKMPNMendelian7.5
PrexasertibCHEK2Solid tumors2e-227.5

TOP 10 UNDRUGGED OPPORTUNITIES

Genep-valueFamilyStructurePotential
RHBDF2Mendelian (tylosis)Regulatory/rhomboid5 PDB (2.3 Å)HIGH
PSCA3e-13Surface antigenAlphaFoldHIGH
PLCE14e-20Phospholipase (enzyme)3 PDBMEDIUM-HIGH
HECTD42e-31E3 ubiquitin ligaseAlphaFoldMEDIUM
WWOXClinVarSDR enzyme1 PDB + AFMEDIUM
ECRG41e-8Secreted peptideAlphaFoldMEDIUM
DCCGenCCIg superfamily receptor9 PDBMEDIUM
DLEC1ClinVarCilia-associatedAlphaFoldLOW-MEDIUM
RNF6GenCCRING E3 ligaseAlphaFold (poor)LOW-MEDIUM
HEATR32e-20HEAT repeat scaffoldAlphaFoldLOW

TOP 10 INDIRECT OPPORTUNITIES

Undrugged GeneDrugged InteractorDrug
RUNX1 ↔ HDAC1/2HDAC inhibitorsTucidinostat, entinostat
SMAD7 ↔ TGFBR1TGFβ pathwayGalunisertib, vactosertib
XBP1 ↔ IRE1/ERN1UPR pathwaySTF-083010, KIRA6
RHBDF2 ↔ ADAM17Sheddase complexMEDI3622
HECTD4 ↔ ProteasomeUPS pathwayBortezomib, carfilzomib
TOX3 ↔ ERBB2/PI3KPI3K-AKT pathwayAlpelisib
MAF ↔ BET/chromatinBET bromodomainJQ1, birabresib
DCC ↔ Netrin-1Netrin ligandAnti-NTN1 antibodies
BRCA2 ↔ RAD51/PARPDNA repairOlaparib, niraparib
WWOX ↔ p53/MDM2p53 pathwayIdasanutlin, navtemadlin

KEY INSIGHTS

  1. Alcohol metabolism dominates the genetic architecture: ADH1B and ALDH2 are the strongest GWAS signals (p=2e-46 and 1e-25), both coding variants in enzyme active sites. This is unique to esophageal cancer — particularly squamous cell carcinoma in East Asian populations where these variants are common.

  2. FGFR2 is the top repurposing opportunity: GWAS evidence (p=1e-25), multiple approved inhibitors for other cancers (erdafitinib, pemigatinib, futibatinib), excellent structural data, and esophageal tissue expression — yet no FGFR2-targeted trials specifically for esophageal cancer are in the top listings.

  3. RHBDF2 is the most compelling undrugged target: Causative gene for tylosis with esophageal cancer (Mendelian), directly implicated via ADAM17 sheddase complex, and recent cryo-EM structures (2023) enable structure-based drug design. This represents the strongest genetic-to-functional link in the disease.

  4. Immunotherapy-genetics disconnect: The clinical trial landscape is dominated by immune checkpoint inhibitors (PD-1/PD-L1), yet the genetic architecture points to alcohol metabolism (ADH1B/ALDH2), receptor tyrosine kinase signaling (FGFR2/ERBB2), and DNA damage response (CHEK2/BRCA2) pathways. Only PTPN2, which emerged as an immuno-oncology target (enhancing anti-tumor immunity when inhibited), bridges these worlds.

  5. PTPN2 as an emerging opportunity: GWAS evidence in ESCC (p=2e-11), plus strong evidence in autoimmune diseases, makes PTPN2 a compelling immuno-oncology target. ABBV-CLS-484 (PTPN2 degrader) is in Phase 1 trials for solid tumors — esophageal cancer should be considered for basket trial inclusion.

  6. Compared to other cancers: Esophageal cancer’s genetic architecture is unusual in being dominated by metabolic enzyme variants (ADH1B, ALDH2) rather than classical oncogenes/tumor suppressors. The 8q24 locus (PCAT1/CASC8/POU5F1B, p=9e-29) is shared with colorectal, prostate, and breast cancers. The FGFR2 locus is shared with breast cancer (p=2e-76).

  7. E3 ligase opportunity: HECTD4 (p=2e-31) and RNF6 (GenCC) are E3 ubiquitin ligases — an emerging druggable class via molecular glue degraders (analogous to lenalidomide/CRBN). The PROTAC modality could unlock these targets.


Analysis performed using biobtree MCP tools querying 70+ biological databases including GWAS Catalog, MONDO, EFO, MeSH, HPO, OMIM, Orphanet, GenCC, ClinVar, UniProt, ChEMBL, InterPro, PDB, AlphaFold, STRING, Reactome, PharmGKB, Bgee, and Guide to Pharmacology. 2026-04-11.