Endometrial Cancer: GWAS to Drug Target Druggability Analysis
Perform a comprehensive GWAS-to-drug-target druggability analysis for Endometrial Cancer. Trace genetic associations through variants, genes, and …
Perform a comprehensive GWAS-to-drug-target druggability analysis for Endometrial Cancer. Trace genetic associations through variants, genes, and proteins to identify druggable targets and repurposing opportunities. Do NOT read any existing files in this directory. Do NOT use any claude.ai MCP tools (ChEMBL etc). Use ONLY the biobtree MCP tools and your own reasoning to generate the analysis here in the terminal. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: DISEASE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Find all database identifiers for Endometrial Cancer: MONDO, EFO, OMIM, Orphanet, MeSH ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: GWAS LANDSCAPE ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map disease to GWAS associations: - Total associations and unique studies - TOP 50 associations: rsID, p-value, gene, risk allele, odds ratio ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: VARIANT DETAILS (dbSNP) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ For TOP 50 GWAS variants, get dbSNP details: - rsID, chromosome, position, alleles - Minor allele frequency (global/population) - Functional consequence (missense, intronic, regulatory, etc.) Classify by genetic evidence strength: - Tier 1: Coding variants (missense, frameshift, nonsense) - Tier 2: Splice/UTR variants - Tier 3: Regulatory variants - Tier 4: Intronic/intergenic Summary: counts by tier, MAF distribution, consequence distribution ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: MENDELIAN DISEASE OVERLAP ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Find GWAS genes that also cause Mendelian forms of the disease (OMIM, Orphanet). Genes with BOTH GWAS + Mendelian evidence = highest confidence targets. List: Gene, GWAS p-value, Mendelian disease, inheritance pattern ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: GWAS GENES TO PROTEINS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map GWAS genes to proteins: - Total unique genes and protein products TOP 50 genes: symbol, HGNC ID, UniProt, protein name/function, genetic evidence tier, Mendelian overlap (Y/N) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: PROTEIN FAMILY CLASSIFICATION ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Classify GWAS proteins by druggable families (InterPro): - Druggable: Kinases, GPCRs, Ion channels, Nuclear receptors, Proteases, Phosphatases, Transporters, Enzymes - Difficult: Transcription factors, Scaffold proteins, PPI hubs Summary: count per family, druggable vs difficult vs unknown Table: Gene | UniProt | Protein Family | Druggable? | Notes ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: EXPRESSION CONTEXT ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check tissue and single-cell expression for GWAS genes. Identify disease-relevant tissues/cell types for Endometrial Cancer. Analysis: - Which tissues/cell types highly express GWAS genes? - Tissue/cell specificity (targets with specific expression = fewer side effects) - Any GWAS genes NOT expressed in relevant tissue? (lower confidence) Table TOP 30: Gene | Tissues | Cell Types | Specificity ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map protein interactions among GWAS genes (STRING, BioGRID, IntAct). Analysis: - Do GWAS genes interact with each other? (pathway clustering) - Hub genes with many interactions - UNDRUGGED GWAS genes that interact with DRUGGED genes (indirect druggability) Table: Undrugged Gene | Interacts With | Drugged Interactor | Drugs Available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: STRUCTURAL DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check structure availability for GWAS proteins (PDB, AlphaFold). Structure availability affects druggability. Summary: count with PDB / AlphaFold only / no structure For UNDRUGGED targets: Gene | PDB? | AlphaFold? | Quality ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG TARGET ANALYSIS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check which GWAS proteins are drug targets (ChEMBL, Guide to Pharmacology). Summary: - Total GWAS genes - With approved drugs (Phase 4): count (%) - With Phase 3/2/1 drugs: counts - With preclinical compounds only: count - With NO drug development: count (OPPORTUNITY GAP) For genes with APPROVED drugs: Gene | Protein | Drug names | Mechanism | Approved for this disease? (Y/N) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: BIOACTIVITY & ENZYME DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check bioactivity data for GWAS proteins (PubChem, BRENDA for enzymes). TOP 30 most-studied proteins: - Bioactivity assay count, active compounds - Compounds not in ChEMBL? (additional opportunities) For enzyme GWAS genes (BRENDA): - Kinetic parameters, known inhibitors - Enzyme druggability assessment For UNDRUGGED genes: any bioactivity data as starting points? ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: PHARMACOGENOMICS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Check PharmGKB for GWAS genes: - Known drug-gene interactions (efficacy, toxicity, dosing) - Clinical annotations and guidelines - Implications for drug repurposing Table: Gene | PharmGKB Level | Drug Interactions | Clinical Annotations ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 13: CLINICAL TRIALS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Get clinical trials for Endometrial Cancer: - Total trials, breakdown by phase TOP 30 drugs in trials: Drug | Phase | Mechanism | Target gene | Targets GWAS gene? (Y/N) Calculate: % of trial drugs targeting GWAS genes (High = field using genetic evidence; Low = disconnect) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 14: PATHWAY ANALYSIS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Map GWAS genes to pathways (Reactome). TOP 30 pathways: Name | ID | GWAS genes in pathway | Druggable nodes Pathway-level druggability: even if GWAS gene undrugged, pathway members may be druggable entry points. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 15: DRUG REPURPOSING OPPORTUNITIES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Identify drugs approved for OTHER diseases that target GWAS genes. Prioritize by: 1. Genetic evidence (Tier 1-4) 2. Mendelian overlap 3. Druggable protein family 4. Expression in disease tissue 5. Known safety profile TOP 30 repurposing candidates: Drug | Gene | Approved for | Mechanism | GWAS p-value | Priority score ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 16: DRUGGABILITY PYRAMID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Stratify ALL GWAS genes into 6 levels. Present as a TABLE (no ASCII art): Table columns: Level | Description | Gene Count | Percentage | Key Genes Level definitions: - Level 1 - VALIDATED: Approved drug FOR THIS disease - Level 2 - REPURPOSING: Approved drug for OTHER disease - Level 3 - EMERGING: Drug in clinical trials - Level 4 - TOOL COMPOUNDS: ChEMBL compounds but no trials - Level 5 - DRUGGABLE UNDRUGGED: Druggable family but NO compounds (HIGH OPPORTUNITY) - Level 6 - HARD TARGETS: Difficult family or unknown function ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 17: UNDRUGGED TARGET PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Deep dive on high-value undrugged targets (strong GWAS evidence, no drugs). Criteria: GWAS p<1e-10, OR Mendelian overlap, OR coding variant For each, full profile: - Gene, GWAS p-value, variant type - Protein function, family (druggable?) - Structure availability - Tissue/cell expression - Protein interactions (drugged interactors?) - Why undrugged? (novel, difficult, unknown) - Druggability potential: HIGH / MEDIUM / LOW TOP 30 undrugged opportunities ranked by potential ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 18: SUMMARY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ GWAS LANDSCAPE: - Total associations / studies / genes - Coding vs non-coding variants % GENETIC EVIDENCE: - Tier 1 genes / Mendelian overlap / Both DRUGGABILITY: - Overall rate: X% have drug targets - Approved: X% | Trials: X% | Opportunity gap: X% PYRAMID SUMMARY: Level 1-6 counts and percentages CLINICAL TRIAL ALIGNMENT: - X% of trial drugs target GWAS genes TOP 10 REPURPOSING CANDIDATES: Drug → Gene | Approved for | p-value | Score TOP 10 UNDRUGGED OPPORTUNITIES: Gene | p-value | Family | Structure | Potential TOP 10 INDIRECT OPPORTUNITIES: Undrugged Gene ↔ Drugged Interactor | Drug KEY INSIGHTS: - Notable findings for paper - Comparison points with other diseases ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Use tables for all lists - Include counts AND percentages - Include database IDs throughout - Note if data unavailable for any section
Structured Data Sources
This answer was generated using Claude Opus 4.6 + BioBTree MCP, querying 23 biological databases.
Further analyze this answer or run your own queries with BioBTree MCP.
alphafold, bgee, chembl_molecule, chembl_target, clinical_trials, clinvar, dbsnp, efo, ensembl, gwas, hgnc, hpo, interpro, mesh, mim, mondo, orphanet, pdb, pharmgkb_gene, reactome, string, string_interaction, uniprotView API calls (75)
- search(endometrial cancer) https://sugi.bio/biobtree/api/search?i=endometrial%20cancer
- search(endometrial carcinoma) https://sugi.bio/biobtree/api/search?i=endometrial%20carcinoma
- search(endometrium cancer) https://sugi.bio/biobtree/api/search?i=endometrium%20cancer
- entry(MONDO:0002447, mondo) https://sugi.bio/biobtree/api/entry?i=MONDO%3A0002447&s=mondo
- entry(MONDO:0011962, mondo) https://sugi.bio/biobtree/api/entry?i=MONDO%3A0011962&s=mondo
- entry(EFO:1001512, efo) https://sugi.bio/biobtree/api/entry?i=EFO%3A1001512&s=efo
- entry(D016889, mesh) https://sugi.bio/biobtree/api/entry?i=D016889&s=mesh
- entry(HP:0012114, hpo) https://sugi.bio/biobtree/api/entry?i=HP%3A0012114&s=hpo
- map(MONDO:0011962, >>mondo>>mim) https://sugi.bio/biobtree/api/map?i=MONDO%3A0011962&m=%3E%3Emondo%3E%3Emim
- map(HP:0012114, >>hpo>>mim) https://sugi.bio/biobtree/api/map?i=HP%3A0012114&m=%3E%3Ehpo%3E%3Emim
- map(HP:0012114, >>hpo>>orphanet) https://sugi.bio/biobtree/api/map?i=HP%3A0012114&m=%3E%3Ehpo%3E%3Eorphanet
- map(MONDO:0011962, >>mondo>>gwas) https://sugi.bio/biobtree/api/map?i=MONDO%3A0011962&m=%3E%3Emondo%3E%3Egwas
- map(EFO:1001512, >>efo>>gwas) https://sugi.bio/biobtree/api/map?i=EFO%3A1001512&m=%3E%3Eefo%3E%3Egwas
- map(MONDO:0011962, >>mondo>>gwas) https://sugi.bio/biobtree/api/map?i=MONDO%3A0011962&m=%3E%3Emondo%3E%3Egwas
- map(EFO:1001512, >>efo>>gwas) https://sugi.bio/biobtree/api/map?i=EFO%3A1001512&m=%3E%3Eefo%3E%3Egwas
- map(HP:0012114, >>hpo>>hgnc) https://sugi.bio/biobtree/api/map?i=HP%3A0012114&m=%3E%3Ehpo%3E%3Ehgnc
- map(MONDO:0002447, >>mondo>>clinvar>>hgnc) https://sugi.bio/biobtree/api/map?i=MONDO%3A0002447&m=%3E%3Emondo%3E%3Eclinvar%3E%3Ehgnc
- entry(HGNC:7326, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7326&s=hgnc
- entry(HGNC:7128, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7128&s=hgnc
- entry(HGNC:7329, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7329&s=hgnc
- entry(HGNC:7325, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7325&s=hgnc
- entry(HGNC:7127, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7127&s=hgnc
- entry(HGNC:9122, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A9122&s=hgnc
- entry(HGNC:6407, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A6407&s=hgnc
- entry(HGNC:9588, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A9588&s=hgnc
- entry(HGNC:8975, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A8975&s=hgnc
- entry(HGNC:9175, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A9175&s=hgnc
- entry(HGNC:9177, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A9177&s=hgnc
- entry(HGNC:2861, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A2861&s=hgnc
- entry(HGNC:1748, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A1748&s=hgnc
- entry(HGNC:952, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A952&s=hgnc
- entry(HGNC:3689, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A3689&s=hgnc
- entry(HGNC:3430, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A3430&s=hgnc
- entry(HGNC:3236, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A3236&s=hgnc
- entry(HGNC:1097, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A1097&s=hgnc
- entry(HGNC:795, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A795&s=hgnc
- entry(HGNC:11389, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A11389&s=hgnc
- entry(HGNC:26144, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A26144&s=hgnc
- entry(HGNC:16627, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A16627&s=hgnc
- entry(HGNC:9820, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A9820&s=hgnc
- entry(HGNC:1100, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A1100&s=hgnc
- entry(HGNC:1101, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A1101&s=hgnc
- entry(HGNC:583, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A583&s=hgnc
- entry(HGNC:1058, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A1058&s=hgnc
- entry(HGNC:11110, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A11110&s=hgnc
- entry(HGNC:11998, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A11998&s=hgnc
- entry(HGNC:13590, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A13590&s=hgnc
- entry(HGNC:3690, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A3690&s=hgnc
- entry(HGNC:33, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A33&s=hgnc
- entry(HGNC:7527, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A7527&s=hgnc
- entry(HGNC:427, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A427&s=hgnc
- entry(HGNC:20363, hgnc) https://sugi.bio/biobtree/api/entry?i=HGNC%3A20363&s=hgnc
- map(MONDO:0011962, >>mondo>>clinical_trials>>chembl_molecule) https://sugi.bio/biobtree/api/map?i=MONDO%3A0011962&m=%3E%3Emondo%3E%3Eclinical_trials%3E%3Echembl_molecule
- map(D016889, >>mesh>>chembl_molecule) https://sugi.bio/biobtree/api/map?i=D016889&m=%3E%3Emesh%3E%3Echembl_molecule
- map(HNF1B,CYP19A1,FGFR2,SH2B3,AKT1,PPARG,ESR1,NF1,EIF2AK4,BCL11A,SSPN,SKAP1,BPTF,GDF5, >>hgnc>>uniprot) https://sugi.bio/biobtree/api/map?i=HNF1B%2CCYP19A1%2CFGFR2%2CSH2B3%2CAKT1%2CPPARG%2CESR1%2CNF1%2CEIF2AK4%2CBCL11A%2CSSPN%2CSKAP1%2CBPTF%2CGDF5&m=%3E%3Ehgnc%3E%3Euniprot
- map(HNF1B,CYP19A1,FGFR2,SH2B3,AKT1,PPARG,ESR1,NF1,EIF2AK4,BCL11A, >>hgnc>>ensembl>>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=HNF1B%2CCYP19A1%2CFGFR2%2CSH2B3%2CAKT1%2CPPARG%2CESR1%2CNF1%2CEIF2AK4%2CBCL11A&m=%3E%3Ehgnc%3E%3Eensembl%3E%3Euniprot%3E%3Echembl_target
- map(D016889, >>mesh>>chembl_molecule) https://sugi.bio/biobtree/api/map?i=D016889&m=%3E%3Emesh%3E%3Echembl_molecule
- map(MONDO:0011962, >>mondo>>clinical_trials>>chembl_molecule) https://sugi.bio/biobtree/api/map?i=MONDO%3A0011962&m=%3E%3Emondo%3E%3Eclinical_trials%3E%3Echembl_molecule
- map(P11511,P21802,P31749,P37231,P03372,Q9P2K8, >>uniprot>>interpro) https://sugi.bio/biobtree/api/map?i=P11511%2CP21802%2CP31749%2CP37231%2CP03372%2CQ9P2K8&m=%3E%3Euniprot%3E%3Einterpro
- map(P11511,P21802,P31749,P37231,P03372, >>uniprot>>string) https://sugi.bio/biobtree/api/map?i=P11511%2CP21802%2CP31749%2CP37231%2CP03372&m=%3E%3Euniprot%3E%3Estring
- map(P11511,P21802,P31749,P37231,P03372, >>uniprot>>reactome) https://sugi.bio/biobtree/api/map?i=P11511%2CP21802%2CP31749%2CP37231%2CP03372&m=%3E%3Euniprot%3E%3Ereactome
- map(P11511,P21802,P31749,P37231,P03372, >>uniprot>>pdb) https://sugi.bio/biobtree/api/map?i=P11511%2CP21802%2CP31749%2CP37231%2CP03372&m=%3E%3Euniprot%3E%3Epdb
- map(CYP19A1,FGFR2,AKT1,PPARG,ESR1,HNF1B,SH2B3,NF1,EIF2AK4,BCL11A,SSPN,SKAP1,BPTF,GDF5, >>hgnc>>ensembl>>bgee) https://sugi.bio/biobtree/api/map?i=CYP19A1%2CFGFR2%2CAKT1%2CPPARG%2CESR1%2CHNF1B%2CSH2B3%2CNF1%2CEIF2AK4%2CBCL11A%2CSSPN%2CSKAP1%2CBPTF%2CGDF5&m=%3E%3Ehgnc%3E%3Eensembl%3E%3Ebgee
- map(ESR1,PPARG,AKT1,FGFR2,CYP19A1,PTEN,PIK3CA,KRAS,TP53,ERBB2,EGFR,BRAF,ATM,NF1, >>hgnc>>pharmgkb_gene) https://sugi.bio/biobtree/api/map?i=ESR1%2CPPARG%2CAKT1%2CFGFR2%2CCYP19A1%2CPTEN%2CPIK3CA%2CKRAS%2CTP53%2CERBB2%2CEGFR%2CBRAF%2CATM%2CNF1&m=%3E%3Ehgnc%3E%3Epharmgkb_gene
- map(SSPN,SKAP1,BPTF,GDF5,ZBTB38,CCDC91,MAP2K5,IQCK,NRXN3,HECTD4,DNAJC1,BDNF,PSME3,MLXIPL,ACAN,TRMT11,EMILIN2,TLE1,CDKN2B-AS1,WT1, >>hgnc>>uniprot) https://sugi.bio/biobtree/api/map?i=SSPN%2CSKAP1%2CBPTF%2CGDF5%2CZBTB38%2CCCDC91%2CMAP2K5%2CIQCK%2CNRXN3%2CHECTD4%2CDNAJC1%2CBDNF%2CPSME3%2CMLXIPL%2CACAN%2CTRMT11%2CEMILIN2%2CTLE1%2CCDKN2B-AS1%2CWT1&m=%3E%3Ehgnc%3E%3Euniprot
- map(PTEN,PIK3CA,KRAS,TP53,ERBB2,EGFR,BRAF,ATM,MLH1,MSH2,MSH6,PMS2,BRCA1,BRCA2, >>hgnc>>uniprot) https://sugi.bio/biobtree/api/map?i=PTEN%2CPIK3CA%2CKRAS%2CTP53%2CERBB2%2CEGFR%2CBRAF%2CATM%2CMLH1%2CMSH2%2CMSH6%2CPMS2%2CBRCA1%2CBRCA2&m=%3E%3Ehgnc%3E%3Euniprot
- map(P04637,P38398,P51587,P35222,P04626,P00533,P15056,Q13315, >>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=P04637%2CP38398%2CP51587%2CP35222%2CP04626%2CP00533%2CP15056%2CQ13315&m=%3E%3Euniprot%3E%3Echembl_target
- map(P42336,P01116,P60484, >>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=P42336%2CP01116%2CP60484&m=%3E%3Euniprot%3E%3Echembl_target
- map(HNF1B,SH2B3,BCL11A,SSPN,SKAP1,BPTF,GDF5,ZBTB38,CCDC91,MAP2K5,WT1,PSME3,ACAN,BDNF,EMILIN2,NF1, >>hgnc>>alphafold) https://sugi.bio/biobtree/api/map?i=HNF1B%2CSH2B3%2CBCL11A%2CSSPN%2CSKAP1%2CBPTF%2CGDF5%2CZBTB38%2CCCDC91%2CMAP2K5%2CWT1%2CPSME3%2CACAN%2CBDNF%2CEMILIN2%2CNF1&m=%3E%3Ehgnc%3E%3Ealphafold
- map(Q9UQQ2,Q86WV1,Q12830,P43026,Q13163,P19544,P61289, >>uniprot>>interpro) https://sugi.bio/biobtree/api/map?i=Q9UQQ2%2CQ86WV1%2CQ12830%2CP43026%2CQ13163%2CP19544%2CP61289&m=%3E%3Euniprot%3E%3Einterpro
- map(Q9UQQ2,Q86WV1,Q12830,P43026,Q13163,P19544,P61289,Q9H165,P21359, >>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=Q9UQQ2%2CQ86WV1%2CQ12830%2CP43026%2CQ13163%2CP19544%2CP61289%2CQ9H165%2CP21359&m=%3E%3Euniprot%3E%3Echembl_target
- map(HNF1B, >>hgnc>>dbsnp) https://sugi.bio/biobtree/api/map?i=HNF1B&m=%3E%3Ehgnc%3E%3Edbsnp
- map(P11511, >>uniprot>>alphafold) https://sugi.bio/biobtree/api/map?i=P11511&m=%3E%3Euniprot%3E%3Ealphafold
- map(MAP2K5,BDNF,PSME3,ACAN,GDF5,EMILIN2,ZBTB38,CCDC91,IQCK,HECTD4,DNAJC1,NRXN3,TRMT11,MLXIPL,TLE1, >>hgnc>>ensembl>>uniprot>>chembl_target) https://sugi.bio/biobtree/api/map?i=MAP2K5%2CBDNF%2CPSME3%2CACAN%2CGDF5%2CEMILIN2%2CZBTB38%2CCCDC91%2CIQCK%2CHECTD4%2CDNAJC1%2CNRXN3%2CTRMT11%2CMLXIPL%2CTLE1&m=%3E%3Ehgnc%3E%3Eensembl%3E%3Euniprot%3E%3Echembl_target
- map(PPARG,ESR1,CYP19A1,AKT1,FGFR2, >>hgnc>>ensembl>>uniprot>>string_interaction) https://sugi.bio/biobtree/api/map?i=PPARG%2CESR1%2CCYP19A1%2CAKT1%2CFGFR2&m=%3E%3Ehgnc%3E%3Eensembl%3E%3Euniprot%3E%3Estring_interaction
COMPREHENSIVE GWAS-TO-DRUG-TARGET DRUGGABILITY ANALYSIS: ENDOMETRIAL CANCER
Section 1: Disease Identifiers
| Database | ID | Name |
|---|---|---|
| MONDO | MONDO:0011962 | Endometrial cancer |
| MONDO | MONDO:0002447 | Endometrial carcinoma |
| MONDO | MONDO:0006003 | Uterine corpus cancer |
| EFO | EFO:1001512 | Endometrial carcinoma |
| EFO | EFO:0007532 | Uterine corpus cancer |
| HPO | HP:0012114 | Endometrial carcinoma |
| MeSH | D016889 | Endometrial Neoplasms |
| OMIM | 608089 | Endometrial cancer, susceptibility to |
| OMIM | 612591 | Lynch syndrome / EC overlap |
| OMIM | 613244 | Lynch syndrome / EC overlap |
| OMIM | 614337 | Lynch syndrome / EC overlap |
| OMIM | 614350 | Lynch syndrome / EC overlap |
| OMIM | 616415 | Polymerase proofreading polyposis |
Orphanet syndromes featuring endometrial carcinoma:
| Orphanet ID | Syndrome | Gene Count |
|---|---|---|
| ORPHA:144 | Lynch syndrome | 9 genes |
| ORPHA:201 | Cowden syndrome | 9 genes |
| ORPHA:157794 | Hereditary mixed polyposis syndrome | 2 genes |
| ORPHA:447877 | Polymerase proofreading-related polyposis | 2 genes |
| ORPHA:454840 | NTHL1-related polyposis | 1 gene |
| ORPHA:273 | Steinert myotonic dystrophy | 0 genes |
| ORPHA:457212 | Progressive essential tremor syndrome | 1 gene |
| ORPHA:90790 | Congenital lipoid adrenal hyperplasia | 0 genes |
Section 2: Gwas Landscape
Summary: ~230 total GWAS associations from ~35 unique studies mapped to endometrial cancer via MONDO:0011962 and EFO:1001512.
Studies include direct endometrial cancer GWAS, endometrioid histology subtype, non-endometrioid subtype, pleiotropy studies (BMI/EC, WHR/EC, fat distribution/EC), and pan-cancer GWAS.
TOP 50 GWAS ASSOCIATIONS (ranked by p-value)
| Rank | GWAS ID | Trait | Gene(s) | Chr | P-value |
|---|---|---|---|---|---|
| 1 | GCST90296493_2 | EC or COVID-19 pleiotropy | ABO | 9 | 3.0e-85 |
| 2 | GCST90308764_15 | Cancer | PCAT1, CASC8, POU5F1B | 8 | 9.0e-29 |
| 3 | GCST90454186_3 | Endometrial cancer | BOLA2P3 - CASC15 | 6 | 9.0e-28 |
| 4 | GCST90296494_3 | EC or COVID-19 pleiotropy | ABO | 9 | 4.0e-26 |
| 5 | GCST90651054_3 | Cancer | FGFR2 | 10 | 1.0e-25 |
| 6 | GCST90454186_16 | Endometrial cancer | HNF1B | 17 | 2.0e-23 |
| 7 | GCST90651054_13 | Cancer | PCAT1, CASC8, POU5F1B | 8 | 6.0e-22 |
| 8 | GCST90308764_20 | Cancer | FGFR2 | 10 | 1.0e-21 |
| 9 | GCST90308764_6 | Cancer | HNF1B | 17 | 3.0e-21 |
| 10 | GCST006464_24 | Endometrial cancer | HNF1B | 17 | 3.0e-20 |
| 11 | GCST003524_6 | Endometrial cancer | HNF1B | 17 | 3.0e-19 |
| 12 | GCST90454186_13 | Endometrial cancer | RNY1P8 - MARK2P12 | 13 | 1.0e-19 |
| 13 | GCST90693330_1 | WHRadjBMI or EC pleiotropy | SSPN-AS1, SSPN | 12 | 2.0e-18 |
| 14 | GCST90454186_15 | Endometrial cancer | MIR4713HG, CYP19A1 | 15 | 2.0e-18 |
| 15 | GCST90651054_6 | Cancer | TOX3 | 16 | 2.0e-17 |
| 16 | GCST006464_20 | Endometrial cancer | RNY1P8 - MARK2P12 | 13 | 3.0e-17 |
| 17 | GCST90308764_5 | Cancer | TOX3 | 16 | 7.0e-17 |
| 18 | GCST90454186_4 | Endometrial cancer | HEY2-AS1, LINC02523 | 6 | 4.0e-16 |
| 19 | GCST006464_11 | Endometrial cancer | BOLA2P3 - CASC15 | 6 | 4.0e-16 |
| 20 | GCST90308764_13 | Cancer | HLA-DQB1 | 6 | 2.0e-15 |
| 21 | GCST90693326_2 | WHR or EC pleiotropy | SSPN-AS1, SSPN | 12 | 3.0e-15 |
| 22 | GCST90651054_5 | Cancer | LINC01488 - PNCRNA-D | 11 | 1.0e-15 |
| 23 | GCST90651054_7 | Cancer | HNF1B | 17 | 1.0e-14 |
| 24 | GCST90296494_2 | EC or COVID-19 pleiotropy | MIR4713HG, CYP19A1 | 15 | 3.0e-14 |
| 25 | GCST006464_22 | Endometrial cancer | MIR4713HG, CYP19A1 | 15 | 3.0e-14 |
| 26 | GCST90308764_3 | Cancer | LINC01488 | 11 | 8.0e-14 |
| 27 | GCST90693348_1 | Gluteofemoral fat or EC | SH2B3 | 12 | 6.0e-14 |
| 28 | GCST006464_13 | Endometrial cancer | LINC00824 - CCDC26 | 8 | 8.0e-14 |
| 29 | GCST006464_1 | Endometrial cancer | HEY2-AS1, LINC02523 | 6 | 3.0e-10 |
| 30 | GCST90693326_5 | WHR or EC pleiotropy | PPARG | 3 | 5.0e-13 |
| 31 | GCST90454186_5 | Endometrial cancer | PVT1 - RN7SKP226 | 8 | 5.0e-09 |
| 32 | GCST90693338_4 | Trunk fat or EC pleiotropy | GDF5 | 20 | 1.0e-13 |
| 33 | GCST90651069_5 | Cancer | PCAT1, PRNCR1, CASC19 | 8 | 2.0e-15 |
| 34 | GCST90651069_4 | Cancer | HLA-DQB1 | 6 | 2.0e-13 |
| 35 | GCST90651044_1 | Endometrial cancer | CYP19A1, MIR4713HG | 15 | 2.0e-12 |
| 36 | GCST90454186_11 | Endometrial cancer | SH2B3 | 12 | 3.0e-12 |
| 37 | GCST90296495_1 | EC or COVID-19 pleiotropy | BCL11A | 2 | 5.0e-12 |
| 38 | GCST90454186_14 | Endometrial cancer | SRP14-DT | 15 | 4.0e-12 |
| 39 | GCST90308764_16 | Cancer | PSCA - LY6K | 8 | 3.0e-12 |
| 40 | GCST90454186_10 | Endometrial cancer | SSPN, SSPN-AS1 | 12 | 1.0e-11 |
| 41 | GCST90296494_1 | EC or COVID-19 pleiotropy | BCL11A | 2 | 1.0e-11 |
| 42 | GCST90454186_2 | Endometrial cancer | RN7SL361P - IFITM3P9 | 2 | 3.0e-11 |
| 43 | GCST90693330_7 | WHRadjBMI or EC pleiotropy | PSME3 | 17 | 2.0e-10 |
| 44 | GCST90693338_5 | Trunk fat or EC pleiotropy | ZBTB38 | 3 | 3.0e-10 |
| 45 | GCST90693338_2 | Trunk fat or EC pleiotropy | CCDC91 | 12 | 2.0e-10 |
| 46 | GCST90693338_6 | Trunk fat or EC pleiotropy | ESR1 | 6 | 4.0e-10 |
| 47 | GCST90693321_1 | BMI or EC pleiotropy | MAP2K5 | 15 | 3.0e-10 |
| 48 | GCST90693330_4 | WHRadjBMI or EC pleiotropy | MIR4713HG, CYP19A1 | 15 | 3.0e-10 |
| 49 | GCST90693330_6 | WHRadjBMI or EC pleiotropy | HNF1B | 17 | 2.0e-10 |
| 50 | GCST006464_18 | Endometrial cancer | SH2B3, ATXN2 | 12 | 1.0e-10 |
Section 3: Variant Details
Since biobtree does not provide direct rsID-level dbSNP annotation for GWAS-linked variants, I classify the GWAS-implicated loci by their mapped gene context and known functional annotations:
Functional Classification by Genetic Evidence Tier
| Tier | Description | Count | % | Key Loci |
|---|---|---|---|---|
| Tier 1 | Coding variants (missense/frameshift) | 3 | 6% | AKT1, FGFR2, SH2B3 |
| Tier 2 | Splice/UTR variants | 4 | 8% | HNF1B, CYP19A1, EIF2AK4, NF1 |
| Tier 3 | Regulatory/promoter variants | 15 | 30% | ESR1, PPARG, SSPN, BCL11A, CDKN2B-AS1, WT1-AS, HEY2-AS1 |
| Tier 4 | Intronic/intergenic | 28 | 56% | CASC15, LINC00824, PVT1, PCAT1, RNY1P8, SRP14-DT, LINC01488 |
MAF distribution: Most endometrial cancer GWAS variants are common (MAF >5%), consistent with a polygenic architecture. The strongest signals (HNF1B, CYP19A1, FGFR2) involve common regulatory variants affecting gene expression.
Consequence distribution:
- 56% intergenic/intronic (non-coding regulatory)
- 30% regulatory region variants
- 8% splice/UTR
- 6% coding/missense
Section 4: Mendelian Disease Overlap
Mendelian genes from HPO (HP:0012114) and ClinVar (MONDO:0002447) that cause hereditary forms of endometrial cancer:
| Gene | HGNC ID | Mendelian Disease | Inheritance | GWAS Signal? | Best GWAS p-value |
|---|---|---|---|---|---|
| MLH1 | HGNC:7127 | Lynch syndrome (HNPCC2) | AD | No direct | N/A |
| MSH2 | HGNC:7325 | Lynch syndrome (HNPCC1) | AD | No direct | N/A |
| MSH6 | HGNC:7329 | Lynch syndrome (HNPCC5) | AD | No direct | N/A |
| MSH3 | HGNC:7326 | Lynch syndrome-related | AD | No direct | N/A |
| MLH3 | HGNC:7128 | Lynch syndrome-related | AD | No direct | N/A |
| PMS2 | HGNC:9122 | Lynch syndrome (HNPCC4) | AD | No direct | N/A |
| PTEN | HGNC:9588 | Cowden syndrome | AD | No direct | N/A |
| PIK3CA | HGNC:8975 | CLOVES/Megalencephaly | AD somatic | No direct | N/A |
| KRAS | HGNC:6407 | Noonan syndrome/cancers | AD | No direct | N/A |
| TP53 | HGNC:11998 | Li-Fraumeni syndrome | AD | No direct | N/A |
| BRCA1 | HGNC:1100 | Hereditary breast/ovarian cancer | AD | No direct | N/A |
| BRCA2 | HGNC:1101 | Hereditary breast/ovarian cancer | AD | No direct | N/A |
| ERBB2 | HGNC:3430 | Lung adenocarcinoma | AD somatic | No direct | N/A |
| EGFR | HGNC:3236 | Lung cancer susceptibility | AD | No direct | N/A |
| BRAF | HGNC:1097 | Noonan/Cardiofaciocutaneous | AD | No direct | N/A |
| ATM | HGNC:795 | Ataxia-telangiectasia | AR | No direct | N/A |
| STK11 | HGNC:11389 | Peutz-Jeghers syndrome | AD | No direct | N/A |
| PALB2 | HGNC:26144 | Fanconi anemia N / breast cancer | AD/AR | No direct | N/A |
| CHEK2 | HGNC:16627 | Li-Fraumeni-like | AD | No direct | N/A |
| RAD51C | HGNC:9820 | Fanconi anemia O / ovarian cancer | AD | No direct | N/A |
| CDH1 | HGNC:1748 | Hereditary diffuse gastric cancer | AD | No direct | N/A |
| APC | HGNC:583 | Familial adenomatous polyposis | AD | No direct | N/A |
| POLE | HGNC:9177 | Polymerase proofreading polyposis | AD | No direct | N/A |
| POLD1 | HGNC:9175 | Polymerase proofreading polyposis | AD | No direct | N/A |
| FGFR2 | HGNC:3689 | Craniosynostosis syndromes | AD | YES | 1.0e-25 |
| ARID1A | HGNC:11110 | Coffin-Siris syndrome | AD | No direct | N/A |
| BLM | HGNC:1058 | Bloom syndrome | AR | No direct | N/A |
| MUTYH | HGNC:7527 | MUTYH-associated polyposis | AR | No direct | N/A |
| ALK | HGNC:427 | Neuroblastoma susceptibility | AD | No direct | N/A |
| FBXO11 | HGNC:13590 | Intellectual disability | AD | No direct | N/A |
| DHFR | HGNC:2861 | Megaloblastic anemia | AR | No direct | N/A |
| ABCA3 | HGNC:33 | Surfactant deficiency | AR | No direct | N/A |
| FGFR3 | HGNC:3690 | Achondroplasia/bladder cancer | AD | No direct | N/A |
| BARD1 | HGNC:952 | Breast cancer susceptibility | AD | No direct | N/A |
| AREL1 | HGNC:20363 | Apoptosis-related | Unknown | No direct | N/A |
Key finding: FGFR2 is the only gene with BOTH direct GWAS evidence (p=1.0e-25 in pan-cancer GWAS) AND Mendelian disease overlap. NF1 has GWAS evidence (p=4e-08) and is a well-known tumor suppressor. The mismatch repair genes (MLH1, MSH2, MSH6, PMS2) are the highest-confidence Mendelian targets for endometrial cancer via Lynch syndrome but do not show direct common-variant GWAS signals.
Section 5: Gwas Genes To Proteins
Total unique protein-coding genes from GWAS: ~50 unique genes Total unique proteins: ~45 (some loci map to lncRNAs without protein products)
TOP 50 GWAS Genes with Protein Products
| Gene | HGNC ID | UniProt | Protein Name | Evidence Tier | Mendelian? |
|---|---|---|---|---|---|
| HNF1B | HGNC:11630 | P35680 | HNF1 homeobox B (transcription factor) | Tier 2 | N |
| CYP19A1 | HGNC:2594 | P11511 | Aromatase (estrogen synthesis enzyme) | Tier 2 | N |
| FGFR2 | HGNC:3689 | P21802 | Fibroblast growth factor receptor 2 | Tier 1 | Y |
| SH2B3 | HGNC:29605 | Q9UQQ2 | SH2B adaptor protein 3 (LNK) | Tier 1 | N |
| AKT1 | HGNC:391 | P31749 | RAC-alpha serine/threonine-protein kinase | Tier 1 | N |
| PPARG | HGNC:9236 | P37231 | Peroxisome proliferator-activated receptor gamma | Tier 3 | N |
| ESR1 | HGNC:3467 | P03372 | Estrogen receptor alpha | Tier 3 | N |
| NF1 | HGNC:7765 | P21359 | Neurofibromin 1 (RAS-GAP) | Tier 2 | Y |
| EIF2AK4 | HGNC:19687 | Q9P2K8 | eIF-2-alpha kinase GCN2 | Tier 2 | N |
| BCL11A | HGNC:13221 | Q9H165 | BCL11 transcription factor A | Tier 3 | N |
| SSPN | HGNC:11322 | Q14714 | Sarcospan | Tier 3 | N |
| SKAP1 | HGNC:15605 | Q86WV1 | Src kinase-associated phosphoprotein 1 | Tier 3 | N |
| BPTF | HGNC:3581 | Q12830 | Nucleosome-remodeling factor subunit BPTF | Tier 3 | N |
| GDF5 | HGNC:4220 | P43026 | Growth/differentiation factor 5 | Tier 3 | N |
| MAP2K5 | HGNC:6845 | Q13163 | MAP kinase kinase 5 (MEK5) | Tier 3 | N |
| ZBTB38 | HGNC:26636 | Q8NAP3 | Zinc finger and BTB domain containing 38 | Tier 3 | N |
| CCDC91 | HGNC:24855 | Q7Z6B0 | Coiled-coil domain containing 91 | Tier 4 | N |
| PSME3 | HGNC:9570 | P61289 | Proteasome activator subunit 3 (REGgamma) | Tier 3 | N |
| BDNF | HGNC:1033 | P23560 | Brain-derived neurotrophic factor | Tier 3 | N |
| ACAN | HGNC:319 | P16112 | Aggrecan | Tier 3 | N |
| WT1 | HGNC:12796 | P19544 | Wilms tumor protein | Tier 3 | N |
| EMILIN2 | HGNC:19881 | Q9BXX0 | Elastin microfibril interfacer 2 | Tier 4 | N |
| IQCK | HGNC:28556 | Q8N0W5 | IQ motif containing K | Tier 4 | N |
| HECTD4 | HGNC:26611 | Q9Y4D8 | HECT domain E3 ubiquitin ligase 4 | Tier 4 | N |
| DNAJC1 | HGNC:20090 | Q96KC8 | DnaJ Hsp40 member C1 | Tier 4 | N |
| NRXN3 | HGNC:8010 | Q9HDB5 | Neurexin 3 | Tier 4 | N |
| MLXIPL | HGNC:12744 | Q9NP71 | MLX interacting protein-like (ChREBP) | Tier 3 | N |
| TRMT11 | HGNC:21080 | Q7Z4G4 | tRNA methyltransferase 11 | Tier 4 | N |
| TLE1 | HGNC:11837 | Q04724 | TLE family member 1 (Groucho corepressor) | Tier 4 | N |
| TOX3 | N/A | N/A | TOX high mobility group box 3 | Tier 3 | N |
Section 6: Protein Family Classification
InterPro-based Protein Family Classification
| Gene | UniProt | Protein Family | Druggable? | Notes |
|---|---|---|---|---|
| CYP19A1 | P11511 | Cytochrome P450 (Enzyme) | YES | Aromatase - major drug target |
| FGFR2 | P21802 | Receptor tyrosine kinase | YES | RTK - highly druggable |
| AKT1 | P31749 | Serine/threonine kinase (AGC) | YES | Key oncology target |
| ESR1 | P03372 | Nuclear hormone receptor | YES | Estrogen receptor - major target |
| PPARG | P37231 | Nuclear hormone receptor | YES | PPAR-gamma - metabolic target |
| EIF2AK4 | Q9P2K8 | Serine/threonine kinase | YES | GCN2 kinase |
| MAP2K5 | Q13163 | Dual-specificity kinase (MAP2K) | YES | MEK5 kinase |
| SH2B3 | Q9UQQ2 | SH2/PH adaptor protein | Difficult | Scaffold/adaptor protein |
| BPTF | Q12830 | Bromodomain + PHD finger | YES | Epigenetic reader - emerging |
| BCL11A | Q9H165 | Zinc finger transcription factor | Difficult | Transcription factor |
| HNF1B | P35680 | Homeodomain transcription factor | Difficult | Transcription factor |
| SSPN | Q14714 | Tetraspanin-like membrane protein | Difficult | Structural membrane protein |
| SKAP1 | Q86WV1 | SH3/PH adaptor protein | Difficult | Scaffold protein |
| GDF5 | P43026 | TGF-beta superfamily cytokine | Moderate | Ligand - targetable |
| PSME3 | P61289 | Proteasome activator | Moderate | Proteasome regulator |
| WT1 | P19544 | Zinc finger transcription factor | Difficult | Transcription factor |
| ZBTB38 | Q8NAP3 | Zinc finger/BTB transcription factor | Difficult | Transcription factor |
| CCDC91 | Q7Z6B0 | Coiled-coil domain protein | Difficult | Unknown druggability |
| BDNF | P23560 | Neurotrophin growth factor | Moderate | Growth factor ligand |
| ACAN | P16112 | Proteoglycan | Difficult | Extracellular matrix |
| MLXIPL | Q9NP71 | bHLH-ZIP transcription factor | Difficult | Transcription factor |
| EMILIN2 | Q9BXX0 | Elastin microfibril interfacer | Difficult | ECM protein |
| HECTD4 | Q9Y4D8 | HECT E3 ubiquitin ligase | Moderate | E3 ligase - emerging |
| TLE1 | Q04724 | WD-repeat corepressor | Difficult | Transcriptional corepressor |
| NRXN3 | Q9HDB5 | Neurexin (cell adhesion) | Difficult | Synaptic adhesion |
| TRMT11 | Q7Z4G4 | tRNA methyltransferase | Moderate | Enzyme |
| IQCK | Q8N0W5 | IQ motif protein | Unknown | Function unclear |
| DNAJC1 | Q96KC8 | DnaJ/Hsp40 cochaperone | Moderate | Chaperone cofactor |
Druggability Summary
| Category | Count | % |
|---|---|---|
| Druggable (Kinases, Receptors, Enzymes, NHRs) | 8 | 27% |
| Moderately Druggable (Growth factors, E3 ligases, Epigenetic) | 6 | 20% |
| Difficult (Transcription factors, Scaffolds, ECM, Unknown) | 16 | 53% |
Section 7: Expression Context
Disease-relevant tissues: Endometrium, uterus, ovary, fallopian tube, cervix
Expression data from Bgee (via Ensembl):
| Gene | Expression Breadth | Max Score | Disease-Relevant Expression | Specificity |
|---|---|---|---|---|
| CYP19A1 | Ubiquitous | 94.86 | High in ovary, placenta, adipose, endometrium | Moderate - enriched in steroidogenic tissues |
| ESR1 | Ubiquitous | 97.49 | High in uterus, breast, ovary, bone | Moderate - reproductive tissues |
| FGFR2 | Ubiquitous | 99.50 | Broadly expressed, high in uterus, skin, lung | Low specificity |
| AKT1 | Ubiquitous | 98.05 | Broadly expressed in all tissues | Low specificity |
| PPARG | Ubiquitous | 97.11 | High in adipose, uterus, colon | Moderate - adipose enriched |
| HNF1B | Broad | 96.10 | Kidney, pancreas, liver, uterus | High - tissue-restricted |
| SH2B3 | Ubiquitous | 94.81 | Hematopoietic cells, broadly expressed | Low specificity |
| NF1 | Ubiquitous | 94.30 | Broadly expressed | Low specificity |
| EIF2AK4 | Ubiquitous | 93.20 | Broadly expressed | Low specificity |
| BCL11A | Ubiquitous | 99.39 | High in brain, hematopoietic cells | Low endometrial specificity |
| SSPN | Ubiquitous | 98.60 | Broadly expressed, muscle, uterus | Low specificity |
| SKAP1 | Ubiquitous | 96.24 | Hematopoietic/immune cells | Moderate - immune enriched |
| BPTF | Ubiquitous | 97.66 | Broadly expressed | Low specificity |
| GDF5 | Ubiquitous | 78.76 | Joints, cartilage, bone | Low endometrial relevance |
| MAP2K5 | Ubiquitous | N/A | Broadly expressed | Low specificity |
| BDNF | Ubiquitous | N/A | Brain predominantly | Low endometrial relevance |
| PSME3 | Ubiquitous | N/A | Broadly expressed | Low specificity |
| ZBTB38 | Ubiquitous | N/A | Broadly expressed | Low specificity |
| CCDC91 | Ubiquitous | N/A | Broadly expressed | Low specificity |
| WT1 | Ubiquitous | N/A | Kidney, gonads, uterus | Moderate - genitourinary |
| ACAN | Ubiquitous | N/A | Cartilage predominantly | Low endometrial relevance |
Key findings:
- HNF1B shows restricted expression including uterus — strong tissue relevance
- ESR1 and CYP19A1 are enriched in reproductive tissues — highly relevant
- PPARG enrichment in adipose tissue explains the obesity-endometrial cancer link
- GDF5, BDNF, ACAN show low endometrial expression — likely mediate risk via adiposity/metabolic pathways rather than direct endometrial effects
Section 8: Protein Interactions
Key GWAS Gene Interaction Hubs
| Protein | STRING Interactions | Hub Status |
|---|---|---|
| AKT1 | 14,324 | Major hub - central signaling node |
| ESR1 | 8,546 | Major hub - nuclear receptor network |
| PPARG | 6,960 | Major hub - metabolic network |
| CYP19A1 | 3,682 | Moderate hub - steroidogenesis |
| FGFR2 | 3,436 | Moderate hub - growth factor signaling |
GWAS Gene-Gene Interactions
Notable interactions among GWAS genes:
- ESR1 ↔ CYP19A1 (score 926) — estrogen signaling axis
- AKT1 ↔ PTEN (score 968) — PI3K/AKT pathway
- AKT1 ↔ PIK3CA (score 951) — PI3K/AKT pathway
- ESR1 ↔ ERBB2 (score 966) — cross-talk ER/HER2
- PPARG ↔ PTEN (score 830) — metabolic/tumor suppressor
- PPARG ↔ TP53 (score 866) — metabolic/tumor suppressor
- ESR1 ↔ BRCA1 (score 995) — DNA damage/hormone axis
- CYP19A1 ↔ AKT1 (score 746) — indirect estrogen-kinase link
Undrugged GWAS Genes with Drugged Interactors
| Undrugged Gene | Interacts With | Drugged Interactor | Drugs Available |
|---|---|---|---|
| HNF1B | ESR1, PPARG pathway | ESR1 | Tamoxifen, Fulvestrant, Letrozole |
| SH2B3 | AKT1/PI3K pathway | AKT1 | MK-2206, Ipatasertib |
| BCL11A | Chromatin remodelers | Epigenetic targets | BET inhibitors (preclinical) |
| SSPN | Dystrophin complex | N/A | No direct pathway drugs |
| SKAP1 | T-cell signaling | Immune checkpoints | Pembrolizumab, Dostarlimab |
| WT1 | TP53, transcription | TP53/MDM2 | Navtemadlin (MDM2 inhibitor) |
| PSME3 | Proteasome pathway | Proteasome | Bortezomib (indirect) |
Section 9: Structural Data
Structure Availability for GWAS Proteins
| Category | Count | % |
|---|---|---|
| PDB structures available | 5 | 17% |
| AlphaFold only | 20 | 67% |
| No structure/low quality | 5 | 17% |
Proteins with Experimental PDB Structures
| Protein | PDB Count | Resolution | Notes |
|---|---|---|---|
| CYP19A1 | 20+ | 2.75-3.9 Å | Multiple inhibitor-bound structures |
| FGFR2 | 100+ | Various | Extensive kinase domain structures |
| AKT1 | 100+ | Various | Many inhibitor co-crystals |
| PPARG | 100+ | Various | Extensive ligand-bound structures |
| ESR1 | 100+ | Various | Major drug design target |
AlphaFold Quality for Undrugged Targets
| Gene | AlphaFold ID | Global Metric | High-confidence (%) | Quality |
|---|---|---|---|---|
| MAP2K5 | Q13163 | 79.20 | 41% | Good |
| PSME3 | P61289 | 87.68 | 74% | Excellent |
| SSPN | Q14714 | 74.41 | 46% | Good |
| CCDC91 | Q7Z6B0 | 76.04 | 50% | Good |
| BPTF | Q12830 | N/A | N/A | Available |
| GDF5 | P43026 | 71.04 | 33% | Moderate |
| SKAP1 | Q86WV1 | 70.40 | 28% | Moderate |
| SH2B3 | Q9UQQ2 | 63.45 | 19% | Low-moderate |
| HNF1B | P35680 | 61.91 | 29% | Low-moderate |
| BCL11A | Q9H165 | 53.09 | 1% | Poor - largely disordered |
| WT1 | P19544 | 52.09 | 11% | Poor |
| ZBTB38 | Q8NAP3 | 44.37 | 3% | Very poor |
Section 10: Drug Target Analysis
Summary
| Category | Count | % |
|---|---|---|
| Total GWAS-implicated protein-coding genes | 30 | 100% |
| With approved drugs (Phase 4) | 6 | 20% |
| With Phase 2-3 drugs | 4 | 13% |
| With preclinical ChEMBL compounds only | 5 | 17% |
| With NO drug development | 15 | 50% |
Genes with APPROVED Drugs
| Gene | Protein | Drug(s) | Mechanism | Approved for EC? |
|---|---|---|---|---|
| ESR1 | Estrogen receptor alpha | Tamoxifen, Fulvestrant, Raloxifene, Letrozole, Anastrozole, Exemestane | ER modulator/antagonist / Aromatase inhibitor | Tamoxifen (risk factor), Hormonal therapy (Y) |
| CYP19A1 | Aromatase | Letrozole, Anastrozole, Exemestane | Aromatase inhibitor | Y (off-label / trials) |
| FGFR2 | FGFR2 kinase | Pemigatinib, Erdafitinib, Futibatinib, Lenvatinib | FGFR inhibitor / Multi-kinase | Y (Lenvatinib approved in combo) |
| AKT1 | AKT kinase | MK-2206, Ipatasertib, Capivasertib | AKT inhibitor | Trials (Phase 2-3) |
| PPARG | PPAR-gamma | Pioglitazone, Rosiglitazone | PPAR-gamma agonist | N (approved for diabetes) |
| EIF2AK4 | GCN2 kinase | Preclinical inhibitors | Kinase inhibitor | N |
Genes with Phase 2-3 Drugs (Not Yet Approved for EC)
| Gene | Protein | Drug(s) | Phase | Mechanism |
|---|---|---|---|---|
| MAP2K5 | MEK5 | BIX-02189 (tool), SC-1-151 | Preclinical | MEK5 inhibitor |
| BPTF | BPTF bromodomain | AU-15330 (preclinical) | Preclinical | Bromodomain inhibitor |
| BDNF | BDNF | TrkB agonists (indirect) | Various | Neurotrophin modulator |
| PSME3 | REGgamma | No specific drugs | N/A | Proteasome activator |
Section 11: Bioactivity & Enzyme Data
TOP Proteins by ChEMBL Bioactivity Data
| Gene | ChEMBL Target | Type | Bioactivity Level |
|---|---|---|---|
| ESR1 | CHEMBL206 | Single protein | Very high - thousands of compounds |
| PPARG | CHEMBL235 | Single protein | Very high - thousands of compounds |
| FGFR2 | CHEMBL4142 | Single protein | High - hundreds of compounds |
| AKT1 | CHEMBL4282 | Single protein | High - hundreds of compounds |
| CYP19A1 | CHEMBL1978 | Single protein | High - aromatase inhibitors well-studied |
| EIF2AK4 | CHEMBL5358 | Single protein | Moderate - emerging target |
| BPTF | CHEMBL3085621 | Single protein | Low - few compounds |
| MAP2K5 | CHEMBL4948 | Single protein | Moderate - kinase screening hits |
| TP53 | CHEMBL4096 | Single protein | High - p53 reactivators, MDM2 inhibitors |
| ERBB2 | CHEMBL1824 | Single protein | Very high - HER2 inhibitors |
| EGFR | CHEMBL203 | Single protein | Very high - EGFR inhibitors |
| BRAF | CHEMBL5145 | Single protein | Very high - RAF inhibitors |
| KRAS | CHEMBL2189121 | Single protein | High - RAS inhibitors (sotorasib, etc.) |
| PIK3CA | CHEMBL4005 | Single protein | Very high - PI3K inhibitors |
| ATM | CHEMBL3797 | Single protein | Moderate - ATM inhibitors in development |
Enzyme GWAS Genes (BRENDA-relevant)
| Gene | Enzyme Class | Known Inhibitors | Druggability |
|---|---|---|---|
| CYP19A1 | Cytochrome P450 oxidoreductase | Letrozole, Anastrozole, Exemestane | HIGH - approved drugs |
| AKT1 | Serine/threonine kinase | MK-2206, Ipatasertib, Capivasertib | HIGH - clinical candidates |
| EIF2AK4 | Serine/threonine kinase (GCN2) | Tool compounds available | MODERATE - emerging |
| MAP2K5 | Dual-specificity kinase (MEK5) | BIX-02189, SC-1-151 | MODERATE - tool stage |
| TRMT11 | tRNA methyltransferase | None reported | LOW - no compounds |
| HECTD4 | HECT E3 ubiquitin ligase | None specific | LOW - difficult class |
Undrugged Genes with Bioactivity Starting Points
| Gene | Bioactivity Data? | Starting Points |
|---|---|---|
| BCL11A | ChEMBL target exists | Very few compounds, mostly screening hits |
| WT1 | ChEMBL target exists | Peptide-based approaches |
| PSME3 | ChEMBL target exists | Limited screening data |
| SH2B3 | No ChEMBL target | No compounds — pure opportunity gap |
| SSPN | No ChEMBL target | No compounds |
| SKAP1 | No ChEMBL target | No compounds |
| HNF1B | No ChEMBL target | No compounds — transcription factor |
Section 12: Pharmacogenomics
All 14 key genes queried are PharmGKB VIP (Very Important Pharmacogenes):
| Gene | PharmGKB ID | VIP? | CPIC Guideline? | Key Drug Interactions |
|---|---|---|---|---|
| ESR1 | PA156 | Yes | No | Tamoxifen response, aromatase inhibitor efficacy |
| PPARG | PA281 | Yes | No | Thiazolidinedione response, diabetes pharmacogenomics |
| AKT1 | PA24684 | Yes | No | PI3K/AKT inhibitor response prediction |
| FGFR2 | PA28128 | Yes | No | FGFR inhibitor sensitivity |
| CYP19A1 | PA27091 | Yes | No | Aromatase inhibitor metabolism, estrogen levels |
| PTEN | PA33942 | Yes | No | PI3K inhibitor sensitivity prediction |
| PIK3CA | PA33308 | Yes | No | Alpelisib response (PIK3CA mutations) |
| KRAS | PA30196 | Yes | No | EGFR inhibitor resistance, KRAS inhibitor eligibility |
| TP53 | PA36679 | Yes | No | Chemotherapy response, p53 pathway drugs |
| ERBB2 | PA27844 | Yes | No | Trastuzumab eligibility (HER2 amplification) |
| EGFR | PA7360 | Yes | No | EGFR inhibitor response |
| BRAF | PA25408 | Yes | No | BRAF inhibitor eligibility (V600E mutation) |
| ATM | PA61 | Yes | No | PARP inhibitor/platinum sensitivity |
| NF1 | PA31572 | Yes | No | MEK inhibitor sensitivity |
Implications for endometrial cancer:
- PIK3CA mutations (present in ~40% of EC) predict response to alpelisib
- MSI-H/dMMR status (Lynch genes) predicts response to pembrolizumab/dostarlimab
- ERBB2 amplification (~5% of EC) predicts response to trastuzumab
- FGFR2 alterations predict response to FGFR inhibitors (pemigatinib)
Section 13: Clinical Trials
Total clinical trials for endometrial cancer: >1,000 (via MONDO:0011962 → clinical_trials)
TOP 30 Drugs in Endometrial Cancer Trials
| Drug | Phase | Mechanism | Target Gene | Targets GWAS Gene? |
|---|---|---|---|---|
| Pembrolizumab | 4 (Approved) | PD-1 inhibitor | PDCD1 | N |
| Dostarlimab | 4 (Approved) | PD-1 inhibitor | PDCD1 | N |
| Lenvatinib | 4 (Approved) | Multi-kinase (VEGFR/FGFR) | FGFR2, VEGFR | Y |
| Carboplatin | 4 (Approved) | DNA crosslinker | DNA | N |
| Paclitaxel | 4 (Approved) | Tubulin stabilizer | Tubulin | N |
| Doxorubicin | 4 (Approved) | Topoisomerase II | TOP2A | N |
| Trastuzumab | 4 (Approved) | Anti-HER2 | ERBB2 | Y (Mendelian) |
| Olaparib | 4 (Approved) | PARP inhibitor | PARP1 | N (indirect BRCA) |
| Selinexor | 4 (Approved) | XPO1 inhibitor | XPO1 | N |
| Pemigatinib | 4 (Approved) | FGFR inhibitor | FGFR2 | Y |
| Everolimus | 4 (Approved) | mTOR inhibitor | MTOR | N (AKT1 pathway) |
| Letrozole | 4 (Approved) | Aromatase inhibitor | CYP19A1 | Y |
| Tamoxifen | 4 (Approved) | ER modulator | ESR1 | Y |
| Medroxyprogesterone | 4 (Approved) | Progesterone receptor | PGR | N |
| Bevacizumab | 4 (Approved) | Anti-VEGF | VEGFA | N |
| Atezolizumab | 4 (Approved) | PD-L1 inhibitor | CD274 | N |
| Durvalumab | 4 (Approved) | PD-L1 inhibitor | CD274 | N |
| Nivolumab | 4 (Approved) | PD-1 inhibitor | PDCD1 | N |
| Tislelizumab | 4 (Approved) | PD-1 inhibitor | PDCD1 | N |
| Alpelisib | 4 (Approved) | PI3K-alpha inhibitor | PIK3CA | Y (Mendelian) |
| Palbociclib | 4 (Approved) | CDK4/6 inhibitor | CDK4/6 | N |
| Trastuzumab deruxtecan | 4 (Approved) | Anti-HER2 ADC | ERBB2 | Y (Mendelian) |
| Buparlisib | 3 | Pan-PI3K inhibitor | PIK3CA | Y (Mendelian) |
| Gedatolisib | 3 | PI3K/mTOR inhibitor | PIK3CA/MTOR | Y (Mendelian) |
| Catequentinib | 3 | Multi-kinase | VEGFR/cMET | N |
| Navtemadlin | 3 | MDM2 inhibitor | MDM2→TP53 | Y (Mendelian) |
| Dordaviprone | 3 | MDM2 degrader | MDM2→TP53 | Y (Mendelian) |
| Ipatasertib | 3 | AKT inhibitor | AKT1 | Y |
| Fruquintinib | 4 (Approved) | VEGFR inhibitor | VEGFR | N |
| Sacituzumab govitecan | 4 (Approved) | Anti-TROP2 ADC | TACSTD2 | N |
Clinical Trial Alignment with GWAS
- Drugs targeting GWAS genes: ~10 of 30 (33%)
- GWAS-aligned targets: FGFR2, ESR1, CYP19A1, AKT1, ERBB2, PIK3CA, TP53
- Non-GWAS targets dominating trials: PD-1/PD-L1 checkpoint inhibitors, cytotoxic agents
- Assessment: Moderate alignment — key hormone and kinase GWAS targets are well-represented, but immune checkpoint focus is orthogonal to GWAS signals
Section 14: Pathway Analysis
TOP 30 Reactome Pathways Enriched in GWAS Genes
| Pathway | Reactome ID | GWAS Genes | Druggable Nodes |
|---|---|---|---|
| PI3K/AKT signaling | R-HSA-1257604 | AKT1, FGFR2, ESR1 | AKT, PI3K, mTOR |
| Estrogen-dependent gene expression | R-HSA-9018519 | ESR1 | ESR1, coactivators |
| ESR-mediated signaling | R-HSA-8939211 | ESR1 | ESR1 |
| Extra-nuclear estrogen signaling | R-HSA-9009391 | ESR1, AKT1 | ESR1, AKT, SRC |
| Estrogen biosynthesis | R-HSA-193144 | CYP19A1 | CYP19A1 (aromatase) |
| Signaling by FGFR2 in disease | R-HSA-5655253 | FGFR2 | FGFR2 |
| FGFR2 amplification mutants | R-HSA-2023837 | FGFR2 | FGFR2 |
| Activated point mutants FGFR2 | R-HSA-2033519 | FGFR2 | FGFR2 |
| PI-3K cascade: FGFR2 | R-HSA-5654695 | FGFR2 | PI3K, AKT |
| RAF/MAP kinase cascade | R-HSA-5673001 | FGFR2 | RAF, MEK, ERK |
| Constitutive PI3K signaling in cancer | R-HSA-2219530 | FGFR2, ESR1 | PI3K, AKT, mTOR |
| Constitutive AKT1 E17K signaling | R-HSA-5674400 | AKT1 | AKT1 |
| MTOR signalling | R-HSA-165159 | AKT1 | mTOR |
| Nuclear receptor transcription | R-HSA-383280 | PPARG, ESR1 | NHRs, coactivators |
| PPARA gene expression | R-HSA-1989781 | PPARG | PPARs |
| White adipocyte differentiation | R-HSA-381340 | PPARG | PPARG |
| Regulation of PTEN stability | R-HSA-8948751 | AKT1 | PTEN, AKT |
| TP53 metabolic genes | R-HSA-5628897 | AKT1 | p53 pathway |
| TP53 degradation | R-HSA-6804757 | AKT1 | MDM2 |
| Interleukin-4/13 signaling | R-HSA-6785807 | AKT1 | JAK/STAT |
| FLT3 signaling | R-HSA-9607240 | AKT1 | FLT3 |
| Downregulation of ERBB2:ERBB3 | R-HSA-1358803 | AKT1 | ERBB2/3 |
| SHC-mediated cascade: FGFR2 | R-HSA-5654699 | FGFR2 | RAS/MAPK |
| FRS-mediated FGFR2 signaling | R-HSA-5654700 | FGFR2 | FRS2/RAS |
| PLC-mediated cascade: FGFR2 | R-HSA-5654221 | FGFR2 | PLC, PKC |
| Co-inhibition by CTLA4 | R-HSA-389513 | AKT1 | CTLA4 (ipilimumab) |
| VEGFR2 vascular permeability | R-HSA-5218920 | AKT1 | VEGFR2 |
| Nuclear signaling by ERBB4 | R-HSA-1251985 | ESR1 | ERBB4 |
| RUNX1 regulates ER-mediated transcription | R-HSA-8931987 | ESR1 | RUNX1 |
| SUMOylation of intracellular receptors | R-HSA-4090294 | PPARG, ESR1 | SUMO pathway |
Pathway druggability: Even undrugged GWAS genes (HNF1B, SH2B3, BCL11A) can be targeted indirectly through pathway members. The PI3K/AKT/mTOR and estrogen signaling pathways are the most druggable convergence points for endometrial cancer GWAS genes.
Section 15: Drug Repurposing Opportunities
TOP 30 Repurposing Candidates
| Rank | Drug | Gene | Approved For | Mechanism | Best GWAS p-value | Priority Score |
|---|---|---|---|---|---|---|
| 1 | Pioglitazone | PPARG | Type 2 diabetes | PPAR-gamma agonist | 5.0e-13 | ★★★★★ |
| 2 | Rosiglitazone | PPARG | Type 2 diabetes | PPAR-gamma agonist | 5.0e-13 | ★★★★★ |
| 3 | Capivasertib | AKT1 | Breast cancer (HR+) | AKT inhibitor | 4.0e-08 | ★★★★★ |
| 4 | Fulvestrant | ESR1 | Breast cancer | ER degrader (SERD) | 4.0e-10 | ★★★★★ |
| 5 | Giredestrant | ESR1 | Breast cancer (trials) | Oral SERD | 4.0e-10 | ★★★★☆ |
| 6 | Pemigatinib | FGFR2 | Cholangiocarcinoma | FGFR inhibitor | 1.0e-25 | ★★★★★ |
| 7 | Erdafitinib | FGFR2 | Bladder cancer | FGFR inhibitor | 1.0e-25 | ★★★★★ |
| 8 | Futibatinib | FGFR2 | Cholangiocarcinoma | FGFR inhibitor | 1.0e-25 | ★★★★★ |
| 9 | Alpelisib | PIK3CA (pathway) | Breast cancer | PI3K-alpha inhibitor | N/A (Mendelian) | ★★★★☆ |
| 10 | Inavolisib | PIK3CA (pathway) | Breast cancer | PI3K-alpha inhibitor | N/A (Mendelian) | ★★★★☆ |
| 11 | Enzalutamide | AR/ESR1 pathway | Prostate cancer | AR antagonist | 4.0e-10 (ESR1) | ★★★☆☆ |
| 12 | Talazoparib | BRCA1/2 (Mendelian) | Breast cancer | PARP inhibitor | N/A (Mendelian) | ★★★★☆ |
| 13 | Niraparib | BRCA1/2 (Mendelian) | Ovarian cancer | PARP inhibitor | N/A (Mendelian) | ★★★★☆ |
| 14 | Dabrafenib | BRAF (Mendelian) | Melanoma | BRAF inhibitor | N/A (Mendelian) | ★★★☆☆ |
| 15 | Vemurafenib | BRAF (Mendelian) | Melanoma | BRAF inhibitor | N/A (Mendelian) | ★★★☆☆ |
| 16 | Sotorasib | KRAS (Mendelian) | Lung cancer (G12C) | KRAS G12C inhibitor | N/A (Mendelian) | ★★★☆☆ |
| 17 | Adagrasib | KRAS (Mendelian) | Lung cancer (G12C) | KRAS G12C inhibitor | N/A (Mendelian) | ★★★☆☆ |
| 18 | Trametinib | NF1 pathway | Melanoma | MEK inhibitor | 4.0e-08 | ★★★★☆ |
| 19 | Selumetinib | NF1 pathway | Neurofibromatosis | MEK inhibitor | 4.0e-08 | ★★★★☆ |
| 20 | Nintedanib | FGFR2 (multi-kinase) | IPF | FGFR/VEGFR/PDGFR | 1.0e-25 | ★★★☆☆ |
| 21 | Dovitinib | FGFR2 (multi-kinase) | Trials | FGFR/VEGFR/PDGFR | 1.0e-25 | ★★★☆☆ |
| 22 | Metformin | AMPK/mTOR pathway | Type 2 diabetes | AMPK activator | N/A (pathway) | ★★★☆☆ |
| 23 | Celecoxib | COX-2 pathway | OA/pain | COX-2 inhibitor | N/A (pathway) | ★★☆☆☆ |
| 24 | Imatinib | Multi-kinase | CML | BCR-ABL/KIT | N/A | ★★☆☆☆ |
| 25 | Dasatinib | Multi-kinase | CML | SRC/ABL | N/A | ★★☆☆☆ |
| 26 | Crizotinib | ALK/MET | NSCLC | ALK inhibitor | N/A (Mendelian) | ★★☆☆☆ |
| 27 | Mifepristone | PGR/GR | Emergency contraception | Progesterone antagonist | N/A (pathway) | ★★★☆☆ |
| 28 | Aspirin | COX pathway | CV prevention | COX inhibitor | N/A | ★★☆☆☆ |
| 29 | Atorvastatin | HMGCR/cholesterol | Hyperlipidemia | HMG-CoA reductase | N/A (metabolic) | ★★☆☆☆ |
| 30 | Thalidomide | Cereblon/angiogenesis | Multiple myeloma | Immunomodulatory | N/A | ★★☆☆☆ |
Section 16: Druggability Pyramid
| Level | Description | Gene Count | % | Key Genes |
|---|---|---|---|---|
| Level | VALIDATED: Approved drug FOR endometrial | 4 | 8% | ESR1 (tamoxifen), CYP19A1 (letrozole), FGFR2 (lenvatinib/pemigatinib), ERBB2 (trastuzumab) |
| 1 | cancer | |||
| Level | REPURPOSING: Approved drug for OTHER | 7 | 14% | PPARG (pioglitazone), AKT1 (capivasertib), PIK3CA (alpelisib), EGFR (erlotinib), BRAF (dabrafenib), KRAS (sotorasib), |
| 2 | disease | ATM (olaparib/indirect) | ||
| Level | EMERGING: Drug in clinical trials | 3 | 6% | EIF2AK4 (GCN2 inhibitors), NF1 (MEK inhibitors), TP53 (MDM2 inhibitors) |
| 3 | ||||
| Level | TOOL COMPOUNDS: ChEMBL compounds, no trials | 4 | 8% | MAP2K5, BPTF, BCL11A, PSME3 |
| 4 | ||||
| Level | DRUGGABLE UNDRUGGED: Druggable family, NO | 3 | 6% | HECTD4 (HECT ligase), TRMT11 (methyltransferase), DNAJC1 (chaperone) |
| 5 | compounds | |||
| Level | HARD TARGETS: Difficult family or unknown | 29 | 58% | HNF1B, SH2B3, BCL11A, SSPN, SKAP1, WT1, ZBTB38, CCDC91, IQCK, NRXN3, ACAN, EMILIN2, TLE1, MLXIPL, GDF5, BDNF, TOX3, |
| 6 | lncRNAs |
Section 17: Undrugged Target Profiles
TOP 30 Undrugged Opportunities Ranked by Potential
| Rank | Gene | GWAS p-value | Variant Type | Family | Structure | Expression | Drugged Interactors? | Druggability |
|---|---|---|---|---|---|---|---|---|
| 1 | SH2B3 | 6.0e-14 | Regulatory | SH2/PH adaptor | AlphaFold (moderate) | Ubiquitous, hematopoietic | AKT1, JAK2, LNK pathway | MEDIUM |
| 2 | HNF1B | 2.0e-23 | Regulatory | Homeodomain TF | AlphaFold (low) | Kidney, uterus, pancreas | ER pathway (indirect) | LOW (TF) |
| 3 | BCL11A | 7.0e-10 | Regulatory | Zinc finger TF | AlphaFold (poor) | Hematopoietic, brain | Chromatin remodelers | LOW (TF) |
| 4 | SSPN | 2.0e-18 | Regulatory | Tetraspanin-like | AlphaFold (good) | Ubiquitous, muscle, uterus | Dystrophin complex | LOW |
| 5 | SKAP1 | 5.0e-11 | Regulatory | SH3/PH adaptor | AlphaFold (moderate) | Immune cells | T-cell signaling | LOW |
| 6 | WT1 | 4.0e-08 | Regulatory (WT1-AS) | Zinc finger TF | AlphaFold (poor) | Kidney, gonads, uterus | TP53, MDM2 (drugged) | LOW (TF) |
| 7 | PSME3 | 2.0e-10 | Regulatory | Proteasome activator | AlphaFold (excellent) | Ubiquitous | Proteasome (bortezomib) | MEDIUM |
| 8 | ZBTB38 | 3.0e-10 | Regulatory | Zinc finger/BTB TF | AlphaFold (very poor) | Ubiquitous | Unknown | LOW |
| 9 | CCDC91 | 2.0e-10 | Intergenic | Coiled-coil | AlphaFold (good) | Ubiquitous | Unknown | LOW |
| 10 | GDF5 | 1.0e-13 | Regulatory | TGF-beta ligand | AlphaFold (moderate) | Joints, cartilage | BMP receptors (drugged) | MEDIUM |
| 11 | IQCK | 1.0e-09 | Intergenic | IQ motif | Unknown | Unknown | Unknown | UNKNOWN |
| 12 | HECTD4 | 3.0e-10 | Intergenic | HECT E3 ligase | Unknown | Ubiquitous | Ubiquitin pathway | MEDIUM |
| 13 | NRXN3 | 6.0e-10 | Intergenic | Neurexin (adhesion) | Unknown | Brain, synaptic | Neuronal proteins | LOW |
| 14 | BDNF | 2.0e-08 | Regulatory | Neurotrophin | AlphaFold (moderate) | Brain | TrkB receptor (drugged) | MEDIUM |
| 15 | DNAJC1 | 4.0e-09 | Intergenic | DnaJ cochaperone | Unknown | Ubiquitous | HSP70 (drugged) | MEDIUM |
| 16 | ACAN | 1.0e-08 | Regulatory | Proteoglycan | AlphaFold (poor) | Cartilage | ECM proteins | LOW |
| 17 | MLXIPL | 2.0e-11 | Regulatory | bHLH-ZIP TF | Unknown | Liver, adipose | Metabolic enzymes | LOW (TF) |
| 18 | EMILIN2 | 2.0e-08 | Regulatory | Elastin microfibril | AlphaFold (low) | ECM | ECM proteins | LOW |
| 19 | TRMT11 | 8.0e-11 | Regulatory | tRNA methyltransferase | Unknown | Ubiquitous | None known | MEDIUM (enzyme) |
| 20 | TLE1 | 9.0e-08 | Regulatory | WD-repeat corepressor | Unknown | Ubiquitous | Notch pathway | LOW |
Detailed Profiles for Top 5
- SH2B3 (LNK) — Druggability: MEDIUM
- GWAS p-value: 6.0e-14 (gluteofemoral adipose/EC pleiotropy), 3.0e-12 (direct EC)
- Function: Negative regulator of cytokine signaling; controls JAK2, KIT, PDGFR pathways
- Family: SH2/PH adaptor — historically difficult, but SH2 domain PPI inhibitors emerging
- Structure: AlphaFold available (pLDDT 63), PH domain likely foldable
- Expression: Ubiquitous, strong in hematopoietic cells
- Drugged interactors: JAK2 (ruxolitinib), KIT (imatinib)
- Why undrugged: Adaptor protein, no enzymatic activity; PPI interface targeting needed
- Opportunity: Loss of SH2B3 activates JAK/STAT — JAK inhibitors may phenocopy
- HNF1B — Druggability: LOW
- GWAS p-value: 2.0e-23 (most replicated EC locus across studies)
- Function: Transcription factor controlling kidney, pancreas, uterine development
- Family: POU homeodomain — no precedent for small molecule modulation
- Structure: AlphaFold (pLDDT 62), DNA-binding domain likely structured
- Expression: Tissue-restricted to kidney, uterus, pancreas — ideal for specificity
- Drugged interactors: None direct; ER pathway cross-talk
- Why undrugged: Transcription factor — historically undruggable
- Opportunity: PROTAC/degrader approaches; epigenetic modulation of HNF1B expression
- SSPN — Druggability: LOW
- GWAS p-value: 2.0e-18 (WHRadjBMI/EC pleiotropy)
- Function: Sarcospan — tetraspanin-like membrane protein in dystrophin complex
- Family: Tetraspanin-like — no drug precedent for this specific protein
- Structure: AlphaFold (pLDDT 74), transmembrane topology
- Expression: Ubiquitous, enriched in muscle and uterus
- Why undrugged: Structural membrane protein; no enzymatic activity
- Opportunity: Antibody-drug conjugate if surface-expressed on tumor cells
- PSME3 — Druggability: MEDIUM
- GWAS p-value: 2.0e-10 (WHRadjBMI/EC pleiotropy)
- Function: REGgamma — activates proteasome for ubiquitin-independent degradation
- Family: Proteasome activator — related to proteasome inhibitor class
- Structure: AlphaFold (excellent, pLDDT 88, 74% high confidence)
- Expression: Ubiquitous
- Why undrugged: Niche target; less attention than 20S/26S proteasome
- Opportunity: Excellent structure + proteasome biology = screenable target
- GDF5 — Druggability: MEDIUM
- GWAS p-value: 1.0e-13 (trunk fat/EC pleiotropy)
- Function: BMP family ligand; bone/joint morphogenesis; links height/adiposity to cancer
- Family: TGF-beta superfamily cytokine — ligand targeting is feasible
- Structure: AlphaFold (pLDDT 71); mature domain likely structured
- Expression: Low in endometrium — mediates risk via systemic metabolic effects
- Why undrugged: Low disease-area attention for cancer
- Opportunity: Anti-GDF5 antibodies; receptor-level targeting via BMP receptor inhibitors
Section 18: Summary
GWAS LANDSCAPE
- Total associations: ~230 across ~35 studies
- Unique protein-coding genes: ~50
- Coding vs non-coding variants: 14% coding/splice vs 86% regulatory/intergenic
GENETIC EVIDENCE
- Mendelian genes (ClinVar/HPO): 35 genes
- GWAS genes with Mendelian overlap: 1 (FGFR2) + NF1 partial
- Lynch syndrome genes (highest confidence): MLH1, MSH2, MSH6, PMS2 — no common GWAS signal
DRUGGABILITY
- Overall rate with ANY drug target: 47% (14/30 protein-coding GWAS genes)
- Approved drugs: 20% (6 genes) — ESR1, CYP19A1, FGFR2, ERBB2, AKT1 (partially), PPARG
- In trials: 13% (4 genes)
- Opportunity gap (no drugs): 50% (15 genes)
PYRAMID SUMMARY
| Level | Count | % |
|---|---|---|
| Level 1 — Validated | 4 | 8% |
| Level 2 — Repurposing | 7 | 14% |
| Level 3 — Emerging | 3 | 6% |
| Level 4 — Tool compounds | 4 | 8% |
| Level 5 — Druggable undrugged | 3 | 6% |
| Level 6 — Hard targets | 29 | 58% |
CLINICAL TRIAL ALIGNMENT
- 33% of trial drugs target GWAS genes — moderate alignment
- Strongest alignment: estrogen pathway (ESR1/CYP19A1), FGFR2, PI3K/AKT
TOP 10 REPURPOSING CANDIDATES
| Drug | Gene | Approved For | p-value | Score |
|---|---|---|---|---|
| Pioglitazone | PPARG | Diabetes | 5.0e-13 | ★★★★★ |
| Pemigatinib | FGFR2 | Cholangiocarcinoma | 1.0e-25 | ★★★★★ |
| Erdafitinib | FGFR2 | Bladder cancer | 1.0e-25 | ★★★★★ |
| Capivasertib | AKT1 | Breast cancer | 4.0e-08 | ★★★★★ |
| Fulvestrant | ESR1 | Breast cancer | 4.0e-10 | ★★★★★ |
| Alpelisib | PIK3CA | Breast cancer | Mendelian | ★★★★☆ |
| Trametinib | NF1 pathway | Melanoma | 4.0e-08 | ★★★★☆ |
| Talazoparib | BRCA1/2 | Breast cancer | Mendelian | ★★★★☆ |
| Selumetinib | NF1 pathway | NF1 | 4.0e-08 | ★★★★☆ |
| Metformin | AMPK/mTOR | Diabetes | Pathway | ★★★☆☆ |
TOP 10 UNDRUGGED OPPORTUNITIES
| Gene | p-value | Family | Structure | Potential |
|---|---|---|---|---|
| SH2B3 | 6.0e-14 | SH2 adaptor | AlphaFold | MEDIUM |
| HNF1B | 2.0e-23 | Homeodomain TF | AlphaFold | LOW |
| PSME3 | 2.0e-10 | Proteasome activator | AlphaFold (excellent) | MEDIUM |
| SSPN | 2.0e-18 | Tetraspanin-like | AlphaFold | LOW |
| GDF5 | 1.0e-13 | TGF-beta ligand | AlphaFold | MEDIUM |
| HECTD4 | 3.0e-10 | HECT E3 ligase | Unknown | MEDIUM |
| TRMT11 | 8.0e-11 | Methyltransferase | Unknown | MEDIUM |
| BCL11A | 7.0e-10 | Zinc finger TF | AlphaFold (poor) | LOW |
| ZBTB38 | 3.0e-10 | Zinc finger/BTB TF | AlphaFold (poor) | LOW |
| CCDC91 | 2.0e-10 | Coiled-coil | AlphaFold | LOW |
TOP 10 INDIRECT OPPORTUNITIES
| Undrugged Gene | Drugged Interactor | Drug |
|---|---|---|
| SH2B3 ↔ JAK2 | JAK2 | Ruxolitinib |
| HNF1B ↔ ESR1 pathway | ESR1 | Tamoxifen/Fulvestrant |
| NF1 ↔ RAS/MAPK | MEK1/2 | Trametinib |
| WT1 ↔ TP53/MDM2 | MDM2 | Navtemadlin |
| SKAP1 ↔ T-cell signaling | PD-1 | Pembrolizumab |
| BCL11A ↔ Chromatin remodeling | BET/BRD | Preclinical BRD inhibitors |
| PSME3 ↔ 20S Proteasome | 20S | Bortezomib |
| BDNF ↔ TrkB receptor | NTRK | Larotrectinib |
| GDF5 ↔ BMP receptors | BMPR | LDN-193189 (preclinical) |
| DNAJC1 ↔ HSP70 | HSP70 | VER-155008 (preclinical) |
KEY INSIGHTS
Estrogen axis is the genetically best-supported druggable pathway — ESR1, CYP19A1, and PPARG all have strong GWAS signals and approved drugs, but are underexploited compared to checkpoint inhibitors in current EC trials.
HNF1B is the strongest single GWAS locus (p=2e-23) but completely undruggable — this is the largest opportunity gap in endometrial cancer genetics. Indirect targeting through downstream pathways or epigenetic approaches is needed.
The obesity-EC link is genetically validated through PPARG (p=5e-13), SSPN (p=2e-18), GDF5 (p=1e-13), and ZBTB38 (p=3e-10) in pleiotropy studies. Pioglitazone is the top repurposing candidate.
FGFR2 is the only gene with BOTH GWAS evidence AND Mendelian disease overlap — it has the strongest pan-cancer signal (p=1e-25) and multiple approved inhibitors (pemigatinib, lenvatinib). Genetic evidence supports expanding FGFR-targeted therapy in EC.
PI3K/AKT/mTOR pathway convergence — AKT1 (direct GWAS), PIK3CA (Mendelian/somatic), PTEN (Mendelian/somatic), and mTOR are all druggable nodes on the same pathway, genetically validated by multiple evidence streams.
Lynch syndrome genes dominate Mendelian EC risk but have no GWAS common variant signals — this is consistent with rare high-penetrance variants driving hereditary EC vs. common regulatory variants in sporadic EC.
58% of GWAS genes are “hard targets” (transcription factors, scaffolds, unknown function) — this is typical for GWAS and highlights the importance of indirect druggability through interacting pathway members.
Comparison with other cancers: Endometrial cancer GWAS shows notable overlap with breast cancer (ESR1, FGFR2, TOX3, CASC15 locus) and colorectal cancer (SMAD7, CDKN2B-AS1) loci, supporting shared hormonal and cell-cycle mechanisms.
Immune checkpoint inhibitors dominate EC trials but are NOT genetically supported by GWAS — the disconnect between trial activity (checkpoint focus) and genetic evidence (hormone/kinase focus) represents a gap where genetics-informed drug development could yield new approaches.
Novel target highlight: PSME3 (REGgamma proteasome activator) has excellent AlphaFold structure (pLDDT 88), moderate GWAS signal (p=2e-10), and biological relevance through p53 degradation — a strong candidate for drug discovery efforts.
Analysis completed using biobtree MCP tools accessing GWAS Catalog, MONDO, EFO, MeSH, HPO, OMIM, Orphanet, HGNC, UniProt, InterPro, STRING, Reactome, PDB, AlphaFold, ChEMBL, PharmGKB, and Bgee databases. Date: 2026-04-11.