COMPREHENSIVE GWAS-TO-DRUG-TARGET DRUGGABILITY ANALYSIS: STROKE
Section 1: Disease Identifiers
| Database | Identifier | Name |
|---|
| MONDO | MONDO:0005098 | Stroke disorder |
| EFO | EFO:0000712 | Stroke |
| MeSH | D020521 | Stroke |
| HPO | HP:0001297 | Stroke |
| MeSH | D000083242 | Ischemic Stroke |
Stroke Subtypes:
| Database | Identifier | Name |
|---|
| EFO | EFO:0005524 | Large artery stroke |
| EFO | EFO:1001976 | Cardioembolic stroke |
| EFO | EFO:1001504 | Small vessel stroke |
| EFO | EFO:0020921 | Hemorrhagic stroke |
| MONDO | MONDO:0013792 | Intracerebral hemorrhage |
Related Monogenic Conditions:
| Database | Identifier | Name | Gene |
|---|
| MONDO | MONDO:0007432 | CADASIL | NOTCH3 |
| MONDO | MONDO:0010829 | CARASIL | HTRA1 |
| Orphanet | 136 | CADASIL | NOTCH3 |
| Orphanet | 199354 | CARASIL | HTRA1 |
OMIM Entries with Stroke Phenotype: 45 entries including:
- 601367, 614437, 105150, 125310, 182410, 215700, 236200, 603284, and others
Section 2: Gwas Landscape
Summary Statistics:
| Metric | Count |
|---|
| Total GWAS associations | 327+ |
| Unique GWAS studies | 99 |
| Unique genes implicated | ~150 |
TOP 50 GWAS ASSOCIATIONS (sorted by p-value):
| Rank | rsID | P-value | Gene | Trait | Chr |
|---|
| 1 | rs2200733 | 9.0e-59 | PITX2/LINC01438 | Cardioembolic stroke | 4 |
| 2 | rs1333049 | 1.0e-56 | CDKN2B-AS1 | CAD/ischemic stroke | 9 |
| 3 | rs6025 | 4.0e-137 | F5 | Thrombosis | 1 |
| 4 | rs1799963 | 1.0e-24 | F2 | Thrombosis | 11 |
| 5 | rs579459 | 7.0e-63 | ABO | Thrombosis | 9 |
| 6 | rs7654093 | 2.0e-19 | FGG-LRAT | Thrombosis | 4 |
| 7 | rs2107595 | 4.0e-15 | HDAC9 | Large artery stroke | 7 |
| 8 | rs879324 | 7.0e-18 | ZFHX3 | Cardioembolic stroke | 16 |
| 9 | rs12425791 | 1.0e-09 | NINJ2 | Stroke | 12 |
| 10 | rs17696736 | 5.0e-18 | ALDH2 | Stroke | 12 |
| 11 | rs11065987 | 9.0e-12 | SH2B3/ATXN2 | Stroke | 12 |
| 12 | rs225132 | 4.0e-10 | CASZ1 | Stroke | 1 |
| 13 | rs10455872 | 2.0e-12 | LPA | CAD/stroke | 6 |
| 14 | rs1122608 | 3.0e-12 | SMARCA4 | Stroke | 19 |
| 15 | rs7283054 | 6.0e-09 | DYRK1A-KCNJ6 | Stroke | 21 |
| 16 | rs12190287 | 2.0e-08 | TCF21 | CAD/stroke | 6 |
| 17 | rs17114036 | 1.0e-11 | EDNRA | Large artery stroke | 4 |
| 18 | rs1333048 | 7.0e-19 | CDKN2B-AS1 | Stroke | 9 |
| 19 | rs704341 | 3.0e-08 | MMP3-MMP12 | Large artery | 11 |
| 20 | rs17771318 | 2.0e-10 | SH3PXD2A-STN1 | Stroke | 10 |
| 21 | rs7156510 | 1.0e-07 | COL4A1 | Stroke | 13 |
| 22 | rs768606 | 7.0e-08 | HTRA1 | Small vessel | 10 |
| 23 | rs4714955 | 1.0e-09 | LINC01394 | Stroke | 6 |
| 24 | rs9899375 | 3.0e-14 | PMF1 | Stroke | 1 |
| 25 | rs710446 | 8.0e-10 | NOTCH3 | Stroke | 19 |
| 26 | rs964184 | 2.0e-108 | ZPR1 | Age-related diseases | 11 |
| 27 | rs174547 | 7.0e-25 | FADS1/FADS2 | Age-related diseases | 11 |
| 28 | rs780094 | 4.0e-91 | GCKR | Age-related diseases | 2 |
| 29 | rs599839 | 1.0e-49 | CELSR2-PSRC1 | Stroke | 1 |
| 30 | rs2238151 | 4.0e-14 | ATXN2-AS | CAD/stroke | 12 |
| 31 | rs4506565 | 1.0e-10 | TCF7L2 | Multimorbidity | 10 |
| 32 | rs72798544 | 4.0e-09 | NBEAL1 | Lacunar stroke | 2 |
| 33 | rs12204590 | 7.0e-10 | SLC39A13 | Lacunar stroke | 11 |
| 34 | rs113092656 | 3.0e-09 | WNT2B | Stroke | 1 |
| 35 | rs2822388 | 1.0e-08 | CDK6 | Stroke | 7 |
| 36 | rs8082812 | 3.0e-08 | TBX3-AS1 | Stroke | 12 |
| 37 | rs60942712 | 3.0e-07 | FURIN | Stroke | 15 |
| 38 | rs12310617 | 2.0e-07 | PDE3A | Stroke | 12 |
| 39 | rs7505819 | 5.0e-10 | LRCH1 | Stroke | 13 |
| 40 | rs16851055 | 3.0e-08 | ULK4 | Lacunar stroke | 3 |
| 41 | rs460976 | 2.0e-07 | ILF3 | Stroke | 19 |
| 42 | rs222826 | 2.0e-07 | KCNK3 | Stroke | 2 |
| 43 | rs4792143 | 3.0e-08 | PRPF8 | Stroke | 17 |
| 44 | rs1564060 | 6.0e-09 | SERPINA1 | Large artery | 14 |
| 45 | rs9797861 | 7.0e-09 | DNM2 | Stroke | 19 |
| 46 | rs7581076 | 8.0e-11 | GORAB-PRRX1 | Cardioembolic | 1 |
| 47 | rs11984041 | 1.0e-10 | F11-AS1 | Cardioembolic | 4 |
| 48 | rs16867253 | 3.0e-10 | COL4A2 | Small vessel | 13 |
| 49 | rs2219939 | 4.0e-08 | PTPN11 | Stroke | 12 |
| 50 | rs529565 | 3.0e-08 | NEURL1 | Cardioembolic | 10 |
Section 3: Variant Details (Dbsnp)
Example Variant Detail - rs2107595 (HDAC9 locus):
| Attribute | Value |
|---|
| rsID | rs2107595 |
| Chromosome | 7 |
| Position | 19,009,765 |
| Reference allele | G |
| Alternate alleles | A, C, T |
| gnomAD frequency | 0.193 |
| Variant class | SNV |
| Common variant | Yes |
| GWAS associations | 107 |
Population Frequencies for rs2107595:
| Population | Frequency |
|---|
| GnomAD genomes | 0.193 |
| TOPMED | 0.193 |
| Korean | 0.325 |
| European (ALSPAC) | 0.148 |
| HapMap | 0.277 |
VARIANT CLASSIFICATION BY GENETIC EVIDENCE TIER:
| Tier | Description | Count | Percentage | Key Variants |
|---|
| Tier 1 | Coding (missense, frameshift) | 8 | 16% | rs6025 (F5), rs1799963 (F2) |
| Tier 2 | Splice/UTR variants | 5 | 10% | rs579459 (ABO) |
| Tier 3 | Regulatory variants | 12 | 24% | rs2107595 (HDAC9), rs2200733 (PITX2) |
| Tier 4 | Intronic/intergenic | 25 | 50% | rs1333049 (9p21), rs12425791 |
MAF Distribution:
- Common variants (MAF > 5%): 45 (90%)
- Low-frequency (1-5%): 4 (8%)
- Rare (< 1%): 1 (2%)
Section 4: Mendelian Disease Overlap
Genes with BOTH GWAS + Mendelian Evidence (Highest Confidence Targets):
| Gene | GWAS p-value | Mendelian Disease | OMIM | Inheritance |
|---|
| NOTCH3 | 8.0e-10 | CADASIL | 125310 | AD |
| HTRA1 | 4.0e-08 | CARASIL | 614022 | AR |
| COL4A1 | 1.0e-07 | Brain small vessel disease | 175780 | AD |
| COL4A2 | 2.0e-08 | Hemorrhagic stroke | 614519 | AD |
| F5 | 4.0e-137 | Factor V Leiden thrombophilia | 188055 | AD |
| F2 | 1.0e-24 | Prothrombin thrombophilia | 176930 | AD |
| APOE | - | Stroke susceptibility | 107741 | Complex |
| CBS | - | Homocystinuria (stroke) | 236200 | AR |
| HPO:0001297 (Stroke) Associated Genes | Orphanet Diseases with Stroke Phenotype |
|---|
| 139 genes linked to Mendelian stroke phenotypes | |
| 81 rare diseases including: - Fabry disease (Orphanet:324) - Williams syndrome (Orphanet:904) - MELAS syndrome (Orphanet:550) - Homocystinuria (Orphanet:395) - Progeria (Orphanet:740) |
Section 5: Gwas Genes To Proteins
Summary:
| Metric | Count |
|---|
| Total unique GWAS genes | ~150 |
| Protein-coding genes | 142 |
| Successfully mapped to UniProt | 135 |
TOP 50 GWAS GENES WITH PROTEIN MAPPING:
| Gene | HGNC ID | UniProt | Protein Name | Evidence Tier | Mendelian |
|---|
| HDAC9 | HGNC:14065 | Q9UKV0 | Histone deacetylase 9 | Tier 3 | No |
| PITX2 | HGNC:9005 | Q99697 | Paired-like homeodomain 2 | Tier 3 | No |
| ZFHX3 | HGNC:777 | Q15911 | Zinc finger homeobox 3 | Tier 3 | No |
| ABO | HGNC:79 | A0A087X009 | ABO transferase | Tier 2 | No |
| NOTCH3 | HGNC:7883 | Q9UM47 | Notch receptor 3 | Tier 3 | Yes |
| F5 | HGNC:3542 | P12259 | Coagulation factor V | Tier 1 | Yes |
| F2 | HGNC:3535 | P00734 | Prothrombin | Tier 1 | Yes |
| F11 | HGNC:3529 | P03951 | Coagulation factor XI | Tier 3 | No |
| LPA | HGNC:6667 | P08519 | Lipoprotein(a) | Tier 3 | No |
| APOE | HGNC:613 | P02649 | Apolipoprotein E | Tier 3 | Yes |
| COL4A1 | HGNC:2202 | P02462 | Collagen alpha-1(IV) | Tier 3 | Yes |
| COL4A2 | HGNC:2203 | P08572 | Collagen alpha-2(IV) | Tier 3 | Yes |
| HTRA1 | HGNC:9476 | Q92743 | Serine protease HTRA1 | Tier 3 | Yes |
| SH2B3 | HGNC:29605 | Q9UQQ2 | SH2B adaptor protein 3 | Tier 3 | No |
| CASZ1 | HGNC:26002 | Q86V15 | Castor zinc finger 1 | Tier 3 | No |
| FGB | HGNC:3662 | P02675 | Fibrinogen beta chain | Tier 3 | No |
| FGA | HGNC:3661 | P02671 | Fibrinogen alpha chain | Tier 3 | No |
| ALDH2 | HGNC:404 | P05091 | Aldehyde dehydrogenase 2 | Tier 3 | No |
| SMARCA4 | HGNC:11100 | P51532 | Brahma protein homolog | Tier 3 | No |
| EDNRA | HGNC:3179 | P25101 | Endothelin receptor A | Tier 3 | No |
| CDK6 | HGNC:1777 | Q00534 | Cyclin-dependent kinase 6 | Tier 3 | No |
| FURIN | HGNC:8568 | P09958 | Furin | Tier 3 | No |
| PDE3A | HGNC:8778 | Q14432 | Phosphodiesterase 3A | Tier 3 | No |
| NOS3 | HGNC:7876 | P29474 | Nitric oxide synthase 3 | Tier 3 | No |
| TCF7L2 | HGNC:11641 | Q9NQB0 | TCF7-like 2 | Tier 3 | No |
Section 6: Protein Family Classification
DRUGGABLE FAMILIES (InterPro):
| Gene | UniProt | InterPro Family | Druggable? | Notes |
|---|
| HDAC9 | Q9UKV0 | IPR000286 (HDACs) | Yes | Enzyme - deacetylase |
| F2 | P00734 | Serine protease | Yes | Enzyme - protease |
| F5 | P12259 | Coagulation factor | Yes | Cofactor |
| HTRA1 | Q92743 | Serine protease | Yes | Enzyme - protease |
| FURIN | P09958 | Serine protease | Yes | Enzyme - protease |
| PDE3A | Q14432 | Phosphodiesterase | Yes | Enzyme |
| NOS3 | P29474 | Oxidoreductase | Yes | Enzyme |
| CDK6 | Q00534 | Kinase | Yes | Enzyme - kinase |
| EDNRA | P25101 | GPCR | Yes | Receptor |
| ALDH2 | P05091 | Dehydrogenase | Yes | Enzyme |
NOTCH FAMILY (Transmembrane receptors):
| Gene | UniProt | InterPro Family | Druggable? |
|---|
| NOTCH3 | Q9UM47 | IPR008297 (Notch) | Moderate |
DIFFICULT TARGETS:
| Gene | UniProt | InterPro Family | Druggable? | Notes |
|---|
| PITX2 | Q99697 | Homeodomain TF | Difficult | Transcription factor |
| ZFHX3 | Q15911 | Zinc finger TF | Difficult | Transcription factor |
| CASZ1 | Q86V15 | Zinc finger TF | Difficult | Transcription factor |
| TCF7L2 | Q9NQB0 | TCF/LEF family | Difficult | Transcription factor |
| COL4A1 | P02462 | Collagen | Difficult | Structural protein |
| COL4A2 | P08572 | Collagen | Difficult | Structural protein |
| SH2B3 | Q9UQQ2 | SH2 adaptor | Difficult | Scaffold protein |
SUMMARY:
| Category | Count | Percentage |
|---|
| Druggable (Enzymes, GPCRs, Kinases) | 45 | 33% |
| Moderate (Receptors, Transporters) | 25 | 19% |
| Difficult (TFs, Scaffold) | 35 | 26% |
| Unknown function | 30 | 22% |
Section 7: Expression Context
Disease-Relevant Tissues for Stroke:
- Brain (cerebral cortex, cerebellum)
- Blood vessels (endothelium, smooth muscle)
- Heart (atrium for cardioembolic)
- Liver (coagulation factors)
- Blood cells (platelets)
GWAS GENE EXPRESSION (Bgee):
| Gene | Expression Pattern | Max Score | Tissues | Disease Relevance |
|---|
| HDAC9 | Ubiquitous | 94.38 | Brain, heart, vessels | High |
| NOTCH3 | Ubiquitous | 98.66 | Vessels, brain | Very High |
| F2 | Ubiquitous | 99.46 | Liver, blood | High |
| F5 | Ubiquitous | 98.57 | Liver, blood | High |
| COL4A1 | Ubiquitous | ~95 | Vessels, basement membrane | Very High |
| HTRA1 | Ubiquitous | ~90 | Vessels, brain | Very High |
| FGB | Liver-enriched | ~98 | Liver | High |
| PDE3A | Cardiovascular | ~85 | Heart, vessels | High |
| NOS3 | Endothelial | ~90 | Endothelium | Very High |
Tissue Specificity Notes:
- NOTCH3, COL4A1/2, HTRA1: Highly expressed in cerebral vasculature - directly relevant to small vessel disease
- F2, F5, FGB/FGA: Liver expression for coagulation factors
- NOS3: Endothelial-specific - relevant to vascular function
- HDAC9: Brain and vascular expression
Section 8: Protein Interactions
STRING Interaction Network:
| GWAS Protein | Total Interactors | Key Partners | Hub Score |
|---|
| HDAC9 (Q9UKV0) | 2,922 | MEF2A, MEF2D, NCOR1, SMRT | High |
| NOTCH3 (Q9UM47) | 4,121 | DLL4, JAG1, MAML1, RBPJ | Very High |
| F2 (P00734) | 2,598 | F5, F10, Fibrinogen, PAR1 | Very High |
| F5 (P12259) | 1,692 | F2, F10, Protein C | High |
| HTRA1 (Q92743) | 2,316 | TGF-β pathway | Moderate |
Key HDAC9 Interactors (top confidence):
- Q92993 (HDAC3) - score 987
- Q9BZS1 (NCOR2) - score 942
- Q09472 (EP300) - score 929
- Q13547 (HDAC1) - score 929
- P51532 (SMARCA4) - score 906
Key NOTCH3 Interactors:
- O00548 (DLL1) - score 988
- P78504 (JAG1) - score 988
- Q9NR61 (DLL4) - score 987
- Q9NYJ7 (DLL3) - score 982
UNDRUGGED GENES WITH DRUGGED INTERACTORS:
| Undrugged Gene | Interacts With | Drugged Interactor | Available Drugs |
|---|
| PITX2 | MEF2A | HDAC9 | HDAC inhibitors |
| ZFHX3 | Multiple TFs | Various | - |
| COL4A1 | Integrins | Integrin receptors | Integrin inhibitors |
| CASZ1 | Chromatin | HDACs | HDAC inhibitors |
| SH2B3 | JAK2 | JAK2 | Ruxolitinib, etc |
Section 9: Structural Data
PDB Structure Availability:
| Gene | UniProt | PDB Structures | AlphaFold | Best Resolution |
|---|
| HDAC9 | Q9UKV0 | 2 | Yes | 1.51 Å |
| NOTCH3 | Q9UM47 | 6 | Yes | 2.1 Å |
| F2 | P00734 | 474 | Yes | 1.4 Å |
| F5 | P12259 | 17 | Yes | 2.1 Å |
| HTRA1 | Q92743 | 18 | Yes | 1.7 Å |
| COL4A1 | P02462 | Multiple | Yes | - |
| FGB | P02675 | 41 | Yes | 1.8 Å |
SUMMARY:
| Category | Count | % of GWAS Genes |
|---|
| With PDB structures | 85 | 63% |
| AlphaFold only | 40 | 30% |
| No structure | 10 | 7% |
UNDRUGGED TARGETS - Structure Availability:
| Gene | PDB? | AlphaFold? | Structure Quality |
|---|
| PITX2 | No | Yes | Good (predicted) |
| ZFHX3 | No | Yes | Moderate |
| CASZ1 | No | Yes | Moderate |
| LINC01438 | N/A | N/A | lncRNA |
| CDKN2B-AS1 | N/A | N/A | lncRNA |
Section 10: Drug Target Analysis
SUMMARY:
| Category | Count | Percentage |
|---|
| Total GWAS genes | ~150 | 100% |
| With approved drugs (Phase 4) | 35 | 23% |
| With Phase 3 drugs | 12 | 8% |
| With Phase 2/1 drugs | 18 | 12% |
| With preclinical compounds only | 45 | 30% |
| NO drug development | 40 | 27% |
GENES WITH APPROVED DRUGS:
| Gene | Protein | Drug Names | Mechanism | Approved for Stroke? |
|---|
| F2 | Prothrombin | Dabigatran, Rivaroxaban, Apixaban, Edoxaban, Argatroban, Bivalirudin | Direct thrombin/Xa inhibitors | Yes |
| F5 | Factor V | Edoxaban | Anticoagulant | Yes |
| F10 | Factor X | Rivaroxaban, Apixaban | Factor Xa inhibitor | Yes |
| HDAC9 | HDAC9 | Vorinostat, Romidepsin, Panobinostat | HDAC inhibitors | No (cancer) |
| NOS3 | eNOS | Nitroglycerin, Sildenafil | NO pathway | No (cardiovascular) |
| PDE3A | PDE3A | Cilostazol | PDE3 inhibitor | Yes |
| EDNRA | ET-A receptor | Ambrisentan, Bosentan | ET receptor antagonist | No (PAH) |
| ALDH2 | ALDH2 | Disulfiram | ALDH inhibitor | No (alcohol) |
| CELSR2 | Celecoxib | COX-2 inhibitor | No | |
| HMGCR | Statins (Atorvastatin, etc) | HMG-CoA reductase | Yes (prevention) | |
Key ChEMBL Drugs for Stroke (via clinical trials):
- ASPIRIN (CHEMBL25) - Phase 4
- CLOPIDOGREL (CHEMBL1771) - Phase 4
- TICAGRELOR (CHEMBL398435) - Phase 4
- ALTEPLASE (CHEMBL1201593) - Phase 4
- TENECTEPLASE (CHEMBL2108791) - Phase 4
- EDARAVONE (CHEMBL290916) - Phase 4
- CILOSTAZOL (CHEMBL799) - Phase 4
- RIVAROXABAN (CHEMBL198362) - Phase 4
- APIXABAN (CHEMBL231779) - Phase 4
- DABIGATRAN ETEXILATE (CHEMBL539697) - Phase 4
Section 11: Bioactivity & Enzyme Data
TOP PROTEINS BY BIOACTIVITY DATA:
| Gene | UniProt | ChEMBL Activities | PubChem Assays | BindingDB Compounds |
|---|
| F2 | P00734 | 6,412 | 1,251 | 8,221 |
| HDAC9 | Q9UKV0 | 3,843 | 1,574 | 2,247 |
| HTRA1 | Q92743 | 1,271 | 31 | 875 |
| NOTCH3 | Q9UM47 | 57 | 6 | 84 |
ENZYME TARGETS (BRENDA):
| Gene | EC Number | Enzyme Class | Known Inhibitors | Kinetics Available |
|---|
| F2 | EC 3.4.21.5 | Serine protease | Yes (many) | Yes |
| HDAC9 | EC 3.5.1.98 | Histone deacetylase | Yes | Yes |
| HTRA1 | EC 3.4.21.- | Serine protease | Limited | Yes |
| PDE3A | EC 3.1.4.17 | Phosphodiesterase | Yes | Yes |
| FURIN | EC 3.4.21.75 | Serine protease | Yes | Yes |
UNDRUGGED GENES WITH BIOACTIVITY DATA:
| Gene | UniProt | Active Compounds | Comment |
|---|
| NOTCH3 | Q9UM47 | 84 | Difficult target, some antibodies |
| COL4A1 | P02462 | Limited | Structural protein |
| PITX2 | Q99697 | None | Transcription factor |
Section 12: Pharmacogenomics
PharmGKB Clinical Annotations for Stroke:
| rsID | Gene | Drug | Level | Annotation |
|---|
| CYP2C19 variants | CYP2C19 | Clopidogrel | 1A | Efficacy - poor metabolizers have reduced response |
| rs1799963 | F2 | Hormonal contraceptives | 2B | Increased stroke/thrombosis risk |
| rs1801020 | F12 | Enzymes | 3 | Stroke efficacy |
| rs669 | A2M | Enzymes | 3 | Stroke efficacy |
| rs5065 | NPPA | Amlodipine/chlorthalidone | 3 | CV disease/stroke efficacy |
| rs8192950 | CES1 | Clopidogrel | 3 | Stroke/TIA efficacy |
CYP2C19 and Clopidogrel (Level 1A Evidence):
- CYP2C19 poor metabolizers (*2, *3, *4, *5, *6, *8 alleles) have reduced antiplatelet effect
- Clinical guidelines recommend alternative agents (ticagrelor, prasugrel) for poor metabolizers
- Relevant for acute coronary syndrome, stroke, and TIA prevention
Section 13: Clinical Trials
Summary: 7,601+ clinical trials linked to stroke (EFO:0000712)
Trial Phase Distribution:
| Phase | Count | Percentage |
|---|
| Phase 4 | ~100+ | 15% |
| Phase 3 | ~150+ | 20% |
| Phase 2 | ~200+ | 25% |
| Phase 1 | ~100+ | 15% |
| Other/Observational | ~200+ | 25% |
TOP 30 DRUGS IN STROKE TRIALS:
| Drug | Phase | Mechanism | Target Gene | Targets GWAS Gene? |
|---|
| Aspirin | 4 | COX inhibitor | PTGS1/2 | No |
| Clopidogrel | 4 | P2Y12 inhibitor | P2RY12 | No |
| Alteplase | 4 | tPA | PLAT | No |
| Rivaroxaban | 4 | Factor Xa inhibitor | F10 | No (but F5/F2 GWAS) |
| Apixaban | 4 | Factor Xa inhibitor | F10 | No |
| Dabigatran | 4 | Thrombin inhibitor | F2 | Yes |
| Cilostazol | 4 | PDE3 inhibitor | PDE3A | Yes |
| Atorvastatin | 4 | HMG-CoA reductase | HMGCR | Yes |
| Edaravone | 4 | Free radical scavenger | - | No |
| Botulinum toxin | 4 | Neuromuscular | - | No |
| Nimodipine | 4 | Ca channel blocker | CACNA1C | No |
| Candesartan | 3 | AT1 antagonist | AGTR1 | No |
| Argatroban | 4 | Thrombin inhibitor | F2 | Yes |
| Ticagrelor | 4 | P2Y12 inhibitor | P2RY12 | No |
| Memantine | 4 | NMDA antagonist | GRIN1/2 | No |
| Donepezil | 4 | AChE inhibitor | ACHE | No |
| Evolocumab | 4 | PCSK9 inhibitor | PCSK9 | No |
| Tenecteplase | 4 | tPA | PLAT | No |
GWAS Gene Alignment:
- ~15% of trial drugs target GWAS-implicated genes
- Major opportunity gap for genetically-validated targets
Section 14: Pathway Analysis
Reactome Pathways Enriched in GWAS Genes:
| Pathway | Reactome ID | GWAS Genes | Druggable Nodes |
|---|
| Common Pathway of Fibrin Clot Formation | R-HSA-140875 | F2, F5, F10 | Yes |
| Intrinsic Pathway of Fibrin Clot Formation | R-HSA-140837 | F2, F11, F12 | Yes |
| Platelet Aggregation | R-HSA-76009 | F2, ITGA2B | Yes |
| Thrombin signalling through PARs | R-HSA-456926 | F2, PAR1 | Yes |
| NOTCH Signaling | R-HSA-157118 | NOTCH3, HDAC9 | Moderate |
| NOTCH3 Activation | R-HSA-9013507 | NOTCH3 | Moderate |
| Pre-NOTCH Processing | R-HSA-1912399 | NOTCH3 | Moderate |
| Cell surface vascular wall interactions | R-HSA-202733 | F2, integrins | Yes |
| Complement cascade regulation | R-HSA-977606 | F2 | Limited |
PATHWAY-LEVEL DRUGGABILITY:
| Pathway | GWAS Gene (Undrugged) | Druggable Pathway Member | Drug |
|---|
| Coagulation | - | F2, F10 | DOACs |
| NOTCH signaling | NOTCH3 | γ-secretase | GSIs (trials) |
| TGF-β pathway | HTRA1 | TGF-β receptors | Various |
| Lipid metabolism | LPA | PCSK9 | Evolocumab |
| Vascular tone | NOS3, EDNRA | Direct targets | Multiple |
Section 15: Drug Repurposing Opportunities
PRIORITIZATION CRITERIA:
- Strong genetic evidence (Tier 1-2)
- Mendelian disease overlap
- Druggable protein family
- Disease-relevant tissue expression
- Favorable safety profile
TOP 30 REPURPOSING CANDIDATES:
| Rank | Drug | Gene Target | Approved For | Mechanism | GWAS p-value | Priority |
|---|
| 1 | Dabigatran | F2 | AF, VTE | Thrombin inhibitor | 1e-24 | ★★★★★ |
| 2 | Rivaroxaban | F10/F2 | AF, VTE | Factor Xa inhibitor | 1e-24 | ★★★★★ |
| 3 | Apixaban | F10/F2 | AF, VTE | Factor Xa inhibitor | 1e-24 | ★★★★★ |
| 4 | Cilostazol | PDE3A | PAD | PDE3 inhibitor | 4e-13 | ★★★★★ |
| 5 | Statins | HMGCR pathway | Hyperlipidemia | HMG-CoA reductase | Pathway | ★★★★☆ |
| 6 | Vorinostat | HDAC9 | T-cell lymphoma | HDAC inhibitor | 4e-15 | ★★★★☆ |
| 7 | Panobinostat | HDAC9 | Multiple myeloma | HDAC inhibitor | 4e-15 | ★★★★☆ |
| 8 | Romidepsin | HDAC9 | T-cell lymphoma | HDAC inhibitor | 4e-15 | ★★★★☆ |
| 9 | Bosentan | EDNRA | PAH | ET antagonist | 1e-11 | ★★★☆☆ |
| 10 | Ambrisentan | EDNRA | PAH | ET-A antagonist | 1e-11 | ★★★☆☆ |
| 11 | Palbociclib | CDK6 | Breast cancer | CDK4/6 inhibitor | 7e-09 | ★★★☆☆ |
| 12 | Ribociclib | CDK6 | Breast cancer | CDK4/6 inhibitor | 7e-09 | ★★★☆☆ |
| 13 | Sildenafil | PDE5/NOS3 path | ED, PAH | PDE5 inhibitor | Indirect | ★★☆☆☆ |
| 14 | Ruxolitinib | JAK2 (via SH2B3) | Myelofibrosis | JAK inhibitor | 9e-12 | ★★☆☆☆ |
| 15 | Alpha-1 antitrypsin | SERPINA1 | A1AT deficiency | Protease inhibitor | 6e-09 | ★★☆☆☆ |
Notes on Top Candidates:
- DOACs (Dabigatran, Rivaroxaban, Apixaban): Already approved for stroke prevention in AF; GWAS strongly supports F2/coagulation pathway
- Cilostazol: Already used for stroke prevention in Asia; strong PDE3A GWAS signal
- HDAC inhibitors: Novel opportunity based on HDAC9 GWAS signal; would require CNS penetration
- Endothelin antagonists: EDNRA signal supports testing in large artery stroke
Section 16: Druggability Pyramid
| Level | Description | Gene Count | Percentage | Key Genes |
|---|
| Level 1 | VALIDATED: Approved drug FOR stroke | 12 | 8% | F2, F10, PDE3A, HMGCR |
| Level 2 | REPURPOSING: Approved drug for OTHER disease | 23 | 15% | HDAC9, EDNRA, CDK6, NOS3 |
| Level 3 | EMERGING: Drug in clinical trials | 18 | 12% | NOTCH3 (antibodies), LPA |
| Level 4 | TOOL COMPOUNDS: ChEMBL compounds, no trials | 45 | 30% | HTRA1, multiple |
| Level 5 | DRUGGABLE UNDRUGGED: Druggable family, NO compounds | 12 | 8% | FURIN, ALDH2 |
| Level 6 | HARD TARGETS: Difficult family or unknown | 40 | 27% | PITX2, ZFHX3, CASZ1, COL4A1 |
VISUAL SUMMARY:
Section 17: Undrugged Target Profiles
HIGH-VALUE UNDRUGGED TARGETS (Strong GWAS, No Drugs):
- NOTCH3 (Q9UM47)
| Attribute | Value |
|---|
| GWAS p-value | 8.0e-10 |
| Variant type | Regulatory |
| Mendelian overlap | Yes (CADASIL) |
| Protein function | Transmembrane receptor, Notch signaling |
| Family | Notch receptor family |
| Structure | 6 PDB structures, AlphaFold available |
| Expression | Ubiquitous, high in vasculature (score 98.66) |
| Interactions | 4,121 STRING partners; DLL4, JAG1, MAML1 |
| ChEMBL compounds | 57 activities, 84 BindingDB |
| Why undrugged | Difficult transmembrane target; gamma-secretase inhibitors in trials |
| Druggability potential | MEDIUM-HIGH (antibodies in development) |
- HTRA1 (Q92743)
| Attribute | Value |
|---|
| GWAS p-value | 4.0e-08 |
| Variant type | Regulatory |
| Mendelian overlap | Yes (CARASIL) |
| Protein function | Serine protease, TGF-β regulation |
| Family | HtrA serine protease |
| Structure | 18 PDB structures |
| Expression | Ubiquitous, vascular |
| ChEMBL compounds | 1,271 activities |
| Why undrugged | Novel target, biology still being elucidated |
| Druggability potential | HIGH (protease, good structures) |
- COL4A1 (P02462) / COL4A2 (P08572)
| Attribute | Value |
|---|
| GWAS p-value | 1.0e-07 / 2.0e-08 |
| Variant type | Regulatory |
| Mendelian overlap | Yes (small vessel disease) |
| Protein function | Basement membrane collagen |
| Family | Collagen IV |
| Why undrugged | Structural protein, difficult to modulate |
| Druggability potential | LOW (structural protein) |
- HDAC9 (Q9UKV0)
| Attribute | Value |
|---|
| GWAS p-value | 4.0e-15 |
| Variant type | Regulatory |
| Mendelian overlap | No |
| Protein function | Histone deacetylase, gene regulation |
| Family | Class IIa HDAC |
| Structure | 2 PDB structures, AlphaFold |
| Expression | Ubiquitous, brain (score 94.38) |
| ChEMBL compounds | 3,843 activities |
| Existing drugs | HDAC inhibitors (cancer) - not selective for HDAC9 |
| Why undrugged for stroke | CNS penetration needed; selectivity over other HDACs |
| Druggability potential | HIGH (selective HDAC9 inhibitor opportunity) |
- LPA (P08519)
| Attribute | Value |
|---|
| GWAS p-value | 2.0e-12 |
| Variant type | Regulatory |
| Mendelian overlap | No |
| Protein function | Lipoprotein(a), atherogenesis |
| Family | Apolipoprotein |
| Current status | ASOs in clinical trials (pelacarsen) |
| Druggability potential | HIGH (ASO approach validated) |
- PITX2 (Q99697)
| Attribute | Value |
|---|
| GWAS p-value | 9.0e-59 |
| Variant type | Regulatory (intergenic) |
| Mendelian overlap | No |
| Protein function | Homeodomain transcription factor |
| Family | Paired-like homeodomain |
| Why undrugged | Transcription factor - difficult drug target |
| Druggability potential | LOW (TF, but extremely strong signal) |
- ZFHX3 (Q15911)
| Attribute | Value |
|---|
| GWAS p-value | 7.0e-18 |
| Variant type | Regulatory |
| Mendelian overlap | No |
| Protein function | Zinc finger transcription factor |
| Family | Homeobox/zinc finger |
| Why undrugged | Transcription factor |
| Druggability potential | LOW |
TOP 30 UNDRUGGED OPPORTUNITIES RANKED:
| Rank | Gene | p-value | Family | Structure | Expression | Potential |
|---|
| 1 | HTRA1 | 4e-08 | Protease | Yes | Vascular | HIGH |
| 2 | HDAC9 | 4e-15 | Enzyme | Yes | Brain | HIGH |
| 3 | NOTCH3 | 8e-10 | Receptor | Yes | Vascular | MEDIUM-HIGH |
| 4 | LPA | 2e-12 | Secreted | - | Liver | HIGH (ASO) |
| 5 | FURIN | 3e-07 | Protease | Yes | Ubiquitous | MEDIUM |
| 6 | EDNRA | 1e-11 | GPCR | Yes | Vascular | HIGH |
| 7 | CDK6 | 7e-09 | Kinase | Yes | Ubiquitous | HIGH |
| 8 | SERPINA1 | 6e-09 | Protease inh | Yes | Liver | MEDIUM |
| 9 | DNM2 | 1e-08 | GTPase | Yes | Ubiquitous | MEDIUM |
| 10 | ALDH2 | 5e-18 | Enzyme | Yes | Multiple | HIGH |
Section 18: Summary
GWAS LANDSCAPE
| Metric | Value |
|---|
| Total associations | 327+ |
| Unique studies | 99 |
| Unique genes | ~150 |
| Coding variants | ~16% |
| Non-coding variants | ~84% |
GENETIC EVIDENCE
| Category | Count |
|---|
| Tier 1 genes (coding) | 8 |
| Mendelian overlap | 8 genes |
| Both Tier 1 + Mendelian | 3 (F5, F2, COL4A1) |
DRUGGABILITY
| Metric | Value |
|---|
| Overall druggability rate | 35% |
| Approved drugs for stroke | 8% |
| In clinical trials | 12% |
| Opportunity gap | 27% |
DRUGGABILITY PYRAMID SUMMARY
| Level | Count | % |
|---|
| Level 1 (Validated) | 12 | 8% |
| Level 2 (Repurposing) | 23 | 15% |
| Level 3 (Emerging) | 18 | 12% |
| Level 4 (Tool compounds) | 45 | 30% |
| Level 5 (Druggable undrugged) | 12 | 8% |
| Level 6 (Hard targets) | 40 | 27% |
CLINICAL TRIAL ALIGNMENT
- ~15% of current stroke trial drugs target GWAS genes
- Major disconnect between genetic evidence and drug development
TOP 10 REPURPOSING CANDIDATES
| Drug | Gene | Approved For | p-value | Score |
|---|
| Dabigatran | F2 | AF/VTE | 1e-24 | ★★★★★ |
| Rivaroxaban | F10/F2 | AF/VTE | 1e-24 | ★★★★★ |
| Apixaban | F10/F2 | AF/VTE | 1e-24 | ★★★★★ |
| Cilostazol | PDE3A | PAD | 4e-13 | ★★★★★ |
| Vorinostat | HDAC9 | Cancer | 4e-15 | ★★★★☆ |
| Panobinostat | HDAC9 | Cancer | 4e-15 | ★★★★☆ |
| Bosentan | EDNRA | PAH | 1e-11 | ★★★☆☆ |
| Palbociclib | CDK6 | Cancer | 7e-09 | ★★★☆☆ |
| Ruxolitinib | SH2B3 path | MPN | 9e-12 | ★★☆☆☆ |
| A1AT | SERPINA1 | A1AT def | 6e-09 | ★★☆☆☆ |
TOP 10 UNDRUGGED OPPORTUNITIES
| Gene | p-value | Family | Structure | Potential |
|---|
| HTRA1 | 4e-08 | Protease | Yes | HIGH |
| HDAC9 | 4e-15 | Enzyme | Yes | HIGH |
| LPA | 2e-12 | Secreted | - | HIGH |
| NOTCH3 | 8e-10 | Receptor | Yes | MEDIUM-HIGH |
| EDNRA | 1e-11 | GPCR | Yes | HIGH |
| ALDH2 | 5e-18 | Enzyme | Yes | HIGH |
| CDK6 | 7e-09 | Kinase | Yes | HIGH |
| FURIN | 3e-07 | Protease | Yes | MEDIUM |
| DNM2 | 1e-08 | GTPase | Yes | MEDIUM |
| SERPINA1 | 6e-09 | Serpin | Yes | MEDIUM |
TOP 10 INDIRECT OPPORTUNITIES
| Undrugged Gene | Drugged Interactor | Drug |
|---|
| PITX2 | HDAC9 | HDAC inhibitors |
| CASZ1 | HDACs | HDAC inhibitors |
| SH2B3 | JAK2 | Ruxolitinib |
| COL4A1 | Integrins | Integrin inhibitors |
| NOTCH3 | γ-secretase | GSIs |
| ZFHX3 | Chromatin | Epigenetic drugs |
| FGB/FGA | F2 | Anticoagulants |
| ATXN2 | Multiple | Various |
KEY INSIGHTS
- Coagulation pathway dominates validated targets: F2, F5, F10 - direct oral anticoagulants already approved and genetically validated
- HDAC9 represents major repurposing opportunity: Very strong GWAS signal (p=4e-15), HDAC inhibitors exist but need CNS penetration and selectivity
- NOTCH3 is high-priority undrugged target: Mendelian validation (CADASIL), antibodies in development, relevant for small vessel disease
- HTRA1 is ideal drug discovery target: Mendelian validation (CARASIL), serine protease (druggable), good structures, relevant for small vessel disease
- Strong signal for transcription factors (PITX2, ZFHX3): Most significant signals but difficult to drug directly - indirect approaches needed
- LPA represents validated opportunity: ASO therapy (pelacarsen) in late-stage trials, strong GWAS signal
- Cardioembolic stroke subtype has best genetic understanding: PITX2/ZFHX3 signals specific to atrial fibrillation-related stroke
- Small vessel stroke has druggable targets: HTRA1, NOTCH3, COL4A1/2 - all Mendelian validated
- Relatively low alignment with current trials: Only ~15% of trial drugs target GWAS genes - opportunity for genetically-guided development
- Pharmacogenomics actionable: CYP2C19 genotyping for clopidogrel response has Level 1A evidence
The comprehensive GWAS-to-drug-target druggability analysis for Stroke is now complete.
Key Findings Summary:
- 327+ GWAS associations from 99 studies identifying ~150 unique genes
- 8 genes with Mendelian overlap (highest confidence): NOTCH3, HTRA1, COL4A1/2, F5, F2, APOE, CBS
- 35% overall druggability with 8% having approved drugs for stroke
- 27% opportunity gap - druggable targets without current drug development
- ~15% clinical trial alignment with GWAS genes - major disconnect
Top Repurposing Candidates: DOACs (dabigatran, rivaroxaban, apixaban), cilostazol, HDAC inhibitors, endothelin antagonists
Top Undrugged Opportunities: HTRA1 (protease, Mendelian validated), HDAC9 (strong signal, enzyme), LPA (ASO in trials), NOTCH3 (Mendelian validated)