Summary
Large artery stroke (MONDO:0005490) is a disease with 41 cohort genes (122 GWAS associations across 27 studies).
At a glance
- Cohort genes: 41
- GWAS associations: 122
Clinical features
No curated clinical features (Orphanet) for this disease.
Identifiers
Disease identifiers
| Field | Value |
|---|
| Canonical name | large artery stroke |
| Mondo ID | MONDO:0005490 |
| EFO | EFO:0005524 |
| Is cancer (heuristic) | no |
Data availability: 122 GWAS associations (27 studies).
Disease family
Classification path: disease › human disease › disease by body system or component › cardiovascular disorder › vascular disorder › arterial disorder › large artery stroke
Related subtypes (29): vertebral artery insufficiency, splenic artery aneurysm, basilar artery insufficiency, arteriosclerosis disorder, subclavian artery aneurysm, pulmonary artery choriocarcinoma, pulmonary artery leiomyosarcoma, coronary artery disorder, hypertensive disorder, carotid artery disorder, pulmonary embolism, peripheral arterial disease, hypotensive disorder, aortic disorder, cervical artery dissection, anterior spinal artery syndrome, fibromuscular dysplasia, retinal arterial tortuosity, Sneddon syndrome, celiac trunk compression syndrome, pediatric arterial ischemic stroke, absence of the pulmonary artery, arterial occlusion, aberrant subclavian artery, anterior spinal artery stroke, arteritis, pulmonary artery disease, fibromuscular dysplasia, multifocal, carotid web
Genetics & variants
GWAS landscape
122 GWAS associations across 27 studies. Top hits map to 37 distinct genes (as reported by GWAS).
Top associations by p-value
| rsID | p-value | Gene | Risk allele | Odds ratio |
|---|
| rs1333049 | 1e-59 | CDKN2B-AS1 | C | |
| rs55730499 | 7e-43 | LPA | T | 1.27 |
| rs2107595 | 5e-17 | HDAC9 - TWIST1 | A | 1.1 |
| rs599839 | 8e-17 | CELSR2 - PSRC1 | ? | |
| rs532436 | 4e-15 | ABO | A | 1.09 |
| rs1412444 | 4e-15 | LIPA | T | 1.07 |
| rs10455872 | 9e-14 | LPA | ? | |
| rs11172113 | 4e-13 | LRP1 | ? | 0.02 |
| rs71524263 | 2e-12 | HDAC9 | ? | 7.05 |
| rs2487928 | 7e-12 | JCAD | A | 1.06 |
| rs10774625 | 8e-12 | ATXN2 | A | 1.06 |
| rs17612742 | 1e-11 | EDNRA | ? | |
| rs1122608 | 2e-11 | SMARCA4 | ? | |
| rs4714955 | 2e-11 | PHACTR1 | ? | |
| rs12936587 | 2e-10 | PEMT - SMCR2 | ? | |
| rs11065987 | 3e-10 | ATXN2-AS | ? | |
| rs11556924 | 8e-10 | UBE2H-DT, ZC3HC1 | ? | |
| rs964184 | 9e-10 | ZPR1 | ? | |
| rs76866386 | 9e-10 | ABCG8 | T | 1.11 |
| rs12122341 | 1e-09 | LINC01765 | G | 1.19 |
| rs1878406 | 1e-09 | PRMT5P1 - EDNRA | T | 1.08 |
| rs12792912 | 2e-09 | RNU7-159P - MMP13 | T | 1.05 |
| rs8027450 | 2e-09 | FURIN | T | 1.06 |
| rs748431 | 3e-09 | FGD5 | G | 1.05 |
| rs181401679 | 4e-09 | ATP6V1G1P7 - RPL7P45 | G | 152128576 |
| rs11984041 | 5e-09 | HDAC9 | T | 1.24 |
| rs10840293 | 5e-09 | SWAP70 | A | 1.05 |
| rs6647 | 6e-09 | SERPINA1 | G | 1.22 |
| rs12124533 | 6e-09 | LINC01765 | ? | 1.16 |
| rs138364069 | 7e-09 | FAF1 | T | 3.57 |
Top studies (by case count)
| Study | Lead author | Year | Cases | Controls | Title |
|---|
| GCST90104542 | Mishra A | 2022 | 6,399 | 1,234,808 | Stroke genetics informs drug discovery and risk prediction across ancestries. |
| GCST90162544 | Mishra A | 2022 | 6,399 | 1,234,808 | Stroke genetics informs drug discovery and risk prediction across ancestries. |
| GCST90162545 | Mishra A | 2022 | 5,532 | 237,242 | Stroke genetics informs drug discovery and risk prediction across ancestries. |
| GCST90103796 | Daghals I | 2022 | 4,373 | 613,368 | Migraine, Stroke, and Cervical Arterial Dissection: Shared Genetics for a Triad of Brain Disorders With Vascular Involvement. |
| GCST005840 | Malik R | 2018 | 4,373 | 297,290 | Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. |
| GCST006907 | Malik R | 2018 | 4,373 | 297,290 | Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. |
| GCST010158 | von Berg J | 2020 | 3,495 | 28,026 | Alternate approach to stroke phenotyping identifies a genetic risk locus for small vessel stroke. |
| GCST010161 | von Berg J | 2020 | 2,449 | 28,026 | Alternate approach to stroke phenotyping identifies a genetic risk locus for small vessel stroke. |
| GCST003259 | Pulit SL | 2015 | 2,349 | 26,690 | Loci associated with ischaemic stroke and its subtypes (SiGN): a genome-wide association study. |
| GCST010160 | von Berg J | 2020 | 2,318 | 28,026 | Alternate approach to stroke phenotyping identifies a genetic risk locus for small vessel stroke. |
Variant details and genetic-evidence tiers
Tier distribution (top 50 variants)
| Tier | Variants |
|---|
| Tier 1: coding | 2 |
| Tier 2: splice/UTR | 2 |
| Tier 3: regulatory | 2 |
| Tier 4: intronic/intergenic | 44 |
MAF distribution
| Bucket | Variants |
|---|
| common (>=0.05) | 49 |
| low_freq (0.01-0.05) | 0 |
| rare (<0.01) | 0 |
| unknown | 1 |
Functional consequences
| Consequence | Count |
|---|
| intron_variant | 30 |
| intergenic_variant | 11 |
| non_coding_transcript_exon_variant | 3 |
| regulatory_region_variant | 2 |
| missense_variant | 2 |
| 3_prime_UTR_variant | 2 |
Top variants
| rsID | Chr | Pos | Alleles | MAF | Consequence | Gene | p-value | Tier |
|---|
| rs1333049 | 9 | 22125504 | G>A,C | 0.05 | intergenic_variant | CDKN2B-AS1 | 1e-59 | Tier 4: intronic/intergenic |
| rs55730499 | 6 | 160584578 | C>T | 0.065 | intron_variant | LPA | 7e-43 | Tier 4: intronic/intergenic |
| rs2107595 | 7 | 19009765 | G>A,C,T | 0.168 | regulatory_region_variant | HDAC9 - TWIST1 | 5e-17 | Tier 3: regulatory |
| rs599839 | 1 | 109279544 | G>A,C,T | 0.05 | intergenic_variant | CELSR2 - PSRC1 | 8e-17 | Tier 4: intronic/intergenic |
| rs532436 | 9 | 133274414 | A>G,T | 0.194 | intron_variant | ABO | 4e-15 | Tier 4: intronic/intergenic |
| rs1412444 | 10 | 89243170 | C>A,G,T | 0.332 | intron_variant | LIPA | 4e-15 | Tier 4: intronic/intergenic |
| rs10455872 | 6 | 160589086 | A>G | 0.05 | intron_variant | LPA | 9e-14 | Tier 4: intronic/intergenic |
| rs11172113 | 12 | 57133500 | T>C,G | 0.05 | intron_variant | LRP1 | 4e-13 | Tier 4: intronic/intergenic |
| rs71524263 | 7 | 18990655 | C>A,G,T | 0.05 | intron_variant | HDAC9 | 2e-12 | Tier 4: intronic/intergenic |
| rs2487928 | 10 | 30034963 | G>A | 0.451 | intron_variant | JCAD | 7e-12 | Tier 4: intronic/intergenic |
| rs10774625 | 12 | 111472415 | A>C,G,T | 0.489 | intron_variant | ATXN2 | 8e-12 | Tier 4: intronic/intergenic |
| rs17612742 | 4 | 147493499 | T>C | 0.05 | non_coding_transcript_exon_variant | EDNRA | 1e-11 | Tier 4: intronic/intergenic |
| rs1122608 | 19 | 11052925 | G>T | 0.05 | intron_variant | SMARCA4 | 2e-11 | Tier 4: intronic/intergenic |
| rs4714955 | 6 | 12903203 | C>A,T | 0.05 | intron_variant | PHACTR1 | 2e-11 | Tier 4: intronic/intergenic |
| rs12936587 | 17 | 17640408 | G>A | 0.05 | intergenic_variant | PEMT - SMCR2 | 2e-10 | Tier 4: intronic/intergenic |
| rs11065987 | 12 | 111634620 | A>G | 0.05 | intron_variant | ATXN2-AS | 3e-10 | Tier 4: intronic/intergenic |
| rs11556924 | 7 | 130023656 | C>A,T | 0.05 | missense_variant | UBE2H-DT, ZC3HC1 | 8e-10 | Tier 1: coding |
| rs964184 | 11 | 116778201 | G>C | 0.05 | 3_prime_UTR_variant | ZPR1 | 9e-10 | Tier 2: splice/UTR |
| rs76866386 | 2 | 43848344 | T>C | 0.064 | intron_variant | ABCG8 | 9e-10 | Tier 4: intronic/intergenic |
| rs12122341 | 1 | 115113069 | C>G | 0.257 | intron_variant | LINC01765 | 1e-09 | Tier 4: intronic/intergenic |
| rs1878406 | 4 | 147472512 | C>A,G,T | 0.142 | intergenic_variant | PRMT5P1 - EDNRA | 1e-09 | Tier 4: intronic/intergenic |
| rs12792912 | 11 | 102930574 | T>A,G | 0.43 | intergenic_variant | RNU7-159P - MMP13 | 2e-09 | Tier 4: intronic/intergenic |
| rs8027450 | 15 | 90875164 | C>A,T | 0.326 | intron_variant | FURIN | 2e-09 | Tier 4: intronic/intergenic |
| rs748431 | 3 | 14886570 | G>C,T | 0.379 | intron_variant | FGD5 | 3e-09 | Tier 4: intronic/intergenic |
| rs181401679 | 13 | 104244508 | C>G | | intergenic_variant | ATP6V1G1P7 - RPL7P45 | 4e-09 | Tier 4: intronic/intergenic |
| rs11984041 | 7 | 18992312 | C>T | 0.093 | intron_variant | HDAC9 | 5e-09 | Tier 4: intronic/intergenic |
| rs10840293 | 11 | 9729649 | G>A,C | 0.423 | intron_variant | SWAP70 | 5e-09 | Tier 4: intronic/intergenic |
| rs6647 | 14 | 94381078 | A>C,G | 0.18 | missense_variant | SERPINA1 | 6e-09 | Tier 1: coding |
| rs12124533 | 1 | 115115178 | C>T | 0.05 | intron_variant | LINC01765 | 6e-09 | Tier 4: intronic/intergenic |
| rs138364069 | 1 | 50832533 | C>T | 0.05 | intron_variant | FAF1 | 7e-09 | Tier 4: intronic/intergenic |
Genes & proteins
Mendelian disease overlap and somatic drivers
GenCC: 0 · Orphanet: 25 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 0
Orphanet rare-disease linkage (cohort genes)
| Gene | Orphanet ID | Rare disease |
|---|
| CNNM2 | Orphanet:620363 | Primary hypomagnesemia-generalized seizures-intellectual disability-obesity syndrome |
| SLCO1B1 | Orphanet:3111 | Rotor syndrome |
| SMARCA4 | Orphanet:1465 | Coffin-Siris syndrome |
| SMARCA4 | Orphanet:231108 | Rhabdoid tumor predisposition syndrome |
| SMARCA4 | Orphanet:370396 | Small cell carcinoma of the ovary |
| SMARCA4 | Orphanet:466962 | SMARCA4-deficient sarcoma of thorax |
| TWIST1 | Orphanet:35093 | Non-syndromic sagittal craniosynostosis |
| TWIST1 | Orphanet:35099 | Non-syndromic bicoronal craniosynostosis |
| TWIST1 | Orphanet:794 | Saethre-Chotzen syndrome |
| CARD9 | Orphanet:457088 | Predisposition to invasive fungal disease due to CARD9 deficiency |
| TTLL5 | Orphanet:1872 | Cone rod dystrophy |
| PHACTR1 | Orphanet:697160 | Infantile epileptic spasms syndrome |
| ALKBH8 | Orphanet:88616 | Autosomal recessive non-syndromic intellectual disability |
| SH2B3 | Orphanet:3318 | Essential thrombocythemia |
| SH2B3 | Orphanet:391366 | Growth retardation-mild developmental delay-chronic hepatitis syndrome |
| EDNRA | Orphanet:443995 | Mandibulofacial dysostosis with alopecia |
| EDNRA | Orphanet:586 | Cystic fibrosis |
| LDLR | Orphanet:391665 | Homozygous familial hypercholesterolemia |
| NT5C2 | Orphanet:320396 | Autosomal recessive spastic paraplegia type 45 |
| SERPINA1 | Orphanet:178396 | Hemorrhagic disease due to alpha-1-antitrypsin Pittsburgh mutation |
| SERPINA1 | Orphanet:586 | Cystic fibrosis |
| SERPINA1 | Orphanet:60 | Alpha-1-antitrypsin deficiency |
| RAI1 | Orphanet:1713 | 17p11.2 microduplication syndrome |
| RAI1 | Orphanet:477817 | PMP22-RAI1 contiguous gene duplication syndrome |
| RAI1 | Orphanet:819 | Smith-Magenis syndrome |
Cohort genes → proteins
41 cohort genes, 39 distinct canonical proteins.
Evidence partition
Cohort genes (full)
| Symbol | HGNC | Ensembl | UniProt | Name | Evidence |
|---|
| CNNM2 | HGNC:103 | ENSG00000148842 | Q9H8M5 | Metal transporter CNNM2 | gwas |
| SLCO1B1 | HGNC:10959 | ENSG00000134538 | Q9Y6L6 | Solute carrier organic anion transporter family member 1B1 | gwas |
| SLC22A3 | HGNC:10967 | ENSG00000146477 | O75751 | Solute carrier family 22 member 3 | gwas |
| SMARCA4 | HGNC:11100 | ENSG00000127616 | P51532 | SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 4 | gwas |
| SORT1 | HGNC:11186 | ENSG00000134243 | Q99523 | Sortilin | gwas |
| SUPT3H | HGNC:11466 | ENSG00000196284 | O75486 | Transcription initiation protein SPT3 homolog | gwas |
| TCF21 | HGNC:11632 | ENSG00000118526 | O43680 | Transcription factor 21 | gwas |
| TWIST1 | HGNC:12428 | ENSG00000122691 | Q15672 | Twist-related protein 1 | gwas |
| ZPR1 | HGNC:13051 | ENSG00000109917 | O75312 | Zinc finger protein ZPR1 | gwas |
| HDAC9 | HGNC:14065 | ENSG00000048052 | Q9UKV0 | Histone deacetylase 9 | gwas |
| WDR12 | HGNC:14098 | ENSG00000138442 | Q9GZL7 | Ribosome biogenesis protein WDR12 | gwas |
| RASD1 | HGNC:15828 | ENSG00000108551 | Q9Y272 | Dexamethasone-induced Ras-related protein 1 | gwas |
| CARD9 | HGNC:16391 | ENSG00000187796 | Q9H257 | Caspase recruitment domain-containing protein 9 | gwas |
| CDKN2C | HGNC:1789 | ENSG00000123080 | P42773 | Cyclin-dependent kinase 4 inhibitor C | gwas |
| TTLL5 | HGNC:19963 | ENSG00000119685 | Q6EMB2 | Tubulin polyglutamylase TTLL5 | gwas |
| ZDHHC22 | HGNC:20106 | ENSG00000177108 | Q8N966 | Palmitoyltransferase ZDHHC22 | gwas |
| OBI1 | HGNC:20308 | ENSG00000152193 | Q5W0B1 | ORC ubiquitin ligase 1 | gwas |
| TSPAN2 | HGNC:20659 | ENSG00000134198 | O60636 | Tetraspanin-2 | gwas |
| PHACTR1 | HGNC:20990 | ENSG00000112137 | Q9C0D0 | Phosphatase and actin regulator 1 | gwas |
| LPAL2 | HGNC:21210 | ENSG00000213071 | Q16609 | Putative apolipoprotein(a)-like protein 2 | gwas |
| ADAMTS7 | HGNC:223 | ENSG00000136378 | Q9UKP4 | A disintegrin and metalloproteinase with thrombospondin motifs 7 | gwas |
| ALKBH8 | HGNC:25189 | ENSG00000137760 | Q96BT7 | tRNA (carboxymethyluridine(34)-5-O)-methyltransferase ALKBH8 | gwas |
| TMEM163 | HGNC:25380 | ENSG00000152128 | Q8TC26 | Transmembrane protein 163 | gwas |
| SH3YL1 | HGNC:29546 | ENSG00000035115 | Q96HL8 | SH3 domain-containing YSC84-like protein 1 | gwas |
| SH2B3 | HGNC:29605 | ENSG00000111252 | Q9UQQ2 | SH2B adapter protein 3 | gwas |
| ZC3HC1 | HGNC:29913 | ENSG00000091732 | Q86WB0 | Zinc finger C3HC-type protein 1 | gwas |
| DTL | HGNC:30288 | ENSG00000143476 | Q9NZJ0 | Denticleless protein homolog | gwas |
| EDNRA | HGNC:3179 | ENSG00000151617 | P25101 | Endothelin-1 receptor | gwas |
| CDKN2B-AS1 | HGNC:34341 | ENSG00000240498 | | CDKN2B and CDKN2A antisense cis and trans regulatory RNA 1 | gwas |
| FAF1 | HGNC:3578 | ENSG00000185104 | Q9UNN5 | FAS-associated factor 1 | gwas |
| MCHR1 | HGNC:4479 | ENSG00000128285 | Q99705 | Melanin-concentrating hormone receptor 1 | gwas |
| LINC01492 | HGNC:51149 | ENSG00000225564 | | long intergenic non-protein coding RNA 1492 | gwas |
| LDLR | HGNC:6547 | ENSG00000130164 | P01130 | Low-density lipoprotein receptor | gwas |
| LPA | HGNC:6667 | ENSG00000198670 | P08519 | Apolipoprotein(a) | gwas |
| NDUFAB1 | HGNC:7694 | ENSG00000004779 | O14561 | Acyl carrier protein, mitochondrial | gwas |
| ABO | HGNC:79 | ENSG00000175164 | P16442 | Histo-blood group ABO system transferase | gwas |
| NT5C2 | HGNC:8022 | ENSG00000076685 | P49902 | Cytosolic purine 5’-nucleotidase | gwas |
| PEMT | HGNC:8830 | ENSG00000133027 | Q9UBM1 | Phosphatidylethanolamine N-methyltransferase | gwas |
| SERPINA1 | HGNC:8941 | ENSG00000197249 | P01009 | Alpha-1-antitrypsin | gwas |
| PLPP3 | HGNC:9229 | ENSG00000162407 | O14495 | Phospholipid phosphatase 3 | gwas |
| RAI1 | HGNC:9834 | ENSG00000108557 | Q7Z5J4 | Retinoic acid-induced protein 1 | gwas |
Cohort function summary
Lead sentence per gene, UniProt-curated.
| Symbol | Protein name | Function (lead sentence) |
|---|
| CNNM2 | Metal transporter CNNM2 | Divalent metal cation transporter. |
| SLCO1B1 | Solute carrier organic anion transporter family member 1B1 | Mediates the Na(+)-independent uptake of organic anions. |
| SLC22A3 | Solute carrier family 22 member 3 | Electrogenic voltage-dependent transporter that mediates the transport of a variety of organic cations such as endogenous bioactive amines, cationic drugs and xenobiotics. |
| SMARCA4 | SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 4 | ATPase involved in transcriptional activation and repression of select genes by chromatin remodeling (alteration of DNA-nucleosome topology). |
| SORT1 | Sortilin | Functions as a sorting receptor in the Golgi compartment and as a clearance receptor on the cell surface. |
| SUPT3H | Transcription initiation protein SPT3 homolog | Probable transcriptional activator. |
| TCF21 | Transcription factor 21 | Involved in epithelial-mesenchymal interactions in kidney and lung morphogenesis that include epithelial differentiation and branching morphogenesis. |
| TWIST1 | Twist-related protein 1 | Acts as a transcriptional regulator. |
| ZPR1 | Zinc finger protein ZPR1 | Acts as a signaling molecule that communicates proliferative growth signals from the cytoplasm to the nucleus. |
| HDAC9 | Histone deacetylase 9 | Responsible for the deacetylation of lysine residues on the N-terminal part of the core histones (H2A, H2B, H3 and H4). |
| WDR12 | Ribosome biogenesis protein WDR12 | Component of the PeBoW complex, which is required for maturation of 28S and 5.8S ribosomal RNAs and formation of the 60S ribosome. |
| RASD1 | Dexamethasone-induced Ras-related protein 1 | Small GTPase. |
| CARD9 | Caspase recruitment domain-containing protein 9 | Adapter protein that plays a key role in innate immune response against fungi by forming signaling complexes downstream of C-type lectin receptors. |
| CDKN2C | Cyclin-dependent kinase 4 inhibitor C | Interacts strongly with CDK6, weakly with CDK4. |
| TTLL5 | Tubulin polyglutamylase TTLL5 | Polyglutamylase which modifies tubulin, generating polyglutamate side chains on the gamma-carboxyl group of specific glutamate residues within the C-terminal tail of tubulin. |
| ZDHHC22 | Palmitoyltransferase ZDHHC22 | Palmitoyltransferase that could catalyze the addition of palmitate onto various protein substrates and be involved in a variety of cellular processes. |
| OBI1 | ORC ubiquitin ligase 1 | E3 ubiquitin ligase essential for DNA replication origin activation during S phase. |
| TSPAN2 | Tetraspanin-2 | May play a role in signalling in oligodendrocytes in the early stages of their terminal differentiation into myelin-forming glia and may also function in stabilizing the mature sheath. |
| PHACTR1 | Phosphatase and actin regulator 1 | Binds actin monomers (G actin) and plays a role in multiple processes including the regulation of actin cytoskeleton dynamics, actin stress fibers formation, cell motility and survival, formation of tubules by endothelial cells, and regula… |
| ADAMTS7 | A disintegrin and metalloproteinase with thrombospondin motifs 7 | Metalloprotease. |
| ALKBH8 | tRNA (carboxymethyluridine(34)-5-O)-methyltransferase ALKBH8 | Catalyzes the methylation of 5-carboxymethyl uridine to 5-methylcarboxymethyl uridine at the wobble position of the anticodon loop in tRNA via its methyltransferase domain. |
| TMEM163 | Transmembrane protein 163 | Zinc ion transporter that mediates zinc efflux and plays a crucial role in intracellular zinc homeostasis. |
| SH2B3 | SH2B adapter protein 3 | Links T-cell receptor activation signal to phospholipase C-gamma-1, GRB2 and phosphatidylinositol 3-kinase. |
| ZC3HC1 | Zinc finger C3HC-type protein 1 | Required for proper positioning of a substantial amount of TPR at the nuclear basket (NB) through interaction with TPR. |
| DTL | Denticleless protein homolog | Substrate-specific adapter of a DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complex required for cell cycle control, DNA damage response and translesion DNA synthesis. |
| EDNRA | Endothelin-1 receptor | Receptor for endothelin-1. |
| FAF1 | FAS-associated factor 1 | Ubiquitin-binding protein. |
| MCHR1 | Melanin-concentrating hormone receptor 1 | Receptor for melanin-concentrating hormone, coupled to both G proteins that inhibit adenylyl cyclase and G proteins that activate phosphoinositide hydrolysis. |
| LDLR | Low-density lipoprotein receptor | Binds low density lipoprotein /LDL, the major cholesterol-carrying lipoprotein of plasma, and transports it into cells by endocytosis. |
| LPA | Apolipoprotein(a) | Apo(a) is the main constituent of lipoprotein(a) (Lp(a)). |
| NDUFAB1 | Acyl carrier protein, mitochondrial | Carrier of the growing fatty acid chain in fatty acid biosynthesis. |
| ABO | Histo-blood group ABO system transferase | This protein is the basis of the ABO blood group system. |
| NT5C2 | Cytosolic purine 5’-nucleotidase | Broad specificity cytosolic 5’-nucleotidase that catalyzes the dephosphorylation of 6-hydroxypurine nucleoside 5’-monophosphates. |
| PEMT | Phosphatidylethanolamine N-methyltransferase | Catalyzes the three sequential steps of the methylation pathway for the biosynthesis of phosphatidylcholine, a critical and essential component for membrane structure. |
| SERPINA1 | Alpha-1-antitrypsin | Inhibitor of serine proteases. |
| PLPP3 | Phospholipid phosphatase 3 | Magnesium-independent phospholipid phosphatase of the plasma membrane that catalyzes the dephosphorylation of a variety of glycerolipid and sphingolipid phosphate esters including phosphatidate/PA, lysophosphatidate/LPA, diacylglycerol pyr… |
| RAI1 | Retinoic acid-induced protein 1 | Transcriptional regulator of the circadian clock components: CLOCK, BMAL1, BMAL2, PER1/3, CRY1/2, NR1D1/2 and RORA/C. |
Protein-family classification
Druggable: 13 · Difficult: 11 · Unknown: 17 · Druggable fraction: 0.32
Family distribution
Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.
| Family | Genes | Fold | FDR |
|---|
| Scaffold/PPI | 6 | 2.5× | 0.205 |
| Transporter | 2 | 3.8× | 0.215 |
| Protease | 3 | 2.7× | 0.215 |
| Enzyme (other) | 6 | 1.8× | 0.215 |
| GPCR | 2 | 1.2× | 0.659 |
| Transcription factor | 5 | 1.0× | 0.659 |
| Other/Unknown | 17 | 0.7× | 0.977 |
Per-gene assignment
| Symbol | Family | Druggable? | EC | InterPro (top 3) |
|---|
| CNNM2 | Other/Unknown | no | | CBS_dom, CNNM, RmlC-like_jellyroll |
| SLCO1B1 | Transporter | yes | | Kazal_dom, OATP, MFS_dom |
| SLC22A3 | Transporter | yes | | Orgcat_transp/SVOP, Sugar_transporter_CS, MFS |
| SMARCA4 | Other/Unknown | no | | SNF2_N, Bromodomain, Helicase_C-like |
| SORT1 | Scaffold/PPI | no | | VPS10, WD40/YVTN_repeat-like_dom_sf, Sortilin_C |
| SUPT3H | Other/Unknown | no | | TFIID_TAF13, Histone-fold |
| TCF21 | Transcription factor | no | | bHLH_dom, HLH_DNA-bd_sf, E-box_TF_Regulators |
| TWIST1 | Transcription factor | no | | bHLH_dom, HLH_DNA-bd_sf, TWIST1_bHLH |
| ZPR1 | Transcription factor | no | | Znf_ZPR1, ZPR1, ZPR1_A/B_dom |
| HDAC9 | Enzyme (other) | yes | 3.5.1.98 | HDACs, Ureohydrolase_dom_sf, His_deacetylse_dom |
| WDR12 | Scaffold/PPI | no | | WD40_rpt, NLE, WD40/YVTN_repeat-like_dom_sf |
| RASD1 | Other/Unknown | no | | Small_GTPase, Small_GTP-bd, P-loop_NTPase |
| CARD9 | Other/Unknown | no | | CARD, DEATH-like_dom_sf, CARD_CARD9 |
| CDKN2C | Scaffold/PPI | no | | Ankyrin_rpt, Ankyrin_rpt-contain_sf, Ank_Repeat/CDKN_Inhibitor |
| TTLL5 | Other/Unknown | no | | TTL/TTLL_fam |
| ZDHHC22 | Other/Unknown | no | | SSD, Palmitoyltrfase_DHHC, PFA4/ZDH16/20/ERF2-like |
| OBI1 | Transcription factor | no | | Znf_RING, Znf_RING/FYVE/PHD, OBI1_RING-HC |
| TSPAN2 | Other/Unknown | no | | Tetraspanin_animals, Tetraspanin_EC2_sf, Tetraspanin/Peripherin |
| PHACTR1 | Other/Unknown | no | | RPEL_repeat |
| LPAL2 | Protease | yes | | Kringle, Kringle-like, Kringle_CS |
| ADAMTS7 | Protease | yes | | TSP1_rpt, Peptidase_M12B, Peptidase_M12B_N |
| ALKBH8 | Enzyme (other) | yes | 2.1.1.229 | RRM_dom, Oxoglu/Fe-dep_dioxygenase_dom, Nucleotide-bd_a/b_plait_sf |
| TMEM163 | Other/Unknown | no | | Tmem163, Cation_efflux_TMD_sf |
| SH3YL1 | Scaffold/PPI | no | | SH3_domain, Ysc84_actin-binding, SYLF_SH3YL1-like |
| SH2B3 | Scaffold/PPI | no | | SH2, PH_domain, PH-like_dom_sf |
| ZC3HC1 | Other/Unknown | no | | NuBaID_N, NuBaID_C |
| DTL | Scaffold/PPI | no | | WD40_rpt, WD40/YVTN_repeat-like_dom_sf, WD40_repeat_CS |
| EDNRA | GPCR | yes | | GPCR_Rhodpsn, Endthln_rcpt, ETA_rcpt |
| CDKN2B-AS1 | Other/Unknown | no | | |
| FAF1 | Other/Unknown | no | | UBX_dom, UAS, Ubiquitin-like_domsf |
| MCHR1 | GPCR | yes | | GPCR_Rhodpsn, MCHR1, MCH_rcpt |
| LINC01492 | Other/Unknown | no | | |
| LDLR | Other/Unknown | no | | LDLR_classB_rpt, EGF-type_Asp/Asn_hydroxyl_site, EGF |
| LPA | Protease | yes | | Kringle, Trypsin_dom, Peptidase_S1A |
| NDUFAB1 | Other/Unknown | no | | ACP, Ppantetheine_attach_site, PP-bd_ACP |
| ABO | Enzyme (other) | yes | 2.4.1.37 | Glyco_trans_6, Nucleotide-diphossugar_trans |
| NT5C2 | Enzyme (other) | yes | 3.1.3.5 | HAD-SF_hydro_IG_5-nucl, Pur_nucleotidase, HAD_sf |
| PEMT | Enzyme (other) | yes | 2.1.1.17 | Phopholipid_MeTrfase, PEMT/MFAP |
| SERPINA1 | Other/Unknown | no | | Serpin_fam, Serpin_CS, Serpin_dom |
| PLPP3 | Enzyme (other) | yes | 3.1.3.4 | PAP2/HPO, PAP2/HPO_sf, PAP-like |
| RAI1 | Transcription factor | no | | Znf_PHD, Znf_RING/FYVE/PHD, EPHD |
Expression context
Cohort genes with no expression data: 0.
40 cohort genes are a single-cell marker in ≥1 SCXA experiment.
Breadth distribution (Bgee present_calls)
| Bucket | Genes |
|---|
| narrow (1-5 tissues) | 0 |
| moderate (6-20) | 0 |
| broad (>20) | 41 |
| unknown | 0 |
Top tissues across cohort
| Tissue | Cohort genes |
|---|
| oocyte | 7 |
| secondary oocyte | 5 |
| male germ line stem cell (sensu Vertebrata) in testis | 5 |
| right lobe of liver | 5 |
| cortical plate | 5 |
| liver | 4 |
| ganglionic eminence | 3 |
| right testis | 3 |
| monocyte | 3 |
| ventricular zone | 3 |
| left testis | 3 |
| right adrenal gland | 2 |
| corpus epididymis | 2 |
| primordial germ cell in gonad | 2 |
| leukocyte | 2 |
| mononuclear cell | 2 |
| popliteal artery | 2 |
| lateral nuclear group of thalamus | 2 |
| parotid gland | 2 |
| adrenal tissue | 2 |
Per-gene tissue summary (top 30)
| Symbol | Bgee breadth | FANTOM5 breadth | SCXA | Top tissues |
|---|
| CNNM2 | 234 | ubiquitous | marker | secondary oocyte, oocyte, right adrenal gland |
| SLCO1B1 | 29 | tissue_specific | marker | right lobe of liver, liver, male germ line stem cell (sensu Vertebrata) in testis |
| SLC22A3 | 189 | broad | marker | thoracic aorta, ascending aorta, descending thoracic aorta |
| SMARCA4 | 295 | ubiquitous | marker | ganglionic eminence, cortical plate, cervix squamous epithelium |
| SORT1 | 291 | ubiquitous | marker | inferior vagus X ganglion, corpus epididymis, subthalamic nucleus |
| SUPT3H | 206 | ubiquitous | yes | primordial germ cell in gonad, sperm, male germ line stem cell (sensu Vertebrata) in testis |
| TCF21 | 175 | broad | marker | primordial germ cell in gonad, right lung, upper lobe of left lung |
| TWIST1 | 233 | ubiquitous | marker | periodontal ligament, mucosa of paranasal sinus, oocyte |
| ZPR1 | 267 | ubiquitous | marker | cortical plate, ganglionic eminence, right testis |
| HDAC9 | 277 | ubiquitous | marker | oocyte, monocyte, secondary oocyte |
| WDR12 | 288 | ubiquitous | marker | ventricular zone, ganglionic eminence, cortical plate |
| RASD1 | 252 | ubiquitous | marker | pericardium, renal medulla, kidney epithelium |
| CARD9 | 172 | broad | marker | monocyte, mononuclear cell, leukocyte |
| CDKN2C | 286 | ubiquitous | marker | ventricular zone, popliteal artery, tibial artery |
| TTLL5 | 288 | ubiquitous | marker | left testis, right testis, testis |
| ZDHHC22 | 106 | broad | marker | lateral nuclear group of thalamus, prefrontal cortex, primary visual cortex |
| OBI1 | 274 | ubiquitous | marker | secondary oocyte, oocyte, male germ line stem cell (sensu Vertebrata) in testis |
| TSPAN2 | 228 | broad | marker | body of uterus, muscle layer of sigmoid colon, popliteal artery |
| PHACTR1 | 210 | ubiquitous | marker | cortical plate, postcentral gyrus, nucleus accumbens |
| LPAL2 | 164 | tissue_specific | marker | right lobe of liver, granulocyte, liver |
| ADAMTS7 | 151 | ubiquitous | marker | right atrium auricular region, cardiac atrium, apex of heart |
| ALKBH8 | 226 | ubiquitous | marker | calcaneal tendon, male germ line stem cell (sensu Vertebrata) in testis, cortical plate |
| TMEM163 | 201 | broad | marker | oocyte, secondary oocyte, lateral nuclear group of thalamus |
| SH3YL1 | 294 | ubiquitous | marker | nephron tubule, parotid gland, upper leg skin |
| SH2B3 | 260 | ubiquitous | marker | monocyte, mononuclear cell, leukocyte |
| ZC3HC1 | 245 | ubiquitous | marker | tibialis anterior, oocyte, deltoid |
| DTL | 196 | ubiquitous | marker | secondary oocyte, oocyte, trabecular bone tissue |
| EDNRA | 253 | ubiquitous | marker | cauda epididymis, seminal vesicle, visceral pleura |
| CDKN2B-AS1 | 169 | ubiquitous | marker | mucosa of transverse colon, rectum, transverse colon |
| FAF1 | 280 | ubiquitous | marker | gastrocnemius, muscle of leg, left testis |
Protein interactions among cohort
Intra-cohort edges: 15.
Hub genes (top 10 by interactor count)
| Symbol | Interactor count |
|---|
| SMARCA4 | 8,138 |
| NDUFAB1 | 5,533 |
| WDR12 | 3,702 |
| CARD9 | 3,636 |
| SERPINA1 | 3,617 |
| TWIST1 | 3,507 |
| RASD1 | 3,278 |
| HDAC9 | 3,047 |
| ZPR1 | 2,581 |
| SORT1 | 2,514 |
Intra-cohort edges
| A | B | Sources |
|---|
| ADAMTS7 | PHACTR1 | string_interaction |
| ADAMTS7 | SORT1 | string_interaction |
| ADAMTS7 | ZC3HC1 | string_interaction |
| CDKN2C | FAF1 | string_interaction |
| CNNM2 | NT5C2 | string_interaction |
| CNNM2 | ZC3HC1 | string_interaction |
| EDNRA | ZC3HC1 | string_interaction |
| HDAC9 | SERPINA1 | biogrid_interaction |
| HDAC9 | SMARCA4 | string_interaction |
| PHACTR1 | SORT1 | string_interaction |
| PHACTR1 | TSPAN2 | string_interaction |
| PHACTR1 | WDR12 | string_interaction |
| PHACTR1 | ZC3HC1 | string_interaction |
| SH2B3 | ZC3HC1 | string_interaction |
| WDR12 | ZC3HC1 | string_interaction |
Structural data
PDB: 25 · AlphaFold-only: 14 · No structure: 2
Cohort genes with PDB structures (top 30)
| Symbol | UniProt | PDB entries |
|---|
| ABO | P16442 | 151 |
| SERPINA1 | P01009 | 46 |
| NT5C2 | P49902 | 43 |
| NDUFAB1 | O14561 | 42 |
| LDLR | P01130 | 36 |
| SMARCA4 | P51532 | 31 |
| SORT1 | Q99523 | 17 |
| FAF1 | Q9UNN5 | 16 |
| LPA | P08519 | 16 |
| MCHR1 | Q99705 | 10 |
| SLCO1B1 | Q9Y6L6 | 9 |
| CARD9 | Q9H257 | 8 |
| CNNM2 | Q9H8M5 | 7 |
| CDKN2C | P42773 | 6 |
| PHACTR1 | Q9C0D0 | 6 |
| DTL | Q9NZJ0 | 5 |
| EDNRA | P25101 | 5 |
| SUPT3H | O75486 | 4 |
| SLC22A3 | O75751 | 3 |
| WDR12 | Q9GZL7 | 3 |
| ALKBH8 | Q96BT7 | 3 |
| TWIST1 | Q15672 | 2 |
| HDAC9 | Q9UKV0 | 2 |
| TTLL5 | Q6EMB2 | 1 |
| SH3YL1 | Q96HL8 | 1 |
AlphaFold-only cohort genes (top 30 by pLDDT)
| Symbol | UniProt | pLDDT |
|---|
| PEMT | Q9UBM1 | 93.45 |
| ZDHHC22 | Q8N966 | 88.21 |
| TSPAN2 | O60636 | 87.26 |
| PLPP3 | O14495 | 84.44 |
| LPAL2 | Q16609 | 82.18 |
| RASD1 | Q9Y272 | 80.75 |
| ZPR1 | O75312 | 80.11 |
| TMEM163 | Q8TC26 | 76.65 |
| ZC3HC1 | Q86WB0 | 69.40 |
| TCF21 | O43680 | 66.67 |
| ADAMTS7 | Q9UKP4 | 64.26 |
| SH2B3 | Q9UQQ2 | 63.45 |
| OBI1 | Q5W0B1 | 57.65 |
| RAI1 | Q7Z5J4 | 39.62 |
Function
Pathway analysis
Distinct Reactome pathways touched by cohort: 157. Enrichment computed across 41 evidence-associated genes (27 with Reactome annotation).
Pathways by enrichment
Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 27 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.
| Pathway | Cohort genes | Fold | FDR | Sample cohort genes |
|---|
| Abacavir ADME | 2 | 105.7× | 0.017 | SLC22A3, NT5C2 |
| Drug ADME | 3 | 25.4× | 0.017 | SLCO1B1, SLC22A3, NT5C2 |
| Defective SLCO1B1 causes hyperbilirubinemia, Rotor type (HBLRR) | 1 | 423.0× | 0.120 | SLCO1B1 |
| Nuclear events stimulated by ALK signaling in cancer | 2 | 24.2× | 0.120 | TWIST1, ZC3HC1 |
| Plasma lipoprotein assembly, remodeling, and clearance | 2 | 16.9× | 0.193 | LDLR, LPA |
| Abacavir metabolism | 1 | 105.7× | 0.209 | NT5C2 |
| Abacavir transmembrane transport | 1 | 84.6× | 0.209 | SLC22A3 |
| Chylomicron clearance | 1 | 84.6× | 0.209 | LDLR |
| LDL remodeling | 1 | 70.5× | 0.209 | LPA |
| Metabolism of porphyrins | 1 | 52.9× | 0.209 | SLCO1B1 |
| Atorvastatin ADME | 1 | 52.9× | 0.209 | SLCO1B1 |
| Nucleotide catabolism | 1 | 47.0× | 0.209 | NT5C2 |
| Purine catabolism | 1 | 38.5× | 0.209 | NT5C2 |
| Organic anion transport by SLCO transporters | 1 | 38.5× | 0.209 | SLCO1B1 |
| Ribavirin ADME | 1 | 38.5× | 0.209 | NT5C2 |
| Protein lipoylation | 1 | 38.5× | 0.209 | NDUFAB1 |
| Sphingolipid catabolism | 1 | 32.5× | 0.209 | PLPP3 |
| Heme degradation | 1 | 30.2× | 0.209 | SLCO1B1 |
| SLC-mediated transport of organic cations | 1 | 28.2× | 0.209 | SLC22A3 |
| R-HSA-549132 | 1 | 28.2× | 0.209 | SLC22A3 |
| Negative regulation of FLT3 | 1 | 26.4× | 0.209 | SH2B3 |
| Formation of the non-canonical BAF (ncBAF) complex | 1 | 24.9× | 0.209 | SMARCA4 |
| Formation of the canonical BAF (cBAF) complex | 1 | 23.5× | 0.209 | SMARCA4 |
| Formation of the polybromo-BAF (pBAF) complex | 1 | 23.5× | 0.209 | SMARCA4 |
| Regulation of KIT signaling | 1 | 22.3× | 0.209 | SH2B3 |
| Recycling of bile acids and salts | 1 | 22.3× | 0.209 | SLCO1B1 |
| Formation of the embryonic stem cell BAF (esBAF) complex | 1 | 22.3× | 0.209 | SMARCA4 |
| Differentiation of naive CD4+ T cells to T helper 2 cells (Th2 cells) | 2 | 10.8× | 0.209 | SMARCA4, HDAC9 |
| Negative Regulation of CDH1 Gene Transcription | 2 | 8.9× | 0.209 | SMARCA4, TWIST1 |
| Chromatin organization | 2 | 6.0× | 0.209 | SMARCA4, SUPT3H |
GO biological processes by enrichment
Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 38 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.
| GO term | Cohort genes | Fold | FDR | Sample cohort genes |
|---|
| cardiac neural crest cell migration involved in outflow tract morphogenesis | 2 | 126.7× | 0.047 | TWIST1, EDNRA |
| obsolete GMP catabolic process to guanine | 1 | 443.5× | 0.067 | NT5C2 |
| DNA endoreduplication | 1 | 443.5× | 0.067 | ZPR1 |
| regulation of striated muscle cell differentiation | 1 | 443.5× | 0.067 | HDAC9 |
| regulation of sphingolipid mediated signaling pathway | 1 | 443.5× | 0.067 | PLPP3 |
| regulation of protein localization to cell leading edge | 1 | 443.5× | 0.067 | EDNRA |
| pre-mRNA catabolic process | 1 | 443.5× | 0.067 | ZPR1 |
| cell proliferation involved in heart valve development | 1 | 443.5× | 0.067 | TWIST1 |
| positive regulation of endocardial cushion to mesenchymal transition involved in heart valve formation | 1 | 443.5× | 0.067 | TWIST1 |
| regulation of phosphatidylcholine catabolic process | 1 | 221.7× | 0.067 | LDLR |
| regulation of interleukin-2 production | 1 | 221.7× | 0.067 | CARD9 |
| cellular response to human chorionic gonadotropin stimulus | 1 | 221.7× | 0.067 | EDNRA |
| gland development | 1 | 221.7× | 0.067 | TCF21 |
| epinephrine uptake | 1 | 221.7× | 0.067 | SLC22A3 |
| embryonic camera-type eye formation | 1 | 221.7× | 0.067 | TWIST1 |
| receptor-mediated endocytosis involved in cholesterol transport | 1 | 221.7× | 0.067 | LDLR |
| zinc ion import into synaptic vesicle | 1 | 221.7× | 0.067 | TMEM163 |
| negative regulation of Kit signaling pathway | 1 | 221.7× | 0.067 | SH2B3 |
| podocyte apoptotic process | 1 | 221.7× | 0.067 | EDNRA |
| cardiac chamber formation | 1 | 147.8× | 0.067 | EDNRA |
| branchiomeric skeletal muscle development | 1 | 147.8× | 0.067 | TCF21 |
| monocyte homeostasis | 1 | 147.8× | 0.067 | SH2B3 |
| Fas signaling pathway | 1 | 147.8× | 0.067 | FAF1 |
| GMP metabolic process | 1 | 147.8× | 0.067 | NT5C2 |
| dGMP metabolic process | 1 | 147.8× | 0.067 | NT5C2 |
| regulation of skeletal muscle fiber development | 1 | 147.8× | 0.067 | HDAC9 |
| histamine transport | 1 | 147.8× | 0.067 | SLC22A3 |
| negative regulation of astrocyte activation | 1 | 147.8× | 0.067 | LDLR |
| metanephric glomerular capillary formation | 1 | 147.8× | 0.067 | TCF21 |
| purine-containing compound transmembrane transport | 1 | 147.8× | 0.067 | SLC22A3 |
Therapeutics
Drug target analysis
Approved (phase 4): 5 · Phase ≥3: 6 · Phased (≥1): 7 · Undrugged: 34
Druggability breadth: 14 of 41 evidence-associated genes (34%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).
Genes with an approved drug
The molecule shown is one approved compound that hits the gene — not necessarily a drug of choice or one indicated for this disease.
Top cohort targets by molecule count
| Symbol | Molecules | Max phase |
|---|
| SLCO1B1 | 57 | 4 |
| EDNRA | 31 | 4 |
| HDAC9 | 28 | 4 |
| SLC22A3 | 20 | 4 |
| SMARCA4 | 2 | 2 |
| SORT1 | 1 | 3 |
| LDLR | 1 | 4 |
| CNNM2 | 0 | 0 |
| SUPT3H | 0 | 0 |
| TCF21 | 0 | 0 |
Drugs targeting cohort genes (top 30)
Bioactivity and enzyme data
Enzyme cohort genes (≥1 EC): 6.
Cohort genes with ChEMBL bioactivity (full, sorted by assay count)
| Symbol | Assays | Type breakdown |
|---|
| HDAC9 | 1,625 | Binding:1612, ADMET:8, Functional:4, Toxicity:1 |
| EDNRA | 418 | Binding:342, Functional:73, Toxicity:2, ADMET:1 |
| MCHR1 | 267 | Binding:158, Functional:109 |
| SLCO1B1 | 242 | Functional:106, ADMET:82, Binding:53, Toxicity:1 |
| SMARCA4 | 230 | Binding:207, ADMET:12, Functional:11 |
| LDLR | 55 | Binding:54, Functional:1 |
| SLC22A3 | 33 | Binding:17, Functional:11, ADMET:5 |
| SORT1 | 17 | Binding:15, ADMET:2 |
| NT5C2 | 7 | Binding:7 |
| ADAMTS7 | 6 | Binding:6 |
| ZC3HC1 | 6 | Binding:6 |
| ABO | 6 | Binding:6 |
| WDR12 | 5 | Binding:5 |
| NDUFAB1 | 5 | Binding:5 |
| FAF1 | 2 | Binding:2 |
Cohort enzymes (BRENDA EC)
| Symbol | EC numbers | Names |
|---|
| HDAC9 | 3.5.1.98 | histone deacetylase |
| ALKBH8 | 2.1.1.229 | tRNA (carboxymethyluridine34-5-O)-methyltransferase |
| ABO | 2.4.1.37, 2.4.1.40, 2.4.1.88 | fucosylgalactoside 3-alpha-galactosyltransferase, glycoprotein-fucosylgalactoside alpha-N-acetylgalactosaminyltransferase, globoside alpha-N-acetylgalactosaminyltransferase |
| NT5C2 | 3.1.3.5 | 5’-nucleotidase |
| PEMT | 2.1.1.17 | phosphatidylethanolamine N-methyltransferase |
| PLPP3 | 3.1.3.4 | phosphatidate phosphatase |
Cohort genes with high screening signal
≥100 ChEMBL assays — a studied-ness signal; see Therapeutics for approved-drug status.
| Symbol | ChEMBL assays |
|---|
| SLCO1B1 | 242 |
| SMARCA4 | 230 |
| HDAC9 | 1,625 |
| EDNRA | 418 |
| MCHR1 | 267 |
Pharmacogenomics
Cohort genes with a PharmGKB record: 40; with CPIC/DPWG dosing guidelines: 1.
Cohort genes with a CPIC/DPWG dosing guideline
| Symbol | CPIC guidelines |
|---|
| SLCO1B1 | 1 |
Chemical tractability of cohort targets
30 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.
| Compound | Max phase | Cohort target (bioactivity) |
|---|
| CANDESARTAN CILEXETIL | 4 | SLCO1B1 |
| TELMISARTAN | 4 | SLCO1B1 |
| SIMVASTATIN | 4 | SLCO1B1 |
| TELITHROMYCIN | 4 | SLCO1B1 |
| PRAVASTATIN | 4 | SLCO1B1 |
| MOMETASONE FUROATE | 4 | SLCO1B1 |
| ATAZANAVIR | 4 | SLCO1B1 |
| HYDROXYZINE PAMOATE | 4 | SLCO1B1 |
| ERYTHROMYCIN ETHYLSUCCINATE | 4 | SLCO1B1 |
| OLMESARTAN MEDOXOMIL | 4 | SLCO1B1 |
| DICLOXACILLIN SODIUM | 4 | SLCO1B1 |
| BETA CAROTENE | 4 | SLCO1B1 |
| NONOXYNOL 9 | 4 | SLCO1B1 |
| ATORVASTATIN | 4 | SLCO1B1 |
| VINBLASTINE | 4 | SLCO1B1 |
| CYCLOSPORINE | 4 | SLCO1B1 |
| RITONAVIR | 4 | SLCO1B1 |
| CARBENOXOLONE SODIUM | 4 | SLCO1B1 |
| CLARITHROMYCIN | 4 | SLCO1B1 |
| DIGOXIN | 4 | SLCO1B1 |
| LOSARTAN | 4 | SLCO1B1 |
| ERYTHROMYCIN ESTOLATE | 4 | SLCO1B1 |
| TACROLIMUS ANHYDROUS | 4 | SLCO1B1 |
| RIFAMPIN | 4 | SLCO1B1 |
| ATORVASTATIN CALCIUM | 4 | SLCO1B1 |
| SIROLIMUS | 4 | SLCO1B1 |
| SULFASALAZINE | 4 | SLCO1B1 |
| PACLITAXEL | 4 | SLCO1B1 |
| RIFAMYCIN | 4 | SLCO1B1 |
| GEMFIBROZIL | 4 | SLCO1B1 |
Druggability pyramid
Cohort genes binned by druggability tier (high → low):
| Tier | Definition | Genes | Symbols |
|---|
| A | Approved (phase 4 drug) | 5 | SLCO1B1, SLC22A3, HDAC9, EDNRA, LDLR |
| B | Phased (≥1) drug, not yet approved | 2 | SMARCA4, SORT1 |
| C | Druggable family + PDB, no drug | 5 | ALKBH8, MCHR1, LPA, ABO, NT5C2 |
| D | Druggable family + AlphaFold only, no drug | 4 | LPAL2, ADAMTS7, PEMT, PLPP3 |
| E | Difficult family or no structure, no drug | 25 | CNNM2, SUPT3H, TCF21, TWIST1, ZPR1, WDR12, RASD1, CARD9, CDKN2C, TTLL5 (+15 more) |
Undrugged target profiles
34 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).
| Symbol | ChEMBL assays | Drugged partners (top 3) |
|---|
| MCHR1 | 267 | — |
| CNNM2 | 0 | — |
| SUPT3H | 0 | — |
| TCF21 | 0 | — |
| TWIST1 | 0 | — |
| ZPR1 | 0 | — |
| WDR12 | 5 | — |
| RASD1 | 0 | — |
| CARD9 | 0 | — |
| CDKN2C | 0 | — |
| TTLL5 | 0 | — |
| ZDHHC22 | 0 | — |
| OBI1 | 0 | — |
| TSPAN2 | 0 | — |
| PHACTR1 | 0 | — |
| LPAL2 | 0 | — |
| ADAMTS7 | 6 | — |
| ALKBH8 | 0 | — |
| TMEM163 | 0 | — |
| SH3YL1 | 0 | — |
| SH2B3 | 0 | — |
| ZC3HC1 | 6 | — |
| DTL | 0 | — |
| CDKN2B-AS1 | 0 | — |
| FAF1 | 2 | — |
| LINC01492 | 0 | — |
| LPA | 0 | — |
| NDUFAB1 | 5 | — |
| ABO | 6 | — |
| NT5C2 | 7 | — |
Clinical trials & evidence
Clinical trials
Clinical trials: 0.
- Cohort genes: CNNM2, SLCO1B1, SLC22A3, SMARCA4, SORT1, SUPT3H, TCF21, TWIST1, ZPR1, HDAC9, WDR12, RASD1, CARD9, CDKN2C, TTLL5, ZDHHC22, OBI1, TSPAN2, PHACTR1, ADAMTS7, ALKBH8, TMEM163, SH3YL1, SH2B3, ZC3HC1, DTL, EDNRA, CDKN2B-AS1, FAF1, MCHR1, LINC01492, LDLR, LPA, NDUFAB1, ABO, NT5C2, PEMT, SERPINA1, PLPP3, RAI1