Anemia

disease
On this page

Also known as anaemia (disease)anemia (disease)

Summary

Anemia (MONDO:0002280) is a disease (an umbrella term covering 19 Mondo subtypes) with 16 cohort genes (36 GWAS associations across 28 studies) and 1,177 clinical trials. Top therapeutic interventions include epoetin alfa, darbepoetin alfa, and epoetin beta.

At a glance

  • Umbrella term: 19 Mondo subtypes
  • Cohort genes: 16
  • GWAS associations: 36
  • ClinVar variants: 18
  • Clinical trials: 1,177

Clinical features

No curated clinical features (Orphanet) for this disease.

Identifiers

Disease identifiers

FieldValue
Canonical nameanemia
Mondo IDMONDO:0002280
MeSHD000740
DOIDDOID:2355
NCITC2869
SNOMED CT271737000
UMLSC0002871
MedGen1526
Is cancer (heuristic)no

Also known as: anaemia (disease) · anemia · anemia (disease)

Data availability: 18 ClinVar variants · 36 GWAS associations (28 studies) · 1 HPO phenotype · 3 cell lines.

Disease family

An umbrella term covering 19 Mondo subtypes.

Classification path: disease › human disease › disease by body system or component › hematologic disorderanemia

Related subtypes (26): autoimmune disorder of blood, blood coagulation disease, hemorrhagic disease, blood platelet disease, splenic disorder, hematopoietic and lymphoid system neoplasm, blood group incompatibility, bone marrow disorder, thymus gland disorder, leukocyte disorder, monoclonal gammopathy, septicemic plague, hyperamylasemia, alpha thalassemia-intellectual disability syndrome type 1, Bloom syndrome, congenital hematological disorder, alpha-thalassemia-myelodysplastic syndrome, deafness-lymphedema-leukemia syndrome, L-ferritin deficiency, dyskeratosis congenita, autosomal dominant 6, polyclonal hyperviscosity syndrome, parasitemia, erythrocyte disorder, premalignant hematological system disease, GATA1-Related X-Linked Cytopenia, paraneoplastic hematological syndrome

Subtypes (19): congenital anemia, neonatal anemia, microcytic anemia, hypochromic anemia, pancytopenia, deficiency anemia, pure red-cell aplasia, macrocytic anemia, normocytic anemia, sideroblastic anemia, aplastic anemia, hemoglobin C disease, hemoglobin E disease, beta-thalassemia and related diseases, hemoglobinopathy Toms River, hereditary methemoglobinemia, hemoglobin D disease, anemia due to enzyme disorder, anemia due to chronic disorder

Genetics & variants

GWAS landscape

36 GWAS associations across 28 studies. Top hits map to 13 distinct genes (as reported by GWAS).

Top associations by p-value

rsIDp-valueGeneRisk alleleOdds ratio
rs3727554521e-55LUC7L?
chr6:326597031e-36A0.11
rs96223632e-30APOL1A0.24
rs1995983953e-29RNF43, TSPOAP1-AS1T
rs779246156e-19PDILTG0.17
chr15:450932983e-18T0.12
chr1:1695498111e-17T0.25
rs3754988572e-16PGAP6?
rs79031465e-16TCF7L2C0.11
chr22:370668961e-13G0.06
rs737282793e-13PRKAG2G0.11
rs3344e-13HBBT0.34
rs71882505e-13FTOT0.11
rs15589022e-12FTOT0.1
rs1896640456e-10RPL6P5 - METAP2P1?
chr6:10687778e-09T2.57
rs5543582e-08NKAIN2?
chr18:127512162e-08G2.3
chr1:594803422e-08G2.58
chr11:611444502e-08T0.05
chr12:550129372e-08A2.43
chr8:401182753e-08C1.72
chr17:448559583e-08GA0.06
rs2013393664e-08ANO2?
chr4:1372255844e-08G1.69
chr4:1636132064e-08G1.4
chr4:1819563754e-08C1.93
chr10:213666184e-08T2.06
chr12:1094427974e-08C0.05
chr16:886403574e-08G2.44

Top studies (by case count)

StudyLead authorYearCasesControlsTitle
GCST90473113UK Biobank Whole-Genome Sequencing Consortium202532,917425,523Whole-genome sequencing of 490,640 UK Biobank participants.
GCST90667911UK Biobank Whole-Genome Sequencing Consortium202532,917425,523Whole-genome sequencing of 490,640 UK Biobank participants.
GCST90297583Auwerx C202417,388300,091Rare copy-number variants as modulators of common disease susceptibility.
GCST90297637Auwerx C202417,388300,091Rare copy-number variants as modulators of common disease susceptibility.
GCST90297688Auwerx C202417,388300,091Rare copy-number variants as modulators of common disease susceptibility.
GCST90297740Auwerx C202417,388300,091Rare copy-number variants as modulators of common disease susceptibility.
GCST90079711Backman JD202114,923372,152Exome sequencing and analysis of 454,787 UK Biobank participants.
GCST90083697Backman JD202114,923372,152Exome sequencing and analysis of 454,787 UK Biobank participants.
GCST90651140Liu TY202512,821215,243Diversity and longitudinal records: Genetic architecture of disease associations and polygenic risk in the Taiwanese Han population.
GCST90475792Verma A20248,844436,730Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.

Variant details and genetic-evidence tiers

Tier distribution (top 50 variants)

TierVariants
Tier 1: coding3
Tier 2: splice/UTR0
Tier 3: regulatory0
Tier 4: intronic/intergenic29

MAF distribution

BucketVariants
common (>=0.05)7
low_freq (0.01-0.05)2
rare (<0.01)0
unknown23

Functional consequences

ConsequenceCount
unknown18
intron_variant10
missense_variant3
intergenic_variant1

Top variants

rsIDChrPosAllelesMAFConsequenceGenep-valueTier
rs37275545216199622AG>Aintron_variantLUC7L1e-55Tier 4: intronic/intergenic
chr6:326597031e-36Tier 4: intronic/intergenic
rs96223632236260509A>G0.47intron_variantAPOL12e-30Tier 4: intronic/intergenic
rs1995983951758358769C>A,G,T0.01missense_variantRNF43, TSPOAP1-AS13e-29Tier 1: coding
rs779246151620381010G>A0.174intron_variantPDILT6e-19Tier 4: intronic/intergenic
chr15:450932983e-18Tier 4: intronic/intergenic
chr1:1695498111e-17Tier 4: intronic/intergenic
rs37549885716377772C>Amissense_variantPGAP62e-16Tier 1: coding
rs790314610112998590C>G,T0.292intron_variantTCF7L25e-16Tier 4: intronic/intergenic
chr22:370668961e-13Tier 4: intronic/intergenic
rs737282797151714408G>A,T0.236intron_variantPRKAG23e-13Tier 4: intronic/intergenic
rs334115227002T>A,C,G0.038missense_variantHBB4e-13Tier 1: coding
rs71882501653800695T>C0.411intron_variantFTO5e-13Tier 4: intronic/intergenic
rs15589021653769662T>A0.331intron_variantFTO2e-12Tier 4: intronic/intergenic
rs1896640452145520951G>A,Cintergenic_variantRPL6P5 - METAP2P16e-10Tier 4: intronic/intergenic
chr6:10687778e-09Tier 4: intronic/intergenic
rs5543586124539810T>A,C0.05intron_variantNKAIN22e-08Tier 4: intronic/intergenic
chr18:127512162e-08Tier 4: intronic/intergenic
chr1:594803422e-08Tier 4: intronic/intergenic
chr11:611444502e-08Tier 4: intronic/intergenic
chr12:550129372e-08Tier 4: intronic/intergenic
chr8:401182753e-08Tier 4: intronic/intergenic
chr17:448559583e-08Tier 4: intronic/intergenic
rs201339366125714068C>Tintron_variantANO24e-08Tier 4: intronic/intergenic
chr4:1372255844e-08Tier 4: intronic/intergenic
chr4:1636132064e-08Tier 4: intronic/intergenic
chr4:1819563754e-08Tier 4: intronic/intergenic
chr10:213666184e-08Tier 4: intronic/intergenic
chr12:1094427974e-08Tier 4: intronic/intergenic
chr16:886403574e-08Tier 4: intronic/intergenic

ClinVar germline variants

18 retrieved; paginated sample, class counts are floors:

7 likely pathogenic, 5 uncertain significance, 4 pathogenic, 1 conflicting classifications of pathogenicity, 1 pathogenic/likely pathogenic

ClinVarVariant (HGVS)GeneClassificationReview
15655NM_000517.6(HBA2):c.410T>C (p.Leu137Pro)HBA1Pathogeniccriteria provided, single submitter
15061NC_000011.10:g.5226570_5233984delHBBPathogeniccriteria provided, single submitter
15161NM_000518.5(HBB):c.79G>A (p.Glu27Lys)HBBPathogeniccriteria provided, multiple submitters, no conflicts
644617NM_001291415.2(KDM6A):c.4207C>T (p.Arg1403Ter)KDM6APathogenic/Likely pathogeniccriteria provided, multiple submitters, no conflicts
1810246NM_001355436.2(SPTB):c.1628G>A (p.Trp543Ter)SPTBPathogeniccriteria provided, single submitter
26789246;XX;t(4;14)(p15.2;q13)dnLikely pathogeniccriteria provided, single submitter
812956Single alleleC1QTNF5Likely pathogenicno assertion criteria provided
812972NM_002049.4(GATA1):c.680C>T (p.Ala227Val)GATA1Likely pathogenicno assertion criteria provided
812886NM_002524.5(NRAS):c.176C>A (p.Ala59Asp)NRASLikely pathogenicno assertion criteria provided
812883NM_000975.5(RPL11):c.142_143dup (p.Val49fs)RPL11Likely pathogenicno assertion criteria provided
812897NM_001029.5(RPS26):c.312+2T>ARPS26Likely pathogenicno assertion criteria provided
812890NM_003126.4(SPTA1):c.2464+1G>ASPTA1Likely pathogeniccriteria provided, multiple submitters, no conflicts
638278NM_001042424.3(NSD2):c.3295G>A (p.Glu1099Lys)NSD2Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
373944NM_000037.4(ANK1):c.1948A>G (p.Met650Val)ANK1Uncertain significancecriteria provided, single submitter
523417NM_032638.5(GATA2):c.971A>G (p.Lys324Arg)GATA2Uncertain significancecriteria provided, multiple submitters, no conflicts
1339481NM_002204.4(ITGA3):c.1766G>C (p.Arg589Pro)ITGA3Uncertain significancecriteria provided, multiple submitters, no conflicts
1338810NM_000193.4(SHH):c.86G>C (p.Gly29Ala)SHHUncertain significancecriteria provided, single submitter
986742NM_003680.4(YARS1):c.611A>C (p.Tyr204Ser)YARS1Uncertain significanceno assertion criteria provided

Genes & proteins

Mendelian disease overlap and somatic drivers

GenCC: 0 · Orphanet: 78 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 0

Orphanet rare-disease linkage (cohort genes)

GeneOrphanet IDRare disease
RPL11Orphanet:124Diamond-Blackfan anemia
RPS26Orphanet:124Diamond-Blackfan anemia
SHHOrphanet:220386Semilobar holoprosencephaly
SHHOrphanet:280195Septopreoptic holoprosencephaly
SHHOrphanet:280200Microform holoprosencephaly
SHHOrphanet:476119Autosomal dominant preaxial polydactyly-upperback hypertrichosis syndrome
SHHOrphanet:485275Acquired schizencephaly
SHHOrphanet:93321Isolated radial hemimelia
SHHOrphanet:93336Polydactyly of a triphalangeal thumb
SHHOrphanet:93405Syndactyly type 4
SHHOrphanet:93924Lobar holoprosencephaly
SHHOrphanet:93925Alobar holoprosencephaly
SHHOrphanet:93926Midline interhemispheric variant of holoprosencephaly
SHHOrphanet:988Tibial hemimelia-polysyndactyly-triphalangeal thumb syndrome
SHHOrphanet:98938Colobomatous microphthalmia
SPTA1Orphanet:288Hereditary elliptocytosis
SPTA1Orphanet:822Hereditary spherocytosis
SPTBOrphanet:288Hereditary elliptocytosis
SPTBOrphanet:822Hereditary spherocytosis
KDM6AOrphanet:2322Kabuki syndrome
NSD2Orphanet:280Wolf-Hirschhorn syndrome
YARS1Orphanet:100045Autosomal dominant intermediate Charcot-Marie-Tooth disease type C
C1QTNF5Orphanet:67042Late-onset retinal degeneration
GATA1Orphanet:124Diamond-Blackfan anemia
GATA1Orphanet:231393Beta-thalassemia-X-linked thrombocytopenia syndrome
GATA1Orphanet:363727X-linked dyserythropoietic anemia with abnormal platelets and neutropenia
GATA1Orphanet:420611Transient myeloproliferative syndrome
GATA1Orphanet:67044Thrombocytopenia with congenital dyserythropoietic anemia
GATA1Orphanet:79277Congenital erythropoietic porphyria
GATA1Orphanet:86849Acute basophilic leukemia
GATA1Orphanet:99887Acute megakaryoblastic leukemia in children with Down syndrome
GATA2Orphanet:228423GATA2 deficiency spectrum
HBA1Orphanet:163596Hemoglobin Bart’s fetalis syndrome
HBA1Orphanet:247511Autosomal dominant secondary polycythemia
HBA1Orphanet:330041Hemoglobin M disease
HBA1Orphanet:707789Unstable alpha globin chain variant disease
HBA1Orphanet:715143Heterozygous beta-thalassemia intermedia with supernumerary alpha-globin gene
HBA1Orphanet:93616Hemoglobin H disease
HBA1Orphanet:98791Alpha-thalassemia-intellectual disability syndrome linked to chromosome 16
HBBOrphanet:2132Hemoglobin C disease
HBBOrphanet:2133Hemoglobin E disease
HBBOrphanet:231214Beta-thalassemia major
HBBOrphanet:231222Beta-thalassemia intermedia
HBBOrphanet:231226Unstable beta globin chain variant disease
HBBOrphanet:231237Delta-beta-thalassemia
HBBOrphanet:231242Hemoglobin C-beta-thalassemia syndrome
HBBOrphanet:231249Hemoglobin E-beta-thalassemia syndrome
HBBOrphanet:232Sickle cell anemia
HBBOrphanet:247511Autosomal dominant secondary polycythemia
HBBOrphanet:251365Sickle cell S-C disease

Cohort genes → proteins

16 cohort genes, 16 distinct canonical proteins.

Evidence partition

SubsetGenes
multi_evidence16

Cohort genes (full)

SymbolHGNCEnsemblUniProtNameEvidence
RPL11HGNC:10301ENSG00000142676P62913Large ribosomal subunit protein uL5clinvar
RPS26HGNC:10414ENSG00000197728P62854Small ribosomal subunit protein eS26clinvar
SHHHGNC:10848ENSG00000164690Q15465Sonic hedgehog proteinclinvar
SPTA1HGNC:11272ENSG00000163554P02549Spectrin alpha chain, erythrocytic 1clinvar
SPTBHGNC:11274ENSG00000070182P11277Spectrin beta chain, erythrocyticclinvar
KDM6AHGNC:12637ENSG00000147050O15550Lysine-specific demethylase 6Aclinvar
NSD2HGNC:12766ENSG00000109685O96028Histone-lysine N-methyltransferase NSD2clinvar
YARS1HGNC:12840ENSG00000134684P54577Tyrosine–tRNA ligase, cytoplasmicclinvar
C1QTNF5HGNC:14344ENSG00000223953Q9BXJ0Complement C1q tumor necrosis factor-related protein 5clinvar
GATA1HGNC:4170ENSG00000102145P15976Erythroid transcription factorclinvar
GATA2HGNC:4171ENSG00000179348P23769Endothelial transcription factor GATA-2clinvar
HBA1HGNC:4823ENSG00000206172P69905Hemoglobin subunit alphaclinvar
HBBHGNC:4827ENSG00000244734P68871Hemoglobin subunit betaclinvar
ANK1HGNC:492ENSG00000029534P16157Ankyrin-1clinvar
ITGA3HGNC:6139ENSG00000005884P26006Integrin alpha-3clinvar
NRASHGNC:7989ENSG00000213281P01111GTPase NRasclinvar

Cohort function summary

Lead sentence per gene, UniProt-curated.

SymbolProtein nameFunction (lead sentence)
RPL11Large ribosomal subunit protein uL5Component of the ribosome, a large ribonucleoprotein complex responsible for the synthesis of proteins in the cell.
RPS26Small ribosomal subunit protein eS26Component of the small ribosomal subunit.
SHHSonic hedgehog proteinThe C-terminal part of the sonic hedgehog protein precursor displays an autoproteolysis and a cholesterol transferase activity.
SPTA1Spectrin alpha chain, erythrocytic 1Spectrin is the major constituent of the cytoskeletal network underlying the erythrocyte plasma membrane.
SPTBSpectrin beta chain, erythrocyticSpectrin is the major constituent of the cytoskeletal network underlying the erythrocyte plasma membrane.
KDM6ALysine-specific demethylase 6AHistone demethylase that specifically demethylates ‘Lys-27’ of histone H3, thereby playing a central role in histone code.
NSD2Histone-lysine N-methyltransferase NSD2Histone methyltransferase which specifically dimethylates nucleosomal histone H3 at ‘Lys-36’ (H3K36me2).
YARS1Tyrosine–tRNA ligase, cytoplasmicTyrosine–tRNA ligase that catalyzes the attachment of tyrosine to tRNA(Tyr) in a two-step reaction: tyrosine is first activated by ATP to form Tyr-AMP and then transferred to the acceptor end of tRNA(Tyr).
GATA1Erythroid transcription factorTranscriptional activator or repressor which serves as a general switch factor for erythroid development.
GATA2Endothelial transcription factor GATA-2Transcriptional activator which regulates endothelin-1 gene expression in endothelial cells.
HBA1Hemoglobin subunit alphaInvolved in oxygen transport from the lung to the various peripheral tissues.
HBBHemoglobin subunit betaInvolved in oxygen transport from the lung to the various peripheral tissues.
ANK1Ankyrin-1Component of the ankyrin-1 complex, a multiprotein complex involved in the stability and shape of the erythrocyte membrane.
ITGA3Integrin alpha-3Integrin alpha-3/beta-1 is a receptor for fibronectin, laminin, collagen, epiligrin, thrombospondin and CSPG4.
NRASGTPase NRasRas proteins bind GDP/GTP and possess intrinsic GTPase activity.

Protein-family classification

Druggable: 3 · Difficult: 5 · Unknown: 8 · Druggable fraction: 0.19

Family distribution

Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.

FamilyGenesFoldFDR
Scaffold/PPI22.2×0.535
Antibody/Immunoglobulin11.8×0.535
Transcription factor31.6×0.535
Enzyme (other)21.5×0.535
Other/Unknown80.9×0.765

Per-gene assignment

SymbolFamilyDruggable?ECInterPro (top 3)
RPL11Other/UnknownnoRibosomal_uL5, Ribosomal_uL5_CS, Ribosomal_uL5_dom_sf
RPS26Other/UnknownnoRibosomal_eS26, Ribosomal_eS26_sf, Ribosomal_eS26_CS
SHHOther/UnknownnoHedgehog_signalling_dom, Hedgehog, Hedgehog_Hint
SPTA1Scaffold/PPInoSH3_domain, Spectrin_repeat, EF_hand_dom
SPTBOther/UnknownnoActinin_actin-bd_CS, CH_dom, Spectrin_repeat
KDM6AEnzyme (other)yes1.14.11.68JmjC_dom, TPR-like_helical_dom_sf, TPR_rpt
NSD2Transcription factorno2.1.1.356PWWP_dom, SET_dom, Znf_RING
YARS1Enzyme (other)yes6.1.1.1aa-tRNA-synth_Ic, Tyr-tRNA-ligase, tRNA-bd_dom
C1QTNF5Other/UnknownnoC1q_dom, Collagen, Tumour_necrosis_fac-like_dom
GATA1Transcription factornoZnf_GATA, Znf_NHR/GATA, Transcription_factor_GATA
GATA2Transcription factornoZnf_GATA, Znf_NHR/GATA, TF_GATA-2/3
HBA1Other/UnknownnoGlobin, Hemoglobin_a-typ, Hemoglobin_pi
HBBOther/UnknownnoGlobin, Hemoglobin_b, Globin-like_sf
ANK1Scaffold/PPInoDeath_dom, ZU5_dom, Ankyrin_rpt
ITGA3Antibody/ImmunoglobulinyesIntegrin_alpha, FG-GAP, Int_alpha_beta-p
NRASOther/UnknownnoSmall_GTPase, Small_GTP-bd, Small_GTPase_Ras-type

Expression context

Cohort genes with no expression data: 0.

14 cohort genes are a single-cell marker in ≥1 SCXA experiment.

Breadth distribution (Bgee present_calls)

BucketGenes
narrow (1-5 tissues)0
moderate (6-20)0
broad (>20)16
unknown0

Top tissues across cohort

TissueCohort genes
monocyte3
bone marrow3
trabecular bone tissue3
cortical plate2
ganglionic eminence2
bone marrow cell2
secondary oocyte2
blood2
granulocyte1
left adrenal gland cortex1
mucosa of transverse colon1
buccal mucosa cell1
epithelial cell of pancreas1
right lobe of liver1
gastrocnemius1
hindlimb stylopod muscle1
muscle of leg1
oocyte1
ventricular zone1
islet of Langerhans1

Per-gene tissue summary (top 30)

SymbolBgee breadthFANTOM5 breadthSCXATop tissues
RPL11293ubiquitousmarkerganglionic eminence, cortical plate, monocyte
RPS26140ubiquitousmarkergranulocyte, left adrenal gland cortex, mucosa of transverse colon
SHH131broadmarkerbuccal mucosa cell, right lobe of liver, epithelial cell of pancreas
SPTA1147tissue_specificmarkertrabecular bone tissue, bone marrow, bone marrow cell
SPTB220broadmarkergastrocnemius, hindlimb stylopod muscle, muscle of leg
KDM6A286ubiquitousyessecondary oocyte, oocyte, bone marrow cell
NSD2281ubiquitousmarkerventricular zone, ganglionic eminence, cortical plate
YARS1290ubiquitousmarkerislet of Langerhans, right adrenal gland, left adrenal gland
C1QTNF5128ubiquitousyesapex of heart, gall bladder, ascending aorta
GATA1138tissue_specificmarkertrabecular bone tissue, blood, bone marrow
GATA2273ubiquitousmarkerseminal vesicle, right lung, left uterine tube
HBA1133tissue_specificmarkermonocyte, blood, bone marrow
HBB284broadmarkermonocyte, trabecular bone tissue, vena cava
ANK1226broadmarkerskeletal muscle tissue of rectus abdominis, triceps brachii, body of tongue
ITGA3149ubiquitousmarkermetanephric glomerulus, right coronary artery, upper lobe of left lung
NRAS278ubiquitousmarkergingival epithelium, epithelium of nasopharynx, secondary oocyte

Protein interactions among cohort

Intra-cohort edges: 5.

Hub genes (top 10 by interactor count)

SymbolInteractor count
KDM6A8,825
NRAS7,598
ANK15,705
GATA24,979
SHH4,953
GATA14,810
YARS14,793
NSD23,530
ITGA32,079
SPTA11,551

Intra-cohort edges

ABSources
ANK1SPTA1biogrid_interaction, string_interaction
ANK1SPTBbiogrid_interaction, string_interaction
GATA1GATA2string_interaction
HBA1HBBintact
SPTA1SPTBintact, string_interaction

Structural data

PDB: 15 · AlphaFold-only: 1 · No structure: 0

Cohort genes with PDB structures (top 30)

SymbolUniProtPDB entries
HBA1P69905356
HBBP68871350
RPS26P62854190
RPL11P62913185
NRASP0111135
NSD2O9602822
ANK1P1615721
SHHQ1546520
YARS1P545778
SPTBP112776
KDM6AO155505
SPTA1P025493
C1QTNF5Q9BXJ02
GATA2P237692
GATA1P159761

AlphaFold-only cohort genes (top 30 by pLDDT)

SymbolUniProtpLDDT
ITGA3P2600683.88

Function

Pathway analysis

Distinct Reactome pathways touched by cohort: 196. Enrichment computed across 16 evidence-associated genes (15 with Reactome annotation).

Pathways by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 15 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

PathwayCohort genesFoldFDRSample cohort genes
Heme assimilation2507.6×8e-04HBA1, HBB
Interaction between L1 and Ankyrins373.7×8e-04SPTA1, SPTB, ANK1
NCAM signaling for neurite out-growth354.4×0.001SPTA1, SPTB, NRAS
Erythrocytes take up oxygen and release carbon dioxide2169.2×0.003HBA1, HBB
Erythrocytes take up carbon dioxide and release oxygen2117.1×0.004HBA1, HBB
Scavenging of heme from plasma2117.1×0.004HBA1, HBB
ER to Golgi Anterograde Transport326.6×0.005SPTA1, SPTB, ANK1
L1CAM interactions324.0×0.005SPTA1, SPTB, ANK1
COPI-mediated anterograde transport322.0×0.005SPTA1, SPTB, ANK1
Axon guidance412.0×0.005RPS26, SPTA1, SPTB, ANK1
Nervous system development411.4×0.005RPS26, SPTA1, SPTB, ANK1
Developmental Biology65.8×0.005RPS26, SHH, SPTA1, SPTB, KDM6A, ANK1
Transport to the Golgi and subsequent modification320.6×0.006SPTA1, SPTB, ANK1
Factors involved in megakaryocyte development and platelet production313.3×0.019GATA1, GATA2, HBB
RAF/MAP kinase cascade312.2×0.022SPTA1, SPTB, NRAS
Asparagine N-linked glycosylation312.0×0.022SPTA1, SPTB, ANK1
Heme signaling228.7×0.025HBA1, HBB
Cytoprotection by HMOX1224.6×0.032HBA1, HBB
Signaling by RAS GAP mutants1253.8×0.039NRAS
Signaling by RAS GTPase mutants1253.8×0.039NRAS
Activation of RAS in B cells1152.3×0.041NRAS
HHAT G278V doesn’t palmitoylate Hh-Np1152.3×0.041SHH
Formation of lateral plate mesoderm1152.3×0.041SHH
NrCAM interactions1108.8×0.041ANK1
RAS signaling downstream of NF1 loss-of-function variants1108.8×0.041NRAS
Estrogen-stimulated signaling through PRKCZ1108.8×0.041NRAS
SOS-mediated signalling195.2×0.041NRAS
Neurofascin interactions195.2×0.041ANK1
Release of Hh-Np from the secreting cell195.2×0.041SHH
Hh mutants abrogate ligand secretion195.2×0.041SHH

GO biological processes by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 16 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

GO termCohort genesFoldFDRSample cohort genes
regulation of primitive erythrocyte differentiation21053.2×1e-04GATA1, GATA2
eosinophil fate commitment21053.2×1e-04GATA1, GATA2
erythrocyte development398.7×4e-04GATA1, HBA1, HBB
nitric oxide transport2421.3×7e-04HBA1, HBB
cellular oxidant detoxification2234.1×0.002HBA1, HBB
positive regulation of mast cell degranulation2191.5×0.002GATA1, GATA2
actin filament capping2191.5×0.002SPTA1, SPTB
carbon dioxide transport2162.0×0.003HBA1, HBB
oxygen transport2131.7×0.003HBA1, HBB
hydrogen peroxide catabolic process284.3×0.008HBA1, HBB
tyrosyl-tRNA aminoacylation11053.2×0.009YARS1
polarity specification of anterior/posterior axis11053.2×0.009SHH
trachea morphogenesis11053.2×0.009SHH
right lung development11053.2×0.009SHH
left lung development11053.2×0.009SHH
primary prostatic bud elongation11053.2×0.009SHH
regulation of prostatic bud formation11053.2×0.009SHH
obsolete regulation of mesenchymal cell proliferation involved in prostate gland development11053.2×0.009SHH
mesenchymal smoothened signaling pathway involved in prostate gland development11053.2×0.009SHH
semicircular canal development11053.2×0.009GATA2
positive regulation of sclerotome development11053.2×0.009SHH
tracheoesophageal septum formation11053.2×0.009SHH
negative regulation of ureter smooth muscle cell differentiation11053.2×0.009SHH
positive regulation of ureter smooth muscle cell differentiation11053.2×0.009SHH
negative regulation of kidney smooth muscle cell differentiation11053.2×0.009SHH
positive regulation of kidney smooth muscle cell differentiation11053.2×0.009SHH
regulation of forebrain neuron differentiation11053.2×0.009GATA2
positive regulation of erythrocyte differentiation263.8×0.009GATA1, GATA2
response to hydrogen peroxide258.5×0.009HBA1, HBB
homeostasis of number of cells within a tissue255.4×0.009GATA1, GATA2

Therapeutics

Drugs indicated for this disease

54 approved, 18 in late-stage (phase 3) trials. Disease-direct ChEMBL indications, not inferred from the associated-gene cohort below.

DrugDevelopment status
Ascorbic AcidApproved (phase 4)
Betamethasone AcetateApproved (phase 4)
BiotinApproved (phase 4)
Calcium CarbonateApproved (phase 4)
CholecalciferolApproved (phase 4)
CladribineApproved (phase 4)
Cupric SulfateApproved (phase 4)
CyanocobalaminApproved (phase 4)
DaprodustatApproved (phase 4)
Darbepoetin AlfaApproved (phase 4)
DecitabineApproved (phase 4)
Epoetin AlfaApproved (phase 4)
Epoetin BetaApproved (phase 4)
Epoetin DeltaApproved (phase 4)
Epoetin ThetaApproved (phase 4)
Epoetin ZetaApproved (phase 4)
FelbamateApproved (phase 4)
Ferric CarboxymaltoseApproved (phase 4)
Ferric DerisomaltoseApproved (phase 4)
Ferric MaltolApproved (phase 4)
Ferrous FumarateApproved (phase 4)
Ferrous SuccinateApproved (phase 4)
Ferrous SulfateApproved (phase 4)
FerumoxytolApproved (phase 4)
Folic AcidApproved (phase 4)
HydroxocobalaminApproved (phase 4)
Imetelstat SodiumApproved (phase 4)
ImigluceraseApproved (phase 4)
Intrinsic FactorApproved (phase 4)
Iron DextranApproved (phase 4)
Iron SucroseApproved (phase 4)
Isopropyl AlcoholApproved (phase 4)
LenalidomideApproved (phase 4)
LevoleucovorinApproved (phase 4)
LuspaterceptApproved (phase 4)
MecobalaminApproved (phase 4)
Methoxy Polyethylene Glycol-Epoetin BetaApproved (phase 4)
Methylprednisolone AcetateApproved (phase 4)
NiacinApproved (phase 4)
NiacinamideApproved (phase 4)
PentostatinApproved (phase 4)
Potassium IodideApproved (phase 4)
Prednisolone AcetateApproved (phase 4)
PrednisoneApproved (phase 4)
PyridoxineApproved (phase 4)
RiboflavinApproved (phase 4)
RoxadustatApproved (phase 4)
Sodium AscorbateApproved (phase 4)
Sodium CitrateApproved (phase 4)
Triamcinolone AcetonideApproved (phase 4)
VadadustatApproved (phase 4)
VoxelotorApproved (phase 4)
Zinc OxideApproved (phase 4)
Zinc SulfateApproved (phase 4)
Blood Cells, RedPhase 3 (in late-stage trials)
CarboplatinPhase 3 (in late-stage trials)
CisplatinPhase 3 (in late-stage trials)
DexamethasonePhase 3 (in late-stage trials)
Efepoetin AlfaPhase 3 (in late-stage trials)
Ferric PyrophosphatePhase 3 (in late-stage trials)
Ferrous AscorbatePhase 3 (in late-stage trials)
Ferrous BisglycinatePhase 3 (in late-stage trials)
FilgrastimPhase 3 (in late-stage trials)
MebendazolePhase 3 (in late-stage trials)
PaclitaxelPhase 3 (in late-stage trials)
PeginesatidePhase 3 (in late-stage trials)
Polyethylene GlycolPhase 3 (in late-stage trials)
PyrimethaminePhase 3 (in late-stage trials)
Sodium ChloridePhase 3 (in late-stage trials)
Sodium Ferric Gluconate ComplexPhase 3 (in late-stage trials)
SulfadoxinePhase 3 (in late-stage trials)
Tranexamic AcidPhase 3 (in late-stage trials)

Earlier-phase candidates (phase 2, investigational — efficacy not yet established): Acetylcholine, Blood, Whole, Deferasirox, Ruxolitinib, Sirolimus, Sotatercept, Tilarginine, Tretinoin.

Drug target analysis

Approved (phase 4): 7 · Phase ≥3: 7 · Phased (≥1): 8 · Undrugged: 8

Druggability breadth: 11 of 16 evidence-associated genes (69%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).

Genes with an approved drug

The molecule shown is one approved compound that hits the gene — not necessarily a drug of choice or one indicated for this disease.

SymbolExample approved molecule
RPL11GENTAMICIN SULFATE
RPS26GENTAMICIN SULFATE
SHHVISMODEGIB
KDM6ADEFERIPRONE
NSD2VENETOCLAX
YARS1CAPSAICIN
HBBCANDESARTAN CILEXETIL

Top cohort targets by molecule count

SymbolMoleculesMax phase
HBB234
NSD284
YARS124
RPL1114
RPS2614
SHH14
KDM6A14
NRAS11
SPTA100
SPTB00

Drugs targeting cohort genes (top 30)

MoleculeMax phaseTargets in cohort
GENTAMICIN SULFATE4RPL11, RPS26
VISMODEGIB4SHH
DEFERIPRONE4KDM6A
VENETOCLAX4NSD2
MITOXANTRONE4NSD2
CAPSAICIN4YARS1
CANDESARTAN CILEXETIL4HBB
MECHLORETHAMINE HYDROCHLORIDE4HBB
PHENAZOPYRIDINE HYDROCHLORIDE4HBB
MERCAPTOPURINE ANHYDROUS4HBB
AZACITIDINE4HBB
AZATHIOPRINE4HBB
TOPOTECAN HYDROCHLORIDE4HBB
ACYCLOVIR4HBB
FLUOROURACIL4HBB
RAUWOLFIA SERPENTINA4HBB
HYDROQUINONE4HBB
MENADIONE4HBB
THIOTEPA4HBB
THIOGUANINE4HBB
RESERPINE4HBB
SURAMIN3NSD2
CRENOLANIB3YARS1
CURCUMIN3HBB
HYDROXYCAMPTOTHECIN3HBB
SINEFUNGIN2NSD2
MOLIBRESIB2HBB, NSD2
HOMIDIUM BROMIDE2NSD2
FISETIN2HBB
TEROXIRONE2HBB

Bioactivity and enzyme data

Enzyme cohort genes (≥1 EC): 3.

Cohort genes with ChEMBL bioactivity (full, sorted by assay count)

SymbolAssaysType breakdown
NSD2264Binding:256, Functional:8
RPL1190Binding:90
RPS2689Binding:89
HBB68Binding:50, Functional:18
HBA159Binding:46, Functional:13
KDM6A40Binding:36, Functional:4
SHH27Binding:23, Functional:4
NRAS18Binding:18
YARS116Binding:16
ITGA35Binding:5
GATA21Binding:1

Cohort enzymes (BRENDA EC)

SymbolEC numbersNames
KDM6A1.14.11.68[histone H3]-trimethyl-L-lysine27 demethylase
NSD22.1.1.356, 2.1.1.357, 2.1.1.359[histone H3]-lysine27 N-trimethyltransferase, [histone H3]-lysine36 N-dimethyltransferase, [histone H3]-lysine36 N-trimethyltransferase
YARS16.1.1.1tyrosine-tRNA ligase

Cohort genes with high screening signal

≥100 ChEMBL assays — a studied-ness signal; see Therapeutics for approved-drug status.

SymbolChEMBL assays
NSD2264

Pharmacogenomics

Cohort genes with a PharmGKB record: 16; with CPIC/DPWG dosing guidelines: 0.

No cohort gene has a CPIC/DPWG genotype-guided dosing guideline (PharmGKB).

Chemical tractability of cohort targets

29 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.

CompoundMax phaseCohort target (bioactivity)
GENTAMICIN SULFATE4RPL11, RPS26
VISMODEGIB4SHH
DEFERIPRONE4KDM6A
VENETOCLAX4NSD2
MITOXANTRONE4NSD2
CAPSAICIN4YARS1
CANDESARTAN CILEXETIL4HBB
MECHLORETHAMINE HYDROCHLORIDE4HBB
PHENAZOPYRIDINE HYDROCHLORIDE4HBB
MERCAPTOPURINE ANHYDROUS4HBB
AZACITIDINE4HBB
AZATHIOPRINE4HBB
TOPOTECAN HYDROCHLORIDE4HBB
ACYCLOVIR4HBB
FLUOROURACIL4HBB
RAUWOLFIA SERPENTINA4HBB
HYDROQUINONE4HBB
MENADIONE4HBB
THIOGUANINE4HBB
RESERPINE4HBB
SURAMIN3NSD2
CRENOLANIB3YARS1
CURCUMIN3HBB
HYDROXYCAMPTOTHECIN3HBB
SINEFUNGIN2NSD2
MOLIBRESIB2HBB, NSD2
HOMIDIUM BROMIDE2NSD2
FISETIN2HBB
TEROXIRONE2HBB

Druggability pyramid

Cohort genes binned by druggability tier (high → low):

TierDefinitionGenesSymbols
AApproved (phase 4 drug)7RPL11, RPS26, SHH, KDM6A, NSD2, YARS1, HBB
BPhased (≥1) drug, not yet approved1NRAS
CDruggable family + PDB, no drug0
DDruggable family + AlphaFold only, no drug1ITGA3
EDifficult family or no structure, no drug7SPTA1, SPTB, C1QTNF5, GATA1, GATA2, HBA1, ANK1

Undrugged target profiles

8 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).

SymbolChEMBL assaysDrugged partners (top 3)
HBA159HBB
SPTA10
SPTB0
C1QTNF50
GATA10
GATA21
ANK10
ITGA35

Clinical trials & evidence

Clinical trials

Clinical trials: 1,177.

Phase distribution (across all retrieved trials)

PhaseTrials
Not specified427
PHASE3243
PHASE2186
PHASE4146
PHASE155
PHASE1/PHASE222
PHASE2/PHASE316
EARLY_PHASE15

Top trials by phase / activity

NCTPhaseStatusTitle
NCT04168346PHASE4NOT_YET_RECRUITINGPreoperative Intravenous Iron Therapy in Patients With Gastric Cancer
NCT04707261PHASE4RECRUITINGAssociation Between Dapagliflozin-induced Improvement and Anemia in Heart Failure Patients (ADIDAS)
NCT05060731PHASE4NOT_YET_RECRUITINGIron Supplementation in Upper Non-variceal Gastrointestinal Bleeding
NCT05891249PHASE4ACTIVE_NOT_RECRUITINGA Study to Evaluate the Safety and Effectiveness of Luspatercept for the Treatment of Transfusion-dependent (TD) Anemia Associated With Myelodysplastic Syndromes (MDS) & Beta-thalassemia (β-Thal) in India
NCT05929729PHASE4RECRUITINGIron Deficiency Anemia (IDA) and the Brain
NCT06624631PHASE4RECRUITINGPDMC Implementation Trial in Kenya
NCT06670963PHASE4RECRUITINGUse of Epoetin Alfa and Iron Derisomaltose in Treatment of Anemia in Patients With Sepsis or Septic Shock: a Randomized Controlled Trial
NCT06898814PHASE4RECRUITINGThe Effects of Intravenous Iron on Mobility in Elderly Patients Following Hip Fracture Surgery
NCT07038330PHASE4NOT_YET_RECRUITINGDOSE FINDING PROSPECTIVE ANALYSIS AFTER HEART SURGERY for Sucrosomial Iron
NCT07160452PHASE4NOT_YET_RECRUITINGLevocarnitine for Reducing ESA Requirements in Hemodialysis Patients With Renal Anemia
NCT07238972PHASE4NOT_YET_RECRUITINGPostoperative Intravenous Ferric Derisomaltose for Prevention of Long-term Anemia After Cardiac Surgery
NCT07287371PHASE4NOT_YET_RECRUITINGSucrosomial Vs Intravenous Iron for Preoperative Anemia
NCT07308769PHASE4RECRUITINGEfficacy and Tolerability of Sucrosomial® Iron vs Ferric Maltol in Iron Deficient Women
NCT07314424PHASE4NOT_YET_RECRUITINGThe Effect of Preoperative Intravenous Ferric Derisomaltose and EPO and Tranexamic Acid in Patients With Preoperative Anemia or Iron Deficiency in Bone Tumor
NCT07523646PHASE4RECRUITINGAnemia Therapy in Patients With Infective Endocarditis
NCT00003398PHASE4COMPLETEDBone Marrow Transplantation in Treating Patients With Hematologic Cancer
NCT00022386PHASE4COMPLETEDEpoetin Alfa in Treating Chemotherapy-Related Anemia in Women With Stage I, Stage II, or Stage III Breast Cancer
NCT00046969PHASE4COMPLETEDEpoetin Beta in Treating Anemia in Patients With Cervical Cancer
NCT00111995PHASE4COMPLETEDEvaluating Aranesp® for the Treatment of Anemia in African-American Subjects With Chronic Renal Failure (CRF) Receiving Hemodialysis
NCT00117039PHASE4COMPLETEDA Study to Evaluate the Effectiveness of Aranesp® for Cancer Patients With Anemia
NCT00117065PHASE4COMPLETEDStudy of Transplant Related Anemia Treated With Aranesp® (STRATA)
NCT00117117PHASE4COMPLETEDA Study to Assess Symptom Burden in Subjects With Nonmyeloid Malignancies Receiving Chemotherapy and Aranesp®
NCT00126334PHASE4COMPLETEDConservative Versus Liberal Red Cell Transfusion in Myocardial Infarction Trial: The CRIT Pilot
NCT00153868PHASE4COMPLETEDA Web-based Study of Quality of Life Benefits Associated Aranesp in Anemic Patients With Cancer
NCT00168948PHASE4UNKNOWNIntermittent Antimalaria Treatment With SP in African Children
NCT00173706PHASE4UNKNOWNEvaluation of the Effects of L-Carnitine Injection in Patients Undergoing Hemodialysis
NCT00194857PHASE4TERMINATEDTreatment of Anemia and Neutropenia in HIV/HCV Coinfected Patients Treated With Pegylated Interferon and Ribavirin
NCT00204334PHASE4COMPLETEDEffects of Anemia Correction on Vascular and Monocyte Function in Renal Transplant Recipients
NCT00206739PHASE4COMPLETEDIntermittent Treatment With Sulfadoxine-pyrimethamine for Malaria Control in Infants
NCT00211120PHASE4TERMINATEDCorrection of Hemoglobin and Outcomes in Renal Insufficiency (CHOIR)
NCT00216541PHASE4COMPLETEDA Study of the Safety and Effectiveness of Epoetin Alfa on Hemoglobin Levels and Blood Transfusions in Cancer Patients Receiving Chemotherapy
NCT00223938PHASE4TERMINATEDStudy of the Efficacy and Safety of Ferrlecit in the Maintenance Dosing in Hemodialysis Patients.
NCT00223964PHASE4COMPLETEDStudy of the Efficacy of Two Doses of Ferrlecit in the Treatment of Iron Deficiency in Pediatric Hemodialysis Patients
NCT00224003PHASE4COMPLETEDStudy of the Safety and Efficacy of Ferrlecit® Maintenance Dosing in Pediatric Hemodialysis Patients
NCT00224068PHASE4COMPLETEDEffect of Iron Therapy as an Adjunct to Epoetin Alfa in the Anemia of Cancer Chemotherapy
NCT00239642PHASE4COMPLETEDSafety and Efficacy of Iron Sucrose in Children
NCT00247507PHASE4UNKNOWNThe Effects of Acetylcysteine on Alleviating Damage of Oxidative Stress in Hemodialysis Patients
NCT00248716PHASE4UNKNOWNTreatment of Anemia in the 2nd Year of Life. Comparison of the Efficacy of Two Different Iron Preparations.
NCT00283465PHASE4COMPLETEDA Study of the Effectiveness and Safety of Treatment With Epoetin Alfa on Hemoglobin Levels, Red Blood Cell Transfusions, and Quality of Life in Patients With Cancer Receiving Platinum-containing Chemotherapy
NCT00312871PHASE4TERMINATEDEffects of Early Correction of Anemia in Patients With Chronic Renal Insufficiency

Drugs tested across these trials (top 30)

MoleculeMax phaseTrials referencing
EPOETIN ALFA467
DARBEPOETIN ALFA458
EPOETIN BETA424
FERRIC CARBOXYMALTOSE422
METHOXY POLYETHYLENE GLYCOL-EPOETIN BETA422
PEGINESATIDE418
FERROUS SULFATE417
IRON SUCROSE412
FERROUS FUMARATE48
CYCLOPHOSPHAMIDE ANHYDROUS46
FERRIC DERISOMALTOSE46
ALBENDAZOLE45
FERUMOXYTOL44
CINACALCET43
CYANOCOBALAMIN43
FOLIC ACID43
SULFADOXINE43
ASCORBIC ACID42
CHOLECALCIFEROL42
EPOETIN DELTA42
FERROUS GLUCONATE42
FILGRASTIM42
LEVOCARNITINE42
PARICALCITOL42
QUININE42
THIOTEPA42
ACETYLCYSTEINE41
ARTESUNATE41
CALCITRIOL41
CARBETOCIN41