Megaloblastic anemia

disease
On this page

Also known as megaloblastic anaemia (disease)megaloblastic anemia (disease)

Summary

Megaloblastic anemia (MONDO:0001700) is a disease (an umbrella term covering 6 Mondo subtypes) with 2 cohort genes (10 GWAS associations across 6 studies).

At a glance

  • Umbrella term: 6 Mondo subtypes
  • Cohort genes: 2
  • GWAS associations: 10
  • ClinVar variants: 2

Clinical features

No curated clinical features (Orphanet) for this disease.

Identifiers

Disease identifiers

FieldValue
Canonical namemegaloblastic anemia
Mondo IDMONDO:0001700
DOIDDOID:13382
NCITC34382
SNOMED CT53165003
UMLSC0002888
MedGen1527
Is cancer (heuristic)no

Also known as: megaloblastic anaemia (disease) · megaloblastic anemia · megaloblastic anemia (disease)

Data availability: 2 ClinVar variants · 10 GWAS associations (6 studies) · 1 HPO phenotype.

Disease family

An umbrella term covering 6 Mondo subtypes.

Classification path: disease › human disease › disease by body system or component › hematologic disorderanemiamacrocytic anemiamegaloblastic anemia

Related subtypes (1): myelodysplastic syndrome associated with isolated del(5q)

Subtypes (6): pernicious anemia, hereditary folate malabsorption, formiminoglutamic aciduria, Imerslund-Grasbeck syndrome, constitutional megaloblastic anemia with severe neurologic disease, vitamin B12- and folate-independent constitutional megaloblastic anemia

Genetics & variants

GWAS landscape

10 GWAS associations across 6 studies. Top hits map to 5 distinct genes (as reported by GWAS).

Top associations by p-value

rsIDp-valueGeneRisk alleleOdds ratio
rs343242193e-63TCN1C0.3
rs5036444e-56TCN1 - OOSP3T0.9
rs18012224e-46CUBNA0.18
rs6013381e-33FUT2G0.15
rs11316033e-24TCN2T0.34
chr6:494453064e-20T0.11
chr19:58397463e-13G0.18
chr19:83737781e-11G0.19
rs1160756621e-11MMAAG0.18
rs1878968637e-10PRDX4P1 - THAP12P9?

Top studies (by case count)

StudyLead authorYearCasesControlsTitle
GCST90475774Verma A202413,771426,909Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST90475773Verma A20243,353115,939Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST90480950Verma A20243,353115,939Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST90477448Verma A20241,56657,186Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST90435797Zhou W20181,076390,026Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.
GCST90651294Liu TY2025359215,243Diversity and longitudinal records: Genetic architecture of disease associations and polygenic risk in the Taiwanese Han population.

Variant details and genetic-evidence tiers

Tier distribution (top 50 variants)

TierVariants
Tier 1: coding4
Tier 2: splice/UTR0
Tier 3: regulatory0
Tier 4: intronic/intergenic6

MAF distribution

BucketVariants
common (>=0.05)7
low_freq (0.01-0.05)2
rare (<0.01)0
unknown1

Functional consequences

ConsequenceCount
missense_variant3
unknown3
intergenic_variant2
stop_gained1
intron_variant1

Top variants

rsIDChrPosAllelesMAFConsequenceGenep-valueTier
rs343242191159855905C>A0.113missense_variantTCN13e-63Tier 1: coding
rs5036441159873981T>A,C0.032intergenic_variantTCN1 - OOSP34e-56Tier 4: intronic/intergenic
rs18012221017114152A>C,G,T0.343missense_variantCUBN4e-46Tier 1: coding
rs6013381948703417G>A0.488stop_gainedFUT21e-33Tier 1: coding
rs11316032230622988T>C0.05missense_variantTCN23e-24Tier 1: coding
chr6:494453060.3644e-20Tier 4: intronic/intergenic
chr19:58397460.4813e-13Tier 4: intronic/intergenic
chr19:83737780.3021e-11Tier 4: intronic/intergenic
rs1160756624145653003G>A0.049intron_variantMMAA1e-11Tier 4: intronic/intergenic
rs187896863445143384C>Aintergenic_variantPRDX4P1 - THAP12P97e-10Tier 4: intronic/intergenic

ClinVar germline variants

2 retrieved; paginated sample, class counts are floors:

1 uncertain significance, 1 pathogenic

ClinVarVariant (HGVS)GeneClassificationReview
523538NM_030943.4(AMN):c.320_321dup (p.Asp108fs)AMNPathogeniccriteria provided, single submitter
523537NM_030943.4(AMN):c.149T>C (p.Phe50Ser)AMNUncertain significancecriteria provided, multiple submitters, no conflicts

Genes & proteins

Mendelian disease overlap and somatic drivers

GenCC: 0 · Orphanet: 5 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 0

Orphanet rare-disease linkage (cohort genes)

GeneOrphanet IDRare disease
AMNOrphanet:35858Imerslund-Gräsbeck syndrome
ABCD1Orphanet:139396X-linked cerebral adrenoleukodystrophy
ABCD1Orphanet:139399Adrenomyeloneuropathy
ABCD1Orphanet:369942CADDS
ABCD1Orphanet:388Hirschsprung disease

Cohort genes → proteins

2 cohort genes, 2 distinct canonical proteins.

Evidence partition

SubsetGenes
multi_evidence2

Cohort genes (full)

SymbolHGNCEnsemblUniProtNameEvidence
AMNHGNC:14604ENSG00000166126Q9BXJ7Protein amnionlessclinvar
ABCD1HGNC:61ENSG00000101986P33897ATP-binding cassette sub-family D member 1clinvar

Cohort function summary

Lead sentence per gene, UniProt-curated.

SymbolProtein nameFunction (lead sentence)
AMNProtein amnionlessMembrane-bound component of the endocytic receptor formed by AMN and CUBN.
ABCD1ATP-binding cassette sub-family D member 1ATP-dependent transporter of the ATP-binding cassette (ABC) family involved in the transport of very long chain fatty acid (VLCFA)-CoA from the cytosol to the peroxisome lumen.

Protein-family classification

Druggable: 1 · Difficult: 0 · Unknown: 1 · Druggable fraction: 0.5

Family distribution

Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.

FamilyGenesFoldFDR
Transporter138.9×0.051
Other/Unknown10.9×0.805

Per-gene assignment

SymbolFamilyDruggable?ECInterPro (top 3)
AMNOther/UnknownnoAMN
ABCD1Transporteryes7.6.2.4ABC_transporter-like_ATP-bd, AAA+_ATPase, FA_transporter

Expression context

Cohort genes with no expression data: 0.

2 cohort genes are a single-cell marker in ≥1 SCXA experiment.

Breadth distribution (Bgee present_calls)

BucketGenes
narrow (1-5 tissues)0
moderate (6-20)0
broad (>20)2
unknown0

Top tissues across cohort

TissueCohort genes
duodenum1
jejunal mucosa1
mucosa of transverse colon1
ileal mucosa1
left adrenal gland1
left adrenal gland cortex1

Per-gene tissue summary (top 30)

SymbolBgee breadthFANTOM5 breadthSCXATop tissues
AMN223broadmarkermucosa of transverse colon, jejunal mucosa, duodenum
ABCD1201ubiquitousmarkerileal mucosa, left adrenal gland cortex, left adrenal gland

Protein interactions among cohort

Intra-cohort edges: 0.

Hub genes (top 10 by interactor count)

SymbolInteractor count
ABCD11,181
AMN414

Structural data

PDB: 2 · AlphaFold-only: 0 · No structure: 0

Cohort genes with PDB structures (top 30)

SymbolUniProtPDB entries
ABCD1P3389714
AMNQ9BXJ71

Function

Pathway analysis

Distinct Reactome pathways touched by cohort: 29. Enrichment computed across 2 evidence-associated genes (2 with Reactome annotation).

Pathways by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 2 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

PathwayCohort genesFoldFDRSample cohort genes
Defective ABCD1 causes ALD12855.0×0.005ABCD1
Defective AMN causes MGA111903.3×0.005AMN
Defective CUBN causes MGA111903.3×0.005AMN
HDL clearance11142.0×0.006AMN
alpha-linolenic (omega3) and linoleic (omega6) acid metabolism1951.7×0.006ABCD1
Linoleic acid (LA) metabolism1571.0×0.006ABCD1
Uptake of dietary cobalamins into enterocytes1571.0×0.006AMN
Transport of small molecules225.1×0.006AMN, ABCD1
Beta-oxidation of very long chain fatty acids1439.2×0.006ABCD1
Defects in cobalamin (B12) metabolism1407.9×0.006AMN
alpha-linolenic acid (ALA) metabolism1356.9×0.006ABCD1
Peroxisomal lipid metabolism1335.9×0.006ABCD1
Cobalamin (Cbl, vitamin B12) transport and metabolism1317.2×0.006AMN
ABC transporters in lipid homeostasis1300.5×0.006ABCD1
Defects in vitamin and cofactor metabolism1300.5×0.006AMN
Class I peroxisomal membrane protein import1259.6×0.007ABCD1
Plasma lipoprotein clearance1237.9×0.007AMN
ABC transporter disorders1219.6×0.007ABCD1
Disease213.1×0.009AMN, ABCD1
Metabolism211.6×0.011AMN, ABCD1
Plasma lipoprotein assembly, remodeling, and clearance1114.2×0.012AMN
Protein localization195.2×0.014ABCD1
Metabolism of water-soluble vitamins and cofactors190.6×0.014AMN
Disorders of transmembrane transporters169.6×0.017ABCD1
Fatty acid metabolism165.6×0.018ABCD1
ABC-family protein mediated transport160.7×0.018ABCD1
Metabolism of vitamins and cofactors158.3×0.018AMN
Diseases of metabolism140.2×0.026AMN
Metabolism of lipids115.8×0.062ABCD1

GO biological processes by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 2 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

GO termCohort genesFoldFDRSample cohort genes
peroxisomal membrane transport14213.0×0.002ABCD1
very long-chain fatty-acyl-CoA catabolic process14213.0×0.002ABCD1
renal protein absorption12808.7×0.002AMN
positive regulation of unsaturated fatty acid biosynthetic process12808.7×0.002ABCD1
sterol homeostasis12106.5×0.002ABCD1
long-chain fatty acid import into peroxisome11685.2×0.002ABCD1
regulation of fatty acid beta-oxidation11404.3×0.002ABCD1
long-chain fatty acid catabolic process11404.3×0.002ABCD1
myelin maintenance11404.3×0.002ABCD1
regulation of mitochondrial depolarization11404.3×0.002ABCD1
fatty acid elongation11203.7×0.002ABCD1
very long-chain fatty acid catabolic process11203.7×0.002ABCD1
cobalamin transport1936.2×0.003AMN
cobalamin metabolic process1766.0×0.003AMN
positive regulation of fatty acid beta-oxidation1766.0×0.003ABCD1
fatty acid derivative biosynthetic process1766.0×0.003ABCD1
regulation of cellular response to oxidative stress1648.1×0.003ABCD1
regulation of oxidative phosphorylation1601.9×0.003ABCD1
neuron projection maintenance1561.7×0.003ABCD1
negative regulation of reactive oxygen species biosynthetic process1495.6×0.003ABCD1
fatty acid homeostasis1468.1×0.003ABCD1
alpha-linolenic acid metabolic process1443.5×0.003ABCD1
peroxisome organization1401.2×0.003ABCD1
very long-chain fatty acid metabolic process1383.0×0.003ABCD1
linoleic acid metabolic process1351.1×0.004ABCD1
unsaturated fatty acid biosynthetic process1324.1×0.004ABCD1
Golgi to plasma membrane protein transport1263.3×0.004AMN
long-chain fatty acid biosynthetic process1221.7×0.005ABCD1
negative regulation of cytokine production involved in inflammatory response1210.7×0.005ABCD1
fatty acid beta-oxidation1187.2×0.006ABCD1

Therapeutics

Drug target analysis

Approved (phase 4): 0 · Phase ≥3: 0 · Phased (≥1): 0 · Undrugged: 2

Druggability breadth: 0 of 2 evidence-associated genes (0%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).

Top cohort targets by molecule count

SymbolMoleculesMax phase
AMN00
ABCD100

Bioactivity and enzyme data

Enzyme cohort genes (≥1 EC): 1.

Cohort enzymes (BRENDA EC)

SymbolEC numbersNames
ABCD17.6.2.4ABC-type fatty-acyl-CoA transporter

Pharmacogenomics

Cohort genes with a PharmGKB record: 2; with CPIC/DPWG dosing guidelines: 0.

No cohort gene has a CPIC/DPWG genotype-guided dosing guideline (PharmGKB).

Chemical tractability of cohort targets

0 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.

Druggability pyramid

Cohort genes binned by druggability tier (high → low):

TierDefinitionGenesSymbols
AApproved (phase 4 drug)0
BPhased (≥1) drug, not yet approved0
CDruggable family + PDB, no drug1ABCD1
DDruggable family + AlphaFold only, no drug0
EDifficult family or no structure, no drug1AMN

Undrugged target profiles

2 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).

SymbolChEMBL assaysDrugged partners (top 3)
AMN0
ABCD10

Clinical trials & evidence

Clinical trials

Clinical trials: 0.