Congenital dyserythropoietic anemia type 4

disease
On this page

Also known as anemia, congenital dyserythropoietic, type IVCDA due to KLF1 mutationCDA IVCDA type 4CDA type IVCDAN4congenital dyserythropoietic anemia due to KLF1 mutationdyserythropoietic anemia, congenital, type IV

Summary

Congenital dyserythropoietic anemia type 4 (MONDO:0013355) is a disease caused by KLF1 (GenCC Definitive), with 2 cohort genes.

At a glance

  • Prevalence: <1 / 1 000 000 (Worldwide) [Orphanet-validated]
  • Causal gene: KLF1 (GenCC Definitive)
  • Cohort genes: 2
  • ClinVar variants: 43

Clinical features

Epidemiology

Prevalence records

2 prevalence record(s), Orphanet:

TypeClassValueGeographyValidation
Cases/families4WorldwideValidated
Point prevalence<1 / 1 000 000WorldwideValidated

Identifiers

Disease identifiers

FieldValue
Canonical namecongenital dyserythropoietic anemia type 4
Mondo IDMONDO:0013355
OMIM613673
Orphanet293825
DOIDDOID:0111400
SNOMED CT719453009
UMLSC3150926
MedGen462276
GARD0017344
Is cancer (heuristic)no

Also known as: anemia, congenital dyserythropoietic, type IV · CDA due to KLF1 mutation · CDA IV · CDA type 4 · CDA type IV · CDAN4 · congenital dyserythropoietic anemia due to KLF1 mutation · congenital dyserythropoietic anemia type 4 · dyserythropoietic anemia, congenital, type IV

Data availability: 43 ClinVar variants · 5 GenCC gene-disease records.

Disease family

Classification path: disease › human disease › disease by body system or component › hematologic disorderanemiacongenital anemiacongenital dyserythropoietic anemia type 4

Related subtypes (7): myopathy, lactic acidosis, and sideroblastic anemia, congenital nonspherocytic hemolytic anemia, congenital dyserythropoietic anemia type 3, congenital dyserythropoietic anemia type 2, severe congenital hypochromic anemia with ringed sideroblasts, Fanconi anemia, congenital dyserythropoietic anemia type 1

Genetics & variants

GWAS landscape

No GWAS associations recorded — common-variant (GWAS) studies don’t cover this disease (typical for Mendelian / rare diseases). See the curated gene cohort and Mendelian overlap below.

Variant details and genetic-evidence tiers

ClinVar germline variants

43 retrieved; paginated sample, class counts are floors:

14 uncertain significance, 8 conflicting classifications of pathogenicity, 8 benign, 6 benign/likely benign, 4 pathogenic, 2 likely pathogenic, 1 likely benign

ClinVarVariant (HGVS)GeneClassificationReview
2087NM_000159.4(GCDH):c.91+5G>TGCDHPathogeniccriteria provided, multiple submitters, no conflicts
56891NM_006563.5(KLF1):c.892G>C (p.Ala298Pro)KLF1Pathogeniccriteria provided, multiple submitters, no conflicts
56892NM_006563.5(KLF1):c.1012C>T (p.Pro338Ser)KLF1Pathogenicno assertion criteria provided
18447NM_006563.5(KLF1):c.973G>A (p.Glu325Lys)LOC117125591Pathogeniccriteria provided, multiple submitters, no conflicts
100801NM_006563.5(KLF1):c.913+1G>AKLF1Likely pathogeniccriteria provided, single submitter
3765538NM_006563.5(KLF1):c.858C>A (p.Cys286Ter)KLF1Likely pathogeniccriteria provided, single submitter
1693737NM_006563.5(KLF1):c.649C>T (p.Gln217Ter)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
328313NM_006563.5(KLF1):c.690C>T (p.Ala230=)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
328315NM_006563.5(KLF1):c.606G>A (p.Gly202=)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
328321NM_006563.5(KLF1):c.311C>T (p.Ala104Val)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
328323NM_006563.5(KLF1):c.259C>G (p.Pro87Ala)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
891355NM_006563.5(KLF1):c.803G>T (p.Arg268Leu)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
891356NM_006563.5(KLF1):c.757G>A (p.Glu253Lys)KLF1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
892550NM_006563.5(KLF1):c.541T>A (p.Tyr181Asn)LOC130063673Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
328304NM_006563.5(KLF1):c.*367C>GKLF1Uncertain significancecriteria provided, single submitter
328305NM_006563.5(KLF1):c.*311A>GKLF1Uncertain significancecriteria provided, single submitter
328309NM_006563.5(KLF1):c.*263C>TKLF1Uncertain significancecriteria provided, multiple submitters, no conflicts
328318NM_006563.5(KLF1):c.478G>A (p.Glu160Lys)KLF1Uncertain significancecriteria provided, multiple submitters, no conflicts
328319NM_006563.5(KLF1):c.353G>A (p.Gly118Asp)KLF1Uncertain significancecriteria provided, single submitter
328324NM_006563.5(KLF1):c.255G>A (p.Pro85=)KLF1Uncertain significancecriteria provided, single submitter
889116NM_006563.5(KLF1):c.*312G>CKLF1Uncertain significancecriteria provided, single submitter
889805NM_006563.5(KLF1):c.*2C>TKLF1Uncertain significancecriteria provided, single submitter
891357NM_006563.5(KLF1):c.676G>A (p.Ala226Thr)KLF1Uncertain significancecriteria provided, single submitter
892551NM_006563.5(KLF1):c.409C>T (p.Arg137Cys)KLF1Uncertain significancecriteria provided, multiple submitters, no conflicts
931601NM_006563.5(KLF1):c.1079G>A (p.Arg360His)KLF1Uncertain significancecriteria provided, multiple submitters, no conflicts
891353NM_006563.5(KLF1):c.1036C>T (p.Arg346Cys)LOC117125591Uncertain significancecriteria provided, single submitter
891354NM_006563.5(KLF1):c.1016T>C (p.Phe339Ser)LOC117125591Uncertain significancecriteria provided, single submitter
3242138NM_006563.5(KLF1):c.587C>T (p.Pro196Leu)LOC130063673Uncertain significancecriteria provided, single submitter
260001NM_006563.5(KLF1):c.304T>C (p.Ser102Pro)KLF1Benigncriteria provided, multiple submitters, no conflicts
328303NM_006563.5(KLF1):c.*454C>GKLF1Benigncriteria provided, single submitter

Genes & proteins

Mendelian disease overlap and somatic drivers

GenCC: 8 · Orphanet: 4 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 0

GenCC gene–disease validity (cohort genes)

the Disease column is the GenCC-asserted condition — a cohort gene’s strongest validity may be for a related predisposition syndrome.

GeneClassificationInheritanceDiseaseRecords
KLF1DefinitiveAutosomal dominantcongenital dyserythropoietic anemia type 48

Orphanet rare-disease linkage (cohort genes)

GeneOrphanet IDRare disease
KLF1Orphanet:251380Hereditary persistence of fetal hemoglobin-sickle cell disease syndrome
KLF1Orphanet:293825Congenital dyserythropoietic anemia type IV
KLF1Orphanet:46532Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
GCDHOrphanet:25Glutaryl-CoA dehydrogenase deficiency

Cohort genes → proteins

2 cohort genes, 2 distinct canonical proteins.

Evidence partition

SubsetGenes
multi_evidence2

Cohort genes (full)

SymbolHGNCEnsemblUniProtNameEvidence
KLF1HGNC:6345ENSG00000105610Q13351Krueppel-like factor 1gencc,clinvar
GCDHHGNC:4189ENSG00000105607Q92947Glutaryl-CoA dehydrogenase, mitochondrialclinvar

Cohort function summary

Lead sentence per gene, UniProt-curated.

SymbolProtein nameFunction (lead sentence)
KLF1Krueppel-like factor 1Transcription regulator of erythrocyte development that probably serves as a general switch factor during erythropoiesis.
GCDHGlutaryl-CoA dehydrogenase, mitochondrialCatalyzes the oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA and CO(2) in the degradative pathway of L-lysine, L-hydroxylysine, and L-tryptophan metabolism.

Protein-family classification

Druggable: 1 · Difficult: 1 · Unknown: 0 · Druggable fraction: 0.5

Family distribution

Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.

FamilyGenesFoldFDR
Enzyme (other)16.0×0.228
Transcription factor14.1×0.228

Per-gene assignment

SymbolFamilyDruggable?ECInterPro (top 3)
KLF1Transcription factornoZnf_C2H2_type, EKLF_TAD2, EKLF_TAD1
GCDHEnzyme (other)yes1.3.8.6Acyl-CoA_DH_CS, AcylCoA_DH/ox_M, AcylCo_DH/oxidase_C

Expression context

Cohort genes with no expression data: 0.

2 cohort genes are a single-cell marker in ≥1 SCXA experiment.

Breadth distribution (Bgee present_calls)

BucketGenes
narrow (1-5 tissues)0
moderate (6-20)0
broad (>20)2
unknown0

Top tissues across cohort

TissueCohort genes
blood1
bone marrow1
trabecular bone tissue1
apex of heart1
liver1
right lobe of liver1

Per-gene tissue summary (top 30)

SymbolBgee breadthFANTOM5 breadthSCXATop tissues
KLF186tissue_specificmarkertrabecular bone tissue, bone marrow, blood
GCDH259ubiquitousmarkerright lobe of liver, liver, apex of heart

Protein interactions among cohort

Intra-cohort edges: 0.

Hub genes (top 10 by interactor count)

SymbolInteractor count
GCDH2,573
KLF11,682

Structural data

PDB: 2 · AlphaFold-only: 0 · No structure: 0

Cohort genes with PDB structures (top 30)

SymbolUniProtPDB entries
GCDHQ929474
KLF1Q133513

Function

Pathway analysis

Distinct Reactome pathways touched by cohort: 1. Enrichment computed across 2 evidence-associated genes (1 with Reactome annotation).

Pathways by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 1 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

PathwayCohort genesFoldFDRSample cohort genes
Lysine catabolism11142.0×9e-04GCDH

GO biological processes by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 2 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

GO termCohort genesFoldFDRSample cohort genes
obsolete L-tryptophan metabolic process12808.7×0.004GCDH
fatty-acyl-CoA biosynthetic process1936.2×0.004GCDH
fatty acid beta-oxidation using acyl-CoA dehydrogenase1702.2×0.004GCDH
cellular response to endothelin1702.2×0.004KLF1
maternal process involved in female pregnancy1468.1×0.005KLF1
protein destabilization1145.3×0.012KLF1
erythrocyte differentiation1133.8×0.012KLF1
ubiquitin-dependent protein catabolic process137.1×0.037KLF1
regulation of DNA-templated transcription115.8×0.076KLF1
positive regulation of DNA-templated transcription114.0×0.077KLF1
regulation of transcription by RNA polymerase II15.8×0.164KLF1

Therapeutics

Drug target analysis

Approved (phase 4): 1 · Phase ≥3: 1 · Phased (≥1): 1 · Undrugged: 1

Druggability breadth: 2 of 2 evidence-associated genes (100%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).

Genes with an approved drug

The molecule shown is one approved compound that hits the gene — not necessarily a drug of choice or one indicated for this disease.

SymbolExample approved molecule
GCDHBALSALAZIDE

Top cohort targets by molecule count

SymbolMoleculesMax phase
GCDH14
KLF100

Drugs targeting cohort genes (top 30)

MoleculeMax phaseTargets in cohort
BALSALAZIDE4GCDH

Bioactivity and enzyme data

Enzyme cohort genes (≥1 EC): 1.

Cohort genes with ChEMBL bioactivity (full, sorted by assay count)

SymbolAssaysType breakdown
GCDH15Binding:14, ADMET:1
KLF14Binding:4

Cohort enzymes (BRENDA EC)

SymbolEC numbersNames
GCDH1.3.8.6glutaryl-CoA dehydrogenase (ETF)

Pharmacogenomics

Cohort genes with a PharmGKB record: 2; with CPIC/DPWG dosing guidelines: 0.

No cohort gene has a CPIC/DPWG genotype-guided dosing guideline (PharmGKB).

Chemical tractability of cohort targets

1 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.

CompoundMax phaseCohort target (bioactivity)
BALSALAZIDE4GCDH

Druggability pyramid

Cohort genes binned by druggability tier (high → low):

TierDefinitionGenesSymbols
AApproved (phase 4 drug)1GCDH
BPhased (≥1) drug, not yet approved0
CDruggable family + PDB, no drug0
DDruggable family + AlphaFold only, no drug0
EDifficult family or no structure, no drug1KLF1

Undrugged target profiles

1 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).

SymbolChEMBL assaysDrugged partners (top 3)
KLF14

Clinical trials & evidence

Clinical trials

Clinical trials: 0.