Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome

disease
On this page

Also known as HPFH-beta-thalassemia syndrome

Summary

Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome (MONDO:0018749) is a disease caused by variants in HBG1 and HBG2, with 6 cohort genes. The dominant Reactome pathway is Factors involved in megakaryocyte development and platelet production (3 cohort genes).

At a glance

  • Prevalence: Unknown (Worldwide)
  • Causal genes: HBG1 (GenCC Strong), HBG2 (GenCC Strong)
  • Cohort genes: 6
  • ClinVar variants: 4
  • Phenotypes (HPO): 6

Clinical features

Signs & symptoms

Clinical features (HPO)

6 HPO clinical features (Orphanet curated; top 6 by frequency):

HPO IDTermFrequency
HP:0000980PallorVery frequent (80-99%)
HP:0001744SplenomegalyVery frequent (80-99%)
HP:0001903AnemiaVery frequent (80-99%)
HP:0011904Persistence of hemoglobin FVery frequent (80-99%)
HP:0002240HepatomegalyFrequent (30-79%)
HP:0003330Abnormal bone structureFrequent (30-79%)

Identifiers

Disease identifiers

FieldValue
Canonical namehereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
Mondo IDMONDO:0018749
Orphanet46532
ICD-11418601307
UMLSC0271994
MedGen543715
GARD0018642
Is cancer (heuristic)no

Also known as: HPFH-beta-thalassemia syndrome

Data availability: 4 ClinVar variants · 6 GenCC gene-disease records · 4 cell lines.

Disease family

Classification path: disease › human disease › disease by body system or component › hematologic disorderanemiabeta-thalassemia and related diseaseshereditary persistence of fetal hemoglobin-beta-thalassemia syndrome

Related subtypes (6): beta-thalassemia-X-linked thrombocytopenia syndrome, delta-beta-thalassemia, hemoglobin C-beta-thalassemia syndrome, hemoglobin E-beta-thalassemia syndrome, hemoglobin Lepore-beta-thalassemia syndrome, beta thalassemia

Genetics & variants

GWAS landscape

No GWAS associations recorded — common-variant (GWAS) studies don’t cover this disease (typical for Mendelian / rare diseases). See the curated gene cohort and Mendelian overlap below.

Variant details and genetic-evidence tiers

ClinVar germline variants

4 retrieved; paginated sample, class counts are floors:

2 uncertain risk allele, 1 uncertain significance, 1 likely benign

ClinVarVariant (HGVS)GeneClassificationReview
4083487NM_015898.4(ZBTB7A):c.722C>T (p.Pro241Leu)ZBTB7AUncertain risk alleleno assertion criteria provided
4083489NM_015898.4(ZBTB7A):c.830AGG[2] (p.Glu279del)ZBTB7AUncertain risk alleleno assertion criteria provided
1048099NM_000559.3(HBG1):c.-29G>AHBG1Uncertain significancecriteria provided, single submitter
4083488NM_015898.4(ZBTB7A):c.1032C>T (p.Asp344=)ZBTB7ALikely benigncriteria provided, single submitter

Genes & proteins

Mendelian disease overlap and somatic drivers

GenCC: 55 · Orphanet: 40 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 0

GenCC gene–disease validity (cohort genes)

the Disease column is the GenCC-asserted condition — a cohort gene’s strongest validity may be for a related predisposition syndrome.

GeneClassificationInheritanceDiseaseRecords
HBBDefinitiveSemidominantbeta-thalassemia HBB/LCRB33
ACSBG1StrongAutosomal dominanthereditary persistence of fetal hemoglobin-beta-thalassemia syndrome4
HBG1StrongAutosomal dominanthereditary persistence of fetal hemoglobin-beta-thalassemia syndrome4
HBG2StrongAutosomal dominanthereditary persistence of fetal hemoglobin-beta-thalassemia syndrome6
KLF1SupportiveAutosomal recessivehereditary persistence of fetal hemoglobin-sickle cell disease syndrome8

Orphanet rare-disease linkage (cohort genes)

GeneOrphanet IDRare disease
HBG1Orphanet:231237Delta-beta-thalassemia
HBG1Orphanet:251380Hereditary persistence of fetal hemoglobin-sickle cell disease syndrome
HBG1Orphanet:46532Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
HBBOrphanet:2132Hemoglobin C disease
HBBOrphanet:2133Hemoglobin E disease
HBBOrphanet:231214Beta-thalassemia major
HBBOrphanet:231222Beta-thalassemia intermedia
HBBOrphanet:231226Unstable beta globin chain variant disease
HBBOrphanet:231237Delta-beta-thalassemia
HBBOrphanet:231242Hemoglobin C-beta-thalassemia syndrome
HBBOrphanet:231249Hemoglobin E-beta-thalassemia syndrome
HBBOrphanet:232Sickle cell anemia
HBBOrphanet:247511Autosomal dominant secondary polycythemia
HBBOrphanet:251365Sickle cell S-C disease
HBBOrphanet:251370Sickle cell S-D Punjab disease
HBBOrphanet:251375Sickle cell S-E disease
HBBOrphanet:251380Hereditary persistence of fetal hemoglobin-sickle cell disease syndrome
HBBOrphanet:330041Hemoglobin M disease
HBBOrphanet:46532Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
HBBOrphanet:695140Sickle cell-beta zero-thalassemia
HBBOrphanet:695147Sickle cell-beta plus-thalassemia
HBBOrphanet:699822Sickle cell S-Lepore disease
HBBOrphanet:700090Sickle cell S-O Arab disease
HBBOrphanet:700107Sickle cell S-other specified hemoglobin variant
HBBOrphanet:700111Homozygous hemoglobin O Arab disease
HBBOrphanet:715125Hemoglobin E-beta-thalassemia intermedia
HBBOrphanet:715128Hemoglobin E-beta-thalassemia major
HBBOrphanet:715135Hemoglobin Lepore-beta-thalassemia intermedia
HBBOrphanet:715140Hemoglobin Lepore-beta-thalassemia major
HBBOrphanet:715143Heterozygous beta-thalassemia intermedia with supernumerary alpha-globin gene
HBBOrphanet:715157Low oxygen affinity beta chain hemoglobin disease
HBBOrphanet:90039Hemoglobin D disease
HBG2Orphanet:251380Hereditary persistence of fetal hemoglobin-sickle cell disease syndrome
HBG2Orphanet:280615Low oxygen affinity gamma chain hemoglobin disease
HBG2Orphanet:46532Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
HBG2Orphanet:707792Unstable gamma globin chain variant disease
KLF1Orphanet:251380Hereditary persistence of fetal hemoglobin-sickle cell disease syndrome
KLF1Orphanet:293825Congenital dyserythropoietic anemia type IV
KLF1Orphanet:46532Hereditary persistence of fetal hemoglobin-beta-thalassemia syndrome
ZBTB7AOrphanet:694956Intellectual disability-lymphoid hypertrophy-macrocephaly syndrome

Cohort genes → proteins

6 cohort genes, 6 distinct canonical proteins.

Evidence partition

SubsetGenes
multi_evidence6

Cohort genes (full)

SymbolHGNCEnsemblUniProtNameEvidence
ACSBG1HGNC:29567ENSG00000103740Q96GR2Long-chain-fatty-acid–CoA ligase ACSBG1gencc,clinvar
HBG1HGNC:4831ENSG00000213934P69891Hemoglobin subunit gamma-1gencc,clinvar
HBBHGNC:4827ENSG00000244734P68871Hemoglobin subunit betagencc
HBG2HGNC:4832ENSG00000196565P69892Hemoglobin subunit gamma-2gencc
KLF1HGNC:6345ENSG00000105610Q13351Krueppel-like factor 1gencc
ZBTB7AHGNC:18078ENSG00000178951O95365Zinc finger and BTB domain-containing protein 7Aclinvar

Cohort function summary

Lead sentence per gene, UniProt-curated.

SymbolProtein nameFunction (lead sentence)
ACSBG1Long-chain-fatty-acid–CoA ligase ACSBG1Catalyzes the conversion of fatty acids such as long-chain and very long-chain fatty acids to their active form acyl-CoAs for both synthesis of cellular lipids, and degradation via beta-oxidation.
HBG1Hemoglobin subunit gamma-1Gamma chains make up the fetal hemoglobin F, in combination with alpha chains.
HBBHemoglobin subunit betaInvolved in oxygen transport from the lung to the various peripheral tissues.
HBG2Hemoglobin subunit gamma-2Gamma chains make up the fetal hemoglobin F, in combination with alpha chains.
KLF1Krueppel-like factor 1Transcription regulator of erythrocyte development that probably serves as a general switch factor during erythropoiesis.
ZBTB7AZinc finger and BTB domain-containing protein 7ATranscription factor that represses the transcription of a wide range of genes involved in cell proliferation and differentiation.

Protein-family classification

Druggable: 0 · Difficult: 2 · Unknown: 4 · Druggable fraction: 0.0

Family distribution

Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.

FamilyGenesFoldFDR
Transcription factor22.8×0.316
Other/Unknown41.2×0.458

Per-gene assignment

SymbolFamilyDruggable?ECInterPro (top 3)
ACSBG1Other/UnknownnoAMP-dep_synth/lig_dom, AMP-binding_CS, ANL_N_sf
HBG1Other/UnknownnoGlobin, Hemoglobin_b, Globin-like_sf
HBBOther/UnknownnoGlobin, Hemoglobin_b, Globin-like_sf
HBG2Other/UnknownnoGlobin, Hemoglobin_b, Globin-like_sf
KLF1Transcription factornoZnf_C2H2_type, EKLF_TAD2, EKLF_TAD1
ZBTB7ATranscription factornoBTB/POZ_dom, SKP1/BTB/POZ_sf, Znf_C2H2_type

Expression context

Cohort genes with no expression data: 0.

6 cohort genes are a single-cell marker in ≥1 SCXA experiment.

Breadth distribution (Bgee present_calls)

BucketGenes
narrow (1-5 tissues)0
moderate (6-20)0
broad (>20)6
unknown0

Top tissues across cohort

TissueCohort genes
blood2
placenta2
trabecular bone tissue2
C1 segment of cervical spinal cord1
inferior olivary complex1
upper leg skin1
male germ line stem cell (sensu Vertebrata) in testis1
monocyte1
vena cava1
adrenal tissue1
ganglionic eminence1
bone marrow1
buccal mucosa cell1
nipple1
tendon of biceps brachii1

Per-gene tissue summary (top 30)

SymbolBgee breadthFANTOM5 breadthSCXATop tissues
ACSBG1207broadmarkerupper leg skin, inferior olivary complex, C1 segment of cervical spinal cord
HBG1121tissue_specificmarkerplacenta, blood, male germ line stem cell (sensu Vertebrata) in testis
HBB284broadmarkermonocyte, trabecular bone tissue, vena cava
HBG2129tissue_specificmarkerplacenta, adrenal tissue, ganglionic eminence
KLF186tissue_specificmarkertrabecular bone tissue, bone marrow, blood
ZBTB7A281ubiquitousmarkerbuccal mucosa cell, tendon of biceps brachii, nipple

Protein interactions among cohort

Intra-cohort edges: 1.

Hub genes (top 10 by interactor count)

SymbolInteractor count
ZBTB7A1,737
ACSBG11,697
KLF11,682
HBB454
HBG266
HBG133

Intra-cohort edges

ABSources
HBG1HBG2intact

Structural data

PDB: 5 · AlphaFold-only: 1 · No structure: 0

Cohort genes with PDB structures (top 30)

SymbolUniProtPDB entries
HBBP68871350
ZBTB7AO9536511
HBG2P698924
KLF1Q133513
HBG1P698912

AlphaFold-only cohort genes (top 30 by pLDDT)

SymbolUniProtpLDDT
ACSBG1Q96GR284.99

Function

Pathway analysis

Distinct Reactome pathways touched by cohort: 15. Enrichment computed across 6 evidence-associated genes (4 with Reactome annotation).

Pathways by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 4 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

PathwayCohort genesFoldFDRSample cohort genes
Factors involved in megakaryocyte development and platelet production349.8×2e-04HBG1, HBB, HBG2
Heme assimilation1951.7×0.008HBB
Erythrocytes take up oxygen and release carbon dioxide1317.2×0.014HBB
Erythrocytes take up carbon dioxide and release oxygen1219.6×0.014HBB
Scavenging of heme from plasma1219.6×0.014HBB
Chaperone Mediated Autophagy1124.1×0.017HBB
Synthesis of very long-chain fatty acyl-CoAs1114.2×0.017ACSBG1
Fatty acyl-CoA biosynthesis1109.8×0.017ACSBG1
Late endosomal microautophagy181.6×0.020HBB
Heme signaling153.9×0.028HBB
Cytoprotection by HMOX1146.0×0.029HBB
Fatty acid metabolism132.8×0.038ACSBG1
Metabolism of lipids17.9×0.140ACSBG1
Neutrophil degranulation15.8×0.174HBB
Metabolism12.9×0.302ACSBG1

GO biological processes by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 6 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

GO termCohort genesFoldFDRSample cohort genes
carbon dioxide transport3648.1×3e-07HBG1, HBB, HBG2
oxygen transport3526.6×3e-07HBG1, HBB, HBG2
erythrocyte development3263.3×2e-06HBG1, HBB, HBG2
regulation of transcription regulatory region DNA binding1702.2×0.016ZBTB7A
nitric oxide transport1561.7×0.016HBB
cellular oxidant detoxification1312.1×0.020HBB
renal absorption1280.9×0.020HBB
double-strand break repair via classical nonhomologous end joining1280.9×0.020ZBTB7A
obsolete regulation of DNA-binding transcription factor activity1255.3×0.020ZBTB7A
cellular response to endothelin1234.1×0.020KLF1
regulation of glycolytic process1200.6×0.021ZBTB7A
maternal process involved in female pregnancy1156.0×0.022KLF1
negative regulation of androgen receptor signaling pathway1156.0×0.022ZBTB7A
long-chain fatty-acyl-CoA biosynthetic process1140.4×0.022ACSBG1
erythrocyte maturation1140.4×0.022ZBTB7A
very long-chain fatty acid metabolic process1127.7×0.022ACSBG1
hydrogen peroxide catabolic process1112.3×0.023HBB
long-chain fatty acid metabolic process1104.0×0.023ACSBG1
blood vessel diameter maintenance1104.0×0.023HBB
response to hydrogen peroxide178.0×0.028HBB
positive regulation of nitric oxide biosynthetic process175.9×0.028HBB
long-chain fatty acid biosynthetic process173.9×0.028ACSBG1
negative regulation of Notch signaling pathway172.0×0.028ZBTB7A
protein localization to nucleus158.5×0.033ZBTB7A
platelet aggregation156.2×0.033HBB
response to glucocorticoid154.0×0.033ACSBG1
protein destabilization148.4×0.035KLF1
erythrocyte differentiation144.6×0.037KLF1
myelination141.9×0.037ACSBG1
regulation of alternative mRNA splicing, via spliceosome140.7×0.037ZBTB7A

Therapeutics

Drug target analysis

Approved (phase 4): 1 · Phase ≥3: 1 · Phased (≥1): 1 · Undrugged: 5

Druggability breadth: 5 of 6 evidence-associated genes (83%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).

Genes with an approved drug

The molecule shown is one approved compound that hits the gene — not necessarily a drug of choice or one indicated for this disease.

SymbolExample approved molecule
HBBCANDESARTAN CILEXETIL

Top cohort targets by molecule count

SymbolMoleculesMax phase
HBB234
ACSBG100
HBG100
HBG200
KLF100
ZBTB7A00

Drugs targeting cohort genes (top 30)

MoleculeMax phaseTargets in cohort
CANDESARTAN CILEXETIL4HBB
MECHLORETHAMINE HYDROCHLORIDE4HBB
PHENAZOPYRIDINE HYDROCHLORIDE4HBB
MERCAPTOPURINE ANHYDROUS4HBB
AZACITIDINE4HBB
AZATHIOPRINE4HBB
TOPOTECAN HYDROCHLORIDE4HBB
ACYCLOVIR4HBB
FLUOROURACIL4HBB
RAUWOLFIA SERPENTINA4HBB
HYDROQUINONE4HBB
MENADIONE4HBB
THIOTEPA4HBB
THIOGUANINE4HBB
RESERPINE4HBB
CURCUMIN3HBB
HYDROXYCAMPTOTHECIN3HBB
MOLIBRESIB2HBB
FISETIN2HBB
TEROXIRONE2HBB
5-FLUOROURIDINE2HBB
ELLAGIC ACID2HBB
BAICALEIN2HBB

Bioactivity and enzyme data

Enzyme cohort genes (≥1 EC): 0.

Cohort genes with ChEMBL bioactivity (full, sorted by assay count)

SymbolAssaysType breakdown
HBB68Binding:50, Functional:18
KLF14Binding:4
ZBTB7A2Binding:2
HBG11Binding:1
HBG21Binding:1

Pharmacogenomics

Cohort genes with a PharmGKB record: 6; with CPIC/DPWG dosing guidelines: 0.

No cohort gene has a CPIC/DPWG genotype-guided dosing guideline (PharmGKB).

Chemical tractability of cohort targets

23 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.

CompoundMax phaseCohort target (bioactivity)
CANDESARTAN CILEXETIL4HBB
MECHLORETHAMINE HYDROCHLORIDE4HBB
PHENAZOPYRIDINE HYDROCHLORIDE4HBB
MERCAPTOPURINE ANHYDROUS4HBB
AZACITIDINE4HBB
AZATHIOPRINE4HBB
TOPOTECAN HYDROCHLORIDE4HBB
ACYCLOVIR4HBB
FLUOROURACIL4HBB
RAUWOLFIA SERPENTINA4HBB
HYDROQUINONE4HBB
MENADIONE4HBB
THIOTEPA4HBB
THIOGUANINE4HBB
RESERPINE4HBB
CURCUMIN3HBB
HYDROXYCAMPTOTHECIN3HBB
MOLIBRESIB2HBB
FISETIN2HBB
TEROXIRONE2HBB
5-FLUOROURIDINE2HBB
ELLAGIC ACID2HBB
BAICALEIN2HBB

Druggability pyramid

Cohort genes binned by druggability tier (high → low):

TierDefinitionGenesSymbols
AApproved (phase 4 drug)1HBB
BPhased (≥1) drug, not yet approved0
CDruggable family + PDB, no drug0
DDruggable family + AlphaFold only, no drug0
EDifficult family or no structure, no drug5ACSBG1, HBG1, HBG2, KLF1, ZBTB7A

Undrugged target profiles

5 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).

SymbolChEMBL assaysDrugged partners (top 3)
ACSBG10
HBG11
HBG21
KLF14
ZBTB7A2

Clinical trials & evidence

Clinical trials

Clinical trials: 0.