HBB Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human HBB — a definitive lookup resource covering: ### Section …

Provide a comprehensive cross-database identifier and functional mapping reference for human HBB — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene HBB, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene HBB, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene HBB protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene HBB protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene HBB, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene HBB, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene HBB, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene HBB protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene HBB, summarize transcription factor regulatory data. If HBB is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate HBB — names with evidence type (ChIP-seq / predicted / experimentally validated) If HBB is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene HBB protein as a drug target, summarize pharmacology data. If HBB is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If HBB is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene HBB, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene HBB, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in HBB: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

HBB

Executive summary

HBB (Beta-Globin, HGNC:4827) encodes the beta subunit of adult hemoglobin, the principal oxygen-carrying protein in red blood cells, and is one of the most clinically significant genes in medicine. Its primary interacting partner is HBA1 (alpha-globin), and together they form the functional hemoglobin tetramer responsible for oxygen and carbon dioxide transport. HBB harbors over 1,700 catalogued variants in ClinVar, with 230+ classified as pathogenic or likely pathogenic, causing a spectrum of hemoglobinopathies including beta-thalassemia (primarily deletions and splice mutations) and sickle cell disease (missense variants); these conditions together drive 967 and 214 active clinical trials respectively, with emerging gene therapy approaches such as CTX001 in late-stage development. GWAS data reveal that HBB variants confer extraordinarily strong protection against severe malaria (p < 10⁻⁵⁵), reflecting one of the most powerful known examples of natural selection. The gene is regulated by 71 transcription factors, prominently KLF1 and GATA1, and is expressed at near-maximum levels across blood-forming and immune tissues, serving as a lineage-defining marker across 46 single-cell atlas experiments.

Gene identifiers

HBB (Beta-Globin)

IdentifierID
HGNC IDHGNC:4827
Approved SymbolHBB
Ensembl Gene IDENSG00000244734
NCBI Entrez Gene ID3043
OMIM Gene/Locus ID141900

Genomic Location (GRCh38/hg38)

  • Chromosome: 11
  • Start: 5,225,464
  • End: 5,229,395
  • Strand: − (minus/reverse)

Transcript identifiers

Ensembl transcripts (7 total)

ENST IDBiotype
ENST00000335295protein_coding
ENST00000380315protein_coding
ENST00000475226retained_intron
ENST00000485743protein_coding
ENST00000633227nonsense_mediated_decay
ENST00000647020protein_coding
ENST00000883533protein_coding

RefSeq transcripts

NM accessionStatusMANE Select
NM_000518REVIEWED
NM_033234PROVISIONAL

CCDS

  • CCDS7753

Canonical/MANE Select transcript exons (ENST00000335295)

Total exons: 3

Exon IDStartEndChromosome
ENSE000016006135225464522572611
ENSE000010573815226577522679911
ENSE000018298675226930522707111

Protein identifiers

UniProt Accessions

  • P68871 (reviewed/canonical) - Hemoglobin subunit beta
  • A0A0J9YWK4 (unreviewed isoform)
  • A0A2R8Y7R2 (unreviewed isoform)
  • D9YZU5 (unreviewed isoform)
  • F8W6P5 (unreviewed isoform)

RefSeq Protein Identifiers

  • NP_000509 (REVIEWED, MANE Select)

Protein Domains and Families

InterPro:

  • IPR000971 | Globin | Domain
  • IPR002337 | Hemoglobin, beta-type | Family
  • IPR009050 | Globin-like superfamily | Homologous_superfamily
  • IPR012292 | Globin/Protoglobin | Homologous_superfamily
  • IPR050056 | Hemoglobin and related oxygen transporters | Family

Pfam:

  • PF00042 | Globin | Domain

PANTHER:

  • PTHR11442 | Hemoglobin beta globins | Family
  • PTHR11442:SF42 | Hemoglobin beta-1 | Subfamily

CDD (Conserved Domain Database):

  • CD08925 | Globin/related proteins | Domain

CATHGENE3D:

  • 1.10.490.10 | Globin domain | Superfamily

Antibody Availability

No antibody resources identified in biobtree for HBB protein.

Structure

Experimental Structures

Total PDB Entries: 100 unique structures

X-ray Crystallography

Most structures (majority of the 100 entries). Resolution range: 1.07 Å to 4.5 Å. Notable entries include:

  • 2W72: 1.07 Å (highest resolution)
  • 2DN1, 2DN2, 2DN3, 1IRD: 1.25 Å
  • 7DY4: 1.3 Å
  • 7DY3: 1.4 Å
  • Multiple structures with 1.5-1.9 Å resolutions
  • Various mutants and complexes at 2.0-3.3 Å

Key wild-type entries:

  • 1HHB, 2HHB, 3HHB, 4HHB: Deoxyhemoglobin (1.74 Å)
  • 1HHO: Oxyhemoglobin (2.1 Å)
  • 2HCO, 1HCO: Carbonmonoxyhemoglobin (2.7 Å)

Cryo-EM / Electron Microscopy

  • 5NI1: 3.2 Å
  • 6NBC, 6NBD: 2.8-3.2 Å (methemoglobin states)
  • 8WJ0, 8WJ1, 8WJ2: 2.24-2.35 Å (carbonmonoxy, oxy, deoxy forms)
  • 9CQM-9CQW: 2.37-3.01 Å (multiple states)
  • 7VDE, 7XGY: 3.5-3.6 Å
  • 9VIB, 9VIC: 2.26-2.45 Å (R2 and R states)
  • Higher resolution complexes: 7PCH (2.89 Å), 8XMP (3.11 Å)

Neutron Diffraction

  • 2DXM: 2.1 Å (deoxyhemoglobin)
  • 3KMF: 2.0 Å (room temperature TOF)

Solution NMR

  • 2H35: Solution structure (no resolution listed)
  • 2M6Z: Refined solution structure, carbonmonoxy form (no resolution listed)

Predicted Structures

AlphaFold Model:

  • Model ID: P68871
  • pLDDT (per-residue confidence): 97.09 (global metric)
  • Confidence distribution: 97% of residues with very high confidence (pLDDT > 70)
  • Predicted to be highly reliable across the full structure

Cross-species orthologs

OrganismGene IDSymbol
Mouse (Mus musculus)ENSMUSG00000052305Hbb-bs
Rat (Rattus norvegicus)ENSRNOG00000081310Hbb
Zebrafish (Danio rerio)ENSDARG00000045142hbae5
Fruit fly (Drosophila melanogaster)FBGN0027657glob1
Worm (C. elegans)WBGENE00008996glb-14
Yeast (S. cerevisiae)none

Clinical variants & AI predictions

ClinVar Overview

ClassificationCount
Pathogenic~150
Likely Pathogenic~80
Uncertain Significance~100
Conflicting~30
Likely Benign~1,200
Benign~150
Total~1,706

Top Pathogenic/Likely Pathogenic ClinVar Variants

ClinVar IDHGVS NotationClassificationAssociated Condition
1048666NC_000011.10:g.5224303_5227790delPathogenicβ-Thalassemia
1074556NC_000011.9:g.(?_5247393)_5248785delPathogenicβ-Thalassemia
1074558NC_000011.9:g.(?_5247493)_5248852delPathogenicβ-Thalassemia
1074559NC_000011.9:g.(?_5239576)_5247294delPathogenicβ-Thalassemia
1074561NC_000011.9:g.(?_5246696)_5247105delPathogenicβ-Thalassemia
1076533NC_000011.10:g.5226755_5227283delPathogenicβ-Thalassemia
1065151NM_000518.5(HBB):c.233_234del (p.His78fs)Likely Pathogenicβ-Thalassemia
1098798NM_000518.5(HBB):c.126dup (p.Phe43fs)Likely PathogenicSickle Cell Disease

Note: Additional ~140 pathogenic/likely pathogenic variants exist; most are splice site mutations and deletions associated with β-thalassemia and sickle cell disease phenotypes.

AlphaMissense Predictions

MetricCount
Total Predictions145
Likely Pathogenic~85

Top 30 AlphaMissense Likely-Pathogenic Variants

Protein VariantPathogenicity ScorePredicted Effect
Y146H0.998Likely Pathogenic
Y146D0.985Likely Pathogenic
A139D0.985Likely Pathogenic
A141P0.990Likely Pathogenic
L142P0.990Likely Pathogenic
H147D0.971Likely Pathogenic
Y146G0.976Likely Pathogenic
Y146N0.978Likely Pathogenic
K133I0.979Likely Pathogenic
A139P0.985Likely Pathogenic
A143D0.981Likely Pathogenic
V138E0.973Likely Pathogenic
V134E0.975Likely Pathogenic
K133N0.975Likely Pathogenic
K133E0.965Likely Pathogenic
A143P0.984Likely Pathogenic
H147N0.898Likely Pathogenic
L142Q0.989Likely Pathogenic
L142R0.974Likely Pathogenic
A141V0.928Likely Pathogenic
K145N0.924Likely Pathogenic
A141D0.986Likely Pathogenic
H147R0.874Likely Pathogenic
H147L0.874Likely Pathogenic
H147Q0.941Likely Pathogenic
A139V0.833Likely Pathogenic
L142V0.767Likely Pathogenic
K145E0.762Likely Pathogenic
V135L0.754Likely Pathogenic
V135M0.623Likely Pathogenic

SpliceAI Predictions

MetricCount
Total Predictions~260
High-Confidence Splice Effects~80+

Top Splice-Altering Variants (SpliceAI Score ≥0.8)

VariantEffectScore
11:5226576:CCCTG:CDonor Gain1.00
11:5226575:A:ACDonor Gain1.00
11:5226575:AC:ADonor Gain1.00
11:5226575:ACC:ADonor Gain1.00
11:5226576:CCCT:CDonor Gain1.00
11:5226576:CCC:CDonor Gain1.00
11:5226576:C:CCDonor Gain1.00
11:5226576:C:CADonor Gain1.00
11:5226573:TCAC:TDonor Loss1.00
11:5226574:CA:CDonor Loss1.00
11:5226572:CTCA:CDonor Gain0.92
11:5226571:ACTC:ADonor Loss0.92
11:5226800:C:CCAcceptor Gain0.99
11:5226806:G:CAcceptor Gain0.99
11:5226798:GC:GAcceptor Gain0.97
11:5226795:GCAGC:GAcceptor Gain0.97
11:5226796:CAGC:CAcceptor Gain0.98
11:5226723:GGAGC:GAcceptor Loss0.98
11:5226725:AGCT:AAcceptor Loss0.98
11:5226726:GCTG:GAcceptor Loss0.98
11:5226728:T:AAcceptor Loss0.98
11:5226564:C:ADonor Gain0.88
11:5226569:AGACT:ADonor Gain0.79
11:5226722:GGAG:GAcceptor Gain0.97
11:5226723:GGAG:GAcceptor Gain0.97
11:5226912:TGTA:TDonor Gain0.98
11:5226563:T:TADonor Gain0.75
11:5226799:CC:CAcceptor Gain0.97
11:5226814:A:CAcceptor Gain0.92
11:5225723:GGAG:GAcceptor Gain0.97

Summary: HBB contains ~1,700 catalogued variants with 230+ classified as pathogenic/likely pathogenic. AlphaMissense predicts ~85 likely-pathogenic missense variants, predominantly at C-terminus residues (Y146, H147, A139-A143, K133-K145). SpliceAI identifies 260 splice-affecting variants, with 80+ high-confidence predictions (score ≥0.8), concentrated in intron 2 and exon 3 boundaries. Variant burden reflects known disease associations: β-thalassemia (primarily deletions & splice mutations) and sickle cell disease (missense variants in coding sequence).

Pathways & Gene Ontology

Reactome Pathways (10 total)

Pathway IDPathway Name
R-HSA-1237044Erythrocytes take up carbon dioxide and release oxygen
R-HSA-1247673Erythrocytes take up oxygen and release carbon dioxide
R-HSA-2168880Scavenging of heme from plasma
R-HSA-6798695Neutrophil degranulation
R-HSA-9613829Chaperone Mediated Autophagy
R-HSA-9615710Late endosomal microautophagy
R-HSA-9707564Cytoprotection by HMOX1
R-HSA-9707616Heme signaling
R-HSA-983231Factors involved in megakaryocyte development and platelet production
R-HSA-9927020Heme assimilation

MSigDB Gene Sets (100 total)

100 gene sets in Molecular Signatures Database including Reactome pathways (CP), Gene Ontology terms (GO:BP, MF, CC), canonical pathways, and human phenotype terms.

Notable sets include:

  • M17493 | GOCC_HEMOGLOBIN_COMPLEX (12 genes)
  • M26254 | GOMF_OXYGEN_CARRIER_ACTIVITY (14 genes)
  • M26516 | GOMF_HEMOGLOBIN_BINDING (9 genes)
  • M19553 | BIOCARTA_AHSP_PATHWAY - Hemoglobin’s Chaperone (13 genes)

Gene Ontology: Biological Process (13 terms)

GO IDTerm
GO:0015671oxygen transport
GO:0015670carbon dioxide transport
GO:0048821erythrocyte development
GO:0008217regulation of blood pressure
GO:0030185nitric oxide transport
GO:0006954inflammatory response
GO:0045429positive regulation of nitric oxide biosynthetic process
GO:0070527platelet aggregation
GO:0098869cellular oxidant detoxification
GO:0042542response to hydrogen peroxide
GO:0042744hydrogen peroxide catabolic process
GO:0097746blood vessel diameter maintenance
GO:0070293renal absorption

Gene Ontology: Molecular Function (6 terms)

GO IDTerm
GO:0005344oxygen carrier activity
GO:0019825oxygen binding
GO:0020037heme binding
GO:0046872metal ion binding
GO:0030492hemoglobin binding
GO:0031721hemoglobin alpha binding

Gene Ontology: Cellular Component (10 terms)

GO IDTerm
GO:0005833hemoglobin complex
GO:0005576extracellular region
GO:0005615extracellular space
GO:0005829cytosol
GO:0031838haptoglobin-hemoglobin complex
GO:0070062extracellular exosome
GO:0072562blood microparticle
GO:0071682endocytic vesicle lumen
GO:1904724tertiary granule lumen
GO:1904813ficolin-1-rich granule lumen

Protein interactions & networks

Total Interaction Count:

  • BioGRID: ~236 interactions
  • IntAct: ~215 interactions
  • STRING: Not available
  • Combined: ~450+ protein-protein interactions

TOP 30 Highest-Confidence Interacting Proteins (IntAct):

RankPartner ProteinInteraction TypeConfidence Score
1HBA1 (Hemoglobin alpha 1)Direct interaction0.970
2HBA1Direct interaction0.970
3HBA1Direct interaction0.970
4HBA1Direct interaction0.970
5HBA1Direct interaction0.970
6HBA1Direct interaction0.970
7HBA1Direct interaction0.970
8HBA1Direct interaction0.970
9HBA1Direct interaction0.970
10HBA1Direct interaction0.970
11HBA1Direct interaction0.970
12HBA1Direct interaction0.970
13HBA1Direct interaction0.970
14HBA1Direct interaction0.970
15HBA1Direct interaction0.970
16HBA1Direct interaction0.970
17HBA1Direct interaction0.970
18HBA1Direct interaction0.970
19HBZ (Hemoglobin zeta)Physical association0.860
20HBZPhysical association0.860
21HBZPhysical association0.860
22HBZPhysical association0.860
23HBM (Hemoglobin M)Physical association0.560
24HBMPhysical association0.560
25HBMPhysical association0.560
26HBQ1 (Hemoglobin theta 1)Physical association0.560
27HBQ1Physical association0.560
28HBQ1Physical association0.560
29MAPK6Association0.840
30MED4Association0.900

Structural/Embedding Similarity (ESM2) – TOP 20:

RankProtein IDTop SimilarityAvg Similarity
1-10P68872, P68873, P68232-P68234, P68224-P68225, P68054-P680551.00000.9992-0.9994
11-20P61772-P61775, P67819-P678241.00000.9986-0.9993

Note: Top 20 ESM2 matches are hemoglobin variants with near-perfect similarity (0.9986-1.0000), indicating nearly identical embedding representations.

Sequence Homology (DIAMOND) – TOP 20:

RankProtein IDIdentity (%)Bitscore
1D0VX08100.0298
2P02030100.0302
3P02084100.0298
4P02085100.0298
5P07415100.0301
6P14391100.0298
7P0202999.3301
8P0202699.3302
9P0203599.3300
10P0204099.3300
11P0204299.3301
12P1089399.3300
13P1544999.3300
14P1439299.3296
15P0202499.3303
16P0424498.0293
17P0202598.6299
18P0202898.6298
19P0203198.6300
20P0203298.6299

Note: Top homologous matches are other hemoglobin variants with 98.6-100% sequence identity, representing orthologs and variants across species.

Transcription factor regulatory data

HBB is not a transcription factor. HBB (hemoglobin subunit beta) is a structural protein component of hemoglobin, not a DNA-binding transcription factor. Skipping downstream targets and DNA binding motifs.

Upstream regulators: Transcription factors regulating HBB

Total identified regulators: 71 (from CollecTRI database)

Top regulators by evidence support:

RankTFEvidence TypeRegulationSources
1KLF1Experimentally validatedUnknownExTRI, TRRUST, SIGNOR, GEREDB, NTNU Curated
2GATA1Experimentally validatedUnknownExTRI, HTRI, TRRUST, NTNU Curated, DoRothEA_A
3NFE2Experimentally validatedActivationExTRI, SIGNOR, GEREDB, NTNU Curated
4BACH1Experimentally validatedActivationExTRI, GOA, GEREDB, NTNU Curated
5DLX4Experimentally validatedUnknownExTRI, HTRI
6USF1Experimentally validatedUnknownExTRI, NTNU Curated
7MAFKExperimentally validatedUnknownExTRI
8NFE2L2Experimentally validatedUnknownExTRI, TRRUST
9MYCExperimentally validatedUnknownExTRI, GEREDB, HTRI, NTNU Curated
10GTF2IExperimentally validatedUnknownExTRI
11TBPExperimentally validatedUnknownExTRI, GEREDB
12KLF2Experimentally validatedActivationExTRI, GEREDB
13STAT5AExperimentally validatedActivationSIGNOR
14POU2F2Predicted/inferredUnknownExTRI, TRRUST, GEREDB
15ESR1Predicted/inferredUnknownExTRI

Additional regulators (16-71): AP1, AR, BACH2, BCL11A, CEBPB, CEBPD, CEBPG, CTCF, EGF, EHMT2, EIF2AK1, EPAS1, EPO, ESR2, ETV6, FGF2, FLI1, GLI3, HMGA1, HIF1A, HLTF, HOXC13, JUN, KLF8, KLF11, LMO2, MAPK11, MAPK14, MIB, MYB, NFE2L1, NFE2L3, NR1I2, NR2F2, NFYA, NFYB, NFYC, PAX1, POU2F1, RBPJ, SMARCdA1, SP1, SP2, SPI1, SUZ12, TGFB1, TFAP2A, TFCP2, TPA2, TP53, USF2, ZNF354C, ZNF362, ZFPM1

Drug & pharmacology data

HBB protein status as therapeutic target: HBB (beta-globin) is not a traditional “druggable” protein target in the sense of direct inhibitors or activators, but rather is central to hemoglobinopathy treatment where compounds indirectly affect hemoglobin function or compensate for defective beta-globin. Most “HBB-targeting” compounds in ChEMBL (CHEMBL4331) actually refer to in vitro screening hits and compounds tested for effects on hemoglobin, rather than clinical therapeutics specifically designed to modulate HBB.

Molecules with HBB activity: ChEMBL reports 255 molecules with recorded activity against CHEMBL4331 (Hemoglobin subunit beta), though the majority are research compounds with Phase 0 status (in vitro screening only).

Top approved molecules (Phase 4) with recorded HBB activity:

  • Azacitidine (CHEMBL1489) — epigenetic modifier for hemoglobinopathies
  • Hydroxyurea (reference compound) — fetal hemoglobin inducer
  • Mercaptopurine (CHEMBL1425), Thioguanine (CHEMBL727), Azathioprine (CHEMBL1542)
  • Fluorouracil (CHEMBL185), Acyclovir (CHEMBL184), Topotecan (CHEMBL1607)
  • Others recorded: Phenazopyridine, Menadione, Hydroquinone, Thiotepa, Reserpine

Clinical trials for hemoglobinopathies (sickle cell & beta-thalassemia):

  • Sickle Cell Disease (MONDO:0011382): 967 clinical trials total, including major Phase 3 programs:

    • Voxelotor (GBT440) — Phase 3 completed, mechanism: allosteric hemoglobin modifier
    • Crizanlizumab — P-selectin antagonist, Phase 3
    • CTX001 — CRISPR gene therapy, Phase 2/3
    • Etavopivat (AG-348) — pyruvate kinase activator, Phase 3
    • Luspatercept — late-stage erythroid maturation regulator
  • Beta-Thalassemia (MONDO:0019402): 214 clinical trials, including:

    • CTX001 gene therapy — Phase 2/3 ongoing
    • Luspatercept — Phase 2/3 trials
    • Etavopivat — Phase 2/3 program
    • Gene editing approaches (EDIT-301, BB305 lentiviral vectors) — Phase 1/2

Pharmacogenomics: No specific HBB-drug interaction guidelines identified in PharmGKB. However, sickle cell & thalassemia are themselves pharmacogenomic conditions — HBB mutations (>1,700 variants in ClinVar) determine disease manifestation and drug response (e.g., hydroxyurea efficacy correlates with ability to induce fetal hemoglobin). Dosing of iron chelators (deferasirox, deferoxamine) is weight-based without HBB-specific adjustments.

Expression profiles

Tissue Expression

HBB shows ubiquitous expression across 284 of 288 tissue conditions in Bgee, with an average expression score of 93.6 and maximum score of 100. Expression is particularly enriched in blood-forming tissues.

RankTissueExpression ScoreQuality
1Vena cava100.00Gold
2Periodontal ligament100.00Gold
3Blood99.92Gold
4Bone marrow99.99Gold
5Spleen99.81Gold
6Monocyte (cell type)100.00Gold
7Triceps brachii99.98Gold
8Biceps brachii99.96Gold
9Gluteal muscle99.97Gold
10Trabecular bone tissue99.99Gold
11Skeletal muscle (biceps)99.98Gold
12Rectus abdominis99.76Gold
13Vastus lateralis99.67Gold
14Diaphragm99.76Gold
15Heart apex99.75Gold
16Bronchus99.71Gold
17Bronchial epithelium99.71Gold
18Colonic epithelium99.71Gold
19Stomach cardia99.69Gold
20Placenta99.68Gold
21Mononuclear cell99.99Gold
22Bone marrow cell99.98Gold
23Decidua99.90Gold
24Bronchial epithelial cell99.89Gold
25Trigeminal ganglion99.64Gold
26Superior vestibular nucleus99.59Gold
27Inferior olivary complex99.68Gold
28Metanephros cortex99.57Gold
29Gall bladder99.57Gold
30Adult organism99.92Gold

Cell Type Expression

HBB is highly enriched in hematopoietic and immune cells. The gene serves as a marker gene in 46/46 SCXA experiments, with maximum mean expression of 562,006 TPM (transcripts per million), indicating it is a defining feature of blood-cell populations.

Top cell types (from Bgee + SCXA):

  • Monocytes (100.0)
  • Mononuclear cells (99.99)
  • Bone marrow cells (99.98)
  • Bronchial epithelial cells (99.89)
  • Red blood cells and erythroid progenitors (implied from max expression)

Single-Cell Expression Datasets

HBB is prominently expressed across 15 human single-cell atlases, primarily in hematopoietic tissues and immune populations:

Dataset IDTitleTissue/SystemCell Count
E-HCAD-4Census of Immune CellsImmune system791,344
E-MTAB-7407Fetal liver, skin, kidneyMultiple tissues473,803
E-MTAB-9543Human gastrointestinal tractGI tract319,479
E-CURD-122Cross-tissue immune cell analysisMulti-tissue immune356,580
E-MTAB-10042Emergent haematopoiesis (fetal bone marrow)Hematopoiesis21,485
E-CURD-112Emergent haematopoiesis (10X)Fetal bone marrow56,592
E-CURD-98Human fetal liverFetal hematopoiesis197,943
E-HCAD-6CD34+ cells (bone marrow)Hematopoietic stem cells34,596
E-CURD-6Bone marrow differentiation hierarchyHematopoiesis1,024
E-MTAB-10553Human liver cellsLiver24,355
E-MTAB-10662Fetal lung with R-SPONDIN inhibitionLung development39,900
E-MTAB-6653Lung carcinomasLung cancer55,719
E-MTAB-8884Chronic myelomonocytic leukemia stem cellsLeukemia9,386
E-MTAB-9221RBC-depleted whole blood (COVID-19)Peripheral blood27,943
E-MTAB-10432Post-MI heart failure bone vascular nicheHeart/bone8,228

Expression Pattern: HBB is a lineage-defining marker for erythroid and myeloid cells, with particularly high expression in mature red blood cells, bone marrow progenitors, and immune cell populations.

Disease associations

Mendelian / Monogenic Diseases

HBB mutations cause multiple autosomal recessive and dominant hemoglobinopathies:

DiseaseDisease ID(s)Inheritance PatternEvidence Level
Beta-thalassemia (HBB/LCRB)OMIM 613985, MONDO:0013517, Orphanet 231214Autosomal recessiveDefinitive/Strong
Sickle cell diseaseOMIM 603903, MONDO:0011382, Orphanet 232Autosomal recessiveStrong/Supportive
Dominant beta-thalassemiaOMIM 603902, MONDO:0011381, Orphanet 231226Autosomal dominantStrong/Supportive
Hemoglobin M diseaseOMIM 617971, Orphanet 330041Autosomal dominantModerate/Supportive
Erythrocytosis, familial, 6OMIM 617980, MONDO:0054801Autosomal dominantStrong/Supportive
Heinz body anemiaOMIM 140700, MONDO:0007705Autosomal dominantStrong/Supportive
Hemoglobin C diseaseOrphanet 2132Autosomal recessiveSupportive
Hemoglobin E diseaseOMIM 603951, MONDO:0016243, Orphanet 2133Autosomal recessiveSupportive
Beta-thalassemia majorMONDO:0016486, Orphanet 231214Autosomal recessiveSupportive
Beta-thalassemia intermediaMONDO:0016487, Orphanet 231222Autosomal recessiveSupportive
Delta-beta-thalassemiaMONDO:0016489, Orphanet 231237Autosomal recessiveSupportive
Hemoglobin C-beta-thalassemia syndromeOrphanet 231242Autosomal recessiveSupportive
Hemoglobin E-beta-thalassemia syndromeMONDO:0016491, Orphanet 231249Autosomal recessiveSupportive
Sickle cell-beta-thalassemia disease syndromeOrphanet 251359Autosomal recessiveSupportive
Sickle cell-hemoglobin C disease syndromeMONDO:0016669, Orphanet 251365Autosomal recessiveSupportive
Sickle cell-hemoglobin D disease syndromeOrphanet 251370Autosomal recessiveSupportive
Sickle cell-hemoglobin E disease syndromeOrphanet 251375Autosomal recessiveSupportive
Hereditary persistence of fetal hemoglobin-sickle cell disease syndromeMONDO:0020989, Orphanet 251380Autosomal recessiveSupportive
Hereditary persistence of fetal hemoglobin-beta-thalassemia syndromeOrphanet 46532Autosomal dominantSupportive
Hemoglobin D diseaseMONDO:0019537, Orphanet 90039Autosomal recessiveSupportive

Additional broadly associated categories: Hemoglobinopathy (MONDO:0044348), Thalassemia (MONDO:0000984), Hemolytic anemia (MONDO:0003664)


Phenotype Associations

Top 30 HPO terms associated with HBB mutations:

HPO IDPhenotype
HP:0001878Hemolytic anemia
HP:0001903Anemia
HP:0001744Splenomegaly
HP:0000952Jaundice
HP:0008282Unconjugated hyperbilirubinemia
HP:0001433Hepatosplenomegaly
HP:0001281Hepatomegaly
HP:0012622Chronic kidney disease
HP:0003259Elevated circulating creatinine concentration
HP:0001922Fever
HP:0000961Cyanosis
HP:0012418Hypoxemia
HP:0001297Stroke
HP:0002140Ischemic stroke
HP:0001626Abnormality of the cardiovascular system
HP:0001640Cardiomegaly
HP:0001644Dilated cardiomyopathy
HP:0001722High-output congestive heart failure
HP:0010885Avascular necrosis
HP:0002754Osteomyelitis
HP:0001510Growth delay
HP:0001531Failure to thrive in infancy
HP:0000823Delayed puberty
HP:0008346Increased red cell sickling tendency
HP:0030058Sickled erythrocytes
HP:0045047HbS hemoglobin
HP:0011902Abnormal hemoglobin
HP:0011904Persistence of hemoglobin F
HP:0011905Reduced hemoglobin A
HP:0200023Priapism

Complex Disease / GWAS Associations

Top 30 GWAS associations (p < 10⁻⁶):

TraitP-valueStudyNotes
Malaria (severe)4.0×10⁻⁶⁹GCST010725_48HBB variant protection; strong effect
Malaria (severe)2.0×10⁻⁶⁷GCST010725_61HBB variant protection
Malaria1.0×10⁻⁵⁵GCST010725_1Strongest known protective association
Glycated hemoglobin (HbA1c) levels3.0×10⁻³⁹GCST008398_13HBB locus
Hemoglobin levels4.0×10⁻²⁶GCST003122_3HBD gene
Hemoglobin A1c levels8.0×10⁻²⁷GCST008034_18HBB directly implicated
Mean corpuscular hemoglobin concentration7.0×10⁻²⁴GCST004329_2HBB locus
Total cholesterol levels5.0×10⁻²³GCST011346_59HBB region
Low density lipoprotein cholesterol levels6.0×10⁻²²GCST011347_35HBB region
Mean corpuscular volume1.0×10⁻²²GCST004335_16HBB locus
Red cell distribution width6.0×10⁻¹⁷GCST008333_1HBB locus
Malaria6.0×10⁻¹⁴GCST001637_1HBG2, HBE1, OR51B5 co-associated
Mean corpuscular hemoglobin concentration1.0×10⁻¹³GCST001782_4HBG2, HBE1
Hemoglobin levels2.0×10⁻¹²GCST009681_7HBB direct effect
Urinary albumin-to-creatinine ratio8.0×10⁻¹²GCST003774_2HBB locus
Hematocrit9.0×10⁻¹⁰GCST004330_1HBB locus
Estimated glomerular filtration rate (non-diabetics)3.0×10⁻⁸GCST008745_92HBB region
Urinary albumin-to-creatinine ratio6.0×10⁻⁸GCST008793_1HBB locus
Hematocrit9.0×10⁻¹⁰GCST009682_2HBB locus
Hemolysis of donated blood (osmotic)4.0×10⁻¹⁰GCST012130_2HBB locus
Urate levels5.0×10⁻¹⁵GCST008972_7HBB locus
Inflammatory biomarkers2.0×10⁻⁹GCST001385_6HBE1, HBG2 co-associated
Hematology traits5.0×10⁻¹¹GCST001779_3HBG2, HBE1
HbA2 levels1.0×10⁻¹¹GCST001710_2OR51V1-HBB region
Fetal hemoglobin levels2.0×10⁻²¹GCST000150_1HBE1, HBG2 co-associated
F-cell distribution2.0×10⁻³⁸GCST000069_3HBB locus
Malaria4.0×10⁻¹¹GCST000410_3OR51V1-HBB region
Estimated glomerular filtration rate (diabetes)7.0×10⁻¹³GCST008746_23HBB region
Blood cell traits (multivariate)5.0×10⁻²⁰GCST008337_2HBB locus
Blood cell traits (multivariate)1.0×10⁻¹²GCST008338_4HBB locus

Key findings:

  • Malaria resistance: HBB variants (especially sickle allele) show extremely strong protection against severe malaria (p < 10⁻⁵⁵)
  • Hematologic traits: HBB is a major QTL for hemoglobin levels, hematocrit, RBC indices, and related blood parameters
  • Metabolic/renal: Secondary associations with glycemic traits (HbA1c), lipid levels, and kidney function markers
  • Inheritance: All Mendelian HBB disorders show clear autosomal inheritance (recessive for most thalassemias and sickling disorders; dominant for some hemoglobinopathies)

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 44 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, cathgene3d, ccds, cdd, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, flybase, gencc, go, gwas, hgnc, hpa, hpo, intact, interpro, mim, mondo, msigdb, orphanet, ortholog, panther, pdb, pfam, reactome, refseq, scxa, scxa_expression, spliceai, string_interaction, transcript, uniprot, wormbase
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (173)