H2BW1

gene
On this page

Also known as TH2B-175

Summary

H2BW1 (H2B.W histone 1, HGNC:27252) is a protein-coding gene on chromosome Xq22.2, encoding Histone H2B type W-T (Q7Z2G1). Atypical histone H2B that can form nucleosomes structurally and dynamically indistinguishable from those containing conventional H2B.

Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene encodes a replication-independent histone that is a member of the H2B histone family that is specifically expressed in sperm nuclei. A polymorphism in the 5’ UTR of this gene is associated with male infertility.

Source: NCBI Gene 158983 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 57 total — 2 pathogenic
  • MANE Select transcript: NM_001002916

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:27252
Approved symbolH2BW1
NameH2B.W histone 1
LocationXq22.2
Locus typegene with protein product
StatusApproved
AliasesTH2B-175
Ensembl geneENSG00000123569
Ensembl biotypeprotein_coding
OMIM300507
Entrez158983

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000217926

RefSeq mRNA: 1 — MANE Select: NM_001002916 NM_001002916

CCDS: CCDS35362

Canonical transcript exons

ENST00000217926 — 3 exons

ExonStartEnd
ENSE00000674142104013170104013708
ENSE00001520302104011147104011463
ENSE00001632622104012690104012748

Expression profiles

Bgee: expression breadth broad, 13 present calls, max score 86.77.

Top tissues by expression

119 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047386.77gold quality
cortical plateUBERON:000534341.54gold quality
right testisUBERON:000453438.36gold quality
placentaUBERON:000198737.49gold quality
colonic epitheliumUBERON:000039737.20gold quality
ventricular zoneUBERON:000305336.48gold quality
bone marrow cellCL:000209236.16gold quality
ganglionic eminenceUBERON:000402335.49gold quality
testisUBERON:000047335.44gold quality
skeletal muscle tissueUBERON:000113434.98gold quality
left testisUBERON:000453334.95gold quality
right coronary arteryUBERON:000162533.01gold quality
muscle tissueUBERON:000238532.23gold quality
hindlimb stylopod muscleUBERON:000425232.15gold quality
bone marrowUBERON:000237131.74gold quality
prefrontal cortexUBERON:000045131.10silver quality
sural nerveUBERON:001548830.93gold quality
stromal cell of endometriumCL:000225529.87gold quality
tonsilUBERON:000237229.77silver quality
superior frontal gyrusUBERON:000266128.25silver quality
duodenumUBERON:000211428.14gold quality
liverUBERON:000210728.04gold quality
lymph nodeUBERON:000002927.57gold quality
vermiform appendixUBERON:000115426.42gold quality
leukocyteCL:000073826.35gold quality
monocyteCL:000057626.23gold quality
bloodUBERON:000017826.04silver quality
gall bladderUBERON:000211025.98gold quality
olfactory segment of nasal mucosaUBERON:000538625.89gold quality
cerebellumUBERON:000203725.88gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.63

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

18 targeting H2BW1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-9-5P100.0072.282361
HSA-MIR-576-5P99.8470.462582
HSA-MIR-6515-3P99.8268.191933
HSA-MIR-129999.7771.242389
HSA-MIR-6848-3P99.6466.49885
HSA-MIR-875-3P99.6369.472548
HSA-MIR-57899.4668.361787
HSA-MIR-103A-1-5P99.3967.781545
HSA-MIR-103A-2-5P99.3967.721577
HSA-MIR-6843-3P99.2666.42915
HSA-MIR-3191-5P99.2466.521722
HSA-MIR-4727-5P99.2367.551154
HSA-MIR-593-3P99.2267.281327
HSA-MIR-314998.7767.131639
HSA-MIR-518C-5P98.5369.201640
HSA-MIR-676-5P98.4968.871492
HSA-MIR-6834-3P98.1665.77551
HSA-MIR-4433B-5P95.9166.56727

Literature-anchored findings (GeneRIF, showing 6)

  • Data indicate that the polymorphism -9C>T and 368A>G in H2BFWT gene are associated with male infertility with idiopathic azoospermia or oligozoospermia, suggesting susceptibility of H2BFWT gene to spermatogenesis impairment in Chinese population. (PMID:22509975)
  • It is concluded that H2BFWT gene c.-9C>T and c.368A>G polymorphisms might be genetic risk factors for idiopathic male infertility. (PMID:28370107)
  • the association of -9C>T and 368A>G polymorphisms in H2BFWT gene with male infertility risk was assessed through a case-control and meta-analysis (PMID:29453813)
  • Tobacco smoking and its impact on the expression level of sperm nuclear protein genes: H2BFWT, TNP1, TNP2, PRM1 and PRM2. (PMID:33440036)
  • Impact of tobacco smoking in association with H2BFWT, PRM1 and PRM2 genes variants on male infertility. (PMID:36217675)
  • H2BFWT Variations in Sperm DNA and Its Correlation to Pregnancy. (PMID:38892236)

Cross-species orthologs

1 orthologs

OrganismSymbolGene ID
mus_musculusH2bw2ENSMUSG00000048155

Paralogs (21): H2BW2 (ENSG00000101812), H2BC11 (ENSG00000124635), H2BC1 (ENSG00000146047), H2BC5 (ENSG00000158373), H2BC4 (ENSG00000180596), H2BC21 (ENSG00000184678), H2BC13 (ENSG00000185130), H2BC26 (ENSG00000196890), H2BC12 (ENSG00000197903), H2BC18 (ENSG00000203814), H2BC15 (ENSG00000233822), H2BC12L (ENSG00000234289), H2BC14 (ENSG00000273703), H2BC8 (ENSG00000273802), H2BC6 (ENSG00000274290), H2BC17 (ENSG00000274641), H2BC9 (ENSG00000275713), H2BC3 (ENSG00000276410), H2BC7 (ENSG00000277224), H2BC10 (ENSG00000278588), H2BK1 (ENSG00000285480)

Protein

Protein identifiers

Histone H2B type W-TQ7Z2G1 (reviewed: Q7Z2G1)

Alternative names: H2B histone family member W testis-specific, H2B.W histone 1

All UniProt accessions (1): Q7Z2G1

UniProt curated annotations — full annotation on UniProt →

Function. Atypical histone H2B that can form nucleosomes structurally and dynamically indistinguishable from those containing conventional H2B. Nucleosomes wrap and compact DNA into chromatin, limiting DNA accessibility to the cellular machineries which require DNA as a template. Histones thereby play a central role in transcription regulation, DNA repair, DNA replication and chromosomal stability. DNA accessibility is regulated via a complex set of post-translational modifications of histones, also called histone code, and nucleosome remodeling. However, unlike conventional H2B, does not recruit chromosome condensation factors and does not participate in the assembly of mitotic chromosomes. May be important for telomere function and play a role in spermatogenesis.

Subunit / interactions. Can replace the conventional histone H2B in the nucleosome. The nucleosome is a histone octamer containing two molecules each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of DNA.

Subcellular location. Nucleus membrane. Chromosome. Telomere.

Tissue specificity. Testis-specific (at protein level).

Miscellaneous. In contrast to other H2B histones, it does not contain the conserved residue in C-terminus that is the target of monoubiquitination. Gene prediction based on conservation. Has orthologs in primates, but not in rodents.

Similarity. Belongs to the histone H2B family.

Isoforms (3)

UniProt IDNamesCanonical?
Q7Z2G1-22yes
Q7Z2G1-11
Q7Z2G1-33

RefSeq proteins (1): NP_001002916* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000558Histone_H2BFamily
IPR007125H2A/H2B/H3Domain
IPR009072Histone-foldHomologous_superfamily

Pfam: PF00125

UniProt features (19 total): modified residue 7, helix 4, splice variant 2, sequence variant 2, compositionally biased region 2, chain 1, region of interest 1

Structure

Experimental structures (PDB)

2 structures.

PDBMethodResolution (Å)
8JCDELECTRON MICROSCOPY3.14
8JCCELECTRON MICROSCOPY3.42

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q7Z2G1-F176.730.60

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (7): 138, 26, 26, 35, 107, 130, 130

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 32 (showing top): GOBP_CHROMOSOME_ORGANIZATION, GOBP_CHROMOSOME_CONDENSATION, GOBP_ORGANELLE_FISSION, GOBP_MITOTIC_NUCLEAR_DIVISION, GOBP_MITOTIC_CELL_CYCLE, GOBP_NUCLEAR_CHROMOSOME_SEGREGATION, GOBP_MITOTIC_CHROMOSOME_CONDENSATION, GOCC_NUCLEAR_ENVELOPE, GOCC_CHROMOSOMAL_REGION, GOCC_PROTEIN_DNA_COMPLEX, GOCC_CHROMOSOME_TELOMERIC_REGION, GOCC_NUCLEAR_MEMBRANE, GOBP_CELL_CYCLE_PROCESS, GOMF_PROTEIN_HETERODIMERIZATION_ACTIVITY, GOMF_PROTEIN_DIMERIZATION_ACTIVITY

GO Biological Process (0):

GO Molecular Function (4): DNA binding (GO:0003677), structural constituent of chromatin (GO:0030527), protein heterodimerization activity (GO:0046982), protein binding (GO:0005515)

GO Cellular Component (6): chromosome, telomeric region (GO:0000781), nucleosome (GO:0000786), nucleus (GO:0005634), nuclear membrane (GO:0031965), chromosome (GO:0005694), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
chromatin2
nucleic acid binding1
structural molecule activity1
protein dimerization activity1
binding1
chromosomal region1
protein-DNA complex1
intracellular membrane-bounded organelle1
nucleus1
nuclear envelope1
organelle membrane1
intracellular membraneless organelle1
cellular anatomical structure1

Protein interactions and networks

STRING

1724 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
H2BW1PLGP00747867
H2BW1RNF20Q5VTR2843
H2BW1H3-5Q6NXT2816
H2BW1H3C1P02295816
H2BW1H3-4Q16695816
H2BW1H3C14Q71DI3815
H2BW1H3-7Q5TEC6815
H2BW1H3-3AP06351813
H2BW1RNF40O75150772
H2BW1H4C7Q99525746
H2BW1H4C16P02304743
H2BW1H2AC20Q16777743
H2BW1H2AC19P20670743
H2BW1USP22Q9UPT9728
H2BW1DET1Q7L5Y6727

IntAct

7 interactions, top by confidence:

ABTypeScore
H2BW1PPP1R2Bpsi-mi:“MI:0915”(physical association)0.560
CEP78H2BW1psi-mi:“MI:0915”(physical association)0.400
H2BW1MACROH2A1psi-mi:“MI:0914”(association)0.350
H2BW1PPP1R2Bpsi-mi:“MI:0915”(physical association)0.000

BioGRID (15): PPP1R2P3 (Two-hybrid), HSPG2 (Affinity Capture-MS), TRO (Affinity Capture-MS), XPC (Affinity Capture-MS), KPNB1 (Affinity Capture-MS), ZC3H18 (Affinity Capture-MS), SUPT16H (Affinity Capture-MS), ARHGEF40 (Affinity Capture-MS), HIST1H2AG (Affinity Capture-MS), NAP1L3 (Affinity Capture-MS), H2AFY (Affinity Capture-MS), C18orf25 (Affinity Capture-MS), MDM2 (Affinity Capture-MS), MAP1S (Affinity Capture-MS), HIST2H2AB (Affinity Capture-MS)

ESM2 similar proteins: A0A097I1R9, A0A097I2B5, A0A097I2D0, A0A1W2PP81, A0A1W2PPE2, A0A1W2PPH5, A0A1W2PPL8, A0A1W2PPW3, A0A1W2PQ09, A0A1W2PR64, A0A1W2PRV1, A6NLC8, O04027, O15819, P06898, P06902, P0C1H6, P0DW11, P0DW12, P0DW13, P0DW14, P27795, P27893, P40284, P40285, P40287, P81195, P81196, P81197, P81198, P81199, P81200, P81201, P81202, Q06196, Q27490, Q2N2K6, Q5M8Q2, Q6CER9, Q6VEU3

Diamond homologs: A0A2R8Y619, A1CJ09, A1D8G9, A2QY49, A3LXE6, A3LZZ1, A5DWF0, O60814, P02281, P02283, P02284, P02285, P02286, P02290, P02293, P04255, P04913, P06899, P06900, P07794, P0C1H3, P0C1H4, P0C1H5, P0C1H6, P10853, P10854, P14001, P16888, P16889, P17271, P19374, P21897, P23527, P23754, P27326, P30757, P33778, P35068, P35069, P48557

SIGNOR signaling

2 interactions.

AEffectBMechanism
SLBP“up-regulates quantity by expression”H2BW1“translation regulation”
“MSL acetyltransferase”“down-regulates activity”H2BW1monoubiquitination

Disease & clinical

Clinical variants and AI predictions

ClinVar

57 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic2
Likely pathogenic0
Uncertain significance43
Likely benign8
Benign4

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
150913GRCh38/hg38 Xq22.2(chrX:103772099-104048812)x3Pathogenic
4683032GRCh37/hg19 Xq22.2(chrX:102988399-103291789)x3Pathogenic

SpliceAI

511 predictions. Top by Δscore:

VariantEffectΔscore
X:104011899:T:TAdonor_gain1.0000
X:104012767:C:CTacceptor_gain1.0000
X:104012768:A:Tacceptor_gain1.0000
X:104012771:C:CTacceptor_gain1.0000
X:104012773:C:CTacceptor_gain1.0000
X:104012774:A:Tacceptor_gain1.0000
X:104012780:A:ACacceptor_gain1.0000
X:104012787:A:Tacceptor_gain1.0000
X:104011870:A:ACdonor_gain0.9900
X:104011890:A:ACdonor_gain0.9900
X:104011891:C:CCdonor_gain0.9900
X:104011895:AAATT:Adonor_gain0.9900
X:104012685:GTTA:Gdonor_loss0.9900
X:104012686:TTAC:Tdonor_loss0.9900
X:104012687:TACCT:Tdonor_loss0.9900
X:104012689:CCT:Cdonor_loss0.9900
X:104012689:CCTTG:Cdonor_gain0.9900
X:104012763:C:CTacceptor_gain0.9900
X:104012767:C:Tacceptor_gain0.9900
X:104012780:A:Cacceptor_gain0.9900
X:104012786:C:CTacceptor_gain0.9900
X:104013165:TGTAC:Tdonor_loss0.9900
X:104013166:GTAC:Gdonor_loss0.9900
X:104013168:ACC:Adonor_loss0.9900
X:104013169:CC:Cdonor_loss0.9900
X:104011464:C:CCacceptor_gain0.9800
X:104011837:T:Cdonor_gain0.9800
X:104011912:AATC:Adonor_gain0.9800
X:104011962:C:CCacceptor_gain0.9800
X:104012764:A:Tacceptor_gain0.9800

AlphaMissense

1120 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:104013418:G:CF81L0.964
X:104013418:G:TF81L0.964
X:104013420:A:GF81L0.964
X:104013406:G:CF85L0.877
X:104013406:G:TF85L0.877
X:104013408:A:GF85L0.877
X:104013300:C:GA121P0.819
X:104013357:C:GA102P0.794
X:104013263:A:GI133T0.760
X:104013419:A:CF81C0.755
X:104013287:G:TA125D0.744
X:104013421:G:CS80R0.741
X:104013421:G:TS80R0.741
X:104013423:T:GS80R0.741
X:104013251:T:AE137V0.727
X:104013314:C:GR116P0.725
X:104013365:G:AS99F0.724
X:104013239:G:TA141D0.722
X:104013309:C:GA118P0.719
X:104013343:C:AM106I0.713
X:104013343:C:GM106I0.713
X:104013343:C:TM106I0.713
X:104013323:A:TI113K0.703
X:104013200:G:TA154D0.700
X:104013263:A:CI133S0.699
X:104013308:G:TA118D0.690
X:104013240:C:GA141P0.684
X:104013299:G:TA121D0.680
X:104013419:A:GF81S0.678
X:104013332:A:TV110D0.676

dbSNP variants (sampled 300 via entrez): RS1000081745 (X:104014665 A>G), RS1003947912 (X:104015626 G>A,C), RS1003980744 (X:104015348 G>A), RS1014156760 (X:104014201 C>A), RS1015277200 (X:104014907 A>G), RS1029501237 (X:104015309 C>T), RS1032813108 (X:104014256 A>G), RS1033968159 (X:104015446 C>A), RS1042340075 (X:104015201 C>T), RS1042391951 (X:104014593 G>A), RS1051345105 (X:104015593 A>C), RS1054298436 (X:104015517 C>G,T), RS1056977142 (X:104014571 T>A,C), RS111263150 (X:104013041 C>A), RS113217714 (X:104011869 T>C)

Disease associations

OMIM: gene MIM:300507 | disease phenotypes: MIM:312080

GenCC curated gene-disease

Mondo (1): Pelizaeus-Merzbacher spectrum disorder (MONDO:0010714)

Orphanet (1): Pelizaeus-Merzbacher disease (Orphanet:702)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (1)

DescriptorNameTree numbers
D020371Pelizaeus-Merzbacher DiseaseC10.228.140.163.100.362.775; C10.228.140.695.625.775; C10.314.400.775; C16.320.322.906; C16.320.565.189.362.775; C18.452.132.100.362.775; C18.452.648.189.362.775

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

7 total (human), top 7 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Aaffects cotreatment, decreases methylation1
Resveratrolaffects cotreatment, decreases expression1
Fulvestrantaffects cotreatment, decreases methylation1
Benzo(a)pyreneaffects methylation1
Endosulfandecreases expression1
Plant Extractsdecreases expression, affects cotreatment1
Valproic Acidincreases methylation1

Clinical trials (associated diseases)

9 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT07382739PHASE2RECRUITINGA Phase 2 Study of Radiotherapy-induced Immune Priming to Enhance Elranatamab (Elra) in Relapsed Refractory Multiple Myeloma (RRMM) With Extramedullary Disease (EMD) and Paramedullary Disease (PMD) PRIME-EMD-PMD
NCT01005004PHASE1COMPLETEDStudy of Human Central Nervous System (CNS) Stem Cells Transplantation in Pelizaeus-Merzbacher Disease (PMD) Subjects
NCT02254863PHASE1RECRUITINGUCB Transplant of Inherited Metabolic Diseases With Administration of Intrathecal UCB Derived Oligodendrocyte-Like Cells
NCT06150716PHASE1RECRUITINGOrbit Study: A Study to Evaluate the Safety, Pharmacokinetics, and Pharmacodynamics of Intrathecally Administered ION356 in Participants With Pelizaeus Merzbacher Disease (PMD)
NCT01391637Not specifiedCOMPLETEDLong-Term Follow-Up Study of Human Stem Cells Transplanted in Subjects With Connatal Pelizaeus-Merzbacher Disease (PMD)
NCT02699190Not specifiedCOMPLETEDLeukoSEQ: Whole Genome Sequencing as a First-Line Diagnostic Tool for Leukodystrophies
NCT03047369Not specifiedRECRUITINGThe Myelin Disorders Biorepository Project
NCT03333200Not specifiedRECRUITINGLongitudinal Study of Neurodegenerative Disorders
NCT05659901Not specifiedRECRUITINGRocket Study: A Study to Characterize Biomarkers and Disease Progression in Participants With Pelizaeus-Merzbacher Disease
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Pelizaeus-Merzbacher spectrum disorder