H4C7

gene
On this page

Also known as H4/l

Summary

H4C7 (H4 clustered histone 7, HGNC:4792) is a protein-coding gene on chromosome 6p22.2, encoding Histone H4-like protein type G (Q99525). Core component of nucleosome.

Histones are basic nuclear proteins that are responsible for the nucleosome structure of the chromosomal fiber in eukaryotes. Two molecules of each of the four core histones (H2A, H2B, H3, and H4) form an octamer, around which approximately 146 bp of DNA is wrapped in repeating units, called nucleosomes. The linker histone, H1, interacts with linker DNA between nucleosomes and functions in the compaction of chromatin into higher order structures. This gene is intronless and encodes a replication-dependent histone that is a member of the histone H4 family. Transcripts from this gene lack polyA tails but instead contain a palindromic termination element. This gene is found in the large histone gene cluster on chromosome 6.

Source: NCBI Gene 8369 — RefSeq curated summary.

At a glance

  • GWAS associations: 17
  • Clinical variants (ClinVar): 29 total
  • MANE Select transcript: NM_003547

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:4792
Approved symbolH4C7
NameH4 clustered histone 7
Location6p22.2
Locus typegene with protein product
StatusApproved
AliasesH4/l
Ensembl geneENSG00000275663
Ensembl biotypeprotein_coding
OMIM602832
Entrez8369

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000611444

RefSeq mRNA: 1 — MANE Select: NM_003547 NM_003547

CCDS: CCDS4599

Canonical transcript exons

ENST00000611444 — 1 exons

ExonStartEnd
ENSE000037548622624661126246996

Expression profiles

Bgee: expression breadth broad, 11 present calls, max score 59.37.

Top tissues by expression

254 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
tibialis anteriorUBERON:000138559.37silver quality
deciduaUBERON:000245056.55gold quality
cranial nerve IIUBERON:000094155.99gold quality
ileal mucosaUBERON:000033154.26silver quality
deltoidUBERON:000147653.34silver quality
hair follicleUBERON:000207352.43gold quality
triceps brachiiUBERON:000150952.21gold quality
quadriceps femorisUBERON:000137749.37gold quality
Brodmann (1909) area 46UBERON:000648349.30gold quality
blood vessel layerUBERON:000479749.29gold quality
cervix squamous epitheliumUBERON:000692249.20gold quality
tendon of biceps brachiiUBERON:000818849.11gold quality
olfactory bulbUBERON:000226448.92gold quality
choroid plexus epitheliumUBERON:000391148.89gold quality
myocardiumUBERON:000234948.87gold quality
type B pancreatic cellCL:000016948.83gold quality
epithelial cell of pancreasCL:000008348.78gold quality
cardiac muscle of right atriumUBERON:000337948.55gold quality
CA1 field of hippocampusUBERON:000388148.50gold quality
vastus lateralisUBERON:000137948.25gold quality
left ventricle myocardiumUBERON:000656648.24gold quality
orbitofrontal cortexUBERON:000416748.20gold quality
upper arm skinUBERON:000426348.06gold quality
cervix epitheliumUBERON:000480148.04gold quality
oviduct epitheliumUBERON:000480448.00gold quality
tongue squamous epitheliumUBERON:000691947.92gold quality
mucosa of urinary bladderUBERON:000125947.80gold quality
nasal cavity epitheliumUBERON:000538447.70gold quality
thymusUBERON:000237047.42gold quality
periodontal ligamentUBERON:000826647.14gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.81

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 1)

  • A novel histone H4 variant H4G regulates ribosomal DNA transcription in breast cancer. (PMID:31219579)

Cross-species orthologs

19 orthologs

OrganismSymbolGene ID
drosophila_melanogasterHis4:CG33875FBGN0053875
drosophila_melanogasterHis4:CG33889FBGN0053889
drosophila_melanogasterHis4:CG33903FBGN0053903
caenorhabditis_elegansWBGENE00001875
caenorhabditis_elegansWBGENE00001879
caenorhabditis_elegansWBGENE00001884
caenorhabditis_elegansWBGENE00001888
caenorhabditis_elegansWBGENE00001892
caenorhabditis_elegansWBGENE00001900
caenorhabditis_elegansWBGENE00001902
caenorhabditis_elegansWBGENE00001905
caenorhabditis_elegansWBGENE00001911
caenorhabditis_elegansWBGENE00001912
caenorhabditis_elegansWBGENE00001920
caenorhabditis_elegansWBGENE00001924
caenorhabditis_elegansWBGENE00001930
caenorhabditis_elegansWBGENE00001934
caenorhabditis_elegansWBGENE00001938
caenorhabditis_eleganshis-67WBGENE00001941

Paralogs (14): H4C8 (ENSG00000158406), H4C3 (ENSG00000197061), H4C11 (ENSG00000197238), H4C16 (ENSG00000197837), H4C15 (ENSG00000270276), H4C14 (ENSG00000270882), H4C12 (ENSG00000273542), H4C6 (ENSG00000274618), H4C13 (ENSG00000275126), H4C9 (ENSG00000276180), H4C5 (ENSG00000276966), H4C4 (ENSG00000277157), H4C1 (ENSG00000278637), H4C2 (ENSG00000278705)

Protein

Protein identifiers

Histone H4-like protein type GQ99525 (reviewed: Q99525)

Alternative names: H4-clustered histone 7

All UniProt accessions (1): Q99525

UniProt curated annotations — full annotation on UniProt →

Function. Core component of nucleosome. Nucleosomes wrap and compact DNA into chromatin, limiting DNA accessibility to the cellular machineries which require DNA as a template. Histones thereby play a central role in transcription regulation, DNA repair, DNA replication and chromosomal stability. DNA accessibility is regulated via a complex set of post-translational modifications of histones, also called histone code, and nucleosome remodeling.

Subunit / interactions. The nucleosome is a histone octamer containing two molecules each of H2A, H2B, H3 and H4 assembled in one H3-H4 heterotetramer and two H2A-H2B heterodimers. The octamer wraps approximately 147 bp of DNA.

Subcellular location. Nucleus. Chromosome.

Similarity. Belongs to the histone H4 family.

RefSeq proteins (1): NP_003538* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001951Histone_H4Family
IPR009072Histone-foldHomologous_superfamily

UniProt features (2 total): chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q99525-F187.890.72

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 27 (showing top): GOBP_PROTEIN_DNA_COMPLEX_ORGANIZATION, GOBP_CHROMATIN_REMODELING, DEURIG_T_CELL_PROLYMPHOCYTIC_LEUKEMIA_UP, GOCC_PROTEIN_DNA_COMPLEX, IRF2_01, GOMF_PROTEIN_HETERODIMERIZATION_ACTIVITY, GOMF_PROTEIN_DIMERIZATION_ACTIVITY, GOMF_STRUCTURAL_MOLECULE_ACTIVITY, GOBP_NUCLEOSOME_ORGANIZATION, IRF1_01, WP_HISTONE_MODIFICATIONS, GAL_LEUKEMIC_STEM_CELL_DN, GOCC_NUCLEOSOME, GOMF_STRUCTURAL_CONSTITUENT_OF_CHROMATIN, GSE2706_LPS_VS_R848_AND_LPS_8H_STIM_DC_DN

GO Biological Process (1): nucleosome assembly (GO:0006334)

GO Molecular Function (4): DNA binding (GO:0003677), structural constituent of chromatin (GO:0030527), protein heterodimerization activity (GO:0046982), protein binding (GO:0005515)

GO Cellular Component (3): nucleosome (GO:0000786), nucleus (GO:0005634), chromosome (GO:0005694)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
chromatin2
chromatin organization1
nucleosome organization1
protein-DNA complex assembly1
nucleic acid binding1
structural molecule activity1
protein dimerization activity1
binding1
protein-DNA complex1
intracellular membrane-bounded organelle1
intracellular membraneless organelle1

Protein interactions and networks

STRING

3096 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
H4C7H2AC20Q16777999
H4C7H2AC19P20670999
H4C7H3-3AP06351998
H4C7H3C14Q71DI3998
H4C7H3-4Q16695998
H4C7H3-5Q6NXT2997
H4C7H3C1P02295997
H4C7H2BC21Q16778997
H4C7H3-7Q5TEC6997
H4C7BRD4O60885993
H4C7BRDTQ58F21986
H4C7RBBP4P31149985
H4C7TP53BP1Q12888979
H4C7BRD2P25440979
H4C7HJURPQ8NCD3973

IntAct

29 interactions, top by confidence:

ABTypeScore
AGTRAPH4C7psi-mi:“MI:0915”(physical association)0.560
H4C7AGTRAPpsi-mi:“MI:0915”(physical association)0.560
H4C7FKBP7psi-mi:“MI:0915”(physical association)0.560
MAIP1H4C7psi-mi:“MI:0915”(physical association)0.560
TMED8H4C7psi-mi:“MI:0915”(physical association)0.560
FGF14H4C7psi-mi:“MI:0915”(physical association)0.560
H4C7CIDEBpsi-mi:“MI:0915”(physical association)0.560
RHBDD2H4C7psi-mi:“MI:0915”(physical association)0.560
APOC1H4C7psi-mi:“MI:0915”(physical association)0.560
H4C7H1-4psi-mi:“MI:0915”(physical association)0.400
H4C7H3C13psi-mi:“MI:0915”(physical association)0.400
HMGN1H2AXpsi-mi:“MI:0914”(association)0.350
HMGN1IPO7psi-mi:“MI:0914”(association)0.350
TMED8H4C7psi-mi:“MI:0915”(physical association)0.000
MAIP1H4C7psi-mi:“MI:0915”(physical association)0.000
H4C7FGF14psi-mi:“MI:0915”(physical association)0.000
H4C7CIDEBpsi-mi:“MI:0915”(physical association)0.000
H4C7RHBDD2psi-mi:“MI:0915”(physical association)0.000
H4C7APOC1psi-mi:“MI:0915”(physical association)0.000

BioGRID (41): AGTRAP (Two-hybrid), HIST1H4G (Affinity Capture-MS), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Two-hybrid), HIST1H4G (Proximity Label-MS), HIST1H4G (Proximity Label-MS), HIST1H4G (Affinity Capture-MS), HIST1H4G (Affinity Capture-MS), HIST1H4G (Affinity Capture-MS), HIST1H4G (Affinity Capture-MS)

ESM2 similar proteins: A0A097I1R9, A0A097I2D0, P02309, P02310, P04915, P08436, P09322, P23750, P23751, P35057, P35059, P40287, P50566, P62786, P62790, P62791, P62792, P62793, P69151, P69152, P70081, P81202, P91849, P91890, Q27443, Q27490, Q41811, Q54BC2, Q54LA5, Q54Z07, Q5M8Q2, Q6LAF1, Q6V9I2, Q6ZXX3, Q71V09, Q757K0, Q75AX1, Q76MU7, Q76NW2, Q8J1L3

Diamond homologs: P02309, P04914, P04915, P08436, P09322, P0CG89, P23750, P23751, P27996, P35057, P35059, P50566, P59259, P62776, P62777, P62778, P62779, P62780, P62781, P62782, P62783, P62784, P62785, P62786, P62787, P62788, P62789, P62792, P62793, P62794, P62795, P62796, P62797, P62798, P62799, P62800, P62801, P62802, P62803, P62804

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

29 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance29
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

48 predictions. Top by Δscore:

VariantEffectΔscore
6:26246694:C:Adonor_gain0.8600
6:26246693:T:TAdonor_gain0.6500
6:26246820:TC:Tacceptor_gain0.6500
6:26246683:A:ACdonor_gain0.5800
6:26246684:C:CCdonor_gain0.5800
6:26246685:A:Cdonor_gain0.5200
6:26246774:CCAGA:Cdonor_gain0.4700
6:26246851:CGCCA:Cacceptor_gain0.4500
6:26246817:TCCTC:Tacceptor_gain0.4400
6:26246929:TGGC:Tdonor_gain0.4400
6:26246712:TAG:Tdonor_gain0.4100
6:26246713:AGA:Adonor_gain0.4100
6:26246821:C:CTacceptor_gain0.4100
6:26246799:T:TGacceptor_gain0.3900
6:26246908:TCAG:Tdonor_gain0.3700
6:26246821:C:Aacceptor_gain0.3600
6:26246909:CAGTA:Cdonor_loss0.3600
6:26246910:AGTAC:Adonor_loss0.3600
6:26246911:GTA:Gdonor_loss0.3600
6:26246912:TA:Tdonor_loss0.3600
6:26246913:A:ATdonor_loss0.3600
6:26246914:CCTT:Cdonor_loss0.3600
6:26246915:C:Adonor_loss0.3500
6:26246822:A:Cacceptor_gain0.3300
6:26246928:TTGG:Tdonor_gain0.3300
6:26246684:CAGGG:Cdonor_gain0.3100
6:26246771:G:Tdonor_gain0.3100
6:26246800:TGAAC:Tdonor_gain0.3100
6:26246818:CCTCA:Cacceptor_gain0.3100
6:26246800:T:Aacceptor_gain0.2900

AlphaMissense

622 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:26246792:G:CF62L0.750
6:26246792:G:TF62L0.750
6:26246794:A:GF62L0.750

dbSNP variants (sampled 300 via entrez): RS1002316739 (6:26246276 G>A,T), RS1003351870 (6:26248016 G>C), RS1003486567 (6:26248303 A>G), RS1004131048 (6:26247994 G>A), RS1005159557 (6:26246301 A>G), RS1005368475 (6:26247134 A>G), RS1005834083 (6:26247394 C>A), RS1006785323 (6:26248346 G>T), RS1007821828 (6:26247041 AC>A), RS1011276476 (6:26246298 A>G), RS1011355118 (6:26246709 A>C), RS1013445555 (6:26248074 C>A), RS1015391874 (6:26247410 C>A,T), RS1015429380 (6:26247700 C>G), RS1015453772 (6:26247489 T>C,G)

Disease associations

OMIM: gene MIM:602832 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

17 associations (top):

StudyTraitp-value
GCST004521_113Autism spectrum disorder or schizophrenia3.000000e-19
GCST004521_142Autism spectrum disorder or schizophrenia2.000000e-09
GCST004521_169Autism spectrum disorder or schizophrenia4.000000e-14
GCST004521_69Autism spectrum disorder or schizophrenia8.000000e-24
GCST004521_83Autism spectrum disorder or schizophrenia1.000000e-13
GCST004571_19Iron status biomarkers (total iron binding capacity)2.000000e-08
GCST004572_27Iron status biomarkers (transferrin saturation)2.000000e-08
GCST007294_143Body fat distribution (trunk fat ratio)5.000000e-29
GCST007294_82Body fat distribution (trunk fat ratio)1.000000e-48
GCST007295_120Body fat distribution (leg fat ratio)2.000000e-46
GCST007295_91Body fat distribution (leg fat ratio)1.000000e-26
GCST010002_50Refractive error4.000000e-34
GCST010083_312Hemoglobin levels6.000000e-44
GCST010141_1Beef consumption7.000000e-13
GCST010143_19Meat-related diet5.000000e-13
GCST010143_31Meat-related diet7.000000e-09
GCST010143_5Meat-related diet4.000000e-09

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0006334total iron binding capacity
EFO:0004341body fat distribution
EFO:0004509hemoglobin measurement
EFO:0008111diet measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

13 total (human), top 13 by PubMed support.

ChemicalActions (top 5)PubMed papers
propionaldehydedecreases expression1
sodium arsenitedecreases expression1
butyraldehydedecreases expression1
pentanaldecreases expression1
Air Pollutantsincreases abundance, increases expression1
Aldehydesdecreases expression1
Arsenicaffects methylation1
Citrullinedecreases expression1
Coalincreases abundance, increases expression1
Diethylhexyl Phthalateincreases expression1
Leadincreases expression1
Smokeincreases abundance, increases expression1
Copper Sulfatedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.