BARHL2

gene
On this page

Summary

BARHL2 (BarH like homeobox 2, HGNC:954) is a protein-coding gene on chromosome 1p22.2, encoding BarH-like 2 homeobox protein (Q9NY43). Potential regulator of neural basic helix-loop-helix genes.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to act upstream of or within several processes, including generation of neurons; positive regulation of macromolecule biosynthetic process; and regulation of axon extension. Predicted to be located in chromatin. Predicted to be active in nucleus. Implicated in oral squamous cell carcinoma. Biomarker of colorectal cancer; oral squamous cell carcinoma; and stomach cancer.

Source: NCBI Gene 343472 — RefSeq curated summary.

At a glance

  • GWAS associations: 26
  • Clinical variants (ClinVar): 61 total
  • MANE Select transcript: NM_020063

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:954
Approved symbolBARHL2
NameBarH like homeobox 2
Location1p22.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000143032
Ensembl biotypeprotein_coding
OMIM605212
Entrez343472

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000370445

RefSeq mRNA: 1 — MANE Select: NM_020063 NM_020063

CCDS: CCDS730

Canonical transcript exons

ENST00000370445 — 3 exons

ExonStartEnd
ENSE000013389609071453190714756
ENSE000014527359071153990712624
ENSE000014527369071657190717302

Expression profiles

Bgee: expression breadth broad, 26 present calls, max score 90.35.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0379 / max 16.2662, expressed in 18 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
132110.037918

Top tissues by expression

121 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cerebellumUBERON:000203790.35gold quality
cerebellar cortexUBERON:000212990.27gold quality
cerebellar hemisphereUBERON:000224590.24gold quality
right hemisphere of cerebellumUBERON:001489088.58gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099185.06gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047374.47silver quality
right testisUBERON:000453457.90gold quality
testisUBERON:000047357.43gold quality
left testisUBERON:000453356.39gold quality
hypothalamusUBERON:000189848.22gold quality
C1 segment of cervical spinal cordUBERON:000646943.42gold quality
ventricular zoneUBERON:000305340.85silver quality
cortical plateUBERON:000534340.09gold quality
brainUBERON:000095539.26gold quality
sural nerveUBERON:001548838.78gold quality
substantia nigraUBERON:000203838.55gold quality
lower esophagus mucosaUBERON:003583437.49gold quality
amygdalaUBERON:000187637.25gold quality
colonic epitheliumUBERON:000039737.20gold quality
temporal lobeUBERON:000187137.16gold quality
bone marrow cellCL:000209236.16gold quality
ganglionic eminenceUBERON:000402335.49gold quality
apex of heartUBERON:000209835.42gold quality
right coronary arteryUBERON:000162534.51gold quality
mucosa of stomachUBERON:000119934.43gold quality
mucosa of transverse colonUBERON:000499134.34gold quality
frontal cortexUBERON:000187033.49gold quality
skeletal muscle tissueUBERON:000113433.38gold quality
bone marrowUBERON:000237132.88gold quality
hindlimb stylopod muscleUBERON:000425232.15gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-HCAD-56yes471.04
E-ANND-3no0.82

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
AVPR2

JASPAR motifs

MotifNameFamily
MA0635.1BARHL2NK
MA0635.2BARHL2NK

JASPAR matrix evidence (PMIDs): PMID:18585359

Upstream regulators (CollecTRI, top): ATOH1

miRNA regulators (miRDB)

36 targeting BARHL2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-10401-5P99.9965.79948
HSA-MIR-6870-5P99.9968.552115
HSA-MIR-453499.9966.581907
HSA-MIR-4723-5P99.9768.702034
HSA-MIR-569899.9768.492029
HSA-MIR-7111-5P99.9768.482062
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-427199.8868.322244
HSA-MIR-3151-5P99.8663.831069
HSA-MIR-444799.8567.812900
HSA-MIR-6756-5P99.8267.972466
HSA-MIR-3150A-3P99.7664.441640
HSA-MIR-6763-5P99.7664.681767
HSA-MIR-92A-2-5P99.7567.012164
HSA-MIR-6766-5P99.6867.702325
HSA-MIR-320299.6667.702737
HSA-MIR-317599.6566.302031
HSA-MIR-6512-3P99.6566.071468
HSA-MIR-6720-5P99.6566.221459
HSA-MIR-431099.5968.842527
HSA-MIR-447299.5666.081478
HSA-MIR-1207-5P99.4969.112983
HSA-MIR-520A-5P99.3566.721632
HSA-MIR-525-5P99.3566.851615
HSA-MIR-491-5P99.1365.981468
HSA-MIR-4763-3P99.1067.832649
HSA-MIR-429798.7766.952013
HSA-MIR-4704-3P98.2869.331300

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriobarhl2ENSDARG00000104361
mus_musculusBarhl2ENSMUSG00000034384
rattus_norvegicusBarhl2ENSRNOG00000002117
drosophila_melanogasterB-H2FBGN0004854
drosophila_melanogasterB-H1FBGN0011758
caenorhabditis_elegansWBGENE00000451
caenorhabditis_elegansWBGENE00000452

Paralogs (3): BARX2 (ENSG00000043039), BARHL1 (ENSG00000125492), BARX1 (ENSG00000131668)

Protein

Protein identifiers

BarH-like 2 homeobox proteinQ9NY43 (reviewed: Q9NY43)

All UniProt accessions (1): Q9NY43

UniProt curated annotations — full annotation on UniProt →

Function. Potential regulator of neural basic helix-loop-helix genes.

Subcellular location. Nucleus.

Similarity. Belongs to the BAR homeobox family.

RefSeq proteins (1): NP_064447* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site
IPR020479HD_metazoaDomain
IPR052145Mediator/Homeobox_domainFamily

Pfam: PF00046

UniProt features (14 total): compositionally biased region 6, helix 3, region of interest 3, chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

11 structures.

PDBMethodResolution (Å)
8PMFX-RAY DIFFRACTION0.95
8PMNX-RAY DIFFRACTION1.3
8PNAX-RAY DIFFRACTION1.45
8PM7X-RAY DIFFRACTION1.7
8PMCX-RAY DIFFRACTION1.85
8R7FX-RAY DIFFRACTION1.98
8PNCX-RAY DIFFRACTION2.05
8PMVX-RAY DIFFRACTION2.1
8PM5X-RAY DIFFRACTION2.4
8PN4X-RAY DIFFRACTION2.6
8R7ZX-RAY DIFFRACTION3.26

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NY43-F162.100.15

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 116 (showing top): BENPORATH_ES_WITH_H3K27ME3, GOBP_NEURON_PROJECTION_EXTENSION, GOBP_REGULATION_OF_DEVELOPMENTAL_GROWTH, GOBP_GROWTH, GOBP_NEUROGENESIS, CHX10_01, EFC_Q6, GOBP_TRANSLATION, GOBP_POST_TRANSCRIPTIONAL_REGULATION_OF_GENE_EXPRESSION, GOBP_REGULATION_OF_ANATOMICAL_STRUCTURE_SIZE, OCT1_03, WTGAAAT_UNKNOWN, GRE_C, RYTAAWNNNTGAY_UNKNOWN, GOBP_REGULATION_OF_CELL_SIZE

GO Biological Process (11): cell fate determination (GO:0001709), neuron migration (GO:0001764), regulation of transcription by RNA polymerase II (GO:0006357), regulation of axon extension (GO:0030516), amacrine cell differentiation (GO:0035881), positive regulation of translation (GO:0045727), regulation of DNA-templated transcription (GO:0006355), nervous system development (GO:0007399), neuron differentiation (GO:0030182), cell fate commitment (GO:0045165), positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (7): RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700), protein binding (GO:0005515)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular developmental process2
generation of neurons2
regulation of DNA-templated transcription2
transcription by RNA polymerase II2
cell differentiation2
regulation of transcription by RNA polymerase II2
transcription cis-regulatory region binding2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
cell fate commitment1
cell migration1
regulation of developmental growth1
axon extension1
regulation of extent of cell growth1
neural retina development1
central nervous system neuron differentiation1
translation1
regulation of translation1
positive regulation of gene expression1
positive regulation of protein metabolic process1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
system development1
positive regulation of DNA-templated transcription1
chromatin1
DNA-binding transcription factor activity1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription activator activity1
positive regulation of transcription by RNA polymerase II1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
transcription regulator activity1
binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

1140 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
BARHL2FOXN4Q96NZ1632
BARHL2PTF1AQ7RTS3624
BARHL2BHLHE22Q8NFJ8575
BARHL2NEUROD4Q9HD90508
BARHL2ISL1P20663500
BARHL2ATOH7Q8N100478
BARHL2LHX5Q9H2C1461
BARHL2POU4F2Q12837453
BARHL2PBX3P40426433
BARHL2LHX9Q9NQ69431
BARHL2ZNF608Q9ULD9418
BARHL2TFAP2AP05549412
BARHL2NFIAQ12857412
BARHL2LHX1P48742405
BARHL2NOVA2Q9UNW9404

IntAct

7 interactions, top by confidence:

ABTypeScore
RELBARHL2psi-mi:“MI:0915”(physical association)0.560
BARHL2RELpsi-mi:“MI:0915”(physical association)0.560
PLEKHF2BARHL2psi-mi:“MI:0915”(physical association)0.560
BARHL2PLEKHF2psi-mi:“MI:0915”(physical association)0.000

BioGRID (2): BARHL2 (Two-hybrid), PLEKHF2 (Two-hybrid)

ESM2 similar proteins: A0A8V0YY16, A0JPN1, A7MB54, A8MTJ6, O35762, O42115, O57601, O88181, O95096, P09065, P23683, P28356, P31311, P31315, P32443, P39020, P42581, P42586, P43697, P48031, P49640, P50222, P50476, P52951, P52954, P52955, P78426, P81067, P81068, P97334, Q14549, Q14774, Q1KKY1, Q1XID0, Q2NKI2, Q2VL76, Q2VL80, Q4V5A3, Q5SQQ9, Q60554

Diamond homologs: A0JPN1, A1YG85, A5PKG8, A6NJ46, A6NMT0, A7MB54, A9L937, B0VXK3, D2KQB0, E7FDX5, M0R6D8, O08686, O13023, O35762, O42365, O43364, O43711, O55144, O88181, O93366, O93367, O93590, P0C1T1, P10035, P14652, P14837, P20009, P28468, P31245, P31246, P31261, P31314, P42583, P42584, P43120, P43345, P43688, P50219, P52945, P52950

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

61 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance59
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

581 predictions. Top by Δscore:

VariantEffectΔscore
1:90714528:TA:Tdonor_loss1.0000
1:90714530:CCTG:Cdonor_gain1.0000
1:90716566:CTCA:Cdonor_loss1.0000
1:90716567:TCAC:Tdonor_loss1.0000
1:90716568:CACCG:Cdonor_loss1.0000
1:90716569:A:ACdonor_gain1.0000
1:90716569:AC:Adonor_gain1.0000
1:90716570:C:CCdonor_gain1.0000
1:90716570:C:Gdonor_loss1.0000
1:90716570:CC:Cdonor_gain1.0000
1:90716570:CCG:Cdonor_gain1.0000
1:90713570:CGTA:Cacceptor_gain0.9900
1:90713573:A:Cacceptor_gain0.9900
1:90714529:A:ACdonor_gain0.9900
1:90714530:C:CCdonor_gain0.9900
1:90714755:CC:Cacceptor_gain0.9900
1:90714756:CC:Cacceptor_gain0.9900
1:90714761:C:CTacceptor_gain0.9900
1:90716570:CCGT:Cdonor_gain0.9900
1:90716570:CCGTG:Cdonor_gain0.9900
1:90712623:TCCT:Tacceptor_loss0.9800
1:90712625:C:CCacceptor_gain0.9800
1:90712626:T:Cacceptor_loss0.9800
1:90714187:CTGG:Cdonor_gain0.9800
1:90714529:ACCTG:Adonor_gain0.9800
1:90714530:CCT:Cdonor_gain0.9800
1:90714530:CCTGC:Cdonor_gain0.9800
1:90714540:TGG:Tdonor_gain0.9800
1:90714754:TCCC:Tacceptor_loss0.9800
1:90714755:CCCT:Cacceptor_loss0.9800

AlphaMissense

2538 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:90712581:C:GA299P1.000
1:90712583:A:GL298P1.000
1:90712583:A:TL298Q1.000
1:90712586:A:GL297S1.000
1:90712589:T:AE296V1.000
1:90712590:C:TE296K1.000
1:90712592:A:GL295P1.000
1:90712595:C:TG294D1.000
1:90712596:C:GG294R1.000
1:90712602:C:GA292P1.000
1:90712606:C:AQ290H1.000
1:90712606:C:GQ290H1.000
1:90712607:T:GQ290P1.000
1:90712610:C:GR289P1.000
1:90712611:G:AR289W1.000
1:90712611:G:CR289G1.000
1:90712612:C:AK288N1.000
1:90712612:C:GK288N1.000
1:90712613:T:AK288M1.000
1:90712613:T:GK288T1.000
1:90712614:T:CK288E1.000
1:90712614:T:GK288Q1.000
1:90712615:C:AW287C1.000
1:90712615:C:GW287C1.000
1:90712616:C:GW287S1.000
1:90712617:A:GW287R1.000
1:90712617:A:TW287R1.000
1:90712618:C:AK286N1.000
1:90712618:C:GK286N1.000
1:90712619:T:AK286M1.000

dbSNP variants (sampled 300 via entrez): RS1000039509 (1:90711931 T>C), RS1000559086 (1:90711787 T>C), RS1001097037 (1:90715329 T>C), RS1001159235 (1:90712838 C>T), RS1001444998 (1:90713080 G>A), RS1001528534 (1:90715623 T>C), RS1001997116 (1:90712449 G>A), RS1002337210 (1:90719199 A>C,G,T), RS1002351437 (1:90712260 G>A,C,T), RS1003045950 (1:90715791 G>T), RS1003162277 (1:90715471 C>A), RS1003256238 (1:90714040 G>A), RS1003500299 (1:90718895 C>A,G), RS1003555677 (1:90711697 A>T), RS1003617798 (1:90711143 C>A)

Disease associations

OMIM: gene MIM:605212 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

26 associations (top):

StudyTraitp-value
GCST004077_3Cognitive function3.000000e-08
GCST005316_298Intelligence (MTAG)2.000000e-12
GCST005830_48Hand grip strength4.000000e-08
GCST006041_41Major depressive disorder3.000000e-06
GCST006920_3Regular attendance at a gym or sports club2.000000e-06
GCST006921_1Regular attendance at a pub or social club2.000000e-13
GCST006922_3Regular attendance at a religious group2.000000e-10
GCST007323_1Risk-taking tendency (4-domain principal component model)3.000000e-09
GCST007324_85Adventurousness4.000000e-09
GCST007325_117General risk tolerance (MTAG)2.000000e-11
GCST007326_101Number of sexual partners9.000000e-12
GCST007327_160Smoking status (ever vs never smokers)5.000000e-11
GCST007335_1Age at first sexual intercourse1.000000e-08
GCST007565_127Morning person4.000000e-21
GCST007576_21Chronotype4.000000e-21
GCST007603_25Smoking initiation1.000000e-08
GCST008595_8Cognitive ability, years of educational attainment or schizophrenia (pleiotropy)3.000000e-13
GCST008810_66Smoking initiation (ever regular vs never regular)7.000000e-10
GCST009523_2Household income2.000000e-09
GCST009524_155Household income (MTAG)3.000000e-12
GCST009524_237Household income (MTAG)2.000000e-15
GCST010002_362Refractive error1.000000e-30
GCST010988_245Adult body size2.000000e-09
GCST011703_38Smoking initiation7.000000e-11
GCST011704_2Smoking status (current vs never)2.000000e-08
GCST90000047_11Age at first sexual intercourse2.000000e-24

EFO canonical traits (11, from GWAS)

EFO IDTrait name
EFO:0004337intelligence
EFO:0006941grip strength measurement
EFO:0009592social interaction measurement
EFO:0008579risk-taking behaviour
EFO:0004318smoking behavior
EFO:0009749age at first sexual intercourse measurement
EFO:0008328chronotype measurement
EFO:0005670smoking initiation
EFO:0004784self reported educational attainment
EFO:0009695household income
EFO:0006527smoking status measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
arseniteincreases methylation1
butyraldehydedecreases expression1
CGP 52608affects binding, increases reaction1
Resveratrolaffects cotreatment, decreases expression1
Benzo(a)pyreneaffects methylation1
Dimethyl Sulfoxideaffects expression1
Plant Extractsaffects cotreatment, decreases expression1
1-Methyl-4-phenylpyridiniumdecreases expression1
Aflatoxin B1decreases methylation1
Copper Sulfateincreases expression1
Acrylamidedecreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A0G3SEES3-1V human BARHL2, clone1Embryonic stem cellMale
CVCL_A0G4SEES3-1V human BARHL2, clone2Embryonic stem cellMale
CVCL_A0G5SEES3-1V human BARHL2, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.