BSX

gene
On this page

Also known as BSX1

Summary

BSX (brain specific homeobox, HGNC:20450) is a protein-coding gene on chromosome 11q24.1, encoding Brain-specific homeobox protein homolog (Q3C1V8). DNA binding protein that function as transcriptional activator.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in cell differentiation; regulation of transcription by RNA polymerase II; and stem cell population maintenance. Predicted to act upstream of or within several processes, including eating behavior; mammary gland involution; and positive regulation of transcription by RNA polymerase II. Predicted to be located in chromatin. Predicted to be part of transcription regulator complex.

Source: NCBI Gene 390259 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 29 total
  • MANE Select transcript: NM_001098169

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20450
Approved symbolBSX
Namebrain specific homeobox
Location11q24.1
Locus typegene with protein product
StatusApproved
AliasesBSX1
Ensembl geneENSG00000188909
Ensembl biotypeprotein_coding
OMIM611074
Entrez390259

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000343035

RefSeq mRNA: 1 — MANE Select: NM_001098169 NM_001098169

CCDS: CCDS41728

Canonical transcript exons

ENST00000343035 — 3 exons

ExonStartEnd
ENSE00001369670122979261122979457
ENSE00001379779122981410122981834
ENSE00001381774122977570122977891

Expression profiles

Bgee: expression breadth tissue_specific, 4 present calls, max score 44.54.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.5923 / max 356.0552, expressed in 37 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1228740.464632
1228750.102115
1228760.02565

Top tissues by expression

125 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099144.54silver quality
sural nerveUBERON:001548844.53gold quality
hypothalamusUBERON:000189840.30gold quality
colonic epitheliumUBERON:000039737.20gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
bone marrow cellCL:000209236.16gold quality
ganglionic eminenceUBERON:000402335.49gold quality
skeletal muscle tissueUBERON:000113433.38gold quality
bone marrowUBERON:000237131.74gold quality
muscle tissueUBERON:000238531.06gold quality
prefrontal cortexUBERON:000045129.90gold quality
stromal cell of endometriumCL:000225529.87gold quality
lymph nodeUBERON:000002928.92gold quality
liverUBERON:000210728.81gold quality
tonsilUBERON:000237228.53gold quality
duodenumUBERON:000211428.14gold quality
urinary bladderUBERON:000125527.87gold quality
leukocyteCL:000073827.32gold quality
monocyteCL:000057627.29gold quality
bloodUBERON:000017826.48gold quality
vermiform appendixUBERON:000115426.42gold quality
gall bladderUBERON:000211025.98gold quality
olfactory segment of nasal mucosaUBERON:000538625.89gold quality
placentaUBERON:000198725.81gold quality
right adrenal gland cortexUBERON:003582725.31gold quality
frontal cortexUBERON:000187025.07gold quality
primary visual cortexUBERON:000243624.61gold quality
superior frontal gyrusUBERON:000266124.08gold quality
gastrocnemiusUBERON:000138823.80gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.98

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

4 targets.

TargetRegulation
AGRPUnknown
APP
MT-ATP6
NPYUnknown

JASPAR motifs

MotifNameFamily
MA0876.1BSXNK
MA0876.2BSXNK

JASPAR matrix evidence (PMIDs): PMID:18585359

Upstream regulators (CollecTRI, top): REST, SP1

Literature-anchored findings (GeneRIF, showing 2)

  • Data suggest that BSX is essential for global cognitive function and that haplo-insufficiency may cause severe mental retardation, and that deletion of Neurogranin (PMID:18855024)
  • Genetic ablation of the Bsx homeodomain transcription factor in zebrafish: Impact on mature pineal gland morphology and circadian behavior. (PMID:35249239)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
mus_musculusBsxENSMUSG00000054360
rattus_norvegicusBsxENSRNOG00000024438
drosophila_melanogasterDllFBGN0000157
caenorhabditis_elegansWBGENE00000463

Paralogs (9): DLX6 (ENSG00000006377), DLX3 (ENSG00000064195), DLX5 (ENSG00000105880), DLX4 (ENSG00000108813), NANOG (ENSG00000111704), DLX2 (ENSG00000115844), DLX1 (ENSG00000144355), VENTX (ENSG00000151650), NANOGP8 (ENSG00000255192)

Protein

Protein identifiers

Brain-specific homeobox protein homologQ3C1V8 (reviewed: Q3C1V8)

All UniProt accessions (1): Q3C1V8

UniProt curated annotations — full annotation on UniProt →

Function. DNA binding protein that function as transcriptional activator. Is essential for normal postnatal growth and nursing. Is an essential factor for neuronal neuropeptide Y and agouti-related peptide function and locomotory behavior in the control of energy balance.

Subcellular location. Nucleus.

Similarity. Belongs to the distal-less homeobox family.

RefSeq proteins (1): NP_001091639* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site
IPR020479HD_metazoaDomain
IPR050848Homeobox_TFFamily

Pfam: PF00046

UniProt features (7 total): compositionally biased region 3, chain 1, DNA-binding region 1, region of interest 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q3C1V8-F168.920.27

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 38 (showing top): GOBP_MAMMARY_GLAND_MORPHOGENESIS, GOBP_GLAND_MORPHOGENESIS, GOBP_BEHAVIOR, GOBP_ANIMAL_ORGAN_MORPHOGENESIS, GOBP_EATING_BEHAVIOR, GOBP_MAMMARY_GLAND_DEVELOPMENT, GOBP_MAINTENANCE_OF_CELL_NUMBER, GOCC_TRANSCRIPTION_REGULATOR_COMPLEX, GOBP_FEEDING_BEHAVIOR, MATZUK_POSTIMPLANTATION_AND_POSTPARTUM, chr11q24, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, GOBP_TISSUE_REMODELING, MIKKELSEN_MCV6_ICP_WITH_H3K27ME3, MIKKELSEN_MEF_ICP_WITH_H3K27ME3

GO Biological Process (8): regulation of transcription by RNA polymerase II (GO:0006357), locomotory behavior (GO:0007626), stem cell population maintenance (GO:0019827), cell differentiation (GO:0030154), eating behavior (GO:0042755), positive regulation of transcription by RNA polymerase II (GO:0045944), mammary gland involution (GO:0060056), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (5): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), sequence-specific DNA binding (GO:0043565)

GO Cellular Component (3): chromatin (GO:0000785), nucleus (GO:0005634), transcription regulator complex (GO:0005667)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II2
regulation of transcription by RNA polymerase II2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
regulation of DNA-templated transcription1
behavior1
multicellular organismal process1
maintenance of cell number1
cellular developmental process1
feeding behavior1
positive regulation of DNA-templated transcription1
tissue remodeling1
mammary gland morphogenesis1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
DNA binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1
protein-containing complex1

Protein interactions and networks

STRING

332 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
BSXAGRPO00253791
BSXNPYP01303726
BSXLEPP41159549
BSXLHX9Q9NQ69430
BSXITPKAP23677425
BSXLHX2P50458421
BSXDMRTA1Q5VZB9412
BSXPMCHP20382408
BSXFEZF2Q8TBJ5401
BSXFEZF1A0PJY2399
BSXBANF1O75531397
BSXSIM1P81133382
BSXLHX4Q969G2376
BSXNHLH2Q02577376
BSXNR5A1Q13285362

IntAct

2 interactions, top by confidence:

ABTypeScore
BSXNPM1psi-mi:“MI:0915”(physical association)0.400

BioGRID (1): BSX (Proximity Label-MS)

ESM2 similar proteins: A1YF16, A1YG93, A2RU54, A2T764, A2T7P4, O02786, O42115, O43711, O57601, P13297, P23410, P28360, P28362, P35548, P35993, P42580, P43120, P43687, P50223, P63156, P63157, P70354, Q01703, Q03014, Q03356, Q03358, Q04281, Q05502, Q0P4W6, Q0P5C3, Q2VL76, Q2VL77, Q2VL78, Q2VL79, Q2VL80, Q2VL82, Q2VL83, Q2VL84, Q2VL85, Q2VL86

Diamond homologs: A0JPN1, A1YG85, A5PKG8, A6NJ46, A6NMT0, A7MB54, A9L937, B0VXK3, D2KQB0, E7FDX5, M0R6D8, O08686, O13023, O35762, O42365, O43364, O43711, O55144, O88181, O93366, O93367, O93590, P0C1T1, P10035, P14652, P14837, P20009, P28468, P31245, P31246, P31261, P31314, P42583, P42584, P43120, P43345, P43688, P50219, P52945, P52950

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

29 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance25
Likely benign1
Benign3

Top pathogenic / likely-pathogenic (0)

SpliceAI

343 predictions. Top by Δscore:

VariantEffectΔscore
11:122977899:C:CTacceptor_gain1.0000
11:122977892:C:CAacceptor_loss0.9900
11:122977892:C:CCacceptor_gain0.9900
11:122977900:G:Tacceptor_gain0.9900
11:122979243:C:CAdonor_gain0.9900
11:122979453:CATCC:Cacceptor_gain0.9900
11:122979455:TCC:Tacceptor_gain0.9900
11:122979456:CC:Cacceptor_gain0.9900
11:122979456:CCC:Cacceptor_gain0.9900
11:122979457:CC:Cacceptor_gain0.9900
11:122979458:C:CCacceptor_gain0.9900
11:122979552:T:TAdonor_gain0.9900
11:122979598:AAG:Adonor_gain0.9900
11:122980352:A:ACdonor_gain0.9900
11:122980352:AGGG:Adonor_gain0.9900
11:122980353:G:Cdonor_gain0.9900
11:122981405:CTT:Cdonor_loss0.9900
11:122981407:TACCC:Tdonor_loss0.9900
11:122981408:A:ACdonor_gain0.9900
11:122981408:AC:Adonor_gain0.9900
11:122981409:C:CCdonor_gain0.9900
11:122981409:CC:Cdonor_gain0.9900
11:122977887:TTCAC:Tacceptor_gain0.9800
11:122977889:CAC:Cacceptor_gain0.9800
11:122979255:GCCCA:Gdonor_loss0.9800
11:122979256:CCCAC:Cdonor_loss0.9800
11:122979257:CCA:Cdonor_loss0.9800
11:122979258:CACC:Cdonor_loss0.9800
11:122979259:AC:Adonor_loss0.9800
11:122979260:CCTGC:Cdonor_loss0.9800

AlphaMissense

1501 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:122977853:T:AK166N1.000
11:122977853:T:GK166N1.000
11:122977854:T:AK166I1.000
11:122977855:T:CK166E1.000
11:122977859:C:AK164N1.000
11:122977859:C:GK164N1.000
11:122977861:T:CK164E1.000
11:122977862:C:AM163I1.000
11:122977862:C:GM163I1.000
11:122977862:C:TM163I1.000
11:122977863:A:GM163T1.000
11:122977863:A:TM163K1.000
11:122977867:G:CR162G1.000
11:122977869:C:GR161P1.000
11:122977870:G:CR161G1.000
11:122977871:G:CN160K1.000
11:122977871:G:TN160K1.000
11:122977872:T:AN160I1.000
11:122977872:T:CN160S1.000
11:122977872:T:GN160T1.000
11:122977873:T:CN160D1.000
11:122977873:T:GN160H1.000
11:122977874:C:AQ159H1.000
11:122977874:C:GQ159H1.000
11:122977875:T:GQ159P1.000
11:122977876:G:TQ159K1.000
11:122977877:G:CF158L1.000
11:122977877:G:TF158L1.000
11:122977878:A:CF158C1.000
11:122977878:A:GF158S1.000

dbSNP variants (sampled 300 via entrez): RS1000285800 (11:122977977 G>A,T), RS1001055045 (11:122979896 A>G), RS1001486079 (11:122980157 C>T), RS1001543553 (11:122983176 A>G), RS1002021142 (11:122983328 G>A,C,T), RS1002121740 (11:122978403 G>A), RS1002472688 (11:122978183 C>T), RS1002869672 (11:122979619 A>G), RS1003612183 (11:122978059 A>C), RS1004823505 (11:122983360 C>A), RS1004895848 (11:122981942 G>A), RS1004981303 (11:122977290 G>T), RS1005076155 (11:122977481 G>A,C), RS1005208958 (11:122977249 G>A), RS1005238943 (11:122981878 C>T)

Disease associations

OMIM: gene MIM:611074 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST000880_40Menarche (age at onset)2.000000e-12
GCST002541_91Menarche (age at onset)3.000000e-26
GCST012227_633Hip circumference adjusted for BMI6.000000e-10

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004703age at menarche
EFO:0008039BMI-adjusted hip circumference

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases methylation2
Aflatoxin B1decreases methylation2
Cadmium Chloridedecreases expression, increases abundance, increases expression2
Resveratroldecreases expression, affects cotreatment1
Cadmiumdecreases expression, increases abundance1
Phthalic Acidsdecreases methylation1
Plant Extractsaffects cotreatment, decreases expression1
Cyclosporinedecreases methylation1
Asbestos, Crocidolitedecreases methylation1
Asbestos, Amositedecreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.