NOTO

gene
On this page

Summary

NOTO (notochord homeobox, HGNC:31839) is a protein-coding gene on chromosome 2p13.2, encoding Homeobox protein notochord (A8MTQ0). Transcription regulator acting downstream of both FOXA2 and Brachyury (T) during notochord development.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in brain development; neuron differentiation; and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within several processes, including embryonic organ development; motile cilium assembly; and regulation of cilium assembly. Predicted to be located in chromatin. Predicted to be active in nucleus.

Source: NCBI Gene 344022 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 49 total
  • Transcription factor: yes — 15 downstream targets (CollecTRI)
  • MANE Select transcript: NM_001134462

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:31839
Approved symbolNOTO
Namenotochord homeobox
Location2p13.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000214513
Ensembl biotypeprotein_coding
OMIM620695
Entrez344022

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000398468

RefSeq mRNA: 1 — MANE Select: NM_001134462 NM_001134462

CCDS: CCDS46335

Canonical transcript exons

ENST00000398468 — 3 exons

ExonStartEnd
ENSE000015332497321077173212513
ENSE000015332517320840073208614
ENSE000015332547320257473203048

Expression profiles

Bgee: expression breadth tissue_specific, 4 present calls, max score 48.77.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.2564 / max 45.7923, expressed in 60 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
209410.256460

Top tissues by expression

116 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
stromal cell of endometriumCL:000225548.77gold quality
duodenumUBERON:000211443.13silver quality
colonic epitheliumUBERON:000039741.54gold quality
mucosa of transverse colonUBERON:000499140.62silver quality
bone marrowUBERON:000237138.64gold quality
bone marrow cellCL:000209238.01gold quality
skeletal muscle tissueUBERON:000113437.69gold quality
muscle tissueUBERON:000238537.68gold quality
ganglionic eminenceUBERON:000402337.16gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
calcaneal tendonUBERON:000370136.38gold quality
rectumUBERON:000105235.74silver quality
right coronary arteryUBERON:000162535.44gold quality
sural nerveUBERON:001548835.30gold quality
apex of heartUBERON:000209835.17gold quality
leukocyteCL:000073834.80gold quality
bloodUBERON:000017834.54gold quality
monocyteCL:000057634.42gold quality
liverUBERON:000210733.97gold quality
mucosa of stomachUBERON:000119933.86gold quality
cerebellumUBERON:000203733.54gold quality
uterine cervixUBERON:000000233.44gold quality
cerebellar cortexUBERON:000212933.37gold quality
stomachUBERON:000094533.33gold quality
cerebellar hemisphereUBERON:000224533.22gold quality
smooth muscle tissueUBERON:000113533.08gold quality
right hemisphere of cerebellumUBERON:001489033.01gold quality
body of stomachUBERON:000116132.57gold quality
endometriumUBERON:000129532.36gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes2.63

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

15 targets.

TargetRegulation
ADAM2
ALG3
BTK
CAT
CCR4
CD74
ENO2
EOMES
LCK
NOTO
SUPT7L
TOP2B
TPM1
USP21
WT1

JASPAR motifs

MotifNameFamily
MA0710.1NOTONK
MA0710.2NOTONK

JASPAR matrix evidence (PMIDs): PMID:18585360

Upstream regulators (CollecTRI, top): AP1, NFKB, NOTO

miRNA regulators (miRDB)

10 targeting NOTO, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-LET-7F-2-3P99.9870.982588
HSA-MIR-1185-1-3P99.9871.042593
HSA-MIR-1185-2-3P99.9871.042593
HSA-MIR-145-5P99.9271.131836
HSA-MIR-5195-3P99.9270.921877
HSA-MIR-449399.9066.48977
HSA-MIR-808099.8267.521342
HSA-MIR-431699.3765.751360
HSA-MIR-10B-3P99.0466.98988
HSA-MIR-55495.2066.98341

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerionotoENSDARG00000021201
mus_musculusNotoENSMUSG00000068302
rattus_norvegicusNotoENSRNOG00000037863
drosophila_melanogasterCG18599FBGN0038592
caenorhabditis_elegansceh-5WBGENE00000430

Paralogs (1): GSX1 (ENSG00000169840)

Protein

Protein identifiers

Homeobox protein notochordA8MTQ0 (reviewed: A8MTQ0)

All UniProt accessions (1): A8MTQ0

UniProt curated annotations — full annotation on UniProt →

Function. Transcription regulator acting downstream of both FOXA2 and Brachyury (T) during notochord development. Required for node morphogenesis. Is essential for cilia formation in the posterior notochord (PNC) and for left-right patterning; acts upstream of FOXJ1 and RFX3 in this process and is required for the expression of various components important for axonemal assembly and function. Plays a role in regulating axial versus paraxial cell fate. Activates the transcription of ciliary proteins C11orf97 homolog, FAM183B and SPACA9 in the embryonic ventral node.

Subcellular location. Nucleus.

RefSeq proteins (1): NP_001127934* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR050877EMX-VAX-Noto_Homeobox_TFsFamily

Pfam: PF00046

UniProt features (9 total): compositionally biased region 3, region of interest 2, sequence variant 2, chain 1, DNA-binding region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-A8MTQ0-F168.690.26

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-9796292Formation of axial mesoderm

MSigDB gene sets: 54 (showing top): GOBP_MORPHOGENESIS_OF_AN_EPITHELIUM, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_NEUROGENESIS, GOBP_REGULATION_OF_CELLULAR_COMPONENT_BIOGENESIS, GOBP_DORSAL_VENTRAL_PATTERN_FORMATION, GOBP_SPECIFICATION_OF_SYMMETRY, GOBP_ANIMAL_ORGAN_MORPHOGENESIS, GOBP_REGULATION_OF_CELL_PROJECTION_ORGANIZATION, GOBP_EMBRYONIC_HEART_TUBE_DEVELOPMENT, GOBP_EMBRYONIC_ORGAN_MORPHOGENESIS, GOBP_HEART_MORPHOGENESIS, GOBP_EMBRYONIC_PATTERN_SPECIFICATION, GOBP_CILIUM_ORGANIZATION, GOBP_ORGANELLE_ASSEMBLY, GOBP_HEAD_DEVELOPMENT

GO Biological Process (11): heart looping (GO:0001947), regulation of transcription by RNA polymerase II (GO:0006357), central nervous system development (GO:0007417), brain development (GO:0007420), embryonic pattern specification (GO:0009880), dorsal/ventral pattern formation (GO:0009953), neuron differentiation (GO:0030182), notochord development (GO:0030903), motile cilium assembly (GO:0044458), regulation of cilium assembly (GO:1902017), determination of left/right symmetry (GO:0007368)

GO Molecular Function (5): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Gastrulation1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cilium assembly2
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
embryonic heart tube morphogenesis1
determination of heart left/right asymmetry1
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
nervous system development1
system development1
central nervous system development1
animal organ development1
head development1
pattern specification process1
embryo development1
regionalization1
cell differentiation1
generation of neurons1
embryonic organ development1
regulation of plasma membrane bounded cell projection assembly1
regulation of organelle assembly1
determination of bilateral symmetry1
left/right pattern formation1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

502 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NOTOPRAMEF18Q5VWM3447
NOTOFOXA2Q9Y261394
NOTOPRADC1Q9BSG0370
NOTOHECTD3Q5T447369
NOTOKLRG2A4D1S0347
NOTOMCIDASD6RGH6337
NOTOFOXJ1Q92949336
NOTOTBX6O95947325
NOTOCHRDQ9H2X0320
NOTOMRPL45Q9BRJ2320
NOTOPALD1Q9ULE6319
NOTOACSM4P0C7M7310
NOTORFX3P48380308
NOTOPTCD3Q96EY7298
NOTOMRPL46Q9H2W6298

IntAct

103 interactions, top by confidence:

ABTypeScore
NOTOKRTAP6-1psi-mi:“MI:0915”(physical association)0.560
NOTOKRTAP19-7psi-mi:“MI:0915”(physical association)0.560
NOTOFAM9Bpsi-mi:“MI:0915”(physical association)0.560
NOTOTTC23psi-mi:“MI:0915”(physical association)0.560
NOTOAPOBEC1psi-mi:“MI:0915”(physical association)0.560
NOTONKX2-4psi-mi:“MI:0915”(physical association)0.560
NOTOKRTAP5-6psi-mi:“MI:0915”(physical association)0.560
NOTOLDLRAP1psi-mi:“MI:0915”(physical association)0.560
NOTOIP6K3psi-mi:“MI:0915”(physical association)0.560
NOTOCHAF1Apsi-mi:“MI:0915”(physical association)0.560
NOTOLRP10psi-mi:“MI:0915”(physical association)0.560
NOTOBTF3psi-mi:“MI:0915”(physical association)0.560
NOTOPOU4F2psi-mi:“MI:0915”(physical association)0.560
NOTOCEP19psi-mi:“MI:0915”(physical association)0.560
NOTOFOXD2psi-mi:“MI:0915”(physical association)0.560
NOTOKRT34psi-mi:“MI:0915”(physical association)0.560
NOTOHNRNPKpsi-mi:“MI:0915”(physical association)0.560
KRTAP6-3NOTOpsi-mi:“MI:0915”(physical association)0.560
NOTOANKRD11psi-mi:“MI:0915”(physical association)0.560
NOTOMAGED1psi-mi:“MI:0915”(physical association)0.560
NOTOCCDC120psi-mi:“MI:0915”(physical association)0.560
NOTOGLYCTKpsi-mi:“MI:0915”(physical association)0.560
NOTOBAG3psi-mi:“MI:0915”(physical association)0.560
NOTOFOXH1psi-mi:“MI:0915”(physical association)0.560
NOTOEIPR1psi-mi:“MI:0915”(physical association)0.560
NOTOMEIS2psi-mi:“MI:0915”(physical association)0.560

BioGRID (35): NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid), NOTO (Two-hybrid)

ESM2 similar proteins: A1YEV8, A1YF08, A1YG25, A1YG85, A2RU54, A2T711, A2T756, A8MTQ0, O14813, O15522, O35160, O35602, O43763, O70218, P28360, P42580, P43687, P50223, P52945, P52946, P52947, P70118, P70354, P81062, Q06348, Q2VL79, Q2VL84, Q2VL85, Q2VL87, Q2VL88, Q61663, Q62066, Q62782, Q6XYB7, Q7YRX0, Q96IS3, Q99811, Q9DED6, Q9ER42, Q9GK08

Diamond homologs: A1A546, A1L2P5, A6NNA5, A6YP92, A8DT10, A8MTQ0, G5EC89, G5ECT8, O09113, O14813, O15266, O35085, O35137, O35690, O35750, O42115, O42201, O42370, O42567, O43365, O60902, O70137, O93353, O93590, O95076, O97039, P02831, P09027, P0DMV5, P18488, P20269, P23759, P23760, P24610, P31249, P31316, P31357, P47239, P50901, P56407

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 32 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Keratinization516.4×6e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

49 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance44
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

438 predictions. Top by Δscore:

VariantEffectΔscore
2:73208618:G:GTdonor_gain1.0000
2:73205735:GCT:Gdonor_gain0.9900
2:73208398:AG:Aacceptor_gain0.9900
2:73208399:GG:Gacceptor_gain0.9900
2:73208562:TGGGG:Tdonor_gain0.9900
2:73210764:A:Gacceptor_gain0.9900
2:73210765:TTATA:Tacceptor_loss0.9900
2:73210766:TATA:Tacceptor_loss0.9900
2:73210768:TA:Tacceptor_loss0.9900
2:73210770:G:GTacceptor_loss0.9900
2:73210770:GGT:Gacceptor_gain0.9900
2:73210770:GGTGA:Gacceptor_gain0.9900
2:73208610:ACCAG:Adonor_loss0.9800
2:73208611:CCAG:Cdonor_loss0.9800
2:73208612:CAGGT:Cdonor_loss0.9800
2:73208613:AGGTG:Adonor_loss0.9800
2:73208614:GG:Gdonor_loss0.9800
2:73208615:G:GAdonor_loss0.9800
2:73208616:T:Adonor_loss0.9800
2:73210769:A:AGacceptor_gain0.9800
2:73210770:G:GGacceptor_gain0.9800
2:73208618:G:Tdonor_gain0.9700
2:73202977:GC:Gdonor_gain0.9600
2:73208397:TAGGG:Tacceptor_loss0.9600
2:73208398:A:Cacceptor_loss0.9600
2:73208399:GGGTT:Gacceptor_gain0.9600
2:73210763:A:AGacceptor_gain0.9600
2:73203626:GCTCT:Gdonor_gain0.9500
2:73208398:A:AGacceptor_gain0.9500
2:73208399:G:GGacceptor_gain0.9500

AlphaMissense

1589 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:73208504:T:CF163L0.999
2:73208506:T:AF163L0.999
2:73208506:T:GF163L0.999
2:73208540:T:CF175L0.999
2:73208542:T:AF175L0.999
2:73208542:T:GF175L0.999
2:73210783:T:CF204L0.999
2:73210785:C:AF204L0.999
2:73210785:C:GF204L0.999
2:73208529:T:CL171S0.998
2:73210784:T:CF204S0.997
2:73210789:A:GN206D0.997
2:73208505:T:CF163S0.996
2:73208505:T:GF163C0.996
2:73210782:G:CW203C0.996
2:73210782:G:TW203C0.996
2:73210790:A:GN206S0.996
2:73210793:G:CR207P0.996
2:73208541:T:CF175S0.995
2:73208541:T:GF175C0.995
2:73210780:T:AW203R0.995
2:73210780:T:CW203R0.995
2:73210792:C:AR207S0.995
2:73208601:T:CL195P0.994
2:73210784:T:GF204C0.994
2:73210790:A:CN206T0.994
2:73210791:C:AN206K0.994
2:73210791:C:GN206K0.994
2:73210797:G:CR208S0.994
2:73210797:G:TR208S0.994

dbSNP variants (sampled 300 via entrez): RS1000285024 (2:73206984 G>A), RS1000387681 (2:73207398 G>C,T), RS1000713925 (2:73208203 C>T), RS1000902692 (2:73211799 A>G), RS1001286465 (2:73205475 G>A), RS1001926724 (2:73208605 A>T), RS1001938402 (2:73210781 G>A,T), RS1001979042 (2:73208972 G>A), RS1002101800 (2:73202275 G>C), RS1002263111 (2:73201908 CTCAAATCAAA>C,CTCAAATCAAATCAAA,CTCAAATCAAATCAAATCAAA), RS1002288778 (2:73204161 C>G,T), RS1002307386 (2:73210543 C>T), RS1002424612 (2:73204400 C>T), RS1002907415 (2:73209202 G>A), RS1002929802 (2:73210104 A>G)

Disease associations

OMIM: gene MIM:620695 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST005316_336Intelligence (MTAG)2.000000e-10
GCST006269_1111General cognitive ability4.000000e-10

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004337intelligence

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

6 total (human), top 6 by PubMed support.

ChemicalActions (top 5)PubMed papers
CGP 52608affects binding, increases reaction1
nutlin 3affects cotreatment, increases expression1
Benzo(a)pyreneaffects methylation1
Dactinomycinaffects cotreatment, increases expression1
Asbestos, Crocidolitedecreases methylation1
Asbestos, Amositedecreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.