MSGN1

gene
On this page

Also known as pMesogenin1

Summary

MSGN1 (mesogenin 1, HGNC:14907) is a protein-coding gene on chromosome 2p24.2, encoding Mesogenin-1 (A6NI15). Involved in specifying the paraxial, but not dorsal, mesoderm.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in mesoderm formation and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within segment specification and somitogenesis. Predicted to be located in chromatin. Predicted to be active in nucleus.

Source: NCBI Gene 343930 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): skeletal dysplasia (Limited, GenCC)
  • GWAS associations: 3
  • Clinical variants (ClinVar): 38 total
  • MANE Select transcript: NM_001105569

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14907
Approved symbolMSGN1
Namemesogenin 1
Location2p24.2
Locus typegene with protein product
StatusApproved
AliasespMesogenin1
Ensembl geneENSG00000151379
Ensembl biotypeprotein_coding
OMIM612209
Entrez343930

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000281047

RefSeq mRNA: 1 — MANE Select: NM_001105569 NM_001105569

CCDS: CCDS42657

Canonical transcript exons

ENST00000281047 — 1 exons

ExonStartEnd
ENSE000009992651781646017817798

Expression profiles

Bgee: expression breadth broad, 16 present calls, max score 69.29.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0977 / max 14.3132, expressed in 41 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
190960.097741

Top tissues by expression

121 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
bone marrow cellCL:000209269.29silver quality
colonic epitheliumUBERON:000039750.23gold quality
olfactory segment of nasal mucosaUBERON:000538648.85gold quality
bone marrowUBERON:000237148.67silver quality
esophagus mucosaUBERON:000246942.69gold quality
lower esophagus mucosaUBERON:003583441.42silver quality
prefrontal cortexUBERON:000045137.97gold quality
minor salivary glandUBERON:000183037.00gold quality
skeletal muscle tissueUBERON:000113436.58gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
saliva-secreting glandUBERON:000104436.30gold quality
ganglionic eminenceUBERON:000402335.49gold quality
right uterine tubeUBERON:000130234.49gold quality
tonsilUBERON:000237234.28gold quality
adrenal tissueUBERON:001830333.77gold quality
muscle tissueUBERON:000238533.41gold quality
esophagusUBERON:000104332.51silver quality
hindlimb stylopod muscleUBERON:000425232.15gold quality
frontal cortexUBERON:000187031.86silver quality
liverUBERON:000210731.48silver quality
sural nerveUBERON:001548830.93gold quality
uterine cervixUBERON:000000230.16silver quality
stromal cell of endometriumCL:000225529.87gold quality
vermiform appendixUBERON:000115429.50gold quality
lymph nodeUBERON:000002928.82gold quality
monocyteCL:000057628.66gold quality
leukocyteCL:000073828.56gold quality
ectocervixUBERON:001224928.49silver quality
duodenumUBERON:000211428.14gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-MTAB-9388yes383.64
E-ANND-3no1.15

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
WNT3A

JASPAR motifs

MotifNameFamily
MA1524.1MSGN1Tal-related

JASPAR matrix evidence (PMIDs): PMID:21750544

Upstream regulators (CollecTRI, top): TBX6

Literature-anchored findings (GeneRIF, showing 1)

  • Described the cloning, expression and function of the mouse and frog mesogenin 1. (PMID:10837126)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriomsgn1ENSDARG00000070546
mus_musculusMsgn1ENSMUSG00000047002
rattus_norvegicusMsgn1ENSRNOG00000028518
drosophila_melanogastersageFBGN0037672

Paralogs (2): MESP1 (ENSG00000166823), MESP2 (ENSG00000188095)

Protein

Protein identifiers

Mesogenin-1A6NI15 (reviewed: A6NI15)

Alternative names: Paraxial mesoderm-specific mesogenin1, pMesogenin1

All UniProt accessions (1): A6NI15

UniProt curated annotations — full annotation on UniProt →

Function. Involved in specifying the paraxial, but not dorsal, mesoderm. May regulate the expression of T-box transcription factors required for mesoderm formation and differentiation.

Subcellular location. Nucleus.

RefSeq proteins (1): NP_001099039* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR011598bHLH_domDomain
IPR036638HLH_DNA-bd_sfHomologous_superfamily
IPR040259Mesogenin/MesPFamily

Pfam: PF00010

UniProt features (6 total): sequence variant 2, chain 1, domain 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-A6NI15-F168.430.33

Function

Pathways and Gene Ontology

Reactome pathways

4 pathways

IDPathway
R-HSA-9793380Formation of paraxial mesoderm
R-HSA-9824272Somitogenesis
R-HSA-1266738Developmental Biology
R-HSA-9758941Gastrulation

MSigDB gene sets: 43 (showing top): GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_FORMATION_OF_PRIMARY_GERM_LAYER, GOBP_ANTERIOR_POSTERIOR_PATTERN_SPECIFICATION, GOBP_SOMITOGENESIS, GOBP_SEGMENTATION, GOBP_GASTRULATION, GOBP_MESODERM_MORPHOGENESIS, GOBP_SEGMENT_SPECIFICATION, GOBP_EMBRYO_DEVELOPMENT, GOBP_MESODERM_DEVELOPMENT, GOBP_SOMITE_DEVELOPMENT, GOBP_TISSUE_MORPHOGENESIS, GOMF_CHROMATIN_BINDING, MARSON_BOUND_BY_FOXP3_UNSTIMULATED

GO Biological Process (6): mesoderm formation (GO:0001707), somitogenesis (GO:0001756), regulation of transcription by RNA polymerase II (GO:0006357), segment specification (GO:0007379), cell differentiation (GO:0030154), positive regulation of transcription by RNA polymerase II (GO:0045944)

GO Molecular Function (9): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), chromatin binding (GO:0003682), protein dimerization activity (GO:0046983), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700), protein binding (GO:0005515)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-3 pathways:

CategoryPathways
Gastrulation1
Formation of paraxial mesoderm1
Developmental Biology1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulatory region sequence-specific DNA binding3
segmentation2
regulation of DNA-templated transcription2
transcription by RNA polymerase II2
regulation of transcription by RNA polymerase II2
binding2
formation of primary germ layer1
mesoderm morphogenesis1
anterior/posterior pattern specification1
chordate embryonic development1
anatomical structure formation involved in morphogenesis1
somite development1
pattern specification process1
cellular developmental process1
positive regulation of DNA-templated transcription1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
DNA-binding transcription factor activity, RNA polymerase II-specific1
DNA-binding transcription activator activity1
positive regulation of transcription by RNA polymerase II1
protein binding1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
transcription cis-regulatory region binding1
transcription regulator activity1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

390 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MSGN1TBX6O95947735
MSGN1RIPPLY2Q5TAB7590
MSGN1MEOX1P50221581
MSGN1UNCXA6NJT0521
MSGN1TCF15Q12870454
MSGN1SMC6Q96SB8426
MSGN1RIPPLY1Q0D2K3422
MSGN1WNT3AP56704404
MSGN1CDX1P47902394
MSGN1CCDC14Q49A88388
MSGN1GEN1Q17RS7384
MSGN1DPCDQ9BVM2379
MSGN1MIXL1Q9H2W2379
MSGN1LFNGQ8NES3378
MSGN1PAX3P23760375

IntAct

95 interactions, top by confidence:

ABTypeScore
MSGN1ITGB3BPpsi-mi:“MI:0915”(physical association)0.560
MSGN1LDOC1psi-mi:“MI:0915”(physical association)0.560
MSGN1NECAB2psi-mi:“MI:0915”(physical association)0.560
MSGN1AP1B1psi-mi:“MI:0915”(physical association)0.560
MSGN1TCF12psi-mi:“MI:0915”(physical association)0.560
MSGN1MYF5psi-mi:“MI:0915”(physical association)0.560
MSGN1ID1psi-mi:“MI:0915”(physical association)0.560
ATP11BMSGN1psi-mi:“MI:0915”(physical association)0.560
MSGN1MAGEA6psi-mi:“MI:0915”(physical association)0.560
MSGN1TRAF1psi-mi:“MI:0915”(physical association)0.560
MSGN1HAP1psi-mi:“MI:0915”(physical association)0.560
MSGN1EXOC7psi-mi:“MI:0915”(physical association)0.560
MSGN1OST4psi-mi:“MI:0915”(physical association)0.560
MSGN1IGFBP6psi-mi:“MI:0915”(physical association)0.560
MSGN1psi-mi:“MI:0915”(physical association)0.560
MSGN1CCDC172psi-mi:“MI:0915”(physical association)0.560
MSGN1DNALI1psi-mi:“MI:0915”(physical association)0.560
MSGN1CTAG1Apsi-mi:“MI:0915”(physical association)0.560
MSGN1BRAT1psi-mi:“MI:0915”(physical association)0.560
MSGN1SERTAD3psi-mi:“MI:0915”(physical association)0.560
MSGN1MYL9psi-mi:“MI:0915”(physical association)0.560
MSGN1ZMYND12psi-mi:“MI:0915”(physical association)0.560
MSGN1ID2psi-mi:“MI:0915”(physical association)0.560
MSGN1ID3psi-mi:“MI:0915”(physical association)0.560
MSGN1ZWINTpsi-mi:“MI:0915”(physical association)0.560
MSGN1TCF4psi-mi:“MI:0915”(physical association)0.560
MSGN1TEPSINpsi-mi:“MI:0915”(physical association)0.560
MSGN1TEX12psi-mi:“MI:0915”(physical association)0.560

BioGRID (37): MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid), MSGN1 (Two-hybrid)

ESM2 similar proteins: A3KNX5, A6NI15, A7YWL5, B0BN13, D3ZLB7, E9Q9M8, O02761, O35906, O70240, O94983, O97930, P01102, P11939, P13346, P23050, P53539, P70298, P70595, P70660, P97303, Q15742, Q28C89, Q3U1J1, Q5EBA3, Q61122, Q61127, Q62722, Q62912, Q62985, Q6NW59, Q80Y50, Q86UZ6, Q8CD60, Q8HZP6, Q90ZL1, Q91ZM2, Q92886, Q96JB3, Q99NA2, Q9BE45

Diamond homologs: A6NI15, O08574, O09105, O42202, P41894, P46581, P48986, P79765, P79766, P79920, P97309, Q08DI0, Q0VG99, Q13562, Q28C89, Q4R5G6, Q60430, Q60867, Q64289, Q90ZL1, Q91616, Q96NK8, Q9BRJ9, Q9DEQ9, Q9HD90, Q9JK54, Q9W690, Q9W6C7, Q9Y4Z2, A8E5T6, O35437, O43680, O60682, O88940, P79782, Q10574, Q20561, Q32PV5, Q5E9S3, Q6GNB7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

38 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance36
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

106 predictions. Top by Δscore:

VariantEffectΔscore
2:17816605:G:GTdonor_gain0.8700
2:17816555:G:GTdonor_gain0.7800
2:17816609:G:GTdonor_gain0.7000
2:17816797:G:GAdonor_gain0.6700
2:17816790:TAG:Tdonor_gain0.6300
2:17816791:AGA:Adonor_gain0.6300
2:17816792:GAG:Gdonor_gain0.6300
2:17817097:G:GGdonor_gain0.6100
2:17816796:T:TAdonor_gain0.5700
2:17816926:GA:Gdonor_gain0.5600
2:17817093:GCGC:Gdonor_gain0.5500
2:17817095:GC:Gdonor_gain0.5300
2:17816518:C:Tdonor_gain0.4900
2:17816615:G:Tdonor_gain0.4900
2:17816729:C:Tdonor_gain0.4700
2:17817094:CGC:Cdonor_gain0.4500
2:17817095:GCG:Gdonor_gain0.4500
2:17817092:AGCGC:Adonor_gain0.4300
2:17817093:GCGCG:Gdonor_gain0.4300
2:17816610:A:Tdonor_gain0.4200
2:17816606:A:Tdonor_gain0.4100
2:17816616:C:Tdonor_gain0.4100
2:17816723:T:Gdonor_gain0.4000
2:17816878:G:GTdonor_gain0.4000
2:17816679:A:Tdonor_gain0.3800
2:17817036:A:Gdonor_gain0.3800
2:17817037:G:GGdonor_gain0.3800
2:17816724:G:GGdonor_gain0.3700
2:17816630:C:Gdonor_gain0.3600
2:17816676:TGG:Tacceptor_gain0.3600

AlphaMissense

1251 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:17816916:A:TE133V1.000
2:17816925:G:TR136M1.000
2:17816958:T:AL147H1.000
2:17817006:C:TT163I1.000
2:17817010:G:CK164N1.000
2:17817010:G:TK164N1.000
2:17817021:T:AL168H1.000
2:17817021:T:CL168P1.000
2:17817051:T:CL178P1.000
2:17816915:G:AE133K0.999
2:17816917:G:CE133D0.999
2:17816917:G:TE133D0.999
2:17816925:G:CR136T0.999
2:17816926:G:CR136S0.999
2:17816926:G:TR136S0.999
2:17816928:T:CM137T0.999
2:17816929:G:AM137I0.999
2:17816929:G:CM137I0.999
2:17816929:G:TM137I0.999
2:17816937:T:CL140S0.999
2:17816937:T:GL140W0.999
2:17816945:G:CA143P0.999
2:17816949:T:CL144P0.999
2:17816958:T:CL147P0.999
2:17816970:T:CL151P0.999
2:17817008:A:CK164Q0.999
2:17817008:A:GK164E0.999
2:17817012:T:AI165N0.999
2:17817012:T:CI165T0.999
2:17817012:T:GI165S0.999

dbSNP variants (sampled 300 via entrez): RS1002270443 (2:17816114 C>T), RS1002596874 (2:17815907 G>T), RS1003154969 (2:17817910 C>T), RS1003510698 (2:17818193 A>G), RS1003887196 (2:17814842 G>A,C,T), RS1004503766 (2:17815204 C>T), RS1004595252 (2:17816840 G>A,T), RS1004752341 (2:17817198 T>G), RS1006908764 (2:17817696 C>T), RS1006959752 (2:17817335 T>G), RS1008674286 (2:17816244 C>A,T), RS1009170948 (2:17814980 C>T), RS1011276853 (2:17817702 C>T), RS1011641662 (2:17817945 A>G), RS1014093070 (2:17816129 G>A)

Disease associations

OMIM: gene MIM:612209 | disease phenotypes:

GenCC curated gene-disease

DiseaseClassificationInheritance
skeletal dysplasiaLimitedAutosomal recessive

Mondo (1): skeletal dysplasia (MONDO:0018230)

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST009391_1171Metabolite levels2.000000e-06
GCST009391_1631Metabolite levels9.000000e-06
GCST011584_6Metastatic colorectal cancer survival in treatment with chemotherapy plus biologics8.000000e-06

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0006524L-arginine measurement
EFO:0010465beta-hydroxybutyric acid measurement
EFO:0000714survival time
EFO:1001480metastatic colorectal cancer

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

7 total (human), top 7 by PubMed support.

ChemicalActions (top 5)PubMed papers
CGP 52608increases reaction, affects binding1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideincreases reaction, decreases reaction, increases expression1
Chir 99021increases reaction, decreases reaction, increases expression1
Benzo(a)pyrenedecreases methylation1
Endosulfandecreases expression1
Tretinoindecreases reaction, increases expression, increases reaction1
Valproic Aciddecreases expression1

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A4F2SEES3-1V human MSGN1, clone1Embryonic stem cellMale
CVCL_A4F3SEES3-1V human MSGN1, clone2Embryonic stem cellMale
CVCL_A4F4SEES3-1V human MSGN1, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

7 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00001754Not specifiedCOMPLETEDStudy of Skeletal Disorders and Short Stature
NCT02762318Not specifiedTERMINATEDIdentification and Characterization of Bone-related Genetic Variants in Families
NCT03548779Not specifiedCOMPLETEDNorth Carolina Genomic Evaluation by Next-generation Exome Sequencing, 2
NCT05247645Not specifiedRECRUITINGData Collection of Patients With Rare Bone Diseases
NCT05876416Not specifiedRECRUITINGDecoding the Genetic Landscape of Skeletal Diseases
NCT05991609Not specifiedACTIVE_NOT_RECRUITINGExtreme Morphology and Metabolic Health
NCT06002373Not specifiedUNKNOWNAssessment of Artificial Intelligence for Treatment Decision Recommendation of Adult Skeletal Class III Patients
  • Associated diseases: skeletal dysplasia
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): skeletal dysplasia