MUCL1

gene
On this page

Also known as SBEM

Summary

MUCL1 (mucin like 1, HGNC:30588) is a protein-coding gene on chromosome 12q13.2, encoding Mucin-like protein 1 (Q96DR8). May play a role as marker for the diagnosis of metastatic breast cancer.

Predicted to be located in Golgi lumen and plasma membrane.

Source: NCBI Gene 118430 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 22 total
  • MANE Select transcript: NM_058173

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:30588
Approved symbolMUCL1
Namemucin like 1
Location12q13.2
Locus typegene with protein product
StatusApproved
AliasesSBEM
Ensembl geneENSG00000172551
Ensembl biotypeprotein_coding
OMIM610857
Entrez118430

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 2 protein_coding, 2 protein_coding_CDS_not_defined, 1 retained_intron

ENST00000308796, ENST00000546809, ENST00000547958, ENST00000547990, ENST00000652289

RefSeq mRNA: 1 — MANE Select: NM_058173 NM_058173

CCDS: CCDS8885

Canonical transcript exons

ENST00000308796 — 4 exons

ExonStartEnd
ENSE000011870365485819354858387
ENSE000024129565485451654854640
ENSE000034647985485677054856892
ENSE000034700265485511654855157

Expression profiles

Bgee: expression breadth ubiquitous, 157 present calls, max score 99.90.

FANTOM5 (CAGE): breadth broad, TPM avg 9.2893 / max 3277.2558, expressed in 220 samples.

FANTOM5 promoters (10 alternative TSS)

Promoter IDTPM avgSamples expressed
1259447.1272210
1259380.61885
1259400.57396
1259410.37487
1259370.28165
1259420.15085
1259390.09564
1259360.03673
2067330.02436
1259430.00551

Top tissues by expression

242 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
upper leg skinUBERON:000426299.90gold quality
parotid glandUBERON:000183199.77gold quality
skin of hipUBERON:000155499.60gold quality
skin of legUBERON:000151198.28gold quality
mammary ductUBERON:000176597.89gold quality
nippleUBERON:000203097.81gold quality
zone of skinUBERON:000001497.63gold quality
skin of abdomenUBERON:000141696.69gold quality
thoracic mammary glandUBERON:000520092.39gold quality
mammary glandUBERON:000191192.31gold quality
nasal cavity epitheliumUBERON:000538491.84gold quality
mammalian vulvaUBERON:000099789.01gold quality
bronchial epithelial cellCL:000232877.83silver quality
bronchusUBERON:000218577.71silver quality
olfactory segment of nasal mucosaUBERON:000538677.63gold quality
tracheaUBERON:000312672.93gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047372.17gold quality
nasal cavity mucosaUBERON:000182671.91gold quality
saliva-secreting glandUBERON:000104470.11gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099169.73gold quality
right uterine tubeUBERON:000130266.84gold quality
epithelium of nasopharynxUBERON:000195164.91silver quality
minor salivary glandUBERON:000183062.39gold quality
right frontal lobeUBERON:000281061.58gold quality
anterior cingulate cortexUBERON:000983560.26gold quality
gastrocnemiusUBERON:000138860.17gold quality
mouth mucosaUBERON:000372959.82gold quality
granulocyteCL:000009459.75gold quality
right adrenal gland cortexUBERON:003582759.58gold quality
putamenUBERON:000187458.19gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-75688yes1764.26
E-ANND-3no0.00

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): POU2F1, POU2F2

miRNA regulators (miRDB)

9 targeting MUCL1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-130399.6569.771662
HSA-MIR-29899.6367.561916
HSA-MIR-488-3P99.6168.791731
HSA-MIR-6828-5P99.3169.211433
HSA-MIR-6809-5P99.1368.451223
HSA-MIR-10526-3P97.8664.971342
HSA-MIR-392097.7569.021168
HSA-MIR-6808-3P94.1365.24516

Literature-anchored findings (GeneRIF, showing 11)

  • identification as possible new breast tumor biomarker (PMID:15684711)
  • octamer-binding transcription factors participate in the strong expression of SBEM gene in breast tissues (PMID:16720387)
  • SBEM may identify a unique subset of breast cancers with poor prognosis and have implications for therapeutic management. (PMID:18269587)
  • results demonstrate that SBEM mRNA may be an important marker for targeting breast cancer micrometastasis (PMID:18497056)
  • The SBEM might a suitable marker for molecular detection of ITCs in BM in breast cancer patients. Analysis of prognostic value for SBEM mRNA-based assay should take into account the heterogeneity and different molecular subtypes of breast cancer. (PMID:19221791)
  • SBEM has the potential to be a specific marker for predicting hematogenous micrometastasis and response to neoadjuvant chemotherapy in breast cancer. (PMID:20364301)
  • an important role for MUCL1 in the proliferation of breast cancer cells (PMID:26725324)
  • Small breast epithelial mucin promotes the invasion and metastasis of breast cancer cells via promoting epithelialtomesenchymal transition. (PMID:32627029)
  • The mucin protein MUCL1 regulates melanogenesis and melanoma genes in a manner dependent on threonine content. (PMID:34545566)
  • Targeting MUCL1 protein inhibits cell proliferation and EMT by deregulating betacatenin and increases irinotecan sensitivity in colorectal cancer. (PMID:35059735)
  • miRNA-186-5p inhibits migration, invasion and proliferation of breast cancer cells by targeting SBEM. (PMID:37477531)

Cross-species orthologs

1 orthologs

OrganismSymbolGene ID
drosophila_melanogasterCG13965FBGN0032834

Protein

Protein identifiers

Mucin-like protein 1Q96DR8 (reviewed: Q96DR8)

Alternative names: Protein BS106, Small breast epithelial mucin

All UniProt accessions (2): Q96DR8, F8VV13

UniProt curated annotations — full annotation on UniProt →

Function. May play a role as marker for the diagnosis of metastatic breast cancer.

Subcellular location. Secreted. Membrane.

Tissue specificity. Expressed in mammary, salivary glands and prostate. Also detected in lung. Mainly expressed in cancer cell lines of breast origin. Highly expressed in lymph node-positive compared with node-negative tumors. Detected in all lymph node containing metastatic cells.

Post-translational modifications. O-glycosylated.

Polymorphism. The number of repeats varies from 3 to 2 in one variant form which is equally present in breast cancer tumors.

RefSeq proteins (1): NP_477521* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR054050MUCL1Family

Pfam: PF21823

UniProt features (27 total): glycosylation site 17, repeat 3, region of interest 2, compositionally biased region 2, signal peptide 1, chain 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96DR8-F168.010.18

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (17): 24, 30, 34, 46, 47, 51, 52, 54, 55, 59, 60, 62, 63, 66, 67, 68, 23

Function

Pathways and Gene Ontology

Reactome pathways

16 pathways

IDPathway
R-HSA-5083625Defective GALNT3 causes HFTC
R-HSA-5083632Defective C1GALT1C1 causes TNPS
R-HSA-5083636Defective GALNT12 causes CRCS1
R-HSA-5621480Dectin-2 family
R-HSA-913709O-linked glycosylation of mucins
R-HSA-977068Termination of O-glycan biosynthesis
R-HSA-1643685Disease
R-HSA-168249Innate Immune System
R-HSA-168256Immune System
R-HSA-3781865Diseases of glycosylation
R-HSA-3906995Diseases associated with O-glycosylation of proteins
R-HSA-392499Metabolism of proteins
R-HSA-5173105O-linked glycosylation
R-HSA-5621481C-type lectin receptors (CLRs)
R-HSA-5668914Diseases of metabolism
R-HSA-597592Post-translational protein modification

MSigDB gene sets: 55 (showing top): REACTOME_INNATE_IMMUNE_SYSTEM, GOZGIT_ESR1_TARGETS_DN, TURASHVILI_BREAST_DUCTAL_CARCINOMA_VS_DUCTAL_NORMAL_DN, MODULE_48, MODULE_95, RICKMAN_HEAD_AND_NECK_CANCER_C, CHARAFE_BREAST_CANCER_LUMINAL_VS_MESENCHYMAL_UP, GOCC_GOLGI_LUMEN, GSE13762_CTRL_VS_125_VITAMIND_DAY12_DC_DN, DODD_NASOPHARYNGEAL_CARCINOMA_DN, REACTOME_POST_TRANSLATIONAL_PROTEIN_MODIFICATION, MODULE_163, REACTOME_DISEASES_OF_GLYCOSYLATION, REACTOME_DISEASES_ASSOCIATED_WITH_O_GLYCOSYLATION_OF_PROTEINS, REACTOME_DEFECTIVE_GALNT3_CAUSES_HFTC

GO Biological Process (0):

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (4): extracellular region (GO:0005576), Golgi lumen (GO:0005796), plasma membrane (GO:0005886), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-11 pathways:

CategoryPathways
Diseases associated with O-glycosylation of proteins3
C-type lectin receptors (CLRs)1
O-linked glycosylation1
O-linked glycosylation of mucins1
Immune System1
Diseases of metabolism1
Diseases of glycosylation1
Post-translational protein modification1
Innate Immune System1
Disease1
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
binding1
Golgi apparatus1
intracellular organelle lumen1
membrane1
cell periphery1

Protein interactions and networks

STRING

652 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MUCL1SCGB2A2Q13296944
MUCL1SCGB1D2O95969482
MUCL1GLYATL2Q8WU03477
MUCL1C5orf46Q6UWT4445
MUCL1IGF2BP3O00425416
MUCL1NEDD9Q14511389
MUCL1ZNF232Q9UNY5348
MUCL1MAP7Q14244341
MUCL1WFDC5Q8TCV5314
MUCL1CHID1Q9BWS9313
MUCL1ADGRF4Q8IZF3301
MUCL1TESPA1A2RU30296
MUCL1SUSD3Q96L08296
MUCL1PPP4R4Q6NUP7294
MUCL1PIPP12273290

IntAct

42 interactions, top by confidence:

ABTypeScore
KHDRBS2KHDRBS3psi-mi:“MI:0914”(association)0.800
HEXIM2AHCYL1psi-mi:“MI:0914”(association)0.740
MUCL1PDIA5psi-mi:“MI:0914”(association)0.640
CEP72AHCYL1psi-mi:“MI:0914”(association)0.640
ARL13BMUCL1psi-mi:“MI:0915”(physical association)0.560
GARRE1APODpsi-mi:“MI:0914”(association)0.530
DHX57APODpsi-mi:“MI:0914”(association)0.530
MTHFD2LMTHFD2psi-mi:“MI:0914”(association)0.530
KLHL10PXDNLpsi-mi:“MI:0914”(association)0.530
ERBB2MUCL1psi-mi:“MI:0915”(physical association)0.370
RNLSSTATHpsi-mi:“MI:0914”(association)0.350
ATP2B2GPR89Apsi-mi:“MI:0914”(association)0.350
ARHGAP36psi-mi:“MI:0914”(association)0.350
HVCN1DOK2psi-mi:“MI:0914”(association)0.350
SPACA9RIOX2psi-mi:“MI:0914”(association)0.350
SEPTIN8RIOK3psi-mi:“MI:0914”(association)0.350
ATP2B2ESYT2psi-mi:“MI:0914”(association)0.350
PTCH1PLXNB2psi-mi:“MI:0914”(association)0.350
ARHGAP36PJA2psi-mi:“MI:0914”(association)0.350
SLC38A6EDIL3psi-mi:“MI:0914”(association)0.350
TLR1LRP6psi-mi:“MI:0914”(association)0.350
ZNF451HERC2psi-mi:“MI:0914”(association)0.350
DHX57H2AC4psi-mi:“MI:0914”(association)0.350
GK3APODpsi-mi:“MI:0914”(association)0.350
CYP4F8APODpsi-mi:“MI:0914”(association)0.350
OR1E1EMC4psi-mi:“MI:0914”(association)0.350
SCGB1D1IGLL5psi-mi:“MI:0914”(association)0.350

BioGRID (43): MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Two-hybrid), PDIA5 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), EMC4 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS), MUCL1 (Affinity Capture-MS)

ESM2 similar proteins: A0A024QYT3, A0A088SG62, A0A182RFI2, A0A6G9KHE4, A0A6M3Z9I6, A1L019, A7RJ12, A8WUV8, A8X7F0, A8XYS0, A8Y181, B0XTU6, B3EX01, B6DDU5, D0MSS2, K7NTD0, K9J977, O61704, P0DPX0, P0DRA3, P0DRA4, P0DSK3, P0DY55, P14199, P21250, P34472, P40148, P59704, P68408, P68409, P86146, P86718, P86719, P91859, Q09416, Q1EG28, Q21175, Q21771, Q24629, Q27913

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

22 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance17
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

322 predictions. Top by Δscore:

VariantEffectΔscore
12:54854613:G:GTdonor_gain1.0000
12:54854636:TGCCC:Tdonor_gain1.0000
12:54854637:GCCC:Gdonor_gain1.0000
12:54854637:GCCCG:Gdonor_gain1.0000
12:54854638:CCC:Cdonor_gain1.0000
12:54854639:CC:Cdonor_gain1.0000
12:54854641:G:GAdonor_loss1.0000
12:54854641:G:GGdonor_gain1.0000
12:54854642:T:TCdonor_loss1.0000
12:54854645:G:GGdonor_gain1.0000
12:54855154:GCTA:Gdonor_gain1.0000
12:54855158:G:GGdonor_gain1.0000
12:54856762:A:AGacceptor_gain1.0000
12:54856763:A:Gacceptor_gain1.0000
12:54856767:CAG:Cacceptor_loss1.0000
12:54856768:A:ACacceptor_loss1.0000
12:54856768:A:AGacceptor_gain1.0000
12:54856768:AGCT:Aacceptor_gain1.0000
12:54856769:G:GAacceptor_gain1.0000
12:54856769:GCT:Gacceptor_gain1.0000
12:54856769:GCTG:Gacceptor_gain1.0000
12:54854643:AA:Adonor_loss0.9900
12:54855110:TTTCA:Tacceptor_loss0.9900
12:54855112:TCAGA:Tacceptor_loss0.9900
12:54855113:CA:Cacceptor_loss0.9900
12:54855114:A:AGacceptor_gain0.9900
12:54855115:G:GGacceptor_gain0.9900
12:54855115:G:Tacceptor_loss0.9900
12:54855115:GA:Gacceptor_gain0.9900
12:54855115:GAGA:Gacceptor_gain0.9900

AlphaMissense

559 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
12:54854613:G:AG11R0.979
12:54854613:G:CG11R0.979
12:54854614:G:AG11E0.955
12:54854605:T:AV8E0.934
12:54854596:C:AA5E0.932
12:54854617:T:AV12D0.922
12:54854599:T:AV6D0.912
12:54854608:T:AL9H0.908
12:54854608:T:GL9R0.897
12:54854589:T:CF3L0.887
12:54854591:C:AF3L0.887
12:54854591:C:GF3L0.887
12:54854602:T:GL7R0.822
12:54854632:T:AV17D0.807
12:54854608:T:CL9P0.803
12:54858206:G:CW79C0.787
12:54858206:G:TW79C0.787
12:54854587:A:TK2M0.780
12:54854602:T:AL7Q0.780
12:54858204:T:AW79R0.774
12:54858204:T:CW79R0.774
12:54854586:A:GK2E0.765
12:54854588:G:CK2N0.761
12:54854588:G:TK2N0.761
12:54854629:T:CL16P0.701
12:54854593:T:CL4S0.677
12:54854625:T:CF15L0.666
12:54854627:T:AF15L0.666
12:54854627:T:GF15L0.666
12:54854629:T:GL16R0.666

dbSNP variants (sampled 300 via entrez): RS1000076458 (12:54848481 T>C), RS1000390893 (12:54836920 T>C), RS1000409203 (12:54845646 T>A,C), RS1000461001 (12:54845946 A>G,T), RS1000665805 (12:54841141 C>T), RS1000751520 (12:54852237 C>T), RS1000755911 (12:54840389 A>C), RS1000772365 (12:54836082 G>A,C), RS1000935613 (12:54830985 G>A,T), RS1000992889 (12:54835568 T>A), RS1001001712 (12:54830258 GTCTC>G,GTC,GTCTCTC), RS1001015388 (12:54840844 G>A), RS1001035111 (12:54845163 G>C,T), RS1001132759 (12:54848196 G>C), RS1001284306 (12:54830667 T>C)

Disease associations

OMIM: gene MIM:610857 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST007995_38Asthma (childhood onset)1.000000e-10

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

13 total (human), top 13 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases expression, increases methylation2
methyleugenoldecreases expression1
sodium arsenatedecreases expression, increases abundance1
CGP 52608affects binding, increases reaction1
Resveratrolaffects cotreatment, decreases expression1
Arsenicdecreases expression, increases abundance1
Estradiolincreases expression1
Plant Extractsaffects cotreatment, decreases expression1
Rotenoneincreases expression1
Tobacco Smoke Pollutionincreases expression1
Cyclosporinedecreases expression1
Cadmium Chlorideincreases expression1
Lactic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.