MUC20

gene
On this page

Also known as FLJ14408KIAA1359

Summary

MUC20 (mucin 20, cell surface associated, HGNC:23282) is a protein-coding gene on chromosome 3q29, encoding Mucin-20 (Q8N307). May regulate MET signaling cascade. It is a selective cancer dependency (DepMap: 16.6% of cell lines).

This gene encodes a member of the mucin protein family. Mucins are high molecular weight glycoproteins secreted by many epithelial tissues to form an insoluble mucous barrier. The C-terminus of this family member associates with the multifunctional docking site of the MET proto-oncogene and suppresses activation of some downstream MET signaling cascades. The protein features a mucin tandem repeat domain that varies between two and six copies in most individuals. Multiple variants encoding different isoforms have been found for this gene. A related pseudogene, which is also located on chromosome 3, has been identified.

Source: NCBI Gene 200958 — RefSeq curated summary.

At a glance

  • GWAS associations: 7
  • Clinical variants (ClinVar): 26 total
  • Cancer dependency (DepMap): dependent in 16.6% of screened cell lines
  • MANE Select transcript: NM_001282506

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:23282
Approved symbolMUC20
Namemucin 20, cell surface associated
Location3q29
Locus typegene with protein product
StatusApproved
AliasesFLJ14408, KIAA1359
Ensembl geneENSG00000176945
Ensembl biotypeprotein_coding
OMIM610360
Entrez200958

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 5 protein_coding, 2 protein_coding_CDS_not_defined, 1 retained_intron

ENST00000423938, ENST00000436408, ENST00000445522, ENST00000447234, ENST00000480350, ENST00000485430, ENST00000498018, ENST00000867368

RefSeq mRNA: 1 — MANE Select: NM_001282506 NM_001282506

CCDS: CCDS63877

Canonical transcript exons

ENST00000447234 — 4 exons

ExonStartEnd
ENSE00001297048195720978195721083
ENSE00001766186195724680195726572
ENSE00003518158195733150195733551
ENSE00003617822195729648195729739

Expression profiles

Bgee: expression breadth ubiquitous, 185 present calls, max score 98.01.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.5321 / max 46.7222, expressed in 151 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
406990.3659103
406980.166277

Top tissues by expression

249 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right uterine tubeUBERON:000130298.01gold quality
olfactory segment of nasal mucosaUBERON:000538697.54gold quality
metanephros cortexUBERON:001053397.46gold quality
lower esophagus mucosaUBERON:003583496.15gold quality
gall bladderUBERON:000211094.66gold quality
sural nerveUBERON:001548894.38gold quality
palpebral conjunctivaUBERON:000181293.80gold quality
minor salivary glandUBERON:000183093.36gold quality
rectumUBERON:000105293.35gold quality
mucosa of transverse colonUBERON:000499192.43gold quality
esophagus mucosaUBERON:000246992.28gold quality
adenohypophysisUBERON:000219691.24gold quality
transverse colonUBERON:000115790.80gold quality
mucosa of stomachUBERON:000119990.76gold quality
nasal cavity epitheliumUBERON:000538490.24gold quality
right lobe of liverUBERON:000111489.22gold quality
adult mammalian kidneyUBERON:000008288.36gold quality
islet of LangerhansUBERON:000000688.14gold quality
cortex of kidneyUBERON:000122587.79gold quality
mouth mucosaUBERON:000372987.53gold quality
saliva-secreting glandUBERON:000104487.32gold quality
left lobe of thyroid glandUBERON:000112087.31gold quality
hindlimb stylopod muscleUBERON:000425287.20gold quality
small intestine Peyer’s patchUBERON:000345487.11gold quality
pituitary glandUBERON:000000786.77gold quality
vaginaUBERON:000099686.75gold quality
body of pancreasUBERON:000115086.64gold quality
esophagusUBERON:000104386.08gold quality
right lobe of thyroid glandUBERON:000111986.03gold quality
left ovaryUBERON:000211985.51gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-ANND-3yes14.85
E-CURD-114yes9.66
E-GEOD-100618no77.95
E-GEOD-99795no14.89

Regulation

Is transcription factor: no

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 16.6% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 16)

  • Results suggest that MUC20 is a novel regulator of the Met signaling cascade which has a role in suppression of the Grb2-Ras pathway. (PMID:15314156)
  • Reflux laryngitis is associated with down-regulation of mucin gene expression. (PMID:18834073)
  • MUC20 is expressed in recurrent colorectal neoplasms and may have a role in poor outcome (PMID:23787019)
  • MUC20 is a novel transmembrane mucin expressed by the human corneal and conjunctival epithelia, and suggest that differential expression of MUC20 during differentiation has a role in maintaining ocular surface homeostasis. (PMID:25168902)
  • A combination of MUC13/MUC20 expression was a potential prognostic marker for patients with ESCC, who received neoadjuvant chemotherapy followed by surgery. (PMID:26323930)
  • Results suggest that MUC20 enhances aggressive behaviors of EOC cells by activating integrin beta1 signaling and provide novel insights into the role of MUC20 in ovarian cancer metastasis. (PMID:26616226)
  • High MUC20 expression is associated wit response to chemotherapy in esophageal squamous cell carcinoma. (PMID:26673820)
  • increased gene expression of MUC16 and MUC20 was found in patients with remission ulcerative colitis (PMID:26770020)
  • MUC20 knockdown suppresses the malignant phenotypes of PDAC cells at least partially through the inhibition of the HGF/MET pathway. (PMID:29993037)
  • the combination of MUC4, MUC16 and MUC20 signature is associated with statistically significant reduced overall survival and increased hazard ratio in pancreatic, colon and stomach cancer. (PMID:30236127)
  • MUC20 expression marks the receptive phase of the human endometrium. (PMID:31519421)
  • Lnc-MUC20-9 binds to ROCK1 and functions as a tumor suppressor in bladder cancer. (PMID:31898364)
  • Identification of oral squamous cell carcinoma markers MUC2 and SPRR1B downstream of TANGO. (PMID:33620575)
  • Mucin 20 modulates proteasome capacity through c-Met signalling to increase carfilzomib sensitivity in mantle cell lymphoma. (PMID:34651428)
  • A flexible summary statistics-based colocalization method with application to the mucin cystic fibrosis lung disease modifier locus. (PMID:35065708)
  • Intracellular MUC20 variant 2 maintains mitochondrial calcium homeostasis and enhances drug resistance in gastric cancer. (PMID:35166958)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusMuc20ENSMUSG00000035638
rattus_norvegicusMuc20ENSRNOG00000001776

Protein

Protein identifiers

Mucin-20Q8N307 (reviewed: Q8N307)

All UniProt accessions (3): Q8N307, C9JJE7, H7C102

UniProt curated annotations — full annotation on UniProt →

Function. May regulate MET signaling cascade. Seems to decrease hepatocyte growth factor (HGF)-induced transient MAPK activation. Blocks GRB2 recruitment to MET thus suppressing the GRB2-RAS pathway. Inhibits HGF-induced proliferation of MMP1 and MMP9 expression.

Subunit / interactions. Interacts with MET; oligomerization increases affinity for MET.

Subcellular location. Secreted. Apical cell membrane. Basolateral cell membrane. Cell projection. Microvillus membrane.

Tissue specificity. Highly expressed in kidney, moderately in placenta, lung, prostate, liver, and digestive system. In the kidney, localized in the proximal tubules but not in the glomerulus or distal tubules. Detected in most of the male urogenital tract epithelia, with the exception of epididymis.

Polymorphism. The region encoding the tandem repeats is highly polymorphic. Divergence of the number of tandem repeats was seen in different cDNA libraries.

Isoforms (2)

UniProt IDNamesCanonical?
Q8N307-11, hMUC20-Lyes
Q8N307-32, hMUC20-S

RefSeq proteins (1): NP_001269435* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR034551MUC20Family
IPR054051MUC-20_rptRepeat

Pfam: PF21824

UniProt features (66 total): compositionally biased region 16, sequence conflict 13, repeat 12, sequence variant 12, region of interest 7, glycosylation site 3, signal peptide 1, chain 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N307-F154.240.20

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (3): 423, 616, 622

Function

Pathways and Gene Ontology

Reactome pathways

20 pathways

IDPathway
R-HSA-5083625Defective GALNT3 causes HFTC
R-HSA-5083632Defective C1GALT1C1 causes TNPS
R-HSA-5083636Defective GALNT12 causes CRCS1
R-HSA-5621480Dectin-2 family
R-HSA-8851805MET activates RAS signaling
R-HSA-913709O-linked glycosylation of mucins
R-HSA-977068Termination of O-glycan biosynthesis
R-HSA-162582Signal Transduction
R-HSA-1643685Disease
R-HSA-168249Innate Immune System
R-HSA-168256Immune System
R-HSA-3781865Diseases of glycosylation
R-HSA-3906995Diseases associated with O-glycosylation of proteins
R-HSA-392499Metabolism of proteins
R-HSA-5173105O-linked glycosylation
R-HSA-5621481C-type lectin receptors (CLRs)
R-HSA-5668914Diseases of metabolism
R-HSA-597592Post-translational protein modification
R-HSA-6806834Signaling by MET
R-HSA-9006934Signaling by Receptor Tyrosine Kinases

MSigDB gene sets: 91 (showing top): REACTOME_INNATE_IMMUNE_SYSTEM, GOZGIT_ESR1_TARGETS_DN, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_DN, GOCC_APICAL_PLASMA_MEMBRANE, OUYANG_PROSTATE_CANCER_PROGRESSION_UP, GOCC_CELL_PROJECTION_MEMBRANE, GOCC_APICAL_PART_OF_CELL, GOCC_MICROVILLUS, GOCC_MICROVILLUS_MEMBRANE, ZHENG_BOUND_BY_FOXP3, GOCC_PLASMA_MEMBRANE_REGION, GOCC_ACTIN_BASED_CELL_PROJECTION, GOBP_CELL_SURFACE_RECEPTOR_PROTEIN_TYROSINE_KINASE_SIGNALING_PATHWAY, GOCC_GOLGI_LUMEN, GOCC_BASAL_PART_OF_CELL

GO Biological Process (1): hepatocyte growth factor receptor signaling pathway (GO:0048012)

GO Molecular Function (1): identical protein binding (GO:0042802)

GO Cellular Component (9): extracellular region (GO:0005576), Golgi lumen (GO:0005796), plasma membrane (GO:0005886), basal plasma membrane (GO:0009925), basolateral plasma membrane (GO:0016323), apical plasma membrane (GO:0016324), microvillus membrane (GO:0031528), membrane (GO:0016020), cell projection (GO:0042995)

Reactome top-level categories

Rollup of top-14 pathways:

CategoryPathways
Diseases associated with O-glycosylation of proteins3
C-type lectin receptors (CLRs)1
Signaling by MET1
O-linked glycosylation1
O-linked glycosylation of mucins1
Immune System1
Diseases of metabolism1
Diseases of glycosylation1
Post-translational protein modification1
Innate Immune System1
Disease1
Metabolism of proteins1
Signaling by Receptor Tyrosine Kinases1
Signal Transduction1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
plasma membrane region3
cell surface receptor protein tyrosine kinase signaling pathway1
protein binding1
Golgi apparatus1
intracellular organelle lumen1
membrane1
cell periphery1
basal part of cell1
basal plasma membrane1
apical part of cell1
microvillus1
cell projection membrane1

Protein interactions and networks

STRING

1036 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MUC20MUC4Q99102948
MUC20MUC13Q9H3R2901
MUC20MUC1P13931896
MUC20MUC12Q9UKN1881
MUC20MUC17Q685J3865
MUC20MUC2Q02817860
MUC20MUC3AQ02505847
MUC20MUC15Q8N387834
MUC20MUC5BQ9HC84833
MUC20MUC5ACP98088814
MUC20MUC21Q5SSG8801
MUC20MUC7Q8TAX7771
MUC20MUC19Q7Z5P9761
MUC20MUC6Q6W4X9730
MUC20MUC16Q8WXI7719

IntAct

5 interactions, top by confidence:

ABTypeScore
ALDOBMUC20psi-mi:“MI:0915”(physical association)0.370
MUC20ELOVL3psi-mi:“MI:0915”(physical association)0.370
MUC20RAD51Bpsi-mi:“MI:0914”(association)0.350

BioGRID (40): TUBB1 (Affinity Capture-MS), ARF6 (Affinity Capture-MS), TRMT44 (Affinity Capture-MS), RBM15 (Affinity Capture-MS), GTPBP2 (Affinity Capture-MS), AP3S1 (Affinity Capture-MS), FAM118B (Affinity Capture-MS), RAD51B (Affinity Capture-MS), RANGRF (Affinity Capture-MS), TUBA4A (Affinity Capture-MS), GK (Affinity Capture-MS), TUBB3 (Affinity Capture-MS), SLC25A14 (Affinity Capture-MS), CBWD1 (Affinity Capture-MS), PPM1F (Affinity Capture-MS)

ESM2 similar proteins: A0A0J9YWL9, A0A0J9YY54, A0A0U1RQI7, A0A494C071, A6NNC1, A6QL64, B3KS81, D3YZV8, E2RYF7, E9Q6E9, F1LWT0, F8W0I5, O60732, P0C8Z4, P0DKJ7, P0DKJ8, P0DKL2, P0DPF3, P18751, P20930, P43537, P53353, P59797, P62521, Q01456, Q08AG5, Q12816, Q3BBV2, Q5H9R4, Q5HY64, Q5JPF3, Q6P902, Q6ZQX7, Q86T75, Q86VE3, Q86VQ3, Q8BGJ3, Q8N307, Q8N7U7, Q8N7X1

Diamond homologs: Q8BUE7, Q8N307

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

26 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance3
Likely benign8
Benign2

Top pathogenic / likely-pathogenic (0)

SpliceAI

420 predictions. Top by Δscore:

VariantEffectΔscore
3:195729640:A:AGacceptor_gain1.0000
3:195729641:T:Gacceptor_gain1.0000
3:195729642:TTGCA:Tacceptor_loss1.0000
3:195729643:TGCAG:Tacceptor_loss1.0000
3:195729644:GCAG:Gacceptor_loss1.0000
3:195729645:CAGGT:Cacceptor_loss1.0000
3:195729646:A:AGacceptor_gain1.0000
3:195729646:AGGT:Aacceptor_gain1.0000
3:195729647:G:GCacceptor_gain1.0000
3:195729647:GGT:Gacceptor_gain1.0000
3:195729647:GGTG:Gacceptor_gain1.0000
3:195729647:GGTGA:Gacceptor_gain1.0000
3:195729736:GCAG:Gdonor_gain1.0000
3:195729737:CAGG:Cdonor_loss1.0000
3:195729739:GGTG:Gdonor_loss1.0000
3:195729740:GTGA:Gdonor_loss1.0000
3:195729741:T:Gdonor_loss1.0000
3:195726570:CAGGT:Cdonor_loss0.9900
3:195726571:AGGTA:Adonor_loss0.9900
3:195726573:GT:Gdonor_loss0.9900
3:195726574:T:Gdonor_loss0.9900
3:195729643:T:TAacceptor_gain0.9900
3:195729646:AG:Aacceptor_gain0.9900
3:195729647:GG:Gacceptor_gain0.9900
3:195729740:G:GGdonor_gain0.9900
3:195729640:ATTT:Aacceptor_gain0.9800
3:195726573:G:GGdonor_gain0.9500
3:195729735:AGCAG:Adonor_gain0.9500
3:195729736:GCAGG:Gdonor_gain0.9500
3:195720943:G:GTdonor_gain0.9300

AlphaMissense

4506 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:195725505:T:AI301N0.985
3:195725505:T:GI301S0.985
3:195725448:T:AI282N0.984
3:195725448:T:GI282S0.984
3:195725448:T:CI282T0.983
3:195725505:T:CI301T0.983
3:195725334:T:GI244S0.982
3:195725391:T:GI263S0.982
3:195725391:T:AI263N0.981
3:195725391:T:CI263T0.981
3:195725334:T:AI244N0.980
3:195725334:T:CI244T0.980
3:195725277:T:GI225S0.979
3:195725277:T:CI225T0.977
3:195725562:T:GI320S0.977
3:195725277:T:AI225N0.976
3:195725414:A:CS271R0.975
3:195725416:C:AS271R0.975
3:195725416:C:GS271R0.975
3:195725471:A:CS290R0.975
3:195725473:C:AS290R0.975
3:195725473:C:GS290R0.975
3:195725562:T:AI320N0.975
3:195725562:T:CI320T0.975
3:195725357:A:CS252R0.973
3:195725359:C:AS252R0.973
3:195725359:C:GS252R0.973
3:195725474:A:CS291R0.972
3:195725476:C:AS291R0.972
3:195725476:C:GS291R0.972

dbSNP variants (sampled 300 via entrez): RS1000061921 (3:195733832 C>G,T), RS1000122962 (3:195731950 C>T), RS1000259119 (3:195731763 A>C), RS1000457747 (3:195730843 T>C), RS1000552837 (3:195728381 G>A), RS1001809361 (3:195728175 G>A,T), RS1001945475 (3:195727999 A>G), RS1002449683 (3:195731437 C>T), RS1002468868 (3:195729379 C>T), RS1002604629 (3:195729189 C>T), RS1002674641 (3:195733060 C>A,G,T), RS1003161657 (3:195732949 A>G), RS1003220150 (3:195732850 G>T), RS1003382590 (3:195724618 G>A,T), RS1004005375 (3:195719754 A>G)

Disease associations

OMIM: gene MIM:610360 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

7 associations (top):

StudyTraitp-value
GCST003143_18Lung disease severity in cystic fibrosis8.000000e-11
GCST003143_19Lung disease severity in cystic fibrosis3.000000e-07
GCST003143_20Lung disease severity in cystic fibrosis6.000000e-06
GCST003143_21Lung disease severity in cystic fibrosis3.000000e-11
GCST003143_22Lung disease severity in cystic fibrosis2.000000e-07
GCST003143_23Lung disease severity in cystic fibrosis7.000000e-08
GCST007876_24Estimated glomerular filtration rate2.000000e-08

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0007744lung disease severity measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

23 total (human), top 23 by PubMed support.

ChemicalActions (top 5)PubMed papers
Tobacco Smoke Pollutiondecreases expression, increases expression3
bisphenol Adecreases expression, increases expression2
sodium arseniteincreases abundance, increases expression2
aristolochic acid Iincreases expression1
methylmercuric chloridedecreases expression1
cordycepinincreases expression1
entinostatdecreases expression1
licochalcone Bincreases expression1
Acetaminophendecreases expression1
Arsenicincreases abundance, increases expression1
Atrazinedecreases expression1
Azathioprineincreases expression1
Benzo(a)pyrenedecreases methylation1
Estradiolaffects cotreatment, decreases expression1
Folic Aciddecreases expression1
Nickeldecreases expression1
Smokedecreases expression1
Tretinoinincreases expression1
Urethaneincreases expression1
Valproic Acidincreases methylation1
Cyclosporineincreases expression1
Antirheumatic Agentsdecreases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): cystic fibrosis