MUC21

gene
On this page

Also known as bCX31G15.2

Summary

MUC21 (mucin 21, cell surface associated, HGNC:21661) is a protein-coding gene on chromosome 6p21.33, encoding Mucin-21 (Q5SSG8).

This gene encodes a large membrane-bound glycoprotein which is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. The encoded protein contains an N-terminal signal sequence, an extracellular mucin domain, a stem domain, a transmembrane domain, and a C-terminal cytoplasmic tail domain. The mucin domain contains O-glycosylation sites and is polymorphic with isoforms containing a variable number of nonidentical proline-, threonine-, and serine-rich tandem repeats of 15 amino acids each. The aberrent expression of this gene is associated with lung adenocarcinoma.

Source: NCBI Gene 394263 — RefSeq curated summary.

At a glance

  • GWAS associations: 32
  • Clinical variants (ClinVar): 58 total
  • MANE Select transcript: NM_001010909

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21661
Approved symbolMUC21
Namemucin 21, cell surface associated
Location6p21.33
Locus typegene with protein product
StatusApproved
AliasesbCX31G15.2
Ensembl geneENSG00000204544
Ensembl biotypeprotein_coding
OMIM616991
Entrez394263

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000376296, ENST00000486149

RefSeq mRNA: 1 — MANE Select: NM_001010909 NM_001010909

CCDS: CCDS34388

Canonical transcript exons

ENST00000376296 — 3 exons

ExonStartEnd
ENSE000018661783098371830984019
ENSE000019282943098800030989903
ENSE000035424613098623730987681

Expression profiles

Bgee: expression breadth ubiquitous, 103 present calls, max score 99.76.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 3.1148 / max 1776.1451, expressed in 40 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
668473.072840
668490.01638
668510.01427
668500.00804
668480.00362

Top tissues by expression

129 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
lower esophagus mucosaUBERON:003583499.76gold quality
esophagus mucosaUBERON:000246998.60gold quality
vaginaUBERON:000099687.47gold quality
esophagusUBERON:000104378.80gold quality
tonsilUBERON:000237270.31gold quality
ectocervixUBERON:001224970.31gold quality
minor salivary glandUBERON:000183066.91gold quality
uterine cervixUBERON:000000266.77gold quality
saliva-secreting glandUBERON:000104464.13gold quality
right coronary arteryUBERON:000162561.80gold quality
body of stomachUBERON:000116160.34gold quality
fundus of stomachUBERON:000116059.60gold quality
olfactory segment of nasal mucosaUBERON:000538659.07gold quality
upper lobe of left lungUBERON:000895258.62gold quality
stomachUBERON:000094557.44gold quality
lower esophagusUBERON:001347356.64gold quality
lower esophagus muscularis layerUBERON:003583356.57gold quality
endocervixUBERON:000045854.11gold quality
lungUBERON:000204854.05gold quality
esophagogastric junction muscularis propriaUBERON:003584153.85gold quality
right uterine tubeUBERON:000130253.54gold quality
left uterine tubeUBERON:000130353.44gold quality
body of pancreasUBERON:000115051.75gold quality
islet of LangerhansUBERON:000000651.35gold quality
right lobe of liverUBERON:000111451.33gold quality
pancreasUBERON:000126451.21gold quality
urinary bladderUBERON:000125551.09gold quality
gastrocnemiusUBERON:000138850.92gold quality
left adrenal glandUBERON:000123450.50gold quality
left adrenal gland cortexUBERON:003582549.62gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes2.85

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

34 targeting MUC21, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-548AN99.9770.912817
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-130B-5P99.8368.501888
HSA-MIR-4756-3P99.6266.301319
HSA-MIR-488-3P99.6168.791731
HSA-MIR-129099.5969.902079
HSA-MIR-1252-3P99.5567.712862
HSA-MIR-5004-3P99.5468.271371
HSA-MIR-199A-5P99.5169.711107
HSA-MIR-199B-5P99.5169.741098
HSA-MIR-766-3P99.4765.241811
HSA-MIR-431899.3866.941505
HSA-MIR-1273H-3P99.2967.55980
HSA-MIR-520E-5P99.2768.901513
HSA-MIR-6852-5P99.1766.692073
HSA-MIR-3152-3P99.1066.35678
HSA-MIR-66199.0965.942062
HSA-MIR-371A-5P99.0866.511914
HSA-MIR-31-5P98.5868.351239
HSA-MIR-5187-5P98.5467.94952
HSA-MIR-449098.5168.47943
HSA-MIR-124898.4767.541314
HSA-MIR-541-5P98.2467.771181
HSA-MIR-316698.2466.631223
HSA-MIR-338-3P98.1467.381137
HSA-MIR-6730-5P98.0368.121299
HSA-MIR-6728-5P97.7966.33891
HSA-MIR-4724-3P97.5767.31785

Literature-anchored findings (GeneRIF, showing 8)

  • Identification and expression of human epiglycanin/MUC21: a novel transmembrane mucin (PMID:17977904)
  • Mucin 21/epiglycanin modulates cell adhesion (PMID:20388707)
  • CC genotype of rs886403 in MUC21 is associated with recurrence in colorectal cancer. (PMID:27803053)
  • MUC21 is significantly upregulated in human masticatory mucosa during wound healing (PMID:28005267)
  • MUC21 could be used as an additional, novel, negative immunohistochemical marker to differentiate mesothelioma from lung adenocarcinoma. (PMID:30329165)
  • MUC21 proteins with a specific glycosylation status may be involved in the progression of EGFR-mutated lung adenocarcinomas, particularly at the stage where tumors are transforming from pure lepidic to micropapillary through low papillary lepidic lesions. (PMID:30973916)
  • MUC21 controls melanoma progression via regulating SLITRK5 and hedgehog signaling pathway. (PMID:35579188)
  • A CRISPR activation screen identifies MUC-21 as critical for resistance to NK and T cell-mediated cytotoxicity. (PMID:37858248)

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

Mucin-21Q5SSG8 (reviewed: Q5SSG8)

Alternative names: Epiglycanin

All UniProt accessions (2): A0A0C4DGM6, Q5SSG8

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Cell membrane.

Tissue specificity. Expressed in lung, large intestine, thymus, and testis. Expressed in normal and malignant bronchial epithelial cells.

Post-translational modifications. O-glycosylated.

Miscellaneous. Could be considered as a marker for lung carcinomas.

Isoforms (3)

UniProt IDNamesCanonical?
Q5SSG8-11yes
Q5SSG8-22
Q5SSG8-33

RefSeq proteins (1): NP_001010909* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008519Tandem-repeating_mucinDomain
IPR028199Mucin_domDomain

Pfam: PF05647, PF14654

UniProt features (75 total): repeat 28, sequence conflict 20, sequence variant 14, region of interest 4, topological domain 2, splice variant 2, signal peptide 1, chain 1, transmembrane region 1, site 1, glycosylation site 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5SSG8-F174.300.26

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 502–503 (cleavage)

Glycosylation sites (1): 25

Function

Pathways and Gene Ontology

Reactome pathways

16 pathways

IDPathway
R-HSA-5083625Defective GALNT3 causes HFTC
R-HSA-5083632Defective C1GALT1C1 causes TNPS
R-HSA-5083636Defective GALNT12 causes CRCS1
R-HSA-5621480Dectin-2 family
R-HSA-913709O-linked glycosylation of mucins
R-HSA-977068Termination of O-glycan biosynthesis
R-HSA-1643685Disease
R-HSA-168249Innate Immune System
R-HSA-168256Immune System
R-HSA-3781865Diseases of glycosylation
R-HSA-3906995Diseases associated with O-glycosylation of proteins
R-HSA-392499Metabolism of proteins
R-HSA-5173105O-linked glycosylation
R-HSA-5621481C-type lectin receptors (CLRs)
R-HSA-5668914Diseases of metabolism
R-HSA-597592Post-translational protein modification

MSigDB gene sets: 38 (showing top): REACTOME_INNATE_IMMUNE_SYSTEM, GOBP_NEGATIVE_REGULATION_OF_CELL_CELL_ADHESION, GOBP_CELL_CELL_ADHESION, GOCC_GOLGI_LUMEN, GOBP_NEGATIVE_REGULATION_OF_CELL_ADHESION, REACTOME_POST_TRANSLATIONAL_PROTEIN_MODIFICATION, BOSCO_EPITHELIAL_DIFFERENTIATION_MODULE, REACTOME_DISEASES_OF_GLYCOSYLATION, REACTOME_DISEASES_ASSOCIATED_WITH_O_GLYCOSYLATION_OF_PROTEINS, REACTOME_DEFECTIVE_GALNT3_CAUSES_HFTC, REACTOME_DEFECTIVE_C1GALT1C1_CAUSES_TNPS, REACTOME_O_LINKED_GLYCOSYLATION, REACTOME_DECTIN_2_FAMILY, REACTOME_C_TYPE_LECTIN_RECEPTORS_CLRS, REACTOME_DISEASES_OF_METABOLISM

GO Biological Process (1): negative regulation of cell-cell adhesion (GO:0022408)

GO Molecular Function (0):

GO Cellular Component (3): Golgi lumen (GO:0005796), plasma membrane (GO:0005886), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-11 pathways:

CategoryPathways
Diseases associated with O-glycosylation of proteins3
C-type lectin receptors (CLRs)1
O-linked glycosylation1
O-linked glycosylation of mucins1
Immune System1
Diseases of metabolism1
Diseases of glycosylation1
Post-translational protein modification1
Innate Immune System1
Disease1
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
negative regulation of cell adhesion1
regulation of cell-cell adhesion1
cell-cell adhesion1
Golgi apparatus1
intracellular organelle lumen1
membrane1
cell periphery1
cellular anatomical structure1

Protein interactions and networks

STRING

588 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MUC21MUC13Q9H3R2837
MUC21MUC20Q8N307801
MUC21MUC15Q8N387792
MUC21MUC4Q99102781
MUC21MUC12Q9UKN1774
MUC21MUC3AQ02505728
MUC21MUC17Q685J3720
MUC21MUC19Q7Z5P9720
MUC21MUC7Q8TAX7720
MUC21MUC6Q6W4X9682
MUC21MUC5BQ9HC84671
MUC21MUC16Q8WXI7664
MUC21MUC5ACP98088625
MUC21MUC1P13931622
MUC21MUC2Q02817612

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A0A0H2URK1, A0A0H3HIJ5, A0A1D8PQ86, A1C3L3, A8AWU7, O86487, P05227, P08519, P08640, P0C727, P0C728, P0CG48, P0CG50, P0CG61, P0CG64, P0CG66, P0CG69, P0CG71, P0CH28, P12027, P13821, P15941, P19837, P24856, P32768, P39712, P46804, P62976, Q02192, Q27905, Q2FJ79, Q2G0L5, Q41406, Q49537, Q49538, Q4L9P0, Q54GV8, Q59SG9, Q5HIB4, Q5SSG8

Diamond homologs: E2RYF6, Q5SSG8

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

58 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance37
Likely benign18
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

227 predictions. Top by Δscore:

VariantEffectΔscore
6:30987994:TTTTA:Tacceptor_loss0.9900
6:30987998:A:AGacceptor_gain0.9900
6:30987998:A:ATacceptor_loss0.9900
6:30987999:G:GGacceptor_gain0.9900
6:30987999:GA:Gacceptor_gain0.9900
6:30983835:G:GTdonor_gain0.9800
6:30984016:GCTG:Gdonor_gain0.9800
6:30987999:GAGA:Gacceptor_gain0.9800
6:30983975:G:GTdonor_gain0.9700
6:30986224:AT:Aacceptor_gain0.9700
6:30986225:T:Gacceptor_gain0.9700
6:30986235:A:AGacceptor_gain0.9700
6:30986236:G:GGacceptor_gain0.9700
6:30987999:GAGAA:Gacceptor_gain0.9700
6:30983814:C:Gdonor_gain0.9600
6:30983976:A:Tdonor_gain0.9600
6:30986225:T:TAacceptor_gain0.9600
6:30986236:GC:Gacceptor_gain0.9600
6:30986236:GCAAC:Gacceptor_gain0.9400
6:30987677:GTGTG:Gdonor_gain0.9400
6:30987682:G:GGdonor_gain0.9400
6:30987997:TAGAG:Tacceptor_gain0.9400
6:30987998:AGAGA:Aacceptor_gain0.9400
6:30987571:G:GGdonor_gain0.9300
6:30987999:G:Cacceptor_gain0.9300
6:30984021:TGAG:Tdonor_loss0.9200
6:30984022:GAGT:Gdonor_loss0.9200
6:30987996:TTAG:Tacceptor_gain0.9200
6:30987997:TAG:Tacceptor_gain0.9200
6:30987998:AGA:Aacceptor_gain0.9200

AlphaMissense

3718 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:30987652:G:AG493R0.965
6:30987652:G:CG493R0.965
6:30987664:G:AG497R0.942
6:30987664:G:CG497R0.942
6:30987653:G:AG493E0.935
6:30987665:G:AG497E0.926
6:30987609:G:CW478C0.920
6:30987609:G:TW478C0.920
6:30987607:T:AW478R0.914
6:30987607:T:CW478R0.914
6:30987632:T:AV486D0.902
6:30987612:A:CE479D0.901
6:30987612:A:TE479D0.901
6:30987647:C:AA491D0.892
6:30987620:T:CL482P0.890
6:30987614:T:CI480T0.888
6:30987638:T:AV488D0.880
6:30987629:T:CL485P0.871
6:30987623:T:AI483N0.866
6:30987620:T:AL482H0.865
6:30987629:T:GL485R0.856
6:30987614:T:GI480S0.853
6:30987656:T:GL494R0.850
6:30987623:T:GI483S0.847
6:30987646:G:CA491P0.846
6:30987656:T:CL494P0.845
6:30987641:T:AV489E0.842
6:30987662:C:AA496D0.842
6:30987620:T:GL482R0.832
6:30987623:T:CI483T0.822

dbSNP variants (sampled 300 via entrez): RS1000175654 (6:30989774 T>C), RS1000583547 (6:30988759 C>T), RS1000765494 (6:30981975 C>T), RS1001478867 (6:30985188 C>T), RS1001610464 (6:30984115 A>G), RS1001805226 (6:30984907 G>A), RS1003342484 (6:30982256 A>G), RS1004166255 (6:30984937 T>C), RS1004505513 (6:30983364 A>C,G), RS1005787204 (6:30983674 C>T), RS1006127187 (6:30982201 G>C), RS1006178066 (6:30981835 G>C), RS1006381668 (6:30989146 T>A), RS1006741119 (6:30989492 T>C), RS1006948371 (6:30984169 A>G)

Disease associations

OMIM: gene MIM:616991 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

32 associations (top):

StudyTraitp-value
GCST001137_9White blood cell count4.000000e-13
GCST001181_1Drug-induced Stevens-Johnson syndrome or toxic epidermal necrolysis (SJS/TEN)3.000000e-07
GCST001200_5Graves’ disease4.000000e-51
GCST001779_4Hematology traits5.000000e-07
GCST001784_12Pulmonary function (smoking interaction)1.000000e-07
GCST001784_34Pulmonary function (smoking interaction)1.000000e-07
GCST001949_5Preeclampsia3.000000e-06
GCST002884_4Cutaneous lupus erythematosus2.000000e-11
GCST003123_16Severe influenza A (H1N1) infection2.000000e-14
GCST003156_41Systemic lupus erythematosus6.000000e-92
GCST004521_114Autism spectrum disorder or schizophrenia3.000000e-17
GCST004521_117Autism spectrum disorder or schizophrenia3.000000e-15
GCST004521_132Autism spectrum disorder or schizophrenia2.000000e-09
GCST004521_171Autism spectrum disorder or schizophrenia4.000000e-14
GCST004521_19Autism spectrum disorder or schizophrenia2.000000e-12
GCST004521_2Autism spectrum disorder or schizophrenia2.000000e-16
GCST004521_209Autism spectrum disorder or schizophrenia5.000000e-16
GCST004521_210Autism spectrum disorder or schizophrenia5.000000e-15
GCST004521_211Autism spectrum disorder or schizophrenia5.000000e-15
GCST004521_265Autism spectrum disorder or schizophrenia7.000000e-14
GCST004521_27Autism spectrum disorder or schizophrenia1.000000e-09
GCST004521_282Autism spectrum disorder or schizophrenia5.000000e-09
GCST004521_295Autism spectrum disorder or schizophrenia6.000000e-18
GCST004521_3Autism spectrum disorder or schizophrenia2.000000e-15
GCST004521_48Autism spectrum disorder or schizophrenia1.000000e-09
GCST004521_70Autism spectrum disorder or schizophrenia8.000000e-20
GCST007254_3Broad depression or major depressive disorder (self-reported)1.000000e-08
GCST010286_2Oropharynx cancer1.000000e-10
GCST010725_70Malaria5.000000e-06
GCST010725_9Malaria6.000000e-06

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0003892pulmonary function measurement
EFO:0004713FEV/FVC ratio
EFO:1001488influenza A (H1N1)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, decreases expression, increases methylation3
Resveratrolaffects cotreatment, decreases expression2
bisphenol Aaffects cotreatment, increases methylation, decreases methylation1
Fulvestrantaffects cotreatment, increases methylation1
Acetaminophenincreases expression1
Vehicle Emissionsdecreases methylation1
Copperaffects cotreatment, decreases expression1
Phthalic Acidsincreases methylation1
Plant Extractsaffects cotreatment, decreases expression1
Tobacco Smoke Pollutionaffects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.