MS4A8

gene
On this page

Also known as MS4A4CD20L5

Summary

MS4A8 (membrane spanning 4-domains A8, HGNC:13380) is a protein-coding gene on chromosome 11q12.2, encoding Membrane-spanning 4-domains subfamily A member 8 (Q9BY19). May be involved in signal transduction as a component of a multimeric receptor complex.

This gene encodes a member of the membrane-spanning 4A gene family. Members of this protein family are characterized by common structural features and similar intron/exon splice boundaries and display unique expression patterns among hematopoietic cells and nonlymphoid tissues. The gene encoding this protein is localized to 11q12.3, among a cluster of family members.

Source: NCBI Gene 83661 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 34 total
  • MANE Select transcript: NM_031457

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:13380
Approved symbolMS4A8
Namemembrane spanning 4-domains A8
Location11q12.2
Locus typegene with protein product
StatusApproved
AliasesMS4A4, CD20L5
Ensembl geneENSG00000166959
Ensembl biotypeprotein_coding
OMIM606549
Entrez83661

Gene structure

Transcript identifiers

Ensembl transcripts: 11 — 5 protein_coding, 5 retained_intron, 1 nonsense_mediated_decay

ENST00000300226, ENST00000450141, ENST00000525458, ENST00000529752, ENST00000532816, ENST00000532953, ENST00000533354, ENST00000533691, ENST00000534316, ENST00000855861, ENST00000855862

RefSeq mRNA: 1 — MANE Select: NM_031457 NM_031457

CCDS: CCDS7990

Canonical transcript exons

ENST00000300226 — 7 exons

ExonStartEnd
ENSE000011081906069961260699775
ENSE000011081956070865060708781
ENSE000011081986071531060715807
ENSE000034849626070337860703500
ENSE000035518996070086060701079
ENSE000036114856070698860707047
ENSE000036549646071502160715134

Expression profiles

Bgee: expression breadth ubiquitous, 143 present calls, max score 99.55.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 1.4483 / max 500.7696, expressed in 92 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
1144870.825454
1144880.185029
1144860.157638
1144900.133622
1144890.080618
1144850.066131

Top tissues by expression

249 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right uterine tubeUBERON:000130299.55gold quality
bronchial epithelial cellCL:000232899.40gold quality
ileal mucosaUBERON:000033199.02gold quality
bronchusUBERON:000218598.80gold quality
jejunal mucosaUBERON:000039997.76gold quality
duodenumUBERON:000211494.62gold quality
olfactory segment of nasal mucosaUBERON:000538693.71gold quality
mucosa of paranasal sinusUBERON:000503091.98gold quality
small intestine Peyer’s patchUBERON:000345490.69gold quality
small intestineUBERON:000210889.49gold quality
epithelial cell of pancreasCL:000008388.40silver quality
fallopian tubeUBERON:000388987.78gold quality
oviduct epitheliumUBERON:000480487.61gold quality
pancreatic ductal cellCL:000207987.55silver quality
rectumUBERON:000105286.98gold quality
right lungUBERON:000216786.15gold quality
tracheaUBERON:000312685.95gold quality
epithelium of nasopharynxUBERON:000195183.52gold quality
nasopharynxUBERON:000172883.51gold quality
gall bladderUBERON:000211083.29gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047381.45gold quality
mucosa of transverse colonUBERON:000499178.69gold quality
transverse colonUBERON:000115777.55gold quality
jejunumUBERON:000211576.57gold quality
pituitary glandUBERON:000000776.51gold quality
islet of LangerhansUBERON:000000676.44gold quality
colonic mucosaUBERON:000031776.35gold quality
mucosa of sigmoid colonUBERON:000499376.19gold quality
endometriumUBERON:000129575.43gold quality
left uterine tubeUBERON:000130375.17gold quality

Single-cell (SCXA)

Detected in 9 experiment(s), a significant marker in 8.

ExperimentMarker?Max mean expression
E-GEOD-125970yes68.79
E-CURD-114yes64.95
E-HCAD-1yes28.90
E-GEOD-130148yes14.84
E-MTAB-5061yes12.28
E-GEOD-81608yes9.94
E-ENAD-27yes4.78
E-GEOD-83139no3.30
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

21 targeting MS4A8, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-4778-3P99.9370.401818
HSA-MIR-3934-3P99.7665.511351
HSA-MIR-449999.6267.291470
HSA-MIR-6740-3P99.4868.491392
HSA-MIR-6731-5P99.2867.422375
HSA-MIR-808599.2867.562362
HSA-MIR-133A-3P99.2771.531270
HSA-MIR-133B99.2771.531270
HSA-MIR-10522-5P99.2668.502087
HSA-MIR-670-3P99.0368.882404
HSA-MIR-432698.9767.63962
HSA-MIR-138-2-3P98.9168.331643
HSA-MIR-4709-3P98.8868.041594
HSA-MIR-38498.7167.341229
HSA-MIR-446398.5666.051071
HSA-MIR-1233-5P98.1966.711201
HSA-MIR-6778-5P98.1966.591239
HSA-MIR-6742-3P97.9564.501490
HSA-MIR-194-3P97.3665.961027
HSA-MIR-5586-3P95.5167.00805

Literature-anchored findings (GeneRIF, showing 1)

  • MS4A8B is a novel differentiation marker of the intestinal epithelium that supports the maintenance of a physiological barrier function in the gut by modulating the transcriptome and by conferring an increased resistance to reactive oxygen species. (PMID:23348583)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusMs4a8aENSMUSG00000024730
rattus_norvegicusMs4a8ENSRNOG00000020939

Paralogs (16): MS4A12 (ENSG00000071203), MS4A6A (ENSG00000110077), MS4A4A (ENSG00000110079), MS4A3 (ENSG00000149516), MS4A2 (ENSG00000149534), MS4A1 (ENSG00000156738), MS4A6E (ENSG00000166926), MS4A7 (ENSG00000166927), MS4A14 (ENSG00000166928), MS4A5 (ENSG00000166930), MS4A15 (ENSG00000166961), MS4A10 (ENSG00000172689), TMEM212 (ENSG00000186329), MS4A13 (ENSG00000204979), MS4A18 (ENSG00000214782), MS4A4E (ENSG00000214787)

Protein

Protein identifiers

Membrane-spanning 4-domains subfamily A member 8Q9BY19 (reviewed: Q9BY19)

Alternative names: Four-span transmembrane protein 4, Membrane-spanning 4-domains subfamily A member 8B

All UniProt accessions (4): Q9BY19, E9PJN8, E9PQE1, H0YD94

UniProt curated annotations — full annotation on UniProt →

Function. May be involved in signal transduction as a component of a multimeric receptor complex.

Subcellular location. Membrane.

Tissue specificity. Expressed by hematopoietic tissues and cells lines.

Similarity. Belongs to the MS4A family.

RefSeq proteins (1): NP_113645* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR007237CD20-like_TMDomain
IPR030417MS4AFamily

Pfam: PF04103

UniProt features (12 total): topological domain 5, transmembrane region 4, chain 1, sequence variant 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BY19-F171.550.37

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 65 (showing top): GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_DN, SABATES_COLORECTAL_ADENOMA_DN, DODD_NASOPHARYNGEAL_CARCINOMA_DN, VECCHI_GASTRIC_CANCER_ADVANCED_VS_EARLY_DN, FEVR_CTNNB1_TARGETS_UP, ZWANG_TRANSIENTLY_UP_BY_2ND_EGF_PULSE_ONLY, ZNF843_TARGET_GENES, GSE10094_LCMV_VS_LISTERIA_IND_EFF_CD4_TCELL_DN, MIR4778_3P, MIR4499, MIR384, GSE13306_RA_VS_UNTREATED_TCONV_DN, GSE13485_CTRL_VS_DAY3_YF17D_VACCINE_PBMC_UP, GSE17721_CTRL_VS_PAM3CSK4_24H_BMDC_UP

GO Biological Process (1): cell surface receptor signaling pathway (GO:0007166)

GO Molecular Function (0):

GO Cellular Component (2): plasma membrane (GO:0005886), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
signal transduction1
membrane1
cell periphery1
cellular anatomical structure1

Protein interactions and networks

STRING

406 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MS4A8MS4A6EQ96DS6921
MS4A8MS4A4EQ96PG1713
MS4A8CD2APQ9Y5K6662
MS4A8ABCA7Q8IZY2621
MS4A8SORL1Q92673599
MS4A8EPHA1P21709599
MS4A8PICALMQ13492595
MS4A8CLUP10909569
MS4A8MS4A6AQ9H2W1544
MS4A8CD33P20138521
MS4A8ZCWPW1Q9H0M4513
MS4A8BIN1O00499512
MS4A8NME8Q8N427480
MS4A8CASS4Q9NQ75480
MS4A8SLC24A4Q8NFF2470

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A0A0A7EPL0, A2RV66, A4QP16, A6QPF4, C4QM85, F4JYG0, O35892, P11836, P13386, P19437, P20490, P41739, P53762, P56645, P70326, P79778, Q01362, Q148B6, Q29131, Q3C1V0, Q3C2E2, Q3SA47, Q3UNB8, Q3V3Q4, Q3YBM2, Q5I2P1, Q5JT82, Q5R8D6, Q5R8E0, Q60HE7, Q68FU0, Q6A058, Q8BVM2, Q8N1N2, Q8NDZ0, Q8QGQ8, Q8WUU8, Q920C4, Q92540, Q96HJ5

Diamond homologs: A6QPF4, P11836, P13386, P19437, P20490, Q01362, Q3C1V0, Q3C2E2, Q3UPL6, Q8N5U1, Q920C4, Q96JA4, Q96JQ5, Q99N05, Q99N07, Q99N09, Q99N10, Q9BY19, Q9H2W1, Q9H3V2, Q9NXJ0, Q96HJ5, Q99N08, Q5REZ6, Q96PG2, Q9GZW8, Q99N03, Q96PG1, Q96DS6, Q2YDM3

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

34 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance31
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

0 predictions. Top by Δscore:

AlphaMissense

1605 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:60708680:A:CS145R0.993
11:60708682:T:AS145R0.993
11:60708682:T:GS145R0.993
11:60706988:T:CF115L0.988
11:60706990:T:AF115L0.988
11:60706990:T:GF115L0.988
11:60703397:G:AG80D0.984
11:60703477:T:CF107L0.983
11:60703479:T:AF107L0.983
11:60703479:T:GF107L0.983
11:60703486:T:AW110R0.983
11:60703486:T:CW110R0.983
11:60703396:G:CG80R0.982
11:60707001:G:AG119E0.982
11:60708689:T:CC148R0.982
11:60703417:G:CG87R0.980
11:60708701:G:AG152R0.980
11:60708701:G:CG152R0.980
11:60701077:G:TG73W0.978
11:60708673:C:AN142K0.975
11:60708673:C:GN142K0.975
11:60703418:G:AG87D0.974
11:60715060:T:CC192R0.972
11:60708702:G:AG152E0.971
11:60703492:G:CG112R0.970
11:60707000:G:AG119R0.970
11:60707000:G:CG119R0.970
11:60703498:T:AW114R0.969
11:60703498:T:CW114R0.969
11:60701077:G:AG73R0.968

dbSNP variants (sampled 300 via entrez): RS1000290058 (11:60710170 C>G), RS1000405815 (11:60710437 G>A), RS1000418375 (11:60704799 A>C,T), RS1000509730 (11:60709495 C>T), RS1000606226 (11:60703282 T>C,G), RS1000661361 (11:60714792 G>A), RS1000775540 (11:60714990 C>A,T), RS1000911788 (11:60698384 T>C), RS1001184002 (11:60704317 C>T), RS1001303908 (11:60705563 G>A), RS1001729848 (11:60697939 A>T), RS1001746177 (11:60710559 G>C), RS1001793222 (11:60714452 A>G), RS1002100143 (11:60710939 C>T), RS1002283635 (11:60704726 C>G)

Disease associations

OMIM: gene MIM:606549 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

5 total (human), top 5 by PubMed support.

ChemicalActions (top 5)PubMed papers
Air Pollutantsincreases abundance, increases expression1
Smokeincreases abundance, increases expression1
Tobacco Smoke Pollutiondecreases expression1
Aflatoxin B1increases methylation1
Antirheumatic Agentsincreases expression1

Cellosaurus cell lines

3 cell lines: 3 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_VD91300-19 GFP-MS4A8BCancer cell line
CVCL_VD96BJAB GFP-MS4A8BCancer cell lineFemale
CVCL_VE01BJAB HA-MS4A8BCancer cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.