CEACAM21

gene
On this page

Also known as R29124_1FLJ13540

Summary

CEACAM21 (CEA cell adhesion molecule 21, HGNC:28834) is a protein-coding gene on chromosome 19q13.2, encoding Cell adhesion molecule CEACAM21 (Q3KPI0).

Predicted to be involved in T cell activation and immune response. Predicted to be located in membrane. Predicted to be active in external side of plasma membrane.

Source: NCBI Gene 90273 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 63 total
  • MANE Select transcript: NM_001098506

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:28834
Approved symbolCEACAM21
NameCEA cell adhesion molecule 21
Location19q13.2
Locus typegene with protein product
StatusApproved
AliasesR29124_1, FLJ13540
Ensembl geneENSG00000007129
Ensembl biotypeprotein_coding
OMIM618191
Entrez90273

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 7 protein_coding, 2 protein_coding_CDS_not_defined, 1 nonsense_mediated_decay

ENST00000187608, ENST00000401445, ENST00000407170, ENST00000457737, ENST00000482870, ENST00000618577, ENST00000632983, ENST00000890424, ENST00000890425, ENST00000967641

RefSeq mRNA: 4 — MANE Select: NM_001098506 NM_001098506, NM_001288773, NM_001290113, NM_033543

CCDS: CCDS46086, CCDS46087, CCDS74373

Canonical transcript exons

ENST00000401445 — 7 exons

ExonStartEnd
ENSE000014765874158434741584443
ENSE000018137514158646441586844
ENSE000018211844157616641576338
ENSE000035060044157935341579628
ENSE000035589864158544341585495
ENSE000035872324157720041577559
ENSE000036031254158584041585871

Expression profiles

Bgee: expression breadth ubiquitous, 132 present calls, max score 92.02.

FANTOM5 (CAGE): breadth broad, TPM avg 7.9132 / max 396.1095, expressed in 580 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1760406.8625414
1760440.7843290
2088290.119749
1760420.114325
1760430.03248

Top tissues by expression

135 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
lymph nodeUBERON:000002992.02gold quality
granulocyteCL:000009491.88gold quality
bloodUBERON:000017890.93gold quality
bone marrow cellCL:000209288.57gold quality
bone marrowUBERON:000237188.50gold quality
spleenUBERON:000210687.14gold quality
vermiform appendixUBERON:000115485.65gold quality
placentaUBERON:000198779.09gold quality
leukocyteCL:000073878.71gold quality
monocyteCL:000057677.66gold quality
hypothalamusUBERON:000189877.44gold quality
gall bladderUBERON:000211076.86gold quality
right lungUBERON:000216776.78gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047375.62silver quality
small intestine Peyer’s patchUBERON:000345475.46gold quality
rectumUBERON:000105275.18gold quality
substantia nigraUBERON:000203874.86gold quality
small intestineUBERON:000210874.23gold quality
upper lobe of left lungUBERON:000895273.55gold quality
duodenumUBERON:000211472.53gold quality
C1 segment of cervical spinal cordUBERON:000646972.22gold quality
lungUBERON:000204871.75gold quality
omental fat padUBERON:001041469.68gold quality
smooth muscle tissueUBERON:000113569.57gold quality
apex of heartUBERON:000209869.38gold quality
right coronary arteryUBERON:000162569.23gold quality
tonsilUBERON:000237268.99gold quality
mucosa of transverse colonUBERON:000499168.43gold quality
adipose tissueUBERON:000101368.07gold quality
subcutaneous adipose tissueUBERON:000219066.81gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-HCAD-5yes155.99
E-ANND-3no1.37

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

6 targeting CEACAM21, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-453099.6966.471509
HSA-MIR-429199.2068.882969
HSA-MIR-892C-5P99.1670.562116
HSA-MIR-510099.1167.521098
HSA-MIR-59998.3266.991037
HSA-MIR-59296.5967.59817

Literature-anchored findings (GeneRIF, showing 1)

  • This study revealed CEACAM21 as novel schizophrenia candidate genes in the Jewish population (PMID:21682944)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
mus_musculusCeacam9ENSMUSG00000007209
mus_musculusCeacam15ENSMUSG00000078795
rattus_norvegicusCeacam15ENSRNOG00000028617
rattus_norvegicusCeacam9ENSRNOG00000046915
rattus_norvegicusCeacam15l1ENSRNOG00000065825

Paralogs (24): CEACAM7 (ENSG00000007306), CEACAM1 (ENSG00000079385), CEACAM6 (ENSG00000086548), CEACAM4 (ENSG00000105352), CEACAM5 (ENSG00000105388), PSG8 (ENSG00000124467), CEACAM8 (ENSG00000124469), HEPACAM (ENSG00000165478), PSG6 (ENSG00000170848), CEACAM3 (ENSG00000170956), PSG9 (ENSG00000183668), CEACAM19 (ENSG00000186567), HEPACAM2 (ENSG00000188175), PSG5 (ENSG00000204941), CEACAM18 (ENSG00000213822), CEACAM16 (ENSG00000213892), VSTM5 (ENSG00000214376), PSG3 (ENSG00000221826), PSG7 (ENSG00000221878), PSG1 (ENSG00000231924), PSG2 (ENSG00000242221), PSG11 (ENSG00000243130), PSG4 (ENSG00000243137), CEACAM20 (ENSG00000273777)

Protein

Protein identifiers

Cell adhesion molecule CEACAM21Q3KPI0 (reviewed: Q3KPI0)

Alternative names: Carcinoembryonic antigen-related cell adhesion molecule 21

All UniProt accessions (3): Q3KPI0, A0A0B4J1W4, A0A0J9YX60

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Membrane.

Similarity. Belongs to the immunoglobulin superfamily. CEA family.

Isoforms (3)

UniProt IDNamesCanonical?
Q3KPI0-11yes
Q3KPI0-22
Q3KPI0-33

RefSeq proteins (4): NP_001091976, NP_001275702, NP_001277042, NP_291021 (=MANE)

Domains & families (InterPro)

IDNameType
IPR003598Ig_sub2Domain
IPR003599Ig_subDomain
IPR007110Ig-like_domDomain
IPR013106Ig_V-setDomain
IPR013783Ig-like_foldHomologous_superfamily
IPR036179Ig-like_dom_sfHomologous_superfamily
IPR050831CEA_cell_adhesionFamily

Pfam: PF07686, PF13927

UniProt features (13 total): splice variant 2, sequence variant 2, topological domain 2, signal peptide 1, chain 1, transmembrane region 1, domain 1, region of interest 1, glycosylation site 1, disulfide bond 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q3KPI0-F185.540.72

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Disulfide bonds (1): 166–214

Glycosylation sites (1): 111

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 124 (showing top): GAUSSMANN_MLL_AF4_FUSION_TARGETS_A_UP, GOCC_CELL_SURFACE, DARWICHE_SKIN_TUMOR_PROMOTER_UP, DARWICHE_SKIN_TUMOR_PROMOTER_DN, DARWICHE_PAPILLOMA_RISK_LOW_UP, DARWICHE_PAPILLOMA_RISK_HIGH_UP, DARWICHE_SQUAMOUS_CELL_CARCINOMA_UP, BROWN_MYELOID_CELL_DEVELOPMENT_UP, AFFAR_YY1_TARGETS_UP, DOUGLAS_BMI1_TARGETS_UP, IVANOVA_HEMATOPOIESIS_STEM_CELL_LONG_TERM, GOCC_SIDE_OF_MEMBRANE, MATSUDA_NATURAL_KILLER_DIFFERENTIATION, YOSHIMURA_MAPK8_TARGETS_UP, MIKKELSEN_ES_ICP_WITH_H3K4ME3

GO Biological Process (2): immune response (GO:0006955), T cell activation (GO:0042110)

GO Molecular Function (0):

GO Cellular Component (2): external side of plasma membrane (GO:0009897), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
immune system process1
response to stimulus1
lymphocyte activation1
plasma membrane1
cell surface1
side of membrane1
cellular anatomical structure1

Protein interactions and networks

STRING

1220 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CEACAM21CEACAM19Q7Z692651
CEACAM21SYKP43405558
CEACAM21VAV1P15498490
CEACAM21CD55P08174458
CEACAM21CD79AP11912443
CEACAM21ERV3-1Q14264436
CEACAM21ERVFRD-1P60508427
CEACAM21ERVW-1Q9UQF0422
CEACAM21DMAC2Q9NW81418
CEACAM21EFHBQ8N7U6404
CEACAM21ESR1P03372397
CEACAM21CD40LGP29965387
CEACAM21HOXA2O43364370
CEACAM21SIGLEC5O15389367
CEACAM21CAPSLQ8WWF8336

IntAct

7 interactions, top by confidence:

ABTypeScore
DPPA4ALOX12Bpsi-mi:“MI:0914”(association)0.530
LILRA6CEACAM21psi-mi:“MI:0915”(physical association)0.400
LILRB1CEACAM21psi-mi:“MI:0915”(physical association)0.400
CEACAM21METpsi-mi:“MI:0914”(association)0.350
CEACAM21GAPDHSpsi-mi:“MI:0914”(association)0.350
CEACAM21VGLL4psi-mi:“MI:0914”(association)0.350

BioGRID (148): KIAA0319L (Affinity Capture-MS), HLA-F (Affinity Capture-MS), ABHD14A (Affinity Capture-MS), KIDINS220 (Affinity Capture-MS), FAT4 (Affinity Capture-MS), ATP6V0A1 (Affinity Capture-MS), ABCB6 (Affinity Capture-MS), ATR (Affinity Capture-MS), PTK7 (Affinity Capture-MS), NCR3LG1 (Affinity Capture-MS), PCDHGA11 (Affinity Capture-MS), PCDHGC3 (Affinity Capture-MS), KIAA1524 (Affinity Capture-MS), PCDH20 (Affinity Capture-MS), XPR1 (Affinity Capture-MS)

ESM2 similar proteins: A0A0E4BZH1, A4QPC6, A5D7V5, A7TZE6, A7TZF0, A7TZF3, A7XUX6, A7XV04, A7XV07, A8K4G0, A8MVZ5, O70355, P08508, P18892, P24071, P31994, P55803, P78410, P79391, Q13410, Q16653, Q29ZQ1, Q3KPI0, Q58DF9, Q5R7W8, Q5R960, Q5R996, Q61885, Q62556, Q63345, Q6Q8B3, Q6UXZ3, Q6XJV4, Q6XJV6, Q7KYR7, Q7TST0, Q7YR73, Q8BTP3, Q8K249, Q8TD46

Diamond homologs: A0A0B4J1L0, D3ZQE1, E9QA28, O75871, P06731, P11464, P11465, P13688, P16573, P31809, P31997, P40198, P40199, Q00887, Q00888, Q00889, Q13046, Q14002, Q15238, Q16557, Q2WEN9, Q3KPI0, Q3UKK2, Q61400, Q63111, Q810J1, Q925P2, Q9D2Z1, Q9UQ72, Q9UQ74, A4FUY1, A6QQC6, A8MVW5, D3YXG0, D3ZB51, E9PZ19, O08775, P13595, P13596, P35968

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

63 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance51
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1055 predictions. Top by Δscore:

VariantEffectΔscore
19:41579627:AT:Adonor_gain1.0000
19:41579627:ATG:Adonor_loss1.0000
19:41579628:TG:Tdonor_loss1.0000
19:41579629:G:GGdonor_gain1.0000
19:41579629:GTGA:Gdonor_loss1.0000
19:41579630:T:Gdonor_loss1.0000
19:41579631:GAGT:Gdonor_loss1.0000
19:41576335:ACAGG:Adonor_loss0.9900
19:41576336:CAGG:Cdonor_loss0.9900
19:41576337:AGGTG:Adonor_loss0.9900
19:41576339:G:GAdonor_loss0.9900
19:41577198:A:AGacceptor_gain0.9900
19:41577199:G:GGacceptor_gain0.9900
19:41577199:GCCTC:Gacceptor_gain0.9900
19:41577558:CGGTG:Cdonor_loss0.9900
19:41577559:GGT:Gdonor_loss0.9900
19:41577560:GTGA:Gdonor_loss0.9900
19:41579625:AAAT:Adonor_gain0.9900
19:41579626:AAT:Adonor_gain0.9900
19:41584345:A:AGacceptor_gain0.9900
19:41584346:G:GAacceptor_gain0.9900
19:41584346:GC:Gacceptor_gain0.9900
19:41584346:GCA:Gacceptor_gain0.9900
19:41584441:CAGGT:Cdonor_loss0.9900
19:41584442:AGG:Adonor_loss0.9900
19:41584443:GG:Gdonor_loss0.9900
19:41584444:GT:Gdonor_loss0.9900
19:41584445:T:Gdonor_loss0.9900
19:41585496:G:Tdonor_loss0.9900
19:41585497:T:Adonor_loss0.9900

AlphaMissense

1899 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:41579459:G:CW177C0.992
19:41579459:G:TW177C0.992
19:41579457:T:AW177R0.991
19:41579457:T:CW177R0.991
19:41577336:G:CW67C0.987
19:41577336:G:TW67C0.987
19:41577334:T:AW67R0.982
19:41577334:T:CW67R0.982
19:41579563:A:GY212C0.982
19:41579562:T:GY212D0.981
19:41579568:T:AC214S0.981
19:41579569:G:CC214S0.981
19:41579562:T:CY212H0.978
19:41579563:A:CY212S0.977
19:41579601:A:CS225R0.972
19:41579603:C:AS225R0.972
19:41579603:C:GS225R0.972
19:41579458:G:CW177S0.969
19:41579424:T:AC166S0.968
19:41579425:G:CC166S0.968
19:41579424:T:CC166R0.965
19:41579419:T:CL164P0.964
19:41577428:G:CR98P0.963
19:41577455:T:CL107P0.959
19:41577493:T:GY120D0.956
19:41579568:T:CC214R0.955
19:41577482:A:TD116V0.954
19:41577482:A:CD116A0.953
19:41577500:T:CL122P0.953
19:41579566:A:CQ213P0.953

dbSNP variants (sampled 300 via entrez): RS1000057280 (19:41575631 T>C), RS1000141865 (19:41584189 C>T), RS1000427941 (19:41551959 C>T), RS1000534793 (19:41575294 G>A), RS1000546555 (19:41550562 G>A), RS1000597904 (19:41556816 G>T), RS1000611675 (19:41551771 A>G), RS1000745915 (19:41585275 G>A), RS1000812150 (19:41586565 C>A,T), RS1000841246 (19:41569548 G>A), RS1000955451 (19:41569327 C>T), RS1001203955 (19:41552875 C>T), RS1001261990 (19:41560205 A>G), RS1001407522 (19:41563658 G>A), RS1001783423 (19:41565885 C>T)

Disease associations

OMIM: gene MIM:618191 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST001119_2Schizophrenia1.000000e-07
GCST001714_1Prostate cancer1.000000e-08
GCST001714_2Prostate cancer2.000000e-12
GCST004744_58Lung adenocarcinoma3.000000e-07

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

12 total (human), top 12 by PubMed support.

ChemicalActions (top 5)PubMed papers
GSK-J4decreases expression1
triphenyl phosphateaffects expression1
pirinixic acidincreases activity, affects binding, decreases expression1
arseniteaffects expression1
perfluorooctanoic acidincreases expression1
S-(1,2-dichlorovinyl)cysteineaffects response to substance, increases expression1
di-n-butylphosphoric acidaffects expression1
CGP 52608increases reaction, affects binding1
Amphotericin Bincreases expression1
Benzo(a)pyreneaffects methylation1
Lipopolysaccharidesdecreases expression, affects response to substance, increases expression1
Antirheumatic Agentsdecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.