CEACAM4

gene
On this page

Summary

CEACAM4 (CEA cell adhesion molecule 4, HGNC:1816) is a protein-coding gene on chromosome 19q13.2, encoding Cell adhesion molecule CEACAM4 (O75871). Granulocyte orphan receptor that acts as an trigger efficient phagocytosis of attached particles.

Predicted to enable protein tyrosine kinase binding activity. Involved in phagocytosis. Predicted to be located in membrane. Predicted to be active in cell surface and plasma membrane.

Source: NCBI Gene 1089 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 47 total
  • MANE Select transcript: NM_001817

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:1816
Approved symbolCEACAM4
NameCEA cell adhesion molecule 4
Location19q13.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000105352
Ensembl biotypeprotein_coding
OMIM619159
Entrez1089

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 3 protein_coding, 1 retained_intron

ENST00000221954, ENST00000472081, ENST00000600925, ENST00000902906

RefSeq mRNA: 4 — MANE Select: NM_001817 NM_001362492, NM_001362493, NM_001362495, NM_001817

CCDS: CCDS33033

Canonical transcript exons

ENST00000221954 — 7 exons

ExonStartEnd
ENSE000007085454162560141625960
ENSE000008424604161967041619711
ENSE000008424614162021141620242
ENSE000013228064161901541619395
ENSE000024939264162165141621768
ENSE000025143444162057541620627
ENSE000035549114162690041627074

Expression profiles

Bgee: expression breadth ubiquitous, 126 present calls, max score 95.65.

FANTOM5 (CAGE): breadth broad, TPM avg 3.1596 / max 431.5855, expressed in 241 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1810822.9506226
1810810.209059

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
bloodUBERON:000017895.65gold quality
monocyteCL:000057689.71gold quality
spleenUBERON:000210689.46gold quality
leukocyteCL:000073889.36gold quality
granulocyteCL:000009487.54gold quality
bone marrowUBERON:000237185.94gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047382.26silver quality
bone marrow cellCL:000209281.48gold quality
vermiform appendixUBERON:000115481.21gold quality
right lungUBERON:000216780.83gold quality
upper lobe of left lungUBERON:000895273.50gold quality
lymph nodeUBERON:000002967.02gold quality
lungUBERON:000204866.74gold quality
smooth muscle tissueUBERON:000113566.17gold quality
gall bladderUBERON:000211066.02gold quality
right coronary arteryUBERON:000162565.92gold quality
left uterine tubeUBERON:000130365.33gold quality
descending thoracic aortaUBERON:000234564.63gold quality
placentaUBERON:000198763.10gold quality
small intestine Peyer’s patchUBERON:000345462.45gold quality
mucosa of stomachUBERON:000119962.24gold quality
omental fat padUBERON:001041461.84gold quality
small intestineUBERON:000210861.46gold quality
left coronary arteryUBERON:000162661.07gold quality
duodenumUBERON:000211460.83gold quality
esophagus mucosaUBERON:000246960.53gold quality
adipose tissueUBERON:000101360.39gold quality
heart left ventricleUBERON:000208460.35gold quality
tibial nerveUBERON:000132359.78gold quality
body of pancreasUBERON:000115059.30gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.34

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

6 targeting CEACAM4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-488-5P99.2868.12821
HSA-MIR-429199.2068.882969
HSA-MIR-892C-5P99.1670.562116
HSA-MIR-510099.1167.521098
HSA-MIR-6852-3P98.5467.601468
HSA-MIR-446898.0166.851187

Literature-anchored findings (GeneRIF, showing 2)

  • CEACAM4 is expressed in tumor-derived cell lines, and this expression is specific to an medullary thyroid carcinoma derived cell line. (PMID:25152032)
  • CEACAM4 has phagocytic function. (PMID:25567962)

Cross-species orthologs

0 orthologs

Paralogs (24): CEACAM21 (ENSG00000007129), CEACAM7 (ENSG00000007306), CEACAM1 (ENSG00000079385), CEACAM6 (ENSG00000086548), CEACAM5 (ENSG00000105388), PSG8 (ENSG00000124467), CEACAM8 (ENSG00000124469), HEPACAM (ENSG00000165478), PSG6 (ENSG00000170848), CEACAM3 (ENSG00000170956), PSG9 (ENSG00000183668), CEACAM19 (ENSG00000186567), HEPACAM2 (ENSG00000188175), PSG5 (ENSG00000204941), CEACAM18 (ENSG00000213822), CEACAM16 (ENSG00000213892), VSTM5 (ENSG00000214376), PSG3 (ENSG00000221826), PSG7 (ENSG00000221878), PSG1 (ENSG00000231924), PSG2 (ENSG00000242221), PSG11 (ENSG00000243130), PSG4 (ENSG00000243137), CEACAM20 (ENSG00000273777)

Protein

Protein identifiers

Cell adhesion molecule CEACAM4O75871 (reviewed: O75871)

Alternative names: Carcinoembryonic antigen CGM7, Carcinoembryonic antigen-related cell adhesion molecule 4, Non-specific cross-reacting antigen W236

All UniProt accessions (2): O75871, M0R363

UniProt curated annotations — full annotation on UniProt →

Function. Granulocyte orphan receptor that acts as an trigger efficient phagocytosis of attached particles.

Subunit / interactions. Interacts through its phosphorylated ITAM domain with the SH2 domain-containing cytoplasmic proteins involved in signaling processes during phagocytosis.

Subcellular location. Membrane.

Tissue specificity. Granulocytes.

Post-translational modifications. N-glycosylated. The cytoplasmic ITAM-like sequence becomes tyrosine phosphorylated by SRC family PTKs upon ligand-mediated receptor clustering and allows to initiate phagocytosis of bound ligand.

Miscellaneous. To study the function of the orphan receptor CEACAM4 chimeric proteins containing the extracellular bacteria-binding domain of CEACAM3 and the transmembrane and cytoplasmic part of CEACAM4 has been used.

Similarity. Belongs to the immunoglobulin superfamily. CEA family.

RefSeq proteins (4): NP_001349421, NP_001349422, NP_001349424, NP_001808* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR007110Ig-like_domDomain
IPR013106Ig_V-setDomain
IPR013783Ig-like_foldHomologous_superfamily
IPR036179Ig-like_dom_sfHomologous_superfamily
IPR050831CEA_cell_adhesionFamily

Pfam: PF07686

UniProt features (18 total): glycosylation site 4, sequence variant 2, mutagenesis site 2, sequence conflict 2, topological domain 2, signal peptide 1, chain 1, transmembrane region 1, domain 1, region of interest 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O75871-F176.990.51

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (4): 111, 126, 57, 104

Mutagenesis-validated functional residues (2):

PositionPhenotype
222no internalization of bacteria; when associated with f-233.
233no internalization of bacteria; when associated with f-222.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 63 (showing top): VERHAAK_AML_WITH_NPM1_MUTATED_DN, TSENG_IRS1_TARGETS_UP, MODULE_45, MODULE_64, GOCC_CELL_SURFACE, GOBP_VESICLE_MEDIATED_TRANSPORT, SHETH_LIVER_CANCER_VS_TXNIP_LOSS_PAM5, CAR_MYST2, MODULE_157, CAR_MLANA, SANSOM_APC_TARGETS_DN, AFFAR_YY1_TARGETS_DN, MODULE_242, GOBP_IMPORT_INTO_CELL, MODULE_20

GO Biological Process (3): regulation of immune system process (GO:0002682), phagocytosis (GO:0006909), signal transduction (GO:0007165)

GO Molecular Function (1): protein tyrosine kinase binding (GO:1990782)

GO Cellular Component (3): plasma membrane (GO:0005886), cell surface (GO:0009986), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
immune system process1
regulation of biological process1
endocytosis1
cell communication1
cellular process1
signaling1
regulation of cellular process1
cellular response to stimulus1
protein kinase binding1
membrane1
cell periphery1

Protein interactions and networks

STRING

1588 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CEACAM4PLA2G7Q13093601
CEACAM4CEACAM20Q6UY09600
CEACAM4UNC79Q9P2D8575
CEACAM4GK5Q6ZS86572
CEACAM4NALCNQ8IZF0542
CEACAM4UNC80Q8N2C7542
CEACAM4CYP2A13Q16696521
CEACAM4A0A096LNM5A0A096LNM5501
CEACAM4CYP2A7P20853485
CEACAM4FLT4P35916472
CEACAM4CYP2A6P00190448
CEACAM4CEACAM19Q7Z692434
CEACAM4PDCD1LG2Q9BQ51420
CEACAM4PTPRN2Q92932416
CEACAM4IGDCC3Q8IVU1400

IntAct

4 interactions, top by confidence:

ABTypeScore
PDCD1LG2CEACAM4psi-mi:“MI:0915”(physical association)0.400
FLT4CEACAM4psi-mi:“MI:0915”(physical association)0.400
CEACAM4FLT4psi-mi:“MI:0915”(physical association)0.400

ESM2 similar proteins: A0A0K2S4Q6, A2A7V7, A6NI73, A8K4G0, O43699, O75019, O75022, O75023, O75871, O76036, P0C191, P20138, P24071, P40198, P59901, P80943, Q08708, Q13410, Q28110, Q3U497, Q496F6, Q64JA4, Q6GTX8, Q6ISS4, Q6PI73, Q6UXZ3, Q7TSN2, Q863H2, Q8C567, Q8K249, Q8MJZ2, Q8MJZ7, Q8N149, Q8N423, Q8N6C8, Q8NHJ6, Q8NHL6, Q8VBT3, Q8VCH2, Q95JB9

Diamond homologs: A0A0B4J1L0, D3ZQE1, E9QA28, O75871, P06731, P11464, P11465, P13688, P16573, P31809, P31997, P40198, P40199, Q00887, Q00888, Q00889, Q13046, Q14002, Q15238, Q16557, Q2WEN9, Q3KPI0, Q3UKK2, Q61400, Q63111, Q810J1, Q925P2, Q9D2Z1, Q9UQ72, Q9UQ74, A8MTB9, Q9D871, Q0E9H9, Q6UY09, A0A140LHF2, Q8BFR2, Q8N475, Q9PWR4, P35329, Q4VAH7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

47 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance38
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1063 predictions. Top by Δscore:

VariantEffectΔscore
19:41619946:A:ACdonor_gain0.9900
19:41619947:C:CCdonor_gain0.9900
19:41621653:T:Adonor_gain0.9900
19:41625599:A:ACdonor_gain0.9900
19:41625600:C:CCdonor_gain0.9900
19:41626894:CCTCA:Cdonor_loss0.9900
19:41626895:CTCAC:Cdonor_loss0.9900
19:41626896:TCAC:Tdonor_loss0.9900
19:41626897:CAC:Cdonor_loss0.9900
19:41619712:C:CCacceptor_gain0.9800
19:41620567:ACACT:Adonor_loss0.9800
19:41620568:CACT:Cdonor_loss0.9800
19:41620569:ACTC:Adonor_loss0.9800
19:41620570:CT:Cdonor_loss0.9800
19:41620571:T:TGdonor_loss0.9800
19:41620572:C:CCdonor_loss0.9800
19:41620574:C:CGdonor_loss0.9800
19:41620625:GCCC:Gacceptor_loss0.9800
19:41620627:CCTAG:Cacceptor_loss0.9800
19:41620628:C:CAacceptor_loss0.9800
19:41620629:T:Cacceptor_loss0.9800
19:41620650:A:Cacceptor_gain0.9800
19:41620709:C:CAdonor_gain0.9800
19:41619707:GGGGC:Gacceptor_gain0.9700
19:41619710:GCCT:Gacceptor_loss0.9700
19:41619712:C:CAacceptor_loss0.9700
19:41619713:T:Aacceptor_loss0.9700
19:41619923:AC:Adonor_gain0.9700
19:41619924:CC:Cdonor_gain0.9700
19:41620573:A:ACdonor_gain0.9700

AlphaMissense

1548 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:41625824:C:AW67C0.984
19:41625824:C:GW67C0.984
19:41625709:A:GS106P0.975
19:41625826:A:GW67R0.972
19:41625826:A:TW67R0.972
19:41625711:C:TG105E0.965
19:41625667:A:CY120D0.963
19:41625732:C:GR98P0.963
19:41625705:A:GL107P0.960
19:41625657:C:GR123P0.954
19:41625660:A:GL122P0.954
19:41625835:C:GA64P0.946
19:41625825:C:GW67S0.940
19:41625666:T:GY120S0.936
19:41625711:C:AG105V0.932
19:41625667:A:TY120N0.923
19:41625832:A:CY65D0.922
19:41625867:A:GL53P0.918
19:41625699:A:GF109S0.914
19:41625728:C:AE99D0.912
19:41625728:C:GE99D0.912
19:41625625:C:GA134P0.900
19:41625667:A:GY120H0.900
19:41625678:T:AD116V0.896
19:41625787:C:GA80P0.894
19:41625789:A:TI79N0.894
19:41625817:C:AG70W0.893
19:41625702:A:GL108P0.892
19:41625678:T:GD116A0.889
19:41625712:C:GG105R0.885

dbSNP variants (sampled 300 via entrez): RS1000199324 (19:41626321 C>T), RS1000331336 (19:41629036 G>T), RS1000354985 (19:41623424 T>G), RS1000686363 (19:41622399 C>T), RS1000870495 (19:41628868 T>A), RS1001070201 (19:41627727 C>T), RS1002392614 (19:41614999 G>C), RS1002971016 (19:41627191 C>A), RS1003048800 (19:41617295 G>A), RS1003109592 (19:41622324 C>T), RS1003236477 (19:41626094 G>A,T), RS1003392869 (19:41616093 G>A), RS1003977632 (19:41625654 G>A), RS1004018089 (19:41620851 G>A), RS1004992228 (19:41616087 T>A,C)

Disease associations

OMIM: gene MIM:619159 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST004744_58Lung adenocarcinoma3.000000e-07
GCST007160_30Refractive astigmatism8.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
triphenyl phosphateaffects expression1
S-(1,2-dichlorovinyl)cysteineaffects cotreatment, decreases expression1
di-n-butylphosphoric acidaffects expression1
theaflavin-3,3’-digallateaffects expression1
Acetaminophendecreases expression1
Aldehydesincreases expression1
Atrazineincreases expression1
Benzo(a)pyreneincreases methylation1
Lipopolysaccharidesaffects cotreatment, decreases expression1
Tretinoinincreases expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.