CEACAM7

gene
On this page

Also known as CEA

Summary

CEACAM7 (CEA cell adhesion molecule 7, HGNC:1819) is a protein-coding gene on chromosome 19q13.2, encoding Cell adhesion molecule CEACAM7 (Q14002).

This gene encodes a cell surface glycoprotein and member of the carcinoembryonic antigen (CEA) family of proteins. Expression of this gene may be downregulated in colon and rectal cancer. Additionally, lower expression levels of this gene may be predictive of rectal cancer recurrence. This gene is present in a CEA family gene cluster on chromosome 19. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 1087 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 47 total
  • MANE Select transcript: NM_001291485

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:1819
Approved symbolCEACAM7
NameCEA cell adhesion molecule 7
Location19q13.2
Locus typegene with protein product
StatusApproved
AliasesCEA
Ensembl geneENSG00000007306
Ensembl biotypeprotein_coding
OMIM619160
Entrez1087

Gene structure

Transcript identifiers

Ensembl transcripts: 16 — 15 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000006724, ENST00000401731, ENST00000599715, ENST00000602225, ENST00000869051, ENST00000869052, ENST00000869053, ENST00000869054, ENST00000869055, ENST00000869056, ENST00000869057, ENST00000869058, ENST00000869059, ENST00000869060, ENST00000869061, ENST00000948635

RefSeq mRNA: 2 — MANE Select: NM_001291485 NM_001291485, NM_006890

CCDS: CCDS12583

Canonical transcript exons

ENST00000401731 — 5 exons

ExonStartEnd
ENSE000008424664167737641677503
ENSE000015554724168810241688270
ENSE000015609134167330341674739
ENSE000025179164168378541684063
ENSE000034955674168685941687221

Expression profiles

Bgee: expression breadth ubiquitous, 137 present calls, max score 99.62.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.9852 / max 1201.3699, expressed in 35 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1810850.857335
1810840.127912

Top tissues by expression

267 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
mucosa of transverse colonUBERON:000499199.62gold quality
rectumUBERON:000105299.61gold quality
colonic mucosaUBERON:000031799.32gold quality
mucosa of sigmoid colonUBERON:000499399.17gold quality
ileal mucosaUBERON:000033193.00gold quality
vermiform appendixUBERON:000115492.61gold quality
transverse colonUBERON:000115791.47gold quality
islet of LangerhansUBERON:000000689.42gold quality
lower esophagus mucosaUBERON:003583489.37gold quality
gall bladderUBERON:000211088.67gold quality
esophagus mucosaUBERON:000246985.96gold quality
caecumUBERON:000115385.29gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.95gold quality
diaphragmUBERON:000110379.56gold quality
palpebral conjunctivaUBERON:000181279.55gold quality
large intestineUBERON:000005979.13gold quality
colonUBERON:000115578.37gold quality
vaginaUBERON:000099677.96gold quality
cervix epitheliumUBERON:000480177.53gold quality
pancreatic ductal cellCL:000207977.32silver quality
epithelium of esophagusUBERON:000197676.26gold quality
oral cavityUBERON:000016775.90gold quality
esophagus squamous epitheliumUBERON:000692075.86gold quality
colonic epitheliumUBERON:000039775.23gold quality
cervix squamous epitheliumUBERON:000692273.79gold quality
epithelial cell of pancreasCL:000008371.26silver quality
olfactory bulbUBERON:000226471.09gold quality
pancreasUBERON:000126470.93gold quality
squamous epitheliumUBERON:000691470.58gold quality
intestineUBERON:000016070.33gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-GEOD-83139yes3529.05
E-GEOD-125970yes2283.47
E-ANND-3yes13.54

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): GLI2

miRNA regulators (miRDB)

59 targeting CEACAM7, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4481100.0066.421669
HSA-MIR-103A-3P99.9869.141595
HSA-MIR-10799.9869.141595
HSA-MIR-4745-5P99.9865.951028
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-391099.9571.132227
HSA-MIR-311999.9271.342390
HSA-MIR-430299.8967.941187
HSA-MIR-544A99.8468.661965
HSA-MIR-4668-5P99.7970.583782
HSA-MIR-200A-5P99.7669.10949
HSA-MIR-200B-5P99.7669.05948
HSA-MIR-6512-3P99.6566.071468
HSA-MIR-6720-5P99.6566.221459
HSA-MIR-130399.6569.771662
HSA-MIR-205399.5769.151635
HSA-MIR-1252-3P99.5567.712862
HSA-MIR-154-3P99.5070.05831
HSA-MIR-487A-3P99.5069.95840
HSA-MIR-548G-3P99.4868.672159
HSA-MIR-312399.4767.152693
HSA-MIR-6839-3P99.3968.861301
HSA-MIR-94099.3766.142064
HSA-MIR-431199.3170.473041
HSA-MIR-6808-5P99.3166.232150
HSA-MIR-6893-5P99.3166.252119
HSA-MIR-329-5P99.2768.111597
HSA-MIR-10522-5P99.2668.502087
HSA-MIR-426399.1869.252236
HSA-MIR-450499.1069.141328

Literature-anchored findings (GeneRIF, showing 6)

  • CGM2 RT-PCR assay provides no specific prognostic information and cannot be used as a decision criterion for adjuvant therapy after colorectal cancer surgery. (PMID:16627067)
  • CEACAM-7 expression is significantly decreased in rectal cancer; a potential tumor marker (PMID:20004437)
  • High CEACAM7 is associated with gastric carcinoma. (PMID:22195770)
  • crystal structure of the N-terminal dimerization domain of CEACAM has been determined at 1.47 A resolution (PMID:26323304)
  • CEACAM7 expression is significantly upregulated in human masticatory mucosa during wound healing (PMID:28005267)
  • CEACAM7 polymorphisms predict genetic predisposition to mortality in post-surgical septic shock patients. (PMID:34657826)

Cross-species orthologs

0 orthologs

Paralogs (24): CEACAM21 (ENSG00000007129), CEACAM1 (ENSG00000079385), CEACAM6 (ENSG00000086548), CEACAM4 (ENSG00000105352), CEACAM5 (ENSG00000105388), PSG8 (ENSG00000124467), CEACAM8 (ENSG00000124469), HEPACAM (ENSG00000165478), PSG6 (ENSG00000170848), CEACAM3 (ENSG00000170956), PSG9 (ENSG00000183668), CEACAM19 (ENSG00000186567), HEPACAM2 (ENSG00000188175), PSG5 (ENSG00000204941), CEACAM18 (ENSG00000213822), CEACAM16 (ENSG00000213892), VSTM5 (ENSG00000214376), PSG3 (ENSG00000221826), PSG7 (ENSG00000221878), PSG1 (ENSG00000231924), PSG2 (ENSG00000242221), PSG11 (ENSG00000243130), PSG4 (ENSG00000243137), CEACAM20 (ENSG00000273777)

Protein

Protein identifiers

Cell adhesion molecule CEACAM7Q14002 (reviewed: Q14002)

Alternative names: Carcinoembryonic antigen CGM2, Carcinoembryonic antigen-related cell adhesion molecule 7

All UniProt accessions (2): Q14002, A0A0A0MTT6

UniProt curated annotations — full annotation on UniProt →

Subunit / interactions. Homodimer.

Subcellular location. Cell membrane. Apical cell membrane.

Tissue specificity. Expressed in columnar epithelial cells of the colon (at protein level). Strongly down-regulated in colonic adenocarcinomas.

Similarity. Belongs to the immunoglobulin superfamily. CEA family.

Isoforms (2)

UniProt IDNamesCanonical?
Q14002-12ayes
Q14002-22b

RefSeq proteins (2): NP_001278414, NP_008821 (=MANE)

Domains & families (InterPro)

IDNameType
IPR003599Ig_subDomain
IPR007110Ig-like_domDomain
IPR013106Ig_V-setDomain
IPR013783Ig-like_foldHomologous_superfamily
IPR036179Ig-like_dom_sfHomologous_superfamily
IPR050831CEA_cell_adhesionFamily

Pfam: PF07686, PF13927

UniProt features (32 total): strand 8, glycosylation site 7, sequence variant 3, sequence conflict 3, helix 2, domain 2, signal peptide 1, chain 1, disulfide bond 1, splice variant 1, propeptide 1, turn 1, lipid moiety-binding region 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
4Y89X-RAY DIFFRACTION1.47

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q14002-F184.920.69

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 242

Disulfide bonds (1): 168–216

Glycosylation sites (7): 174, 183, 198, 57, 85, 105, 112

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-163125Post-translational modification: synthesis of GPI-anchored proteins
R-HSA-392499Metabolism of proteins
R-HSA-597592Post-translational protein modification

MSigDB gene sets: 88 (showing top): GOCC_CELL_SURFACE, CREIGHTON_ENDOCRINE_THERAPY_RESISTANCE_5, GOCC_APICAL_PLASMA_MEMBRANE, MODULE_113, SABATES_COLORECTAL_ADENOMA_DN, GOCC_APICAL_PART_OF_CELL, GOCC_SIDE_OF_MEMBRANE, GOCC_PLASMA_MEMBRANE_REGION, GOMF_KINASE_BINDING, GOMF_PROTEIN_TYROSINE_KINASE_BINDING, DODD_NASOPHARYNGEAL_CARCINOMA_DN, REACTOME_POST_TRANSLATIONAL_PROTEIN_MODIFICATION, RICKMAN_HEAD_AND_NECK_CANCER_E, BOSCO_EPITHELIAL_DIFFERENTIATION_MODULE, REACTOME_POST_TRANSLATIONAL_MODIFICATION_SYNTHESIS_OF_GPI_ANCHORED_PROTEINS

GO Biological Process (2): regulation of immune system process (GO:0002682), signal transduction (GO:0007165)

GO Molecular Function (1): protein tyrosine kinase binding (GO:1990782)

GO Cellular Component (7): extracellular region (GO:0005576), cytosol (GO:0005829), plasma membrane (GO:0005886), cell surface (GO:0009986), apical plasma membrane (GO:0016324), side of membrane (GO:0098552), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Post-translational protein modification1
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure5
membrane2
immune system process1
regulation of biological process1
cell communication1
cellular process1
signaling1
regulation of cellular process1
cellular response to stimulus1
protein kinase binding1
cytoplasm1
cell periphery1
apical part of cell1
plasma membrane region1
leaflet of membrane bilayer1

Protein interactions and networks

STRING

1126 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CEACAM7CYP2A13Q16696544
CEACAM7ZNF574Q6ZN55526
CEACAM7CYP2A7P20853524
CEACAM7CYP2A6P00190469
CEACAM7IQANK1A8MXQ7408
CEACAM7GRB14Q14449395
CEACAM7LRP6O75581378
CEACAM7CD177Q8N6Q3372
CEACAM7GRIK5Q16478363
CEACAM7A1BGP04217359
CEACAM7NCK2O43639353
CEACAM7BCL10O95999353
CEACAM7NCK1P16333353
CEACAM7ZNF526Q8TF50353
CEACAM7CARD9Q9H257353

IntAct

5 interactions, top by confidence:

ABTypeScore
CEACAM1CEACAM7psi-mi:“MI:0915”(physical association)0.400
CEACAM7CEACAM6psi-mi:“MI:0915”(physical association)0.400
CEACAM7PBRM1psi-mi:“MI:0915”(physical association)0.400
CEACAM7GPC1psi-mi:“MI:0915”(physical association)0.400

BioGRID (7): CEACAM7 (Proximity Label-MS), PBRM1 (Affinity Capture-MS), GPC1 (Affinity Capture-MS), PSG3 (Negative Genetic), PSG6 (Negative Genetic), CEACAM7 (Reconstituted Complex), CEACAM7 (Reconstituted Complex)

ESM2 similar proteins: A0A0K2S4Q6, A2A7V7, A5D7V5, A6NI73, A8K4G0, A8MVS5, O43699, O75022, O75023, O75871, P09564, P0C191, P13688, P16573, P20138, P24071, P31809, P31994, P40198, P43630, P59901, Q08708, Q13410, Q14002, Q28110, Q3KPI0, Q496F6, Q6GTX8, Q6UX52, Q6UXZ3, Q7TSN2, Q7Z692, Q810J1, Q863H2, Q8C567, Q8K249, Q8MJZ7, Q8N423, Q8N743, Q8NHJ6

Diamond homologs: A0A0B4J1L0, D3ZQE1, E9QA28, O75871, P06731, P11464, P11465, P13688, P16573, P31809, P31997, P40198, P40199, Q00887, Q00888, Q00889, Q13046, Q14002, Q15238, Q16557, Q2WEN9, Q3KPI0, Q3UKK2, Q61400, Q63111, Q810J1, Q925P2, Q9D2Z1, Q9UQ72, Q9UQ74, A8MTB9, Q9D871, Q0E9H9, Q6UY09, A0A140LHF2, Q8BFR2, Q8N475, Q9PWR4, P35329, Q4VAH7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

47 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance41
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

704 predictions. Top by Δscore:

VariantEffectΔscore
19:41674761:T:Cacceptor_gain1.0000
19:41684811:C:CAdonor_gain1.0000
19:41686857:A:ACdonor_gain1.0000
19:41686858:C:CCdonor_gain1.0000
19:41677375:CCACT:Cdonor_gain0.9900
19:41683783:A:ACdonor_gain0.9900
19:41683784:C:CCdonor_gain0.9900
19:41683784:CAG:Cdonor_gain0.9900
19:41684062:CG:Cacceptor_gain0.9900
19:41684062:CGCTG:Cacceptor_gain0.9900
19:41684064:C:CCacceptor_gain0.9900
19:41684066:G:Cacceptor_gain0.9900
19:41686885:T:TAdonor_gain0.9900
19:41687217:CGAGG:Cacceptor_gain0.9900
19:41687222:C:CCacceptor_gain0.9900
19:41688097:CTCAC:Cdonor_loss0.9900
19:41688100:A:AGdonor_loss0.9900
19:41688101:C:CTdonor_loss0.9900
19:41674760:C:CCacceptor_gain0.9800
19:41683776:GATAC:Gdonor_loss0.9800
19:41683777:ATAC:Adonor_loss0.9800
19:41683778:TACTC:Tdonor_loss0.9800
19:41683779:ACT:Adonor_loss0.9800
19:41683780:C:Gdonor_loss0.9800
19:41683781:T:TAdonor_loss0.9800
19:41683782:C:CGdonor_loss0.9800
19:41683783:A:AGdonor_loss0.9800
19:41683783:ACAG:Adonor_gain0.9800
19:41683784:CA:Cdonor_gain0.9800
19:41683784:CAGC:Cdonor_gain0.9800

AlphaMissense

1724 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:41687085:C:AW67C0.992
19:41687085:C:GW67C0.992
19:41687087:A:GW67R0.989
19:41687087:A:TW67R0.989
19:41683844:C:GC216S0.983
19:41683845:A:TC216S0.983
19:41686963:A:GL108P0.983
19:41683954:C:AW179C0.982
19:41683954:C:GW179C0.982
19:41683988:C:GC168S0.982
19:41683989:A:TC168S0.982
19:41686925:A:CY121D0.980
19:41683810:A:CS227R0.979
19:41683810:A:TS227R0.979
19:41683812:T:GS227R0.979
19:41683989:A:GC168R0.978
19:41683987:A:CC168W0.975
19:41686918:A:GL123P0.974
19:41683956:A:GW179R0.973
19:41683956:A:TW179R0.973
19:41687009:C:GG93R0.970
19:41687009:C:TG93R0.970
19:41683845:A:GC216R0.969
19:41683988:C:TC168Y0.964
19:41687008:C:TG93E0.964
19:41687053:C:GR78P0.964
19:41683851:A:CY214D0.963
19:41686960:A:GL109P0.962
19:41687083:T:GY68S0.960
19:41687084:A:CY68D0.960

dbSNP variants (sampled 300 via entrez): RS1000248838 (19:41686221 A>G), RS1000303782 (19:41679458 G>A), RS1000525458 (19:41678287 T>A), RS1000582906 (19:41684832 A>G), RS1000592013 (19:41679799 C>T), RS1000971 (19:41676018 A>C), RS1001062 (19:41685546 A>C), RS1001200218 (19:41681832 G>A), RS1001254535 (19:41687460 C>A), RS1001307519 (19:41674531 G>T), RS1001704078 (19:41687173 A>G,T), RS1001713584 (19:41681369 C>T), RS1002036343 (19:41685917 C>T), RS1002789003 (19:41688441 G>A), RS1003086038 (19:41688201 G>C)

Disease associations

OMIM: gene MIM:619160 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
Nickeldecreases expression2
ethyl-p-hydroxybenzoatedecreases expression1
sodium arsenitedecreases expression1
1,2-dielaidoylphosphatidylethanolaminedecreases expression1
Zoledronic Acidincreases expression1
Benzo(a)pyreneaffects methylation1
Estradioldecreases expression1
Malathiondecreases expression1
Silicon Dioxidedecreases expression1
Tobacco Smoke Pollutionaffects expression1
Tretinoinincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.