VSIG1

gene
On this page

Also known as MGC44287GPA34

Summary

VSIG1 (V-set and immunoglobulin domain containing 1, HGNC:28675) is a protein-coding gene on chromosome Xq22.3, encoding V-set and immunoglobulin domain-containing protein 1 (Q86XK7).

This gene encodes a member of the junctional adhesion molecule (JAM) family. The encoded protein contains multiple glycosylation sites at the N-terminal region, and multiple phosphorylation sites and glutamic acid/proline (EP) repeats at the C-terminal region. The gene is expressed in normal stomach and testis, as well as in gastric, esophageal and ovarian cancers. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 340547 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 70 total
  • MANE Select transcript: NM_182607

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:28675
Approved symbolVSIG1
NameV-set and immunoglobulin domain containing 1
LocationXq22.3
Locus typegene with protein product
StatusApproved
AliasesMGC44287, GPA34
Ensembl geneENSG00000101842
Ensembl biotypeprotein_coding
OMIM300620
Entrez340547

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 3 protein_coding, 1 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000217957, ENST00000415430, ENST00000458383, ENST00000479635, ENST00000485533

RefSeq mRNA: 2 — MANE Select: NM_182607 NM_001170553, NM_182607

CCDS: CCDS14535, CCDS55474

Canonical transcript exons

ENST00000217957 — 7 exons

ExonStartEnd
ENSE00000866544108058038108058201
ENSE00001661051108076077108076218
ENSE00001698717108045065108045179
ENSE00001765589108073250108073369
ENSE00001770932108077048108079184
ENSE00003519067108066936108067134
ENSE00003621426108072677108072832

Expression profiles

Bgee: expression breadth ubiquitous, 147 present calls, max score 93.65.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.8197 / max 503.0153, expressed in 24 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
1971980.570320
1971950.08698
1971990.060810
1971970.04758
1971960.03278
1971940.02158

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
pylorusUBERON:000116693.65gold quality
adult organismUBERON:000702392.99gold quality
right testisUBERON:000453490.16gold quality
left testisUBERON:000453388.46gold quality
epithelial cell of pancreasCL:000008387.04gold quality
testisUBERON:000047386.77gold quality
stomachUBERON:000094586.59gold quality
body of stomachUBERON:000116185.82gold quality
cardia of stomachUBERON:000116282.78gold quality
pancreatic ductal cellCL:000207982.68silver quality
gall bladderUBERON:000211082.38gold quality
mucosa of stomachUBERON:000119978.04gold quality
fundus of stomachUBERON:000116076.74gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047374.70gold quality
granulocyteCL:000009467.48gold quality
bloodUBERON:000017865.55gold quality
adenohypophysisUBERON:000219664.76gold quality
pituitary glandUBERON:000000764.43gold quality
vermiform appendixUBERON:000115463.91gold quality
lymph nodeUBERON:000002963.33gold quality
islet of LangerhansUBERON:000000661.91gold quality
duodenumUBERON:000211459.05gold quality
caecumUBERON:000115358.76gold quality
tibialis anteriorUBERON:000138558.44silver quality
ileal mucosaUBERON:000033157.81gold quality
bone marrow cellCL:000209257.31silver quality
urinary bladderUBERON:000125556.79gold quality
deltoidUBERON:000147656.31gold quality
colonic epitheliumUBERON:000039755.95silver quality
prefrontal cortexUBERON:000045155.59gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.41
E-GEOD-124858no1.71

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

115 targeting VSIG1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6748-5P100.0065.811057
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-4533100.0069.482758
HSA-MIR-4692100.0067.322066
HSA-MIR-3646100.0073.565283
HSA-MIR-5692A100.0074.406850
HSA-MIR-366299.9973.825684
HSA-MIR-453199.9969.703181
HSA-MIR-318599.9968.121959
HSA-MIR-451499.9967.101870
HSA-MIR-23B-5P99.9866.07587
HSA-MIR-3692-3P99.9870.272139
HSA-MIR-806899.9873.852376
HSA-MIR-480399.9871.993117
HSA-MIR-807599.9767.20962
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-55999.9572.283609
HSA-MIR-548AB99.9571.313488
HSA-MIR-23A-5P99.9465.39468
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482

Literature-anchored findings (GeneRIF, showing 5)

  • expressed, both at the mRNA and protein level, in gastric, esophageal, and ovarian cancers, while expression in normal tissues was restricted to testis and stomach (PMID:16405301)
  • VSIG1 expression was significantly reduced at both the mRNA and protein levels in gastric cancer tissues. loss of VSIG1 expression was significantly correlated with poor overall survival and disease-free survival in gastric cancer patients. (PMID:22095633)
  • Study showed that VSIG1 is expressed tissue specifically in both non-cancerous and cancerous tissues, and that VSIG1 has suppressive functions in proliferation, migration, and invasion in cancer cells. (PMID:28603843)
  • Interaction between cadherins, vimentin, and V-set and immunoglobulin domain containing 1 in gastric-type hepatocellular carcinoma. (PMID:34170400)
  • V-set and immunoglobulin domain containing 1 (VSIG1) as an emerging target for epithelial-mesenchymal transition of gastric cancer. (PMID:36171238)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusVsig1ENSMUSG00000031430
rattus_norvegicusVsig1ENSRNOG00000054219

Paralogs (14): VSIG2 (ENSG00000019102), VSIR (ENSG00000107738), GPA33 (ENSG00000143167), IGSF11 (ENSG00000144847), ESAM (ENSG00000149564), CXADR (ENSG00000154639), JAM2 (ENSG00000154721), F11R (ENSG00000158769), MXRA8 (ENSG00000162576), JAM3 (ENSG00000166086), CLMP (ENSG00000166250), MUC15 (ENSG00000169550), VSTM2B (ENSG00000187135), VSIG8 (ENSG00000243284)

Protein

Protein identifiers

V-set and immunoglobulin domain-containing protein 1Q86XK7 (reviewed: Q86XK7)

Alternative names: Cell surface A33 antigen, Glycoprotein A34

All UniProt accessions (2): Q86XK7, C9JY48

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Membrane.

Tissue specificity. Detected only in stomach mucosa and testis, and to a much lesser level in pancreas (at protein level). Detected in gastric cancers (31%), esophageal carcinomas (50%) and ovarian cancers (23%).

Post-translational modifications. Highly N-glycosylated. Appears not to contain significant amounts of O-linked carbohydrates or sialic acid in its sugar moieties.

Isoforms (2)

UniProt IDNamesCanonical?
Q86XK7-11yes
Q86XK7-22

RefSeq proteins (2): NP_001164024, NP_872413* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000920Myelin_P0-relFamily
IPR003598Ig_sub2Domain
IPR003599Ig_subDomain
IPR007110Ig-like_domDomain
IPR013106Ig_V-setDomain
IPR013783Ig-like_foldHomologous_superfamily
IPR029861VSIG1Family
IPR036179Ig-like_dom_sfHomologous_superfamily

Pfam: PF07686, PF13927

UniProt features (21 total): glycosylation site 5, compositionally biased region 4, disulfide bond 2, topological domain 2, domain 2, signal peptide 1, chain 1, splice variant 1, sequence variant 1, transmembrane region 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q86XK7-F175.440.44

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Disulfide bonds (2): 43–116, 161–211

Glycosylation sites (5): 32, 38, 133, 200, 219

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 96 (showing top): GOBP_DIGESTION, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_EPITHELIAL_CELL_DEVELOPMENT, HNF1_Q6, EFC_Q6, LEE_NAIVE_T_LYMPHOCYTE, GOBP_MAINTENANCE_OF_GASTROINTESTINAL_EPITHELIUM, GOBP_DIGESTIVE_SYSTEM_PROCESS, HNF1_C, GOBP_MULTICELLULAR_ORGANISMAL_LEVEL_HOMEOSTASIS, POU3F2_02, GOBP_EPITHELIAL_CELL_MORPHOGENESIS, HNF1_01, GOBP_EPITHELIAL_STRUCTURE_MAINTENANCE, SOX5_01

GO Biological Process (2): epithelial cell morphogenesis (GO:0003382), maintenance of gastrointestinal epithelium (GO:0030277)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (3): basolateral plasma membrane (GO:0016323), plasma membrane (GO:0005886), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cell morphogenesis1
epithelial cell development1
epithelial structure maintenance1
digestive system process1
binding1
basal plasma membrane1
plasma membrane region1
membrane1
cell periphery1
cellular anatomical structure1

Protein interactions and networks

STRING

492 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
VSIG1CTSEP14091515
VSIG1CST9LQ9H4G1488
VSIG1ANXA10Q9UJ72479
VSIG1CLDN18P56856461
VSIG1IGSF11Q5DX21454
VSIG1IZUMO1Q8IYV9438
VSIG1CACNG4Q9UBN1434
VSIG1GJB5O95377434
VSIG1GKN1Q9NS71416
VSIG1TEX13BQ9BXU2394
VSIG1LGALS4P56470388
VSIG1SLC4A5Q9BY07384
VSIG1VPS35Q96QK1383
VSIG1MAPK15Q8TD08374
VSIG1FATE1Q969F0372

IntAct

19 interactions, top by confidence:

ABTypeScore
VSIG1TTI1psi-mi:“MI:0914”(association)0.640
EMP3VSIG1psi-mi:“MI:0915”(physical association)0.560
UPK1BVSIG1psi-mi:“MI:0915”(physical association)0.560
VSIG1NINJ2psi-mi:“MI:0915”(physical association)0.560
HGFACVSIG1psi-mi:“MI:0915”(physical association)0.560
VSIG1TNPO2psi-mi:“MI:0914”(association)0.530
VSIG1H1-5psi-mi:“MI:0915”(physical association)0.400
VSIG1H1-2psi-mi:“MI:0915”(physical association)0.400
VSIG1H1-1psi-mi:“MI:0915”(physical association)0.400
VSIG1RIMOC1psi-mi:“MI:0914”(association)0.350
EMP3VSIG1psi-mi:“MI:0915”(physical association)0.000
UPK1BVSIG1psi-mi:“MI:0915”(physical association)0.000
HGFACVSIG1psi-mi:“MI:0915”(physical association)0.000
NINJ2VSIG1psi-mi:“MI:0915”(physical association)0.000

BioGRID (115): UBE4A (Affinity Capture-MS), CDS1 (Affinity Capture-MS), ATP2A3 (Affinity Capture-MS), IP6K1 (Affinity Capture-MS), RRP1 (Affinity Capture-MS), SLC25A17 (Affinity Capture-MS), DEPDC1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), EXOG (Affinity Capture-MS), TTI1 (Affinity Capture-MS), FANCL (Affinity Capture-MS), TMEM160 (Affinity Capture-MS), DAGLB (Affinity Capture-MS), NF1 (Affinity Capture-MS), C4A (Affinity Capture-MS)

ESM2 similar proteins: A0JM41, A2VD98, A6QQC6, A8MVW5, B0CLX4, B6ZK76, B6ZK77, O60487, O70255, O88324, O88775, O95976, P01832, P03228, P06907, P08920, P08921, P09619, P0C6B7, P0C6N0, P0CW72, P10522, P20938, P21995, P25189, P27573, P37301, P37998, P59823, P59824, P86176, Q01151, Q4VAH7, Q5EAB0, Q5R804, Q640U3, Q6PCB8, Q6WEB5, Q80UL9, Q86XK7

Diamond homologs: A0JM41, A2VD98, A5D7C3, O60487, O60939, O70255, O95297, P06907, P10522, P20938, P25189, P27573, P37301, P54900, Q29RR6, Q32PI9, Q3TEW6, Q3V3F6, Q4KLY3, Q5EAB0, Q5R804, Q6AYT8, Q6UWV2, Q6WEB5, Q7M729, Q7M730, Q86XK7, Q8AVM3, Q8IWT1, Q9D2J4, Q9PWR4, Q08E08, P78310, P97792, Q5R764, Q864L3, Q8WMV3, Q9R066, A2AJ76, D3YXG0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

70 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance25
Likely benign11
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

1067 predictions. Top by Δscore:

VariantEffectΔscore
X:108058037:GGT:Gacceptor_gain1.0000
X:108058197:TTTCT:Tdonor_gain1.0000
X:108058198:TTCT:Tdonor_gain1.0000
X:108058199:TCT:Tdonor_gain1.0000
X:108058199:TCTGT:Tdonor_loss1.0000
X:108058200:CT:Cdonor_gain1.0000
X:108058200:CTGT:Cdonor_loss1.0000
X:108058201:TG:Tdonor_loss1.0000
X:108058202:G:GGdonor_gain1.0000
X:108058203:T:Gdonor_loss1.0000
X:108058207:G:GTdonor_gain1.0000
X:108066935:GATTT:Gacceptor_gain1.0000
X:108072675:A:AGacceptor_gain1.0000
X:108072675:A:Tacceptor_loss1.0000
X:108072675:AGT:Aacceptor_gain1.0000
X:108072676:G:GGacceptor_gain1.0000
X:108072676:GT:Gacceptor_gain1.0000
X:108072676:GTG:Gacceptor_gain1.0000
X:108072676:GTGA:Gacceptor_gain1.0000
X:108072800:G:Tdonor_gain1.0000
X:108072828:CTTCA:Cdonor_gain1.0000
X:108072829:TTCA:Tdonor_gain1.0000
X:108072833:G:GGdonor_gain1.0000
X:108073249:GACCC:Gacceptor_gain1.0000
X:108073366:TCACG:Tdonor_loss1.0000
X:108073367:CACG:Cdonor_loss1.0000
X:108073369:CG:Cdonor_loss1.0000
X:108073370:G:GGdonor_gain1.0000
X:108073370:GTGAG:Gdonor_loss1.0000
X:108073371:T:TCdonor_loss1.0000

AlphaMissense

2528 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:108058165:G:CW59C0.999
X:108058165:G:TW59C0.999
X:108067062:T:GY114D0.999
X:108067068:T:AC116S0.999
X:108067069:G:AC116Y0.999
X:108067069:G:CC116S0.999
X:108072745:T:CC161R0.999
X:108072747:T:GC161W0.999
X:108072786:G:CW174C0.999
X:108072786:G:TW174C0.999
X:108073312:T:AC211S0.999
X:108073312:T:CC211R0.999
X:108073313:G:CC211S0.999
X:108073326:C:AN215K0.999
X:108073326:C:GN215K0.999
X:108058115:T:CC43R0.998
X:108058116:G:AC43Y0.998
X:108058117:C:GC43W0.998
X:108058163:T:AW59R0.998
X:108058163:T:CW59R0.998
X:108067068:T:CC116R0.998
X:108067070:C:GC116W0.998
X:108072697:T:AC145S0.998
X:108072698:G:CC145S0.998
X:108072745:T:AC161S0.998
X:108072746:G:AC161Y0.998
X:108072746:G:CC161S0.998
X:108073261:G:TG194W0.998
X:108073314:T:GC211W0.998
X:108066988:G:CR89P0.997

dbSNP variants (sampled 300 via entrez): RS1000006215 (X:108063458 T>C), RS1000202294 (X:108046025 A>G), RS1000251717 (X:108028199 G>A), RS1000260646 (X:108036822 G>A), RS1000269279 (X:108034321 C>G,T), RS1000376902 (X:108056132 T>C), RS1000478373 (X:108023682 A>C), RS1000522224 (X:108057620 T>C), RS1000578919 (X:108034898 G>A), RS1000584395 (X:108068449 T>A), RS1000629304 (X:108046051 G>A,T), RS1000658865 (X:108046394 A>G), RS1000679342 (X:108020699 T>C), RS1000716255 (X:108065444 C>A,T), RS1000743274 (X:108045592 T>G)

Disease associations

OMIM: gene MIM:300620 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

26 total (human), top 26 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneincreases expression, increases methylation2
Copperaffects cotreatment, decreases expression, affects binding2
aristolochic acid Idecreases expression1
methyleugenolincreases expression1
propionaldehydedecreases expression1
bisphenol Aincreases methylation, affects cotreatment1
trichostatin Adecreases expression1
butyraldehydedecreases expression1
CGP 52608affects binding, increases reaction1
nutlin 3affects cotreatment, increases expression1
abrineincreases expression1
bisphenol Sdecreases expression1
jinfukangaffects cotreatment, increases expression1
NSC 689534affects binding, decreases expression1
Resveratrolaffects cotreatment, decreases expression1
Fulvestrantaffects cotreatment, increases methylation1
Camptothecinincreases expression1
Cisplatinaffects cotreatment, increases expression1
Dactinomycinaffects cotreatment, increases expression1
N-Nitrosopyrrolidineincreases expression1
Silicon Dioxidedecreases expression1
Smokeincreases expression1
Tobacco Smoke Pollutionincreases expression1
Valproic Aciddecreases expression1
Aflatoxin B1increases expression1
Okadaic Acidincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.