XG

gene
On this page

Summary

XG (Xg glycoprotein (Xg blood group), HGNC:12806) is a protein-coding gene on chromosome Xp22.33, encoding Glycoprotein Xg (P55808).

This gene encodes the XG blood group antigen, and is located at the pseudoautosomal boundary on the short (p) arm of chromosome X. The three 5’ exons reside in the pseudoautosomal region and the remaining exons within the X-specific end. A truncated copy of this gene is found on the Y chromosome at the pseudoautosomal boundary. It is transcribed, but not expected to make a Y-chromosome specific gene product. Alternatively spliced transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 7499 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 59 total — 5 pathogenic
  • MANE Select transcript: NM_001141919

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:12806
Approved symbolXG
NameXg glycoprotein (Xg blood group)
LocationXp22.33
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000124343
Ensembl biotypeprotein_coding
OMIM300879
Entrez7499

Gene structure

Transcript identifiers

Ensembl transcripts: 6 — 4 protein_coding, 2 retained_intron

ENST00000381174, ENST00000419513, ENST00000503018, ENST00000509484, ENST00000519244, ENST00000644266

RefSeq mRNA: 3 — MANE Select: NM_001141919 NM_001141919, NM_001141920, NM_175569

CCDS: CCDS14120, CCDS48073

Canonical transcript exons

ENST00000644266 — 11 exons

ExonStartEnd
ENSE0000084642027820662782128
ENSE0000084642127896442789706
ENSE0000131186028113362811452
ENSE0000148770528067012806745
ENSE0000168962028081852808220
ENSE0000178971927747162774739
ENSE0000355040927705502770591
ENSE0000356099027973102797360
ENSE0000360989527945352794603
ENSE0000382965128143642816500
ENSE0000390212427520402752335

Expression profiles

Bgee: expression breadth ubiquitous, 189 present calls, max score 97.12.

FANTOM5 (CAGE): breadth broad, TPM avg 3.2824 / max 313.4883, expressed in 247 samples.

FANTOM5 promoters (12 alternative TSS)

Promoter IDTPM avgSamples expressed
1953792.5902229
1953810.3078103
1953820.113760
1953780.079150
1953830.053222
1953800.050326
1953770.044521
2095850.038621
1953850.03638
1953860.01293

Top tissues by expression

251 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
calcaneal tendonUBERON:000370197.12gold quality
tendonUBERON:000004395.17gold quality
skin of hipUBERON:000155493.49gold quality
skin of legUBERON:000151193.08gold quality
zone of skinUBERON:000001492.35gold quality
skin of abdomenUBERON:000141691.90gold quality
upper leg skinUBERON:000426290.38gold quality
tendon of biceps brachiiUBERON:000818889.67gold quality
gingival epitheliumUBERON:000194987.28gold quality
layer of synovial tissueUBERON:000761686.69gold quality
secondary oocyteCL:000065586.46gold quality
gingivaUBERON:000182886.12gold quality
esophagus squamous epitheliumUBERON:000692085.63gold quality
synovial jointUBERON:000221784.65gold quality
subcutaneous adipose tissueUBERON:000219084.46gold quality
mammalian vulvaUBERON:000099783.99gold quality
ascending aortaUBERON:000149683.20gold quality
thoracic aortaUBERON:000151582.86gold quality
esophagus mucosaUBERON:000246981.57gold quality
aortaUBERON:000094780.93gold quality
adipose tissueUBERON:000101380.35gold quality
tibial arteryUBERON:000761079.74gold quality
popliteal arteryUBERON:000225079.71gold quality
vaginaUBERON:000099679.48gold quality
ectocervixUBERON:001224978.60gold quality
lower esophagus mucosaUBERON:003583478.28gold quality
descending thoracic aortaUBERON:000234578.23gold quality
buccal mucosa cellCL:000233678.07gold quality
right coronary arteryUBERON:000162577.92gold quality
upper arm skinUBERON:000426377.81silver quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes15.85

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

80 targeting XG, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-6748-5P100.0065.811057
HSA-MIR-4262100.0073.263931
HSA-MIR-9-5P100.0072.282361
HSA-MIR-4425100.0067.591049
HSA-MIR-366299.9973.825684
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-450099.9972.722367
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-433-3P99.9869.371203
HSA-MIR-50799.9770.111915
HSA-MIR-55799.9670.011640
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-590-3P99.9674.346478
HSA-MIR-767-5P99.9570.85993
HSA-MIR-548J-3P99.9472.614881
HSA-MIR-335-3P99.9373.364958
HSA-MIR-548AE-3P99.9372.664867
HSA-MIR-548AH-3P99.9372.544872
HSA-MIR-548AM-3P99.9372.544872
HSA-MIR-548AQ-3P99.9372.664867
HSA-MIR-314399.9371.963104
HSA-MIR-367199.9073.043897
HSA-MIR-6715A-3P99.8368.051473
HSA-MIR-442099.8270.081624

Literature-anchored findings (GeneRIF, showing 3)

  • Gene frequency in France is similar to that reported in predominantly white populations. (PMID:18789743)
  • The pseudoautosomal boundary on Xp22.33/Yp11.31 may harbor male-specific genetic variants for autism spectrum disorders. (PMID:24132906)
  • The present findings identify the genetic basis of the erythroid-specific Xg(a)/CD99 blood group phenotypes and reveal the molecular background of their formation. (PMID:30061310)

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

Glycoprotein XgP55808 (reviewed: P55808)

Alternative names: Protein PBDX

All UniProt accessions (3): A0A2U3U020, E5RH28, P55808

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Cell membrane.

Tissue specificity. Expressed in erythroid tissues, including thymus, bone marrow and fetal liver, and in several nonerythroid tissues, such as heart, placenta, skeletal muscle, thyroid and trachea, as well as in skin fibroblasts. Expression is low or undetectable in other tissues.

Post-translational modifications. O-glycosylated.

Polymorphism. XG is responsible for the Xg blood group system.

Miscellaneous. The gene coding for this protein is located in the pseudoautosomal region 1 (PAR1) of X and Y chromosomes.

Similarity. Belongs to the CD99 family.

Isoforms (3)

UniProt IDNamesCanonical?
P55808-11yes
P55808-22
P55808-33

RefSeq proteins (3): NP_001135391, NP_001135392, NP_780778 (=MANE)

Domains & families (InterPro)

IDNameType
IPR022078CD99L2Family

Pfam: PF12301

UniProt features (11 total): topological domain 2, splice variant 2, signal peptide 1, chain 1, sequence conflict 1, transmembrane region 1, region of interest 1, compositionally biased region 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P55808-F158.590.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 57 (showing top): GOBP_MYELOID_LEUKOCYTE_MIGRATION, GOBP_POSITIVE_REGULATION_OF_CELLULAR_EXTRAVASATION, GOBP_REGULATION_OF_LEUKOCYTE_MIGRATION, GOBP_CELL_CELL_ADHESION, GOBP_CELLULAR_EXTRAVASATION, GOBP_LEUKOCYTE_MIGRATION, GOBP_POSITIVE_REGULATION_OF_NEUTROPHIL_MIGRATION, GOBP_POSITIVE_REGULATION_OF_LEUKOCYTE_MIGRATION, GOBP_GRANULOCYTE_MIGRATION, GOBP_REGULATION_OF_CELLULAR_EXTRAVASATION, GOBP_T_CELL_MIGRATION, DODD_NASOPHARYNGEAL_CARCINOMA_UP, GOBP_HOMOTYPIC_CELL_CELL_ADHESION, GOBP_LYMPHOCYTE_MIGRATION, GOBP_REGULATION_OF_NEUTROPHIL_MIGRATION

GO Biological Process (3): homotypic cell-cell adhesion (GO:0034109), T cell extravasation (GO:0072683), positive regulation of neutrophil extravasation (GO:2000391)

GO Molecular Function (0):

GO Cellular Component (2): plasma membrane (GO:0005886), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cell-cell adhesion1
cellular extravasation1
T cell migration1
positive regulation of cellular extravasation1
neutrophil extravasation1
positive regulation of neutrophil migration1
regulation of neutrophil extravasation1
membrane1
cell periphery1
cellular anatomical structure1

Protein interactions and networks

STRING

78 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
XGCD99P14209860
XGARSDP51689767
XGARSFP54793765
XGARSLP51690750
XGSTSP08842649
XGCD99L2Q8TCZ2348
XGSMPDL3BQ92485235
XGSLC43A1O75387224
XGGYG2O15488220
XGBIVMQ86UB2204
XGGYPBP06028201
XGBAATQ14032199
XGGATA1P15976185
XGXKP51811182
XGLANCL3Q6ZV70181

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A1A4K1, B1H3G4, E9PV24, O35988, O61704, O75167, O93383, P14209, P14599, P15514, P24338, P31431, P31955, P34741, P34900, P34901, P43322, P43407, P49414, P49416, P50605, P55808, P58239, Q02297, Q0VFF9, Q1RMT9, Q27913, Q56A20, Q58DD4, Q5RAT9, Q5RCS3, Q5RE35, Q5REP3, Q5XG99, Q6DBW9, Q6GR51, Q6PKG0, Q6ZQ58, Q7SXB3, Q7TMJ8

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

59 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic5
Likely pathogenic0
Uncertain significance29
Likely benign2
Benign2

Top pathogenic / likely-pathogenic (5)

Variant IDHGVSClassification
1341989GRCh37/hg19 Xp22.33(chrX:940688-2676609)x3Pathogenic
1808677GRCh37/hg19 Xp22.33(chrX:201705-2696762)x3Pathogenic
686939GRCh37/hg19 Xp22.33(chrX:168546-2696762)x3Pathogenic
816242GRCh37/hg19 Xp22.33(chrX:168546-3265521)x0Pathogenic
988943GRCh37/hg19 Xp22.33-22.32(chrX:60000-4857212)x1Pathogenic

SpliceAI

1744 predictions. Top by Δscore:

VariantEffectΔscore
X:2782124:TGG:Tdonor_gain1.0000
X:2782129:G:GGdonor_gain1.0000
X:2794602:AGGTA:Adonor_loss1.0000
X:2794603:GGTA:Gdonor_loss1.0000
X:2794604:G:Cdonor_loss1.0000
X:2794605:T:Gdonor_loss1.0000
X:2811330:CTGCA:Cacceptor_loss1.0000
X:2811331:TGCA:Tacceptor_loss1.0000
X:2811332:GCAG:Gacceptor_loss1.0000
X:2811333:CAG:Cacceptor_loss1.0000
X:2811334:A:AGacceptor_gain1.0000
X:2811335:G:GGacceptor_gain1.0000
X:2811335:GGCA:Gacceptor_gain1.0000
X:2811450:ATGG:Adonor_loss1.0000
X:2811451:TGGT:Tdonor_loss1.0000
X:2811452:GGTA:Gdonor_loss1.0000
X:2811453:G:GGdonor_gain1.0000
X:2811453:GT:Gdonor_loss1.0000
X:2811454:T:TCdonor_loss1.0000
X:2752332:CGAG:Cdonor_loss0.9900
X:2752336:GT:Gdonor_loss0.9900
X:2752337:T:Adonor_loss0.9900
X:2782125:GGA:Gdonor_gain0.9900
X:2782126:G:GTdonor_gain0.9900
X:2789642:A:AGacceptor_gain0.9900
X:2789642:A:Tacceptor_loss0.9900
X:2789643:G:GGacceptor_gain0.9900
X:2789643:GAT:Gacceptor_gain0.9900
X:2789705:AGGTA:Adonor_loss0.9900
X:2789706:GGTAA:Gdonor_gain0.9900

AlphaMissense

1270 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:2811366:T:AI147N0.992
X:2770568:T:CL27S0.990
X:2811369:T:AV148E0.987
X:2811384:T:AV153E0.984
X:2811363:C:AP146H0.983
X:2770561:T:CF25L0.981
X:2770563:T:AF25L0.981
X:2770563:T:GF25L0.981
X:2811363:C:GP146R0.981
X:2811381:T:AV152E0.981
X:2811378:T:AV151E0.979
X:2770562:T:GF25C0.978
X:2811390:T:GL155R0.977
X:2811395:G:AG157R0.977
X:2811395:G:CG157R0.977
X:2811375:T:AV150E0.976
X:2811396:G:AG157E0.976
X:2811390:T:CL155P0.970
X:2811357:T:AV144E0.968
X:2811401:G:CA159P0.965
X:2811402:C:AA159E0.965
X:2770562:T:CF25S0.963
X:2770568:T:GL27W0.959
X:2770576:G:CA30P0.958
X:2811363:C:TP146L0.958
X:2811387:C:AT154K0.958
X:2811390:T:AL155Q0.957
X:2811354:T:AI143N0.954
X:2770580:T:AL31H0.952
X:2811387:C:GT154R0.952

dbSNP variants (sampled 300 via entrez): RS1000046962 (X:2779376 A>G), RS1000304912 (X:2800326 C>G), RS1000323278 (X:2789289 A>G,T), RS1000376513 (X:2801131 A>G), RS1000409160 (X:2800644 C>T), RS1000481837 (X:2754149 A>C), RS1000515877 (X:2759056 C>T), RS1000572700 (X:2764956 G>A), RS1000579569 (X:2759511 T>C), RS1000763021 (X:2753970 T>C,G), RS1000848678 (X:2775322 G>A,T), RS1001029901 (X:2809901 C>G), RS1001053807 (X:2780992 T>TG), RS1001120027 (X:2774720 C>A,T), RS1001166143 (X:2769130 G>A)

Disease associations

OMIM: gene MIM:300879 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

12 total (human), top 12 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arsenitedecreases expression2
tebuconazoledecreases expression1
CGP 52608affects binding, increases reaction1
entinostatincreases expression1
incobotulinumtoxinAincreases expression1
Benzo(a)pyrenedecreases expression1
Dexamethasoneincreases expression1
Nickeldecreases expression1
Tobacco Smoke Pollutiondecreases expression1
Antirheumatic Agentsincreases expression1
Okadaic Aciddecreases expression1
Lactic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.