SAGE1

gene
On this page

Also known as SAGECT14

Summary

SAGE1 (sarcoma antigen 1, HGNC:30369) is a protein-coding gene on chromosome Xq26.3, encoding Sarcoma antigen 1 (Q9NXZ1).

This gene belongs to a class of genes that are activated in tumors. These genes are expressed in tumors of different histologic types but not in normal tissues, except for spermatogenic cells and, for some, placenta. The proteins encoded by these genes appear to be strictly tumor specific, and hence may be excellent sources of antigens for cancer immunotherapy. This gene is expressed in sarcomas.

Source: NCBI Gene 55511 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 131 total — 1 pathogenic
  • MANE Select transcript: NM_001381902

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:30369
Approved symbolSAGE1
Namesarcoma antigen 1
LocationXq26.3
Locus typegene with protein product
StatusApproved
AliasesSAGE, CT14
Ensembl geneENSG00000181433
Ensembl biotypeprotein_coding
OMIM300359
Entrez55511

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000324447, ENST00000370709

RefSeq mRNA: 2 — MANE Select: NM_001381902 NM_001381902, NM_018666

CCDS: CCDS14652

Canonical transcript exons

ENST00000370709 — 20 exons

ExonStartEnd
ENSE00001266175135911579135911953
ENSE00001266222135907701135907841
ENSE00001266238135906411135906551
ENSE00001266252135905252135905392
ENSE00001266261135904477135904569
ENSE00001597111135906024135906164
ENSE00001618285135908477135908617
ENSE00001630680135906926135907066
ENSE00001659041135911192135911332
ENSE00001678117135908864135909004
ENSE00001717583135901559135901691
ENSE00001722075135912321135912414
ENSE00001725800135910030135910170
ENSE00001747641135907313135907453
ENSE00001763122135908089135908229
ENSE00001767284135910415135910555
ENSE00001783969135909639135909779
ENSE00001913147135912798135913062
ENSE00003473089135896243135896329
ENSE00003919560135893716135893781

Expression profiles

Bgee: expression breadth broad, 39 present calls, max score 86.68.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.4656 / max 75.9113, expressed in 77 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1976430.448576
1976420.01708

Top tissues by expression

245 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047386.68gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099179.26gold quality
hair follicleUBERON:000207371.68gold quality
right testisUBERON:000453466.40gold quality
testisUBERON:000047365.12gold quality
left testisUBERON:000453364.64gold quality
kidney epitheliumUBERON:000481964.00gold quality
diaphragmUBERON:000110363.59gold quality
nephron tubuleUBERON:000123162.17gold quality
metanephric glomerulusUBERON:000473662.02gold quality
cervix squamous epitheliumUBERON:000692261.79gold quality
tongue squamous epitheliumUBERON:000691961.29gold quality
mucosa of paranasal sinusUBERON:000503058.27gold quality
buccal mucosa cellCL:000233657.67silver quality
endothelial cellCL:000011557.63gold quality
deciduaUBERON:000245056.93gold quality
epithelium of nasopharynxUBERON:000195156.38gold quality
vastus lateralisUBERON:000137955.47gold quality
vena cavaUBERON:000408755.28gold quality
quadriceps femorisUBERON:000137754.70gold quality
lower lobe of lungUBERON:000894952.01silver quality
cervix epitheliumUBERON:000480151.75gold quality
left ventricle myocardiumUBERON:000656651.50gold quality
oviduct epitheliumUBERON:000480449.99gold quality
Brodmann (1909) area 46UBERON:000648349.95gold quality
myocardiumUBERON:000234949.49gold quality
cardiac muscle of right atriumUBERON:000337949.24gold quality
olfactory bulbUBERON:000226448.92gold quality
type B pancreatic cellCL:000016948.83gold quality
epithelial cell of pancreasCL:000008348.66gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.09

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

4 targeting SAGE1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6515-3P99.8268.191933
HSA-MIR-129999.7771.242389
HSA-MIR-320299.6667.702737
HSA-MIR-4756-3P99.6266.301319

Literature-anchored findings (GeneRIF, showing 4)

  • MAGE-A4(143-151) and SAGE(715-723) are HLA-A2402-restricted CTL epitopes (PMID:16061876)
  • ADAM2, CALR3 and SAGE1 cancer/testis antigens are not promising targets for immunotherapy of breast and lung cancer. (PMID:26252478)
  • MAGE-A4, NY-ESO-1 and SAGE mRNA expression rates and co-expression relationships in solid tumours. (PMID:32600281)
  • SAGE1: a Potential Target Antigen for Lung Cancer T-Cell Immunotherapy. (PMID:34465596)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
drosophila_melanogasterIntS6FBGN0261383
caenorhabditis_elegansWBGENE00000994

Paralogs (11): INTS6 (ENSG00000102786), INTS6L (ENSG00000165359), CT45A5 (ENSG00000228836), CT45A1 (ENSG00000268940), CT45A3 (ENSG00000269096), CT45A10 (ENSG00000269586), CT45A9 (ENSG00000270946), CT45A2 (ENSG00000271449), CT45A7 (ENSG00000273696), CT45A8 (ENSG00000278085), CT45A6 (ENSG00000278289)

Protein

Protein identifiers

Sarcoma antigen 1Q9NXZ1 (reviewed: Q9NXZ1)

Alternative names: Cancer/testis antigen 14

All UniProt accessions (1): Q9NXZ1

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Nucleus.

Tissue specificity. Expressed mainly in bladder, lung, head and neck carcinomas. Not expressed in normal tissues except for testis.

RefSeq proteins (2): NP_001368831, NP_061136 (=MANE)

Domains & families (InterPro)

IDNameType
IPR029307INT_SG_DDX_CT_CDomain
IPR051113Integrator_subunit6Family

Pfam: PF15300

UniProt features (9 total): modified residue 3, sequence variant 2, chain 1, region of interest 1, compositionally biased region 1, cross-link 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
8HPPX-RAY DIFFRACTION3

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NXZ1-F151.150.03

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (4): 45, 64, 238, 778

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 37 (showing top): GOBP_SNRNA_PROCESSING, MODULE_95, GOCC_NUCLEAR_BODY, GOCC_INTEGRATOR_COMPLEX, chrXq26, PURBEY_TARGETS_OF_CTBP1_NOT_SATB1_DN, MODULE_163, GOBP_SNRNA_METABOLIC_PROCESS, CHAMP1_TARGET_GENES, GSE15659_NAIVE_CD4_TCELL_VS_RESTING_TREG_DN, GSE15659_NAIVE_CD4_TCELL_VS_NONSUPPRESSIVE_TCELL_DN, GSE15659_NAIVE_CD4_TCELL_VS_ACTIVATED_TREG_DN, GSE15659_RESTING_TREG_VS_NONSUPPRESSIVE_TCELL_DN, GSE15659_RESTING_VS_ACTIVATED_TREG_DN, GSE15659_NONSUPPRESSIVE_TCELL_VS_ACTIVATED_TREG_DN

GO Biological Process (1): snRNA 3’-end processing (GO:0034472)

GO Molecular Function (0):

GO Cellular Component (4): nucleus (GO:0005634), nucleoplasm (GO:0005654), nuclear body (GO:0016604), integrator complex (GO:0032039)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
snRNA processing1
RNA 3’-end processing1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
nucleoplasm1
intracellular membraneless organelle1
nuclear protein-containing complex1

Protein interactions and networks

STRING

516 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SAGE1DDX43Q9NXZ2869
SAGE1SPA17Q15506815
SAGE1MAGEC2Q9UBF1773
SAGE1PRSS50Q9UI38771
SAGE1PAGE5Q96GU1720
SAGE1GAGE4P0DSO3714
SAGE1ADAM2P78326690
SAGE1MAGEA1P43355655
SAGE1MAGEC1O60732625
SAGE1LIPIQ6XZB0606
SAGE1ACRBPQ8NEB7595
SAGE1PER2O15055584
SAGE1BMAL1O00327543
SAGE1NXF2BQ9GZY0524
SAGE1PABIR3Q6P4D5517

IntAct

4 interactions, top by confidence:

ABTypeScore
H2BC21SMCHD1psi-mi:“MI:0914”(association)0.350
TP53BP1PSMD14psi-mi:“MI:2364”(proximity)0.270
BRCA1SMCHD1psi-mi:“MI:2364”(proximity)0.270

BioGRID (7): SAGE1 (Proximity Label-MS), SAGE1 (Proximity Label-MS), SAGE1 (Affinity Capture-MS), SAGE1 (Affinity Capture-MS), SAGE1 (Proximity Label-MS), SAGE1 (Proximity Label-MS), SAGE1 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A087WUL8, A0A0J9YWL9, A0A0J9YY54, A0A0U1RQI7, A6NJ88, A6QL64, B4DH59, D3ZVV1, E9Q6E9, F1LWT0, O04492, O88799, P0DKJ7, P0DKJ8, P0DKL2, P0DPF3, P18583, P53353, Q08AG5, Q0P6D6, Q2EG98, Q3BBV2, Q4ZJZ1, Q5HY64, Q5JPF3, Q5QGU6, Q5TAG4, Q5TI25, Q5XHX6, Q6P3W6, Q6P902, Q6XPR3, Q6ZQX7, Q86T75, Q86VE3, Q86VQ3, Q8N2N9, Q8N660, Q8N693, Q96EQ9

Diamond homologs: A6NJ88, P0DMU7, P0DMU8, P0DMU9, P0DMV0, P0DMV1, P0DMV2, Q2TAF4, Q5DJT8, Q5HYN5, Q5JSJ4, Q5U4W6, Q6PCM2, Q7SYD9, Q8BND4, Q8NHU0, Q9NXZ1, Q9UL03, Q54Z23, Q9W485

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

131 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance80
Likely benign27
Benign10

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
1703579GRCh37/hg19 Xp22.33-q28(chrX:1-155270560)Pathogenic

SpliceAI

3162 predictions. Top by Δscore:

VariantEffectΔscore
X:135904588:G:GTdonor_gain1.0000
X:135904599:A:AGdonor_gain1.0000
X:135904603:G:GGdonor_gain1.0000
X:135904618:A:AGdonor_gain1.0000
X:135904618:A:Gdonor_gain1.0000
X:135905359:GCTG:Gdonor_gain1.0000
X:135905374:A:AGdonor_gain1.0000
X:135905374:A:Gdonor_gain1.0000
X:135905385:GGATC:Gdonor_gain1.0000
X:135905386:GATC:Gdonor_gain1.0000
X:135905391:GC:Gdonor_gain1.0000
X:135905393:G:GGdonor_gain1.0000
X:135905764:C:CGdonor_gain1.0000
X:135905764:C:Gdonor_gain1.0000
X:135905775:GGA:Gdonor_gain1.0000
X:135906165:G:GGdonor_gain1.0000
X:135906401:T:Aacceptor_gain1.0000
X:135906402:G:Aacceptor_gain1.0000
X:135906409:A:AGacceptor_gain1.0000
X:135906409:AGAT:Aacceptor_gain1.0000
X:135906410:G:Aacceptor_loss1.0000
X:135906410:G:GAacceptor_gain1.0000
X:135906410:GA:Gacceptor_gain1.0000
X:135906410:GAT:Gacceptor_gain1.0000
X:135906410:GATG:Gacceptor_gain1.0000
X:135906410:GATGC:Gacceptor_gain1.0000
X:135906547:TCCAT:Tdonor_gain1.0000
X:135906549:CAT:Cdonor_gain1.0000
X:135906549:CATGT:Cdonor_loss1.0000
X:135906550:AT:Adonor_gain1.0000

AlphaMissense

6024 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:135912392:T:CF865L0.992
X:135912394:T:AF865L0.992
X:135912394:T:GF865L0.992
X:135912827:T:CL882P0.990
X:135912407:G:CA870P0.989
X:135911924:T:CL831S0.988
X:135912799:T:CF873L0.983
X:135912801:T:AF873L0.983
X:135912801:T:GF873L0.983
X:135912387:T:AV863D0.982
X:135912393:T:CF865S0.981
X:135911944:T:CF838L0.980
X:135911946:T:AF838L0.980
X:135911946:T:GF838L0.980
X:135912383:T:CF862L0.979
X:135912385:T:AF862L0.979
X:135912385:T:GF862L0.979
X:135912384:T:CF862S0.972
X:135912827:T:AL882H0.970
X:135912335:T:CF846L0.967
X:135912337:C:AF846L0.967
X:135912337:C:GF846L0.967
X:135912803:A:TK874I0.965
X:135912839:T:CL886P0.965
X:135911916:A:CK828N0.959
X:135911916:A:TK828N0.959
X:135912333:T:CI845T0.958
X:135912815:T:CL878S0.958
X:135912333:T:GI845S0.956
X:135912345:T:CL849P0.955

dbSNP variants (sampled 300 via entrez): RS1000302091 (X:135892862 C>T), RS1001478751 (X:135892319 C>A), RS1001845807 (X:135892059 G>A), RS1002444008 (X:135893572 C>G), RS1006139820 (X:135912205 G>A), RS1007267485 (X:135910585 A>G), RS1007706268 (X:135911027 C>T), RS1008973105 (X:135893140 T>C), RS1009355060 (X:135892865 G>A,T), RS1010058538 (X:135892407 G>A,C,T), RS1010278238 (X:135892141 C>G), RS1012296593 (X:135911824 C>G,T), RS1017260853 (X:135910652 T>C,G), RS1018969993 (X:135893141 G>A), RS1020072444 (X:135892438 C>T)

Disease associations

OMIM: gene MIM:300359 | disease phenotypes: MIM:400045

GenCC curated gene-disease

Mondo (2): trisomy X (MONDO:0018066), 46,XX sex reversal 1 (MONDO:0100250)

Orphanet (1): Trisomy X syndrome (Orphanet:3375)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (1)

DescriptorNameTree numbers
C535318Triple X syndrome (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

9 total (human), top 9 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteaffects splicing, decreases expression1
benzo(e)pyrenedecreases methylation1
Acetaminophenaffects expression1
Benzo(a)pyreneincreases methylation1
Methapyrilenedecreases methylation1
Silicon Dioxideincreases expression1
Cyclosporinedecreases expression1
Aflatoxin B1decreases methylation1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

2 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT01678261Not specifiedCOMPLETEDX-chromosome Inactivation, Epigenetics and the Transcriptome
NCT03396562Not specifiedRECRUITINGThe eXtroardinarY Babies Study: Natural History of Health and Neurodevelopment in Infants and Young Children With Sex Chromosome Trisomy
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): 46,XX sex reversal 1, trisomy X