GSG1

gene
On this page

Also known as MGC3146

Summary

GSG1 (germ cell associated 1, HGNC:19716) is a protein-coding gene on chromosome 12p13.1, encoding Germ cell-specific gene 1 protein (Q2KHT4). May cause the redistribution of PAPOLB from the cytosol to the endoplasmic reticulum.

Predicted to enable RNA polymerase binding activity. Predicted to be located in endoplasmic reticulum membrane. Predicted to be active in plasma membrane.

Source: NCBI Gene 83445 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 69 total
  • MANE Select transcript: NM_001080555

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:19716
Approved symbolGSG1
Namegerm cell associated 1
Location12p13.1
Locus typegene with protein product
StatusApproved
AliasesMGC3146
Ensembl geneENSG00000111305
Ensembl biotypeprotein_coding
Entrez83445

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 10 protein_coding

ENST00000337630, ENST00000351606, ENST00000396302, ENST00000432710, ENST00000457134, ENST00000537302, ENST00000542415, ENST00000545401, ENST00000545699, ENST00000651961

RefSeq mRNA: 15 — MANE Select: NM_001080555 NM_001080554, NM_001080555, NM_001206842, NM_001206843, NM_001206845, NM_001367358, NM_001367359, NM_001367360, NM_001367361, NM_001367362, NM_001367363, NM_001367364, NM_001368007, NM_031289, NM_153823

CCDS: CCDS44835, CCDS44836, CCDS55806, CCDS55807, CCDS55808, CCDS8659, CCDS91657

Canonical transcript exons

ENST00000651961 — 7 exons

ExonStartEnd
ENSE000010958681308920813089276
ENSE000012995821308886213088909
ENSE000016830551310346513103667
ENSE000018390961308353213085243
ENSE000034717321309050313090818
ENSE000035204741308715213087263
ENSE000036740361308790713088059

Expression profiles

Bgee: expression breadth ubiquitous, 143 present calls, max score 98.82.

FANTOM5 (CAGE): breadth broad, TPM avg 2.9408 / max 1109.2139, expressed in 207 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
1297801.072987
1297860.684841
1297790.595261
1297870.47807
1297810.05386
1297850.034017
1297820.02216

Top tissues by expression

283 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left testisUBERON:000453398.82gold quality
right testisUBERON:000453498.67gold quality
spermCL:000001998.45gold quality
male germ cellCL:000001596.43gold quality
adult organismUBERON:000702396.00gold quality
testisUBERON:000047395.39gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047386.00gold quality
body of pancreasUBERON:000115069.95gold quality
pancreasUBERON:000126462.02gold quality
prefrontal cortexUBERON:000045161.59gold quality
stromal cell of endometriumCL:000225560.55gold quality
anterior cingulate cortexUBERON:000983557.35gold quality
cingulate cortexUBERON:000302757.33gold quality
dorsolateral prefrontal cortexUBERON:000983456.96gold quality
jejunal mucosaUBERON:000039956.40gold quality
metanephros cortexUBERON:001053356.38gold quality
frontal cortexUBERON:000187055.72gold quality
neocortexUBERON:000195055.54gold quality
cortical plateUBERON:000534355.45gold quality
Brodmann (1909) area 9UBERON:001354055.25gold quality
choroid plexus epitheliumUBERON:000391155.15silver quality
right frontal lobeUBERON:000281054.93gold quality
cerebral cortexUBERON:000095653.40gold quality
duodenumUBERON:000211452.29gold quality
metanephrosUBERON:000008152.07gold quality
cortex of kidneyUBERON:000122551.54gold quality
telencephalonUBERON:000189351.37gold quality
hypothalamusUBERON:000189850.70gold quality
forebrainUBERON:000189050.22gold quality
nucleus accumbensUBERON:000188250.16gold quality

Single-cell (SCXA)

Detected in 5 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-MTAB-7316yes1278.68
E-MTAB-11121yes624.95
E-GEOD-134144yes32.98
E-MTAB-7249no15.29
E-ANND-3no3.40

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): ESR1

miRNA regulators (miRDB)

77 targeting GSG1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-4713-3P100.0065.92505
HSA-MIR-371A-3P99.9966.7791
HSA-MIR-453499.9966.581907
HSA-MIR-141-3P99.9472.792421
HSA-MIR-200A-3P99.9472.682420
HSA-MIR-515-5P99.9269.822343
HSA-MIR-519E-5P99.9269.622358
HSA-MIR-6499-3P99.9066.381212
HSA-MIR-15A-5P99.9072.802787
HSA-MIR-15B-5P99.9072.782798
HSA-MIR-16-5P99.9072.802780
HSA-MIR-195-5P99.9072.812805
HSA-MIR-3180-5P99.8269.122422
HSA-MIR-34B-5P99.7867.561175
HSA-MIR-449C-5P99.7867.631168
HSA-MIR-2682-5P99.7367.381055
HSA-MIR-5580-3P99.7069.412052
HSA-MIR-1212499.6869.172700
HSA-MIR-545-5P99.6670.182308
HSA-MIR-1260A99.6166.671098
HSA-MIR-1260B99.6166.671098
HSA-MIR-4524A-5P99.5771.731193
HSA-MIR-4524B-5P99.5771.681195
HSA-MIR-391599.4568.491905
HSA-MIR-889-5P99.4168.751025
HSA-MIR-751599.3168.221795
HSA-MIR-148A-5P99.3068.271141
HSA-MIR-450599.2767.812678
HSA-MIR-578799.2267.862628

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusGsg1ENSMUSG00000030206
rattus_norvegicusGsg1ENSRNOG00000008518

Paralogs (10): LIM2 (ENSG00000105370), NKG7 (ENSG00000105374), PMP22 (ENSG00000109099), EMP1 (ENSG00000134531), EMP3 (ENSG00000142227), CLDND2 (ENSG00000160318), GSG1L (ENSG00000169181), TMEM202 (ENSG00000187806), EMP2 (ENSG00000213853), GSG1L2 (ENSG00000214978)

Protein

Protein identifiers

Germ cell-specific gene 1 proteinQ2KHT4 (reviewed: Q2KHT4)

All UniProt accessions (8): A0A494C0G6, Q2KHT4, F1T0A0, F1T0A1, F5GYH0, F5H0V9, F5H134, G3XAB9

UniProt curated annotations — full annotation on UniProt →

Function. May cause the redistribution of PAPOLB from the cytosol to the endoplasmic reticulum.

Subunit / interactions. Interacts with PAPOLB.

Subcellular location. Endoplasmic reticulum membrane.

Similarity. Belongs to the GSG1 family.

Isoforms (8)

UniProt IDNamesCanonical?
Q2KHT4-11yes
Q2KHT4-22
Q2KHT4-33
Q2KHT4-44
Q2KHT4-55
Q2KHT4-66
Q2KHT4-77
Q2KHT4-88

RefSeq proteins (15): NP_001074023, NP_001074024, NP_001193771, NP_001193772, NP_001193774, NP_001354287, NP_001354288, NP_001354289, NP_001354290, NP_001354291, NP_001354292, NP_001354293, NP_001354936, NP_112579, NP_722545 (=MANE)

Domains & families (InterPro)

IDNameType
IPR012478GSG-1Family
IPR050579PMP-22/EMP/MP20-likeFamily

Pfam: PF07803

UniProt features (15 total): splice variant 6, transmembrane region 4, sequence variant 2, sequence conflict 2, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q2KHT4-F170.790.35

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 77 (showing top): LI_CISPLATIN_RESISTANCE_DN, KORKOLA_CHORIOCARCINOMA_DN, SENESE_HDAC1_AND_HDAC2_TARGETS_DN, KORKOLA_EMBRYONAL_CARCINOMA_DN, GNF2_CCNA1, ACEVEDO_METHYLATED_IN_LIVER_CANCER_DN, SENESE_HDAC3_TARGETS_DN, GOCC_NUCLEAR_OUTER_MEMBRANE_ENDOPLASMIC_RETICULUM_MEMBRANE_NETWORK, GOCC_ORGANELLE_SUBCOMPARTMENT, WAKABAYASHI_ADIPOGENESIS_PPARG_RXRA_BOUND_36HR, WAKABAYASHI_ADIPOGENESIS_PPARG_BOUND_8D, ATF2_S_UP.V1_UP, PRC2_EED_DN.V1_UP, ALK_DN.V1_DN, KRAS.BREAST_UP.V1_UP

GO Biological Process (0):

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (4): endoplasmic reticulum membrane (GO:0005789), plasma membrane (GO:0005886), endoplasmic reticulum (GO:0005783), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding1
organelle membrane1
nuclear outer membrane-endoplasmic reticulum membrane network1
endoplasmic reticulum subcompartment1
membrane1
cell periphery1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1
cellular anatomical structure1

Protein interactions and networks

STRING

354 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GSG1RIMKLBQ9ULI2491
GSG1SCAPERQ9BY12477
GSG1TMEM39BQ9GZU3392
GSG1OR6A2O95222369
GSG1CASKIN2Q8WXE0348
GSG1VWC2Q2TAL6335
GSG1PRRT1Q99946330
GSG1PPP2R5BQ15173317
GSG1SLF2Q8IX21316
GSG1SH3BGRP55822300
GSG1RHEBL1Q8TAI7288
GSG1FAM124BQ9H5Z6288
GSG1NRN1Q9NPD7284
GSG1SIGLEC11Q96RL6282
GSG1TSSK2Q96PF2281

IntAct

5 interactions, top by confidence:

ABTypeScore
TRAF2GSG1psi-mi:“MI:0915”(physical association)0.560
GSG1TRAF2psi-mi:“MI:0915”(physical association)0.560
GSG1IL37psi-mi:“MI:0915”(physical association)0.400

BioGRID (14): GSG1 (Two-hybrid), ASAH1 (Affinity Capture-MS), IL37 (Affinity Capture-MS), IL37 (Affinity Capture-MS), ASAH1 (Affinity Capture-MS), GSG1 (Two-hybrid), ARFIP2 (Two-hybrid), FTHL17 (Two-hybrid), SDCBP (Two-hybrid), SYT16 (Two-hybrid), PITPNC1 (Two-hybrid), MIEF2 (Two-hybrid), IL37 (Affinity Capture-MS), GSG1 (Protein-peptide)

ESM2 similar proteins: A0A494BZU4, A0A7H0DND7, A0JNG0, A2T345, A4IIV4, C4QM85, E7F594, G5EDX4, O02051, O45306, P0DP42, P0DST5, P0DST6, P21061, P24763, P34362, P34363, P53053, Q09282, Q0II41, Q10907, Q11071, Q11085, Q13571, Q20249, Q297K8, Q2KHT4, Q2KJA5, Q32KQ5, Q5DC12, Q5GH77, Q5PQM0, Q5RD28, Q5REZ0, Q5TAH2, Q61168, Q6AXT9, Q6GV27, Q6GV28, Q7K1V5

Diamond homologs: A4IIV4, A8MUP6, D3Z7H4, D3ZK93, Q2KHT4, Q3SZT1, Q4V922, Q6AYL2, Q6UXU4, Q8R1W2

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

69 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance49
Likely benign13
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1002 predictions. Top by Δscore:

VariantEffectΔscore
12:13085239:CCATG:Cacceptor_gain1.0000
12:13085240:CATG:Cacceptor_gain1.0000
12:13085240:CATGC:Cacceptor_gain1.0000
12:13085242:TG:Tacceptor_gain1.0000
12:13085244:C:CCacceptor_gain1.0000
12:13087150:A:ACdonor_gain1.0000
12:13087151:C:CCdonor_gain1.0000
12:13088794:C:Adonor_gain1.0000
12:13085241:ATG:Aacceptor_gain0.9900
12:13085243:GCTA:Gacceptor_loss0.9900
12:13085244:C:CAacceptor_loss0.9900
12:13085245:T:Cacceptor_loss0.9900
12:13085254:A:ACacceptor_gain0.9900
12:13085254:A:Cacceptor_gain0.9900
12:13087151:CT:Cdonor_gain0.9900
12:13087151:CTA:Cdonor_gain0.9900
12:13087151:CTAGA:Cdonor_gain0.9900
12:13087261:GACCT:Gacceptor_loss0.9900
12:13087263:CCTAC:Cacceptor_loss0.9900
12:13087264:CTACC:Cacceptor_loss0.9900
12:13087265:T:Aacceptor_loss0.9900
12:13087900:AACTC:Adonor_loss0.9900
12:13087901:ACTCA:Adonor_loss0.9900
12:13087902:CTCA:Cdonor_loss0.9900
12:13087903:TCAC:Tdonor_loss0.9900
12:13087904:CA:Cdonor_loss0.9900
12:13087905:A:Cdonor_loss0.9900
12:13087906:C:CGdonor_loss0.9900
12:13088060:C:CCacceptor_gain0.9900
12:13088064:C:CTacceptor_gain0.9900

AlphaMissense

2366 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000063842 (12:13085984 C>G), RS1000869738 (12:13105310 A>G), RS1001014209 (12:13098521 A>C), RS1001142420 (12:13102045 G>C), RS1001225470 (12:13083928 CT>C), RS1001469319 (12:13102846 T>A), RS1001477195 (12:13102492 A>G), RS1001508746 (12:13098866 T>G), RS1001564253 (12:13102260 A>G), RS1002110345 (12:13097407 C>T), RS1002163892 (12:13090699 C>T), RS1002173096 (12:13096248 C>T), RS1002418698 (12:13097221 T>C), RS1002470662 (12:13100219 G>A,T), RS1002567089 (12:13100493 C>T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST005988_10Serum albumin levels1.000000e-08

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

18 total (human), top 18 by PubMed support.

ChemicalActions (top 5)PubMed papers
Resveratrolaffects cotreatment, decreases expression2
Aflatoxin B1decreases methylation, increases methylation2
butyraldehydeincreases expression1
fipronilaffects cotreatment, decreases expression1
CGP 52608affects binding, increases reaction1
2-palmitoylglycerolincreases expression1
jinfukangdecreases expression1
incobotulinumtoxinAdecreases expression1
Benzo(a)pyreneaffects methylation1
Cisplatindecreases expression1
Copperaffects cotreatment, decreases expression1
DEETaffects cotreatment, decreases expression1
Plant Extractsaffects cotreatment, decreases expression1
Silicon Dioxideincreases expression1
Valproic Aciddecreases methylation1
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxideincreases expression1
Copper Sulfateincreases expression1
Particulate Matterdecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.