GSG1L

gene
On this page

Also known as MGC18079PRO19651KTSR5831

Summary

GSG1L (GSG1 like, HGNC:28283) is a protein-coding gene on chromosome 16p12.1, encoding Germ cell-specific gene 1-like protein (Q6UXU4). As a component of the inner core of AMPAR complex, modifies AMPA receptor (AMPAR) gating.

Predicted to be involved in regulation of postsynaptic neurotransmitter receptor internalization. Predicted to act upstream of or within several processes, including regulation of AMPA receptor activity; regulation of short-term neuronal synaptic plasticity; and transmission of nerve impulse. Predicted to be located in asymmetric synapse and membrane. Predicted to be active in Schaffer collateral - CA1 synapse; glutamatergic synapse; and postsynaptic density membrane.

Source: NCBI Gene 146395 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 31 total — 1 pathogenic
  • MANE Select transcript: NM_001109763

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:28283
Approved symbolGSG1L
NameGSG1 like
Location16p12.1
Locus typegene with protein product
StatusApproved
AliasesMGC18079, PRO19651, KTSR5831
Ensembl geneENSG00000169181
Ensembl biotypeprotein_coding
OMIM617161
Entrez146395

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 7 protein_coding, 1 nonsense_mediated_decay

ENST00000380897, ENST00000395724, ENST00000447459, ENST00000562611, ENST00000569166, ENST00000898876, ENST00000951030, ENST00000951031

RefSeq mRNA: 4 — MANE Select: NM_001109763 NM_001109763, NM_001323900, NM_001323901, NM_144675

CCDS: CCDS10631, CCDS45450, CCDS81960

Canonical transcript exons

ENST00000447459 — 7 exons

ExonStartEnd
ENSE000011386092780748727807554
ENSE000025989822778752827791467
ENSE000035054182806307628063714
ENSE000035928372788448627884638
ENSE000036394622796315627963203
ENSE000036437592782878927828956
ENSE000036511172784495027845061

Expression profiles

Bgee: expression breadth ubiquitous, 144 present calls, max score 98.53.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.4752 / max 29.6106, expressed in 147 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1568570.2840115
1568560.155081
1568580.036219

Top tissues by expression

237 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
sural nerveUBERON:001548898.53gold quality
pancreatic ductal cellCL:000207985.44silver quality
right atrium auricular regionUBERON:000663185.26gold quality
putamenUBERON:000187484.46gold quality
cardiac atriumUBERON:000208184.15gold quality
nucleus accumbensUBERON:000188283.98gold quality
caudate nucleusUBERON:000187383.80gold quality
ascending aortaUBERON:000149682.16gold quality
thoracic aortaUBERON:000151581.80gold quality
amygdalaUBERON:000187680.70gold quality
prefrontal cortexUBERON:000045180.69gold quality
hypothalamusUBERON:000189879.14gold quality
anterior cingulate cortexUBERON:000983578.00gold quality
substantia nigraUBERON:000203877.79gold quality
descending thoracic aortaUBERON:000234576.92gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047376.88silver quality
Brodmann (1909) area 9UBERON:001354075.90gold quality
midbrainUBERON:000189175.89gold quality
right frontal lobeUBERON:000281075.43gold quality
frontal cortexUBERON:000187075.23gold quality
tendon of biceps brachiiUBERON:000818875.00gold quality
dorsolateral prefrontal cortexUBERON:000983474.81gold quality
neocortexUBERON:000195074.72gold quality
C1 segment of cervical spinal cordUBERON:000646973.69gold quality
cerebral cortexUBERON:000095672.97gold quality
forebrainUBERON:000189072.97gold quality
upper arm skinUBERON:000426372.77gold quality
temporal lobeUBERON:000187172.66gold quality
spinal cordUBERON:000224072.49gold quality
myocardiumUBERON:000234972.09gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no2.28

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

121 targeting GSG1L, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1193100.0065.93529
HSA-MIR-3134100.0066.43777
HSA-MIR-432-3P100.0067.86705
HSA-MIR-4533100.0069.482758
HSA-MIR-4481100.0066.421669
HSA-MIR-5193100.0067.261744
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-453499.9966.581907
HSA-MIR-186-5P99.9970.833707
HSA-MIR-150-5P99.9966.691976
HSA-MIR-4745-5P99.9865.951028
HSA-MIR-6891-5P99.9866.531372
HSA-MIR-807599.9767.20962
HSA-MIR-3605-5P99.9667.12932
HSA-MIR-808299.9567.271170
HSA-MIR-6744-5P99.9366.82748
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-6508-5P99.9270.672465
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-10395-5P99.8667.35676
HSA-MIR-806799.8669.592260
HSA-MIR-130B-5P99.8368.501888
HSA-MIR-6756-5P99.8267.972466
HSA-MIR-3121-3P99.8271.963630
HSA-MIR-3681-5P99.8266.88387
HSA-MIR-4694-3P99.7969.532640
HSA-MIR-4713-5P99.7867.801794
HSA-MIR-465899.7764.94514

Literature-anchored findings (GeneRIF, showing 1)

  • gene expression experiment revealed that CYP2B6, SPON1, and GSG1L can be activated concomitantly through a constitutive androstane receptor (CAR) activation pathway (PMID:27010727)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriogsg1lENSDARG00000037390
mus_musculusGsg1lENSMUSG00000046182
rattus_norvegicusGsg1lENSRNOG00000065255

Paralogs (10): LIM2 (ENSG00000105370), NKG7 (ENSG00000105374), PMP22 (ENSG00000109099), GSG1 (ENSG00000111305), EMP1 (ENSG00000134531), EMP3 (ENSG00000142227), CLDND2 (ENSG00000160318), TMEM202 (ENSG00000187806), EMP2 (ENSG00000213853), GSG1L2 (ENSG00000214978)

Protein

Protein identifiers

Germ cell-specific gene 1-like proteinQ6UXU4 (reviewed: Q6UXU4)

All UniProt accessions (2): Q6UXU4, H3BNP0

UniProt curated annotations — full annotation on UniProt →

Function. As a component of the inner core of AMPAR complex, modifies AMPA receptor (AMPAR) gating.

Subunit / interactions. Component of the inner core of AMPAR complex. AMPAR complex consists of an inner core made of 4 pore-forming GluA/GRIA proteins (GRIA1, GRIA2, GRIA3 and GRIA4) and 4 major auxiliary subunits arranged in a twofold symmetry. One of the two pairs of distinct binding sites is occupied either by CNIH2, CNIH3 or CACNG2, CACNG3. The other harbors CACNG2, CACNG3, CACNG4, CACNG8 or GSG1L. This inner core of AMPAR complex is complemented by outer core constituents binding directly to the GluA/GRIA proteins at sites distinct from the interaction sites of the inner core constituents. Outer core constituents include at least PRRT1, PRRT2, CKAMP44/SHISA9, FRRS1L and NRN1. The proteins of the inner and outer core serve as a platform for other, more peripherally associated AMPAR constituents. Alone or in combination, these auxiliary subunits control the gating and pharmacology of the AMPAR complex and profoundly impact their biogenesis and protein processing.

Subcellular location. Cell membrane. Synapse.

Similarity. Belongs to the GSG1 family.

Isoforms (4)

UniProt IDNamesCanonical?
Q6UXU4-11yes
Q6UXU4-32
Q6UXU4-43
Q6UXU4-24

RefSeq proteins (4): NP_001103233, NP_001310829, NP_001310830, NP_653276 (=MANE)

Domains & families (InterPro)

IDNameType
IPR012478GSG-1Family
IPR050579PMP-22/EMP/MP20-likeFamily

Pfam: PF07803

UniProt features (13 total): topological domain 5, transmembrane region 4, splice variant 3, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q6UXU4-F172.230.35

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 98 (showing top): GOBP_REGULATION_OF_NEURONAL_SYNAPTIC_PLASTICITY, GOBP_REGULATION_OF_SHORT_TERM_NEURONAL_SYNAPTIC_PLASTICITY, GOBP_VESICLE_MEDIATED_TRANSPORT, GOBP_CELL_CELL_SIGNALING, GOBP_REGULATION_OF_VESICLE_MEDIATED_TRANSPORT, GOBP_CELL_JUNCTION_ORGANIZATION, GOBP_REGULATION_OF_SYNAPTIC_PLASTICITY, GOBP_REGULATION_OF_RECEPTOR_INTERNALIZATION, GOBP_REGULATION_OF_RECEPTOR_MEDIATED_ENDOCYTOSIS, SCHAEFFER_PROSTATE_DEVELOPMENT_6HR_DN, GOBP_RECEPTOR_INTERNALIZATION, GOBP_SYNAPTIC_SIGNALING, GOBP_REGULATION_OF_ENDOCYTOSIS, GOBP_REGULATION_OF_TRANSPORT, GOBP_IMPORT_INTO_CELL

GO Biological Process (7): transmission of nerve impulse (GO:0019226), regulation of short-term neuronal synaptic plasticity (GO:0048172), synapse organization (GO:0050808), regulation of postsynaptic neurotransmitter receptor internalization (GO:0099149), regulation of cell communication (GO:0010646), regulation of signaling (GO:0023051), regulation of transport (GO:0051049)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (7): plasma membrane (GO:0005886), Schaffer collateral - CA1 synapse (GO:0098685), postsynaptic density membrane (GO:0098839), glutamatergic synapse (GO:0098978), membrane (GO:0016020), asymmetric synapse (GO:0032279), synapse (GO:0045202)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cell communication2
synapse2
postsynaptic density2
action potential1
chemical synaptic transmission1
nervous system process1
regulation of neuronal synaptic plasticity1
cell junction organization1
regulation of receptor internalization1
regulation of biological quality1
postsynaptic neurotransmitter receptor internalization1
regulation of cellular process1
signaling1
regulation of biological process1
transport1
regulation of localization1
binding1
membrane1
cell periphery1
postsynaptic membrane1
postsynaptic specialization membrane1
cellular anatomical structure1
neuron to neuron synapse1
cell junction1

Protein interactions and networks

STRING

696 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GSG1LSHISA9B4DS77860
GSG1LCNIH2Q6PI25759
GSG1LCNIH1O95406725
GSG1LCNIH4Q9P003723
GSG1LGRIA1P42261696
GSG1LSYNDIG1Q9H7V2689
GSG1LCACNG8Q8WXS5670
GSG1LCACNG2Q9Y698647
GSG1LCNIH3Q8TBE1646
GSG1LGRIA2P42262541
GSG1LVWC2Q2TAL6527
GSG1LCCDC196A0A1B0GTZ2519
GSG1LSHISA7A6NL88500
GSG1LFRRS1LQ9P0K9490
GSG1LABHD6Q9BV23458

IntAct

4 interactions, top by confidence:

ABTypeScore
GSG1LLXNpsi-mi:“MI:0915”(physical association)0.560

BioGRID (2): GSG1L (Two-hybrid), MAL (Two-hybrid)

ESM2 similar proteins: A0JPH4, A4IIV4, A6NGA9, A8MUP6, D3Z7H4, D3ZK93, O42282, O60478, O75204, O95859, P0DP42, P38551, P58418, Q08CE6, Q0II41, Q11085, Q29RH7, Q2KHT4, Q2M2E3, Q32KQ5, Q32LT7, Q3SZT1, Q4V922, Q504G0, Q569A2, Q5CZV0, Q5FWC3, Q5M962, Q5R8B5, Q5RCD5, Q5VW38, Q5XGU1, Q5ZIF5, Q6AXT9, Q6AYL2, Q6GV27, Q6GV28, Q6UXU4, Q7TQI0, Q8BGP5

Diamond homologs: A4IIV4, A8MUP6, D3Z7H4, D3ZK93, Q2KHT4, Q3SZT1, Q4V922, Q6AYL2, Q6UXU4, Q8R1W2

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

31 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance28
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
2423570NC_000016.9:g.(?27441393)(29001333_?)delPathogenic

SpliceAI

1536 predictions. Top by Δscore:

VariantEffectΔscore
16:27807485:A:ACdonor_gain1.0000
16:27807486:C:CCdonor_gain1.0000
16:27807486:CGGG:Cdonor_gain1.0000
16:27807553:TC:Tacceptor_gain1.0000
16:27807554:CC:Cacceptor_gain1.0000
16:27844944:ACTTA:Adonor_loss1.0000
16:27844947:T:TGdonor_loss1.0000
16:27844948:A:ACdonor_gain1.0000
16:27844948:ACCAG:Adonor_loss1.0000
16:27844949:C:CAdonor_loss1.0000
16:27844949:C:CCdonor_gain1.0000
16:27845057:GAGGC:Gacceptor_gain1.0000
16:27845059:GGC:Gacceptor_gain1.0000
16:27845062:C:CCacceptor_gain1.0000
16:27845062:CTGTG:Cacceptor_loss1.0000
16:27845063:T:Aacceptor_loss1.0000
16:27884480:CCTTA:Cdonor_loss1.0000
16:27884481:CTTAC:Cdonor_loss1.0000
16:27884482:TTACC:Tdonor_loss1.0000
16:27884483:TA:Tdonor_loss1.0000
16:27884484:A:ACdonor_gain1.0000
16:27884484:A:ATdonor_loss1.0000
16:27884485:C:CCdonor_gain1.0000
16:27884636:CCC:Cacceptor_gain1.0000
16:27884637:CCC:Cacceptor_gain1.0000
16:27884638:CCTGT:Cacceptor_loss1.0000
16:27884639:C:CCacceptor_gain1.0000
16:27884639:C:Gacceptor_loss1.0000
16:27888055:TTTAC:Tdonor_loss1.0000
16:27888056:TTACC:Tdonor_loss1.0000

AlphaMissense

2158 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
16:27828906:A:GL238P1.000
16:27828906:A:TL238H1.000
16:27828915:A:TV235D1.000
16:27828926:C:AM231I1.000
16:27828926:C:GM231I1.000
16:27828926:C:TM231I1.000
16:27828927:A:CM231R1.000
16:27828927:A:TM231K1.000
16:27828938:A:CF227L1.000
16:27828938:A:TF227L1.000
16:27828939:A:CF227C1.000
16:27828939:A:GF227S1.000
16:27828940:A:GF227L1.000
16:27828940:A:TF227I1.000
16:27828947:C:AW224C1.000
16:27828947:C:GW224C1.000
16:27828949:A:GW224R1.000
16:27828949:A:TW224R1.000
16:27844952:G:CF220L1.000
16:27844952:G:TF220L1.000
16:27844954:A:GF220L1.000
16:27844970:C:AW214C1.000
16:27844970:C:GW214C1.000
16:27844972:A:GW214R1.000
16:27844972:A:TW214R1.000
16:27844980:G:TP211H1.000
16:27844985:C:AW209C1.000
16:27844985:C:GW209C1.000
16:27844987:A:GW209R1.000
16:27844987:A:TW209R1.000

dbSNP variants (sampled 300 via entrez): RS1000021486 (16:28009599 C>T), RS1000024395 (16:28012197 C>A,T), RS1000080708 (16:28004656 T>C), RS1000081209 (16:27836907 A>G), RS1000090547 (16:27807106 A>G), RS1000098685 (16:28006271 C>T), RS1000105655 (16:28047255 A>G), RS1000108052 (16:27922278 C>T), RS1000137240 (16:27856900 T>C), RS1000142523 (16:27947241 C>T), RS1000144555 (16:27807341 C>A), RS1000146125 (16:27842543 T>A), RS1000151867 (16:27972791 T>G), RS1000167106 (16:27906049 C>T), RS1000172076 (16:28028991 C>T)

Disease associations

OMIM: gene MIM:617161 | disease phenotypes: MIM:256730

GenCC curated gene-disease

Mondo (1): neuronal ceroid lipofuscinosis (MONDO:0016295)

Orphanet (2): Neuronal ceroid lipofuscinosis (Orphanet:216), OBSOLETE: Infantile neuronal ceroid lipofuscinosis (Orphanet:79263)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST001762_64Obesity-related traits7.000000e-06
GCST009218_31Lateral ventricle temporal horn volume4.000000e-08
GCST010002_111Refractive error3.000000e-09
GCST012440_2Hemoglobin levels in non-alcoholic fatty liver disease x mastiha supplementation interaction4.000000e-08

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004509hemoglobin measurement
EFO:0600067mastiha supplement exposure measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

16 total (human), top 16 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases methylation2
GSK-J4increases expression1
benzo(e)pyreneincreases methylation1
aflatoxin B2increases methylation1
S-(1,2-dichlorovinyl)cysteineaffects cotreatment, decreases expression1
CGP 52608affects binding, increases reaction1
bisphenol Saffects cotreatment, decreases methylation1
Fulvestrantaffects cotreatment, decreases methylation1
Acetaminophendecreases expression1
Allergensaffects cotreatment, decreases expression, increases abundance1
Vehicle Emissionsaffects cotreatment, decreases expression, increases abundance1
Lipopolysaccharidesaffects cotreatment, decreases expression1
Methapyrileneincreases methylation1
Rotenonedecreases expression1
Valproic Acidincreases methylation1
Particulate Matterincreases abundance, affects cotreatment, decreases expression1

Clinical trials (associated diseases)

7 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00337636PHASE1COMPLETEDStudy of HuCNS-SC Cells in Patients With Infantile or Late Infantile Neuronal Ceroid Lipofuscinosis (NCL)
NCT01238315PHASE1WITHDRAWNSafety and Efficacy Study of HuCNS-SC in Subjects With Neuronal Ceroid Lipofuscinosis
NCT07582484PHASE1/PHASE2NOT_YET_RECRUITINGGene Therapy Trial for CLN6 Batten Disease
NCT01873924Not specifiedRECRUITINGClinical and Neuropsychological Investigations in Batten Disease
NCT01966757Not specifiedCOMPLETEDNeuronal Ceroid Lipofuscinosis and Associated Sleep Abnormalities
NCT04613089Not specifiedRECRUITINGNatural History and Longitudinal Clinical Assessments in NCL / Batten Disease, the International DEM-CHILD Database
NCT06844877Not specifiedRECRUITINGItalian NCL Registry: a Registry for NCL as an Integration Tool for Future Therapeutic Strategies
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): neuronal ceroid lipofuscinosis