GEMIN8

gene
On this page

Also known as FLJ20514

Summary

GEMIN8 (gem nuclear organelle associated protein 8, HGNC:26044) is a protein-coding gene on chromosome Xp22.2, encoding Gem-associated protein 8 (Q9NWZ8). The SMN complex catalyzes the assembly of small nuclear ribonucleoproteins (snRNPs), the building blocks of the spliceosome, and thereby plays an important role in the splicing of cellular pre-mRNAs. It is a selective cancer dependency (DepMap: 78.3% of cell lines).

The protein encoded by this gene is part of the SMN complex, which is necessary for spliceosomal snRNP assembly in the cytoplasm and pre-mRNA splicing in the nucleus. The encoded protein binds to both SMN1 and the GEMIN6/GEMIN7 heterodimer, mediating their interaction. This protein is found in nuclear Gemini of Cajal bodies (gems) and in the cytoplasm. Three transcript variants encoding the same protein have been found for this gene.

Source: NCBI Gene 54960 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 62 total
  • Cancer dependency (DepMap): dependent in 78.3% of screened cell lines
  • MANE Select transcript: NM_001042479

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26044
Approved symbolGEMIN8
Namegem nuclear organelle associated protein 8
LocationXp22.2
Locus typegene with protein product
StatusApproved
AliasesFLJ20514
Ensembl geneENSG00000046647
Ensembl biotypeprotein_coding
OMIM300962
Entrez54960

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 19 protein_coding, 1 protein_coding_CDS_not_defined, 1 nonsense_mediated_decay

ENST00000332885, ENST00000380523, ENST00000398355, ENST00000460203, ENST00000477386, ENST00000680255, ENST00000850642, ENST00000889493, ENST00000889496, ENST00000889498, ENST00000889501, ENST00000889503, ENST00000915202, ENST00000915203, ENST00000915204, ENST00000915205, ENST00000915206, ENST00000965135, ENST00000965136, ENST00000965137, ENST00000965138

RefSeq mRNA: 3 — MANE Select: NM_001042479 NM_001042479, NM_001042480, NM_017856

CCDS: CCDS14159

Canonical transcript exons

ENST00000680255 — 5 exons

ExonStartEnd
ENSE000014853251402614014026221
ENSE000042824341402146414021511
ENSE000042824351402977714029892
ENSE000042824361400672614009169
ENSE000042824391402007814020534

Expression profiles

Bgee: expression breadth ubiquitous, 232 present calls, max score 95.79.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 13.5007 / max 74.9323, expressed in 1799 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1985097.65161753
1985085.00691675
1985070.8421576

Top tissues by expression

272 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right uterine tubeUBERON:000130295.79gold quality
left ovaryUBERON:000211992.63gold quality
C1 segment of cervical spinal cordUBERON:000646992.34gold quality
right ovaryUBERON:000211892.33gold quality
tibial nerveUBERON:000132391.93gold quality
body of uterusUBERON:000985391.86gold quality
descending thoracic aortaUBERON:000234590.66gold quality
left lobe of thyroid glandUBERON:000112090.58gold quality
right lobe of thyroid glandUBERON:000111990.52gold quality
left uterine tubeUBERON:000130390.34gold quality
endocervixUBERON:000045890.28gold quality
adenohypophysisUBERON:000219690.25gold quality
lower esophagus muscularis layerUBERON:003583389.97gold quality
lower esophagusUBERON:001347389.94gold quality
esophagogastric junction muscularis propriaUBERON:003584189.79gold quality
spinal cordUBERON:000224089.65gold quality
body of pancreasUBERON:000115089.58gold quality
left coronary arteryUBERON:000162689.47gold quality
thoracic aortaUBERON:000151589.46gold quality
mucosa of stomachUBERON:000119989.43gold quality
ascending aortaUBERON:000149689.43gold quality
thyroid glandUBERON:000204689.34gold quality
muscle layer of sigmoid colonUBERON:003580589.31gold quality
ectocervixUBERON:001224989.18gold quality
pituitary glandUBERON:000000789.07gold quality
aortaUBERON:000094789.06gold quality
tibial arteryUBERON:000761089.02gold quality
popliteal arteryUBERON:000225089.01gold quality
right coronary arteryUBERON:000162588.89gold quality
minor salivary glandUBERON:000183088.76gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-GEOD-137537yes1053.44
E-MTAB-8142yes16.37
E-ANND-3yes4.70
E-MTAB-6524no84.37

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

78 targeting GEMIN8, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-8485100.0077.574731
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-366299.9973.825684
HSA-MIR-548AW99.9972.573559
HSA-MIR-150-5P99.9966.691976
HSA-MIR-548AN99.9770.912817
HSA-MIR-1250-3P99.9670.044038
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-185-3P99.9567.011743
HSA-MIR-141-3P99.9472.792421
HSA-MIR-200A-3P99.9472.682420
HSA-MIR-311999.9271.342390
HSA-MIR-498-3P99.9171.271114
HSA-MIR-153-5P99.8973.866317
HSA-LET-7A-2-3P99.8770.531921
HSA-LET-7G-3P99.8570.431929
HSA-MIR-4728-5P99.8569.394718
HSA-MIR-544A99.8468.661965
HSA-MIR-6785-5P99.8268.684428
HSA-MIR-94499.8270.853042
HSA-MIR-548AJ-5P99.7871.123085
HSA-MIR-548F-5P99.7871.023093
HSA-MIR-548G-5P99.7871.123085
HSA-MIR-548X-5P99.7871.123085
HSA-MIR-7856-5P99.7569.992901
HSA-MIR-808499.7369.571760
HSA-MIR-149-3P99.7268.223963
HSA-MIR-430699.7270.503630
HSA-MIR-4677-5P99.7070.091940

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 78.3% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 2)

  • Gemin8 is a novel integral component of the SMN complex (PMID:16434402)
  • Gemin8 has an essential role in the proper structural organization of the SMN complex and the involvement of the heteromeric subunit containing Gemin6, Gemin7, Gemin8, and Unrip in the recruitment of Sm proteins to the snRNP assembly pathway (PMID:17023415)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriogemin8ENSDARG00000053496
mus_musculusGemin8ENSMUSG00000040621
rattus_norvegicusGemin8ENSRNOG00000029038
rattus_norvegicusENSRNOG00000068876
caenorhabditis_elegansWBGENE00017357

Protein

Protein identifiers

Gem-associated protein 8Q9NWZ8 (reviewed: Q9NWZ8)

Alternative names: Protein FAM51A1

All UniProt accessions (2): Q9NWZ8, H7BYE2

UniProt curated annotations — full annotation on UniProt →

Function. The SMN complex catalyzes the assembly of small nuclear ribonucleoproteins (snRNPs), the building blocks of the spliceosome, and thereby plays an important role in the splicing of cellular pre-mRNAs. Most spliceosomal snRNPs contain a common set of Sm proteins SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE, SNRPF and SNRPG that assemble in a heptameric protein ring on the Sm site of the small nuclear RNA to form the core snRNP (Sm core). In the cytosol, the Sm proteins SNRPD1, SNRPD2, SNRPE, SNRPF and SNRPG are trapped in an inactive 6S pICln-Sm complex by the chaperone CLNS1A that controls the assembly of the core snRNP. To assemble core snRNPs, the SMN complex accepts the trapped 5Sm proteins from CLNS1A forming an intermediate. Binding of snRNA inside 5Sm triggers eviction of the SMN complex, thereby allowing binding of SNRPD3 and SNRPB to complete assembly of the core snRNP.

Subunit / interactions. Part of the core SMN complex that contains SMN1, GEMIN2/SIP1, DDX20/GEMIN3, GEMIN4, GEMIN5, GEMIN6, GEMIN7, GEMIN8 and STRAP/UNRIP. Part of the SMN-Sm complex that contains SMN1, GEMIN2/SIP1, DDX20/GEMIN3, GEMIN4, GEMIN5, GEMIN6, GEMIN7, GEMIN8, STRAP/UNRIP and the Sm proteins SNRPB, SNRPD1, SNRPD2, SNRPD3, SNRPE, SNRPF and SNRPG. Interacts with GEMIN6; the interaction is direct. Interacts with GEMIN7; the interaction is direct. Interacts with SMN1; the interaction is direct. Interacts with GEMIN4; the interaction is direct.

Subcellular location. Nucleus. Gem. Cytoplasm.

RefSeq proteins (3): NP_001035944, NP_001035945, NP_060326 (=MANE)

Domains & families (InterPro)

IDNameType
IPR034754GEMIN8Family

Pfam: PF15348

UniProt features (12 total): helix 3, modified residue 3, compositionally biased region 2, chain 1, region of interest 1, coiled-coil region 1, sequence variant 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
7BBLX-RAY DIFFRACTION1.52

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NWZ8-F169.280.33

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (3): 117, 124, 126

Function

Pathways and Gene Ontology

Reactome pathways

2 pathways

IDPathway
R-HSA-191859snRNP Assembly
R-HSA-9754678SARS-CoV-2 modulates host translation machinery

MSigDB gene sets: 114 (showing top): GOBP_PROTEIN_RNA_COMPLEX_ORGANIZATION, GOBP_RNA_SPLICING, P300_01, GOBP_RIBONUCLEOPROTEIN_COMPLEX_BIOGENESIS, GOBP_SPLICEOSOMAL_SNRNP_ASSEMBLY, REACTOME_METABOLISM_OF_RNA, GOCC_SMN_COMPLEX, GOCC_GEMINI_OF_CAJAL_BODIES, GOCC_NUCLEAR_BODY, GAVIN_FOXP3_TARGETS_CLUSTER_P6, LEE_RECENT_THYMIC_EMIGRANT, GOCC_SMN_SM_PROTEIN_COMPLEX, chrXp22, PURBEY_TARGETS_OF_CTBP1_NOT_SATB1_UP, GOBP_MRNA_PROCESSING

GO Biological Process (3): spliceosomal snRNP assembly (GO:0000387), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (7): nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), cytosol (GO:0005829), SMN complex (GO:0032797), SMN-Sm protein complex (GO:0034719), Gemini of Cajal bodies (GO:0097504)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Metabolism of non-coding RNA1
SARS-CoV-2-host interactions1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
RNA processing2
cytoplasm2
Sm-like protein family complex2
mRNA splicing, via spliceosome1
protein-RNA complex assembly1
mRNA metabolic process1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1
SMN-Sm protein complex1
SMN complex1
nuclear body1

Protein interactions and networks

STRING

618 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GEMIN8GEMIN7Q9H840968
GEMIN8GEMIN2O14893947
GEMIN8GEMIN5Q8TEQ6894
GEMIN8GEMIN6Q8WXD5880
GEMIN8STRAPQ9Y3F4862
GEMIN8DDX20Q9UHI6815
GEMIN8GEMIN4P57678773
GEMIN8TCEANCQ8N8B7716
GEMIN8RPS4XP12631533
GEMIN8LSM10Q969L4513
GEMIN8OR4X2Q8NGF9506
GEMIN8SMN1Q16637497
GEMIN8EGFL6Q8IUX8493
GEMIN8SNRPD1P13641478
GEMIN8TOE1Q96GM8462
GEMIN8EIF1AXP47813462

IntAct

85 interactions, top by confidence:

ABTypeScore
SMN1GEMIN2psi-mi:“MI:0914”(association)0.960
SNRPFGEMIN2psi-mi:“MI:0914”(association)0.910
SNRPEGEMIN2psi-mi:“MI:0914”(association)0.770
SNRPD2GEMIN2psi-mi:“MI:0914”(association)0.710
SNRPBPRMT5psi-mi:“MI:0914”(association)0.670
SNRNP70GEMIN2psi-mi:“MI:0914”(association)0.640
NCKIPSDGEMIN2psi-mi:“MI:0914”(association)0.640
SMN1PRMT5psi-mi:“MI:0914”(association)0.600
ELAVL2GEMIN8psi-mi:“MI:0915”(physical association)0.560
GEMIN8psi-mi:“MI:0915”(physical association)0.560
PDLIM5GEMIN8psi-mi:“MI:0915”(physical association)0.560
DDX20GAPDHSpsi-mi:“MI:0914”(association)0.530
SNRPEPRMT5psi-mi:“MI:0914”(association)0.530
SNRPNPRMT5psi-mi:“MI:0914”(association)0.530
SNRPESNRPGP15psi-mi:“MI:0914”(association)0.530
SNRPFSNRPGP15psi-mi:“MI:0914”(association)0.530
GEMIN7GEMIN2psi-mi:“MI:0914”(association)0.530

BioGRID (117): GEMIN8 (Affinity Capture-MS), GEMIN8 (Affinity Capture-MS), GEMIN8 (Affinity Capture-MS), GEMIN8 (Affinity Capture-MS), GEMIN8 (Affinity Capture-MS), GEMIN8 (Affinity Capture-Western), GEMIN8 (Affinity Capture-Western), GEMIN8 (Affinity Capture-Western), GEMIN8 (Affinity Capture-Western), GEMIN8 (Affinity Capture-MS), STRAP (Affinity Capture-MS), GEMIN7 (Affinity Capture-MS), SNRNP70 (Affinity Capture-MS), GEMIN8 (Affinity Capture-MS), CPNE7 (Affinity Capture-MS)

ESM2 similar proteins: A0JNN6, A2A9F4, A7MB40, A8MQ03, B5X7E4, D2HS03, E2QSX5, O60927, O95817, O95886, P07733, P97837, P97838, P97839, Q08DN6, Q13522, Q14209, Q2KJ10, Q5RD34, Q5REU9, Q5RFW2, Q5SYE7, Q5TM51, Q5VUB5, Q6AW68, Q6AY88, Q6BCB4, Q6DJE5, Q6MFY6, Q6PEI3, Q6PFD5, Q75NY9, Q7YR30, Q8BG87, Q8BHE1, Q8BJ42, Q8CAF4, Q8CBR1, Q8K3Q3, Q8NAX2

Diamond homologs: Q1LZ79, Q8BHE1, Q9NWZ8, P34399

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 78 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Metabolism of non-coding RNA783.8×6e-11
snRNP Assembly1455.9×2e-19
SARS-CoV-2 modulates host translation machinery1250.7×3e-16
mRNA Splicing - Minor Pathway521.1×5e-05
mRNA Splicing1020.7×1e-09
Processing of Capped Intron-Containing Pre-mRNA1117.1×1e-09
mRNA Polyadenylation1016.6×9e-09
SARS-CoV-2-host interactions613.5×7e-05

GO biological processes:

GO termPartnersFoldFDR
spliceosomal snRNP assembly16134.8×3e-29
spliceosomal complex assembly652.3×2e-07
U2-type prespliceosome assembly545.2×8e-06
mRNA splicing, via spliceosome1418.6×5e-12
RNA splicing78.9×1e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

62 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance28
Likely benign7
Benign3

Top pathogenic / likely-pathogenic (0)

SpliceAI

774 predictions. Top by Δscore:

VariantEffectΔscore
X:14009165:CCGCC:Cacceptor_gain1.0000
X:14009166:CGCC:Cacceptor_gain1.0000
X:14009166:CGCCC:Cacceptor_gain1.0000
X:14009167:GCC:Gacceptor_gain1.0000
X:14009168:CC:Cacceptor_gain1.0000
X:14009168:CCC:Cacceptor_gain1.0000
X:14009168:CCCTG:Cacceptor_loss1.0000
X:14009169:CC:Cacceptor_gain1.0000
X:14009170:C:CCacceptor_gain1.0000
X:14009170:C:Tacceptor_gain1.0000
X:14009170:CT:Cacceptor_loss1.0000
X:14009177:C:CTacceptor_gain1.0000
X:14020071:AACTT:Adonor_loss1.0000
X:14020072:ACTT:Adonor_loss1.0000
X:14020073:CTTAC:Cdonor_loss1.0000
X:14020074:TTAC:Tdonor_loss1.0000
X:14020075:T:TGdonor_loss1.0000
X:14020076:A:ACdonor_gain1.0000
X:14020077:C:CCdonor_gain1.0000
X:14020077:CG:Cdonor_gain1.0000
X:14020077:CGT:Cdonor_gain1.0000
X:14020077:CGTCG:Cdonor_gain1.0000
X:14009178:A:Tacceptor_gain0.9900
X:14009181:C:CTacceptor_gain0.9900
X:14020072:A:Cdonor_gain0.9900
X:14020076:ACGT:Adonor_gain0.9900
X:14020077:CGTC:Cdonor_gain0.9900
X:14020532:TGC:Tacceptor_gain0.9900
X:14020533:GCC:Gacceptor_loss0.9900
X:14020535:C:CCacceptor_gain0.9900

AlphaMissense

0 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000008808 (X:14025345 A>C), RS1000024501 (X:14015445 G>C), RS1000077990 (X:13993702 G>A), RS1000088241 (X:14013512 T>C), RS1000196480 (X:14006939 G>A), RS1000521588 (X:14016024 T>C), RS1000669353 (X:14026280 C>G,T), RS1000710537 (X:13991644 A>G), RS1000763042 (X:13998512 A>G), RS1000982555 (X:13999715 G>A), RS1000999378 (X:14001535 C>T), RS1001039806 (X:14025637 G>A), RS1001053613 (X:13990177 T>C), RS1001082744 (X:13991060 T>C), RS1001139378 (X:14006219 G>C)

Disease associations

OMIM: gene MIM:300962 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

26 total (human), top 26 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Aaffects cotreatment, increases methylation, increases expression2
Cisplatinaffects expression, increases expression2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
Valproic Acidincreases expression, decreases expression2
aristolochic acid Idecreases expression1
FR900359affects phosphorylation1
methylmercuric chloridedecreases expression1
sodium arsenitedecreases expression1
monomethylarsonous acidincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamidedecreases expression, affects cotreatment1
dorsomorphinaffects cotreatment, decreases expression1
bisphenol Saffects cotreatment, increases expression1
jinfukangincreases expression1
Decitabineaffects expression1
Fulvestrantaffects cotreatment, increases methylation1
Caffeinedecreases phosphorylation1
Dexamethasoneaffects cotreatment, increases expression1
Doxorubicindecreases expression1
Indomethacinaffects cotreatment, increases expression1
Methyl Methanesulfonatedecreases expression1
Dihydrotestosteroneincreases expression1
1-Methyl-3-isobutylxanthineaffects cotreatment, increases expression1
Aflatoxin B1decreases methylation1
Antirheumatic Agentsincreases expression1
Cadmium Chloridedecreases expression1
p-Chloromercuribenzoic Acidaffects cotreatment, decreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.