SPATA31G1

gene
On this page

Also known as MGC41945

Summary

SPATA31G1 (SPATA31 subfamily G member 1, HGNC:31418) is a protein-coding gene on chromosome 9p13.3, encoding Spermatogenesis-associated protein 31G1 (Q5VYM1). Dispensable for normal development and fertility.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 13 total
  • MANE Select transcript: NM_203299

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:31418
Approved symbolSPATA31G1
NameSPATA31 subfamily G member 1
Location9p13.3
Locus typegene with protein product
StatusApproved
AliasesMGC41945
Ensembl geneENSG00000174038
Ensembl biotypeprotein_coding
Entrez138724

Gene structure

Transcript identifiers

Ensembl transcripts: 7 — 6 protein_coding, 1 nonsense_mediated_decay

ENST00000312292, ENST00000354479, ENST00000378745, ENST00000416537, ENST00000421362, ENST00000534880, ENST00000537671

RefSeq mRNA: 5 — MANE Select: NM_203299 NM_001040410, NM_001040411, NM_001040412, NM_001287391, NM_203299

CCDS: CCDS47961, CCDS47962, CCDS6572

Canonical transcript exons

ENST00000312292 — 2 exons

ExonStartEnd
ENSE000022197873504224035042485
ENSE000022418033504286135045986

Expression profiles

Bgee: expression breadth ubiquitous, 153 present calls, max score 91.20.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0746 / max 69.4671, expressed in 3 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
965590.07463

Top tissues by expression

237 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
kidney epitheliumUBERON:000481991.20gold quality
cardiac muscle of right atriumUBERON:000337989.81gold quality
left ventricle myocardiumUBERON:000656689.19gold quality
left testisUBERON:000453387.99gold quality
right testisUBERON:000453486.98gold quality
testisUBERON:000047385.63gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047385.26gold quality
upper arm skinUBERON:000426383.64gold quality
vena cavaUBERON:000408782.73gold quality
cerebellar vermisUBERON:000472081.75gold quality
myocardiumUBERON:000234980.52gold quality
spermCL:000001980.03silver quality
nasal cavity epitheliumUBERON:000538479.02gold quality
epithelial cell of pancreasCL:000008377.10gold quality
cardia of stomachUBERON:000116276.88silver quality
adult organismUBERON:000702375.39gold quality
sural nerveUBERON:001548875.14gold quality
vastus lateralisUBERON:000137975.01gold quality
body of tongueUBERON:001187674.96gold quality
bone marrow cellCL:000209274.95gold quality
quadriceps femorisUBERON:000137774.88gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450274.48gold quality
pharyngeal mucosaUBERON:000035573.90gold quality
lateral nuclear group of thalamusUBERON:000273673.81gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451173.22gold quality
heart right ventricleUBERON:000208073.20gold quality
substantia nigra pars compactaUBERON:000196572.30gold quality
substantia nigra pars reticulataUBERON:000196672.30gold quality
tracheaUBERON:000312672.16gold quality
pylorusUBERON:000116671.87silver quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no2.30

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

1 targeting SPATA31G1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1139998.7165.69869

Literature-anchored findings (GeneRIF, showing 1)

  • C9orf131 and C10orf120 are not essential for male fertility in humans or mice. (PMID:36871790)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusSpata31g1ENSMUSG00000028451
rattus_norvegicusSpata31g1ENSRNOG00000022601

Protein

Protein identifiers

Spermatogenesis-associated protein 31G1Q5VYM1 (reviewed: Q5VYM1)

All UniProt accessions (5): Q5VYM1, A0A1D5RMQ0, F5H224, F5H4E3, F8W7H5

UniProt curated annotations — full annotation on UniProt →

Function. Dispensable for normal development and fertility.

Isoforms (3)

UniProt IDNamesCanonical?
Q5VYM1-11yes
Q5VYM1-22, D
Q5VYM1-33

RefSeq proteins (5): NP_001035500, NP_001035501, NP_001035502, NP_001274320, NP_976044* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR026677Spata31g1-likeFamily

UniProt features (25 total): region of interest 8, compositionally biased region 7, sequence variant 6, splice variant 2, chain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5VYM1-F139.710.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 11 (showing top): TGACCTY_ERR1_Q2, chr9p13, AR_03, SUPT16H_TARGET_GENES, BANG_VERTEPORFIN_ENDOMETRIAL_CANCER_CELLS_DN, GSE29614_CTRL_VS_TIV_FLU_VACCINE_PBMC_2007_UP, PR_02, GR_Q6, ZHOU_INFLAMMATORY_RESPONSE_FIMA_UP, GSE25502_WT_VS_KLF13_KO_THYMIC_MEMORY_LIKE_CD8_TCELL_UP, GR_01

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (0):

Protein interactions and networks

STRING

804 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPATA31G1REDIC1Q86WS4667
SPATA31G1FAM209AQ5JX71647
SPATA31G1C9orf43Q8TAL5646
SPATA31G1TULP2O00295627
SPATA31G1C20orf173Q96LM9608
SPATA31G1SPATA16Q9BXB7598
SPATA31G1MAGEB16A2A368595
SPATA31G1TMCO5AQ8N6Q1584
SPATA31G1ZFAND4Q86XD8561
SPATA31G1PKDREJQ9NTG1534
SPATA31G1PLCZ1Q86YW0513
SPATA31G1CABS1Q96KC9507
SPATA31G1GARIN2Q8N9W8480
SPATA31G1CXorf65A6NEN9479
SPATA31G1CCDC187A0A096LP49476
SPATA31G1CATSPER1Q8NEC5476

IntAct

3 interactions, top by confidence:

ABTypeScore
rpoHSPATA31G1psi-mi:“MI:0915”(physical association)0.000
SPATA31G1thiD2psi-mi:“MI:0915”(physical association)0.000

BioGRID (4): C9orf131 (Affinity Capture-MS), C9orf131 (Affinity Capture-MS), C9orf131 (Affinity Capture-MS), C9orf131 (Affinity Capture-MS)

ESM2 similar proteins: A0A1B0GTH6, A0A1B0GUW6, A0A1D5RMD1, A2AQH4, A4FU49, A6NJ88, C4P6S0, D3YU32, E9PAV3, F1QU13, I3L273, J3KML8, P70670, Q32L62, Q3V0E1, Q3V3Q4, Q4R729, Q5H9F3, Q5QJ38, Q5SWP3, Q5U4C1, Q5VWK0, Q5VYM1, Q68DN1, Q6AZ54, Q7TSG5, Q810T2, Q8CH19, Q8K4E0, Q8N3K9, Q8N5Q1, Q8NDH2, Q8TCU4, Q8WNU4, Q8WWL7, Q920R4, Q921B4, Q923B3, Q96JA4, Q96M34

Diamond homologs: Q3V0E1, Q5VYM1

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

13 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance12
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

344 predictions. Top by Δscore:

VariantEffectΔscore
9:35042384:A:AGdonor_gain0.9500
9:35042482:C:Tdonor_gain0.9400
9:35042855:TCCCA:Tacceptor_loss0.9300
9:35042856:CCCA:Cacceptor_loss0.9300
9:35042857:CCAG:Cacceptor_loss0.9300
9:35042858:CAGGA:Cacceptor_loss0.9300
9:35042859:A:AGacceptor_gain0.9300
9:35042860:G:GGacceptor_gain0.9300
9:35042374:A:Tdonor_gain0.9200
9:35042373:G:GTdonor_gain0.9100
9:35042245:GC:Gdonor_gain0.9000
9:35042859:AG:Aacceptor_gain0.8900
9:35042860:GG:Gacceptor_gain0.8900
9:35042791:AGTG:Adonor_gain0.8800
9:35042860:GGAA:Gacceptor_gain0.8700
9:35042285:GC:Gdonor_gain0.8600
9:35042291:GGGGA:Gdonor_gain0.8600
9:35042385:C:Gdonor_gain0.8600
9:35042284:GGC:Gdonor_gain0.8500
9:35042475:TGC:Tdonor_gain0.8400
9:35042283:GGGC:Gdonor_gain0.8300
9:35042860:GGA:Gacceptor_gain0.8100
9:35042860:GGAAC:Gacceptor_gain0.8100
9:35042252:G:Tdonor_gain0.8000
9:35042804:AAAC:Adonor_loss0.8000
9:35042805:AACC:Adonor_loss0.8000
9:35042806:ACCTT:Adonor_loss0.8000
9:35042807:C:Gdonor_loss0.8000
9:35042544:ATTG:Adonor_gain0.7900
9:35042803:CAAA:Cdonor_loss0.7900

AlphaMissense

6979 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
9:35042366:A:CS38R0.930
9:35042368:T:AS38R0.930
9:35042368:T:GS38R0.930
9:35042396:T:CF48L0.914
9:35042398:C:AF48L0.914
9:35042398:C:GF48L0.914
9:35042888:T:CF87L0.896
9:35042890:C:AF87L0.896
9:35042890:C:GF87L0.896
9:35043215:T:CF196L0.889
9:35043217:C:AF196L0.889
9:35043217:C:GF196L0.889
9:35044976:T:AW783R0.884
9:35044976:T:CW783R0.884
9:35043137:T:CF170L0.870
9:35043139:C:AF170L0.870
9:35043139:C:GF170L0.870
9:35043128:T:CF167L0.867
9:35043130:C:AF167L0.867
9:35043130:C:GF167L0.867
9:35044978:G:CW783C0.854
9:35044978:G:TW783C0.854
9:35042405:T:AW51R0.848
9:35042405:T:CW51R0.848
9:35042409:A:CQ52P0.835
9:35042359:C:GC35W0.834
9:35042357:T:CC35R0.831
9:35044989:T:CL787P0.830
9:35042345:T:AC31S0.819
9:35042346:G:CC31S0.819

dbSNP variants (sampled 300 via entrez): RS1000865706 (9:35046385 C>T), RS1002776281 (9:35042002 T>C,G), RS1004398847 (9:35040828 T>C), RS1004451330 (9:35040501 A>C), RS1004710569 (9:35046031 A>G), RS1005885581 (9:35042014 G>A), RS1007878836 (9:35045406 T>C), RS1008745446 (9:35040091 C>G,T), RS1008820092 (9:35041551 G>A,C), RS1008939970 (9:35040028 A>C,G,T), RS1009117345 (9:35040331 T>C), RS1009531273 (9:35046172 T>C), RS1010562041 (9:35043415 T>A), RS1010891569 (9:35046315 C>A), RS1011170199 (9:35039944 T>A)

Disease associations

OMIM: gene `` | disease phenotypes: MIM:167320, MIM:605726, MIM:613954, MIM:614373

GenCC curated gene-disease

Mondo (4): inclusion body myopathy with Paget disease of bone and frontotemporal dementia (MONDO:0000507), autosomal recessive distal spinal muscular atrophy 2 (MONDO:0011585), frontotemporal dementia and/or amyotrophic lateral sclerosis 6 (MONDO:0013501), amyotrophic lateral sclerosis type 16 (MONDO:0013715)

Orphanet (5): Distal hereditary motor neuropathy, Jerash type (Orphanet:139552), Frontotemporal dementia with motor neuron disease (Orphanet:275872), Juvenile amyotrophic lateral sclerosis (Orphanet:300605), Inclusion body myopathy with Paget disease of bone and frontotemporal dementia (Orphanet:52430), Amyotrophic lateral sclerosis (Orphanet:803)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST006295_2Response to quetiapine in schizophrenia8.000000e-07
GCST90002393_374Monocyte count6.000000e-11

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0005091monocyte count

MeSH disease descriptors (1)

DescriptorNameTree numbers
C535715Spinal muscular atrophy, Jerash type (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

8 total (human), top 8 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneincreases mutagenesis, affects methylation2
aristolochic acid Iincreases expression1
potassium chromate(VI)affects cotreatment, increases expression1
epigallocatechin gallateaffects cotreatment, increases expression1
jinfukangdecreases expression1
Folic Aciddecreases expression1
Tobacco Smoke Pollutionaffects expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

3 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT01353430Not specifiedRECRUITINGCharacterization of Inclusion Body Myopathy Associated With Paget’s Disease of Bone and Frontotemporal Dementia (IBMPFD)
NCT01793168Not specifiedRECRUITINGRare Disease Patient Registry & Natural History Study - Coordination of Rare Diseases at Sanford
NCT04823143Not specifiedCOMPLETEDNatural History Study of Patients With VCP-related Disease