SPATA31H1

gene
On this page

Also known as DKFZp434G118

Summary

SPATA31H1 (SPATA31 subfamily H member 1, HGNC:25275) is a protein-coding gene on chromosome 2p23.3, encoding Spermatogenesis-associated protein 31H1 (Q68DN1).

Located in extracellular exosome and nucleus.

Source: NCBI Gene 84226 — RefSeq curated summary.

At a glance

  • GWAS associations: 22
  • Clinical variants (ClinVar): 29 total
  • MANE Select transcript: NM_032266

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25275
Approved symbolSPATA31H1
NameSPATA31 subfamily H member 1
Location2p23.3
Locus typegene with protein product
StatusApproved
AliasesDKFZp434G118
Ensembl geneENSG00000221843
Ensembl biotypeprotein_coding
Entrez84226

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000447166

RefSeq mRNA: 1 — MANE Select: NM_032266 NM_032266

CCDS: CCDS42666

Canonical transcript exons

ENST00000447166 — 5 exons

ExonStartEnd
ENSE000016190072756533727565415
ENSE000016469722756604427566098
ENSE000017000932756673527582722
ENSE000017089232753738627537570
ENSE000017662402756628927566384

Expression profiles

Bgee: expression breadth ubiquitous, 138 present calls, max score 91.04.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0217 / max 17.1412, expressed in 3 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
193770.01753
193780.00423

Top tissues by expression

242 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cardiac muscle of right atriumUBERON:000337991.04gold quality
left ventricle myocardiumUBERON:000656690.66gold quality
spermCL:000001988.19gold quality
tibialis anteriorUBERON:000138584.88silver quality
right testisUBERON:000453483.50gold quality
left testisUBERON:000453382.95gold quality
testisUBERON:000047380.02gold quality
myocardiumUBERON:000234977.46gold quality
ileal mucosaUBERON:000033176.21silver quality
epithelial cell of pancreasCL:000008375.13gold quality
nasal cavity epitheliumUBERON:000538475.05gold quality
kidney epitheliumUBERON:000481972.42gold quality
deltoidUBERON:000147670.14gold quality
upper arm skinUBERON:000426369.21gold quality
cortical plateUBERON:000534368.27gold quality
stromal cell of endometriumCL:000225565.40gold quality
quadriceps femorisUBERON:000137765.07gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450264.42gold quality
vastus lateralisUBERON:000137963.28gold quality
muscle tissueUBERON:000238560.81gold quality
germinal epithelium of ovaryUBERON:000130460.79gold quality
ganglionic eminenceUBERON:000402360.67gold quality
biceps brachiiUBERON:000150760.63gold quality
islet of LangerhansUBERON:000000659.77gold quality
ventricular zoneUBERON:000305359.74silver quality
jejunal mucosaUBERON:000039959.08gold quality
colonic epitheliumUBERON:000039758.50gold quality
right lobe of liverUBERON:000111458.46gold quality
bone marrow cellCL:000209257.96gold quality
skeletal muscle tissueUBERON:000113457.77gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.33
E-MTAB-6058no46.23

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

16 targeting SPATA31H1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-10523-5P99.9169.222038
HSA-MIR-1212399.5271.792990
HSA-MIR-671-5P99.5267.111277
HSA-MIR-6504-3P99.1769.312891
HSA-MIR-445198.8268.171455
HSA-MIR-4436B-3P98.2565.261494
HSA-MIR-6735-5P98.2465.361488
HSA-MIR-7843-5P98.1265.261421
HSA-MIR-444398.0266.251928
HSA-MIR-4632-5P97.8265.381470
HSA-MIR-6879-5P97.7765.521521
HSA-MIR-191397.0766.201417
HSA-MIR-311697.0765.781324
HSA-MIR-1237-5P95.3862.21451
HSA-MIR-448895.3862.00443
HSA-MIR-4697-5P95.3861.72457

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
mus_musculusSpata31h1ENSMUSG00000044581
mus_musculusSpata31h-ps1ENSMUSG00000118586
rattus_norvegicusAABR07063276.1ENSRNOG00000049398

Protein

Protein identifiers

Spermatogenesis-associated protein 31H1Q68DN1 (reviewed: Q68DN1)

All UniProt accessions (1): C9JG08

UniProt curated annotations — full annotation on UniProt →

Tissue specificity. Expressed in sperm (at protein level).

Domain organisation. The P-S-E-R-S-H-H-S repeats give rise to an antiparallel beta-structure.

RefSeq proteins (1): NP_115642* (*=MANE)

Domains & families (InterPro)

UniProt features (67 total): repeat 27, compositionally biased region 20, sequence variant 11, region of interest 6, sequence conflict 2, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q68DN1-F134.940.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 15 (showing top): CAGCTG_AP4_Q5, AACTTT_UNKNOWN, POU3F2_02, TGGAAA_NFAT_Q4_01, YATGNWAAT_OCT_C, EVI1_02, MIR12123, MIR4443, chr2p23, GSE15659_TREG_VS_TCONV_UP, COMP1_01, OCT1_05, GATAAGR_GATA_C, GSE21546_UNSTIM_VS_ANTI_CD3_STIM_SAP1A_KO_AND_ELK1_KO_DP_THYMOCYTES_DN, OCT1_01

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (2): nucleus (GO:0005634), extracellular exosome (GO:0070062)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
intracellular membrane-bounded organelle1
extracellular vesicle1

Protein interactions and networks

STRING

1762 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPATA31H1CCDC121Q6ZUS5768
SPATA31H1ZNF512Q96ME7766
SPATA31H1GPN1Q9HCN4605
SPATA31H1FNDC4Q9H6D8570
SPATA31H1SLC4A1APQ9BWU0543
SPATA31H1GCKRQ14397540
SPATA31H1NRBP1Q9UHY1448
SPATA31H1KRTCAP3Q53RY4447
SPATA31H1BUD13Q9BRD0443
SPATA31H1SUPT7LO94864399
SPATA31H1COBLL1Q53SF7382
SPATA31H1IFT172Q9UG01372
SPATA31H1GTF3C2Q8WUA4370
SPATA31H1TRIB1Q96RU8366
SPATA31H1APOA5Q6Q788343

IntAct

2 interactions, top by confidence:

ABTypeScore
MYCpsi-mi:“MI:0914”(association)0.350

BioGRID (4): C2orf16 (Affinity Capture-MS), C2orf16 (Affinity Capture-RNA), C2orf16 (Affinity Capture-MS), C2orf16 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A1B0GTH6, A0A1B0GUW6, A0A1D5RMD1, A2AQH4, A4FU49, A6NJ88, C4P6S0, D3YU32, E9PAV3, F1QU13, I3L273, J3KML8, P70670, Q32L62, Q3V0E1, Q3V3Q4, Q4R729, Q5H9F3, Q5QJ38, Q5SWP3, Q5U4C1, Q5VWK0, Q5VYM1, Q68DN1, Q6AZ54, Q7TSG5, Q810T2, Q8CH19, Q8K4E0, Q8N3K9, Q8N5Q1, Q8NDH2, Q8TCU4, Q8WNU4, Q8WWL7, Q920R4, Q921B4, Q923B3, Q96JA4, Q96M34

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

29 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance19
Likely benign10
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

924 predictions. Top by Δscore:

VariantEffectΔscore
2:27537568:GAG:Gdonor_gain1.0000
2:27537570:GGTA:Gdonor_loss1.0000
2:27566042:A:AGacceptor_gain1.0000
2:27566043:G:Cacceptor_loss1.0000
2:27566043:G:GAacceptor_gain1.0000
2:27566096:ATGG:Adonor_loss1.0000
2:27566097:TG:Tdonor_gain1.0000
2:27566098:GG:Gdonor_gain1.0000
2:27566098:GGT:Gdonor_loss1.0000
2:27566099:G:GCdonor_loss1.0000
2:27566099:G:GGdonor_gain1.0000
2:27566100:T:Gdonor_loss1.0000
2:27537571:G:GGdonor_gain0.9900
2:27537572:T:Adonor_loss0.9900
2:27565967:A:AGacceptor_gain0.9900
2:27565968:A:Gacceptor_gain0.9900
2:27566043:GA:Gacceptor_gain0.9900
2:27566043:GATCC:Gacceptor_gain0.9900
2:27566094:ACATG:Adonor_gain0.9900
2:27566095:CATG:Cdonor_gain0.9900
2:27566096:ATG:Adonor_gain0.9900
2:27566101:GAG:Gdonor_loss0.9900
2:27566733:A:AGacceptor_gain0.9900
2:27566734:G:GAacceptor_gain0.9900
2:27565335:A:AGacceptor_gain0.9800
2:27565336:G:GGacceptor_gain0.9800
2:27566043:GAT:Gacceptor_gain0.9800
2:27566043:GATC:Gacceptor_gain0.9800
2:27566237:A:AGacceptor_gain0.9800
2:27566237:ACT:Aacceptor_gain0.9800

AlphaMissense

35147 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:27580074:T:CF1168L0.976
2:27580076:T:AF1168L0.976
2:27580076:T:GF1168L0.976
2:27580075:T:CF1168S0.959
2:27579357:T:CF929L0.927
2:27579359:T:AF929L0.927
2:27579359:T:GF929L0.927
2:27579838:T:CL1089P0.923
2:27579504:T:AW978R0.919
2:27579504:T:CW978R0.919
2:27580617:T:CF1349L0.910
2:27580619:C:AF1349L0.910
2:27580619:C:GF1349L0.910
2:27579506:G:CW978C0.896
2:27579506:G:TW978C0.896
2:27579862:T:CL1097P0.886
2:27580338:T:CF1256L0.883
2:27580340:C:AF1256L0.883
2:27580340:C:GF1256L0.883
2:27579935:C:GC1121W0.875
2:27580003:T:CL1144P0.871
2:27580009:T:CI1146T0.868
2:27580009:T:AI1146N0.863
2:27580009:T:GI1146S0.863
2:27579865:T:CL1098S0.862
2:27578325:T:CF585L0.854
2:27578327:T:AF585L0.854
2:27578327:T:GF585L0.854
2:27579942:T:CC1124R0.852
2:27579919:T:CL1116P0.849

dbSNP variants (sampled 300 via entrez): RS1000011423 (2:27556023 T>A,C), RS1000096737 (2:27564460 C>G), RS1000156723 (2:27555960 C>A,T), RS1000197723 (2:27541519 G>C,T), RS1000257556 (2:27563342 A>G,T), RS1000328310 (2:27571483 G>A), RS1000330767 (2:27562930 T>G), RS1000435539 (2:27580063 T>C,G), RS1000561360 (2:27541327 T>C), RS1000576604 (2:27548113 C>T), RS1000614979 (2:27554097 G>A), RS1000733246 (2:27547658 G>A,T), RS1000803112 (2:27542621 T>C), RS1000906765 (2:27550002 GATTTTT>G), RS1000952218 (2:27576659 G>T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

22 associations (top):

StudyTraitp-value
GCST000769_3Calcium levels7.000000e-06
GCST001006_6Waist Circumference - Triglycerides (WC-TG)2.000000e-09
GCST001277_11Liver enzyme levels (gamma-glutamyl transferase)4.000000e-13
GCST001841_15Palmitoleic acid (16:1n-7) levels1.000000e-09
GCST001905_4Hypertriglyceridemia2.000000e-13
GCST003264_205Post bronchodilator FEV1/FVC ratio4.000000e-06
GCST003858_1Oral cavity cancer4.000000e-08
GCST005308_1Nonalcoholic fatty liver disease2.000000e-08
GCST005985_4Creatinine levels2.000000e-21
GCST005987_24Albumin-globulin ratio4.000000e-10
GCST005999_3Aspartate aminotransferase levels1.000000e-12
GCST006014_3Creatine kinase levels3.000000e-08
GCST006017_2Prothrombin time3.000000e-10
GCST006231_16Mean arterial pressure9.000000e-06
GCST007437_11Triglyceride levels7.000000e-06
GCST007858_2Fasting blood glucose adjusted for BMI4.000000e-09
GCST008103_38Bipolar disorder1.000000e-07
GCST008985_2Triglycerides7.000000e-12
GCST010276_7Renal underexcretion gout5.000000e-07
GCST010660_10Triglyceride levels1.000000e-25
GCST011351_3Aspartate aminotransferase levels2.000000e-11
GCST90016673_8Percent liver fat2.000000e-08

EFO canonical traits (11, from GWAS)

EFO IDTrait name
EFO:0004838calcium measurement
EFO:0000195metabolic syndrome
EFO:0004530triglyceride measurement
EFO:0004532serum gamma-glutamyl transferase measurement
EFO:0004713FEV/FVC ratio
EFO:0005128albumin:globulin ratio measurement
EFO:0004736aspartate aminotransferase measurement
EFO:0004534creatine kinase measurement
EFO:0008390prothrombin time measurement
EFO:0006340mean arterial pressure
EFO:0010821liver fat measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases expression, affects methylation2
fluorene-9-bisphenolincreases expression1
tris(2-butoxyethyl) phosphateaffects expression1
licochalcone Bdecreases expression1
Cannabidiolincreases expression1
Endosulfanincreases expression1
Tobacco Smoke Pollutiondecreases expression1
Urethanedecreases expression1
Aflatoxin B1increases methylation1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.