SPATA20

gene
On this page

Also known as FLJ21347SSP411Tisp78

Summary

SPATA20 (spermatogenesis associated 20, HGNC:26125) is a protein-coding gene on chromosome 17q21.33, encoding Spermatogenesis-associated protein 20 (Q8TB22). May play a role in fertility regulation.

Predicted to be involved in carbohydrate metabolic process; cell differentiation; and spermatogenesis. Located in mitochondrion.

Source: NCBI Gene 64847 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 161 total
  • MANE Select transcript: NM_022827

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26125
Approved symbolSPATA20
Namespermatogenesis associated 20
Location17q21.33
Locus typegene with protein product
StatusApproved
AliasesFLJ21347, SSP411, Tisp78
Ensembl geneENSG00000006282
Ensembl biotypeprotein_coding
OMIM613939
Entrez64847

Gene structure

Transcript identifiers

Ensembl transcripts: 44 — 20 protein_coding, 14 retained_intron, 10 nonsense_mediated_decay

ENST00000006658, ENST00000356488, ENST00000502911, ENST00000503063, ENST00000503127, ENST00000504265, ENST00000504271, ENST00000504334, ENST00000505085, ENST00000505336, ENST00000505456, ENST00000505559, ENST00000505656, ENST00000508528, ENST00000508598, ENST00000510917, ENST00000511347, ENST00000511605, ENST00000511845, ENST00000511937, ENST00000512181, ENST00000512416, ENST00000513618, ENST00000515526, ENST00000515619, ENST00000634597, ENST00000635113, ENST00000860309, ENST00000860310, ENST00000860311, ENST00000860312, ENST00000860313, ENST00000860314, ENST00000860315, ENST00000860316, ENST00000860317, ENST00000860318, ENST00000860319, ENST00000860320, ENST00000912606, ENST00000953007, ENST00000953008, ENST00000953009, ENST00000953010

RefSeq mRNA: 3 — MANE Select: NM_022827 NM_001258372, NM_001258373, NM_022827

CCDS: CCDS11571, CCDS58563

Canonical transcript exons

ENST00000006658 — 17 exons

ExonStartEnd
ENSE000020836435054717450547285
ENSE000034586215055053650550610
ENSE000034658535054828350548453
ENSE000034751535054904350549186
ENSE000034964895054855450548618
ENSE000035166635055070850550917
ENSE000035334795054772050547767
ENSE000035385695055425150554450
ENSE000035488695054881050548964
ENSE000035740465055099850551190
ENSE000035957005055549250555852
ENSE000035971855055020850550312
ENSE000035987175055196950552180
ENSE000036401185054928650549487
ENSE000036634255054998550550115
ENSE000036657525055523250555312
ENSE000036728335055151150551679

Expression profiles

Bgee: expression breadth ubiquitous, 279 present calls, max score 98.42.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 40.5836 / max 228.2490, expressed in 1789 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
16168340.39251789
1616840.12205
1616860.05364
1616850.01553

Top tissues by expression

291 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left testisUBERON:000453398.42gold quality
right testisUBERON:000453498.35gold quality
apex of heartUBERON:000209897.11gold quality
left adrenal gland cortexUBERON:003582596.87gold quality
right lobe of thyroid glandUBERON:000111996.80gold quality
adrenal cortexUBERON:000123596.76gold quality
left lobe of thyroid glandUBERON:000112096.73gold quality
right adrenal glandUBERON:000123396.72gold quality
left adrenal glandUBERON:000123496.68gold quality
right adrenal gland cortexUBERON:003582796.52gold quality
thyroid glandUBERON:000204696.48gold quality
right hemisphere of cerebellumUBERON:001489096.26gold quality
testisUBERON:000047396.03gold quality
metanephros cortexUBERON:001053395.96gold quality
left ovaryUBERON:000211995.67gold quality
cerebellar hemisphereUBERON:000224595.66gold quality
cerebellar cortexUBERON:000212995.63gold quality
adrenal glandUBERON:000236995.53gold quality
right ovaryUBERON:000211895.47gold quality
male germ cellCL:000001595.31gold quality
body of stomachUBERON:000116195.13gold quality
spermCL:000001995.11gold quality
cerebellumUBERON:000203794.83gold quality
endocervixUBERON:000045894.71gold quality
cardia of stomachUBERON:000116294.58gold quality
transverse colonUBERON:000115794.43gold quality
lateral nuclear group of thalamusUBERON:000273694.29gold quality
mucosa of transverse colonUBERON:000499194.24gold quality
fundus of stomachUBERON:000116094.16gold quality
body of pancreasUBERON:000115094.10gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

5 targeting SPATA20, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3934-3P99.7665.511351
HSA-MIR-120899.7068.281533
HSA-MIR-328-5P99.0864.651000
HSA-MIR-6885-5P98.7164.33902
HSA-MIR-6765-3P97.8364.591165

Literature-anchored findings (GeneRIF, showing 2)

  • SSP411 has potential as a biomarker for the diagnosis of Cholangiocarcinoma (PMID:23118872)
  • Identification of nonfunctional SPATA20 causing acephalic spermatozoa syndrome in humans. (PMID:36415156)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriospata20ENSDARG00000013880
mus_musculusSpata20ENSMUSG00000020867
rattus_norvegicusSpata20ENSRNOG00000003273
drosophila_melanogasterCG8613FBGN0033924
caenorhabditis_elegansWBGENE00015204

Protein

Protein identifiers

Spermatogenesis-associated protein 20Q8TB22 (reviewed: Q8TB22)

Alternative names: Sperm-specific protein 411

All UniProt accessions (7): Q8TB22, A0A0U1RRL8, D6R947, D6RC70, D6RIU6, H0Y9M1, H0Y9W3

UniProt curated annotations — full annotation on UniProt →

Function. May play a role in fertility regulation.

Subcellular location. Secreted.

Miscellaneous. May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Isoforms (4)

UniProt IDNamesCanonical?
Q8TB22-11yes
Q8TB22-22
Q8TB22-33
Q8TB22-44

RefSeq proteins (3): NP_001245301, NP_001245302, NP_073738* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR004879Ssp411-like_TRXDomain
IPR0089286-hairpin_glycosidase_sfHomologous_superfamily
IPR0123416hp_glycosidase-like_sfHomologous_superfamily
IPR024705Ssp411Family
IPR036249Thioredoxin-like_sfHomologous_superfamily

Pfam: PF03190

UniProt features (14 total): splice variant 5, sequence variant 3, sequence conflict 2, signal peptide 1, chain 1, region of interest 1, modified residue 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8TB22-F190.380.81

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 649

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 71 (showing top): FXR_IR1_Q6, TGACCTY_ERR1_Q2, GOBP_MALE_GAMETE_GENERATION, YGACNNYACAR_UNKNOWN, ONKEN_UVEAL_MELANOMA_UP, ATF1_Q6, MODULE_301, GOBP_CARBOHYDRATE_METABOLIC_PROCESS, BASAKI_YBX1_TARGETS_DN, GOBP_DEVELOPMENTAL_PROCESS_INVOLVED_IN_REPRODUCTION, NIKOLSKY_BREAST_CANCER_17Q21_Q25_AMPLICON, DR3_Q4, GSE13762_CTRL_VS_125_VITAMIND_DAY12_DC_DN, MARTENS_TRETINOIN_RESPONSE_UP, VDR_Q6

GO Biological Process (3): carbohydrate metabolic process (GO:0005975), spermatogenesis (GO:0007283), cell differentiation (GO:0030154)

GO Molecular Function (0):

GO Cellular Component (2): extracellular region (GO:0005576), mitochondrion (GO:0005739)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
primary metabolic process1
developmental process involved in reproduction1
male gamete generation1
cellular developmental process1
cellular anatomical structure1
cytoplasm1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

704 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPATA20SPATA6Q9NWH7595
SPATA20SPATA4Q8NEY3532
SPATA20SPATA46Q5T0L3527
SPATA20PUSL1Q8N0Z8496
SPATA20SPATA17Q96L03489
SPATA20SPATA3Q8NHX4458
SPATA20SPATA22Q8NHS9455
SPATA20SPATA25Q9BR10451
SPATA20CROCC2H7BZ55443
SPATA20SPATA7Q9P0W8432
SPATA20CFAP184Q2M329416
SPATA20HEATR9A2RTY3410
SPATA20IQCNQ9H0B3403
SPATA20COASYQ13057395
SPATA20AFG2AQ8NB90378

IntAct

42 interactions, top by confidence:

ABTypeScore
TADA3TADA2Apsi-mi:“MI:0914”(association)0.740
ETFRF1NDUFAB1psi-mi:“MI:0914”(association)0.640
COX5BCOX7A2Lpsi-mi:“MI:0914”(association)0.530
NIPSNAP3ACLUHpsi-mi:“MI:0914”(association)0.530
COLGALT2COL1A1psi-mi:“MI:0914”(association)0.530
CCL5C4Apsi-mi:“MI:0914”(association)0.530
ODF1TCP1psi-mi:“MI:0914”(association)0.530
GPSM3ATE1psi-mi:“MI:0914”(association)0.530
LYG2TRAF2psi-mi:“MI:0914”(association)0.530
UQCRFS1NDUFAB1psi-mi:“MI:0914”(association)0.530
SPATA20MYBPC3psi-mi:“MI:0915”(physical association)0.370
CORO1ASPATA20psi-mi:“MI:0915”(physical association)0.370
SPATA20ST6GALNAC4psi-mi:“MI:0915”(physical association)0.370
SPATA20PMPCBpsi-mi:“MI:0914”(association)0.350
TRUB2NME6psi-mi:“MI:0914”(association)0.350
TRUB2SPATA20psi-mi:“MI:0914”(association)0.350
SPATA20NDUFAB1psi-mi:“MI:0914”(association)0.350
MICOS13MTX2psi-mi:“MI:0914”(association)0.350
NDUFV3NDUFS8psi-mi:“MI:0914”(association)0.350
NDUFA10AURKApsi-mi:“MI:0914”(association)0.350
LYG2PLOD3psi-mi:“MI:0914”(association)0.350
HS1BP3TAF5Lpsi-mi:“MI:0914”(association)0.350
RAB20BCL10psi-mi:“MI:0914”(association)0.350

BioGRID (76): SPATA20 (Affinity Capture-MS), SPATA20 (Affinity Capture-MS), SPATA20 (Affinity Capture-MS), AGL (Co-fractionation), SPATA20 (Affinity Capture-MS), SPATA20 (Affinity Capture-MS), NUDT8 (Affinity Capture-MS), LYRM7 (Affinity Capture-MS), GLS (Affinity Capture-MS), PMPCA (Affinity Capture-MS), PMPCB (Affinity Capture-MS), CHCHD2 (Affinity Capture-MS), POLDIP2 (Affinity Capture-MS), NDUFA10 (Affinity Capture-MS), ECH1 (Affinity Capture-MS)

ESM2 similar proteins: A0A2B7YDW3, A0A2Z4HPY4, A0A455LN86, A0A455LRW3, A0L2D7, A1CVK0, A1RE94, A1VPR5, A5EBX1, A8ZNR6, A9AWD5, B0U5U4, B0Y565, B0Y5B4, B2FJE7, B2I916, B3Y522, B4SMR2, B6K412, E4V6I8, M2XHU6, O48929, O49187, O50406, O87455, O94265, P05990, P55610, P9WEH1, Q00706, Q07YM7, Q09214, Q0AEJ6, Q0AHQ0, Q0HPW7, Q255B1, Q3AQA7, Q4H4F8, Q4WR16, Q55107

Diamond homologs: P37509, P37512, Q09214, Q6T393, Q80YT5, Q8TB22

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 52 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Respiratory electron transport819.0×2e-06
Aerobic respiration and respiratory electron transport511.1×5e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

161 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance138
Likely benign6
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

3235 predictions. Top by Δscore:

VariantEffectΔscore
17:50548546:G:Aacceptor_gain1.0000
17:50548549:TCTA:Tacceptor_loss1.0000
17:50548550:CTAG:Cacceptor_loss1.0000
17:50548551:TA:Tacceptor_loss1.0000
17:50548615:TCAGG:Tdonor_loss1.0000
17:50548616:CAGG:Cdonor_loss1.0000
17:50548617:AGG:Adonor_loss1.0000
17:50548619:G:Cdonor_loss1.0000
17:50548620:T:Adonor_loss1.0000
17:50548796:A:AGacceptor_gain1.0000
17:50548796:AT:Aacceptor_gain1.0000
17:50548796:ATG:Aacceptor_gain1.0000
17:50548797:T:Aacceptor_gain1.0000
17:50548805:TTCAG:Tacceptor_loss1.0000
17:50548806:TCA:Tacceptor_loss1.0000
17:50548808:A:AGacceptor_gain1.0000
17:50548808:AGTC:Aacceptor_gain1.0000
17:50548808:AGTCG:Aacceptor_gain1.0000
17:50548809:G:GAacceptor_gain1.0000
17:50548809:GT:Gacceptor_gain1.0000
17:50548809:GTC:Gacceptor_gain1.0000
17:50548809:GTCG:Gacceptor_gain1.0000
17:50548809:GTCGG:Gacceptor_gain1.0000
17:50548830:T:Aacceptor_gain1.0000
17:50548961:GCAG:Gdonor_gain1.0000
17:50548965:G:Cdonor_loss1.0000
17:50548965:G:GGdonor_gain1.0000
17:50548966:T:Gdonor_loss1.0000
17:50549037:C:CAacceptor_gain1.0000
17:50549038:GCCA:Gacceptor_loss1.0000

AlphaMissense

5204 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
17:50548857:T:CF121L0.998
17:50548859:C:AF121L0.998
17:50548859:C:GF121L0.998
17:50549079:T:AW169R0.998
17:50549079:T:CW169R0.998
17:50549083:T:CL170P0.998
17:50550238:T:AW326R0.998
17:50550238:T:CW326R0.998
17:50550789:T:AW403R0.998
17:50550789:T:CW403R0.998
17:50551997:G:CD576H0.998
17:50548452:T:AW83R0.997
17:50548452:T:CW83R0.997
17:50548561:T:AW86R0.997
17:50548561:T:CW86R0.997
17:50549075:T:AN167K0.997
17:50549075:T:GN167K0.997
17:50549155:T:CF194S0.997
17:50549286:T:AW205R0.997
17:50549286:T:CW205R0.997
17:50552069:G:CA600P0.997
17:50554256:G:CD639H0.997
17:50548554:G:CW83C0.996
17:50548554:G:TW83C0.996
17:50548585:G:CA94P0.996
17:50548586:C:AA94D0.996
17:50548607:T:AI101N0.996
17:50548613:T:AL103H0.996
17:50548910:G:CK138N0.996
17:50548910:G:TK138N0.996

dbSNP variants (sampled 300 via entrez): RS1000060209 (17:50550508 T>C), RS1000443938 (17:50547841 T>C), RS1000473717 (17:50554178 A>C,G), RS1000475131 (17:50547570 C>T), RS1000697110 (17:50552736 T>C), RS1000725540 (17:50552728 A>G), RS1000761447 (17:50553973 G>A), RS1001217182 (17:50553035 C>T), RS1001316239 (17:50553884 C>T), RS1001378482 (17:50548700 T>G), RS1001532629 (17:50552020 C>A,T), RS1001600742 (17:50553641 G>A,T), RS1001670369 (17:50546999 A>C,G), RS1001826691 (17:50552343 T>A), RS1001983992 (17:50547192 C>A,T)

Disease associations

OMIM: gene MIM:613939 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST006585_1912Blood protein levels4.000000e-81
GCST010244_389Triglyceride levels2.000000e-10
GCST90002405_384Reticulocyte count5.000000e-10
GCST90020026_459Hip index2.000000e-09

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004530triglyceride measurement
EFO:0007986reticulocyte count
EFO:0008039BMI-adjusted hip circumference

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

47 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Cyclosporinedecreases expression3
bisphenol Aincreases expression, increases methylation2
Arsenicaffects methylation, increases abundance, increases expression2
Benzo(a)pyreneincreases expression, affects methylation2
Tetrachlorodibenzodioxinincreases expression2
aristolochic acid Iincreases expression1
alpha-pineneincreases oxidation, increases abundance, affects cotreatment1
glycidyl methacrylatedecreases expression1
sodium arseniteincreases abundance, increases expression1
perfluorooctanoic acidincreases expression1
ochratoxin Aincreases acetylation, increases expression1
methacrylaldehydeincreases oxidation, increases abundance, affects cotreatment1
CGP 52608increases reaction, affects binding1
entinostatincreases expression1
ICG 001increases expression1
bisphenol Sdecreases methylation1
jinfukangincreases expression1
(+)-JQ1 compounddecreases expression1
4-(4-((5-(4,5-dimethyl-2-nitrophenyl)-2-furanyl)methylene)-4,5-dihydro-3-methyl-5-oxo-1H-pyrazol-1-yl)benzoic acidincreases expression1
Resveratrolaffects cotreatment, decreases expression1
Decitabineaffects expression1
Arsenic Trioxidedecreases response to substance1
Acetaminophendecreases expression1
Acroleinincreases abundance, affects cotreatment, increases oxidation1
Air Pollutantsaffects cotreatment, increases abundance, increases oxidation1
Arbutindecreases expression1
Atrazineincreases expression1
Cadmiumincreases expression1
Camptothecinincreases expression1
Cisplatinaffects expression1

Cellosaurus cell lines

4 cell lines: 4 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_TQ14HAP1 SPATA20 (-) 1Cancer cell lineMale
CVCL_XT71HAP1 SPATA20 (-) 2Cancer cell lineMale
CVCL_XT72HAP1 SPATA20 (-) 3Cancer cell lineMale
CVCL_XT73HAP1 SPATA20 (-) 4Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.