SPATA1

gene
On this page

Also known as SP-2

Summary

SPATA1 (spermatogenesis associated 1, HGNC:14682) is a protein-coding gene on chromosome 1p22.3, encoding Spermatogenesis-associated protein 1 (Q5VX52).

Predicted to be located in acrosomal vesicle.

Source: NCBI Gene 100505741 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 3 total
  • MANE Select transcript: NM_001397487

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14682
Approved symbolSPATA1
Namespermatogenesis associated 1
Location1p22.3
Locus typegene with protein product
StatusApproved
AliasesSP-2
Ensembl geneENSG00000122432
Ensembl biotypeprotein_coding
Entrez100505741

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 5 protein_coding_CDS_not_defined, 3 nonsense_mediated_decay, 1 retained_intron, 1 protein_coding

ENST00000460286, ENST00000468437, ENST00000473108, ENST00000484939, ENST00000485121, ENST00000490879, ENST00000697276, ENST00000697277, ENST00000699394, ENST00000699524

RefSeq mRNA: 2 — MANE Select: NM_001397487 NM_001310156, NM_001397487

CCDS: CCDS90993

Canonical transcript exons

ENST00000699524 — 14 exons

ExonStartEnd
ENSE000021705718450638684506418
ENSE000024480408452584584526073
ENSE000035370638451622384516395
ENSE000038109468452569684525749
ENSE000039768448454563484545759
ENSE000039768458455591084555979
ENSE000039768478453286084532974
ENSE000039768498456586184567379
ENSE000039768508452058584520691
ENSE000039768518454878684548964
ENSE000039768528454420284544304
ENSE000039768538452239084522507
ENSE000039768548453370984533766
ENSE000039768558455043284550530

Expression profiles

Bgee: expression breadth ubiquitous, 176 present calls, max score 91.23.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1998 / max 11.4203, expressed in 68 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
38000.199868

Top tissues by expression

274 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
buccal mucosa cellCL:000233691.23silver quality
spermCL:000001989.65gold quality
male germ cellCL:000001586.26gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.49gold quality
left testisUBERON:000453379.05gold quality
right testisUBERON:000453478.37gold quality
testisUBERON:000047377.56gold quality
monocyteCL:000057677.20gold quality
mononuclear cellCL:000084276.95gold quality
calcaneal tendonUBERON:000370176.80gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099176.24gold quality
leukocyteCL:000073876.03gold quality
granulocyteCL:000009475.51gold quality
right lobe of liverUBERON:000111475.51gold quality
lower esophagus mucosaUBERON:003583475.18gold quality
apex of heartUBERON:000209873.92gold quality
adenohypophysisUBERON:000219673.74gold quality
spleenUBERON:000210673.01gold quality
tendonUBERON:000004372.45gold quality
metanephros cortexUBERON:001053372.11gold quality
mucosa of stomachUBERON:000119971.98gold quality
mucosa of transverse colonUBERON:000499171.90gold quality
right lungUBERON:000216771.38gold quality
descending thoracic aortaUBERON:000234571.36gold quality
body of uterusUBERON:000985371.18gold quality
thoracic aortaUBERON:000151570.98gold quality
right coronary arteryUBERON:000162570.97gold quality
ascending aortaUBERON:000149670.95gold quality
muscle layer of sigmoid colonUBERON:003580570.77gold quality
transverse colonUBERON:000115770.53gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no3.23

Regulation

Is transcription factor: no

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_rerioSPATA1ENSDARG00000094327
mus_musculusSpata1ENSMUSG00000028188
rattus_norvegicusSpata1ENSRNOG00000015678

Protein

Protein identifiers

Spermatogenesis-associated protein 1Q5VX52 (reviewed: Q5VX52)

Alternative names: Sperm-specific protein SP-2

All UniProt accessions (4): A0A8V8TNC2, A0A8V8TNU4, A0A8V8TPQ3, A0A8V8TQ46

UniProt curated annotations — full annotation on UniProt →

Subunit / interactions. Interacts with IFT20.

Subcellular location. Cytoplasmic vesicle. Secretory vesicle. Acrosome.

Miscellaneous. May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Isoforms (3)

UniProt IDNamesCanonical?
Q5VX52-11yes
Q5VX52-22
Q5VX52-43

RefSeq proteins (2): NP_001297085, NP_001384416* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR031478SPATA1_CDomain
IPR039062SPAT1Family

Pfam: PF15743

UniProt features (11 total): splice variant 3, sequence conflict 2, coiled-coil region 2, chain 1, region of interest 1, compositionally biased region 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5VX52-F169.580.45

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 58 (showing top): GOCC_SECRETORY_GRANULE, TCCAGAT_MIR5165P, chr1p22, GOCC_SECRETORY_VESICLE, GOCC_ACROSOMAL_VESICLE, KUMAR_PATHOGEN_LOAD_BY_MACROPHAGES, IL15_UP.V1_DN, IL2_UP.V1_DN, HHEX_TARGET_GENES, NAB2_TARGET_GENES, NKX2_3_TARGET_GENES, PAF1_TARGET_GENES, UBN1_TARGET_GENES, ZNF22_TARGET_GENES, ZNF274_TARGET_GENES

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (2): acrosomal vesicle (GO:0001669), cytoplasmic vesicle (GO:0031410)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
secretory granule1
cytoplasm1
intracellular vesicle1

Protein interactions and networks

STRING

204 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPATA1SPATA6LQ8N4H0445
SPATA1SPATA6Q9NWH7396
SPATA1SPEF1Q9Y4P9370
SPATA1CTBSQ01459367
SPATA1SPATA20Q8TB22357
SPATA1SPATA17Q96L03355
SPATA1SPATA4Q8NEY3348
SPATA1GMCL1Q96IK5338
SPATA1SPATA7Q9P0W8317
SPATA1SPAG16Q8N0X2305
SPATA1SPATA22Q8NHS9302
SPATA1SRMP19623289
SPATA1LCA5Q86VQ0288
SPATA1ETNPPLQ8TBG4272
SPATA1GNG5P30670270

IntAct

2 interactions, top by confidence:

ABTypeScore
SPATA1PCK1psi-mi:“MI:0914”(association)0.350

BioGRID (33): SPATA1 (Two-hybrid), UBR4 (Affinity Capture-MS), SCYL2 (Affinity Capture-MS), TLK2 (Affinity Capture-MS), TLK1 (Affinity Capture-MS), MAP3K4 (Affinity Capture-MS), WNK2 (Affinity Capture-MS), SMG6 (Affinity Capture-MS), WNK1 (Affinity Capture-MS), LRRC49 (Affinity Capture-MS), VPS39 (Affinity Capture-MS), C2orf44 (Affinity Capture-MS), NECAP2 (Affinity Capture-MS), KDM5C (Affinity Capture-MS), CMTR1 (Affinity Capture-MS)

ESM2 similar proteins: A0A1W2P884, A2RUB6, A7E3D8, A8MT70, B0CM36, B2RYR0, F1PZQ5, O95447, Q0IIM1, Q0P5X1, Q2KHM9, Q2T9X8, Q4KLH6, Q4R3Q7, Q4R6Q9, Q5NVK0, Q5R7F8, Q5RBD6, Q5RBY6, Q5RC32, Q5RD75, Q5SZL2, Q5TB80, Q5TID7, Q5VX52, Q5XI03, Q6A000, Q6NS45, Q6NZK5, Q6ZPR1, Q6ZQ06, Q7Z4H7, Q80VP2, Q80XJ2, Q80ZU5, Q86T90, Q86YF9, Q8BMD2, Q8IYW5, Q8N0Z3

Diamond homologs: Q5VX52, Q6AY22, Q9D5R4

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

3 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

554 predictions. Top by Δscore:

VariantEffectΔscore
1:84563243:A:ACdonor_gain1.0000
1:84563244:C:CCdonor_gain1.0000
1:84563255:A:ACdonor_gain1.0000
1:84563256:C:CCdonor_gain1.0000
1:84563414:TGATC:Tacceptor_gain1.0000
1:84563419:C:Aacceptor_loss1.0000
1:84563419:C:CCacceptor_gain1.0000
1:84563420:T:Aacceptor_loss1.0000
1:84563729:TCCTA:Tdonor_loss1.0000
1:84563731:CTACC:Cdonor_loss1.0000
1:84563733:A:ATdonor_loss1.0000
1:84564206:A:Cdonor_gain1.0000
1:84565835:TCTTA:Tdonor_loss1.0000
1:84565836:CTTA:Cdonor_loss1.0000
1:84565837:TTA:Tdonor_loss1.0000
1:84565838:TA:Tdonor_loss1.0000
1:84565839:A:ACdonor_gain1.0000
1:84565840:C:Adonor_loss1.0000
1:84565840:C:CCdonor_gain1.0000
1:84555030:T:TAdonor_gain0.9900
1:84555031:C:Adonor_gain0.9900
1:84563417:TC:Tacceptor_gain0.9900
1:84563418:CC:Cacceptor_gain0.9900
1:84565834:GTCTT:Gdonor_loss0.9900
1:84565839:AC:Adonor_gain0.9900
1:84565840:CC:Cdonor_gain0.9900
1:84565840:CCA:Cdonor_gain0.9900
1:84565840:CCAG:Cdonor_gain0.9900
1:84565884:TTCTG:Tdonor_gain0.9900
1:84566009:TTACC:Tacceptor_gain0.9900

AlphaMissense

2860 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:84520610:T:AV43D0.996
1:84520624:T:AW48R0.996
1:84520624:T:CW48R0.996
1:84520601:T:AV40D0.995
1:84520626:G:CW48C0.994
1:84520626:G:TW48C0.994
1:84520684:T:CF68L0.994
1:84520686:T:AF68L0.994
1:84520686:T:GF68L0.994
1:84525861:T:CL133S0.994
1:84522476:T:CF99S0.992
1:84522479:T:CL100P0.992
1:84525735:T:CF123L0.992
1:84525737:T:AF123L0.992
1:84525737:T:GF123L0.992
1:84525867:T:CL135S0.992
1:84522469:T:CF97L0.991
1:84522471:T:AF97L0.991
1:84522471:T:GF97L0.991
1:84525863:T:GY134D0.991
1:84522410:T:CL77S0.990
1:84522419:T:CL80P0.990
1:84522475:T:CF99L0.990
1:84522477:T:AF99L0.990
1:84522477:T:GF99L0.990
1:84525721:T:CL118P0.990
1:84520595:T:CL38P0.989
1:84522500:T:CL107S0.989
1:84520625:G:CW48S0.988
1:84520669:T:CF63L0.988

dbSNP variants (sampled 300 via entrez): RS1000019693 (1:84530081 C>T), RS1000087909 (1:84528671 T>A,C), RS1000090045 (1:84510811 C>T), RS1000130325 (1:84558552 A>G), RS1000185189 (1:84549896 C>A,T), RS1000222999 (1:84508936 G>C,T), RS1000251317 (1:84558259 C>G), RS1000278089 (1:84551503 A>C), RS1000289753 (1:84551792 T>C), RS1000296687 (1:84508591 T>G), RS1000300596 (1:84514606 G>A), RS1000307211 (1:84518046 T>C), RS1000396000 (1:84565154 G>A), RS1000441163 (1:84544897 A>C), RS1000497511 (1:84529796 T>G)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
hydroxyhydroquinoneincreases expression1
tris(2-butoxyethyl) phosphateaffects expression1
beta-lapachonedecreases expression1
sodium arsenitedecreases expression1
maleic acidincreases expression1
Doxorubicindecreases expression1
Urethaneincreases expression1
Aflatoxin B1decreases methylation1
Okadaic Aciddecreases expression1
S-Nitrosoglutathioneincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.