SPAG4

gene
On this page

Also known as SUN4CT127

Summary

SPAG4 (sperm associated antigen 4, HGNC:11214) is a protein-coding gene on chromosome 20q11.22, encoding Sperm-associated antigen 4 protein (Q9NPE6). Involved in spermatogenesis.

The mammalian sperm flagellum contains two cytoskeletal structures associated with the axoneme: the outer dense fibers surrounding the axoneme in the midpiece and principal piece and the fibrous sheath surrounding the outer dense fibers in the principal piece of the tail. Defects in these structures are associated with abnormal tail morphology, reduced sperm motility, and infertility. In the rat, the protein encoded by this gene associates with an outer dense fiber protein via a leucine zipper motif and localizes to the microtubules of the manchette and axoneme during sperm tail development. Alternative splicing results in multiple transcript variants encoding different isoforms.

Source: NCBI Gene 6676 — RefSeq curated summary.

At a glance

  • GWAS associations: 5
  • Clinical variants (ClinVar): 46 total
  • MANE Select transcript: NM_003116

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:11214
Approved symbolSPAG4
Namesperm associated antigen 4
Location20q11.22
Locus typegene with protein product
StatusApproved
AliasesSUN4, CT127
Ensembl geneENSG00000061656
Ensembl biotypeprotein_coding
OMIM603038
Entrez6676

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 6 protein_coding, 3 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000374273, ENST00000430878, ENST00000454819, ENST00000462896, ENST00000463973, ENST00000468248, ENST00000498203, ENST00000679710, ENST00000861575, ENST00000957539

RefSeq mRNA: 2 — MANE Select: NM_003116 NM_001317931, NM_003116

CCDS: CCDS13259, CCDS93032

Canonical transcript exons

ENST00000374273 — 12 exons

ExonStartEnd
ENSE000006616643561713635617240
ENSE000006616683561861235618720
ENSE000014629913561582935616307
ENSE000034606543561752035617586
ENSE000035096393561808735618130
ENSE000035144473561892335618998
ENSE000035431013561957935619746
ENSE000035445713561919535619310
ENSE000035506343561845035618475
ENSE000035556903561777935617840
ENSE000035567553562068435620773
ENSE000038917473562087635621094

Expression profiles

Bgee: expression breadth ubiquitous, 186 present calls, max score 96.26.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 3.9955 / max 59.7410, expressed in 1320 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1843132.33191016
1843110.6065278
1843140.5075229
1843150.4345195
1843120.115251

Top tissues by expression

278 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
body of pancreasUBERON:000115096.26gold quality
right testisUBERON:000453493.87gold quality
left testisUBERON:000453393.68gold quality
testisUBERON:000047390.83gold quality
pancreasUBERON:000126486.58gold quality
adenohypophysisUBERON:000219685.34gold quality
left ovaryUBERON:000211985.19gold quality
right ovaryUBERON:000211884.30gold quality
left uterine tubeUBERON:000130384.12gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.79gold quality
spleenUBERON:000210683.16gold quality
pituitary glandUBERON:000000782.96gold quality
small intestine Peyer’s patchUBERON:000345481.95gold quality
transverse colonUBERON:000115781.45gold quality
right adrenal gland cortexUBERON:003582781.40gold quality
lower esophagus mucosaUBERON:003583481.37gold quality
body of stomachUBERON:000116181.06gold quality
endocervixUBERON:000045880.98gold quality
left adrenal gland cortexUBERON:003582580.86gold quality
minor salivary glandUBERON:000183080.56gold quality
right adrenal glandUBERON:000123380.05gold quality
adrenal tissueUBERON:001830380.04gold quality
stromal cell of endometriumCL:000225579.98gold quality
bone marrow cellCL:000209279.73gold quality
left adrenal glandUBERON:000123479.51gold quality
small intestineUBERON:000210879.38gold quality
mucosa of transverse colonUBERON:000499179.11gold quality
ovaryUBERON:000099278.78gold quality
gall bladderUBERON:000211078.60gold quality
apex of heartUBERON:000209878.49gold quality

Single-cell (SCXA)

Detected in 9 experiment(s), a significant marker in 8.

ExperimentMarker?Max mean expression
E-CURD-88yes107.84
E-HCAD-4yes39.46
E-ANND-3yes34.19
E-MTAB-8410yes32.41
E-CURD-46yes31.72
E-HCAD-11yes20.16
E-MTAB-10553yes11.02
E-HCAD-1yes10.45
E-MTAB-6108no223.43

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 6)

  • SPAG4 is an independent prognostic factor in renal cell carcinoma and plays a crucial role in cytokinesis to defend against hypoxia-induced tetraploid formation. (PMID:23602831)
  • SPAG4 knockdown reduces the invasion capability of RCC cells. (PMID:23818324)
  • SPAG4, in cooperation with Nesprin3, has a fundamental pathological function in the migration of lung carcinoma cells. (PMID:29901114)
  • Our study revealed that SPAG4 was identified as a cancer biomarker for glioblastoma and might be a promising target for clinical diagnosis and intervention of glioblastoma. (PMID:30817682)
  • Results indicate that sperm associated antigen 4 (SPAG4L/SPAG4Lbeta) transcript isoform interacts with spectrin repeat containing nuclear envelope protein 2 (Nesprin2) in the meiotic process. (PMID:31144711)
  • Human sperm-associated antigen 4 as a potential prognostic biomarker of lung squamous cell carcinoma. (PMID:34311595)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
mus_musculusSpag4ENSMUSG00000038180
rattus_norvegicusSpag4ENSRNOG00000048056
caenorhabditis_elegansWBGENE00006816

Paralogs (4): SUN2 (ENSG00000100242), SUN3 (ENSG00000164744), SUN1 (ENSG00000164828), SUN5 (ENSG00000167098)

Protein

Protein identifiers

Sperm-associated antigen 4 proteinQ9NPE6 (reviewed: Q9NPE6)

Alternative names: Outer dense fiber-associated protein SPAG4, SUN domain-containing protein 4

All UniProt accessions (4): Q9NPE6, A0A7P0Z4G5, C9JJZ6, Q5JX49

UniProt curated annotations — full annotation on UniProt →

Function. Involved in spermatogenesis. Required for sperm head formation but not required to establish and maintain general polarity of the sperm head. Required for anchoring and organization of the manchette. Required for targeting of SUN3 and probably SYNE1 through a probable SUN1:SYNE3 LINC complex to the nuclear envelope and involved in accurate posterior sperm head localization of the complex. May anchor SUN3 the nuclear envelope. Involved in maintenance of the nuclear envelope integrity. May assist the organization and assembly of outer dense fibers (ODFs), a specific structure of the sperm tail.

Subunit / interactions. Homodimer. Interacts with ODF1. May associate with microtubules. Interacts with SUN3 and SYNE1; suggesting the formation of a spermatogenesis-specific LINC complex; a SUN domain-based heterotrimer with SUN3 may associate with SYNE1. Interacts with SEPT12 and LMNB1; during spermatogenesis.

Subcellular location. Membrane. Cytoplasm. Cytoskeleton. Flagellum axoneme. Nucleus envelope. Nucleus inner membrane.

Tissue specificity. Predominantly epressed in testis. Expressed in ejaculated spermatozoa (at protein level).

RefSeq proteins (2): NP_001304860, NP_003107* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR012919SUN_domDomain
IPR045119SUN1-5Family

Pfam: PF07738

UniProt features (9 total): compositionally biased region 3, transmembrane region 2, chain 1, domain 1, region of interest 1, coiled-coil region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NPE6-F171.790.38

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 123 (showing top): NIKOLSKY_BREAST_CANCER_20Q11_AMPLICON, GOZGIT_ESR1_TARGETS_DN, MENSE_HYPOXIA_UP, GOBP_MALE_GAMETE_GENERATION, LIAO_METASTASIS, GOBP_DEVELOPMENTAL_PROCESS_INVOLVED_IN_REPRODUCTION, HAN_SATB1_TARGETS_DN, chr20q11, GOCC_NUCLEAR_ENVELOPE, GOCC_ORGANELLE_INNER_MEMBRANE, GOCC_NUCLEAR_INNER_MEMBRANE, GOCC_MEMBRANE_PROTEIN_COMPLEX, GOCC_NUCLEAR_MEMBRANE, ZHAN_MULTIPLE_MYELOMA_CD2_UP, GOMF_STRUCTURAL_MOLECULE_ACTIVITY

GO Biological Process (2): spermatogenesis (GO:0007283), cell differentiation (GO:0030154)

GO Molecular Function (3): structural molecule activity (GO:0005198), protein-membrane adaptor activity (GO:0043495), protein binding (GO:0005515)

GO Cellular Component (10): nuclear envelope (GO:0005635), nuclear inner membrane (GO:0005637), cytoskeleton (GO:0005856), meiotic nuclear membrane microtubule tethering complex (GO:0034993), nucleus (GO:0005634), cytoplasm (GO:0005737), cilium (GO:0005929), membrane (GO:0016020), motile cilium (GO:0031514), cell projection (GO:0042995)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
developmental process involved in reproduction1
male gamete generation1
cellular developmental process1
molecular_function1
protein-macromolecule adaptor activity1
binding1
nucleus1
endomembrane system1
organelle envelope1
organelle inner membrane1
nuclear membrane1
intracellular membraneless organelle1
microtubule organizing center attachment site1
nuclear membrane microtubule tethering complex1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
intraciliary transport particle1
membrane-bounded organelle1
plasma membrane bounded cell projection1
cilium1

Protein interactions and networks

STRING

1212 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPAG4ODF1Q14990998
SPAG4SPAG5Q96R06737
SPAG4SYNE1Q8NF91737
SPAG4SYNE3Q6ZMZ3636
SPAG4SYNE4Q8N205587
SPAG4SEPTIN12Q8IYM1543
SPAG4SYNE2Q8WXH0532
SPAG4KASH5Q8N6L0530
SPAG4SEPTIN1Q8WYJ6475
SPAG4SEPTIN6Q14141471
SPAG4EMDP50402447
SPAG4HOOK1Q9UJC3443
SPAG4TOR3AQ9H497442
SPAG4MRPS18AQ9NVS2441
SPAG4OTORQ9NRC9439

IntAct

88 interactions, top by confidence:

ABTypeScore
SEPTIN12SPAG4psi-mi:“MI:0915”(physical association)0.610
SPAG4SEPTIN12psi-mi:“MI:0915”(physical association)0.610
SEPTIN12SPAG4psi-mi:“MI:0403”(colocalization)0.610
TMEM120ASPAG4psi-mi:“MI:0915”(physical association)0.560
ZDHHC24SPAG4psi-mi:“MI:0915”(physical association)0.560
SLC30A8SPAG4psi-mi:“MI:0915”(physical association)0.560
CYP4F2SPAG4psi-mi:“MI:0915”(physical association)0.560
ABHD16ASPAG4psi-mi:“MI:0915”(physical association)0.560
TMEM60SPAG4psi-mi:“MI:0915”(physical association)0.560
PGA4SPAG4psi-mi:“MI:0915”(physical association)0.560
GOSR2SPAG4psi-mi:“MI:0915”(physical association)0.560
ASGR1SPAG4psi-mi:“MI:0915”(physical association)0.560
IGFBP5SPAG4psi-mi:“MI:0915”(physical association)0.560
TSPO2SPAG4psi-mi:“MI:0915”(physical association)0.560
MS4A13SPAG4psi-mi:“MI:0915”(physical association)0.560
TMEM187SPAG4psi-mi:“MI:0915”(physical association)0.560
PTPN9SPAG4psi-mi:“MI:0915”(physical association)0.560
SEC22ASPAG4psi-mi:“MI:0915”(physical association)0.560
BMP10SPAG4psi-mi:“MI:0915”(physical association)0.560
TNMDSPAG4psi-mi:“MI:0915”(physical association)0.560

BioGRID (31): SPAG4 (Two-hybrid), SCD (Two-hybrid), ERMP1 (Two-hybrid), GOSR2 (Two-hybrid), TSPO2 (Two-hybrid), IGFBP5 (Two-hybrid), SEC22A (Two-hybrid), TMEM60 (Two-hybrid), NRM (Two-hybrid), C16orf58 (Two-hybrid), ASGR1 (Two-hybrid), CSGALNACT1 (Two-hybrid), BMP10 (Two-hybrid), SUN5 (Two-hybrid), PTPN9 (Two-hybrid)

ESM2 similar proteins: A1L3T7, A2CI98, A2CJ06, A2VCK2, D3ZQL6, E1BBG2, M3WHG5, O15037, O43918, O70303, O75333, P48778, P59729, Q32PJ7, Q3KP66, Q3T191, Q5JYT7, Q5T124, Q5U4F0, Q5XIS1, Q6AXX1, Q6VYH9, Q7TSI1, Q80U38, Q8BGT6, Q8BHW9, Q8BWG4, Q8BWQ5, Q8BZ33, Q8BZW2, Q8C0J6, Q8C2K5, Q8CFK6, Q8IV53, Q8IY33, Q8IZW8, Q8K1S6, Q8K330, Q8N3F8, Q8TB24

Diamond homologs: A0A0B4KEE4, O55034, O94901, Q09825, Q0II64, Q20745, Q5SS91, Q8BJS4, Q8TAQ9, Q8TC36, Q95LV7, Q9D666, Q9DA32, Q9JJF2, Q9NPE6, Q9UH99, Q9SG79, Q558Z2, Q9FF75

SIGNOR signaling

1 interactions.

AEffectBMechanism
SPAG4“form complex”“LINC complex”binding

Disease & clinical

Clinical variants and AI predictions

ClinVar

46 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance36
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1760 predictions. Top by Δscore:

VariantEffectΔscore
20:35618611:GTGA:Gacceptor_gain1.0000
20:35619193:A:AGacceptor_gain1.0000
20:35619194:G:GGacceptor_gain1.0000
20:35619307:GGAG:Gdonor_gain1.0000
20:35619308:GAG:Gdonor_gain1.0000
20:35619308:GAGG:Gdonor_gain1.0000
20:35619309:AGGT:Adonor_loss1.0000
20:35619310:GGTG:Gdonor_loss1.0000
20:35619311:G:Cdonor_loss1.0000
20:35620679:ACCAG:Aacceptor_gain1.0000
20:35620680:CCA:Cacceptor_loss1.0000
20:35620681:CA:Cacceptor_loss1.0000
20:35620682:A:AGacceptor_gain1.0000
20:35620683:G:GGacceptor_gain1.0000
20:35620683:G:GTacceptor_loss1.0000
20:35616224:G:GTdonor_gain0.9900
20:35618085:A:AGacceptor_gain0.9900
20:35618086:G:GGacceptor_gain0.9900
20:35618086:GC:Gacceptor_gain0.9900
20:35618131:G:GGdonor_gain0.9900
20:35618610:A:AGacceptor_gain0.9900
20:35618611:G:GGacceptor_gain0.9900
20:35618611:GT:Gacceptor_gain0.9900
20:35619039:A:Tdonor_gain0.9900
20:35619189:TTCCA:Tacceptor_loss0.9900
20:35619192:CA:Cacceptor_loss0.9900
20:35619193:A:ACacceptor_loss0.9900
20:35619194:GGA:Gacceptor_gain0.9900
20:35619306:TGGAG:Tdonor_gain0.9900
20:35619307:GGAGG:Gdonor_gain0.9900

AlphaMissense

2828 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:35619605:G:CW312C0.997
20:35619605:G:TW312C0.997
20:35620762:T:CF386L0.997
20:35620764:C:AF386L0.997
20:35620764:C:GF386L0.997
20:35620726:T:CF374L0.996
20:35620728:C:AF374L0.996
20:35620728:C:GF374L0.996
20:35619603:T:AW312R0.994
20:35619603:T:CW312R0.994
20:35620910:T:CI401T0.994
20:35619637:T:CI323T0.993
20:35620732:T:CF376L0.993
20:35620734:C:AF376L0.993
20:35620734:C:GF376L0.993
20:35619610:T:GF314C0.991
20:35620916:T:CI403T0.991
20:35620942:T:CF412L0.991
20:35620944:C:AF412L0.991
20:35620944:C:GF412L0.991
20:35620964:G:CR419P0.991
20:35619637:T:AI323N0.990
20:35620894:T:CF396L0.990
20:35620896:T:AF396L0.990
20:35620896:T:GF396L0.990
20:35619610:T:CF314S0.989
20:35620910:T:GI401S0.989
20:35620926:C:AN406K0.989
20:35620926:C:GN406K0.989
20:35620961:T:AV418D0.989

dbSNP variants (sampled 300 via entrez): RS1000461465 (20:35617942 C>T), RS1001009870 (20:35619406 G>A,T), RS1001093449 (20:35616488 A>G), RS1001619666 (20:35620096 G>A), RS1001634950 (20:35617450 G>T), RS1002904029 (20:35616105 C>T), RS1003281922 (20:35616379 G>A,C), RS1003293841 (20:35621304 T>C), RS1003429631 (20:35621117 A>T), RS1004081884 (20:35615727 C>A), RS1004184895 (20:35614125 C>T), RS1005606698 (20:35619131 GCCCCCGCGCCCCGACT>G), RS1006149514 (20:35620281 T>A), RS1006323140 (20:35620532 C>T), RS1006518851 (20:35618745 C>A,G,T)

Disease associations

OMIM: gene MIM:603038 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

5 associations (top):

StudyTraitp-value
GCST005956_31Waist-to-hip ratio adjusted for BMI8.000000e-08
GCST005958_16Waist-to-hip ratio adjusted for BMI (age >50)6.000000e-06
GCST005962_40Waist-to-hip ratio adjusted for BMI x sex x age interaction (4df test)3.000000e-08
GCST010002_66Refractive error2.000000e-20
GCST010703_112Brain morphology (MOSTest)4.000000e-19

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0007788BMI-adjusted waist-hip ratio
EFO:0008007age at assessment
EFO:0008343sex interaction measurement
EFO:0004346neuroimaging measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

40 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Cyclosporinedecreases expression3
mercuric bromidedecreases expression, affects cotreatment2
bisphenol Saffects cotreatment, increases expression, decreases expression2
Benzo(a)pyreneincreases expression, increases methylation2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
Smokedecreases expression, increases abundance2
Valproic Acidincreases methylation, affects expression2
aristolochic acid Idecreases expression1
triphenyl phosphateaffects expression1
bisphenol Aaffects expression1
tris(2-butoxyethyl) phosphateaffects expression1
arseniteaffects binding, increases reaction1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
zinc chlorideincreases expression, decreases reaction1
cobaltous chlorideincreases expression, decreases reaction1
nickel chlorideincreases expression1
di-n-butylphosphoric acidaffects expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
NSC 689534increases expression1
Air Pollutantsincreases abundance, decreases expression1
Benzeneincreases expression1
Carbamazepineaffects expression1
Dexamethasoneaffects cotreatment, increases expression1
Diethylhexyl Phthalatedecreases expression1
Estradiolaffects cotreatment, increases expression1
Hydrogen Peroxideaffects expression1
Indomethacinincreases expression, affects cotreatment1
Oxygenincreases expression1
Quercetinaffects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.