SPOCD1

gene
On this page

Also known as FLJ25348PPP1R146

Summary

SPOCD1 (SPOC domain containing 1, HGNC:26338) is a protein-coding gene on chromosome 1p35.2, encoding SPOC domain-containing protein 1 (Q6ZMY3). Protein adapter that acts as an essential executor of PIWIL4-piRNA pathway directed transposon DNA methylation and silencing in the male embryonic germ cells.

This gene encodes a protein that belongs to the TFIIS family of transcription factors. Alternate splicing results in multiple transcript variants.

Source: NCBI Gene 90853 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 192 total — 3 likely-pathogenic
  • MANE Select transcript: NM_144569

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26338
Approved symbolSPOCD1
NameSPOC domain containing 1
Location1p35.2
Locus typegene with protein product
StatusApproved
AliasesFLJ25348, PPP1R146
Ensembl geneENSG00000134668
Ensembl biotypeprotein_coding
OMIM619038
Entrez90853

Gene structure

Transcript identifiers

Ensembl transcripts: 16 — 10 protein_coding, 6 retained_intron

ENST00000257100, ENST00000360482, ENST00000452755, ENST00000460061, ENST00000468720, ENST00000473361, ENST00000485944, ENST00000525930, ENST00000528579, ENST00000528791, ENST00000529396, ENST00000531039, ENST00000532604, ENST00000533231, ENST00000917879, ENST00000917880

RefSeq mRNA: 6 — MANE Select: NM_144569 NM_001281987, NM_001281988, NM_001394397, NM_001394398, NM_001394399, NM_144569

CCDS: CCDS347, CCDS60066, CCDS72748

Canonical transcript exons

ENST00000360482 — 16 exons

ExonStartEnd
ENSE000012797513181596131816022
ENSE000026991373181395131815372
ENSE000034785163179659031796715
ENSE000034873993180166431801705
ENSE000035503123179221531792401
ENSE000035532103179940131799485
ENSE000035587583180044131800617
ENSE000035607663179412431794235
ENSE000035906643179844231798601
ENSE000036149483179327831793428
ENSE000036263913180001631800141
ENSE000036264333179042231791291
ENSE000036495903179820731798323
ENSE000036691843179267831792767
ENSE000036933023179374731793897
ENSE000037857553179980931799863

Expression profiles

Bgee: expression breadth ubiquitous, 164 present calls, max score 93.05.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 17.3583 / max 195.0093, expressed in 1076 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
114989.6525977
114976.8555935
114990.7512440
114960.099139

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
kidney epitheliumUBERON:000481993.05gold quality
stromal cell of endometriumCL:000225593.02gold quality
cardiac muscle of right atriumUBERON:000337987.54gold quality
left ventricle myocardiumUBERON:000656687.51gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047382.57gold quality
monocyteCL:000057680.27gold quality
right testisUBERON:000453479.96gold quality
upper lobe of left lungUBERON:000895279.58gold quality
leukocyteCL:000073879.38gold quality
left testisUBERON:000453379.37gold quality
upper arm skinUBERON:000426378.76gold quality
upper lobe of lungUBERON:000894878.62gold quality
testisUBERON:000047378.39gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099178.18gold quality
tibiaUBERON:000097977.24gold quality
urinary bladderUBERON:000125577.04gold quality
epithelial cell of pancreasCL:000008377.03gold quality
right lungUBERON:000216774.57gold quality
nasal cavity epitheliumUBERON:000538474.23gold quality
oocyteCL:000002374.18gold quality
myocardiumUBERON:000234974.03gold quality
cerebellar vermisUBERON:000472073.82gold quality
myometriumUBERON:000129673.64gold quality
amniotic fluidUBERON:000017372.95silver quality
vermiform appendixUBERON:000115472.58gold quality
right coronary arteryUBERON:000162572.14gold quality
lungUBERON:000204872.06gold quality
epithelium of nasopharynxUBERON:000195171.75gold quality
bloodUBERON:000017871.65gold quality
caecumUBERON:000115371.41gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-MTAB-9435yes417.25
E-HCAD-13yes25.43
E-ANND-3yes14.22
E-MTAB-7249no45.36

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

11 targeting SPOCD1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-551B-5P99.9671.283493
HSA-MIR-6817-3P99.7968.352126
HSA-MIR-2681-5P99.7567.641655
HSA-MIR-42198.9067.041883
HSA-MIR-518C-5P98.5369.201640
HSA-MIR-473697.9665.891287
HSA-MIR-4723-3P97.6765.911017
HSA-MIR-445697.5064.881678
HSA-MIR-6769B-3P97.4165.531036
HSA-MIR-318397.4065.68978

Literature-anchored findings (GeneRIF, showing 5)

  • SPOCD1 expression is significantly upregulated in human masticatory mucosa during wound healing (PMID:28005267)
  • SPOCD1 genetic variants have role in gastric carcinogenesis. (PMID:28246015)
  • SPOCD1 promoted cell proliferation and inhibited cell apoptosis through regulation of VEGFA in osteosarcoma. (PMID:29257309)
  • LncRNA SPOCD1-AS from ovarian cancer extracellular vesicles remodels mesothelial cells to promote peritoneal metastasis via interacting with G3BP1. (PMID:33726799)
  • Haplotype phasing of a bipolar disorder pedigree revealed rare multiple mutations of SPOCD1 gene in the 1p36-35 susceptibility locus. (PMID:35504398)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
mus_musculusSpocd1ENSMUSG00000028784
rattus_norvegicusENSRNOG00000064683
drosophila_melanogasterppsFBGN0082831

Paralogs (2): DIDO1 (ENSG00000101191), PHF3 (ENSG00000118482)

Protein

Protein identifiers

SPOC domain-containing protein 1Q6ZMY3 (reviewed: Q6ZMY3)

All UniProt accessions (6): Q6ZMY3, E9PKC3, E9PMX0, E9PPM7, E9PQH7, H0YF46

UniProt curated annotations — full annotation on UniProt →

Function. Protein adapter that acts as an essential executor of PIWIL4-piRNA pathway directed transposon DNA methylation and silencing in the male embryonic germ cells. Recruited to young transposons, which are specifically marked with histone H3 trimethylated at both ‘Lys-4’ and ‘Lys-9’ (H3K4me3K9me3), via its association with SPIN1 chromatin reader, and associates with the de novo DNA methylation machinery and repressive chromatin remodeling complexes. Following this, PIWIL4 engages with nascent transposable element transcript to direct piRNA-directed DNA methylation. Not required for piRNA biosynthesis.

Subunit / interactions. Interacts with DNMT3A, DNMT3C and DNMT3L. Interacts with C19orf84. Interacts with SPIN1; promoting recruitment to transposons marked with histone H3 trimethylated at both ‘Lys-4’ and ‘Lys-9’ (H3K4me3K9me3).

Subcellular location. Nucleus. Chromosome.

Disease relevance. Spermatogenic failure. A male infertility disorder caused by spermatogenesis defects that result in non-obstructive azoospermia. The disease is caused by variants affecting the gene represented in this entry.

Isoforms (5)

UniProt IDNamesCanonical?
Q6ZMY3-11yes
Q6ZMY3-22
Q6ZMY3-33
Q6ZMY3-44
Q6ZMY3-55

RefSeq proteins (6): NP_001268916, NP_001268917, NP_001381326, NP_001381327, NP_001381328, NP_653170* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR003618TFIIS_cen_domDomain
IPR012921SPOC_CDomain
IPR036575TFIIS_cen_dom_sfHomologous_superfamily

Pfam: PF07500, PF07744

UniProt features (34 total): compositionally biased region 9, splice variant 9, region of interest 8, sequence variant 4, domain 2, chain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q6ZMY3-F147.760.05

Function

Pathways and Gene Ontology

Reactome pathways

4 pathways

IDPathway
R-HSA-9845323Regulation of endogenous retroelements by Piwi-interacting RNAs (piRNAs)
R-HSA-212165Epigenetic regulation of gene expression
R-HSA-74160Gene expression (Transcription)
R-HSA-9842860Regulation of endogenous retroelements

MSigDB gene sets: 88 (showing top): GOBP_MALE_GAMETE_GENERATION, WEI_MYCN_TARGETS_WITH_E_BOX, GOBP_NEGATIVE_REGULATION_OF_GENE_EXPRESSION_EPIGENETIC, GOBP_DNA_METHYLATION_DEPENDENT_CONSTITUTIVE_HETEROCHROMATIN_FORMATION, GOBP_DEVELOPMENTAL_PROCESS_INVOLVED_IN_REPRODUCTION, HENDRICKS_SMARCA4_TARGETS_UP, GOBP_CHROMATIN_REMODELING, GOBP_HETEROCHROMATIN_ORGANIZATION, GOBP_EPIGENETIC_REGULATION_OF_GENE_EXPRESSION, chr1p35, WANG_SMARCE1_TARGETS_UP, CHICAS_RB1_TARGETS_CONFLUENT, MIYAGAWA_TARGETS_OF_EWSR1_ETS_FUSIONS_DN, GOBP_TRANSPOSITION, PEDERSEN_METASTASIS_BY_ERBB2_ISOFORM_7

GO Biological Process (7): DNA-templated transcription (GO:0006351), regulation of transcription by RNA polymerase II (GO:0006357), spermatogenesis (GO:0007283), cell differentiation (GO:0030154), transposable element silencing by piRNA-mediated heterochromatin formation (GO:0141006), transposable element silencing by piRNA-mediated DNA methylation (GO:0141196), regulatory ncRNA-mediated gene silencing (GO:0031047)

GO Molecular Function (2): protein-macromolecule adaptor activity (GO:0030674), protein binding (GO:0005515)

GO Cellular Component (2): nucleus (GO:0005634), chromosome (GO:0005694)

Reactome top-level categories

Rollup of top-3 pathways:

CategoryPathways
Regulation of endogenous retroelements1
Gene expression (Transcription)1
Epigenetic regulation of gene expression1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transposable element silencing by heterochromatin formation2
gene expression1
RNA biosynthetic process1
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
developmental process involved in reproduction1
male gamete generation1
cellular developmental process1
piRNA-mediated heterochromatin formation1
gene silencing by piRNA-directed DNA methylation1
negative regulation of gene expression1
protein binding1
molecular adaptor activity1
binding1
intracellular membrane-bounded organelle1
intracellular membraneless organelle1

Protein interactions and networks

STRING

706 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SPOCD1DNMT3LQ9UJW3739
SPOCD1BANF1O75531627
SPOCD1TEX15Q9BXT5545
SPOCD1ZKSCAN2Q63HK3543
SPOCD1SPENQ96T58528
SPOCD1DNMT3AQ9Y6K1506
SPOCD1TDRD9Q8NDG6487
SPOCD1G3BP1Q13283485
SPOCD1TCEA3O75764462
SPOCD1DNAJC9Q8WXX5458
SPOCD1SH2D5Q6ZV89456
SPOCD1TCEA1P23193445
SPOCD1TSPYL5Q86VY4432
SPOCD1TASORQ9UK61400
SPOCD1PIWIL4Q7Z3Z4392

IntAct

3 interactions, top by confidence:

ABTypeScore
PPP1CASPOCD1psi-mi:“MI:0407”(direct interaction)0.440
SPOCD1PPP1CCpsi-mi:“MI:0915”(physical association)0.370

BioGRID (5): SPOCD1 (Affinity Capture-RNA), SPOCD1 (Negative Genetic), SPOCD1 (Cross-Linking-MS (XL-MS)), SPOCD1 (Two-hybrid), SPOCD1 (Affinity Capture-Western)

ESM2 similar proteins: A0A096LP49, A0A8V8TNH8, A0A8V8TPE2, A2VE02, A5D7I0, A6NDY2, A6NGG8, A6NIJ5, A6NNJ1, A8MXJ8, A8MYA2, B1ASB6, B2RW88, D6RGX4, O60269, P0C7V4, P0C7W8, P0C7W9, P0C7X0, P0DV75, P0DV76, Q2KIS6, Q2NL68, Q3SY00, Q4R736, Q4V8B5, Q5RCQ2, Q5SZB4, Q5VZ46, Q5XIK6, Q658T7, Q66JV7, Q6NS69, Q6PAC4, Q6ZMY3, Q76N32, Q7TSA6, Q7Z591, Q80VW7, Q80X53

Diamond homologs: A6ZZW1, O44498, O75764, P10711, P20232, P23193, P36106, P52652, Q29RL9, Q2KI09, Q4KLL0, Q6FJ00, Q6ZMY3, Q9U263, B1ASB6, Q148K0, Q15560, Q92576, P23881, Q08923, Q1MTR4, Q63799, Q6C0K9, Q99MY8, Q9NR48, Q9QVN7, Q9ZVH8, B8AMA8, O74508, Q12830, Q5EA28, Q5EAW9, Q5F489, Q5HZG4, Q5VWG9, Q6BER5, Q84TV4, Q8C9B9, Q9BTC0, Q9CWW7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

192 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic3
Uncertain significance150
Likely benign27
Benign2

Top pathogenic / likely-pathogenic (3)

Variant IDHGVSClassification
2626778NM_144569.7(SPOCD1):c.2912T>G (p.Leu971Arg)Likely pathogenic
2626779NM_144569.7(SPOCD1):c.3354_3355insA (p.Gln1119fs)Likely pathogenic
2626780NM_144569.7(SPOCD1):c.1991_1992del (p.Arg664fs)Likely pathogenic

SpliceAI

2640 predictions. Top by Δscore:

VariantEffectΔscore
1:31792398:CGTC:Cacceptor_gain1.0000
1:31792673:GGTAC:Gdonor_loss1.0000
1:31792674:GTACC:Gdonor_loss1.0000
1:31792675:TA:Tdonor_loss1.0000
1:31792676:A:AGdonor_loss1.0000
1:31792677:C:CTdonor_loss1.0000
1:31792767:CCT:Cacceptor_loss1.0000
1:31792769:T:Gacceptor_loss1.0000
1:31793750:T:TAdonor_gain1.0000
1:31793775:C:Adonor_gain1.0000
1:31793776:C:Adonor_gain1.0000
1:31796587:A:ACdonor_gain1.0000
1:31796587:AAC:Adonor_gain1.0000
1:31796588:A:Cdonor_gain1.0000
1:31798202:CTCA:Cdonor_loss1.0000
1:31798203:TCA:Tdonor_loss1.0000
1:31798204:CA:Cdonor_loss1.0000
1:31798205:A:ACdonor_gain1.0000
1:31798205:AC:Adonor_gain1.0000
1:31798205:ACC:Adonor_gain1.0000
1:31798206:C:CCdonor_gain1.0000
1:31798206:CC:Cdonor_gain1.0000
1:31798206:CCC:Cdonor_gain1.0000
1:31792400:TC:Tacceptor_gain0.9900
1:31792401:CC:Cacceptor_gain0.9900
1:31792402:C:CCacceptor_gain0.9900
1:31792402:CT:Cacceptor_loss0.9900
1:31792406:G:Tacceptor_gain0.9900
1:31792407:G:Cacceptor_gain0.9900
1:31792407:G:GCacceptor_gain0.9900

AlphaMissense

7839 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:31793326:G:CF879L0.990
1:31793326:G:TF879L0.990
1:31793327:A:GF879S0.990
1:31793328:A:GF879L0.990
1:31798466:G:CF668L0.989
1:31798466:G:TF668L0.989
1:31798468:A:GF668L0.989
1:31798461:A:GL670P0.986
1:31798277:A:GL692P0.985
1:31793341:G:CF874L0.983
1:31793341:G:TF874L0.983
1:31793343:A:GF874L0.983
1:31793362:C:AW867C0.981
1:31793362:C:GW867C0.981
1:31798473:A:GL666P0.981
1:31798483:A:CY663D0.981
1:31798479:C:GR664P0.979
1:31798475:G:CS665R0.978
1:31798475:G:TS665R0.978
1:31798477:T:GS665R0.978
1:31793364:A:GW867R0.976
1:31793364:A:TW867R0.976
1:31798226:C:GR709P0.975
1:31798253:A:GL700P0.974
1:31798463:G:CN669K0.973
1:31798463:G:TN669K0.973
1:31798535:C:AE645D0.973
1:31798535:C:GE645D0.973
1:31792255:G:CF974L0.972
1:31792255:G:TF974L0.972

dbSNP variants (sampled 300 via entrez): RS1000002055 (1:31803340 A>G,T), RS1000021286 (1:31799323 A>T), RS1000021424 (1:31796838 A>C,G), RS1000081534 (1:31798001 C>T), RS1000404494 (1:31816011 G>A), RS1000435551 (1:31815740 T>C), RS1000560817 (1:31799603 C>G), RS1000839941 (1:31790262 AC>A), RS1001141687 (1:31790549 A>G), RS1001197165 (1:31806965 T>C), RS1001302878 (1:31799647 GCCCCCACCCCTT>G), RS1001463743 (1:31813505 C>T), RS1001472182 (1:31817107 T>TATATATATAC), RS1001493654 (1:31794427 C>A), RS1001526042 (1:31793195 C>T)

Disease associations

OMIM: gene MIM:619038 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (1): Male infertility with azoospermia or oligozoospermia due to single gene mutation (Orphanet:399805)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST005951_36Body mass index9.000000e-10
GCST009798_61Asthma3.000000e-08
GCST012227_1393Hip circumference adjusted for BMI7.000000e-09

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004340body mass index
EFO:0008039BMI-adjusted hip circumference

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

57 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Estradiolaffects expression, affects cotreatment, increases expression, increases reaction4
bisphenol Adecreases expression, increases expression, affects cotreatment3
Benzo(a)pyreneincreases methylation, decreases expression, increases expression3
sodium arsenitedecreases expression, increases expression2
Cisplatinaffects expression, affects cotreatment, decreases expression2
Dexamethasoneincreases expression, affects cotreatment, decreases expression2
Valproic Acidaffects expression, increases expression2
Aflatoxin B1increases expression, increases methylation2
Particulate Matteraffects cotreatment, increases abundance, increases expression2
aristolochic acid Iincreases expression1
sotorasibaffects cotreatment, decreases expression1
triphenyl phosphateaffects expression1
propionaldehydeincreases expression1
lead acetateincreases expression1
trichostatin Aincreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
cobaltous chloridedecreases expression1
butyraldehydeincreases expression1
zinc chromatedecreases expression, increases abundance1
chromium hexavalent iondecreases expression, increases abundance1
perfluorooctane sulfonic acidincreases expression1
2,2’,4,4’,5-brominated diphenyl etherincreases expression1
abrineincreases expression1
ormosilincreases expression, affects binding1
2,2’,4,4’-tetrabromodiphenyl etherincreases expression1
bisphenol Sincreases expression1
jinfukangaffects cotreatment, decreases expression, increases expression1
trametinibdecreases expression, affects cotreatment1
(+)-JQ1 compounddecreases expression1
NVP-BKM120affects cotreatment, decreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_D8W5Ubigene HCT 116 SPOCD1 KOCancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.