STPG4

gene
On this page

Also known as FLJ40172

Summary

STPG4 (sperm-tail PG-rich repeat containing 4, HGNC:26850) is a protein-coding gene on chromosome 2p21, encoding Protein STPG4 (Q8N801). Maternal factor that plays a role in epigenetic chromatin reprogramming during early development of the zygote.

Predicted to enable chromatin binding activity and histone binding activity. Predicted to be involved in epigenetic programing of male pronucleus. Predicted to act upstream of or within pyrimidine-containing compound catabolic process. Predicted to be located in cytoplasm and nucleus. Predicted to be active in female pronucleus; germinal vesicle; and male pronucleus.

Source: NCBI Gene 285051 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 2 total
  • MANE Select transcript: NM_001163561

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26850
Approved symbolSTPG4
Namesperm-tail PG-rich repeat containing 4
Location2p21
Locus typegene with protein product
StatusApproved
AliasesFLJ40172
Ensembl geneENSG00000239605
Ensembl biotypeprotein_coding
Entrez285051

Gene structure

Transcript identifiers

Ensembl transcripts: 6 — 3 protein_coding, 3 protein_coding_CDS_not_defined

ENST00000294947, ENST00000445927, ENST00000449846, ENST00000464527, ENST00000482548, ENST00000496939

RefSeq mRNA: 2 — MANE Select: NM_001163561 NM_001163561, NM_173649

CCDS: CCDS1831, CCDS54356

Canonical transcript exons

ENST00000445927 — 7 exons

ExonStartEnd
ENSE000017362024715517147155308
ENSE000038910504712994147129995
ENSE000038912344715295747153016
ENSE000038914474713019647130260
ENSE000038937674709027047090374
ENSE000038941404708699147087130
ENSE000038948284715125847151515

Expression profiles

Bgee: expression breadth ubiquitous, 130 present calls, max score 81.47.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1076 / max 34.7335, expressed in 33 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
282110.037518
282100.030816
282120.03033
282130.00903

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099181.47gold quality
left testisUBERON:000453380.54gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047380.00gold quality
testisUBERON:000047379.88gold quality
right testisUBERON:000453479.04gold quality
monocyteCL:000057663.65gold quality
leukocyteCL:000073862.98gold quality
granulocyteCL:000009462.96gold quality
bloodUBERON:000017857.61gold quality
vaginaUBERON:000099657.61gold quality
smooth muscle tissueUBERON:000113556.78gold quality
ectocervixUBERON:001224955.59gold quality
skin of legUBERON:000151155.34gold quality
endocervixUBERON:000045854.97gold quality
zone of skinUBERON:000001454.78gold quality
uterine cervixUBERON:000000254.67gold quality
corpus callosumUBERON:000233654.65gold quality
urinary bladderUBERON:000125554.31gold quality
skin of abdomenUBERON:000141653.97gold quality
tonsilUBERON:000237253.38gold quality
mucosa of stomachUBERON:000119953.16gold quality
bone marrowUBERON:000237151.87gold quality
cortical plateUBERON:000534351.34gold quality
gall bladderUBERON:000211051.06gold quality
muscle layer of sigmoid colonUBERON:003580550.80gold quality
tibial nerveUBERON:000132350.65gold quality
spleenUBERON:000210649.36gold quality
right adrenal glandUBERON:000123349.29gold quality
sural nerveUBERON:001548849.09silver quality
bone marrow cellCL:000209248.96gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.37

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

42 targeting STPG4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6873-3P100.0071.422626
HSA-MIR-7110-3P100.0073.182486
HSA-MIR-5193100.0067.261744
HSA-MIR-10401-5P99.9965.79948
HSA-MIR-4650-5P99.9864.69999
HSA-MIR-6845-3P99.9466.881439
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-990299.8969.152250
HSA-MIR-659-3P99.8570.691620
HSA-MIR-6817-3P99.7968.352126
HSA-MIR-6892-3P99.6866.401178
HSA-MIR-6762-3P99.6666.941188
HSA-MIR-6512-3P99.6566.071468
HSA-MIR-6720-5P99.6566.221459
HSA-MIR-469699.4867.481040
HSA-MIR-127599.4767.902749
HSA-MIR-57899.4668.361787
HSA-MIR-391199.3866.951087
HSA-MIR-427999.1966.702437
HSA-MIR-66199.0965.942062
HSA-MIR-320A-5P98.8866.751248
HSA-MIR-3124-3P98.8768.952123
HSA-MIR-475298.7168.04833
HSA-MIR-126198.6268.10896
HSA-MIR-3691-5P98.6265.88552
HSA-MIR-376B-5P98.4666.40606
HSA-MIR-376C-5P98.4666.64589
HSA-MIR-138-5P98.4370.491292
HSA-MIR-58198.3967.42835
HSA-MIR-147A98.3366.40795

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusStpg4ENSMUSG00000036557
rattus_norvegicusStpg4ENSRNOG00000040136

Protein

Protein identifiers

Protein STPG4Q8N801 (reviewed: Q8N801)

Alternative names: Gonad-specific expression gene protein, Sperm-tail PG-rich repeat-containing protein 4

All UniProt accessions (2): Q8N801, H7C3W9

UniProt curated annotations — full annotation on UniProt →

Function. Maternal factor that plays a role in epigenetic chromatin reprogramming during early development of the zygote. Involved in the regulation of gametic DNA demethylation by inducing the conversion of the modified genomic base 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC).

Subunit / interactions. Interacts with histone H3. Interacts with histone H4.

Subcellular location. Cytoplasm. Nucleus.

Isoforms (2)

UniProt IDNamesCanonical?
Q8N801-11yes
Q8N801-22

RefSeq proteins (2): NP_001157033, NP_775920 (=MANE)

Domains & families (InterPro)

IDNameType
IPR010736SHIPPO-rptRepeat

Pfam: PF07004

UniProt features (4 total): sequence variant 2, chain 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N801-F167.290.01

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-9821002Chromatin modifications during the maternal to zygotic transition (MZT)
R-HSA-1266738Developmental Biology
R-HSA-9816359Maternal to zygotic transition (MZT)

MSigDB gene sets: 46 (showing top): GOBP_PYRIMIDINE_CONTAINING_COMPOUND_CATABOLIC_PROCESS, GOBP_PYRIMIDINE_CONTAINING_COMPOUND_METABOLIC_PROCESS, GOBP_CHROMATIN_REMODELING, GOBP_EPIGENETIC_REGULATION_OF_GENE_EXPRESSION, GOMF_CHROMATIN_BINDING, GOCC_GERM_CELL_NUCLEUS, GOCC_PRONUCLEUS, GOMF_HISTONE_BINDING, HEIDENBLAD_AMPLICON_8Q24_UP, GOCC_FEMALE_PRONUCLEUS, GOCC_MALE_PRONUCLEUS, ZWANG_TRANSIENTLY_UP_BY_2ND_EGF_PULSE_ONLY, FEV_TARGET_GENES, FOXD2_TARGET_GENES, LHX9_TARGET_GENES

GO Biological Process (3): epigenetic programing of male pronucleus (GO:0044727), pyrimidine-containing compound catabolic process (GO:0072529), chromatin organization (GO:0006325)

GO Molecular Function (3): chromatin binding (GO:0003682), histone binding (GO:0042393), protein binding (GO:0005515)

GO Cellular Component (7): male germ cell nucleus (GO:0001673), female pronucleus (GO:0001939), male pronucleus (GO:0001940), nucleus (GO:0005634), cytoplasm (GO:0005737), germinal vesicle (GO:0042585), female germ cell nucleus (GO:0001674)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Maternal to zygotic transition (MZT)1
Developmental Biology1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding2
germ cell nucleus2
pronucleus2
epigenetic programming in the zygotic pronuclei1
catabolic process1
pyrimidine-containing compound metabolic process1
cellular component organization1
protein binding1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
cellular anatomical structure1
female germ cell nucleus1

Protein interactions and networks

STRING

120 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
STPG4TMEM247A6NEH6541
STPG4ANKRD30BLA7E2S9528
STPG4MANBALQ9NQG1506
STPG4PPP1R21Q6ZMI0477
STPG4CREG2Q8IUH2472
STPG4KCNK12Q9HB15472
STPG4GTF2A1LQ9UNN4472
STPG4STON1Q9Y6Q2447
STPG4Q53S48Q53S48445
STPG4MARCHF10Q8NA82430
STPG4ZBTB5O15062419
STPG4TLCD3AQ8TBR7417
STPG4ENTREP1Q15884413
STPG4SEC31BQ9NQW1400
STPG4NSMCE1Q8WV22391

IntAct

2 interactions, top by confidence:

ABTypeScore
TLK2PES1psi-mi:“MI:2364”(proximity)0.270

BioGRID (4): C2orf61 (Affinity Capture-MS), C2orf61 (Affinity Capture-RNA), C2orf61 (Two-hybrid), C2orf61 (Co-fractionation)

ESM2 similar proteins: A1A5R9, A2AVQ5, A2RRW4, A2RSX4, A5WUY6, A9JS51, B0BM24, B0S4Q5, B1H283, D4A039, F1MMV1, Q08BC4, Q0VB26, Q2T9Q3, Q2TA11, Q2TBS4, Q3KQ80, Q3SX64, Q3TZ65, Q3ZCV2, Q4KKZ1, Q4KLY8, Q4R5Y0, Q5PQN4, Q5RAF2, Q5XI56, Q6ZN84, Q810P2, Q8BUG5, Q8C008, Q8C8J0, Q8IXM7, Q8N412, Q8N6G2, Q8N7U6, Q8N801, Q8TCI5, Q8WW14, Q9D067, Q9D131

Diamond homologs: Q8N801, Q9DAG5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

2 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2119 predictions. Top by Δscore:

VariantEffectΔscore
2:47046313:A:AGacceptor_gain1.0000
2:47046313:AGCCT:Aacceptor_gain1.0000
2:47046314:G:GAacceptor_gain1.0000
2:47046314:GC:Gacceptor_gain1.0000
2:47046314:GCC:Gacceptor_gain1.0000
2:47046314:GCCT:Gacceptor_gain1.0000
2:47046314:GCCTG:Gacceptor_gain1.0000
2:47046429:GGG:Gdonor_gain1.0000
2:47046430:GG:Gdonor_gain1.0000
2:47046430:GGG:Gdonor_gain1.0000
2:47046430:GGGT:Gdonor_loss1.0000
2:47046431:GG:Gdonor_gain1.0000
2:47046432:G:Adonor_loss1.0000
2:47046432:G:GGdonor_gain1.0000
2:47046433:T:Gdonor_loss1.0000
2:47051876:GGCTG:Gdonor_gain1.0000
2:47051877:GCTGG:Gdonor_gain1.0000
2:47060764:T:TAacceptor_gain1.0000
2:47060764:TGCA:Tacceptor_loss1.0000
2:47060765:GCA:Gacceptor_loss1.0000
2:47060766:CA:Cacceptor_loss1.0000
2:47060767:A:AGacceptor_gain1.0000
2:47060767:A:Tacceptor_loss1.0000
2:47060767:AGCT:Aacceptor_gain1.0000
2:47060768:G:GTacceptor_gain1.0000
2:47060768:GC:Gacceptor_gain1.0000
2:47060768:GCT:Gacceptor_gain1.0000
2:47060768:GCTG:Gacceptor_gain1.0000
2:47060768:GCTGA:Gacceptor_gain1.0000
2:47060967:GCCTG:Gdonor_gain1.0000

AlphaMissense

1606 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:47090306:A:CF196L0.976
2:47090306:A:TF196L0.976
2:47090308:A:GF196L0.976
2:47151432:A:CF75L0.976
2:47151432:A:TF75L0.976
2:47151434:A:GF75L0.976
2:47151417:C:AR80S0.975
2:47151417:C:GR80S0.975
2:47129983:A:CF159L0.974
2:47129983:A:TF159L0.974
2:47129985:A:GF159L0.974
2:47152974:A:GW42R0.971
2:47152974:A:TW42R0.971
2:47151306:G:CF117L0.966
2:47151306:G:TF117L0.966
2:47151308:A:GF117L0.966
2:47151433:A:GF75S0.962
2:47152977:A:GW41R0.956
2:47152977:A:TW41R0.956
2:47090307:A:GF196S0.952
2:47151477:A:CF60L0.952
2:47151477:A:TF60L0.952
2:47151479:A:GF60L0.952
2:47129984:A:GF159S0.950
2:47152972:C:AW42C0.950
2:47152972:C:GW42C0.950
2:47151418:C:GR80T0.947
2:47151429:T:AK76N0.946
2:47151429:T:GK76N0.946
2:47129965:T:AR165S0.945

dbSNP variants (sampled 300 via entrez): RS1000000482 (2:47087805 T>C,G), RS1000068033 (2:47129037 G>T), RS1000074800 (2:47089294 T>C), RS1000082160 (2:47149482 A>G), RS1000093617 (2:47095877 G>C,T), RS1000124891 (2:47096077 T>C), RS1000199774 (2:47109802 T>C), RS1000208697 (2:47154976 T>C,G), RS1000216965 (2:47143333 A>C), RS1000264552 (2:47091516 A>C), RS1000273684 (2:47087460 G>GC), RS1000331975 (2:47106005 T>C), RS1000428397 (2:47114650 G>A), RS1000440819 (2:47095482 C>G,T), RS1000486179 (2:47123668 C>T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (1): Lynch syndrome (MONDO:0005835)

Orphanet (1): Lynch syndrome (Orphanet:144)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST000635_23Response to statin therapy2.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

19 total (human), top 19 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
propionaldehydeincreases methylation1
nonanalincreases methylation1
n-hexanalincreases methylation1
trichostatin Aaffects expression1
beta-lapachonedecreases expression1
sodium arseniteincreases expression1
butyraldehydeincreases methylation1
benzo(e)pyreneincreases methylation1
caprylic aldehydeincreases methylation1
pentanalincreases methylation1
heptanalincreases methylation1
Grape Seed Proanthocyanidinsaffects cotreatment, increases expression1
Resveratrolaffects cotreatment, decreases expression1
Catechinaffects cotreatment, increases expression1
Copperaffects cotreatment, decreases expression1
Methapyrileneincreases methylation1
Smokedecreases expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

106 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00566644PHASE3TERMINATEDIntrauterine Levonorgestrel and Observation or Observation Alone in Preventing Atypical Endometrial Hyperplasia and Endometrial Cancer in Women With Hereditary Non-Polyposis Colorectal Cancer or Lynch Syndrome
NCT02000089PHASE3RECRUITINGThe Cancer of the Pancreas Screening-5 CAPS5)Study
NCT02813824PHASE3ACTIVE_NOT_RECRUITINGEffect of Chemoprevention by Low-dose Aspirin of New or Recurrent Colorectal Adenomas in Patients With Lynch Syndrome
NCT02912559PHASE3ACTIVE_NOT_RECRUITINGCombination Chemotherapy With or Without Atezolizumab in Treating Patients With Stage III Colon Cancer and Deficient DNA Mismatch Repair
NCT04711434PHASE3UNKNOWNPD-1 Antibody for The Prevention of Adenomatous Polyps and Second Primary Tumors in Lynch Syndrome Patients
NCT07609901PHASE3NOT_YET_RECRUITINGPreventive Dendritic Cell Vaccination for Lynch Syndrome Carriers
NCT03631641PHASE2TERMINATEDNivolumab in Preventing Colon Adenomas in Participants With Lynch Syndrome and a History of Partial Colectomy
NCT03831698PHASE2UNKNOWNOmega 3 Fatty Acids in Colorectal Cancer (CRC) Prevention in Patients With Lynch Syndrome (COLYNE)
NCT04920149PHASE2RECRUITINGMesalamine for Colorectal Cancer Prevention Program in Lynch Syndrome
NCT05411718PHASE2RECRUITINGA Phase IIa Randomized, Double-Blinded Clinical Trial of Naproxen or Aspirin for Cancer Immune Interception in Lynch Syndrome
NCT05419011PHASE2ACTIVE_NOT_RECRUITINGTesting a Combination of Vaccines for Cancer Prevention in Lynch Syndrome
NCT02052908PHASE1COMPLETEDNaproxen in Preventing DNA Mismatch Repair Deficient Colorectal Cancer in Patients With Lynch Syndrome
NCT02359565PHASE1ACTIVE_NOT_RECRUITINGPembrolizumab in Treating Younger Patients With Recurrent, Progressive, or Refractory High-Grade Gliomas, Diffuse Intrinsic Pontine Gliomas, Hypermutated Brain Tumors, Ependymoma or Medulloblastoma
NCT04500548PHASE1WITHDRAWNTesting the Combination of Two Immunotherapy Drugs (Nivolumab and Ipilimumab) in Children, Adolescent, and Young Adult Patients With Relapsed/Refractory Cancers That Have an Increased Number of Genetic Changes, The 3CI Study
NCT07163403PHASE1RECRUITINGFirst in Human Pilot Study to Assess the Safety and Efficacy of Dendritic Cells Loaded With Frameshift Derived Neopeptides for the Prevention of Cancer in of Lynch Syndrome Carriers
NCT05078866PHASE1/PHASE2ACTIVE_NOT_RECRUITINGCancer Preventive Vaccine Nous-209 for Lynch Syndrome Patients
NCT07412197PHASE1/PHASE2NOT_YET_RECRUITINGPreventive Dendritic Cell Vaccination for Lynch Syndrome
NCT00898768EARLY_PHASE1COMPLETEDCapsule Endoscopy to Screen for Small Bowel Neoplasia in Lynch Syndrome
NCT03832985EARLY_PHASE1COMPLETEDPediatric Reporting of Adult-Onset Genomic Results
NCT04379999EARLY_PHASE1COMPLETEDAtorvastatin ± Aspirin in Lynch Syndrome Syndrome
NCT04906382EARLY_PHASE1TERMINATEDTislelizumab for the Treatment of Recurrent Mismatch Repair Deficient Endometrial Cancer
NCT00508573Not specifiedCOMPLETEDRegistry for Women Who Are At Risk Or May Have Lynch Syndrome
NCT00582296Not specifiedACTIVE_NOT_RECRUITINGMulti-Organ Screening Recommendations in Patients With Lynch Syndrome
NCT00609505Not specifiedCOMPLETEDTelemedicine vs. Face-to-Face Cancer Genetic Counseling
NCT00633607Not specifiedCOMPLETEDHereditary Colorectal and Associated Tumor Registry Study
NCT00905710Not specifiedCOMPLETEDChromoendoscopy to Decrease the Risk of Colorectal Neoplasia in Lynch Syndrome
NCT00927680Not specifiedUNKNOWNFamilial Colorectal Cancer Registry in Hispanics
NCT01199250Not specifiedWITHDRAWNBiomarkers in Samples From Patients With Endometrial Cancer
NCT01216930Not specifiedCOMPLETEDMolecular Screening for Lynch Syndrome in Southern Denmark
NCT01582841Not specifiedCOMPLETEDIntegrating Genetic Testing for Lynch Syndrome in a Managed Care Setting
NCT01823471Not specifiedCOMPLETEDI-Scan For Colon Polyp Detection In HNPCC
NCT01845753Not specifiedCOMPLETEDMolecular Screening for Lynch Syndrome in Denmark
NCT01850654Not specifiedCOMPLETEDOhio Colorectal Cancer Prevention Initiative
NCT02012699Not specifiedRECRUITINGIntegrated Cancer Repository for Cancer Research
NCT02053805Not specifiedUNKNOWNProstate Cancer Screening Among Men With High Risk Genetic Predisposition
NCT02194387Not specifiedACTIVE_NOT_RECRUITINGEnergy Balance Interventions in Increasing Physical Activity in Breast Cancer Gene Positive Patients, Lynch Syndrome-Positive Patients, CLL Survivors or High-Risk Family Members
NCT02198092Not specifiedCOMPLETEDPreliminary Evaluation of Septin9 in Patients With Hereditary Colon Cancer Syndromes
NCT02206360Not specifiedACTIVE_NOT_RECRUITINGPancreatic Cancer Early Detection Program
NCT02371135Not specifiedACTIVE_NOT_RECRUITINGMetagenomic Evaluation of the Gut Microbiome in Patients With Lynch Syndrome and Other Hereditary Colonic Polyposis Syndromes
NCT02494791Not specifiedUNKNOWNUniversal Screening for Lynch Syndrome in Women With Endometrial and Non-Serous Ovarian Cancer
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Lynch syndrome