CST8

gene
On this page

Also known as CRESCTES5

Summary

CST8 (cystatin 8, HGNC:2480) is a protein-coding gene on chromosome 20p11.21, encoding Cystatin-8 (O60676). Performs a specialized role during sperm development and maturation.

The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. The cystatin locus on chromosome 20 contains the majority of the type 2 cystatin genes and pseudogenes. This gene is located in the cystatin locus and encodes a protein similar to type 2 cystatins. The encoded protein exhibits highly tissue-specific expression in the reproductive tract, suggesting implicit roles in reproduction. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 10047 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 33 total
  • MANE Select transcript: NM_005492

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2480
Approved symbolCST8
Namecystatin 8
Location20p11.21
Locus typegene with protein product
StatusApproved
AliasesCRES, CTES5
Ensembl geneENSG00000125815
Ensembl biotypeprotein_coding
OMIM608683
Entrez10047

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000246012, ENST00000449810

RefSeq mRNA: 2 — MANE Select: NM_005492 NM_001281730, NM_005492

CCDS: CCDS13156

Canonical transcript exons

ENST00000246012 — 4 exons

ExonStartEnd
ENSE000008594362349295823493071
ENSE000008594372349152523491898
ENSE000010257762349111723491342
ENSE000011313772349583123496010

Expression profiles

Bgee: expression breadth broad, 21 present calls, max score 86.34.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0383 / max 42.4020, expressed in 3 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1838640.03833

Top tissues by expression

265 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047386.34gold quality
right testisUBERON:000453483.61gold quality
left testisUBERON:000453383.40gold quality
buccal mucosa cellCL:000233682.88silver quality
testisUBERON:000047380.54gold quality
spermCL:000001979.03silver quality
male germ cellCL:000001578.55silver quality
hair follicleUBERON:000207373.44gold quality
tongue squamous epitheliumUBERON:000691968.71gold quality
diaphragmUBERON:000110368.61gold quality
tibialis anteriorUBERON:000138566.59silver quality
olfactory bulbUBERON:000226465.42gold quality
type B pancreatic cellCL:000016965.34gold quality
cervix squamous epitheliumUBERON:000692264.96gold quality
gingival epitheliumUBERON:000194964.47gold quality
squamous epitheliumUBERON:000691464.12gold quality
amniotic fluidUBERON:000017363.66gold quality
epithelial cell of pancreasCL:000008362.63gold quality
CA1 field of hippocampusUBERON:000388162.56gold quality
deltoidUBERON:000147662.20gold quality
esophagus squamous epitheliumUBERON:000692061.83gold quality
choroid plexus epitheliumUBERON:000391161.69gold quality
gingivaUBERON:000182861.28gold quality
thymusUBERON:000237061.06gold quality
epithelium of esophagusUBERON:000197660.73gold quality
adult organismUBERON:000702360.45gold quality
mucosa of paranasal sinusUBERON:000503060.33gold quality
quadriceps femorisUBERON:000137760.23gold quality
cerebellar vermisUBERON:000472060.16gold quality
orbitofrontal cortexUBERON:000416759.88gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no2.30

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): CEBPB

miRNA regulators (miRDB)

8 targeting CST8, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1250-3P99.9670.044038
HSA-MIR-6835-3P99.9370.492904
HSA-MIR-153-5P99.8973.866317
HSA-MIR-1212299.5669.331672
HSA-MIR-126198.6268.10896
HSA-MIR-397798.0068.171500
HSA-MIR-6515-5P97.0865.481219
HSA-MIR-490-5P96.7565.81661

Literature-anchored findings (GeneRIF, showing 2)

  • studies show that cystatin-related epididymal spermatogenic protein is present in the sperm equatorial segment and becomes accessible to the extracellular environment during fertilization (PMID:12193387)
  • We show with phylogenetic relations that the CTES genes are clustered into three original groups, a testatin, a Cres, and a CstL1 group (PMID:20565543)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriosi:busm1-57f23.1ENSDARG00000074425
mus_musculusCst8ENSMUSG00000027442
rattus_norvegicusCst8ENSRNOG00000004989
caenorhabditis_elegansWBGENE00000534
caenorhabditis_elegansWBGENE00000535
caenorhabditis_elegansWBGENE00023486

Paralogs (11): CST7 (ENSG00000077984), CST9L (ENSG00000101435), CST3 (ENSG00000101439), CST4 (ENSG00000101441), CSTL1 (ENSG00000125823), CST11 (ENSG00000125831), CST5 (ENSG00000170367), CST2 (ENSG00000170369), CST1 (ENSG00000170373), CST9 (ENSG00000173335), CST6 (ENSG00000175315)

Protein

Protein identifiers

Cystatin-8O60676 (reviewed: O60676)

Alternative names: Cystatin-related epididymal spermatogenic protein

All UniProt accessions (3): O60676, A0A384MDQ4, A2A2N0

UniProt curated annotations — full annotation on UniProt →

Function. Performs a specialized role during sperm development and maturation.

Subcellular location. Secreted.

Tissue specificity. Proximal caput region of the epididymis. Lower expression in the testis. Within the testis it is localized to the elongating spermatids, whereas within the epididymis it is exclusively synthesized by the proximal caput epithelium.

Similarity. Belongs to the cystatin family.

RefSeq proteins (2): NP_001268659, NP_005483* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000010Cystatin_domDomain
IPR046350Cystatin_sfHomologous_superfamily
IPR052691Sperm_Mat_CystatinFamily

Pfam: PF00031

UniProt features (9 total): glycosylation site 2, disulfide bond 2, sequence variant 2, signal peptide 1, chain 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O60676-F187.120.70

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Disulfide bonds (2): 95–105, 119–139

Glycosylation sites (2): 27, 39

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 48 (showing top): GOCC_CELL_SURFACE, WEBER_METHYLATED_LCP_IN_SPERM_UP, GOMF_PEPTIDASE_REGULATOR_ACTIVITY, chr20p11, GOMF_ENZYME_INHIBITOR_ACTIVITY, GOMF_ENZYME_REGULATOR_ACTIVITY, YOSHIMURA_MAPK8_TARGETS_UP, GOMF_CYSTEINE_TYPE_ENDOPEPTIDASE_INHIBITOR_ACTIVITY, GOMF_ENDOPEPTIDASE_REGULATOR_ACTIVITY, FOXN3_TARGET_GENES, SIPA1_TARGET_GENES, MIR1250_3P, MIR3977, MIR490_5P, NABA_ECM_REGULATORS

GO Biological Process (0):

GO Molecular Function (2): cysteine-type endopeptidase inhibitor activity (GO:0004869), peptidase inhibitor activity (GO:0030414)

GO Cellular Component (3): extracellular region (GO:0005576), cytoplasm (GO:0005737), cell surface (GO:0009986)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
cysteine-type endopeptidase activity1
endopeptidase inhibitor activity1
enzyme inhibitor activity1
peptidase activity1
peptidase regulator activity1
intracellular anatomical structure1

Protein interactions and networks

STRING

737 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CST8CST9Q5W186753
CST8CTPS1P17812558
CST8GATA1P15976549
CST8CEBPAP49715492
CST8LCN8Q6JVE9483
CST8GZF1Q9H116456
CST8GHSRQ92847411
CST8NXT1Q9UKK6408
CST8TMEM167BQ9NRX6384
CST8ACSS1Q9NUB1372
CST8CST9LQ9H4G1371
CST8RILPL1Q5EBL4368
CST8TMEM225Q6GV28367
CST8GALNT9Q9HCQ5366
CST8ECSCRQ19T08364

IntAct

10 interactions, top by confidence:

ABTypeScore
FAM9CNDC80psi-mi:“MI:0914”(association)0.670
IQCF2TRAFD1psi-mi:“MI:0914”(association)0.350
SPO16CST8psi-mi:“MI:0914”(association)0.350
MGST2CST8psi-mi:“MI:0914”(association)0.350
RGS5CST8psi-mi:“MI:0914”(association)0.350
CDKN3CST8psi-mi:“MI:0914”(association)0.350
CST8ITIH2psi-mi:“MI:0914”(association)0.350
CST8HS3ST1psi-mi:“MI:0914”(association)0.350
CST8VGFpsi-mi:“MI:0914”(association)0.350

BioGRID (44): ARMC8 (Affinity Capture-MS), CST8 (Affinity Capture-MS), LAMA3 (Affinity Capture-MS), TUBB8 (Affinity Capture-MS), TUBA1B (Affinity Capture-MS), CST8 (Affinity Capture-MS), B4GALT6 (Affinity Capture-MS), MKLN1 (Affinity Capture-MS), CST8 (Affinity Capture-MS), SPCS3 (Affinity Capture-MS), HEATR6 (Affinity Capture-MS), DDX19B (Affinity Capture-MS), KHK (Affinity Capture-MS), MAEA (Affinity Capture-MS), PXDN (Affinity Capture-MS)

ESM2 similar proteins: A0A0K0IP23, A0A1S3PBB7, A0A224AHH8, A0A3S6I186, A0A5C1J0Z8, A0A6B9KZ52, A1L015, A1L017, B1P1J3, B2Z450, B7PKZ1, B7PKZ2, D0NBV1, D0NBV4, E3P6N5, E3P6N6, E3P6N7, E3P6N9, E3P6P0, E3P6P1, E3P6P2, E3P6P4, J3RYX9, O60676, O88969, P01048, P08932, P08935, P0DXA0, P19313, P22085, P23779, P28325, P32766, P35481, P81714, P90698, Q15828, Q2XXN5, Q331K1

Diamond homologs: B2Z450, E3P6N3, E3P6N4, E3P6N5, E3P6N6, E3P6N7, E3P6N8, E3P6N9, E3P6P0, E3P6P1, E3P6P2, E3P6P3, E3P6P4, J3RYX9, J3SE80, O19092, O19093, O60676, O76096, O88969, O89098, O97862, P01034, P01035, P01036, P01037, P01038, P08935, P09228, P14841, P19313, P21460, P22085, P28325, P32766, P35481, P81061, P81714, P90698, Q15828

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

33 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance29
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

405 predictions. Top by Δscore:

VariantEffectΔscore
20:23493070:GG:Gdonor_gain0.9900
20:23493071:GG:Gdonor_gain0.9900
20:23491897:AGGTA:Adonor_loss0.9800
20:23491900:T:Cdonor_loss0.9800
20:23495829:A:AGacceptor_gain0.9800
20:23495830:G:GGacceptor_gain0.9800
20:23495830:GA:Gacceptor_gain0.9800
20:23495830:GAA:Gacceptor_gain0.9800
20:23493071:GGTA:Gdonor_loss0.9700
20:23495827:TCA:Tacceptor_loss0.9700
20:23495829:A:Cacceptor_loss0.9700
20:23495830:G:GAacceptor_loss0.9700
20:23491856:G:GGdonor_gain0.9600
20:23492952:CAACA:Cacceptor_loss0.9500
20:23492953:AACAG:Aacceptor_loss0.9500
20:23492954:ACAG:Aacceptor_loss0.9500
20:23492955:CAGG:Cacceptor_loss0.9500
20:23492956:AGGT:Aacceptor_loss0.9500
20:23495757:G:GTdonor_gain0.9500
20:23495830:GAAA:Gacceptor_gain0.9400
20:23495830:GAAAT:Gacceptor_gain0.9300
20:23493076:G:Cdonor_loss0.9100
20:23491855:A:AGdonor_gain0.9000
20:23491871:G:GAdonor_gain0.9000
20:23491204:G:GTdonor_gain0.8800
20:23491340:GGG:Gdonor_gain0.8700
20:23491341:GGG:Gdonor_gain0.8700
20:23493075:G:GCdonor_loss0.8700
20:23491203:GGGA:Gdonor_gain0.8600
20:23491870:TGGTC:Tdonor_gain0.8600

AlphaMissense

933 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:23495879:T:CF132L0.978
20:23495881:C:AF132L0.978
20:23495881:C:GF132L0.978
20:23491863:T:CF66L0.977
20:23491865:C:AF66L0.977
20:23491865:C:GF66L0.977
20:23491811:T:GC48W0.975
20:23491818:T:CF51L0.974
20:23491820:T:AF51L0.974
20:23491820:T:GF51L0.974
20:23495869:G:CW128C0.972
20:23495869:G:TW128C0.972
20:23491822:C:AA52D0.968
20:23495847:T:GF121C0.963
20:23493003:A:CS93R0.961
20:23493005:C:AS93R0.961
20:23493005:C:GS93R0.961
20:23495846:T:CF121L0.960
20:23495847:T:CF121S0.960
20:23495848:T:AF121L0.960
20:23495848:T:GF121L0.960
20:23492977:A:CY84S0.955
20:23491809:T:CC48R0.951
20:23495840:T:AC119S0.951
20:23495841:G:CC119S0.951
20:23492976:T:GY84D0.947
20:23492997:G:CA91P0.947
20:23495867:T:AW128R0.947
20:23495867:T:CW128R0.947
20:23492976:T:CY84H0.943

dbSNP variants (sampled 300 via entrez): RS1000021993 (20:23500007 A>G,T), RS1000143913 (20:23498122 T>G), RS1000196691 (20:23498446 C>T), RS1000276092 (20:23505161 T>G), RS1000904377 (20:23494971 C>A,T), RS1001012619 (20:23489531 A>G), RS1001090710 (20:23501887 C>A,T), RS1001438598 (20:23501736 T>C), RS1001794715 (20:23507096 C>A,T), RS1001988261 (20:23504003 A>G), RS1002018846 (20:23497497 CT>C), RS1002069780 (20:23497751 T>C), RS1002354260 (20:23490654 G>A), RS1002708337 (20:23506722 C>T), RS1002762122 (20:23489939 C>T)

Disease associations

OMIM: gene MIM:608683 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

4 total (human), top 4 by PubMed support.

ChemicalActions (top 5)PubMed papers
CGP 52608affects binding, increases reaction1
Acetaminophenincreases expression1
Benzo(a)pyreneincreases methylation1
Fluorouracilaffects expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.