CST7

gene
On this page

Summary

CST7 (cystatin F, HGNC:2479) is a protein-coding gene on chromosome 20p11.21, encoding Cystatin-F (O76096). Inhibits papain and cathepsin L but with affinities lower than other cystatins.

The cystatin superfamily encompasses proteins that contain multiple cystatin-like sequences. Some of the members are active cysteine protease inhibitors, while others have lost or perhaps never acquired this inhibitory activity. There are three inhibitory families in the superfamily, including the type 1 cystatins (stefins), type 2 cystatins and the kininogens. The type 2 cystatin proteins are a class of cysteine proteinase inhibitors found in a variety of human fluids and secretions. This gene encodes a glycosylated cysteine protease inhibitor with a putative role in immune regulation through inhibition of a unique target in the hematopoietic system. Expression of the protein has been observed in various human cancer cell lines established from malignant tumors.

Source: NCBI Gene 8530 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 22 total
  • MANE Select transcript: NM_003650

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2479
Approved symbolCST7
Namecystatin F
Location20p11.21
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000077984
Ensembl biotypeprotein_coding
OMIM603253
Entrez8530

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000480798, ENST00000874360

RefSeq mRNA: 1 — MANE Select: NM_003650 NM_003650

CCDS: CCDS13165

Canonical transcript exons

ENST00000480798 — 4 exons

ExonStartEnd
ENSE000006609592495728724957459
ENSE000006609602495892824959044
ENSE000014719132494926924949575
ENSE000018871472495963524959928

Expression profiles

Bgee: expression breadth ubiquitous, 178 present calls, max score 99.55.

FANTOM5 (CAGE): breadth broad, TPM avg 44.6158 / max 10994.0432, expressed in 489 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
18390931.5488467
18391013.0482358
1839110.01879

Top tissues by expression

283 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
granulocyteCL:000009499.55gold quality
bloodUBERON:000017897.97gold quality
bone marrowUBERON:000237197.91gold quality
bone marrow cellCL:000209297.46gold quality
trabecular bone tissueUBERON:000248397.35gold quality
spleenUBERON:000210696.53gold quality
right lungUBERON:000216793.69gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099190.86gold quality
upper lobe of left lungUBERON:000895290.66gold quality
upper lobe of lungUBERON:000894888.92gold quality
lymph nodeUBERON:000002988.53gold quality
gall bladderUBERON:000211086.80gold quality
vermiform appendixUBERON:000115485.24gold quality
omental fat padUBERON:001041484.88gold quality
periodontal ligamentUBERON:000826684.83gold quality
peritoneumUBERON:000235884.80gold quality
leukocyteCL:000073884.40gold quality
adipose tissue of abdominal regionUBERON:000780884.02gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.89gold quality
mononuclear cellCL:000084283.00gold quality
monocyteCL:000057682.57gold quality
left uterine tubeUBERON:000130381.97gold quality
right lobe of liverUBERON:000111481.63gold quality
lungUBERON:000204880.47gold quality
caecumUBERON:000115380.15gold quality
mucosa of transverse colonUBERON:000499179.83gold quality
superficial temporal arteryUBERON:000161479.45gold quality
small intestine Peyer’s patchUBERON:000345479.27gold quality
small intestineUBERON:000210877.10gold quality
right coronary arteryUBERON:000162576.45gold quality

Single-cell (SCXA)

Detected in 39 experiment(s), a significant marker in 34.

ExperimentMarker?Max mean expression
E-CURD-120yes2687.06
E-CURD-55yes2661.10
E-MTAB-8911yes2335.43
E-HCAD-15yes2297.99
E-CURD-122yes2279.15
E-GEOD-149689yes2187.66
E-MTAB-9467yes2093.90
E-MTAB-6653yes2088.54
E-MTAB-10553yes2073.86
E-CURD-77yes1977.24
E-GEOD-139324yes1791.04
E-MTAB-8207yes1773.96
E-HCAD-4yes1763.81
E-HCAD-1yes1627.28
E-CURD-95yes1626.96

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): TREM2

miRNA regulators (miRDB)

5 targeting CST7, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-590-3P99.9674.346478
HSA-MIR-10527-5P99.9172.283754
HSA-MIR-431699.3765.751360
HSA-MIR-426698.5367.291035
HSA-MIR-4793-5P96.8865.90872

Literature-anchored findings (GeneRIF, showing 13)

  • cystatin F is an intracellular cysteine peptidase inhibitor with readily regulated expression (PMID:12423348)
  • CST7 is secreted, but artificial modification of its C-terminus can induce its endocytic targeting. (PMID:15212960)
  • Cystatin F was not colocalized with cystatin C shich suggests distinct functions for these two cysteine protease inhibitors in U937 cells. (PMID:15752368)
  • analysis of reduction-dependent activation of human cystatin F (PMID:16601115)
  • A latent protease inhibitor which is itself regulated by proteolysis in the endocytic pathway. (PMID:18256700)
  • Data demonstrate the addition of N-linked sugars to an Asn-X-Cys motif in cystatin F and suggest that the mannose 6-phosphate sorting machinery is used to divert cystatin F from the secretory pathway and to mediate its uptake from extracellular pools. (PMID:19192250)
  • Transgenic cystatin F is expressed in model mice only during demyelination; in chronic demyelination, remyelinating ability is lost. (PMID:21344476)
  • Regulation of cathepsins S and L by cystatin F during maturation of dendritic cells. (PMID:22365146)
  • during monocyte to macrophage differentiation, the endosomal/lysosomal proteolytic activity can be regulated by cystatin F whose expression is under the control of transcriptional factor C/EBP alpha. (PMID:30033148)
  • Cystatin F was identified as the key subunit of family 2 cystatins in survival analysis. Pancreatic ductal adenocarcinoma (PDAC) patients who harbored a higher expression level of CST7 had a lower risk in overall survival. The prognostic nomogram indicated that the CST7 expression model effectively predicted the outcomes of patients with earlystage PDAC. (PMID:31059105)
  • Exploring the factors underlying remyelination arrest by studying the post-transcriptional regulatory mechanisms of cystatin F gene. (PMID:32947653)
  • Transcriptomic Profiling Identifies Neutrophil-Specific Upregulation of Cystatin F as a Marker of Acute Inflammation in Humans. (PMID:33868254)
  • Cystatin F acts as a mediator of immune suppression in glioblastoma. (PMID:34189679)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
ENSDARG00000101340
mus_musculusCst7ENSMUSG00000068129
rattus_norvegicusCst7ENSRNOG00000006767
caenorhabditis_elegansWBGENE00000534
caenorhabditis_elegansWBGENE00000535
caenorhabditis_elegansWBGENE00023486

Paralogs (11): CST9L (ENSG00000101435), CST3 (ENSG00000101439), CST4 (ENSG00000101441), CST8 (ENSG00000125815), CSTL1 (ENSG00000125823), CST11 (ENSG00000125831), CST5 (ENSG00000170367), CST2 (ENSG00000170369), CST1 (ENSG00000170373), CST9 (ENSG00000173335), CST6 (ENSG00000175315)

Protein

Protein identifiers

Cystatin-FO76096 (reviewed: O76096)

Alternative names: Cystatin-7, Cystatin-like metastasis-associated protein, Leukocystatin

All UniProt accessions (1): O76096

UniProt curated annotations — full annotation on UniProt →

Function. Inhibits papain and cathepsin L but with affinities lower than other cystatins. May play a role in immune regulation through inhibition of a unique target in the hematopoietic system.

Subunit / interactions. Homodimer; disulfide-linked.

Subcellular location. Secreted. Cytoplasm.

Tissue specificity. Primarily expressed in peripheral blood cells and spleen.

Similarity. Belongs to the cystatin family.

RefSeq proteins (1): NP_003641* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000010Cystatin_domDomain
IPR042886Cystatin-FFamily
IPR046350Cystatin_sfHomologous_superfamily

Pfam: PF00031

UniProt features (17 total): disulfide bond 4, helix 3, strand 3, glycosylation site 2, signal peptide 1, chain 1, turn 1, short sequence motif 1, site 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
2CH9X-RAY DIFFRACTION2.1

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O76096-F189.780.77

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 37 (reactive site)

Disulfide bonds (4): 26, 63, 99–110, 124–144

Glycosylation sites (2): 62, 115

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 362 (showing top): GSE18804_BRAIN_VS_COLON_TUMORAL_MACROPHAGE_DN, GSE18804_SPLEEN_MACROPHAGE_VS_COLON_TUMORAL_MACROPHAGE_UP, GOBP_REGULATION_OF_CELL_ACTIVATION, VERHAAK_AML_WITH_NPM1_MUTATED_DN, MODULE_52, GOBP_INFLAMMATORY_RESPONSE, MODULE_151, MODULE_45, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, GOBP_REGULATION_OF_MACROPHAGE_ACTIVATION, GOBP_REGULATION_OF_MYELINATION, GOBP_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, MODULE_16, MODULE_118, MODULE_75

GO Biological Process (5): regulation of antigen processing and presentation (GO:0002577), immune response (GO:0006955), positive regulation of myelination (GO:0031643), negative regulation of microglial cell activation (GO:1903979), lysosomal protein catabolic process (GO:1905146)

GO Molecular Function (6): protease binding (GO:0002020), endopeptidase inhibitor activity (GO:0004866), cysteine-type endopeptidase inhibitor activity (GO:0004869), peptidase inhibitor activity (GO:0030414), protein homodimerization activity (GO:0042803), protein binding (GO:0005515)

GO Cellular Component (10): obsolete extracellular space (GO:0005615), lysosome (GO:0005764), endosome (GO:0005768), late endosome (GO:0005770), multivesicular body (GO:0005771), endoplasmic reticulum (GO:0005783), Golgi apparatus (GO:0005794), cytoplasmic vesicle (GO:0031410), extracellular region (GO:0005576), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
endomembrane system3
cytoplasm3
intracellular membrane-bounded organelle2
cellular anatomical structure2
regulation of immune system process1
antigen processing and presentation1
immune system process1
response to stimulus1
regulation of myelination1
positive regulation of nervous system process1
myelination1
positive regulation of cellular process1
microglial cell activation1
negative regulation of macrophage activation1
negative regulation of neuroinflammatory response1
regulation of microglial cell activation1
lysosome1
protein catabolic process in the vacuole1
enzyme binding1
endopeptidase activity1
peptidase inhibitor activity1
endopeptidase regulator activity1
cysteine-type endopeptidase activity1
endopeptidase inhibitor activity1
enzyme inhibitor activity1
peptidase activity1
peptidase regulator activity1
identical protein binding1
protein dimerization activity1
binding1
lytic vacuole1
cytoplasmic vesicle1
endosome1
late endosome1
intracellular vesicle1
intracellular anatomical structure1

Protein interactions and networks

STRING

1318 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CST7CTSLP07711913
CST7TMEM119Q4V9L6718
CST7P2RY12Q9H244701
CST7LGMNQ99538673
CST7CLEC7AQ9BXN2664
CST7TREM2Q9NZC2654
CST7TYROBPO43914651
CST7APOEP02649604
CST7CTSCP53634595
CST7PTPRCP08575524
CST7NKG7Q16617523
CST7CSTBP04080502
CST7FGL2Q14314491
CST7CTSSP25774487
CST7CX3CR1P49238475

IntAct

12 interactions, top by confidence:

ABTypeScore
TPT1CST7psi-mi:“MI:0914”(association)0.530
CST7CTSLpsi-mi:“MI:0914”(association)0.530
CST7H2BC14psi-mi:“MI:0915”(physical association)0.400
STK32ACST7psi-mi:“MI:0915”(physical association)0.400
CST7CTSCpsi-mi:“MI:0915”(physical association)0.400
CTSCCST7psi-mi:“MI:0915”(physical association)0.400
CST7psi-mi:“MI:0914”(association)0.350
ZCCHC17SHTN1psi-mi:“MI:0914”(association)0.350
CST7APODpsi-mi:“MI:0914”(association)0.350
CST7tktApsi-mi:“MI:0915”(physical association)0.000

BioGRID (19): POTEE (Affinity Capture-MS), TUBA3C (Affinity Capture-MS), ITIH3 (Affinity Capture-MS), GALC (Affinity Capture-MS), CTSL (Affinity Capture-MS), CST7 (Affinity Capture-MS), ATP2A3 (Affinity Capture-MS), CST7 (Affinity Capture-MS), HIST1H2BM (Proximity Label-MS), ITIH3 (Affinity Capture-MS), CST7 (Affinity Capture-MS), TUBA3C (Affinity Capture-MS), CST7 (Affinity Capture-MS), ATP2A3 (Affinity Capture-MS), CTSL (Affinity Capture-MS)

ESM2 similar proteins: A9QWQ1, O14625, O22969, O46675, O46676, O46677, O46678, O62657, O76096, O89098, O97919, P09341, P10147, P10855, P13236, P13501, P14097, P16619, P19875, P19876, P30782, P30882, P40224, P46632, P47854, P48061, P50229, P50230, P50231, P97272, Q17QA1, Q53S33, Q5EBF6, Q5I1Z0, Q68A92, Q68AZ0, Q711P4, Q8HYQ1, Q8HYQ2, Q8HYQ3

Diamond homologs: B1P1J3, B2Z450, E3P6N3, E3P6N4, E3P6N5, E3P6N6, E3P6N7, E3P6N8, E3P6N9, E3P6P0, E3P6P1, E3P6P2, E3P6P3, E3P6P4, J3RYX9, J3SE80, O19092, O19093, O76096, O97862, P01034, P01035, P01037, P01038, P08935, P09228, P0DXA0, P14841, P19313, P21460, P31726, P35481, P81061, P81714, P90698, Q10993, Q15828, Q2XXN5, Q6T6T4, Q7M429

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

22 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance18
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

700 predictions. Top by Δscore:

VariantEffectΔscore
20:24957283:TCA:Tacceptor_loss1.0000
20:24957284:CA:Cacceptor_loss1.0000
20:24957285:A:AGacceptor_gain1.0000
20:24957285:AGATA:Aacceptor_loss1.0000
20:24957286:G:Aacceptor_loss1.0000
20:24957286:G:GAacceptor_gain1.0000
20:24957286:GAT:Gacceptor_gain1.0000
20:24957286:GATA:Gacceptor_gain1.0000
20:24957286:GATAC:Gacceptor_gain1.0000
20:24957456:TCAG:Tdonor_loss1.0000
20:24957457:CAGGT:Cdonor_loss1.0000
20:24957461:T:Gdonor_loss1.0000
20:24958924:CCA:Cacceptor_loss1.0000
20:24958925:CAG:Cacceptor_loss1.0000
20:24958926:A:AGacceptor_gain1.0000
20:24958926:AGAT:Aacceptor_loss1.0000
20:24958927:G:GAacceptor_gain1.0000
20:24958927:GA:Gacceptor_gain1.0000
20:24958927:GAT:Gacceptor_gain1.0000
20:24958927:GATA:Gacceptor_gain1.0000
20:24959041:GCAG:Gdonor_gain1.0000
20:24959042:CAG:Cdonor_loss1.0000
20:24959043:AGGTA:Adonor_loss1.0000
20:24959044:GG:Gdonor_loss1.0000
20:24959046:T:Adonor_loss1.0000
20:24957275:T:TAacceptor_gain0.9900
20:24957286:GA:Gacceptor_gain0.9900
20:24957386:TTGAA:Tdonor_gain0.9900
20:24959282:G:GTdonor_gain0.9900
20:24949573:CAGGT:Cdonor_loss0.9800

AlphaMissense

948 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:24957424:T:CF70L0.974
20:24957426:C:AF70L0.974
20:24957426:C:GF70L0.974
20:24957399:C:AN61K0.973
20:24957399:C:GN61K0.973
20:24957418:T:CF68L0.972
20:24957420:C:AF68L0.972
20:24957420:C:GF68L0.972
20:24957382:A:CS56R0.968
20:24957384:T:AS56R0.968
20:24957384:T:GS56R0.968
20:24959673:G:CW133C0.967
20:24959673:G:TW133C0.967
20:24959644:T:AC124S0.953
20:24959645:G:CC124S0.953
20:24958937:G:CG85R0.945
20:24959704:T:AC144S0.945
20:24959705:G:CC144S0.945
20:24958972:A:CR96S0.944
20:24958972:A:TR96S0.944
20:24958974:C:TT97I0.943
20:24957394:T:CF60L0.942
20:24957396:C:AF60L0.942
20:24957396:C:GF60L0.942
20:24959646:C:GC124W0.940
20:24959645:G:AC124Y0.938
20:24959012:T:AC110S0.937
20:24959013:G:CC110S0.937
20:24957398:A:TN61I0.936
20:24959644:T:CC124R0.935

dbSNP variants (sampled 300 via entrez): RS1000079209 (20:24947625 A>T), RS1000094451 (20:24954621 A>G), RS1000425101 (20:24952156 T>C), RS1000695595 (20:24955941 G>A,T), RS1000799465 (20:24957704 C>T), RS1001077965 (20:24948741 A>T), RS1001186907 (20:24957096 T>A,C,G), RS1001385050 (20:24947852 C>T), RS1001391292 (20:24952777 G>A), RS1001525728 (20:24949702 G>A,T), RS1001796428 (20:24958849 C>T), RS1001827566 (20:24959138 G>A,C), RS1001992390 (20:24953818 C>A,T), RS1002083702 (20:24949563 G>A,C), RS1002104406 (20:24956854 G>A)

Disease associations

OMIM: gene MIM:603253 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST006585_375Blood protein levels5.000000e-302
GCST006585_509Blood protein levels8.000000e-28
GCST007328_59Alcohol consumption (drinks per week)5.000000e-08
GCST011741_63LDL cholesterol levels in HIV infection5.000000e-07

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004611low density lipoprotein cholesterol measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

19 total (human), top 19 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases expression, increases methylation3
Decitabineincreases expression2
Methotrexatedecreases expression, increases expression2
aristolochic acid Iincreases expression1
alpha phellandrenedecreases expression1
beta-lapachoneincreases expression1
di-n-butylphosphoric acidaffects expression1
CD 437decreases expression1
(+)-JQ1 compounddecreases expression1
Bortezomibincreases expression1
Acetaminophendecreases expression1
Amiodaroneincreases expression1
Benzoatesincreases expression1
Cyclophosphamidedecreases expression1
Fluorouracilaffects response to substance, decreases expression1
Nickelincreases expression1
Tretinoindecreases expression1
Mifepristonedecreases expression1
Antirheumatic Agentsdecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.