CTSO

gene
On this page

Summary

CTSO (cathepsin O, HGNC:2542) is a protein-coding gene on chromosome 4q32.1, encoding Cathepsin O (P43234). Proteolytic enzyme possibly involved in normal cellular protein degradation and turnover.

The protein encoded by the gene is a cysteine proteinase and a member of the papain superfamily. This proteolytic enzyme is involved in cellular protein degradation and turnover. The recombinant form of this enzyme was shown to degrade synthetic peptides typically used as substrates for cysteine proteinases and its proteolytic activity was abolished by an inhibitor of cyteine proteinase.

Source: NCBI Gene 1519 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 56 total
  • MANE Select transcript: NM_001334

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2542
Approved symbolCTSO
Namecathepsin O
Location4q32.1
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000256043
Ensembl biotypeprotein_coding
OMIM600550
Entrez1519

Gene structure

Transcript identifiers

Ensembl transcripts: 13 — 9 protein_coding, 4 nonsense_mediated_decay

ENST00000433477, ENST00000676791, ENST00000678374, ENST00000679136, ENST00000679625, ENST00000679942, ENST00000679996, ENST00000680553, ENST00000680741, ENST00000681379, ENST00000859966, ENST00000859967, ENST00000952268

RefSeq mRNA: 1 — MANE Select: NM_001334 NM_001334

CCDS: CCDS3794

Canonical transcript exons

ENST00000433477 — 8 exons

ExonStartEnd
ENSE00001612008155924118155926070
ENSE00001751842155929542155929705
ENSE00001753351155928336155928428
ENSE00002080357155953713155953866
ENSE00002218314155937362155937483
ENSE00002435290155943156155943264
ENSE00002444303155942317155942456
ENSE00002524268155939371155939538

Expression profiles

Bgee: expression breadth ubiquitous, 134 present calls, max score 96.11.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 10.4204 / max 109.1830, expressed in 1543 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
5454210.04611527
545430.3743211

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
calcaneal tendonUBERON:000370196.11gold quality
gall bladderUBERON:000211095.18gold quality
endometriumUBERON:000129594.19gold quality
right coronary arteryUBERON:000162593.20gold quality
smooth muscle tissueUBERON:000113593.07gold quality
placentaUBERON:000198792.81gold quality
mucosa of stomachUBERON:000119992.68gold quality
liverUBERON:000210792.62gold quality
duodenumUBERON:000211492.54gold quality
monocyteCL:000057692.35gold quality
right lobe of liverUBERON:000111492.31gold quality
leukocyteCL:000073892.23gold quality
fallopian tubeUBERON:000388992.00gold quality
descending thoracic aortaUBERON:000234591.89gold quality
left coronary arteryUBERON:000162691.56gold quality
right lungUBERON:000216791.45gold quality
colonic epitheliumUBERON:000039791.08gold quality
thoracic aortaUBERON:000151590.91gold quality
rectumUBERON:000105290.84gold quality
ascending aortaUBERON:000149690.83gold quality
endocervixUBERON:000045890.33gold quality
lungUBERON:000204889.91gold quality
body of uterusUBERON:000985389.91gold quality
metanephros cortexUBERON:001053389.91gold quality
left adrenal glandUBERON:000123489.76gold quality
left adrenal gland cortexUBERON:003582589.68gold quality
lymph nodeUBERON:000002989.46gold quality
tibial arteryUBERON:000761089.42gold quality
popliteal arteryUBERON:000225089.39gold quality
muscle layer of sigmoid colonUBERON:003580589.31gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

73 targeting CTSO, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6873-3P100.0071.422626
HSA-MIR-4510100.0066.602050
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-6127100.0066.762188
HSA-MIR-6129100.0066.462080
HSA-MIR-6130100.0066.692012
HSA-MIR-6133100.0066.482064
HSA-MIR-5692A100.0074.406850
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-569699.9872.364487
HSA-MIR-1213699.9872.815713
HSA-MIR-495-3P99.9672.814197
HSA-MIR-568899.9673.234504
HSA-MIR-590-3P99.9674.346478
HSA-MIR-218-5P99.9372.222103
HSA-MIR-498-3P99.9171.271114
HSA-MIR-7-1-3P99.9171.534384
HSA-MIR-7-2-3P99.9171.404394
HSA-MIR-568099.9169.833421
HSA-MIR-10527-5P99.9172.283754
HSA-MIR-153-5P99.8973.866317
HSA-MIR-579-3P99.8671.663628
HSA-MIR-664B-3P99.8471.653590
HSA-MIR-313399.8170.923506
HSA-MIR-494-3P99.7071.452795
HSA-MIR-130399.6569.771662
HSA-MIR-105-5P99.5469.242060
HSA-MIR-7853-5P99.5469.302055

Literature-anchored findings (GeneRIF, showing 3)

  • In this study, various molecular dynamics (MD) simulations of pro- and mature human cathepsins L and O were performed. (PMID:23009386)
  • a single-nucleotide polymorphism near the CTSO gene is a poor prognostic factor in breast cancer although further research might help to reveal the factors linking this genotype and prognosis. (PMID:26482374)
  • We report that CTSO reduces the protein levels of BRCA1 and ZNF423 through cysteine proteinase-mediated degradation. We also have identified a series of transcription factors of BRCA1 that are regulated by CTSO at the protein level. (PMID:28968398)

Cross-species orthologs

16 orthologs

OrganismSymbolGene ID
danio_rerioctsoENSDARG00000052855
mus_musculusCtsoENSMUSG00000028015
rattus_norvegicusCtsoENSRNOG00000066128
caenorhabditis_elegansWBGENE00000781
caenorhabditis_elegansWBGENE00000782
caenorhabditis_elegansWBGENE00000784
caenorhabditis_elegansWBGENE00000785
caenorhabditis_elegansWBGENE00013072
caenorhabditis_elegansWBGENE00013076
caenorhabditis_elegansWBGENE00013764
caenorhabditis_elegansWBGENE00016300
caenorhabditis_elegansWBGENE00016306
caenorhabditis_elegansWBGENE00019314
caenorhabditis_elegansWBGENE00019986
caenorhabditis_elegansWBGENE00022189
caenorhabditis_elegansWBGENE00044760

Paralogs (12): CTSZ (ENSG00000101160), CTSH (ENSG00000103811), CTSC (ENSG00000109861), CTSL (ENSG00000135047), CTSV (ENSG00000136943), TINAG (ENSG00000137251), TINAGL1 (ENSG00000142910), CTSK (ENSG00000143387), CTSS (ENSG00000163131), CTSB (ENSG00000164733), CTSW (ENSG00000172543), CTSF (ENSG00000174080)

Protein

Protein identifiers

Cathepsin OP43234 (reviewed: P43234)

All UniProt accessions (10): A0A7I2V4E3, A0A7I2V4S1, A0A7P0T898, A0A7P0T8T8, A0A7P0T996, A0A7P0Z4C4, A0A7P0Z4D8, A0A7P0Z4I5, A0A7P0Z4L5, P43234

UniProt curated annotations — full annotation on UniProt →

Function. Proteolytic enzyme possibly involved in normal cellular protein degradation and turnover.

Subcellular location. Lysosome.

Tissue specificity. Expressed in all tissues examined. High levels seen in the ovary, kidney and placenta while low levels seen in thymus and skeletal muscle.

Similarity. Belongs to the peptidase C1 family.

RefSeq proteins (1): NP_001325* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000169Pept_cys_ASActive_site
IPR000668Peptidase_C1A_CDomain
IPR013128Peptidase_C1AFamily
IPR025660Pept_his_ASActive_site
IPR038765Papain-like_cys_pep_sfHomologous_superfamily
IPR039417Peptidase_C1A_papain-likeDomain

Pfam: PF00112

UniProt features (11 total): disulfide bond 3, active site 3, glycosylation site 2, signal peptide 1, propeptide 1, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P43234-F189.690.80

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (3): 132; 269; 289

Disulfide bonds (3): 262–310, 129–170, 163–204

Glycosylation sites (2): 62, 105

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-2132295MHC class II antigen presentation
R-HSA-1280218Adaptive Immune System
R-HSA-168256Immune System

MSigDB gene sets: 158 (showing top): GSE45365_CD8A_DC_VS_CD11B_DC_IFNAR_KO_DN, MODULE_172, chr4q32, REACTOME_ADAPTIVE_IMMUNE_SYSTEM, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, KEGG_LYSOSOME, CHANDRAN_METASTASIS_DN, CAIRO_HEPATOBLASTOMA_CLASSES_DN, LI_WILMS_TUMOR_VS_FETAL_KIDNEY_1_UP, HOSHIDA_LIVER_CANCER_SUBCLASS_S3, GOMF_CYSTEINE_TYPE_PEPTIDASE_ACTIVITY, SCHAEFFER_PROSTATE_DEVELOPMENT_6HR_UP, SCHAEFFER_PROSTATE_DEVELOPMENT_48HR_UP, SCHAEFFER_PROSTATE_DEVELOPMENT_AND_CANCER_BOX4_DN, BROWNE_HCMV_INFECTION_24HR_DN

GO Biological Process (2): proteolysis (GO:0006508), obsolete proteolysis involved in protein catabolic process (GO:0051603)

GO Molecular Function (5): cysteine-type endopeptidase activity (GO:0004197), protein binding (GO:0005515), peptidase activity (GO:0008233), cysteine-type peptidase activity (GO:0008234), hydrolase activity (GO:0016787)

GO Cellular Component (2): obsolete extracellular space (GO:0005615), lysosome (GO:0005764)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Adaptive Immune System1
Immune System1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
protein metabolic process1
endopeptidase activity1
cysteine-type peptidase activity1
binding1
hydrolase activity1
catalytic activity, acting on a protein1
peptidase activity1
catalytic activity1
lytic vacuole1

Protein interactions and networks

STRING

994 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CTSOZNF423Q2M1K9541
CTSORBM46Q8TBY0460
CTSOKLHL2O95198431
CTSOLGMNQ99538412
CTSOTDO2P48775411
CTSOGUCY1B1Q02153407
CTSOESPL1Q14674390
CTSOCDCP1Q9H5V8388
CTSOMAEAQ7L5Y9373
CTSOPSMA1P25786365
CTSOCTSDP07339338
CTSOGNB1LQ9BYB4334
CTSODDX59Q5T1V6315
CTSOGUCY1A1Q02108314
CTSOBLMHQ13867313
CTSOBCS1LQ9Y276313

IntAct

27 interactions, top by confidence:

ABTypeScore
CTSOPRKCApsi-mi:“MI:0915”(physical association)0.560
CTSOYWHAGpsi-mi:“MI:0915”(physical association)0.560
CTSOSETDB1psi-mi:“MI:0915”(physical association)0.560
CTSOKAT5psi-mi:“MI:0915”(physical association)0.560
CTSOLMO3psi-mi:“MI:0915”(physical association)0.560
KNG1CTSVpsi-mi:“MI:0914”(association)0.530
MTFR1CTSOpsi-mi:“MI:0915”(physical association)0.400
KNG1CTSVpsi-mi:“MI:0914”(association)0.350
CTSOLYPLA1psi-mi:“MI:0914”(association)0.350
CSTACTSVpsi-mi:“MI:0914”(association)0.350
CST3CTSVpsi-mi:“MI:0914”(association)0.350
CSTBCTSHpsi-mi:“MI:0914”(association)0.350
CTSOhldEpsi-mi:“MI:0915”(physical association)0.000

BioGRID (20): LYPLA1 (Affinity Capture-MS), GRN (Affinity Capture-MS), AMZ2 (Affinity Capture-MS), CTSO (Affinity Capture-MS), CTSO (Affinity Capture-MS), LYPLA1 (Affinity Capture-MS), GRN (Affinity Capture-MS), CTSO (Synthetic Lethality), CTSO (Affinity Capture-MS), LYPLA1 (Affinity Capture-MS), CTSO (Affinity Capture-MS), GRN (Affinity Capture-MS), PGLYRP3 (Affinity Capture-MS), CTSO (Affinity Capture-MS), CTSO (Affinity Capture-MS)

ESM2 similar proteins: A1E295, O65493, O70370, P00787, P05689, P06797, P07154, P07688, P07711, P07858, P10605, P10619, P16675, P25326, P25773, P25774, P25775, P25804, P35591, P43233, P43234, P43295, P43296, P83205, Q02765, Q10716, Q13219, Q16819, Q26534, Q3MI05, Q3SZI1, Q3ZKN1, Q4R5M2, Q5R6D1, Q7TN16, Q8BM88, Q8HY81, Q8R4K8, Q8VYS0, Q920A5

Diamond homologs: A0A509APV9, A5YVK8, A8DS38, B2LSD2, D3ZZ07, F4JNL3, O10364, O23791, O35186, O45734, O60911, O65039, O70370, O91466, O97397, O97578, P04989, P05994, P06797, P07154, P09648, P12412, P14080, P14518, P20721, P25249, P25250, P25251, P25326, P25773, P25775, P25776, P25777, P25781, P25783, P25784, P25803, P25805, P35591, P36184

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

56 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance44
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

0 predictions. Top by Δscore:

AlphaMissense

2103 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
4:155928394:C:AW291C0.997
4:155928394:C:GW291C0.997
4:155928396:A:GW291R0.996
4:155928396:A:TW291R0.996
4:155929632:A:GW250R0.996
4:155929632:A:TW250R0.996
4:155928338:C:GC310S0.995
4:155928339:A:TC310S0.995
4:155928382:C:AW295C0.995
4:155928382:C:GW295C0.995
4:155939455:A:CS156R0.995
4:155939455:A:TS156R0.995
4:155939457:T:GS156R0.995
4:155939524:C:AW133C0.995
4:155939524:C:GW133C0.995
4:155943160:A:CF80L0.994
4:155943160:A:TF80L0.994
4:155943162:A:GF80L0.994
4:155928338:C:TC310Y0.993
4:155928400:A:CN289K0.993
4:155928400:A:TN289K0.993
4:155928384:A:GW295R0.992
4:155928384:A:TW295R0.992
4:155929594:G:CC262W0.992
4:155939414:C:GC170S0.992
4:155939415:A:TC170S0.992
4:155939515:G:CS136R0.992
4:155939515:G:TS136R0.992
4:155939517:T:GS136R0.992
4:155939526:A:GW133R0.992

dbSNP variants (sampled 300 via entrez): RS10000523 (4:155939953 C>A,G,T), RS1000091125 (4:155934438 G>A), RS1000174042 (4:155949685 G>C), RS10002427 (4:155951679 C>A,T), RS1000247612 (4:155949430 G>A,T), RS1000247920 (4:155955124 A>G), RS1000462546 (4:155950370 C>T), RS10005054 (4:155952026 C>G,T), RS10005559 (4:155952505 C>A,G,T), RS1000576932 (4:155950092 A>G), RS10006015 (4:155947016 C>T), RS10006268 (4:155947099 G>A,C,T), RS1000808558 (4:155954907 T>C), RS1000854154 (4:155945657 C>T), RS1000878040 (4:155929022 C>T)

Disease associations

OMIM: gene MIM:600550 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

46 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Aciddecreases expression, affects expression, decreases methylation, affects cotreatment7
Estradioldecreases reaction, increases expression, decreases expression2
GSK-J4decreases expression1
afuresertibincreases expression1
sotorasibaffects cotreatment, increases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
bisphenol Aincreases expression1
deoxynivalenoldecreases expression1
2-methyl-4-isothiazolin-3-onedecreases expression1
arsenitedecreases reaction, increases expression1
methylparabendecreases expression1
sulforaphaneincreases expression1
sodium arseniteincreases expression1
butyraldehydedecreases expression1
zinc chromateincreases abundance, increases expression1
1-aminomethylphosphonic acidincreases expression1
phenethyl isothiocyanateincreases expression1
chromium hexavalent ionincreases abundance, increases expression1
perfluorooctane sulfonic acidincreases expression1
scriptaidaffects expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression, increases expression1
belinostatdecreases expression1
ICG 001increases expression1
abrinedecreases expression1
dorsomorphindecreases expression, increases expression, affects cotreatment1
NSC 689534increases expression1
trametinibaffects cotreatment, increases expression1
MT19c compoundincreases expression1
NVP-BKM120affects cotreatment, increases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.