CTSH

gene
On this page

Also known as ACC-4ACC-5ACC4ACC5

Summary

CTSH (cathepsin H, HGNC:2535) is a protein-coding gene on chromosome 15q25.1, encoding Pro-cathepsin H (P09668). Important for the overall degradation of proteins in lysosomes.

The protein encoded by this gene is a lysosomal cysteine proteinase important in the overall degradation of lysosomal proteins. It is composed of a dimer of disulfide-linked heavy and light chains, both produced from a single protein precursor. The encoded protein, which belongs to the peptidase C1 protein family, can act both as an aminopeptidase and as an endopeptidase. Increased expression of this gene has been correlated with malignant progression of prostate tumors. Alternate splicing of this gene results in multiple transcript variants encoding different isoforms.

Source: NCBI Gene 1512 — RefSeq curated summary.

At a glance

  • GWAS associations: 15
  • Clinical variants (ClinVar): 73 total
  • Phenotypes (HPO): 30
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_004390

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2535
Approved symbolCTSH
Namecathepsin H
Location15q25.1
Locus typegene with protein product
StatusApproved
AliasesACC-4, ACC-5, ACC4, ACC5
Ensembl geneENSG00000103811
Ensembl biotypeprotein_coding
OMIM116820
Entrez1512

Gene structure

Transcript identifiers

Ensembl transcripts: 75 — 28 retained_intron, 26 nonsense_mediated_decay, 18 protein_coding, 2 protein_coding_CDS_not_defined, 1 non_stop_decay

ENST00000220166, ENST00000525807, ENST00000527138, ENST00000527715, ENST00000528191, ENST00000528436, ENST00000528741, ENST00000529263, ENST00000529612, ENST00000529861, ENST00000530010, ENST00000530929, ENST00000533777, ENST00000534038, ENST00000534237, ENST00000534268, ENST00000534533, ENST00000615999, ENST00000649928, ENST00000676486, ENST00000676510, ENST00000676588, ENST00000676596, ENST00000676639, ENST00000676668, ENST00000676671, ENST00000676738, ENST00000676763, ENST00000676808, ENST00000676850, ENST00000676865, ENST00000676880, ENST00000677011, ENST00000677049, ENST00000677102, ENST00000677207, ENST00000677238, ENST00000677254, ENST00000677316, ENST00000677320, ENST00000677367, ENST00000677448, ENST00000677534, ENST00000677789, ENST00000677810, ENST00000677874, ENST00000677921, ENST00000677936, ENST00000678031, ENST00000678033, ENST00000678114, ENST00000678281, ENST00000678283, ENST00000678397, ENST00000678415, ENST00000678487, ENST00000678644, ENST00000678727, ENST00000678799, ENST00000678817, ENST00000678841, ENST00000678886, ENST00000678940, ENST00000679017, ENST00000679047, ENST00000679125, ENST00000679172, ENST00000679211, ENST00000679334, ENST00000906226, ENST00000906227, ENST00000906228, ENST00000906229, ENST00000969896, ENST00000969897

RefSeq mRNA: 3 — MANE Select: NM_004390 NM_001319137, NM_001411095, NM_004390

CCDS: CCDS10308, CCDS92047

Canonical transcript exons

ENST00000220166 — 12 exons

ExonStartEnd
ENSE000018666627892105878922205
ENSE000034715937892299378923118
ENSE000034924907892533478925440
ENSE000035376647892771378927781
ENSE000035674217893497878935082
ENSE000036386337893145178931506
ENSE000036492917893914078939171
ENSE000036738627893568078935750
ENSE000036830257893731878937423
ENSE000036902257893237278932458
ENSE000037879337892941278929493
ENSE000039041447894489178945046

Expression profiles

Bgee: expression breadth ubiquitous, 290 present calls, max score 99.83.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 81.4082 / max 1418.7102, expressed in 1601 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
15113877.90991581
1511392.24651012
1511360.7303189
1511370.3743208
1511350.147283

Top tissues by expression

296 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cranial nerve IIUBERON:000094199.83gold quality
nephron tubuleUBERON:000123199.45gold quality
lower lobe of lungUBERON:000894999.32gold quality
bronchial epithelial cellCL:000232899.19gold quality
adult mammalian kidneyUBERON:000008299.15gold quality
renal medullaUBERON:000036299.05gold quality
visceral pleuraUBERON:000240199.05gold quality
epithelium of bronchusUBERON:000203199.03gold quality
monocyteCL:000057698.95gold quality
bronchusUBERON:000218598.94gold quality
mononuclear cellCL:000084298.88gold quality
lymph nodeUBERON:000002998.84gold quality
leukocyteCL:000073898.83gold quality
upper lobe of lungUBERON:000894898.77gold quality
adult organismUBERON:000702398.72gold quality
upper lobe of left lungUBERON:000895298.72gold quality
gall bladderUBERON:000211098.71gold quality
parotid glandUBERON:000183198.51gold quality
lungUBERON:000204898.50gold quality
kidney epitheliumUBERON:000481998.46gold quality
olfactory segment of nasal mucosaUBERON:000538698.45gold quality
nasal cavity epitheliumUBERON:000538498.41gold quality
upper leg skinUBERON:000426298.39gold quality
vermiform appendixUBERON:000115498.29gold quality
endocervixUBERON:000045898.21gold quality
right lungUBERON:000216798.07gold quality
granulocyteCL:000009497.98gold quality
metanephros cortexUBERON:001053397.94gold quality
kidneyUBERON:000211397.89gold quality
caecumUBERON:000115397.69gold quality

Single-cell (SCXA)

Detected in 27 experiment(s), a significant marker in 24.

ExperimentMarker?Max mean expression
E-HCAD-38yes889.35
E-MTAB-9801yes544.22
E-HCAD-1yes109.27
E-HCAD-4yes72.47
E-MTAB-8142yes41.16
E-MTAB-10553yes32.48
E-MTAB-5061yes28.38
E-GEOD-134144yes27.56
E-CURD-112yes27.45
E-HCAD-10yes26.78
E-CURD-122yes26.73
E-MTAB-9221yes26.26
E-GEOD-130148yes25.62
E-MTAB-6701yes24.17
E-GEOD-81547yes22.22

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

2 targets.

TargetRegulation
MMP2Activation
MMP3Activation

Upstream regulators (CollecTRI, top): GATA2

miRNA regulators (miRDB)

17 targeting CTSH, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-124-3P99.8973.743043
HSA-MIR-506-3P99.8973.553057
HSA-MIR-4685-5P99.2565.991563
HSA-MIR-6837-5P99.2565.471632
HSA-MIR-129-1-3P98.8668.41779
HSA-MIR-129-2-3P98.8668.41779
HSA-MIR-4436B-3P98.2565.261494
HSA-MIR-6735-5P98.2465.361488
HSA-MIR-7843-5P98.1265.261421
HSA-MIR-4638-3P97.9065.75905
HSA-MIR-4632-5P97.8265.381470
HSA-MIR-6879-5P97.7765.521521
HSA-MIR-3616-3P96.9665.45983
HSA-MIR-449196.5366.20935
HSA-MIR-465796.5366.57895
HSA-MIR-808196.4267.75738
HSA-MIR-627-5P95.5166.80509

Literature-anchored findings (GeneRIF, showing 25)

  • involvement in processing of hydrophobic surfactant-associated protein C in type II pneumocytes (PMID:12034564)
  • production of cathepsins B and H by tumor cells along with concomitant decrease of their inhibitor, cystatin C, in the CSF might contribute in the process of metastasis and spread of the cancer cells in the leptomeningeal tissues (PMID:12589965)
  • Cathepsin H could contribute to the transformation of LDL to an atherogenic moiety; the process might involve a self-sustaining amplifying circle. (PMID:12615673)
  • kinetic data on substrates hydrolysis and enzyme inhibition show the role of the mini-chain as a framework for transition state stabilization of free alpha-amino groups of substrates and as a structural barrier for endopeptidase-like substrate cleavage. (PMID:14515996)
  • AKR1B1 and CTSH may be good markers for prediction of sensitivity to certain drugs (PMID:14662023)
  • cathepsin H is involved in maturation of the biologically active surfactant associated protein B (PMID:14766755)
  • cathepsin H has a role in progression of colorectal cancer that is distinct from that of cathepsins B and L (PMID:15646835)
  • A general defect in napsin A or cathepsin H expression or activity was not the specific cause for abnormal surfactant accumulation in juvenile pulmonary alveolar proteinosis (PMID:18216060)
  • cathepsin B, cathepsin H, cathepsin X and cystatin C may have roles in inflammatory breast cancer (PMID:18949742)
  • In contrast to cathepsin S, cathepsin H values did not correlate with markers of inflammation, indicating a specific role for cathepsin H in the pleural host response. (PMID:19404922)
  • These data indicate that cathepsins B, L and S may act as cell-death mediators in in monocytic cells infected with ICP4 and Us3 deletion mutant herpes simplex virus type 1. (PMID:20881085)
  • CTSH overexpression in a subset hepatoma may be thyroid hormone receptor dependent and suggests that this overexpression has an important role in hepatoma progression. (PMID:21217776)
  • The processing of procathepsin H is an autocatalytic, multistep process proceeding from an inactive 41kDa pro-form, through a 30kDa intermediate form, to the 28kDa mature form. (PMID:22704610)
  • Quantification of immunohistochemistry showed that there is no difference in the global expression of CTSD, CTSH and CTSK between asthmatics and non-asthmatics. (PMID:23483898)
  • The data provide strong evidence that CTSH is an important regulator of beta-cell function during progression of T1D (PMID:24982147)
  • mutation in LRPAP1 is associated with high myopia. Further studies are expected to evaluate the pathogenicity of the variants in CTSH, LEPREL1, ZNF644, SLC39A5, and SCO2. (PMID:25525168)
  • In conclusion, CTSH/rs3825932 and ERBB3/rs2292239 SNPs were associated with reduced risk of progression to proliferative diabetic retinopathy and two-step progression of diabetic retinopathy on the ETDRS scale accordingly. (PMID:26245339)
  • Especially, rs3825932 in CTSH has integrative functional evidence supporting the association with type 1 diabetes mellitus. (PMID:26429317)
  • p41 fragment is also shown to reduce the secretion of interleukin-12 (IL-12/p70) during the subsequent maturation of treated dendritic cells. (PMID:26960148)
  • the up-regulation of MMP genes is mediated through degradation of class IIa histone deacetylases (HDACs) by certain cysteine cathepsins (Cts). (PMID:28157489)
  • Study presents the crystal structures of human procathepsin H at 2.00 A and 1.66 A resolution. These structures allow us to explore in detail the molecular basis for the inhibition of the mature domain by the prodomain. Comparison with cathepsin H structure reveals how mini-chain reorients upon activation. Results demonstrate that procathepsin H is not auto-activated but can be trans-activated by cathepsin L. (PMID:30044821)
  • The Rac2 GTPase contributes to cathepsin H-mediated protection against cytokine-induced apoptosis in insulin-secreting cells. (PMID:32814070)
  • Genetic and environmental factors regulate the type 1 diabetes gene CTSH via differential DNA methylation. (PMID:33992646)
  • Functional genomics identify causal variant underlying the protective CTSH locus for Alzheimer’s disease. (PMID:36739351)
  • Cathepsin H: Molecular characteristics and clues to function and mechanism. (PMID:37148981)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_rerioctshENSDARG00000041108
mus_musculusCtshENSMUSG00000032359
rattus_norvegicusCtshENSRNOG00000014064
caenorhabditis_elegansWBGENE00008861

Paralogs (12): CTSZ (ENSG00000101160), CTSC (ENSG00000109861), CTSL (ENSG00000135047), CTSV (ENSG00000136943), TINAG (ENSG00000137251), TINAGL1 (ENSG00000142910), CTSK (ENSG00000143387), CTSS (ENSG00000163131), CTSB (ENSG00000164733), CTSW (ENSG00000172543), CTSF (ENSG00000174080), CTSO (ENSG00000256043)

Protein

Protein identifiers

Pro-cathepsin HP09668 (reviewed: P09668)

All UniProt accessions (27): A0A087X0D5, A0A0B4J217, P09668, A0A3B3IRM7, A0A7I2V2E4, A0A7I2V2L7, A0A7I2V2T8, A0A7I2V314, A0A7I2V3E8, A0A7I2V3T9, A0A7I2V439, A0A7I2V4H2, A0A7I2V4N9, A0A7I2V4W0, A0A7I2V4X5, A0A7I2V532, A0A7I2V563, A0A7I2V673, A0A7I2YQ92, A0A7I2YQE6, A0A7I2YQH4, A0A7I2YQL1, A0A7I2YQL6, A0A7I2YQX6, E9PKT6, E9PN60, E9PN84

UniProt curated annotations — full annotation on UniProt →

Function. Important for the overall degradation of proteins in lysosomes.

Subunit / interactions. Composed of a mini chain and a large chain. The large chain may be split into heavy and light chain. All chains are held together by disulfide bonds.

Subcellular location. Lysosome.

Similarity. Belongs to the peptidase C1 family.

RefSeq proteins (3): NP_001306066, NP_001398024, NP_004381* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000169Pept_cys_ASActive_site
IPR000668Peptidase_C1A_CDomain
IPR013128Peptidase_C1AFamily
IPR013201Prot_inhib_I29Domain
IPR025660Pept_his_ASActive_site
IPR025661Pept_asp_ASActive_site
IPR038765Papain-like_cys_pep_sfHomologous_superfamily
IPR039417Peptidase_C1A_papain-likeDomain

Pfam: PF00112, PF08246

Enzyme classification (BRENDA):

  • EC 3.4.22.16 — cathepsin H (BRENDA: 16 organisms, 94 substrates, 306 inhibitors, 41 Km, 22 kcat entries)

Substrate kinetics (BRENDA)

32 substrates with measured Km, best-characterized 15. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
ARG-4-METHYLCOUMARYL-7-AMIDE0.097–723
BENZOYL-DL-ARG-2-NAPHTHYLAMIDE0.039–4.323
ARG-2-NAPHTHYLAMIDE0.097–0.1692
BENZOYL-L-PHE-L-VAL-L-ARG-4-METHYLCOUMARYL-7-AMI0.025–0.0382
H-L-ARG-NH-4-METHYLCOUMARYL-7-AMIDE0.05–3.62
L-LEU-BETA-NAPHTHYLAMIDE0.203–0.222
ALA-4-METHYLCOUMARYL-7-AMIDE2101
ALPHA-N-BENZOYL-DL-ARG-BETA-NAPHTHYLAMIDE0.3671
BENZOYL-ARG-2-NAPHTHYLAMIDE0.6371
BENZOYL-ARG-4-METHYLCOUMARYL-7-AMIDE0.3211
BENZOYL-L-ARG-2-NAPHTHYLAMIDE0.461
BENZOYL-L-PHE-L-ARG-7-AMIDO-4-METHYLCOUMARIN0.0821
BENZYLOXYCARBONYL-PRO-ALA-ALA-ALA-PRO7.41
BENZYLOXYCARBONYL-PRO-ALA-ALA-ALA-PRO-NH23.51
CITRULLINE-4-METHYLCOUMARYL-7-AMIDE0.0761

UniProt features (50 total): helix 14, strand 12, sequence variant 5, disulfide bond 4, chain 3, active site 3, propeptide 2, glycosylation site 2, turn 2, signal peptide 1, peptide 1, sequence conflict 1

Structure

Experimental structures (PDB)

2 structures.

PDBMethodResolution (Å)
6CZSX-RAY DIFFRACTION1.66
6CZKX-RAY DIFFRACTION2

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P09668-F194.300.91

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (3): 141; 281; 301

Disulfide bonds (4): 102–327, 138–181, 172–214, 272–322

Glycosylation sites (2): 101, 230

Function

Pathways and Gene Ontology

Reactome pathways

7 pathways

IDPathway
R-HSA-2132295MHC class II antigen presentation
R-HSA-5683826Surfactant metabolism
R-HSA-6798695Neutrophil degranulation
R-HSA-1280218Adaptive Immune System
R-HSA-168249Innate Immune System
R-HSA-168256Immune System
R-HSA-392499Metabolism of proteins

MSigDB gene sets: 501 (showing top): GSE45365_CD8A_DC_VS_CD11B_DC_IFNAR_KO_MCMV_INFECTION_UP, GOBP_MORPHOGENESIS_OF_AN_EPITHELIUM, GOBP_EPITHELIUM_DEVELOPMENT, KOBAYASHI_EGFR_SIGNALING_24HR_UP, REACTOME_INNATE_IMMUNE_SYSTEM, MCLACHLAN_DENTAL_CARIES_UP, GOBP_METANEPHROS_DEVELOPMENT, REACTOME_ADAPTIVE_IMMUNE_SYSTEM, BASSO_B_LYMPHOCYTE_NETWORK, GOCC_SECRETORY_GRANULE, GOBP_EPITHELIAL_TUBE_BRANCHING_INVOLVED_IN_LUNG_MORPHOGENESIS, GOBP_REGULATION_OF_EPITHELIAL_CELL_MIGRATION, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, GOBP_LUNG_MORPHOGENESIS, IVANOVA_HEMATOPOIESIS_MATURE_CELL

GO Biological Process (26): metanephros development (GO:0001656), T cell mediated cytotoxicity (GO:0001913), adaptive immune response (GO:0002250), immune response-regulating signaling pathway (GO:0002764), proteolysis (GO:0006508), immune response (GO:0006955), spermatogenesis (GO:0007283), positive regulation of gene expression (GO:0010628), positive regulation of epithelial cell migration (GO:0010634), neuropeptide catabolic process (GO:0010813), bradykinin catabolic process (GO:0010815), antigen processing and presentation (GO:0019882), positive regulation of cell migration (GO:0030335), zymogen activation (GO:0031638), protein destabilization (GO:0031648), response to retinoic acid (GO:0032526), membrane protein proteolysis (GO:0033619), surfactant homeostasis (GO:0043129), obsolete proteolysis involved in protein catabolic process (GO:0051603), dichotomous subdivision of terminal units involved in lung branching (GO:0060448), ERK1 and ERK2 cascade (GO:0070371), cellular response to thyroid hormone stimulus (GO:0097067), lysosomal protein catabolic process (GO:1905146), response to odorant (GO:1990834), positive regulation of apoptotic signaling pathway (GO:2001235), apoptotic process (GO:0006915)

GO Molecular Function (15): endopeptidase activity (GO:0004175), aminopeptidase activity (GO:0004177), cysteine-type endopeptidase activity (GO:0004197), serine-type endopeptidase activity (GO:0004252), peptidase activity (GO:0008233), cysteine-type peptidase activity (GO:0008234), cysteine-type endopeptidase activator activity involved in apoptotic process (GO:0008656), HLA-A specific activating MHC class I receptor activity (GO:0030108), kininogen binding (GO:0030984), identical protein binding (GO:0042802), protein-containing complex binding (GO:0044877), thyroid hormone binding (GO:0070324), protein binding (GO:0005515), peptidase activator activity involved in apoptotic process (GO:0016505), hydrolase activity (GO:0016787)

GO Cellular Component (15): outer dense fiber (GO:0001520), acrosomal vesicle (GO:0001669), extracellular region (GO:0005576), obsolete extracellular space (GO:0005615), lysosome (GO:0005764), cytosol (GO:0005829), axoneme (GO:0005930), extracellular matrix (GO:0031012), secretory granule lumen (GO:0034774), cytoplasmic ribonucleoprotein granule (GO:0036464), extracellular exosome (GO:0070062), alveolar lamellar body (GO:0097208), multivesicular body lumen (GO:0097486), tertiary granule lumen (GO:1904724), ficolin-1-rich granule lumen (GO:1904813)

Reactome top-level categories

Rollup of top-4 pathways:

CategoryPathways
Immune System2
Adaptive Immune System1
Metabolism of proteins1
Innate Immune System1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
immune system process2
peptidase activity2
endopeptidase activity2
protein binding2
binding2
secretory granule2
cytoplasm2
intracellular organelle lumen2
kidney development1
leukocyte mediated cytotoxicity1
T cell mediated immunity1
immune response1
signal transduction1
regulation of immune response1
protein metabolic process1
response to stimulus1
developmental process involved in reproduction1
male gamete generation1
gene expression1
regulation of gene expression1
positive regulation of macromolecule biosynthetic process1
epithelial cell migration1
regulation of epithelial cell migration1
positive regulation of cell migration1
peptide catabolic process1
catabolic process1
cell migration1
regulation of cell migration1
positive regulation of cell motility1
protein processing1
regulation of protein stability1
response to lipid1
response to oxygen-containing compound1
proteolysis1
multicellular organismal-level chemical homeostasis1
epithelial tube branching involved in lung morphogenesis1
dichotomous subdivision of an epithelial terminal unit1
exopeptidase activity1
cysteine-type peptidase activity1

Protein interactions and networks

STRING

1921 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CTSHCSTAP01040884
CTSHCTSDP07339817
CTSHLGMNQ99538778
CTSHC1QTNF6Q9BXI9661
CTSHCLEC16AQ2KHT3621
CTSHBACH2Q9BYV9608
CTSHQ5Y7H0Q5Y7H0584
CTSHSH2B3Q9UQQ2564
CTSHPTPN2P17706561
CTSHHMSDA8MTL9560
CTSHTAGAPQ8N103542
CTSHSLC39A5Q6ZMH5532
CTSHCSTBP04080531
CTSHP3H2Q8IVL5526
CTSHSFTPBP07988522

IntAct

90 interactions, top by confidence:

ABTypeScore
POC5CETN3psi-mi:“MI:0914”(association)0.920
POC5CETN3psi-mi:“MI:0914”(association)0.770
HSPD1NUDT19psi-mi:“MI:0914”(association)0.710
CCNCMED19psi-mi:“MI:0914”(association)0.640
DYNC1H1CTSHpsi-mi:“MI:0915”(physical association)0.560
CTSHATN1psi-mi:“MI:0915”(physical association)0.560
CTSHKLK6psi-mi:“MI:0915”(physical association)0.560
BPNT1GTPBP10psi-mi:“MI:0914”(association)0.530
TBC1D22BA2ML1psi-mi:“MI:0914”(association)0.530
SLC25A41NUDT19psi-mi:“MI:0914”(association)0.530
DNAAF19KLK10psi-mi:“MI:0914”(association)0.530
CST1CTSVpsi-mi:“MI:0914”(association)0.530
MMRN1CTSVpsi-mi:“MI:0914”(association)0.530
AIREALOX12Bpsi-mi:“MI:0914”(association)0.530
CTSLCTSHpsi-mi:“MI:0914”(association)0.530
ZIC1CTSVpsi-mi:“MI:0914”(association)0.530
SLCO6A1PSMD11psi-mi:“MI:0914”(association)0.350

BioGRID (76): CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS), CTSH (Affinity Capture-MS)

ESM2 similar proteins: A0A068CNX1, A0A072UTP9, A0A0F7G352, A0E358, A8DS38, D3ZZ07, F4JNL3, O17473, O23791, O70370, O97397, P00786, P05167, P09668, P0DO76, P14080, P15242, P22895, P25775, P25778, P25804, P35591, P41715, P43295, P43296, P49935, P55097, P80884, Q01957, Q02765, Q10716, Q10717, Q40143, Q63088, Q6YD92, Q80UB0, Q8H166, Q8HY82, Q8RWQ9, Q8VYS0

Diamond homologs: A0A068CNX1, A0A072UTP9, A0A0F7G352, A0A1S4F2V5, A2XQE8, A5HII1, B2LSD2, F4JNL3, O35186, O45734, O46427, O60911, O65039, O65493, O70370, O97397, P00785, P00786, P04989, P05167, P06797, P07154, P07711, P09648, P09668, P0DO76, P12412, P13277, P15242, P25251, P25326, P25773, P25774, P25776, P25777, P25778, P25782, P25784, P25803, P25804

SIGNOR signaling

2 interactions.

AEffectBMechanism
CTSH“down-regulates quantity by destabilization”BGLAPcleavage

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 92 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Neutrophil degranulation124.7×2e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

73 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance41
Likely benign8
Benign4

Top pathogenic / likely-pathogenic (0)

SpliceAI

2052 predictions. Top by Δscore:

VariantEffectΔscore
15:78922987:GCTTA:Gdonor_loss1.0000
15:78922988:CTTA:Cdonor_loss1.0000
15:78922989:TTA:Tdonor_loss1.0000
15:78922990:TACCC:Tdonor_loss1.0000
15:78922991:AC:Adonor_gain1.0000
15:78922991:ACCC:Adonor_loss1.0000
15:78922992:C:Adonor_loss1.0000
15:78922992:CC:Cdonor_gain1.0000
15:78923114:AAGTA:Aacceptor_gain1.0000
15:78923115:AGTA:Aacceptor_gain1.0000
15:78923116:GTA:Gacceptor_gain1.0000
15:78923117:TA:Tacceptor_gain1.0000
15:78923119:C:CCacceptor_gain1.0000
15:78925330:TCACC:Tdonor_loss1.0000
15:78925331:CACCT:Cdonor_loss1.0000
15:78925332:A:ACdonor_gain1.0000
15:78925332:AC:Adonor_gain1.0000
15:78925333:C:CTdonor_gain1.0000
15:78925333:CC:Cdonor_gain1.0000
15:78925436:TCATA:Tacceptor_gain1.0000
15:78925437:CATA:Cacceptor_gain1.0000
15:78925437:CATAC:Cacceptor_gain1.0000
15:78925439:TA:Tacceptor_gain1.0000
15:78925440:AC:Aacceptor_loss1.0000
15:78925441:C:CCacceptor_gain1.0000
15:78925441:C:CGacceptor_loss1.0000
15:78925442:T:Cacceptor_loss1.0000
15:78925450:C:CTacceptor_gain1.0000
15:78925451:A:Tacceptor_gain1.0000
15:78925456:C:CTacceptor_gain1.0000

AlphaMissense

2220 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
15:78932432:G:CF144L0.999
15:78932432:G:TF144L0.999
15:78932434:A:GF144L0.999
15:78932438:C:AW142C0.999
15:78932438:C:GW142C0.999
15:78932440:A:GW142R0.999
15:78932440:A:TW142R0.999
15:78922173:C:GC322S0.998
15:78922174:A:TC322S0.998
15:78922994:C:AG311W0.998
15:78923004:C:AW307C0.998
15:78923004:C:GW307C0.998
15:78923022:G:CN301K0.998
15:78923022:G:TN301K0.998
15:78923025:T:AK300N0.998
15:78923025:T:GK300N0.998
15:78925390:G:CS250R0.998
15:78925390:G:TS250R0.998
15:78925392:T:GS250R0.998
15:78931457:C:GC181S0.998
15:78931458:A:TC181S0.998
15:78932430:G:AS145F0.998
15:78932442:C:TC141Y0.998
15:78922993:C:AG311V0.997
15:78922993:C:TG311E0.997
15:78923016:C:AW303C0.997
15:78923016:C:GW303C0.997
15:78923036:A:GW297R0.997
15:78923036:A:TW297R0.997
15:78923066:C:AG287W0.997

dbSNP variants (sampled 300 via entrez): RS1000054384 (15:78941519 G>A), RS1000105021 (15:78941191 C>G), RS1000268396 (15:78931797 A>G), RS1000344402 (15:78926999 A>C), RS1000409180 (15:78921540 C>T), RS1000521817 (15:78926149 G>C), RS1000566526 (15:78932642 A>G), RS1000676106 (15:78930763 G>A,C), RS1000706816 (15:78928110 A>G), RS1000761720 (15:78923140 G>A), RS1000779220 (15:78927155 G>A), RS1001132690 (15:78930102 G>C), RS1001241761 (15:78936791 G>A), RS1001245743 (15:78930283 T>C,G), RS1001542877 (15:78944440 C>G,T)

Disease associations

OMIM: gene MIM:116820 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

30 total (30 of 30 shown, HPO-id order):

HPOTerm
HP:0000017Nocturia
HP:0000709Psychosis
HP:0000711Restlessness
HP:0000716Depression
HP:0000738Hallucinations
HP:0000739Anxiety
HP:0000822Hypertension
HP:0000826Precocious puberty
HP:0000975Hyperhidrosis
HP:0001262Excessive daytime somnolence
HP:0001279Syncope
HP:0001350Slurred speech
HP:0001513Obesity
HP:0002019Constipation
HP:0002076Migraine
HP:0002307Drooling
HP:0002360Sleep disturbance
HP:0002494Abnormal rapid eye movement sleep
HP:0002519Hypnagogic hallucination
HP:0002524Cataplexy
HP:0006896Hypnopompic hallucination
HP:0007018Attention deficit hyperactivity disorder
HP:0007686Abnormal pupillary function
HP:0010534Transient global amnesia
HP:0010535Sleep apnea
HP:0012452Restless legs
HP:0025233Sleep paralysis
HP:0025235NREM parasomnia
HP:0030014Female sexual dysfunction
HP:0040307Male sexual dysfunction

GWAS associations

15 associations (top):

StudyTraitp-value
GCST000258_3Type 1 diabetes3.000000e-15
GCST000392_4Type 1 diabetes8.000000e-08
GCST001164_3Bipolar disorder9.000000e-06
GCST005522_2Narcolepsy1.000000e-09
GCST005531_8Multiple sclerosis1.000000e-08
GCST005536_33Type 1 diabetes9.000000e-14
GCST006585_1101Blood protein levels5.000000e-166
GCST006585_1123Blood protein levels4.000000e-212
GCST008870_57Keratinocyte cancer (MTAG)2.000000e-07
GCST008871_40Basal cell carcinoma3.000000e-09
GCST009597_304Multiple sclerosis9.000000e-17
GCST009873_8Autoimmune traits (pleiotropy)1.000000e-14
GCST009874_34Celiac disease2.000000e-06
GCST009875_11Type 1 diabetes1.000000e-11
GCST90011899_143Aspartate aminotransferase levels4.000000e-11

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0010176keratinocyte carcinoma
EFO:0004736aspartate aminotransferase measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (2): CHEMBL2111441 (SELECTIVITY GROUP), CHEMBL2225 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 72 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL5095230ATUZAGINSTAT272

PharmGKB: 1 entry (VIP=true, CPIC=false)

GtoPdb / IUPHAR curated pharmacology

(IUPHAR/BPS Guide to Pharmacology — expert-curated)

Target class: enzyme — C1: Papain

Most potent curated ligand interactions (2 total), top 2:

LigandActionAffinityParameter
compound 1b [PMID: 16290936]Inhibition7.4pKi
compound 1e [PMID: 27285276]Inhibition6.36pIC50

ChEMBL bioactivities

37 potent at pChembl≥5 of 46 total, top 28 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
9.34Ki0.46nMCHEMBL371420
8.85Ki1.4nMCHEMBL190121
7.75IC5018nMCHEMBL113948
7.40IC5040nMCHEMBL183812
7.40IC5040nMCHEMBL114161
6.96IC50110nMCHEMBL5848986
6.90IC50125.9nMCHEMBL567341
6.86Ki137nMCHEMBL514348
6.84IC50145nMCHEMBL492175
6.72IC50190nMCHEMBL203663
6.72IC50190nME-64
6.64IC50230nMCHEMBL204605
6.25IC50560nMATUZAGINSTAT
6.25IC50560nMCHEMBL5783659
6.24IC50580nMCHEMBL180839
6.19IC50645.6nMCHEMBL567134
5.89Ki1300nMCHEMBL4287663
5.85Ki1400nMCHEMBL486232
5.81IC501550nMCHEMBL5870472
5.58IC502653nMCHEMBL390474
5.43Ki3700nMCHEMBL212521
5.30IC505000nMCHEMBL426611
5.18IC506591nMCHEMBL230474
5.16Ki7000nMCHEMBL4292287
5.08IC508300nMCHEMBL183660
5.04Ki9200nMLEUPEPTIN
5.01IC509800nMCHEMBL450997
5.00IC501e+04nMGRASSYSTATIN A

PubChem BioAssay actives

25 with measured affinity, of 180 total; 24 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
[(2S)-1-cyclohexylpropan-2-yl] N-[cyano(propan-2-yl)amino]carbamate238907: Inhibition constant against human cathepsin H using L-Arg-b-naphthalamideki0.0005uM
[(2S)-1-cyclohexylpropan-2-yl] N-[cyano(methyl)amino]carbamate238891: Inhibition constant against human cathepsin H using L-Arg-b-naphthalamideki0.0014uM
[(2R)-3-methyl-1-phenylbutan-2-yl] N-[(2S)-1-oxohexan-2-yl]carbamate260672: Inhibition of recombinant human cathepsin H in a fluorescence assayic500.0180uM
[(2S)-1-phenylpropan-2-yl] N-[(2S)-1-oxohexan-2-yl]carbamate260672: Inhibition of recombinant human cathepsin H in a fluorescence assayic500.0400uM
[(2S)-1-cyclohexylpropan-2-yl] 1-cyanoazetidine-2-carboxylate242157: Inhibitory concentration against human cathepsin H using 50 uM L-Arg-beta-naphthalamideic500.0400uM
6-(3,5-difluoroanilino)-9-ethylpurine-2-carbonitrile444705: Inhibition of human Cathepsin Hic500.1259uM
(2S)-2-amino-N-[(E,3S)-1-(benzenesulfonyl)-5-phenylpent-1-en-3-yl]pentanamide370548: Inhibition of human recombinant cathepsin H by fluorescence assayki0.1370uM
3-acetyl-8-chloro-2-(4-chloroanilino)-6-nitro-1H-quinolin-4-one350739: Inhibition of human liver cathepsin H after 30 mins by fluorometric end-point assayic500.1450uM
(2S,3S)-3-[[(2S)-1-[4-(diaminomethylideneamino)butylamino]-4-methyl-1-oxopentan-2-yl]carbamoyl]oxirane-2-carboxylic acid360106: Inhibition of human recombinant cathepsin H after 10 minsic500.1900uM
[(2R)-3-methyl-1-phenylbutan-2-yl] N-[(1E,2S)-1-(morpholine-4-carbonylhydrazinylidene)hexan-2-yl]carbamate260672: Inhibition of recombinant human cathepsin H in a fluorescence assayic500.1900uM
[(2S)-1-phenylpropan-2-yl] N-[(1E,2S)-1-(morpholine-4-carbonylhydrazinylidene)hexan-2-yl]carbamate260672: Inhibition of recombinant human cathepsin H in a fluorescence assayic500.2300uM
[(2S)-1-cyclohexylpropan-2-yl] 1-cyanopyrrolidine-2-carboxylate242157: Inhibitory concentration against human cathepsin H using 50 uM L-Arg-beta-naphthalamideic500.5800uM
9-(3,5-difluorophenyl)-6-(ethylamino)purine-2-carbonitrile444705: Inhibition of human Cathepsin Hic500.6456uM
(2S)-1-[(2S)-2-[[(2S)-2-[[[(2S)-1-[(2S)-1-[(2S)-2-[(4-aminobenzoyl)amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]pyrrolidine-2-carbonyl]amino]carbamoylamino]-3-phenylpropanoyl]amino]-3-hydroxypropanoyl]-N-[(2S)-1-[[(2S)-1-[[(2S)-1-amino-3-(4-hydroxy-3-nitrophenyl)-1-oxopropan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-1-oxo-3-phenylpropan-2-yl]pyrrolidine-2-carboxamide1402875: Reversible competitive inhibition of human cathepsin H using fluorogenic AMC-derived peptide substrate assessed as reduction in residual activity preincubated for 30 mins followed by substrate additionki1.3000uM
N-[(2S)-1-[[(E,3S)-1-(benzenesulfonyl)-5-phenylpent-1-en-3-yl]amino]-4-methyl-1-oxopentan-2-yl]morpholine-4-carboxamide370548: Inhibition of human recombinant cathepsin H by fluorescence assayki1.4000uM
(4S,5R)-4-(2,3-dimethylphenoxy)-6-oxa-1-azabicyclo[3.2.1]octan-7-one290156: Inhibition of human recombinant cathepsin Hic502.6530uM
(2S)-2-amino-N-[(1S)-1-cyano-2-(4-phenylphenyl)ethyl]butanamide370548: Inhibition of human recombinant cathepsin H by fluorescence assayki3.7000uM
(2-methyl-2-phenylpropyl) N-[(3S)-1,2-dioxo-1-[[(1R)-1-phenylethyl]amino]heptan-3-yl]carbamate242263: Inhibition of 50 uM L-Arg-beta-naphthalamide binding to human cathepsin H in fluorescence assayic505.0000uM
(4S,5R)-4-phenoxy-6-oxa-1-azabicyclo[3.2.1]octan-7-one290156: Inhibition of human recombinant cathepsin Hic506.5910uM
(4S)-4-[[(2S)-2-[[(2S)-1-[(2S)-2-[[[(2S)-2-[[(2S)-1-[(2S)-2-[[(2S)-2-[[(2S)-2-[(2-acetamidoacetyl)amino]-5-carbamimidamidopentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]carbamoylamino]propanoyl]pyrrolidine-2-carbonyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-5-[[(2R)-1-[[(2R)-1-amino-5-carbamimidamido-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-5-oxopentanoic acid1402875: Reversible competitive inhibition of human cathepsin H using fluorogenic AMC-derived peptide substrate assessed as reduction in residual activity preincubated for 30 mins followed by substrate additionki7.0000uM
[(2S)-1-cyclohexylpropan-2-yl] N-[(3S)-1-cyanopyrrolidin-3-yl]carbamate242157: Inhibitory concentration against human cathepsin H using 50 uM L-Arg-beta-naphthalamideic508.3000uM
(2S)-2-acetamido-N-[(2S)-1-[[(2S)-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]-4-methylpentanamide1125415: Inhibition of human liver cathepsin Hki9.2000uM
2-amino-N-[(2S)-4-diazo-1-(4-iodophenyl)-3-oxobutan-2-yl]acetamide360106: Inhibition of human recombinant cathepsin H after 10 minsic509.8000uM
methyl (2S)-1-[(2R)-2-[[(2S)-2-[[(2S,3R)-2-[[(3S,4S)-4-[[(2S)-4-amino-2-[[(2S)-2-[[(2R)-2-[(2S)-2-[(2S)-2-(dimethylamino)-3-methylbutanoyl]oxy-3-methylbutanoyl]oxy-3-methylbutanoyl]amino]-4-methylpentanoyl]amino]-4-oxobutanoyl]amino]-3-hydroxy-6-methylheptanoyl]amino]-3-hydroxybutanoyl]amino]propanoyl]-methylamino]-3-phenylpropanoyl]pyrrolidine-2-carboxylate448875: Inhibition of cathepsin H after 10 to 15 mins by fluorescence assayic5010.0000uM

CTD chemical–gene interactions

81 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression7
Estradiolaffects cotreatment, decreases expression4
bisphenol Aaffects cotreatment, increases expression, affects expression2
sodium arsenitedecreases expression2
Decitabineaffects cotreatment, affects expression, increases expression2
Arsenic Trioxidedecreases expression, affects cotreatment2
Leaddecreases expression, affects expression2
Smokedecreases expression2
Cyclosporinedecreases expression2
afuresertibincreases expression1
triphenyl phosphateaffects expression1
propionaldehydeincreases expression1
glycidyl methacrylatedecreases expression1
sodium arsenatedecreases expression1
tris(2-butoxyethyl) phosphateaffects expression1
arsenitedecreases expression1
sodium bichromatedecreases expression1
afimoxifeneincreases expression1
butyraldehydeincreases expression1
perfluorooctanoic acidaffects cotreatment, increases expression1
cupric chloridedecreases expression1
pentanalincreases expression1
perfluorooctane sulfonic acidincreases expression1
seocalcitoldecreases expression1
entinostatincreases expression1
K 7174decreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
nutlin 3increases expression, increases secretion, affects cotreatment1
abrinedecreases expression1
dorsomorphinaffects cotreatment, increases expression1

ChEMBL screening assays

58 unique, capped per target: 58 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL829917BindingSelectivity for human cathepsin H over human cathepsin KPotent and selective ketoamide-based inhibitors of cysteine protease, cathepsin K. — J Med Chem

Cellosaurus cell lines

1 cell lines: 1 transformed cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B2VEAbcam HEK293T CTSH KOTransformed cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.