SCHEMBL2398747

SCHEMBL2398747

CSc1ccc(OC(F)(F)[C@H](NC(CSC(c2ccccc2)(c2ccccc2)c2ccccc2)C(=O)O)c2ccc(F)cc2)cc1

nearest known ligand 0.47

Predicted protein targets (top 8)

geneUniProtsupporting neighboursconfidence
KIF11 P52732 12/20 0.47
CTSL P07711 3/20 0.38
CTSB P07858 1/20 0.38
CTSS P25774 1/20 0.38
CTSK P43235 1/20 0.38
EDNRB P24530 1/20 0.37
EDNRA P25101 1/20 0.37
DNMT1 P26358 1/20 0.36

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL2398750 1.00 KIF11 (0.47) KIF11CTSLCTSBCTSSCTSK
SCHEMBL2396614 0.83 CTSL (0.42) CTSLCTSBCTSSCTSK
SCHEMBL2396618 0.83 CTSL (0.42) CTSLCTSBCTSSCTSK
SCHEMBL1272459 0.83 KIF11 (0.56) KIF11CTSLCTSBCTSSCTSK
SCHEMBL13725978 0.83 KIF11 (0.56) KIF11CTSLCTSBCTSSCTSK
SCHEMBL1331473 0.83 KIF11 (0.56) KIF11CTSLCTSBCTSSCTSK
SCHEMBL1272910 0.83 KIF11 (0.56) KIF11CTSLCTSBCTSSCTSK
SCHEMBL1272460 0.83 KIF11 (0.56) KIF11CTSLCTSBCTSSCTSK
SCHEMBL2401298 0.81 CTSL (0.37) CTSLCTSBCTSSCTSK
SCHEMBL1272934 0.80 KIF11 (0.53) KIF11CTSLDNMT1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-8013186-B2 Haloalkyl containing compounds as cysteine protease inhibitors VIROBAY, INC. (US) 2011-09-06 US disclosed
US-20090023781-A1 Haloalkyl Containing Compounds as Cysteine Protease Inhibitors VIROBAY, INC. (US) 2009-01-22 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20090023781-A1 Haloalkyl Containing Compounds as Cysteine Protease Inhibitors CTSF, CTSS, CTSE KIF11 2441/4885CTSL 9/4885CTSB 5/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.