SCHEMBL13661911

SCHEMBL13661911

COC(=O)[C@H]1C[C@H](O)CN1B(C)O

nearest known ligand 0.46

Predicted protein targets (top 12)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 2/20 0.46
MMP2 P08253 3/20 0.42
ANPEP P15144 3/20 0.42
ATM Q13315 1/20 0.36
CYP3A4 P08684 5/20 0.35
MAPT P10636 2/20 0.35
KDM4E B2RXH2 1/20 0.35
MEN1 O00255 1/20 0.34
KMT2A Q03164 1/20 0.34
PEPD P12955 1/20 0.33
SCN9A Q15858 1/20 0.31
POLB P06746 1/20 0.31

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL2608451 1.00 ALDH1A1 (0.46) ALDH1A1MMP2ANPEPATMCYP3A4
SCHEMBL2610758 1.00 ALDH1A1 (0.46) ALDH1A1MMP2ANPEPATMCYP3A4
SCHEMBL16376157 1.00 ALDH1A1 (0.46) ALDH1A1MMP2ANPEPATMCYP3A4
SCHEMBL18084393 1.00 ALDH1A1 (0.46) ALDH1A1MMP2ANPEPATMCYP3A4
SCHEMBL26379162 0.86 PEPD (0.41) ALDH1A1MMP2ANPEPCYP3A4MAPT
SCHEMBL9969991 0.83 ALDH1A1 (0.40) ALDH1A1MMP2ANPEPCYP3A4MAPT
SCHEMBL13920579 0.82 KMT2A (0.34) ALDH1A1ATMCYP3A4MAPTMEN1
SCHEMBL13920501 0.82 DPP4 (0.35) ALDH1A1ATM
SCHEMBL18084388 0.81 MMP2 (0.38) ALDH1A1MMP2ANPEPCYP3A4MAPT
SCHEMBL12795646 0.81 MMP2 (0.38) ALDH1A1MMP2ANPEPPOLB

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 1 patent. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20090264431-A1 NOVEL CATHEPSIN C INHIBITORS AND THEIR USE GLAXO GROUP LIMITED (GB) 2009-10-22 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20090264431-A1 NOVEL CATHEPSIN C INHIBITORS AND THEIR USE CTSB, CTSS, CTSF ALDH1A1 2060/4885MMP2 57/4885ANPEP 129/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.