SCHEMBL3607217

SCHEMBL3607217

COC(=O)c1sc(-n2cnc3ccc(CO)cc32)cc1OO[C@H](C)c1ccccc1Cl

nearest known ligand 0.60

Predicted protein targets (top 12)

geneUniProtsupporting neighboursconfidence
PLK1 P53350 16/20 0.57
PLK3 Q9H4B4 14/20 0.57
IKBKE Q14164 2/20 0.54
CDK2 P24941 1/20 0.54
PIK3CD O00329 1/20 0.48
NUAK1 O60285 1/20 0.48
PDGFRB P09619 1/20 0.48
PIM1 P11309 1/20 0.48
KDR P35968 1/20 0.48
NEK2 P51955 1/20 0.48
CYP2C9 P11712 3/20 0.47
CYP3A4 P08684 3/20 0.47

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL2989088 0.92 PLK1 (0.68) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL2973858 0.85 PLK1 (0.68) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL3254468 0.84 PLK1 (0.64) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL2987942 0.84 PLK1 (0.66) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL4398001 0.84 PLK1 (0.67) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL2982059 0.84 PLK1 (0.67) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL4413716 0.83 PLK1 (0.65) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL4406434 0.83 PLK1 (0.79) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL4398661 0.83 PLK1 (0.64) PLK1PLK3IKBKECDK2PIK3CD
SCHEMBL4401420 0.82 PLK1 (0.65) PLK1PLK3IKBKECDK2PIK3CD

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 1 patent. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20100075960-A1 BENZIMIDAZOLE THIOPHENE COMPOUNDS SMITHKLINE BEECHAM CORPORATION 2010-03-25 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20100075960-A1 BENZIMIDAZOLE THIOPHENE COMPOUNDS ABCB1, CYP3A5, CYP3A4 PLK1 450/4885PLK3 1025/4885IKBKE 1461/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.