SCHEMBL719808

SCHEMBL719808

CCOC(=O)c1sc2cc(C(F)(F)F)cnc2c1C

nearest known ligand 0.53

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 7/20 0.53
CYP2C9 P11712 1/20 0.49
CYP2C19 P33261 1/20 0.49
LMNA P02545 2/20 0.46
KDM4E B2RXH2 8/20 0.46
MEN1 O00255 3/20 0.46
KMT2A Q03164 3/20 0.46
GAA P10253 3/20 0.46
MAPT P10636 2/20 0.46
HSP90AA1 P07900 1/20 0.46
CRHBP P24387 1/20 0.46
CRHR2 Q13324 1/20 0.46
L3MBTL1 Q9Y468 1/20 0.43
GABRP O00591 1/20 0.42
GABRD O14764 1/20 0.42
GABRA1 P14867 1/20 0.42
GABRB1 P18505 1/20 0.42
GABRG2 P18507 1/20 0.42
GABRB3 P28472 1/20 0.42
GABRA5 P31644 1/20 0.42

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL27852359 0.84 CYP2C9 (0.55) ALDH1A1CYP2C9CYP2C19LMNAKDM4E
SCHEMBL28794517 0.83 CYP2C9 (0.64) ALDH1A1CYP2C9CYP2C19LMNAKDM4E
SCHEMBL3886292 0.76 POLB (0.63) ALDH1A1KDM4EMEN1KMT2AGAA
SCHEMBL720933 0.75 CYP2C9 (0.46) ALDH1A1CYP2C9CYP2C19LMNAKDM4E
SCHEMBL3683576 0.74 LMNA (0.53) ALDH1A1CYP2C9CYP2C19LMNAKDM4E
SCHEMBL4118771 0.74 ALDH1A1 (0.67) ALDH1A1CYP2C9CYP2C19LMNAKDM4E
SCHEMBL7787959 0.73 AGBL2 (0.51) ALDH1A1KDM4EMEN1KMT2AGAA
SCHEMBL716644 0.72 LMNA (0.49) ALDH1A1LMNAKDM4EMEN1KMT2A
SCHEMBL25294081 0.72 KDM4E (0.51) ALDH1A1KDM4EMEN1KMT2AGAA
SCHEMBL15738846 0.71 POLB (0.44) ALDH1A1KDM4EMEN1KMT2AGAA

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 4 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-8436043-B2 Heterocyclic compound TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2013-05-07 US disclosed
US-20120270865-A2 HETEROCYCLIC COMPOUND TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2012-10-25 US disclosed
US-20120053173-A1 HETEROCYCLIC COMPOUND TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2012-03-01 US disclosed
EP-2251326-A1 HETEROCYCLIC COMPOUND Takeda Pharmaceutical Company Limited (JP) 2010-11-17 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (2 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20120053173-A1 HETEROCYCLIC COMPOUND SLC5A2, SLC5A1, IAPP ALDH1A1 504/4885CYP2C9 896/4885CYP2C19 946/4885
US-20120270865-A2 HETEROCYCLIC COMPOUND SLC5A2, SLC5A1, IAPP ALDH1A1 504/4885CYP2C9 896/4885CYP2C19 946/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.