SCHEMBL6858367

SCHEMBL6858367

CCOC(=O)c1c(-c2ccc(OCC3CC3)cc2)c2cc(O)ccc2n1Cc1cccc(OC)c1

nearest known ligand 0.47

Predicted protein targets (top 15)

geneUniProtsupporting neighboursconfidence
KMT2A Q03164 2/20 0.47
KDM4E B2RXH2 3/20 0.44
MAPT P10636 3/20 0.44
LMNA P02545 2/20 0.44
PKM P14618 2/20 0.44
GAA P10253 1/20 0.44
PLA2G2A P14555 4/20 0.42
EDNRB P24530 3/20 0.42
EDNRA P25101 2/20 0.42
ALDH1A1 P00352 1/20 0.42
ESR1 P03372 1/20 0.42
MAPK8 P45983 2/20 0.41
MEN1 O00255 1/20 0.41
CD74 P04233 1/20 0.40
MIF P14174 1/20 0.40

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL6855787 0.94 KMT2A (0.46) KMT2AKDM4EMAPTLMNAPKM
SCHEMBL6855171 0.93 KMT2A (0.49) KMT2AKDM4EMAPTPLA2G2AEDNRB
SCHEMBL6851204 0.92 PLA2G2A (0.53) KMT2APLA2G2AEDNRBEDNRAMAPK8
SCHEMBL6856189 0.90 KMT2A (0.49) KMT2APLA2G2AEDNRBEDNRAMAPK8
SCHEMBL6855746 0.89 KMT2A (0.46) KMT2AKDM4EMAPTPLA2G2AEDNRB
SCHEMBL27598107 0.88 KMT2A (0.44) KMT2AKDM4EMAPTPLA2G2AEDNRB
SCHEMBL6851245 0.87 KMT2A (0.44) KMT2AKDM4EMAPTPLA2G2AEDNRB
SCHEMBL6858282 0.87 PLA2G2A (0.49) KMT2APLA2G2AEDNRBEDNRA
SCHEMBL27598029 0.86 KMT2A (0.47) KMT2AKDM4EMAPTPLA2G2AEDNRB
SCHEMBL6851390 0.86 KMT2A (0.52) KMT2AKDM4EMAPTLMNAPKM

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 4 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-6787651-B2 2-CARBOXY-SUBSTITUTED; TREATING OSTEOPENIA, OSTEOPOROSIS, CANCER, DIABETES AND ATHEROSCLEROSIS. SMITHKLINE BEECHAM CORPORATION 2004-09-07 US disclosed
EP-1341761-A1 SUBSTITUTED INDOLES, PHARMACEUTICAL COMPOSITIONS CONTAINING SUCH INDOLES AND THEIR USE AS PPAR-$g(g) BINDING AGENTS SMITHKLINE BEECHAM CORPORATION (US) 2003-09-10 EP disclosed
US-20030087902-A1 Substituted indoles, pharmaceutical compounds containing such indoles and their use as PPAR-gamma binding agents SMITHKLINE BEECHAM CORPORATION 2003-05-08 US disclosed
WO-2002030895-A1 SUBSTITUTED INDOLES, PHARMACEUTICAL COMPOSITIONS CONTAINING SUCH INDOLES AND THEIR USE AS PPAR-η BINDING AGENTS SMITHKLINE BEECHAM CORPORATION (US) 2002-04-18 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20030087902-A1 Substituted indoles, pharmaceutical compounds containing such indoles and their use as PPAR-gamma binding agents PPARG, PPARA, PPARD KMT2A 2753/4885KDM4E 1858/4885MAPT 3447/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.