SCHEMBL4797378

SCHEMBL4797378

CCc1ccc(Cc2c(C)cc(OC(C)=O)cc2O)cc1

nearest known ligand 0.49

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
THRA P10827 7/20 0.49
THRB P10828 7/20 0.49
AKT1 P31749 1/20 0.44
CYP3A4 P08684 3/20 0.40
HPGD P15428 2/20 0.38
ALDH1A1 P00352 2/20 0.37
MAPK1 P28482 2/20 0.37
HSD17B10 Q99714 2/20 0.37
KMT2A Q03164 2/20 0.37
MEN1 O00255 1/20 0.37
ALOX15 P16050 1/20 0.37
LMNA P02545 3/20 0.37
KDM4E B2RXH2 2/20 0.36
GLA P06280 2/20 0.36
GAA P10253 2/20 0.36
TP53 P04637 1/20 0.36
MAPT P10636 1/20 0.36
CASP1 P29466 1/20 0.36
HTT P42858 1/20 0.36
CASP7 P55210 1/20 0.36

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL27767713 0.90 THRA (0.50) THRATHRBCYP3A4LMNAKDM4E
SCHEMBL27767721 0.88 THRA (0.49) THRATHRBHPGDALDH1A1HSD17B10
SCHEMBL27746702 0.88 THRA (0.48) THRATHRBCYP3A4MAPK1HSD17B10
SCHEMBL27746660 0.88 THRA (0.48) THRATHRBCYP3A4HPGDALDH1A1
SCHEMBL27767686 0.88 THRA (0.48) THRATHRBAKT1CYP3A4KMT2A
SCHEMBL27746659 0.88 THRA (0.48) THRATHRBCYP3A4ALDH1A1KMT2A
SCHEMBL6724702 0.86 SLC5A1 (0.47) THRATHRBALDH1A1MAPK1HSD17B10
SCHEMBL27767700 0.83 THRA (0.44) THRATHRBCYP3A4HPGDALDH1A1
SCHEMBL27746689 0.83 THRA (0.44) THRATHRBCYP3A4ALDH1A1KMT2A
SCHEMBL4288355 0.82 THRA (0.42) THRATHRBCYP3A4ALDH1A1MAPT

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 8 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
CN-101472937-A Benzylphenylglucopyranoside Derivatives DAIICHI SANKYO CO LTD (JP) 2009-07-01 CN disclosed
US-7465712-B2 Glucopyranosyloxy benzylbenzene derivatives, medicinal compositions containing the same and intermediates for the preparation of the derivatives KISSEI PHARMACEUTICAL CO., LTD. (JP) 2008-12-16 US disclosed
CN-100341885-C Process for selective production of aryl 5-thio-beta-d-aldohexopyranosides TAISHO PHARMACEUTICAL CO LTD (JP) 2007-10-10 CN disclosed
CN-1675233-A Process for selective production of aryl 5-thio-beta-d-aldohexopyranosides TAISHO PHARMACEUTICAL CO LTD (JP) 2005-09-28 CN disclosed
CN-1177857-C Glucopyranosyloxy benzylbenzene derivatives, medicinal compositions containing the same and intermediates for the prepararation of the derivatives ����ҩƷ��ҵ��ʽ���� 2004-12-01 CN disclosed
US-20040053855-A1 Glucopyranosyloxy benzylbenzene derivatives, medicinal compositions containing the same and intermediates for the preparation of the derivatives KISSEI PHARMACEUTICAL CO & LTD. (JP) 2004-03-18 US disclosed
CN-1418219-A Glucopyranosyloxy benzylbenzene derivatives, medicinal compositions containing the same and intermediates for the prepararation of the derivatives KISSEI PHARMACEUTICAL (JP) 2003-05-14 CN disclosed
EP-1270584-A1 GLUCOPYRANOSYLOXY BENZYLBENZENE DERIVATIVES, MEDICINAL COMPOSITIONS CONTAINING THE SAME AND INTERMEDIATES FOR THE PREPARATION OF THE DERIVATIVES Kissei Pharmaceutical Co., Ltd. (JP) 2003-01-02 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20040053855-A1 Glucopyranosyloxy benzylbenzene derivatives, medicinal compositions containing the same and intermediates for the preparation of the derivatives SLC5A2, SLC5A1, SLC2A2 THRA 3338/4885THRB 1522/4885AKT1 1952/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.