SCHEMBL6157819

SCHEMBL6157819

CC(=O)O[C@H]1CC(=O)N(Cc2ccc(Oc3ccc(C)cc3)cc2)C1=O

nearest known ligand 0.44

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
KMT2A Q03164 2/20 0.44
ALDH1A1 P00352 4/20 0.43
POLB P06746 3/20 0.43
L3MBTL1 Q9Y468 1/20 0.42
SMN1; SMN2 Q16637 2/20 0.41
GAA P10253 2/20 0.40
USP2 O75604 1/20 0.38
PKM P14618 1/20 0.38
MAPK1 P28482 1/20 0.38
MAPT P10636 2/20 0.38
KDM4E B2RXH2 1/20 0.38
HTT P42858 1/20 0.38
EGLN1 Q9GZT9 1/20 0.38
LMNA P02545 2/20 0.38
LPAR1 Q92633 1/20 0.38
LPAR3 Q9UBY5 1/20 0.38
FFAR1 O14842 1/20 0.37
CYP1A2 P05177 1/20 0.37
CYP3A4 P08684 1/20 0.37
CYP2C9 P11712 1/20 0.37

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL6162057 0.91 S1PR5 (0.46) KMT2AALDH1A1POLBSMN1; SMN2GAA
SCHEMBL6158254 0.84 ALDH1A1 (0.47) KMT2AALDH1A1POLBL3MBTL1GAA
SCHEMBL3593084 0.82 ALDH1A1 (0.55) KMT2AALDH1A1POLBSMN1; SMN2GAA
SCHEMBL9636719 0.82 ALDH1A1 (0.55) KMT2AALDH1A1POLBSMN1; SMN2GAA
SCHEMBL6159091 0.80 ALDH1A1 (0.50) KMT2AALDH1A1POLBSMN1; SMN2GAA
SCHEMBL6162428 0.79 EGLN1 (0.45) EGLN1
SCHEMBL6162419 0.79 EGLN1 (0.45) EGLN1
SCHEMBL6164470 0.78 EGLN1 (0.44) ALDH1A1L3MBTL1SMN1; SMN2EGLN1
SCHEMBL6164463 0.78 EGLN1 (0.44) ALDH1A1L3MBTL1SMN1; SMN2EGLN1
SCHEMBL6161464 0.74 HCRTR1 (0.36) EGLN1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 6 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-6953789-B2 Thiol compounds, their production and use TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2005-10-11 US disclosed
US-20040157894-A1 Thiol compounds, their production and use TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2004-08-12 US disclosed
US-6699881-B2 MATRIX METALLOPROTEASE INHIBITORS TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2004-03-02 US disclosed
US-20030078253-A1 Thiol compounds, their production and use TAKEDA PHARMACEUTICAL COMPANY, LIMITED (JP) 2003-04-24 US disclosed
US-6420415-B1 Thiol compounds, their production and use TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2002-07-16 US disclosed
EP-1132379-A1 NOVEL THIOL DERIVATIVES, PROCESS FOR PRODUCING THE SAME AND UTILIZATION THEREOF Takeda Chemical Industries, Ltd. (JP) 2001-09-12 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (2 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20030078253-A1 Thiol compounds, their production and use TST, QSOX1, SULT1E1 KMT2A 1898/4885ALDH1A1 905/4885POLB 3567/4885
US-20040157894-A1 Thiol compounds, their production and use TST, SULT1E1, QSOX1 KMT2A 1850/4885ALDH1A1 877/4885POLB 3489/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.