SCHEMBL6158839

SCHEMBL6158839

Cc1ccc(S(=O)(=O)N(c2ccccc2)c2ccc(C=O)cc2)cc1

nearest known ligand 0.59

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 3/20 0.59
PKM P14618 1/20 0.56
CA12 O43570 1/20 0.55
CA9 Q16790 1/20 0.55
ESR1 P03372 1/20 0.53
CYP2A6 P11509 1/20 0.48
CNR2 P34972 4/20 0.48
CNR1 P21554 3/20 0.48
MAPT P10636 2/20 0.47
MCOLN3 Q8TDD5 1/20 0.47
KDM4E B2RXH2 1/20 0.46
MEN1 O00255 1/20 0.46
KMT2A Q03164 1/20 0.46
C5AR1 P21730 1/20 0.45
NPSR1 Q6W5P4 2/20 0.44
HTT P42858 2/20 0.44
L3MBTL1 Q9Y468 1/20 0.44
ATM Q13315 1/20 0.44
KEAP1 Q14145 1/20 0.43
NFE2L2 Q16236 1/20 0.43

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL3697451 0.87 PKM (0.71) ALDH1A1PKMCA12CA9ESR1
SCHEMBL28634601 0.85 ALDH1A1 (0.66) ALDH1A1PKMCA12CA9ESR1
SCHEMBL3702995 0.84 ALDH1A1 (0.63) ALDH1A1PKMCA12CA9ESR1
SCHEMBL3701001 0.82 MAPT (0.67) ALDH1A1PKMCA12CA9ESR1
SCHEMBL3886338 0.81 ALDH1A1 (0.62) ALDH1A1PKMCA12CA9ESR1
SCHEMBL18892841 0.79 CYP2A6 (0.61) ALDH1A1CA12CA9CYP2A6CNR2
SCHEMBL5354776 0.79 CYP2A6 (0.61) ALDH1A1CA12CA9CYP2A6CNR2
SCHEMBL19222015 0.79 CYP2A6 (0.61) ALDH1A1CA12CA9CYP2A6CNR2
SCHEMBL11230768 0.79 CYP2A6 (0.61) ALDH1A1CA12CA9CYP2A6CNR2
SCHEMBL14047565 0.79 CYP2A6 (0.61) ALDH1A1CA12CA9CYP2A6CNR2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 6 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-6953789-B2 Thiol compounds, their production and use TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2005-10-11 US disclosed
US-20040157894-A1 Thiol compounds, their production and use TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2004-08-12 US disclosed
US-6699881-B2 MATRIX METALLOPROTEASE INHIBITORS TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2004-03-02 US disclosed
US-20030078253-A1 Thiol compounds, their production and use TAKEDA PHARMACEUTICAL COMPANY, LIMITED (JP) 2003-04-24 US disclosed
US-6420415-B1 Thiol compounds, their production and use TAKEDA CHEMICAL INDUSTRIES, LTD. (JP) 2002-07-16 US disclosed
EP-1132379-A1 NOVEL THIOL DERIVATIVES, PROCESS FOR PRODUCING THE SAME AND UTILIZATION THEREOF Takeda Chemical Industries, Ltd. (JP) 2001-09-12 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (2 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20030078253-A1 Thiol compounds, their production and use TST, QSOX1, SULT1E1 ALDH1A1 905/4885PKM 4604/4885CA12 2940/4885
US-20040157894-A1 Thiol compounds, their production and use TST, SULT1E1, QSOX1 ALDH1A1 877/4885PKM 4525/4885CA12 3006/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.