SCHEMBL2935501

SCHEMBL2935501

CCCCOC(=O)c1sc(C(=O)OCCCC)c(O)c1O

nearest known ligand 0.54

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
ESR1 P03372 10/20 0.54
TSHR P16473 5/20 0.51
LMNA P02545 2/20 0.51
CYP1A2 P05177 2/20 0.51
CYP2C19 P33261 2/20 0.51
MAPK1 P28482 2/20 0.51
CYP2D6 P10635 1/20 0.51
NR1H2 P55055 1/20 0.51
RNASEL Q05823 1/20 0.51
SMN1; SMN2 Q16637 1/20 0.51
L3MBTL1 Q9Y468 2/20 0.50
TDP1 Q9NUW8 1/20 0.50
AKR1C4 P17516 1/20 0.47
AKR1C3 P42330 1/20 0.47
AKR1C2 P52895 1/20 0.47
AKR1C1 Q04828 1/20 0.47
ALDH1A1 P00352 1/20 0.47
COMT P21964 1/20 0.46
CYP3A4 P08684 2/20 0.46
CYP2C9 P11712 1/20 0.46

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL6205874 0.93 ESR1 (0.49) ESR1TSHRLMNACYP1A2CYP2C19
SCHEMBL2935502 0.93 ESR1 (0.49) ESR1TSHRLMNACYP1A2CYP2C19
Protiofate SCHEMBL871311 0.88 MAPT (0.50) ESR1TSHRLMNACYP1A2CYP2C19
Protiofate SCHEMBL5378233 0.81 MAPT (0.45) ESR1TSHRLMNASMN1; SMN2TDP1
SCHEMBL1141274 0.79 GAA (0.56) TSHRLMNACYP1A2SMN1; SMN2ALDH1A1
SCHEMBL11099233 0.79 ALDH1A1 (0.48) TSHRCYP1A2SMN1; SMN2ALDH1A1KDM4E
SCHEMBL6231662 0.78 ESR1 (0.52) ESR1AKR1C4AKR1C3AKR1C2AKR1C1
SCHEMBL11095728 0.78 GAA (0.50) TSHRSMN1; SMN2ALDH1A1CYP3A4KDM4E
SCHEMBL10923446 0.78 MAPT (0.56) ESR1TSHRLMNATDP1ALDH1A1
SCHEMBL1726035 0.78 GAA (0.55) TSHRLMNACYP1A2SMN1; SMN2ALDH1A1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 3 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
EP-1461341-B1 PROCESS FOR PREPARING A HETEROAROMATIC COMPOUND SUBSTITUTED WITH ONE OR MORE ETHER GROUPS AGFA GEVAERT (BE) 2010-01-27 EP disclosed
US-6734312-B2 CONDENSING HYDROXY- GROUP OF A COMPOUND HAVING HETEROAROMATIC NUCLEUS, WITH AN ALCOHOL CONTAINING ONE OR MORE PRIMARY OR SECONDARY ALCOHOL GROUPS, OPTIONALLY USING THE REDOX COUPLE OF A TRIARYL- OR TRIALKYLPHOSPHINE AND AN AZODIOXO-COMPOUND AGFA-GEVAERT (BE) 2004-05-11 US disclosed
US-20030162958-A1 Process for preparing a compound containing a heteroaromatic group with one or more ether substituents AGFA-GEVAERT (BE) 2003-08-28 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20030162958-A1 Process for preparing a compound containing a heteroaromatic group with one or more ether substituents ADH1C, ADH1A, CYP2E1 ESR1 1160/4885TSHR 1485/4885LMNA 640/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.