SCHEMBL10024038

SCHEMBL10024038

COC[C@H](C[C@@H](O)CC(C)=O)OC

nearest known ligand 0.33

Predicted protein targets (top 4)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 1/20 0.33
TDP1 Q9NUW8 1/20 0.33
CYP2C9 P11712 1/20 0.30
TSHR P16473 1/20 0.30

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL10024033 1.00 ALDH1A1 (0.33) ALDH1A1TDP1CYP2C9TSHR
SCHEMBL10024027 0.84 ALDH1A1 (0.36) ALDH1A1TDP1CYP2C9TSHR
SCHEMBL8319170 0.81 ALDH1A1 (0.40) ALDH1A1TDP1CYP2C9TSHR
SCHEMBL10024040 0.81 ALDH1A1 (0.40) ALDH1A1TDP1CYP2C9TSHR
SCHEMBL8530152 0.79
SCHEMBL12787259 0.74 ALDH1A1 (0.42) ALDH1A1TDP1
SCHEMBL9067817 0.71 CYP2C9 (0.37) CYP2C9TSHR
SCHEMBL10024039 0.71 MGAM (0.38) ALDH1A1TDP1
SCHEMBL1772274 0.70
SCHEMBL9224073 0.70 TDP1 (0.36) TDP1CYP2C9TSHR

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 3 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-8148550-B2 selected from fluvastatin, rosuvastatin, cerivastatin, glenvastatin or atorvastatin, by reduction or lactonization of chemical intermediate compounds, then hydrogenation, decyclization and deprotection of hydroxy groups RATIOPHARM GMBH (DE) 2012-04-03 US disclosed
US-20080249306-A1 Method for the Production of Statins RATIOPHARM GMBH (DE) 2008-10-09 US disclosed
US-20070093660-A1 Method for the production of statins RATIOPHARM GMBH (DE) 2007-04-26 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (2 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20070093660-A1 Method for the production of statins HMGCR, CYP11A1, CYP51A1 ALDH1A1 673/4885TDP1 3803/4885CYP2C9 78/4885
US-20080249306-A1 Method for the Production of Statins HMGCR, COASY, PCSK9 ALDH1A1 862/4885TDP1 1656/4885CYP2C9 220/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.