SCHEMBL7103922

SCHEMBL7103922

CCCCCCCCCCCCCCCCOC[C@H]1CCO[C@H](c2ccccc2)O1

nearest known ligand 0.53

Predicted protein targets (top 3)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 1/20 0.50
TDP1 Q9NUW8 1/20 0.50
SMN1; SMN2 Q16637 1/20 0.42

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL7103919 1.00 ALDH1A1 (0.50) ALDH1A1TDP1SMN1; SMN2
SCHEMBL7103917 1.00 ALDH1A1 (0.50) ALDH1A1TDP1SMN1; SMN2
SCHEMBL1100795 1.00 ALDH1A1 (0.50) ALDH1A1TDP1SMN1; SMN2
SCHEMBL11075645 0.88 ALDH1A1 (0.59) ALDH1A1TDP1SMN1; SMN2
SCHEMBL11074748 0.88 ALDH1A1 (0.59) ALDH1A1TDP1SMN1; SMN2
SCHEMBL24750411 0.82 MEN1 (0.52) TDP1SMN1; SMN2
SCHEMBL24750713 0.81 TDP1 (0.44) ALDH1A1TDP1SMN1; SMN2
SCHEMBL9670184 0.77 ALDH1A1 (0.50) ALDH1A1TDP1SMN1; SMN2
SCHEMBL9027367 0.77 ALDH1A1 (0.50) ALDH1A1TDP1SMN1; SMN2
SCHEMBL12539970 0.77 CYP2A6 (0.42)

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 4 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
EP-1341800-A2 C-GLUCOSYL ETHER LIPIDS The Research Foundation Of the City university of New York (US) 2003-09-10 EP disclosed
US-6613748-B2 C-glycosides having 2-amino groups; anticancer agents THE RESEARCH FOUNDATION OF THE CITY UNIVERSITY OF NEW YORK 2003-09-02 US disclosed
US-20020128214-A1 C-glucosyl ether lipids RESEARCH FOUNDATION OF THE CITY UNIVERSITY OF NEW YORK, THE 2002-09-12 US disclosed
WO-2002060911-A2 C-GLUCOSYL ETHER LIPIDS THE RESEARCH FOUNDATION OF THE CITY UNIVERSITY OF NEW YORK (US) 2002-08-08 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20020128214-A1 C-glucosyl ether lipids UGCG, LIPC, UGGT1 ALDH1A1 2464/4885TDP1 4603/4885SMN1; SMN2 4645/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.