SCHEMBL4594969

SCHEMBL4594969

CCCOC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O

nearest known ligand 0.40

Predicted protein targets (top 9)

geneUniProtsupporting neighboursconfidence
CA2 P00918 1/20 0.35
SELP P16109 1/20 0.34
SPHK2 Q9NRA0 1/20 0.33
AMDHD2 Q9Y303 1/20 0.33
FBP1 P09467 1/20 0.33
GJB2 P29033 4/20 0.33
PYGB P11216 3/20 0.33
TLR4 O00206 1/20 0.31
GAA P10253 1/20 0.31

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL4594966 1.00 CA2 (0.35) CA2SELPSPHK2AMDHD2FBP1
SCHEMBL14231358 1.00 CA2 (0.35) CA2SELPSPHK2AMDHD2FBP1
SCHEMBL10138503 0.96 SELP (0.38) CA2SELPSPHK2AMDHD2FBP1
SCHEMBL10138504 0.86 CA2 (0.44) CA2GJB2GAA
SCHEMBL17848806 0.86 ALDH1A1 (0.46) SELPSPHK2GJB2TLR4
SCHEMBL17848795 0.86 ALDH1A1 (0.46) SELPSPHK2GJB2TLR4
SCHEMBL17848813 0.86 ALDH1A1 (0.46) SELPSPHK2GJB2TLR4
SCHEMBL30610704 0.84 CA2 (0.36) CA2SELPAMDHD2FBP1PYGB
SCHEMBL26448837 0.84 CA2 (0.36) CA2SELPAMDHD2FBP1PYGB
SCHEMBL20932845 0.83 FBP1 (0.36) CA2SELPAMDHD2FBP1PYGB

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 3 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20080108557-A1 Modified Proteins NOVO NORDISK HEALTHCARE A/G (CH) 2008-05-08 US disclosed
EP-1797192-A1 MODIFIED PROTEINS Novo Nordisk Health Care AG (CH) 2007-06-20 EP disclosed
WO-2006035057-A1 MODIFIED PROTEINS NOVO NORDISK HEALTH CARE AG (DK) 2006-04-06 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20080108557-A1 Modified Proteins PTMS, DNPEP, STT3A CA2 4529/4885SELP 578/4885SPHK2 3814/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.