SCHEMBL22490884

SCHEMBL22490884

COc1cc(C(=O)OO[C@@H]2[C@@H](OC)[C@H](O)[C@@H](CO)O[C@@H]2O)cc(Cl)c1OCc1cccc(OC(C)(C)C)c1

nearest known ligand 0.40

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
LGALS8 O00214 2/20 0.35
LGALS9 O00182 1/20 0.35
LGALS1 P09382 1/20 0.35
LGALS3 P17931 1/20 0.35
LGALS7; LGALS7B P47929 1/20 0.35
ALDH1A1 P00352 1/20 0.33
KDM4E B2RXH2 1/20 0.33
SLC5A2 P31639 2/20 0.33
PTPN2 P17706 1/20 0.32
PTPN1 P18031 1/20 0.32
ACP1 P24666 1/20 0.32
MRGPRX4 Q96LA9 1/20 0.31
ADCYAP1R1 P41586 1/20 0.31
SLC5A1 P13866 1/20 0.31
CA12 O43570 2/20 0.31
CA1 P00915 2/20 0.31
CA9 Q16790 2/20 0.31
PTPN7 P35236 1/20 0.31
PTPN12 Q05209 1/20 0.31
PTPN22 Q9Y2R2 1/20 0.31

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL22490887 0.86 ALDH1A1 (0.35) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488588 0.84 LGALS3 (0.40) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488537 0.79 CA2 (0.44) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488633 0.76 LGALS3 (0.46) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL22490890 0.76 MRGPRX4 (0.50) KDM4EPTPN2PTPN1ACP1MRGPRX4
SCHEMBL23488703 0.75 LGALS8 (0.37) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488734 0.73 LGALS3 (0.40) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488294 0.73 LGALS8 (0.55) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488445 0.71 LGALS3 (0.40) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B
SCHEMBL23488707 0.71 LGALS8 (0.47) LGALS8LGALS9LGALS1LGALS3LGALS7; LGALS7B

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20210170033-A1 NOVEL CONJUGATES OF A PHARMACEUTICAL AGENT AND A MOIETY CAPABLE OF BINDING TO A GLUCOSE SENSING PROTEIN SANOFI (FR) 2021-06-10 US disclosed
EP-3717463-A1 NOVEL CONJUGATES OF A PHARMACEUTICAL AGENT AND A MOIETY CAPABLE OF BINDING TO A GLUCOSE SENSING PROTEIN Sanofi (FR) 2020-10-07 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20210170033-A1 NOVEL CONJUGATES OF A PHARMACEUTICAL AGENT AND A MOIETY CAPABLE OF BINDING TO A GLUCOSE SENSING PROTEIN GPR119, SLC2A8, SLC2A4 LGALS8 61/4885LGALS9 106/4885LGALS1 50/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.