SCHEMBL5981982

SCHEMBL5981982

CC(=O)O[C@H]1[C@@H](OC(=O)c2ccccc2)O[C@H](COC(=O)c2ccccc2)[C@H]1OC(=O)c1ccccc1

nearest known ligand 0.61

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
PTPN1 P18031 4/20 0.61
TSHR P16473 1/20 0.55
PTPN2 P17706 1/20 0.55
PTPN11 Q06124 1/20 0.55
USP2 O75604 2/20 0.53
KMT2A Q03164 2/20 0.53
MEN1 O00255 1/20 0.53
POLB P06746 1/20 0.53
F11 P03951 2/20 0.53
TMPRSS2 O15393 1/20 0.53
PDE5A O76074 1/20 0.53
CA1 P00915 1/20 0.53
CA2 P00918 1/20 0.53
HMGCR P04035 1/20 0.53
CYP1A2 P05177 1/20 0.53
LCK P06239 1/20 0.53
FYN P06241 1/20 0.53
HSPD1 P10809 1/20 0.53
CYP2C9 P11712 1/20 0.53
CA6 P23280 1/20 0.53

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL5982658 1.00 PTPN1 (0.61) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL5981979 1.00 PTPN1 (0.61) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL13871572 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL4849836 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL12352422 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL4849838 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL8829597 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL14263475 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL22022507 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2
SCHEMBL30976660 0.97 PTPN1 (0.58) PTPN1TSHRPTPN2PTPN11USP2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 11 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
EP-1634888-A2 Synthesis of 2'-deoxy-L-nucleosides Pharmasset Limited (US) 2006-03-15 EP claimed
EP-1600451-A2 Synthesis of 2'-deoxy-l-nucleosides Pharmasset Limited (US) 2005-11-30 EP claimed
US-20050090660-A1 2'-deoxy-L-nucleosides PHARMASSET, INC. 2005-04-28 US claimed
EP-1232166-A2 SYNTHESIS OF 2'-DEOXY-L-NUCLEOSIDES Pharmasset Limited (US) 2002-08-21 EP claimed
WO-2001034618-A2 SYNTHESIS OF 2'-DEOXY-L-NUCLEOSIDES PHARMASSET LIMITED (US) 2001-05-17 WO claimed
EP-1634888-A2 Synthesis of 2'-deoxy-L-nucleosides Pharmasset Limited (US) 2006-03-15 EP disclosed
EP-1600451-A2 Synthesis of 2'-deoxy-l-nucleosides Pharmasset Limited (US) 2005-11-30 EP disclosed
EP-1600452-A2 Synthesis of 2'-deoxy-L-nucleosides Pharmasset Limited (US) 2005-11-30 EP disclosed
US-20050090660-A1 2'-deoxy-L-nucleosides PHARMASSET, INC. 2005-04-28 US disclosed
EP-1232166-A2 SYNTHESIS OF 2'-DEOXY-L-NUCLEOSIDES Pharmasset Limited (US) 2002-08-21 EP disclosed
WO-2001034618-A2 SYNTHESIS OF 2'-DEOXY-L-NUCLEOSIDES PHARMASSET LIMITED (US) 2001-05-17 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20050090660-A1 2'-deoxy-L-nucleosides CCNH, ADAR, NSUN2 PTPN1 3267/4885TSHR 1043/4885PTPN2 2607/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.