SCHEMBL6840072

SCHEMBL6840072

COc1nc2c(N)ncnc2n1C1OC(CO)C(O)C1O

nearest known ligand 0.73

Predicted protein targets (top 12)

geneUniProtsupporting neighboursconfidence
HSPA8 P11142 4/20 0.73
HSPA5 P11021 2/20 0.73
RXFP1 Q9HBX9 1/20 0.73
LMNA P02545 1/20 0.66
TP53 P04637 1/20 0.66
HBB P68871 1/20 0.66
SMN1; SMN2 Q16637 1/20 0.66
SLC28A2 O43868 5/20 0.66
GAPDH P04406 2/20 0.64
HSD17B10 Q99714 1/20 0.64
FBP1 P09467 1/20 0.63
ADORA3 P0DMS8 2/20 0.61

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL1421824 1.00 HSPA8 (0.73) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL1421820 1.00 HSPA8 (0.73) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL2452214 0.96 HSPA8 (0.68) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL30237271 0.90 HSPA8 (0.60) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL6761939 0.90 HSPA8 (0.72) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL6761931 0.90 HSPA8 (0.72) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL17406892 0.88 HSPA8 (0.56) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL18858928 0.86 HSPA8 (0.80) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL15906323 0.86 HSPA8 (0.80) HSPA8HSPA5RXFP1LMNATP53
SCHEMBL1135195 0.86 HSPA8 (0.80) HSPA8HSPA5RXFP1LMNATP53

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 1 patent. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20040063658-A1 Nucleoside derivatives for treating hepatitis C virus infection GENELABS TECHNOLOGIES, INC. 2004-04-01 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20040063658-A1 Nucleoside derivatives for treating hepatitis C virus infection HAVCR2, PNP, NTPCR HSPA8 1928/4885HSPA5 2092/4885RXFP1 4832/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.