SCHEMBL10229752

SCHEMBL10229752

CC1C[C@H](n2cc(COCc3ccccc3)c(=O)[nH]c2=O)O[C@@H]1CO

nearest known ligand 0.61

Predicted protein targets (top 6)

geneUniProtsupporting neighboursconfidence
TK1 P04183 5/20 0.52
TK2 O00142 4/20 0.52
RNASE1 P07998 1/20 0.50
TYMS P04818 1/20 0.47
LMNA P02545 1/20 0.45
SMN1; SMN2 Q16637 1/20 0.45

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL10178250 0.89 TK1 (0.63) TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1665228 0.89 TK1 (0.63) TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL10231090 0.89 TK1 (0.63) TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1665230 0.89 TK1 (0.63) TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL12150924 0.82 TK1 (0.66) TK1TK2LMNASMN1; SMN2
SCHEMBL9780916 0.81 RNASE1 (0.58) TK1TK2RNASE1
SCHEMBL9780910 0.81 RNASE1 (0.58) TK1TK2RNASE1
SCHEMBL17669967 0.81 TK1 (0.63) TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1666025 0.80 TK1 (0.65) TK1TK2LMNASMN1; SMN2
SCHEMBL13573919 0.80 TK1 (0.65) TK1TK2LMNASMN1; SMN2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-8148503-B2 Nucleotides and nucleosides and methods for their use in DNA sequencing LASERGEN, INC. (US) 2012-04-03 US disclosed
US-20100041041-A1 NUCLEOTIDES AND NUCLEOSIDES AND METHODS FOR THEIR USE IN DNA SEQUENCING AGILENT TECHNOLOGIES, INC. 2010-02-18 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20100041041-A1 NUCLEOTIDES AND NUCLEOSIDES AND METHODS FOR THEIR USE IN DNA SEQUENCING UNG, NT5C2, NT5E TK1 90/4885TK2 108/4885RNASE1 142/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.