SCHEMBL10229752 — predicted protein targets

Predicted protein targets (top 6)

	gene	UniProt	supporting neighbours	confidence
▸	TK1	P04183	5/20	0.52

▸	TK2	O00142	4/20	0.52

▸	RNASE1	P07998	1/20	0.50

▸	TYMS	P04818	1/20	0.47

▸	LMNA	P02545	1/20	0.45

▸	SMN1; SMN2	Q16637	1/20	0.45

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compound	similarity	top predicted	shared targets
SCHEMBL10178250	0.89	TK1 (0.63)	TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1665228	0.89	TK1 (0.63)	TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL10231090	0.89	TK1 (0.63)	TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1665230	0.89	TK1 (0.63)	TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL12150924	0.82	TK1 (0.66)	TK1TK2LMNASMN1; SMN2
SCHEMBL9780916	0.81	RNASE1 (0.58)	TK1TK2RNASE1
SCHEMBL9780910	0.81	RNASE1 (0.58)	TK1TK2RNASE1
SCHEMBL17669967	0.81	TK1 (0.63)	TK1TK2TYMSLMNASMN1; SMN2
SCHEMBL1666025	0.80	TK1 (0.65)	TK1TK2LMNASMN1; SMN2
SCHEMBL13573919	0.80	TK1 (0.65)	TK1TK2LMNASMN1; SMN2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.

Patent	Title	Assignee	Published	Priority	Filing	Country	Status
US-8148503-B2	Nucleotides and nucleosides and methods for their use in DNA sequencing	LASERGEN, INC. (US)	2012-04-03	—	—	US	disclosed
US-20100041041-A1	NUCLEOTIDES AND NUCLEOSIDES AND METHODS FOR THEIR USE IN DNA SEQUENCING	AGILENT TECHNOLOGIES, INC.	2010-02-18	—	—	US	disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

Patent	Title	Text reads most about	Predicted target · text-rank
US-20100041041-A1	NUCLEOTIDES AND NUCLEOSIDES AND METHODS FOR THEIR USE IN DNA SEQUENCING	UNG, NT5C2, NT5E	TK1 90/4885TK2 108/4885RNASE1 142/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.