SCHEMBL5120954

SCHEMBL5120954

CCOCc1cc(O)cc(CO[Si](c2ccccc2)(c2ccccc2)C(C)(C)C)c1

nearest known ligand 0.36

Predicted protein targets (top 11)

geneUniProtsupporting neighboursconfidence
TACR1 P25103 2/20 0.32
ALDH1A1 P00352 1/20 0.32
RAB9A P51151 1/20 0.32
SMN1; SMN2 Q16637 1/20 0.32
KAT6A Q92794 2/20 0.32
TLR8 Q9NR97 2/20 0.32
TLR7 Q9NYK1 2/20 0.32
NUDT1 P36639 1/20 0.32
KDM4E B2RXH2 1/20 0.30
POLB P06746 1/20 0.30
TSHR P16473 1/20 0.30

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL31124766 0.81 TSHR (0.35) TACR1SMN1; SMN2TSHR
SCHEMBL8149467 0.80 GABRA1 (0.41) SMN1; SMN2KDM4E
SCHEMBL31124717 0.79 TRPA1 (0.35) TACR1ALDH1A1SMN1; SMN2TSHR
SCHEMBL9347235 0.78 ESR1 (0.43) ALDH1A1
SCHEMBL17803752 0.78 ACLY (0.34) SMN1; SMN2KDM4E
SCHEMBL1314001 0.77 AGXT (0.42) ALDH1A1SMN1; SMN2TSHR
SCHEMBL8024443 0.76 MRGPRX4 (0.38) RAB9ASMN1; SMN2TSHR
SCHEMBL5113620 0.76 MRGPRX4 (0.48) ALDH1A1RAB9ASMN1; SMN2POLB
SCHEMBL16818604 0.76 TACR1 (0.34) TACR1SMN1; SMN2TSHR
SCHEMBL22774158 0.76 SMYD2 (0.32) TACR1SMN1; SMN2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20080051418-A1 Arylalkanoic Acid Derivative TAKEDA PHARMACEUTICAL COMPANY LIMITED (JP) 2008-02-28 US disclosed
EP-1829863-A1 ARYLALKANOIC ACID DERIVATIVE Takeda Pharmaceutical Company Limited (JP) 2007-09-05 EP disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20080051418-A1 Arylalkanoic Acid Derivative INSR, NPY1R, GPR119 TACR1 343/4885ALDH1A1 1527/4885RAB9A 3734/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.