SCHEMBL4890494

SCHEMBL4890494

Cc1cccc(C=C(Br)Br)c1N

nearest known ligand 0.45

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
CD44 P16070 1/20 0.45
ALDH1A1 P00352 4/20 0.34
TSHR P16473 3/20 0.34
CYP3A4 P08684 3/20 0.32
TDP1 Q9NUW8 1/20 0.32
CA1 P00915 3/20 0.32
CA2 P00918 3/20 0.32
CA9 Q16790 3/20 0.32
CA7 P43166 2/20 0.32
TRPA1 O75762 2/20 0.31
POLB P06746 2/20 0.31
ATM Q13315 1/20 0.31
L3MBTL1 Q9Y468 2/20 0.31
CYP1A2 P05177 2/20 0.31
KDM4E B2RXH2 2/20 0.31
CYP2A6 P11509 1/20 0.31
ANPEP P15144 1/20 0.31
DPP4 P27487 1/20 0.31
MAPT P10636 1/20 0.31
MPI P34949 1/20 0.31

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL14269557 0.86 CD44 (0.43) CD44ALDH1A1TSHRCYP3A4TDP1
SCHEMBL14269555 0.79 ALDH1A1 (0.50) ALDH1A1TSHRTDP1POLBL3MBTL1
SCHEMBL3752472 0.78 ALDH1A1 (0.52) ALDH1A1TSHRCYP3A4CA1CA2
SCHEMBL4887591 0.76 CD44 (0.45) CD44ALDH1A1TSHRCYP3A4TDP1
SCHEMBL23707360 0.76 CD44 (0.41) CD44ALDH1A1TSHR
SCHEMBL5327152 0.76 TLR2 (0.43) ALDH1A1CYP3A4TDP1CA1CA2
SCHEMBL9939913 0.74 CD44 (0.39) CD44ALDH1A1TSHRTRPA1ATM
SCHEMBL20430118 0.72 CD44 (0.50) CD44ALDH1A1TSHRCYP3A4TDP1
SCHEMBL29750653 0.72 ALDH1A1 (0.60) CD44ALDH1A1TSHRCYP3A4TDP1
SCHEMBL3432662 0.72 CD44 (0.50) CD44ALDH1A1TSHRCYP3A4TDP1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 5 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20080039625-A1 Screening Methods LAUTENS MARK 2008-02-14 US disclosed
US-20080039625-A1 Screening Methods LAUTENS MARK 2008-02-14 US disclosed
US-20080039625-A1 Screening Methods LAUTENS MARK 2008-02-14 US disclosed
EP-1817283-A1 2-SUBSTITUTED INDOLES, THEIR PRECURSORS AND NOVEL PROCESSES FOR THE PREPARATION THEREOF Lautens, Mark (CA) 2007-08-15 EP disclosed
WO-2006047888-A1 2-SUBSTITUTED INDOLES, THEIR PRECURSORS AND NOVEL PROCESSES FOR THE PREPARATION THEREOF LAUTENS MARK (CA) 2006-05-11 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20080039625-A1 Screening Methods CBR3, ZKSCAN2, CRBN CD44 4745/4885ALDH1A1 2175/4885TSHR 409/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.