SCHEMBL3884447

SCHEMBL3884447

Cn1c(C(N)=O)c(-c2ccc(N)cc2)c2ccccc21

nearest known ligand 0.46

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
EGFR P00533 2/20 0.46
SRC P12931 1/20 0.46
KMT2A Q03164 4/20 0.45
ALDH1A1 P00352 4/20 0.45
LMNA P02545 3/20 0.45
HPGD P15428 3/20 0.45
MEN1 O00255 3/20 0.45
SMN1; SMN2 Q16637 2/20 0.45
MAPT P10636 1/20 0.45
HTT P42858 1/20 0.45
NPSR1 Q6W5P4 1/20 0.45
TSHR P16473 1/20 0.41
POLB P06746 1/20 0.41
KDM4E B2RXH2 4/20 0.41
HSD17B10 Q99714 3/20 0.41
GAA P10253 2/20 0.41
GLA P06280 1/20 0.41
PABPC1 P11940 1/20 0.41
EIF4H Q15056 1/20 0.41
NPC1 O15118 2/20 0.40

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL6017853 0.89 SMN1; SMN2 (0.53) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL3879261 0.82 MAPT (0.50) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL2662560 0.78 EGFR (0.53) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL7593404 0.76 KMT2A (0.55) EGFRKMT2AALDH1A1LMNAHPGD
SCHEMBL6595214 0.74 TSHR (0.64) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL35212204 0.74 TSHR (0.64) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL28835840 0.74 MAPT (0.47) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL12468631 0.74 LMNA (0.48) EGFRSRCKMT2AALDH1A1LMNA
SCHEMBL16144912 0.74 LMNA (0.53) EGFRKMT2AALDH1A1LMNAHPGD
SCHEMBL7671576 0.73 KMT2A (0.49) EGFRSRCKMT2AALDH1A1LMNA

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 5 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-7566736-B2 Substituted indoles, compositions containing them, method for the production thereof and their use AVENTIS PHARMA S.A. (FR) 2009-07-28 US disclosed
CN-101098868-A Substituted indoles, compositions containing them, method for the production thereof and their use AVENTIS PHARMA SA (FR) 2008-01-02 CN disclosed
US-20070259910-A1 Substituted Indoles, Compositions Containing Them, Method for the Production Thereof and Their Use AVENTIS PHARMA S.A. (FR) 2007-11-08 US disclosed
EP-1841762-A1 SUBSTITUTED INDOLES, COMPOSITIONS CONTAINING THEM, METHOD FOR THE PRODUCTION THEREOF AND THEIR USE Aventis Pharma S.A. (FR) 2007-10-10 EP disclosed
WO-2006061493-A1 SUBSTITUTED INDOLES, COMPOSITIONS CONTAINING THEM, METHOD FOR THE PRODUCTION THEREOF AND THEIR USE AVENTIS PHARMA S.A. (FR) 2006-06-15 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20070259910-A1 Substituted Indoles, Compositions Containing Them, Method for the Production Thereof and Their Use IDO1, RB1, IDO2 EGFR 877/4885SRC 1592/4885KMT2A 1237/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.