SCHEMBL16921052

SCHEMBL16921052

Cc1cc(S(=O)(=O)O)ccc1NC(=O)c1cc(NC(=O)Nc2cc(C(=O)Nc3ccccc3)cc(C(=O)Nc3ccc(S(=O)(=O)O)cc3S(=O)(=O)O)c2)cc(C(=O)Nc2ccccc2)c1

nearest known ligand 0.75

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
TIMP3 P35625 6/20 0.75
RECQL P46063 7/20 0.56
KMT2A Q03164 7/20 0.56
HSD17B10 Q99714 7/20 0.56
TDP1 Q9NUW8 7/20 0.56
MAPT P10636 7/20 0.56
PKM P14618 6/20 0.56
MEN1 O00255 5/20 0.56
P2RX1 P51575 5/20 0.56
LMNA P02545 5/20 0.56
POLB P06746 5/20 0.56
NFKB1 P19838 5/20 0.56
APEX1 P27695 5/20 0.56
BLM P54132 5/20 0.56
BRCA1 P38398 4/20 0.56
CYP3A4 P08684 4/20 0.56
CYP1A2 P05177 3/20 0.56
STAT1 P42224 3/20 0.56
STAT5A P42229 3/20 0.56
STAT5B P51692 3/20 0.56

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL12410283 1.00 TIMP3 (0.75) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL12410284 0.94 TIMP3 (0.86) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL1902159 0.87 TIMP3 (1.00) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL12903191 0.86 TIMP3 (0.72) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL12903193 0.84 TIMP3 (0.62) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL12903190 0.80 TIMP3 (0.78) TIMP3RECQLKMT2AMAPTMEN1
SCHEMBL21645072 0.79 RECQL (0.86) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL7223972 0.78 FGF1 (0.63) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL7224031 0.76 TIMP3 (0.57) TIMP3RECQLKMT2AHSD17B10TDP1
SCHEMBL7234616 0.73 FGF1 (0.62) TIMP3RECQLKMT2AHSD17B10TDP1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 4 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20230303985-A1 FUSION PROTEINS AND METHODS THEREOF THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 2023-09-28 US disclosed
US-20180030152-A1 FUSION PROTEINS AND METHODS THEREOF NATIONAL INSTITUTES OF HEALTH (NIH), U.S. DEPT. OF HEALTH AND HUMAN SERVICES (DHHS), U.S. GOVERNMENT 2018-02-01 US disclosed
US-20160108380-A1 FUSION PROTEINS AND METHODS THEREOF NATIONAL INSTITUTES OF HEALTH (NIH), U.S. DEPT. OF HEALTH AND HUMAN SERVICES (DHHS), U.S. GOVERNMENT 2016-04-21 US disclosed
US-20150203589-A1 FUSION PROTEINS AND METHODS THEREOF NATIONAL INSTITUTES OF HEALTH - DIRECTOR DEITR 2015-07-23 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (4 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20180030152-A1 FUSION PROTEINS AND METHODS THEREOF BCR, RPS27A, FUS TIMP3 2963/4885RECQL 4353/4885KMT2A 1365/4885
US-20160108380-A1 FUSION PROTEINS AND METHODS THEREOF BCR, RPS27A, FUS TIMP3 2963/4885RECQL 4353/4885KMT2A 1365/4885
US-20230303985-A1 FUSION PROTEINS AND METHODS THEREOF BCR, RPS27A, FUS TIMP3 2963/4885RECQL 4353/4885KMT2A 1365/4885
US-20150203589-A1 FUSION PROTEINS AND METHODS THEREOF BCR, RPS27A, FUS TIMP3 2963/4885RECQL 4353/4885KMT2A 1365/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.