SCHEMBL6801528

SCHEMBL6801528

CC1(NC(=N)NS(=O)(=O)c2ccc(Cl)s2)CC1

nearest known ligand 0.45

Predicted protein targets (top 19)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 4/20 0.43
MAPT P10636 3/20 0.43
TDP1 Q9NUW8 1/20 0.43
KMT2A Q03164 4/20 0.42
HTT P42858 2/20 0.42
GAA P10253 1/20 0.42
HPGD P15428 1/20 0.42
PSEN1 P49768 6/20 0.40
PSEN2 P49810 6/20 0.40
APH1B Q8WW43 6/20 0.40
NCSTN Q92542 6/20 0.40
APH1A Q96BI3 6/20 0.40
PSENEN Q9NZ42 6/20 0.40
SMN1; SMN2 Q16637 2/20 0.40
MEN1 O00255 1/20 0.40
TSHR P16473 1/20 0.40
FBP1 P09467 1/20 0.40
HTR6 P50406 1/20 0.40
CCR5 P51681 1/20 0.40

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL7563524 0.77 MCL1 (0.36)
SCHEMBL6804436 0.76 KMT2A (0.49) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL6802953 0.75 CCR4 (0.38)
SCHEMBL6800094 0.74
SCHEMBL6801536 0.72 PSEN1 (0.42) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL682128 0.71 MAPT (0.63) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL22115699 0.71 CYP1A2 (0.47) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL6802725 0.70 KMT2A (0.66) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL194745 0.70 MAPT (0.53) ALDH1A1MAPTTDP1KMT2AHTT
SCHEMBL6798127 0.69

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 1 patent. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20040010142-A1 Novel process NOVO NORDISK A/S (DK) 2004-01-15 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20040010142-A1 Novel process UGT1A1, CYP4X1, CYP4B1 ALDH1A1 146/4885MAPT 3454/4885TDP1 3187/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.