SCHEMBL20958136

SCHEMBL20958136

Cc1ccc(S(=O)(=O)OCCOCCOCCOCCOC(c2ccccc2)(c2ccccc2)c2ccccc2)cc1

nearest known ligand 0.47

Predicted protein targets (top 14)

geneUniProtsupporting neighboursconfidence
TK2 O00142 8/20 0.47
ALDH1A1 P00352 1/20 0.42
KIF11 P52732 1/20 0.41
DUT P33316 1/20 0.40
CA12 O43570 1/20 0.39
CA1 P00915 1/20 0.39
CA9 Q16790 1/20 0.39
VDR P11473 1/20 0.38
CYP24A1 Q07973 1/20 0.38
RECQL P46063 1/20 0.38
TK1 P04183 1/20 0.38
STAT3 P40763 1/20 0.37
HTT P42858 1/20 0.37
SMN1; SMN2 Q16637 1/20 0.37

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL9524449 1.00 TK2 (0.47) TK2ALDH1A1KIF11DUTCA12
SCHEMBL27443036 1.00 TK2 (0.47) TK2ALDH1A1KIF11DUTCA12
SCHEMBL27443043 1.00 TK2 (0.47) TK2ALDH1A1KIF11DUTCA12
SCHEMBL9523426 1.00 TK2 (0.47) TK2ALDH1A1KIF11DUTCA12
SCHEMBL4303777 0.94 TK1 (0.43) TK2ALDH1A1KIF11DUTCA12
SCHEMBL4013278 0.94 TK2 (0.42) TK2ALDH1A1KIF11DUTCA12
SCHEMBL2381255 0.92 SLC6A11 (0.43) TK2ALDH1A1KIF11CA12CA1
SCHEMBL6046754 0.92 SLC6A11 (0.43) TK2ALDH1A1KIF11CA12CA1
SCHEMBL31516447 0.88 TK1 (0.46) TK2ALDH1A1KIF11VDRCYP24A1
SCHEMBL26103993 0.86 TK2 (0.47) TK2ALDH1A1KIF11VDRCYP24A1

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 5 patents. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-11945838-B2 Method for synthesis of protein amphiphiles INDIAN INSTITUTE OF SCIENCE EDUCATION AND RESEARCH (IN) 2024-04-02 US disclosed
US-11680137-B2 Method for purifying trityl group-containing monodispersed polyethylene glycol NOF CORPORATION (JP) 2023-06-20 US disclosed
US-11173212-B2 Supramolecular protein assemblies with advanced functions and synthesis thereof INDIAN INSTITUTE OF SCIENCE EDUCATION AND RESEARCH (IN) 2021-11-16 US disclosed
US-20200199175-A1 METHOD FOR SYNTHESIS OF PROTEIN AMPHIPHILES INDIAN INSTITUTE OF SCIENCE EDUCATION AND RESEARCH 2020-06-25 US disclosed
US-20190134212-A1 SUPRAMOLECULAR PROTEIN ASSEMBLIES WITH ADVANCED FUNCTIONS AND SYNTHESIS THEREOF INDIAN INSTITUTE OF SCIENCE EDUCATION AND RESEARCH (IN) 2019-05-09 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (4 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20200199175-A1 METHOD FOR SYNTHESIS OF PROTEIN AMPHIPHILES PTMS, VAC14, PDIA6 TK2 3618/4885ALDH1A1 4784/4885KIF11 3691/4885
US-20190134212-A1 SUPRAMOLECULAR PROTEIN ASSEMBLIES WITH ADVANCED FUNCTIONS AND SYNTHESIS THEREOF APC, CHLSN, MAX TK2 4417/4885ALDH1A1 4627/4885KIF11 2577/4885
US-11945838-B2 Method for synthesis of protein amphiphiles PTMS, VAC14, PDIA6 TK2 3618/4885ALDH1A1 4784/4885KIF11 3691/4885
US-11173212-B2 Supramolecular protein assemblies with advanced functions and synthesis thereof APC, CHLSN, MAX TK2 4417/4885ALDH1A1 4627/4885KIF11 2577/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.