Predicted protein targets (top 20)
| gene | UniProt | supporting neighbours | confidence | |
|---|---|---|---|---|
| ▸ | CA12 | O43570 | 1/20 | 0.53 |
| ▸ | CA14 | Q9ULX7 | 1/20 | 0.53 |
| ▸ | KMT2A | Q03164 | 1/20 | 0.46 |
| ▸ | CA1 | P00915 | 1/20 | 0.45 |
| ▸ | CA2 | P00918 | 1/20 | 0.45 |
| ▸ | CA7 | P43166 | 1/20 | 0.45 |
| ▸ | PSENEN | Q9NZ42 | 1/20 | 0.43 |
| ▸ | CTSK | P43235 | 5/20 | 0.42 |
| ▸ | CTSS | P25774 | 4/20 | 0.42 |
| ▸ | GLA | P06280 | 2/20 | 0.41 |
| ▸ | KDM4E | B2RXH2 | 1/20 | 0.41 |
| ▸ | GAA | P10253 | 1/20 | 0.41 |
| ▸ | MAPT | P10636 | 1/20 | 0.41 |
| ▸ | HPGD | P15428 | 1/20 | 0.41 |
| ▸ | GLS | O94925 | 1/20 | 0.41 |
| ▸ | CYP2D6 | P10635 | 1/20 | 0.41 |
| ▸ | CTSL | P07711 | 1/20 | 0.40 |
| ▸ | CTSB | P07858 | 1/20 | 0.40 |
| ▸ | EPHX2 | P34913 | 1/20 | 0.40 |
| ▸ | BIRC2 | Q13490 | 1/20 | 0.39 |
Click a target to see other patent compounds predicted against it — the reverse direction, in place.
Similar compounds — the chemically nearest patent molecules
Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.
| Compound | similarity | top predicted | shared targets | |
|---|---|---|---|---|
| SCHEMBL16667828 | 1.00 | CA12 (0.53) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL26030105 | 1.00 | CA12 (0.53) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL13112912 | 1.00 | CA12 (0.53) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL30134977 | 1.00 | CA12 (0.53) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL9137141 | 1.00 | CA12 (0.53) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL709981 | 0.93 | CA14 (0.59) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL241988 | 0.93 | CA14 (0.59) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL2380555 | 0.93 | CA14 (0.59) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL29918031 | 0.91 | KMT2A (0.47) | CA12CA14KMT2ACA1CA2 | |
| SCHEMBL29918107 | 0.91 | KMT2A (0.47) | CA12CA14KMT2ACA1CA2 |
Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.
Patent provenance — the patents this molecule appears in, and who filed them
Claimed or disclosed in 3 patents. claimed = in the patent's claims; disclosed = body only.
| Patent | Title | Assignee | Published | Priority | Filing | Country | Status |
|---|---|---|---|---|---|---|---|
| US-20220033372-A1 | METHODS AND COMPOSITIONS RELATING TO GENOTOXIN COLIBACTIN | PRESIDENT AND FELLOWS OF HARVARD COLLEGE (US) | 2022-02-03 | — | — | US | disclosed |
| US-11040951-B2 | Methods and compositions relating to genotoxin colibactin | PRESIDENT AND FELLOWS OF HARVARD COLLEGE (US) | 2021-06-22 | — | — | US | disclosed |
| US-20200055836-A1 | METHODS AND COMPOSITIONS RELATING TO GENOTOXIN COLIBACTIN | PRESIDENT AND FELLOWS OF HARVARD COLLEGE (US) | 2020-02-20 | — | — | US | disclosed |
Patent text — is the patent's own abstract consistent with the prediction?
For each of this compound's patents that has machine-readable text (3 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.
| Patent | Title | Text reads most about | Predicted target · text-rank |
|---|---|---|---|
| US-20220033372-A1 | METHODS AND COMPOSITIONS RELATING TO GENOTOXIN COLIBACTIN | CACYBP, MYCBP, CLSPN | CA12 3349/4885CA14 3567/4885KMT2A 4272/4885 |
| US-20200055836-A1 | METHODS AND COMPOSITIONS RELATING TO GENOTOXIN COLIBACTIN | CACYBP, MYCBP, CLSPN | CA12 3349/4885CA14 3567/4885KMT2A 4272/4885 |
| US-11040951-B2 | Methods and compositions relating to genotoxin colibactin | CACYBP, MYCBP, CLSPN | CA12 3349/4885CA14 3567/4885KMT2A 4272/4885 |
“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.