Predicted protein targets (top 20)
| gene | UniProt | supporting neighbours | confidence | |
|---|---|---|---|---|
| ▸ | ALDH1A1 | P00352 | 10/20 | 0.46 |
| ▸ | MAPT | P10636 | 1/20 | 0.46 |
| ▸ | CA12 | O43570 | 1/20 | 0.38 |
| ▸ | CA1 | P00915 | 1/20 | 0.38 |
| ▸ | CA9 | Q16790 | 1/20 | 0.38 |
| ▸ | GAA | P10253 | 3/20 | 0.35 |
| ▸ | MGLL | Q99685 | 1/20 | 0.34 |
| ▸ | KDM4E | B2RXH2 | 7/20 | 0.33 |
| ▸ | HPGD | P15428 | 3/20 | 0.33 |
| ▸ | AKR1C3 | P42330 | 1/20 | 0.33 |
| ▸ | AKR1C2 | P52895 | 1/20 | 0.33 |
| ▸ | LMNA | P02545 | 1/20 | 0.33 |
| ▸ | HSD17B10 | Q99714 | 2/20 | 0.33 |
| ▸ | ATM | Q13315 | 1/20 | 0.33 |
| ▸ | SMN1; SMN2 | Q16637 | 1/20 | 0.33 |
| ▸ | CYP1A2 | P05177 | 1/20 | 0.33 |
| ▸ | CYP3A4 | P08684 | 1/20 | 0.33 |
| ▸ | CYP2D6 | P10635 | 1/20 | 0.33 |
| ▸ | CYP2C9 | P11712 | 1/20 | 0.33 |
| ▸ | CYP2C19 | P33261 | 1/20 | 0.33 |
Click a target to see other patent compounds predicted against it — the reverse direction, in place.
Similar compounds — the chemically nearest patent molecules
Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.
| Compound | similarity | top predicted | shared targets | |
|---|---|---|---|---|
| SCHEMBL6512773 | 0.92 | KDM4E (0.37) | ALDH1A1MAPTGAAKDM4EHPGD | |
| SCHEMBL7910494 | 0.84 | ALOX15 (0.39) | ALDH1A1MAPTCA12CA1CA9 | |
| SCHEMBL7745228 | 0.81 | GAA (0.47) | ALDH1A1CA12CA1CA9GAA | |
| Hydrochloric Acid SCHEMBL28842311 | 0.76 | KDM4E (0.46) | ALDH1A1MAPTGAAMGLLKDM4E | |
| SCHEMBL27717830 | 0.75 | CA1 (0.55) | ALDH1A1MAPTCA12CA1CA9 | |
| SCHEMBL9472568 | 0.75 | MAPT (0.65) | ALDH1A1MAPTCA12CA1CA9 | |
| SCHEMBL12756281 | 0.74 | MAPK1 (0.35) | ALDH1A1MAPTCA12CA1CA9 | |
| SCHEMBL1923453 | 0.73 | CA12 (0.38) | ALDH1A1MAPTCA12CA1CA9 | |
| SCHEMBL20133945 | 0.73 | ALDH1A1 (0.43) | ALDH1A1MAPTCA12CA1CA9 | |
| Hydrochloric Acid SCHEMBL7062212 | 0.72 | CA12 (0.37) | ALDH1A1MAPTCA12CA1CA9 |
Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.
Patent provenance — the patents this molecule appears in, and who filed them
Claimed or disclosed in 2 patents. claimed = in the patent's claims; disclosed = body only.
| Patent | Title | Assignee | Published | Priority | Filing | Country | Status |
|---|---|---|---|---|---|---|---|
| US-20050003371-A1 | Modified nucleotides and methods of labeling nucleic acids | STRATAGENE | 2005-01-06 | — | — | US | disclosed |
| WO-2004037989-A2 | MODIFIED NUCLEOTIDES AND METHODS OF LABELING NUCLEIC ACIDS | STRATAGENE (US) | 2004-05-06 | — | — | WO | disclosed |
Patent text — is the patent's own abstract consistent with the prediction?
For each of this compound's patents that has machine-readable text (1 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.
| Patent | Title | Text reads most about | Predicted target · text-rank |
|---|---|---|---|
| US-20050003371-A1 | Modified nucleotides and methods of labeling nucleic acids | RNGTT, NT5C3B, NT5E | ALDH1A1 2177/4885MAPT 4276/4885CA12 2388/4885 |
“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.