SCHEMBL891512

SCHEMBL891512

O=[C]/C=C\C(=O)CC(=O)C(=O)O

nearest known ligand 0.44

Predicted protein targets (top 7)

geneUniProtsupporting neighboursconfidence
HAO1 Q9UJM8 2/20 0.44
OR51E2 Q9H255 1/20 0.32
PTPN1 P18031 1/20 0.31
ERCC5 P28715 1/20 0.31
FEN1 P39748 1/20 0.31
POLB P06746 1/20 0.31
DNTT P04053 1/20 0.31

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL4731952 1.00 HAO1 (0.44) HAO1OR51E2PTPN1ERCC5FEN1
SCHEMBL380204 0.77 LDHA (0.33)
SCHEMBL23361428 0.72 TSHR (0.50) HAO1OR51E2PTPN1ERCC5FEN1
SCHEMBL1332598 0.72 TSHR (0.50) HAO1OR51E2PTPN1ERCC5FEN1
SCHEMBL891513 0.72 TSHR (0.50) HAO1OR51E2PTPN1ERCC5FEN1
SCHEMBL5473789 0.70
SCHEMBL28089517 0.69 HAO1 (0.45) HAO1OR51E2PTPN1ERCC5FEN1
SCHEMBL2440721 0.68
SCHEMBL11803247 0.68
SCHEMBL536651 0.68

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 34 patents — showing the first 20. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
CN-108350472-A Process for producing aromatic compound and derivative thereof 国立研究开发法人理化学研究所 2018-07-31 CN claimed
CN-1594557-A Construction and use for gentisic acid 1,2-dioxygenase gene engineering bacterium CHINA ACADEMY OF SCIENCES WUHA (CN) 2005-03-16 CN claimed
US-12577526-B2 Massively parallel on-chip construction of synthetic microbial communities THE BROAD INSTITUTE, INC. (US) 2026-03-17 US disclosed
US-20250376656-A1 MASSIVELY PARALLEL ON-CHIP CONSTRUCTION OF SYNTHETIC MICROBIAL COMMUNITIES MASSACHUSETTS INSTITUTE OF TECHNOLOGY (US) 2025-12-11 US disclosed
CN-112375723-B Engineering bacteria for producing maleic acid and construction method and application thereof 北京化工大学 2023-06-27 CN disclosed
US-20220228190-A1 MASSIVELY PARALLEL ON-CHIP CONSTRUCTION OF SYNTHETIC MICROBIAL COMMUNITIES MASSACHUSETTS INSTITUTE OF TECHNOLOGY (US) 2022-07-21 US disclosed
US-11060079-B2 Methods and microorganisms for producing flavors and fragrance chemicals ARDRA INC. (CA) 2021-07-13 US disclosed
CN-112375723-A Engineering bacterium for producing maleic acid and construction method and application thereof 北京化工大学 2021-02-19 CN disclosed
US-10329575-B2 Regulatory sequence for plants CERES, INC. (US) 2019-06-25 US disclosed
US-20190177713-A1 METHODS AND MICROORGANISMS FOR PRODUCING FLAVORS AND FRAGRANCE CHEMICALS ARDRA INC. (CA) 2019-06-13 US disclosed
EP-3478848-A2 METHODS AND MICROORGANISMS FOR PRODUCING FLAVORS AND FRAGRANCE CHEMICALS Ardra Bio Inc. (US) 2019-05-08 EP disclosed
CN-101974615-A Method for qualitatively and quantitatively measuring mycothiol CHINESE ACAD INST MICROBIOLOGY 2011-02-16 CN disclosed
US-20100037355-A1 SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDES ENCODED THEREBY CERES, INC (US) 2010-02-11 US disclosed
US-20090265815-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded therapy ALEXANDROV NICKOLAI 2009-10-22 US disclosed
US-20070214517-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded thereby CERES, INC. (US) 2007-09-13 US disclosed
US-20060150283-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded thereby ALEXANDROV NICKOLAI 2006-07-06 US disclosed
US-20060048240-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded thereby ALEXANDROV NICKOLAI 2006-03-02 US disclosed
CN-1594557-A Construction and use for gentisic acid 1,2-dioxygenase gene engineering bacterium CHINA ACADEMY OF SCIENCES WUHA (CN) 2005-03-16 CN disclosed
US-6594587-B2 Determination of associations among set of biological elements using algorithm that is capable of generating Steiner tree MONSANTO TECHNOLOGY LLC 2003-07-15 US disclosed
US-20020123847-A1 Method for analyzing biological elements TELEPHONICS CORPORATION 2002-09-05 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (4 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20070214517-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded thereby RNGTT, CPSF1, CPSF6 HAO1 4100/4885OR51E2 2604/4885PTPN1 4727/4885
US-20100037355-A1 SEQUENCE-DETERMINED DNA FRAGMENTS AND CORRESPONDING POLYPEPTIDES ENCODED THEREBY RNGTT, CPSF1, CPSF6 HAO1 4100/4885OR51E2 2604/4885PTPN1 4727/4885
US-20060048240-A1 Sequence-determined DNA fragments and corresponding polypeptides encoded thereby RNGTT, CPSF1, CPSF6 HAO1 4100/4885OR51E2 2604/4885PTPN1 4727/4885
US-12577526-B2 Massively parallel on-chip construction of synthetic microbial communities STUB1, SAMM50, CD14 HAO1 4531/4885OR51E2 2730/4885PTPN1 85/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.