SCHEMBL213292

SCHEMBL213292

O=C1OC2(c3ccc(O)cc3Oc3cc(O)ccc32)c2cc(Cl)c(Cl)c(C(=O)O)c21

nearest known ligand 0.53

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
HPGD P15428 3/20 0.53
ALDH1A1 P00352 3/20 0.53
ALOX12 P18054 3/20 0.53
CYP2C9 P11712 2/20 0.53
ALOX15 P16050 2/20 0.53
BRCA1 P38398 2/20 0.53
HSD17B10 Q99714 2/20 0.53
FTO Q9C0B1 10/20 0.52
MAPT P10636 3/20 0.52
MEN1 O00255 2/20 0.52
HTT P42858 2/20 0.52
KMT2A Q03164 2/20 0.52
SMAD3 P84022 2/20 0.52
CYP2C19 P33261 1/20 0.52
RECQL P46063 1/20 0.52
BLM P54132 1/20 0.52
PRMT1 Q99873 1/20 0.52
ACE2 Q9BYF1 1/20 0.52
TDP1 Q9NUW8 1/20 0.52
TTR P02766 1/20 0.37

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL3625220 0.87 FTO (0.58) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL19207355 0.82 HPGD (0.54) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL37698 0.82 HPGD (0.54) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6037012 0.82 ALDH1A1 (0.56) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6000384 0.81 FTO (0.55) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL1032008 0.81 ALDH1A1 (0.59) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL5148271 0.81 ALDH1A1 (0.59) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6535901 0.81 HPGD (0.55) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL17515090 0.81 ALDH1A1 (0.55) HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL7095947 0.81 ALDH1A1 (0.50) HPGDALDH1A1ALOX12CYP2C9ALOX15

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 124 patents — showing the first 20. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20160264612-A1 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA UNIV COLUMBIA (US) 2016-09-15 US claimed
US-20130096015-A1 Massive Parallel Method For Decoding DNA And RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2013-04-18 US claimed
US-7345159-B2 Massive parallel method for decoding DNA and RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2008-03-18 US claimed
EP-1790736-A2 Massive parallel method for decoding DNA and RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2007-05-30 EP claimed
EP-1337541-B1 Massive parallel method for decoding DNA and RNA UNIV COLUMBIA (US) 2007-03-07 EP claimed
US-20040185466-A1 to detect single nucleotide polymorphisms, genetic mutation, gene expression, forensic identification, disease association, DNA/genomic sequencing and translation/transcription analysis; solid phase synthesis THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK. 2004-09-23 US claimed
EP-1337541-A4 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA UNIV COLUMBIA (US) 2004-07-14 EP claimed
JP-2004510433-A 2004-04-08 JP claimed
US-6664079-B2 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 2003-12-16 US claimed
EP-1337541-A2 Massive parallel method for decoding DNA and RNA The Trustees of Columbia University in the City of New York (US) 2003-08-27 EP claimed
US-20020102586-A1 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences NATIONAL SCIENCE FOUNDATION 2002-08-01 US claimed
WO-2002029003-A2 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2002-04-11 WO claimed
US-11940413-B2 Methods and devices for sequencing nucleic acids in smaller batches IsoPlexis Corporation (US) 2024-03-26 US disclosed
US-20240076731-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES QIAGEN SCIENCES, LLC 2024-03-07 US disclosed
US-20240060128-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES QIAGEN WALTHAM, INC. 2024-02-22 US disclosed
US-20210231609-A1 METHODS AND DEVICES FOR SEQUENCING NUCLEIC ACIDS IN SMALLER BATCHES IsoPlexis Corporation 2021-07-29 US disclosed
US-6664079-B2 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 2003-12-16 US disclosed
EP-1337541-A2 Massive parallel method for decoding DNA and RNA The Trustees of Columbia University in the City of New York (US) 2003-08-27 EP disclosed
US-20020102586-A1 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences NATIONAL SCIENCE FOUNDATION 2002-08-01 US disclosed
WO-2002029003-A2 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2002-04-11 WO disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (3 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20160264612-A1 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA RNGTT, NSUN2, RNMT HPGD 3255/4885ALDH1A1 3815/4885ALOX12 4422/4885
US-20240076731-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES NT5C2, NOP2, NT5C3B HPGD 4406/4885ALDH1A1 4837/4885ALOX12 3123/4885
US-20240060128-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES NT5C2, NOP2, NT5C3B HPGD 4406/4885ALDH1A1 4837/4885ALOX12 3123/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.