SCHEMBL214271

SCHEMBL214271

CCc1cccc(F)c1[N+](=O)[O-]

nearest known ligand 0.45

Predicted protein targets (top 20)

geneUniProtsupporting neighboursconfidence
ALDH1A1 P00352 4/20 0.45
TDP1 Q9NUW8 2/20 0.45
MGMT P16455 1/20 0.40
TSHR P16473 3/20 0.40
CYP1A2 P05177 1/20 0.40
CYP2C19 P33261 1/20 0.40
MEN1 O00255 3/20 0.39
KMT2A Q03164 3/20 0.39
HPGD P15428 1/20 0.39
GABRA1 P14867 2/20 0.39
GABRB2 P47870 2/20 0.39
LMNA P02545 2/20 0.39
GAA P10253 1/20 0.39
MAPT P10636 2/20 0.38
POLB P06746 2/20 0.38
SMN1; SMN2 Q16637 1/20 0.37
HTT P42858 1/20 0.36
GPR35 Q9HC97 1/20 0.36
KAT2B Q92831 1/20 0.36
ALDH3A1 P30838 1/20 0.36

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compoundsimilaritytop predictedshared targets
SCHEMBL10944668 0.86 TDP1 (0.54) ALDH1A1TDP1TSHRMEN1KMT2A
SCHEMBL29791278 0.84 ALDH1A1 (0.39) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL28154174 0.84 ALDH1A1 (0.39) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL29961217 0.83 ALDH1A1 (0.41) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL1855977 0.83 MGMT (0.43) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL16950621 0.83 ALDH1A1 (0.41) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL20560654 0.83 ALDH1A1 (0.52) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL962484 0.83 MGMT (0.40) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL2565099 0.82 GAA (0.39) ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL22284116 0.81 GAA (0.40) ALDH1A1TDP1MGMTTSHRCYP1A2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 155 patents — showing the first 20. claimed = in the patent's claims; disclosed = body only.

PatentTitleAssigneePublishedPriorityFilingCountryStatus
US-20160264612-A1 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA UNIV COLUMBIA (US) 2016-09-15 US claimed
US-20130096015-A1 Massive Parallel Method For Decoding DNA And RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2013-04-18 US claimed
US-7345159-B2 Massive parallel method for decoding DNA and RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2008-03-18 US claimed
EP-1790736-A2 Massive parallel method for decoding DNA and RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2007-05-30 EP claimed
EP-1337541-B1 Massive parallel method for decoding DNA and RNA UNIV COLUMBIA (US) 2007-03-07 EP claimed
US-20040185466-A1 to detect single nucleotide polymorphisms, genetic mutation, gene expression, forensic identification, disease association, DNA/genomic sequencing and translation/transcription analysis; solid phase synthesis THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK. 2004-09-23 US claimed
EP-1337541-A4 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA UNIV COLUMBIA (US) 2004-07-14 EP claimed
JP-2004510433-A 2004-04-08 JP claimed
US-6664079-B2 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 2003-12-16 US claimed
EP-1337541-A2 Massive parallel method for decoding DNA and RNA The Trustees of Columbia University in the City of New York (US) 2003-08-27 EP claimed
US-20020102586-A1 Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences NATIONAL SCIENCE FOUNDATION 2002-08-01 US claimed
WO-2002029003-A2 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US) 2002-04-11 WO claimed
US-11940413-B2 Methods and devices for sequencing nucleic acids in smaller batches IsoPlexis Corporation (US) 2024-03-26 US disclosed
US-20240076731-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES QIAGEN SCIENCES, LLC 2024-03-07 US disclosed
US-20240060128-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES QIAGEN WALTHAM, INC. 2024-02-22 US disclosed
US-20230407384-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES QIAGEN WALTHAM, INC. 2023-12-21 US disclosed
CN-1309646-A Di- or tri-fluoromethanesulfonyl anilide derivs., process for preparation of them and herbicides contg. them as active ingredient IHARA CHEMICAL IND CO (JP) 2001-08-22 CN disclosed
EP-1101760-A1 DI- OR TRI-FLUOROMETHANESULFONYL ANILIDE DERIVATIVES, PROCESS FOR THE PREPARATION OF THEM AND HERBICIDES CONTAINING THEM AS THE ACTIVE INGREDIENT IHARA CHEMICAL INDUSTRY Co., Ltd. (JP) 2001-05-23 EP disclosed
US-4721790-A Nucleophilic substitution process for fluoronitroaralkyloxazoline ETHYL CORPORATION (US) 1988-01-26 US disclosed
US-4540795-A Fluoronitroaralkyloxazolines, derivatives thereof, and nucleophilic substitution processes for preparing them ETHYL CORPORATION (US) 1985-09-10 US disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (4 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

PatentTitleText reads most aboutPredicted target · text-rank
US-20160264612-A1 MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA RNGTT, NSUN2, RNMT ALDH1A1 3815/4885TDP1 299/4885MGMT 163/4885
US-20240076731-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES NT5C2, NOP2, NT5C3B ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885
US-20230407384-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES NT5C2, NOP2, NT5C3B ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885
US-20240060128-A1 METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES NT5C2, NOP2, NT5C3B ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.