SCHEMBL214271 — predicted protein targets

Predicted protein targets (top 20)

	gene	UniProt	supporting neighbours	confidence
▸	ALDH1A1	P00352	4/20	0.45

▸	TDP1	Q9NUW8	2/20	0.45

▸	MGMT	P16455	1/20	0.40

▸	TSHR	P16473	3/20	0.40

▸	CYP1A2	P05177	1/20	0.40

▸	CYP2C19	P33261	1/20	0.40

▸	MEN1	O00255	3/20	0.39

▸	KMT2A	Q03164	3/20	0.39

▸	HPGD	P15428	1/20	0.39

▸	GABRA1	P14867	2/20	0.39

▸	GABRB2	P47870	2/20	0.39

▸	LMNA	P02545	2/20	0.39

▸	GAA	P10253	1/20	0.39

▸	MAPT	P10636	2/20	0.38

▸	POLB	P06746	2/20	0.38

▸	SMN1; SMN2	Q16637	1/20	0.37

▸	HTT	P42858	1/20	0.36

▸	GPR35	Q9HC97	1/20	0.36

▸	KAT2B	Q92831	1/20	0.36

▸	ALDH3A1	P30838	1/20	0.36

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compound	similarity	top predicted	shared targets
SCHEMBL10944668	0.86	TDP1 (0.54)	ALDH1A1TDP1TSHRMEN1KMT2A
SCHEMBL29791278	0.84	ALDH1A1 (0.39)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL28154174	0.84	ALDH1A1 (0.39)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL29961217	0.83	ALDH1A1 (0.41)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL1855977	0.83	MGMT (0.43)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL16950621	0.83	ALDH1A1 (0.41)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL20560654	0.83	ALDH1A1 (0.52)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL962484	0.83	MGMT (0.40)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL2565099	0.82	GAA (0.39)	ALDH1A1TDP1MGMTTSHRCYP1A2
SCHEMBL22284116	0.81	GAA (0.40)	ALDH1A1TDP1MGMTTSHRCYP1A2

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 155 patents — showing the first 20. claimed = in the patent's claims; disclosed = body only.

Patent	Title	Assignee	Published	Priority	Filing	Country	Status
US-20160264612-A1	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	UNIV COLUMBIA (US)	2016-09-15	—	—	US	claimed
US-20130096015-A1	Massive Parallel Method For Decoding DNA And RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2013-04-18	—	—	US	claimed
US-7345159-B2	Massive parallel method for decoding DNA and RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2008-03-18	—	—	US	claimed
EP-1790736-A2	Massive parallel method for decoding DNA and RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2007-05-30	—	—	EP	claimed
EP-1337541-B1	Massive parallel method for decoding DNA and RNA	UNIV COLUMBIA (US)	2007-03-07	—	—	EP	claimed
US-20040185466-A1	to detect single nucleotide polymorphisms, genetic mutation, gene expression, forensic identification, disease association, DNA/genomic sequencing and translation/transcription analysis; solid phase synthesis	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK.	2004-09-23	—	—	US	claimed
EP-1337541-A4	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	UNIV COLUMBIA (US)	2004-07-14	—	—	EP	claimed
JP-2004510433-A	—	—	2004-04-08	—	—	JP	claimed
US-6664079-B2	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK	2003-12-16	—	—	US	claimed
EP-1337541-A2	Massive parallel method for decoding DNA and RNA	The Trustees of Columbia University in the City of New York (US)	2003-08-27	—	—	EP	claimed
US-20020102586-A1	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	NATIONAL SCIENCE FOUNDATION	2002-08-01	—	—	US	claimed
WO-2002029003-A2	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2002-04-11	—	—	WO	claimed
US-11940413-B2	Methods and devices for sequencing nucleic acids in smaller batches	IsoPlexis Corporation (US)	2024-03-26	—	—	US	disclosed
US-20240076731-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	QIAGEN SCIENCES, LLC	2024-03-07	—	—	US	disclosed
US-20240060128-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	QIAGEN WALTHAM, INC.	2024-02-22	—	—	US	disclosed
US-20230407384-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	QIAGEN WALTHAM, INC.	2023-12-21	—	—	US	disclosed
CN-1309646-A	Di- or tri-fluoromethanesulfonyl anilide derivs., process for preparation of them and herbicides contg. them as active ingredient	IHARA CHEMICAL IND CO (JP)	2001-08-22	—	—	CN	disclosed
EP-1101760-A1	DI- OR TRI-FLUOROMETHANESULFONYL ANILIDE DERIVATIVES, PROCESS FOR THE PREPARATION OF THEM AND HERBICIDES CONTAINING THEM AS THE ACTIVE INGREDIENT	IHARA CHEMICAL INDUSTRY Co., Ltd. (JP)	2001-05-23	—	—	EP	disclosed
US-4721790-A	Nucleophilic substitution process for fluoronitroaralkyloxazoline	ETHYL CORPORATION (US)	1988-01-26	—	—	US	disclosed
US-4540795-A	Fluoronitroaralkyloxazolines, derivatives thereof, and nucleophilic substitution processes for preparing them	ETHYL CORPORATION (US)	1985-09-10	—	—	US	disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (4 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

Patent	Title	Text reads most about	Predicted target · text-rank
US-20160264612-A1	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	RNGTT, NSUN2, RNMT	ALDH1A1 3815/4885TDP1 299/4885MGMT 163/4885
US-20240076731-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	NT5C2, NOP2, NT5C3B	ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885
US-20230407384-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	NT5C2, NOP2, NT5C3B	ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885
US-20240060128-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	NT5C2, NOP2, NT5C3B	ALDH1A1 4837/4885TDP1 116/4885MGMT 1701/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.