SCHEMBL213292 — predicted protein targets

Predicted protein targets (top 20)

	gene	UniProt	supporting neighbours	confidence
▸	HPGD	P15428	3/20	0.53

▸	ALDH1A1	P00352	3/20	0.53

▸	ALOX12	P18054	3/20	0.53

▸	CYP2C9	P11712	2/20	0.53

▸	ALOX15	P16050	2/20	0.53

▸	BRCA1	P38398	2/20	0.53

▸	HSD17B10	Q99714	2/20	0.53

▸	FTO	Q9C0B1	10/20	0.52

▸	MAPT	P10636	3/20	0.52

▸	MEN1	O00255	2/20	0.52

▸	HTT	P42858	2/20	0.52

▸	KMT2A	Q03164	2/20	0.52

▸	SMAD3	P84022	2/20	0.52

▸	CYP2C19	P33261	1/20	0.52

▸	RECQL	P46063	1/20	0.52

▸	BLM	P54132	1/20	0.52

▸	PRMT1	Q99873	1/20	0.52

▸	ACE2	Q9BYF1	1/20	0.52

▸	TDP1	Q9NUW8	1/20	0.52

▸	TTR	P02766	1/20	0.37

Click a target to see other patent compounds predicted against it — the reverse direction, in place.

Similar compounds — the chemically nearest patent molecules

Nearest neighbours by Morgan-fingerprint cosine across the patent-compound collection, with each neighbour's top predicted target and the predicted targets it shares with this molecule.

Compound	similarity	top predicted	shared targets
SCHEMBL3625220	0.87	FTO (0.58)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL19207355	0.82	HPGD (0.54)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL37698	0.82	HPGD (0.54)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6037012	0.82	ALDH1A1 (0.56)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6000384	0.81	FTO (0.55)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL1032008	0.81	ALDH1A1 (0.59)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL5148271	0.81	ALDH1A1 (0.59)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL6535901	0.81	HPGD (0.55)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL17515090	0.81	ALDH1A1 (0.55)	HPGDALDH1A1ALOX12CYP2C9ALOX15
SCHEMBL7095947	0.81	ALDH1A1 (0.50)	HPGDALDH1A1ALOX12CYP2C9ALOX15

Similarity is cosine over the 2,048-bit Morgan fingerprint (≈ Tanimoto). Identical fingerprints score 1.00.

Patent provenance — the patents this molecule appears in, and who filed them

Claimed or disclosed in 124 patents — showing the first 20. claimed = in the patent's claims; disclosed = body only.

Patent	Title	Assignee	Published	Priority	Filing	Country	Status
US-20160264612-A1	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	UNIV COLUMBIA (US)	2016-09-15	—	—	US	claimed
US-20130096015-A1	Massive Parallel Method For Decoding DNA And RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2013-04-18	—	—	US	claimed
US-7345159-B2	Massive parallel method for decoding DNA and RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2008-03-18	—	—	US	claimed
EP-1790736-A2	Massive parallel method for decoding DNA and RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2007-05-30	—	—	EP	claimed
EP-1337541-B1	Massive parallel method for decoding DNA and RNA	UNIV COLUMBIA (US)	2007-03-07	—	—	EP	claimed
US-20040185466-A1	to detect single nucleotide polymorphisms, genetic mutation, gene expression, forensic identification, disease association, DNA/genomic sequencing and translation/transcription analysis; solid phase synthesis	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK.	2004-09-23	—	—	US	claimed
EP-1337541-A4	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	UNIV COLUMBIA (US)	2004-07-14	—	—	EP	claimed
JP-2004510433-A	—	—	2004-04-08	—	—	JP	claimed
US-6664079-B2	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK	2003-12-16	—	—	US	claimed
EP-1337541-A2	Massive parallel method for decoding DNA and RNA	The Trustees of Columbia University in the City of New York (US)	2003-08-27	—	—	EP	claimed
US-20020102586-A1	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	NATIONAL SCIENCE FOUNDATION	2002-08-01	—	—	US	claimed
WO-2002029003-A2	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2002-04-11	—	—	WO	claimed
US-11940413-B2	Methods and devices for sequencing nucleic acids in smaller batches	IsoPlexis Corporation (US)	2024-03-26	—	—	US	disclosed
US-20240076731-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	QIAGEN SCIENCES, LLC	2024-03-07	—	—	US	disclosed
US-20240060128-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	QIAGEN WALTHAM, INC.	2024-02-22	—	—	US	disclosed
US-20210231609-A1	METHODS AND DEVICES FOR SEQUENCING NUCLEIC ACIDS IN SMALLER BATCHES	IsoPlexis Corporation	2021-07-29	—	—	US	disclosed
US-6664079-B2	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK	2003-12-16	—	—	US	disclosed
EP-1337541-A2	Massive parallel method for decoding DNA and RNA	The Trustees of Columbia University in the City of New York (US)	2003-08-27	—	—	EP	disclosed
US-20020102586-A1	Sequencing nucleic acid; obtain nucleotide sequences, incubate in reaction mixture, apply to solid support, determine nucleotide sequences	NATIONAL SCIENCE FOUNDATION	2002-08-01	—	—	US	disclosed
WO-2002029003-A2	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK (US)	2002-04-11	—	—	WO	disclosed

Patent text — is the patent's own abstract consistent with the prediction?

For each of this compound's patents that has machine-readable text (3 of them — usually the abstract, not the full specification), we ask MedCPT which protein the text reads most about, and where the chemistry-predicted target lands among 4885 human targets. A high rank means the patent's own wording is consistent with the prediction — a weak, independent signal, not proof of activity.

Patent	Title	Text reads most about	Predicted target · text-rank
US-20160264612-A1	MASSIVE PARALLEL METHOD FOR DECODING DNA AND RNA	RNGTT, NSUN2, RNMT	HPGD 3255/4885ALDH1A1 3815/4885ALOX12 4422/4885
US-20240076731-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	NT5C2, NOP2, NT5C3B	HPGD 4406/4885ALDH1A1 4837/4885ALOX12 3123/4885
US-20240060128-A1	METHODS AND COMPOSITIONS FOR INCORPORATING NUCLEOTIDES	NT5C2, NOP2, NT5C3B	HPGD 4406/4885ALDH1A1 4837/4885ALOX12 3123/4885

“Text reads most about” is the patent abstract's nearest protein in MedCPT space (background-debiased). Only ~1.4% of patents have machine-readable text, so most compounds won't have this panel.