Provide a comprehensive cross-database identifier and functional mapping reference for human MYC — a definitive lookup resource covering:

### Section 1: Gene identifiers
For human gene MYC, list ALL gene-level database identifiers.

Required:
- HGNC ID and approved symbol
- Ensembl gene ID (ENSG...)
- NCBI Entrez Gene ID
- OMIM gene/locus ID
- Genomic location: chromosome, start position, end position, strand (GRCh38)

### Section 2: Transcript identifiers
For human gene MYC, list ALL transcript-level identifiers.

Required:
- Ensembl transcripts: ALL ENST IDs with biotype. Total count.
- RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select.
- CCDS IDs.
- For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count.

### Section 3: Protein identifiers
For human gene MYC protein product(s), list ALL protein-level identifiers.

Required:
- UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry.
- RefSeq protein: ALL NP_ accessions.
- Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID.
- Antibody availability: known antibody resources for the protein.

### Section 4: Structure
For human gene MYC protein, list ALL structural data.

Required:
- Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count.
- Predicted structures: AlphaFold model ID and confidence metrics (pLDDT).

### Section 5: Cross-species orthologs
For human gene MYC, list orthologous genes in key model organisms.

Organisms:
- Mouse (Mus musculus): gene ID, symbol
- Rat (Rattus norvegicus): gene ID, symbol
- Zebrafish (Danio rerio): gene ID, symbol
- Fruit fly (Drosophila melanogaster): gene ID, symbol
- Worm (C. elegans): gene ID, symbol
- Yeast (S. cerevisiae): gene ID, symbol

### Section 6: Clinical variants & AI predictions
For human gene MYC, summarize clinical variants and AI predictions.

Clinical variant annotations (ClinVar):
- Total variant count (approximate is fine)
- Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign
- TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition

AI-based variant effect predictions:
- Splice effect predictions: total count + TOP 30 with delta scores if known
- Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores.

### Section 7: Pathways & Gene Ontology
For human gene MYC, list biological pathways and Gene Ontology annotations.

Pathway membership:
- ALL biological pathways this gene participates in, with pathway IDs and names
- Total pathway count

Gene Ontology:
- Biological Process: count and TOP 20 terms with GO IDs
- Molecular Function: count and TOP 20 terms with GO IDs
- Cellular Component: count and TOP 20 terms with GO IDs

### Section 8: Protein interactions & networks
For human gene MYC protein, summarize protein interactions and networks.

Protein-protein interactions (STRING, IntAct, BioGRID, etc.):
- Total interaction count (approximate)
- TOP 30 highest-confidence interacting proteins with scores/evidence

Protein similarity:
- Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores
- Sequence homology: TOP 20 homologous proteins with identity/similarity

### Section 9: Transcription factor regulatory data
For human gene MYC, summarize transcription factor regulatory data.

If MYC is a transcription factor:
- Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence
- DNA binding motifs from JASPAR — all known motif IDs and motif family classification.

Regardless:
- Upstream regulators: TFs that regulate MYC — names with evidence type (ChIP-seq / predicted / experimentally validated)

If MYC is not a transcription factor, say so briefly and skip the downstream/motif sections.

### Section 10: Drug & pharmacology data
For human gene MYC protein as a drug target, summarize pharmacology data.

If MYC is a known drug target:
- Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase)
- Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention
- Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any

If MYC is not currently a drug target, say so briefly.

### Section 11: Expression profiles
For human gene MYC, summarize expression profiles.

Tissue expression (GTEx, HPA, Bgee, etc.):
- TOP 30 tissues with expression scores/levels (direction, units if known)
- Note tissue-specific or tissue-enriched patterns

Cell type expression (Tabula Sapiens, HCA, etc.):
- TOP 30 cell types with expression scores
- Note cell-type-specific patterns

Single-cell expression: notable datasets or cell populations of interest for this gene.

### Section 12: Disease associations
For human gene MYC, summarize disease associations.

Mendelian / monogenic disease:
- Diseases caused by mutations in MYC: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level
- Include all directly linked conditions

Phenotype associations:
- Clinical phenotypes associated with the gene (HPO terms where known)
- TOP 30 phenotype terms with HPO IDs

Complex-disease / GWAS:
- Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known
- TOP 30 GWAS associations

Question

Provide a comprehensive cross-database identifier and functional mapping reference for human MYC — a definitive lookup resource covering:

### Section 1: Gene identifiers
For human gene MYC, list ALL gene-level database identifiers.

Required:
- HGNC ID and approved symbol
- Ensembl gene ID (ENSG...)
- NCBI Entrez Gene ID
- OMIM gene/locus ID
- Genomic location: chromosome, start position, end position, strand (GRCh38)

### Section 2: Transcript identifiers
For human gene MYC, list ALL transcript-level identifiers.

Required:
- Ensembl transcripts: ALL ENST IDs with biotype. Total count.
- RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select.
- CCDS IDs.
- For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count.

### Section 3: Protein identifiers
For human gene MYC protein product(s), list ALL protein-level identifiers.

Required:
- UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry.
- RefSeq protein: ALL NP_ accessions.
- Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID.
- Antibody availability: known antibody resources for the protein.

### Section 4: Structure
For human gene MYC protein, list ALL structural data.

Required:
- Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count.
- Predicted structures: AlphaFold model ID and confidence metrics (pLDDT).

### Section 5: Cross-species orthologs
For human gene MYC, list orthologous genes in key model organisms.

Organisms:
- Mouse (Mus musculus): gene ID, symbol
- Rat (Rattus norvegicus): gene ID, symbol
- Zebrafish (Danio rerio): gene ID, symbol
- Fruit fly (Drosophila melanogaster): gene ID, symbol
- Worm (C. elegans): gene ID, symbol
- Yeast (S. cerevisiae): gene ID, symbol

### Section 6: Clinical variants & AI predictions
For human gene MYC, summarize clinical variants and AI predictions.

Clinical variant annotations (ClinVar):
- Total variant count (approximate is fine)
- Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign
- TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition

AI-based variant effect predictions:
- Splice effect predictions: total count + TOP 30 with delta scores if known
- Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores.

### Section 7: Pathways & Gene Ontology
For human gene MYC, list biological pathways and Gene Ontology annotations.

Pathway membership:
- ALL biological pathways this gene participates in, with pathway IDs and names
- Total pathway count

Gene Ontology:
- Biological Process: count and TOP 20 terms with GO IDs
- Molecular Function: count and TOP 20 terms with GO IDs
- Cellular Component: count and TOP 20 terms with GO IDs

### Section 8: Protein interactions & networks
For human gene MYC protein, summarize protein interactions and networks.

Protein-protein interactions (STRING, IntAct, BioGRID, etc.):
- Total interaction count (approximate)
- TOP 30 highest-confidence interacting proteins with scores/evidence

Protein similarity:
- Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores
- Sequence homology: TOP 20 homologous proteins with identity/similarity

### Section 9: Transcription factor regulatory data
For human gene MYC, summarize transcription factor regulatory data.

If MYC is a transcription factor:
- Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence
- DNA binding motifs from JASPAR — all known motif IDs and motif family classification.

Regardless:
- Upstream regulators: TFs that regulate MYC — names with evidence type (ChIP-seq / predicted / experimentally validated)

If MYC is not a transcription factor, say so briefly and skip the downstream/motif sections.

### Section 10: Drug & pharmacology data
For human gene MYC protein as a drug target, summarize pharmacology data.

If MYC is a known drug target:
- Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase)
- Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention
- Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any

If MYC is not currently a drug target, say so briefly.

### Section 11: Expression profiles
For human gene MYC, summarize expression profiles.

Tissue expression (GTEx, HPA, Bgee, etc.):
- TOP 30 tissues with expression scores/levels (direction, units if known)
- Note tissue-specific or tissue-enriched patterns

Cell type expression (Tabula Sapiens, HCA, etc.):
- TOP 30 cell types with expression scores
- Note cell-type-specific patterns

Single-cell expression: notable datasets or cell populations of interest for this gene.

### Section 12: Disease associations
For human gene MYC, summarize disease associations.

Mendelian / monogenic disease:
- Diseases caused by mutations in MYC: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level
- Include all directly linked conditions

Phenotype associations:
- Clinical phenotypes associated with the gene (HPO terms where known)
- TOP 30 phenotype terms with HPO IDs

Complex-disease / GWAS:
- Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known
- TOP 30 GWAS associations

Accepted Answer

Executive summary MYC (HGNC:7553; chr8q24.21) encodes the Myc proto-oncogene protein, a bHLH-ZIP transcription factor that is one of the most broadly important oncogenes in human biology, driving cell proliferation, differentiation, and apoptosis across virtually all tissue types. It is most strongly implicated in Burkitt lymphoma through direct mutation, and GWAS signals at the 8q24 locus associate the region with colorectal, prostate, breast, and bladder cancers as well as asthma. MYC …

Transcript ID	Biotype	Start	End
ENST00000259523	protein_coding	127,735,434	127,740,477
ENST00000377970	protein_coding	127,736,084	127,741,434
ENST00000517291	protein_coding	127,735,519	127,738,772
ENST00000520751	protein_coding_CDS_not_defined	127,736,231	127,738,475
ENST00000524013	protein_coding	127,736,220	127,741,372
ENST00000621592	protein_coding	127,736,231	127,742,951
ENST00000641036	protein_coding	127,736,046	127,736,612
ENST00000641252	protein_coding	127,735,473	127,735,817
ENST00000651626	protein_coding	127,735,659	127,741,434
ENST00000652288	protein_coding	127,737,621	127,741,434
ENST00000707113	protein_coding	127,736,958	127,741,434
ENST00000707114	protein_coding	127,737,724	127,741,434
ENST00000707115	protein_coding	127,737,737	127,741,434
ENST00000707116	protein_coding	127,737,745	127,741,434
ENST00000707117	protein_coding_CDS_not_defined	127,739,073	127,741,434

NM ID	MANE Select
NM_001177352	—
NM_001177353	—
NM_001177354	—
NM_001272275	—
NM_001354870	—
NM_002467	✓
NM_010849	—
NM_012603	—
NM_080323	—

Exon ID	Start	End	Length
ENSE00003746860	127,736,231	127,736,623	393 bp
ENSE00003475084	127,738,248	127,739,019	771 bp
ENSE00003847400	127,740,396	127,742,951	2,556 bp

Name	Type	ID
Transcription regulator Myc	Family	IPR002418
Myc family transcription factors	Family	IPR050433
Myc-type, basic helix-loop-helix (bHLH) domain	Domain	IPR011598
Transcription regulator Myc, N-terminal	Domain	IPR012682
Leucine zipper, Myc	Domain	IPR003327
Helix-loop-helix DNA-binding domain superfamily	Homologous superfamily	IPR036638
—	Domain (PFAM)	PF00010
—	Domain (PFAM)	PF01056
—	Domain (PFAM)	PF02344
—	Domain (SMART)	SM00353

PDB ID	Title	Method	Resolution
1A93	NMR solution structure of the c-Myc-Max heterodimeric leucine zipper	Solution NMR	N/A
1EE4	Crystal structure of yeast karyopherin (importin) α with c-Myc NLS peptide	X-ray Diffraction	2.1 Å
1MV0	NMR structure of tumor suppressor BIN1 with c-Myc interaction	Solution NMR	N/A
1NKP	Crystal structure of Myc-Max recognizing DNA	X-ray Diffraction	1.8 Å
2A93	NMR solution structure of c-Myc-Max heterodimeric leucine zipper (40 structures)	Solution NMR	N/A
2OR9	Anti-c-Myc antibody 9E10 Fab fragment/epitope peptide complex	X-ray Diffraction	2.7 Å
4Y7R	WDR5 in complex with Myc MbIIIb peptide	X-ray Diffraction	1.898 Å
5I4Z	Structure of apo OmoMYC	X-ray Diffraction	1.95 Å
5I50	OmoMYC bound to double-stranded DNA	X-ray Diffraction	2.701 Å
6C4U	Engineered FHA with Myc-pTBD peptide	X-ray Diffraction	2.6 Å
6E16	Ternary structure of c-Myc-TBP-TAF1	X-ray Diffraction	2.4 Å
6E24	Ternary structure of c-Myc-TBP-TAF1	X-ray Diffraction	3.001 Å
6G6J	Human MYC:MAX bHLHZip complex	X-ray Diffraction	2.25 Å
6G6K	Human MYC:MAX bHLHZip complex	X-ray Diffraction	1.35 Å
6G6L	Human MYC:MAX bHLHZip complex	X-ray Diffraction	2.2 Å
7T1Y	Fbw7-Skp1-MycC degron complex	X-ray Diffraction	2.55 Å
7T1Z	Fbw7-Skp1-MycN degron complex	X-ray Diffraction	2.77 Å
8J2Q	Cypovirus polyhedra mutant fused with c-Myc fragment	X-ray Diffraction	1.92 Å
8OTS	OCT4 and MYC-MAX co-bound to nucleosome	Cryo-EM	3.3 Å
8OTT	MYC-MAX bound to nucleosome at SHL+5.8	Cryo-EM	3.3 Å
8Q1N	Cyclic peptide binder of WDR5 WBM-site	X-ray Diffraction	1.843 Å
8WLG	Cypovirus polyhedra mutant fused with c-Myc fragment	X-ray Diffraction	2.55 Å
8X8S	Cypovirus polyhedra mutant fused with c-Myc fragment	X-ray Diffraction	2.04 Å
8X8V	Cypovirus polyhedra mutant fused with c-Myc fragment	X-ray Diffraction	2.0 Å
9QNH	Myc pS294 phosphopeptide binding to 14-3-3σ	X-ray Diffraction	1.3 Å

MYC Gene Complete Identifier and Functional Mapping Reference

Executive summary

Gene identifiers

Transcript identifiers

Protein identifiers

Structure

Experimental Structures

Predicted Structures

Cross-species orthologs

Clinical variants & AI predictions

ClinVar: Variant Classification (46 variants)

ClinVar: Top Pathogenic/Likely Pathogenic Variants

AlphaMissense: Likely Pathogenic Predictions (100+ variants)

SpliceAI: Splice Effect Predictions (522 variants)

Pathways & Gene Ontology

Reactome Pathways

MSigDB Gene Sets

Gene Ontology Annotations

Biological Process (BP): 59 terms

Molecular Function (MF): 15 terms

Cellular Component (CC): 12 terms

Protein interactions & networks

Protein-protein Interactions

Protein Similarity

Transcription factor regulatory data

Downstream targets

DNA binding motifs (JASPAR)

Upstream regulators

Drug & pharmacology data

Top molecules by development phase

Pharmacogenomics & dosing

Expression profiles

Tissue Expression (Bgee)

Single-Cell Expression (SCXA - Single Cell Expression Atlas)

Disease associations

Mendelian / Monogenic Diseases

Phenotype Associations (HPO Terms)

Complex-Disease / GWAS Associations (Top 30)

Organism	Gene ID	Symbol
Mouse (Mus musculus)	ENSMUSG00000022346	Myc
Rat (Rattus norvegicus)	ENSRNOG00000004500	Myc
Zebrafish (Danio rerio)	ENSDARG00000007241	mycb
Zebrafish (Danio rerio)	ENSDARG00000045695	myca
Fruit fly (Drosophila melanogaster)	FBGN0262656	Myc
Worm (C. elegans)	none	—
Yeast (S. cerevisiae)	none	—

Classification	Count
Pathogenic	4
Likely Pathogenic	1
Uncertain Significance (VUS)	11
Benign	7
Likely Benign	3
Other/Unclassified	20

Variant ID	HGVS	Type	Classification
12576	NM_002467.6(MYC):c.162G>C (p.Glu54Asp)	SNV	Pathogenic
12574	NM_002467.6(MYC):c.214C>T (p.Pro72Ser)	SNV	Pathogenic
12575	NM_002467.6(MYC):c.302A>C (p.Asn101Thr)	SNV	Pathogenic
2658815	NM_002467.6(MYC):c.217A>G (p.Thr73Ala)	SNV	Pathogenic
810340	chr8:128750494-128753204x3	CNV	Likely Pathogenic

Variant	Protein Change	Score	Class
8:127738410:T:A	W50R	1.000	likely_pathogenic
8:127738411:G:C	W50S	1.000	likely_pathogenic
8:127738412:G:C	W50C	1.000	likely_pathogenic
8:127738412:G:T	W50C	1.000	likely_pathogenic
8:127738410:T:G	W50G	0.999	likely_pathogenic
8:127738405:A:C	D48A	0.998	likely_pathogenic
8:127738405:A:G	D48G	0.996	likely_pathogenic
8:127738408:T:A	I49N	0.996	likely_pathogenic
8:127738400:C:A	S46R	0.996	likely_pathogenic
8:127738400:C:G	S46R	0.996	likely_pathogenic
8:127738401:G:A	E47K	0.982	likely_pathogenic
8:127738395:C:T	P45S	0.980	likely_pathogenic
8:127738396:C:G	P45R	0.980	likely_pathogenic
8:127738402:A:T	E47V	0.992	likely_pathogenic
8:127738404:G:C	D48H	0.998	likely_pathogenic
8:127738407:A:T	I49F	0.968	likely_pathogenic
8:127738397:G:A	S46A	0.904	likely_pathogenic
8:127738313:C:A	D17E	0.781	likely_pathogenic
8:127738312:A:C	D17A	0.914	likely_pathogenic
8:127738312:A:T	D17V	0.932	likely_pathogenic
8:127738311:G:C	D17H	0.937	likely_pathogenic
8:127738311:G:T	D17Y	0.869	likely_pathogenic
8:127738323:C:T	P21S	0.666	likely_pathogenic
8:127738324:C:T	P21L	0.799	likely_pathogenic
8:127738330:T:C	F23S	0.953	likely_pathogenic
8:127738329:T:C	F23L	0.972	likely_pathogenic
8:127738395:C:A	P45T	0.972	likely_pathogenic
8:127738393:C:A	A44E	0.957	likely_pathogenic
8:127738305:G:C	D15H	0.739	likely_pathogenic
8:127738312:A:G	D17G	0.867	likely_pathogenic

Variant	Gene	Effect	Score
8:127736622:AGGT:A	MYC	donor_loss	0.98
8:127736623:GGT:G	MYC	donor_loss	0.98
8:127736625:T:G	MYC	donor_loss	0.98
8:127736624:GTAA:G	MYC	donor_loss	0.98
8:127736610:T:TA	MYC	donor_gain	0.97
8:127736611:A:AA	MYC	donor_gain	0.97
8:127736620:CCAG:C	MYC	donor_gain	0.97
8:127736196:GC:G	MYC	donor_gain	0.77
8:127737196:G:GT	MYC	donor_gain	0.92
8:127737196:GAA:G	MYC	donor_gain	0.73
8:127737546:G:GT	MYC	donor_gain	0.82
8:127737547:G:T	MYC	donor_gain	0.85
8:127737641:G:GT	MYC	donor_gain	0.82
8:127737654:C:T	MYC	donor_gain	0.81
8:127737659:G:GT	MYC	donor_gain	0.89
8:127737659:GGAGT:G	MYC	donor_gain	0.72
8:127737660:G:GT	MYC	donor_gain	0.88
8:127737661:A:T	MYC	donor_gain	0.90
8:127736612:G:GG	MYC	donor_gain	0.85
8:127736619:ACCAG:A	MYC	donor_gain	0.93
8:127736621:CAG:C	MYC	donor_gain	0.87
8:127736622:AG:A	MYC	donor_gain	0.82
8:127736623:GG:G	MYC	donor_gain	0.82
8:127736624:G:GG	MYC	donor_gain	0.96
8:127737492:G:T	MYC	donor_gain	0.70
8:127737492:G:GT	MYC	donor_gain	0.64
8:127737533:ATTG:A	MYC	donor_gain	0.63
8:127737497:ATCGC:A	MYC	donor_gain	0.59
8:127737538:G:GG	MYC	donor_gain	0.50
8:127737537:A:AG	MYC	donor_gain	0.49

Pathway ID	Pathway Name
R-HSA-1362277	Transcription of E2F targets under negative control by DREAM complex
R-HSA-201556	Signaling by ALK
R-HSA-2122947	NOTCH1 Intracellular Domain Regulates Transcription
R-HSA-2173796	SMAD2/SMAD3:SMAD4 heterotrimer regulates transcription
R-HSA-2644606	Constitutive Signaling by NOTCH1 PEST Domain Mutants
R-HSA-2894862	Constitutive Signaling by NOTCH1 HD+PEST Domain Mutants
R-HSA-4411364	Binding of TCF/LEF:CTNNB1 to target gene promoters
R-HSA-5687128	MAPK6/MAPK4 signaling
R-HSA-5689880	Ub-specific processing proteases
R-HSA-6785807	Interleukin-4 and Interleukin-13 signaling
R-HSA-69202	Cyclin E associated events during G1/S transition
R-HSA-69656	Cyclin A:Cdk2-associated events at S phase entry
R-HSA-8866911	TFAP2 (AP-2) family regulates transcription of cell cycle factors
R-HSA-8951430	RUNX3 regulates WNT signaling
R-HSA-9018519	Estrogen-dependent gene expression
R-HSA-9616222	Transcriptional regulation of granulopoiesis
R-HSA-9764562	Regulation of CDH1 mRNA translation by microRNAs
R-HSA-9818749	Regulation of NFE2L2 gene expression
R-HSA-9909649	Regulation of PD-L1(CD274) transcription

Rank	GO ID	Term
1	GO:0045944	Positive regulation of transcription by RNA polymerase II
2	GO:0006357	Regulation of transcription by RNA polymerase II
3	GO:0010468	Regulation of gene expression
4	GO:0000122	Negative regulation of transcription by RNA polymerase II
5	GO:0010628	Positive regulation of gene expression
6	GO:0051276	Chromosome organization
7	GO:0008284	Positive regulation of cell population proliferation
8	GO:0006974	DNA damage response
9	GO:0006338	Chromatin remodeling
10	GO:0006366	Transcription by RNA polymerase II
11	GO:0000165	MAPK cascade
12	GO:0016055	Wnt signaling pathway
13	GO:0043066	Negative regulation of apoptotic process
14	GO:0008630	Intrinsic apoptotic signaling pathway in response to DNA damage
15	GO:0032206	Positive regulation of telomere maintenance
16	GO:0000082	G1/S transition of mitotic cell cycle
17	GO:0010564	Regulation of cell cycle process
18	GO:0045893	Positive regulation of DNA-templated transcription
19	GO:0060261	Positive regulation of transcription initiation by RNA polymerase II
20	GO:0070848	Response to growth factor

Rank	GO ID	Term
1	GO:0003677	DNA binding
2	GO:0000981	DNA-binding transcription factor activity, RNA polymerase II-specific
3	GO:0001228	DNA-binding transcription activator activity, RNA polymerase II-specific
4	GO:0000978	RNA polymerase II cis-regulatory region sequence-specific DNA binding
5	GO:0001227	DNA-binding transcription repressor activity, RNA polymerase II-specific
6	GO:0070888	E-box binding
7	GO:0001221	Transcription coregulator binding
8	GO:0046983	Protein dimerization activity
9	GO:0042802	Identical protein binding
10	GO:0140537	Transcription regulator activator activity
11	GO:0140297	DNA-binding transcription factor binding
12	GO:0044877	Protein-containing complex binding
13	GO:0001046	Core promoter sequence-specific DNA binding
14	GO:0031625	Ubiquitin protein ligase binding
15	GO:1905761	SCF ubiquitin ligase complex binding

Rank	GO ID	Term
1	GO:0005634	Nucleus
2	GO:0005654	Nucleoplasm
3	GO:0071943	Myc-Max complex
4	GO:0000785	Chromatin
5	GO:0016604	Nuclear body
6	GO:0032991	Protein-containing complex
7	GO:0005730	Nucleolus
8	GO:0005819	Spindle
9	GO:0000791	Euchromatin
10	GO:0090571	RNA polymerase II transcription repressor complex
11	GO:0048471	Perinuclear region of cytoplasm
12	GO:0030424	Axon

Rank	Gene	Protein Name	STRING Score	Score
1	TP53	Tumor protein p53	P04637	997
2	BRCA1	Breast cancer type 1 susceptibility	P38398	995
3	EP300	Histone acetyltransferase p300	Q09472	994
4	TRRAP	Transformation/transcription domain-associated	Q9Y4A5	993
5	CDCA7L	Cell division cycle-associated 7-like	Q96GN5	991
6	E2F1	Transcription factor E2F1	Q01094	990
7	BIN1	Myc box-dependent-interacting protein 1	O00499	989
8	MAX	Protein max	P25912	987
9	ZNF346	Zinc finger protein 346	Q9UL40	986
10	SMAD3	SMAD family member 3	P84022	983
11	DNMT3A	DNA (cytosine-5)-methyltransferase 3A	Q9Y6K1	983
12	HIF1A	Hypoxia-inducible factor 1-alpha	Q16665	982
13	IGF2BP1	Insulin-like growth factor 2 mRNA-binding 1	Q9NZI8	980
14	ZBTB17	Zinc finger BTB domain-containing 17	Q13105	976
15	AMOTL1	Angiomotin-like protein 1	Q8IY63	972
16	KAT2B	Histone acetyltransferase KAT2B	Q92831	972
17	KAT2A	Histone acetyltransferase KAT2A	Q92830	963
18	HDAC1	Histone deacetylase 1	Q13547	953
19	CCND1	Cyclin D1	P24385	951
20	CDKN1A	Cyclin-dependent kinase inhibitor 1	P38936	945
21	POU5F1	POU domain class 5 transcription factor 1	P31359	944
22	YY1	Yin and Yang 1 transcriptional repressor	P25490	943
23	KLF4	Kruppel-like factor 4	P78338	942
24	EZH2	Histone-lysine N-methyltransferase EZH2	Q15910	940
25	HSP90AA1	Heat shock protein HSP90-alpha	P07900	939
26	BCL2	Apoptosis regulator BCL-2	P10415	938
27	CTNNB1	Catenin beta-1	P35222	937
28	CUL1	Cullin-1	Q13616	935
29	FBXW7	F-box/WD repeat-containing protein 7	Q969H0	935
30	SOX2	Transcription factor SOX-2	P48431	933

Rank	UniProt ID	Protein	Top Similarity	Avg Similarity
1	A1YG22	MYC homolog	0.9999	0.9947
2	A2T7L5	MYC homolog	0.9999	0.9948
3	B8XIA5	MYC homolog	0.9999	0.9948
4	G3V909	MYC homolog	0.9998	0.9836
5	O09015	MYC homolog	0.9998	0.9862
6	P01102	N-myc proto-oncogene (MYCN)	0.9943	0.9867
7	P01108	MYCL proto-oncogene	0.9999	0.9942
8	P01109	MYC homolog	0.9996	0.9903
9	P01110	MYC homolog	0.9999	0.9923
10	P06171	MYC homolog	0.9995	0.9932
11	P06295	MYC homolog	0.9999	0.9923
12	P06646	MYC homolog	0.9990	0.9927
13	P09416	MYC homolog	0.9999	0.9942
14	P0C0N8	MYC homolog	1.0000	0.9921
15	P0C0N9	MYC homolog	1.0000	0.9921
16	P10166	MYC homolog	0.9993	0.9808
17	P10395	MYC homolog	0.9998	0.9918
18	P12523	MYC homolog	0.9998	0.9919
19	P12525	MYC homolog	0.9941	0.9858
20	P15171	MYC homolog	0.9995	0.9933

Rank	UniProt ID	Protein	Identity %	Bit Score
1	P68271	MYC homolog	100.00	796
2	P68272	MYC homolog	100.00	796
3	P61244	Protein MAX (co-factor)	100.00	290
4	P61245	MYC homolog	100.00	290
5	P23583	MYC homolog	99.80	815
6	A1YG22	MYC homolog	99.80	794
7	P01108	MYCL proto-oncogene	93.80	770
8	P01109	MYC homolog	98.80	662
9	P01110	MYC homolog	96.90	662
10	P03966	MYC homolog	95.20	757
11	P04198	MYC homolog	86.90	681
12	P06171	MYC homolog	93.60	676
13	P06295	MYC homolog	98.60	670
14	P06646	MYC homolog	73.30	477
15	P09416	MYC homolog	93.80	778
16	P0C0N8	MYC homolog	96.40	665
17	P0C0N9	MYC homolog	96.40	660
18	P10166	MYC homolog	89.10	600
19	P10395	MYC homolog	99.50	672
20	P12523	MYC homolog	99.50	672

Target	Regulation	Evidence/Confidence
CDKN1A	Activation	High (36 PubMed refs)
CDKN1B	Repression	High (22 PubMed refs)
CDKN2A	Activation	High (16 PubMed refs)
TP53	Activation	High (14 PubMed refs)
BRCA1	Activation	High (8 PubMed refs)
CCND1	Repression	High (12 PubMed refs)
CCND2	Activation	High (11 PubMed refs)
ENO1	Activation	High (5 PubMed refs)
LDHA	Activation	High (6 PubMed refs)
CDK4	Activation	High (9 PubMed refs)
CCNA2	Unknown	High (7 PubMed refs)
RCC1	Unknown	High (4 PubMed refs)
TOP1	Unknown	High (1 PubMed ref)
GAPDH	Unknown	High (1 PubMed ref)
CDK2	Unknown	High (3 PubMed refs)
TRRAP	Unknown	High (4 PubMed refs)
PRMT1	Unknown	High (1 PubMed ref)
EZH2	Unknown	High (2 PubMed refs)
BIN1	Unknown	High (3 PubMed refs)
BMI1	Activation	High (4 PubMed refs)
BRCA1	Activation	High (8 PubMed refs)
HSPD1	Activation	Low (6 PubMed refs)
RAD51	Activation	Low (2 PubMed refs)
CDK6	Activation	Low (3 PubMed refs)
RUVBL2	Activation	Low (1 PubMed ref)
PCNA	Unknown	Low (6 PubMed refs)
BAX	Unknown	Low (3 PubMed refs)
PTEN	Unknown	Low (3 PubMed refs)
SLC2A1	Activation	Curated (1 PubMed ref)
PKM	Unknown	Curated

Motif ID	Name	Collection	Family	Species
MA0059.1	MAX::MYC	CORE	bHLH-ZIP	Homo sapiens
MA0059.2	MAX::MYC	CORE	bHLH-ZIP	Homo sapiens
MA0147.3	MYC	CORE	bHLH-ZIP	Homo sapiens
MA0147.4	MYC	CORE	bHLH-ZIP	Homo sapiens

Rank	Tissue	Expression Score	Quality
1	Upper leg skin	99.12	Gold
2	Vena cava	99.08	Gold
3	Left uterine tube	98.80	Gold
4	Skin of abdomen	98.56	Gold
5	Mucosa of stomach	98.26	Gold
6	Penis	98.18	Gold
7	Peritoneum	97.68	Gold
8	Omental fat pad	97.68	Gold
9	Mucosa of urinary bladder	97.59	Gold
10	Skin of leg	97.54	Gold
11	Pericardium	97.34	Gold
12	Gall bladder	97.31	Gold
13	Adipose tissue (abdominal region)	96.85	Gold
14	Zone of skin	96.75	Gold
15	Cartilage tissue	96.67	Gold
16	Mammary duct	96.66	Gold
17	Nipple	96.19	Gold
18	Right ovary	96.08	Gold
19	Left ovary	95.95	Gold
20	Esophagus mucosa	95.81	Gold
21	Epithelium of mammary gland	95.65	Gold
22	Parotid gland	95.62	Gold
23	Trachea	95.53	Gold
24	Synovial joint	95.20	Gold
25	Skin of hip	95.18	Gold
26	Pharyngeal mucosa	94.93	Gold
27	Subcutaneous adipose tissue	94.77	Gold
28	Mammalian vulva	94.76	Gold
29	Body of pancreas	94.68	Gold
30	Vermiform appendix	94.67	Gold

Experiment ID	Dataset Description	Cell Count	Species
E-HCAD-6	CD34+ cells from bone marrow (hematopoiesis)	34,596	Homo sapiens
E-MTAB-6701	Fetal-maternal interface (first trimester)	135,071	Homo sapiens
E-MTAB-9841	Mammary epithelial cells (lactating vs non-lactating)	92,071	Homo sapiens
E-HCAD-29	GM-CSF-producing T helper cells	78,686	Homo sapiens
E-MTAB-10885	Milk tissue vs non-lactating breast	28,628	Homo sapiens
E-MTAB-8205	hPSC-derived endothelial-to-hematopoietic transition	25,764	Homo sapiens
E-MTAB-9067	Fetal liver and bone marrow hematopoiesis	5,865	Homo sapiens
E-MTAB-6524	Induced pluripotent stem cells (iPSCs)	10,926	Homo sapiens
E-MTAB-8271	Haemogenic endothelium (hPSC differentiation)	2,784	Homo sapiens
E-MTAB-8884	Chronic myelomonocytic leukemia stem cells	9,386	Homo sapiens
E-GEOD-75688	Primary breast cancer cells & metastases	549	Homo sapiens
E-CURD-6	Bone marrow with aneuploid cells	1,024	Homo sapiens
E-ENAD-20	PDX model (BRAF/MEK inhibitor response)	674	Homo sapiens
E-MTAB-8911	Chronic GVHD T-lymphocytes	19,075	Homo sapiens

Disease	ID	Inheritance	Evidence
Burkitt lymphoma	MONDO:0007243 / Orphanet:543 / OMIM:113970	Unknown	Curated (GenCC, ClinVar)
Classic Hodgkin lymphoma	MONDO:0009348 / Orphanet:391	Unknown	ClinVar
Cholesteatoma of middle ear	MONDO:0006533	Unknown	ClinVar
Primary ovarian failure	MONDO:0005387	Unknown	ClinVar

HPO ID	Phenotype
HP:0030080	Burkitt lymphoma
HP:0005561	Abnormal bone marrow cell morphology
HP:0002733	Abnormal lymph node morphology
HP:0002721	Immunodeficiency
HP:0025435	Increased circulating lactate dehydrogenase concentration
HP:0032218	Decreased CD4+ T cell proportion
HP:0001442	Typified by somatic mosaicism
HP:0005214	Intestinal obstruction
HP:0002239	Gastrointestinal hemorrhage
HP:0002149	Hyperuricemia
HP:0100649	Neoplasm of the oral cavity
HP:0002027	Abdominal pain
HP:0002017	Nausea and vomiting
HP:0001743	Abnormality of the spleen
HP:0001732	Abnormality of the pancreas
HP:0001392	Abnormality of the liver
HP:0000137	Abnormality of the ovary
HP:0003745	Sporadic

Trait/Disease	Associated Genes	p-value	Study
Non-glioblastoma glioma	CCDC26	8.0e-36	GCST005933_9
Non-glioblastoma glioma	CCDC26	2.0e-65	GCST005933_10
Colorectal cancer or advanced adenoma	PCAT1, CASC8, POU5F1B, CCAT2	3.0e-64	GCST007856_84
Colorectal cancer or advanced adenoma	POU5F1B, PCAT1, CASC8	2.0e-56	GCST007856_6
Glioma	CCDC26	1.0e-39	GCST005931_16
Asthma (childhood onset)	MYC - PVT1	1.0e-23	GCST007800_55
Prostate-specific antigen levels	PCAT1, CASC8, POU5F1B	7.0e-21	GCST004093_26
Bladder cancer	CASC11	2.0e-18	GCST000842_5
Prostate cancer	CASC8	6.0e-18	GCST002421_6
Nonsyndromic cleft lip with cleft palate	CCDC26	8.0e-16	GCST004166_56
Asthma	MYC - PVT1	3.0e-15	GCST010043_160
Breast cancer	PVT1	1.0e-13	GCST004988_564
Endometrial cancer	LINC00824 - CCDC26	8.0e-14	GCST006464_13
Colorectal cancer	PCAT1, CASC8, POU5F1B, CCAT2	4.0e-14	GCST003017_15
Glioma	CCDC26	1.0e-14	GCST005931_2
Allergic disease (asthma, hay fever or eczema)	PVT1	6.0e-14	GCST005038_17
Asthma	MYC - PVT1	1.0e-14	GCST010042_34
Endometrial endometrioid carcinoma	LINC00824 - CCDC26	1.0e-13	GCST003525_2
Diffuse large B cell lymphoma	PVT1	1.0e-12	GCST002636_5
Urinary bladder cancer	CASC11	9.0e-12	GCST000231_2
Colorectal cancer	PCAT1, CASC8, POU5F1B	8.0e-13	GCST002411_9
Ovarian cancer	LINC00824	3.0e-12	GCST001941_3
Urinary bladder cancer	CASC11	7.0e-12	GCST000639_1
Renal cell carcinoma	PVT1	5.0e-11	GCST002273_1
Breast cancer	PVT1	3.0e-11	GCST001937_34
Colorectal cancer	PCAT1, CASC8, POU5F1B	1.0e-11	GCST001787_1
Diffuse large B cell lymphoma	RN7SKP226 - LINC00824	4.0e-11	GCST002636_6
Asthma	MYC - PVT1	2.0e-11	GCST007798_112
Prostate cancer (early onset)	PCAT1, CASC8, POU5F1B	9.0e-09	GCST002413_8
Multiple sclerosis	PVT1	8.0e-09	GCST001198_62