MYC Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MYC. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human MYC. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 36 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, interpro, jaspar, mim, orphanet, ortholog, pdb, pfam, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uberon, uniprot
Generated: 2026-03-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (48)
MYC

COMPREHENSIVE CROSS-DATABASE IDENTIFIER AND FUNCTIONAL MAPPING REFERENCE Human MYC (c-Myc Proto-Oncogene)

Section 1: Gene Identifiers Primary Gene Identifiers

DatabaseIdentifierNotes
HGNC IDHGNC:7553Approved
HGNC SymbolMYCApproved symbol
Full NameMYC proto-oncogene, bHLH transcription factor
Ensembl GeneENSG00000136997GRCh38
NCBI Entrez Gene4609protein-coding
OMIM190080Gene/Locus
Locus Typegene with protein product
Locus Groupprotein-coding gene
StatusApproved
Genomic Location (GRCh38/hg38)
AttributeValue
Chromosome8
Cytogenetic Band8q24.21
Start Position127,735,434
End Position127,742,951
Strand+ (forward)
Gene Length7,518 bp
Biotypeprotein_coding
Gene Aliases & Previous Names
- Aliases- Previous Name- Gene GroupEnsembl Transcripts (Total
c-Myc, bHLHe39, MYCCv-myc avian myelocytomatosis viral oncogene homologBasic helix-loop-helix proteins
Transcript Identifiers**15)
Transcript IDBiotypeStartEndStrand
ENST00000259523protein_coding127,735,434127,740,477+
ENST00000377970protein_coding127,736,084127,741,434+
ENST00000517291protein_coding127,735,519127,738,772+
ENST00000520751protein_coding_CDS_not_defined127,736,231127,738,475+
ENST00000524013protein_coding127,736,220127,741,372+
ENST00000621592protein_coding127,736,231127,742,951+
ENST00000641036protein_coding127,736,046127,736,612+
ENST00000641252protein_coding127,735,473127,735,817+
ENST00000651626protein_coding127,735,659127,741,434+
ENST00000652288protein_coding127,737,621127,741,434+
ENST00000707113protein_coding127,736,958127,741,434+
ENST00000707114protein_coding127,737,724127,741,434+
ENST00000707115protein_coding127,737,737127,741,434+
ENST00000707116protein_coding127,737,745127,741,434+
ENST00000707117protein_coding_CDS_not_defined127,739,073127,741,434+
RefSeq Transcripts (Human)
AccessionTypeChromosomeStatusMANE Select
NM_002467mRNA8REVIEWEDYes (Canonical)
NM_001354870mRNA8REVIEWEDNo
RefSeq Proteins (Human)
AccessionChromosomeStatusMANE Select
NP_0024588REVIEWEDYes (Canonical)
NP_0013417998REVIEWEDNo
CCDS Identifiers (Total: 2)
CCDS ID
CCDS6359
CCDS87627
Canonical Transcript Exons (ENST00000259523 - 3 exons)
Exon IDStartEndLengthStrand
ENSE00001848036127,735,434127,736,6231,190 bp+
ENSE00003540079127,738,248127,739,019772 bp+
ENSE00001766895127,740,398127,740,47780 bp+
Total Exon Count (canonical)UniProt Accessions (Total
3
Protein Identifiers**6)
AccessionNameStatusLengthMass
P01106Myc proto-oncogene proteinReviewed (Canonical)454 aa50,565 Da
Q14899Unreviewed--
Q16591Unreviewed--
A0A0B4J1R1Unreviewed--
A0A494C1T8Unreviewed--
H0YBG3Unreviewed--
Canonical Protein Details (P01106)
AttributeValue
Primary NameMyc proto-oncogene protein
Alternative NamesClass E basic helix-loop-helix protein 39; Proto-oncogene c-Myc; Transcription factor p64
Length454 amino acids
Molecular Mass50,565 Da
Protein Domains and Families (InterPro)
InterPro IDNameType
IPR002418Tscrpt_reg_MycFamily
IPR003327Myc-LZDomain
IPR011598bHLH_domDomain
IPR012682Tscrpt_reg_Myc_NDomain
IPR036638HLH_DNA-bd_sfHomologous_superfamily
IPR050433Myc_transcription_factorsFamily
Pfam Domains (Total: 3)
Pfam IDDescription
PF00010Helix-loop-helix DNA-binding domain
PF01056Myc amino-terminal region
PF02344Myc-type, basic helix-loop-helix leucine zipper

Section 4: Structure Identifiers Experimental Structures (PDB) - Total: 25

PDB IDTitleMethodResolution
6G6KHuman MYC:MAX bHLHZip complexX-RAY DIFFRACTION1.35 Å
1NKPMyc-Max recognizing DNAX-RAY DIFFRACTION1.8 Å
4Y7RWDR5 with MYC MbIIIb peptideX-RAY DIFFRACTION1.898 Å
8Q1NCyclic peptide binder of WDR5X-RAY DIFFRACTION1.843 Å
8J2QCypovirus Polyhedra with c-MycX-RAY DIFFRACTION1.92 Å
5I4ZApo OmoMYCX-RAY DIFFRACTION1.95 Å
8X8VCypovirus Polyhedra with c-MycX-RAY DIFFRACTION2.0 Å
8X8SCypovirus Polyhedra with c-MycX-RAY DIFFRACTION2.04 Å
1EE4Karyopherin with c-Myc NLSX-RAY DIFFRACTION2.1 Å
6G6LHuman MYC:MAX bHLHZip complexX-RAY DIFFRACTION2.2 Å
6G6JHuman MYC:MAX bHLHZip complexX-RAY DIFFRACTION2.25 Å
6E16c-Myc-TBP-TAF1 ternaryX-RAY DIFFRACTION2.4 Å
7T1YFbw7-Skp1-MycCdegronX-RAY DIFFRACTION2.55 Å
8WLGCypovirus with c-Myc fragmentX-RAY DIFFRACTION2.55 Å
6C4UFHA with Myc-pTBD peptideX-RAY DIFFRACTION2.6 Å
5I50OmoMYC bound to DNAX-RAY DIFFRACTION2.701 Å
2OR9Anti-c-myc antibody 9E10 FabX-RAY DIFFRACTION2.7 Å
7T1ZFbw7-Skp1-MycNdegronX-RAY DIFFRACTION2.77 Å
6E24c-Myc-TBP-TAF1 ternaryX-RAY DIFFRACTION3.001 Å
9QNHMyc pS294 with 14-3-3sigmaX-RAY DIFFRACTION1.3 Å
1A93c-MYC-MAX leucine zipperSOLUTION NMR-
2A93c-MYC-MAX leucine zipper (40 structures)SOLUTION NMR-
1MV0BIN1 interaction with c-MycSOLUTION NMR-
8OTSOCT4 and MYC-MAX on nucleosomeELECTRON MICROSCOPY3.3 Å
8OTTMYC-MAX on nucleosome at SHL+5.8ELECTRON MICROSCOPY3.3 Å
Structure Summary:
  • X-ray crystallography: 20 structures
  • Solution NMR: 3 structures
  • Cryo-EM: 2 structures Predicted Structure (AlphaFold)
AttributeValue
AlphaFold IDP01106
Sequence Length3,551 (full entry)
Global pLDDT61.38
Fraction pLDDT Very High0.21 (21%)
Note
Low overall confidence reflects intrinsically disordered regions characteristic of transcription factors.
Cross-Species Orthologs**
Vertebrate Orthologs
OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000022346Mycprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000004500Mycprotein_coding
Zebrafish (Danio rerio)ENSDARG00000007241mycbprotein_coding
Zebrafish (Danio rerio)ENSDARG00000045695mycaprotein_coding
Invertebrate Orthologs
OrganismGene IDSymbol
Fruit fly (D. melanogaster)FBGN0262656Myc
Orthologs Not Found in Database
- Worm (C. elegans)- Yeast (S. cerevisiae)ClinVar Variants Summary (Total
No direct orthologNo direct ortholog
Clinical Variants & AI Predictions**46)
ClassificationCount
Pathogenic4
Likely Pathogenic1
Benign4
Likely Benign3
Benign/Likely Benign2
Uncertain Significance (VUS)10
Other/Not Classified22
Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVSTypeClassification
12574NM_002467.6:c.214C>T (p.Pro72Ser)SNVPathogenic
12575NM_002467.6:c.302A>C (p.Asn101Thr)SNVPathogenic
12576NM_002467.6:c.162G>C (p.Glu54Asp)SNVPathogenic
2658815NM_002467.6:c.217A>G (p.Thr73Ala)SNVPathogenic
8103408q24.21 (128750494-128753204)x3CNV gainLikely Pathogenic
Variants of Uncertain Significance (VUS)
ClinVar IDHGVSType
1679939c.77A>G (p.Asn26Ser)SNV
2286886c.409G>C (p.Glu137Gln)SNV
2468497c.909C>G (p.Ser303Arg)SNV
2510151c.671C>T (p.Ala224Val)SNV
2658812c.30+4A>TSNV
3154445c.1228A>G (p.Ile410Val)SNV
3154448c.1297G>C (p.Asp433His)SNV
3297249c.544G>A (p.Gly182Ser)SNV
3400454c.814G>A (p.Glu272Lys)SNV
3915251c.688G>T (p.Ala230Ser)SNV
SpliceAI Predictions (Total: 522) Top 20 Predicted Splice-Altering Variants (by delta score)
Variant IDEffectScore
8:127736625:T:Gdonor_loss0.98
8:127736622:AGGT:Adonor_loss0.98
8:127736623:GGT:Gdonor_loss0.98
8:127736624:GTAA:Gdonor_loss0.98
8:127736610:T:TAdonor_gain0.97
8:127736611:A:AAdonor_gain0.97
8:127736620:CCAG:Cdonor_gain0.97
8:127736624:G:GGdonor_gain0.96
8:127736197:C:Gdonor_gain0.93
8:127736619:ACCAG:Adonor_gain0.93
8:127737196:G:GTdonor_gain0.92
8:127736621:CAG:Cdonor_gain0.87
8:127736612:G:GGdonor_gain0.85
8:127736622:AG:Adonor_gain0.82
8:127736623:GG:Gdonor_gain0.82
8:127736196:GC:Gdonor_gain0.77
8:127736633:G:GTdonor_gain0.75
8:127737196:GAA:Gdonor_gain0.73
8:127736197:C:CGdonor_gain0.70
8:127737198:A:Gdonor_gain0.68
AlphaMissense Predictions (Total: 2,886) Classification Distribution (sample of first 100)
  • Likely benign: ~70%
  • Ambiguous: ~20%
  • Likely pathogenic: ~10% Top 20 Predicted Pathogenic Missense Variants
Variant IDProtein ChangePathogenicity ScoreClass
8:127738305:G:CD15H0.739likely_pathogenic
8:127738299:G:CD13H0.664likely_pathogenic
8:127738300:A:TD13V0.650likely_pathogenic
8:127738306:A:TD15V0.650likely_pathogenic
8:127738300:A:CD13A0.572likely_pathogenic
8:127738305:G:TD15Y0.610likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 19)

Pathway IDNameDisease?
R-HSA-1362277Transcription of E2F targets under negative control by DREAM complexNo
R-HSA-201556Signaling by ALKNo
R-HSA-2122947NOTCH1 Intracellular Domain Regulates TranscriptionNo
R-HSA-2173796SMAD2/SMAD3:SMAD4 heterotrimer regulates transcriptionNo
R-HSA-2644606Constitutive Signaling by NOTCH1 PEST Domain MutantsYes
R-HSA-2894862Constitutive Signaling by NOTCH1 HD+PEST Domain MutantsYes
R-HSA-4411364Binding of TCF/LEF:CTNNB1 to target gene promotersNo
R-HSA-5687128MAPK6/MAPK4 signalingNo
R-HSA-5689880Ub-specific processing proteasesNo
R-HSA-6785807Interleukin-4 and Interleukin-13 signalingNo
R-HSA-69202Cyclin E associated events during G1/S transitionNo
R-HSA-69656Cyclin A:Cdk2-associated events at S phase entryNo
R-HSA-8866911TFAP2 family regulates transcription of cell cycle factorsNo
R-HSA-8951430RUNX3 regulates WNT signalingNo
R-HSA-9018519Estrogen-dependent gene expressionNo
R-HSA-9616222Transcriptional regulation of granulopoiesisNo
R-HSA-9764562Regulation of CDH1 mRNA translation by microRNAsNo
R-HSA-9818749Regulation of NFE2L2 gene expressionNo
R-HSA-9909649Regulation of PD-L1(CD274) transcriptionNo
Gene Ontology Annotations (Total: 85) Molecular Function (20 terms)
GO IDTerm
GO:0000978RNA polymerase II cis-regulatory region sequence-specific DNA binding
GO:0000981DNA-binding transcription factor activity, RNA polymerase II-specific
GO:0001046core promoter sequence-specific DNA binding
GO:0001221transcription coregulator binding
GO:0001227DNA-binding transcription repressor activity, RNA polymerase II-specific
GO:0001228DNA-binding transcription activator activity, RNA polymerase II-specific
GO:0003677DNA binding
GO:0031625ubiquitin protein ligase binding
GO:0042802identical protein binding
GO:0044877protein-containing complex binding
GO:0046983protein dimerization activity
GO:0070888E-box binding
GO:0140297DNA-binding transcription factor binding
GO:0140537transcription regulator activator activity
GO:1905761SCF ubiquitin ligase complex binding
Biological Process (Top 20 of 53 terms)
GO IDTerm
GO:0000082G1/S transition of mitotic cell cycle
GO:0000122negative regulation of transcription by RNA polymerase II
GO:0000165MAPK cascade
GO:0006338chromatin remodeling
GO:0006357regulation of transcription by RNA polymerase II
GO:0006366transcription by RNA polymerase II
GO:0006974DNA damage response
GO:0008284positive regulation of cell population proliferation
GO:0008630intrinsic apoptotic signaling pathway in response to DNA damage
GO:0010468regulation of gene expression
GO:0010628positive regulation of gene expression
GO:0016055Wnt signaling pathway
GO:0016072rRNA metabolic process
GO:0032204regulation of telomere maintenance
GO:0043066negative regulation of apoptotic process
GO:0045893positive regulation of DNA-templated transcription
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0051276chromosome organization
GO:0071456cellular response to hypoxia
GO:1902255positive regulation of intrinsic apoptotic signaling pathway by p53 class mediator
Cellular Component (12 terms)
GO IDTerm
GO:0000785chromatin
GO:0000791euchromatin
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005730nucleolus
GO:0005819spindle
GO:0016604nuclear body
GO:0030424axon
GO:0032991protein-containing complex
GO:0048471perinuclear region of cytoplasm
GO:0071943Myc-Max complex
GO:0090571RNA polymerase II transcription repressor complex

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions

  • Total Interactions: 14,230+
  • Highest Confidence Interactions (score ≥ 900):
UniProt BScoreProtein Name
P01106997MYC (self)
P04637997TP53
P38398995BRCA1
Q09472994EP300
Q9Y4A5993TRRAP
Q96GN5991USP28
Q01094990E2F1
O00499989BIN1
P25912987HIC1
Q9UL40986ZNF131
P84022983SMAD3
Q9Y6K1983DNMT3A
Q16665982HIF1A
Q9NZI8980IGF2BP1
Q13105976ZBTB17
BioGRID Interactions
  • Total Interactions: 5,810+
  • Experimental Methods: Affinity Capture-MS, Affinity Capture-Western, Two-hybrid, Reconstituted Complex, Biochemical Activity, Synthetic Lethality Top Interactors (by evidence)
Gene SymbolInteraction Types
MAXAffinity Capture-Western, Affinity Capture-MS
FBXW7Affinity Capture-Western
USP28Affinity Capture-Western
USP37Affinity Capture-Western
USP36Affinity Capture-Western
CREBBPAffinity Capture-Western, Reconstituted Complex
KAT2AAffinity Capture-Western, Reconstituted Complex
TRRAPReconstituted Complex
SKP2Affinity Capture-Western
HDAC1Affinity Capture-MS
Protein Structural Similarity (ESM2)
  • Total Similar Proteins: 63
  • Top 20 by Similarity Score:
UniProtTop SimilarityAvg Similarity
P011061.00000.9947
P235831.00000.9947
P682711.00000.9944
P682721.00000.9944
P0C0N81.00000.9921
P0C0N91.00000.9921
A1YG220.99990.9947
A2T7L50.99990.9948
B8XIA50.99990.9948
P011080.99990.9942
Protein Sequence Similarity (DIAMOND)
  • Total Homologs: 65
  • Top 20 by Identity:
UniProtIdentity %BitscoreDescription
P61244100.0290
P61245100.0290
P68271100.0796
P68272100.0796
A1YG2299.8794
P2358399.8815
Q6421099.1819
P2038999.1819
B8XIA599.1805
P0110699.1816

Section 9: Transcription Factor Regulatory Data MYC as Transcription Factor MYC is a bHLH-ZIP transcription factor that forms heterodimers with MAX to regulate gene expression. JASPAR DNA Binding Motifs (Total: 4)

Motif IDNameCollectionClassFamily
MA0147.4MYCCOREBasic helix-loop-helix factorsbHLH-ZIP
MA0147.3MYCCOREBasic helix-loop-helix factorsbHLH-ZIP
MA0059.2MAX::MYCCOREBasic helix-loop-helix factorsbHLH-ZIP
MA0059.1MAX::MYCCOREBasic helix-loop-helix factorsbHLH-ZIP
Downstream Targets (Genes Regulated BY MYC) From SIGNOR database (Total: 172+ interactions) Activated Targets (Top 30):
Target GeneMechanismScore
CCND1transcriptional regulation0.50
CDK4transcriptional regulation0.57
CDC25Atranscriptional regulation0.63
CCNA2transcriptional regulation0.42
CDK6transcriptional regulation0.45
CUL1transcriptional regulation0.48
DNMT3Abinding0.71
LDHAtranscriptional regulation0.56
HK2transcriptional regulation0.37
SLC2A1transcriptional regulation0.43
VEGFAtranscriptional regulation0.46
ENO1transcriptional regulation0.41
SIRT2transcriptional regulation0.47
Repressed Targets (Top 20):
Target GeneMechanismScore
CDKN1Atranscriptional regulation0.78
CDKN2Atranscriptional regulation0.77
CDKN2Btranscriptional regulation0.60
CDKN1Btranscriptional regulation0.53
SMAD20.68
SMAD3binding0.69
HLA-Atranscriptional regulation0.27
HLA-Btranscriptional regulation0.27
HLA-Ctranscriptional regulation0.26
DKK1transcriptional regulation0.39
SFRP1transcriptional regulation0.36
Upstream Regulators (TFs that Regulate MYC) From CollecTRI database (Total: 1,124+ regulatory relationships) Activators of MYC:
TFConfidence
E2F1High
CTNNB1 (β-catenin)High
CTCFHigh
DLX5High
HNF4AHigh
IKZF1High
NOTCH1(via SIGNOR)
STAT3(via SIGNOR)
LEF1High
FOXM1
ERG
Repressors of MYC:
TFConfidence
FOSHigh
JUNHigh
JUNDHigh
CEBPAHigh
CEBPBHigh
HOXB13High
IKZF3High
KLF11High
GLI1High
ENO1High
SMAD3/SMAD4(via SIGNOR)

Section 10: Drug & Pharmacology Data ChEMBL Targets (Total: 7)

Target IDNameType
CHEMBL1250348Myc proto-oncogene proteinSINGLE PROTEIN
CHEMBL3301395c-Myc/MaxPROTEIN COMPLEX
CHEMBL4106127c-Myc/c-MaxPROTEIN-PROTEIN INTERACTION
CHEMBL4296141VHL/Myc proto-oncogene proteinPROTEIN-PROTEIN INTERACTION
CHEMBL4888452MAP2K3/Myc proto-oncogene proteinPROTEIN-PROTEIN INTERACTION
CHEMBL5465202Cereblon/Myc proto-oncogene proteinPROTEIN-PROTEIN INTERACTION
CHEMBL5465553WDR5-MYCPROTEIN-PROTEIN INTERACTION
Targeting Molecules (from ChEMBL) Total Molecules: 100+ compounds tested Top Molecules by Development Phase:
ChEMBL IDNameTypeHighest Phase
CHEMBL165RESVERATROLSmall moleculePhase 3
CHEMBL4297458EZOBRESIBSmall moleculePhase 2
Note: MYC has historically been considered "undruggable" due to lack of deep binding pockets. Most compounds target MYC-MAX dimerization or upstream/downstream pathways. PharmGKB
AttributeValue
PharmGKB IDPA31353
SymbolMYC
VIP GeneYes
CPIC GuidelineNo
Chromosomechr8
Drug-Gene Interactions MYC is annotated as a Very Important Pharmacogene (VIP) in PharmGKB, indicating clinical relevance in drug response.

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Gene IDENSG00000136997
SpeciesHomo sapiens
Expression BreadthUbiquitous
Total Present Calls256
Max Expression Score99.12
Tissue Expression (Top 50 tissues from Bgee via UBERON) MYC is expressed ubiquitously across human tissues. Top tissues include:
Tissue (UBERON)ID
SpleenUBERON:0002106
OvaryUBERON:0000992
Lymph nodeUBERON:0000029
Mammary glandUBERON:0001911
ColonUBERON:0001155
StomachUBERON:0000945
Small intestineUBERON:0000160
PancreasUBERON:0001264
UterusUBERON:0000995
Thyroid gland
Adrenal glandUBERON:0002369
Adipose tissueUBERON:0001013
SkinUBERON:0000014
Heart right ventricleUBERON:0002080
Lung (upper/lower lobes)UBERON:0008948/0008949
EsophagusUBERON:0001043
Liver
TongueUBERON:0001723
Fallopian tubeUBERON:0003889
Urinary bladderUBERON:0001255
Single-Cell Expression Data (Total: 14 datasets)
ExperimentDescriptionCells
E-HCAD-29GM-CSF-producing T helper cells78,686
E-MTAB-6701Fetal-maternal interface (10x)135,071
E-MTAB-9841Mammary epithelial cells92,071
E-HCAD-6CD34+ cells, bone marrow hematopoiesis34,596
E-MTAB-10885Human milk cells vs non-lactating breast28,628
E-MTAB-8205hPSC endothelial-to-haematopoietic transition25,764
E-MTAB-8911Graft-Versus-Host Disease T-lymphocytes19,075
E-MTAB-6524Human iPSCs10,926
E-MTAB-8884Chronic myelomonocytic leukemia stem cells9,386
E-MTAB-9067Foetal liver and bone marrow haematopoiesis5,865
E-MTAB-8271Haemogenic endothelium in hPSC differentiation2,784
E-CURD-6Human bone marrow, aneuploid cells1,024
E-ENAD-20Xenograft, BRAF/MEK inhibitor treatment674
E-GEOD-75688Primary breast cancer cells549

Section 12: Disease Associations Orphanet Disease Associations (Total: 3)

Orphanet IDDisease NameTypeGenes
543Burkitt lymphomaDisease1
480541High grade B-cell lymphoma with MYC and/or BCL2 and/or BCL6 rearrangementDisease3
99861Precursor T-cell acute lymphoblastic leukemiaDisease23
GenCC (Gene Curation Coalition)
GeneDiseaseClassification
MYCBurkitt lymphoma (OMIM:113970)No Known Disease Relationship*
Note: MYC translocations are a hallmark of Burkitt lymphoma; however, germline MYC mutations causing Mendelian disease have not been established. HPO Phenotype Associations (Total: 18)
HPO IDPhenotype
HP:0030080Burkitt lymphoma
HP:0002733Abnormal lymph node morphology
HP:0005561Abnormal bone marrow cell morphology
HP:0002721Immunodeficiency
HP:0000137Abnormality of the ovary
HP:0001392Abnormality of the liver
HP:0001732Abnormality of the pancreas
HP:0001743Abnormality of the spleen
HP:0002017Nausea and vomiting
HP:0002027Abdominal pain
HP:0002149Hyperuricemia
HP:0002239Gastrointestinal hemorrhage
HP:0005214Intestinal obstruction
HP:0025435Increased circulating lactate dehydrogenase
HP:0032218Decreased CD4+ T cell proportion
HP:0100649Neoplasm of the oral cavity
HP:0001442Typified by somatic mosaicism
HP:0003745Sporadic
GWAS Associations (Total: 51) Top 30 GWAS Associations by Significance:
Study IDTraitMapped Gene(s)P-value
GCST005933_10Non-glioblastoma gliomaCCDC262×10⁻⁶⁵
GCST007856_84Colorectal cancer or advanced adenomaPCAT1, CASC8, POU5F1B, CCAT23×10⁻⁶⁴
GCST007856_6Colorectal cancer or advanced adenomaPOU5F1B, PCAT1, CASC82×10⁻⁵⁶
GCST005931_16GliomaCCDC261×10⁻³⁹
GCST005933_9Non-glioblastoma gliomaCCDC268×10⁻³⁶
GCST007800_55Asthma (childhood onset)MYC - PVT11×10⁻²³
GCST004093_26Prostate-specific antigen levelsPCAT1, CASC8, POU5F1B7×10⁻²¹
GCST000842_5Bladder cancerCASC112×10⁻¹⁸
GCST002421_6Prostate cancerCASC86×10⁻¹⁸
GCST004166_56Nonsyndromic cleft lip with cleft palateCCDC268×10⁻¹⁶
GCST007010043_160AsthmaMYC - PVT13×10⁻¹⁵
GCST002919_8Colorectal cancerCASC8, PCAT1, POU5F1B6×10⁻¹⁵
GCST005931_15GliomaCCDC261×10⁻¹⁴
GCST010042_34AsthmaMYC - PVT11×10⁻¹⁴
GCST003017_15Colorectal cancerPCAT1, CASC8, POU5F1B, CCAT24×10⁻¹⁴
GCST004988_564Breast cancerPVT11×10⁻¹³
GCST006465_13Endometrial cancer (endometrioid)LINC00824 - CCDC261×10⁻¹³
GCST006464_13Endometrial cancerLINC00824 - CCDC268×10⁻¹⁴
GCST002411_9Colorectal cancerPCAT1, CASC8, POU5F1B8×10⁻¹³
GCST002636_5Diffuse large B cell lymphomaPVT11×10⁻¹²
Disease Categories with Strong GWAS Signal:
  • Cancer: Colorectal, Bladder, Breast, Prostate, Ovarian, Pancreatic, Endometrial, Glioma, Lymphoma, Renal cell carcinoma
  • Respiratory: Asthma, Allergic rhinitis, Allergic sensitization
  • Developmental: Cleft lip/palate
  • Other: Multiple sclerosis, Height SUMMARY STATISTICS
CategoryCount
Gene Identifiers6 databases
Ensembl Transcripts15
RefSeq Transcripts (human)2
CCDS IDs2
UniProt Accessions6
InterPro Domains6
Pfam Domains3
PDB Structures25
Vertebrate Orthologs4 species
ClinVar Variants46
SpliceAI Predictions522
AlphaMissense Predictions2,886
Reactome Pathways19
GO Annotations85
STRING Interactions14,230+
BioGRID Interactions5,810+
TF Target Genes (SIGNOR)172+
Upstream Regulators (CollecTRI)1,124+
JASPAR Binding Motifs4
ChEMBL Targets7
GWAS Associations51
HPO Phenotypes18
Orphanet Diseases3
Single-Cell Datasets14
Reference compiled from biobtree integrated database queries. Data current as of database version. I have compiled a comprehensive cross-database identifier and functional mapping reference for human MYC covering all 12 requested sections. Key highlights: Gene Summary:
  • MYC (HGNC:7553) encodes a bHLH-ZIP transcription factor at chromosome 8q24.21

  • Critical oncogene involved in cell cycle, proliferation, and apoptosis regulation

  • Forms heterodimers with MAX to regulate thousands of target genes Key Counts:

  • 15 Ensembl transcripts, 25 PDB structures, 6 UniProt entries

  • 46 ClinVar variants (4 pathogenic), 522 SpliceAI predictions, 2,886 AlphaMissense predictions

  • 14,230+ STRING interactions, 5,810+ BioGRID interactions

  • 172+ downstream target genes, 1,124+ upstream regulatory relationships

  • 51 GWAS associations linking to cancers (colorectal, bladder, breast, glioma, lymphoma) and other traits

  • Ubiquitous expression across 256 tissue/cell types Clinical Relevance:

  • Hallmark translocation gene in Burkitt lymphoma

  • Associated with multiple cancer types via GWAS

  • Designated as a Very Important Pharmacogene (VIP)

  • Historically considered “undruggable” but Phase 2/3 compounds exist (Ezobresib, Resveratrol)