ALK Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human ALK. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human ALK. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 34 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, mondo, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (45)
ALK

COMPREHENSIVE CROSS-DATABASE IDENTIFIER AND FUNCTIONAL MAPPING REFERENCE FOR HUMAN ALK

Section 1: Gene Identifiers

DatabaseIdentifierAdditional Info
HGNC IDHGNC:427Approved
Approved SymbolALK
Approved NameALK receptor tyrosine kinase
Ensembl GeneENSG00000171094
NCBI Entrez Gene238protein-coding
OMIM105590
Locus Typegene with protein product
Locus Groupprotein-coding gene
Aliases: CD246, ALK1 Previous Names: anaplastic lymphoma kinase (Ki-1), anaplastic lymphoma receptor tyrosine kinase Gene Groups: Receptor tyrosine kinases, CD molecules Genomic Location
AttributeValue
Chromosome2
Cytogenetic Band2p23.2-p23.1
Start Position29,192,774
End Position29,921,586
StrandMinus (-)
GenomeGRCh38 (homo_sapiens)
Gene Length728,813 bp

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 7)

Transcript IDBiotypeStartEndStrand
ENST00000389048protein_coding29,192,77429,921,586-
ENST00000431873nonsense_mediated_decay29,192,79429,227,653-
ENST00000453137protein_coding29,220,87029,223,394-
ENST00000498037protein_coding_CDS_not_defined29,286,41029,319,211-
ENST00000618119protein_coding29,192,77729,717,700-
ENST00000638605protein_coding_CDS_not_defined29,193,20729,224,727-
ENST00000642122protein_coding29,192,77429,223,900-
RefSeq Transcripts (Human ALK)
AccessionTypeStatusMANE Select
NM_004304mRNAREVIEWED✓ YES (Canonical)
NM_001353765mRNAREVIEWEDNo
RefSeq Proteins (Human ALK)
AccessionStatusMANE Select
NP_004295REVIEWED✓ YES (Canonical)
NP_001340694REVIEWEDNo
CCDS Identifiers (Total: 2)
CCDS ID
CCDS33172
CCDS86828
Exons of Canonical Transcript ENST00000389048 (Total: 29)
Exon IDStartEndLength
ENSE0000179493729,919,99329,921,5861,594
ENSE0000131302329,717,57829,717,697120
ENSE0000129086529,694,85029,695,014165
ENSE0000131303329,531,91529,532,116202
ENSE0000125569329,383,73229,383,859128
ENSE0000125567329,328,35029,328,481132
ENSE0000125566829,320,75129,320,882132
ENSE0000358444129,318,30429,318,404101
ENSE0000363838229,296,88829,297,057170
ENSE0000125562529,275,09929,275,227129
ENSE0000125563529,275,40229,275,49695
ENSE0000132939929,251,10529,251,267163
ENSE0000115443329,239,68029,239,830151
ENSE0000115442829,233,56529,233,696132
ENSE0000115442629,232,30429,232,448145
ENSE0000125559129,228,88429,229,066183
ENSE0000115442029,227,57429,227,67299
ENSE0000115441729,226,92229,227,074153
ENSE0000115441129,225,46129,225,565105
ENSE0000115440729,223,34229,223,528187
ENSE0000115440329,222,51729,222,60791
ENSE0000115439929,222,34429,222,40865
ENSE0000115439529,220,70629,220,835130
ENSE0000115439029,213,98429,214,08198
ENSE0000115438129,209,78629,209,87893
ENSE0000168204329,207,17129,207,272102
ENSE0000115436729,197,54229,197,676135
ENSE0000115436129,196,77029,196,86091
ENSE0000114035429,192,77429,193,9221,149

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusCanonical
Q9UM73ALK tyrosine kinase receptorReviewed (Swiss-Prot)✓ YES
Alternative Names: Anaplastic lymphoma kinase Protein Properties:
PropertyValue
Length1,620 amino acids
Mass176,442 Da
Protein Domains and Families (InterPro) - Total: 13
InterPro IDNameType
IPR000719Prot_kinase_domDomain
IPR000998MAM_domDomain
IPR001245Ser-Thr/Tyr_kinase_cat_domDomain
IPR002011Tyr_kinase_rcpt_2_CSConserved_site
IPR002172LDrepeatLR_classA_rptRepeat
IPR008266Tyr_kinase_ASActive_site
IPR011009Kinase-like_dom_sfHomologous_superfamily
IPR013320ConA-like_dom_sfHomologous_superfamily
IPR017441Protein_kinase_ATP_BSBinding_site
IPR020635Tyr_kinase_cat_domDomain
IPR036055LDL_receptor-like_sfHomologous_superfamily
IPR050122RTKFamily
IPR055163ALK/LTK-like_GRDDomain

Section 4: Structure Identifiers Experimental Structures (PDB) - Total: 79

PDB IDTitleMethodResolution (Å)
2KUPComplex of PTB domain of SNT-2 and HALK peptideNMR-
2KUQChimera of PTB domain of SNT-2 and HALK peptideNMR-
2XB7ALK in complex with NVP-TAE684X-ray2.50
2XBAALK in complex with PHA-E429X-ray1.95
2XP2ALK in complex with Crizotinib (PF-02341066)X-ray1.90
2YFXL1196M Mutant ALK with CrizotinibX-ray1.70
2YHVL1196M Mutant ALKX-ray1.90
2YJRF1174L Mutant ALKX-ray1.90
2YJSC1156Y Mutant ALKX-ray1.90
3AOXALK with CH5424802 (Alectinib precursor)X-ray1.75
3L9PALK Catalytic DomainX-ray1.80
4ANLG1269A Mutant ALKX-ray1.70
4CLIALK with PF-06463922 (Lorlatinib)X-ray2.05
4MKCALK with LDK378 (Ceritinib)X-ray2.01
4Z55ALK with pyrazolopyrimidine derivative of LDK378X-ray1.55
5FTOALK with EntrectinibX-ray2.22
6MX8ALK with BrigatinibX-ray1.96
7LRZHuman ALK GRDX-ray1.91
7LS0Human ALK GRD bound to AUGX-ray3.05
7MZWALK extracellular ligand binding region 673-1025NMR-
7MZYALK extracellular fragment 673-986X-ray1.50
7N00ALK extracellular 648-1025 with AUG-alphaCryo-EM2.27
7NWZALK:ALKAL2 complexX-ray4.17
9G5IALK:ALKAL2 complex (2:1)Cryo-EM3.20
9GBEALK G1202R/L1196M with NVL-655X-ray1.58
(Plus 54 additional structures - showing representative subset) Method Distribution: X-ray Diffraction (~70), NMR (~5), Cryo-EM (~4) Predicted Structure (AlphaFold)
Model IDSequence LengthGlobal pLDDTFraction Very High pLDDT
AF-Q9UM73-F112,39768.260.20 (20%)

Section 5: Cross-Species Orthologs

OrganismEnsembl IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000055471Alkprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000008683Alkprotein_coding
Zebrafish (Danio rerio)ENSDARG00000095833alkprotein_coding
Fruit fly (Drosophila melanogaster)FBGN0040505Alkprotein_coding
Worm (C. elegans)No direct ortholog--
Yeast (S. cerevisiae)No ortholog--

Section 6: Clinical Variants & AI Predictions ClinVar Summary Total Variant Count: 6,175

ClassificationCount
Pathogenic19
Likely Pathogenic7
Uncertain Significance (VUS)~6,000+
Likely Benign~50+
Benign~100+
Pathogenic Variants (Total: 19)
ClinVar IDHGVSVariant TypeReview Status
18083c.3824G>A (p.Arg1275Gln)SNVMultiple submitters
18085c.3575G>C (p.Arg1192Pro)SNVSingle submitter
217849c.3520T>C (p.Phe1174Leu)SNVNo criteria
217850c.3520T>A (p.Phe1174Ile)SNVNo criteria
65670c.3520T>G (p.Phe1174Val)SNVNo criteria
217851c.3522C>A (p.Phe1174Leu)SNVNo criteria
217852c.3522C>G (p.Phe1174Leu)SNVNo criteria
545115c.3512T>C (p.Ile1171Thr)SNVNo criteria
217855c.3733T>A (p.Phe1245Ile)SNVNo criteria
217856c.3734T>G (p.Phe1245Cys)SNVNo criteria
217857c.3735C>A (p.Phe1245Leu)SNVNo criteria
217858c.3833A>C (p.Tyr1278Ser)SNVNo criteria
15278032p23.2-21 deletionCNVSingle submitter
3247472Exon deletionDeletionSingle submitter
3944662p23.2-21 gainCNVNo criteria
487484inv(2)(p23;p21)InversionNo criteria
5626602p23.2-22.3 lossCNVNo criteria
627530t(2;3)(p12;p14.3)TranslocationNo criteria
6873022p23.2-22.2 lossCNVNo criteria
Likely Pathogenic Variants (Total: 7)
ClinVar IDHGVSVariant Type
1705031c.3538G>C (p.Val1180Leu)SNV
217854c.3718T>G (p.Leu1240Val)SNV
2445401c.402G>C (p.Lys134Asn)SNV
375887c.3521T>G (p.Phe1174Cys)SNV
376135c.3586C>A (p.Leu1196Met)SNV
547174inv(2)(p21p23.2)Inversion
978809c.4298_4299del (p.Glu1433fs)Deletion
AlphaMissense Predictions Total Predictions: 10,546
ClassificationApproximate Count
Likely Pathogenic~3,000+
Ambiguous~4,000+
Likely Benign~3,500+
TOP 50 Predicted Pathogenic Missense Variants (Highest Scores):
VariantPositionScoreClass
W1510R15100.998likely_pathogenic
G1508D15080.996likely_pathogenic
C1579W15790.994likely_pathogenic
G1508V15080.993likely_pathogenic
C1579R15790.993likely_pathogenic
C1579Y15790.992likely_pathogenic
S1509P15090.991likely_pathogenic
W1510C15100.991likely_pathogenic
G1508R15080.990likely_pathogenic
F1577S15770.990likely_pathogenic
F1577L15770.988likely_pathogenic
F1572L15720.976likely_pathogenic
C1579F15790.974likely_pathogenic
W1510S15100.969likely_pathogenic
F1511L15110.968likely_pathogenic
V1582D15820.966likely_pathogenic
G1580E15800.965likely_pathogenic
W1510G15100.964likely_pathogenic
S1509F15090.964likely_pathogenic
G1580V15800.957likely_pathogenic
SpliceAI Predictions Total Predictions: 6,693 High-Impact Splice Variants (Score ≥0.5): 100+ variants TOP 50 High-Confidence Splice-Altering Variants:
VariantEffectDelta Score
2:29193928:G:Cacceptor_gain1.00
2:29196858:TAT:Tacceptor_gain1.00
2:29196859:AT:Aacceptor_gain1.00
2:29196764:TTTTA:Tdonor_loss1.00
2:29196765:TTTA:Tdonor_loss1.00
2:29196766:TTAC:Tdonor_loss1.00
2:29196767:TACCT:Tdonor_loss1.00
2:29196768:A:ATdonor_loss1.00
2:29196769:C:Adonor_loss1.00
2:29196861:CTAA:Cacceptor_loss1.00
2:29196859:ATCTA:Aacceptor_loss1.00
2:29193926:CCG:Cacceptor_gain0.99
2:29193927:C:Tacceptor_gain0.99
2:29196768:A:ACdonor_gain0.99
2:29196769:C:CCdonor_gain0.99
2:29196857:GTAT:Gacceptor_gain0.99
2:29193921:TCCTG:Tacceptor_loss0.98
2:29193923:C:CCacceptor_gain0.98
2:29193928:G:GCacceptor_gain0.98
2:29196856:GGTAT:Gacceptor_gain0.98

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 11)

Pathway IDPathway NameDisease Pathway
R-HSA-201556Signaling by ALKNo
R-HSA-9851151MDK and PTN in ALK signalingNo
R-HSA-9725370Signaling by ALK fusions and activated point mutantsYes
R-HSA-9700645ALK mutants bind TKIsYes
R-HSA-9717326crizotinib-resistant ALK mutantsYes
R-HSA-9717316alectinib-resistant ALK mutantsYes
R-HSA-9717319brigatinib-resistant ALK mutantsYes
R-HSA-9717323ceritinib-resistant ALK mutantsYes
R-HSA-9717329lorlatinib-resistant ALK mutantsYes
R-HSA-9717264ASP-3026-resistant ALK mutantsYes
R-HSA-9717301NVP-TAE684-resistant ALK mutantsYes
Gene Ontology Annotations (Total: 29) Biological Process (17 terms):
GO IDTerm
GO:0007165signal transduction
GO:0007169cell surface receptor protein tyrosine kinase signaling pathway
GO:0016310phosphorylation
GO:0038083peptidyl-tyrosine autophosphorylation
GO:0046777protein autophosphorylation
GO:0042127regulation of cell population proliferation
GO:0042981regulation of apoptotic process
GO:0048666neuron development
GO:0045664regulation of neuron differentiation
GO:0021766hippocampus development
GO:0030534adult behavior
GO:0036269swimming behavior
GO:0006950response to stress
GO:0050995negative regulation of lipid catabolic process
GO:0060159regulation of dopamine receptor signaling pathway
GO:0097009energy homeostasis
GO:1900006positive regulation of dendrite development
Molecular Function (7 terms):
GO IDTerm
GO:0004713protein tyrosine kinase activity
GO:0004714transmembrane receptor protein tyrosine kinase activity
GO:0005524ATP binding
GO:0008201heparin binding
GO:0030298receptor signaling protein tyrosine kinase activator activity
GO:0042802identical protein binding
Cellular Component (5 terms):
GO IDTerm
GO:0005886plasma membrane
GO:0032991protein-containing complex
GO:0043235receptor complex
GO:0070062extracellular exosome

Section 8: Protein Interactions & Molecular Networks STRING Interactions (Total: 3,930+) TOP 50 Highest-Confidence Interacting Proteins:

UniProt IDGeneScoreDescription
Q9HC35EML4997Echinoderm microtubule-associated protein-like 4
P21246PTN991Pleiotrophin
P06748NPM1990Nucleophosmin
P21741MDK971Midkine
P37173TGFBR2947TGF-beta receptor type-2
P36896ACVR1B929Activin receptor type-1B
P01116KRAS887GTPase KRas
P36897SMAD1876Mothers against decapentaplegic homolog 1
P31939ATIC867Bifunctional purine biosynthesis protein
P01111NRAS854GTPase NRas
P27037ACVR2A840Activin receptor type-2A
P29353SHC1824SHC-transforming protein 1
P28908TNFRSF8823TNF receptor superfamily member 8 (CD30)
P33176KIF5B822Kinesin-1 heavy chain
Q9NZQ7ALK818ALK (self-interaction)
P06753TPM3816Tropomyosin alpha-3 chain
P42336PIK3CA801PI3K catalytic subunit alpha
P29354SHC2795SHC-transforming protein 2
P18827SDC1792Syndecan-1
P04637TP53772Cellular tumor antigen p53
BioGRID Interactions (Total: 741+) Key Interaction Partners by Method:
InteractorMethodFunction
SHC1Affinity Capture-WesternAdaptor protein
GRB2Affinity Capture-Western/MSAdaptor protein
SRCAffinity Capture-WesternKinase
BCAR1Affinity Capture-MSAdaptor protein
PTNReconstituted ComplexLigand
JAK3Affinity Capture-WesternKinase
PTPN11Proximity Label-MSPhosphatase
PIK3R1Physical AssociationPI3K regulatory
CDC37Proximity Label-MSHSP90 co-chaperone
NUMBProximity Label-MSEndocytosis
IntAct Interactions (Total: 120+) Key Validated Interactions:
Interactor AInteractor BTypeScore
ALKAL1ALKdirect interaction0.66
ALKALKAL2physical association0.56
SHC1ALKphysical association0.69
PIK3R1ALKphysical association0.59
CRKLALKphysical association0.52
NCK2ALKphysical association0.51
Protein Sequence Similarity (DIAMOND) - Total: 85 TOP 20 Homologous Proteins:
UniProt IDTop Identity (%)Top Bitscore
Q6313291.84356
Q78DX791.84351
Q61G9384.24012
Q8I7I584.23997
P0892279.53798
P9779386.62728
P0621399.52691
P0806997.72685
P0DV8489.12845
Q6075199.52783
Q1520897.82774
Q6471696.92562
Q9WTL496.92562
P1461789.82397
P1461689.82388
P0894198.72378
O7379877.92091
Q1330892.61998
Q5XJV684.51977
Q96Q0484.51974
Structural Similarity (ESM2) - Total: 50 TOP 20 Structurally Similar Proteins:
UniProt IDSimilarity ScoreAvg Similarity
O086280.99970.9698
P234700.99950.9712
Q059090.99950.9721
Q132190.99970.9778
Q613980.99970.9692
Q80XH40.99970.9728
O603830.99930.9807
P0DV840.99930.9751
P977930.99940.9751
Q628490.99960.9773
Q9QZL20.99960.9762
O881990.99960.9757
Q505J30.99970.9725

Section 9: Transcription Factor Regulatory Data Note: ALK is NOT a transcription factor. It is a receptor tyrosine kinase. Upstream Regulators (CollecTRI) - Total: 8 Transcription factors that regulate ALK expression:

TF GeneRegulationConfidence
PHOX2BActivationHigh
AP1-High
ESR1-High
JUNB-High
STAT5A-High
TXK-High
CEBPB-Low
TCF23-Low
Key Regulatory Insight
PHOX2B is a confirmed transcriptional activator of ALK, particularly relevant in neuroblastoma where PHOX2B mutations can affect ALK expression.
Drug & Pharmacology Data**
ChEMBL Target Information
Target IDTarget NameTarget Type
CHEMBL4247ALK tyrosine kinase receptorSINGLE PROTEIN
CHEMBL2111387NPM/ALKCHIMERIC PROTEIN
CHEMBL3883330EML4-ALKCHIMERIC PROTEIN
Approved ALK Inhibitors (Phase 4)
ChEMBL IDDrug NameDevelopment Phase
CHEMBL601719CRIZOTINIB4 (Approved)
CHEMBL1738797ALECTINIB4 (Approved)
CHEMBL2403108CERITINIB4 (Approved)
CHEMBL3545311BRIGATINIB4 (Approved)
CHEMBL3286830LORLATINIB4 (Approved)
CHEMBL1983268ENTRECTINIB4 (Approved)
CHEMBL4298138REPOTRECTINIB4 (Approved)
Other Kinase Inhibitors with ALK Activity (Phase 4)
ChEMBL IDDrug NamePrimary TargetPhase
CHEMBL535SUNITINIBMulti-kinase4
CHEMBL477772PAZOPANIBMulti-kinase4
CHEMBL24828VANDETANIBMulti-kinase4
CHEMBL288441BOSUTINIBBCR-ABL4
CHEMBL553ERLOTINIBEGFR4
CHEMBL3353410OSIMERTINIBEGFR4
CHEMBL189963PALBOCICLIBCDK4/64
CHEMBL1789941RUXOLITINIBJAK1/24
CHEMBL1287853FEDRATINIBJAK24
CHEMBL3622821UPADACITINIBJAK14
CHEMBL502835NINTEDANIBMulti-kinase4
CHEMBL3301622GILTERITINIBFLT3/AXL4
CHEMBL608533MIDOSTAURINMulti-kinase4
Phase 3 Inhibitors
ChEMBL IDDrug NamePhase
CHEMBL50QUERCETIN3
CHEMBL223360LINIFANIB3
CHEMBL276711SEMAXANIB3
CHEMBL491473CEDIRANIB3
CHEMBL522892DOVITINIB3
CHEMBL3545308ROCILETINIB3
CHEMBL1879463DACTOLISIB3
CHEMBL428690ALVOCIDIB3
CHEMBL603469LESTAURTINIB3
CHEMBL31965CANERTINIB3
PharmGKB
PharmGKB IDSymbolVIP GeneCPIC Guideline
PA24719ALK✓ YESNo

Section 11: Expression Profiles Bgee Expression Summary

PropertyValue
Expression BreadthUbiquitous
Total Present Calls181
Total Absent Calls86
Total Conditions267
Max Expression Score85.61
Average Expression Score54.69
Gold Quality Count214
TOP 30 Tissues/Cell Types by Expression
Tissue/Cell TypeExpressionScoreQuality
SpermPresent85.61Gold
Male germ cellPresent85.51Gold
Male germ line stem cell (testis)Present85.22Gold
Ventral tegmental areaPresent78.37Gold
Superior vestibular nucleusPresent77.39Gold
Medial globus pallidusPresent77.03Silver
Subthalamic nucleusPresent76.80Silver
Buccal mucosa cellPresent76.70Silver
Dorsal plus ventral thalamusPresent76.44Silver
Globus pallidusPresent76.35Silver
Brodmann area 23Present75.67Gold
Inferior vagus X ganglionPresent75.22Silver
Middle temporal gyrusPresent74.13Gold
Trigeminal ganglionPresent73.35Silver
Dorsal root ganglionPresent73.10Silver
Tissues with LOW/ABSENT Expression
TissueExpressionScore
Type B pancreatic cellAbsent83.34
Olfactory bulbAbsent83.23
Cervix squamous epitheliumAbsent77.67
DiaphragmAbsent77.39
ThymusAbsent76.19
Cerebellar vermisAbsent74.78
Single-Cell Expression Data
Dataset IDDescriptionSpeciesCell Count
E-ANND-2GTEx: snRNAseq atlasHomo sapiens209,126

Section 12: Disease Associations Mendelian/Monogenic Disease (GenCC)

DiseaseClassificationInheritanceEvidence
Neuroblastoma, susceptibility to, 3 (OMIM:613014)DefinitiveAutosomal dominantG2P
Neuroblastoma, susceptibility to, 3 (OMIM:613014)StrongAutosomal dominantAmbry, Labcorp
Orphanet Disease Associations (Total: 8)
Orphanet IDDisease NameType
635NeuroblastomaDisease
251877GanglioneuroblastomaDisease
251992GanglioneuromaDisease
300895ALK-positive anaplastic large cell lymphomaHistopathological subtype
364043ALK-positive large B-cell lymphomaDisease
178342Inflammatory myofibroblastic tumorDisease
146Differentiated thyroid carcinomaDisease
626Large/giant congenital melanocytic nevusDisease
MONDO Disease Associations
MONDO IDDisease Name
MONDO:0020325Anaplastic large cell lymphoma
HPO Phenotype Associations (Total: 68) Cancer/Tumor-Related Phenotypes:
HPO IDPhenotype
HP:0003006Neuroblastoma
HP:0003005Ganglioneuroma
HP:0100631Neoplasm of the adrenal gland
HP:0004375Neoplasm of the nervous system
HP:0012056Cutaneous melanoma
HP:0005600Congenital giant melanocytic nevus
Neurological Phenotypes:
HPO IDPhenotype
HP:0001250Seizure
HP:0001251Ataxia
HP:0001336Myoclonus
HP:0012758Neurodevelopmental delay
HP:0010543Opsoclonus
HP:0007110Central hypoventilation
HP:0002277Horner syndrome
Hematological/Systemic Phenotypes:
HPO IDPhenotype
HP:0001873Thrombocytopenia
HP:0001903Anemia
HP:0001892Abnormal bleeding
HP:0001945Fever
HP:0002716Lymphadenopathy
GWAS Associations (Total: 15)
Study IDTraitP-valueChr
GCST008758_64Pre-treatment viral load in HIV-1 infection7.0e-182
GCST008295_43Number of decayed/missing/filled tooth surfaces2.0e-092
GCST011346_32Total cholesterol levels4.0e-092
GCST011347_56Low density lipoprotein cholesterol levels4.0e-082
GCST010989_247Body size at age 103.0e-082
GCST008306_9Dentures6.0e-082
GCST008386_3Carbamazepine-induced serious cutaneous adverse reaction5.0e-082
GCST002726_14Glucose homeostasis traits3.0e-062
GCST002783_418Body mass index8.0e-062
GCST003095_3Adiponectin levels (BMI-adjusted)7.0e-062
GCST003670_7Systolic blood pressure1.0e-062
GCST008176_8Gestational age at birth4.0e-062
GCST008384_3Carbamazepine-induced DRESS1.0e-062
GCST008551_19Simvastatin-induced myopathy5.0e-062
GCST90000582_8Spontaneous coronary artery dissection8.0e-062
SUMMARY STATISTICS
CategoryCount
Gene Identifiers5 (HGNC, Ensembl, Entrez, OMIM, UniProt)
Transcripts (Ensembl)7
RefSeq Transcripts2 (human)
CCDS2
Exons (canonical)29
UniProt Entries1 (reviewed)
Protein Domains13
PDB Structures79
Orthologs4 (mouse, rat, zebrafish, fly)
ClinVar Variants6,175
Pathogenic Variants19
Likely Pathogenic Variants7
AlphaMissense Predictions10,546
SpliceAI Predictions6,693
Reactome Pathways11
GO Annotations29
STRING Interactions3,930+
BioGRID Interactions741+
IntAct Interactions120+
Sequence Homologs85
Structural Homologs50
Approved ALK Inhibitors7
Phase 3+ Drugs32
Orphanet Diseases8
HPO Phenotypes68
GWAS Associations15

This reference document was compiled using BioTree cross-database integration platform, covering 70+ biological databases.