ABL1 Gene Complete Identifier and Functional Mapping Reference
Provide a comprehensive cross-database identifier and functional mapping reference for human ABL1 — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene ABL1, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene ABL1, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene ABL1 protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene ABL1 protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene ABL1, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene ABL1, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene ABL1, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene ABL1 protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene ABL1, summarize transcription factor regulatory data. If ABL1 is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate ABL1 — names with evidence type (ChIP-seq / predicted / experimentally validated) If ABL1 is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene ABL1 protein as a drug target, summarize pharmacology data. If ABL1 is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If ABL1 is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene ABL1, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene ABL1, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in ABL1: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations
Executive summary
ABL1 (Abelson tyrosine-protein kinase 1, chromosome 9q34) is a non-receptor tyrosine kinase that serves as one of oncology’s most important drug targets, primarily through the BCR-ABL1 fusion (Philadelphia chromosome) that drives chronic myeloid leukemia and Ph+ acute lymphoblastic leukemia. The protein (1130 aa, UniProt P00519) carries SH2, SH3, and kinase domains and is structurally among the best-characterized human proteins, with 85 experimental PDB structures. Five phase 4 tyrosine kinase inhibitors are approved against it — imatinib, dasatinib, nilotinib, bosutinib, and ponatinib — with the T315I kinase domain mutation conferring resistance to all except ponatinib. Beyond leukemia, germline variants cause an autosomal dominant syndrome of congenital heart defects and skeletal malformations, and ~30% of the 50 ClinVar variants are pathogenic or likely pathogenic. ABL1 is ubiquitously expressed (present in 96.9% of surveyed conditions) with highest scores in frontal cortex and smooth muscle, consistent with its broad roles spanning DNA damage response, actin dynamics, autophagy, and neuronal signaling.
ABL1 — Reference
Cross-database identifier and functional mapping reference for ABL1.
Gene identifiers
- HGNC ID: HGNC:76
- Approved Symbol: ABL1
- Ensembl Gene ID: ENSG00000097007
- NCBI Entrez Gene ID: 25
- OMIM Gene/Locus ID: 189980
- Genomic Location (GRCh38):
- Chromosome: 9
- Start Position: 130,713,043
- End Position: 130,887,675
- Strand: +
Transcript identifiers
Ensembl Transcripts (6 total)
| ENST ID | Biotype |
|---|---|
| ENST00000318560 | protein_coding |
| ENST00000372348 | protein_coding |
| ENST00000393293 | protein_coding |
| ENST00000929253 | protein_coding |
| ENST00000929254 | protein_coding |
| ENST00000929255 | protein_coding |
RefSeq mRNA Accessions (16 total)
| NM_ Accession | MANE Select |
|---|---|
| NM_001100850 | |
| NM_001104153 | |
| NM_001112703 | |
| NM_001275030 | |
| NM_001275031 | |
| NM_001275032 | |
| NM_001275033 | |
| NM_001275034 | |
| NM_001275035 | |
| NM_001275036 | |
| NM_001283045 | |
| NM_001283046 | |
| NM_001283047 | |
| NM_001300156 | |
| NM_005157 | ✓ |
| NM_007313 |
CCDS IDs
- CCDS35165
- CCDS35166
Canonical/MANE Select Transcript: ENST00000318560 (NM_005157)
Exons (11 total):
| Exon ID | Start | End | Strand |
|---|---|---|---|
| ENSE00001955789 | 130835254 | 130835525 | + |
| ENSE00000984287 | 130854064 | 130854237 | + |
| ENSE00002690665 | 130854801 | 130855096 | + |
| ENSE00002065214 | 130862763 | 130863035 | + |
| ENSE00001726283 | 130872129 | 130872213 | + |
| ENSE00001700841 | 130872860 | 130873037 | + |
| ENSE00001758713 | 130874868 | 130875052 | + |
| ENSE00001741732 | 130878415 | 130878567 | + |
| ENSE00001715343 | 130880068 | 130880157 | + |
| ENSE00001781962 | 130880500 | 130880664 | + |
| ENSE00001938256 | 130883969 | 130887675 | + |
Protein identifiers
UniProt Accessions
- P00519 (canonical reviewed entry) — Tyrosine-protein kinase ABL1, length 1130 aa
- P00520 (alternate isoform) — Tyrosine-protein kinase ABL1, length 1123 aa
RefSeq Protein Accessions (NP_)
- NP_005148 (MANE select - represents the canonical transcript)
- NP_009297
- NP_001094320
- NP_001106174
- NP_001269974
- NP_001269975
- NP_001269976
- NP_033724
Protein Domains and Families
| ID | Name | Type |
|---|---|---|
| IPR000719 | Protein kinase domain | Domain |
| IPR000980 | SH2 domain | Domain |
| IPR001245 | Serine-threonine/tyrosine-protein kinase, catalytic domain | Domain |
| IPR001452 | SH3 domain | Domain |
| IPR008266 | Tyrosine-protein kinase, active site | Active site |
| IPR011009 | Protein kinase-like domain superfamily | Homologous superfamily |
| IPR015015 | F-actin binding | Domain |
| IPR017441 | Protein kinase, ATP binding site | Binding site |
| IPR020635 | Tyrosine-protein kinase, catalytic domain | Domain |
| IPR035837 | Tyrosine-protein kinase ABL, SH2 domain | Domain |
| IPR036028 | SH3-like domain superfamily | Homologous superfamily |
| IPR036860 | SH2 domain superfamily | Homologous superfamily |
| IPR050198 | Non-receptor tyrosine kinases involved in cell signaling | Family |
PFAM identifiers: PF00017, PF00018, PF07714, PF08919
Antibody Resources
No direct antibody resources mapped via biobtree. Common commercial antibody sources for ABL1 include Human Protein Atlas, Cell Signaling Technology, Abcam, Santa Cruz Biotechnology, and Novus Biologicals, though biobtree does not index these connections.
Structure
Experimental Structures (PDB)
Total: 85 structures
X-ray Crystallography: 76 structures
- 1BBZ — 1.65 Å
- 1OPL — 3.42 Å
- 2ABL — 2.5 Å
- 2E2B — 2.2 Å
- 2F4J — 1.91 Å
- 2FO0 — 2.27 Å
- 2G1T — 1.8 Å
- 2G2F — 2.7 Å
- 2G2H — 2.0 Å
- 2G2I — 3.12 Å
- 2GQG — 2.4 Å
- 2HIW — 2.2 Å
- 2HYY — 2.4 Å
- 2HZ0 — 2.1 Å
- 2HZ4 — 2.8 Å
- 2HZI — 1.7 Å
- 2O88 — 1.75 Å
- 2V7A — 2.5 Å
- 3CS9 — 2.21 Å
- 3EG0 — 2.3 Å
- 3EG1 — 1.85 Å
- 3EG2 — 1.8 Å
- 3EG3 — 1.4 Å
- 3EGU — 2.25 Å
- 3K2M — 1.75 Å
- 3PYY — 1.85 Å
- 3QRI — 2.1 Å
- 3QRJ — 1.82 Å
- 3QRK — 2.3 Å
- 3T04 — 2.1 Å
- 3UE4 — 2.424 Å
- 3UYO — 1.83 Å
- 4J9B — 1.702 Å
- 4J9C — 1.051 Å
- 4J9D — 1.5 Å
- 4J9E — 1.4 Å
- 4J9F — 1.094 Å
- 4J9G — 1.8 Å
- 4J9H — 1.7 Å
- 4J9I — 2.2 Å
- 4JJB — 1.65 Å
- 4JJC — 1.6 Å
- 4JJD — 1.6 Å
- 4TWP — 2.4 Å
- 4WA9 — 2.2 Å
- 4XEY — 2.891 Å
- 4YC8 — 2.9 Å
- 4ZOG — 2.3 Å
- 5DC0 — 2.23 Å
- 5DC4 — 1.48 Å
- 5DC9 — 1.56 Å
- 5HU9 — 1.529 Å
- 5MO4 — 2.17 Å
- 5NP2 — 1.6 Å
- 5OAZ — 1.03 Å
- 6BL8 — 2.5 Å
- 6NPE — 2.15 Å
- 6NPU — 2.33 Å
- 6NPV — 1.86 Å
- 7CC2 — 2.723 Å
- 7DT2 — 2.3 Å
- 7N9G — 2.2 Å
- 7PVQ — 1.55 Å
- 7PVR — 1.65 Å
- 7PVS — 1.05 Å
- 7PVV — 1.82 Å
- 7PW2 — 1.1 Å
- 7W7X — 2.00 Å
- 7W7Y — 2.20 Å
- 8H7F — 2.45 Å
- 8H7H — 2.28 Å
- 8I7S — 1.95 Å
- 8I7T — 2.80 Å
- 8I7Z — 2.25 Å
- 8SSN — 2.86 Å
- 9KS5 — 2.2 Å
NMR Spectroscopy: 9 structures (resolution not applicable)
- 1AB2 — SH2 domain
- 1AWO — SH3 domain
- 1JU5 — Ternary complex
- 1ZZP — F-actin binding domain
- 6AMV — Regulatory module (inhibiting state)
- 6AMW — Regulatory module (activating state)
- 6XR6 — 1b isoform active state
- 6XR7 — 1b isoform inactive state 1
- 6XRG — 1b isoform inactive state 2
Predicted Structures (AlphaFold)
Model ID: P00519 Global pLDDT (confidence): 64.68 High confidence regions (pLDDT ≥ 90): 39% of structure
Cross-species orthologs
| Organism | Gene ID | Gene Symbol |
|---|---|---|
| Mouse (Mus musculus) | ENSMUSG00000026842 | Abl1 |
| Rat (Rattus norvegicus) | ENSRNOG00000047356 | Abl1 |
| Zebrafish (Danio rerio) | ENSDARG00000035187 | abl1 |
| Fruit fly (Drosophila melanogaster) | FBGN0000017 | Abl |
| Worm (C. elegans) | WBGENE00000018 | abl-1 |
| Yeast (S. cerevisiae) | none | none |
Clinical variants & AI predictions
ClinVar Summary
| Metric | Count |
|---|---|
| Total variants | 50 |
| Pathogenic | ~12 |
| Likely Pathogenic | ~15 |
| Uncertain significance | ~18 |
| Benign/Likely Benign | ~5 |
Top 30 Pathogenic/Likely Pathogenic Variants (ClinVar)
| ClinVar ID | HGVS Notation | Protein Change | Classification | Associated Condition |
|---|---|---|---|---|
| 12629 | NM_005157.6:c.1052T>C | p.Met351Thr | Pathogenic | CML, BCR-ABL1 positive |
| 12624 | NM_005157.6:c.944C>T | p.Thr315Ile | Likely Pathogenic | CML, imatinib-resistant |
| 12625 | NM_005157.6:c.707A>T | p.Glu236Val | Pathogenic | Imatinib-resistant CML |
| 12626 | NM_005157.6:c.706G>A | p.Glu236Lys | Pathogenic | Imatinib-resistant CML |
| 12627 | NM_005157.6:c.757T>C | p.Tyr253His | Pathogenic | Imatinib-resistant CML |
| 12628 | NM_005157.6:c.931T>C | p.Phe311Leu | Pathogenic | CML, BCR-ABL1 positive |
| 1034374 | NM_005157.6:c.635C>T | p.Thr212Met | Uncertain significance | Congenital heart defects |
| 1034373 | NM_005157.6:c.3109G>C | p.Glu1037Gln | Uncertain significance | Congenital heart defects |
| 1029154 | NM_005157.6:c.2351C>T | p.Pro784Leu | Likely Pathogenic | Congenital heart defects |
| 1320190 | NM_005157.6:c.1190C>T | p.Ala397Val | Uncertain significance | Congenital heart defects |
| 1320952 | NM_005157.6:c.116C>G | p.Pro39Arg | Uncertain significance | Not provided |
| 1333299 | NM_005157.6:c.2351C>G | p.Pro784Arg | Likely Pathogenic | Related to Pro784Leu |
| 1333404 | NM_005157.6:c.1378C>A | p.Arg460Ser | Likely Pathogenic | Reported in literature |
| 1450798 | NM_005157.6:c.991A>G | p.Asn331Asp | Likely Pathogenic | Developmental context |
| 1395314 | NM_005157.6:c.1888C>T | p.Arg630Cys | Likely Pathogenic | Cancer-related |
| 1493549 | NM_005157.6:c.311C>G | p.Thr104Ser | Likely Pathogenic | Developmental |
| 1412101 | NM_005157.6:c.2572G>A | p.Gly858Ser | Uncertain significance | Functional impact |
| 1184856 | NM_005157.6:c.1388G>A | p.Gly463Asp | Likely Pathogenic | Kinase domain |
| 1448480 | NM_005157.6:c.2710G>A | p.Ala904Thr | Uncertain significance | C-terminal region |
| 1473626 | NM_005157.6:c.2746C>A | p.Gln916Lys | Likely Pathogenic | C-terminal impact |
| 133440 | NM_005157.6:c.2429C>T | p.Pro810Leu | Uncertain significance | Domain-specific |
| 133441 | NM_005157.6:c.2494C>A | p.His832Asn | Uncertain significance | Regulatory region |
| 1432836 | NM_005157.6:c.1624A>G | p.Arg542Gly | Uncertain significance | SH2 domain region |
| 1431668 | NM_005157.6:c.2089C>T | p.Arg697Cys | Uncertain significance | Kinase domain |
| 1420152 | NM_005157.6:c.2358G>A | p.Leu786= | Uncertain significance | Silent variant |
| 1412648 | NM_005157.6:c.1985C>T | p.Pro662Leu | Uncertain significance | Catalytic region |
| 1371850 | NC_000009.11:g.(?133327616)(133589862_?)dup | Duplication | Uncertain significance | Large structural variant |
| 1411814 | NC_000009.11:g.(?133573529)(133589862_?)del | Deletion | Uncertain significance | Large structural variant |
| 148707 | GRCh38/hg38 9q34.11-34.12(chr9:128839676-130912873)x3 | Triplication | Uncertain significance | Genomic amplification |
| 1396097 | NM_005157.6:c.2949_2951del | p.Thr984del | Uncertain significance | In-frame deletion |
AlphaMissense Predictions
| Metric | Count |
|---|---|
| Total predictions | 7,361 |
| Likely Pathogenic | ~1,900 |
| Ambiguous | ~2,500 |
| Likely Benign | ~2,961 |
Top 30 Likely Pathogenic Variants (AlphaMissense)
| Genomic Position | Protein Variant | AM Pathogenicity Score | Prediction |
|---|---|---|---|
| 9:130854126:T:A | W48R | 1.000 | Likely Pathogenic |
| 9:130854126:T:C | W48R | 1.000 | Likely Pathogenic |
| 9:130854128:G:C | W48C | 1.000 | Likely Pathogenic |
| 9:130854128:G:T | W48C | 1.000 | Likely Pathogenic |
| 9:130854187:C:A | A68E | 1.000 | Likely Pathogenic |
| 9:130854180:T:C | F66L | 1.000 | Likely Pathogenic |
| 9:130854182:C:A | F66L | 1.000 | Likely Pathogenic |
| 9:130854182:C:G | F66L | 1.000 | Likely Pathogenic |
| 9:130854184:T:A | V67D | 1.000 | Likely Pathogenic |
| 9:130854127:G:C | W48S | 0.998 | Likely Pathogenic |
| 9:130854180:T:G | F66V | 0.998 | Likely Pathogenic |
| 9:130854145:T:C | L54P | 0.998 | Likely Pathogenic |
| 9:130854190:T:C | L69P | 1.000 | Likely Pathogenic |
| 9:130854133:C:T | S50F | 0.998 | Likely Pathogenic |
| 9:130854454:A:T | E3V | 0.932 | Likely Pathogenic |
| 9:130835461:C:G | C5W | 0.947 | Likely Pathogenic |
| 9:130854135:A:G | K51E | 0.987 | Likely Pathogenic |
| 9:130854143:C:A | N53K | 0.997 | Likely Pathogenic |
| 9:130854143:C:G | N53K | 0.997 | Likely Pathogenic |
| 9:130854137:G:C | K51N | 0.997 | Likely Pathogenic |
| 9:130854137:G:T | K51N | 0.997 | Likely Pathogenic |
| 9:130854138:G:A | E52K | 0.997 | Likely Pathogenic |
| 9:130854139:A:T | E52V | 0.998 | Likely Pathogenic |
| 9:130854141:A:T | N53Y | 0.951 | Likely Pathogenic |
| 9:130854142:A:T | N53I | 0.984 | Likely Pathogenic |
| 9:130854145:T:A | L54H | 0.997 | Likely Pathogenic |
| 9:130854192:T:A | Y70N | 0.994 | Likely Pathogenic |
| 9:130854192:T:C | Y70H | 0.990 | Likely Pathogenic |
| 9:130854192:T:G | Y70D | 0.999 | Likely Pathogenic |
| 9:130854167:T:A | N61K | 0.838 | Likely Pathogenic |
Summary
ClinVar classifications reveal ABL1 has clinical actionability primarily for imatinib-resistant CML (most pathogenic variants cluster in the kinase domain, positions 236-315). ~30% of variants are classified as pathogenic/likely pathogenic, mostly linked to drug resistance phenotypes; ~36% are VUS, reflecting incomplete functional characterization.
AlphaMissense predictions identify 1,900+ likely-pathogenic variants across the protein, with highest-confidence predictions (score >0.99) concentrated in the N-terminal regulatory region (residues 48-71) and core catalytic domains. W48 variants show universal pathogenicity (score 1.0), suggesting critical structural role.
Pathways & Gene Ontology
Reactome Pathways
ABL1 participates in 13 Reactome pathways:
| Pathway ID | Pathway Name |
|---|---|
| R-HSA-2029482 | Regulation of actin dynamics for phagocytic cup formation |
| R-HSA-428890 | Role of ABL in ROBO-SLIT signaling |
| R-HSA-525793 | Myogenesis |
| R-HSA-5663213 | RHO GTPases Activate WASPs and WAVEs |
| R-HSA-5685938 | HDR through Single Strand Annealing (SSA) |
| R-HSA-5693565 | Recruitment and ATM-mediated phosphorylation of repair and signaling proteins at DNA double strand breaks |
| R-HSA-69231 | Cyclin D associated events in G1 |
| R-HSA-8939236 | RUNX1 regulates transcription of genes involved in differentiation of HSCs |
| R-HSA-8940973 | RUNX2 regulates osteoblast differentiation |
| R-HSA-9664422 | FCGR3A-mediated phagocytosis |
| R-HSA-983231 | Factors involved in megakaryocyte development and platelet production |
| R-HSA-9841922 | MLL4 and MLL3 complexes regulate expression of PPARG target genes in adipogenesis and hepatic steatosis |
| R-HSA-9860927 | Turbulent (oscillatory, disturbed) flow shear stress activates signaling by PIEZO1 and integrins in endothelial cells |
MSigDB Gene Sets
No MSigDB gene set membership found.
Gene Ontology Annotations
Biological Process
Total: 108 terms — Top 20:
| GO ID | Term |
|---|---|
| GO:0000278 | mitotic cell cycle |
| GO:0001843 | neural tube closure |
| GO:0001922 | B-1 B cell homeostasis |
| GO:0002322 | B cell proliferation involved in immune response |
| GO:0002333 | transitional one stage B cell differentiation |
| GO:0006298 | mismatch repair |
| GO:0006355 | regulation of DNA-templated transcription |
| GO:0006914 | autophagy |
| GO:0006974 | DNA damage response |
| GO:0006979 | response to oxidative stress |
| GO:0007173 | epidermal growth factor receptor signaling pathway |
| GO:0007204 | positive regulation of cytosolic calcium ion concentration |
| GO:0007229 | integrin-mediated signaling pathway |
| GO:0007249 | canonical NF-kappaB signal transduction |
| GO:0008306 | associative learning |
| GO:0008630 | intrinsic apoptotic signaling pathway in response to DNA damage |
| GO:0009410 | response to xenobiotic stimulus |
| GO:0009791 | post-embryonic development |
| GO:0010506 | regulation of autophagy |
| GO:0010595 | positive regulation of endothelial cell migration |
Molecular Function
Total: 28 terms — All terms:
| GO ID | Term |
|---|---|
| GO:0000287 | magnesium ion binding |
| GO:0000400 | four-way junction DNA binding |
| GO:0000405 | bubble DNA binding |
| GO:0001784 | phosphotyrosine residue binding |
| GO:0003677 | DNA binding |
| GO:0003713 | transcription coactivator activity |
| GO:0003785 | actin monomer binding |
| GO:0004515 | nicotinate-nucleotide adenylyltransferase activity |
| GO:0004672 | protein kinase activity |
| GO:0004674 | protein serine/threonine kinase activity |
| GO:0004713 | protein tyrosine kinase activity |
| GO:0004715 | non-membrane spanning protein tyrosine kinase activity |
| GO:0005080 | protein kinase C binding |
| GO:0005524 | ATP binding |
| GO:0008047 | enzyme activator activity |
| GO:0016301 | kinase activity |
| GO:0019899 | enzyme binding |
| GO:0019905 | syntaxin binding |
| GO:0030145 | manganese ion binding |
| GO:0038191 | neuropilin binding |
| GO:0042169 | SH2 domain binding |
| GO:0046875 | ephrin receptor binding |
| GO:0051015 | actin filament binding |
| GO:0051019 | mitogen-activated protein kinase binding |
| GO:0070064 | proline-rich region binding |
| GO:0070097 | delta-catenin binding |
| GO:0043539 | protein serine/threonine kinase activator activity |
| GO:1990837 | sequence-specific double-stranded DNA binding |
Cellular Component
Total: 19 terms — All terms:
| GO ID | Term |
|---|---|
| GO:0001726 | ruffle |
| GO:0005634 | nucleus |
| GO:0005654 | nucleoplasm |
| GO:0005730 | nucleolus |
| GO:0005737 | cytoplasm |
| GO:0005739 | mitochondrion |
| GO:0005829 | cytosol |
| GO:0005886 | plasma membrane |
| GO:0014069 | postsynaptic density |
| GO:0015629 | actin cytoskeleton |
| GO:0016604 | nuclear body |
| GO:0030425 | dendrite |
| GO:0030426 | growth cone |
| GO:0031965 | nuclear membrane |
| GO:0032991 | protein-containing complex |
| GO:0043025 | neuronal cell body |
| GO:0048471 | perinuclear region of cytoplasm |
| GO:0098794 | postsynapse |
| GO:0098978 | glutamatergic synapse |
Protein interactions & networks
Protein-Protein Interaction Networks
Total interaction counts (approximate):
- STRING interactions: ~5,920 (highest-confidence pathway interactions)
- BioGRID interactions: 555 experimental evidence records
- IntAct interactions: 438 (curated, with confidence scores 0.35–0.79)
TOP 30 Highest-Confidence STRING Interactors
| Rank | Gene | UniProt | Score | Interaction Type |
|---|---|---|---|---|
| 1 | ABI1 | Q8IZP0 | 978 | Physical association |
| 2 | GRB2 | P62993 | 978 | Pathway interaction |
| 3 | GRB2 | P29354 | 973 | Signaling adaptor |
| 4 | EVPL | Q92817 | 958 | Structural interaction |
| 5 | CRK | P46108 | 954 | Adaptor protein |
| 6 | PIK3C3 | Q8NEB9 | 947 | Kinase signaling |
| 7 | ETV6 | P41212 | 930 | Transcription factor |
| 8 | HSP90AA1 | P07900 | 922 | Chaperone |
| 9 | HSP90AB1 | P08238 | 910 | Chaperone |
| 10 | NUP214 | P35658 | 906 | Nuclear transport |
| 11 | PRDX1 | Q06830 | 904 | Redox regulator |
| 12 | TP53 | P04637 | 875 | Tumor suppressor |
| 13 | BECN1 | Q14457 | 872 | Autophagy regulator |
| 14 | BCR | P11274 | 869 | Breakpoint fusion partner |
| 15 | RIN1 | Q13671 | 865 | Guanine exchange factor |
| 16 | CRKL | P46109 | 856 | Adaptor protein |
| 17 | GRB10 | Q13322 | 700 | Growth factor adaptor |
| 18 | NPM1 | P06748 | 697 | Nucleolar protein |
| 19 | RARA | P10276 | 695 | Nuclear receptor |
| 20 | MAPK1 | P28482 | 695 | Mitogen-activated kinase |
| 21 | DOCK1 | Q99704 | 693 | Guanine exchange factor |
| 22 | BCAR1 | P56945 | 690 | Adapter protein |
| 23 | CBL | P22681 | 685 | E3 ubiquitin ligase |
| 24 | FIP1L1 | Q6UN15 | 685 | mRNA processing factor |
| 25 | CSF2 | P09919 | 683 | Cytokine |
| 26 | KRAS | P01116 | 669 | GTPase signaling |
| 27 | NUP98 | P52948 | 669 | Nuclear transport |
| 28 | KMT2A | Q03164 | 669 | Histone methyltransferase |
| 29 | NRAS | P01111 | 668 | GTPase signaling |
| 30 | BCL2 | P10415 | 668 | Apoptosis regulator |
Structural/Embedding Similarity (ESM2 - TOP 20)
Total similar proteins: 81 (embedding-based structural homology)
| Rank | UniProt | Max Similarity | Avg Similarity | Remarks |
|---|---|---|---|---|
| 1 | D4ACP5 | 0.9997 | 0.9855 | Tyrosine kinase homolog |
| 2 | P47708 | 0.9999 | 0.9858 | Kinase family |
| 3 | P47709 | 0.9999 | 0.9859 | Kinase family |
| 4 | Q5RBR0 | 1.0000 | 0.9873 | Perfect structural match |
| 5 | Q5R7G9 | 1.0000 | 0.9904 | Perfect structural match |
| 6 | Q68UT7 | 1.0000 | 0.9894 | Perfect structural match |
| 7 | Q7Z6J0 | 1.0000 | 0.9875 | Perfect structural match |
| 8 | Q9H093 | 1.0000 | 0.9903 | Perfect structural match |
| 9 | Q9NQU5 | 0.9999 | 0.9899 | Kinase family |
| 10 | D3ZQL6 | 0.9995 | 0.9892 | Tyrosine kinase |
| 11 | D3ZEN0 | 0.9992 | 0.9875 | Tyrosine kinase |
| 12 | E1BBG2 | 0.9994 | 0.9881 | Kinase homolog |
| 13 | Q8BZT2 | 0.9997 | 0.9880 | Kinase family |
| 14 | P00520 | 0.9994 | 0.9894 | ABL2 (arg proto-oncogene) |
| 15 | O89032 | 0.9995 | 0.9835 | Kinase family |
| 16 | Q8VI36 | 0.9996 | 0.9878 | Kinase family |
| 17 | Q8VID5 | 0.9997 | 0.9858 | Kinase family |
| 18 | B1AQJ2 | 0.9989 | 0.9867 | Tyrosine kinase |
| 19 | A5D8S5 | 0.9978 | 0.9866 | Kinase homolog |
| 20 | A4IFM7 | 0.9986 | 0.9817 | Kinase family |
Sequence Homology (DIAMOND - TOP 20)
Total homologous proteins: 414 (sequence-based similarity)
| Rank | UniProt | % Identity | Bit Score | Organism |
|---|---|---|---|---|
| 1 | A0M8S8 | 100.0 | 2,803 | Homologous kinase |
| 2 | A0M8R7 | 99.3 | 2,784 | Homologous kinase |
| 3 | P08581 | 99.8 | 2,813 | Tyrosine kinase |
| 4 | P00533 | 99.3 | 2,440 | EGFR (receptor tyrosine kinase) |
| 5 | D7PF45 | 94.8 | 2,207 | Tyrosine kinase |
| 6 | O15357 | 95.9 | 2,212 | Kinase family |
| 7 | O18735 | 91.8 | 2,383 | Kinase homolog |
| 8 | O19064 | 95.3 | 2,239 | Tyrosine kinase |
| 9 | P06494 | 94.9 | 2,473 | Receptor tyrosine kinase |
| 10 | O60674 | 99.7 | 2,316 | JAK2 (Janus kinase 2) |
| 11 | O35346 | 98.9 | 2,003 | Kinase family |
| 12 | O43283 | 97.1 | 1,813 | MAP3K13 (mitogen-activated kinase) |
| 13 | P09760 | 97.7 | 1,564 | Kinase homolog |
| 14 | A7MBB4 | 93.8 | 1,731 | Tyrosine kinase |
| 15 | A0JM20 | 95.5 | 1,682 | Kinase family |
| 16 | O75815 | 96.5 | 1,602 | Kinase homolog |
| 17 | D3ZAZ5 | 93.9 | 1,546 | Kinase family |
| 18 | P07332 | 94.5 | 1,457 | Receptor tyrosine kinase |
| 19 | P42684 | 99.0 | 1,060 | ABL2 (ARG proto-oncogene kinase) |
| 20 | F1LM93 | 97.6 | 1,063 | Kinase family |
Key network features:
- Central hub proteins: GRB2, CRK, CRKL, ABI1 (adaptor/signaling proteins)
- Kinase interactions: HSP90 chaperones, PI3K/JAK/MAPK pathway members
- Known BCR-ABL fusion partners: BCR, ETV6 (fusion in leukemia)
- Regulatory proteins: TP53, BECN1, CBL (ubiquitin ligase for degradation)
- Ras signaling: KRAS, NRAS, GTPase effectors
Transcription factor regulatory data
ABL1 is NOT a transcription factor. ABL1 encodes a non-receptor tyrosine kinase (Abelson tyrosine-protein kinase 1), a signaling enzyme rather than a DNA-binding transcriptional regulator. Therefore, DNA binding motifs and direct transcriptional target identification do not apply.
Upstream Regulators of ABL1
Total: 41 transcription factors regulate ABL1 gene expression
Regulators with high prediction confidence:
- AP1
- CUX1
- GFI1
- HIF1A
- IRF8
- JUN
- PAX5
- RELA
- TBX21
- TXK
Regulators with low prediction confidence:
- BCL6, EGR1, ESR1, HAND2, HHEX, HR, KAT5, MYC, NFKB, PAX1, RUNX1, SALL2, SATB1, SP1, SPI1, STAT1, STAT5A, TCF3, TFAP2A, TP53, ZHX2
Evidence type: Predicted from CollecTRI (curated transcriptional regulation database)
Note on Downstream Effects
While ABL1 does not function as a transcription factor, it regulates gene expression indirectly through kinase activity (phosphorylation). Documented downstream targets include: BAX, BCL2, BCL6, CCND2, CDKN1A, CSF1, FOXO3, JUN, PIM1, and TP53, with effects on apoptosis, cell cycle, and differentiation pathways.
Drug & pharmacology data
ABL1 is a well-established and heavily studied drug target with 3,120+ molecules in ChEMBL.
Approved molecules targeting ABL1 (5 phase 4 drugs)
| Molecule | ID | Mechanism | Highest Phase | Clinical Indication(s) |
|---|---|---|---|---|
| Imatinib | CHEMBL941 | ABL1 tyrosine kinase inhibitor | 4 | Chronic myeloid leukemia (CML), gastrointestinal stromal tumor (GIST), acute lymphoblastic leukemia |
| Dasatinib | CHEMBL5416410 | Multi-kinase inhibitor (ABL1, SRC family) | 4 | Ph+ CML, Ph+ ALL |
| Nilotinib | CHEMBL255863 | ABL1 kinase inhibitor | 4 | Ph+ CML, GIST, Ph+ ALL |
| Ponatinib | CHEMBL1171837 | Multi-target ABL1 inhibitor (including T315I resistant mutant) | 4 | Ph+ CML, Ph+ ALL |
| Bosutinib | CHEMBL288441 | ABL1/SRC family kinase inhibitor | 4 | Ph+ CML, Ph+ ALL |
Top 20 clinical trials targeting ABL1
| Trial ID | Title | Phase | Status | Primary Intervention |
|---|---|---|---|---|
| NCT00481247 | Dasatinib vs Imatinib in newly diagnosed CML-CP | Phase 3 | COMPLETED | Dasatinib vs Imatinib |
| NCT00471497 | Imatinib vs Nilotinib in newly diagnosed Ph+ CML-CP | Phase 3 | COMPLETED | Imatinib vs Nilotinib |
| NCT00574873 | Bosutinib vs Imatinib in newly diagnosed Ph+ CML | Phase 3 | COMPLETED | Bosutinib vs Imatinib |
| NCT00760877 | Nilotinib vs Imatinib - kinetics of molecular response CML-CP | Phase 3 | COMPLETED | Nilotinib vs Imatinib |
| NCT00802841 | Imatinib dose optimization vs Nilotinib in CML suboptimal response | Phase 3 | COMPLETED | Nilotinib vs optimized Imatinib |
| NCT01275196 | Nilotinib vs Imatinib in newly diagnosed Chinese Ph+ CML-CP | Phase 3 | COMPLETED | Nilotinib vs Imatinib |
| NCT02130557 | Bosutinib vs Imatinib in newly diagnosed CML-CP | Phase 3 | COMPLETED | Bosutinib vs Imatinib |
| NCT03589326 | Ponatinib vs Imatinib in acute lymphoblastic leukemia | Phase 3 | ACTIVE_NOT_RECRUITING | Ponatinib vs Imatinib |
| NCT04722848 | Sequential Ponatinib+Blinatumomab vs Chemotherapy+Imatinib in Ph+ ALL | Phase 3 | ACTIVE_NOT_RECRUITING | Ponatinib+Blinatumomab vs Chemotherapy+Imatinib |
| NCT04971226 | Asciminib vs other TKIs in newly diagnosed Ph+ CML-CP | Phase 3 | ACTIVE_NOT_RECRUITING | Asciminib vs other tyrosine kinase inhibitors |
| NCT00081926 | Gleevec (Imatinib) in newly diagnosed CML chronic phase | Phase 4 | COMPLETED | Imatinib |
| NCT00481247 | Phase III Dasatinib vs Imatinib CML-CP | Phase 3 | COMPLETED | Dasatinib vs Imatinib |
| NCT01460693 | Imatinib vs Dasatinib in newly-diagnosed CML | Phase 3 | COMPLETED | Imatinib vs Dasatinib |
| NCT01819389 | Low-dose Nilotinib+Imatinib in CML treatment failure/intolerance | Phase 3 | COMPLETED | Combined Nilotinib+Imatinib |
| NCT02272777 | Imatinib and Nilotinib in CML-CP | Phase 3 | COMPLETED | Imatinib and Nilotinib combination |
| NCT00844298 | Nilotinib + chemotherapy in newly diagnosed Ph+ ALL | Phase 2 | COMPLETED | Nilotinib + chemotherapy |
| NCT00390793 | Dasatinib + chemotherapy in Ph+ or BCR-ABL+ ALL | Phase 2 | COMPLETED | Dasatinib + chemotherapy |
| NCT00287105 | Imatinib added to chemotherapy in Ph+ ALL children | Phase 2 | COMPLETED | Imatinib + chemotherapy |
| NCT01528085 | Nilotinib + chemotherapy in elderly Ph+ ALL | Phase 2 | COMPLETED | Nilotinib + chemotherapy |
| NCT06124157 | Dasatinib or Imatinib + chemotherapy + Blinatumomab in Ph+/Ph-like B-ALL | Phase 3 | RECRUITING | Dasatinib or Imatinib + chemotherapy + Blinatumomab |
Pharmacogenomics & dosing
VIP Gene Status: ABL1 is designated as a VIP (Very Important Pharmacogene) in PharmGKB with annotated variants.
Key genetic markers:
- BCR-ABL1 fusion (Philadelphia chromosome): Diagnostic hallmark of CML and Ph+ ALL; defines treatment with ABL1 inhibitors
- ABL1 kinase domain mutations: Confer resistance to TKIs; specific mutations require escalation to higher-generation inhibitors:
- T315I mutation → requires ponatinib (only approved ABL1 inhibitor effective against this mutation)
- Other mutations → may respond to dasatinib, nilotinib, or bosutinib
Dosing considerations (based on trial protocols):
- Imatinib: 400 mg daily standard; 600 mg daily high-dose for resistance
- Dasatinib: 100 mg daily or 50 mg twice daily; reduced dosing (70 mg) in tolerability studies
- Nilotinib: 300 mg twice daily standard
- Ponatinib: 45 mg daily (approved dose)
- Bosutinib: Dosing adjusted for tolerability
Note: Extensive patient pharmacokinetic/pharmacodynamic monitoring (molecular response assessment) and mutation testing guide treatment optimization across all approved ABL1 inhibitors.
Expression profiles
Tissue Expression (Bgee)
ABL1 shows ubiquitous expression across human tissues with an average expression score of 86.77 (max: 98.8). Expression is enriched in neuronal tissues, smooth muscle, and reproductive organs.
| Rank | Tissue | Expression Score | Quality |
|---|---|---|---|
| 1 | Frontal pole | 98.80 | Gold |
| 2 | Paraflocculus | 98.69 | Gold |
| 3 | Middle frontal gyrus | 98.68 | Gold |
| 4 | Muscle layer of sigmoid colon | 96.54 | Gold |
| 5 | Mucosa of stomach | 96.22 | Gold |
| 6 | Lower esophagus muscularis layer | 95.99 | Gold |
| 7 | Lower esophagus | 95.96 | Gold |
| 8 | Esophagogastric junction muscularis propria | 95.87 | Gold |
| 9 | Stromal cell of endometrium | 95.79 | Gold |
| 10 | Left uterine tube | 95.62 | Gold |
| 11 | Ventricular zone | 95.40 | Gold |
| 12 | Body of uterus | 95.13 | Gold |
| 13 | Ganglionic eminence | 95.11 | Gold |
| 14 | Tibial artery | 95.05 | Gold |
| 15 | Popliteal artery | 95.05 | Gold |
| 16 | Endocervix | 94.94 | Gold |
| 17 | Saphenous vein | 94.81 | Gold |
| 18 | Sigmoid colon | 94.22 | Gold |
| 19 | Aorta | 94.18 | Gold |
| 20 | Right ovary | 94.00 | Gold |
| 21 | Endometrium epithelium | 93.83 | Gold |
| 22 | Left ovary | 93.82 | Gold |
| 23 | Fundus of stomach | 93.58 | Gold |
| 24 | Right coronary artery | 93.55 | Gold |
| 25 | Transverse colon | 93.54 | Gold |
| 26 | Smooth muscle tissue | 93.52 | Gold |
| 27 | Right lobe of thyroid gland | 93.17 | Gold |
| 28 | Descending thoracic aorta | 93.15 | Gold |
| 29 | Gallbladder | 93.15 | Gold |
| 30 | Pericardium | 93.14 | Gold |
Key patterns: Prominent expression in CNS structures (frontal cortex, cerebellum), smooth muscle tissues, gastrointestinal tract muscularis layers, and reproductive organs. Expression in 283/292 conditions (96.9% present calls).
Cell Type Expression
No comprehensive cell-type-specific expression data from Tabula Sapiens or HCA available in current databases. SCXA single-cell data is limited to 1 dataset.
Single-Cell Expression Datasets
| Dataset ID | Description | Species | Cell Count |
|---|---|---|---|
| E-MTAB-6058 | Human foetal L/M-opsin cone photoreceptor populations (AAV2/9.pR2.1.GFP reporter) | Homo sapiens | 74 |
Notes: ABL1 detected in 141 cell clusters across available single-cell experiments; maximum mean expression: 17.42. Limited single-cell atlas coverage suggests need for tissue-specific proteomic or immunohistochemical validation for cell-type specificity.
Disease associations
Mendelian / Monogenic Diseases
Via GenCC (curated gene-disease associations):
| Disease | Disease ID | Inheritance | Evidence Level |
|---|---|---|---|
| Congenital heart defects and skeletal malformations syndrome | OMIM:617602 | Autosomal dominant | Strong |
| Connective tissue disorder | MONDO:0003900 | Autosomal dominant | Moderate |
| Bone development disease | MONDO:0005497 | Autosomal recessive | Limited |
Via ClinVar & Mondo:
- Chronic myelogenous leukemia, BCR-ABL1 positive (MONDO:0011996)
- Congenital heart disease (MONDO:0005453)
- T-cell acute lymphoblastic leukemia (MONDO:0004963)
- Cardiomyopathy (MONDO:0004994)
- Cleft palate (MONDO:0016064)
- Lip and oral cavity carcinoma (MONDO:0023644)
- Citrullinemia (MONDO:0015991)
- Microcephaly (MONDO:0001149)
- Muscular dystrophy-dystroglycanopathy variants (MONDO:0000171, MONDO:0012248, MONDO:0013159)
Via Orphanet:
- Chronic myeloid leukemia (ORPHA:521)
- Marfanoid habitus-facial dysmorphism-skeletal abnormality-heart defect syndrome (ORPHA:643503)
- Walker-Warburg syndrome (ORPHA:899)
- POMT1-related limb-girdle muscular dystrophy R11 (ORPHA:86812)
- Moyamoya angiopathy (ORPHA:477768)
Phenotype Associations (Top 30 HPO Terms)
- HP:0000006 | Autosomal dominant inheritance
- HP:0005506 | Chronic myelogenous leukemia
- HP:0004848 | Ph-positive acute lymphoblastic leukemia
- HP:0001974 | Increased total leukocyte count
- HP:0001873 | Thrombocytopenia
- HP:0001894 | Thrombocytosis
- HP:0001263 | Global developmental delay
- HP:0001508 | Failure to thrive
- HP:0001629 | Ventricular septal defect
- HP:0001631 | Atrial septal defect
- HP:0001680 | Coarctation of aorta
- HP:0002616 | Aortic root aneurysm
- HP:0004322 | Short stature
- HP:0001511 | Intrauterine growth retardation
- HP:0001166 | Arachnodactyly
- HP:0001382 | Joint hypermobility
- HP:0001763 | Pes planus
- HP:0000767 | Pectus excavatum
- HP:0002650 | Scoliosis
- HP:0002808 | Kyphosis
- HP:0002023 | Anal atresia
- HP:0000776 | Congenital diaphragmatic hernia
- HP:0002566 | Intestinal malrotation
- HP:0000218 | High palate
- HP:0000337 | Broad forehead
- HP:0002007 | Frontal bossing
- HP:0000582 | Upslanted palpebral fissure
- HP:0000490 | Deeply set eye
- HP:0003189 | Long nose
- HP:0001744 | Splenomegaly
Complex-Disease / GWAS Associations (Top 6)
| Trait | P-Value | Chromosome |
|---|---|---|
| Platelet count | 7.0e-13 | 9 |
| White blood cell count | 8.0e-12 | 9 |
| Lymphocyte count | 9.0e-11 | — |
| Red cell distribution width | 1.0e-10 | 9 |
| Coronary artery calcified atherosclerotic plaque score in type 2 diabetes | 5.0e-06 | 9 |
| Response to amphetamines | 8.0e-06 | 9 |