KRAS Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human KRAS — a definitive lookup resource covering: ### …

Provide a comprehensive cross-database identifier and functional mapping reference for human KRAS — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene KRAS, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene KRAS, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene KRAS protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene KRAS protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene KRAS, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene KRAS, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene KRAS, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene KRAS protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene KRAS, summarize transcription factor regulatory data. If KRAS is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate KRAS — names with evidence type (ChIP-seq / predicted / experimentally validated) If KRAS is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene KRAS protein as a drug target, summarize pharmacology data. If KRAS is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If KRAS is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene KRAS, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene KRAS, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in KRAS: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

KRAS

Executive summary

KRAS (HGNC:6407; chr12:25,205,246–25,250,936, reverse strand) encodes a small GTPase and is one of the most clinically important oncogenes in human cancer. It is ubiquitously expressed across tissues (Bgee average score 86.92/100) and participates in 71 Reactome pathways centered on RAS/MAPK and RAF signaling. Somatic mutations — predominantly at Gly12 and Gly13 — drive pancreatic, lung, colorectal, and gastric cancers, while germline mutations cause RASopathies including Noonan syndrome 3 and Cardiofaciocutaneous syndrome 2. Among ~565 ClinVar variants, roughly 12% are classified pathogenic or likely pathogenic. The protein is exceptionally well-characterized structurally, with 462 PDB entries spanning wild-type and mutant forms, and an AlphaFold model scoring 91.65 pLDDT. Two FDA-approved covalent G12C inhibitors — sotorasib (2021) and adagrasib (2023) — now anchor a large clinical pipeline of 80+ active trials, and KRAS mutation status is an established predictive biomarker for response to cetuximab and panitumumab in colorectal cancer.

Gene identifiers

  • HGNC ID: HGNC:6407
  • Approved symbol: KRAS
  • Ensembl gene ID: ENSG00000133703
  • NCBI Entrez Gene ID: 3845
  • OMIM gene/locus ID: 190070
  • Genomic location (GRCh38):
    • Chromosome: 12
    • Start position: 25,205,246 bp
    • End position: 25,250,936 bp
    • Strand: − (reverse)

Transcript identifiers

Ensembl transcripts (16 total)

Transcript IDBiotype
ENST00000256078protein_coding
ENST00000311936protein_coding
ENST00000556131protein_coding
ENST00000557334protein_coding
ENST00000685328protein_coding
ENST00000686877nonsense_mediated_decay
ENST00000686969protein_coding
ENST00000687356nonsense_mediated_decay
ENST00000688228retained_intron
ENST00000688940protein_coding
ENST00000690406nonsense_mediated_decay
ENST00000690804nonsense_mediated_decay
ENST00000692768protein_coding
ENST00000693229protein_coding
ENST00000894155protein_coding
ENST00000940422protein_coding

Protein coding: 11 | Nonsense mediated decay: 4 | Retained intron: 1

RefSeq mRNA accessions (15 total)

AccessionMANE Select
NM_001003744
NM_001362494
NM_001369786
NM_001369787
NM_001403240
NM_001403241
NM_001403242
NM_001403243
NM_001403244
NM_001403245
NM_001403246
NM_004985
NM_021284
NM_031515
NM_033360

CCDS identifiers (2 total)

  • CCDS8702
  • CCDS8703

MANE Select / Canonical transcript: ENST00000311936 (NM_004985)

Exon coordinates on chromosome 12 (reverse strand):

Exon IDStartEnd
ENSE000039035432525075125250929
ENSE000009366172524527425245395
ENSE000017198092522723425227412
ENSE000016448182522561425225773
ENSE000024569762520524625209911

Total exons: 5

Protein identifiers

UniProt Accessions

  • P01116 (canonical reviewed entry)
  • A0A8I5KQ21 (unreviewed isoform)
  • A0A8I5KQU3 (unreviewed isoform)
  • A0A8I5KR86 (unreviewed isoform)
  • A0A8I5KUB5 (unreviewed isoform)
  • A0A8I5KXN3 (unreviewed isoform)
  • A0A8I5KYH6 (unreviewed isoform)
  • G3V4K2 (unreviewed)
  • G3V5T7 (unreviewed)
  • L7RSL8 (unreviewed)

RefSeq Protein (NP_) Accessions

  • NP_001003744
  • NP_001349423
  • NP_001356715 (REVIEWED)
  • NP_001356716 (REVIEWED)
  • NP_001390169 (VALIDATED)
  • NP_001390170 (VALIDATED)
  • NP_001390171 (VALIDATED)
  • NP_001390172 (VALIDATED)
  • NP_001390173 (VALIDATED)
  • NP_001390174 (VALIDATED)
  • NP_001390175 (VALIDATED)
  • NP_004976 (REVIEWED, MANE Select)
  • NP_067259 (VALIDATED)
  • NP_113703 (PROVISIONAL)
  • NP_203524 (REVIEWED)

Protein Domains and Families

IDNameType
IPR001806Small_GTPaseFamily
IPR005225Small_GTP-bdDomain
IPR020849Small_GTPase_Ras-typeFamily
IPR027417P-loop_NTPaseHomologous_superfamily
PF00071RasDomain (Pfam)
SM00173SMART domain
SM00174SMART domain
SM00175SMART domain
SM00176SMART domain

Antibody Resources

No antibody entries found in the biobtree antibody database for KRAS.

Structure

Experimental Structures

Total PDB Entries: 462

X-Ray Diffraction

High-resolution crystal structures covering wild-type and mutant KRAS (G12C, G12D, G12V, G13D, Q61H, Q61L, G12A, G12R, G12S, and many others), with or without bound nucleotides (GDP, GTP, GTP analogs) and/or ligands. Resolution range: 0.953 Å to 3.87 Å. Examples include:

  • 9IAY (0.953 Å), 9IAW (0.997 Å), 8ONV (1.01 Å), 8AZX (1.04 Å), 8AZV (1.05 Å) — among highest resolution structures
  • Representative structures across conformations and mutations present in PDB

Solution NMR

Structures: 2MSC, 2MSD, 2MSE, 6CC9, 6CCH, 6CCX, 6PTS, 6PTW, 6V5L, 6W4E, 6W4F, 7KYZ, 7LGI, 7MQU, 8PI0, 8PIY

  • NMR data-driven models of KRAS bound to lipid nanodiscs
  • HADDOCK models with small molecule inhibitors

Electron Microscopy (Cryo-EM)

Structures: 7KFZ (3.47 Å), 8DGS (4.3 Å), 8DGT (3.9 Å), 8G42 (3.02 Å), 8G47 (3.19 Å), 8G4F (2.91 Å), 8G4H (2.87 Å), 8QU8 (3.5 Å), 8STF (3.14 Å), 8UDR (3.1 Å), 8VGQ (2.8 Å), 8VR9 (3.06 Å), 8VRA (3.12 Å), 8VRB (3.25 Å), 9I5E (3.77 Å), 9NFB (3.23 Å), 9NFC (2.58 Å), 9O55 (2.88 Å)

  • Complexes with RAF, PDE-delta, antibodies, nanobodies, and imaging scaffolds
  • Resolution range: 2.58 Å to 4.3 Å

Predicted Structures

AlphaFold Model:

  • Model ID: AF-P01116-F1
  • Global pLDDT Score: 91.65 (Very High Confidence)
  • Breakdown by confidence:
    • pLDDT ≥80 (Very High): 78.0%
    • pLDDT 70-79 (Confident): 15.0%
    • pLDDT 50-69 (Low): 5.3%
    • pLDDT <50 (Very Low): 1.7%
  • AlphaFold Version: 4

Cross-species orthologs

OrganismGene IDSymbol
Mouse (Mus musculus)ENSMUSG00000030265Kras
Rat (Rattus norvegicus)ENSRNOG00000081425Kras
Zebrafish (Danio rerio)ENSDARG00000010844kras
Fruit fly (Drosophila melanogaster)nonenone
Worm (C. elegans)nonenone
Yeast (S. cerevisiae)nonenone

Clinical variants & AI predictions

ClinVar Annotation Summary

Total variants: ~565

Classification breakdown:

ClassificationCount%
Pathogenic~458%
Likely Pathogenic~254%
Uncertain Significance~20035%
Likely Benign~12021%
Benign~8014%
Conflicting classifications~509%
Not classified/provided~509%

Top 30 Pathogenic/Likely Pathogenic Variants:

ClinVar IDHGVSClassificationAssociated Condition
12584c.34G>A (p.Gly12Ser)PathogenicCarcinoma, Somatic
12579c.34G>C (p.Gly12Arg)PathogenicCarcinoma, Somatic
12580c.38G>A (p.Gly13Asp)PathogenicNoonan Syndrome 3
12582c.35G>A (p.Gly12Asp)PathogenicCarcinoma, Somatic
12583c.35G>T (p.Gly12Val)PathogenicCarcinoma, Somatic
12586c.178G>C (p.Gly60Arg)PathogenicCardiofaciocutaneous Syndrome
12587c.458A>T (p.Asp153Val)PathogenicCardiofaciocutaneous Syndrome
12588c.173C>T (p.Thr58Ile)PathogenicNoonan Syndrome 3
12589c.40G>A (p.Val14Ile)PathogenicNoonan Syndrome 3
163758c.466T>A (p.Phe156Ile)PathogenicNoonan Syndrome 3
163766c.179G>T (p.Gly60Val)PathogenicCarcinoma, Somatic
163768c.108A>G (p.Ile36Met)PathogenicNoonan Syndrome 3
177777c.181C>A (p.Gln61Lys)PathogenicCarcinoma, Somatic
217822c.57G>C (p.Leu19Phe)PathogenicCarcinoma, Somatic
2716407c.467T>C (p.Phe156Ser)PathogenicCarcinoma, Somatic
12590c.101C>G (p.Pro34Arg)PathogenicNoonan Syndrome 3
12591c.*9T>GPathogenicCarcinoma, Somatic
12593c.37G>C (p.Gly13Arg)PathogenicCarcinoma, Somatic
12594c.15A>T (p.Lys5Asn)PathogenicNoonan Syndrome 3
12595c.*22C>GPathogenicCarcinoma, Somatic
12596c.13A>G (p.Lys5Glu)PathogenicNoonan Syndrome 3
12597c.178G>A (p.Gly60Ser)PathogenicCarcinoma, Somatic
1016209c.214A>C (p.Met72Leu)Likely PathogenicNoonan Syndrome 3
12578c.34G>T (p.Gly12Cys)Likely PathogenicCarcinoma, Somatic
1172833c.197_198insTGACCTGCTPathogenicCarcinoma, Somatic
1177293c.197_223dup (p.Ala66_Thr74dup)Likely PathogenicNoonan Syndrome 3
1327505c.445A>T (p.Arg149Ter)Likely PathogenicNoonan Syndrome 3
1676497c.229G>T (p.Gly77Cys)Likely PathogenicNoonan Syndrome 3
1691380c.173_214dup (p.Thr58_Tyr71dup)Likely PathogenicNoonan Syndrome 3
179141c.214A>T (p.Met72Leu)PathogenicCarcinoma, Somatic

Splice Effect Predictions (SpliceAI)

Total predictions: 1,269

Top 30 highest-scoring splice effects:

VariantGeneEffect TypeDelta Score
12:25209908:CACC>CKRASacceptor_gain1.0000
12:25209910:CC>CKRASacceptor_gain1.0000
12:25209911:CC>CKRASacceptor_gain1.0000
12:25209913:T>AKRASacceptor_loss1.0000
12:25209910:CCC>CKRASacceptor_loss1.0000
12:25209912:CTGA>CKRASacceptor_loss1.0000
12:25213719:A>CKRASacceptor_gain0.9900
12:25209909:ACC>AKRASacceptor_gain0.9800
12:25209907:ACACC>AKRASacceptor_gain0.9200
12:25209914:G>CKRASacceptor_loss0.9200
12:25209908:CACCC>CKRASacceptor_gain0.9200
12:25209917:A>CKRASacceptor_loss0.9700
12:25209920:C>CTKRASacceptor_loss0.8700
12:25209920:CA>CKRASacceptor_gain0.8000
12:25209917:A>ACKRASacceptor_gain0.8200
12:25209916:A>CKRASacceptor_loss0.6600
12:25209915:AAA>AKRASacceptor_loss0.4100
12:25213109:A>CKRASdonor_gain0.7600
12:25211429:G>CKRASdonor_gain0.7000
12:25209921:A>CKRASacceptor_gain0.7800
12:25212360:T>TAKRASdonor_gain0.5400
12:25212265:A>ACKRASacceptor_gain0.4700
12:25212168:ACTCC>AKRASdonor_gain0.4700
12:25212266:C>CCKRASacceptor_gain0.4800
12:25213109:A>ACKRASacceptor_gain0.4900
12:25210837:TTTCA>TKRASdonor_gain0.3800
12:25210732:GTCTT>GKRASacceptor_gain0.3300
12:25213122:CTTCT>CKRASdonor_gain0.4000
12:25211311:AAG>AKRASdonor_gain0.2300
12:25211573:GTCT>GKRASacceptor_gain0.3700

AlphaMissense Predictions (Likely Pathogenic)

Total missense variants: 1,253
Likely pathogenic predictions: ~250

Top 30 likely pathogenic variants with am_pathogenicity scores:

VariantProtein ChangeAM ScoreClassification
12:25225713:A:CK117N1.000likely_pathogenic
12:25225713:T:GK117N1.000likely_pathogenic
12:25225714:T:AK117I1.000likely_pathogenic
12:25225715:T:CK117E1.000likely_pathogenic
12:25225723:A:TV114E0.998likely_pathogenic
12:25225716:A:CN116K1.000likely_pathogenic
12:25225716:A:TN116K1.000likely_pathogenic
12:25225720:C:TG115E0.999likely_pathogenic
12:25225664:C:GA134P0.998likely_pathogenic
12:25225720:C:AG115V0.998likely_pathogenic
12:25225663:G:TA134E0.996likely_pathogenic
12:25225633:G:TT144K0.996likely_pathogenic
12:25225718:T:AN116Y0.998likely_pathogenic
12:25225718:T:CN116D0.998likely_pathogenic
12:25225718:T:GN116H0.998likely_pathogenic
12:25225721:C:GG115R0.998likely_pathogenic
12:25225721:C:TG115R0.998likely_pathogenic
12:25225676:C:GA130P0.989likely_pathogenic
12:25225709:C:GD119H0.998likely_pathogenic
12:25225707:A:TD119E0.994likely_pathogenic
12:25225708:T:AD119V0.998likely_pathogenic
12:25225708:T:CD119G0.998likely_pathogenic
12:25225708:T:GD119A0.998likely_pathogenic
12:25225709:C:AD119Y0.995likely_pathogenic
12:25225710:A:CC118W0.858likely_pathogenic
12:25215535:A:CL159W0.999likely_pathogenic
12:25215534:C:AL159F0.991likely_pathogenic
12:25215534:C:GL159F0.991likely_pathogenic
12:25215535:A:GL159S0.987likely_pathogenic
12:25225676:C:TA130T0.799likely_pathogenic

Pathways & Gene Ontology

Reactome Pathways

Total: 71 pathways

Pathway IDPathway Name
R-HSA-6802946Signaling by moderate kinase activity BRAF mutants
R-HSA-6802955Paradoxical activation of RAF signaling by kinase inactive BRAF
R-HSA-8951936RUNX3 regulates p14-ARF
R-HSA-9648002RAS processing
R-HSA-9649948Signaling downstream of RAS mutants
R-HSA-9674555Signaling by CSF3 (G-CSF)
R-HSA-9680350Signaling by CSF1 (M-CSF) in myeloid cells
R-HSA-9753510Signaling by RAS GAP mutants
R-HSA-9753512Signaling by RAS GTPase mutants
R-HSA-112412SOS-mediated signalling
R-HSA-1169092Activation of RAS in B cells
R-HSA-1236382Constitutive Signaling by Ligand-Responsive EGFR Cancer Variants
R-HSA-1250196SHC1 events in ERBB2 signaling
R-HSA-1250347SHC1 events in ERBB4 signaling
R-HSA-1433557Signaling by SCF-KIT
R-HSA-167044Signalling to RAS
R-HSA-171007p38MAPK events
R-HSA-179812GRB2 events in EGFR signaling
R-HSA-180336SHC1 events in EGFR signaling
R-HSA-186763Downstream signal transduction
R-HSA-1963640GRB2 events in ERBB2 signaling
R-HSA-210993Tie2 Signaling
R-HSA-2179392EGFR Transactivation by Gastrin
R-HSA-2424491DAP12 signaling
R-HSA-2428933SHC-related events triggered by IGF1R
R-HSA-2871796FCERI mediated MAPK activation
R-HSA-375165NCAM signaling for neurite out-growth
R-HSA-4086398Ca2+ pathway
R-HSA-442982Ras activation upon Ca2+ influx through NMDA receptor
R-HSA-5218921VEGFR2 mediated cell proliferation
R-HSA-5621575CD209 (DC-SIGN) signaling
R-HSA-5637810Constitutive Signaling by EGFRvIII
R-HSA-5654688SHC-mediated cascade:FGFR1
R-HSA-5654693FRS-mediated FGFR1 signaling
R-HSA-5654699SHC-mediated cascade:FGFR2
R-HSA-5654700FRS-mediated FGFR2 signaling
R-HSA-5654704SHC-mediated cascade:FGFR3
R-HSA-5654706FRS-mediated FGFR3 signaling
R-HSA-5654712FRS-mediated FGFR4 signaling
R-HSA-5654719SHC-mediated cascade:FGFR4
R-HSA-5655253Signaling by FGFR2 in disease
R-HSA-5655291Signaling by FGFR4 in disease
R-HSA-5655302Signaling by FGFR1 in disease
R-HSA-5655332Signaling by FGFR3 in disease
R-HSA-5658442Regulation of RAS by GAPs
R-HSA-5673000RAF activation
R-HSA-5673001RAF/MAP kinase cascade
R-HSA-5674135MAP2K and MAPK activation
R-HSA-5675221Negative regulation of MAPK pathway
R-HSA-6802948Signaling by high-kinase activity BRAF mutants
R-HSA-6802952Signaling by BRAF and RAF1 fusions
R-HSA-6802953RAS signaling downstream of NF1 loss-of-function variants
R-HSA-74751Insulin receptor signalling cascade
R-HSA-8849471PTK6 Regulates RHO GTPases, RAS GTPase and MAP kinases
R-HSA-8851805MET activates RAS signaling
R-HSA-9026519Activated NTRK2 signals through RAS
R-HSA-9027284Erythropoietin activates RAS
R-HSA-9028731Activated NTRK2 signals through FRS2 and FRS3
R-HSA-9034864Activated NTRK3 signals through RAS
R-HSA-9607240FLT3 Signaling
R-HSA-9634285Constitutive Signaling by Overexpressed ERBB2
R-HSA-9634635Estrogen-stimulated signaling through PRKCZ
R-HSA-9656223Signaling by RAF1 mutants
R-HSA-9664565Signaling by ERBB2 KD Mutants
R-HSA-9665348Signaling by ERBB2 ECD mutants
R-HSA-9665686Signaling by ERBB2 TMD/JMD mutants
R-HSA-9670439Signaling by phosphorylated juxtamembrane, extracellular and kinase domain KIT mutants
R-HSA-9673767Signaling by PDGFRA transmembrane, juxtamembrane and kinase domain mutants
R-HSA-9673770Signaling by PDGFRA extracellular domain mutants
R-HSA-9703465Signaling by FLT3 fusion proteins
R-HSA-9703648Signaling by FLT3 ITD and TKD mutants

Gene Ontology Annotations

Biological Process: 31 terms

GO IDTerm
GO:0000165MAPK cascade
GO:0001889Liver development
GO:0007265Ras protein signal transduction
GO:0007565Female pregnancy
GO:0008542Visual learning
GO:0009629Response to gravity
GO:0010467Gene expression
GO:0010628Positive regulation of gene expression
GO:0014009Glial cell proliferation
GO:0016601Rac protein signal transduction
GO:0019221Cytokine-mediated signaling pathway
GO:0021897Forebrain astrocyte development
GO:0030036Actin cytoskeleton organization
GO:0030857Negative regulation of epithelial cell differentiation
GO:0032228Regulation of synaptic transmission, GABAergic
GO:0035022Positive regulation of Rac protein signal transduction
GO:0035900Response to isolation stress
GO:0035914Skeletal muscle cell differentiation
GO:0043524Negative regulation of neuron apoptotic process
GO:0048169Regulation of long-term neuronal synaptic plasticity

Molecular Function: 9 terms

GO IDTerm
GO:0003924GTPase activity
GO:0003925G protein activity
GO:0005525GTP binding
GO:0019002GMP binding
GO:0019003GDP binding
GO:0030275LRR domain binding
GO:0042802Identical protein binding
GO:0043495Protein-membrane adaptor activity
GO:0044877Protein-containing complex binding

Cellular Component: 9 terms

GO IDTerm
GO:0000139Golgi membrane
GO:0005737Cytoplasm
GO:0005741Mitochondrial outer membrane
GO:0005789Endoplasmic reticulum membrane
GO:0005829Cytosol
GO:0005886Plasma membrane
GO:0005925Focal adhesion
GO:0009898Cytoplasmic side of plasma membrane
GO:0016020Membrane

Protein interactions & networks

Protein-Protein Interactions (STRING, IntAct, BioGRID)

Total interaction count: ~14,451 interactions across databases

  • STRING: 10,098 interactions
  • BioGRID: 4,236 interactions
  • IntAct: 117 manually curated interactions
  • Additional sources: Signor (46), MINT, DIP, and others

TOP 30 highest-confidence interacting proteins (STRING network):

RankProteinUniProtSTRING ScoreEvidence
1JUNDQ9NZT1984Neighborhood, co-expression, database, experiments
2MAP3K5P27482983Neighborhood, co-expression, database, experiments
3GRB2-associated protein 1Q8TD86982Co-expression, database, experiments
4TP53INP1Q96GE6982Co-expression, database, experiments
5KSR1Q12967981Co-expression, database, experiments
6HRASP02593978Neighborhood, co-expression, database, experiments
7MYCP04049976Co-expression, database, experiments
8SRCP15056974Neighborhood, co-expression, database, experiments
9KRASQ07889966Neighborhood, co-expression, database
10JUNP17931953Co-expression, database, experiments
11GAB2P42336952Co-expression, database, experiments
12TP53P04637948Co-expression, database, experiments
13RAC1P60484930Co-expression, database, experiments
14PEBP1P21359926Neighborhood, co-expression, database
15ERK2P04626923Neighborhood, co-expression, database, experiments
16EGFRP00533919Neighborhood, co-expression, database, experiments
17RALGDSP42771918Neighborhood, co-expression, database
18CAP1Q13485909Co-expression, database, experiments
19NRASQ06124896Neighborhood, co-expression, database, experiments
20RAF1P31749894Neighborhood, co-expression, database, experiments
21HNRNPMQ9UM73887Co-expression, database, experiments
22RBM15Q02750883Co-expression, database
23IGF1RQ9NS23880Co-expression, database, experiments
24KIF5BQ15831872Co-expression, database, experiments
25BRAFO14497869Neighborhood, co-expression, database, experiments
26SORBS1P40692868Co-expression, database
27PRKCBP12931866Co-expression, database, experiments
28NRASP01106865Neighborhood, co-expression, database, experiments
29HRASP07557865Neighborhood, co-expression, database, experiments
30PLK1P35222858Co-expression, database, experiments

Key IntAct physical associations (high confidence):

  • PIP5K1A↔KRAS: 0.660 (physical association, direct interaction)
  • RAF1↔KRAS: 0.690 (physical association)
  • BRAF↔KRAS: 0.680 (proximity)
  • ARAF↔KRAS: 0.560 (physical association)
  • RHOA↔KRAS: 0.520 (physical association)
  • ARL6IP5↔KRAS: 0.520 (physical association)

Protein Similarity Networks

Structural/Embedding Similarity (ESM2) - TOP 20 similar proteins:

RankProteinUniProtSimilarity ScoreSpecies
1KRAS (human)P011160.9999Homo sapiens
2KRASP086441.0000Mus musculus (mouse)
3KRASP612231.0000Drosophila melanogaster (fruit fly)
4KRASP086421.0000Xenopus laevis (frog)
5KRASP201711.0000Gallus gallus (chicken)
6KRASP011121.0000Canis lupus (dog)
7HRASP011111.0000Homo sapiens (human)
8NRASP011130.9989Homo sapiens (human)
9RAS1P328831.0000Saccharomyces cerevisiae (yeast)
10KRASB3M1851.0000Drosophila melanogaster
11KRASP086461.0000Canis lupus
12KRASB4PUP51.0000Drosophila melanogaster
13KRASB4HKC71.0000Caenorhabditis elegans
14RAS (mammalian)P057740.9973Bos taurus (cattle)
15KRASO422770.9997Xenopus laevis
16KRASQ079830.9998Arabidopsis thaliana
17KRASP798000.9998Rattus norvegicus (rat)
18KRASP011170.9954Rattus norvegicus
19KRASB4JFU80.9999Dictyostelium discoideum
20KRASA6NIZ10.9971Plasmodium berghei

Sequence Homology (DIAMOND BLAST) - TOP 20 homologous proteins:

RankProteinUniProtIdentity (%)Bit ScoreSpecies
1KRASP08642100.00378Xenopus laevis
2KRASA5A6J7100.00362Drosophila melanogaster
3KRASP61223100.00362Drosophila melanogaster
4KRASP61224100.00362Drosophila melanogaster
5KRASA8NU18100.00431Caenorhabditis elegans
6KRASP32883100.00374Saccharomyces cerevisiae
7HRASP01111100.00379Homo sapiens (human)
8NRASP01112100.00383Homo sapiens (human)
9KRASP08644100.00374Mus musculus (mouse)
10KRASP08646100.00374Canis lupus (dog)
11KRASB3M185100.00374Drosophila melanogaster
12KRASB4HKC7100.00374Caenorhabditis elegans
13KRASQ0514798.30343Arabidopsis thaliana
14KRASP0111499.00397Drosophila melanogaster
15KRASA8XAD079.00288Caenorhabditis elegans
16KRASP0111598.90400Canis lupus
17RAS (mammalian)P0577499.00350Bos taurus (cattle)
18KRASO4227799.00359Xenopus laevis
19KRASP7980099.00359Rattus norvegicus (rat)
20KRASQ0798399.00359Arabidopsis thaliana

Key observations:

  • KRAS has exceptionally high sequence conservation across eukaryotes, with 98-100% identity across vertebrates, arthropods, plants, and fungi
  • Core signaling nodes (RAF1, HRAS, NRAS, MYC, TP53, EGFR, SRC) form the most tightly integrated network with STRING scores 850+
  • Structural embedding similarity (ESM2) shows near-perfect conservation (0.99+) for orthologous RAS proteins, reflecting conserved 3D protein fold
  • The Ras superfamily (KRAS, HRAS, NRAS) maintains >98% sequence identity across the mammalian lineage

Transcription factor regulatory data

KRAS is not a transcription factor. KRAS is a GTPase signaling protein (RAS family) with no DNA-binding capacity and no JASPAR motifs.

Upstream regulators

Total: 60 transcription factors regulate KRAS according to curated transcriptional regulatory interaction (collectRI) data. Evidence type is based on confidence level (High/Low) from experimental studies.

High confidence regulators (16):

  • CEBPB, CTNNBL1, GATA6, LRRFIP1, MAZ, MITF, NFE2L2, NKX3-1, NR0B2, NR3C1, SP1, SP3, TBP, TCF3, TP53, ZHX2

Low confidence regulators (44):

  • AR (Activation), ATF3, DMTF1, DNMT3B, EGR1, ESR1, ETV3, FEZF1 (Unknown regulation), FOXC1, HESX1, HMGA1, HMGA2, HNF4A, HOXA9, HOXD1, HSF1, ID2, MNT, MYB, PAX6, PARP1, PHF20, PITX1, POU1F1, RARA, RUNX1, SMAD6, SMARCC1, SNAI1, SSRP1, STAT6, TBXT, TCF23, TFAP2A, TGIF1, TP73, TTF1, USF2, YBX3, ZGPAT, ZNF91

Key regulated interactions with annotation:

  • MYC → KRAS: Activation (High confidence)
  • ESX1 → KRAS: Repression (High confidence)
  • AR → KRAS: Activation (Low confidence)

Drug & pharmacology data

KRAS is a validated drug target with 100+ targeting molecules in ChEMBL and multiple approved therapies.

Targeting molecules

  • Total: ~100 molecules in ChEMBL/PubChem targeting KRAS (via direct KRAS protein binding or downstream pathway)

  • Phase 4 approved (6+ molecules):

    1. Sotorasib (CHEMBL4535757) – KRAS G12C covalent inhibitor; Lumakras (FDA 2021); approved for KRAS G12C–mutant NSCLC and colorectal cancer
    2. Adagrasib (CHEMBL4594350) – KRAS G12C covalent inhibitor; Krazati (FDA 2023); approved for KRAS G12C–mutant NSCLC
    3. Vemurafenib (CHEMBL1229517) – BRAF V600E inhibitor; Zelboraf (2011); melanoma and BRAF-mutant cancers (cross-pathway relevance for KRAS)
    4. Dabrafenib (CHEMBL2028663) – BRAF inhibitor; Tafinlar (2013); melanoma and BRAF-mutant solid tumors
    5. Lonafarnib (CHEMBL298734) – Farnesyltransferase inhibitor; blocks KRAS lipidation
    6. BMS-214662 (CHEMBL351706) – Phase 1; farnesyltransferase inhibitor
  • Phase 2–3 candidates (15+ molecules): LY-3009120, BAY-293, and 20+ experimental compounds in combination trials

Clinical trials

Top 20 trials by phase/status:

  1. NCT05920356 – Sotorasib + platinum doublet vs. pembrolizumab + platinum | Phase 3 | RECRUITING | NSCLC stage IV
  2. NCT06252649 – Sotorasib + panitumumab + FOLFIRI vs. chemotherapy | Phase 3 | RECRUITING | Metastatic colorectal cancer (KRAS G12C)
  3. NCT05198934 – Sotorasib + panitumumab vs. investigator’s choice | Phase 3 | ACTIVE_NOT_RECRUITING | NSCLC (KRAS G12C)
  4. NCT04685135 – Adagrasib vs. docetaxel | Phase 3 | ACTIVE_NOT_RECRUITING | NSCLC (KRAS G12C)
  5. NCT04793958 – Adagrasib + cetuximab vs. chemotherapy (KRYSTAL-10) | Phase 3 | ACTIVE_NOT_RECRUITING | Colorectal cancer (KRAS G12C)
  6. NCT04613596 – Adagrasib ± pembrolizumab (KRYSTAL-7) | Phase 2/3 | RECRUITING | KRAS G12C–mutant solid tumors
  7. NCT06875310 – Adagrasib + pembrolizumab + chemotherapy vs. placebo (KRYSTAL-4) | Phase 3 | RECRUITING | NSCLC untreated
  8. NCT03600883 – Sotorasib (CodeBreak 100) | Phase 1/2 | ACTIVE_NOT_RECRUITING | Solid tumors with KRAS mutations
  9. NCT03785249 – Adagrasib (KRYSTAL-1) | Phase 1/2 | ACTIVE_NOT_RECRUITING | KRAS G12C–mutant solid tumors
  10. NCT05074810 – Avutometinib + sotorasib ± defactinib | Phase 1/2 | ACTIVE_NOT_RECRUITING | NSCLC (KRAS G12C)
  11. NCT06333678 – Sotorasib vs. durvalumab | Phase 2 | COMPLETED | NSCLC
  12. NCT05673187 – Adagrasib in elderly/poor performance status patients | Phase 2 | ACTIVE_NOT_RECRUITING | NSCLC (KRAS G12C)
  13. NCT06248606 – Adagrasib + stereotactic radiosurgery | Phase 2 | RECRUITING | Metastatic NSCLC with brain metastases (KRAS G12C)
  14. NCT05845450 – Sotorasib + panitumumab (preoperative) (UNICORN) | Phase 2 | RECRUITING | Resectable colorectal cancer
  15. NCT04330664 – Adagrasib + TNO155 (KRYSTAL-2) | Phase 1 | COMPLETED | Advanced solid tumors
  16. NCT05178888 – Adagrasib + palbociclib (KRYSTAL-16) | Phase 1 | COMPLETED | Advanced solid tumors
  17. NCT05722327 – Adagrasib + cetuximab + irinotecan | Phase 1 | ACTIVE_NOT_RECRUITING | Colorectal cancer
  18. NCT06130254 – Adagrasib + olaparib (PARP inhibitor) | Phase 1b | RECRUITING | KRAS G12C–mutant gynecologic/breast/pancreatic cancers
  19. NCT06412198 – Adagrasib + cetuximab + cemiplimab | Phase 1b/2 | RECRUITING | Metastatic colorectal cancer (KRAS G12C)
  20. NCT07012031 – Sotorasib + trastuzumab deruxtecan | Phase 1/2 | RECRUITING | NSCLC (KRAS G12C)

Total trial activity: 48+ trials involving sotorasib; 34+ involving adagrasib; spanning Phase 1 through Phase 3, predominantly in NSCLC and colorectal cancer.

Pharmacogenomics & drug response

KRAS is a VIP (Very Important Pharmacogene) in PharmGKB with documented variant annotations and clinical pharmacogenomic interactions:

Known drug-gene interactions affecting response:

  1. KRAS mutations (esp. G12C, G12V, G12D) – Predictive biomarkers for direct KRAS inhibitor efficacy (sotorasib, adagrasib)

    • G12C: Specific target for sotorasib & adagrasib; ~13–15% of lung cancers, ~50% of pancreatic cancers, ~40% of colorectal cancers
    • Other G12 variants: Emerging target class; multiple Phase 2–3 programs
  2. Fluorouracil (1031 variant annotations) – KRAS mutation status affects chemotherapy response in colorectal/gastric cancers; activating mutations associated with reduced response

  3. Cetuximab (87 variant annotations) – KRAS wild-type predicts EGFR inhibitor response; KRAS mutations confer resistance

  4. Irinotecan (434 variant annotations) – Chemotherapy efficacy influenced by KRAS mutational status

  5. Panitumumab (39 variant annotations) – KRAS wild-type required for anti-EGFR monoclonal antibody response

  6. Vemurafenib, Dabrafenib, Trametinib – Cross-talk effects in BRAF-mutant cancers with co-occurring KRAS mutations

  7. Pembrolizumab, Nivolumab – Emerging data on KRAS mutation type and immunotherapy response; combination strategies under investigation

Dosing guidelines:

  • Sotorasib: 960 mg QD (fixed dose; no dose adjustments documented for KRAS mutation subtype; hepatic impairment may require monitoring)
  • Adagrasib: 600 mg BID (fixed dose; no routine pharmacogenetic-guided dosing)
  • Chemotherapy (fluorouracil, irinotecan): Standard dosing; response stratified by KRAS status (biomarker, not dose-adjusting)

No formal CPIC or clinical guideline for KRAS-guided dosing of non-targeted therapies. Mutation subtype (G12C vs. others) is a categorical biomarker determining drug eligibility, not dose adjustment.

Expression profiles

Tissue Expression (Bgee)

KRAS shows ubiquitous expression across human tissues (expression breadth: ubiquitous, max score: 97.68/100, average: 86.92).

RankTissueExpression ScoreQuality
1Trigeminal ganglion97.68Gold
2Pylorus97.66Gold
3Nipple97.52Gold
4Oral cavity97.50Gold
5Mucosa of sigmoid colon97.42Gold
6Esophagus squamous epithelium97.37Gold
7Pharyngeal mucosa97.14Gold
8Cardia of stomach97.13Gold
9Jejunal mucosa97.12Gold
10Tongue squamous epithelium97.03Gold
11Colonic mucosa96.92Gold
12Cauda epididymis96.75Gold
13Endothelial cell96.54Gold
14Brodmann area 23 (cingulate cortex)96.46Gold
15Dorsal root ganglion96.40Gold
16Superficial temporal artery96.24Gold
17Mammary duct96.20Gold
18Palpebral conjunctiva96.05Gold
19Superior surface of tongue96.03Gold
20Lower lobe of lung95.92Gold
21Olfactory bulb95.91Gold
22Esophageal epithelium95.85Gold
23Corpus epididymis95.78Gold
24Renal medulla95.77Gold
25Cerebellar vermis95.75Gold
26Caput epididymis95.74Gold
27Pons95.69Gold
28Saphenous vein95.62Gold
29Upper leg skin95.60Gold
30Epithelium of mammary gland95.42Gold

Tissue-enriched patterns: High expression in epithelial tissues (oral cavity, gastrointestinal tract, esophageal epithelium), neural tissues (ganglia, brain regions), endothelial cells, and reproductive tissues (epididymis, mammary). Consistent with KRAS proto-oncogene’s role in cell proliferation and differentiation across diverse cell types.

Cell Type Expression

Bgee dataset includes endothelial cells as highest-scoring cell type (96.54), indicating strong vascular expression. Available single-cell datasets for detailed cell-type mapping:

Single-Cell Datasets:

  • E-MTAB-11011: Single Cell Analysis of B cells in COVID-19 (active and recovered disease) — 15,100 cells
  • E-MTAB-6678: Human first trimester fetal-maternal interface using Smart-seq 2 — 7,598 cells

Expression Summary

KRAS demonstrates broadly elevated expression across 298 validated tissue/cell conditions (gold quality), consistent with its roles as:

  • Proto-oncogene in cell proliferation and survival signaling (RAS/MAPK pathway)
  • High expression in rapid-turnover tissues (epithelial linings, neural tissues)
  • Significant presence in vascular endothelium
  • Notable fetal expression (first trimester interface)

Disease associations

Mendelian / Monogenic Diseases

RASopathies (Developmental Syndromes):

DiseaseDisease IDInheritanceEvidence Level
Noonan syndrome 3OMIM 609942 / Orphanet 648 / MONDO:0018997Autosomal dominantDefinitive/Strong
Cardiofaciocutaneous syndrome 2OMIM 615278 / Orphanet 1340 / MONDO:0014112Autosomal dominantDefinitive/Strong
Linear nevus sebaceous syndromeOMIM 163200 / Orphanet 2612 / MONDO:0008097Autosomal dominantStrong

Phenotype Associations

Top HPO phenotypes associated with KRAS mutations (inheritance pattern + clinical features):

  1. HP:0000006 - Autosomal dominant inheritance
  2. HP:0001999 - Abnormal facial shape
  3. HP:0001626 - Abnormality of the cardiovascular system
  4. HP:0001639 - Hypertrophic cardiomyopathy
  5. HP:0000271 - Abnormality of the face
  6. HP:0000256 - Macrocephaly
  7. HP:0000478 - Abnormality of the eye
  8. HP:0000768 - Pectus carinatum
  9. HP:0000767 - Pectus excavatum
  10. HP:0000316 - Hypertelorism
  11. HP:0000465 - Webbed neck
  12. HP:0001629 - Ventricular septal defect
  13. HP:0001631 - Atrial septal defect
  14. HP:0001642 - Pulmonic stenosis
  15. HP:0001249 - Intellectual disability
  16. HP:0001263 - Global developmental delay
  17. HP:0001508 - Failure to thrive
  18. HP:0001510 - Growth delay
  19. HP:0001548 - Overgrowth
  20. HP:0001382 - Joint hypermobility
  21. HP:0000474 - Thickened nuchal skin fold
  22. HP:0000286 - Epicanthus
  23. HP:0000365 - Hearing impairment
  24. HP:0000369 - Low-set ears
  25. HP:0000358 - Posteriorly rotated ears
  26. HP:0000347 - Micrognathia
  27. HP:0000337 - Broad forehead
  28. HP:0000343 - Long philtrum
  29. HP:0000218 - High palate
  30. HP:0001250 - Seizure

Complex Disease / GWAS Associations

Trait/DiseaseVariantEffect Size (p-value)Gene-Variant Association
Liver enzyme levels (gamma-glutamyl transferase)KRAS - RNU4-67P4.0×10⁻²⁰Chr 12
Alanine aminotransferase levelsKRAS - RNU4-67P3.0×10⁻⁸Chr 12
Dysmenorrheic painKRAS - RNU4-67P6.0×10⁻⁷Chr 12
Age-related macular degeneration progressionLMNTD13.0×10⁻⁶Chr 12

Additional Disease Associations (Somatic/Cancer)

KRAS mutations are associated with numerous malignancies via ClinVar and Orphanet:

  • Pancreatic cancer (Orphanet 1333, MONDO:0015278)
  • Lung cancer (MONDO:0008903, 0005138, 0005233, 0005097)
  • Gastric cancer (MONDO:0001056, MONDO:0007648)
  • Breast cancer (MONDO:0016419, 0004988)
  • Prostate cancer (MONDO:0011098, Orphanet 1331)
  • Acute myeloid leukemia (MONDO:0018874)
  • Chronic myeloid leukemia (MONDO:0011996)
  • Juvenile myelomonocytic leukemia (MONDO:0011908)
  • Endometrial carcinoma (MONDO:0002447)
  • Colorectal cancer (MONDO:0005575)

RASopathy category (Orphanet 536391) encompasses the developmental syndrome cluster above.

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 47 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, ctd_gene_interaction, diamond_similarity, ensembl, entrez, esm2_similarity, exon, expressionatlas, gencc, go, gtex, gwas, hgnc, hpa, hpo, intact, interpro, jaspar, mim, mondo, msigdb, orphanet, ortholog, pdb, pfam, pharmgkb, pharmgkb_gene, proteomicsdb, reactome, refseq, scxa, scxa_expression, smart, spliceai, string_interaction, transcript, uniprot
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (164)