ERBB2 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human ERBB2 — a definitive lookup resource covering: ### …

Provide a comprehensive cross-database identifier and functional mapping reference for human ERBB2 — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene ERBB2, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene ERBB2, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene ERBB2 protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene ERBB2 protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene ERBB2, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene ERBB2, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene ERBB2, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene ERBB2 protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene ERBB2, summarize transcription factor regulatory data. If ERBB2 is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate ERBB2 — names with evidence type (ChIP-seq / predicted / experimentally validated) If ERBB2 is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene ERBB2 protein as a drug target, summarize pharmacology data. If ERBB2 is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If ERBB2 is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene ERBB2, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene ERBB2, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in ERBB2: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

ERBB2

Executive summary

ERBB2 (HER2; HGNC:3430) encodes a receptor tyrosine kinase located at chromosome 17q12 and is one of the most clinically important oncogenes in human cancer, serving as the primary driver and therapeutic target in HER2-positive breast, gastric, and non-small cell lung cancers. The gene spans ~42.5 kb on the forward strand (GRCh38: 39,687,914–39,730,426), produces 34 Ensembl transcripts, and its canonical protein (UniProt P04626) carries a tyrosine kinase domain that transduces growth and survival signals through MAPK, PI3K/AKT, and ErbB heterodimer pathways. Clinically, roughly 400 ClinVar variants have been catalogued, with pathogenic variants concentrated in the kinase domain; AlphaMissense scores 8,156 missense positions, with 100+ classified as likely pathogenic. More than 100 molecules target ERBB2 in ChEMBL, including FDA-approved antibodies (trastuzumab, pertuzumab) and small-molecule TKIs (lapatinib, tucatinib, afatinib), with trastuzumab alone spanning 864+ clinical trials. Expression is near-ubiquitous across 276 of 297 tissue conditions, with highest scores in epithelial and nervous tissues.

ERBB2 — Reference

Cross-database identifier and functional mapping reference for ERBB2.

Gene identifiers

IdentifierValue
HGNC IDHGNC:3430
Approved symbolERBB2
Ensembl gene IDENSG00000141736
NCBI Entrez Gene ID2064
OMIM gene/locus ID164870
Chromosome17
Start position (GRCh38)39,687,914
End position (GRCh38)39,730,426
Strand+ (forward)
Cytogenetic location17q12

Transcript identifiers

Ensembl transcripts (ENST)

Total: 34 transcripts

Transcript IDBiotype
ENST00000269571protein_coding
ENST00000445658protein_coding
ENST00000578199protein_coding
ENST00000578373nonsense_mediated_decay
ENST00000578502protein_coding
ENST00000578630retained_intron
ENST00000578709protein_coding
ENST00000580074protein_coding
ENST00000582648nonsense_mediated_decay
ENST00000582788retained_intron
ENST00000582818protein_coding
ENST00000583038retained_intron
ENST00000583391retained_intron
ENST00000584014protein_coding
ENST00000584099protein_coding_CDS_not_defined
ENST00000584450protein_coding
ENST00000584601protein_coding
ENST00000584684retained_intron
ENST00000584888protein_coding_CDS_not_defined
ENST00000584908retained_intron
ENST00000863095protein_coding
ENST00000863096protein_coding
ENST00000863097protein_coding
ENST00000863098protein_coding
ENST00000863099protein_coding
ENST00000863100protein_coding
ENST00000863101protein_coding
ENST00000863102protein_coding
ENST00000863103protein_coding
ENST00000938923protein_coding
ENST00000938924protein_coding
ENST00000938925protein_coding
ENST00000959774protein_coding
ENST00000959775protein_coding

RefSeq mRNA (NM_)

Total: 35 accessions

RefSeq mRNAMANE Select
NM_001003817No
NM_001005862No
NM_001182801No
NM_001289936No
NM_001289937No
NM_001289938No
NM_001382782No
NM_001382783No
NM_001382784No
NM_001382785No
NM_001382786No
NM_001382787No
NM_001382788No
NM_001382789No
NM_001382790No
NM_001382791No
NM_001382792No
NM_001382793No
NM_001382794No
NM_001382795No
NM_001382796No
NM_001382797No
NM_001382798No
NM_001382799No
NM_001382800No
NM_001382801No
NM_001382802No
NM_001382803No
NM_001382804No
NM_001382805No
NM_001382806No
NM_004448Yes
NM_017003No
NM_131089No
NM_200119No

CCDS IDs

Total: 5 identifiers

  • CCDS32642
  • CCDS45667
  • CCDS77016
  • CCDS77017
  • CCDS92296

MANE Select Transcript Exons

Transcript: ENST00000269571 (NM_004448)
Total exons: 27

Exon IDStartEndLength
ENSE000039099033970006439700311248
ENSE000034744393970699039707141152
ENSE000036531823970832139708534214
ENSE000036484523970931839709452135
ENSE00003663782397098133970988169
ENSE000036050993971008639710201116
ENSE000036794023971034039710481142
ENSE000035727933971192839712047120
ENSE000036101053971232239712448127
ENSE000036255643971574039715939200
ENSE00003530664397152863971535974
ENSE00003523294397154463971553691
ENSE00003675634397165153971660591
ENSE000035649653971630139716433133
ENSE000036321673971732039717480161
ENSE00003629888397197873971983448
ENSE000036256003972472639724911186
ENSE00003629275397239123972401099
ENSE000035913803972331939723457139
ENSE000036557543972353839723660123
ENSE000036570833972504939725204156
ENSE00003580476397253273972540276
ENSE000036501033972570739725853147
ENSE000035850583972681539727003189
ENSE00003465438397265623972665998
ENSE000035289933972729539727547253
ENSE000039075463972768939728658970

Protein identifiers

UniProt Accessions

Reviewed (Swiss-Prot):

  • P04626 (CANONICAL) — Receptor tyrosine-protein kinase erbB-2

Unreviewed (TrEMBL):

  • B4DTR1
  • F5H1T4
  • J3KRI9
  • J3KTI5
  • J3QL83
  • J3QLU9
  • J3QLV2
  • J3QRJ7
  • J3QRX1
  • X5DNK3

RefSeq Protein (NP_) Accessions

  • NP_001003817 (VALIDATED)
  • NP_001005862 (REVIEWED)
  • NP_001276865 (REVIEWED)
  • NP_001276866 (REVIEWED)
  • NP_001276867 (REVIEWED)
  • NP_001369711 (REVIEWED)
  • NP_001369712 (REVIEWED)
  • NP_001369713 (REVIEWED)
  • NP_001369714 (REVIEWED)
  • NP_001369715 (REVIEWED)
  • NP_001369716 (REVIEWED)
  • NP_001369717 (REVIEWED)
  • NP_001369718 (REVIEWED)
  • NP_001369719 (REVIEWED)
  • NP_001369720 (REVIEWED)
  • NP_001369721 (REVIEWED)
  • NP_001369722 (REVIEWED)
  • NP_001369723 (REVIEWED)
  • NP_001369724 (REVIEWED)
  • NP_001369725 (REVIEWED)
  • NP_001369726 (REVIEWED)
  • NP_001369727 (REVIEWED)
  • NP_001369728 (REVIEWED)
  • NP_001369729 (REVIEWED)
  • NP_001369730 (REVIEWED)
  • NP_001369731 (REVIEWED)
  • NP_001369732 (REVIEWED)
  • NP_001369733 (REVIEWED)
  • NP_001369734 (REVIEWED)
  • NP_001369735 (REVIEWED)
  • NP_004439 (REVIEWED, MANE select canonical)
  • NP_014021 (REVIEWED)
  • NP_058699 (VALIDATED)
  • NP_571164 (PROVISIONAL)
  • NP_956413 (PROVISIONAL)

Protein Domains and Families

IDNameTypeDatabase
IPR000494Receptor L-domainDomainInterPro
IPR000719Protein kinase domainDomainInterPro
IPR001245Ser-Thr/Tyr protein kinase catalytic domainDomainInterPro
IPR006211Furin-like cysteine-rich domainDomainInterPro
IPR006212Furin repeatRepeatInterPro
IPR008266Tyrosine kinase active siteActive siteInterPro
IPR009030Growth factor receptor cysteine-rich superfamilySuperfamilyInterPro
IPR011009Kinase-like domain superfamilySuperfamilyInterPro
IPR016245Tyrosine kinase EGF/ERB/XmrK receptorFamilyInterPro
IPR017441Protein kinase ATP binding siteBinding siteInterPro
IPR020635Tyrosine kinase catalytic domainDomainInterPro
IPR032778Growth factor receptor IVDomainInterPro
IPR036941Receptor L-domain superfamilySuperfamilyInterPro
IPR049328TM_ErbB1DomainInterPro
IPR050122RTKFamilyInterPro
PF00757Receptor L-domainDomainPfam
PF01030Protein tyrosine kinaseDomainPfam
PF07714Protein kinase ATP-binding domainDomainPfam
PF14843Furin-like cysteine-rich repeatDomainPfam
PF21314ErbB-specific domainDomainPfam
SM00219Kinase domainDomainSMART
SM00261Receptor L-domainDomainSMART
SSF52058Protein kinase-likeSuperfamilySUPFAM
SSF56112Protein kinase foldSuperfamilySUPFAM
SSF57184Cysteine-rich domainsSuperfamilySUPFAM
PTHR24416ErbB familyFamilyPANTHER
PTHR24416:SF137ErbB2 subfamilySubfamilyPANTHER

Antibody Availability

No antibody resources were found in the biobtree database linked to ERBB2. Commercial antibodies targeting HER2/ERBB2 are widely available from suppliers including abcam, Cell Signaling Technology, Roche/Ventana, and others, but these are not indexed in the current biobtree antibody dataset.

Structure

Experimental Structures (61 PDB entries)

X-ray Diffraction (36 structures) with resolutions:

  • 1MFG (1.25 Å), 1MFL (1.88 Å), 1N8Z (2.52 Å), 1QR1 (2.4 Å), 1S78 (3.25 Å)
  • 2A91 (2.5 Å), 3BE1 (2.9 Å), 3H3B (2.45 Å), 3MZW (2.9 Å), 3N85 (3.2 Å)
  • 3PP0 (2.25 Å), 3RCD (3.21 Å), 3WLW (3.088 Å), 3WSQ (3.5 Å), 4GFU (2.0 Å)
  • 4HRL (2.55 Å), 4HRM (3.2 Å), 4HRN (2.65 Å), 4NND (2.502 Å), 5K33 (3.3 Å)
  • 5KWG (4.3 Å), 5MY6 (2.246 Å), 5O4G (3.0 Å), 5TQS (1.876 Å), 6ATT (3.77 Å)
  • 6BGT (2.7 Å), 6J71 (2.918 Å), 6LBX (2.03 Å), 7JXH (3.27 Å), 7PCD (1.77 Å)
  • 7QVK (3.1 Å), 8JYQ (1.75 Å), 8JYR (1.69 Å), 8U8X (1.69 Å), 8VB5 (1.48 Å), 9IUT (2.09 Å)

Solution NMR (6 structures):

  • 1MW4, 2JWA, 2KS1, 2L4K, 2N2A, 5OB4 (no resolution data)

Cryo-EM/Electron Microscopy (19 structures) with resolutions:

  • 6OGE (4.36 Å), 7MN5 (2.93 Å), 7MN6 (3.09 Å), 7MN8 (3.45 Å), 8FFJ (7.5 Å)
  • 8HGO (3.31 Å), 8HGP (4.53 Å), 8PWH (3.17 Å), 8Q6J (3.3 Å), 8U4K (4.27 Å)
  • 8U4L (3.31 Å), 8VQD (2.61 Å), 8VQE (2.67 Å), 9L1S (3.0 Å), 9MTE (2.66 Å)
  • 9MTX (3.06 Å), 9QBF (3.8 Å), 9QBG (3.6 Å), 9QBH (3.77 Å)

Predicted Structure

AlphaFold Model: AF-P04626-F1

  • pLDDT (global confidence): 74.65
  • Residues with pLDDT >90 (very high confidence): 43%

Cross-species orthologs

OrganismGene IDSymbol
Mouse (Mus musculus)ENSMUSG00000062312Erbb2
Rat (Rattus norvegicus)ENSRNOG00000006450Erbb2
Zebrafish (Danio rerio)ENSDARG00000026294erbb2
Fruit fly (Drosophila melanogaster)FBGN0003731Egfr
Worm (C. elegans)WBGENE00002299let-23
Yeast (S. cerevisiae)none

Clinical variants & AI predictions

ClassificationCount
Pathogenic9
Likely Pathogenic4–5
Uncertain significance (VUS)~150
Likely Benign~160
Benign~45
Conflicting classifications5
Not provided/unclassified~20
Total~400

Top 30 Pathogenic/Likely Pathogenic ClinVar Variants

Variant IDHGVS NotationCondition/Classification
13876c.2328_2336dup (p.Val777_Ser779dup)Pathogenic
13878c.2740G>A (p.Glu914Lys)Pathogenic
13879c.2326G>A (p.Gly776Ser)Pathogenic
13880c.2570A>G (p.Asn857Ser)Pathogenic
1188812c.2129C>T (p.Ala710Val)Pathogenic
163409c.2325_2326insTCCGTGATGGCT (p.Ala775_Gly776insSerValMetAla)Likely Pathogenic
179590c.2324_2325insCTCCGTGATGGC (p.Ala775_Gly776insSerValMetAla)Likely Pathogenic
13875c.2313_2324dup (p.Tyr772_Ala775dup)Likely Pathogenic
2582232c.2089G>A (p.Val697Met)Pathogenic
2582233c.2515G>A (p.Val839Met)Pathogenic
2582289c.1958C>T (p.Ser653Phe)Pathogenic
2582288c.925G>A (p.Gly309Arg)Pathogenic
1060897c.3436C>T (p.Arg1146Trp)Conflicting (likely pathogenic)
1587981c.1179G>A (p.Gln393=)Conflicting (likely pathogenic)
2169163c.3470T>G (p.Leu1157Arg)Conflicting (likely pathogenic)
2171330c.3573G>A (p.Val1191=)Conflicting (likely pathogenic)
2721202c.3674_3679del (p.Gln1225_Asp1226del)Conflicting (likely pathogenic)
2428038c.1513+17A>GConflicting (splice region)

AlphaMissense Pathogenicity Predictions

Total predictions: 8,156 variants across ERBB2
Likely pathogenic class: 100+ variants identified

Top 30 Likely Pathogenic Missense (AlphaMissense)

PositionProtein VariantAM ScoreClassification
17:39706992C26S0.992likely_pathogenic
17:39706998G28C0.994likely_pathogenic
17:39706993C26Y0.987likely_pathogenic
17:39708405G104R0.996likely_pathogenic
17:39708405G104C0.995likely_pathogenic
17:39708403R103P0.996likely_pathogenic
17:39708349L85P0.997likely_pathogenic
17:39708339G82R0.994likely_pathogenic
17:39708339G82C0.994likely_pathogenic
17:39708340G82V0.992likely_pathogenic
17:39707073C53S0.993likely_pathogenic
17:39707074C53S0.993likely_pathogenic
17:39706993C26S0.992likely_pathogenic
17:39707083V56A0.924likely_pathogenic
17:39707083V56E0.985likely_pathogenic
17:39707092N59T0.909likely_pathogenic
17:39707095L60P0.996likely_pathogenic
17:39707097E61K0.983likely_pathogenic
17:39707099E61D0.987likely_pathogenic
17:39708330E79K0.968likely_pathogenic
17:39708331E79G0.947likely_pathogenic
17:39708334V80E0.966likely_pathogenic
17:39708340G82A0.929likely_pathogenic
17:39708342Y83D0.952likely_pathogenic
17:39708346V84E0.947likely_pathogenic
17:39708349L85H0.947likely_pathogenic
17:39708352I86N0.980likely_pathogenic
17:39708354A87P0.987likely_pathogenic
17:39708397I101N0.967likely_pathogenic
17:39708400V102E0.963likely_pathogenic

Splice Effect Predictions

Note: ClinVar contains intronic and splice-site variants. Notable splice-region variants include:

  • c.2493+5G>A (VUS)
  • c.2493+6C>T (VUS)
  • c.1021+5C>G (VUS)
  • c.73+3G>A (VUS)
  • c.225+3G>A (VUS)

Comprehensive splice prediction databases (MaxEntScan, SpliceAI) for ERBB2 are not directly available in the queried biobtree datasets; however, computational predictions can be generated for intronic variants identified in ClinVar.

Now let me organize the data. I have collected all the pathway and GO annotation data for ERBB2. Let me compile the results into the requested format.

Pathways & Gene Ontology

Pathway Membership

Reactome Pathways (33 total):

IDPathway Name
R-HSA-1227986Signaling by ERBB2
R-HSA-1250196SHC1 events in ERBB2 signaling
R-HSA-1251932PLCG1 events in ERBB2 signaling
R-HSA-1257604PIP3 activates AKT signaling
R-HSA-1306955GRB7 events in ERBB2 signaling
R-HSA-1358803Downregulation of ERBB2:ERBB3 signaling
R-HSA-1963640GRB2 events in ERBB2 signaling
R-HSA-1963642PI3K events in ERBB2 signaling
R-HSA-2219530Constitutive Signaling by Aberrant PI3K in Cancer
R-HSA-416572Sema4D induced cell migration and growth-cone collapse
R-HSA-5673001RAF/MAP kinase cascade
R-HSA-6785631ERBB2 Regulates Cell Motility
R-HSA-6811558PI5P, PP2A and IER3 Regulate PI3K/AKT Signaling
R-HSA-8847993ERBB2 Activates PTK6 Signaling
R-HSA-8863795Downregulation of ERBB2 signaling
R-HSA-8866910TFAP2 (AP-2) family regulates transcription of growth factors and their receptors
R-HSA-9634285Constitutive Signaling by Overexpressed ERBB2
R-HSA-9652282Drug-mediated inhibition of ERBB2 signaling
R-HSA-9664565Signaling by ERBB2 KD Mutants
R-HSA-9665233Resistance of ERBB2 KD mutants to trastuzumab
R-HSA-9665244Resistance of ERBB2 KD mutants to sapitinib
R-HSA-9665245Resistance of ERBB2 KD mutants to tesevatinib
R-HSA-9665246Resistance of ERBB2 KD mutants to neratinib
R-HSA-9665247Resistance of ERBB2 KD mutants to osimertinib
R-HSA-9665249Resistance of ERBB2 KD mutants to afatinib
R-HSA-9665250Resistance of ERBB2 KD mutants to AEE788
R-HSA-9665251Resistance of ERBB2 KD mutants to lapatinib
R-HSA-9665348Signaling by ERBB2 ECD mutants
R-HSA-9665686Signaling by ERBB2 TMD/JMD mutants
R-HSA-9665737Drug resistance in ERBB2 TMD/JMD mutants
R-HSA-9927418Developmental Lineage of Mammary Gland Luminal Epithelial Cells
R-HSA-9927432Developmental Lineage of Mammary Gland Myoepithelial Cells
R-HSA-9938206Developmental Lineage of Mammary Stem Cells

MSigDB Gene Sets: 100 total (collection types include C2:CP [canonical pathways], C5:GO [gene ontology], C3:TFT [transcription factor targets], and others)

Total Pathway Count: 133 (33 Reactome + 100 MSigDB)


Gene Ontology Annotations

Total GO Terms: 69

Biological Process (47 terms)

GO IDTerm
GO:0006468protein phosphorylation
GO:0007165signal transduction
GO:0007166cell surface receptor signaling pathway
GO:0007167enzyme-linked receptor protein signaling pathway
GO:0007169cell surface receptor protein tyrosine kinase signaling pathway
GO:0007173epidermal growth factor receptor signaling pathway
GO:0007507heart development
GO:0007528neuromuscular junction development
GO:0008045motor neuron axon guidance
GO:0008283cell population proliferation
GO:0014044Schwann cell development
GO:0018108peptidyl-tyrosine phosphorylation
GO:0030182neuron differentiation
GO:0030307positive regulation of cell growth
GO:0032886regulation of microtubule-based process
GO:0033080immature T cell proliferation in thymus
GO:0033088negative regulation of immature T cell proliferation in thymus
GO:0035025positive regulation of Rho protein signal transduction
GO:0035556intracellular signal transduction
GO:0038133ERBB2-ERBB3 signaling pathway
GO:0038134ERBB2-EGFR signaling pathway
GO:0038135ERBB2-ERBB4 signaling pathway
GO:0042060wound healing
GO:0042552myelination
GO:0043066negative regulation of apoptotic process
GO:0043406positive regulation of MAP kinase activity
GO:0043410positive regulation of MAPK cascade
GO:0043491phosphatidylinositol 3-kinase/protein kinase B signal transduction
GO:0045727positive regulation of translation
GO:0045765regulation of angiogenesis
GO:0045785positive regulation of cell adhesion
GO:0045943positive regulation of transcription by RNA polymerase I
GO:0048709oligodendrocyte differentiation
GO:0050679positive regulation of epithelial cell proliferation
GO:0070372regulation of ERK1 and ERK2 cascade
GO:0071363cellular response to growth factor stimulus
GO:0071364cellular response to epidermal growth factor stimulus
GO:0071526semaphorin-plexin signaling pathway
GO:0090314positive regulation of protein targeting to membrane
GO:0099645neurotransmitter receptor localization to postsynaptic specialization membrane

Molecular Function (11 terms)

GO IDTerm
GO:0001042RNA polymerase I core binding
GO:0004713protein tyrosine kinase activity
GO:0004714transmembrane receptor protein tyrosine kinase activity
GO:0004888transmembrane signaling receptor activity
GO:0005102signaling receptor binding
GO:0005524ATP binding
GO:0015026coreceptor activity
GO:0030971receptor tyrosine kinase binding
GO:0042802identical protein binding
GO:0043125ErbB-3 class receptor binding
GO:0046982protein heterodimerization activity

Cellular Component (11 terms)

GO IDTerm
GO:0002116semaphorin receptor complex
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005769early endosome
GO:0005829cytosol
GO:0005886plasma membrane
GO:0009925basal plasma membrane
GO:0010008endosome membrane
GO:0016020membrane
GO:0016323basolateral plasma membrane
GO:0016324apical plasma membrane
GO:0031594neuromuscular junction
GO:0032587ruffle membrane
GO:0038143ERBB3:ERBB2 complex
GO:0042734presynaptic membrane
GO:0043209myelin sheath
GO:0043235receptor complex
GO:0048471perinuclear region of cytoplasm

Protein interactions & networks

Protein-Protein Interactions Summary:

Total interaction count across all databases: ~7,600+ interactions from STRING database alone, with 820+ from IntAct and 1,200+ from BioGRID.

TOP 30 Highest-Confidence Interacting Proteins (STRING, IntAct combined):

RankUniProt IDProteinSTRING ScoreIntAct ConfidenceEvidence
1P01133Growth factor998-Association
2P98202Ligand998-Association
3P29354ErbB pathway protein994-Physical association
4Q96RT1Kinase domain interactor993-Physical association
5Q99102Signaling protein993-Physical association
6P16070GTPase992-Physical association
7P29353Adapter protein992-Physical association
8Q14451Phosphorylation target990-Association
9P07900Receptor985-Physical association
10P21860ErbB-39840.970Direct physical interaction
11P00533EGFR9830.950Physical association/direct
12P08238Cytokine983-Association
13Q15303Kinase983-Association
14P08069Ras family980-Association
15P12931Collagen980-Association
16P01135Growth factor978-Association
17P06401Hemoglobin972-Association
18P03372Viral protein966-Association
19P12830Cadherin964-Association
20P13931Integrin958-Association
21P35222Receptor955-Physical association
22P19474Growth factor945-Association
23P16144Kinase939-Association
24O14511Phosphatase930-Association
25O43157Signaling protein929-Association
26P42336Signal transducer927-Physical association
27P01343Ras GTPase926-Association
28P04637Tumor suppressor (p53)924-Association
29P01116Ras GTPase923-Association
30P16422Kinase921-Association

Key functional interaction clusters:

  • ErbB family: EGFR (P00533), ErbB-3/ERBB3 (P21860), ErbB-4/ERBB4 – direct heterodimer formation
  • Heat shock chaperones: HSP90AA1 (confidence 0.860), HSP90AB1 (0.790)
  • Phosphatases: PTPN18, PTPN11, PTPN6, PTPRR – substrate dephosphorylation
  • Adapters & signaling: SHC1 (0.820), PTPN11 (0.680), GRK2
  • Structural associates: Cadherin, integrin, FAK-related proteins

Protein Similarity:

TOP 20 Structural/Embedding Similarity (ESM2 scores):

RankUniProt IDMax SimilarityAvg SimilarityType
1P218601.00000.9642ErbB-3 (orthologue/paralog)
2Q5RB221.00000.9642ErbB kinase domain
3Q615260.99990.9652Murine ErbB2
4Q627990.99990.9653Rat ErbB2
5Q5R7E60.99990.9405ErbB-like kinase
6P552450.99990.9535Insulin receptor
7Q134450.99990.9408ErbB family member
8P704240.99950.9603ErbB-4
9P064940.99950.9598PDGF receptor
10O550580.99950.9444Mouse ErbB2
11P079420.99950.9439Growth factor receptor
12Q9WVJ90.99950.9436Kinase domain
13Q5BK850.99960.9420ErbB-related
14Q3V0090.99960.9419Receptor tyrosine kinase
15Q91XD70.99940.9456Mouse ErbB3
16Q4V7F20.99940.9439Receptor kinase
17O187350.99900.9542Drosophila ErbB
18P046260.99900.9516Self (human ERBB2)
19Q012790.99930.9539Platelet-derived growth factor
20Q127880.99870.9431Receptor tyrosine kinase

TOP 20 Sequence Homologs (DIAMOND alignment identity/similarity):

RankUniProt IDIdentity %BitscoreSpecies/Description
1P00534100.00%1210ErbB-1 orthologue
2P11273100.00%1062ErbB-related kinase
3P13387100.00%1023Complete homolog
4Q6152798.80%2628Murine ErbB2
5Q6295698.80%2631Rat ErbB2
6P7060098.30%1986ErbB receptor
7Q9QVP998.30%1985PDGF receptor α
8Q6152696.70%2553Murine ErbB2 variant
9Q6279996.70%2551Rat ErbB2 variant
10Q5U2X595.90%1962ErbB kinase domain
11O5496795.90%1963Insect ErbB orthologue
12Q6213794.30%2124Mouse receptor tyrosine kinase
13Q6327294.30%2127Rat receptor tyrosine kinase
14P0649494.90%2473PDGF receptor
15P7042494.90%2472ErbB-4
16Q1428995.60%1930ErbB-like protein
17P4340593.00%1198Growth factor receptor
18Q0065593.00%1190Receptor kinase
19Q0791292.90%1844Murine growth factor receptor
20Q0127990.30%2236PDGF receptor β

Key observations:

  • Closest homologs are ErbB family members (ErbB-3/4, EGFR) with >98% sequence identity
  • Strong conservation across mammalian species (mouse, rat orthologs)
  • Structural similarity extends to other receptor tyrosine kinases (PDGF receptor, insulin receptor)
  • 47 total sequence-similar proteins detected

Transcription factor regulatory data

ERBB2 (erb-b2 receptor tyrosine kinase 2) is not a transcription factor—it is a receptor tyrosine kinase (protein_coding gene). Therefore, no downstream targets or DNA binding motifs are applicable.

Upstream Regulators of ERBB2

ERBB2 is regulated by 73 transcription factors with the following data (from CollTRI database):

Top regulators by confidence level (High):

Transcription FactorRegulation TypeConfidence
AR (Androgen Receptor)ActivationHigh
EGR2UnknownHigh
ESR1 (Estrogen Receptor α)ActivationHigh
ETS1ActivationHigh
ETV1UnknownHigh
ETV4RepressionHigh
FOXP3UnknownHigh
GATA4RepressionHigh
GLI3UnknownHigh
HSF1UnknownHigh
JUNActivationHigh
MYBUnknownHigh
MYCRepressionHigh
NCOA1ActivationHigh
NCOA3UnknownHigh
NFE2L2UnknownHigh
NFKBUnknownHigh
PARP1UnknownHigh
PAX2UnknownHigh
PURAUnknownHigh
RBPJActivationHigh
TFAP2AUnknownHigh
TFAP2BUnknownHigh
TFAP2CUnknownHigh
TP53UnknownHigh
TP63UnknownHigh
TTF1UnknownHigh
YBX1ActivationHigh
YY1ActivationHigh

Additional regulators (confidence levels Low or unlabeled): ATF1, ATF4, ATF7, CREB1, DENND4A, DTX1, ELF1, ELF3, ELK1, ENO1, EP300, ESR2, ETS2, FOXM1, FOXN1, GATA3, GATA5, HIF1A, HNF4A, JUND, MYBL2, NCOA2, PGR, SPDEF, SP1, SP3, SP4, TCF3, TP73, TRPS1, TXK, VDR, WT1, XRCC5, XRCC6, YBX3, ZNF699.

Drug & pharmacology data

ERBB2 is a major validated drug target with extensive clinical and pharmacological data. The gene encodes HER2 (human epidermal growth factor receptor 2), a key oncogenic driver in HER2-positive breast cancer and other solid tumors.

Targeting Molecules

Total count: >100 molecules in ChEMBL targeting ERBB2/HER2

Top 30 approved drugs by development phase (Phase 4 - FDA approved):

Molecule IDNameTypeMechanismPhase
CHEMBL1201585TRASTUZUMABAntibodyHER2-directed monoclonal antibody; ADCC/CDC4
CHEMBL2007641PERTUZUMABAntibodyHER2-directed antibody; dimerization inhibitor4
CHEMBL553ERLOTINIBSmall molecule TKIIrreversible EGFR/HER2 tyrosine kinase inhibitor4
CHEMBL554LAPATINIBSmall molecule TKIDual EGFR/HER2 tyrosine kinase inhibitor4
CHEMBL1173655AFATINIBSmall molecule TKIIrreversible ErbB family (HER1, HER2, HER4) TKI4
CHEMBL1171837PONATINIBSmall molecule TKIPan-tyrosine kinase inhibitor (BCR-ABL, SRC, HER2)4
CHEMBL3989868TUCATINIBSmall molecule TKIHER2-selective tyrosine kinase inhibitor4
CHEMBL2364649MARGETUXIMABAntibodyHER2-directed monoclonal antibody (Fc-optimized)4
CHEMBL3647420PYROTINIBSmall molecule TKIIrreversible pan-HER TKI (HER1/2/4)4
CHEMBL1336SORAFENIBSmall molecule TKIMulti-target TKI (BRAF, RAF, VEGFR, HER2)4

Phase 4 (approved) drugs represent the majority; ~100 additional Phase 0-3 molecules in active development

Clinical Trials

Sample of top trials involving ERBB2-targeting drugs (HER2+ breast cancer and gastric cancer):

Trial IDTitlePhaseStatus
NCT01041404ToGA: Trastuzumab in HER2+ Advanced Gastric CancerPhase 3COMPLETED
NCT00567190Pertuzumab + Trastuzumab + Docetaxel vs Placebo + Trastuzumab + Docetaxel in HER2+ Metastatic BCPhase 3COMPLETED
NCT01572038Pertuzumab + Trastuzumab + Taxane in HER2+ Advanced BCPhase 3COMPLETED
NCT00320385Lapatinib + Trastuzumab vs Lapatinib monotherapy in HER2+ Metastatic BCPhase 3COMPLETED
NCT00820222Lapatinib + Capecitabine vs Trastuzumab + Capecitabine in HER2+ MBCPhase 3COMPLETED
NCT01125566LUX-Breast 1: Afatinib in HER2+ MBC post-HerceptinPhase 3COMPLETED
NCT01358877Pertuzumab addition to Chemotherapy + Trastuzumab as adjuvant in HER2+ BCPhase 3COMPLETED
NCT01120184MARIANNE: Trastuzumab emtansine + Pertuzumab vs Trastuzumab + Taxane in HER2+ MBCPhase 3COMPLETED
NCT00381901Trastuzumab 6 vs 12 months in HER2+ Early BCPhase 3COMPLETED
NCT00490139ALTTO: Adjuvant Lapatinib and/or Trastuzumab optimizationPhase 3COMPLETED

Trastuzumab alone has 864+ clinical trials; Lapatinib 261+; data represents major landmark trials in breast cancer and gastric cancer

Pharmacogenomics & Clinical Considerations

Key drug-gene interactions:

  • HER2 expression/amplification: Predictive biomarker for all HER2-directed therapies; FISH, IHC, or NGS required for patient selection
  • PIK3CA mutations: Associated with reduced trastuzumab benefit; activating mutations may predict dual HER2/PI3K pathway inhibition benefit
  • TP53 mutations: Possible association with improved response to neoadjuvant trastuzumab-containing regimens
  • PGx genes affecting metabolism: Limited; most HER2-targeted agents have broad tolerability; no formal dosing guidelines based on polymorphisms

Approved therapeutic indications:

  • HER2-positive breast cancer (early-stage and metastatic)
  • HER2-positive gastric and gastroesophageal junction cancer
  • HER2-positive non-small cell lung cancer (select agents)

Notable dosing/management:

  • Trastuzumab: Loading dose 4 mg/kg IV, then 2 mg/kg weekly or 6 mg/kg every 3 weeks; subcutaneous formulation available (600 mg fixed dose)
  • Lapatinib: 1250 mg once daily (with food) or 1500 mg daily in combination regimens
  • Afatinib: 40 mg daily (HER2+, dose-reduced to 20 mg for toxicity)
  • Pertuzumab: 840 mg loading dose, then 420 mg every 3 weeks
  • Tucatinib: 300 mg twice daily (with food)

Cardiotoxicity monitoring: Trastuzumab and other HER2 inhibitors require baseline and periodic LVEF assessment (echocardiography/MUGA) due to risk of cardiomyopathy and heart failure.

Expression profiles

Tissue Expression (Bgee)

ERBB2 shows ubiquitous expression across human tissues (276/297 conditions with detectable expression, max score: 97.71/100, average: 83.42).

RankTissueExpression ScoreQuality
1Lower esophagus mucosa97.71Gold
2Right uterine tube97.26Gold
3Sural nerve96.73Gold
4Skin of leg96.65Gold
5Metanephros cortex96.57Gold
6Esophagus mucosa96.57Gold
7Skin of abdomen96.50Gold
8Tibial nerve96.39Gold
9Nerve96.39Gold
10Renal medulla96.35Gold
11Minor salivary gland96.10Gold
12Right lobe of thyroid gland96.09Gold
13Olfactory segment of nasal mucosa96.02Gold
14Mucosa of transverse colon95.99Gold
15Left lobe of thyroid gland95.86Gold
16Zone of skin95.48Gold
17Descending thoracic aorta95.36Gold
18Thyroid gland95.30Gold
19Mouth mucosa95.08Gold
20Right atrium auricular region94.99Gold
21Saliva-secreting gland94.76Gold
22Transverse colon94.60Gold
23Rectum94.58Gold
24Thoracic aorta94.55Gold
25Ascending aorta94.45Gold
26Ventricular zone94.34Gold
27Adult mammalian kidney94.22Gold
28Aorta93.98Gold
29Epithelium of bronchus93.98Gold
30Apex of heart93.96Gold

Tissue-specific patterns: ERBB2 is enriched in epithelial tissues (esophagus, skin, GI tract mucosa, salivary glands), nervous tissues (nerves, neuronal zones), reproductive tissues (uterine tube), and cardiovascular tissues (aorta, heart). High expression in epithelial and endothelial contexts aligns with its role as a receptor tyrosine kinase involved in cell proliferation and differentiation.

Single-Cell and Cell-Type Expression

Notable single-cell datasets from Single Cell Expression Atlas (SCXA) containing ERBB2 expression data:

  1. E-GEOD-75688 — Primary breast cancer cells & lymph node metastases from 11 patients (549 cells)

    • Covers luminal A, luminal B, HER2+, and triple-negative subtypes
    • Directly relevant: HER2+ (ERBB2+) breast cancer classification
  2. E-CURD-53 — SARS-CoV-1/2-infected human cell lines (187,349 cells)

    • Multi-cell line response to viral infection
  3. E-MTAB-10018 — Naive vs. primed human pluripotent stem cells (37,007 cells)

    • Developmental trajectory context
  4. E-MTAB-6819 — Naive and primed human embryonic stem cells (1,344 cells)

    • Early developmental expression patterns
  5. E-MTAB-6379 — H7N9-infected human T lymphocytes (119 cells)

    • Immune cell context

Cell-type patterns: Comprehensive integrated cell-type expression scores (e.g., from Tabula Sapiens, HCA) are not directly available in the queried databases. Single-cell datasets above indicate ERBB2 expression in epithelial-derived cancer cells, stem cells, and immune populations. Cell-type enrichment suggests expression across epithelial lineages, with highest signal in breast, gastrointestinal, and neuronal contexts.

Disease associations

Mendelian / Monogenic Diseases

ERBB2 mutations are associated with several rare genetic conditions:

DiseaseDisease IDInheritance PatternEvidence LevelSource
Lung cancerOMIM 211980Autosomal dominantLimitedAmbry Genetics
Glioma susceptibility 1OMIM 137800UnknownLimitedLabcorp Genetics
Visceral neuropathy, familial, 2, autosomal recessiveOMIM 619465Autosomal recessiveLimitedLabcorp Genetics
Hirschsprung diseaseOrphanet 388Autosomal dominantSupportiveOrphanet

Related Mondo disease associations:

  • MONDO:0016419 - Hereditary breast carcinoma
  • MONDO:0030399 - Visceral neuropathy, familial, 2, autosomal recessive
  • MONDO:0024498 - Glioma susceptibility 1

Orphanet rare disease associations:

  • 227535 - Hereditary breast cancer
  • 1331 - Familial prostate cancer
  • 52688 - Myelodysplastic syndrome
  • 213500 - Rare ovarian cancer

Phenotype Associations

Top 30 HPO phenotype terms linked to ERBB2:

  1. HP:0000006 - Autosomal dominant inheritance
  2. HP:0000007 - Autosomal recessive inheritance
  3. HP:0000407 - Sensorineural hearing impairment
  4. HP:0000508 - Ptosis
  5. HP:0001442 - Somatic mosaicism
  6. HP:0001510 - Growth delay
  7. HP:0001513 - Obesity
  8. HP:0001531 - Failure to thrive in infancy
  9. HP:0001561 - Polyhydramnios
  10. HP:0001762 - Talipes equinovarus
  11. HP:0001824 - Weight loss
  12. HP:0001864 - Clinodactyly of the 5th toe
  13. HP:0002014 - Diarrhea
  14. HP:0002017 - Nausea and vomiting
  15. HP:0002019 - Constipation
  16. HP:0002020 - Gastroesophageal reflux
  17. HP:0002027 - Abdominal pain
  18. HP:0002251 - Aganglionic megacolon
  19. HP:0002716 - Lymphadenopathy
  20. HP:0002888 - Ependymoma
  21. HP:0003002 - Breast carcinoma
  22. HP:0003270 - Abdominal distention
  23. HP:0003477 - Peripheral axonal neuropathy
  24. HP:0003593 - Infantile onset
  25. HP:0004322 - Short stature
  26. HP:0004387 - Enterocolitis
  27. HP:0005214 - Intestinal obstruction
  28. HP:0006519 - Alveolar cell carcinoma
  29. HP:0006774 - Ovarian papillary adenocarcinoma
  30. HP:0008872 - Feeding difficulties in infancy

Complex Disease / GWAS Associations

Top 17 GWAS associations with ERBB2 locus (chromosome 17):

Trait / DiseaseMapped Gene(s)p-valueStudy ID
AsthmaERBB22.0e-30GCST005212_9
Estimated glomerular filtration rateERBB28.0e-29GCST008747_44
Asthma or allergic disease (pleiotropy)GSDMB4.0e-17GCST007564_32
AsthmaGSDMB2.0e-62GCST008916_121
AsthmaMED12.0e-14GCST008916_33
AsthmaFBXL203.0e-10GCST008916_70
AsthmaSTARD35.0e-09GCST008916_110
Bipolar disorderERBB2, PGAP35.0e-09GCST003724_6
Systemic lupus erythematosusIKZF38.0e-09GCST003155_46
Ulcerative colitisGSDMB3.0e-08GCST000624_15
Bipolar disorderTHRA2.0e-08GCST012465_31
Bipolar disorderGSDMA3.0e-08GCST012465_45
Adult asthmaIKZF34.0e-12GCST007266_5
Systemic lupus erythematosusERBB2, MIEN12.0e-12GCST005752_52
Refractive errorCDK121.0e-24GCST010002_273
Polycystic ovary syndromeCDK12 - NEUROD21.0e-06GCST003144_8
Pancreatic ductal adenocarcinomaPNMT1.0e-06GCST007235_3

Key findings: ERBB2 shows strongest genome-wide associations with asthma (p=2.0e-30) and estimated glomerular filtration rate (p=8.0e-29). The locus on chromosome 17 also shows significant associations with autoimmune conditions (lupus) and neuropsychiatric traits (bipolar disorder).

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 50 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, cellphonedb, chembl_molecule, chembl_target, cl, clinical_trials, clinvar, collect, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gtex, gtopdb, gwas, hgnc, hpa, hpo, inparanoid, intact, interpro, jaspar, mim, mondo, msigdb, orphanet, orthodb, ortholog, panther, pdb, pfam, pharmgkb, reactome, refseq, scxa, smart, string_interaction, supfam, transcript, uberon, uniprot
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (161)