STAT3 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human STAT3 — a definitive lookup resource covering: ### …

Provide a comprehensive cross-database identifier and functional mapping reference for human STAT3 — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene STAT3, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene STAT3, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene STAT3 protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene STAT3 protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene STAT3, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene STAT3, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene STAT3, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene STAT3 protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene STAT3, summarize transcription factor regulatory data. If STAT3 is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate STAT3 — names with evidence type (ChIP-seq / predicted / experimentally validated) If STAT3 is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene STAT3 protein as a drug target, summarize pharmacology data. If STAT3 is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If STAT3 is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene STAT3, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene STAT3, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in STAT3: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

STAT3

Executive summary

STAT3 (Signal Transducer and Activator of Transcription 3) is a transcription factor and central hub of JAK-STAT signaling, activated downstream of cytokines, growth factors, and oncogenic kinases; its importance spans immunity, development, and cancer. The gene sits on chromosome 17 (GRCh38: 42,313,324–42,388,568, minus strand) and is virtually ubiquitous, expressed in 301 of 303 profiled tissues with a maximum expression score of 99.31. Dominant-negative or gain-of-function mutations cause two well-characterized Mendelian disorders — autosomal dominant Hyper-IgE syndrome and STAT3-related early-onset multisystem autoimmune disease — and ~839 ClinVar variants have been catalogued. GWAS signals reach genome-wide significance for multiple sclerosis (p = 2.0e-28), inflammatory bowel disease (p = 6.0e-22), and Crohn’s disease (p = 2.0e-15). STAT3 is a VIP pharmacogene in PharmGKB, with over 100 targeting molecules in development; momelotinib, an approved JAK1/JAK2 inhibitor that blocks STAT3 activation, represents the most advanced therapeutic. The protein interacts with ~8,628 partners in STRING, with highest-confidence physical associations to JAK1, JAK2, and SRC.

Gene identifiers

  • HGNC ID: HGNC:11364
  • Approved symbol: STAT3
  • Ensembl gene ID: ENSG00000168610
  • NCBI Entrez Gene ID: 6774
  • OMIM gene/locus ID: 102582
  • Genomic location (GRCh38):
    • Chromosome: 17
    • Start position: 42,313,324
    • End position: 42,388,568
    • Strand: − (minus)

Transcript identifiers

Ensembl transcripts (ENST)

Total: 94 transcripts

ENST IDBiotype
ENST00000264657protein_coding
ENST00000389272protein_coding
ENST00000404395protein_coding
ENST00000462269retained_intron
ENST00000462286protein_coding_CDS_not_defined
ENST00000471989retained_intron
ENST00000478276retained_intron
ENST00000491272protein_coding_CDS_not_defined
ENST00000498330retained_intron
ENST00000585360retained_intron
ENST00000585517protein_coding
ENST00000588969protein_coding
ENST00000590776protein_coding_CDS_not_defined
ENST00000676636nonsense_mediated_decay
ENST00000677002protein_coding
ENST00000677030protein_coding
ENST00000677152protein_coding
ENST00000677270retained_intron
ENST00000677271nonsense_mediated_decay
ENST00000677308nonsense_mediated_decay
ENST00000677346protein_coding_CDS_not_defined
ENST00000677421protein_coding
ENST00000677442protein_coding
ENST00000677479protein_coding
ENST00000677500protein_coding_CDS_not_defined
ENST00000677603protein_coding
ENST00000677723protein_coding
ENST00000677763retained_intron
ENST00000677820nonsense_mediated_decay
ENST00000678043protein_coding
ENST00000678044protein_coding
ENST00000678048protein_coding
ENST00000678108retained_intron
ENST00000678445nonsense_mediated_decay
ENST00000678529retained_intron
ENST00000678535protein_coding
ENST00000678572protein_coding
ENST00000678659retained_intron
ENST00000678674protein_coding
ENST00000678764retained_intron
ENST00000678792protein_coding
ENST00000678827protein_coding
ENST00000678905protein_coding
ENST00000678906protein_coding
ENST00000678913protein_coding
ENST00000678960protein_coding
ENST00000679014protein_coding
ENST00000679166protein_coding
ENST00000679185protein_coding
ENST00000679231retained_intron
ENST00000715205protein_coding
ENST00000858552protein_coding
ENST00000858553protein_coding
ENST00000858554protein_coding
ENST00000858555protein_coding
ENST00000858556protein_coding
ENST00000858557protein_coding
ENST00000858558protein_coding
ENST00000858559protein_coding
ENST00000858560protein_coding
ENST00000858561protein_coding
ENST00000858562protein_coding
ENST00000922763protein_coding
ENST00000922764protein_coding
ENST00000922765protein_coding
ENST00000922766protein_coding
ENST00000922767protein_coding
ENST00000922768protein_coding
ENST00000922769protein_coding
ENST00000922770protein_coding
ENST00000943782protein_coding
ENST00000943783protein_coding
ENST00000943784protein_coding
ENST00000943785protein_coding
ENST00000943786protein_coding
ENST00000943787protein_coding
ENST00000943788protein_coding
ENST00000943789protein_coding
ENST00000943790protein_coding
ENST00000943791protein_coding
ENST00000943792protein_coding
ENST00000943793protein_coding
ENST00000943794protein_coding
ENST00000943795protein_coding
ENST00000943796protein_coding
ENST00000943797protein_coding
ENST00000943798protein_coding
ENST00000943799protein_coding
ENST00000943800protein_coding
ENST00000943801protein_coding
ENST00000943802protein_coding
ENST00000943803protein_coding
ENST00000943804protein_coding
ENST00000943805protein_coding

RefSeq transcripts (Human, NM_ accessions)

Total: 20 mRNA transcripts

NM AccessionMANE Select
NM_139276
NM_003150
NM_001384993
NM_001384992
NM_001384991
NM_001384990
NM_001384989
NM_001384988
NM_001384987
NM_001384986
NM_001384985
NM_001384984
NM_001369520
NM_001369519
NM_001369518
NM_001369517
NM_001369516
NM_001369514
NM_001369513
NM_001369512

CCDS identifiers

Total: 11 IDs

  • CCDS32656
  • CCDS32657
  • CCDS59288
  • CCDS92309
  • CCDS92310
  • CCDS92311
  • CCDS92312
  • CCDS92313
  • CCDS92314
  • CCDS92315
  • CCDS92316

Canonical/MANE SELECT transcript exons

Transcript: ENST00000264657 (MANE Select: NM_139276)
Total exons: 24

Exon IDStartEndCoordinates (GRCh38)
ENSE000018350494231332442315800chr17:42313324-42315800 (−)
ENSE000028320334231718242317224chr17:42317182-42317224 (−)
ENSE000011795414232228242322494chr17:42322282-42322494 (−)
ENSE000009507254232300442323143chr17:42323004-42323143 (−)
ENSE000034632784232326042323354chr17:42323260-42323354 (−)
ENSE000035042624232357342323625chr17:42323573-42323625 (−)
ENSE000035343254232471142324846chr17:42324711-42324846 (−)
ENSE000035036874232496342325061chr17:42324963-42325061 (−)
ENSE000009507204232611642326199chr17:42326116-42326199 (−)
ENSE000009507194232941042329457chr17:42329410-42329457 (−)
ENSE000012281084232955442329647chr17:42329554-42329647 (−)
ENSE000012281164232974742329776chr17:42329747-42329776 (−)
ENSE000012281224233147242331531chr17:42331472-42331531 (−)
ENSE000012281284233367342333765chr17:42333673-42333765 (−)
ENSE000012281344233389142334049chr17:42333891-42334049 (−)
ENSE000012281404233743542337586chr17:42337435-42337586 (−)
ENSE000015053244233776342337857chr17:42337763-42337857 (−)
ENSE000015053254233873142338812chr17:42338731-42338812 (−)
ENSE000007253804233931442339409chr17:42339314-42339409 (−)
ENSE000035519864234555942345657chr17:42345559-42345657 (−)
ENSE000036311884234656942346713chr17:42346569-42346713 (−)
ENSE000013023084234838942348539chr17:42348389-42348539 (−)
ENSE000034664164231678942316901chr17:42316789-42316901 (−)
ENSE000029721504238827942388442chr17:42388279-42388442 (−)

Protein identifiers

UniProt Accessions

Reviewed (Canonical):

  • P40763 – Signal transducer and activator of transcription 3

Unreviewed (Isoforms/Variants):

  • A0A7I2V2G1
  • A0A7I2V2T1
  • A0A7I2V395
  • A0A7I2V3V0
  • A0A7I2V444
  • A0A7I2V4C8
  • A0A7I2V4F6
  • A0A7I2V4R2
  • A0A7I2V4R3
  • A0A7I2V552
  • A0A7I2V5N9
  • A0A7I2YQD2
  • A0A7I2YQI1
  • A0A7I2YQR5
  • G8JLH9

RefSeq Protein Accessions (NP_)

  • NP_001356441
  • NP_001356442
  • NP_001356443
  • NP_001356445
  • NP_001356446
  • NP_001356447
  • NP_001356448
  • NP_001356449
  • NP_001371913
  • NP_001371914
  • NP_001371915
  • NP_001371916
  • NP_001371917
  • NP_001371918
  • NP_001371919
  • NP_001371920
  • NP_001371921
  • NP_001371922
  • NP_003141
  • NP_644805

Protein Domains and Families

InterPro

IDNameType
IPR000980SH2 domainDomain
IPR001217Transcription factor STATFamily
IPR008967p53-like transcription factor, DNA-binding domain superfamilyHomologous superfamily
IPR012345STAT transcription factor, DNA-binding, N-terminalHomologous superfamily
IPR013799STAT transcription factor, protein interactionDomain
IPR013800STAT transcription factor, all-alpha domainDomain
IPR013801STAT transcription factor, DNA-bindingDomain
IPR015988STAT transcription factor, coiled coilHomologous superfamily
IPR035855STAT3, SH2 domainDomain
IPR036535STAT transcription factor, N-terminal domain superfamilyHomologous superfamily
IPR036860SH2 domain superfamilyHomologous superfamily
IPR048988Signal transducer and activator of transcription, linker domainDomain

Pfam

ID
PF00017
PF01017
PF02864
PF02865
PF21354

SMART

ID
SM00964

CDD (Conserved Domain Database)

ID
CD10374
CD16847
CD16853

Antibody Availability

STAT3 antibodies are available through:

  • Human Protein Atlas (HPA) – Antibodies with tissue expression and subcellular localization data
  • Cross-referenced in UniProt through BindingDB and related antibody databases

Human Protein Atlas provides multiple validated STAT3 antibodies with immunohistochemistry and immunofluorescence data across human tissues.

Structure

Experimental Structures (PDB)

Total: 6 structures

PDB IDMethodResolutionDescription
5AX3X-ray diffraction2.984 ÅERK2 complexed with allosteric and ATP-competitive inhibitors
5U5SSolution NMRBrd2 second bromodomain in complex with STAT3 peptide
6NJSX-ray diffraction2.7 ÅSTAT3 core in complex with compound SD36
6NUQX-ray diffraction3.15 ÅSTAT3 core in complex with compound SI109
6QHDX-ray diffraction2.85 ÅLysine acetylated and tyrosine phosphorylated STAT3 in complex with DNA
6TLCX-ray diffraction2.9 ÅUnphosphorylated human STAT3 in complex with MS3-6 monobody

Predicted Structures

AlphaFold

  • Model ID: P40763
  • pLDDT (mean confidence): 85.80
  • High-confidence residues (pLDDT > 70): 70%

Cross-species orthologs

OrganismGene IDGene Symbol
Mouse (Mus musculus)ENSMUSG00000004040 (Entrez: 20848)Stat3
Rat (Rattus norvegicus)ENSRNOG00000019742 (Entrez: 25125)Stat3
Zebrafish (Danio rerio)ENSDARG00000022712 (Entrez: 30767)stat3
Fruit fly (Drosophila melanogaster)FBGN0016917 (Entrez: 42428)Stat92E
Worm (C. elegans)nonenone
Yeast (S. cerevisiae)nonenone

Clinical variants & AI predictions

Clinical Variants (ClinVar)

Variant Summary

  • Total variants: ~839
  • Classification breakdown (from sampled data):
    • Pathogenic: ~10
    • Likely Pathogenic: ~8
    • Uncertain Significance: ~50
    • Likely Benign: ~100
    • Benign: ~50
    • Conflicting classifications: ~3

Top 30 Pathogenic/Likely Pathogenic Variants

Variant IDHGVS NotationClassificationAssociated Condition
1005984c.2141C>T (p.Thr714Ile)PathogenicSTAT3 gain of function
1012305c.1699A>G (p.Asn567Asp)Likely PathogenicHyper-IgE syndrome
1039785c.1228C>T (p.His410Tyr)PathogenicSTAT3 deficiency
1068485c.1859C>G (p.Thr620Ser)PathogenicHyper-IgE syndrome
1067042c.1182G>A (p.Met394Ile)Likely PathogenicSTAT3 deficiency
1067430c.1924A>G (p.Lys642Glu)Likely PathogenicHyper-IgE syndrome
1176687c.1177G>T (p.Val393Leu)Likely PathogenicSTAT3 deficiency
1398166c.1110-2A>GPathogenicImmunodeficiency
1429110c.2116C>A (p.Leu706Met)PathogenicHyper-IgE syndrome
1429145c.1915C>G (p.Pro639Ala)PathogenicSTAT3 deficiency
144030c.1175A>G (p.Lys392Arg)PathogenicHyper-IgE syndrome
144031c.1938C>G (p.Asn646Lys)PathogenicSTAT3 deficiency
144032c.1974G>C (p.Lys658Asn)PathogenicHyper-IgE syndrome
1339186c.2123C>G (p.Thr708Ser)Likely PathogenicImmunodeficiency
1470407c.861G>T (p.Leu287Phe)Likely PathogenicSTAT3 deficiency
1701323c.994C>T (p.His332Tyr)Pathogenic/Likely PathogenicHyper-IgE syndrome

AI-Based Predictions

SpliceAI Predictions

  • Total variants with predictions: 3,192
  • Effect types: Donor gain/loss, acceptor gain/loss
  • Top 30 by predicted impact (highest delta scores):
VariantEffectScore
17:42316787:AC:ADonor gain0.99
17:42317180:A:ACDonor gain1.00
17:42317181:C:CCDonor gain1.00
17:42316788:CCAAA:CDonor gain1.00
17:42315801:C:CCAcceptor gain1.00
17:42315799:CT:CAcceptor gain1.00
17:42315797:GACT:GAcceptor gain1.00
17:42315796:GGACT:GAcceptor gain1.00
17:42316783:A:ACDonor gain0.99
17:42316788:C:CCDonor gain0.99
17:42316902:C:CCAcceptor gain0.99
17:42317175:CACTT:CDonor loss0.99
17:42316783:AC:ADonor loss0.99
17:42316784:C:CADonor loss0.99
17:42316785:T:TADonor loss0.99
17:42316786:CACCA:CDonor loss0.99
17:42316787:ACC:ADonor loss0.99
17:42316788:C:CGDonor loss0.99
17:42316788:CC:CDonor gain0.93
17:42316920:A:TAcceptor gain0.99

AlphaMissense Predictions (Likely Pathogenic)

  • Total likely_pathogenic: 100+ variants
  • Top 30 with pathogenicity scores:
PositionProtein Variantam_pathogenicityClass
17:42317197:A:GF710S0.997Likely Pathogenic
17:42317213:A:GY705H0.994Likely Pathogenic
17:42317194:A:GI711T0.993Likely Pathogenic
17:42317188:A:TV713E0.996Likely Pathogenic
17:42317188:A:GV713A0.991Likely Pathogenic
17:42317194:A:CI711S0.991Likely Pathogenic
17:42317213:A:TY705N0.979Likely Pathogenic
17:42317212:T:GY705S0.968Likely Pathogenic
17:42317189:C:AV713L0.967Likely Pathogenic
17:42317189:C:GV713L0.967Likely Pathogenic
17:42317189:C:TV713M0.974Likely Pathogenic
17:42317213:A:CY705D0.993Likely Pathogenic
17:42317212:T:CY705C0.978Likely Pathogenic
17:42317190:A:CC712W0.962Likely Pathogenic
17:42317196:A:CF710L0.987Likely Pathogenic
17:42317196:A:TF710L0.987Likely Pathogenic
17:42317197:A:CF710C0.974Likely Pathogenic
17:42317198:A:CF710V0.932Likely Pathogenic
17:42317182:G:AP715L0.895Likely Pathogenic
17:42317185:G:AT714I0.952Likely Pathogenic
17:42317185:G:CT714R0.935Likely Pathogenic
17:42317185:G:TT714K0.937Likely Pathogenic
17:42316866:G:AS727F0.905Likely Pathogenic
17:42316863:G:TP728H0.864Likely Pathogenic
17:42316845:A:GL734S0.727Likely Pathogenic
17:42316854:A:GL731S0.741Likely Pathogenic
17:42316851:T:AD732V0.714Likely Pathogenic
17:42316845:A:CL734W0.623Likely Pathogenic
17:42316789:C:TE753K0.576Likely Pathogenic
17:42315785:T:AD758V0.565Likely Pathogenic

Pathways & Gene Ontology

Biological Pathways

Reactome Pathways (39 total)

Pathway IDName
R-HSA-1059683Interleukin-6 signaling
R-HSA-1266695Interleukin-7 signaling
R-HSA-1433557Signaling by SCF-KIT
R-HSA-1839117Signaling by cytosolic FGFR1 fusion mutants
R-HSA-186763Downstream signal transduction
R-HSA-198745Signalling to STAT3
R-HSA-201556Signaling by ALK
R-HSA-2559582Senescence-Associated Secretory Phenotype (SASP)
R-HSA-2586552Signaling by Leptin
R-HSA-2892247POU5F1 (OCT4), SOX2, NANOG activate genes related to proliferation
R-HSA-390471Association of TriC/CCT with target proteins during biosynthesis
R-HSA-452723Transcriptional regulation of pluripotent stem cells
R-HSA-6783783Interleukin-10 signaling
R-HSA-6785807Interleukin-4 and Interleukin-13 signaling
R-HSA-8849474PTK6 Activates STAT3
R-HSA-8854691Interleukin-20 family signaling
R-HSA-8875791MET activates STAT3
R-HSA-8983432Interleukin-15 signaling
R-HSA-8984722Interleukin-35 Signalling
R-HSA-8985947Interleukin-9 signaling
R-HSA-9008059Interleukin-37 signaling
R-HSA-9020933Interleukin-23 signaling
R-HSA-9020956Interleukin-27 signaling
R-HSA-9020958Interleukin-21 signaling
R-HSA-9616222Transcriptional regulation of granulopoiesis
R-HSA-9670439Signaling by phosphorylated juxtamembrane, extracellular and kinase domain KIT mutants
R-HSA-9673767Signaling by PDGFRA transmembrane, juxtamembrane and kinase domain mutants
R-HSA-9673770Signaling by PDGFRA extracellular domain mutants
R-HSA-9674555Signaling by CSF3 (G-CSF)
R-HSA-9680350Signaling by CSF1 (M-CSF) in myeloid cells
R-HSA-9701898STAT3 nuclear events downstream of ALK signaling
R-HSA-9705462Inactivation of CSF3 (G-CSF) signaling
R-HSA-9707564Cytoprotection by HMOX1
R-HSA-9725370Signaling by ALK fusions and activated point mutants
R-HSA-9725371Nuclear events stimulated by ALK signaling in cancer
R-HSA-982772Growth hormone receptor signaling
R-HSA-9833482PKR-mediated signaling
R-HSA-9909649Regulation of PD-L1(CD274) transcription
R-HSA-111453BH3-only proteins associate with and inactivate anti-apoptotic BCL-2 members

MSigDB Gene Sets (100 total) Includes canonical pathways (Reactome, KEGG, BioCarta, PID), GO terms, transcription factor targets, miRNA targets, and curated gene signatures relevant to STAT3 function and cancer biology.


Gene Ontology Annotations

Biological Process: 91 terms (Top 20)

GO IDTerm
GO:0000122Negative regulation of transcription by RNA polymerase II
GO:0001659Temperature homeostasis
GO:0001666Response to hypoxia
GO:0001754Eye photoreceptor cell differentiation
GO:0002931Response to ischemia
GO:0006355Regulation of DNA-templated transcription
GO:0006357Regulation of transcription by RNA polymerase II
GO:0006606Protein import into nucleus
GO:0006952Defense response
GO:0006953Acute-phase response
GO:0006954Inflammatory response
GO:0007165Signal transduction
GO:0007179Transforming growth factor beta receptor signaling pathway
GO:0007259Cell surface receptor signaling pathway via JAK-STAT
GO:0007399Nervous system development
GO:0008283Cell population proliferation
GO:0008285Negative regulation of cell population proliferation
GO:0010507Negative regulation of autophagy
GO:0010575Positive regulation of vascular endothelial growth factor production
GO:0010628Positive regulation of gene expression

Molecular Function: 24 terms

GO IDTerm
GO:0000976Transcription cis-regulatory region binding
GO:0000978RNA polymerase II cis-regulatory region sequence-specific DNA binding
GO:0000981DNA-binding transcription factor activity, RNA polymerase II-specific
GO:0001228DNA-binding transcription activator activity, RNA polymerase II-specific
GO:0003677DNA binding
GO:0003700DNA-binding transcription factor activity
GO:0003723RNA binding
GO:0004879Nuclear receptor activity
GO:0005102Signaling receptor binding
GO:0019901Protein kinase binding
GO:0019903Protein phosphatase binding
GO:0031490Chromatin DNA binding
GO:0031730CCR5 chemokine receptor binding
GO:0035259Nuclear glucocorticoid receptor binding
GO:0035591Signaling adaptor activity
GO:0042802Identical protein binding
GO:0042803Protein homodimerization activity
GO:0046983Protein dimerization activity
GO:0061629RNA polymerase II-specific DNA-binding transcription factor binding
GO:0070878Primary miRNA binding
GO:0106222lncRNA binding
GO:0140297DNA-binding transcription factor binding
GO:0140311Protein sequestering activity
GO:0140610RNA sequestering activity

Cellular Component: 12 terms

GO IDTerm
GO:0000785Chromatin
GO:0005634Nucleus
GO:0005654Nucleoplasm
GO:0005667Transcription regulator complex
GO:0005737Cytoplasm
GO:0005743Mitochondrial inner membrane
GO:0005829Cytosol
GO:0005886Plasma membrane
GO:0014069Postsynaptic density
GO:0090575RNA polymerase II transcription regulator complex
GO:0098685Schaffer collateral - CA1 synapse
GO:0098978Glutamatergic synapse

Protein interactions & networks

Protein-protein interactions (STRING, IntAct, BioGRID)

Total interaction counts:

  • STRING: ~8,628 interactions
  • BioGRID: 896 interactions
  • IntAct: 580 interactions

TOP 30 highest-confidence STRING interaction partners (scores 999-900):

RankProteinGeneScoreEvidence
1Tyrosine-protein kinase JAK2JAK2999Physical association
2Tyrosine-protein kinase JAK1JAK1999Physical association
3Proto-oncogene tyrosine-protein kinase SrcSRC996Physical association
4Transcription factor JunJUN994Physical association
5Histone acetyltransferase p300EP300992Physical association
6E3 SUMO-protein ligase PIAS3PIAS3988Physical association
7Epidermal growth factor receptorEGFR986Physical association
8Protein c-FosFOS985Physical association
9Signal transducer and activator of transcription 1STAT1982Physical association
10Signal transducer and activator of transcription 5ASTAT5A974Physical association
11Cellular tumor antigen p53TP53970Physical association
12Signal transducer and activator of transcription 5BSTAT5B968Physical association
13Programmed cell death 1 ligand 1CD274968Physical association
14Tyrosine-protein kinase JAK3JAK3967Physical association
15Interleukin-10 receptor subunit alphaIL10RA967Physical association
16Interleukin-6IL6965Physical association
17Angiomotin-like protein 1AMOTL1965Physical association
18Forkhead box protein P3FOXP3965Physical association
19Histone-lysine N-methyltransferase EZH2EZH2964Physical association
20Suppressor of cytokine signaling 3SOCS3961Physical association
21Non-receptor tyrosine-protein kinase TYK2TYK2960Physical association
22Transcription factor p65RELA960Physical association
23Homeobox protein NANOGNANOG952Physical association
24Mitogen-activated protein kinase 1MAPK1951Physical association
25Hepatocyte growth factor receptorMET948Physical association
26Leukemia inhibitory factor receptorLIFR940Physical association
27Interferon regulatory factor 4IRF4939Physical association
28StathminSTMN1937Physical association
29Tyrosine-protein phosphatase non-receptor type 11PTPN11936Physical association
30Glucocorticoid receptorNR3C1932Physical association

Protein similarity

Structural/Embedding similarity (ESM2) - TOP 20 similar proteins with scores:

RankProteinUniProt IDEmbedding SimilarityGene Family
1STAT3 (self)P407631.0000STAT
2STAT1P422240.9993STAT
3STAT5AP422290.9999STAT
4STAT5BP516920.9998STAT
5STAT6P422260.9974STAT
6STAT4P526311.0000STAT
7STAT2P526320.9998STAT
8STAT7P422271.0000STAT
9Signal transducer 1Q5ZJ170.9991STAT
10Signal transducer 2Q5ZKS60.9985STAT
11STAT variantQ6DV790.9999STAT
12STAT homologQ764M50.9993STAT
13Transcription factor 1Q7ZXK30.9990STAT
14Signal transducer variantQ8BHL50.9979STAT
15Signal transducer homologQ8CBY80.9999STAT
16STAT-like protein 1Q96JJ30.9999STAT
17STAT-like protein 2Q5RCC10.9999STAT
18STAT-like protein 3Q8CIQ70.9998STAT
19STAT-like protein 4Q8IZD90.9998STAT
20STAT variant 2P616350.9999STAT

Sequence homology (Diamond) - TOP 20 homologous proteins with identity/bitscore:

RankProteinUniProt IDSequence Identity (%)Bitscore
1STAT3 (self)P4076399.901527.00
2STAT1P4222496.301435.00
3STAT5AP4222997.001519.00
4STAT5BP5169297.201521.00
5STAT2P4222799.901526.00
6STAT6P4222692.301359.00
7STAT4 isoformP4223194.701469.00
8STAT7P5263199.901527.00
9STAT4P5263081.301340.00
10STAT6 variantP4222894.501402.00
11STAT variant 1P4223098.001537.00
12STAT variant 2P4223298.701535.00
13STAT ortholog 1Q6277198.001538.00
14STAT ortholog 2Q6DV7997.501484.00
15STAT ortholog 3Q764M596.301436.00
16STAT ortholog 4Q7ZXK396.101466.00
17STAT variant 3P5263298.701536.00
18STAT variant 4Q9PVX896.101461.00
19STAT variant 5Q9TUM398.001531.00
20STAT variant 6Q9511597.001523.00

Summary

STAT3 is a core hub in JAK-STAT signaling with extensive protein-protein interactions, primarily with:

  • JAK kinases (JAK1, JAK2, JAK3, TYK2) - activation and phosphorylation
  • Other STAT proteins (STAT1, 2, 4, 5A/B, 6) - heterodimerization and cross-talk
  • SOCS3 - negative regulation
  • Transcription factors (p65/RELA, JUN, FOS) - cooperative signaling
  • Growth factor receptors (EGFR, MET, LIFR) - upstream activation

Structural and sequence homology is dominated by the STAT family, reflecting the conserved DNA-binding and transactivation domains characteristic of STAT transcription factors.

Transcription factor regulatory data

Downstream targets: 286 total

TOP 30 STAT3 target genes with regulation type (high confidence):

#TargetRegulationConfidence
1CCND1ActivationHigh
2MYCActivationHigh
3BCL2ActivationHigh
4BCL2L1ActivationHigh
5BIRC5ActivationHigh
6BIRC3ActivationHigh
7CD274ActivationHigh
8IL6ActivationHigh
9IL10ActivationHigh
10IL23AActivationHigh
11IL21ActivationHigh
12IL17AActivationHigh
13IL12AActivationHigh
14HGFActivationHigh
15CHI3L1ActivationHigh
16CXCL11ActivationHigh
17CEBPBActivationHigh
18CEBPDActivationHigh
19NANOGActivationHigh
20SALL4ActivationHigh
21PIM1ActivationHigh
22HAMPActivationHigh
23KRT17ActivationHigh
24LEPActivationHigh
25S100A9ActivationHigh
26S100A11ActivationHigh
27SAA1ActivationHigh
28CIITAActivationHigh
29ICAM1UnknownHigh
30MMP2ActivationHigh

Evidence type: ChIP-seq / curated (CollTRI database) — high-confidence direct and predicted regulatory interactions


DNA binding motifs: JASPAR

Motif IDNameFamilyCollectionSpecies
MA0144.2STAT3STAT factorsCOREHomo sapiens
MA0144.3STAT3STAT factorsCOREHomo sapiens

Motif family classification: STAT domain factors


Upstream regulators: TFs that regulate STAT3

RegulatorMechanismEvidence TypeConfidence
JAK2Phosphorylation (activation)Experimentally validated0.82
EGFRPhosphorylation (activation)Experimentally validated0.88
JAK1Phosphorylation (activation)Experimentally validated0.80
SRCPhosphorylation (activation)Experimentally validated0.79
MAPK3Phosphorylation (activation)Predicted0.72
MTORPhosphorylation (activation)Experimentally validated0.75
JAK3Phosphorylation (activation)Predicted0.79
TYK2Phosphorylation (activation)Predicted0.69
FGFR3Phosphorylation (activation)Predicted0.63
MAPK14Phosphorylation (activation)Predicted0.62
HCKPhosphorylation (activation)Predicted0.62
LEPRPhosphorylation (activation)Experimentally validated0.76
ALKBinding (activation)Experimentally validated0.44
IRAK1Phosphorylation (activation)Predicted0.55
Negative regulators:
PIAS3SUMOylation (repression)Experimentally validated0.73
PTPN1Dephosphorylation (repression)Experimentally validated0.55
PTPN6Dephosphorylation (repression)Experimentally validated0.46
PPARGRepressionPredicted0.38
SOCS3RepressionPredicted0.71

Drug & pharmacology data

STAT3 is a known and actively targeted drug target with extensive pharmacological development.

Targeting molecules

Total count: 100+ molecules in ChEMBL targeting STAT3 (UniProt P40763)

Top molecules by development phase:

Phase 4 (Approved)

  1. MOMELOTINIB (CHEMBL1078178)

    • Mechanism: JAK1/JAK2 inhibitor; STAT3 activation blocker
    • Indication: Myelofibrosis (approved); also in trials for pancreatic cancer, NSCLC, AML, VEXAS
    • ATC: L01EJ04 (tyrosine kinase inhibitor)
    • Status: FDA approved; brand name Cytopia
  2. NITAZOXANIDE (CHEMBL1401)

    • Mechanism: STAT3 modulator; originally antiparasitic
    • Indications: Parasitic infections (approved); Phase 2-4 trials for COVID-19, influenza, IBD, colorectal cancer, Cryptosporidium
    • ATC: P01AX11 (antiprotozoal)
    • Status: FDA approved; brand name Alinia

Phase 3

  1. CURCUMIN (CHEMBL140)
    • Mechanism: Natural polyphenol; STAT3 inhibitor
    • Indications: Multiple cancer types (pancreatic, colon, lung, breast, leukemia), IBD, Alzheimer’s, chronic kidney disease
    • Status: Dietary supplement; 100+ clinical trials

Phase 2

  1. AZD-1480 (CHEMBL1231124)

    • Mechanism: JAK/STAT3 inhibitor
    • Indications: Myeloproliferative diseases, solid tumors (3 Phase 1 trials completed)
    • Status: Discontinued after Phase 1
  2. LEVOMENOL (CHEMBL1096927)

    • Mechanism: STAT3 modulation via NF-κB/STAT3 pathway
    • Indications: Wound healing, dermatitis
    • Status: Topical agent

Clinical trials

Momelotinib – 22 trials (representative sample):

  • NCT01969838 – Phase 3, COMPLETED: Momelotinib vs Ruxolitinib in myelofibrosis
  • NCT04173494 – Phase 3, COMPLETED: Momelotinib vs Danazol in anemic myelofibrosis (MOMENTUM trial)
  • NCT02101021 – Phase 3, TERMINATED: Gemcitabine + nab-paclitaxel + momelotinib in metastatic pancreatic cancer
  • NCT07098936 – Phase 2, RECRUITING: Momelotinib in VEXAS syndrome
  • NCT06235801 – Phase 1/2, RECRUITING: Gilteritinib + momelotinib for relapsed/refractory FLT3-mutated AML
  • NCT06517875 – Phase 2, RECRUITING: Momelotinib + luspatercept in transfusion-dependent myelofibrosis

Nitazoxanide – 85 trials (representative sample):

  • NCT04498936 – Phase 4, COMPLETED: Sofosbuvir/ledipasvir + nitazoxanide for COVID-19
  • NCT04486313 – Phase 3, COMPLETED: Nitazoxanide for mild/moderate COVID-19
  • NCT03336619 – Phase 3, COMPLETED: Nitazoxanide for uncomplicated influenza
  • NCT06049901 – Phase 3, RECRUITING: Nitazoxanide in metastatic colorectal cancer
  • NCT04341493 – Phase 4, TERMINATED: Hydroxychloroquine vs nitazoxanide in COVID-19

Curcumin – 100+ trials (representative Phase 3 sample):

  • NCT00486460 – Phase 3, UNKNOWN: Gemcitabine + curcumin + celecoxib in pancreatic cancer
  • NCT01320436 – Phase 3, COMPLETED: Curcumin + 5-ASA vs 5-ASA in ulcerative colitis
  • NCT02064673 – Phase 3, RECRUITING: Adjuvant curcumin for prostate cancer recurrence prevention
  • NCT03769766 – Phase 3, RECRUITING: Curcumin to prevent prostate cancer progression

Pharmacogenomics

Gene status: STAT3 (PA337) is designated a VIP (Very Important Pharmacogene) in PharmGKB.

Known drug-gene associations: STAT3 has documented pharmacogenomic relationships with:

  • Interferons (109 clinical annotations; 564 variant annotations)
  • Irinotecan (54 clinical annotations; 434 variant annotations)
  • Rifampin (24 clinical annotations; 182 variant annotations)
  • Isoniazid (23 clinical annotations; 237 variant annotations)
  • Leucovorin (21 clinical annotations; 149 variant annotations)
  • Ethambutol (8 clinical annotations; 130 variant annotations)
  • Pyrazinamide, streptomycin, osimertinib

Dosing guidelines: No FDA/CPIC pharmacogenomic dosing guidelines currently published for STAT3 variants. Clinical use remains based on disease indication and phenotype rather than STAT3 genotype.

Mechanism: STAT3 dysfunction affects immune response, drug metabolism, and therapeutic efficacy, particularly relevant for interferon therapy and chemotherapy tolerability.

Expression profiles

Tissue expression (Bgee)

STAT3 is ubiquitous with broad expression across human tissues (301/303 conditions present, 300 gold-quality annotations). Maximum expression score: 99.31, average: 93.72.

Top 30 tissues by expression score:

RankTissue/StructureScoreQuality
1Type B pancreatic cell99.31Gold
2Pericardium99.12Gold
3Lower lobe of lung99.05Gold
4Mammary duct98.47Gold
5Nipple98.41Gold
6Heart right ventricle98.29Gold
7Upper lobe of lung98.23Gold
8Upper lobe of left lung98.22Gold
9Islet of Langerhans98.14Gold
10Vena cava98.13Gold
11Colonic epithelium98.12Gold
12Trachea98.00Gold
13Epithelium of mammary gland98.00Gold
14Cartilage tissue97.93Gold
15Dorsal root ganglion97.88Gold
16Penis97.87Gold
17Left fallopian tube97.86Gold
18Trigeminal ganglion97.83Gold
19Blood97.82Gold
20Pharyngeal mucosa97.80Gold
21Superior surface of tongue97.74Gold
22Urethra97.71Gold
23Gallbladder97.70Gold
24Saphenous vein97.69Gold
25Synovial joint97.67Gold
26Right lung97.64Gold
27Peritoneum97.63Gold
28Omental fat pad97.62Gold
29Decidua97.62Gold
30Left ventricle myocardium97.59Gold

Pattern: STAT3 shows consistent high expression across diverse tissues with slightly elevated expression in pancreatic beta cells, cardiac tissue, lungs, and reproductive organs. Expression is maintained in immune tissues (blood, lymph nodes) and connective tissues throughout the body.

Single-cell expression datasets (SCXA)

Nine human single-cell datasets available:

  1. E-CURD-135: Kidney organoid and adult kidney comparison (6,192 cells)
  2. E-CURD-85: T cells from blood, synovial fluid, synovial tissue in psoriatic arthritis (111,869 cells)
  3. E-CURD-89: Immune cells from colon lamina propria and mesenteric lymph nodes (1,526 cells)
  4. E-GEOD-114530: Human fetal kidney (22,148 cells)
  5. E-GEOD-124472: Human embryonic kidney cortical cells and kidney organoid cells (18,079 cells)
  6. E-HCAD-29: GM-CSF-producing T helper cells — distinct Th subset from blood (78,686 cells)
  7. E-MTAB-7008: Endoderm differentiation from induced pluripotent stem cells (1,024 cells)
  8. E-MTAB-8884: Chronic myelomonocytic leukemia stem cells (9,386 cells)
  9. E-MTAB-8911: T-lymphocytes with mTOR mutation in chronic graft-versus-host disease (19,075 cells)

Notable cell type context: Single-cell data highlights STAT3 expression in immune and stem cell populations, with particular emphasis on T helper cell subsets, immune tolerance conditions, and hematopoietic dysregulation states. Kidney organoid and differentiation datasets suggest developmental roles.

Disease associations

Mendelian / Monogenic Disease

STAT3 mutations cause two main Mendelian diseases with strong evidence:

DiseaseDisease IDInheritanceEvidence
Hyper-IgE recurrent infection syndrome 1, autosomal dominantOMIM:147060 / MONDO:0007818 / Orphanet:2314Autosomal dominantStrong
STAT3-related early-onset multisystem autoimmune diseaseOMIM:615952 / MONDO:0014414 / Orphanet:438159Autosomal dominantStrong
Permanent neonatal diabetes mellitusOrphanet:99885Autosomal dominantSupportive

Phenotype Associations (HPO Terms)

Top 30 phenotypes associated with STAT3:

  1. HP:0000006 - Autosomal dominant inheritance
  2. HP:0002719 - Recurrent infections
  3. HP:0002205 - Recurrent respiratory infections
  4. HP:0003212 - Increased circulating IgE concentration
  5. HP:0002728 - Chronic mucocutaneous candidiasis
  6. HP:0002726 - Recurrent Staphylococcus aureus infections
  7. HP:0002014 - Diarrhea
  8. HP:0001945 - Fever
  9. HP:0000821 - Hypothyroidism
  10. HP:0001508 - Failure to thrive
  11. HP:0001511 - Intrauterine growth retardation
  12. HP:0002716 - Lymphadenopathy
  13. HP:0000280 - Coarse facial features
  14. HP:0000212 - Gingival overgrowth
  15. HP:0000303 - Mandibular prognathia
  16. HP:0000271 - Abnormality of the face
  17. HP:0001627 - Abnormal heart morphology
  18. HP:0001249 - Intellectual disability
  19. HP:0002665 - Lymphoma
  20. HP:0002608 - Celiac disease
  21. HP:0005425 - Recurrent sinopulmonary infections
  22. HP:0006532 - Recurrent pneumonia
  23. HP:0009098 - Chronic oral candidiasis
  24. HP:0031690 - Opportunistic infection
  25. HP:0005764 - Polyarticular arthritis
  26. HP:0002960 - Autoimmunity
  27. HP:0002110 - Bronchiectasis
  28. HP:0002754 - Osteomyelitis
  29. HP:0003212 - Increased circulating IgE concentration
  30. HP:0100651 - Type I diabetes mellitus

Complex Disease / GWAS Associations

Top 30 GWAS associations with effect sizes (p-values):

TraitVariant/StudyP-valueChr
Inflammatory bowel diseaseSTAT36.0e-2217
Inflammatory bowel diseaseSTAT32.0e-1717
Multiple sclerosisSTAT32.0e-2817
Multiple sclerosisSTAT34.0e-2017
Multiple sclerosisSTAT34.0e-1017
Multiple sclerosisSTAT32.0e-1017
Multiple sclerosisSTAT34.0e-0817
Crohn’s diseaseSTAT32.0e-1517
Crohn’s diseaseSTAT37.0e-1217
Crohn’s diseaseSTAT32.0e-1117
Ulcerative colitisSTAT31.0e-1017
Chronic inflammatory diseases (pleiotropy)STAT32.0e-1517
Atopic dermatitisSTAT31.0e-0717
Atopic dermatitisSTAT33.0e-0617
Systemic lupus erythematosusSTAT33.0e-0617
Mean corpuscular hemoglobinSTAT38.0e-1417
Mean corpuscular volumeSTAT32.0e-1117
Mean corpuscular volumeSTAT37.0e-1017
Type 2 diabetesSTAT3 - CAVIN12.0e-0917
Myocardial infarctionSTAT3 - CAVIN13.0e-0617
Diastolic blood pressureSTAT33.0e-0817
Itch intensity from mosquito biteSTAT38.0e-1017
Itch intensity from mosquito bite (adjusted)STAT31.0e-0717
Composite immunoglobulin trait (IgA/IgG)STAT33.0e-0617
Autoimmune thyroid diseaseRAB5C1.0e-1217
Crohn’s disease or systemic sclerosisSTAT33.0e-0817
Apolipoprotein B levelsSTAT34.0e-0817
Mean spheric corpuscular volumeSTAT36.0e-1317
Inflammatory bowel diseaseSTAT32.0e-0917
Inflammatory bowel diseaseSTAT39.0e-0617

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 51 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, cdd, chembl_indication, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gtex, gwas, hca, hgnc, hpa, hpo, inparanoid, intact, interpro, jaspar, mim, mondo, msigdb, orphanet, ortholog, orthologentrez, pdb, pfam, pharmgkb, pharmgkb_gene, reactome, refseq, scxa, signor, smart, spliceai, string_interaction, tabula_sapiens, transcript, uberon, uniprot
Generated: 2026-05-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (190)