STAT3 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human STAT3. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human STAT3. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 35 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, jaspar, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-04-01 — For the latest data, query BioBTree directly via MCP or API.
View API calls (45)
STAT3

I now have all the data needed to compile the comprehensive STAT3 reference. Let me format it.


STAT3 - Comprehensive Cross-Database Identifier and Functional Mapping Reference Signal Transducer and Activator of Transcription 3


Section 1: Gene Identifiers Primary Gene Identifiers

DatabaseIdentifierDescription
HGNCHGNC:11364Approved symbol: STAT3
EnsemblENSG00000168610Ensembl gene ID
NCBI Entrez6774Entrez Gene ID
OMIM102582Gene/Locus MIM number
Gene Information
AttributeValue
Approved NameSignal transducer and activator of transcription 3
Previous NameSignal transducer and activator of transcription 3 (acute-phase response factor)
AliasesAPRF, ADMIO, ADMIO1, HIES
Locus TypeProtein-coding gene
Gene GroupSH2 domain containing
StatusApproved
Genomic Location (GRCh38/hg38)
AttributeValue
Chromosome17
Cytogenetic Band17q21.2
Start Position42,313,324
End Position42,388,568
StrandMinus (-)
Genomic AccessionNC_000017.11

Section 2: Transcript Identifiers Summary Statistics

CategoryCount
Total Ensembl Transcripts94
Protein-coding72
Nonsense-mediated decay5
Retained intron11
Other6
RefSeq Transcripts (human)20 (mRNA)
CCDS IDs11
MANE Select / Canonical Transcript
AttributeValue
RefSeq MANE SelectNM_139276 ★
RefSeq ProteinNP_644805 ★
Ensembl CanonicalENST00000264657
Ensembl Transcripts (Protein-Coding - First 50)
Transcript IDBiotypeStartEnd
ENST00000264657protein_coding4231332442388442
ENST00000389272protein_coding4231546842388431
ENST00000404395protein_coding4231555942388414
ENST00000585517protein_coding4231485442388439
ENST00000588969protein_coding4231549142388482
ENST00000677002protein_coding4231484842348516
ENST00000677030protein_coding4231344642388505
ENST00000677152protein_coding4231484842388477
ENST00000677421protein_coding4231342142388433
ENST00000677442protein_coding4231342142388474
ENST00000677479protein_coding4231347142388474
ENST00000677603protein_coding4231484842388477
ENST00000677723protein_coding4231342142388474
ENST00000678043protein_coding4231342142388476
ENST00000678044protein_coding4231341242388540
ENST00000678048protein_coding4231342142388442
ENST00000678535protein_coding4231490342388484
ENST00000678572protein_coding4231347142388442
ENST00000678674protein_coding4231342142388442
ENST00000678792protein_coding4231342142388467
ENST00000678827protein_coding4231347142388540
ENST00000678905protein_coding4231347142388474
ENST00000678906protein_coding4231341242388454
ENST00000678913protein_coding4231342142388442
ENST00000678960protein_coding4231341242388373
ENST00000679014protein_coding4231342142388511
ENST00000679166protein_coding4231342142388472
ENST00000679185protein_coding4231347142388503
ENST00000715205protein_coding4231341242388568
ENST00000858552protein_coding4231484842388503
RefSeq Transcripts (Human - All mRNA)
RefSeq IDSymbolStatusMANE Select
NM_139276STAT3REVIEWED★ YES
NM_003150STAT3REVIEWEDNo
NM_213662STAT3REVIEWEDNo
NM_001369512STAT3REVIEWEDNo
NM_001369513STAT3REVIEWEDNo
NM_001369514STAT3REVIEWEDNo
NM_001369516STAT3REVIEWEDNo
NM_001369517STAT3REVIEWEDNo
NM_001369518STAT3REVIEWEDNo
NM_001369519STAT3REVIEWEDNo
NM_001369520STAT3REVIEWEDNo
NM_001384984STAT3REVIEWEDNo
NM_001384985STAT3REVIEWEDNo
NM_001384986STAT3REVIEWEDNo
NM_001384987STAT3REVIEWEDNo
NM_001384988STAT3REVIEWEDNo
NM_001384989STAT3REVIEWEDNo
NM_001384990STAT3REVIEWEDNo
NM_001384991STAT3REVIEWEDNo
NM_001384992STAT3REVIEWEDNo
CCDS IDs (All 11)
CCDS ID
CCDS32656
CCDS32657
CCDS59288
CCDS92309
CCDS92310
CCDS92311
CCDS92312
CCDS92313
CCDS92314
CCDS92315
CCDS92316
Canonical Transcript Exons (ENST00000264657) Total Exon Count: 24
Exon IDStartEndStrandLength
ENSE000029721504238827942388442-164
ENSE000035519864234656942346713-145
ENSE000036311884234555942345657-99
ENSE000013023084234838942348539-151
ENSE000007253804233931442339409-96
ENSE000015053254233873142338812-82
ENSE000015053244233776342337857-95
ENSE000012281404233743542337586-152
ENSE000012281344233389142334049-159
ENSE000012281284233367342333765-93
ENSE000012281224233147242331531-60
ENSE000012281164232974742329776-30
ENSE000012281084232955442329647-94
ENSE000009507194232941042329457-48
ENSE000009507204232611642326199-84
ENSE000035343254232471142324846-136
ENSE000035036874232496342325061-99
ENSE000035042624232357342323625-53
ENSE000034632784232326042323354-95
ENSE000009507254232300442323143-140
ENSE000011795414232228242322494-213
ENSE000028320334231718242317224-43
ENSE000034664164231678942316901-113
ENSE000018350494231332442315800-2477

Section 3: Protein Identifiers UniProt Accessions

UniProt IDStatusName
P40763★ Reviewed (Swiss-Prot)Signal transducer and activator of transcription 3
G8JLH9Unreviewed (TrEMBL)STAT3 isoform
A0A7I2V2G1UnreviewedSTAT3 variant
A0A7I2V2T1UnreviewedSTAT3 variant
A0A7I2V395UnreviewedSTAT3 variant
A0A7I2V3V0UnreviewedSTAT3 variant
A0A7I2V444UnreviewedSTAT3 variant
A0A7I2V4C8UnreviewedSTAT3 variant
A0A7I2V4F6UnreviewedSTAT3 variant
A0A7I2V4R2UnreviewedSTAT3 variant
A0A7I2V4R3UnreviewedSTAT3 variant
A0A7I2V552UnreviewedSTAT3 variant
A0A7I2V5N9UnreviewedSTAT3 variant
A0A7I2YQD2UnreviewedSTAT3 variant
A0A7I2YQI1UnreviewedSTAT3 variant
A0A7I2YQR5UnreviewedSTAT3 variant
Canonical Protein Properties (P40763)
PropertyValue
Length770 amino acids
Molecular Mass88,068 Da
Alternative NameAcute-phase response factor (APRF)
RefSeq Protein Accessions (Human)
RefSeq ProteinSymbolStatusMANE
NP_644805STAT3REVIEWED★ YES
NP_003141STAT3REVIEWEDNo
NP_998827STAT3REVIEWEDNo
NP_001356441STAT3REVIEWEDNo
NP_001356442STAT3REVIEWEDNo
NP_001356443STAT3REVIEWEDNo
NP_001356445STAT3REVIEWEDNo
NP_001356446STAT3REVIEWEDNo
NP_001356447STAT3REVIEWEDNo
NP_001356448STAT3REVIEWEDNo
NP_001356449STAT3REVIEWEDNo
NP_001371913STAT3REVIEWEDNo
NP_001371914STAT3REVIEWEDNo
NP_001371915STAT3REVIEWEDNo
NP_001371916STAT3REVIEWEDNo
NP_001371917STAT3REVIEWEDNo
NP_001371918STAT3REVIEWEDNo
NP_001371919STAT3REVIEWEDNo
NP_001371920STAT3REVIEWEDNo
NP_001371921STAT3REVIEWEDNo
Protein Domains and Families (InterPro) Total Domain/Family Annotations: 12
InterPro IDNameType
IPR001217STATFamily
IPR000980SH2Domain
IPR035855STAT3_SH2Domain
IPR013799STAT_TF_prot_interactionDomain
IPR013800STAT_TF_alphaDomain
IPR013801STAT_TF_DNA-bdDomain
IPR048988STAT_linkerDomain
IPR008967p53-like_TF_DNA-bd_sfHomologous_superfamily
IPR012345STAT_TF_DNA-bd_NHomologous_superfamily
IPR015988STAT_TF_CCHomologous_superfamily
IPR036535STAT_N_sfHomologous_superfamily
IPR036860SH2_dom_sfHomologous_superfamily

Section 4: Structure Identifiers Experimental Structures (PDB) Total PDB Structures: 6

PDB IDMethodResolution (Å)Title
6NJSX-ray Diffraction2.70Stat3 Core in complex with compound SD36
6QHDX-ray Diffraction2.85Lysine acetylated and tyrosine phosphorylated STAT3 in complex with DNA
6TLCX-ray Diffraction2.90Unphosphorylated human STAT3 in complex with MS3-6 monobody
5AX3X-ray Diffraction2.98Crystal structure of ERK2 complexed with allosteric inhibitors
6NUQX-ray Diffraction3.15Stat3 Core in complex with compound SI109
5U5SSolution NMR-Brd2 second bromodomain in complex with STAT3 peptide
Predicted Structures (AlphaFold)
AlphaFold IDSequence LengthGlobal pLDDTFraction Very High Confidence
P40763618285.800.70 (70%)

Section 5: Cross-Species Orthologs

SpeciesEnsembl Gene IDSymbolRefSeq
Mouse (Mus musculus)ENSMUSG00000004040Stat3NM_011486
Rat (Rattus norvegicus)ENSRNOG00000019742Stat3NM_012747
Zebrafish (Danio rerio)ENSDARG00000022712stat3NM_131479
Worm (C. elegans)WBGENE00010251--
Worm (C. elegans)WBGENE00013111--
Fruit fly (D. melanogaster)No direct ortholog--
Yeast (S. cerevisiae)No ortholog--

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary Total ClinVar Variants: 839

ClassificationCount
Pathogenic43
Likely Pathogenic~5
Uncertain Significance (VUS)~50
Likely Benign~200
Benign~100
Conflicting~10
Pathogenic Variants (All 43)
ClinVar IDHGVS NotationProtein ChangeReview Status
18304c.1144C>Tp.Arg382TrpMultiple submitters
18305c.1145G>Ap.Arg382GlnMultiple submitters
18306c.1268G>Ap.Arg423GlnMultiple submitters
18308c.1909G>Ap.Val637MetMultiple submitters
18303c.1384GTG[1]p.Val463delMultiple submitters
224846c.454C>Tp.Arg152TrpMultiple submitters
224848c.2147C>Tp.Thr716MetMultiple submitters
636714c.1139+1G>TSplice siteMultiple submitters
418505c.1397A>Gp.Asn466SerMultiple submitters
224843c.1032G>Cp.Gln344HisSingle submitter
224847c.1057G>Tp.Val353PheSingle submitter
224849c.1260T>Gp.Asn420LysSingle submitter
224850c.2107G>Ap.Ala703ThrSingle submitter
476197c.986T>Ap.Met329LysSingle submitter
664121c.1311C>Ap.His437GlnSingle submitter
850251c.833G>Ap.Arg278HisSingle submitter
1005984c.2141C>Tp.Thr714IleSingle submitter
1039785c.1228C>Tp.His410TyrSingle submitter
1068485c.1859C>Gp.Thr620SerSingle submitter
1398166c.1110-2A>GSplice siteSingle submitter
1429110c.2116C>Ap.Leu706MetSingle submitter
1429145c.1915C>Gp.Pro639AlaSingle submitter
1800571c.988C>Tp.Pro330SerSingle submitter
2099076c.1145G>Cp.Arg382ProSingle submitter
2138013c.1915C>Ap.Pro639ThrSingle submitter
2138015c.1907C>Ap.Ser636TyrSingle submitter
2138016c.1865C>Tp.Thr622IleSingle submitter
2169593c.1934T>Ap.Leu645GlnSingle submitter
2308162c.995A>Tp.His332LeuSingle submitter
2925614c.1907C>Tp.Ser636PheSingle submitter
2925616c.1181T>Cp.Met394ThrSingle submitter
2940006c.2137G>Tp.Val713LeuSingle submitter
3391073c.1309C>Tp.His437TyrSingle submitter
3759433c.2137G>Cp.Val713LeuSingle submitter
144030c.1175A>Gp.Lys392ArgNo criteria
144031c.1938C>Gp.Asn646LysNo criteria
144032c.1974G>Cp.Lys658AsnNo criteria
1705748c.1910T>Cp.Val637AlaNo criteria
1804642c.985A>Gp.Met329ValNo criteria
1804643c.1858A>Gp.Thr620AlaNo criteria
18307c.1145G>Tp.Arg382LeuNo criteria
430904c.1853G>Ap.Gly618AspNo criteria
64689c.1166C>Tp.Thr389IleNo criteria
AlphaMissense Predictions Total Predictions: 5,104
ClassificationApproximate Count
Likely Pathogenic~600
Ambiguous~1,200
Likely Benign~3,300
Top 50 AlphaMissense Likely Pathogenic Variants (Highest Scores)
Variant IDProtein ChangePathogenicity Score
17:42317197:A:GF710S0.997
17:42317188:A:TV713E0.996
17:42317194:A:TI711N0.995
17:42317213:A:GY705H0.994
17:42317194:A:GI711T0.993
17:42317213:A:CY705D0.993
17:42317188:A:GV713A0.991
17:42317194:A:CI711S0.991
17:42317188:A:CV713G0.988
17:42317203:G:AT708I0.988
17:42317196:A:CF710L0.987
17:42317209:A:GL706P0.987
17:42322327:A:CY686D0.972
17:42317197:A:CF710C0.974
17:42317189:C:TV713M0.974
17:42317213:A:TY705N0.979
17:42317212:T:CY705C0.978
17:42317212:T:GY705S0.968
17:42317189:C:AV713L0.967
17:42317190:A:CC712W0.962
17:42322326:T:GY686S0.954
17:42317185:G:AT714I0.952
17:42317192:A:GC712R0.943
17:42317185:G:TT714K0.937
17:42317185:G:CT714R0.935
17:42317198:A:CF710V0.932
17:42317191:C:TC712Y0.929
17:42317198:A:TF710I0.907
17:42316866:G:AS727F0.905
17:42317182:G:AP715L0.895
17:42317199:C:AK709N0.882
17:42317209:A:TL706Q0.876
17:42316866:G:TS727Y0.874
17:42317195:T:AI711F0.859
17:42322326:T:CY686C0.855
17:42316863:G:TP728H0.864
17:42316863:G:AP728L0.849
17:42317209:A:CL706R0.846
17:42317200:T:AK709M0.838
17:42316863:G:CP728R0.835
17:42317182:G:CP715R0.815
17:42317193:G:CI711M0.813
17:42317191:C:AC712F0.801
17:42317206:T:AK707M0.795
17:42316864:G:TP728T0.792
17:42316872:G:CP725R0.779
17:42316864:G:AP728S0.771
17:42317201:T:CK709E0.764
17:42317205:C:AK707N0.755
17:42317192:A:CC712G0.745
SpliceAI Predictions Total Splice Effect Predictions: 3,192 Top 50 Predicted Splice-Altering Variants (Delta Score ≥0.8)
Variant IDEffectDelta Score
17:42315796:GGACT:Gacceptor_gain1.00
17:42315797:GACT:Gacceptor_gain1.00
17:42315799:CT:Cacceptor_gain1.00
17:42315801:C:CCacceptor_gain1.00
17:42315801:C:CGacceptor_loss1.00
17:42316788:CCAAA:Cdonor_gain1.00
17:42316780:AATAC:Adonor_loss0.99
17:42316781:ATACT:Adonor_loss0.99
17:42316782:TACT:Tdonor_loss0.99
17:42316783:AC:Adonor_loss0.99
17:42316784:C:CAdonor_loss0.99
17:42316785:T:TAdonor_loss0.99
17:42316786:CACCA:Cdonor_loss0.99
17:42316787:ACC:Adonor_loss0.99
17:42316788:C:CGdonor_loss0.99
17:42315797:GACTC:Gacceptor_gain0.98
17:42315798:ACT:Aacceptor_gain0.98
17:42315799:CTC:Cacceptor_gain0.98
17:42315800:TCT:Tacceptor_gain0.98
17:42315801:C:Gacceptor_gain0.98
17:42315810:C:CTacceptor_gain0.98
17:42315811:A:Tacceptor_gain0.98
17:42316787:A:ACdonor_gain0.99
17:42316788:C:CCdonor_gain0.99
17:42316788:CCAA:Cdonor_gain0.97
17:42316787:AC:Adonor_gain0.93
17:42316788:CC:Cdonor_gain0.93
17:42316799:T:TAdonor_gain0.92
17:42316788:CCA:Cdonor_gain0.90
17:42315814:C:CTacceptor_gain0.87
17:42315815:A:Tacceptor_gain0.83
17:42316820:TTC:Tdonor_gain0.83
17:42315717:T:Cdonor_gain0.81

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathway Count: 39

Pathway IDPathway NameDisease?
R-HSA-1059683Interleukin-6 signalingNo
R-HSA-6783783Interleukin-10 signalingNo
R-HSA-6785807Interleukin-4 and Interleukin-13 signalingNo
R-HSA-8983432Interleukin-15 signalingNo
R-HSA-8985947Interleukin-9 signalingNo
R-HSA-9020933Interleukin-23 signalingNo
R-HSA-9020956Interleukin-27 signalingNo
R-HSA-9020958Interleukin-21 signalingNo
R-HSA-8854691Interleukin-20 family signalingNo
R-HSA-8984722Interleukin-35 SignallingNo
R-HSA-9008059Interleukin-37 signalingNo
R-HSA-1266695Interleukin-7 signalingNo
R-HSA-198745Signalling to STAT3No
R-HSA-982772Growth hormone receptor signalingNo
R-HSA-2586552Signaling by LeptinNo
R-HSA-1433557Signaling by SCF-KITNo
R-HSA-186763Downstream signal transductionNo
R-HSA-201556Signaling by ALKNo
R-HSA-8849474PTK6 Activates STAT3No
R-HSA-8875791MET activates STAT3No
R-HSA-9674555Signaling by CSF3 (G-CSF)No
R-HSA-9680350Signaling by CSF1 (M-CSF) in myeloid cellsNo
R-HSA-111453BH3-only proteins associate with anti-apoptotic BCL-2No
R-HSA-2559582Senescence-Associated Secretory Phenotype (SASP)No
R-HSA-390471Association of TriC/CCT with target proteinsNo
R-HSA-452723Transcriptional regulation of pluripotent stem cellsNo
R-HSA-2892247POU5F1, SOX2, NANOG activate proliferation genesNo
R-HSA-9616222Transcriptional regulation of granulopoiesisNo
R-HSA-9705462Inactivation of CSF3 (G-CSF) signalingNo
R-HSA-9707564Cytoprotection by HMOX1No
R-HSA-9833482PKR-mediated signalingNo
R-HSA-9909649Regulation of PD-L1(CD274) transcriptionNo
R-HSA-9701898STAT3 nuclear events downstream of ALK signalingNo
R-HSA-1839117Signaling by cytosolic FGFR1 fusion mutantsYes
R-HSA-9670439Signaling by phosphorylated KIT mutantsYes
R-HSA-9673767Signaling by PDGFRA transmembrane mutantsYes
R-HSA-9673770Signaling by PDGFRA extracellular domain mutantsYes
R-HSA-9725370Signaling by ALK fusions and activated point mutantsYes
R-HSA-9725371Nuclear events stimulated by ALK signaling in cancerYes
Gene Ontology Annotations Biological Process (Total: 89)
GO IDTerm
GO:0007259cell surface receptor signaling pathway via JAK-STAT
GO:0070102interleukin-6-mediated signaling pathway
GO:0019221cytokine-mediated signaling pathway
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0006357regulation of transcription by RNA polymerase II
GO:0006954inflammatory response
GO:0006953acute-phase response
GO:0008283cell population proliferation
GO:0030154cell differentiation
GO:0045893positive regulation of DNA-templated transcription
GO:0010628positive regulation of gene expression
GO:0007165signal transduction
GO:0048708astrocyte differentiation
GO:0072538T-helper 17 type immune response
GO:0045766positive regulation of angiogenesis
GO:0060396growth hormone receptor signaling pathway
GO:0033210leptin-mediated signaling pathway
GO:0035019somatic stem cell population maintenance
GO:0001666response to hypoxia
GO:0042593glucose homeostasis
Molecular Function (Total: 24)
GO IDTerm
GO:0003700DNA-binding transcription factor activity
GO:0001228DNA-binding transcription activator activity, RNA polymerase II-specific
GO:0000981DNA-binding transcription factor activity, RNA polymerase II-specific
GO:0003677DNA binding
GO:0042803protein homodimerization activity
GO:0046983protein dimerization activity
GO:0042802identical protein binding
GO:0019901protein kinase binding
GO:0019903protein phosphatase binding
GO:0005102signaling receptor binding
GO:0003723RNA binding
GO:0070878primary miRNA binding
GO:0106222lncRNA binding
GO:0031490chromatin DNA binding
GO:0035591signaling adaptor activity
GO:0035259nuclear glucocorticoid receptor binding
GO:0031730CCR5 chemokine receptor binding
GO:0061629RNA polymerase II-specific DNA-binding transcription factor binding
GO:0140297DNA-binding transcription factor binding
GO:0000976transcription cis-regulatory region binding
Cellular Component (Total: 12)
GO IDTerm
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005737cytoplasm
GO:0005829cytosol
GO:0005886plasma membrane
GO:0000785chromatin
GO:0005667transcription regulator complex
GO:0005743mitochondrial inner membrane
GO:0014069postsynaptic density
GO:0090575RNA polymerase II transcription regulator complex
GO:0098685Schaffer collateral - CA1 synapse
GO:0098978glutamatergic synapse

Section 8: Protein Interactions & Molecular Networks Interaction Database Summary

DatabaseTotal Interactions
STRING8,628
BioGRID896
IntAct580
SIGNOR166
Top 50 STRING Interactions (Highest Confidence)
Interacting ProteinUniProtScore
JAK1O60674999
JAK2P23458999
SRCP12931996
JUNP05412994
EP300Q09472992
PIK3CAQ9Y6X2988
EGFRP00533986
FOSP01100985
STAT1P42224982
STAT5BP42229974
TP53P04637970
STAT2P51692968
CUEDC2Q9NZQ7968
JAK3P52333967
IL10Q13651967
IL6P05231965
SOCS3Q8IY63965
PIAS3Q9BZS1965
EZH2Q15910964
SOCS2O14543961
TYK2P29597960
RELAQ04206960
NANOGQ9H9S0952
MAPK1P28482951
METP08581948
LIFP42702940
IRF9Q15306939
STMN1P16949937
PTPN11Q06124936
NR3C1P04150932
AKT1P31749926
HSP90AA1P07900923
HIF1AQ16665923
CEBPBP17676920
IL10RAP22301916
DNMT1P26358916
CDKN1AQ9P0J0910
EGFP01133908
IFNGP01579905
IL6STP08887903
NFKB1P19838903
HDAC1Q13547903
ERBB2P21860902
IL6RP40189900
CCND1P24385899
SUMO1Q9UGK3899
FOXM1Q08050892
ANXA2Q7Z403891
CTNNB1P35222887
IL12AQ16552886
Protein Similarity (ESM2 Structural Embedding) Total Similar Proteins: 43
UniProtSimilarityDescription
P422271.000STAT3 (identical)
P526311.000STAT3 ortholog
Q19S501.000STAT3 ortholog
P616350.999STAT6
A4FUD60.999STAT3 ortholog
P407631.000STAT3 (self)
P422240.999STAT1
P422290.999STAT5B
P422300.999STAT5A
Q6DV790.999STAT3 ortholog
Q7ZXK30.999STAT3 ortholog
Q627710.999STAT3 (mouse)
Protein Sequence Similarity (DIAMOND) Total Homologous Proteins: 27
UniProtIdentity (%)Bitscore
P4076399.91527
P4222799.91526
P5263199.91527
Q6277198.01538
P4223098.01537
Q9TUM398.01531
Q9TUZ098.01531
P5263298.71536
P4223298.71535
P5169297.21521
Q9TUZ197.01528
P4222997.01519
Q9511597.01523
Q6DV7997.51484
P4222496.31435
Q764M596.31436
Q7ZXK396.11466
Q9PVX896.11461

Section 9: Transcription Factor Regulatory Data STAT3 is a transcription factor JASPAR Binding Motifs

Motif IDNameCollectionClassFamily
MA0144.2STAT3CORESTAT domain factorsSTAT factors
MA0144.3STAT3CORESTAT domain factorsSTAT factors
Downstream Targets (Genes Regulated BY STAT3) Total Target Genes (CollecTRI): 486+ Top 50 Target Genes with Regulation Type
Target GeneRegulationConfidence
BCL2ActivationHigh
BCL2L1ActivationHigh
BIRC5 (Survivin)ActivationHigh
CCND1 (Cyclin D1)ActivationHigh
MYCActivationHigh
VEGFAActivationHigh
CD274 (PD-L1)ActivationHigh
SOCS3ActivationHigh
IL10ActivationHigh
A2MUnknownHigh
ABCB1UnknownHigh
ADIPOQUnknownHigh
AGTUnknownHigh
AGRPUnknownHigh
BATFActivationHigh
BCL3ActivationHigh
BECN1UnknownHigh
BIRC3ActivationHigh
BMPR2UnknownHigh
BST2UnknownHigh
CAV1UnknownHigh
CCL2ActivationHigh
CCL11ActivationHigh
CD46ActivationHigh
CD74UnknownHigh
CDC25AUnknownHigh
CDKN1A (p21)ActivationHigh
CDKN1B (p27)UnknownHigh
HGFActivationMedium
TGFB1ActivationMedium
IL6ActivationMedium
RORCActivationMedium
CEBPDActivationMedium
S100A9ActivationMedium
CASP3ActivationLow
AHRActivationLow
AKAP12ActivationLow
AKT1UnknownLow
APLNActivationLow
ARID5AActivationLow
BACE1UnknownLow
BADActivationLow
BCL6ActivationLow
CCDC88AActivationLow
CCL20ActivationLow
CCNE1ActivationLow
CDH17UnknownLow
CDX2UnknownLow
CDKN2DActivationLow
FOXP3RepressionMedium
Upstream Regulators (TFs That Regulate STAT3)
RegulatorRegulationConfidence
CTNNB1 (β-catenin)ActivationHigh
HMGA1ActivationHigh
HIC1Activation-
JAK1Activation-
PLK1Activation-
POLR2HActivation-
KAT5Repression-
PPARGRepression-
SIN3ARepression-
ATF4UnknownLow
BCL6UnknownLow
CEBPAUnknown-
CEBPBUnknownLow
HIF1AUnknownLow
MYCUnknownLow
PIAS3Unknown-
RELAUnknown-
STAT1Unknown-
SPI1Unknown-
BRCA1Unknown-
SIGNOR Regulatory Network (Top 50 Interactions)
SourceTargetEffectMechanismScore
EGFRSTAT3up-regulatesphosphorylation0.88
JAK2STAT3up-regulatesphosphorylation0.82
JAK1STAT3up-regulatesphosphorylation0.80
SRCSTAT3up-regulatesphosphorylation0.79
JAK3STAT3up-regulatesphosphorylation0.79
LEPRSTAT3up-regulatesphosphorylation0.76
IL6STAT3up-regulates-0.76
MTORSTAT3up-regulatesphosphorylation0.75
PIAS3STAT3down-regulatessumoylation0.73
MAPK3STAT3up-regulatesphosphorylation0.72
SOCS3STAT3down-regulates-0.71
TYK2STAT3up-regulatesphosphorylation0.69
FGFR3STAT3up-regulatesphosphorylation0.63
HCKSTAT3up-regulatesphosphorylation0.62
MAPK14STAT3up-regulatesphosphorylation0.62
PRKCDSTAT3up-regulatesphosphorylation0.60
MAPK8STAT3up-regulatesphosphorylation0.57
IRAK1STAT3up-regulatesphosphorylation0.55
PTPN1STAT3down-regulatesdephosphorylation0.55
PTPRDSTAT3down-regulatesdephosphorylation0.52
MAPK9STAT3up-regulatesphosphorylation0.48
PTPN6STAT3down-regulatesdephosphorylation0.46
PKMSTAT3up-regulatesphosphorylation0.44
ALKSTAT3up-regulatesbinding0.44
CDK5STAT3up-regulatesphosphorylation0.40
FGFR4STAT3up-regulatesphosphorylation0.40
PRKCESTAT3up-regulatesphosphorylation0.40
RPS6KA5STAT3up-regulatesphosphorylation0.39
NLKSTAT3up-regulatesphosphorylation0.35
DAB2IPSTAT3down-regulatesbinding0.35

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL Target IDNameType
CHEMBL4026Signal transducer and activator of transcription 3SINGLE PROTEIN
CHEMBL4296101PTPN9/STAT3PROTEIN-PROTEIN INTERACTION
CHEMBL4523691Protein cereblon/STAT3PROTEIN-PROTEIN INTERACTION
CHEMBL5482983JAK2-STAT3PROTEIN COMPLEX
PharmGKB Information
AttributeValue
PharmGKB IDPA337
VIP Gene★ Yes (Very Important Pharmacogene)
Has Variant AnnotationYes
Has CPIC GuidelineNo
Top 30 Targeting Molecules by Development Phase
ChEMBL IDNameTypeHighest Phase
CHEMBL1078178MOMELOTINIBSmall moleculePhase 4 (Approved)
CHEMBL1401NITAZOXANIDESmall moleculePhase 4 (Approved)
CHEMBL140CURCUMINSmall moleculePhase 3
CHEMBL1096927LEVOMENOLSmall moleculePhase 2
CHEMBL1231124AZD-1480Small moleculePhase 2
CHEMBL1081584LOGANINSmall moleculePhase 0
CHEMBL1209803MORRONISIDESmall moleculePhase 0
CHEMBL128729-Small moleculePhase 0
CHEMBL1289974-Small moleculePhase 0
CHEMBL1299373-Small moleculePhase 0
Known STAT3 Inhibitors (SIGNOR)
CompoundEffectMechanism
S3I-201down-regulateschemical inhibition
5,15-Diphenyl-21H,23H-porphinedown-regulates activitychemical inhibition

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression BreadthUbiquitous
Total Present Calls301
Total Absent Calls2
Max Expression Score99.31
Average Expression Score93.72
Gold Quality Conditions300
Top 30 Tissues by Expression Score
Tissue/Cell TypeExpressionScoreQuality
Type B pancreatic cellpresent99.31gold
Pericardiumpresent99.12gold
Lower lobe of lungpresent99.05gold
Mammary ductpresent98.47gold
Nipplepresent98.41gold
Heart right ventriclepresent98.29gold
Upper lobe of lungpresent98.23gold
Upper lobe of left lungpresent98.22gold
Islet of Langerhanspresent98.14gold
Vena cavapresent98.13gold
Colonic epitheliumpresent98.12gold
Tracheapresent98.00gold
Epithelium of mammary glandpresent98.00gold
Cartilage tissuepresent97.93gold
Dorsal root ganglionpresent97.88gold
Penispresent97.87gold
Left uterine tubepresent97.86gold
Trigeminal ganglionpresent97.83gold
Bloodpresent97.82gold
Pharyngeal mucosapresent97.80gold
Superior surface of tonguepresent97.74gold
Urethrapresent97.71gold
Gall bladderpresent97.70gold
Saphenous veinpresent97.69gold
Synovial jointpresent97.67gold
Right lungpresent97.64gold
Peritoneumpresent97.63gold
Omental fat padpresent97.62gold
Deciduapresent97.62gold
Left ventricle myocardiumpresent97.59gold
Single-Cell Expression Data (SCXA) Total Single-Cell Datasets: 9
Experiment IDDescriptionCells
E-CURD-85T cells from blood, synovial fluid and tissue in psoriatic arthritis111,869
E-HCAD-29GM-CSF-producing T helper cells78,686
E-GEOD-124472Human embryonic kidney outer/inner cortical and organoid cells18,079
E-GEOD-114530Human fetal kidneys22,148
E-MTAB-8911Clonally expanded T-lymphocytes in chronic GVHD19,075
E-MTAB-8884Chronic myelomonocytic leukemia stem cells9,386
E-CURD-135Kidney organoid and adult human kidney transcriptomes6,192
E-CURD-89Immune cells from colon lamina propria and mesenteric lymph nodes1,526
E-MTAB-7008Endoderm differentiation from iPSCs1,024

Section 12: Disease Associations Mendelian Disease Links (GenCC/Orphanet)

DiseaseDisease IDInheritanceEvidence
Hyper-IgE recurrent infection syndrome 1, ADOMIM:147060Autosomal dominantStrong
STAT3-related early-onset multisystem autoimmune diseaseOMIM:615952Autosomal dominantStrong
Permanent neonatal diabetes mellitusORPHANET:99885Autosomal dominantSupportive
Acute promyelocytic leukemiaORPHANET:520--
T-cell large granular lymphocyte leukemiaORPHANET:86872--
Chronic lymphoproliferative disorder of NK cellsORPHANET:512017--
Breast implant-associated anaplastic large cell lymphomaORPHANET:667662--
HPO Phenotype Associations Total Associated Phenotypes: 177 Top 50 Phenotypes
HPO IDPhenotype
HP:0000006Autosomal dominant inheritance
HP:0002719Recurrent infections
HP:0002726Recurrent Staphylococcus aureus infections
HP:0002728Chronic mucocutaneous candidiasis
HP:0001880Increased total eosinophil count
HP:0000964Eczematoid dermatitis
HP:0000988Skin rash
HP:0001508Failure to thrive
HP:0000280Coarse facial features
HP:0002757Recurrent fractures
HP:0000939Osteoporosis
HP:0002110Bronchiectasis
HP:0002205Recurrent respiratory infections
HP:0001873Thrombocytopenia
HP:0001890Autoimmune hemolytic anemia
HP:0001904Autoimmune neutropenia
HP:0001973Autoimmune thrombocytopenia
HP:0002014Diarrhea
HP:0001263Global developmental delay
HP:0001945Fever
HP:0002716Lymphadenopathy
HP:0001433Hepatosplenomegaly
HP:0000857Neonatal insulin-dependent diabetes mellitus
HP:0000821Hypothyroidism
HP:0001627Abnormal heart morphology
HP:0001738Exocrine pancreatic insufficiency
HP:0002608Celiac disease
HP:0002665Lymphoma
HP:0001903Anemia
HP:0002754Osteomyelitis
GWAS Associations Total GWAS Associations: 34
StudyTraitp-value
GCST009597Multiple sclerosis2.0e-28
GCST001725Inflammatory bowel disease6.0e-22
GCST005531Multiple sclerosis4.0e-20
GCST004131Inflammatory bowel disease2.0e-17
GCST001785Crohn's disease2.0e-15
GCST005537Chronic inflammatory diseases (pleiotropy)2.0e-15
GCST90002397Mean spheric corpuscular volume6.0e-13
GCST010571Autoimmune thyroid disease1.0e-12
GCST000207Crohn's disease7.0e-12
GCST004132Crohn's disease2.0e-11
GCST90002392Mean corpuscular volume2.0e-11
GCST004133Ulcerative colitis1.0e-10
GCST001198Multiple sclerosis2.0e-10
GCST000593Multiple sclerosis3.0e-10
GCST004861Itch intensity from mosquito bite8.0e-10
GCST004602Mean corpuscular volume7.0e-10
GCST006867Type 2 diabetes2.0e-09
GCST003602Inflammatory bowel disease2.0e-09
GCST011991Psoriasis or type 2 diabetes2.0e-09
GCST005527Psoriasis5.0e-09
GCST001341Multiple sclerosis4.0e-08
GCST000879Crohn's disease3.0e-08
GCST010124Crohn's disease or systemic sclerosis3.0e-08
GCST007268Diastolic blood pressure3.0e-08
GCST010243Apolipoprotein B levels4.0e-08
GCST003184Atopic dermatitis1.0e-07
GCST004862Itch intensity (adjusted)1.0e-07

SUMMARY STATISTICS

CategoryCount
Ensembl Transcripts94
RefSeq Transcripts (human)20
CCDS IDs11
UniProt Entries16
InterPro Domains12
PDB Structures6
ClinVar Variants839
Pathogenic Variants43
AlphaMissense Predictions5,104
SpliceAI Predictions3,192
Reactome Pathways39
GO Biological Process89
GO Molecular Function24
GO Cellular Component12
STRING Interactions8,628
IntAct Interactions580
BioGRID Interactions896
TF Targets (CollecTRI)486+
JASPAR Motifs2
GWAS Associations34
HPO Phenotypes177
Orphanet Diseases7

Reference generated using BioBTree database queries. Data current as of April 2026. I’ve compiled a comprehensive cross-database identifier and functional mapping reference for human STAT3 (Signal Transducer and Activator of Transcription 3). Key highlights from this reference: Gene Identity:

  • HGNC:11364 | Ensembl: ENSG00000168610 | Entrez: 6774 | OMIM: 102582

  • Location: chr17:42,313,324-42,388,568 (minus strand) Transcripts & Protein:

  • 94 Ensembl transcripts (72 protein-coding)

  • MANE Select: NM_139276 / NP_644805

  • Canonical protein P40763: 770 amino acids, 88 kDa Clinical Significance:

  • 839 ClinVar variants (43 pathogenic)

  • Causes Hyper-IgE syndrome (Job’s syndrome) and STAT3-related autoimmune disease

  • PharmGKB VIP (Very Important Pharmacogene)

  • 2 approved drugs target STAT3 pathways (Momelotinib, Nitazoxanide) Function:

  • Key transcription factor in JAK-STAT signaling

  • Regulates 486+ target genes including BCL2, MYC, CCND1, VEGFA, PD-L1

  • 39 Reactome pathways (interleukin signaling, growth hormone, etc.)

  • 8,628 STRING protein interactions Expression:

  • Ubiquitously expressed (301/303 conditions positive)

  • Highest in pancreatic beta cells, lung, heart, mammary tissue