TGFB1 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human TGFB1. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human TGFB1. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 33 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-31 — For the latest data, query BioBTree directly via MCP or API.
View API calls (41)
TGFB1

Section 1: Gene Identifiers

DatabaseIdentifierNotes
HGNC IDHGNC:11766Approved
HGNC SymbolTGFB1Approved symbol
Full NameTransforming growth factor beta 1
Ensembl GeneENSG00000105329
NCBI Entrez Gene7040
OMIM190180Gene/locus
Locus Typeprotein-coding gene
Genomic Location
AttributeValue
Chromosome19
Cytogenetic Band19q13.2
Start Position41,301,587
End Position41,353,961
StrandNegative (-)
Genome AssemblyGRCh38
Genomic AccessionNC_000019.10
Gene Aliases & Previous Symbols
  • Current Aliases: CED, TGFbeta, LAP, TGF-beta1, CAEND1, IBDIMDE

  • Previous Symbols: TGFB, DPD1 Gene Family

  • Transforming growth factor beta family


Section 2: Transcript Identifiers Ensembl Transcripts Total Transcript Count: 8

Transcript IDBiotypeStartEndStrand
ENST00000221930protein_coding41,330,32341,353,922-
ENST00000597453retained_intron41,342,02141,348,341-
ENST00000598758protein_coding_CDS_not_defined41,301,58741,342,030-
ENST00000600196protein_coding41,330,91541,353,044-
ENST00000677934protein_coding41,330,91541,353,044-
ENST00000890114protein_coding41,329,72941,353,961-
ENST00000966383protein_coding41,330,90941,353,922-
ENST00000966384protein_coding41,330,91641,353,674-
RefSeq Transcripts
AccessionTypeStatusMANE Select
NM_000660mRNAREVIEWED✓ Yes (Canonical)
XM_011527242mRNAPREDICTEDNo
XM_054321897mRNAPREDICTEDNo
RefSeq Proteins
AccessionTypeStatusMANE Select
NP_000651proteinREVIEWED✓ Yes
XP_011525544predicted_proteinPREDICTEDNo
XP_054177872predicted_proteinPREDICTEDNo
CCDS Identifiers
CCDS ID
CCDS33031
Canonical Transcript Exons (ENST00000221930) Total Exon Count: 7
Exon IDStartEndStrandChromosome
ENSE0000113670341,352,69041,353,922-19
ENSE0000070841641,348,29541,348,455-19
ENSE0000346366141,344,74741,344,864-19
ENSE0000365079141,342,17041,342,247-19
ENSE0000084244141,341,88341,342,030-19
ENSE0000070841241,332,12841,332,281-19
ENSE0000119616441,330,32341,331,210-19

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusCanonical
P01137Transforming growth factor beta-1 proproteinReviewed (Swiss-Prot)✓ Yes
Protein Properties
PropertyValue
Length390 amino acids
Molecular Mass44,325 Da
RefSeq Protein Accessions
AccessionType
NP_000651Reviewed (MANE Select)
XP_011525544Predicted
XP_054177872Predicted
Protein Domains and Families (InterPro) Total Domain/Family Annotations: 7
IDNameType
IPR001111TGF-b_propeptideDomain
IPR001839TGF-b_CDomain
IPR003939TGFb1Family
IPR015615TGF-beta-likeFamily
IPR016319TGF-betaFamily
IPR017948TGFb_CSConserved_site
IPR029034Cystine-knot_cytokineHomologous_superfamily

Section 4: Structure Identifiers Experimental Structures (PDB) Total PDB Structure Count: 20

PDB IDMethodResolution (Å)Title
1KLASOLUTION NMR-Solution structure of TGF-B1, models 1-17
1KLCSOLUTION NMR-Solution structure of TGF-B1, minimized average
1KLDSOLUTION NMR-Solution structure of TGF-B1, models 18-33
3KFDX-RAY DIFFRACTION2.995Ternary complex of TGF-b1, isoform-specific recognition
4KV5X-RAY DIFFRACTION3.0scFv GC1009 in complex with TGF-beta1
5FFOX-RAY DIFFRACTION3.49Integrin alpha V beta 6 with pro-TGF-beta
5VQPX-RAY DIFFRACTION2.9Crystal structure of human pro-TGF-beta1
6GFFX-RAY DIFFRACTION3.1GARP (LRRC32) with latent TGF-beta1 and MHG-8 Fab
6OM2X-RAY DIFFRACTION2.77Integrin alphaV beta8 with proTGF-beta1 ligand peptide
6P7JX-RAY DIFFRACTION3.501Latency Associated Peptide unbound to TGF-beta1
7Y1RELECTRON MICROSCOPY4.01Human L-TGF-beta1 with anchor protein LRRC33
7Y1TELECTRON MICROSCOPY3.24Integrin alphaV/beta8 and L-TGF-beta1 at 1:2 ratio
8C7HELECTRON MICROSCOPY2.7latTGF-beta LHG-10 Fab complex
8REWELECTRON MICROSCOPY2.98GARP-lTGFbeta1 with activating antibody Fab
8UDZX-RAY DIFFRACTION2.21LTBP-49247 Fab with TGFbeta1 Small Latent Complex
8VSCELECTRON MICROSCOPY3.0L-TGF-b1/GARP
8VSDELECTRON MICROSCOPY3.2avb8/L-TGF-b1/GARP
9FDYELECTRON MICROSCOPY3.4Betaglycan Orphan Domain with TGF-b1 and TGFBRII
9FKPELECTRON MICROSCOPY3.72Zebrafish Betaglycan with TGF-b1 and TGFBRII
9VJJX-RAY DIFFRACTION2.477Human Latent TGF-beta1 with SOF10
Predicted Structures (AlphaFold)
AlphaFold IDGlobal pLDDTSequence LengthFraction Very High Confidence
P0113780.113,1170.41 (41%)

Section 5: Cross-Species Orthologs

SpeciesGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000002603Tgfb1protein_coding
Rat (Rattus norvegicus)ENSRNOG00000020652Tgfb1protein_coding
Zebrafish (Danio rerio)ENSDARG00000034895tgfb1bprotein_coding
Zebrafish (Danio rerio)ENSDARG00000041502tgfb1aprotein_coding
Fruit fly (Drosophila melanogaster)No direct ortholog--
Worm (C. elegans)No direct ortholog--
Yeast (S. cerevisiae)No data available--

Section 6: Clinical Variants & AI Predictions ClinVar Clinical Variants Total Variant Count: 408 Classification Breakdown (Estimated from sample)

ClassificationCount (Approx)
Pathogenic~5
Likely Pathogenic~2
Pathogenic/Likely Pathogenic~3
Uncertain Significance (VUS)~150
Likely Benign~150
Benign~50
Benign/Likely Benign~10
Conflicting~5
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationClassificationAssociated Condition
12528c.673T>C (p.Cys225Arg)PathogenicCamurati-Engelmann disease
12529c.653G>A (p.Arg218His)Pathogenic/Likely pathogenicCamurati-Engelmann disease
12531c.652C>T (p.Arg218Cys)PathogenicCamurati-Engelmann disease
12533c.667T>C (p.Cys223Arg)PathogenicCamurati-Engelmann disease
Note: Most variants in TGFB1 are classified as VUS or benign; pathogenic variants are primarily associated with Camurati-Engelmann disease. AI-Based Variant Effect Predictions SpliceAI Predictions Total Splice Predictions: 2,395 TOP 50 Predicted Splice-Altering Variants (Score ≥ 0.80)
VariantGeneEffectDelta Score
19:41301667:G:TTGFB1donor_gain1.00
19:41302643:T:ATGFB1acceptor_gain1.00
19:41302649:A:AGTGFB1acceptor_gain1.00
19:41301705:G:TTGFB1donor_loss1.00
19:41301706:T:ATGFB1donor_loss1.00
19:41301662:G:GTTGFB1donor_gain0.99
19:41301671:G:GTTGFB1donor_gain0.99
19:41302650:C:GTGFB1acceptor_gain0.99
19:41302660:CCTA:CTGFB1acceptor_loss0.99
19:41302662:TAGCC:TTGFB1acceptor_loss0.99
19:41302663:A:ACTGFB1acceptor_loss0.99
19:41302663:A:AGTGFB1acceptor_gain0.99
19:41302664:G:GCTGFB1acceptor_gain0.99
19:41302664:G:GTTGFB1acceptor_loss0.99
19:41302664:GC:GTGFB1acceptor_gain0.99
19:41302664:GCC:GTGFB1acceptor_gain0.99
19:41302664:GCCA:GTGFB1acceptor_gain0.99
19:41302766:G:GTTGFB1donor_gain0.99
19:41301700:GAAAG:GTGFB1donor_gain0.99
19:41302644:G:ATGFB1acceptor_gain1.00
19:41302645:G:ATGFB1acceptor_gain0.98
19:41302643:T:TATGFB1acceptor_loss0.98
19:41302664:GCCAA:GTGFB1acceptor_gain0.98
19:41301667:G:GTTGFB1donor_gain0.98
19:41301705:G:GGTGFB1donor_gain0.98
19:41301619:T:ATGFB1acceptor_gain0.89
19:41301619:TGACG:TTGFB1acceptor_gain0.97
19:41301663:A:TTGFB1donor_gain0.90
19:41302667:A:AGTGFB1acceptor_gain0.93
19:41302664:G:TTGFB1acceptor_gain0.92
19:41302659:TCCTA:TTGFB1acceptor_gain0.90
AlphaMissense Predictions Total Missense Predictions: 2,513 TOP 50 Predicted Pathogenic Missense Variants
VariantProtein ChangeAM PathogenicityClassification
19:41331058:G:CC389W0.999likely_pathogenic
19:41331059:C:AC389F0.999likely_pathogenic
19:41331059:C:GC389S1.000likely_pathogenic
19:41331059:C:TC389Y0.999likely_pathogenic
19:41331060:A:GC389R1.000likely_pathogenic
19:41331060:A:TC389S1.000likely_pathogenic
19:41331064:G:CC387W1.000likely_pathogenic
19:41331065:C:AC387F1.000likely_pathogenic
19:41331065:C:GC387S1.000likely_pathogenic
19:41331065:C:TC387Y1.000likely_pathogenic
19:41331066:A:GC387R0.999likely_pathogenic
19:41331066:A:TC387S1.000likely_pathogenic
19:41331079:C:AM382I0.999likely_pathogenic
19:41331079:C:GM382I0.999likely_pathogenic
19:41331079:C:TM382I0.999likely_pathogenic
19:41331080:A:CM382R0.999likely_pathogenic
19:41331080:A:GM382T0.998likely_pathogenic
19:41331080:A:TM382K0.999likely_pathogenic
19:41331055:G:CS390R0.996likely_pathogenic
19:41331055:G:TS390R0.996likely_pathogenic
19:41331056:C:AS390I0.975likely_pathogenic
19:41331056:C:TS390N0.948likely_pathogenic
19:41331057:T:AS390C0.943likely_pathogenic
19:41331057:T:GS390R0.996likely_pathogenic
19:41331060:A:CC389G0.992likely_pathogenic
19:41331066:A:CC387G0.995likely_pathogenic
19:41331069:A:GS386P0.995likely_pathogenic
19:41331082:G:CN381K0.983likely_pathogenic
19:41331082:G:TN381K0.983likely_pathogenic
19:41331083:T:AN381I0.996likely_pathogenic
19:41331074:A:CV384G0.982likely_pathogenic
19:41331074:A:GV384A0.974likely_pathogenic
19:41331074:A:TV384E0.997likely_pathogenic
19:41331075:C:AV384L0.988likely_pathogenic
19:41331075:C:GV384L0.988likely_pathogenic
19:41331075:C:TV384M0.997likely_pathogenic
19:41331086:G:AS380F0.989likely_pathogenic
19:41331086:G:CS380C0.977likely_pathogenic
19:41331086:G:TS380Y0.971likely_pathogenic
19:41331089:A:CL379R0.997likely_pathogenic
19:41331089:A:GL379P1.000likely_pathogenic
19:41331089:A:TL379Q0.999likely_pathogenic
19:41331094:C:AE377D0.924likely_pathogenic
19:41331096:C:TE377K0.980likely_pathogenic
19:41331071:C:GR385P0.953likely_pathogenic
19:41331081:T:AM382L0.944likely_pathogenic
19:41331081:T:CM382V0.979likely_pathogenic
19:41331061:C:AK388N0.968likely_pathogenic
19:41331062:T:AK388M0.913likely_pathogenic
19:41331063:T:CK388E0.936likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathway Count: 21

Pathway IDNameDisease Pathway
R-HSA-114608Platelet degranulationNo
R-HSA-168277Influenza Virus Induced ApoptosisYes
R-HSA-202733Cell surface interactions at the vascular wallNo
R-HSA-2129379Molecules associated with elastic fibresNo
R-HSA-2173788Downregulation of TGF-beta receptor signalingNo
R-HSA-2173789TGF-beta receptor signaling activates SMADsNo
R-HSA-2173791TGF-beta receptor signaling in EMTNo
R-HSA-3000170Syndecan interactionsNo
R-HSA-3000178ECM proteoglycansNo
R-HSA-3304356SMAD2/3 Phosphorylation Motif Mutants in CancerYes
R-HSA-3642279TGFBR2 MSI Frameshift Mutants in CancerYes
R-HSA-3645790TGFBR2 Kinase Domain Mutants in CancerYes
R-HSA-3656532TGFBR1 KD Mutants in CancerYes
R-HSA-3656535TGFBR1 LBD Mutants in CancerYes
R-HSA-381340Transcriptional regulation of white adipocyte differentiationNo
R-HSA-5689603UCH proteinasesNo
R-HSA-6785807Interleukin-4 and Interleukin-13 signalingNo
R-HSA-8941855RUNX3 regulates CDKN1A transcriptionNo
R-HSA-8941858Regulation of RUNX3 expression and activityNo
R-HSA-8951936RUNX3 regulates p14-ARFNo
R-HSA-9839389TGFBR3 regulates TGF-beta signalingNo
Gene Ontology Annotations Total GO Terms: 196 Biological Process (TOP 20)
GO IDTerm Name
GO:0007179transforming growth factor beta receptor signaling pathway
GO:0001837epithelial to mesenchymal transition
GO:0008285negative regulation of cell population proliferation
GO:0008284positive regulation of cell population proliferation
GO:0030335positive regulation of cell migration
GO:0010628positive regulation of gene expression
GO:0010629negative regulation of gene expression
GO:0042130negative regulation of T cell proliferation
GO:0045591positive regulation of regulatory T cell differentiation
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0032967positive regulation of collagen biosynthetic process
GO:0001570vasculogenesis
GO:0002040sprouting angiogenesis
GO:0030316osteoclast differentiation
GO:0048661positive regulation of smooth muscle cell proliferation
GO:0036446myofibroblast differentiation
GO:0050729positive regulation of inflammatory response
GO:0060391positive regulation of SMAD protein signal transduction
GO:0071560cellular response to transforming growth factor beta stimulus
GO:0045066regulatory T cell differentiation
Molecular Function (TOP 20)
GO IDTerm Name
GO:0005125cytokine activity
GO:0008083growth factor activity
GO:0005114type II transforming growth factor beta receptor binding
GO:0034713type I transforming growth factor beta receptor binding
GO:0034714type III transforming growth factor beta receptor binding
GO:0042802identical protein binding
GO:0019899enzyme binding
GO:0043539protein serine/threonine kinase activator activity
GO:0044877protein-containing complex binding
Cellular Component (TOP 20)
GO IDTerm Name
GO:0005576extracellular region
GO:0005615extracellular space
GO:0031012extracellular matrix
GO:0031093platelet alpha granule lumen
GO:0005886plasma membrane
GO:0009986cell surface
GO:0005634nucleus
GO:0005737cytoplasm
GO:0005796Golgi lumen
GO:0030424axon
GO:0043025neuronal cell body
GO:0072562blood microparticle

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions Total Interaction Count: 7,020 TOP 50 Highest-Confidence Interacting Proteins

UniProt IDGeneScoreDescription
P36897TGFBR1999TGF-beta receptor type I
P37173TGFBR2999TGF-beta receptor type II
P17813ENG997Endoglin
P22064LTBP1996Latent TGF-beta binding protein 1
P07585DCN993Decorin
Q03167TGFBR3992TGF-beta receptor type III
Q8N2S1LTBP4990Latent TGF-beta binding protein 4
O15105SMAD7978SMAD family member 7
P84022SMAD3978SMAD family member 3
P01343GDF3976Growth differentiation factor 3
Q9NS15LTBP3975Latent TGF-beta binding protein 3
P00533EGFR973Epidermal growth factor receptor
P07996TSP1973Thrombospondin 1
P09038FGF2971Fibroblast growth factor 2
P02751FN1969Fibronectin 1
P29279CTGF/CCN2967Cellular communication network factor 2
Q15796SMAD2966SMAD family member 2
Q14767LTBP2954Latent TGF-beta binding protein 2
O14625CYR61947Cysteine-rich angiogenic inducer 61
O14786NRP1944Neuropilin-1
P13247SERPINE1940Plasminogen activator inhibitor 1
P01375TNF936Tumor necrosis factor
P05231IL6932Interleukin-6
P01584IL1B924Interleukin-1 beta
Q13485SMAD4917SMAD family member 4
Q14392LTBP1913Latent TGF-beta binding protein 1
P01023A2M910Alpha-2-macroglobulin
P01133EGF904Epidermal growth factor
P22301IL10891Interleukin-10
P01579IFNG887Interferon gamma
P10600TGFB3885Transforming growth factor beta 3
P37023ACVRL1884Activin receptor-like kinase 1
Q6YHK3CD109884CD109 antigen
P35555FBN1877Fibrillin 1
P14780MMP9873Matrix metallopeptidase 9
P14210HGF871Hepatocyte growth factor
P01730CD4866CD4 antigen
Q9BZS1FOXP3855Forkhead box P3
P05112IL4850Interleukin-4
P08253MMP2850Matrix metallopeptidase 2
P31749AKT1846AKT serine/threonine kinase 1
P21781FGF7845Fibroblast growth factor 7
Q16552IL17A845Interleukin-17A
P13500CCL2844C-C motif chemokine ligand 2
P15502ELN840Elastin
O43541SMAD6837SMAD family member 6
Q9BXN1ASPORIN830Asporin
O76093FGF18825Fibroblast growth factor 18
O15520FGF10824Fibroblast growth factor 10
P10767FGF6824Fibroblast growth factor 6
IntAct Protein Interactions Total IntAct Entries: 232 Key Experimentally Validated Interactions:
Partner GeneInteraction TypeConfidence Score
LRRC32 (GARP)physical association0.850
LTBP1direct interaction0.640
TGFB1 (homodimer)direct interaction0.520
ITGAVphysical association0.490
MEOX2physical association0.560
PSG1direct interaction0.560
ENGdirect interaction0.440
TGFBR3direct interaction0.440
TGFBR1direct interaction0.440
Fstl1direct interaction0.440
LTBP4physical association0.520
COL5A1physical association0.370
LAMB1physical association-
Protein Similarity ESM2 Structural/Embedding Similarity Total Similar Proteins: 94 TOP 20 Similar Proteins:
UniProt IDTop SimilarityAverage Similarity
P183311.0000.982
P618111.0000.964
P618121.0000.964
Q049981.0000.982
P171250.9990.986
P172460.9990.983
P042020.9990.983
P183410.9990.983
P212140.9990.966
P504140.9990.983
P430320.9990.982
P079950.9990.982
O756100.9990.959
P972990.9990.959
Q863H10.9990.959
Q9ERR70.9990.955
Q68US50.9990.965
Q38L250.9990.964
O002920.9990.959
Q96HF10.9990.959
Diamond Sequence Similarity Total Homologous Proteins: 36 TOP 20 Homologous Proteins:
UniProt IDTop Identity (%)Top Bitscore
P61811100.0838
P61812100.0838
P1712599.3840
Q0725899.3840
Q38L2599.3834
P0985899.3832
P1833199.5811
P5041499.5792
P1834199.5792
Q0499899.5810
P0953399.0791
P0113799.0792
P2121498.8829
P0420298.7798
P1724698.7799
P1060097.8835
Q38HS296.9773
P2709096.6810
P0799596.5757
P4303296.5748

Section 9: Transcription Factor Regulatory Data Note on TGFB1 as a Transcription Factor TGFB1 is not a classical transcription factor; it is a secreted cytokine/growth factor. However, it functions as a signaling molecule that regulates transcription through the SMAD pathway. Below we provide both upstream regulators of TGFB1 expression and downstream targets regulated by TGFB1 signaling. Upstream Regulators (TFs that regulate TGFB1 expression) Total: 75 transcription factors TOP 50 Upstream Regulators:

TF GeneRegulationConfidence
SP1ActivationHigh
STAT3ActivationHigh
RELA (NFκB)ActivationHigh
JUNActivationHigh
FOSActivationHigh
HIF1AActivationHigh
SMAD2ActivationHigh
CEBPBActivationHigh
EGR1ActivationHigh
E2F1ActivationHigh
KLF6ActivationHigh
KLF10ActivationHigh
SREBF1ActivationHigh
TFE3ActivationHigh
TFAP4ActivationHigh
ATF2ActivationHigh
RXRAActivationHigh
PPARDActivationHigh
PPARGActivationHigh
ARActivationHigh
SMAD3RepressionHigh
AHRRepressionHigh
PPARARepressionHigh
MYCUnknownHigh
NFKB1UnknownHigh
SMAD4UnknownHigh
KLF4UnknownHigh
PURAUnknownHigh
FOXC1UnknownHigh
GLI2UnknownHigh
GLI3UnknownHigh
ELF3ActivationHigh
DLX2ActivationHigh
SMAD7ActivationHigh
ASCL1-High
GATA6-High
KLF2Repression-
NR3C1Repression-
LRP1Repression-
BCL11BRepression-
FOXC2Activation-
FOXO1Activation-
FOXP3Activation-
LMO2Activation-
CREB1Activation-
RELActivationLow
NFE2L2ActivationLow
NANOGActivationLow
STAT6ActivationLow
ID1ActivationLow
Downstream Targets (Genes regulated by TGFB1 signaling) Total: 61 target genes All Downstream Target Genes:
Target GeneRegulation
COL1A1Activation
COL1A2Activation
COL2A1Activation
COL3A1Activation
COL4A1Activation
ACTA2Activation
CCN2 (CTGF)Activation
ENGActivation
ELNActivation
FGFR1Activation
ITGA2Activation
CDKN2B (p15)Activation
ADAMTS4Activation
ANKHActivation
BGLAPRepression
CILPActivation
ENPP1Activation
HBA1Activation
HBBActivation
IDEActivation
CCNA2Repression
CDH1Repression
CDK4Repression
DNAH10Repression
ITGA3Repression

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL Target IDTarget NameType
CHEMBL1795178Transforming growth factor beta-1 proproteinSINGLE PROTEIN
CHEMBL3988637Transforming growth factor betaPROTEIN FAMILY
CHEMBL4296077TGF-beta1/SMAD3PROTEIN-PROTEIN INTERACTION
Targeting Molecules Total Molecules: 5
ChEMBL IDNameTypeHighest Dev. Phase
CHEMBL3260567VACTOSERTIBSmall moleculePhase 2
CHEMBL4448117-Unknown0
CHEMBL5078185-Unknown0
CHEMBL5272606--0
CHEMBL5280211--0
PharmGKB Information
PharmGKB IDSymbolVIP GeneCPIC Guideline
PA350TGFB1✓ Yes (Very Important Pharmacogene)No
Pharmacogenomics
  • TGFB1 is classified as a VIP (Very Important Pharmacogene) in PharmGKB
  • No CPIC clinical guidelines currently available
  • Variants may affect drug response in various therapeutic contexts including:
  • Cancer therapy
  • Immunosuppression
  • Fibrosis treatment

Section 11: Expression Profiles Overall Expression Pattern

PropertyValue
Expression BreadthUbiquitous
Total Present Calls204
Max Expression Score99.08
Tissue Expression (Bgee) TOP 30 Tissues by Expression Score:
RankTissueExpression ScoreCall Quality
1Granulocyte99.08Gold
2Monocyte98.51Gold
3Leukocyte98.47Gold
4Mononuclear cell98.43Gold
5Stromal cell of endometrium98.27Gold
6Ascending aorta96.77Gold
7Thoracic aorta96.75Gold
8Descending thoracic aorta96.68Gold
9Right coronary artery96.46Gold
10Spleen96.43Gold
11Lower esophagus mucosa96.32Gold
12Right lung95.95Gold
13Endocervix95.60Gold
14Blood95.46Gold
15Aorta95.39Gold
16Upper lobe of left lung95.20Gold
17Left coronary artery95.04Gold
18Bone marrow cell94.85Gold
19Ectocervix94.61Gold
20Coronary artery94.59Gold
21Upper lobe of lung94.55Gold
22Popliteal artery94.50Gold
23Tibial artery94.50Gold
24Mucosa of stomach93.90Gold
25Body of uterus93.65Gold
26Lymph node93.14Gold
27Omental fat pad92.73Gold
28Peritoneum92.67Gold
29Metanephros cortex92.50Gold
30Left uterine tube92.39Gold
Cell Type Expression TOP 30 Cell Types:
Cell TypeExpression Score
Granulocyte99.08
Monocyte98.51
Leukocyte98.47
Mononuclear cell98.43
Stromal cell of endometrium98.27
Bone marrow cell94.85
Note: TGFB1 shows notably ABSENT expression in Type B pancreatic cells (beta cells). Single-Cell Expression Data (Single Cell Expression Atlas) Total Single-Cell Datasets: 8
Experiment IDDescriptionSpeciesCell Count
E-ANND-5Mapping the developing human immune system across organsH. sapiens911,873
E-GEOD-139324Immune landscape of viral- and carcinogen-derived head and neck cancerH. sapiens204,315
E-GEOD-135922Single-cell transcriptomics of human retinal pigment epithelium and choroidH. sapiens55,571
E-MTAB-8205Single-cell RNA-seq of hPSC-derived endothelial-to-haematopoietic transitionH. sapiens25,764
E-MTAB-8911mTOR mutation in T-lymphocytes associated with chronic GVHDH. sapiens19,075
E-MTAB-11011Single Cell Analysis of B cells in COVID-19H. sapiens15,100
E-GEOD-106540Precursors of CD4+ cytotoxic T lymphocytesH. sapiens2,244
E-GEOD-75367Circulating breast cancer cellsH. sapiens74

Section 12: Disease Associations Mendelian/Monogenic Disease Links (GenCC) Total Disease Associations: 11

DiseaseOMIM/Orphanet IDInheritanceClassificationSubmitter
Camurati-Engelmann diseaseOMIM:131300Autosomal dominantDefinitiveAmbry Genetics, G2P
Camurati-Engelmann diseaseOMIM:131300Autosomal dominantStrongGenomics England PanelApp, Labcorp, PanelApp Australia
Camurati-Engelmann diseaseORPHANET:1328Autosomal dominantSupportiveOrphanet
Inflammatory bowel disease, immunodeficiency, and encephalopathyOMIM:618213Autosomal recessiveStrongLabcorp Genetics
Inflammatory bowel disease, immunodeficiency, and encephalopathyOMIM:618213Autosomal recessiveModeratePanelApp Australia
Inflammatory bowel disease, immunodeficiency, and encephalopathyOMIM:618213Autosomal recessiveLimitedAmbry Genetics
Inflammatory bowel disease, immunodeficiency, and encephalopathyORPHANET:565788Autosomal recessiveSupportiveOrphanet
Cystic fibrosis (modifier)ORPHANET:586Autosomal recessiveSupportiveOrphanet
Orphanet Disease Classifications
Orphanet IDDisease NameTypeGene CountPhenotype Count
1328Camurati-Engelmann diseaseMalformation syndrome154
565788Infantile inflammatory bowel disease with neurological involvementDisease10
586Cystic fibrosisDisease1935
Phenotype Associations (HPO) Total Phenotype Terms: 149 TOP 50 Associated Phenotypes:
HPO IDPhenotype Term
HP:0000006Autosomal dominant inheritance
HP:0000007Autosomal recessive inheritance
HP:0002653Bone pain
HP:0003034Diaphyseal sclerosis
HP:0000940Abnormal diaphysis morphology
HP:0002652Skeletal dysplasia
HP:0001324Muscle weakness
HP:0003388Easy fatigability
HP:0002515Waddling gait
HP:0001763Pes planus
HP:0002808Kyphosis
HP:0002650Scoliosis
HP:0003307Hyperlordosis
HP:0002857Genu valgum
HP:0002673Coxa valga
HP:0002694Sclerosis of skull base
HP:0000929Abnormal skull morphology
HP:0002007Frontal bossing
HP:0000303Mandibular prognathia
HP:0000520Proptosis
HP:0000648Optic atrophy
HP:0000651Diplopia
HP:0001293Cranial nerve compression
HP:0000365Hearing impairment
HP:0001298Encephalopathy
HP:0001263Global developmental delay
HP:0002059Cerebral atrophy
HP:0002079Hypoplasia of the corpus callosum
HP:0002188Delayed CNS myelination
HP:0001251Ataxia
HP:0001257Spasticity
HP:0002014Diarrhea
HP:0002024Malabsorption
HP:0002110Bronchiectasis
HP:0002205Recurrent respiratory infections
HP:0001639Hypertrophic cardiomyopathy
HP:0001508Failure to thrive
HP:0001533Slender build
HP:0001903Anemia
HP:0001882Decreased total leukocyte count
HP:0001974Increased total leukocyte count
HP:0003212Increased circulating IgE concentration
HP:0003237Increased circulating IgG concentration
HP:0002910Elevated hepatic transaminases
HP:0001392Abnormality of the liver
HP:0001394Cirrhosis
HP:0001733Pancreatitis
HP:0001738Exocrine pancreatic insufficiency
HP:0000716Depression
HP:0000739Anxiety
Complex Trait Associations (GWAS) Total GWAS Associations: 19
Study IDTrait/DiseaseMapped GeneP-value
GCST010866_163Coronary artery diseaseTGFB11×10⁻²⁶
GCST005195_133Coronary artery diseaseTGFB12×10⁻¹⁷
GCST005194_75Coronary artery diseaseTGFB14×10⁻¹⁷
GCST005196_249Coronary artery diseaseTGFB14×10⁻¹⁶
GCST005196_248Coronary artery diseaseTGFB17×10⁻¹⁵
GCST90013664_34Aspartate aminotransferase levelsCCDC97, TGFB12×10⁻²⁵
GCST008613_2HematuriaCCDC97, TGFB11×10⁻¹¹
GCST90002402_208Platelet countCCDC97, TGFB12×10⁻¹¹
GCST007269_321Pulse pressureTGFB18×10⁻¹¹
GCST008617_4Hematuria (moderate to severe)CCDC97, TGFB12×10⁻⁹
GCST90013663_33Alanine aminotransferase levelsCCDC97, TGFB14×10⁻⁹
GCST002454_12Colorectal cancerTMEM911×10⁻⁸
GCST004787_13Coronary artery disease (MI, PTCA, CABG, angina, IHD)TGFB14×10⁻⁸
GCST007990_15Coronary artery diseaseTGFB12×10⁻⁸
GCST008618_1Hematuria (mild)CCDC97, TGFB19×10⁻⁸
GCST004898_3Preterm birth (maternal effect)TGFB15×10⁻⁷
GCST003494_2Colorectal cancerTMEM914×10⁻⁷
GCST007856_118Colorectal cancer or advanced adenomaTMEM911×10⁻⁶
GCST008114_4Type 2 diabetesTGFB11×10⁻⁶

SUMMARY TGFB1 (Transforming Growth Factor Beta 1) is a pleiotropic cytokine that plays critical roles in:

  • Cell proliferation and differentiation
  • Immune regulation (T cell homeostasis, regulatory T cell differentiation)
  • Extracellular matrix production (collagen synthesis, fibrosis)
  • Epithelial-mesenchymal transition (EMT)
  • Wound healing and tissue repair Key Statistics
CategoryCount
Ensembl Transcripts8
PDB Structures20
ClinVar Variants408
SpliceAI Predictions2,395
AlphaMissense Predictions2,513
Reactome Pathways21
GO Terms196
STRING Interactions7,020
IntAct Interactions232
GWAS Associations19
HPO Phenotypes149
Disease Associations11
Primary Disease Associations
  1. Camurati-Engelmann disease (OMIM:131300) - Autosomal dominant, DEFINITIVE
  2. Inflammatory bowel disease, immunodeficiency, and encephalopathy (OMIM:618213) - Autosomal recessive
  3. Cystic fibrosis - Modifier gene

Reference compiled from BioBTree aggregated databases including: Ensembl, HGNC, UniProt, NCBI, ClinVar, PDB, AlphaFold, Reactome, GO, STRING, IntAct, Bgee, GWAS Catalog, HPO, GenCC, Orphanet, ChEMBL, PharmGKB, SpliceAI, AlphaMissense.