ESR1 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human ESR1. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human ESR1. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 31 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, interpro, jaspar, mim, ortholog, pdb, pharmgkb_gene, reactome, refseq, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-04-01 — For the latest data, query BioBTree directly via MCP or API.
View API calls (43)
ESR1

Section 1: Gene Identifiers Primary Gene Identifiers

DatabaseIdentifierDescription
HGNCHGNC:3467Approved symbol: ESR1
EnsemblENSG00000091831Ensembl gene ID
NCBI Entrez2099Entrez Gene ID
OMIM133430Gene/locus MIM number
Gene Information
AttributeValue
Approved NameEstrogen receptor 1
Locus TypeGene with protein product
Locus GroupProtein-coding gene
StatusApproved
Gene GroupEstrogen receptors
Previous SymbolsESR
AliasesER, NR3A1, Era, ER-alpha, ESRA, ESTRR
Genomic Location
AttributeValue
Chromosome6
Cytogenetic Band6q25.1-q25.2
Start Position151,651,284 (GRCh38)
End Position152,129,619 (GRCh38)
Strand+ (plus/forward)
Genome AssemblyGRCh38/hg38
Gene Span~478 kb

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 18)

Transcript IDBiotypeStartEndStrand
ENST00000206249protein_coding151,807,682152,103,274+
ENST00000338799protein_coding151,804,431152,099,053+
ENST00000404742protein_coding151,690,496151,808,166+
ENST00000406599protein_coding151,807,679152,098,978+
ENST00000415488protein_coding151,880,657152,061,059+
ENST00000427531protein_coding151,809,107152,129,619+
ENST00000440973protein_coding151,690,496152,103,274+
ENST00000443427protein_coding151,807,319152,103,274+
ENST00000446550protein_coding151,807,196151,808,259+
ENST00000456483protein_coding151,807,679152,098,978+
ENST00000473497protein_coding_CDS_not_defined151,656,691151,807,857+
ENST00000482101protein_coding_CDS_not_defined151,944,172151,984,165+
ENST00000488573protein_coding_CDS_not_defined151,808,105151,811,278+
ENST00000641399protein_coding_CDS_not_defined151,936,130152,118,399+
ENST00000858333protein_coding151,690,530152,100,579+
ENST00000858334protein_coding151,807,168152,101,388+
ENST00000858335protein_coding151,807,663152,099,792+
ENST00000947759protein_coding151,651,284152,099,200+
RefSeq Transcripts - Human mRNAs (NM_ accessions)
RefSeq IDSymbolTypeStatusMANE Select
NM_000125ESR1mRNAREVIEWEDYes ✓
NM_001122740ESR1mRNAREVIEWEDNo
NM_001122741ESR1mRNAREVIEWEDNo
NM_001122742ESR1mRNAREVIEWEDNo
NM_001291230ESR1mRNAREVIEWEDNo
NM_001291241ESR1mRNAREVIEWEDNo
NM_001328100ESR1mRNAREVIEWEDNo
NM_001385568ESR1mRNAREVIEWEDNo
NM_001385569ESR1mRNAREVIEWEDNo
NM_001385570ESR1mRNAREVIEWEDNo
NM_001385571ESR1mRNAREVIEWEDNo
NM_001385572ESR1mRNAREVIEWEDNo
CCDS Identifiers (Total: 2)
CCDS ID
CCDS5234
CCDS87457
Canonical Transcript Exon Structure (ENST00000206249) Total Exons: 8
Exon IDStartEndLength
ENSE00001877305151,807,682151,808,364683 bp
ENSE00003705805151,842,597151,842,787191 bp
ENSE00003721187151,880,655151,880,771117 bp
ENSE00002178920151,944,173151,944,508336 bp
ENSE00000813753152,011,656152,011,794139 bp
ENSE00003743537152,060,991152,061,124134 bp
ENSE00003736522152,094,385152,094,568184 bp
ENSE00001128501152,098,732152,103,2744,543 bp

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusLengthMass
P03372Estrogen receptorReviewed (Swiss-Prot) ✓595 aa66,216 Da
Alternative Names:
  • ER-alpha
  • Estradiol receptor
  • Nuclear receptor subfamily 3 group A member 1 RefSeq Protein Accessions (NP_ IDs)
RefSeq IDSymbolStatusCanonical
NP_000116ESR1REVIEWEDYes ✓
NP_001116212ESR1REVIEWEDNo
NP_001116213ESR1REVIEWEDNo
NP_001116214ESR1REVIEWEDNo
NP_001278159ESR1REVIEWEDNo
NP_001278170ESR1REVIEWEDNo
NP_001315029ESR1REVIEWEDNo
NP_001372497ESR1REVIEWEDNo
NP_001372498ESR1REVIEWEDNo
NP_001372499ESR1REVIEWEDNo
NP_001372500ESR1REVIEWEDNo
NP_001372501ESR1REVIEWEDNo
Protein Domains and Families (Total: 10)
InterPro IDNameType
IPR000536Nucl_hrmn_rcpt_lig-bdDomain
IPR001292Estr_rcptFamily
IPR001628Znf_hrmn_rcptDomain
IPR001723Nuclear_hrmn_rcptFamily
IPR013088Znf_NHR/GATAHomologous_superfamily
IPR024178Est_rcpt/est-rel_rcpFamily
IPR024736Oestrogen-typ_rcpt_final_C_domDomain
IPR035500NHR-like_dom_sfHomologous_superfamily
IPR046944Estr_rcpt_NDomain
IPR050200Nuclear_hormone_rcpt_NR3Family

Section 4: Structure Identifiers Experimental Structures Total PDB Structures: 475 ESR1 is one of the most extensively structurally characterized proteins in the human proteome. Representative PDB Structures (Top 50 by Resolution)

PDB IDResolutionMethodDescription
5DXE1.50 ÅX-rayLBD with Stapled Peptide SRC2-P4 and Estradiol
5UFX1.55 ÅX-rayLBD in Complex with OP1074
5UFW1.58 ÅX-rayLBD in Complex with OP1154
5KCT1.60 ÅX-rayLBD Y537S with OBHS-N derivative
1XPC1.60 ÅX-rayLBD with compound 19
3UUD1.60 ÅX-rayLBD Y537S with estradiol
5N101.60 ÅX-rayBinary bivalent supramolecular platform
5W9D1.65 ÅX-rayLBD C381S,C417S,C530S with Endoxifen
6CBZ1.65 ÅX-rayLBD Y537S with Estradiol and GRIP Peptide
3CBP1.42 ÅX-raySET7/9-ER-Sinefungin complex
1XP61.70 ÅX-rayLBD with compound 16
2QXS1.70 ÅX-rayLBD 536S mutant with Raloxifene
5KCC2.39 ÅX-rayLBD Y537S with OBHS-N
1XP91.80 ÅX-rayLBD with compound 18
1XP11.80 ÅX-rayLBD with compound 15
4MGA1.80 ÅX-rayLBD Y537S with 4-tert-octylphenol
2B1Z1.78 ÅX-rayLBD with 17methyl-17alpha-dihydroequilenin
5KCW1.91 ÅX-rayLBD Y537S with N-trifluoroethyl OBHS-N
3ERT1.90 ÅX-rayLBD with 4-hydroxytamoxifen
1SJ01.90 ÅX-rayLBD with antagonist ligand
6CHZ1.68 ÅX-rayLBD Y537S with antagonist H3B-9224
6CHW1.89 ÅX-rayLBD Y537S covalently bound to H3B-5942
1L2I1.95 ÅX-rayLBD with THC and GRIP1 peptide
1X7R2.00 ÅX-rayLBD with Genistein
2OUZ2.00 ÅX-rayLBD with Lasofoxifene
Methods Distribution:
  • X-ray Diffraction: ~470 structures
  • Solution NMR: ~5 structures (1HCP, 2LLO, 2LLQ, 5T0X) Predicted Structures
DatabaseIDMetrics
AlphaFoldP03372pLDDT: 67.14, Sequence Length: 4637, Fraction Very High Confidence: 0.45

Section 5: Cross-Species Orthologs

OrganismEnsembl Gene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000019768Esr1protein_coding
Rat (Rattus norvegicus)ENSRNOG00000019358Esr1protein_coding
Zebrafish (Danio rerio)ENSDARG00000004111esr1protein_coding
Fruit fly (Drosophila melanogaster)FBGN0035849ERRprotein_coding
Worm (C. elegans)No direct ortholog--
Yeast (S. cerevisiae)No ortholog--

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary Total Variants: 217

ClassificationCount
Pathogenic2
Likely Pathogenic0
Uncertain Significance (VUS)~25
Likely Benign~80
Benign~90
Risk Factor2
Conflicting~10
Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationVariant TypeClassification
16590NM_000125.4:c.1339_1340delinsGC (p.Cys447Ala)IndelPathogenic
16592NM_000125.4:c.469C>T (p.Arg157Ter)SNVPathogenic
Risk Factor Variants
ClinVar IDHGVS NotationVariant Type
16594NM_000125.4:c.453-397T>CSNV
208354NM_000125.4:c.908A>G (p.Lys303Arg)SNV
SpliceAI Predictions Total Splice-Altering Predictions: 4,481 Top 50 High-Impact Splice Variants (Score ≥ 0.5)
VariantEffectDelta Score
6:151690665:G:GGdonor_gain1.00
6:151690663:AAG:Adonor_loss0.98
6:151690664:AG:Adonor_loss0.98
6:151690665:G:Adonor_loss0.98
6:151690666:TAG:Tdonor_loss0.98
6:151690668:GGTA:Gdonor_loss0.94
6:151690595:TCC:Tdonor_gain0.92
6:151690663:AA:Adonor_gain0.90
6:151690667:AG:Adonor_loss0.86
6:151690660:TTCAA:Tdonor_gain0.84
6:151699907:A:Gdonor_gain0.83
6:151690669:G:Cdonor_loss0.81
6:151690596:C:Adonor_gain0.79
6:151690649:A:AGdonor_gain0.77
6:151699959:GA:Gdonor_gain0.73
6:151690662:CAA:Cdonor_gain0.67
6:151690863:AC:Adonor_gain0.66
6:151697479:A:AGdonor_gain0.63
6:151690666:T:Cdonor_gain0.62
6:151697494:G:GTdonor_gain0.62
AlphaMissense Predictions Total Missense Predictions: 3,910 Predicted Pathogenic Missense Variants (Top 50)
VariantProtein ChangePathogenicity ScoreClassification
6:151807935:A:TK8I0.841likely_pathogenic
6:151807921:G:AM3I0.733likely_pathogenic
6:151807921:G:CM3I0.733likely_pathogenic
6:151807921:G:TM3I0.733likely_pathogenic
6:151807936:A:CK8N0.698likely_pathogenic
6:151807936:A:TK8N0.698likely_pathogenic
6:151807943:G:TG11W0.745likely_pathogenic
6:151807943:G:AG11R0.595likely_pathogenic
6:151807943:G:CG11R0.595likely_pathogenic
Ambiguous Variants (Top examples)
VariantProtein ChangePathogenicity ScoreClassification
6:151807920:T:AM3K0.514ambiguous
6:151807917:C:TT2I0.490ambiguous
6:151807920:T:GM3R0.481ambiguous
6:151807956:T:CL15P0.373ambiguous

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 17)

Pathway IDPathway NameDisease Pathway
R-HSA-8939211ESR-mediated signalingNo
R-HSA-9018519Estrogen-dependent gene expressionNo
R-HSA-9009391Extra-nuclear estrogen signalingNo
R-HSA-383280Nuclear Receptor transcription pathwayNo
R-HSA-8931987RUNX1 regulates estrogen receptor mediated transcriptionNo
R-HSA-1257604PIP3 activates AKT signalingNo
R-HSA-1251985Nuclear signaling by ERBB4No
R-HSA-2219530Constitutive Signaling by Aberrant PI3K in CancerYes
R-HSA-6811558PI5P, PP2A and IER3 Regulate PI3K/AKT SignalingNo
R-HSA-4090294SUMOylation of intracellular receptorsNo
R-HSA-5689896Ovarian tumor domain proteasesNo
R-HSA-8866910TFAP2 family regulates transcription of growth factorsNo
R-HSA-8939256RUNX1 regulates transcription of genes involved in WNT signalingNo
R-HSA-8939902Regulation of RUNX2 expression and activityNo
R-HSA-9841251Mitochondrial unfolded protein response (UPRmt)No
R-HSA-9927418Developmental Lineage of Mammary Gland Luminal Epithelial CellsNo
R-HSA-9927426Developmental Lineage of Mammary Gland Alveolar CellsNo
Gene Ontology Annotations (Total: 80) Biological Process (Count: ~45)
GO IDTerm Name
GO:0030520estrogen receptor signaling pathway
GO:0030518nuclear receptor-mediated steroid hormone signaling pathway
GO:0043401steroid hormone receptor signaling pathway
GO:0043627response to estrogen
GO:0071391cellular response to estrogen stimulus
GO:0071392cellular response to estradiol stimulus
GO:0032355response to estradiol
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0045893positive regulation of DNA-templated transcription
GO:0000122negative regulation of transcription by RNA polymerase II
GO:0006357regulation of transcription by RNA polymerase II
GO:0006355regulation of DNA-templated transcription
GO:0006338chromatin remodeling
GO:0007165signal transduction
GO:0060065uterus development
GO:0060068vagina development
GO:0060749mammary gland alveolus development
GO:0060745mammary gland branching involved in pregnancy
GO:0060750epithelial cell proliferation involved in mammary gland duct elongation
GO:0048863stem cell differentiation
Molecular Function (Count: ~25)
GO IDTerm Name
GO:0030284nuclear estrogen receptor activity
GO:0004879nuclear receptor activity
GO:0034056estrogen response element binding
GO:0003700DNA-binding transcription factor activity
GO:0001228DNA-binding transcription activator activity, RNA polymerase II-specific
GO:0000981DNA-binding transcription factor activity, RNA polymerase II-specific
GO:0000978RNA polymerase II cis-regulatory region sequence-specific DNA binding
GO:1990837sequence-specific double-stranded DNA binding
GO:0003682chromatin binding
GO:0005496steroid binding
GO:0030331nuclear estrogen receptor binding
GO:0001221transcription coregulator binding
GO:0001222transcription corepressor binding
GO:0001223transcription coactivator binding
GO:0042802identical protein binding
GO:0005516calmodulin binding
GO:0008013beta-catenin binding
GO:0008270zinc ion binding
GO:007188914-3-3 protein binding
GO:0019899enzyme binding
Cellular Component (Count: ~15)
GO IDTerm Name
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0000785chromatin
GO:0000791euchromatin
GO:0005667transcription regulator complex
GO:0005737cytoplasm
GO:0005829cytosol
GO:0005886plasma membrane
GO:0005794Golgi apparatus
GO:0016020membrane
GO:0032991protein-containing complex

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions Total Interactions: 8,546 Top 50 Highest-Confidence Interacting Proteins

UniProt IDProtein NameScore
Q9Y6Q9NCOA3 (SRC-3)997
P03372ESR1 (homodimer)997
P12931SRC996
P05412JUN (c-Jun)995
P38398BRCA1995
P01100FOS (c-Fos)994
P08069IGF1R994
Q03135CAV1992
P55317FOXA1990
P07900HSP90AA1988
P48552NRIP1 (RIP140)988
O75376NCOR1987
Q13547HDAC1987
Q15788NCOA1 (SRC-1)986
Q14344GNA13985
P08238HSP90AB1983
P27482CALML3981
P08047SP1979
P10275AR (Androgen receptor)979
Q15596NCOA2 (SRC-2)979
Q8TD86DDX5979
Q92841DDX17979
Q96GE6MYBBP1A979
Q9NZT1CALML5979
P00533EGFR974
P04637TP53974
P24385CCND1972
P27986PIK3R1971
P04626ERBB2 (HER2)966
P01343PRL (Prolactin)964
P29353SHC1962
P35222CTNNB1 (beta-catenin)952
Protein Structural Similarity (ESM2 Embeddings) Total Similar Proteins: 82
UniProt IDTop SimilarityAvg Similarity
B3SV561.00000.9930
Q079171.00000.9935
Q08E021.00000.9931
P706621.00000.9823
Q86U701.00000.9822
A0P8Z40.99990.9943
O153500.99990.9879
O935110.99990.9928
P062110.99990.9953
P197850.99990.9953
Protein Sequence Similarity (DIAMOND) Total Similar Proteins: 148
UniProt IDTop IdentityBitScore
P0415099.4%1506
P0640198.6%1692
P0823598.7%1781
Q4JM2899.6%1812
Q3YC0499.2%1795

Section 9: Transcription Factor Regulatory Data ESR1 is a ligand-activated transcription factor and a member of the nuclear receptor superfamily. JASPAR Binding Motif Profiles (Total: 4)

Matrix IDCollectionClassFamily
MA0112.1CORENuclear receptors with C4 zinc fingersSteroid hormone receptors (NR3)
MA0112.2CORENuclear receptors with C4 zinc fingersSteroid hormone receptors (NR3)
MA0112.3CORENuclear receptors with C4 zinc fingersSteroid hormone receptors (NR3)
MA0112.4CORENuclear receptors with C4 zinc fingersSteroid hormone receptors (NR3)
Downstream Target Genes (CollecTRI) Total Target Genes: 840+ Top 50 ESR1 Target Genes with Regulatory Direction
Target GeneRegulationConfidence
CCND1ActivationHigh
BCL2ActivationHigh
BIRC5 (Survivin)ActivationHigh
MYCUnknownHigh
TFF1 (pS2)ActivationHigh
GREB1ActivationHigh
BRCA1UnknownHigh
BRCA2UnknownHigh
CDKN1A (p21)ActivationHigh
CDKN1B (p27)RepressionHigh
CDKN1C (p57)RepressionHigh
CDC25AActivationHigh
CYP19A1 (Aromatase)ActivationHigh
CAV1RepressionHigh
CTSD (Cathepsin D)UnknownHigh
AMHActivationHigh
AICDAActivationHigh
BMP2ActivationHigh
AURKAUnknownHigh
AP1ActivationHigh
CXCL8 (IL-8)ActivationHigh
CRHRepressionHigh
CRHBPActivationHigh
BTG2RepressionHigh
CD24RepressionHigh
CDH1 (E-cadherin)UnknownHigh
DACH1ActivationHigh
DCTActivationHigh
CYP1A1RepressionHigh
CYP2C19RepressionHigh
Upstream Regulators of ESR1
RegulatorRegulationConfidence
AHRUnknownHigh
AP1UnknownHigh
BRCA1Activation-
BARD1Activation-
DNMT1UnknownHigh
E2F4UnknownHigh
CEBPDRepression-
ARRepressionLow
AIPRepression-
SIGNOR Signaling Interactions (Total: 141+) Key Regulators of ESR1
EntityEffectMechanism
17beta-estradiolUp-regulatesChemical activation
EGFRUp-regulatesPhosphorylation
ERBB2Up-regulatesPhosphorylation
MAPK1/3 (ERK1/2)Up-regulatesPhosphorylation
AKTUp-regulatesPhosphorylation
PRKACA (PKA)Up-regulatesPhosphorylation
GSK3BUp-regulatesPhosphorylation
ABL1Up-regulatesPhosphorylation
CDK2Up-regulatesPhosphorylation
BRCA1Down-regulatesActivity
DUSP22Down-regulatesDephosphorylation
Drug Modulators
DrugEffectMechanism
TamoxifenDown-regulatesChemical inhibition
FulvestrantDown-regulatesChemical inhibition
RaloxifeneDown-regulatesChemical inhibition
EstradiolUp-regulatesChemical activation
EstroneUp-regulatesChemical activation
EstriolUp-regulatesChemical activation
TiboloneUp-regulatesChemical activation
DiethylstilbestrolUp-regulatesChemical activation
Bisphenol AUp-regulatesChemical activation

Section 10: Drug & Pharmacology Data ChEMBL Target Information

Target IDTarget Type
CHEMBL206SINGLE PROTEIN
CHEMBL2093866PROTEIN FAMILY
FDA-Approved Drugs Targeting ESR1 (Phase 4) Total Approved: 95 Selective Estrogen Receptor Modulators (SERMs)
ChEMBL IDDrug NameMechanism
CHEMBL83TAMOXIFENAntagonist
CHEMBL786TAMOXIFEN CITRATEAntagonist
CHEMBL81RALOXIFENEAntagonist
CHEMBL1116RALOXIFENE HYDROCHLORIDEAntagonist
CHEMBL46740BAZEDOXIFENEAntagonist
CHEMBL328190LASOFOXIFENEAntagonist
CHEMBL2355051CLOMIPHENEMixed agonist/antagonist
Selective Estrogen Receptor Degraders (SERDs)
ChEMBL IDDrug NameMechanism
CHEMBL1358FULVESTRANTPure antagonist/degrader
CHEMBL4297509ELACESTRANTOral SERD
Estrogens (Agonists)
ChEMBL IDDrug Name
CHEMBL135ESTRADIOL
CHEMBL691ETHINYL ESTRADIOL
CHEMBL1405ESTRONE
CHEMBL193482ESTRIOL
CHEMBL1511ESTRADIOL VALERATE
CHEMBL1200430ESTRADIOL ACETATE
CHEMBL1200973ESTRADIOL CYPIONATE
CHEMBL1201151MESTRANOL
CHEMBL411DIETHYLSTILBESTROL
CHEMBL1018DIENESTROL
CHEMBL9225HEXESTROL
CHEMBL1230314ESTETROL ANHYDROUS
CHEMBL141305CYCLOFENIL
CHEMBL1201165QUINESTROL
CHEMBL1200761CHLOROTRIANISENE
Progestins/Hormonal Agents
ChEMBL IDDrug Name
CHEMBL1162NORETHINDRONE
CHEMBL1389LEVONORGESTREL
CHEMBL1531ETONOGESTREL
CHEMBL1533DESOGESTREL
CHEMBL1387NORETHYNODREL
CHEMBL1200624ETHYNODIOL DIACETATE
CHEMBL1276308MIFEPRISTONE
CHEMBL260538ULIPRISTAL ACETATE
CHEMBL1479DANAZOL
CHEMBL1393SPIRONOLACTONE
Other Approved Drugs with ESR1 Activity
ChEMBL IDDrug NamePrimary Use
CHEMBL1756ESTRAMUSTINE PHOSPHATEProstate cancer
CHEMBL1023BEXAROTENECutaneous T-cell lymphoma
CHEMBL1289601LENVATINIBThyroid cancer
CHEMBL3545185SELINEXORMultiple myeloma
CHEMBL408513BELINOSTATPeripheral T-cell lymphoma
Drugs in Clinical Development (Phase 3)
ChEMBL IDDrug NameMechanism
CHEMBL1093458ENDOXIFENActive tamoxifen metabolite
CHEMBL489AFIMOXIFENETopical SERM
CHEMBL4475463AMCENESTRANTOral SERD
CHEMBL4650316GIREDESTRANTOral SERD
CHEMBL4650365CAMIZESTRANTOral SERD
CHEMBL5095183IMLUNESTRANTOral SERD
CHEMBL5095210VEPDEGESTRANTPROTAC degrader
PharmGKB Information
AttributeValue
PharmGKB IDPA156
VIP GeneYes
CPIC GuidelineNo

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression BreadthUbiquitous
Total Present Calls216
Total Absent Calls70
Total Conditions286
Max Expression Score97.49
Average Expression Score68.11
Gold Quality Count255
Top 30 Expressing Tissues (Ranked by Score)
RankTissue (UBERON ID)Expression ScoreQuality
1Oviduct epithelium (UBERON:0004804)97.49Gold
2Cervix epithelium (UBERON:0004801)96.89Gold
3Mammalian vulva (UBERON:0000997)96.85Gold
4Endometrium (UBERON:0001295)96.64Gold
5Uterine cervix (UBERON:0000002)96.04Gold
6Cervix squamous epithelium (UBERON:0006922)95.98Gold
7Fallopian tube (UBERON:0003889)95.94Gold
8Endocervix (UBERON:0000458)95.16Gold
9Right uterine tube (UBERON:0001302)94.72Gold
10Uterus (UBERON:0000995)94.53Gold
11Ectocervix (UBERON:0012249)94.34Gold
12Epithelium of mammary gland (UBERON:0003244)94.10Gold
13Mammary duct (UBERON:0001765)93.85Gold
14Body of uterus (UBERON:0009853)93.61Gold
15Vagina (UBERON:0000996)93.35Gold
16Caput epididymis (UBERON:0004358)92.43Gold
17Germinal epithelium of ovary (UBERON:0001304)91.68Gold
18Calcaneal tendon (UBERON:0003701)91.13Gold
19Urethra (UBERON:0000057)90.87Gold
20Myometrium (UBERON:0001296)90.85Gold
21Tibia (UBERON:0000979)90.56Gold
22Mammary gland (UBERON:0001911)89.63Gold
23Thoracic mammary gland (UBERON:0005200)89.62Gold
24Left uterine tube (UBERON:0001303)89.21Gold
25Adult organism (UBERON:0007023)88.52Gold
26Female reproductive system (UBERON:0000474)88.11Gold
27Nipple (UBERON:0002030)86.20Gold
28Tendon (UBERON:0000043)85.42Gold
29Decidua (UBERON:0002450)83.72Gold
30Right lobe of liver (UBERON:0001114)83.44Gold
Expression Pattern: ESR1 shows highest expression in female reproductive tissues, particularly the oviduct, cervix, and uterus. Also notably expressed in mammary tissue, bone, and various other tissues.

Section 12: Disease Associations Mendelian Disease Associations (GenCC)

DiseaseOMIM IDClassificationInheritanceSubmitter
Estrogen Resistance SyndromeOMIM:615363LimitedAutosomal RecessiveAmbry Genetics
Estrogen Resistance SyndromeOMIM:615363LimitedAutosomal RecessiveLabcorp Genetics
Estrogen Resistance SyndromeORPHANET:785SupportiveAutosomal RecessiveOrphanet
Human Phenotype Ontology Associations (Total: 43) Reproductive System Phenotypes
HPO IDPhenotype
HP:0000013Hypoplasia of the uterus
HP:0000147Polycystic ovaries
HP:0000786Primary amenorrhea
HP:0008675Enlarged polycystic ovaries
HP:0003187Breast hypoplasia
HP:0100783Breast aplasia
HP:0008187Absence of secondary sex characteristics
HP:0008197Absence of pubertal development
HP:0100133Abnormality of the pubic hair
Skeletal/Growth Phenotypes
HPO IDPhenotype
HP:0000098Tall stature
HP:0001548Overgrowth
HP:0000938Osteopenia
HP:0000939Osteoporosis
HP:0002663Delayed epiphyseal ossification
HP:0002750Delayed skeletal maturation
HP:0003799Marked delay in bone age
HP:0010639Elevated alkaline phosphatase of bone origin
HP:0031428Increased circulating osteocalcin level
Endocrine Phenotypes
HPO IDPhenotype
HP:0000823Delayed puberty
HP:0000834Abnormality of the adrenal glands
HP:0000837Increased circulating gonadotropin level
HP:0000842Hyperinsulinemia
HP:0001952Glucose intolerance
HP:0040270Impaired glucose tolerance
HP:0003117Abnormal circulating hormone concentration
HP:0025134Increased serum estradiol
HP:0030087Abnormal circulating testosterone concentration
Cancer Association
HPO IDPhenotype
HP:0003002Breast carcinoma
GWAS Associations (Total: 126) Top 30 GWAS Traits by Significance
TraitStudy IDP-valueMapped Gene/Locus
Heel bone mineral densityGCST006433_14570 (highly significant)ESR1
Heel bone mineral densityGCST006433_14582×10⁻²¹²ESR1
Heel bone mineral densityGCST006288_7642×10⁻¹¹⁹ESR1
Heel bone mineral densityGCST006979_2821×10⁻¹³⁸ESR1
Heel bone mineral densityGCST006288_6891×10⁻⁵⁴ESR1
Breast cancerGCST004988_2745×10⁻⁵⁴CCDC170-ESR1
HeightGCST008839_589×10⁻⁴⁶ESR1
Breast cancerGCST004988_5243×10⁻⁴¹CCDC170
Heel bone mineral densityGCST006288_137×10⁻³⁵ESR1
Pulse pressureGCST007269_2276×10⁻³²ESR1
Breast cancer (ER-negative)GCST005076_93×10⁻³¹CCDC170
Birth weightGCST008362_801×10⁻²⁸ESR1
Pulse pressureGCST007096_1338×10⁻²⁸ESR1
Breast cancerGCST003845_104×10⁻²⁸CCDC170-ESR1
Breast cancerGCST004988_7561×10⁻²⁶ESR1
HeightGCST002647_1368×10⁻²⁴ESR1
Breast cancerGCST001937_92×10⁻²¹CCDC170
Heel bone mineral densityGCST006288_6905×10⁻²¹ESR1
Birth weightGCST005146_426×10⁻²⁰ESR1
FractureGCST006423_45×10⁻¹⁹ESR1
Breast cancer (ER-negative)GCST005076_22×10⁻¹⁸ESR1
Breast cancer (ER-negative)GCST003842_68×10⁻¹⁸CCDC170-ESR1
Pulse pressureGCST004278_192×10⁻¹⁸ESR1
HeightGCST000817_51×10⁻¹⁷ESR1
Breast sizeGCST006655_42×10⁻¹⁶CCDC170-ESR1
Heel bone mineral densityGCST006288_5474×10⁻¹⁶ESR1
Breast cancerGCST000343_12×10⁻¹⁵CCDC170-ESR1
Breast cancer (ER-negative)GCST003842_59×10⁻¹⁵CCDC170
Body fat distribution (trunk fat ratio)GCST007294_1041×10⁻¹⁴ESR1
Breast sizeGCST006655_82×10⁻¹⁴CCDC170-ESR1
Major GWAS Trait Categories:
  • Bone mineral density/Osteoporosis - Most significant associations
  • Breast cancer - Multiple highly significant loci
  • Height - Consistent associations
  • Cardiovascular traits - Blood pressure, pulse pressure
  • Reproductive traits - Age at menarche, breast size

SUMMARY STATISTICS

CategoryCount
Total database cross-references~250,000+
Ensembl transcripts18
RefSeq mRNA transcripts12
PDB structures475
ClinVar variants217
SpliceAI predictions4,481
AlphaMissense predictions3,910
GO annotations80
Reactome pathways17
STRING interactions8,546
Transcription factor targets840+
Approved drugs95
GWAS associations126
HPO phenotypes43

This reference document was generated using BioBTree cross-database mapping. Data sources include HGNC, Ensembl, NCBI, UniProt, PDB, ClinVar, PharmGKB, Reactome, GO, STRING, ChEMBL, Bgee, GWAS Catalog, and others.