AR Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human AR. This should serve as a definitive lookup resource for …

Provide a comprehensive cross-database identifier and functional mapping reference for human AR. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 36 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, jaspar, mim, orphanet, ortholog, pdb, pharmgkb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-04-01 — For the latest data, query BioBTree directly via MCP or API.
View API calls (43)
AR

Section 1: Gene Identifiers

AttributeValue
HGNC IDHGNC:644
Approved SymbolAR
Approved Nameandrogen receptor
Locus Typegene with protein product
StatusApproved
Ensembl Gene IDENSG00000169083
NCBI Entrez Gene ID367
OMIM Gene/Locus ID313700
Gene GroupNuclear receptor subfamily 3 group C
Aliases: AIS, NR3C4, SMAX1, HUMARA Previous Symbols: DHTR, SBMA Previous Names: dihydrotestosterone receptor, spinal and bulbar muscular atrophy Genomic Location
AttributeValue
ChromosomeX
Cytogenetic BandXq12
Start Position67,543,353
End Position67,730,619
Strand+ (forward)
Genome BuildGRCh38/hg38
Biotypeprotein_coding

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 10)

Transcript IDBiotypeStartEndStrand
ENST00000374690protein_coding67,544,02167,730,619+
ENST00000396043nonsense_mediated_decay67,544,02167,724,288+
ENST00000396044protein_coding67,545,14467,723,936+
ENST00000504326protein_coding67,544,82067,696,075+
ENST00000513847protein_coding_CDS_not_defined67,544,82067,689,763+
ENST00000514029nonsense_mediated_decay67,544,82067,696,075+
ENST00000612452nonsense_mediated_decay67,544,02167,730,619+
ENST00000613054nonsense_mediated_decay67,544,82067,696,075+
ENST00000853518protein_coding67,543,35367,724,392+
ENST00000853519protein_coding67,544,13467,724,340+
RefSeq Transcripts
RefSeq IDTypeStatusMANE Select
NM_000044mRNAREVIEWED✓ Yes (Canonical)
CCDS IDs (Total: 3)
CCDS ID
CCDS14387
CCDS87754
CCDS87755
Canonical Transcript Exons (ENST00000374690) Total Exon Count: 8
Exon IDStartEndStrand
ENSE0000390197167,544,02167,546,762+
ENSE0000360605267,643,25667,643,407+
ENSE0000362523567,686,01067,686,126+
ENSE0000116547667,711,40267,711,689+
ENSE0000128259767,717,47867,717,622+
ENSE0000116545867,721,83367,721,963+
ENSE0000131688167,722,82767,722,984+
ENSE0000193091167,723,68667,730,619+

Section 3: Protein Identifiers UniProt Accessions (Total: 6)

UniProt IDStatusNotes
P10275✓ Reviewed (Swiss-Prot)Canonical entry
A0A087WUX9Unreviewed (TrEMBL)Isoform
A0A087X1B6Unreviewed (TrEMBL)Isoform
A0A7I2PS51Unreviewed (TrEMBL)Isoform
E9PEG3Unreviewed (TrEMBL)Isoform
F5GZG9Unreviewed (TrEMBL)Isoform
Canonical Protein (P10275) Properties
AttributeValue
Protein NameAndrogen receptor
Alternative NamesDihydrotestosterone receptor; Nuclear receptor subfamily 3 group C member 4
Length920 amino acids
Molecular Mass99,188 Da
Protein Domains and Families (Total: 6)
InterPro IDNameType
IPR000536Nucl_hrmn_rcpt_lig-bdDomain
IPR001103Andrgn_rcptFamily
IPR001628Znf_hrmn_rcptDomain
IPR013088Znf_NHR/GATAHomologous_superfamily
IPR035500NHR-like_dom_sfHomologous_superfamily
IPR050200Nuclear_hormone_rcpt_NR3Family

Section 4: Structure Identifiers Experimental Structures Total PDB Structures: 94

PDB IDMethodResolution (Å)Title
8E1AX-ray1.20Novel AR inhibitors
7ZTZX-ray1.40AR-LBD (Y764C) with DHT
4OHAX-ray1.42T877A-AR-LBD with coactivator
5V8QX-ray1.44SARM Part III
2AX6X-ray1.50T877A mutant with hydroxyflutamide
3L3XX-ray1.55DHT-bound AR with SRC3 motif
8FH0X-ray1.59H875Y/F877L/T878A mutant
8FH2X-ray1.59L702H/H875Y mutant
2AM9X-ray1.64AR-LBD with testosterone
2AX9X-ray1.65AR-LBD with R-3
2PNUX-ray1.65AR-LBD with EM-5744
3B66X-ray1.65SARM S-21
1T65X-ray1.66AR-LBD with DHT and GRIP1
7ZU1X-ray1.68AR-LBD (V758A) with DHT
8FH1X-ray1.69F877L/T878A mutant
3RLLX-ray1.70T877A with SARM
3V49X-ray1.70AR-LBD with SARM inhibitor
7ZU2X-ray1.74AR-LBD (Q799E) with DHT
2AMBX-ray1.75AR-LBD with tetrahydrogestrinone
2PITX-ray1.76AR-LBD with small molecule
(and 74 more structures...)
AlphaFold Predicted Structure
AttributeValue
AlphaFold IDAF-P10275-F1
Global pLDDT Score58.85
Sequence Length6958 (full model)
Fraction Very High Confidence0.29 (29%)

Section 5: Cross-Species Orthologs

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000046532Arprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000005639Arprotein_coding
Zebrafish (Danio rerio)ENSDARG00000067976arprotein_coding
Fruit fly (Drosophila melanogaster)FBGN0035849ERRprotein_coding
Worm (C. elegans)No data available
Yeast (S. cerevisiae)No data available

Section 6: Clinical Variants & AI Predictions ClinVar Clinical Variant Summary Total Variants in ClinVar: 1,053+

ClassificationCount
Pathogenic100+
Likely Pathogenic100+
Uncertain Significance (VUS)~700
Benign/Likely Benign~150
Conflicting~50
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
1076800c.1063G>T (p.Glu355Ter)SNVPathogenic
1298367c.175C>T (p.Gln59Ter)SNVPathogenic
265460c.2257C>T (p.Arg753Ter)SNVPathogenic
279682c.1822C>T (p.Arg608Ter)SNVPathogenic
279684c.1847G>A (p.Arg616His)SNVPathogenic
279686c.2301del (p.Asp768fs)DeletionPathogenic
279687c.2566C>T (p.Arg856Cys)SNVPathogenic
279688c.2567G>T (p.Arg856Leu)SNVPathogenic
279690c.2668G>A (p.Val890Met)SNVPathogenic
1068775c.2225G>T (p.Trp742Leu)SNVPathogenic
1077100c.2170C>T (p.Pro724Ser)SNVPathogenic/Likely pathogenic
1196591c.1762G>C (p.Ala588Pro)SNVPathogenic/Likely pathogenic
1065434c.2237T>C (p.Met746Thr)SNVPathogenic/Likely pathogenic
1338497c.2197G>A (p.Asp733Asn)SNVPathogenic/Likely pathogenic
1333387c.2710G>T (p.Val904Leu)SNVPathogenic/Likely pathogenic
1368614c.1443C>A (p.Tyr481Ter)SNVPathogenic/Likely pathogenic
1298359c.2255G>A (p.Trp752Ter)SNVPathogenic
1342730c.2494C>T (p.Arg832Ter)SNVPathogenic
1344505c.2407dup (p.Gln803fs)DuplicationPathogenic
1685531c.208C>T (p.Gln70Ter)SNVPathogenic
1685535c.1195T>C (p.Trp399Arg)SNVPathogenic
1685537c.1208C>T (p.Ala403Val)SNVPathogenic
1685540c.2184C>G (p.Asn728Lys)SNVPathogenic
235415c.1804T>G (p.Cys602Gly)SNVPathogenic
1177442c.2255G>C (p.Trp752Ser)SNVLikely pathogenic
1298366c.1742A>C (p.Lys581Thr)SNVLikely pathogenic
1338633c.2668G>T (p.Val890Leu)SNVLikely pathogenic
1480317c.2318A>G (p.Glu773Gly)SNVLikely pathogenic
1506100c.2126G>T (p.Gly709Val)SNVLikely pathogenic
1687577c.1768+1G>ASNVLikely pathogenic
1805862c.1749C>G (p.Phe583Leu)SNVLikely pathogenic
2000750c.2174-2A>GSNVLikely pathogenic
2431800c.2221T>C (p.Ser741Pro)SNVLikely pathogenic
381643c.1720G>C (p.Ala574Pro)SNVLikely pathogenic
390372c.2258G>T (p.Arg753Leu)SNVLikely pathogenic
391803c.2137C>A (p.Leu713Ile)SNVLikely pathogenic
421614c.2740C>A (p.Pro914Thr)SNVLikely pathogenic
427107c.1888C>T (p.Arg630Trp)SNVLikely pathogenic
429930c.2723T>A (p.Leu908His)SNVLikely pathogenic
458359c.1805G>A (p.Cys602Tyr)SNVLikely pathogenic
458360c.1850T>C (p.Leu617Pro)SNVLikely pathogenic
458361c.2103G>T (p.Leu701Phe)SNVLikely pathogenic
464791c.1736G>C (p.Ser579Thr)SNVLikely pathogenic
464800c.2696T>C (p.Ile899Thr)SNVLikely pathogenic
492788c.2104C>T (p.Leu702Phe)SNVLikely pathogenic
533378c.2053G>A (p.Val685Ile)SNVLikely pathogenic
915284c.2342T>A (p.Met781Lys)SNVLikely pathogenic
2635610c.1531A>T (p.Arg511Ter)SNVPathogenic
2921352c.1339_1342dup (p.Gly448fs)DuplicationPathogenic
3061398c.2743del (p.Ile915fs)DeletionPathogenic
AI-Based Variant Effect Predictions AlphaMissense Predictions Total Predicted Variants: 5,978
ClassificationDescription
Likely PathogenicPathogenicity score > 0.564
Ambiguous0.340 - 0.564
Likely Benign< 0.340
TOP 50 Predicted Pathogenic Missense Variants (by score):
VariantPositionAM ScoreClass
G6WX:67545162:G:T0.927likely_pathogenic
G6RX:67545162:G:A0.923likely_pathogenic
E2VX:67545151:A:T0.890likely_pathogenic
E2KX:67545150:G:A0.888likely_pathogenic
G8RX:67545168:G:A0.863likely_pathogenic
G8EX:67545169:G:A0.770likely_pathogenic
G8VX:67545169:G:T0.780likely_pathogenic
E2GX:67545151:A:G0.749likely_pathogenic
R9SX:67545173:G:C0.748likely_pathogenic
R9MX:67545172:G:T0.742likely_pathogenic
G6VX:67545163:G:T0.714likely_pathogenic
K17MX:67545196:A:T0.700likely_pathogenic
L5SX:67545160:T:C0.696likely_pathogenic
E2AX:67545151:A:C0.647likely_pathogenic
R9WX:67545171:A:T0.640likely_pathogenic
R9TX:67545172:G:C0.620likely_pathogenic
R13WX:67545183:C:T0.608likely_pathogenic
V10DX:67545175:T:A0.606likely_pathogenic
V3LX:67545153:G:C0.604likely_pathogenic
E2QX:67545150:G:C0.601likely_pathogenic
G6AX:67545169:G:C0.599ambiguous
R13PX:67545184:G:C0.583likely_pathogenic
V3MX:67545153:G:A0.572likely_pathogenic
K17EX:67545195:A:G0.571likely_pathogenic
Y11DX:67545177:T:G0.823likely_pathogenic
G8VX:67545169:G:T0.780likely_pathogenic
K17NX:67545197:G:C0.779likely_pathogenic
(and more...)
SpliceAI Predictions No splice-altering variant predictions available in database

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 6)

Pathway IDPathway Name
R-HSA-3371497HSP90 chaperone cycle for steroid hormone receptors (SHR) in the presence of ligand
R-HSA-383280Nuclear Receptor transcription pathway
R-HSA-4090294SUMOylation of intracellular receptors
R-HSA-5625886Activated PKN1 stimulates transcription of AR regulated genes KLK2 and KLK3
R-HSA-5689880Ub-specific processing proteases
R-HSA-8940973RUNX2 regulates osteoblast differentiation
Gene Ontology Annotations (Total: 100+) Molecular Function (TOP 20)
GO IDTerm
GO:0003707nuclear steroid receptor activity
GO:0004879nuclear receptor activity
GO:0005497androgen binding
GO:0005496steroid binding
GO:0003700DNA-binding transcription factor activity
GO:0000981DNA-binding transcription factor activity, RNA polymerase II-specific
GO:0001228DNA-binding transcription activator activity, RNA polymerase II-specific
GO:0003677DNA binding
GO:0003682chromatin binding
GO:0043565sequence-specific DNA binding
GO:0034056estrogen response element binding
GO:0008270zinc ion binding
GO:0005515protein binding
GO:0008013beta-catenin binding
GO:0019899enzyme binding
GO:0001223transcription coactivator binding
GO:0051117ATPase binding
GO:0061629RNA polymerase II-specific DNA-binding TF binding
GO:0070974POU domain binding
GO:0140693molecular condensate scaffold activity
Biological Process (TOP 20)
GO IDTerm
GO:0030521androgen receptor signaling pathway
GO:0030518nuclear receptor-mediated steroid hormone signaling pathway
GO:0008584male gonad development
GO:0007283spermatogenesis
GO:0048808male genitalia morphogenesis
GO:0060736prostate gland growth
GO:0060514prostate induction
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0045893positive regulation of DNA-templated transcription
GO:0006357regulation of transcription by RNA polymerase II
GO:0008284positive regulation of cell population proliferation
GO:0033574response to testosterone
GO:0071394cellular response to testosterone stimulus
GO:0019102male somatic sex determination
GO:0033327Leydig cell differentiation
GO:0072520seminiferous tubule development
GO:0060749mammary gland alveolus development
GO:0060742epithelial cell differentiation involved in prostate gland development
GO:0001701in utero embryonic development
GO:0043410positive regulation of MAPK cascade
Cellular Component (TOP 20)
GO IDTerm
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0000785chromatin
GO:0016607nuclear speck
GO:0005737cytoplasm
GO:0005829cytosol
GO:0005886plasma membrane
GO:0032991protein-containing complex

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions Total STRING Interactions: 5,158 Total IntAct Interactions: 514+ Total BioGRID Interactions: 1,718 TOP 50 Highest-Confidence Interactors (STRING)

UniProt IDGeneScore
P07900HSP90AA1990
P08238HSP90AB1990
P10275AR (self)990
P55317FOXA1987
O60341KDM1A982
Q00987MDM2980
P03372ESR1979
P35222CTNNB1968
Q13772NCOA4964
P12931SRC944
Q15910EZH2942
P07288KLK3 (PSA)935
O60885BRD4929
Q99801NKX3-1917
Q15596NCOA2906
Q9NQU5PIAS4903
P34932HSPA4889
Q15788NCOA1888
O15393TMPRSS2886
P50549NFYA884
P84022SMAD3879
O75376NCOR1869
Q9Y261FOXA2857
P60484PTEN853
O43294TGFB1I1851
Q12778FOXO1844
Q9BY66KDM4C840
P04637TP53839
Q13547HDAC1836
P24385CCND1831
P20151NR4A1823
Q92826SMARCD3822
Q9Y6Q9NCOA3822
P31213GPRC6A816
Q92793CREBBP816
Q9UBS8RNF14810
Q13485SMAD4809
P05093CYP17A1794
P08069IGF1R793
O43791SPOP788
P11511CYP19A1785
Q96PM5ARID5B780
P01148GNRH1779
P49841GSK3B779
O75369FLNB779
Q14315FLNC777
P38398BRCA1773
P04626ERBB2772
Q02880TOP2B771
P00354GAPDH766
Protein Similarity (ESM2 Embedding-Based) Total Similar Proteins: 79
UniProt IDTop SimilarityAvg Similarity
A7X8B31.00000.9950
A7X8B51.00000.9949
A7X8B71.00000.9951
O979521.00000.9950
O979601.00000.9950
P06401 (PR)1.00000.9949
Q6QT551.00000.9950
Q9QZB61.00000.9919
P511791.00000.9920
O977750.99990.9948
P152070.99980.9952
P190910.99980.9949
P061860.99970.9949
Q00175 (GR)0.99960.9948
Q9GKL70.99960.9950
Q15464 (SHB)0.99940.9872
Q63449 (Ar rat)0.99960.9942
O43435 (NR3C4)0.99120.9865
(and more...)
Protein Sequence Similarity (Diamond BLAST) Total Homologous Proteins: 85
UniProt IDIdentity (%)BitScore
Q4JM2899.61812
Q9N0W899.61811
Q3YC0499.21795
P08235 (MR)98.71781
A7X8B399.51707
A7X8B599.51705
A7X8B799.11701
P06401 (PR)98.61692
A7X8C498.31685
P1520796.61677
P1909196.61682
P2219996.11666
O9795299.81665
O9796099.21655
O9777598.71642
Q63449 (Ar rat)94.41619
Q00175 (GR)95.01607
(and more...)

Section 9: Transcription Factor Regulatory Data AR is a transcription factor - Nuclear receptor subfamily 3 group C DNA Binding Profiles (JASPAR)

JASPAR IDNameCollectionClassFamily
MA0007.2ARCORENuclear receptors with C4 zinc fingersSteroid hormone receptors (NR3)
Downstream Targets (Genes Regulated BY AR) From SIGNOR Database (114+ regulatory relationships)
Target GeneEffectMechanismDirect
KLK3 (PSA)Up-regulatesTranscriptional regulationYes
TMPRSS2Up-regulatesTranscriptional regulationYes
NKX3-1Up-regulatesTranscriptional regulationYes
UBE2CUp-regulatesTranscriptional regulationYes
SERPINB5Down-regulatesTranscriptional regulationYes
SCN9AUp-regulatesTranscriptional regulationYes
ARG2Up-regulatesTranscriptional regulationNo
ARG1Up-regulatesTranscriptional regulationNo
NAT1Up-regulatesTranscriptional regulationNo
CLK3Down-regulatesTranscriptional regulationNo
BTG1Down-regulatesTranscriptional regulationNo
BTG2Down-regulatesTranscriptional regulationNo
CDKN1ADown-regulatesTranscriptional regulationNo
NRASDown-regulatesTranscriptional regulationNo
WEE1Down-regulatesTranscriptional regulationNo
CTNNB1Down-regulatesTranscriptional regulationNo
UCNDown-regulatesTranscriptional regulationYes
CRHDown-regulatesTranscriptional regulationYes
AKR1C3Down-regulatesTranscriptional regulationNo
CYP7B1Down-regulatesTranscriptional regulationNo
ZBTB46Down-regulatesTranscriptional regulationYes
Upstream Regulators of AR
RegulatorEffectMechanismScore
NCOA2Up-regulates activityBinding0.90
HSP90AA1Up-regulates activityBinding0.76
HSP90AB1Up-regulates activityBinding0.70
FKBP4Up-regulates activityBinding0.72
FOXA1Down-regulatesTranscriptional regulation0.77
SRCUp-regulates activityPhosphorylation0.76
ZMIZ1Up-regulates activityBinding0.71
EZH2Up-regulates activityBinding0.54
HOXB13Down-regulatesTranscriptional regulation0.49
KDM4BUp-regulatesTranscriptional regulation0.33
AKT1Down-regulates activityPhosphorylation0.59
PKN1Up-regulatesPhosphorylation0.59
CDK1Up-regulatesPhosphorylation0.53
MAPK1Down-regulatesPhosphorylation0.51
PIM1Up-regulates activityPhosphorylation0.38
PTENUp-regulates activityDephosphorylation0.58
TNK2Up-regulates activityPhosphorylation0.55

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL IDTarget TypeDescription
CHEMBL1871SINGLE PROTEINAndrogen receptor
CHEMBL4296118PROTEIN-PROTEINAR/BIRC2 interaction
CHEMBL4523653PROTEIN-PROTEINAR/PELP1 interaction
CHEMBL4523684PROTEIN-PROTEINCereblon/AR interaction
CHEMBL4523730PROTEIN-PROTEINVHL/AR interaction
CHEMBL5169084PROTEIN-PROTEINCereblon/CUL4A/AR
Approved Drugs Targeting AR (Phase 4, Total: 74)
ChEMBL IDDrug NameTypeMechanism
CHEMBL1082407ENZALUTAMIDESmall moleculeAR antagonist
CHEMBL3183409APALUTAMIDESmall moleculeAR antagonist
CHEMBL4297185DAROLUTAMIDESmall moleculeAR antagonist
CHEMBL409BICALUTAMIDESmall moleculeAR antagonist
CHEMBL806FLUTAMIDESmall moleculeAR antagonist
CHEMBL1274NILUTAMIDESmall moleculeAR antagonist
CHEMBL271227ABIRATERONE ACETATESmall moleculeCYP17A1 inhibitor (indirect)
CHEMBL3590187CLASCOTERONESmall moleculeTopical AR antagonist
CHEMBL386630TESTOSTERONESmall moleculeAR agonist
CHEMBL1170TESTOSTERONE PROPIONATESmall moleculeAR agonist
CHEMBL1445FLUOXYMESTERONESmall moleculeAR agonist
CHEMBL1200436OXANDROLONESmall moleculeAR agonist
CHEMBL1393SPIRONOLACTONESmall moleculeAR antagonist
CHEMBL103PROGESTERONESmall moleculeSteroid
CHEMBL135ESTRADIOLSmall moleculeER agonist/AR modulator
CHEMBL691ETHINYL ESTRADIOLSmall moleculeER agonist
CHEMBL1200430ESTRADIOL ACETATESmall moleculeER agonist
CHEMBL1511ESTRADIOL VALERATESmall moleculeER agonist
CHEMBL384467DEXAMETHASONESmall moleculeGR agonist
CHEMBL131PREDNISOLONESmall moleculeGR agonist
CHEMBL632BETAMETHASONESmall moleculeGR agonist
CHEMBL1370BUDESONIDESmall moleculeGR agonist
CHEMBL1473FLUTICASONE PROPIONATESmall moleculeGR agonist
CHEMBL1161MOMETASONE FUROATESmall moleculeGR agonist
CHEMBL717MEDROXYPROGESTERONE ACETATESmall moleculeProgestin
CHEMBL1389LEVONORGESTRELSmall moleculeProgestin
CHEMBL1509DROSPIRENONESmall moleculeProgestin/AR antagonist
CHEMBL2355051CLOMIPHENESmall moleculeSERM
CHEMBL1276308MIFEPRISTONESmall moleculePR/GR antagonist
CHEMBL260538ULIPRISTAL ACETATESmall moleculePR modulator
Pharmacogenomics (PharmGKB)
AttributeValue
PharmGKB IDPA57
VIP GeneYes (Very Important Pharmacogene)
CPIC GuidelinesNone currently available

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression PatternUbiquitous
Total Present Calls250
Maximum Expression Score97.53
Tissue Expression (TOP 30 by Expression Score)
TissueScoreCall QualityRank
Seminal vesicle97.53Gold1,150
Urethra96.69Gold1,540
Nipple96.13Gold1,800
Mammalian vulva94.97Gold2,350
Corpus epididymis94.70Gold2,470
Penis93.15Gold3,200
Caput epididymis92.58Gold3,460
Cauda epididymis90.91Gold4,240
Mammary duct90.78Gold4,300
Liver89.24Gold5,020
Rectus abdominis (muscle)89.22Gold5,030
Buccal mucosa89.09Gold5,080
Saphenous vein88.92Gold5,160
Right lobe of liver87.87Gold5,650
Hip skin87.47Gold5,840
Synovial joint87.40Gold5,870
Mammary gland epithelium86.96Gold6,080
Mammary gland86.73Gold6,190
Calcaneal tendon86.66Gold6,220
Left ventricle myocardium85.78Gold6,630
Endocervix85.48Gold6,770
Heart right ventricle85.47Gold6,770
Prostate gland85.46Gold6,780
Tendon84.65Gold7,150
Fallopian tube84.36Gold7,290
Ectocervix83.88Gold7,520
Vagina83.84Gold7,530
Vena cava83.75Gold7,580
Biceps brachii83.67Gold7,610
Upper leg skin83.48Gold7,700
Single-Cell Expression Data (SCXA) Total Datasets: 3
Experiment IDDescriptionSpeciesCell Count
E-MTAB-10855Mammary gland lactation single-cellHomo sapiens74,404
E-MTAB-10885Human milk vs breast tissue scRNA-seqHomo sapiens28,628
E-MTAB-9841Mammary epithelial cells scRNA-seqHomo sapiens92,071

Section 12: Disease Associations Mendelian/Monogenic Diseases (Orphanet)

Orphanet IDDisease NameTypeGene CountPhenotype Count
ORPHA:481Kennedy disease (SBMA)Disease113
ORPHA:90797Partial androgen insensitivity syndromeDisease126
ORPHA:99429Complete androgen insensitivity syndromeDisease125
ORPHA:95706Non-syndromic posterior hypospadiasMorphological anomaly216
GenCC Gene-Disease Validity (Total: 11 curated associations)
DiseaseClassificationInheritanceSubmitter
Androgen insensitivity syndromeDefinitiveX-linkedAmbry Genetics
Androgen insensitivity syndromeDefinitiveX-linkedG2P
Androgen insensitivity syndromeStrongX-linkedLabcorp
Androgen insensitivity syndromeStrongX-linkedLMM
Kennedy diseaseDefinitiveX-linkedG2P
Kennedy diseaseStrongX-linkedAmbry Genetics
Kennedy diseaseStrongX-linkedLabcorp
Kennedy diseaseSupportiveX-linkedOrphanet
Partial androgen insensitivityStrongX-linkedGenomics England
Partial androgen insensitivitySupportiveX-linkedOrphanet
Complete androgen insensitivitySupportiveX-linkedOrphanet
Phenotype Associations (HPO) - Total: 93
HPO IDPhenotype
HP:0000027Azoospermia
HP:0000028Cryptorchidism
HP:0000029Testicular atrophy
HP:0000037Male pseudohermaphroditism
HP:0000047Hypospadias
HP:0000054Micropenis
HP:0000062Ambiguous genitalia
HP:0000135Hypogonadism
HP:0000771Gynecomastia
HP:0000786Primary amenorrhea
HP:0000789Infertility
HP:0003251Male infertility
HP:0008226Androgen insufficiency
HP:0008689Bilateral cryptorchidism
HP:0008730Female external genitalia in 46,XY
HP:0001252Hypotonia
HP:0001260Dysarthria
HP:0001265Hyporeflexia
HP:0001283Bulbar palsy
HP:0001288Gait disturbance
HP:0001337Tremor
HP:0002380Fasciculations
HP:0003202Skeletal muscle atrophy
HP:0003236Elevated CK
HP:0003690Limb muscle weakness
HP:0008981Calf muscle hypertrophy
HP:0009830Peripheral neuropathy
HP:0002215Sparse axillary hair
HP:0002225Sparse pubic hair
HP:0002550Absent facial hair
HP:0010788Testicular neoplasm
HP:0012125Prostate cancer
HP:0030088Increased serum testosterone
HP:0008232Elevated FSH
HP:0011969Elevated LH
HP:0025134Increased serum estradiol
HP:0001417X-linked inheritance
HP:0001419X-linked recessive inheritance
HP:0003581Adult onset
HP:0003577Congenital onset
HP:0003677Slowly progressive
HP:0100639Erectile dysfunction
(and 51 more phenotypes...)
Complex Trait Associations (GWAS) No direct GWAS associations found in database for HGNC:644 SUMMARY STATISTICS
CategoryCount
Total Transcripts10
Total UniProt Entries6
Total PDB Structures94
Total InterPro Domains6
Total Orthologs4
Total ClinVar Variants1,053+
Total AlphaMissense Predictions5,978
Total GO Annotations100+
Total Reactome Pathways6
Total STRING Interactions5,158
Total IntAct Interactions514+
Total SIGNOR Regulatory Entries114+
Total Approved Drugs (ChEMBL Phase 4)74
Total HPO Phenotypes93
Total Orphanet Diseases4
Total GenCC Curated Associations11
---Reference compiled from BioBTree aggregated databases including: HGNC, Ensembl, NCBI Entrez, OMIM, UniProt, PDB, AlphaFold, InterPro, ClinVar, AlphaMissense, GO, Reactome, STRING, IntAct, BioGRID, SIGNOR, JASPAR, ChEMBL, PharmGKB, Bgee, SCXA, HPO, Orphanet, and GenCC.