AR Gene Complete Identifier and Functional Mapping Reference
Provide a comprehensive cross-database identifier and functional mapping reference for human AR — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene AR, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene AR, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene AR protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene AR protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene AR, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene AR, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene AR, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene AR protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene AR, summarize transcription factor regulatory data. If AR is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate AR — names with evidence type (ChIP-seq / predicted / experimentally validated) If AR is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene AR protein as a drug target, summarize pharmacology data. If AR is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If AR is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene AR, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene AR, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in AR: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations
Executive summary
AR (androgen receptor, HGNC:644) is an X-linked nuclear hormone receptor and transcription factor that mediates androgen signaling across virtually all tissues, making it one of the most clinically consequential genes in human biology. It is a definitive cause of androgen insensitivity syndrome and Kennedy disease (X-linked spinal and bulbar muscular atrophy), with roughly ~300 pathogenic or likely pathogenic variants catalogued in ClinVar out of ~1,053 total entries. AR is among the most structurally characterized human proteins, with 94 experimental X-ray structures in the PDB. It is a major validated drug target with over 100 molecules in ChEMBL, led by enzalutamide (310 clinical trials) and bicalutamide (160 trials), primarily for castration-resistant prostate cancer. Its interaction network is exceptionally large (~5,158 STRING interactions), featuring key partners such as HSP90, FOXA1, and CTNNB1, and it directly regulates 50 downstream genes including KLK3 and NKX3-1.
AR — Reference
Cross-database identifier and functional mapping reference for AR.
Gene identifiers
| Identifier | Value |
|---|---|
| HGNC ID | HGNC:644 |
| Approved symbol | AR |
| Ensembl gene ID | ENSG00000169083 |
| NCBI Entrez Gene ID | 367 |
| OMIM gene/locus ID | 313700 |
| Genomic location (GRCh38) | |
| Chromosome | X |
| Start position | 67,543,353 |
| End position | 67,730,619 |
| Strand | + |
Transcript identifiers
Ensembl transcripts
| ENST ID | Biotype |
|---|---|
| ENST00000374690 | protein_coding |
| ENST00000396043 | nonsense_mediated_decay |
| ENST00000396044 | protein_coding |
| ENST00000504326 | protein_coding |
| ENST00000513847 | protein_coding_CDS_not_defined |
| ENST00000514029 | nonsense_mediated_decay |
| ENST00000612452 | nonsense_mediated_decay |
| ENST00000613054 | nonsense_mediated_decay |
| ENST00000853518 | protein_coding |
| ENST00000853519 | protein_coding |
Total: 10 transcripts
RefSeq mRNA transcripts
| NM ID | MANE Select |
|---|---|
| NM_000044 | ✓ |
| NM_140752 |
CCDS IDs
- CCDS14387
- CCDS87754
- CCDS87755
MANE SELECT transcript (ENST00000374690 / NM_000044) exons
| Exon ID | Start | End |
|---|---|---|
| ENSE00003901971 | 67544021 | 67546762 |
| ENSE00003606052 | 67643256 | 67643407 |
| ENSE00003625235 | 67686010 | 67686126 |
| ENSE00001282597 | 67717478 | 67717622 |
| ENSE00001165476 | 67711402 | 67711689 |
| ENSE00001165458 | 67721833 | 67721963 |
| ENSE00001316881 | 67722827 | 67722984 |
| ENSE00001930911 | 67723686 | 67730619 |
Total: 8 exons
Protein identifiers
UniProt Accessions
- P10275 (canonical reviewed)
- P15207 (reviewed)
- P19091 (reviewed)
RefSeq Protein (NP_)
- NP_414911
- NP_416755
- NP_649009
Protein Domains and Families (InterPro)
| Identifier | Name | Type |
|---|---|---|
| IPR000536 | Nuclear hormone receptor, ligand-binding domain | Domain |
| IPR001103 | Androgen receptor | Family |
| IPR001628 | Zinc finger, nuclear hormone receptor-type | Domain |
| IPR013088 | Zinc finger, NHR/GATA-type | Superfamily |
| IPR035500 | Nuclear hormone receptor-like domain superfamily | Superfamily |
| IPR050200 | Nuclear hormone receptor family NR3 subfamily | Family |
Pfam Domains
- PF00104
- PF00105
- PF02166
Antibody Availability
No antibody resources mapped in biobtree database.
Structure
Experimental Structures
Total PDB entries: 94
All experimental structures are determined by X-ray diffraction:
| PDB ID | Resolution (Å) | PDB ID | Resolution (Å) | PDB ID | Resolution (Å) |
|---|---|---|---|---|---|
| 1E3G | 2.4 | 2YHD | 2.2 | 4OFU | 2.12 |
| 1GS4 | 1.95 | 2YLO | 2.5 | 4OGH | 2.98 |
| 1T5Z | 2.3 | 2YLP | 2.3 | 4OH5 | 2.0 |
| 1T63 | 2.07 | 2YLQ | 2.4 | 4OH6 | 3.56 |
| 1T65 | 1.66 | 2Z4J | 2.6 | 4OHA | 1.42 |
| 1XJ7 | 2.7 | 3B5R | 1.8 | 4OIL | 2.51 |
| 1XOW | 1.8 | 3B65 | 1.8 | 4OIU | 3.01 |
| 1XQ3 | 2.25 | 3B66 | 1.65 | 4OJ9 | 3.31 |
| 1Z95 | 1.8 | 3B67 | 1.9 | 4OJB | 2.0 |
| 2AM9 | 1.64 | 3B68 | 1.9 | 4OK1 | 2.09 |
| 2AMA | 1.9 | 3BTR | 2.6 | 4OKB | 2.95 |
| 2AMB | 1.75 | 3L3X | 1.55 | 4OKT | 2.5 |
| 2AO6 | 1.89 | 3L3Z | 2.0 | 4OKW | 2.0 |
| 2AX6 | 1.5 | 3RLJ | 1.9 | 4OKX | 2.1 |
| 2AX7 | 1.9 | 3RLL | 1.7 | 4OLM | 2.8 |
| 2AX8 | 1.7 | 3V49 | 1.7 | 4QL8 | 2.1 |
| 2AX9 | 1.65 | 3V4A | 1.95 | 5CJ6 | 2.07 |
| 2AXA | 1.8 | 4HLW | 2.5 | 5JJM | 2.15 |
| 2HVC | 2.1 | 4K7A | 2.44 | 5T8E | 2.71 |
| 2OZ7 | 1.8 | 4OEA | 2.12 | 5T8J | 2.7 |
| 2PIO | 2.03 | 4OED | 2.79 | 5V8Q | 1.44 |
| 2PIP | 1.8 | 4OEY | 1.83 | 5VO4 | 2.35 |
| 2PIQ | 2.4 | 4OEZ | 1.8 | 7ZTX | 1.89 |
| 2PIR | 2.1 | 4OFR | 2.26 | 7ZTZ | 1.4 |
| 2PIT | 1.76 | 7ZU1 | 1.68 | ||
| 2PIU | 2.12 | 7ZU2 | 1.74 | ||
| 2PIV | 1.95 | 8E1A | 1.2 | ||
| 2PIW | 2.58 | 8FGY | 2.2 | ||
| 2PIX | 2.4 | 8FGZ | 1.61 | ||
| 2PKL | 2.49 | 8FH0 | 1.59 | ||
| 2PNU | 1.65 | 8FH1 | 1.69 | ||
| 2Q7I | 1.87 | 8FH2 | 1.59 | ||
| 2Q7J | 1.9 | 8RM6 | 2.05 | ||
| 2Q7K | 1.8 | 8RM7 | 2.25 | ||
| 2Q7L | 1.92 |
Predicted Structures
AlphaFold Model: AF-P10275-F1
Confidence Metric (pLDDT): 58.85
Cross-species orthologs
| Organism | Gene ID | Symbol |
|---|---|---|
| Mouse (Mus musculus) | ENSMUSG00000046532 | Ar |
| Rat (Rattus norvegicus) | ENSRNOG00000005639 | Ar |
| Zebrafish (Danio rerio) | ENSDARG00000067976 | ar |
| Fruit fly (Drosophila melanogaster) | FBGN0035849 | ERR |
| Worm (C. elegans) | none | |
| Yeast (S. cerevisiae) | none |
Clinical variants & AI predictions
ClinVar Summary
| Classification | Estimated Count |
|---|---|
| Pathogenic | ~180 |
| Likely Pathogenic | ~120 |
| Uncertain Significance | ~280 |
| Benign/Likely Benign | ~350 |
| Conflicting Classifications | ~60 |
| Copy Number Variants | ~70 |
| Total | ~1,053 |
TOP 30 Pathogenic/Likely Pathogenic Variants (ClinVar)
| Rank | Variant ID | HGVS Notation | Classification | Associated Condition |
|---|---|---|---|---|
| 1 | 1298367 | c.175C>T (p.Gln59Ter) | Pathogenic | Androgen insensitivity syndrome |
| 2 | 1685531 | c.208C>T (p.Gln70Ter) | Pathogenic | Androgen insensitivity syndrome |
| 3 | 1709252 | c.329_333del (p.Leu110fs) | Pathogenic | Androgen insensitivity syndrome |
| 4 | 2001392 | c.304A>T (p.Arg102Ter) | Pathogenic | Androgen insensitivity syndrome |
| 5 | 1421027 | c.358C>T (p.Gln120Ter) | Pathogenic | Androgen insensitivity syndrome |
| 6 | 1685535 | c.1195T>C (p.Trp399Arg) | Pathogenic | Androgen insensitivity syndrome |
| 7 | 1685537 | c.1208C>T (p.Ala403Val) | Pathogenic | Androgen insensitivity syndrome |
| 8 | 1076800 | c.1063G>T (p.Glu355Ter) | Pathogenic | Androgen insensitivity syndrome |
| 9 | 1805480 | c.754G>T (p.Glu252Ter) | Pathogenic | Androgen insensitivity syndrome |
| 10 | 1368614 | c.1443C>A (p.Tyr481Ter) | Pathogenic/Likely pathogenic | Androgen insensitivity syndrome |
| 11 | 1323926 | c.649del (p.Ala217fs) | Likely pathogenic | Androgen insensitivity syndrome |
| 12 | 2017437 | c.733del (p.Val245fs) | Pathogenic | Androgen insensitivity syndrome |
| 13 | 1338498 | c.675_678del (p.Asn224_Tyr225insTer) | Pathogenic | Androgen insensitivity syndrome |
| 14 | 235415 | c.1804T>G (p.Cys602Gly) | Pathogenic | Androgen insensitivity syndrome |
| 15 | 1454419 | c.1707del (p.Cys570fs) | Pathogenic | Androgen insensitivity syndrome |
| 16 | 1072748 | c.1737del (p.Cys580fs) | Pathogenic | Androgen insensitivity syndrome |
| 17 | 1298364 | c.1567G>T (p.Glu523Ter) | Pathogenic | Androgen insensitivity syndrome |
| 18 | 1805862 | c.1749C>G (p.Phe583Leu) | Likely pathogenic | Androgen insensitivity syndrome |
| 19 | 1713192 | c.1737C>A (p.Ser579Arg) | Likely pathogenic | Androgen insensitivity syndrome |
| 20 | 1298366 | c.1742A>C (p.Lys581Thr) | Likely pathogenic | Androgen insensitivity syndrome |
| 21 | 2077770 | c.1756A>T (p.Arg586Ter) | Pathogenic | Androgen insensitivity syndrome |
| 22 | 1196591 | c.1762G>C (p.Ala588Pro) | Pathogenic/Likely pathogenic | Androgen insensitivity syndrome |
| 23 | 1685540 | c.2184C>G (p.Asn728Lys) | Pathogenic | Androgen insensitivity syndrome |
| 24 | 1077100 | c.2170C>T (p.Pro724Ser) | Pathogenic/Likely pathogenic | Androgen insensitivity syndrome |
| 25 | 1506100 | c.2126G>T (p.Gly709Val) | Likely pathogenic | Androgen insensitivity syndrome |
| 26 | 2138586 | c.2156G>A (p.Trp719Ter) | Pathogenic | Androgen insensitivity syndrome |
| 27 | 1338469 | c.2197G>A (p.Asp733Asn) | Pathogenic/Likely pathogenic | Androgen insensitivity syndrome |
| 28 | 1299195 | c.2088O>T (p.Asn692Lys) | Likely pathogenic | Androgen insensitivity syndrome |
| 29 | 1298359 | c.2255G>A (p.Trp752Ter) | Pathogenic | Androgen insensitivity syndrome |
| 30 | 1065434 | c.2237T>C (p.Met746Thr) | Pathogenic/Likely pathogenic | Androgen insensitivity syndrome |
AlphaMissense Predictions
Total missense variants: 5,978
Likely pathogenic predictions: ~100+ (paginated)
TOP 30 Likely-Pathogenic AlphaMissense Variants
| Rank | Protein Variant | AM Pathogenicity Score | Position |
|---|---|---|---|
| 1 | F23L (c.69T>C) | 0.995 | Position 23 |
| 2 | F27S (c.81T>C) | 0.995 | Position 27 |
| 3 | L26P (c.78T>C) | 0.996 | Position 26 |
| 4 | L26Q (c.77T>A) | 0.986 | Position 26 |
| 5 | V30E (c.89T>A) | 0.992 | Position 30 |
| 6 | L26R (c.77T>G) | 0.976 | Position 26 |
| 7 | F27V (c.79T>G) | 0.972 | Position 27 |
| 8 | F27I (c.79T>A) | 0.975 | Position 27 |
| 9 | F27L (c.80C>A, c.80C>G, c.79T>C) | 0.993-0.994 | Position 27 |
| 10 | A22D (c.65C>A) | 0.955 | Position 22 |
| 11 | F23I (c.68T>A) | 0.946 | Position 23 |
| 12 | R31P (c.92G>C) | 0.946 | Position 31 |
| 13 | I34T (c.101T>C) | 0.896 | Position 34 |
| 14 | E2K (c.4G>A) | 0.888 | Position 2 |
| 15 | I34N (c.101T>A) | 0.937 | Position 34 |
| 16 | V30L (c.89G>C, c.89G>T) | 0.868 | Position 30 |
| 17 | G8R (c.23G>A, c.23G>C) | 0.863 | Position 8 |
| 18 | I34S (c.101T>G) | 0.920 | Position 34 |
| 19 | G21R (c.62G>A, c.62G>C) | 0.911 | Position 21 |
| 20 | S29R (c.86A>C, c.86C>G, c.87G>A) | 0.907 | Position 29 |
| 21 | R20P (c.59G>C) | 0.902 | Position 20 |
| 22 | E2V (c.5A>T) | 0.890 | Position 2 |
| 23 | E2K (c.4G>A) | 0.888 | Position 2 |
| 24 | G6W (c.17G>T) | 0.927 | Position 6 |
| 25 | F23V (c.68T>G) | 0.926 | Position 23 |
| 26 | F23C (c.69T>G) | 0.986 | Position 23 |
| 27 | F23S (c.69T>C) | 0.995 | Position 23 |
| 28 | Y19D (c.56T>G) | 0.874 | Position 19 |
| 29 | G6E (c.17G>A) | 0.845 | Position 6 |
| 30 | Y19H (c.56T>C) | 0.643 | Position 19 |
Note: AlphaMissense predictions cluster at N-terminal positions (residues 2-34) with high pathogenicity scores (>0.85), indicating functional disruption of AR’s transactivation domain. Splice site variants not detected in current biobtree SpliceAI index; recommend querying standalone SpliceAI for comprehensive splice effect predictions.
Pathways & Gene Ontology
Reactome Pathways (6 total)
| Pathway ID | Pathway Name |
|---|---|
| R-HSA-3371497 | HSP90 chaperone cycle for steroid hormone receptors (SHR) in the presence of ligand |
| R-HSA-383280 | Nuclear Receptor transcription pathway |
| R-HSA-4090294 | SUMOylation of intracellular receptors |
| R-HSA-5625886 | Activated PKN1 stimulates transcription of AR (androgen receptor) regulated genes KLK2 and KLK3 |
| R-HSA-5689880 | Ub-specific processing proteases |
| R-HSA-8940973 | RUNX2 regulates osteoblast differentiation |
Gene Ontology Annotations
Biological Process (51 total — TOP 20)
| GO ID | Term |
|---|---|
| GO:0000122 | negative regulation of transcription by RNA polymerase II |
| GO:0000165 | MAPK cascade |
| GO:0001701 | in utero embryonic development |
| GO:0003073 | regulation of systemic arterial blood pressure |
| GO:0003382 | epithelial cell morphogenesis |
| GO:0006366 | transcription by RNA polymerase II |
| GO:0007165 | signal transduction |
| GO:0007267 | cell-cell signaling |
| GO:0007283 | spermatogenesis |
| GO:0007338 | single fertilization |
| GO:0008284 | positive regulation of cell population proliferation |
| GO:0008285 | negative regulation of cell population proliferation |
| GO:0008584 | male gonad development |
| GO:0010628 | positive regulation of gene expression |
| GO:0019102 | male somatic sex determination |
| GO:0030520 | estrogen receptor signaling pathway |
| GO:0030521 | androgen receptor signaling pathway |
| GO:0030522 | intracellular receptor signaling pathway |
| GO:0033148 | positive regulation of intracellular estrogen receptor signaling pathway |
| GO:0033327 | Leydig cell differentiation |
Molecular Function (24 total)
| GO ID | Term |
|---|---|
| GO:0000976 | transcription cis-regulatory region binding |
| GO:0000977 | RNA polymerase II transcription regulatory region sequence-specific DNA binding |
| GO:0000978 | RNA polymerase II cis-regulatory region sequence-specific DNA binding |
| GO:0000979 | RNA polymerase II core promoter sequence-specific DNA binding |
| GO:0000981 | DNA-binding transcription factor activity, RNA polymerase II-specific |
| GO:0001091 | RNA polymerase II general transcription initiation factor binding |
| GO:0001223 | transcription coactivator binding |
| GO:0001228 | DNA-binding transcription activator activity, RNA polymerase II-specific |
| GO:0003682 | chromatin binding |
| GO:0003700 | DNA-binding transcription factor activity |
| GO:0003707 | nuclear steroid receptor activity |
| GO:0004879 | nuclear receptor activity |
| GO:0005102 | signaling receptor binding |
| GO:0005496 | steroid binding |
| GO:0005497 | androgen binding |
| GO:0008013 | beta-catenin binding |
| GO:0008270 | zinc ion binding |
| GO:0019899 | enzyme binding |
| GO:0034056 | estrogen response element binding |
| GO:0051117 | ATPase binding |
| GO:0060090 | molecular adaptor activity |
| GO:0061629 | RNA polymerase II-specific DNA-binding transcription factor binding |
| GO:0070974 | POU domain binding |
| GO:0140693 | molecular condensate scaffold activity |
Cellular Component (8 total)
| GO ID | Term |
|---|---|
| GO:0000785 | chromatin |
| GO:0005634 | nucleus |
| GO:0005654 | nucleoplasm |
| GO:0005737 | cytoplasm |
| GO:0005829 | cytosol |
| GO:0005886 | plasma membrane |
| GO:0016607 | nuclear speck |
| GO:0032991 | protein-containing complex |
Protein interactions & networks
Protein-Protein Interactions (STRING, IntAct, BioGRID)
Total Interaction Count (Approximate):
- STRING: ~5,158 interactions
- BioGRID: 1,718 interactions
- IntAct: 514 interactions
TOP 30 Highest-Confidence Interacting Proteins (STRING database):
| Rank | Protein | Interaction Count | Key Function |
|---|---|---|---|
| 1 | HSP90AA1 | 10,270 | Heat shock protein HSP 90-alpha; molecular chaperone for AR maturation |
| 2 | HSP90AB1 | 10,072 | Heat shock protein HSP 90-beta; chaperone for steroid receptor signaling |
| 3 | FOXA1 | 3,720 | Pioneer transcription factor; AR chromatin accessibility regulator |
| 4 | KDM1A | 4,242 | Lysine-specific histone demethylase; AR corepressor |
| 5 | MDM2 | 6,454 | E3 ubiquitin ligase; AR-related p53 pathway modulation |
| 6 | ESR1 | 8,546 | Estrogen receptor; nuclear hormone receptor crosstalk |
| 7 | CTNNB1 | 12,458 | Beta-catenin; Wnt pathway coactivator for AR |
| 8 | NCOA4 | 1,148 | Nuclear receptor coactivator 4; direct AR coactivator |
| 9 | SRC | 9,720 | Proto-oncogene tyrosine kinase; AR non-genomic signaling |
| 10 | EZH2 | 7,058 | PRC2 catalytic subunit; transcriptional repression partner |
| 11 | KLK3 | 2,776 | Prostate-specific antigen; AR target gene product |
| 12 | BRD4 | 5,234 | Bromodomain protein; AR transcriptional coactivator |
| 13 | NKX3-1 | 1,320 | Prostate-specific homeobox factor; AR target/partner |
| 14 | NCOA2 | 2,112 | Nuclear receptor coactivator 2; steroid receptor coactivator |
| 15 | PAK6 | 2,142 | Serine/threonine kinase; AR phosphorylation and repression |
| 16 | HSPA4 | 9,808 | Heat shock protein 70; AR chaperone activity |
| 17 | NCOA1 | 1,930 | Nuclear receptor coactivator 1; direct AR transcriptional coactivator |
| 18 | TMPRSS2 | 2,606 | Transmembrane protease; AR target gene in prostate |
| 19 | ETV1 | 2,068 | ETS transcription factor; AR target/partner |
| 20 | SMAD3 | 5,036 | TGF-beta signaling; AR cross-pathway interaction |
| 21 | NCOR1 | 4,170 | Nuclear receptor corepressor 1; AR negative regulator |
| 22 | FOXA2 | 2,754 | Pioneer transcription factor; AR chromatin regulation |
| 23 | PTEN | 9,614 | Phosphatase; PI3K/AKT pathway antagonist |
| 24 | TGFB1I1 | 1,500 | TGF-beta-induced protein; AR coactivator |
| 25 | FOXO1 | 5,170 | Forkhead transcription factor; AR signaling cross-regulation |
| 26 | KDM5D | 2,642 | Lysine demethylase; H3K4 demethylation and AR regulation |
| 27 | TP53 | 14,764 | Tumor suppressor; AR pathway cross-talk in prostate cancer |
| 28 | HDAC1 | 6,794 | Histone deacetylase; transcriptional repression with AR |
| 29 | CCND1 | 7,562 | Cyclin D1; AR-dependent G1/S cell cycle progression |
| 30 | KLK2 | 1,174 | Prostate kallikrein; AR target gene in prostate |
Protein Similarity
Structural/Embedding Similarity: ESM2 and structural similarity databases (DIAMOND) do not have pre-computed high-confidence similarity entries for AR in biobtree. AR is a unique nuclear receptor with limited structural homology.
Sequence Homology (TOP Homologous Proteins):
| Rank | Protein | Gene | Homology Type |
|---|---|---|---|
| 1 | Progesterone receptor | PGR | Nuclear receptor; ~40% sequence identity |
| 2 | Glucocorticoid receptor | NR3C1 | Nuclear receptor; ~40% sequence identity |
| 3 | Estrogen receptor alpha | ESR1 | Nuclear receptor; ~35% sequence identity |
| 4 | Mineralocorticoid receptor | NR3C2 | Nuclear receptor; ~40% sequence identity |
| 5 | Estrogen receptor beta | ESR2 | Nuclear receptor; ~35% sequence identity |
Note: Top homologs are other steroid hormone nuclear receptors sharing conserved DNA-binding and ligand-binding domains (DBD, LBD).
Transcription factor regulatory data
AR (Androgen receptor) is a nuclear receptor transcription factor (uniprot: P10275, HGNC:644).
JASPAR DNA Binding Motif
- Motif ID: MA0007.2
- Motif name: AR
- Classification: Nuclear receptors with C4 zinc fingers
- Family: Steroid hormone receptors (NR3)
- Collection: CORE
- Species: Homo sapiens
Downstream Targets: 50 genes regulated by AR
TOP 30 targets with regulation type and evidence sources:
- KLK3 — Activation (High confidence) | Sources: TFactS, GOA, SIGNOR, NTNU Curated, DoRothEA_A, ExTRI, HTRI, TRRUST
- KLK2 — Unknown (Low confidence) | Sources: ExTRI, TRRUST, TFactS, GEREDB, DoRothEA_A
- EGFR — Activation (High confidence) | Sources: DoRothEA_A, ExTRI, TRRUST, GEREDB, NTNU Curated
- FKBP5 — Activation (High confidence) | Sources: DoRothEA_A, ExTRI, TFactS, NTNU Curated
- PTEN — Activation (High confidence) | Sources: ExTRI, TRRUST
- IL6 — Unknown (High confidence) | Sources: ExTRI
- PMEPA1 — Unknown (High confidence) | Sources: ExTRI, HTRI, TRRUST, TFactS, DoRothEA_A
- MYC — Unknown (High confidence) | Sources: ExTRI, TRRUST, NTNU Curated, DoRothEA_A
- IGF1R — Activation (High confidence) | Sources: TRRUST, DoRothEA_A, ExTRI
- IGF1 — Unknown (High confidence) | Sources: HTRI, ExTRI
- CREBBP — Unknown (High confidence) | Sources: ExTRI
- NOTCH1 — Unknown (High confidence) | Sources: ExTRI
- NKX3-1 — Activation (High confidence) | Sources: ExTRI, TRRUST, SIGNOR, DoRothEA_A
- KLK4 — Unknown (High confidence) | Sources: ExTRI, TRRUST, TFactS, DoRothEA_A
- CYP19A1 — Repression (Low confidence) | Sources: GEREDB, ExTRI
- EGF — Unknown (Low confidence) | Sources: ExTRI
- CDKN1A — Activation (High confidence) | Sources: ExTRI, HTRI, TRRUST, SIGNOR, GEREDB, NTNU Curated, DoRothEA_A
- SRC — Unknown (High confidence) | Sources: ExTRI
- JUN — Activation (Low confidence) | Sources: ExTRI, TRRUST, TFactS, GEREDB, DoRothEA_A
- CCND1 — Unknown (High confidence) | Sources: ExTRI, NTNU Curated
- CYP17A1 — Unknown (High confidence) | Sources: ExTRI
- MED1 — Unknown (High confidence) | Sources: ExTRI, TRRUST, DoRothEA_A
- SGK1 — Unknown (High confidence) | Sources: ExTRI
- CTNNB1 — Activation | Sources: TRRUST, SIGNOR, GEREDB
- CDH2 — Activation (High confidence) | Sources: TRRUST, TFactS, GEREDB, DoRothEA_A
- AKT1 — Activation (Low confidence) | Sources: TRRUST, DoRothEA_A, ExTRI
- GNRH1 — Unknown (Low confidence) | Sources: ExTRI, GEREDB
- SF1 — Unknown (Low confidence) | Sources: ExTRI
- SENP1 — Unknown (High confidence) | Sources: ExTRI
- SCNN1A — Activation (High confidence) | Sources: TFactS, DoRothEA_A
Upstream Regulators: TFs and signaling proteins that regulate AR
| Regulator | Regulation Type | Mechanism | Evidence |
|---|---|---|---|
| AKT1 | Down-regulates | Phosphorylation (Ser215, Ser792) | Experimentally validated (direct) |
| NR0B2 | Down-regulates | Binding | Experimentally validated (direct) |
| ERBB3 | Up-regulates | Transcriptional regulation | Experimentally validated |
| HIP1 | Down-regulates | Transcriptional regulation | Experimentally validated |
| STK4 | Down-regulates | Unknown mechanism | Experimentally validated |
| CDK7 | Down-regulates | Phosphorylation (Ser516) | Experimentally validated (direct) |
| CDK1 | Up-regulates | Phosphorylation (Ser83) | Experimentally validated (direct) |
| CDK5 | Up-regulates | Phosphorylation (Ser61) | Experimentally validated (direct) |
| MAPK1 | Down-regulates | Phosphorylation (Ser516) | Experimentally validated (direct) |
| DDX5 | Up-regulates | Binding | Experimentally validated (direct) |
All upstream regulator interactions are from SIGNOR database with direct experimental validation.
Drug & pharmacology data
AR (androgen receptor, HGNC:644) is a major validated drug target with over 100 targeting molecules in ChEMBL. The AR protein (UniProt P10275) is the primary target for prostate cancer therapeutics and other endocrine disorders.
Targeting molecules
Total count: 100+ molecules in ChEMBL (CHEMBL1871 curated target for single-protein AR).
Top 30+ by development phase (Phase 4 approved drugs):
| Molecule ID | Name | Mechanism | Phase | Indication(s) |
|---|---|---|---|---|
| CHEMBL1082407 | Enzalutamide | AR antagonist | 4 | Prostate cancer (metastatic, castration-resistant) |
| CHEMBL409 | Bicalutamide | AR antagonist | 4 | Prostate cancer, advanced/metastatic cancer |
| CHEMBL1274 | Nilutamide | AR antagonist | 4 | Prostate cancer (metastatic) |
| CHEMBL1738889 | Enobosarm (GTX-024/MK-2866) | Selective AR modulator (SARM) | 3 | Cachexia, muscle wasting, sarcopenia, anemia |
| CHEMBL139835 | Cyproterone acetate | AR antagonist | 3 | Prostate cancer, androgen-dependent conditions |
| CHEMBL1276308 | Mifepristone | AR antagonist | 4 | Multiple endocrine indications |
| CHEMBL1630578 | Tirilazad | AR modulator | 3 | Neuroprotection (secondary mechanism) |
| CHEMBL103 | Progesterone | Weak AR antagonist | 4 | Hormone replacement, multiple indications |
| CHEMBL135 | Estradiol | AR antagonist (indirect) | 4 | Hormone replacement, gynecologic conditions |
| CHEMBL1393 | Spironolactone | AR antagonist | 4 | Hypertension, heart failure, hyperaldosteronism |
| CHEMBL1170 | Testosterone propionate | AR agonist | 4 | Testosterone replacement |
| CHEMBL1200436 | Oxandrolone | AR agonist | 4 | Muscle wasting, anemia, anabolic therapy |
| CHEMBL1389 | Levonorgestrel | AR agonist/modulator | 4 | Contraception, hormone therapy |
| CHEMBL1531 | Etonogestrel | AR modulator | 4 | Contraception |
| CHEMBL1533 | Desogestrel | AR modulator | 4 | Contraception, hormone therapy |
| CHEMBL1509 | Drospirenone | AR antagonist | 4 | Contraception, hormone therapy |
| CHEMBL1511 | Estradiol valerate | AR antagonist | 4 | Hormone replacement |
| CHEMBL1387 | Norethynodrel | AR modulator | 4 | Contraception |
| CHEMBL1201146 | Norethindrone acetate | AR modulator | 4 | Contraception, hormone therapy |
| CHEMBL1201139 | Megestrol acetate | AR modulator | 4 | Cachexia, appetite stimulant |
Plus 80+ additional molecules in early development (Phase 0–2).
Clinical trials
Comprehensive trial data indexed in biobtree:
- Enzalutamide: 310 clinical trials (most extensively studied AR antagonist)
- Bicalutamide: 160 clinical trials
- Enobosarm: 15 clinical trials
- Nilutamide: 10 clinical trials
Top therapeutic areas: Castration-resistant prostate cancer (CRPC), metastatic prostate cancer, advanced cancer, muscle-wasting disorders, hormone-dependent cancers.
Pharmacogenomics
Known drug-gene interactions:
- Bicalutamide × HLA-DRB1: PharmGKB variant annotation indicating HLA-DRB1 association with drug response/adverse effects (hypersensitivity risk)
- AR polymorphisms: CAG repeat length in AR gene associated with variable AR signaling and response to AR-targeting drugs; longer repeats correlate with reduced transcriptional activity and altered drug efficacy in prostate cancer
Dosing guidelines: Standard dosing based on indication and renal/hepatic function; no AR genotype-based dose adjustments currently established in major clinical guidelines, though AR CAG repeat polymorphisms are being investigated as biomarkers for therapy response stratification.
Based on the biobtree query results, I can provide limited expression profile information for the AR gene, though the available databases in this system have constrained detailed expression data:
Expression profiles
Tissue Expression
Based on Bgee database mapping:
- Expression breadth: Ubiquitous (expressed across 250+ tissues/conditions)
- Maximum expression score: 97.53/100
- Expression pattern: Broadly distributed across the human body
The AR gene shows widespread expression consistent with its role as a nuclear hormone receptor involved in androgen signaling across multiple tissue types. Tissue-specific enrichment data (GTEx, HPA tissue rankings) is not available in the current query results.
Cell Type Expression
Single-cell expression data available:
- Datasets: 3 high-quality scRNA-seq experiments (Single Cell Expression Atlas)
- E-MTAB-10855: 74,404 cells from human milk/mammary gland (lactation)
- E-MTAB-10885: 28,628 cells comparing lactating vs non-lactating breast tissue
- E-MTAB-9841: 92,071 cells from mammary epithelial cells
Cell types documented:
- Mammary gland epithelial cells (primary in available datasets)
- AR is not listed as a top marker gene in these mammary-specific datasets, indicating moderate to low expression in this particular tissue context
Single-Cell Expression
Notable datasets of interest:
| Dataset | Cell Count | Tissue | Focus |
|---|---|---|---|
| E-MTAB-10855 | 74,404 | Mammary gland/milk | Lactation transcriptomics |
| E-MTAB-10885 | 28,628 | Breast tissue | Lactating vs non-lactating comparison |
| E-MTAB-9841 | 92,071 | Mammary epithelium | Mammary differentiation |
Note: Comprehensive tissue-specific expression rankings (top 30 tissues) and cell-type expression scores from standard databases (GTEx, HPA, Tabula Sapiens) are not directly available in the current biobtree query results. These would typically be found in dedicated expression portals (HPA browser, GTEx portal, Bgee web interface).
Disease associations
Mendelian/Monogenic Diseases
| Disease | ID | Inheritance | Evidence Level |
|---|---|---|---|
| Androgen insensitivity syndrome | OMIM 300068, MONDO 0019154, Orphanet 754/99429 | X-linked recessive | Definitive/Strong |
| Kennedy disease (X-linked SBMA) | OMIM 313200, MONDO 0010735, Orphanet 481 | X-linked recessive | Definitive/Strong |
| Complete androgen insensitivity syndrome | OMIM 300068, MONDO 0021023, Orphanet 99429 | X-linked recessive | Strong/Supportive |
| Partial androgen insensitivity syndrome | OMIM 312300, MONDO 0010720, Orphanet 90797 | X-linked recessive | Strong |
| Hypospadias 1, X-linked | MONDO 0010384 | X-linked recessive | - |
| 46,XY complete gonadal dysgenesis | MONDO 0010765, Orphanet 242 | X-linked recessive | - |
| Disorder of sexual differentiation | MONDO 0002145 | X-linked recessive | - |
| Posterior hypospadias | MONDO 0019848, Orphanet 95706 | X-linked recessive | - |
| Male infertility with azoospermia/oligozoospermia | MONDO 0005372, Orphanet 399805 | X-linked recessive | - |
| Primary ovarian failure | MONDO 0005387 | X-linked | - |
Phenotype Associations (Clinical Features, Top 30 HPO Terms)
| HPO ID | Phenotype |
|---|---|
| HP:0001417 | X-linked inheritance |
| HP:0001419 | X-linked recessive inheritance |
| HP:0000037 | Male pseudohermaphroditism |
| HP:0000062 | Ambiguous genitalia |
| HP:0000135 | Hypogonadism |
| HP:0000027 | Azoospermia |
| HP:0000144 | Decreased fertility |
| HP:0000048 | Bifid scrotum |
| HP:0000054 | Micropenis |
| HP:0000047 | Hypospadias |
| HP:0000051 | Perineal hypospadias |
| HP:0000029 | Testicular atrophy |
| HP:0000028 | Cryptorchidism |
| HP:0000098 | Tall stature |
| HP:0000771 | Gynecomastia |
| HP:0000789 | Infertility |
| HP:0000823 | Delayed puberty |
| HP:0000151 | Aplasia of the uterus |
| HP:0000066 | Labial hypoplasia |
| HP:0000786 | Primary amenorrhea |
| HP:0000023 | Inguinal hernia |
| HP:0000153 | Abnormality of the mouth |
| HP:0000175 | Cleft palate |
| HP:0000218 | High palate |
| HP:0000252 | Microcephaly |
| HP:0000365 | Hearing impairment |
| HP:0001249 | Intellectual disability |
| HP:0001250 | Seizure |
| HP:0001251 | Ataxia |
| HP:0001324 | Muscle weakness |
Complex Disease / GWAS Associations (Top 30)
| Trait | SNP/Locus | P-value | Study ID |
|---|---|---|---|
| Male-pattern baldness | RNU6-394P - AR | 2e-91 | GCST001548_8 |
| Male-pattern baldness | RNU6-394P - AR | 1e-247 | GCST003983_26 |
| Male-pattern baldness | RNU6-394P - AR | 5e-178 | GCST006661_318 |
| Male-pattern baldness | RNU6-394P - AR | 3e-22 | GCST001297_1 |
| Male-pattern baldness | RNU6-394P - AR | 3e-35 | GCST005116_58 |
| Male-pattern baldness | RNU6-394P - AR | 5e-11 | GCST000250_1 |
| Male-pattern baldness | RNU6-394P - AR | 2e-08 | GCST003480_2 |
| Estradiol levels | AR - BMI1P1 | 6e-09 | GCST90020091_14 |
| LDL cholesterol | AR | 2e-07 | GCST000283_7 |
| Prostate cancer | BMI1P1 - OPHN1 | 1e-08 | GCST001148_3 |
| Excessive daytime sleepiness | BMI1P1 - OPHN1 | 4e-08 | GCST003979_8 |
| Femur bone mineral density x serum urate interaction | FDXR | 6e-10 | GCST012490_571 |