HTT Gene Complete Identifier and Functional Mapping Reference
Provide a comprehensive cross-database identifier and functional mapping reference for human HTT — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene HTT, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene HTT, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene HTT protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene HTT protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene HTT, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene HTT, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene HTT, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene HTT protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene HTT, summarize transcription factor regulatory data. If HTT is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate HTT — names with evidence type (ChIP-seq / predicted / experimentally validated) If HTT is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene HTT protein as a drug target, summarize pharmacology data. If HTT is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If HTT is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene HTT, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene HTT, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in HTT: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations
Executive summary
HTT (huntingtin, HGNC:4851) encodes a large scaffolding protein on chromosome 4 that is the causative gene for Huntington disease (OMIM:143100), an autosomal dominant neurodegenerative disorder driven by CAG trinucleotide repeat expansion rather than conventional point mutations — a mechanism that explains why only 3 definitively pathogenic coding variants appear in ClinVar despite ~780 mapped variants. The protein is expressed ubiquitously across human tissues (average expression score 80.9), with highest levels in peripheral nerve (95.04), pancreas (92.59), and multiple brain regions including prefrontal cortex and cerebellar hemisphere, consistent with its broad roles in vesicle transport, autophagy, apoptosis regulation, and neurogenesis. Structurally, 31 experimental PDB entries — including cryo-EM structures of the full-length HTT–HAP40 complex at 2.6 Å — define its HEAT-repeat architecture. The therapeutic pipeline has shifted toward HTT-lowering strategies: the antisense oligonucleotide tominersen (NCT05686551) and gene therapies AMT-130 and SPK-10001 are in active clinical trials, while tetrabenazine and deutetrabenazine remain approved for symptomatic chorea management.
Gene identifiers
HTT (huntingtin)
| Identifier | Value |
|---|---|
| HGNC ID | HGNC:4851 |
| Approved symbol | HTT |
| Ensembl gene ID | ENSG00000197386 |
| NCBI Entrez Gene ID | 3064 |
| OMIM ID | 613004 |
| Genomic location (GRCh38) | |
| Chromosome | 4 |
| Start position | 3,041,363 |
| End position | 3,243,957 |
| Strand | + (forward) |
Transcript identifiers
Ensembl Transcripts (23 total)
| ENST ID | Biotype |
|---|---|
| ENST00000355072 | protein_coding |
| ENST00000506137 | retained_intron |
| ENST00000508321 | retained_intron |
| ENST00000509043 | retained_intron |
| ENST00000509618 | protein_coding |
| ENST00000509751 | retained_intron |
| ENST00000510626 | retained_intron |
| ENST00000512068 | retained_intron |
| ENST00000512909 | retained_intron |
| ENST00000513326 | retained_intron |
| ENST00000513639 | retained_intron |
| ENST00000513806 | protein_coding_CDS_not_defined |
| ENST00000647962 | protein_coding_CDS_not_defined |
| ENST00000648150 | protein_coding_CDS_not_defined |
| ENST00000649131 | protein_coding |
| ENST00000649900 | protein_coding_CDS_not_defined |
| ENST00000650588 | nonsense_mediated_decay |
| ENST00000650595 | nonsense_mediated_decay |
| ENST00000680239 | nonsense_mediated_decay |
| ENST00000680291 | retained_intron |
| ENST00000680360 | nonsense_mediated_decay |
| ENST00000680956 | protein_coding |
| ENST00000681528 | protein_coding |
RefSeq mRNA (4 total)
| NM Accession | Symbol | MANE Select |
|---|---|---|
| NM_001316495 | HD | No |
| NM_141274 | HD | No |
| NM_001388492 | HTT | Yes |
| NM_002111 | HTT | No |
CCDS ID
| CCDS ID |
|---|
| CCDS43206 |
Canonical/MANE SELECT Transcript: ENST00000355072 (NM_001388492)
67 exons:
| Exon ID | Start | End | Strand | Chromosome |
|---|---|---|---|---|
| ENSE00001251499 | 3074681 | 3075088 | + | 4 |
| ENSE00000854943 | 3086939 | 3087022 | + | 4 |
| ENSE00000854944 | 3099274 | 3099394 | + | 4 |
| ENSE00000854945 | 3103824 | 3103883 | + | 4 |
| ENSE00000854946 | 3105357 | 3105436 | + | 4 |
| ENSE00000854947 | 3107285 | 3107423 | + | 4 |
| ENSE00000854948 | 3115304 | 3115445 | + | 4 |
| ENSE00003494967 | 3116085 | 3116263 | + | 4 |
| ENSE00001506819 | 3121228 | 3121432 | + | 4 |
| ENSE00001506818 | 3122889 | 3122936 | + | 4 |
| ENSE00000854953 | 3125549 | 3125629 | + | 4 |
| ENSE00000854954 | 3127264 | 3127604 | + | 4 |
| ENSE00000854955 | 3129924 | 3130047 | + | 4 |
| ENSE00000854956 | 3130305 | 3130423 | + | 4 |
| ENSE00003681950 | 3131286 | 3131397 | + | 4 |
| ENSE00003475088 | 3131638 | 3131775 | + | 4 |
| ENSE00003571103 | 3132562 | 3132720 | + | 4 |
| ENSE00003648724 | 3132814 | 3132911 | + | 4 |
| ENSE00003673769 | 3134401 | 3134540 | + | 4 |
| ENSE00003657059 | 3135904 | 3135967 | + | 4 |
| ENSE00003607438 | 3136226 | 3136326 | + | 4 |
| ENSE00003635707 | 3140510 | 3140656 | + | 4 |
| ENSE00003658909 | 3142766 | 3142886 | + | 4 |
| ENSE00003584863 | 3145152 | 3145228 | + | 4 |
| ENSE00003620583 | 3146797 | 3146948 | + | 4 |
| ENSE00003512853 | 3148005 | 3148207 | + | 4 |
| ENSE00003460831 | 3154293 | 3154419 | + | 4 |
| ENSE00003658160 | 3157072 | 3157199 | + | 4 |
| ENSE00003597650 | 3160282 | 3160392 | + | 4 |
| ENSE00003659509 | 3172908 | 3173131 | + | 4 |
| ENSE00003558932 | 3172320 | 3172397 | + | 4 |
| ENSE00003522087 | 3174721 | 3174799 | + | 4 |
| ENSE00003671014 | 3174946 | 3175107 | + | 4 |
| ENSE00003665848 | 3178298 | 3178446 | + | 4 |
| ENSE00003574493 | 3180515 | 3180651 | + | 4 |
| ENSE00003615800 | 3182354 | 3182470 | + | 4 |
| ENSE00003581669 | 3186597 | 3186719 | + | 4 |
| ENSE00000854981 | 3187651 | 3187886 | + | 4 |
| ENSE00000854982 | 3188951 | 3189093 | + | 4 |
| ENSE00003613725 | 3199732 | 3199939 | + | 4 |
| ENSE00003596776 | 3206496 | 3206675 | + | 4 |
| ENSE00003689364 | 3206807 | 3206983 | + | 4 |
| ENSE00000854987 | 3207281 | 3207357 | + | 4 |
| ENSE00003559384 | 3204007 | 3204148 | + | 4 |
| ENSE00003472889 | 3209827 | 3209949 | + | 4 |
| ENSE00000854988 | 3208773 | 3208911 | + | 4 |
| ENSE00003479441 | 3211929 | 3212142 | + | 4 |
| ENSE00003512984 | 3212564 | 3212709 | + | 4 |
| ENSE00003679310 | 3213958 | 3214135 | + | 4 |
| ENSE00003639044 | 3215110 | 3215211 | + | 4 |
| ENSE00003603422 | 3217765 | 3217952 | + | 4 |
| ENSE00003669528 | 3220182 | 3220308 | + | 4 |
| ENSE00003490959 | 3222387 | 3222487 | + | 4 |
| ENSE00003543993 | 3223406 | 3223560 | + | 4 |
| ENSE00003651248 | 3223992 | 3224131 | + | 4 |
| ENSE00003476697 | 3225661 | 3225743 | + | 4 |
| ENSE00003644432 | 3228615 | 3228745 | + | 4 |
| ENSE00003503216 | 3228880 | 3229009 | + | 4 |
| ENSE00003685801 | 3229887 | 3230042 | + | 4 |
| ENSE00003589182 | 3233163 | 3233353 | + | 4 |
| ENSE00003691918 | 3235284 | 3235398 | + | 4 |
| ENSE00003467481 | 3235565 | 3235778 | + | 4 |
| ENSE00003501297 | 3236149 | 3236254 | + | 4 |
| ENSE00003678629 | 3238447 | 3238609 | + | 4 |
| ENSE00003548126 | 3238818 | 3238978 | + | 4 |
| ENSE00002026783 | 3239846 | 3243957 | + | 4 |
Protein identifiers
UniProt accessions
- P42858 (Huntingtin) — canonical reviewed entry
- A0A3B3ISR3 (unreviewed)
- A0A3B3IU25 (unreviewed)
- A0A7P0TA78 (unreviewed)
- A0A7P0TAC5 (unreviewed)
- A0A7P0TAN5 (unreviewed)
- A0A7P0Z417 (unreviewed)
- H0YA07 (unreviewed)
RefSeq proteins (NP_ accessions)
- NP_001303424 (reviewed)
- NP_649531 (reviewed)
- NP_001375421 (reviewed, MANE SELECT)
- NP_002102 (reviewed)
Protein domains and families
InterPro entries:
| ID | Name | Type |
|---|---|---|
| IPR000091 | Huntingtin | Family |
| IPR028426 | Huntingtin family | Family |
| IPR011989 | Armadillo-like helical | Homologous superfamily |
| IPR016024 | Armadillo-type fold | Homologous superfamily |
| IPR048411 | Huntingtin, N-terminal, HEAT repeats 1 | Repeat |
| IPR024613 | Huntingtin, N-terminal, HEAT repeats 2 | Repeat |
| IPR048412 | Huntingtin, bridge | Repeat |
| IPR048413 | Huntingtin, C-terminal HEAT repeats | Repeat |
Pfam entries:
- PF12372
- PF20925
- PF20926
- PF20927
Antibody availability
Direct searches through biobtree antibody databases returned no indexed entries. However, HTT protein antibodies are available through:
- DNASU (ID: 3064) — DNA Sequencing and Genomics to Diagnose Genetic Disease in Newborns, provides recombinant antibodies and expression clones
- Human Protein Atlas (HPA) — ENSG00000197386 mapped; HPA maintains antibody resources for tissue-specific expression and localization
Structure
Experimental Structures from PDB
Total PDB Structures: 31
X-ray Crystallography (9 structures)
| PDB ID | Title | Resolution (Å) |
|---|---|---|
| 3IO4 | Huntingtin amino-terminal region with 17 Gln residues - Crystal C90 | 3.63 |
| 3IO6 | Huntingtin amino-terminal region with 17 Gln residues - crystal C92-a | 3.7 |
| 3IOR | Huntingtin amino-terminal region with 17 Gln residues - crystal C95 | 3.6 |
| 3IOT | Huntingtin amino-terminal region with 17 Gln residues - crystal C92-b | 3.5 |
| 3IOU | Huntingtin amino-terminal region with 17 Gln residues - crystal C94 | 3.7 |
| 3IOV | Huntingtin amino-terminal region with 17 Gln residues - crystal C99 | 3.7 |
| 3IOW | Huntingtin amino-terminal region with 17 Gln residues - crystal C99-Hg | 3.5 |
| 4FE8 | Crystal Structure of Htt36Q3H-EX1-X1-C1(Alpha) | 3.0 |
| 4FEB | Crystal Structure of Htt36Q3H-EX1-X1-C2(Beta) | 2.8 |
| 4FEC | Crystal Structure of Htt36Q3H | 3.0 |
| 4FED | Crystal Structure of Htt36Q3H | 2.807 |
| 8R2O | Huntingtin-Q17, 1-66, N-MBP fusion | 3.23 |
| 3LRH | Structure of anti-huntingtin VL domain in complex with huntingtin peptide | 2.6 |
| 4RAV | Crystal structure of scFvC4 in complex with the first 17 AA of huntingtin | 2.5 |
Solution NMR (3 structures)
| PDB ID | Title |
|---|---|
| 2LD0 | Solution structure of the N-terminal domain of huntingtin (htt17) in 50 % TFE |
| 2LD2 | Solution structure of the N-terminal domain of huntingtin (htt17) in presence of DPC micelles |
| 6N8C | Structure of the Huntingtin tetramer/dimer mixture determined by paramagnetic NMR |
Cryo-EM (15 structures)
| PDB ID | Title | Resolution (Å) |
|---|---|---|
| 6EZ8 | Human Huntingtin-HAP40 complex structure | 4.0 |
| 6RMH | The Rigid-body refined model of the normal Huntingtin. | 9.6 |
| 6X9O | High resolution cryoEM structure of huntingtin in complex with HAP40 | 2.6 |
| 6YEJ | Cryo-EM structure of the Full-length disease type human Huntingtin | 18.2 |
| 7DXJ | Human 46QHuntingtin-HAP40 complex structure | 3.6 |
| 7DXK | Human 128QHuntingtin-HAP40 complex structure | 4.1 |
| 8SAH | Huntingtin C-HEAT domain in complex with HAP40 | 3.2 |
| 8VLX | HTT in complex with HAP40 and a small molecule. | 2.6 |
| 8W15 | HTT in complex with HAP40 in the apo state. | 2.72 |
| 8YAE | Cryo-ET structure of huntingtin actin complex | 10.08 |
| 8YAO | Cryo-ET structure of huntingtin actin dimer complex | 20.8 |
| 9PMW | Structure of HTTQ23-HAP40 complex bound to macrocycles HHL1, HD4 and HL2 | 2.1 |
| 9PN0 | Structure of HTTQ23-HAP40 complex bound to macrocycles HHD3, HD4 and HL2 | 2.3 |
| 9YR6 | Structure of HTTQ23-HAP40 complex bound to a small molecule ligand | 2.3 |
Predicted Structures
AlphaFold: No database mapping found for P42858 in AlphaFold structural database. However, AlphaFold predictions are available directly from AlphaFold Database (https://alphafolddb.esmatlas.com) using UniProt ID P42858.
Cross-species orthologs
| Organism | Gene ID | Symbol |
|---|---|---|
| Mouse (Mus musculus) | ENSMUSG00000029104 | Htt |
| Rat (Rattus norvegicus) | ENSRNOG00000011073 | Htt |
| Zebrafish (Danio rerio) | ENSDARG00000052866 | htt |
| Fruit fly (Drosophila melanogaster) | FBGN0027655 | htt |
| Worm (C. elegans) | WBGENE00009027 | htt-1 |
| Yeast (S. cerevisiae) | none | none |
Based on the biobtree data I’ve gathered, here’s the summary:
Clinical variants & AI predictions
ClinVar Overview
| Classification | Count | Notes |
|---|---|---|
| Total variants | ~780 | Mapped to HTT gene |
| Uncertain significance (VUS) | ~650+ | Majority of HTT variants |
| Likely benign | ~50+ | Includes synonymous & intronic variants |
| Benign | ~15 | Multiple submitters with consensus |
| Likely pathogenic | 2 | c.8110-1G>A, c.1403-1G>C (splicing) |
| Pathogenic | 3 | c.2710C>T (p.Gln904Ter), c.2085del (p.Gly697fs), c.5821_5833del (p.Ser1941fs) |
| Conflicting | ~3+ | c.7010A>G, c.54GCA[22], c.1206059 |
TOP 30 Pathogenic/Likely Pathogenic Variants
| Variant ID | HGVS | Condition/Classification |
|---|---|---|
| 1393012 | c.2710C>T | p.Gln904Ter (Pathogenic) |
| 1464611 | c.2085del | p.Gly697fs (Pathogenic) |
| 1494729 | c.5821_5833del | p.Ser1941fs (Pathogenic) |
| 1357041 | c.8110-1G>A | Splice variant (Likely pathogenic) |
| 1375718 | c.1403-1G>C | Splice variant (Likely pathogenic) |
| — | — | Only 5 high-confidence P/LP variants in HTT ClinVar |
AlphaMissense Pathogenicity Predictions
| Metric | Count |
|---|---|
| Total predictions | ~3,000+ |
| Likely pathogenic | ~1,500+ |
TOP 30 Likely Pathogenic AlphaMissense Predictions
| Position | Amino Acid Change | AM Pathogenicity Score |
|---|---|---|
| 2 | A2T | 0.773 |
| 2 | A2P | 0.735 |
| 5 | E5K | 0.947 |
| 5 | E5V | 0.945 |
| 7 | L7P | 0.997 |
| 7 | L7Q | 0.975 |
| 10 | A10P | 0.985 |
| 10 | A10D | 0.981 |
| 11 | F11L | 0.988 |
| 11 | F11C | 0.944 |
| 13 | S13P | 0.955 |
| 14 | L14P | 0.987 |
| 14 | L14H | 0.977 |
| 17 | F17S | 0.823 |
| 96 | K96N | 0.974 |
| 102 | C103R | 0.998 |
| 103 | C103W | 0.994 |
| 106 | I106K | 0.999 |
| 107 | C107R | 0.982 |
| 121 | F121L | 0.994 |
| 125 | L125P | 0.997 |
| 126 | G126R | 0.992 |
| 128 | A128P | 0.998 |
| 128 | A128D | 0.999 |
| 130 | E130K | 0.994 |
| 131 | L131P | 0.998 |
| 132 | F132S | 0.995 |
| 133 | L133P | 0.998 |
| 134 | L134P | 0.989 |
| 135 | C135R | 0.995 |
Key Observations:
- HTT has very few definitive pathogenic variants (n=3) in ClinVar, reflecting its dominant gain-of-function disease mechanism via CAG repeat expansion
- Most coding variants are classified as VUS, indicating insufficient evidence of pathogenicity
- AlphaMissense predicts high pathogenicity for early N-terminal residues (positions 2-17), consistent with functional importance
- Splice site variants are rare but documented as likely pathogenic when affecting canonical splice sites (±1-2 positions)
Pathways & Gene Ontology
Biological Pathways
Reactome pathways: 1 total
| ID | Pathway Name |
|---|---|
| R-HSA-9022692 | Regulation of MECP2 expression and activity |
MSigDB gene sets: 479 total
Major categories in HTT membership:
- Gene Ontology (C5:GO): ~150+ gene-set pathways
- Human Phenotype (C5:HP): ~130+ phenotype terms (neurodegeneration, motor dysfunction, cognitive impairment, chorea, ataxia, dystonia, spasticity, gait disturbance, muscle weakness, etc.)
- Canonical Pathways (C2:CP): ~5 pathways including KEGG Huntington’s Disease and ERK pathway in Huntington’s disease
- Transcription factor targets (C3:TFT): ~40+ TF targets
- miRNA targets (C3:MIR): ~30+ miRNA targets
- Other: chromosome locations, gene neighborhoods, oncogenic/immunologic signatures
Gene Ontology Annotations
Total GO terms: 48
Biological Process (19 terms)
| GO ID | Term |
|---|---|
| GO:0006915 | Apoptotic process |
| GO:0043065 | Positive regulation of apoptotic process |
| GO:0007030 | Golgi organization |
| GO:0006890 | Retrograde vesicle-mediated transport, Golgi to ER |
| GO:0048489 | Synaptic vesicle transport |
| GO:0047496 | Vesicle transport along microtubule |
| GO:0007417 | Central nervous system development |
| GO:0022008 | Neurogenesis |
| GO:0042297 | Vocal learning |
| GO:0031648 | Protein destabilization |
| GO:0000132 | Establishment of mitotic spindle orientation |
| GO:0045724 | Positive regulation of cilium assembly |
| GO:1901526 | Positive regulation of mitophagy |
| GO:1904504 | Positive regulation of lipophagy |
| GO:1905337 | Positive regulation of aggrephagy |
| GO:1905289 | Regulation of CAMKK-AMPK signaling cascade |
| GO:1905291 | Positive regulation of CAMKK-AMPK signaling cascade |
| GO:2001237 | Negative regulation of extrinsic apoptotic signaling pathway |
| GO:0031587 | Obsolete: positive regulation of IP3-sensitive Ca²⁺ channel activity |
Molecular Function (10 terms)
| GO ID | Term |
|---|---|
| GO:0042802 | Identical protein binding |
| GO:0044325 | Transmembrane transporter binding |
| GO:0019900 | Kinase binding |
| GO:0034452 | Dynactin binding |
| GO:0045505 | Dynein intermediate chain binding |
| GO:0048487 | Beta-tubulin binding |
| GO:0002039 | p53 binding |
| GO:0004721 | Phosphoprotein phosphatase activity |
| GO:0005522 | Profilin binding |
| GO:0031072 | Heat shock protein binding |
Cellular Component (19 terms)
| GO ID | Term |
|---|---|
| GO:0032991 | Protein-containing complex |
| GO:0005634 | Nucleus |
| GO:0005654 | Nucleoplasm |
| GO:0005737 | Cytoplasm |
| GO:0005829 | Cytosol |
| GO:0048471 | Perinuclear region of cytoplasm |
| GO:0030424 | Axon |
| GO:0030425 | Dendrite |
| GO:0099523 | Presynaptic cytosol |
| GO:0099524 | Postsynaptic cytosol |
| GO:0005794 | Golgi apparatus |
| GO:0005783 | Endoplasmic reticulum |
| GO:0005769 | Early endosome |
| GO:0005770 | Late endosome |
| GO:0031410 | Cytoplasmic vesicle |
| GO:0030659 | Cytoplasmic vesicle membrane |
| GO:0005776 | Autophagosome |
| GO:0005814 | Centriole |
| GO:0016234 | Inclusion body |
I’m unable to retrieve comprehensive protein-protein interaction data for human HTT (P51112) from the available biobtree databases. The search and mapping tools show limited connectivity to mainstream interaction databases (STRING, BioGRID, IntAct) in this particular database instance.
The available data indicates:
- ESM2 structural similarity: 51 results, but predominantly orthologous HTT proteins from other species (similarity scores >0.98, indicating these are orthologs rather than distinct interaction partners)
- Diamond sequence similarity: Only 4 results (other HTT orthologs from different organisms)
- STRING/BioGRID/IntAct: No mapped interactions available in this biobtree instance
For a comprehensive analysis of human HTT protein interactions and networks, you would need to:
- Query STRING Database directly (string-db.org) for the human huntingtin protein
- Access BioGRID, IntAct, or MINT for curated binary interaction data
- Use AlphaFold Database or Foldseek for structural similarity comparisons
- Consult PPI prediction tools for computational interaction predictions
Would you like me to help you access these resources directly or try alternative biobtree mappings?
Transcription factor regulatory data
HTT is not a transcription factor. Huntingtin is a structural/regulatory protein involved in protein trafficking, apoptosis regulation, and neurogenesis. No DNA-binding motifs are recorded in JASPAR or other transcription factor databases.
Upstream regulators of HTT
18 transcription factors regulate HTT gene expression (CollecTRI database):
| TF Name | Regulation Type | Confidence | Evidence Sources |
|---|---|---|---|
| SP1 | Activates | High | ExTRI, TRRUST, DoRothEA_A |
| TP53 | Unknown | High | DoRothEA_A, ExTRI, GEREDB, NTNU Curated |
| TGIF1 | — | High | ExTRI |
| ZNF395 | — | High | ExTRI, NTNU Curated |
| YY1 | Represses | High | ExTRI, NTNU Curated |
| MBD2 | — | High | ExTRI |
| KDM5C | — | High | ExTRI |
| CREB1 | Activates | Low | ExTRI, NTNU Curated |
| REST | Represses | Low | ExTRI, NTNU Curated |
| TFAP2A | Unknown | — | DoRothEA_A, TRRUST |
| PAX6 | — | Low | ExTRI |
| VSX2 | — | Low | ExTRI |
| VSX1 | — | Low | ExTRI |
| POU4F2 | — | Low | ExTRI |
| NR1H3 | — | Low | ExTRI |
| AP1 | — | Low | ExTRI |
| NONO | — | Low | ExTRI |
| NFKB | — | Low | ExTRI |
Evidence types: ExTRI (computational transcriptional interaction prediction), TRRUST & GEREDB (literature-curated), DoRothEA_A (multi-source prediction), NTNU Curated (manual curation). No direct ChIP-seq evidence is currently catalogued for most HTT-TF relationships.
Drug & pharmacology data
HTT Status as a Drug Target: HTT (huntingtin) functions primarily as a genetic disease target rather than a traditional pharmacological target. Mutations (CAG trinucleotide repeats) in the HTT gene directly cause Huntington disease. Current therapeutics do not directly inhibit the huntingtin protein through conventional binding; instead, approved and investigational approaches target HTT through:
- RNA-based therapies: Antisense oligonucleotides reducing HTT mRNA/protein levels
- Gene therapies: Viral or non-viral vectors delivering HTT-targeting constructs
- Symptomatic agents: Drugs managing involuntary movements downstream of HTT pathology
Targeting Molecules: The 18,767 molecules in ChEMBL mapped to CHEMBL5514 (huntingtin target) represent primarily high-throughput screening compounds with minimal therapeutic development—the vast majority are phase 0 research compounds.
FDA-Approved/Advanced HTT-Targeting Drugs (Top 5):
- Tetrabenazine (CHEMBL117785) – Phase 4 approved (2008); VMAT2 inhibitor; symptomatic (chorea management), not HTT-lowering
- Deutetrabenazine – Phase 4 approved; improved tetrabenazine profile
- RO7234292/Tominersen (ISIS 443139) – Phase 2/3; antisense oligonucleotide directly targeting HTT mRNA; NCT05686551 (GENERATION HD2)
- WVE-120101 & WVE-120102 – Phase 1/2; allele-specific antisense oligonucleotides (terminated trials)
- AMT-130 – Phase 1/2; in vivo gene therapy delivering micro-RNA targeting mutant HTT; NCT04120493, NCT05243017
Clinical Trials (TOP 20 Active/Recent for HTT-Targeted or Huntington’s Therapies):
| Trial ID | Phase | Status | Intervention |
|---|---|---|---|
| NCT05686551 | Phase 2 | ACTIVE | Tominersen (RO7234292) – HTT-lowering antisense |
| NCT06826612 | Phase 1/2 | RECRUITING | SPK-10001 gene therapy – HTT-targeting |
| NCT07339514 | Phase 1/2 | ACTIVE | ER2001 gene therapy – early manifest HD |
| NCT04120493 | Phase 1/2 | ACTIVE | AMT-130 gene therapy |
| NCT05243017 | Phase 1/2 | ACTIVE | AMT-130 (European cohort) |
| NCT05541627 | Phase 1/2 | ACTIVE | AB-1001 striatal administration |
| NCT06254482 | Phase 2 | ACTIVE | Votoplam extension study |
| NCT06873334 | Phase 2/3 | RECRUITING | SKY-0515 pharmacodynamics study |
| NCT07378644 | Phase 2/3 | RECRUITING | SKY-0515 efficacy/safety |
| NCT07326709 | Phase 3 | NOT_YET_RECRUITING | Votoplam efficacy/safety |
| NCT04713982 | Phase 2/3 | ACTIVE | Deutetrabenazine functional effects |
| NCT04556656 | Phase 3 | COMPLETED | Pridopidine (PROOF-HD) |
| NCT03842969 | Phase 3 | COMPLETED | RO7234292/RG6042 extension |
| NCT05475483 | Phase 2 | COMPLETED | SOM3355 chorea treatment |
| NCT05358717 | Phase 2 | COMPLETED | PTC518 safety/efficacy |
| NCT04478734 | Phase 2 | RECRUITING | Thiamine + biotin combination |
| NCT05509153 | Phase 2 | RECRUITING | N-acetyl cysteine (premanifest) |
| NCT05107128 | Phase 2 | COMPLETED | SAGE-718 cognitive effects |
| NCT04514367 | Phase 2 | COMPLETED | ANX005 complement inhibitor |
| NCT06853743 | Phase 2 | RECRUITING | Nicotinamide riboside (NAD-HD) |
Pharmacogenomics: HTT-related pharmacogenomics centers on genetic risk rather than drug-response variation:
- Disease Causation: CAG repeat expansions (>39 repeats) in HTT cause Huntington disease in autosomal-dominant pattern
- Premanifest Identification: Genetic testing identifies at-risk carriers decades before symptom onset; informs enrollment in preventive trials
- Gene-Specific Drug Response: No established HTT variant-based dosing guidelines. Response to tetrabenazine varies clinically but not based on HTT genotype
- Therapeutic Targeting: Emerging therapies (antisense, gene therapy) reduce total or mutant HTT levels; efficacy monitored by CSF HTT levels, not genetic polymorphisms
Key Note: Unlike typical pharmacology, HTT genetics is the disease mechanism itself, not a modifier of drug response to other therapeutics.
Expression profiles
Tissue Expression (Bgee GTEx-derived)
HTT shows ubiquitous expression across human tissues with notably high expression in nervous system, sensory, and metabolic tissues. Expression scores range 0–100 with average 80.9.
| Rank | Tissue | Expression Score | Quality |
|---|---|---|---|
| 1 | Sural nerve | 95.04 | Gold |
| 2 | Body of pancreas | 92.59 | Gold |
| 3 | Colonic epithelium | 91.95 | Gold |
| 4 | Granulocyte | 91.03 | Gold |
| 5 | Adrenal tissue | 90.74 | Gold |
| 6 | Stromal cell of endometrium | 90.59 | Gold |
| 7 | Calcaneal tendon | 90.29 | Gold |
| 8 | Pancreas | 90.21 | Gold |
| 9 | Ventricular zone | 90.06 | Gold |
| 10 | Hindlimb stylopod muscle | 89.58 | Gold |
| 11 | Cortical plate | 89.53 | Gold |
| 12 | Skin of leg | 89.42 | Gold |
| 13 | Skin of abdomen | 88.69 | Gold |
| 14 | Right frontal lobe | 88.26 | Gold |
| 15 | Islet of Langerhans | 88.10 | Gold |
| 16 | Prefrontal cortex | 87.99 | Gold |
| 17 | Right cerebellar hemisphere | 87.86 | Gold |
| 18 | Gastrocnemius | 87.85 | Gold |
| 19 | Muscle of leg | 87.73 | Gold |
| 20 | Cerebellar hemisphere | 87.54 | Gold |
| 21 | Cerebellar cortex | 87.52 | Gold |
| 22 | Upper lobe of left lung | 87.21 | Gold |
| 23 | Anterior cingulate cortex | 87.07 | Gold |
| 24 | Ganglionic eminence | 87.02 | Gold |
| 25 | Cingulate cortex | 86.99 | Gold |
| 26 | Leukocyte | 86.96 | Gold |
| 27 | Bone marrow cell | 86.94 | Gold |
| 28 | Left lobe of thyroid gland | 86.90 | Gold |
| 29 | Monocyte | 86.80 | Gold |
| 30 | Right lobe of thyroid gland | 86.75 | Gold |
Key patterns:
- Nervous system enrichment: Highest expression in peripheral nerves (sural nerve, 95.04), developmental brain regions (ventricular zone, cortical plate), and multiple cortical areas
- Immune cell expression: Strong in granulocytes (91.03), monocytes (86.80), leukocytes (86.96), bone marrow cells (86.94)
- Metabolic tissues: High in pancreatic tissues (body 92.59, islets 88.10), adrenal tissue (90.74)
- Skeletal/connective tissue: Elevated in tendon, muscle (gastrocnemius, leg muscle ~87.7–87.9), skin
Cell Type Expression (Bgee Celltypes)
Cell type-specific expression within tissues:
| Cell Type | Expression Score | Context |
|---|---|---|
| Granulocyte | 91.03 | Circulating immune cell |
| Leukocyte | 86.96 | General immune marker |
| Monocyte | 86.80 | Circulating myeloid cell |
| Bone marrow cell | 86.94 | Hematopoietic progenitor context |
| Stromal cell of endometrium | 90.59 | Connective tissue in reproductive organ |
Pattern: HTT shows strong expression in immune and hematopoietic cell types, consistent with systemic distribution.
Single-Cell Expression (SCXA)
- 1 marker experiment with 141 distinct cell clusters
- Max mean expression: 9.89 (log-scale or normalized units)
- Average mean expression: 0.15 across all clusters
- Expression heterogeneity: Wide variation (0.15–9.89) suggests strong cell-type-specific regulation despite ubiquitous presence
Notable pattern: HTT has marked single-cell heterogeneity, indicating distinct populations with high vs. low expression despite ubiquitous tissue-level presence.
Disease associations
Mendelian / Monogenic Diseases
| Disease | Disease IDs | Inheritance Pattern | Evidence Level |
|---|---|---|---|
| Huntington disease | OMIM:143100, Mondo:0007739, Orphanet:399 | Autosomal dominant | Definitive, Strong, Supportive |
| Lopes-Maciel-Rodan syndrome | OMIM:617435, Mondo:0054573 | Autosomal recessive | Limited, Strong |
| Juvenile Huntington disease | Orphanet:248111 | Autosomal dominant | Supportive |
Phenotype Associations (HPO Terms)
Top 30 clinical phenotypes associated with HTT mutations:
- HP:0000006 – Autosomal dominant inheritance
- HP:0000007 – Autosomal recessive inheritance
- HP:0000496 – Abnormality of eye movement
- HP:0000545 – Myopia
- HP:0000708 – Atypical behavior
- HP:0000713 – Agitation
- HP:0000716 – Depression
- HP:0000718 – Aggressive behavior
- HP:0000722 – Compulsive behaviors
- HP:0000726 – Dementia
- HP:0000733 – Motor stereotypy
- HP:0000734 – Disinhibition
- HP:0000737 – Irritability
- HP:0000738 – Hallucinations
- HP:0000739 – Anxiety
- HP:0000741 – Apathy
- HP:0000746 – Delusion
- HP:0000751 – Personality changes
- HP:0000752 – Hyperactivity
- HP:0001250 – Seizure
- HP:0001251 – Ataxia
- HP:0001257 – Spasticity
- HP:0001262 – Excessive daytime somnolence
- HP:0001263 – Global developmental delay
- HP:0001268 – Mental deterioration
- HP:0001272 – Cerebellar atrophy
- HP:0001276 – Hypertonia
- HP:0001288 – Gait disturbance
- HP:0001332 – Dystonia
- HP:0001336 – Myoclonus
Total: 99 HPO phenotypes mapped
Complex Disease / GWAS Associations
Top 30 GWAS associations (26 mapped to HTT):
| Trait/Disease | Gene(s) | p-value |
|---|---|---|
| Monobrow | HTT | 1.0e-22 |
| Mean platelet volume | HTT | 2.0e-24, 3.0e-10, 2.0e-20 |
| White blood cell count | HTT | 8.0e-14 |
| Neutrophil count | HTT | 5.0e-13 |
| Estimated glomerular filtration rate | HTT | 1.0e-10 |
| Frailty index | HTT | 3.0e-10 |
| Platelet count | HTT | 4.0e-09 |
| Worry/vulnerability (neuroticism factor) | HTT | 1.0e-09 |
| Walking pace | HTT | 1.0e-09 |
| Type 2 diabetes | HTT | 1.0e-09 |
| Household income (MTAG) | HTT | 1.0e-09, 2.0e-09 |
| Worry too long after embarrassing experience | HTT | 9.0e-10 |
| Parental lifespan | HTT | 6.0e-09 |
| Aging traits (healthspan, parental lifespan, longevity) | HTT | 3.0e-08 |
| Hip circumference adjusted for BMI | HTT | 5.0e-08 |
| Serum sulfate level | HTT | 3.0e-07 |
| Itch intensity from mosquito bite adjusted by bite size | HTT | 8.0e-06 |
| Myocardial infarction | HTT, HTT-AS | 5.0e-06 |