MAPT Gene Complete Identifier and Functional Mapping Reference
Provide a comprehensive cross-database identifier and functional mapping reference for human MAPT — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene MAPT, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene MAPT, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene MAPT protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene MAPT protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene MAPT, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene MAPT, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene MAPT, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene MAPT protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene MAPT, summarize transcription factor regulatory data. If MAPT is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate MAPT — names with evidence type (ChIP-seq / predicted / experimentally validated) If MAPT is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene MAPT protein as a drug target, summarize pharmacology data. If MAPT is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If MAPT is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene MAPT, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene MAPT, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in MAPT: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations
Executive summary
MAPT encodes microtubule-associated protein tau, a neuronal cytoskeletal protein that is central to the pathogenesis of Alzheimer’s disease and a broad class of neurodegenerative disorders collectively termed tauopathies. Its highest expression is in cortical brain regions (score 99.52 in cortical plate), consistent with its prominent role in neuronal function. The gene spans chromosome 17q21 and produces at least 52 transcripts, reflecting extraordinary isoform complexity. Pathogenic MAPT variants — predominantly causing frontotemporal dementia with parkinsonism (FTDP-17) — number approximately 30 in ClinVar, and GWAS links the locus to Parkinson’s disease (p = 1×10⁻⁶⁸) and progressive supranuclear palsy (p = 2×10⁻¹²²). Structurally, tau is an intrinsically disordered protein (AlphaFold global pLDDT 49.56, no residues with pLDDT > 90), yet 274 experimental PDB structures capture its disease-relevant filament conformations across multiple tauopathies. Its interaction network (~7,283 documented interactions) is dominated by key phosphorylation kinases CDK5 and GSK3B and major neurodegeneration proteins including APP, APOE, and SNCA.
Gene identifiers
- HGNC ID: HGNC:6893
- Approved symbol: MAPT
- Ensembl gene ID: ENSG00000186868
- NCBI Entrez Gene ID: 4137
- OMIM gene/locus ID: 157140
- Genomic location (GRCh38):
- Chromosome: 17
- Start position: 45,894,358 bp
- End position: 46,028,334 bp
- Strand: +
Transcript identifiers
Ensembl Transcripts (52 total)
| ENST ID | Biotype |
|---|---|
| ENST00000262410 | protein_coding |
| ENST00000334239 | protein_coding |
| ENST00000344290 | protein_coding |
| ENST00000351559 | protein_coding |
| ENST00000415613 | protein_coding |
| ENST00000420682 | protein_coding |
| ENST00000431008 | protein_coding |
| ENST00000446361 | protein_coding |
| ENST00000535772 | protein_coding |
| ENST00000570299 | protein_coding_CDS_not_defined |
| ENST00000571311 | nonsense_mediated_decay |
| ENST00000571987 | protein_coding |
| ENST00000572440 | retained_intron |
| ENST00000574436 | protein_coding |
| ENST00000576518 | retained_intron |
| ENST00000577017 | protein_coding_CDS_not_defined |
| ENST00000680542 | nonsense_mediated_decay |
| ENST00000680674 | protein_coding |
| ENST00000703922 | nonsense_mediated_decay |
| ENST00000703923 | nonsense_mediated_decay |
| ENST00000703924 | nonsense_mediated_decay |
| ENST00000703974 | retained_intron |
| ENST00000703975 | retained_intron |
| ENST00000703976 | retained_intron |
| ENST00000703977 | protein_coding_CDS_not_defined |
| ENST00000703978 | nonsense_mediated_decay |
| ENST00000703979 | protein_coding_CDS_not_defined |
| ENST00000703980 | protein_coding_CDS_not_defined |
| ENST00000703981 | protein_coding_CDS_not_defined |
| ENST00000884777 | protein_coding |
| ENST00000884778 | protein_coding |
| ENST00000884779 | protein_coding |
| ENST00000884780 | protein_coding |
| ENST00000884781 | protein_coding |
| ENST00000884782 | protein_coding |
| ENST00000884783 | protein_coding |
| ENST00000884784 | protein_coding |
| ENST00000884785 | protein_coding |
| ENST00000884786 | protein_coding |
| ENST00000884787 | protein_coding |
| ENST00000884788 | protein_coding |
| ENST00000972111 | protein_coding |
| ENST00000972112 | protein_coding |
| ENST00000972113 | protein_coding |
| ENST00000972114 | protein_coding |
| ENST00000972115 | protein_coding |
| ENST00000972116 | protein_coding |
| ENST00000972117 | protein_coding |
| ENST00000972118 | protein_coding |
| ENST00000972119 | protein_coding |
| ENST00000972120 | protein_coding |
RefSeq mRNA Accessions (57 total)
| NM ID | MANE Select |
|---|---|
| NM_001038609 | |
| NM_001123066 | |
| NM_001123067 | |
| NM_001203251 | |
| NM_001203252 | |
| NM_001276088 | |
| NM_001276089 | |
| NM_001276090 | |
| NM_001276091 | |
| NM_001285454 | |
| NM_001285455 | |
| NM_001285456 | |
| NM_001300634 | |
| NM_001300635 | |
| NM_001300636 | |
| NM_001300637 | |
| NM_001300638 | |
| NM_001300639 | |
| NM_001316620 | |
| NM_001377265 | ✓ |
| NM_001377266 | |
| NM_001377267 | |
| NM_001377268 | |
| NM_001382126 | |
| NM_001403975 | |
| NM_001403976 | |
| NM_001403977 | |
| NM_001403978 | |
| NM_001403979 | |
| NM_001403980 | |
| NM_001403981 | |
| NM_001403982 | |
| NM_001403983 | |
| NM_001403984 | |
| NM_001403987 | |
| NM_001403990 | |
| NM_001403992 | |
| NM_001403994 | |
| NM_001403996 | |
| NM_001403998 | |
| NM_001403999 | |
| NM_001404002 | |
| NM_001404004 | |
| NM_001404005 | |
| NM_001404006 | |
| NM_001404010 | |
| NM_001404011 | |
| NM_001404012 | |
| NM_001404014 | |
| NM_001404015 | |
| NM_001404016 | |
| NM_005910 | |
| NM_010838 | |
| NM_016834 | |
| NM_016835 | |
| NM_016841 | |
| NM_017212 | |
| NM_143318 | |
| NM_170345 |
CCDS IDs (10 total)
| CCDS ID |
|---|
| CCDS11499 |
| CCDS11500 |
| CCDS11501 |
| CCDS11502 |
| CCDS45715 |
| CCDS45716 |
| CCDS56033 |
| CCDS92345 |
| CCDS92346 |
| CCDS92347 |
MANE Select Transcript: ENST00000262410 (NM_001377265)
Exons (13 total)
| Exon ID | Start | End | Strand | Chromosome |
|---|---|---|---|---|
| ENSE00003900976 | 45894554 | 45894686 | + | 17 |
| ENSE00003549966 | 45962321 | 45962470 | + | 17 |
| ENSE00003658894 | 45978375 | 45978440 | + | 17 |
| ENSE00003682122 | 45991460 | 45991586 | + | 17 |
| ENSE00002410720 | 45989878 | 45990075 | + | 17 |
| ENSE00003611335 | 45987040 | 45987095 | + | 17 |
| ENSE00003634058 | 45996399 | 45996664 | + | 17 |
| ENSE00003570976 | 46018618 | 46018730 | + | 17 |
| ENSE00002672395 | 45982866 | 45983930 | + | 17 |
| ENSE00002409189 | 46010310 | 46010402 | + | 17 |
| ENSE00002405491 | 46014243 | 46014324 | + | 17 |
| ENSE00001291240 | 46023956 | 46028334 | + | 17 |
Protein identifiers
UniProt accessions
- P10636 (canonical, reviewed) — Microtubule-associated protein tau
- A0A7I2PJZ2 (unreviewed)
- A0A7I2PLE3 (unreviewed)
- A0A7P0T936 (unreviewed)
- I3L2Z2 (unreviewed)
RefSeq protein (NP_ accessions)
NP_001033698, NP_001116538, NP_001116539, NP_001190180, NP_001190181, NP_001263017, NP_001263018, NP_001263019, NP_001263020, NP_001272383, NP_001272384, NP_001272385, NP_001287563, NP_001287564, NP_001287565, NP_001287566, NP_001287567, NP_001287568, NP_001303549, NP_001364194, NP_001364195, NP_001364196, NP_001364197, NP_001369032, NP_001390904, NP_001390905, NP_001390906, NP_001390907, NP_001390908, NP_001390909, NP_001390910, NP_001390911, NP_001390912, NP_001390913, NP_001390916, NP_001390919, NP_001390921, NP_001390923, NP_001390925, NP_001390927, NP_001390928
Protein domains and families
| ID | Name | Type |
|---|---|---|
| IPR001084 | Microtubule associated protein, tubulin-binding repeat | Repeat |
| IPR002955 | Microtubule-associated protein Tau | Family |
| IPR027324 | Microtubule associated protein MAP2/MAP4/Tau | Family |
| PF00418 | Protein of unknown function (Pfam domain) | Pfam domain |
Antibody availability
Limited records in public biobtree antibody databases. However, tau protein (P10636) has extensive cross-references to PDB structures (274 entries) and is well-characterized. Commercial antibody resources available through: Human Protein Atlas (ENSG00000186868), BindingDB (675 entries), and antibody vendor catalogs targeting epitopes across the protein sequence.
Structure
Experimental Structures
PDB Entries: 274 total
By Experimental Method:
| Method | Count | Resolution Range |
|---|---|---|
| Electron Microscopy (Cryo-EM) | ~220 | 1.9 – 4.3 Å |
| X-Ray Diffraction | ~45 | 1.0 – 3.1 Å |
| Solution NMR | ~5 | Not applicable |
| Electron Crystallography | ~3 | 1.1 – 2.0 Å |
| Solid-State NMR | ~3 | Not applicable |
Key Structure Types:
- Disease-associated filaments: Paired helical filaments (PHF), straight filaments (SF), and twisted filaments from Alzheimer’s disease, chronic traumatic encephalopathy (CTE), progressive supranuclear palsy (PSP), corticobasal degeneration (CBD), and other tauopathies
- Antibody-Tau complexes: Fab fragments in complex with tau peptides (e.g., AT8, PT3, BIIB092, CBTAU series)
- Protein binding complexes: 14-3-3 sigma with phosphorylated tau peptides; tau-microtubule complexes; Fyn kinase interaction
- Tau peptide structures: Individual tau repeat fragments, amyloid-forming VQIVYK segment
- In vitro assembled filaments: Various morphologies under different buffer conditions (PBS, heparin, ions, etc.)
Notable PDB entries: 5O3L, 6HRE (AD PHF); 6TJO, 6TJX (CBD); 7P65 (PSP); 8Q8R, 8Q8S (AD PHF/THF variants); 22JY (highest resolution cryo-EM at 2.2 Å)
Predicted Structures
AlphaFold Model:
- Model ID: P10636
- Global pLDDT: 49.56 (low confidence, consistent with intrinsically disordered protein)
- Fraction with pLDDT > 90: 0.00 (no highly confident regions)
- Protein Length: 758 amino acids (canonical isoform)
Note: Low confidence reflects tau’s nature as an intrinsically disordered protein with limited stable secondary structure in its native state.
Cross-species orthologs
| Organism | Gene ID | Symbol |
|---|---|---|
| Mouse (Mus musculus) | ENSMUSG00000018411 | Mapt |
| Rat (Rattus norvegicus) | ENSRNOG00000005133 | Mapt |
| Zebrafish (Danio rerio) | ENSDARG00000087616, ENSDARG00000089314 | maptb, mapta |
| Fruit fly (Drosophila melanogaster) | none | none |
| Worm (C. elegans) | none | none |
| Yeast (S. cerevisiae) | none | none |
Clinical variants & AI predictions
ClinVar Summary
| Classification | Count |
|---|---|
| Pathogenic | ~30 |
| Likely Pathogenic | ~5 |
| Uncertain Significance | ~100 |
| Likely Benign | ~220 |
| Benign | ~246 |
| Total | ~601 |
Top 30 Pathogenic/Likely Pathogenic Clinical Variants
| Variant ID | HGVS Notation | Classification | Associated Condition |
|---|---|---|---|
| 14245 | NM_001377265.1:c.2078C>T (p.Pro693Leu) | Pathogenic | FTDP-17 |
| 14246 | NM_001377265.1:c.1991G>T (p.Gly664Val) | Pathogenic | FTDP-17 |
| 14247 | NM_001377265.1:c.2392C>T (p.Arg798Trp) | Pathogenic | FTDP-17 |
| 14248 | NM_001377265.1:c.2091+14C>T | Pathogenic | FTDP-17 |
| 14251 | NM_001377265.1:c.2091+1G>A | Pathogenic | FTDP-17 (splice donor) |
| 14252 | NM_001377265.1:c.2185G>A (p.Val729Met) | Pathogenic | FTDP-17 |
| 14253 | NM_001377265.1:c.2013T>G (p.Asn671Lys) | Pathogenic | FTDP-17 |
| 14254 | NM_001377265.1:c.2090G>A (p.Ser697Asn) | Pathogenic | FTDP-17 |
| 14255 | NM_001377265.1:c.2341G>A (p.Gly781Arg) | Pathogenic/Likely | FTDP-17 |
| 14256 | NM_001377265.1:c.2077C>T (p.Pro693Ser) | Pathogenic | FTDP-17 |
| 14257 | NM_001377265.1:c.2064T>C (p.Asn688=) | Pathogenic | FTDP-17 |
| 14258 | NM_001377265.1:c.2201A>T (p.Glu734Val) | Pathogenic | FTDP-17 |
| 14259 | NM_001377265.1:c.1946A>C (p.Lys649Thr) | Pathogenic | FTDP-17 |
| 14260 | NM_001377265.1:c.2282A>T (p.Lys761Ile) | Pathogenic | FTDP-17 |
| 14262 | NM_001377265.1:c.2135C>T (p.Ser712Phe) | Pathogenic | FTDP-17 |
| 14263 | NM_001377265.1:c.14G>T (p.Arg5Leu) | Pathogenic | FTDP-17 |
| 14266 | NM_001377265.1:c.1972C>G (p.Leu658Val) | Pathogenic | FTDP-17 |
| 14267 | NM_001377265.1:c.2231C>T (p.Ser744Leu) | Pathogenic | FTDP-17 |
| 14268 | NM_001377265.1:c.2126A>T (p.Lys709Met) | Pathogenic | FTDP-17 |
| 14269 | NM_001377265.1:c.2084G>T (p.Gly695Val) | Pathogenic | FTDP-17 |
| 2049744 | NM_001377265.1:c.2091+16C>G | Likely Pathogenic | FTDP-17 |
Splice Effect Predictions (SpliceAI)
- Total predictions: ~3,502
- Donor gain events: ~2,800
- Donor loss events: ~400
- Acceptor gain/loss events: ~300
| Position | Variant | Gene | Effect Type | Delta Score |
|---|---|---|---|---|
| 17:45894684 | CAGGT:C | MAPT | Donor loss | 1.00 |
| 17:45894685 | AGGTA:A | MAPT | Donor loss | 1.00 |
| 17:45894686 | GGTA:G | MAPT | Donor loss | 1.00 |
| 17:45894687 | G:GA | MAPT | Donor loss | 1.00 |
| 17:45894688 | T:A | MAPT | Donor loss | 0.99 |
| 17:45894683 | TCAG:T | MAPT | Donor gain | 0.96 |
| 17:45894685 | AG:A | MAPT | Donor gain | 0.95 |
| 17:45894686 | GG:G | MAPT | Donor gain | 0.95 |
| 17:45894687 | G:GG | MAPT | Donor gain | 0.99 |
| 17:45895712 | G:GT | MAPT | Donor gain | 0.59 |
AlphaMissense Pathogenicity Predictions (P10636)
- Total variants assessed: ~700
- Likely pathogenic predictions: ~144
- Pathogenicity range: 0.042–0.997
| Position | Variant | Protein Change | am_pathogenicity | am_class |
|---|---|---|---|---|
| 17:45996437 | A:C | S516R | 0.997 | Likely pathogenic |
| 17:45996425 | A:C | S512R | 0.994 | Likely pathogenic |
| 17:45996436 | C:A | S515R | 0.991 | Likely pathogenic |
| 17:45996428 | G:C | G513R | 0.986 | Likely pathogenic |
| 17:45991542 | T:C | I488T | 0.988 | Likely pathogenic |
| 17:45991540 | G:C | R487S | 0.981 | Likely pathogenic |
| 17:45996429 | G:A | G513D | 0.990 | Likely pathogenic |
| 17:45991552 | A:C | K491N | 0.966 | Likely pathogenic |
| 17:45991552 | A:T | K491N | 0.966 | Likely pathogenic |
| 17:45996438 | G:A | S516N | 0.953 | Likely pathogenic |
| 17:45996426 | G:T | S512I | 0.954 | Likely pathogenic |
| 17:45991539 | G:T | R487M | 0.951 | Likely pathogenic |
| 17:45996435 | G:T | S515I | 0.933 | Likely pathogenic |
| 17:45996426 | G:A | S512N | 0.917 | Likely pathogenic |
| 17:45991538 | A:G | R487G | 0.916 | Likely pathogenic |
| 17:45996441 | C:G | P517R | 0.915 | Likely pathogenic |
| 17:45996421 | C:A | A490E | 0.910 | Likely pathogenic |
| 17:45996441 | C:A | P517H | 0.920 | Likely pathogenic |
| 17:45996444 | G:A | G518D | 0.969 | Likely pathogenic |
| 17:45991550 | A:G | K491E | 0.945 | Likely pathogenic |
Pathways & Gene Ontology
Biological Pathways
Reactome (3 pathways):
| Pathway ID | Pathway Name |
|---|---|
| R-HSA-264870 | Caspase-mediated cleavage of cytoskeletal proteins |
| R-HSA-9619483 | Activation of AMPK downstream of NMDARs |
| R-HSA-9833482 | PKR-mediated signaling |
MSigDB: No gene sets found
Total pathways: 3
Gene Ontology Annotations
Biological Process (51 terms):
| GO ID | Term |
|---|---|
| GO:0000226 | microtubule cytoskeleton organization |
| GO:0001774 | microglial cell activation |
| GO:0006974 | DNA damage response |
| GO:0007267 | cell-cell signaling |
| GO:0007416 | synapse assembly |
| GO:0007611 | learning or memory |
| GO:0007613 | memory |
| GO:0010288 | response to lead ion |
| GO:0010506 | regulation of autophagy |
| GO:0010629 | negative regulation of gene expression |
| GO:0010917 | negative regulation of mitochondrial membrane potential |
| GO:0016072 | rRNA metabolic process |
| GO:0019896 | axonal transport of mitochondrion |
| GO:0021954 | central nervous system neuron development |
| GO:0031110 | regulation of microtubule polymerization or depolymerization |
| GO:0031113 | regulation of microtubule polymerization |
| GO:0031116 | positive regulation of microtubule polymerization |
| GO:0031122 | cytoplasmic microtubule organization |
| GO:0031175 | neuron projection development |
| GO:0032930 | positive regulation of superoxide anion generation |
Molecular Function (25 terms):
| GO ID | Term |
|---|---|
| GO:0003677 | DNA binding |
| GO:0003680 | minor groove of adenine-thymine-rich DNA binding |
| GO:0003690 | double-stranded DNA binding |
| GO:0003697 | single-stranded DNA binding |
| GO:0003723 | RNA binding |
| GO:0003779 | actin binding |
| GO:0005515 | protein binding |
| GO:0008017 | microtubule binding |
| GO:0015631 | tubulin binding |
| GO:0017124 | SH3 domain binding |
| GO:0019899 | enzyme binding |
| GO:0019901 | protein kinase binding |
| GO:0030674 | protein-macromolecule adaptor activity |
| GO:0034185 | apolipoprotein binding |
| GO:0034452 | dynactin binding |
| GO:0035091 | phosphatidylinositol binding |
| GO:0042802 | identical protein binding |
| GO:0043565 | sequence-specific DNA binding |
| GO:0051087 | protein-folding chaperone binding |
| GO:0051721 | protein phosphatase 2A binding |
Cellular Component (28 terms):
| GO ID | Term |
|---|---|
| GO:0005576 | extracellular region |
| GO:0005634 | nucleus |
| GO:0005737 | cytoplasm |
| GO:0005739 | mitochondrion |
| GO:0005829 | cytosol |
| GO:0005856 | cytoskeleton |
| GO:0005874 | microtubule |
| GO:0005886 | plasma membrane |
| GO:0015630 | microtubule cytoskeleton |
| GO:0016020 | membrane |
| GO:0030424 | axon |
| GO:0030425 | dendrite |
| GO:0030426 | growth cone |
| GO:0030673 | axolemma |
| GO:0034399 | nuclear periphery |
| GO:0036464 | cytoplasmic ribonucleoprotein granule |
| GO:0036477 | somatodendritic compartment |
| GO:0042995 | cell projection |
| GO:0043005 | neuron projection |
| GO:0043025 | neuronal cell body |
Protein interactions & networks
Protein-Protein Interactions (STRING, BioGRID, IntAct)
Total interaction count:
- STRING: ~5,558 interactions
- BioGRID: 1,565 interactions
- IntAct: 160 curated interactions
- Combined: ~7,283 interactions documented
TOP 30 highest-confidence STRING interactors (combined score):
| Rank | Protein | Gene | UniProt | Score | Role/Function |
|---|---|---|---|---|---|
| 1 | MAPT (self) | MAPT | P10636 | 986 | Self-oligomerization |
| 2 | Amyloid-beta precursor protein | APP | P05067 | 986 | Neurodegeneration pathway |
| 3 | Non-selective voltage-gated ion channel VDAC1 | VDAC1 | P21796 | 974 | Mitochondrial transport |
| 4 | Apolipoprotein E | APOE | P02649 | 973 | Lipid metabolism, Alzheimer’s risk |
| 5 | Alpha-synuclein | SNCA | P37840 | 971 | Synucleinopathy pathway |
| 6 | TAR DNA-binding protein 43 | TARDBP | Q13148 | 970 | RNA binding, neurodegeneration |
| 7 | Tyrosine-protein kinase Fyn | FYN | P06241 | 958 | Tau phosphorylation |
| 8 | Cyclin-dependent kinase 5 | CDK5 | Q00535 | 948 | Tau phosphorylation kinase |
| 9 | Glycogen synthase kinase-3 beta | GSK3B | P49841 | 943 | Tau phosphorylation kinase |
| 10 | Presenilin-1 | PSEN1 | P49768 | 933 | γ-secretase, neurodegeneration |
| 11 | 14-3-3 protein theta | YWHAQ | P27348 | 903 | Tau binding/stabilization |
| 12 | RNA-binding protein FUS | FUS | P35637 | 889 | RNA binding, neurodegeneration |
| 13 | Progranulin | GRN | P23781 | 886 | Neuroinflammation |
| 14 | Presenilin-2 | PSEN2 | P49810 | 883 | γ-secretase, neurodegeneration |
| 15 | Guanine nucleotide exchange factor C9orf72 | C9orf72 | Q96LT7 | 852 | ALS/FTD pathway |
| 16 | Major prion protein | PRNP | P04156 | 834 | Protein aggregation |
| 17 | Synaptophysin | SYP | P08247 | 828 | Synaptic function |
| 18 | Synapsin-1 | SYN1 | P17600 | 826 | Synaptic regulation |
| 19 | Leucine-rich repeat serine/threonine-protein kinase 2 | LRRK2 | Q5S007 | 826 | Parkinson’s disease |
| 20 | Acetylcholinesterase | ACHE | P22303 | 811 | Neurotransmitter metabolism |
| 21 | Protein S100-B | S100B | P04271 | 802 | Glial protein, tau-associated |
| 22 | E3 ubiquitin-protein ligase parkin | PRKN | O60260 | 790 | Protein degradation, Parkinson’s |
| 23 | Saitohin | SAI1 | Q8IWL8 | 786 | MAPT-associated protein |
| 24 | KAT8 regulatory NSL complex subunit 1 | KANSL1 | Q7Z3B3 | 785 | Chromatin regulation |
| 25 | 14-3-3 protein zeta/delta | YWHAZ | P63104 | 775 | Tau binding/stabilization |
| 26 | Cytotoxic granule associated RNA binding protein TIA1 | TIA1 | P31483 | 775 | RNA binding |
| 27 | PHD finger protein 1 | PHF1 | O43189 | 772 | Tau phosphorylation |
| 28 | Heat shock protein HSP 90-alpha | HSP90AA1 | P07900 | 772 | Protein folding/stabilization |
| 29 | Parkinson disease protein 7 | DJ1 | Q99497 | 772 | Oxidative stress response |
| 30 | Beta-secretase 1 | BACE1 | P56817 | 764 | APP processing |
Protein Similarity
Structural/Embedding Similarity (ESM2 - top 20 by score):
| Rank | Protein | UniProt | Identity | Top Similarity Score | Species/Type |
|---|---|---|---|---|---|
| 1 | MAPT | P10636 | - | 1.0000 | Self |
| 2 | Microtubule-associated protein 2 (MAP2) | P11137 | 79.3% identity | 0.9995 | Human ortholog |
| 3 | Microtubule-associated protein 4 (MAP4) | P27816 | 77.8% identity | 0.9995+ | Human ortholog |
| 4 | Q5YCW0 | Q5YCW0 | 99% identity | 1.0000 | MAPT ortholog |
| 5 | Q5YCW1 | Q5YCW1 | 99% identity | 1.0000 | MAPT ortholog |
| 6 | P10637 | P10637 | 90.6% identity | 0.9997 | MAPT variant |
| 7 | Q5S6V2 | Q5S6V2 | 94.5% identity | 0.9994 | MAPT ortholog |
| 8 | O88778 | O88778 | - | 0.9996 | Mouse/rodent MAPT |
| 9 | O88737 | O88737 | - | 0.9996 | Mouse/rodent MAPT |
| 10 | O55103 | O55103 | - | 0.9995 | Mouse/rodent ortholog |
| 11 | Q5YCV9 | Q5YCV9 | 93.6% identity | 0.9999 | MAPT ortholog |
| 12 | P19332 | P19332 | 90.9% identity | 0.9997 | MAPT ortholog |
| 13 | P15146 | P15146 | 89% identity | 0.9995 | MAPT-related MAP |
| 14 | P29172 | P29172 | 88.6% identity | 0.9994 | MAPT ortholog |
| 15 | P06924 | P06924 | - | 0.9982 | MAP-like protein |
| 16 | E9PT23 | E9PT23 | - | 0.9993 | Ortholog (non-human) |
| 17 | Q5M7W5 | Q5M7W5 | 82.6% identity | 0.9994 | MAPT-like protein |
| 18 | Q13061 | Q13061 | - | 0.9990 | Related protein |
| 19 | Q16799 | Q16799 | - | 0.9999 | Related protein |
| 20 | P27546 | P27546 | 66.7% identity | 0.9994 | MAP family member |
Sequence Homology (DIAMOND - top 19 by identity %):
| Rank | Protein | Gene | UniProt | Identity | Bitscore |
|---|---|---|---|---|---|
| 1 | MAPT | MAPT | P10636 | 95.0% | 1070 |
| 2 | MAPT ortholog | - | Q5YCW0 | 99.0% | 1096 |
| 3 | MAPT ortholog | - | Q5YCW1 | 99.0% | 1093 |
| 4 | MAPT variant | - | P10637 | 90.6% | 966 |
| 5 | MAPT ortholog | - | Q5S6V2 | 94.5% | 1072 |
| 6 | MAPT ortholog | - | Q5YCV9 | 93.6% | 1067 |
| 7 | MAPT ortholog | - | P19332 | 90.9% | 968 |
| 8 | Microtubule-associated protein 2 | MAP2 | P11137 | 79.3% | 2385 |
| 9 | Microtubule-associated protein 4 | MAP4 | P27816 | 77.8% | 792 |
| 10 | MAPT ortholog | - | Q6TS35 | 84.4% | 464 |
| 11 | Apolipoprotein E | APOE | P15146 | 89.0% | 2796 |
| 12 | MAPT-related | - | Q5M7W5 | 82.6% | 896 |
| 13 | MAPT ortholog | - | P20357 | 89.0% | 2806 |
| 14 | MAPT ortholog | - | P36225 | 77.8% | 809 |
| 15 | MAPT ortholog | - | P29172 | 88.6% | 463 |
| 16 | MAPT ortholog | - | P27546 | 66.7% | 883 |
| 17 | MAPT ortholog | - | O02828 | 89.8% | 447 |
| 18 | MAPT ortholog | - | P57786 | 84.2% | 453 |
| 19 | MAPT ortholog | - | Q9MYX8 | 94.1% | 441 |
Key findings:
- MAPT shows extensive interactions with major neurodegeneration-associated proteins (APP, TARDBP, SNCA, PSEN1/2)
- Key tau phosphorylation kinases (CDK5, GSK3B, FYN) are highly scored interactors
- 14-3-3 proteins (YWHAQ, YWHAZ) are prominent tau-binding partners
- Structural similarity predominantly reflects MAPT orthologs and related microtubule-associated proteins (MAP2, MAP4)
- ESM2 embeddings identify highly similar neurodegenerative disease-associated proteins
Transcription factor regulatory data
MAPT is not a transcription factor. MAPT (microtubule-associated protein tau) is a microtubule-associated protein with 758 amino acids. It does not have known DNA-binding capability and is not classified as a transcription factor. Therefore, downstream targets and DNA-binding motifs are not applicable.
Upstream regulators
The following transcription factors regulate MAPT expression:
| TF Name | Evidence Type | Regulation |
|---|---|---|
| AR (Androgen Receptor) | CollecTRI (literature-curated) | Activation |
| DLX3 (Distal-less homeobox 3) | CollecTRI (literature-curated) | Unknown |
| TFAP2A (Transcription factor AP-2 alpha) | CollecTRI (literature-curated) | Repression |
Data source: CollecTRI—a curated database of transcriptional regulatory interactions combining literature-derived and computationally predicted TF-gene associations. The interactions listed above represent experimentally validated or literature-supported regulatory relationships, though specific evidence types (e.g., ChIP-seq experiments vs. computational predictions) are not differentiated in the current dataset.
Drug & pharmacology data
MAPT is an established drug target with active development programs in clinical trials, primarily focused on tau-related pathologies in Alzheimer’s disease and frontotemporal dementia.
Targeting molecules
Total count: ~100 molecules in ChEMBL testing against MAPT (CHEMBL1293224)
TOP molecules by highest development phase (Phase 4 = approved):
| Molecule ID | Name | Type | Highest Phase |
|---|---|---|---|
| CHEMBL4297245 | Donanemab | Antibody | 4 (Approved) |
| CHEMBL3833321 | Lecanemab | Antibody | 4 (Approved) |
| CHEMBL3039540 | Aducanumab | Antibody | 4 (Approved) |
| CHEMBL1008 | Bepridil | Small molecule | 4 |
| CHEMBL103 | Progesterone | Small molecule | 4 |
| CHEMBL11 | Imipramine | Small molecule | 4 |
| CHEMBL1112 | Aripiprazole | Small molecule | 4 |
| CHEMBL1116 | Raloxifene hydrochloride | Small molecule | 4 |
| CHEMBL1086440 | Triclabendazole | Small molecule | 4 |
Note: Most phase 4 compounds are approved for non-tau indications. The primary tau-targeting strategies in clinical development are monoclonal antibodies (anti-amyloid-beta agents that secondarily affect tau pathology).
Clinical trials
24 trials identified in tauopathies, with 20 trials in Alzheimer’s disease specifically:
Phase 3 trials:
- NCT05508789: Donanemab (LY3002813) in early symptomatic AD (TRAILBLAZER-ALZ 5) — RECRUITING
Phase 1-2 tau-targeting trials:
- NCT03538522: NA-831 — COMPLETED
- NCT04445831: Tau-targeted vaccines — COMPLETED
- NCT05318976 & NCT05321498: XPro1595 (TNF-α pathway modulation) — COMPLETED/WITHDRAWN
- NCT02294851: BMS-986168 (tau monoclonal antibody) — COMPLETED
- NCT05344989: APNmAb005 (tau antibody, first-in-human) — RECRUITING
Imaging/biomarker studies:
- NCT02103894: [18F]MNI-777 tau PET — COMPLETED
- NCT02676843: Tau PET with 18F-AV-1451 in MAPT mutation carriers — COMPLETED
Pharmacogenomics
Status: MAPT is a VIP (Very Important Pharmacogene) in PharmGKB but has NO CPIC (Clinical Pharmacogenetics Implementation Consortium) guidelines — no published dosing/response recommendations for MAPT genotypes.
Genetic variants: 601 variants documented in ClinVar, including:
- Pathogenic variants: Associated with frontotemporal dementia (e.g., c.2078C>T p.Pro693Leu, c.2392C>T p.Arg798Trp)
- Functionally relevant: MAPT H1/H2 haplotypes associated with tauopathy risk, but no specific drug-dosing guidelines established
Current evidence: MAPT genetic variants predict tau pathology burden and AD risk but do not yet inform personalized drug selection or dosing for tau-targeting therapies under clinical development.
Expression profiles
Tissue expression (Bgee)
MAPT shows ubiquitous expression (breadth: ubiquitous, average score: 74.48, max: 99.52) with marked enrichment in neural tissues.
| Rank | Tissue | Expression Score | Status |
|---|---|---|---|
| 1 | Cortical plate | 99.52 | Gold quality |
| 2 | Superior frontal gyrus | 98.76 | Gold quality |
| 3 | Prefrontal cortex | 98.51 | Gold quality |
| 4 | Frontal lobe | 98.45 | Gold quality |
| 5 | Frontal cortex | 98.45 | Gold quality |
| 6 | Right frontal lobe | 98.32 | Gold quality |
| 7 | Cerebellum | 98.29 | Gold quality |
| 8 | Right hemisphere of cerebellum | 98.29 | Gold quality |
| 9 | Cerebellar hemisphere | 98.28 | Gold quality |
| 10 | Cerebellar cortex | 98.28 | Gold quality |
| 11 | Brodmann area 9 | 98.08 | Gold quality |
| 12 | Primary visual cortex | 98.03 | Gold quality |
| 13 | Dorsolateral prefrontal cortex | 97.88 | Gold quality |
| 14 | Cerebral cortex | 97.87 | Gold quality |
| 15 | Anterior cingulate cortex | 97.39 | Gold quality |
| 16 | Hypothalamus | 97.34 | Gold quality |
| 17 | Ammon’s horn (hippocampus) | 97.33 | Gold quality |
| 18 | Temporal lobe | 97.32 | Gold quality |
| 19 | Amygdala | 97.29 | Gold quality |
| 20 | Brain | 96.91 | Gold quality |
| 21 | Putamen | 96.68 | Gold quality |
| 22 | Corpus callosum | 96.61 | Gold quality |
| 23 | Caudate nucleus | 96.58 | Gold quality |
| 24 | Substantia nigra | 96.45 | Gold quality |
| 25 | Nucleus accumbens | 96.36 | Gold quality |
| 26 | Embryo | 95.99 | Gold quality |
| 27 | Ganglionic eminence | 95.99 | Gold quality |
| 28 | Gastrocnemius | 95.59 | Gold quality |
| 29 | Skeletal muscle tissue | 95.52 | Gold quality |
| 30 | Skeletal muscle organ | 94.64 | Gold quality |
Tissue-specific patterns: MAPT demonstrates striking brain-specificity with highest expression in cortical regions (especially frontal and prefrontal cortex, cortical plate at 99.52). Expression remains very high across all major brain structures (cerebrum, cerebellum, basal ganglia, limbic system). Notable secondary expression in skeletal muscle and developing embryonic tissues reflects MAPT’s role in microtubule organization during development and neuronal growth.
Single-cell expression datasets
MAPT is expressed across multiple human single-cell/tissue datasets:
| Dataset | Tissue/Cell Population | Cells | Relevance |
|---|---|---|---|
| E-MTAB-8894 | Fetal lateral ganglionic eminence (PCW 7, 9, 11) | 150,129 | Neural progenitors & developing interneurons (MAPT score: 78.4 in cluster 1) |
| E-MTAB-9154 | iPSC-derived neurons ± genetic/toxic stressors | 36,117 | Neuronal differentiation & stress responses |
| E-MTAB-10485 | Fetal lateral ganglionic eminence extracellular signaling | 18,649 | Neural progenitor interactions |
| E-MTAB-5 (HCAD) | Organoid inter-individual variation assessment | 25,049 | Developmental variation in neural tissues |
| E-GEOD-93593 | H1 ESC → ventral differentiation (125-day protocol) | 1,733 | Neuronal lineage commitment |
| E-GEOD-75140 | Human cerebral organoids (neocortex development) | 734 | Neocortical development recapitulation |
Cell-type-specific patterns: MAPT is enriched in developing neural progenitor cells and differentiating neurons, consistent with its role as a neuronal cytoskeletal protein. Expression peaks in the lateral ganglionic eminence during critical developmental stages (PCW 7–11, corresponding to interneuron migration and specification). Strong expression in iPSC-derived neurons indicates MAPT is a marker of neuronal commitment and maturation. In organoid studies, MAPT expression tracks with neural cell identity and developmental progression.
Expression characteristics
- Developmental profile: High expression during fetal neurogenesis and brain development; maintained in adult neurons
- Neuronal enrichment: Expressed in neural progenitors, neurons, and glial progenitors; lower in non-neural tissues despite ubiquitous classification
- Adult brain regionalization: Highest in cortex, hippocampus, substantia nigra, and other regions implicated in cognition and motor function
- Tauopathy relevance: Widespread brain expression explains MAPT mutation involvement in tauopathies (Frontotemporal dementia, corticobasal degeneration, progressive supranuclear palsy)
Disease associations
Mendelian / Monogenic Diseases
MAPT mutations cause several neurodegenerative tauopathies with autosomal dominant (or recessive) inheritance:
| Disease | OMIM | Mondo | Orphanet | Inheritance | Evidence Level |
|---|---|---|---|---|---|
| Late-onset Parkinson disease | 168600 | MONDO:0008199 | — | Autosomal dominant | Strong (Genomics England PanelApp) |
| Pick disease | 172700 | MONDO:0008243 | 100069, 282 | Autosomal dominant | Strong |
| Semantic dementia | 600274 | MONDO:0010857 | 100069, 100070, 275864, 282 | Autosomal dominant | Strong |
| Supranuclear palsy, progressive, 1 (PSP-P) | 601104 | MONDO:0010997, MONDO:0019037 | 683, 240071 | Autosomal dominant | Strong |
| Progressive supranuclear palsy-parkinsonism syndrome | 260540 | MONDO:0009839 | 683, 240085, 240094, 240103, 240112, 99750 | Autosomal recessive | Moderate-Limited |
Phenotype Associations (HPO)
Top 30 clinical phenotypes associated with MAPT mutations:
- HP:0000605 — Supranuclear gaze palsy
- HP:0000726 — Dementia
- HP:0001300 — Parkinsonism
- HP:0002145 — Frontotemporal dementia
- HP:0002185 — Neurofibrillary tangles
- HP:0002354 — Memory impairment
- HP:0000511 — Vertical supranuclear gaze palsy
- HP:0002063 — Rigidity
- HP:0002067 — Bradykinesia
- HP:0006892 — Frontotemporal cerebral atrophy
- HP:0002141 — Gait imbalance
- HP:0002172 — Postural instability
- HP:0001337 — Tremor
- HP:0001288 — Gait disturbance
- HP:0001260 — Dysarthria
- HP:0002015 — Dysphagia
- HP:0002381 — Aphasia
- HP:0000727 — Frontal lobe dementia
- HP:0000738 — Hallucinations
- HP:0000726 — Dementia
- HP:0000737 — Irritability
- HP:0000741 — Apathy
- HP:0000743 — Frontal release signs
- HP:0000751 — Personality changes
- HP:0001268 — Mental deterioration
- HP:0012444 — Brain atrophy
- HP:0012671 — Abulia
- HP:0007164 — Slurred speech
- HP:0010522 — Dyslexia
- HP:0011098 — Speech apraxia
Complex Disease / GWAS Associations
Top 30 GWAS-associated traits with MAPT loci (p < 10⁻⁸):
Neurodegenerative / Cognitive (strongest)
- Parkinson’s disease (p = 1×10⁻⁶⁸, GCST004902)
- Progressive supranuclear palsy (p = 2×10⁻¹²² to 4×10⁻¹¹³, multiple studies)
- Corticobasal degeneration (p = 1×10⁻¹², GCST002971)
- Plasma t-tau levels (p = 1×10⁻¹², GCST004092)
- General cognitive ability (p = 1×10⁻¹³ to 3×10⁻⁸, multiple studies)
- Cognitive ability (p = 8×10⁻⁸, GCST005142)
- Cognitive performance (processing speed) (p = 9×10⁻⁶, GCST008891)
- Reaction time (p = 1×10⁻¹⁵ to 6×10⁻¹⁵, multiple studies)
Neuroticism / Psychiatric 9. General factor of neuroticism (p = 5×10⁻¹⁵ to 1×10⁻⁹, multiple studies) 10. Alcohol use disorder (consumption score) (p = 2×10⁻¹⁴, GCST012336) 11. Alcohol consumption (drinks per week) (p = 5×10⁻²³, GCST008757) 12. Experiencing mood swings (p = 2×10⁻¹⁰, with WNT3) 13. Feeling hurt (p = 1×10⁻¹¹, with WNT3) 14. Mood instability (p = 7×10⁻²⁰, GCST008357)
Morphometric / Hematologic 15. Red blood cell count (p = 9×10⁻⁶⁶, GCST90002403) 16. Hemoglobin levels (p = 1×10⁻⁶¹, GCST010083) 17. Mean spheric corpuscular volume (p = 2×10⁻⁵³, GCST90002397) 18. Mean reticulocyte volume (p = 5×10⁻²⁸, GCST90002396) 19. Waist-to-hip ratio adjusted for BMI (p = 5×10⁻¹³ to 2×10⁻⁸, multiple studies) 20. A body shape index (p = 5×10⁻¹² to 1×10⁻¹⁰, multiple studies) 21. Waist-hip index (p = 2×10⁻¹² to 2×10⁻⁸, multiple studies) 22. Hip circumference adjusted for BMI (p = 9×10⁻⁹, GCST90020028)
Respiratory / Pulmonary 23. Idiopathic pulmonary fibrosis (p = 6×10⁻⁹ to 3×10⁻¹⁶, MAPT-AS1, GCST009758) 24. Interstitial lung disease (p = 9×10⁻¹⁴, GCST001968)
Cardiovascular 25. Atrial fibrillation (p = 4×10⁻⁹, GCST006061) 26. QRS complex duration (p = 2×10⁻¹⁴, GCST003872)
Metabolic / Other 27. Primary biliary cirrhosis (p = 2×10⁻⁹, GCST005581) 28. Osteoarthrosis (p = 3×10⁻⁸, GCST010771) 29. Male-pattern baldness (p = 1×10⁻²⁶ to 8×10⁻²⁹, multiple studies) 30. Type 1 diabetes (p = 2×10⁻¹⁵, GCST90013445)