MAPT Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MAPT — a definitive lookup resource covering: ### …

Provide a comprehensive cross-database identifier and functional mapping reference for human MAPT — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene MAPT, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene MAPT, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene MAPT protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene MAPT protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene MAPT, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene MAPT, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene MAPT, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene MAPT protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene MAPT, summarize transcription factor regulatory data. If MAPT is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate MAPT — names with evidence type (ChIP-seq / predicted / experimentally validated) If MAPT is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene MAPT protein as a drug target, summarize pharmacology data. If MAPT is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If MAPT is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene MAPT, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene MAPT, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in MAPT: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

MAPT

Executive summary

MAPT encodes microtubule-associated protein tau, a neuronal cytoskeletal protein that is central to the pathogenesis of Alzheimer’s disease and a broad class of neurodegenerative disorders collectively termed tauopathies. Its highest expression is in cortical brain regions (score 99.52 in cortical plate), consistent with its prominent role in neuronal function. The gene spans chromosome 17q21 and produces at least 52 transcripts, reflecting extraordinary isoform complexity. Pathogenic MAPT variants — predominantly causing frontotemporal dementia with parkinsonism (FTDP-17) — number approximately 30 in ClinVar, and GWAS links the locus to Parkinson’s disease (p = 1×10⁻⁶⁸) and progressive supranuclear palsy (p = 2×10⁻¹²²). Structurally, tau is an intrinsically disordered protein (AlphaFold global pLDDT 49.56, no residues with pLDDT > 90), yet 274 experimental PDB structures capture its disease-relevant filament conformations across multiple tauopathies. Its interaction network (~7,283 documented interactions) is dominated by key phosphorylation kinases CDK5 and GSK3B and major neurodegeneration proteins including APP, APOE, and SNCA.

Gene identifiers

  • HGNC ID: HGNC:6893
  • Approved symbol: MAPT
  • Ensembl gene ID: ENSG00000186868
  • NCBI Entrez Gene ID: 4137
  • OMIM gene/locus ID: 157140
  • Genomic location (GRCh38):
    • Chromosome: 17
    • Start position: 45,894,358 bp
    • End position: 46,028,334 bp
    • Strand: +

Transcript identifiers

Ensembl Transcripts (52 total)

ENST IDBiotype
ENST00000262410protein_coding
ENST00000334239protein_coding
ENST00000344290protein_coding
ENST00000351559protein_coding
ENST00000415613protein_coding
ENST00000420682protein_coding
ENST00000431008protein_coding
ENST00000446361protein_coding
ENST00000535772protein_coding
ENST00000570299protein_coding_CDS_not_defined
ENST00000571311nonsense_mediated_decay
ENST00000571987protein_coding
ENST00000572440retained_intron
ENST00000574436protein_coding
ENST00000576518retained_intron
ENST00000577017protein_coding_CDS_not_defined
ENST00000680542nonsense_mediated_decay
ENST00000680674protein_coding
ENST00000703922nonsense_mediated_decay
ENST00000703923nonsense_mediated_decay
ENST00000703924nonsense_mediated_decay
ENST00000703974retained_intron
ENST00000703975retained_intron
ENST00000703976retained_intron
ENST00000703977protein_coding_CDS_not_defined
ENST00000703978nonsense_mediated_decay
ENST00000703979protein_coding_CDS_not_defined
ENST00000703980protein_coding_CDS_not_defined
ENST00000703981protein_coding_CDS_not_defined
ENST00000884777protein_coding
ENST00000884778protein_coding
ENST00000884779protein_coding
ENST00000884780protein_coding
ENST00000884781protein_coding
ENST00000884782protein_coding
ENST00000884783protein_coding
ENST00000884784protein_coding
ENST00000884785protein_coding
ENST00000884786protein_coding
ENST00000884787protein_coding
ENST00000884788protein_coding
ENST00000972111protein_coding
ENST00000972112protein_coding
ENST00000972113protein_coding
ENST00000972114protein_coding
ENST00000972115protein_coding
ENST00000972116protein_coding
ENST00000972117protein_coding
ENST00000972118protein_coding
ENST00000972119protein_coding
ENST00000972120protein_coding

RefSeq mRNA Accessions (57 total)

NM IDMANE Select
NM_001038609
NM_001123066
NM_001123067
NM_001203251
NM_001203252
NM_001276088
NM_001276089
NM_001276090
NM_001276091
NM_001285454
NM_001285455
NM_001285456
NM_001300634
NM_001300635
NM_001300636
NM_001300637
NM_001300638
NM_001300639
NM_001316620
NM_001377265
NM_001377266
NM_001377267
NM_001377268
NM_001382126
NM_001403975
NM_001403976
NM_001403977
NM_001403978
NM_001403979
NM_001403980
NM_001403981
NM_001403982
NM_001403983
NM_001403984
NM_001403987
NM_001403990
NM_001403992
NM_001403994
NM_001403996
NM_001403998
NM_001403999
NM_001404002
NM_001404004
NM_001404005
NM_001404006
NM_001404010
NM_001404011
NM_001404012
NM_001404014
NM_001404015
NM_001404016
NM_005910
NM_010838
NM_016834
NM_016835
NM_016841
NM_017212
NM_143318
NM_170345

CCDS IDs (10 total)

CCDS ID
CCDS11499
CCDS11500
CCDS11501
CCDS11502
CCDS45715
CCDS45716
CCDS56033
CCDS92345
CCDS92346
CCDS92347

MANE Select Transcript: ENST00000262410 (NM_001377265)

Exons (13 total)

Exon IDStartEndStrandChromosome
ENSE000039009764589455445894686+17
ENSE000035499664596232145962470+17
ENSE000036588944597837545978440+17
ENSE000036821224599146045991586+17
ENSE000024107204598987845990075+17
ENSE000036113354598704045987095+17
ENSE000036340584599639945996664+17
ENSE000035709764601861846018730+17
ENSE000026723954598286645983930+17
ENSE000024091894601031046010402+17
ENSE000024054914601424346014324+17
ENSE000012912404602395646028334+17

Protein identifiers

UniProt accessions

  • P10636 (canonical, reviewed) — Microtubule-associated protein tau
  • A0A7I2PJZ2 (unreviewed)
  • A0A7I2PLE3 (unreviewed)
  • A0A7P0T936 (unreviewed)
  • I3L2Z2 (unreviewed)

RefSeq protein (NP_ accessions)

NP_001033698, NP_001116538, NP_001116539, NP_001190180, NP_001190181, NP_001263017, NP_001263018, NP_001263019, NP_001263020, NP_001272383, NP_001272384, NP_001272385, NP_001287563, NP_001287564, NP_001287565, NP_001287566, NP_001287567, NP_001287568, NP_001303549, NP_001364194, NP_001364195, NP_001364196, NP_001364197, NP_001369032, NP_001390904, NP_001390905, NP_001390906, NP_001390907, NP_001390908, NP_001390909, NP_001390910, NP_001390911, NP_001390912, NP_001390913, NP_001390916, NP_001390919, NP_001390921, NP_001390923, NP_001390925, NP_001390927, NP_001390928

Protein domains and families

IDNameType
IPR001084Microtubule associated protein, tubulin-binding repeatRepeat
IPR002955Microtubule-associated protein TauFamily
IPR027324Microtubule associated protein MAP2/MAP4/TauFamily
PF00418Protein of unknown function (Pfam domain)Pfam domain

Antibody availability

Limited records in public biobtree antibody databases. However, tau protein (P10636) has extensive cross-references to PDB structures (274 entries) and is well-characterized. Commercial antibody resources available through: Human Protein Atlas (ENSG00000186868), BindingDB (675 entries), and antibody vendor catalogs targeting epitopes across the protein sequence.

Structure

Experimental Structures

PDB Entries: 274 total

By Experimental Method:

MethodCountResolution Range
Electron Microscopy (Cryo-EM)~2201.9 – 4.3 Å
X-Ray Diffraction~451.0 – 3.1 Å
Solution NMR~5Not applicable
Electron Crystallography~31.1 – 2.0 Å
Solid-State NMR~3Not applicable

Key Structure Types:

  • Disease-associated filaments: Paired helical filaments (PHF), straight filaments (SF), and twisted filaments from Alzheimer’s disease, chronic traumatic encephalopathy (CTE), progressive supranuclear palsy (PSP), corticobasal degeneration (CBD), and other tauopathies
  • Antibody-Tau complexes: Fab fragments in complex with tau peptides (e.g., AT8, PT3, BIIB092, CBTAU series)
  • Protein binding complexes: 14-3-3 sigma with phosphorylated tau peptides; tau-microtubule complexes; Fyn kinase interaction
  • Tau peptide structures: Individual tau repeat fragments, amyloid-forming VQIVYK segment
  • In vitro assembled filaments: Various morphologies under different buffer conditions (PBS, heparin, ions, etc.)

Notable PDB entries: 5O3L, 6HRE (AD PHF); 6TJO, 6TJX (CBD); 7P65 (PSP); 8Q8R, 8Q8S (AD PHF/THF variants); 22JY (highest resolution cryo-EM at 2.2 Å)


Predicted Structures

AlphaFold Model:

  • Model ID: P10636
  • Global pLDDT: 49.56 (low confidence, consistent with intrinsically disordered protein)
  • Fraction with pLDDT > 90: 0.00 (no highly confident regions)
  • Protein Length: 758 amino acids (canonical isoform)

Note: Low confidence reflects tau’s nature as an intrinsically disordered protein with limited stable secondary structure in its native state.

Cross-species orthologs

OrganismGene IDSymbol
Mouse (Mus musculus)ENSMUSG00000018411Mapt
Rat (Rattus norvegicus)ENSRNOG00000005133Mapt
Zebrafish (Danio rerio)ENSDARG00000087616, ENSDARG00000089314maptb, mapta
Fruit fly (Drosophila melanogaster)nonenone
Worm (C. elegans)nonenone
Yeast (S. cerevisiae)nonenone

Clinical variants & AI predictions

ClinVar Summary

ClassificationCount
Pathogenic~30
Likely Pathogenic~5
Uncertain Significance~100
Likely Benign~220
Benign~246
Total~601

Top 30 Pathogenic/Likely Pathogenic Clinical Variants

Variant IDHGVS NotationClassificationAssociated Condition
14245NM_001377265.1:c.2078C>T (p.Pro693Leu)PathogenicFTDP-17
14246NM_001377265.1:c.1991G>T (p.Gly664Val)PathogenicFTDP-17
14247NM_001377265.1:c.2392C>T (p.Arg798Trp)PathogenicFTDP-17
14248NM_001377265.1:c.2091+14C>TPathogenicFTDP-17
14251NM_001377265.1:c.2091+1G>APathogenicFTDP-17 (splice donor)
14252NM_001377265.1:c.2185G>A (p.Val729Met)PathogenicFTDP-17
14253NM_001377265.1:c.2013T>G (p.Asn671Lys)PathogenicFTDP-17
14254NM_001377265.1:c.2090G>A (p.Ser697Asn)PathogenicFTDP-17
14255NM_001377265.1:c.2341G>A (p.Gly781Arg)Pathogenic/LikelyFTDP-17
14256NM_001377265.1:c.2077C>T (p.Pro693Ser)PathogenicFTDP-17
14257NM_001377265.1:c.2064T>C (p.Asn688=)PathogenicFTDP-17
14258NM_001377265.1:c.2201A>T (p.Glu734Val)PathogenicFTDP-17
14259NM_001377265.1:c.1946A>C (p.Lys649Thr)PathogenicFTDP-17
14260NM_001377265.1:c.2282A>T (p.Lys761Ile)PathogenicFTDP-17
14262NM_001377265.1:c.2135C>T (p.Ser712Phe)PathogenicFTDP-17
14263NM_001377265.1:c.14G>T (p.Arg5Leu)PathogenicFTDP-17
14266NM_001377265.1:c.1972C>G (p.Leu658Val)PathogenicFTDP-17
14267NM_001377265.1:c.2231C>T (p.Ser744Leu)PathogenicFTDP-17
14268NM_001377265.1:c.2126A>T (p.Lys709Met)PathogenicFTDP-17
14269NM_001377265.1:c.2084G>T (p.Gly695Val)PathogenicFTDP-17
2049744NM_001377265.1:c.2091+16C>GLikely PathogenicFTDP-17

Splice Effect Predictions (SpliceAI)

  • Total predictions: ~3,502
  • Donor gain events: ~2,800
  • Donor loss events: ~400
  • Acceptor gain/loss events: ~300
PositionVariantGeneEffect TypeDelta Score
17:45894684CAGGT:CMAPTDonor loss1.00
17:45894685AGGTA:AMAPTDonor loss1.00
17:45894686GGTA:GMAPTDonor loss1.00
17:45894687G:GAMAPTDonor loss1.00
17:45894688T:AMAPTDonor loss0.99
17:45894683TCAG:TMAPTDonor gain0.96
17:45894685AG:AMAPTDonor gain0.95
17:45894686GG:GMAPTDonor gain0.95
17:45894687G:GGMAPTDonor gain0.99
17:45895712G:GTMAPTDonor gain0.59

AlphaMissense Pathogenicity Predictions (P10636)

  • Total variants assessed: ~700
  • Likely pathogenic predictions: ~144
  • Pathogenicity range: 0.042–0.997
PositionVariantProtein Changeam_pathogenicityam_class
17:45996437A:CS516R0.997Likely pathogenic
17:45996425A:CS512R0.994Likely pathogenic
17:45996436C:AS515R0.991Likely pathogenic
17:45996428G:CG513R0.986Likely pathogenic
17:45991542T:CI488T0.988Likely pathogenic
17:45991540G:CR487S0.981Likely pathogenic
17:45996429G:AG513D0.990Likely pathogenic
17:45991552A:CK491N0.966Likely pathogenic
17:45991552A:TK491N0.966Likely pathogenic
17:45996438G:AS516N0.953Likely pathogenic
17:45996426G:TS512I0.954Likely pathogenic
17:45991539G:TR487M0.951Likely pathogenic
17:45996435G:TS515I0.933Likely pathogenic
17:45996426G:AS512N0.917Likely pathogenic
17:45991538A:GR487G0.916Likely pathogenic
17:45996441C:GP517R0.915Likely pathogenic
17:45996421C:AA490E0.910Likely pathogenic
17:45996441C:AP517H0.920Likely pathogenic
17:45996444G:AG518D0.969Likely pathogenic
17:45991550A:GK491E0.945Likely pathogenic

Pathways & Gene Ontology

Biological Pathways

Reactome (3 pathways):

Pathway IDPathway Name
R-HSA-264870Caspase-mediated cleavage of cytoskeletal proteins
R-HSA-9619483Activation of AMPK downstream of NMDARs
R-HSA-9833482PKR-mediated signaling

MSigDB: No gene sets found

Total pathways: 3


Gene Ontology Annotations

Biological Process (51 terms):

GO IDTerm
GO:0000226microtubule cytoskeleton organization
GO:0001774microglial cell activation
GO:0006974DNA damage response
GO:0007267cell-cell signaling
GO:0007416synapse assembly
GO:0007611learning or memory
GO:0007613memory
GO:0010288response to lead ion
GO:0010506regulation of autophagy
GO:0010629negative regulation of gene expression
GO:0010917negative regulation of mitochondrial membrane potential
GO:0016072rRNA metabolic process
GO:0019896axonal transport of mitochondrion
GO:0021954central nervous system neuron development
GO:0031110regulation of microtubule polymerization or depolymerization
GO:0031113regulation of microtubule polymerization
GO:0031116positive regulation of microtubule polymerization
GO:0031122cytoplasmic microtubule organization
GO:0031175neuron projection development
GO:0032930positive regulation of superoxide anion generation

Molecular Function (25 terms):

GO IDTerm
GO:0003677DNA binding
GO:0003680minor groove of adenine-thymine-rich DNA binding
GO:0003690double-stranded DNA binding
GO:0003697single-stranded DNA binding
GO:0003723RNA binding
GO:0003779actin binding
GO:0005515protein binding
GO:0008017microtubule binding
GO:0015631tubulin binding
GO:0017124SH3 domain binding
GO:0019899enzyme binding
GO:0019901protein kinase binding
GO:0030674protein-macromolecule adaptor activity
GO:0034185apolipoprotein binding
GO:0034452dynactin binding
GO:0035091phosphatidylinositol binding
GO:0042802identical protein binding
GO:0043565sequence-specific DNA binding
GO:0051087protein-folding chaperone binding
GO:0051721protein phosphatase 2A binding

Cellular Component (28 terms):

GO IDTerm
GO:0005576extracellular region
GO:0005634nucleus
GO:0005737cytoplasm
GO:0005739mitochondrion
GO:0005829cytosol
GO:0005856cytoskeleton
GO:0005874microtubule
GO:0005886plasma membrane
GO:0015630microtubule cytoskeleton
GO:0016020membrane
GO:0030424axon
GO:0030425dendrite
GO:0030426growth cone
GO:0030673axolemma
GO:0034399nuclear periphery
GO:0036464cytoplasmic ribonucleoprotein granule
GO:0036477somatodendritic compartment
GO:0042995cell projection
GO:0043005neuron projection
GO:0043025neuronal cell body

Protein interactions & networks

Protein-Protein Interactions (STRING, BioGRID, IntAct)

Total interaction count:

  • STRING: ~5,558 interactions
  • BioGRID: 1,565 interactions
  • IntAct: 160 curated interactions
  • Combined: ~7,283 interactions documented

TOP 30 highest-confidence STRING interactors (combined score):

RankProteinGeneUniProtScoreRole/Function
1MAPT (self)MAPTP10636986Self-oligomerization
2Amyloid-beta precursor proteinAPPP05067986Neurodegeneration pathway
3Non-selective voltage-gated ion channel VDAC1VDAC1P21796974Mitochondrial transport
4Apolipoprotein EAPOEP02649973Lipid metabolism, Alzheimer’s risk
5Alpha-synucleinSNCAP37840971Synucleinopathy pathway
6TAR DNA-binding protein 43TARDBPQ13148970RNA binding, neurodegeneration
7Tyrosine-protein kinase FynFYNP06241958Tau phosphorylation
8Cyclin-dependent kinase 5CDK5Q00535948Tau phosphorylation kinase
9Glycogen synthase kinase-3 betaGSK3BP49841943Tau phosphorylation kinase
10Presenilin-1PSEN1P49768933γ-secretase, neurodegeneration
1114-3-3 protein thetaYWHAQP27348903Tau binding/stabilization
12RNA-binding protein FUSFUSP35637889RNA binding, neurodegeneration
13ProgranulinGRNP23781886Neuroinflammation
14Presenilin-2PSEN2P49810883γ-secretase, neurodegeneration
15Guanine nucleotide exchange factor C9orf72C9orf72Q96LT7852ALS/FTD pathway
16Major prion proteinPRNPP04156834Protein aggregation
17SynaptophysinSYPP08247828Synaptic function
18Synapsin-1SYN1P17600826Synaptic regulation
19Leucine-rich repeat serine/threonine-protein kinase 2LRRK2Q5S007826Parkinson’s disease
20AcetylcholinesteraseACHEP22303811Neurotransmitter metabolism
21Protein S100-BS100BP04271802Glial protein, tau-associated
22E3 ubiquitin-protein ligase parkinPRKNO60260790Protein degradation, Parkinson’s
23SaitohinSAI1Q8IWL8786MAPT-associated protein
24KAT8 regulatory NSL complex subunit 1KANSL1Q7Z3B3785Chromatin regulation
2514-3-3 protein zeta/deltaYWHAZP63104775Tau binding/stabilization
26Cytotoxic granule associated RNA binding protein TIA1TIA1P31483775RNA binding
27PHD finger protein 1PHF1O43189772Tau phosphorylation
28Heat shock protein HSP 90-alphaHSP90AA1P07900772Protein folding/stabilization
29Parkinson disease protein 7DJ1Q99497772Oxidative stress response
30Beta-secretase 1BACE1P56817764APP processing

Protein Similarity

Structural/Embedding Similarity (ESM2 - top 20 by score):

RankProteinUniProtIdentityTop Similarity ScoreSpecies/Type
1MAPTP10636-1.0000Self
2Microtubule-associated protein 2 (MAP2)P1113779.3% identity0.9995Human ortholog
3Microtubule-associated protein 4 (MAP4)P2781677.8% identity0.9995+Human ortholog
4Q5YCW0Q5YCW099% identity1.0000MAPT ortholog
5Q5YCW1Q5YCW199% identity1.0000MAPT ortholog
6P10637P1063790.6% identity0.9997MAPT variant
7Q5S6V2Q5S6V294.5% identity0.9994MAPT ortholog
8O88778O88778-0.9996Mouse/rodent MAPT
9O88737O88737-0.9996Mouse/rodent MAPT
10O55103O55103-0.9995Mouse/rodent ortholog
11Q5YCV9Q5YCV993.6% identity0.9999MAPT ortholog
12P19332P1933290.9% identity0.9997MAPT ortholog
13P15146P1514689% identity0.9995MAPT-related MAP
14P29172P2917288.6% identity0.9994MAPT ortholog
15P06924P06924-0.9982MAP-like protein
16E9PT23E9PT23-0.9993Ortholog (non-human)
17Q5M7W5Q5M7W582.6% identity0.9994MAPT-like protein
18Q13061Q13061-0.9990Related protein
19Q16799Q16799-0.9999Related protein
20P27546P2754666.7% identity0.9994MAP family member

Sequence Homology (DIAMOND - top 19 by identity %):

RankProteinGeneUniProtIdentityBitscore
1MAPTMAPTP1063695.0%1070
2MAPT ortholog-Q5YCW099.0%1096
3MAPT ortholog-Q5YCW199.0%1093
4MAPT variant-P1063790.6%966
5MAPT ortholog-Q5S6V294.5%1072
6MAPT ortholog-Q5YCV993.6%1067
7MAPT ortholog-P1933290.9%968
8Microtubule-associated protein 2MAP2P1113779.3%2385
9Microtubule-associated protein 4MAP4P2781677.8%792
10MAPT ortholog-Q6TS3584.4%464
11Apolipoprotein EAPOEP1514689.0%2796
12MAPT-related-Q5M7W582.6%896
13MAPT ortholog-P2035789.0%2806
14MAPT ortholog-P3622577.8%809
15MAPT ortholog-P2917288.6%463
16MAPT ortholog-P2754666.7%883
17MAPT ortholog-O0282889.8%447
18MAPT ortholog-P5778684.2%453
19MAPT ortholog-Q9MYX894.1%441

Key findings:

  • MAPT shows extensive interactions with major neurodegeneration-associated proteins (APP, TARDBP, SNCA, PSEN1/2)
  • Key tau phosphorylation kinases (CDK5, GSK3B, FYN) are highly scored interactors
  • 14-3-3 proteins (YWHAQ, YWHAZ) are prominent tau-binding partners
  • Structural similarity predominantly reflects MAPT orthologs and related microtubule-associated proteins (MAP2, MAP4)
  • ESM2 embeddings identify highly similar neurodegenerative disease-associated proteins

Transcription factor regulatory data

MAPT is not a transcription factor. MAPT (microtubule-associated protein tau) is a microtubule-associated protein with 758 amino acids. It does not have known DNA-binding capability and is not classified as a transcription factor. Therefore, downstream targets and DNA-binding motifs are not applicable.

Upstream regulators

The following transcription factors regulate MAPT expression:

TF NameEvidence TypeRegulation
AR (Androgen Receptor)CollecTRI (literature-curated)Activation
DLX3 (Distal-less homeobox 3)CollecTRI (literature-curated)Unknown
TFAP2A (Transcription factor AP-2 alpha)CollecTRI (literature-curated)Repression

Data source: CollecTRI—a curated database of transcriptional regulatory interactions combining literature-derived and computationally predicted TF-gene associations. The interactions listed above represent experimentally validated or literature-supported regulatory relationships, though specific evidence types (e.g., ChIP-seq experiments vs. computational predictions) are not differentiated in the current dataset.

Drug & pharmacology data

MAPT is an established drug target with active development programs in clinical trials, primarily focused on tau-related pathologies in Alzheimer’s disease and frontotemporal dementia.

Targeting molecules

Total count: ~100 molecules in ChEMBL testing against MAPT (CHEMBL1293224)

TOP molecules by highest development phase (Phase 4 = approved):

Molecule IDNameTypeHighest Phase
CHEMBL4297245DonanemabAntibody4 (Approved)
CHEMBL3833321LecanemabAntibody4 (Approved)
CHEMBL3039540AducanumabAntibody4 (Approved)
CHEMBL1008BepridilSmall molecule4
CHEMBL103ProgesteroneSmall molecule4
CHEMBL11ImipramineSmall molecule4
CHEMBL1112AripiprazoleSmall molecule4
CHEMBL1116Raloxifene hydrochlorideSmall molecule4
CHEMBL1086440TriclabendazoleSmall molecule4

Note: Most phase 4 compounds are approved for non-tau indications. The primary tau-targeting strategies in clinical development are monoclonal antibodies (anti-amyloid-beta agents that secondarily affect tau pathology).

Clinical trials

24 trials identified in tauopathies, with 20 trials in Alzheimer’s disease specifically:

Phase 3 trials:

  • NCT05508789: Donanemab (LY3002813) in early symptomatic AD (TRAILBLAZER-ALZ 5) — RECRUITING

Phase 1-2 tau-targeting trials:

  • NCT03538522: NA-831 — COMPLETED
  • NCT04445831: Tau-targeted vaccines — COMPLETED
  • NCT05318976 & NCT05321498: XPro1595 (TNF-α pathway modulation) — COMPLETED/WITHDRAWN
  • NCT02294851: BMS-986168 (tau monoclonal antibody) — COMPLETED
  • NCT05344989: APNmAb005 (tau antibody, first-in-human) — RECRUITING

Imaging/biomarker studies:

  • NCT02103894: [18F]MNI-777 tau PET — COMPLETED
  • NCT02676843: Tau PET with 18F-AV-1451 in MAPT mutation carriers — COMPLETED

Pharmacogenomics

Status: MAPT is a VIP (Very Important Pharmacogene) in PharmGKB but has NO CPIC (Clinical Pharmacogenetics Implementation Consortium) guidelines — no published dosing/response recommendations for MAPT genotypes.

Genetic variants: 601 variants documented in ClinVar, including:

  • Pathogenic variants: Associated with frontotemporal dementia (e.g., c.2078C>T p.Pro693Leu, c.2392C>T p.Arg798Trp)
  • Functionally relevant: MAPT H1/H2 haplotypes associated with tauopathy risk, but no specific drug-dosing guidelines established

Current evidence: MAPT genetic variants predict tau pathology burden and AD risk but do not yet inform personalized drug selection or dosing for tau-targeting therapies under clinical development.

Expression profiles

Tissue expression (Bgee)

MAPT shows ubiquitous expression (breadth: ubiquitous, average score: 74.48, max: 99.52) with marked enrichment in neural tissues.

RankTissueExpression ScoreStatus
1Cortical plate99.52Gold quality
2Superior frontal gyrus98.76Gold quality
3Prefrontal cortex98.51Gold quality
4Frontal lobe98.45Gold quality
5Frontal cortex98.45Gold quality
6Right frontal lobe98.32Gold quality
7Cerebellum98.29Gold quality
8Right hemisphere of cerebellum98.29Gold quality
9Cerebellar hemisphere98.28Gold quality
10Cerebellar cortex98.28Gold quality
11Brodmann area 998.08Gold quality
12Primary visual cortex98.03Gold quality
13Dorsolateral prefrontal cortex97.88Gold quality
14Cerebral cortex97.87Gold quality
15Anterior cingulate cortex97.39Gold quality
16Hypothalamus97.34Gold quality
17Ammon’s horn (hippocampus)97.33Gold quality
18Temporal lobe97.32Gold quality
19Amygdala97.29Gold quality
20Brain96.91Gold quality
21Putamen96.68Gold quality
22Corpus callosum96.61Gold quality
23Caudate nucleus96.58Gold quality
24Substantia nigra96.45Gold quality
25Nucleus accumbens96.36Gold quality
26Embryo95.99Gold quality
27Ganglionic eminence95.99Gold quality
28Gastrocnemius95.59Gold quality
29Skeletal muscle tissue95.52Gold quality
30Skeletal muscle organ94.64Gold quality

Tissue-specific patterns: MAPT demonstrates striking brain-specificity with highest expression in cortical regions (especially frontal and prefrontal cortex, cortical plate at 99.52). Expression remains very high across all major brain structures (cerebrum, cerebellum, basal ganglia, limbic system). Notable secondary expression in skeletal muscle and developing embryonic tissues reflects MAPT’s role in microtubule organization during development and neuronal growth.

Single-cell expression datasets

MAPT is expressed across multiple human single-cell/tissue datasets:

DatasetTissue/Cell PopulationCellsRelevance
E-MTAB-8894Fetal lateral ganglionic eminence (PCW 7, 9, 11)150,129Neural progenitors & developing interneurons (MAPT score: 78.4 in cluster 1)
E-MTAB-9154iPSC-derived neurons ± genetic/toxic stressors36,117Neuronal differentiation & stress responses
E-MTAB-10485Fetal lateral ganglionic eminence extracellular signaling18,649Neural progenitor interactions
E-MTAB-5 (HCAD)Organoid inter-individual variation assessment25,049Developmental variation in neural tissues
E-GEOD-93593H1 ESC → ventral differentiation (125-day protocol)1,733Neuronal lineage commitment
E-GEOD-75140Human cerebral organoids (neocortex development)734Neocortical development recapitulation

Cell-type-specific patterns: MAPT is enriched in developing neural progenitor cells and differentiating neurons, consistent with its role as a neuronal cytoskeletal protein. Expression peaks in the lateral ganglionic eminence during critical developmental stages (PCW 7–11, corresponding to interneuron migration and specification). Strong expression in iPSC-derived neurons indicates MAPT is a marker of neuronal commitment and maturation. In organoid studies, MAPT expression tracks with neural cell identity and developmental progression.

Expression characteristics

  • Developmental profile: High expression during fetal neurogenesis and brain development; maintained in adult neurons
  • Neuronal enrichment: Expressed in neural progenitors, neurons, and glial progenitors; lower in non-neural tissues despite ubiquitous classification
  • Adult brain regionalization: Highest in cortex, hippocampus, substantia nigra, and other regions implicated in cognition and motor function
  • Tauopathy relevance: Widespread brain expression explains MAPT mutation involvement in tauopathies (Frontotemporal dementia, corticobasal degeneration, progressive supranuclear palsy)

Disease associations

Mendelian / Monogenic Diseases

MAPT mutations cause several neurodegenerative tauopathies with autosomal dominant (or recessive) inheritance:

DiseaseOMIMMondoOrphanetInheritanceEvidence Level
Late-onset Parkinson disease168600MONDO:0008199Autosomal dominantStrong (Genomics England PanelApp)
Pick disease172700MONDO:0008243100069, 282Autosomal dominantStrong
Semantic dementia600274MONDO:0010857100069, 100070, 275864, 282Autosomal dominantStrong
Supranuclear palsy, progressive, 1 (PSP-P)601104MONDO:0010997, MONDO:0019037683, 240071Autosomal dominantStrong
Progressive supranuclear palsy-parkinsonism syndrome260540MONDO:0009839683, 240085, 240094, 240103, 240112, 99750Autosomal recessiveModerate-Limited

Phenotype Associations (HPO)

Top 30 clinical phenotypes associated with MAPT mutations:

  1. HP:0000605 — Supranuclear gaze palsy
  2. HP:0000726 — Dementia
  3. HP:0001300 — Parkinsonism
  4. HP:0002145 — Frontotemporal dementia
  5. HP:0002185 — Neurofibrillary tangles
  6. HP:0002354 — Memory impairment
  7. HP:0000511 — Vertical supranuclear gaze palsy
  8. HP:0002063 — Rigidity
  9. HP:0002067 — Bradykinesia
  10. HP:0006892 — Frontotemporal cerebral atrophy
  11. HP:0002141 — Gait imbalance
  12. HP:0002172 — Postural instability
  13. HP:0001337 — Tremor
  14. HP:0001288 — Gait disturbance
  15. HP:0001260 — Dysarthria
  16. HP:0002015 — Dysphagia
  17. HP:0002381 — Aphasia
  18. HP:0000727 — Frontal lobe dementia
  19. HP:0000738 — Hallucinations
  20. HP:0000726 — Dementia
  21. HP:0000737 — Irritability
  22. HP:0000741 — Apathy
  23. HP:0000743 — Frontal release signs
  24. HP:0000751 — Personality changes
  25. HP:0001268 — Mental deterioration
  26. HP:0012444 — Brain atrophy
  27. HP:0012671 — Abulia
  28. HP:0007164 — Slurred speech
  29. HP:0010522 — Dyslexia
  30. HP:0011098 — Speech apraxia

Complex Disease / GWAS Associations

Top 30 GWAS-associated traits with MAPT loci (p < 10⁻⁸):

Neurodegenerative / Cognitive (strongest)

  1. Parkinson’s disease (p = 1×10⁻⁶⁸, GCST004902)
  2. Progressive supranuclear palsy (p = 2×10⁻¹²² to 4×10⁻¹¹³, multiple studies)
  3. Corticobasal degeneration (p = 1×10⁻¹², GCST002971)
  4. Plasma t-tau levels (p = 1×10⁻¹², GCST004092)
  5. General cognitive ability (p = 1×10⁻¹³ to 3×10⁻⁸, multiple studies)
  6. Cognitive ability (p = 8×10⁻⁸, GCST005142)
  7. Cognitive performance (processing speed) (p = 9×10⁻⁶, GCST008891)
  8. Reaction time (p = 1×10⁻¹⁵ to 6×10⁻¹⁵, multiple studies)

Neuroticism / Psychiatric 9. General factor of neuroticism (p = 5×10⁻¹⁵ to 1×10⁻⁹, multiple studies) 10. Alcohol use disorder (consumption score) (p = 2×10⁻¹⁴, GCST012336) 11. Alcohol consumption (drinks per week) (p = 5×10⁻²³, GCST008757) 12. Experiencing mood swings (p = 2×10⁻¹⁰, with WNT3) 13. Feeling hurt (p = 1×10⁻¹¹, with WNT3) 14. Mood instability (p = 7×10⁻²⁰, GCST008357)

Morphometric / Hematologic 15. Red blood cell count (p = 9×10⁻⁶⁶, GCST90002403) 16. Hemoglobin levels (p = 1×10⁻⁶¹, GCST010083) 17. Mean spheric corpuscular volume (p = 2×10⁻⁵³, GCST90002397) 18. Mean reticulocyte volume (p = 5×10⁻²⁸, GCST90002396) 19. Waist-to-hip ratio adjusted for BMI (p = 5×10⁻¹³ to 2×10⁻⁸, multiple studies) 20. A body shape index (p = 5×10⁻¹² to 1×10⁻¹⁰, multiple studies) 21. Waist-hip index (p = 2×10⁻¹² to 2×10⁻⁸, multiple studies) 22. Hip circumference adjusted for BMI (p = 9×10⁻⁹, GCST90020028)

Respiratory / Pulmonary 23. Idiopathic pulmonary fibrosis (p = 6×10⁻⁹ to 3×10⁻¹⁶, MAPT-AS1, GCST009758) 24. Interstitial lung disease (p = 9×10⁻¹⁴, GCST001968)

Cardiovascular 25. Atrial fibrillation (p = 4×10⁻⁹, GCST006061) 26. QRS complex duration (p = 2×10⁻¹⁴, GCST003872)

Metabolic / Other 27. Primary biliary cirrhosis (p = 2×10⁻⁹, GCST005581) 28. Osteoarthrosis (p = 3×10⁻⁸, GCST010771) 29. Male-pattern baldness (p = 1×10⁻²⁶ to 8×10⁻²⁹, multiple studies) 30. Type 1 diabetes (p = 2×10⁻¹⁵, GCST90013445)

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 44 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, fantom5_promoter, gencc, go, gtex, gwas, hgnc, hpa, hpo, intact, interpro, jaspar, mim, mondo, msigdb, orphanet, ortholog, orthologentrez, pdb, pfam, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (156)