MAPT Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MAPT. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human MAPT. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 35 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, fantom5_promoter, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-27 — For the latest data, query BioBTree directly via MCP or API.
View API calls (47)
MAPT

MAPT - COMPREHENSIVE CROSS-DATABASE IDENTIFIER AND FUNCTIONAL MAPPING REFERENCE Microtubule Associated Protein Tau

Section 1: Gene Identifiers

DatabaseIdentifierNotes
HGNCHGNC:6893Approved symbol: MAPT
EnsemblENSG00000186868Human gene
NCBI Entrez4137
OMIM157140Gene/locus entry
RefSeq GenomicNC_000017.11
Full Name: microtubule associated protein tau Genomic Location:
AttributeValue
Chromosome17
Cytoband17q21.31
Start Position45,894,358
End Position46,028,334
Strand+ (positive)
Gene Length133,976 bp
Gene AliasesLocus TypeGene GroupsTotal Transcripts
MTBT1, tau, PPND, FTDP-17, TAU, MSTD, MTBT2, FLJ31424, MGC138549, PPP1R103, tau-40, Tau-PHF6, DDPAC, MAPTLprotein-coding geneProtein phosphatase 1 regulatory subunits
Transcript Identifiers**52
Ensembl Transcripts (52 total)
Transcript IDBiotypeStartEndLength
ENST00000262410protein_coding45,894,55446,028,334133,780 bp
ENST00000334239protein_coding45,894,66846,024,183129,515 bp
ENST00000344290protein_coding45,894,56046,024,425129,865 bp
ENST00000351559protein_coding45,894,55446,028,334133,780 bp
ENST00000415613protein_coding45,962,33846,024,17161,833 bp
ENST00000420682protein_coding45,894,56046,024,659130,099 bp
ENST00000431008protein_coding45,962,33846,024,17161,833 bp
ENST00000446361protein_coding45,894,67446,028,334133,660 bp
ENST00000535772protein_coding45,894,55146,024,225129,674 bp
ENST00000570299protein_coding_CDS_not_defined45,894,57646,018,730124,154 bp
ENST00000571311nonsense_mediated_decay45,894,56645,978,42583,859 bp
ENST00000571987protein_coding45,962,33846,024,17161,833 bp
ENST00000572440retained_intron45,975,49245,980,5065,014 bp
ENST00000574436protein_coding45,962,33846,024,17161,833 bp
ENST00000576518retained_intron45,972,78346,024,43151,648 bp
ENST00000577017protein_coding_CDS_not_defined45,985,65945,987,0951,436 bp
ENST00000680542nonsense_mediated_decay45,894,66346,025,873131,210 bp
ENST00000680674protein_coding45,894,52746,024,655130,128 bp
ENST00000703922-ENST00000972120variousvariousvarious-
Additional 33 protein_coding and alternative transcripts... RefSeq Transcripts - Human (mRNA)
AccessionSymbolStatusMANE Select
NM_001377265MAPTREVIEWED✓ MANE Select
NM_001123066MAPTREVIEWED
NM_001123067MAPTREVIEWED
NM_001203251MAPTREVIEWED
NM_001203252MAPTREVIEWED
NM_001377266MAPTREVIEWED
NM_001377267MAPTREVIEWED
NM_001377268MAPTREVIEWED
NM_005910MAPTREVIEWED
NM_016834MAPTREVIEWED
NM_016835MAPTREVIEWED
NM_016841MAPTREVIEWED
CCDS IDs (10 total)
CCDS ID
CCDS11499
CCDS11500
CCDS11501
CCDS11502
CCDS45715
CCDS45716
CCDS56033
CCDS92345
CCDS92346
CCDS92347
Exons for Representative Transcript (ENST00000344290) Total Exons: 11
Exon IDStartEndStrand
ENSE0000390319445,894,56045,894,686+
ENSE0000354996645,962,32145,962,470+
ENSE0000359889245,971,85945,971,945+
ENSE0000365889445,978,37545,978,440+
ENSE0000267239545,982,86645,983,930+
ENSE0000361133545,987,04045,987,095+
ENSE0000368212245,991,46045,991,586+
ENSE0000363405845,996,39945,996,664+
ENSE0000240549146,014,24346,014,324+
ENSE0000357097646,018,61846,018,730+
ENSE0000399031646,023,95646,024,425+

Section 3: Protein Identifiers UniProt Accessions (5 total)

AccessionStatusName
P10636✓ Reviewed (Swiss-Prot)Microtubule-associated protein tau
A0A7I2PJZ2Unreviewed
A0A7I2PLE3Unreviewed
A0A7P0T936Unreviewed
I3L2Z2Unreviewed
Canonical Protein (P10636):
  • Sequence Length: 758 amino acids
  • Molecular Mass: 78,928 Da
  • Alternative Names: Neurofibrillary tangle protein, Paired helical filament-tau (PHF-tau) RefSeq Protein Accessions - Human
AccessionSymbolStatusMANE Select
NP_001364194MAPTREVIEWED✓ MANE Select
NP_001116538MAPTREVIEWED
NP_001116539MAPTREVIEWED
NP_001190180MAPTREVIEWED
NP_001190181MAPTREVIEWED
NP_001364195MAPTREVIEWED
NP_001364196MAPTREVIEWED
NP_001364197MAPTREVIEWED
Protein Domains and Families
InterPro IDShort NameType
IPR001084MAP_tubulin-bd_rptRepeat
IPR002955TauFamily
IPR027324MAP2/MAP4/TauFamily
Pfam
PF00418 (Tau/MAP)
Structure Identifiers**
Experimental Structures Total PDB Structures: 274 Top 50 Representative PDB Structures
PDB IDMethodResolution (Å)Description
5O3LCryo-EM3.4Paired helical filament in Alzheimer's disease brain
5O3TCryo-EM3.4Straight filament in Alzheimer's disease brain
6GX5Cryo-EM3.2Narrow Pick Filament from Pick's disease brain
6NWPCryo-EM2.3Chronic traumatic encephalopathy Type I Tau filament
6NWQCryo-EM3.4CTE Type II Tau filament
6TJOCryo-EM3.2TypeI tau filaments - Corticobasal degeneration
6TJXCryo-EM3.0TypeII tau filaments - Corticobasal degeneration
7P65Cryo-EM2.7Progressive supranuclear palsy tau filament
7P66Cryo-EM3.0Globular glial tauopathy type 1 tau filament
7P6ACryo-EM1.9Limbic-predominant neuronal inclusion body 4R tauopathy
7P6DCryo-EM3.3Argyrophilic grain disease type 1 tau filament
8CAQCryo-EM2.3Tau filaments Type I - Subacute Sclerosing Panencephalitis
8OT6Cryo-EM2.0CTE typeI tau filament from Guam ALS/PDC
8P34Cryo-EM2.61Tau filaments - DeltaK281 mutation
2MZ7NMR-Structure of Tau(267-312) bound to Microtubules
5N5ANMR-Structure of Tau(254-290) bound to F-actin
2ON9X-ray1.51Amyloid forming peptide VQIVYK
5K7NMicroED1.1VQIVYK peptide at high resolution
6ODGX-ray1.0SVQIVY tau protein fragment
4FL5X-ray1.914-3-3 sigma with Tau pS214 peptide
5E2WX-ray1.5Anti-TAU AT8 FAB with triply phosphorylated TAU peptide
6CVJCryo-EM3.2Synthetic tau bound to microtubule
7PQCCryo-EM4.1tau-microtubule structural ensemble
8FYUX-ray1.85CHIP-TPR domain with acetylated tau peptide
1I8HNMR-PIN1 WW domain with phosphothreonine peptide
Structure Method Summary:
MethodCount
Cryo-EM (Electron Microscopy)~220
X-ray Diffraction~45
NMR~6
Electron Crystallography~3
Predicted Structures AlphaFold Model:
AttributeValue
AlphaFold IDAF-P10636-F1
Sequence Length5544 (all isoforms)
Global pLDDT49.56
Fraction Very High Confidence0.00
Note
MAPT is an intrinsically disordered protein, explaining the low AlphaFold confidence scores
Cross-Species Orthologs**
SpeciesEnsembl Gene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000018411Maptprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000005133Maptprotein_coding
Zebrafish (Danio rerio)ENSDARG00000087616maptbprotein_coding
Zebrafish (Danio rerio)ENSDARG00000089314maptaprotein_coding
Additional Entrez orthologsNote
440 speciesNo direct orthologs found in Drosophila melanogaster, C. elegans, or S. cerevisiae (Tau is a vertebrate-specific protein)
Clinical Variants & AI Predictions**
ClinVar Variant Summary Total ClinVar Variants: 601
ClassificationCount
Pathogenic28
Likely Pathogenic7
Uncertain Significance (VUS)~100
Likely Benign~200
Benign~250
Conflicting~16
Pathogenic Variants (28 total)
Variant IDHGVS NotationConsequenceReview Status
14245c.2078C>T (p.Pro693Leu)MissenseMultiple submitters
14247c.2392C>T (p.Arg798Trp)MissenseMultiple submitters
14252c.2185G>A (p.Val729Met)MissenseMultiple submitters
14253c.2013T>G (p.Asn671Lys)MissenseMultiple submitters
14257c.2064T>C (p.Asn688=)Synonymous/splicingMultiple submitters
14266c.1972C>G (p.Leu658Val)MissenseMultiple submitters
98212c.1999-10G>TIntronic/spliceMultiple submitters
98218c.2091+3G>ASplice donorMultiple submitters
98222c.2091+16C>TIntronic/spliceMultiple submitters
14246c.1991G>T (p.Gly664Val)MissenseNo criteria
14248c.2091+14C>TSplice regionNo criteria
14251c.2091+1G>ASplice donorNo criteria
14254c.2090G>A (p.Ser697Asn)MissenseNo criteria
14256c.2077C>T (p.Pro693Ser)MissenseNo criteria
14258c.2201A>T (p.Glu734Val)MissenseNo criteria
14259c.1946A>C (p.Lys649Thr)MissenseNo criteria
14260c.2282A>T (p.Lys761Ile)MissenseNo criteria
14262c.2135C>T (p.Ser712Phe)MissenseSingle submitter
14263c.14G>T (p.Arg5Leu)MissenseNo criteria
14267c.2231C>T (p.Ser744Leu)MissenseNo criteria
14268c.2126A>T (p.Lys709Met)MissenseNo criteria
14269c.2084G>T (p.Gly695Val)MissenseNo criteria
2573094c.2266C>T (p.Pro756Ser)MissenseSingle submitter
98219c.2091+11T>CSplice regionNo criteria
98221c.2091+13A>GSplice regionNo criteria
98226c.2120T>G (p.Leu707Arg)MissenseNo criteria
98243c.2060ATA[1] (p.Asn688del)In-frame deletionSingle submitter
56444817q21.31 CNV deletionCopy number lossNo criteria
Likely Pathogenic Variants (7 total)
Variant IDHGVS NotationConsequence
2049744c.2091+16C>GSplice region
3075676c.2077C>G (p.Pro693Ala)Missense
3778831c.2027T>A (p.Leu676His)Missense
4056341c.2010TAA[1] (p.Asn671del)In-frame deletion
599620c.2260C>T (p.His754Tyr)Missense
807628c.2266C>A (p.Pro756Thr)Missense
986870c.2242T>A (p.Ser748Thr)Missense
SpliceAI Predictions Total Splice-Affecting Variants: 3,502 Top 50 Highest-Scoring Splice Variants
VariantEffectDelta Score
17:45894684:CAGGT:Cdonor_loss1.0000
17:45894685:AGGTA:Adonor_loss1.0000
17:45894686:GGTA:Gdonor_loss1.0000
17:45894687:G:GAdonor_loss1.0000
17:45894687:G:GGdonor_gain0.9900
17:45894688:T:Adonor_loss0.9900
17:45894683:TCAG:Tdonor_gain0.9600
17:45894685:AG:Adonor_gain0.9500
17:45894686:GG:Gdonor_gain0.9500
17:45894682:ATCAG:Adonor_gain0.9000
17:45894684:CAG:Cdonor_gain0.8900
17:45896223:G:GTdonor_gain0.8100
17:45896237:G:Tdonor_gain0.7600
17:45896223:G:Tdonor_gain0.7400
17:45895687:G:GGdonor_gain0.7000
17:45895686:A:AGdonor_gain0.6900
17:45895470:C:Tdonor_gain0.6700
17:45894956:G:GAdonor_gain0.6400
17:45894955:TGG:Tdonor_gain0.6000
17:45895891:GC:Gdonor_gain0.5900
AlphaMissense Predictions Total Missense Predictions: 4,906 Distribution by Class
ClassApproximate Count
Likely Benign~4,200
Ambiguous~550
Likely Pathogenic~156
Top 50 Likely Pathogenic Missense Predictions
VariantProtein ChangeAM ScoreClassification
17:45962359:T:CF8L0.892likely_pathogenic
17:45962361:C:AF8L0.892likely_pathogenic
17:45962361:C:GF8L0.892likely_pathogenic
17:45962350:C:AR5S0.686likely_pathogenic
17:45962351:G:CR5P0.569likely_pathogenic
Note
Most highly pathogenic mutations cluster in the microtubule-binding repeat region (exons 9-13)
Biological Pathways & Gene Ontology**
Reactome Pathways (3 pathways)
Pathway IDNameDisease Pathway
R-HSA-264870Caspase-mediated cleavage of cytoskeletal proteinsNo
R-HSA-9619483Activation of AMPK downstream of NMDARsNo
R-HSA-9833482PKR-mediated signalingNo
Gene Ontology Annotations Total GO Annotations: 100+ Biological Process (Top 20)
GO IDTerm
GO:0000226microtubule cytoskeleton organization
GO:0001774microglial cell activation
GO:0006974DNA damage response
GO:0007416synapse assembly
GO:0007611learning or memory
GO:0007613memory
GO:0010506regulation of autophagy
GO:0019896axonal transport of mitochondrion
GO:0021954central nervous system neuron development
GO:0031110regulation of microtubule polymerization or depolymerization
GO:0031116positive regulation of microtubule polymerization
GO:0031175neuron projection development
GO:0045773positive regulation of axon extension
GO:0046785microtubule polymerization
GO:0048167regulation of synaptic plasticity
GO:0050808synapse organization
GO:0051260protein homooligomerization
GO:0061564axon development
GO:0097435supramolecular fiber organization
GO:1990000amyloid fibril formation
Molecular Function (Top 20)
GO IDTerm
GO:0003677DNA binding
GO:0003680minor groove of adenine-thymine-rich DNA binding
GO:0003690double-stranded DNA binding
GO:0003697single-stranded DNA binding
GO:0003723RNA binding
GO:0003779actin binding
GO:0004857enzyme inhibitor activity
GO:0008017microtubule binding
GO:0017124SH3 domain binding
GO:0019899enzyme binding
GO:0019901protein kinase binding
GO:0030674protein-macromolecule adaptor activity
GO:0034185apolipoprotein binding
GO:0034452dynactin binding
GO:0035091phosphatidylinositol binding
GO:0042802identical protein binding
GO:0051087protein-folding chaperone binding
GO:0051721protein phosphatase 2A binding
GO:0051879Hsp90 protein binding
GO:0099609microtubule lateral binding
Cellular Component (Top 20)
GO IDTerm
GO:0005576extracellular region
GO:0005634nucleus
GO:0005737cytoplasm
GO:0005739mitochondrion
GO:0005829cytosol
GO:0005874microtubule
GO:0005886plasma membrane
GO:0015630microtubule cytoskeleton
GO:0030424axon
GO:0030425dendrite
GO:0030426growth cone
GO:0030673axolemma
GO:0043005neuron projection
GO:0043025neuronal cell body
GO:0043197dendritic spine
GO:0044304main axon
GO:0045121membrane raft
GO:0045298tubulin complex
GO:0097418neurofibrillary tangle
GO:1904115axon cytoplasm

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions STRING Interactions: 5,558 total Top 50 Highest-Confidence Interactors

UniProt IDGeneConfidence Score
P10636MAPT (self)986
P05067APP (Amyloid precursor protein)986
P21796VDAC1974
P02649APOE973
P37840SNCA (Alpha-synuclein)971
Q13148TARDBP970
P06241FYN958
Q00535CDK5948
P49841GSK3B943
P49768PSEN1933
P27348YWHAQ (14-3-3 theta)903
P35637FUS889
P23781UCHL3886
P49810PSEN2883
Q96LT7TMEM106B852
P04156PRNP834
P08247SNAP25828
P17600SYN1826
Q5S007LRRK2826
P22303ACHE811
P04271S100B802
O60260PARK2 (Parkin)790
Q8IWL8SQSTM1786
Q7Z3B3WWOX785
P07900HSP90AA1772
O43189PHF1772
P56817BACE1764
P07196NEFL761
Q9BXM7PIAS4751
Q9NUM4TRIM13750
P34932HSPA4745
P78352DLG4735
Q13492PICALM726
Q9NQ11ATG4A721
O00499BIN1719
P34998CRHR1717
Q14114LRP8713
P06276BCHE711
P42858HTT (Huntingtin)698
P55072VCP696
P23246SFPQ694
P12036NEFH693
P09936UCHL1690
P12931SRC686
P07197NEFM683
Q13501SQSTM1683
P00354GAPDH680
IntAct Interactions Total IntAct Records: 160+ Key Direct Interactions (high confidence):
PartnerTypeScore
APPDirect interaction0.75
HSPA8Direct interaction0.67
MAPT (self)Direct interaction0.67
STUB1Physical association0.52
FYNPhysical association0.59
GSK3BPhysical association0.50
AKT1Physical association0.50
EGFRPhysical association0.55
Protein Similarity ESM2 Structural Similarity (76 proteins)
UniProt IDTop SimilarityAvg Similarity
Q5YCW01.00000.9892
Q5YCW11.00000.9891
P10636 (self)0.99990.9890
Q5S6V20.99990.9890
Q5YCV90.99990.9890
P10637 (MAP2)0.99970.9895
P193320.99970.9896
O887780.99960.9878
O887370.99960.9870
Q3ZB980.99960.9833
DIAMOND Sequence Homology (19 proteins)
UniProt IDTop Identity (%)Top Bitscore
P10636 (self)95.01070
Q5YCW099.01096
Q5YCW199.01093
Q5S6V294.51072
Q5YCV993.61067
P10637 (MAP2)90.6966
P1933290.9968
O0282889.8447
P11137 (MAP4)79.32385
P1514689.02796
P2035789.02806

Section 9: Transcription Factor Regulatory Data MAPT is NOT a transcription factor Upstream Regulators (TFs regulating MAPT)

TF GeneRegulationSource
ARActivates MAPTCollecTRI
DLX3UnknownCollecTRI
TFAP2ARepresses MAPTCollecTRI
Downstream Targets (Genes regulated BY MAPT)
Target GeneRegulation
FOSRepressed by MAPT
Note: MAPT's primary function is microtubule stabilization, not transcriptional regulation SIGNOR Kinase Regulation of MAPT (138 entries) Key Kinases Phosphorylating MAPT:
KinaseEffectMechanismScore
CDK5Down-regulates activityPhosphorylation0.76
GSK3BDown-regulates activityPhosphorylation0.74
CDK5/CDK5R1Down-regulates activityPhosphorylation0.70
CAMK2ADown-regulates activityPhosphorylation0.60
MAPK1Down-regulates activityPhosphorylation0.57
FYNDown-regulatesPhosphorylation0.54
LRRK2Down-regulatesPhosphorylation0.53
MAPK12Down-regulates activityPhosphorylation0.53
SYKDown-regulatesPhosphorylation0.47
TTBK1Down-regulatesPhosphorylation0.45
DYRK1ADown-regulatesPhosphorylation0.43
MARK1Down-regulates activityPhosphorylation0.43
PRKACADown-regulates activityPhosphorylation0.43
GSK3ADown-regulates activityPhosphorylation0.43
CSNK1DDown-regulatesPhosphorylation0.37
SGK1Down-regulatesPhosphorylation0.33
CAPN1Down-regulates activityCleavage0.33
MAPK14Up-regulatesPhosphorylation0.32
PHKG1Down-regulates activityPhosphorylation0.31
CHEK2Down-regulatesPhosphorylation0.20
Phosphatases Dephosphorylating MAPT:
PhosphataseEffectScore
PP2B (Calcineurin)Up-regulates0.20
PPP2R1A (PP2A)Down-regulates0.20

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL Target IDNameType
CHEMBL1293224Microtubule-associated protein tauSINGLE PROTEIN
CHEMBL4296122KEAP1/Microtubule-associated protein tauPPI
CHEMBL4296123KEAP1/Cullin-3/Microtubule-associated protein tauPPI
CHEMBL4296124Cullin-3/Microtubule-associated protein tauPPI
CHEMBL6066025Protein cereblon/Microtubule-associated protein tauPPI (PROTAC)
Total Molecules Targeting MAPT: 95,343+ activity records Molecules by Development Phase Phase 4 (Approved) Molecules with Tau Activity: 100+
ChEMBL IDNameType
CHEMBL1008BEPRIDILSmall molecule
CHEMBL101PHENYLBUTAZONESmall molecule
CHEMBL103PROGESTERONESmall molecule
CHEMBL104CLOTRIMAZOLESmall molecule
CHEMBL11IMIPRAMINESmall molecule
CHEMBL112ACETAMINOPHENSmall molecule
CHEMBL1112ARIPIPRAZOLESmall molecule
CHEMBL11359CISPLATINSmall molecule
CHEMBL117785TETRABENAZINESmall molecule
CHEMBL135ESTRADIOLSmall molecule
CHEMBL1351CARBOPLATINSmall molecule
CHEMBL1083993AMIODARONESmall molecule
CHEMBL1200503BROMOCRIPTINESmall molecule
CHEMBL1200517DIHYDROERGOTAMINESmall molecule
CHEMBL1200561DOXAZOSINSmall molecule
Phase 3 Molecules:
ChEMBL IDNameType
CHEMBL10316IDAZOXANSmall molecule
CHEMBL109480TANESPIMYCINSmall molecule
CHEMBL1130DOXIFLURIDINESmall molecule
CHEMBL119443ERGONOVINESmall molecule
CHEMBL1255653SEPIAPTERINSmall molecule
CHEMBL135400ZOPICLONESmall molecule
PharmGKB Information
AttributeValue
PharmGKB IDPA238
VIP GeneYes
CPIC GuidelineNo
VIP = Very Important Pharmacogene Binding Data
BindingDB Entries
675 binding records
Expression Profiles**
Bgee Expression Summary
AttributeValue
Expression BreadthUbiquitous
Total Present Calls141
Max Expression Score99.52
Top 30 Tissues by Expression Score
TissueExpression ScoreRank
Cortical plate99.52226
Superior frontal gyrus98.76580
Prefrontal cortex98.51694
Frontal cortex98.45724
Frontal lobe98.45724
Right frontal lobe98.32783
Cerebellum98.29797
Right hemisphere of cerebellum98.29796
Cerebellar cortex98.28802
Cerebellar hemisphere98.28804
Brodmann area 998.08898
Primary visual cortex98.03921
Dorsolateral prefrontal cortex97.88989
Cerebral cortex97.87995
Anterior cingulate cortex97.391220
Hypothalamus97.341240
Ammon's horn (Hippocampus)97.331240
Temporal lobe97.321250
Amygdala97.291270
Brain96.911440
Putamen96.681550
Corpus callosum96.611580
Caudate nucleus96.581600
Substantia nigra96.451660
Nucleus accumbens96.361700
Embryo95.991870
Ganglionic eminence95.991870
Gastrocnemius95.592050
Skeletal muscle tissue95.522090
Skeletal muscle organ94.642500
Expression pattern: Predominantly neuronal, highest in cortex, cerebellum, and basal ganglia FANTOM5 Promoter Data Total Promoters: 10
Promoter IDTPM AverageSamples Expressed
16130623.34717
1613055.18597
1613073.08334
1613091.63147
1613080.36101
1613040.2777
1613110.2190
1613170.0631
1613200.0421
1613100.0311
Single-Cell Expression Data (6 datasets)
Experiment IDDescriptionCells
E-MTAB-8894Human fetal lateral ganglionic eminence150,129
E-MTAB-9154iPSC derived neurons under genetic/toxic stressors36,117
E-HCAD-5Organoids modeling inter-individual variation25,049
E-MTAB-10485Neural progenitors during brain development18,649
E-GEOD-93593125-day H1 hESC differentiation to ventral cell types1,733
E-GEOD-75140Cerebral organoids recapitulating fetal neocortex734

Section 12: Disease Associations Mendelian/Monogenic Disease Links (GenCC)

DiseaseClassificationInheritanceSubmitter
Pick diseaseStrongAutosomal dominantGenomics England, Invitae
Supranuclear palsy, progressive, 1StrongAutosomal dominantGenomics England
Semantic dementiaStrongAutosomal dominantGenomics England
Late-onset Parkinson diseaseStrong/LimitedAutosomal dominantGenomics England
Progressive supranuclear palsy-parkinsonismModerate/LimitedAutosomal recessiveGenomics England
Orphanet Disease Associations (8 diseases)
Orphanet IDDisease NameTypePhenotypes
275864Behavioral variant of frontotemporal dementiaDisease40
100070Progressive non-fluent aphasiaDisease32
240071Classic progressive supranuclear palsy syndromeClinical subtype28
240103PSP-corticobasal syndromeClinical subtype30
240094PSP-pure akinesia with gait freezingClinical subtype21
240112PSP-progressive non-fluent aphasia syndromeClinical subtype21
240085PSP-predominant parkinsonism syndromeClinical subtype20
100069Semantic dementiaDisease10
HPO Phenotype Associations (160 phenotypes) Top 50 Phenotypes
HPO IDPhenotype
HP:0000006Autosomal dominant inheritance
HP:0000726Dementia
HP:0002145Frontotemporal dementia
HP:0001300Parkinsonism
HP:0002185Neurofibrillary tangles
HP:0000738Hallucinations
HP:0000716Depression
HP:0000741Apathy
HP:0001260Dysarthria
HP:0001288Gait disturbance
HP:0002063Rigidity
HP:0002067Bradykinesia
HP:0002172Postural instability
HP:0002381Aphasia
HP:0002354Memory impairment
HP:0002059Cerebral atrophy
HP:0001332Dystonia
HP:0001337Tremor
HP:0002015Dysphagia
HP:0000511Vertical supranuclear gaze palsy
HP:0000605Supranuclear gaze palsy
HP:0000514Slow saccadic eye movements
HP:0002300Mutism
HP:0002186Apraxia
HP:0000751Personality changes
HP:0000734Disinhibition
HP:0000719Inappropriate behavior
HP:0002359Frequent falls
HP:0002317Unsteady gait
HP:0002304Akinesia
HP:0007354Amyotrophic lateral sclerosis
HP:0002446Astrocytosis
HP:0002171Gliosis
HP:0002529Neuronal loss in central nervous system
HP:0003581Adult onset
HP:0003584Late onset
HP:0003596Middle age onset
HP:0003676Progressive
HP:0006892Frontotemporal cerebral atrophy
HP:0007112Temporal cortical atrophy
HP:0012444Brain atrophy
HP:0100256Senile plaques
HP:0100315Lewy bodies
HP:0011098Speech apraxia
HP:0002439Frontolimbic dementia
HP:0002427Expressive aphasia
HP:0002463Language impairment
HP:0002380Fasciculations
HP:0031825Freezing of gait
HP:0002322Resting tremor
GWAS Associations (124 associations) Top 30 GWAS Traits
Study IDTraitP-valueMapped Gene
GCST001116_8Progressive supranuclear palsy2×10⁻¹¹⁸MAPT
GCST004902_11Parkinson's disease1×10⁻⁶⁸MAPT
GCST001445_4Parkinson's disease8×10⁻⁵²-
GCST002544_5Parkinson's disease2×10⁻⁴⁸MAPT
GCST005116_49Male-pattern baldness1×10⁻²⁶MAPT
GCST005232_115Neuroticism8×10⁻²⁶LINC02210-CRHR1
GCST007328_48Alcohol consumption8×10⁻²⁵MAPT-AS1
GCST008757_26Alcohol consumption5×10⁻²³MAPT
GCST006418_5Progressive supranuclear palsy4×10⁻¹¹³-
GCST007845_1Progressive supranuclear palsy2×10⁻²²MAPT
GCST001116_6Progressive supranuclear palsy9×10⁻¹⁸MAPT
GCST000959_11Parkinson's disease1×10⁻²⁸LINC02210
GCST000528_6Parkinson's disease2×10⁻¹⁶LINC02210-CRHR1
GCST010083_316Hemoglobin levels1×10⁻⁶¹MAPT
GCST090002403_409Red blood cell count9×10⁻⁶⁶MAPT
GCST010703_137Brain morphology2×10⁻⁶⁵KANSL1
GCST001483_2Intracranial volume8×10⁻¹⁵KANSL1
GCST005337_8Headache6×10⁻¹⁵LINC02210-CRHR1
GCST006268_366Reaction time6×10⁻¹⁵MAPT
GCST001968_12Interstitial lung disease9×10⁻¹⁴MAPT
GCST003872_11QRS complex2×10⁻¹⁴MAPT
GCST006269_6General cognitive ability1×10⁻¹³MAPT
GCST007938_6Medication use (anilides)8×10⁻¹³MAPT
GCST002971_2Corticobasal degeneration1×10⁻¹²LINC02210
GCST004092_1Plasma t-tau levels1×10⁻¹²MAPT
GCST006269_191General cognitive ability1×10⁻¹¹MAPT
GCST001482_17Lumbar spine bone mineral density5×10⁻¹¹MAPT
GCST012228_609Waist-hip index6×10⁻¹¹MAPT
GCST000855_1Parkinson's disease7×10⁻¹²MAPT
GCST010427_22Left-right brain asymmetry1×10⁻²⁰MAPT
SUMMARY STATISTICS
CategoryCount
Total Transcripts52
RefSeq mRNA (Human)12
CCDS IDs10
UniProt Entries5 (1 reviewed)
PDB Structures274
InterPro Domains3
ClinVar Variants601
Pathogenic Variants28
Likely Pathogenic7
SpliceAI Predictions3,502
AlphaMissense Predictions4,906
GO Annotations100+
Reactome Pathways3
STRING Interactions5,558
IntAct Records160+
SIGNOR Regulations138
HPO Phenotypes160
GWAS Associations124
Orphanet Diseases8
ChEMBL Activities95,343+
FANTOM5 Promoters10
Single-Cell Datasets6
Orthologs4 (vertebrates only)
DATA SOURCES
  • HGNC - HUGO Gene Nomenclature Committee

  • Ensembl - Genome browser (release 113)

  • NCBI Entrez Gene - Gene database

  • OMIM - Online Mendelian Inheritance in Man

  • UniProt - Protein sequence and function

  • RefSeq - Reference sequences

  • CCDS - Consensus Coding Sequence

  • PDB - Protein Data Bank

  • AlphaFold - Protein structure predictions

  • ClinVar - Clinical variant interpretations

  • SpliceAI - Splice variant predictions

  • AlphaMissense - Missense pathogenicity predictions

  • Gene Ontology - Functional annotations

  • Reactome - Biological pathways

  • STRING - Protein-protein interactions

  • IntAct - Molecular interactions

  • SIGNOR - Signaling network

  • CollecTRI - Transcription factor interactions

  • ChEMBL - Drug and bioactivity data

  • PharmGKB - Pharmacogenomics

  • Bgee - Gene expression

  • FANTOM5 - Transcription start sites

  • HPO - Human Phenotype Ontology

  • GenCC - Gene-disease validity

  • Orphanet - Rare diseases

  • GWAS Catalog - Genome-wide association studies Reference compiled from Biobtree integrated biological databases Date: 2026-03-27 I have compiled a comprehensive cross-database identifier and functional mapping reference for human MAPT (Microtubule Associated Protein Tau). Key highlights:

  • Gene Location: Chromosome 17q21.31 (45,894,358-46,028,334, + strand)

  • Identifiers: HGNC:6893, ENSG00000186868, Entrez 4137, OMIM 157140

  • Transcripts: 52 Ensembl transcripts, 12 human RefSeq mRNAs, MANE Select: NM_001377265

  • Protein: P10636 (758 aa, 79 kDa), intrinsically disordered

  • Structures: 274 PDB entries (predominantly Cryo-EM of disease-associated fibrils)

  • Clinical Variants: 601 ClinVar entries (28 pathogenic, 7 likely pathogenic)

  • AI Predictions: 3,502 SpliceAI variants, 4,906 AlphaMissense predictions

  • Interactions: 5,558 STRING interactors, key partners include APP, GSK3B, CDK5, APOE

  • Drug Data: 95,343+ ChEMBL activity records, designated VIP gene in PharmGKB

  • Expression: Highest in brain (cortex, cerebellum, hippocampus, substantia nigra)

  • Diseases: Frontotemporal dementia, Pick disease, PSP, Parkinson’s disease, Alzheimer’s disease (via neurofibrillary tangles)

  • GWAS: 124 associations including PSP (p=2×10⁻¹¹⁸), Parkinson’s (p=1×10⁻⁶⁸)