MECP2 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MECP2. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human MECP2. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 34 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, jaspar, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-27 — For the latest data, query BioBTree directly via MCP or API.
View API calls (46)
MECP2
Gene
Methyl-CpG Binding Protein 2
Gene Identifiers**
DatabaseIdentifierDetails
HGNCHGNC:6990Approved symbol: MECP2
Ensembl GeneENSG00000169057GRCh38
NCBI Entrez Gene4204protein-coding
OMIM300005Gene/locus
Full Gene Name methyl-CpG binding protein 2 Previous Symbols RTT, MRX16, MRX79 Previous Names
  • Rett syndrome
  • mental retardation, X-linked 16
  • mental retardation, X-linked 79 Gene Group Methyl-CpG binding domain containing Locus Information
AttributeValue
Cytogenetic BandXq28
ChromosomeX
Start Position154,021,573
End Position154,137,103
StrandMinus (-)
Gene Length115,531 bp
Genome AssemblyGRCh38

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 30)

Transcript IDBiotypeStartEndStrand
ENST00000303391protein_coding154,021,573154,097,717-
ENST00000453960protein_coding154,021,573154,097,717-
ENST00000407218protein_coding154,030,605154,097,689-
ENST00000415944protein_coding154,030,234154,097,136-
ENST00000628176protein_coding154,030,239154,097,737-
ENST00000630151protein_coding154,021,573154,094,476-
ENST00000637917protein_coding154,030,303154,097,737-
ENST00000369957nonsense_mediated_decay154,031,370154,097,663-
ENST00000674996nonsense_mediated_decay154,031,315154,097,625-
ENST00000675526nonsense_mediated_decay154,031,315154,097,625-
ENST00000713611nonsense_mediated_decay154,031,315154,097,625-
ENST00000481807retained_intron154,032,434154,040,103-
ENST00000486506retained_intron154,031,269154,060,358-
ENST00000629277retained_intron154,094,966154,097,523-
ENST00000675841retained_intron154,056,397154,092,307-
ENST00000460227protein_coding_CDS_not_defined154,032,435154,049,481-
ENST00000463644protein_coding_CDS_not_defined154,032,435154,049,481-
ENST00000488293protein_coding_CDS_not_defined154,032,435154,049,479-
ENST00000496908protein_coding_CDS_not_defined154,032,373154,096,969-
ENST00000611468protein_coding_CDS_not_defined154,032,466154,092,307-
ENST00000625300protein_coding_CDS_not_defined154,032,459154,034,174-
ENST00000626422protein_coding_CDS_not_defined154,032,459154,034,174-
ENST00000627864protein_coding_CDS_not_defined154,090,802154,097,680-
ENST00000631210protein_coding_CDS_not_defined154,032,373154,137,103-
ENST00000637467protein_coding_CDS_not_defined154,092,232154,094,780-
ENST00000637533protein_coding_CDS_not_defined154,032,475154,097,025-
ENST00000637791protein_coding_CDS_not_defined154,033,994154,092,261-
ENST00000638041protein_coding_CDS_not_defined154,057,148154,057,810-
ENST00000676382protein_coding_CDS_not_defined154,032,464154,097,625-
ENST00000700484protein_coding_CDS_not_defined154,032,495154,097,625-
RefSeq Transcripts - Human mRNA (NM_) Accessions
RefSeq IDStatusMANE SelectChromosome
NM_001110792REVIEWEDYes (Canonical)X
NM_004992REVIEWEDNoX
NM_001316337REVIEWEDNoX
NM_001369391REVIEWEDNoX
NM_001369392REVIEWEDNoX
NM_001369393REVIEWEDNoX
NM_001369394REVIEWEDNoX
NM_001386137REVIEWEDNoX
NM_001386138REVIEWEDNoX
NM_001386139REVIEWEDNoX
CCDS Identifiers (Total: 2)
CCDS ID
CCDS14741
CCDS48193
Canonical Transcript Exons (ENST00000303391) Total Exon Count: 4
Exon IDStartEndStrandRegion
ENSE00001451321154,097,604154,097,717-X
ENSE00003539912154,092,184154,092,307-X
ENSE00003772050154,032,207154,032,557-X
ENSE00004020471154,021,573154,031,450-X

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusMassLength
P51608Methyl-CpG-binding protein 2Reviewed (Swiss-Prot)52,441 Da486 aa
RefSeq Protein Accessions (Human NP_)
RefSeq IDStatusMANE Select
NP_001104262REVIEWEDYes
NP_004983REVIEWEDNo
NP_001303266REVIEWEDNo
NP_001356320REVIEWEDNo
NP_001356321REVIEWEDNo
NP_001356322REVIEWEDNo
NP_001356323REVIEWEDNo
NP_001373066REVIEWEDNo
NP_001373067REVIEWEDNo
NP_001373068REVIEWEDNo
Protein Domains and Families (Total: 4)
InterPro IDNameType
IPR001739Methyl_CpG_DNA-bdDomain
IPR016177DNA-bd_dom_sfHomologous Superfamily
IPR017353Me_CpG-bd_MeCP2Family
IPR045138MeCP2/MBD4Family

Section 4: Structure Identifiers Experimental Structures (Total: 9 PDB entries)

PDB IDMethodResolutionTitle
6OGKX-RAY DIFFRACTION1.65 ÅMeCP2 MBD in complex with DNA
6OGJX-RAY DIFFRACTION1.80 ÅMeCP2 MBD in complex with DNA
6YWWX-RAY DIFFRACTION2.10 ÅMeCP2 microsatellite binding protein protecting CA repeats
5BT2X-RAY DIFFRACTION2.20 ÅMeCP2 MBD domain (A140V) with methylated DNA
6C1YX-RAY DIFFRACTION2.30 ÅMBD of human MeCP2 with methylated DNA
3C2IX-RAY DIFFRACTION2.50 ÅMBD of human MeCP2 with methylated BDNF sequence
1QK9SOLUTION NMR-Solution structure of MeCP2 domain binding methylated DNA
8AJRSOLUTION NMR-Triple mutant MBD from MeCP2
8ALQSOLUTION NMR-Triple mutant MBD binding asymmetrically modified DNA
Predicted Structures
Model IDGlobal pLDDTSequence LengthFraction Very High Confidence
AF-P51608-F1 (AlphaFold)57.253,6830.09

Section 5: Cross-Species Orthologs

OrganismEnsembl Gene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000031393Mecp2protein_coding
Rat (Rattus norvegicus)ENSRNOG00000056659Mecp2protein_coding
Zebrafish (Danio rerio)ENSDARG00000014218mecp2protein_coding
Fruit fly (D. melanogaster)No ortholog found--
Worm (C. elegans)No ortholog found--
Yeast (S. cerevisiae)No ortholog found--

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary Total Variants: 2,081

ClassificationCount (approx.)
Pathogenic~200+
Likely Pathogenic~200+
Uncertain Significance (VUS)~300+
Likely Benign~400+
Benign~50+
Conflicting~100+
TOP 50 Pathogenic Variants
ClinVar IDHGVS NotationTypeReview Status
11809c.433C>T (p.Arg145Cys)SNVExpert panel
11819c.916C>T (p.Arg306Ter)SNVExpert panel
11824c.952C>T (p.Arg318Cys)SNVExpert panel
11835c.334C>G (p.Leu112Val)SNVExpert panel
143345c.1174_1199del (p.Val392fs)DeletionExpert panel
143406c.1200_1243del (p.Pro400_Pro401insTer)DeletionExpert panel
143485c.1490_1493del (p.Val497fs)DeletionExpert panel
143490c.1497A>G (p.Ter499Trp)SNVExpert panel
11815c.844C>T (p.Arg282Ter)SNVMultiple submitters
11828c.538C>T (p.Arg180Ter)SNVMultiple submitters
11833c.459C>G (p.Tyr153Ter)SNVMultiple submitters
1071768c.74C>G (p.Ser25Ter)SNVMultiple submitters
143304c.136_139del (p.Asp46fs)DeletionMultiple submitters
143307c.1065del (p.Arg356fs)DeletionMultiple submitters
143322c.1115C>A (p.Ser372Ter)SNVMultiple submitters
143328c.144_147del (p.Glu49fs)MicrosatelliteMultiple submitters
143334c.1154C>G (p.Ser385Ter)SNVMultiple submitters
143342c.1171_1178del (p.Pro391fs)DeletionMultiple submitters
143441c.1252C>T (p.Gln418Ter)SNVMultiple submitters
143457c.1344_1345del (p.Gln449fs)DeletionMultiple submitters
143467c.1393C>T (p.Arg465Ter)SNVMultiple submitters
143482c.1486_1489del (p.Arg496fs)MicrosatelliteMultiple submitters
11814c.352C>T (p.Arg118Trp)SNVMultiple submitters
11811c.509C>T (p.Thr170Met)SNVMultiple submitters
1027605c.507C>A (p.Phe169Leu)SNVMultiple submitters
11846c.746del (p.Gly249fs)DeletionMultiple submitters
1301354c.1209_1243del (p.Pro403_Glu404insTer)DeletionMultiple submitters
1419223c.1172_1215del (p.Pro391fs)DeletionMultiple submitters
143351c.1187_1224del (p.Pro396fs)DeletionMultiple submitters
143354c.1188_1231del (p.Pro397fs)DeletionMultiple submitters
143366c.1193_1224del (p.Leu398fs)DeletionMultiple submitters
143372c.1193_1236del (p.Leu398fs)DeletionMultiple submitters
143384c.1196_1202del (p.Pro399fs)DeletionMultiple submitters
143395c.1198_1199delinsTA (p.Pro400Ter)IndelMultiple submitters
143400c.1199_1215del (p.Pro400fs)DeletionMultiple submitters
143401c.1199_1224del (p.Pro400fs)DeletionMultiple submitters
143405c.1200_1242del (p.Pro401fs)DeletionMultiple submitters
143409c.1201_1226del (p.Pro401fs)DeletionMultiple submitters
143411c.1203_1236del (p.Pro402fs)DeletionMultiple submitters
143422c.1216_1241del (p.Glu406fs)DeletionMultiple submitters
1401512c.1199del (p.Pro400fs)DeletionMultiple submitters
1006548c.933_968del (p.Leu313_Val324del)DeletionSingle submitter
11817c.1216G>T (p.Glu406Ter)SNVSingle submitter
11818c.203_204del (p.Pro68fs)DeletionSingle submitter
11834c.1399G>T (p.Glu467Ter)SNVSingle submitter
1071811c.674dup (p.Glu226fs)DuplicationSingle submitter
1073280c.984_1023del (p.Lys329fs)DeletionSingle submitter
1076185c.231del (p.Glu78fs)DeletionSingle submitter
1076937c.954_962del (p.Lys319_Arg321del)DeletionSingle submitter
1206780c.62+2T>ASNV (splice)Single submitter
TOP 50 Likely Pathogenic Variants
ClinVar IDHGVS NotationType
11844c.490C>G (p.Pro164Ala)SNV
143578c.488A>G (p.Asp163Gly)SNV
143585c.505T>A (p.Phe169Ile)SNV
143710c.872C>T (p.Ala291Val)SNV
143742c.946A>G (p.Lys316Glu)SNV
11841c.710C>T (p.Pro237Leu)SNV
1066884c.215G>A (p.Gly72Asp)SNV
1098273c.398A>T (p.Asp133Val)SNV
11810c.500T>C (p.Phe167Ser)SNV
143302c.1040_1073del (p.Gly347fs)Deletion
143303c.1045_1063del (p.Lys349fs)Deletion
143512c.310G>T (p.Gly104Ter)SNV
143519c.64G>T (p.Glu22Ter)SNV
143525c.338C>A (p.Pro113His)SNV
143527c.338C>T (p.Pro113Leu)SNV
143533c.352C>G (p.Arg118Gly)SNV
143538c.367A>G (p.Arg123Gly)SNV
143545c.398A>G (p.Asp133Gly)SNV
143555c.422G>T (p.Gly141Val)SNV
143558c.433C>G (p.Arg145Gly)SNV
(+ many more frameshift deletions/insertions)
AI-Based Variant Predictions SpliceAI Predictions Total Variants: 1,190+
VariantGeneEffectDelta Score
X:154030322:G:GAMECP2donor_gain0.96
X:154030527:G:CMECP2donor_gain0.87
X:154029745:T:AMECP2donor_gain0.80
X:154030320:C:CTMECP2donor_gain0.79
X:154030321:T:TTMECP2donor_gain0.79
X:154030319:G:CMECP2donor_gain0.78
X:154028870:C:CTMECP2donor_gain0.77
X:154028871:T:TTMECP2donor_gain0.77
X:154030317:CAG:CMECP2donor_gain0.73
X:154027626:G:TMECP2donor_gain0.65
X:154030318:AGC:AMECP2donor_gain0.65
X:154025413:G:CMECP2donor_gain0.62
X:154030296:AC:AMECP2donor_gain0.60
X:154030297:CC:CMECP2donor_gain0.60
X:154030294:C:TMECP2donor_gain0.59
X:154030292:C:AMECP2donor_gain0.58
X:154030298:C:CTMECP2donor_gain0.57
X:154030288:AGAGC:AMECP2donor_gain0.57
X:154030290:AGC:AMECP2donor_gain0.54
X:154030940:C:CAMECP2acceptor_gain0.54
AlphaMissense Predictions Total Missense Variants: 3,133
VariantAA ChangePathogenicity ScoreClassification
X:154030370:G:CS486R0.989likely_pathogenic
X:154030370:G:TS486R0.989likely_pathogenic
X:154030381:C:TE483K0.981likely_pathogenic
X:154030380:T:AE483V0.979likely_pathogenic
X:154030397:G:CS477R0.970likely_pathogenic
X:154030399:T:GS477R0.970likely_pathogenic
X:154030374:A:TV485D0.959likely_pathogenic
X:154030402:C:GD476H0.953likely_pathogenic
X:154030380:T:GE483A0.947likely_pathogenic
X:154030383:G:AT482I0.938likely_pathogenic
X:154030401:T:AD476V0.931likely_pathogenic
X:154030371:C:AS486I0.929likely_pathogenic
X:154030411:C:TE473K0.929likely_pathogenic
X:154030380:T:CE483G0.928likely_pathogenic
X:154030374:A:GV485A0.927likely_pathogenic
X:154030402:C:AD476Y0.926likely_pathogenic
X:154030414:C:TE472K0.914likely_pathogenic
X:154030390:C:GE473Q0.887likely_pathogenic
X:154030395:C:GR478P0.890likely_pathogenic
X:154030401:T:GD476A0.890likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 11)

Pathway IDPathway NameDisease Pathway
R-HSA-8986944Transcriptional Regulation by MECP2No
R-HSA-9022692Regulation of MECP2 expression and activityNo
R-HSA-9022699MECP2 regulates neuronal receptors and channelsNo
R-HSA-9022702MECP2 regulates transcription of neuronal ligandsNo
R-HSA-9022707MECP2 regulates transcription factorsNo
R-HSA-9022927MECP2 regulates genes involved in GABA signalingNo
R-HSA-9022534Loss of MECP2 binding ability to 5hmC-DNAYes
R-HSA-9022535Loss of phosphorylation of MECP2 at T308Yes
R-HSA-9022537Loss of MECP2 binding to NCoR/SMRT complexYes
R-HSA-9022538Loss of MECP2 binding ability to 5mC-DNAYes
R-HSA-9725371Nuclear events stimulated by ALK signaling in cancerYes
Gene Ontology Annotations (Total: 66) Biological Process (42 terms) - TOP 20
GO IDTerm
GO:0000122negative regulation of transcription by RNA polymerase II
GO:0045892negative regulation of DNA-templated transcription
GO:0045944positive regulation of transcription by RNA polymerase II
GO:0010629negative regulation of gene expression
GO:0010467gene expression
GO:0031507heterochromatin formation
GO:0044027negative regulation of gene expression via chromosomal CpG island methylation
GO:0071514genomic imprinting
GO:0042551neuron maturation
GO:0016358dendrite development
GO:0007416synapse assembly
GO:0060291long-term synaptic potentiation
GO:0060079excitatory postsynaptic potential
GO:0099191trans-synaptic signaling by BDNF
GO:0043524negative regulation of neuron apoptotic process
GO:0007219Notch signaling pathway
GO:0021549cerebellum development
GO:0035176social behavior
GO:0007616long-term memory
GO:0008344adult locomotory behavior
Molecular Function (15 terms)
GO IDTerm
GO:0008327methyl-CpG binding
GO:0010385double-stranded methylated DNA binding
GO:0003677DNA binding
GO:0003682chromatin binding
GO:1990841promoter-specific chromatin binding
GO:0003676nucleic acid binding
GO:0003723RNA binding
GO:0003729mRNA binding
GO:0035197siRNA binding
GO:0003714transcription corepressor activity
GO:0060090molecular adaptor activity
GO:0140566histone reader activity
GO:0140693molecular condensate scaffold activity
Cellular Component (9 terms)
GO IDTerm
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0000792heterochromatin
GO:0005829cytosol
GO:0005615extracellular space
GO:0005813centrosome
GO:0098794postsynapse

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions Summary

DatabaseTotal Interactions
STRING3,950+
IntAct239+
BioGRID1,482+
TOP 50 Highest-Confidence STRING Interactors
UniProt IDGene SymbolConfidence Score
P26358DNMT1997
Q96ST3SIN3A997
Q13547HDAC1996
Q92769HDAC2994
O75376NCOR1992
Q9UKL0RCOR1992
O43463HDAC3991
P16220CREB1988
P46100ATRX970
O76039CDKL5963
P16990PDHA1958
Q9Y6K1DNMT3A945
Q06787FMR1912
Q14739LBR896
Q8WYQ5DGCR8892
P23560BDNF869
Q9UBC3DNMT3B864
P78355CHD4843
Q99259TBL1X838
Q96LA8TBL1XR1824
P55315CBX5823
O75182NCOR2808
Q96T88UHRF1784
P28472GABRB3779
Q6NXT2H3C1774
Q16695H3C3774
P49711CTCF767
Q13127REST763
Q9NZ94NLGN1756
P56178MECP2 (self)752
IntAct Key Interactors (Selected High-Confidence)
Interaction IDPartner GeneInteraction TypeScore
EBI-25480668KPNA3association0.640
EBI-25875954KLHL20physical association0.670
EBI-16061300Ncor1direct interaction0.590
EBI-16061376Tbl1xdirect interaction0.590
EBI-16061633Tbl1xphysical association0.590
EBI-16061328HDAC3association0.530
EBI-21532962GTPBP10association0.530
EBI-10107233PARP1association0.530
EBI-1189084SMARCA2physical association0.520
EBI-16061605Ncor2direct interaction0.520
Protein Similarity ESM2 Structural/Embedding Similarity (29 proteins)
UniProt IDTop SimilarityAverage Similarity
P170961.0000.9347
Q6URC21.0000.9347
Q95LG81.0000.9691
P170950.99980.9333
Q9QXP30.99980.9348
Q9EPH20.99970.9512
P286670.99970.9496
Q0VBZ90.99950.9483
Q005660.99940.9701
Q625110.99940.9706
Q96JP50.99940.9713
Q9Z2D60.99940.9712
P355660.99930.9459
P490060.99930.9448
Q9Y6180.99920.9775
Q9WU420.99920.9772
Q8K5850.99920.9387
Q69Z610.99890.9781
Q96N640.99890.9763
Q9UBB50.99880.9700
DIAMOND Sequence Similarity (12 proteins)
UniProt IDTop IdentityTop Bitscore
P51608 (self)100.0%728
Q95LG8100.0%728
Q9UIS967.4%733
Q9Z2E267.4%734
O9524366.1%674
Q9Z2D766.1%677
O9598397.4%521
Q9Z2D897.4%520
Q9UBB594.7%588
Q9Z2E194.7%586
Q0056692.3%665
Q9Z2D692.3%669

Section 9: Transcription Factor Regulatory Data MECP2 functions as a transcriptional regulator (primarily repressor) Downstream Targets (Genes Regulated BY MECP2) - Total: 36

Target GeneRegulation TypeConfidence
BDNFRepression-
ESR1Repression-
CCND1Repression-
CRHRepression-
DLL1Repression-
DLX5Unknown-
EHMT2Repression-
FKBP5Repression-
FOXP3Unknown-
FXYD1Repression-
GAD1Repression-
GRIA2Repression-
GRIN2BRepression-
HDAC7Repression-
IGFBP3Repression-
MEF2CRepression-
MGMTUnknown-
MYOCDRepression-
NOTCH1Repression-
NOX4Repression-
PCDH7Repression-
PCDHB1Repression-
PLA2G7Repression-
PTPN1Unknown-
RBFOX1Repression-
RELNRepression-
SGK1Repression-
SIRT1Repression-
SLC25A4Repression-
SLC2A3Activation-
SRFRepression-
TFF1Repression-
ABCB1Unknown-
ALOX5Unknown-
CDKL5Unknown-
Upstream Regulators (TFs That Regulate MECP2) - Total: 8
Transcription FactorRegulation TypeConfidence
EGR2UnknownHigh
PITX2UnknownHigh
TFCP2UnknownHigh
MEF2CUnknown-
GATA2Activation-
SP1Unknown-
ESR1UnknownLow
MBD2UnknownLow
HOXB13UnknownLow
DNA Binding Motifs No JASPAR binding profiles available for MECP2. MECP2 binds methylated CpG dinucleotides through its methyl-CpG-binding domain (MBD) rather than sequence-specific DNA motifs.

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL IDTarget NameType
CHEMBL3638346Methyl-CpG-binding protein 2Single Protein
Targeting Molecules (Total: 6)
ChEMBL IDName/AliasesTypeHighest Dev. Phase
CHEMBL489770N-(4-(pyridin-2-yl)thiazol-2-yl)pyridin-2-amine; GNF-Pf-4773Small molecule0 (Preclinical)
CHEMBL5208511-(5-phenyl-2,3-dihydro-1,3,4-thiadiazol-2-yl)naphthalen-2-olSmall molecule0 (Preclinical)
CHEMBL1371175-Small molecule0 (Preclinical)
CHEMBL3687963-Small molecule0 (Preclinical)
CHEMBL3687966-Small molecule0 (Preclinical)
CHEMBL3687968-Small molecule0 (Preclinical)
PharmGKB Information
PharmGKB IDSymbolVIP GeneHas CPIC GuidelineHas Variant Annotation
PA30729MECP2YesNoYes
Clinical Trials No direct drug-targeting clinical trials found in databases. Research focuses on gene therapy and symptomatic treatment approaches for Rett syndrome. Pharmacogenomics MECP2 is designated as a Very Important Pharmacogene (VIP) in PharmGKB, indicating clinical relevance for pharmacogenomics research, though no specific dosing guidelines currently exist.

Section 11: Expression Profiles Expression Summary (Bgee)

MetricValue
Expression BreadthUbiquitous
Total Present Calls277
Total Absent Calls22
Total Conditions299
Max Expression Score97.92
Average Expression Score83.27
Gold Quality Measurements258
TOP 30 Highest-Expressing Tissues
Tissue (UBERON ID)Expression ScoreQuality
Paraflocculus (UBERON:0005351)97.92Gold
Brodmann area 10 (UBERON:0013541)97.01Gold
Sural nerve (UBERON:0015488)96.30Gold
Frontal pole (UBERON:0002795)95.89Gold
Colonic epithelium (UBERON:0000397)95.84Gold
Middle frontal gyrus (UBERON:0002702)95.66Gold
Cerebellar vermis (UBERON:0004720)95.09Gold
Calcaneal tendon (UBERON:0003701)94.44Gold
Postcentral gyrus (UBERON:0002581)92.41Gold
Tendon (UBERON:0000043)91.59Gold
Parietal lobe (UBERON:0001872)91.22Gold
Blood (UBERON:0000178)91.07Gold
Corpus callosum (UBERON:0002336)90.35Gold
Mucosa of stomach (UBERON:0001199)90.32Gold
Adrenal tissue (UBERON:0018303)90.22Gold
Gastrocnemius (UBERON:0001388)89.95Gold
Muscle of leg (UBERON:0001383)89.87Gold
Hindlimb stylopod muscle (UBERON:0004252)89.82Gold
Olfactory nasal mucosa (UBERON:0005386)89.38Gold
Right lung (UBERON:0002167)89.22Gold
Cerebellum (UBERON:0002037)88.88Gold
Parotid gland (UBERON:0001831)88.87Silver
Left ovary (UBERON:0002119)88.84Gold
Tibial artery (UBERON:0007610)88.73Gold
Popliteal artery (UBERON:0002250)88.73Gold
TOP Cell Type Expression
Cell Type (CL ID)Expression ScoreQuality
Bone marrow cell (CL:0002092)90.24Gold
Granulocyte (CL:0000094)90.17Gold
Monocyte (CL:0000576)89.49Gold
Mononuclear cell (CL:0000842)89.00Gold
Leukocyte (CL:0000738)88.98Gold
Expression Pattern Summary
- Tissue-specific pattern- Highest expression- Notable pattern
MECP2 shows ubiquitous expression across tissuesBrain regions (prefrontal cortex, cerebellum) and nervous systemStrong expression in neural tissues consistent with Rett syndrome neurological phenotype
Disease Associations**
Mendelian/Monogenic Disease Links (GenCC)
DiseaseDisease IDClassificationInheritanceSource
Rett syndromeOMIM:312750DefinitiveX-linkedG2P, Ambry, Labcorp
Syndromic X-linked intellectual disability Lubs typeOMIM:300260DefinitiveX-linkedG2P
Severe neonatal-onset encephalopathy with microcephalyOMIM:300673DefinitiveX-linkedG2P
Chromosome Xq28 duplication syndromeOMIM:300815DefinitiveX-linkedG2P
Atypical Rett syndromeORPHANET:3095SupportiveAutosomal dominantOrphanet
Non-syndromic X-linked intellectual disabilityORPHANET:777SupportiveX-linkedOrphanet
X-linked intellectual disability-psychosis-macroorchidismORPHANET:3077SupportiveX-linkedOrphanet
Systemic lupus erythematosusORPHANET:536SupportiveUnknownOrphanet
Orphanet Disease Associations (Total: 7)
Orphanet IDDisease NameDisorder TypePhenotype Count
778Rett syndromeDisease36
3095Atypical Rett syndromeDisease54
209370MECP2-related severe neonatal encephalopathyDisease19
1762Proximal Xq28 duplication syndromeMalformation syndrome18
3077X-linked ID-psychosis-macroorchidismMalformation syndrome27
777X-linked non-syndromic intellectual disabilityEtiological subtype-
536Systemic lupus erythematosusDisease44
Phenotype Associations (HPO) - TOP 50 of 255
HPO IDPhenotype Term
HP:0001249Intellectual disability
HP:0001250Seizure
HP:0001252Hypotonia
HP:0001263Global developmental delay
HP:0000252Microcephaly
HP:0000253Progressive microcephaly
HP:0001344Absent speech
HP:0000750Delayed speech and language development
HP:0000733Motor stereotypy
HP:0000729Autistic behavior
HP:0000717Autism
HP:0001332Dystonia
HP:0001251Ataxia
HP:0001257Spasticity
HP:0001347Hyperreflexia
HP:0001288Gait disturbance
HP:0002019Constipation
HP:0001508Failure to thrive
HP:0001510Growth delay
HP:0002015Dysphagia
HP:0001319Neonatal hypotonia
HP:0002059Cerebral atrophy
HP:0001298Encephalopathy
HP:0000712Emotional lability
HP:0000739Anxiety
HP:0000748Inappropriate laughter
HP:0001417X-linked inheritance
HP:0001419X-linked recessive inheritance
HP:0001423X-linked dominant inheritance
HP:0000711Restlessness
HP:0000718Aggressive behavior
HP:0000737Irritability
HP:0000752Hyperactivity
HP:0000709Psychosis
HP:0001337Tremor
HP:0001336Myoclonus
HP:0001266Choreoathetosis
HP:0002020Gastroesophageal reflux
HP:0002033Poor suck
HP:0000508Ptosis
HP:0000160Narrow mouth
HP:0000218High palate
HP:0001761Pes cavus
HP:0000470Short neck
HP:0000767Pectus excavatum
HP:0001371Flexion contracture
HP:0001239Wrist flexion contracture
HP:0001387Joint stiffness
HP:0001300Parkinsonism
HP:0007585Respiratory gaseous exchange
GWAS Associations (Total: 8)
Study IDTrait/DiseaseMapped Genep-value
GCST003155Systemic lupus erythematosusMECP22.0e-15
GCST005990Non-albumin protein levelsMECP25.0e-09
GCST003599Systemic lupus erythematosusTMEM187 - IRAK14.0e-10
GCST002190SchizophreniaRENBP4.0e-08
GCST004867Systemic lupus erythematosusARHGAP48.0e-07
GCST001873Red blood cell traitsG6PD2.0e-16
GCST001873Red blood cell traitsG6PD4.0e-19
GCST001873Red blood cell traitsG6PD4.0e-18
DATA SOURCES SUMMARY
DatabaseRecords/Entries
HGNC1 entry, 4,785 cross-references
Ensembl1 gene, 30 transcripts
UniProt1 reviewed entry, 9,371 cross-references
ClinVar2,081 variants
SpliceAI1,190+ predictions
AlphaMissense3,133 missense predictions
PDB9 structures
Reactome11 pathways
GO66 annotations
STRING3,950+ interactions
IntAct239+ interactions
BioGRID1,482+ interactions
SIGNOR49 regulatory relationships
CollecTRI44 TF-target relationships
Bgee277 expression conditions
HPO255 phenotype associations
GenCC12 gene-disease assertions
Orphanet7 disease associations
GWAS Catalog8 associations
ChEMBL6 targeting molecules
PharmGKBVIP gene status

This reference document was compiled from biobtree integrated biological databases. Data reflects current database versions as of the query date.