DMD Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human DMD. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human DMD. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 36 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, mondo, orphanet, ortholog, paralog, pdb, pharmgkb_gene, reactome, refseq, scxa, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (43)
DMD
Identifier TypeValue
HGNC IDHGNC:2928
Approved SymbolDMD
Approved Namedystrophin
Ensembl Gene IDENSG00000198947
NCBI Entrez Gene ID1756
OMIM Gene ID300377
Locus Typeprotein-coding gene
StatusApproved
Genomic Location
AttributeValue
ChromosomeX
Cytogenetic BandXp21.2-p21.1
Start Position (GRCh38)31,097,677
End Position (GRCh38)33,339,609
StrandMinus (-)
Gene Length2,241,932 bp (~2.2 Mb)
Gene Aliases & Previous Symbols
  • Aliases: BMD, DXS142, DXS164, DXS206, DXS230, DXS239, DXS268, DXS269, DXS270, DXS272
  • Previous Symbol: MRX85
  • Gene Groups: Zinc fingers ZZ-type, MicroRNA protein coding host genes, Spectrin repeat containing

Section 2: Transcript Identifiers Summary

MetricCount
Total Ensembl Transcripts91
Protein-coding35
Retained intron32
Nonsense-mediated decay4
Protein-coding CDS not defined18
Ensembl Transcripts (Protein-coding - Top 30)
Transcript IDBiotypeStartEnd
ENST00000357033 ⭐protein_coding31,119,22233,211,549
ENST00000288447protein_coding32,516,59633,339,609
ENST00000343523protein_coding31,119,22831,508,481
ENST00000358062protein_coding31,119,22132,362,958
ENST00000359836protein_coding31,119,22832,155,469
ENST00000361471protein_coding31,121,55631,266,907
ENST00000378677protein_coding31,119,22833,128,360
ENST00000378680protein_coding31,121,67331,266,949
ENST00000378702protein_coding31,121,18431,266,857
ENST00000378705protein_coding31,169,58631,206,600
ENST00000378707protein_coding31,119,22832,155,469
ENST00000378723protein_coding31,119,21931,266,952
ENST00000420596protein_coding32,364,68133,211,312
ENST00000447523protein_coding32,573,61132,849,802
ENST00000448370protein_coding32,364,68133,211,312
ENST00000474231protein_coding31,121,39732,155,469
ENST00000493412protein_coding32,342,63132,348,510
ENST00000541735protein_coding31,119,22832,155,469
ENST00000619831protein_coding31,119,22232,412,237
ENST00000620040protein_coding31,119,39832,155,469
RefSeq Transcripts (Human)
AccessionTypeStatusMANE Select
NM_004006 ⭐mRNAREVIEWEDYes
NM_000109mRNAREVIEWEDNo
NM_004009mRNAREVIEWEDNo
NM_004010mRNAREVIEWEDNo
NM_004011mRNAREVIEWEDNo
NM_004012mRNAREVIEWEDNo
NM_004013mRNAREVIEWEDNo
NM_004014mRNAREVIEWEDNo
NM_004015mRNAREVIEWEDNo
NM_004016mRNAREVIEWEDNo
NM_004017mRNAREVIEWEDNo
NM_004018mRNAREVIEWEDNo
NM_004019mRNAREVIEWEDNo
NM_004020mRNAREVIEWEDNo
NM_004021mRNAREVIEWEDNo
NM_004022mRNAREVIEWEDNo
NM_004023mRNAREVIEWEDNo
CCDS IDs (Total: 11) CCDS14229, CCDS14230, CCDS14231, CCDS14232, CCDS14233, CCDS14234, CCDS48091, CCDS55394, CCDS55395, CCDS94585, CCDS94586 Canonical Transcript Exons (ENST00000357033) Total Exon Count: 79
Exon IDStartEndStrand
ENSE0000189914133,211,28233,211,549-
ENSE0000346306433,020,13933,020,200-
ENSE0000164020632,844,78332,844,860-
ENSE0000169109732,849,72832,849,820-
ENSE0000168528232,823,29532,823,387-
ENSE0000178949832,816,46832,816,640-
ENSE0000143499532,697,87032,697,998-
ENSE0000143650632,699,11232,699,293-
ENSE0000143558132,644,96432,645,152-
ENSE0000143568732,644,13232,644,313-
............
ENSE0000390142031,119,22231,121,930-
(Full list of 79 exons available)

Section 3: Protein Identifiers UniProt Accessions

AccessionNameStatusLengthMass
P11532 ⭐DystrophinReviewed (Swiss-Prot)3,685 aa426,778 Da
RefSeq Protein Accessions (Human)
AccessionStatusMANE Select
NP_003997 ⭐REVIEWEDYes
NP_000100REVIEWEDNo
NP_004000REVIEWEDNo
NP_004001REVIEWEDNo
NP_004002REVIEWEDNo
NP_004003REVIEWEDNo
NP_004004REVIEWEDNo
NP_004005REVIEWEDNo
NP_004006REVIEWEDNo
NP_004007REVIEWEDNo
NP_004008REVIEWEDNo
NP_004009REVIEWEDNo
NP_004010REVIEWEDNo
NP_004011REVIEWEDNo
NP_004012REVIEWEDNo
NP_004013REVIEWEDNo
NP_004014REVIEWEDNo
Protein Domains and Families (Total: 14)
Domain/Family IDNameType
IPR001715CH_domDomain
IPR002017Spectrin_repeatRepeat
IPR018159Spectrin/alpha-actininRepeat
IPR000433Znf_ZZDomain
IPR001202WW_domDomain
IPR015153EF-hand_dom_typ1Domain
IPR015154EF-hand_dom_typ2Domain
IPR001589Actinin_actin-bd_CSConserved_site
IPR035436Dystrophin/utrophinFamily
IPR050774KCMF1/DystrophinFamily
IPR011992EF-hand-dom_pairHomologous_superfamily
IPR036020WW_dom_sfHomologous_superfamily
IPR036872CH_dom_sfHomologous_superfamily
IPR043145Znf_ZZ_sfHomologous_superfamily

Section 4: Structure Identifiers Experimental Structures (Total: 5 PDB entries)

PDB IDTitleMethodResolution
9D58Human Dystrophin tandem calponin homology actin-binding domain crystallized in a closed-state conformationX-RAY DIFFRACTION1.94 Å
1EG3Structure of a dystrophin WW domain fragment in complex with a beta-dystroglycan peptideX-RAY DIFFRACTION2.0 Å
1EG4Structure of a dystrophin WW domain fragment in complex with a beta-dystroglycan peptideX-RAY DIFFRACTION2.0 Å
3UUNCrystal Structure of N-terminal first spectrin repeat of dystrophinX-RAY DIFFRACTION2.3 Å
1DXXN-terminal Actin-binding Domain of Human DystrophinX-RAY DIFFRACTION2.6 Å
Predicted Structures
SourceModel IDCoverage
AlphaFoldAF-P11532-F1Full-length predicted
Swiss-ModelAvailablePartial coverage

Section 5: Cross-Species Orthologs Orthologs in Model Organisms

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000045103Dmdprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000046366Dmdprotein_coding
Zebrafish (Danio rerio)ENSDARG00000008487dmdprotein_coding
Fruit fly (D. melanogaster)No direct ortholog--
Worm (C. elegans)No direct ortholog--
Yeast (S. cerevisiae)No ortholog--
Human Paralogs (Total: 36)
Gene IDSymbolDescription
ENSG00000152818UTRNUtrophin (closest paralog)
ENSG00000102385DRP2Dystrophin related protein 2
ENSG00000134769DTNADystrobrevin alpha
ENSG00000138101DTNBDystrobrevin beta
ENSG00000232125DYTNDystrotelin
ENSG00000072110ACTN1Actinin alpha 1
ENSG00000077522ACTN2Actinin alpha 2
ENSG00000248746ACTN3Actinin alpha 3
ENSG00000130402ACTN4Actinin alpha 4
ENSG00000163554SPTA1Spectrin alpha
ENSG00000070182SPTBSpectrin beta
ENSG00000115306SPTBN1Spectrin beta 1
ENSG00000173898SPTBN2Spectrin beta 2
ENSG00000160460SPTBN4Spectrin beta 4
ENSG00000137877SPTBN5Spectrin beta 5
ENSG00000197694SPTAN1Spectrin alpha 1
ENSG00000196924FLNAFilamin A
ENSG00000136068FLNBFilamin B
ENSG00000128591FLNCFilamin C
ENSG00000127603MACF1Microtubule-actin crosslinking factor 1

Section 6: Clinical Variants & AI Predictions ClinVar Summary

ClassificationCount
Total Variants11,261
Pathogenic2,500+
Likely Pathogenic800+
Uncertain Significance (VUS)5,000+
Likely Benign1,000+
Benign500+
Conflicting Classifications500+
TOP 50 Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
1066543NM_004006.3(DMD):c.961-2A>GSNVPathogenic
1068684NM_004006.3(DMD):c.1402_1406del (p.Glu468fs)DeletionPathogenic
1069390NM_004006.3(DMD):c.3426C>A (p.Cys1142Ter)SNVPathogenic
1069704NM_004006.3(DMD):c.418del (p.Leu140fs)DeletionPathogenic
1069818NM_004006.3(DMD):c.1383dup (p.Asp462Ter)DuplicationPathogenic
1069895NM_004006.3(DMD):c.5404C>T (p.Gln1802Ter)SNVPathogenic
1069896NM_004006.3(DMD):c.4777G>T (p.Glu1593Ter)SNVPathogenic
1069989NM_004006.3(DMD):c.8853_8865del (p.Gln2952fs)DeletionPathogenic
1070053NM_004006.3(DMD):c.3511G>T (p.Glu1171Ter)SNVPathogenic
1070054NM_004006.3(DMD):c.2994T>A (p.Tyr998Ter)SNVPathogenic
1070319NM_004006.3(DMD):c.31+1G>ASNVPathogenic
1070660NM_004006.3(DMD):c.2508_2509dup (p.Asn837fs)MicrosatellitePathogenic
1070671NM_004006.3(DMD):c.1696G>T (p.Glu566Ter)SNVPathogenic
1071032NM_004006.3(DMD):c.568C>T (p.Gln190Ter)SNVPathogenic
1071092NM_004006.3(DMD):c.1240G>T (p.Glu414Ter)SNVPathogenic
1071406NM_004006.3(DMD):c.3088A>T (p.Lys1030Ter)SNVPathogenic
1071448NM_004006.3(DMD):c.1150-2_1150-1delDeletionPathogenic
1071576NM_004006.3(DMD):c.4518+2T>GSNVPathogenic
1071817NM_004006.3(DMD):c.142dup (p.Arg48fs)DuplicationPathogenic
1071899NM_004006.3(DMD):c.8745G>A (p.Trp2915Ter)SNVPathogenic
1071900NM_004006.3(DMD):c.8491C>T (p.Gln2831Ter)SNVPathogenic
1071901NM_004006.3(DMD):c.8416C>T (p.Gln2806Ter)SNVPathogenic
1071902NM_004006.3(DMD):c.4864G>T (p.Glu1622Ter)SNVPathogenic
1071903NM_004006.3(DMD):c.7822G>T (p.Glu2608Ter)SNVPathogenic
1072046NM_004006.3(DMD):c.6677G>A (p.Trp2226Ter)SNVPathogenic
1072120NM_004006.3(DMD):c.2803+5_2803+8delMicrosatellitePathogenic
1072121NM_004006.3(DMD):c.2605C>T (p.Gln869Ter)SNVPathogenic
1072122NM_004006.3(DMD):c.2521C>T (p.Gln841Ter)SNVPathogenic
1072123NM_004006.3(DMD):c.2353C>T (p.Gln785Ter)SNVPathogenic
1072167NM_004006.3(DMD):c.94-2A>CSNVPathogenic
1072276NM_004006.3(DMD):c.1603-2A>TSNVPathogenic
1072763NM_004006.3(DMD):c.6254G>A (p.Trp2085Ter)SNVPathogenic
1072774NM_004006.3(DMD):c.10642dup (p.Thr3548fs)DuplicationPathogenic
1072778NM_004006.3(DMD):c.6913-2A>CSNVPathogenic
1072908NM_004006.3(DMD):c.9415A>T (p.Lys3139Ter)SNVPathogenic
1072951NM_004006.3(DMD):c.9624del (p.Ala3209fs)DeletionPathogenic
1073002NM_004006.3(DMD):c.2695G>T (p.Glu899Ter)SNVPathogenic
1073022NM_004006.3(DMD):c.7449del (p.Asp2484fs)DeletionPathogenic
1073030NM_004006.3(DMD):c.790G>T (p.Glu264Ter)SNVPathogenic
1073408NM_004006.3(DMD):c.2917G>T (p.Glu973Ter)SNVPathogenic
1073643NM_004006.3(DMD):c.10224dup (p.Pro3409fs)DuplicationPathogenic
1073681NM_004006.3(DMD):c.1983dup (p.Thr662fs)DuplicationPathogenic
1073692NM_004006.3(DMD):c.2760_2761del (p.Gln920fs)DeletionPathogenic
1073735NM_004006.3(DMD):c.6604del (p.Arg2202fs)DeletionPathogenic
1073854NM_004006.3(DMD):c.3727_3728del (p.Leu1243fs)MicrosatellitePathogenic
1074008NM_004006.3(DMD):c.1961T>A (p.Leu654Ter)SNVPathogenic
1074009NM_004006.3(DMD):c.1339A>T (p.Arg447Ter)SNVPathogenic
1074010NM_004006.3(DMD):c.1721G>A (p.Trp574Ter)SNVPathogenic
SpliceAI Predictions Total Predictions: 7,041
TOP 50 High-Impact Splice Variants (Score ≥0.8)
Variant ID
------------
X:31146284:CACTT:C
X:31146285:ACTT:A
X:31146286:CTTA:C
X:31146287:TTA:T
X:31146288:TA:T
X:31146289:A:AC
X:31146289:ACC:A
X:31146290:C:CC
X:31146425:C:CT
X:31134095:AGCTT:A
X:31134096:GCTT:G
X:31134097:CTTA:C
X:31134098:TTA:T
X:31134099:TACCT:T
X:31146289:ACCCA:A
X:31146413:GG:G
X:31146414:GC:G
X:31146415:C:CA
X:31146416:T:C
X:31146421:C:CT
X:31146426:A:T
X:31147262:T:C
X:31134192:CAC:C
X:31134196:T:C
X:31134199:A:AC
X:31133681:C:CT
X:31133682:T:TT
X:31130704:TA:T
X:31130705:AA:A
X:31134891:C:CT
X:31146352:CCT:C
X:31146354:TCT:T
X:31146410:TGGGG:T
X:31146411:GGGG:G
X:31146412:GGG:G
X:31134191:TCACC:T
X:31134192:CACCT:C
X:31121931:C:CC
X:31126709:A:T
X:31121928:GTCC:G
X:31121929:TCC:T
X:31121930:CCT:C
X:31121931:C:T
X:31121932:T:G
X:31147258:C:A
X:31134195:C:CC
X:31136416:C:CT
X:31146353:CTC:C
X:31134206:C:CT
AlphaMissense Predictions Total Predictions: 24,447
TOP 50 Likely Pathogenic Missense Variants
Variant
---------
X:31147294:A:G
X:31147285:A:G
X:31147282:A:G
X:31147290:C:A
X:31147290:C:G
X:31147296:C:A
X:31147296:C:G
X:31147294:A:C
X:31147291:C:A
X:31147291:C:G
X:31147297:C:A
X:31147297:C:G
X:31147298:T:C
X:31147285:A:T
X:31147282:A:C
X:31134115:G:C
X:31134115:G:T
X:31147288:T:G
X:31147285:A:C
X:31147282:A:T
X:31147292:T:C
X:31147294:A:T
X:31147295:G:C
X:31134128:A:G
X:31134128:A:T
X:31147298:T:A
X:31147292:T:A
X:31147300:T:G
X:31134116:A:G
X:31134116:A:C
X:31147295:G:T
X:31134128:A:C
X:31146310:A:C
X:31146413:G:A
X:31147297:C:T
X:31147283:G:T
X:31147283:G:C
X:31134141:C:A
X:31134129:G:A
X:31134140:A:T
X:31134117:A:C
X:31134131:T:G
X:31134117:A:T
X:31147291:C:T
X:31146391:A:C
X:31134137:A:T
X:31146413:G:T
X:31147286:G:C
X:31134137:A:C
X:31146390:C:G

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 3)

Pathway IDPathway Name
R-HSA-390522Striated Muscle Contraction
R-HSA-3000171Non-integrin membrane-ECM interactions
R-HSA-9913351Formation of the dystrophin-glycoprotein complex (DGC)
Gene Ontology Annotations (Total: 51) Molecular Function (Count: 9)
GO IDTerm
GO:0003779actin binding
GO:0002162dystroglycan binding
GO:0017022myosin binding
GO:0017166vinculin binding
GO:0050998nitric-oxide synthase binding
GO:0008270zinc ion binding
GO:0005200structural constituent of cytoskeleton
GO:0008307structural constituent of muscle
Biological Process (Count: 27)
GO IDTerm
GO:0007517muscle organ development
GO:0007519skeletal muscle tissue development
GO:0055001muscle cell development
GO:0060048cardiac muscle contraction
GO:0014819regulation of skeletal muscle contraction
GO:0090257regulation of muscle system process
GO:0046716muscle cell cellular homeostasis
GO:0035994response to muscle stretch
GO:0002027regulation of heart rate
GO:0086001cardiac muscle cell action potential
GO:0010880regulation of release of sequestered calcium ion into cytosol by sarcoplasmic reticulum
GO:0010881regulation of cardiac muscle contraction by regulation of the release of sequestered calcium ion
GO:0014809regulation of skeletal muscle contraction by regulation of release of sequestered calcium ion
GO:1903169regulation of calcium ion transmembrane transport
GO:1902305regulation of sodium ion transmembrane transport
GO:0035633maintenance of blood-brain barrier
GO:0048666neuron development
GO:0010976positive regulation of neuron projection development
GO:0045666positive regulation of neuron differentiation
GO:0099536synaptic signaling
Cellular Component (Count: 15)
GO IDTerm
GO:0016010dystrophin-associated glycoprotein complex
GO:0016013syntrophin complex
GO:0042383sarcolemma
GO:0043034costamere
GO:0030018Z disc
GO:0005886plasma membrane
GO:0009986cell surface
GO:0045211postsynaptic membrane
GO:0045202synapse
GO:0005856cytoskeleton
GO:0002385muscle tissue
GO:0045121membrane raft
GO:0030055cell-substrate junction
GO:0005634nucleus
GO:0005829cytosol

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interaction Summary

DatabaseInteraction Count
STRING2,170+
BioGRID175+
IntAct131+
TOP 50 STRING Interactions (Highest Confidence)
Interacting ProteinUniProtScore
DAG1 (Dystroglycan)Q14118999
DMD (self-interaction)P11532999
DTNA (Dystrobrevin alpha)Q14714996
DTNB (Dystrobrevin beta)Q13326965
SNTA1 (Syntrophin alpha-1)Q13424966
SNTB1 (Syntrophin beta-1)Q92629965
SNTB2 (Syntrophin beta-2)Q13884956
SGCA (Sarcoglycan alpha)Q16586982
SGCB (Sarcoglycan beta)Q16585923
SGCG (Sarcoglycan gamma)Q13326965
SGCD (Sarcoglycan delta)Q92629965
UTRN (Utrophin)P46939972
NOS1 (Nitric oxide synthase 1)P29475895
VCL (Vinculin)P18206933
ACTN2 (Actinin alpha-2)P35609965
DES (Desmin)P17661830
LAMA2 (Laminin alpha-2)P24043957
CAV3 (Caveolin-3)P56539901
PLEC (Plectin)Q15149897
TTN (Titin)Q8WZ42849
TOP 50 IntAct Interactions
Interaction IDInteractor AInteractor BTypeConfidence
EBI-10187548DMDDTNBphysical association0.890
EBI-10319402CTNNAL1DMDphysical association0.880
EBI-295853SNTB1DMDphysical association0.840
EBI-30875745DMDSNTA1physical association0.800
EBI-10283592DMDHAUS1physical association0.800
EBI-30875791DMDDTNAphysical association0.760
EBI-368283SNTB2DMDphysical association0.740
EBI-12702602HNF4ADMDphysical association0.560
EBI-21273895ADRA1DDMDassociation0.530
EBI-54452660DMDRUNX2direct interaction0.440
Protein Similarity (ESM2 Embedding - Total: 64)
UniProtSimilarityAvg Similarity
Q9UPN3 (MACF1)0.99990.9900
Q9QXZ0 (Utrophin, mouse)0.99990.9898
Q91ZU60.99960.9890
Q8NF91 (Nesprin-1)0.99970.9890
Q6ZWR6 (Nesprin-2)0.99970.9890
P11533 (Dystrophin, rat)0.99900.9890
P46939 (UTRN)0.99900.9886
P11531 (Dystrophin, mouse)0.99990.9887
P11530 (Dystrophin, rabbit)0.99990.9886
G3V7L10.99900.9883
Sequence Homology (DIAMOND - Total: 122+)
UniProtIdentityBitscoreDescription
Q03001 (MACF1/ACF7)77.6%10067Microtubule-actin crosslinking factor
Q8NF91 (SYNE1)84.1%13873Nesprin-1
Q6ZWR6 (SYNE2)84.1%13865Nesprin-2
O97592 (DMD, bovine)94.6%6542Dystrophin
P11530 (DMD, rabbit)96.2%6642Dystrophin
P11531 (DMD, mouse)96.2%6636Dystrophin
D3ZHV2 (DMD, rat)97.5%6420Dystrophin
Q5GN48 (DMD, canine)94.6%6546Dystrophin
P11533 (UTRN, human)77.7%5413Utrophin
Q15149 (PLEC)91.8%4957Plectin

Section 9: Transcription Factor Regulatory Data Note: DMD does NOT encode a transcription factor. It encodes a structural protein (dystrophin). Upstream Transcriptional Regulators of DMD (CollecTRI)

TF GeneRegulationConfidence
MYOD1ActivationHigh
SP1ActivationHigh
SP3ActivationHigh
SRFUnknownHigh
TFAP2AUnknownHigh
YY1UnknownHigh
HRUnknownLow
Downstream Targets Not applicable - DMD is not a transcription factor. DNA Binding Profiles Not applicable - DMD is not a transcription factor.

Section 10: Drug & Pharmacology Data PharmGKB

AttributeValue
PharmGKB IDPA27378
VIP GeneYes
CPIC GuidelineNo
ChromosomechrX
ChEMBL Target
Target IDNameType
CHEMBL5498504DystrophinSINGLE PROTEIN
Clinical Trials for Duchenne Muscular Dystrophy (Total: 100+) Phase 4 Trials
Trial IDTitleStatus
NCT04687020Long-term Use of Viltolarsen in Boys With DMDACTIVE_NOT_RECRUITING
NCT06713135Safety and Effectiveness of Long-term Treatment With VamoroloneACTIVE_NOT_RECRUITING
NCT05412394Once Weekly Infant Corticosteroid Trial for DMDRECRUITING
NCT00606775Preventive Efficacy of Carvedilol on Cardiac DysfunctionUNKNOWN
Phase 3 Trials (Selected - Total: 50+)
Trial IDInterventionStatusPhase
NCT05096221Delandistrogene Moxeparvovec (SRP-9001)COMPLETEDPhase 3
NCT04281485PF-06939926 (Gene therapy)ACTIVE_NOT_RECRUITINGPhase 3
NCT05693142RGX-202 Gene TherapyRECRUITINGPhase 3
NCT07160634SGT-003 Gene TherapyRECRUITINGPhase 3
NCT04060199Viltolarsen (RACER53)COMPLETEDPhase 3
NCT02500381Casimersen & GolodirsenCOMPLETEDPhase 3
NCT02255552EteplirsenCOMPLETEDPhase 3
NCT01826487AtalurenCOMPLETEDPhase 3
NCT02851797GivinostatCOMPLETEDPhase 3
NCT05933057Givinostat (Non-ambulant)RECRUITINGPhase 3
NCT05126758Deramiocel (CAP-1002)ACTIVE_NOT_RECRUITINGPhase 3
NCT03703882EdasalonexentCOMPLETEDPhase 3
NCT05066633Metoprolol (add-on treatment)RECRUITINGPhase 3
Approved Therapies (FDA)
DrugMechanismApproval Year
Deflazacort (Emflaza)Corticosteroid2017
Eteplirsen (Exondys 51)Exon 51 skipping2016
Golodirsen (Vyondys 53)Exon 53 skipping2019
Viltolarsen (Viltepso)Exon 53 skipping2020
Casimersen (Amondys 45)Exon 45 skipping2021
Delandistrogene moxeparvovec (Elevidys)Gene therapy2023
Vamorolone (Agamree)Corticosteroid2023
Givinostat (Duvyzat)HDAC inhibitor2024

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression BreadthUbiquitous
Total Present Calls295
Total Absent Calls6
Max Expression Score98.90
Average Expression Score83.39
TOP 30 Tissues by Expression Score
Tissue/Cell Type (UBERON/CL ID)ExpressionScoreQuality
Trigeminal ganglion (UBERON:0001675)present98.90gold
Skeletal muscle tissue of rectus abdominis (UBERON:0004511)present98.70gold
Dorsal root ganglion (UBERON:0000044)present98.12gold
Seminal vesicle (UBERON:0000998)present97.61gold
Sural nerve (UBERON:0015488)present97.47gold
Biceps brachii (UBERON:0001507)present97.46gold
Vastus lateralis (UBERON:0001379)present97.34gold
Skeletal muscle tissue of biceps brachii (UBERON:0004502)present97.29gold
Blood vessel layer (UBERON:0004797)present97.19gold
Quadriceps femoris (UBERON:0001377)present96.94gold
Endothelial cell (CL:0000115)present96.73gold
Saphenous vein (UBERON:0007318)present96.67gold
Body of tongue (UBERON:0011876)present96.58gold
Cardiac muscle of right atrium (UBERON:0003379)present96.49gold
Left ventricle myocardium (UBERON:0006566)present96.48gold
Skeletal muscle tissue (UBERON:0001134)present96.45gold
Cauda epididymis (UBERON:0004360)present96.42gold
Myocardium (UBERON:0002349)present96.28gold
Tibial nerve (UBERON:0001323)present96.25gold
Tibialis anterior (UBERON:0001385)present96.21gold
Heart right ventricle (UBERON:0002080)present96.17gold
Colonic epithelium (UBERON:0000397)present95.89gold
Vena cava (UBERON:0004087)present95.86gold
Muscle tissue (UBERON:0002385)present95.68gold
Hindlimb stylopod muscle (UBERON:0004252)present95.51gold
Skeletal muscle organ (UBERON:0014892)present95.39gold
Muscle organ (UBERON:0001630)present95.38gold
Secondary oocyte (CL:0000655)present95.31gold
Deltoid (UBERON:0001476)present95.20gold
Choroid plexus epithelium (UBERON:0003911)present95.11gold
Expression Pattern Summary
  • Highest expression: Skeletal muscle, cardiac muscle, smooth muscle, peripheral nerves
  • Pattern: Tissue-enriched in muscle tissues (both skeletal and cardiac)
  • Isoforms: Multiple tissue-specific promoters drive different isoforms (Dp427, Dp260, Dp140, Dp116, Dp71) Single-Cell Expression Data (Total: 11 datasets)
Experiment IDDescriptionSpeciesCell Count
E-ANND-2GTEx: snRNAseq atlasHomo sapiens209,126
E-MTAB-11268Atlas of the human hypertrophied heartHomo sapiens64,898
E-HCAD-31Pancreatic islet cells (healthy and T2D)Homo sapiens38,217
E-MTAB-7316Adult human neural retinaHomo sapiens25,143
E-GEOD-137537Human Retina (Age-Related Macular Degeneration)Homo sapiens12,881
E-MTAB-6678Human first trimester fetal-maternal interfaceHomo sapiens7,598
E-MTAB-7052Human dermal fibroblastsHomo sapiens1,920
E-GEOD-130473Human fetal liverHomo sapiens1,475
E-ENAD-20Patient-derived xenograft modelHomo sapiens674
E-MTAB-7303iPSC-dopamine neurons (Parkinson)Homo sapiens123

Section 12: Disease Associations Mendelian Disease Links (GenCC - Definitive/Strong)

DiseaseOMIM/MONDOClassificationMOISubmitter
Duchenne muscular dystrophyOMIM:310200DefinitiveX-linkedMultiple
Becker muscular dystrophyOMIM:300376DefinitiveX-linkedAmbry Genetics
Dilated cardiomyopathy 3BOMIM:302045DefinitiveX-linkedAmbry Genetics
Duchenne and Becker muscular dystrophyMONDO:0016899DefinitiveX-linkedMyriad
Orphanet Disease Associations
ORPHA IDDisease NameTypeGene Count
98896Duchenne muscular dystrophyDisease2
98895Becker muscular dystrophyDisease1
206546Symptomatic form of DMD/BMD in female carriersDisease1
154Familial isolated dilated cardiomyopathyDisease53
777X-linked non-syndromic intellectual disabilitySubtype30
HPO Phenotype Associations (Total: 80) TOP 50 Phenotypes
HPO IDPhenotype
HP:0003560Muscular dystrophy
HP:0003236Elevated circulating creatine kinase concentration
HP:0003707Calf muscle pseudohypertrophy
HP:0003701Proximal muscle weakness
HP:0003323Progressive muscle weakness
HP:0001644Dilated cardiomyopathy
HP:0001638Cardiomyopathy
HP:0002505Loss of ambulation
HP:0002878Respiratory failure
HP:0002747Respiratory insufficiency due to muscle weakness
HP:0003391Gowers sign
HP:0001252Hypotonia
HP:0001270Motor delay
HP:0001263Global developmental delay
HP:0001324Muscle weakness
HP:0002650Scoliosis
HP:0001371Flexion contracture
HP:0001771Achilles tendon contracture
HP:0003551Difficulty climbing stairs
HP:0002515Waddling gait
HP:0003546Exercise intolerance
HP:0030097Absent muscle dystrophin expression
HP:0003198Myopathy
HP:0003202Skeletal muscle atrophy
HP:0011675Arrhythmia
HP:0001635Congestive heart failure
HP:0025169Left ventricular systolic dysfunction
HP:0001712Left ventricular hypertrophy
HP:0002093Respiratory insufficiency
HP:0002091Restrictive ventilatory defect
HP:0001256Mild intellectual disability
HP:0100543Cognitive impairment
HP:0001328Specific learning disability
HP:0001419X-linked recessive inheritance
HP:0001417X-linked inheritance
HP:0003621Juvenile onset
HP:0011463Childhood onset
HP:0008981Calf muscle hypertrophy
HP:0003731Quadriceps muscle weakness
HP:0003326Myalgia
HP:0003710Exercise-induced muscle cramps
HP:0002913Myoglobinuria
HP:0002380Fasciculations
HP:0003457EMG abnormality
HP:0003115Abnormal EKG
HP:0002870Obstructive sleep apnea
HP:0012378Fatigue
HP:0002527Falls
HP:0030051Tip-toe gait
HP:0001288Gait disturbance
GWAS Associations (Total: 11)
Study IDTraitP-value
GCST003124_19Mild influenza (H1N1) infection5.0e-14
GCST003125_11Influenza A (H1N1) infection4.0e-11
GCST012304_6Major depressive disorder1.0e-09
GCST004904_60Body mass index1.0e-08
GCST004034_1Temporomandibular joint disorder4.0e-08
GCST009139_1Ankle-brachial index7.0e-08
GCST001308_20Response to anti-depressant treatment in MDD2.0e-06
GCST002202_3Anxiety in major depressive disorder4.0e-06
GCST004862_35Itch intensity from mosquito bite5.0e-06
GCST008151_58Waist circumference4.0e-06
GCST008160_82Waist circumference4.0e-06

Summary Statistics

CategoryCount
Ensembl Transcripts91
RefSeq mRNA (Human)17
CCDS IDs11
Exons (Canonical)79
Protein Length3,685 aa
InterPro Domains14
PDB Structures5
Orthologs3 (+ 36 paralogs)
ClinVar Variants11,261
SpliceAI Predictions7,041
AlphaMissense Predictions24,447
Reactome Pathways3
GO Terms51
STRING Interactions2,170+
HPO Phenotypes80
Clinical Trials100+
Expression Tissues295

This reference document provides a comprehensive cross-database mapping for human DMD gene. Data sourced from: HGNC, Ensembl, NCBI, UniProt, PDB, ClinVar, SpliceAI, AlphaMissense, Reactome, Gene Ontology, STRING, BioGRID, IntAct, Bgee, Orphanet, GenCC, HPO, GWAS Catalog, PharmGKB, and ChEMBL via biobtree.