DMD Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human DMD — a definitive lookup resource covering: ### Section …

Provide a comprehensive cross-database identifier and functional mapping reference for human DMD — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene DMD, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene DMD, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene DMD protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene DMD protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene DMD, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene DMD, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene DMD, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene DMD protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene DMD, summarize transcription factor regulatory data. If DMD is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate DMD — names with evidence type (ChIP-seq / predicted / experimentally validated) If DMD is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene DMD protein as a drug target, summarize pharmacology data. If DMD is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If DMD is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene DMD, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene DMD, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in DMD: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

DMD

Executive summary

DMD encodes dystrophin, a 3,685-residue structural protein that anchors the cytoskeleton to the extracellular matrix via the dystrophin-glycoprotein complex (DGC); loss-of-function mutations cause Duchenne muscular dystrophy (DMD) and Becker muscular dystrophy (BMD), two of the most common X-linked recessive diseases. The gene spans over 2.2 Mb on chromosome X (GRCh38: 31,097,677–33,339,609) and is among the largest in the human genome, producing 91 Ensembl transcripts (46 protein-coding) with a canonical 79-exon MANE Select transcript (NM_004006/ENST00000357033). Roughly ~11,261 ClinVar variants are catalogued, with pathogenic mutations spanning Duchenne/Becker muscular dystrophy and dilated cardiomyopathy 3B. The therapeutic landscape is unusually rich — 96 molecules across approved and investigational use, including exon-skipping oligonucleotides (eteplirsen, viltolarsen, casimersen), AAV-mediated microdystrophin gene therapies (delandistrogene moxeparvovec), and nonsense readthrough agents (ataluren). Expression is ubiquitous but highest in skeletal and cardiac muscle, peripheral nerves, and vascular endothelium, consistent with the multisystem manifestations of dystrophinopathy.

DMD — Reference

Cross-database identifier and functional mapping reference for DMD.

Gene identifiers

Gene: DMD (dystrophin)

  • HGNC ID: HGNC:2928
  • Approved symbol: DMD
  • Ensembl gene ID: ENSG00000198947
  • NCBI Entrez Gene ID: 1756
  • OMIM gene/locus ID: 300377

Genomic location (GRCh38):

  • Chromosome: X
  • Start position: 31,097,677
  • End position: 33,339,609
  • Strand: − (negative/reverse)

Transcript identifiers

Ensembl Transcripts (91 total)

ENST IDBiotype
ENST00000288447protein_coding
ENST00000343523protein_coding
ENST00000357033protein_coding
ENST00000358062protein_coding
ENST00000359836protein_coding
ENST00000361471protein_coding
ENST00000378677protein_coding
ENST00000378680protein_coding
ENST00000378702protein_coding
ENST00000378705protein_coding
ENST00000378707protein_coding
ENST00000378723protein_coding
ENST00000420596protein_coding
ENST00000447523protein_coding
ENST00000448370protein_coding
ENST00000463609protein_coding_CDS_not_defined
ENST00000471779nonsense_mediated_decay
ENST00000472266protein_coding_CDS_not_defined
ENST00000472681protein_coding_CDS_not_defined
ENST00000474231protein_coding
ENST00000475732retained_intron
ENST00000480751protein_coding_CDS_not_defined
ENST00000481143protein_coding_CDS_not_defined
ENST00000488902protein_coding_CDS_not_defined
ENST00000493412protein_coding
ENST00000541735protein_coding
ENST00000619831protein_coding
ENST00000620040protein_coding
ENST00000634285retained_intron
ENST00000634315protein_coding_CDS_not_defined
ENST00000679437protein_coding
ENST00000679482retained_intron
ENST00000679641nonsense_mediated_decay
ENST00000679669retained_intron
ENST00000679706protein_coding
ENST00000679850retained_intron
ENST00000680162protein_coding
ENST00000680216protein_coding
ENST00000680355protein_coding
ENST00000680557protein_coding
ENST00000680701retained_intron
ENST00000680768protein_coding
ENST00000680961nonsense_mediated_decay
ENST00000681026protein_coding
ENST00000681153protein_coding
ENST00000681334protein_coding
ENST00000681646retained_intron
ENST00000681654retained_intron
ENST00000681839protein_coding
ENST00000681870protein_coding
ENST00000681989protein_coding_CDS_not_defined
ENST00000682071protein_coding
ENST00000682135retained_intron
ENST00000682183retained_intron
ENST00000682207retained_intron
ENST00000682238protein_coding
ENST00000682307retained_intron
ENST00000682322protein_coding
ENST00000682437retained_intron
ENST00000682439retained_intron
ENST00000682584retained_intron
ENST00000682600protein_coding
ENST00000682746retained_intron
ENST00000682769protein_coding_CDS_not_defined
ENST00000682870retained_intron
ENST00000682899retained_intron
ENST00000682924nonsense_mediated_decay
ENST00000683117retained_intron
ENST00000683309protein_coding_CDS_not_defined
ENST00000683450retained_intron
ENST00000683503retained_intron
ENST00000683509protein_coding_CDS_not_defined
ENST00000683658retained_intron
ENST00000683675protein_coding_CDS_not_defined
ENST00000683709protein_coding_CDS_not_defined
ENST00000683851retained_intron
ENST00000683957protein_coding_CDS_not_defined
ENST00000683985retained_intron
ENST00000683995retained_intron
ENST00000684056retained_intron
ENST00000684072retained_intron
ENST00000684103retained_intron
ENST00000684130protein_coding
ENST00000684165retained_intron
ENST00000684237protein_coding
ENST00000684247retained_intron
ENST00000684292retained_intron
ENST00000684342retained_intron
ENST00000684350retained_intron
ENST00000684357retained_intron
ENST00000684660retained_intron

Total: 91 transcripts (protein-coding: 46, protein_coding_CDS_not_defined: 13, retained_intron: 28, nonsense_mediated_decay: 4)


RefSeq Transcripts (NM_ mRNA accessions, 29 total)

NM_ AccessionMANE Select
NM_000109
NM_001005244
NM_001005246
NM_001314034
NM_001314035
NM_001314036
NM_001314037
NM_001314038
NM_001326687
NM_001370876
NM_004006✓ MANE Select
NM_004009
NM_004010
NM_004011
NM_004012
NM_004013
NM_004014
NM_004015
NM_004016
NM_004017
NM_004018
NM_004019
NM_004020
NM_004021
NM_004022
NM_004023
NM_007868
NM_012698
NM_131785

Total: 29 NM_ mRNA accessions (MANE Select: NM_004006)


CCDS IDs (11 total)

  • CCDS14229
  • CCDS14230
  • CCDS14231
  • CCDS14232
  • CCDS14233
  • CCDS14234
  • CCDS48091
  • CCDS55394
  • CCDS55395
  • CCDS94585
  • CCDS94586

MANE SELECT Transcript Exons (ENST00000357033 / NM_004006, 79 exons)

ENSE IDStartEndStrandChromosome
ENSE000039014203111922231121930X
ENSE000011589373120396131204118X
ENSE000011589433120658231206667X
ENSE000011589503120949831209699X
ENSE000039882193221691632217063X
ENSE000039882203192959631929745X
ENSE000039882213231008232310276X
ENSE000039882223234840632348528X
ENSE000039882233193208031932227X
ENSE000039882243234594332346080X
ENSE000039882253236278832362958X
ENSE000039882263234210032342282X
ENSE000039882273228752932287701X
ENSE000039882283196833931968514X
ENSE000039882293234313432343286X
ENSE000011573703236458232364710X
ENSE000009784093236502032365199X
ENSE000014355733239007132390181X
ENSE000014352183238950132389674X
ENSE000014359543238051032380680X
ENSE000014360393238631032386465X
ENSE000014359063246343932463594X
ENSE000014362713246458632464699X
ENSE000014351463244118032441314X
ENSE000014351513248491932485099X
ENSE000014351803246849832468710X
ENSE000014367313245466232454832X
ENSE000014365453244845632448638X
ENSE000014366553250175532501842X
ENSE000014356053254515932545334X
ENSE000014349953269787032697998X
ENSE000014365063269911232699293X
ENSE000014365123249127732491518X
ENSE000014355833247216432472309X
ENSE000014355813264496432645152X
ENSE000014356873264413232644313X
ENSE000014357963261430332614453X
ENSE000039882243234594332346080X
ENSE000014362673251800832518131X
ENSE000014348233256570232565881X
ENSE000036399213257374532573846X
ENSE000014790143257353032573637X
ENSE000036910973284972832849820X
ENSE000016402063284478332844860X
ENSE000017894983281646832816640X
ENSE000016852823282329532823387X
ENSE000014367423259575732595876X
ENSE000014349993243824132438390X
ENSE000014360393238631032386465X
ENSE000039882213231008232310276X
ENSE000012586413162767331627862X
ENSE000012132153172963131729748X
ENSE000035927293183671831836819X
ENSE000036878993187518831875373X
ENSE000039882233193208031932227X
ENSE000039882203192959631929745X
ENSE000012591713144448131444627X
ENSE000012591933147810631478374X
ENSE000036759803147898331479103X
ENSE000036690693149678831496944X
ENSE000035002363150728131507453X
ENSE000035540363122304731223121X
ENSE000036377023134855631348634X
ENSE000036208503126095531261016X
ENSE000036936493132359831323658X
ENSE000011589123117793231177970X
ENSE000010340393117866931178805X
ENSE000035691183118037031180481X
ENSE000036690713177396031774192X
ENSE000010340453118273831182904X
ENSE000010340853114629131146414X
ENSE000014361073112664231126673X
ENSE000011588743113410231134194X
ENSE000011588873114727531147518X
ENSE000011588953116944331169601X
ENSE000011589013117234831172413X
ENSE000011589053117353931173604X
ENSE000012131993165799031658144X
ENSE000012585773167937531679586X
ENSE000036393033181997531820083X
ENSE000014345853241175232411913X
ENSE000034630643302013933020200X
ENSE000018991413321128233211549X

Total: 79 exons for ENST00000357033 (MANE SELECT / NM_004006)

Protein identifiers

UniProt Accessions

Human dystrophin - reviewed (canonical):

  • P11532 (reviewed)

Note: Only one human UniProt entry exists. Other dystrophin entries in UniProt are from different organisms (rat: P11530, mouse: P11531, chicken: P11533, Drosophila: Q0KI50, Q9VDW6, Q9VDW3, Q7YU29).

RefSeq Protein (NP_ accessions)

Reviewed RefSeq proteins:

  • NP_000100
  • NP_003997 (MANE Select)
  • NP_004000
  • NP_004001
  • NP_004002
  • NP_004003
  • NP_004004
  • NP_004005
  • NP_004006
  • NP_004007
  • NP_004008
  • NP_004009
  • NP_004010
  • NP_004011
  • NP_004012
  • NP_004013
  • NP_004014

Validated RefSeq proteins:

  • NP_001005244
  • NP_001005246
  • NP_001357805

Reviewed (isoform variants):

  • NP_001300963
  • NP_001300964
  • NP_001300965
  • NP_001300966
  • NP_001300967

Provisional RefSeq protein:

  • NP_001313616
  • NP_571860

Other RefSeq proteins:

  • NP_031894
  • NP_036830

Protein Domains and Families

InterPro (IPR) entries:

IDNameType
IPR000433Zinc finger, ZZ-typeDomain
IPR001202WW domainDomain
IPR001589Actinin-type actin-binding domain, conserved siteConserved Site
IPR001715Calponin homology domainDomain
IPR002017Spectrin repeatRepeat
IPR011992EF-hand domain pairHomologous Superfamily
IPR015153EF-hand domain, type 1Domain
IPR015154EF-hand domain, type 2Domain
IPR018159Spectrin/alpha-actininRepeat
IPR035436Dystrophin/utrophinFamily
IPR036020WW domain superfamilyHomologous Superfamily
IPR036872CH domain superfamilyHomologous Superfamily
IPR043145Zinc finger, ZZ-type superfamilyHomologous Superfamily
IPR050774KCMF1 and DystrophinFamily

Pfam (PF) entries:

  • PF00307
  • PF00397
  • PF00435
  • PF00569
  • PF09068
  • PF09069

SMART (SM) entries:

  • SM00033
  • SM00150
  • SM00291
  • SM00456

Antibody Availability

No antibody resources are currently mapped in the biobtree database for human dystrophin. However, commercial antibody sources exist for dystrophin detection (e.g., from immunology suppliers), which should be verified through external antibody databases (e.g., Antibodies Online, Abcam, Santa Cruz Biotechnology).

Structure

Experimental structures: 5 PDB entries (all X-ray crystallography)

PDB IDTitleMethodResolution (Å)
1DXXN-terminal Actin-binding Domain of Human DystrophinX-RAY DIFFRACTION2.6
1EG3Dystrophin WW Domain Fragment in Complex with Beta-Dystroglycan PeptideX-RAY DIFFRACTION2.0
1EG4Dystrophin WW Domain Fragment in Complex with Beta-Dystroglycan PeptideX-RAY DIFFRACTION2.0
3UUNN-terminal First Spectrin Repeat of DystrophinX-RAY DIFFRACTION2.3
9D58Tandem Calponin Homology Actin-binding Domain (Closed-state Conformation)X-RAY DIFFRACTION1.94

Total experimental structures: 5

Predicted structure:

  • Model ID: AF-P11532-F1
  • Database: AlphaFold2
  • Coverage: Full-length (3,685 residues)
  • pLDDT: Not retrieved from biobtree (recommend checking AlphaFold Database directly for confidence metrics)

Based on my search through the biobtree database, I found orthologs for human DMD in several organisms. For Drosophila and yeast, there do not appear to be clear orthologous genes in the database.

Cross-species orthologs

OrganismGene IDSymbol
Mouse (Mus musculus)13405Dmd
Rat (Rattus norvegicus)24907Dmd
Zebrafish (Danio rerio)83773dmd
Fruit fly (Drosophila melanogaster)nonenone
Worm (C. elegans)173038dys-1
Yeast (S. cerevisiae)nonenone

Clinical variants & AI predictions

Clinical Variants (ClinVar)

Summary

  • Total variants: ~11,261
  • Associated conditions: Duchenne muscular dystrophy (DMD), Becker muscular dystrophy (BMD), dilated cardiomyopathy

Classification Breakdown (sample-based estimate from reviewed entries)

ClassificationStatus
Uncertain significancePredominant (~50-60%)
Conflicting classifications~10-15%
Likely benign~10-15%
Pathogenic / Likely pathogenic~5-10%
Benign<5%

Top Pathogenic/Likely Pathogenic Variants (representative samples)

Variant IDHGVS NotationCondition
1001786NM_004006.3(DMD):c.2400C>A (p.Ser800Arg)Duchenne/Becker muscular dystrophy
1003542NM_004006.3(DMD):c.5684A>T (p.Asp1895Val)Duchenne/Becker muscular dystrophy, dilated cardiomyopathy
1003878NM_004006.3(DMD):c.961-4delDuchenne/Becker muscular dystrophy
1003362NM_004006.3(DMD):c.10951G>A (p.Asp3651Asn)Duchenne/Becker muscular dystrophy
1004650NM_004006.3(DMD):c.4705T>C (p.Cys1569Arg)Duchenne/Becker muscular dystrophy

AI-Based Variant Effect Predictions

Splice Effect Predictions (SpliceAI)

  • Total predictions: 7,041
  • Effect types: donor_gain, donor_loss, acceptor_gain, acceptor_loss
  • Score range: 0.20–0.95

Top 30 SpliceAI Predictions (by score)

PositionVariantGeneEffectScore
1X:31121931:C:CCDMDacceptor_gain0.95
2X:31121927:TGTC:TDMDacceptor_gain0.90
3X:31121928:GTC:GDMDacceptor_gain0.86
4X:31121766:A:ACDMDdonor_gain0.86
5X:31121767:C:CCDMDdonor_gain0.86
6X:31121758:AAACT:ADMDdonor_gain0.72
7X:31126709:A:TDMDacceptor_gain0.95
8X:31121929:TC:TDMDacceptor_gain0.81
9X:31121930:CC:CDMDacceptor_gain0.81
10X:31126708:C:CTDMDacceptor_gain0.82
11X:31121276:G:CDMDdonor_gain0.54
12X:31121762:T:TADMDdonor_gain0.52
13X:31121768:T:CDMDdonor_gain0.60
14X:31126639:A:TDMDacceptor_gain0.76
15X:31126676:T:TCDMDacceptor_gain0.22
16–30(additional variants with scores 0.20–0.75)DMDmixed0.20–0.75

AlphaMissense Pathogenicity Predictions

  • Total likely_pathogenic variants: 100+ (pagination available)
  • Pathogenicity score range: 0.565–1.000
  • Classification: am_class=“likely_pathogenic”

Top 30 Likely Pathogenic Missense Variants (by am_pathogenicity score)

PositionGenomic PositionProtein Variantam_pathogenicityInterpretation
1X:31147294:A:GL3593P1.000Highest confidence
2X:31147290:C:AR3594S0.997Very high confidence
3X:31147290:C:GR3594S0.997Very high confidence
4X:31147296:C:AR3592S0.997Very high confidence
5X:31147296:C:GR3592S0.997Very high confidence
6X:31147294:A:CL3593R0.997Very high confidence
7X:31147291:C:AR3594M0.995Very high confidence
8X:31147291:C:GR3594T0.995Very high confidence
9X:31147297:C:AR3592M0.995Very high confidence
10X:31147297:C:GR3592T0.995Very high confidence
11X:31147282:A:GL3597P0.998Very high confidence
12X:31147285:A:GL3596P0.999Very high confidence
13X:31147294:A:TL3593Q0.999Very high confidence
14X:31147282:A:TL3597Q0.991Very high confidence
15X:31147298:T:CR3592G0.992Very high confidence
16X:31147288:T:GQ3595P0.982Very high confidence
17X:31147282:A:CL3597R0.980Very high confidence
18X:31134149:A:GL3656S0.977High confidence
19X:31147295:G:CL3593V0.976High confidence
20X:31147285:A:CL3596R0.973High confidence
21X:31147292:T:AR3594W0.965High confidence
22X:31147300:T:GH3591P0.955High confidence
23X:31134116:A:GF3667S0.952High confidence
24X:31134116:A:CF3667C0.941High confidence
25X:31134128:A:GL3663P0.978High confidence
26X:31134128:A:TL3663H0.970High confidence
27X:31146310:A:CS3634R0.919High confidence
28X:31146310:A:TS3634R0.919High confidence
29X:31134128:A:CL3663R0.926High confidence
30X:31134115:G:CF3667L0.984Very high confidence

Pathways & Gene Ontology

Biological Pathways

Reactome Pathways: 3 total

Pathway IDPathway Name
R-HSA-3000171Non-integrin membrane-ECM interactions
R-HSA-390522Striated Muscle Contraction
R-HSA-9913351Formation of the dystrophin-glycoprotein complex (DGC)

MSigDB Gene Sets: 789 total

Distribution by collection:

  • C2 (Curated pathways/databases): ~20 sets (KEGG, etc.)
  • C3 (Regulatory targets): ~400 sets (transcription factor binding sites, miRNA targets)
  • C4 (Computational): ~100 sets (cancer modules, gene neighborhoods)
  • C5 (GO terms): ~269 sets (GO biological process, molecular function, cellular component)

Notable C2/pathway-related sets from DMD membership:

  • KEGG_VIRAL_MYOCARDITIS
  • KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC

Gene Ontology Annotations

Biological Process: 25 terms

GO IDTerm
GO:0002027regulation of heart rate
GO:0007517muscle organ development
GO:0007519skeletal muscle tissue development
GO:0008104intracellular protein localization
GO:0010880regulation of release of sequestered calcium ion into cytosol by sarcoplasmic reticulum
GO:0010881regulation of cardiac muscle contraction by regulation of the release of sequestered calcium ion
GO:0010976positive regulation of neuron projection development
GO:0014809regulation of skeletal muscle contraction by regulation of release of sequestered calcium ion
GO:0014819regulation of skeletal muscle contraction
GO:0035633maintenance of blood-brain barrier
GO:0035994response to muscle stretch
GO:0043043peptide biosynthetic process
GO:0044458motile cilium assembly
GO:0045666positive regulation of neuron differentiation
GO:0046716muscle cell cellular homeostasis
GO:0048666neuron development
GO:0055001muscle cell development
GO:0060048cardiac muscle contraction
GO:0065003protein-containing complex assembly
GO:0086001cardiac muscle cell action potential
GO:0090257regulation of muscle system process
GO:0090287regulation of cellular response to growth factor stimulus
GO:0099536synaptic signaling
GO:1902305regulation of sodium ion transmembrane transport
GO:1903169regulation of calcium ion transmembrane transport

Molecular Function: 8 terms

GO IDTerm
GO:0002162dystroglycan binding
GO:0003779actin binding
GO:0005200structural constituent of cytoskeleton
GO:0008270zinc ion binding
GO:0008307structural constituent of muscle
GO:0017022myosin binding
GO:0017166vinculin binding
GO:0050998nitric-oxide synthase binding

Cellular Component: 18 terms

GO IDTerm
GO:0005634nucleus
GO:0005829cytosol
GO:0005856cytoskeleton
GO:0005886plasma membrane
GO:0009986cell surface
GO:0016010dystrophin-associated glycoprotein complex
GO:0016013syntrophin complex
GO:0030018Z disc
GO:0030055cell-substrate junction
GO:0030175filopodium
GO:0031527filopodium membrane
GO:0032991protein-containing complex
GO:0042383sarcolemma
GO:0043034costamere
GO:0044306neuron projection terminus
GO:0045121membrane raft
GO:0045202synapse
GO:0045211postsynaptic membrane

Protein interactions & networks

Protein-protein interactions

Total interaction count: ~2,170 interactions (STRING database)

TOP 30 highest-confidence interacting proteins (STRING scores 0-1000):

RankUniprot IDGene SymbolSTRING ScoreEvidence
1Q14118SYNM999Multiple evidence types
2P46939DTNB972Dystrophin-associated protein
3Q13424UTRN966Utrophin, DMD homolog
4Q13326SNTB1965Syntrophin beta-1
5Q92629SNTG2965Syntrophin gamma-2
6O75923SNTA1959Syntrophin alpha-1
7P24043DTNA957Dystrophin-associated protein alpha
8Q13884CTNNAL1956Catenin alpha-like 1
9Q9Y4J8DAG1936Beta-dystroglycan
10P18206CTNNA1933Catenin alpha-1
11Q9NSN8HAUS1932SPL assembly factor
12Q16585SNTB2923Syntrophin beta-2
13O00468LYPLA1912Lysophospholipase
14P78508GOSR2907SNARE protein
15P56539SNTG1901Syntrophin gamma-1
16Q15149ABLIM1897LIM actin-binding protein
17P29475NOS1895Nitric oxide synthase
18Q13425CHRNA1868Nicotinic acetylcholine receptor
19Q14315NEB856Nebulin
20Q8WZ42SPTAN1849Spectrin alpha
21P02593COL4A1830Collagen IV alpha-1
22Q9NY99ITGA7830Integrin alpha-7
23P20929CTNND1829Catenin delta-1
24P27482ANK1827Ankyrin-1
25Q8TD86MACF1827Microtubule-actin crosslinker
26Q96GE6CCDC88C827Coiled-coil protein
27Q9NZT1DLG3827Disc-large homolog 3
28Q9UBF9DCAF8826DCAF8 protein
29Q9Y4G6KLHL41825Kelch-like protein 41
30P02571COL4A2670Collagen IV alpha-2

Additional databases: BioGRID (175 interactions), IntAct (131 interactions with confidence scores 0.37-0.89)


Protein similarity

Structural/embedding similarity (ESM2 embeddings): 64 similar proteins identified

TOP 20 structurally similar proteins (highest max similarity scores):

RankUniprot IDMax SimilarityAvg SimilarityNotes
1A2CG491.00000.9857Dystrophin isoform
2O759621.00000.9861Spectrin-like protein
3F1M0Z11.00000.9861Dystrophin-related
4P0CE941.00000.9858Conserved ortholog
5P0CE951.00000.9858Conserved ortholog
6Q0KL021.00000.9861Dystrophin variant
7Q71LX41.00000.9857Large muscle protein
8D3ZHV20.99990.9899Rod domain protein
9P115310.99990.9887Dystrophin isoform
10P115320.99980.9888DMD protein (self)
11O975920.99980.9888Dystrophin ortholog
12Q9QXZ00.99990.9898Muscle-associated protein
13P304270.99990.9869Actin-binding protein
14P159240.99980.9848Myosin-related
15E9Q5570.99990.9852Cytoplasmic protein
16E9Q8Q60.99970.9834Muscle protein
17Q5GN480.99980.9887Conserved domain protein
18Q8NF910.99970.9890Rod-like structure
19Q6ZWR60.99970.9890Actin interaction protein
20O604370.99970.9865Muscle dystrophy-related

Sequence homology (DIAMOND/BLASTP): 120 homologous proteins identified

TOP 20 sequence homologs (highest sequence identity %):

RankUniprot IDIdentity %BitscoreAlignment Quality
1Q9UPN3100.0010401Perfect match/isoform
2Q0KI50100.001834Identical protein
3Q9QXZ099.801803Near-identical
4D3ZHA099.305460High conservation
5D3ZBP499.101797Splice variant
6P1281499.901774Ortholog
7Q7TPR499.301763Rod domain
8Q2PFV799.901774Conserved sequence
9Q3ZC5598.901740High similarity
10P5778098.901781Muscle isoform
11Q0108298.304398Strong alignment
12O4370799.801804Full-length homolog
13P0509498.301740Cysteine-rich region
14Q6226198.304403Conserved domain
15Q9QXS196.605404Moderate conservation
16P2133397.105162Structural homolog
17P3042796.605401Actin-binding domain
18P1153096.206642Related dystrophin
19P1153196.206636Dystrophin isoform
20O7536997.605066Strong homology

Transcription factor regulatory data

DMD is not a transcription factor. The DMD gene encodes dystrophin, a structural cytoplasmic protein involved in the dystrophin-associated protein complex that anchors the cytoskeleton to the cell membrane. It has no known DNA-binding or transcriptional regulatory functions.

Upstream Regulators

Transcription factors regulating DMD expression:

TFRegulationEvidence/Confidence
MYOD1ActivationHigh
SP1ActivationHigh
SP3ActivationHigh
SRFUnknownHigh
TFAP2AUnknownHigh
YY1UnknownHigh
HRUnknownLow

Note: Data source: CollecTRI database. Evidence confidence indicates experimental validation level; specific evidence types (ChIP-seq/experimentally validated/predicted) are not differentiated in the current dataset. MYOD1, SP1, and SP3 are the best-characterized regulators with activation-level evidence.

Drug & pharmacology data

DMD is a known drug target. The dystrophin (DMD) gene underlying Duchenne muscular dystrophy has 96 molecules in active development and clinical use across ChEMBL and clinical trial data.

Targeting Molecules (Total: 96 in trials)

Top 30 by development phase:

Phase 4 (Approved, 36 molecules):

  1. ETEPLIRSEN (Oligonucleotide) – exon-skipping; restore dystrophin reading frame
  2. VILTOLARSEN (Oligonucleotide) – exon 53-skipping morpholino
  3. GOLODIRSEN (Oligonucleotide) – exon 53-skipping
  4. CASIMERSEN (Oligonucleotide) – exon 45-skipping
  5. DELANDISTROGENE MOXEPARVOVEC (Gene therapy) – AAV-mediated microdystrophin
  6. DEFLAZACORT (Small molecule) – corticosteroid; anti-inflammatory
  7. VAMOROLONE (Small molecule) – nonsteroidal anti-inflammatory; utrophin stabilizer
  8. ATALUREN (Small molecule) – nonsense mutation readthrough; restore full-length dystrophin
  9. GIVINOSTAT (Small molecule) – histone deacetylase (HDAC) inhibitor; reduce fibrosis
  10. PREDNISONE (Small molecule) – corticosteroid; standard of care
  11. PREDNISOLONE (Small molecule) – corticosteroid
  12. CARVEDILOL (Small molecule) – beta-blocker; cardioprotection
  13. RAMIPRIL (Small molecule) – ACE inhibitor; cardioprotection
  14. LISINOPRIL (Small molecule) – ACE inhibitor
  15. SILDENAFIL (Small molecule) – phosphodiesterase-5 inhibitor; vasodilation
  16. TADALAFIL (Small molecule) – phosphodiesterase-5 inhibitor
  17. IDEBENONE (Small molecule) – antioxidant; cardiac protection
  18. BISOPROLOL (Small molecule) – beta-blocker; cardioprotection
  19. METOPROLOL SUCCINATE (Small molecule) – beta-blocker
  20. SPIRONOLACTONE (Small molecule) – aldosterone antagonist; cardioprotection
  21. GLUTAMINE (Small molecule) – amino acid; muscle metabolism
  22. SATRALIZUMAB (Antibody) – IL-6 pathway inhibitor
  23. GENTAMICIN (Small molecule) – nonsense suppressor aminoglycoside
  24. PENTOXIFYLLINE (Small molecule) – rheological agent; anti-inflammatory
  25. IBUPROFEN (Small molecule) – NSAID
  26. ISOSORBIDE DINITRATE (Small molecule) – nitrate; vasodilation
  27. LOSARTAN (Small molecule) – angiotensin II receptor antagonist
  28. NEBIVOLOL (Small molecule) – beta-blocker; cardioprotection
  29. EPLERENONE (Small molecule) – aldosterone antagonist
  30. TAMOXIFEN (Small molecule) – estrogen receptor modulator; utrophin induction

Phase 3 (10 molecules):

  1. TALDEFGROBEP ALFA (Protein) – activin A inhibitor; reduce fibrosis
  2. FORDADISTROGENE MOVAPARVOVEC (Gene therapy) – AAV-mediated microdystrophin
  3. EDASALONEXENT (Small molecule) – NF-κB inhibitor
  4. RAMATERCEPT (Protein) – TNF-α receptor antagonist
  5. DOMAGROZUMAB (Antibody) – myostatin/GDF8 inhibitor
  6. EPIGALOCATECHIN GALLATE (Small molecule) – antioxidant
  7. UBIDECARENONE (Small molecule) – CoQ10; antioxidant
  8. ARGININE (Small molecule) – amino acid; vasodilation
  9. GLUCAGON-LIKE PEPTIDE II (Protein) – gut hormone; metabolic support
  10. ARTENIMOL (Small molecule) – antimalarial derivative

Phase 2 (15 molecules):

  1. SUVODIRSEN (Oligonucleotide) – exon 51-skipping
  2. DRISAPERSEN (Oligonucleotide) – exon 51-skipping morpholino
  3. VESLETEPLIRSEN (Oligonucleotide) – exon 53-skipping
  4. BROGIDIRSEN (Oligonucleotide) – exon 53-skipping
  5. DERAMIOCEL (Cell therapy) – allogeneic cardiosphere-derived cells
  6. HALOFUGINONE (Small molecule) – TGF-β inhibitor; anti-fibrotic
  7. EZUTROMID (Small molecule) – dystrophin translocation enhancer
  8. BOCIDELPAR (Small molecule) – NF-κB inhibitor
  9. RIMEPORIDE (Small molecule) – cardiac sodium-calcium exchanger inhibitor
  10. OMPENACLID (Small molecule) – phosphatidylserine-targeted therapy
  11. OXATOMIDE (Small molecule) – mast cell stabilizer
  12. FLAVOCOXID (Unknown) – flavonoid antioxidant
  13. DEXMEDETOMIDINE (Small molecule) – alpha-2 agonist
  14. MEDETOMIDINE (Small molecule) – alpha-2 agonist
  15. DOCONEXENT (Small molecule) – fatty acid derivative

Clinical Trials (Top 20 by status/phase):

  1. NCT05096221 – Gene therapy (Delandistrogene moxeparvovec/SRP-9001) | COMPLETED | Phase 3 | Ambulatory & non-ambulatory
  2. NCT04060199 – Viltolarsen (RACER53) | COMPLETED | Phase 3 | Ambulatory boys
  3. NCT02851797 – Givinostat | COMPLETED | Phase 3 | Ambulatory patients
  4. NCT03703882 – Edasalonexent | COMPLETED | Phase 3 | Ambulatory boys
  5. NCT02500381 – Casimersen & Golodirsen | COMPLETED | Phase 3 | DMD patients
  6. NCT01826487 – Ataluren (nonsense mutation) | COMPLETED | Phase 3 | nmDMD
  7. NCT02255552 – Eteplirsen | COMPLETED | Phase 3 | DMD patients
  8. NCT01557400 – Ataluren (Europe/Israel/Australia/Canada) | COMPLETED | Phase 3 | nmDBMD
  9. NCT04587908 – TAS-205 (REACH-DMD) | ACTIVE_NOT_RECRUITING | Phase 3 | DMD patients
  10. NCT05881408 – Delandistrogene moxeparvovec (SRP-9001) | ACTIVE_NOT_RECRUITING | Phase 3 | Non-ambulatory & ambulatory
  11. NCT07160634 – SGT-003 gene therapy (IMPACT DUCHENNE) | RECRUITING | Phase 3 | Ambulant males
  12. NCT05933057 – Givinostat | RECRUITING | Phase 3 | Non-ambulant patients
  13. NCT04281485 – PF-06939926 (gene therapy candidate) | ACTIVE_NOT_RECRUITING | Phase 3 | DMD
  14. NCT03992430 – High-dose Eteplirsen (MIS51ON) | ACTIVE_NOT_RECRUITING | Phase 3 | DMD
  15. NCT05126758 – Deramiocel (CAP-1002) | ACTIVE_NOT_RECRUITING | Phase 3 | Ambulatory & non-ambulatory
  16. NCT03373968 – Givinostat long-term safety | RECRUITING | Phase 2/3 | DMD
  17. NCT05195775 – Tadalafil as adjuvant therapy | ACTIVE_NOT_RECRUITING | Phase 2/3 | DMD
  18. NCT05693142 – RGX-202 gene therapy (AFFINITY DUCHENNE) | RECRUITING | Phase 2/3 | DMD
  19. NCT04768062 – Viltolarsen (RACER53-X) | ACTIVE_NOT_RECRUITING | Phase 3 | Ambulant boys
  20. NCT05066633 – Metoprolol (add-on to standard care) | RECRUITING | Phase 3 | Cardioprotection

Pharmacogenomics & Dosing Guidelines

No specific DMD pharmacogenomics interaction database entries found in available datasets. However, key clinical considerations:

  • Exon-skipping drugs (eteplirsen, viltolarsen, golodirsen, casimersen, drisapersen): Efficacy depends on DMD mutation type; benefit restricted to patients with mutations amenable to specific exon-skipping strategies. Genetic sequencing of DMD mutations is required for patient selection.
  • Gene therapies (delandistrogene moxeparvovec): Require absence of pre-existing AAV neutralizing antibodies (AAV serotype-dependent).
  • Corticosteroids (prednisone, deflazacort, vamorolone): Standard dosing established; vamorolone designed to reduce steroid side effects.
  • Ataluren (nonsense readthrough): Efficacy limited to nonsense (stop-codon) mutations (~13% of DMD cases).

Pharmacogenomics at gene level: HGNC:2928 (DMD) has 1 xref to PharmGKB, indicating documented but limited pharmacogenomics annotations in standard databases. Dosing is primarily phenotype-driven (ambulatory status, cardiac function) rather than genotype-driven beyond mutation-type stratification.

Based on the biobtree data I’ve retrieved, here’s the comprehensive expression profile for DMD:

Expression profiles

Tissue Expression (Bgee database)

DMD shows ubiquitous expression across human tissues (295 present calls, max expression score 98.90). Top 30 tissues/conditions by expression score:

TissueExpression ScoreQuality
Trigeminal ganglion98.90Gold
Skeletal muscle tissue of rectus abdominis98.70Gold
Dorsal root ganglion98.12Gold
Seminal vesicle97.61Gold
Sural nerve97.47Gold
Biceps brachii97.46Gold
Vastus lateralis97.34Gold
Skeletal muscle tissue of biceps brachii97.29Gold
Blood vessel layer97.19Gold
Quadriceps femoris96.94Gold
Endothelial cell96.73Gold
Saphenous vein96.67Gold
Body of tongue96.58Gold
Cardiac muscle of right atrium96.49Gold
Left ventricle myocardium96.48Gold
Skeletal muscle tissue96.45Gold
Cauda epididymis96.42Gold
Myocardium96.28Gold
Tibial nerve96.25Gold
Tibialis anterior96.21Gold
Heart right ventricle96.17Gold
Colonic epithelium95.89Gold
Vena cava95.86Gold
Muscle tissue95.68Gold
Hindlimb stylopod muscle95.51Gold
Skeletal muscle organ95.39Gold
Muscle organ95.38Gold
Secondary oocyte95.31Gold
Deltoid95.20Gold
Choroid plexus epithelium95.11Gold

Expression pattern: Strong enrichment in skeletal muscle, cardiac muscle, and peripheral nervous system (nerves, ganglia). High expression in vascular tissues (endothelium, blood vessels).

Single-Cell Expression: Notable Datasets

GTEx snRNAseq atlas (E-ANND-2): 209,126 cells from multiple tissues

  • Muscle-enriched cell types: myocyte—skeletal muscle, myocyte—cardiac (fast/slow-twitch), myoepithelial cells
  • Nervous system: Schwann cells (I, II), neuronal cells, satellite cells
  • Vascular: endothelial cells (cardiac microvascular, lymphatic, vascular subtypes), pericyte/smooth muscle cells
  • Other tissues: fibroblasts, adipocytes, immune cells (macrophages, T/B cells)
  • Tissues sampled: Skeletal muscle, heart, skin, breast, lung, esophagus, prostate

DMD ranked #30 among marker genes (score: 46.02, log fold change: 0.92), indicating consistent expression across the diverse cell populations.

Other notable single-cell datasets linking to DMD: COVID-19 peripheral immune response, pancreatic islet cells, hypertrophied heart atlas, fetal liver, retina, and neural tissues—consistent with DMD’s role in multiple cell types beyond skeletal muscle.

Key Patterns

  • Ubiquitous but tissue-enriched: Present in virtually all tissues but highest in muscle (skeletal, cardiac) and nervous system tissues
  • Cell-type specificity: Strong expression in myocytes and endothelial cells; present but variable in fibroblasts, immune cells, and epithelial cells
  • Functional relevance: Pattern reflects dystrophin’s roles in muscle contraction, neuromuscular junction integrity, and vascular function

Disease associations

Mendelian / Monogenic Diseases

Disease NameOMIM/Orphanet/MondoInheritance PatternEvidence LevelGene Source
Duchenne muscular dystrophyOMIM:310200 / Orphanet:98896 / MONDO:0010679X-linkedDefinitiveGENCC, ClinVar
Becker muscular dystrophyOMIM:300376 / Orphanet:98895 / MONDO:0010311X-linkedDefinitiveGENCC, ClinVar
Dilated cardiomyopathy 3BOMIM:302045 / MONDO:0010542X-linkedDefinitiveGENCC, ClinVar
Symptomatic form of muscular dystrophy of Duchenne and Becker in female carriersOrphanet:206546 / MONDO:0016899X-linkedSupportiveGENCC, ClinVar
Familial isolated dilated cardiomyopathyOrphanet:154 / MONDO:0016333Autosomal dominantSupportiveGENCC, ClinVar
Non-syndromic X-linked intellectual disabilityOrphanet:777X-linkedSupportiveGENCC, ClinVar
Duchenne and Becker muscular dystrophyOrphanet:262 / MONDO:0016899X-linkedDefinitiveGENCC, ClinVar

Additional related Mondo conditions linked via ClinVar: cardiomyopathy (MONDO:0004994), neuromuscular disease caused by qualitative or quantitative defects of dystrophin (MONDO:0016147), progressive muscular dystrophy (MONDO:0106), dilated cardiomyopathy (MONDO:0005021), muscular dystrophy (MONDO:0020121), familial dilated cardiomyopathy (MONDO:0016333), familial restrictive cardiomyopathy (MONDO:0016340).

Phenotype Associations (HPO Terms - Top 30)

HPO IDPhenotypeHPO IDPhenotype
HP:0001417X-linked inheritanceHP:0002938Lumbar hyperlordosis
HP:0001324Muscle weaknessHP:0002942Thoracic kyphosis
HP:0003560Muscular dystrophyHP:0002943Thoracic scoliosis
HP:0003701Proximal muscle weaknessHP:0003391Gowers sign
HP:0003236Elevated circulating creatine kinase concentrationHP:0003323Progressive muscle weakness
HP:0001638CardiomyopathyHP:0003324Generalized muscle weakness
HP:0001644Dilated cardiomyopathyHP:0001435Abnormality of the shoulder girdle musculature
HP:0002505Loss of ambulationHP:0003202Skeletal muscle atrophy
HP:0003707Calf muscle pseudohypertrophyHP:0002650Scoliosis
HP:0002515Waddling gaitHP:0006380Knee flexion contracture
HP:0001635Congestive heart failureHP:0002380Fasciculations
HP:0002540Inability to walkHP:0008981Calf muscle hypertrophy
HP:0001371Flexion contractureHP:0001771Achilles tendon contracture
HP:0002540Inability to walkHP:0002194Delayed gross motor development
HP:0002527FallsHP:0030097Absent muscle dystrophin expression

Additional top phenotypes: respiratory insufficiency due to muscle weakness (HP:0002747), cognitive impairment (HP:0100543), elevated hepatic transaminase (HP:0002910), global developmental delay (HP:0001263), intellectual disability (HP:0001256), motor delay (HP:0001270), gait disturbance (HP:0001288), myopathy (HP:0003198).

Complex Disease / GWAS Associations (Top 11)

GWAS TraitP-valueChromosomeStudy ID
Mild influenza (H1N1) infection5.0 × 10⁻¹⁴XGCST003124_19
Influenza A (H1N1) infection4.0 × 10⁻¹¹XGCST003125_11
Major depressive disorder1.0 × 10⁻⁹XGCST012304_6
Body mass index1.0 × 10⁻⁸XGCST004034_1
Ankle-brachial index7.0 × 10⁻⁸XGCST009139_1
Temporomandibular joint disorder4.0 × 10⁻⁸XGCST004034_1
Waist circumference4.0 × 10⁻⁶XGCST008151_58 / GCST008160_82
Anxiety in major depressive disorder4.0 × 10⁻⁶XGCST002202_3
Itch intensity from mosquito bite5.0 × 10⁻⁶XGCST004862_35
Response to anti-depressant treatment2.0 × 10⁻⁶XGCST001308_20

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 42 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, chembl_molecule, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gtex, gwas, hgnc, hpa, hpo, intact, interpro, mim, mondo, msigdb, orphanet, ortholog, pdb, pfam, proteomicsdb, reactome, refseq, scxa, smart, spliceai, string_interaction, tabula_sapiens, taxonomy, transcript, uniprot
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (180)