MYH7 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MYH7 — a definitive lookup resource covering: ### …

Provide a comprehensive cross-database identifier and functional mapping reference for human MYH7 — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene MYH7, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene MYH7, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene MYH7 protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene MYH7 protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene MYH7, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene MYH7, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene MYH7, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene MYH7 protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene MYH7, summarize transcription factor regulatory data. If MYH7 is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate MYH7 — names with evidence type (ChIP-seq / predicted / experimentally validated) If MYH7 is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene MYH7 protein as a drug target, summarize pharmacology data. If MYH7 is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If MYH7 is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene MYH7, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene MYH7, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in MYH7: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

MYH7

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

Gene identifiers

MYH7 (myosin heavy chain 7)

IdentifierValue
HGNC IDHGNC:7577
HGNC approved symbolMYH7
Ensembl gene IDENSG00000092054
NCBI Entrez Gene ID4625
OMIM gene/locus ID160760
Chromosome14
Start position (GRCh38)23,412,733
End position (GRCh38)23,436,137
Strand− (minus)

Transcript identifiers

Ensembl Transcripts

Total: 41 transcripts (all protein_coding)

Transcript IDBiotype
ENST00000355349protein_coding
ENST00000713768protein_coding
ENST00000713769protein_coding
ENST00000858539protein_coding
ENST00000858540protein_coding
ENST00000858541protein_coding
ENST00000858542protein_coding
ENST00000858543protein_coding
ENST00000858544protein_coding
ENST00000858545protein_coding
ENST00000858546protein_coding
ENST00000858547protein_coding
ENST00000858548protein_coding
ENST00000858549protein_coding
ENST00000858550protein_coding
ENST00000858551protein_coding
ENST00000965935protein_coding
ENST00000965936protein_coding
ENST00000965937protein_coding
ENST00000965938protein_coding
ENST00000965939protein_coding
ENST00000965940protein_coding
ENST00000965941protein_coding
ENST00000965942protein_coding
ENST00000965943protein_coding
ENST00000965944protein_coding
ENST00000965945protein_coding
ENST00000965946protein_coding
ENST00000965947protein_coding
ENST00000965948protein_coding
ENST00000965949protein_coding
ENST00000965950protein_coding
ENST00000965951protein_coding
ENST00000965952protein_coding
ENST00000965953protein_coding
ENST00000965954protein_coding
ENST00000965955protein_coding
ENST00000965956protein_coding
ENST00000965957protein_coding
ENST00000965958protein_coding
ENST00000965959protein_coding

RefSeq mRNA Accessions

NM_ accessions (8 total)

AccessionMANE Select
NM_000257✓ Yes
NM_001112733No
NM_001183707No
NM_001361607No
NM_001407004No
NM_001425737No
NM_017240No
NM_080728No

CCDS IDs

CCDS ID
CCDS9601

Canonical/MANE Select Transcript Exons

Transcript: ENST00000355349 (maps to NM_000257)
Total exons: 40

Exon IDStartEndStrandChromosome
ENSE00004021211234127402341287114
ENSE00001666341234137592341389314
ENSE00001617095234140072341410214
ENSE00001758206234149952341527014
ENSE00001766234234153812341550614
ENSE00001690609234156292341583214
ENSE00001701082234160042341631214
ENSE00000654010234168682341699214
ENSE00001720186234171532341731814
ENSE00001661625234182102341840614
ENSE00000654012234175032341768614
ENSE00001711774234191772341929514
ENSE00000654015234194832341960914
ENSE00001776760234198452342023414
ENSE00000586050234209582342104814
ENSE00001767436234221802342232514
ENSE00001636102234235472342372314
ENSE00001625387234239072342414914
ENSE00000654022234247692342502414
ENSE00001618128234256952342581814
ENSE00000654023234252822342541814
ENSE00001643045234259642342608114
ENSE00001797702234267772342686414
ENSE00000654028234272402342730714
ENSE00000654029234275852342789414
ENSE00001747242234285002342867014
ENSE00000654031234289552342910414
ENSE00001802826234297752342991314
ENSE00000654032234292292342934714
ENSE00000654035234305602343066314
ENSE00001797612234309012343099914
ENSE00001805709234315852343167714
ENSE00001610903234314182343148114
ENSE00000654039234317612343186914
ENSE00001647421234326392343279514
ENSE00000654040234324792343250614
ENSE00002511061234335322343374014
ENSE00000654043234330842343322714
ENSE00001333121234341942343424914
ENSE00001910862234356202343566014

Protein identifiers

UniProt Accessions

  • P12883 (canonical, reviewed) — Myosin-7 / Myosin heavy chain 7
  • A0AAQ5BGU7 (unreviewed) — MYH7 variant

RefSeq Protein Accessions (NP_)

  • NP_000248 (MANE select, primary)
  • NP_001393933 (alternate isoform)

Protein Domains and Families

InterPro Domains:

IDNameType
IPR000048IQ motif, EF-hand binding siteBinding site
IPR001609Myosin head, motor domain-likeDomain
IPR002928Myosin tailDomain
IPR004009Myosin, SH3 domainDomain
IPR008989Myosin S1 fragment, N-terminalHomologous superfamily
IPR014751DNA repair protein XRCC4-like, C-terminalHomologous superfamily
IPR027417P-loop containing nucleoside triphosphate hydrolaseHomologous superfamily
IPR036961Kinesin motor domain superfamilyHomologous superfamily

Pfam Domains:

  • PF00063 (269 UniProt proteins annotated)
  • PF01576 (119 UniProt proteins annotated)
  • PF02736 (85 UniProt proteins annotated)

Antibody Availability

No antibody resources are currently indexed in biobtree for MYH7/P12883. However, antibody availability may be found through:

  • Human Protein Atlas (HPA) — referenced in UniProt xrefs (hpa|1)
  • Commercial antibody vendors (Abcam, Cell Signaling, Santa Cruz, etc.) — indexed separately outside biobtree

Structure

Experimental Structures (PDB)

Total: 37 structures

X-ray Crystallography (23 structures)

  1. 2FXM – 2.7 Å
  2. 2FXO – 2.5 Å
  3. 4DB1 – 2.6 Å
  4. 4P7H – 3.2 Å
  5. 4PA0 – 2.25 Å
  6. 4XA1 – 3.2 Å
  7. 4XA3 – 2.548 Å
  8. 4XA4 – 2.327 Å
  9. 4XA6 – 3.42 Å
  10. 5CHX – 2.3 Å
  11. 5CJ0 – 2.3 Å
  12. 5CJ1 – 2.1 Å
  13. 5CJ4 – 3.102 Å
  14. 5WJ7 – 2.5 Å
  15. 5WJB – 2.905 Å
  16. 5WLQ – 3.104 Å
  17. 5WLZ – 3.5 Å
  18. 5WME – 2.3 Å
  19. 6PF2 – 2.17 Å
  20. 6PFP – 2.2 Å
  21. 9HTF – 2.481 Å
  22. 9HTG – 2.596 Å
  23. 9I8P – 2.601 Å

Cryo-EM (14 structures)

  1. 3DTP – 20.0 Å
  2. 5TBY – 20.0 Å
  3. 8ACT – 3.6 Å
  4. 8EFD – 3.8 Å
  5. 8EFE – 3.8 Å
  6. 8EFH – 3.3 Å
  7. 8EFI – 3.4 Å
  8. 8ENC – 3.6 Å
  9. 8G4L – 6.4 Å
  10. 8ZB7 – 3.19 Å
  11. 8ZI9 – 3.08 Å
  12. 9GZ1 – 3.7 Å
  13. 9GZ2 – 2.9 Å
  14. 9GZ3 – 3.4 Å

Predicted Structures (AlphaFold)

Model ID: AF-P12883-F1 (Version 4)

Confidence Metrics (pLDDT):

  • Global pLDDT: 74.45
  • Very high confidence (pLDDT > 90): 10.17%
  • Confident (80–90): 54.75%
  • Low confidence (70–80): 30.79%
  • Very low confidence (pLDDT < 70): 4.29%

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

Clinical variants & AI predictions

ClinVar Variant Counts

ClassificationCount
Pathogenic76
Likely Pathogenic100+
Uncertain Significance (VUS)~3800+
Likely Benign~100+
Benign~30+
Total~4107

TOP 30 Pathogenic Variants

Variant IDHGVS NotationAssociated Conditions
14087c.1208G>A (p.Arg403Gln)Hypertrophic cardiomyopathy
14089c.1357C>T (p.Arg453Cys)Hypertrophic cardiomyopathy
14090c.1750G>C (p.Gly584Arg)Hypertrophic cardiomyopathy
14091c.1816G>A (p.Val606Met)Dilated cardiomyopathy
14095c.2167C>T (p.Arg723Cys)Hypertrophic cardiomyopathy
14097c.2722C>G (p.Leu908Val)Hypertrophic cardiomyopathy
14098c.2221G>C (p.Gly741Arg)Hypertrophic cardiomyopathy
14099c.767G>A (p.Gly256Glu)Dilated cardiomyopathy
14101c.1208G>T (p.Arg403Leu)Hypertrophic cardiomyopathy
14102c.1207C>T (p.Arg403Trp)Hypertrophic cardiomyopathy
14104c.2155C>T (p.Arg719Trp)Hypertrophic cardiomyopathy
14105c.2146G>A (p.Gly716Arg)Hypertrophic cardiomyopathy
14106c.2803G>A (p.Glu935Lys)Hypertrophic cardiomyopathy
14107c.2156G>A (p.Arg719Gln)Hypertrophic cardiomyopathy
14108c.1594T>C (p.Ser532Pro)Hypertrophic cardiomyopathy
14125c.2717A>G (p.Asp906Gly)Hypertrophic cardiomyopathy
14129c.1357C>A (p.Arg453Ser)Hypertrophic cardiomyopathy
42838c.1358G>A (p.Arg453His)Hypertrophic cardiomyopathy
42875c.1988G>A (p.Arg663His)Hypertrophic cardiomyopathy
42885c.2167C>G (p.Arg723Gly)Hypertrophic cardiomyopathy
42922c.2681A>G (p.Glu894Gly)Hypertrophic cardiomyopathy
42926c.2711G>A (p.Arg904His)Hypertrophic cardiomyopathy
42933c.2788G>A (p.Glu930Lys)Hypertrophic cardiomyopathy
42992c.4130C>T (p.Thr1377Met)Hypertrophic cardiomyopathy
43106c.788T>C (p.Ile263Thr)Hypertrophic cardiomyopathy
164316c.2710C>T (p.Arg904Cys)Hypertrophic cardiomyopathy
164342c.2207T>C (p.Ile736Thr)Hypertrophic cardiomyopathy
164401c.427C>T (p.Arg143Trp)Hypertrophic cardiomyopathy
177626c.1954A>G (p.Arg652Gly)Hypertrophic cardiomyopathy
177665c.2221G>T (p.Gly741Trp)Hypertrophic cardiomyopathy

Splice Effect Predictions (SpliceAI)

MetricCount
Total predictions~4030
Donor lossHigh delta (0.79–1.00)
Donor gainHigh delta (0.40–0.99)
Acceptor lossHigh delta (0.23–0.95)
Acceptor gainHigh delta (0.20–1.00)

TOP 30 Highest-Impact Splice Variants (delta score)

VariantEffectScoreType
14:23413754:CCCA:Cdonor_loss1.0000Loss
14:23413755:CCACC:Cdonor_loss1.0000Loss
14:23413756:CACC:Cdonor_loss1.0000Loss
14:23413889:TCCTC:Tacceptor_gain1.0000Gain
14:23413890:CCTCC:Cacceptor_gain1.0000Gain
14:23413891:CTC:Cacceptor_gain1.0000Gain
14:23413758:CCTT:Cdonor_gain1.0000Gain
14:23412872:C:Tacceptor_gain0.9900Gain
14:23412873:T:Cacceptor_gain0.9700Gain
14:23412875:T:Cacceptor_gain0.9800Gain
14:23412870:CCCTT:Cacceptor_gain0.9600Gain
14:23412870:CC:Cacceptor_gain0.9500Gain
14:23412871:CCT:Cacceptor_loss0.9500Loss
14:23412872:C:CCacceptor_gain0.9500Gain
14:23412869:GCC:Gacceptor_gain0.9000Gain
14:23413797:TG:Tdonor_gain0.9700Gain
14:23413796:TTG:Tdonor_gain0.9000Gain
14:23413794:TGTTG:Tdonor_gain0.9900Gain
14:23413757:A:ACdonor_gain0.9900Gain
14:23413758:C:CCdonor_gain0.9900Gain
14:23413751:GGAC:Gdonor_loss0.8700Loss
14:23413752:GACCC:Gdonor_loss0.8900Loss
14:23413753:ACCC:Adonor_loss0.9600Loss
14:23412875:T:TCacceptor_gain0.9900Gain
14:23413565:T:TGacceptor_gain0.7900Gain
14:23413566:C:Gacceptor_gain0.7800Gain
14:23412867:AAGCC:Aacceptor_gain0.7800Gain
14:23413750:GGGAC:Gdonor_loss0.7900Loss
14:23413761:T:Adonor_gain0.7700Gain
14:23413765:G:Adonor_gain0.8400Gain

AlphaMissense Predictions (Likely Pathogenic)

MetricCount
Total missense variants~2000+
Likely pathogenic~100+
Ambiguous~200+
Likely benign~1700+

TOP 30 Likely-Pathogenic AlphaMissense Variants (am_pathogenicity score)

VariantProtein ChangeScoreClass
14:23413821:C:GA1910P0.999likely_pathogenic
14:23413841:A:GL1903P0.997likely_pathogenic
14:23413823:C:GR1909P0.998likely_pathogenic
14:23413787:C:GR1921P0.991likely_pathogenic
14:23413812:C:GA1913P0.999likely_pathogenic
14:23413790:A:GL1920P0.989likely_pathogenic
14:23413775:C:GR1925P0.977likely_pathogenic
14:23413792:C:AK1919N0.977likely_pathogenic
14:23413776:G:TR1925S0.975likely_pathogenic
14:23413794:T:CK1919E0.975likely_pathogenic
14:23413799:A:TV1917D0.970likely_pathogenic
14:23413820:G:TA1910E0.970likely_pathogenic
14:23413802:T:GQ1916P0.962likely_pathogenic
14:23413824:G:CR1909G0.961likely_pathogenic
14:23413777:G:CS1924R0.953likely_pathogenic
14:23413780:C:AK1923N0.947likely_pathogenic
14:23413823:C:AR1909L0.944likely_pathogenic
14:23413785:C:GA1922P0.941likely_pathogenic
14:23413795:G:CN1918K0.931likely_pathogenic
14:23413788:G:CR1921G0.933likely_pathogenic
14:23413818:C:GD1911H0.927likely_pathogenic
14:23413784:G:TA1922D0.925likely_pathogenic
14:23413793:T:AK1919M0.912likely_pathogenic
14:23413782:T:CK1923E0.870likely_pathogenic
14:23413776:G:CR1925G0.863likely_pathogenic
14:23413787:C:AR1921L0.864likely_pathogenic
14:23413825:C:AE1908D0.858likely_pathogenic
14:23413826:T:AE1908V0.919likely_pathogenic
14:23413827:C:TE1908K0.945likely_pathogenic
14:23413839:C:GD1904H0.829likely_pathogenic

Key Findings:

  • MYH7 harbors 4,107 total ClinVar variants, predominantly variants of uncertain significance (~3,800)
  • 76 Pathogenic variants cause hypertrophic and dilated cardiomyopathy
  • SpliceAI predicts 4,030 splice-altering variants with high-confidence (scores 0.79–1.00) donor/acceptor disruptions
  • 100+ AlphaMissense variants classified as likely pathogenic in the C-terminal region (positions 1903–1926)
  • Highest pathogenic confidence: arginine substitutions (R1925, R1921, R1909) and proline insertions in rod domains

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

You’ve hit your session limit · resets 10:10pm (Europe/Berlin)

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 22 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, ccds, clinvar, ensembl, entrez, exon, go, hgnc, interpro, mim, msigdb, orthodb, ortholog, pdb, pfam, reactome, refseq, spliceai, transcript, uniprot
Generated: 2026-05-25 — For the latest data, query BioBTree directly via MCP or API.
View API calls (98)