MYBPC3 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human MYBPC3. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human MYBPC3. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 35 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_target, clinvar, collectri, diamond_similarity, drugbank, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, interpro, mim, msigdb, orphanet, ortholog, paralog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-29 — For the latest data, query BioBTree directly via MCP or API.
View API calls (44)
MYBPC3

Section 1: Gene Identifiers

DatabaseIdentifierNotes
HGNCHGNC:7551Approved symbol: MYBPC3
Ensembl GeneENSG00000134571GRCh38
NCBI Entrez Gene4607
OMIM Gene600958
CCDSCCDS53621Consensus CDS
Genomic Location (GRCh38/hg38)
AttributeValue
Chromosome11
Cytogenetic Band11p11.2
Start Position47,331,406
End Position47,352,702
StrandMinus (-)
Gene Length21,297 bp

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 3)

Transcript IDBiotypeStartEndUTR5UTR3
ENST00000399249protein_coding47,331,40647,352,70247,352,648-47,352,70247,331,845-47,331,870
ENST00000545968protein_coding47,331,40647,352,70247,352,648-47,352,70247,331,845-47,331,870
ENST00000544791nonsense_mediated_decay47,335,57847,352,70247,352,648-47,352,70247,337,391-47,337,497
RefSeq Transcripts
AccessionTypeStatusMANE Select
NM_000256mRNAREVIEWED✓ Yes
NP_000247ProteinREVIEWED✓ Yes
NM_001044349mRNAVALIDATEDNo
NP_001037814ProteinVALIDATEDNo
CCDS Identifiers
  • CCDS53621 (Consensus Coding Sequence) Exon Information (ENST00000399249 - Canonical Transcript) Total Exon Count: 34
Exon IDStartEndStrand
ENSE0000130447447,352,62347,352,702-
ENSE0000109805347,351,23947,351,505-
ENSE0000113014747,350,50247,350,615-
ENSE0000109805747,350,01447,350,112-
ENSE0000113014247,349,77447,349,922-
ENSE0000113013647,348,42447,348,541-
ENSE0000109804447,347,85747,347,905-
ENSE0000109805547,347,65147,347,680-
ENSE0000109803947,347,42647,347,479-
ENSE0000109805047,346,62747,346,647-
ENSE0000109805447,346,20747,346,370-
ENSE0000109804947,343,49247,343,624-
ENSE0000179970047,343,26047,343,262-
ENSE0000109804047,343,02147,343,145-
ENSE0000109805247,342,83047,342,935-
ENSE0000109805947,342,57847,342,744-
ENSE0000109806247,341,99147,342,156-
ENSE0000113008247,341,13847,341,244-
ENSE0000109805847,341,00347,341,032-
ENSE0000113007347,339,65147,339,790-
ENSE0000113006647,339,32447,339,404-
ENSE0000109804747,338,52047,338,679-
ENSE0000113005447,337,69047,337,794-
ENSE0000113004647,337,39147,337,579-
ENSE0000369400147,335,87747,336,011-
ENSE0000092127647,335,04247,335,209-
ENSE0000092127547,333,92247,334,010-
ENSE0000092127447,333,55747,333,752-
ENSE0000092127347,333,19447,333,333-
ENSE0000092127247,332,81447,332,973-
ENSE0000092127147,332,56647,332,702-
ENSE0000092127047,332,07247,332,258-
ENSE0000092126947,331,84547,331,881-
ENSE0000112999047,331,40647,331,716-

Section 3: Protein Identifiers UniProt Accessions

AccessionStatusNameLengthMass
Q14896Reviewed (Swiss-Prot)Myosin-binding protein C, cardiac-type1,274 aa140,762 Da
Alternative Name: C-protein, cardiac muscle isoform RefSeq Protein
AccessionStatusMANE Select
NP_000247REVIEWED✓ Yes
NP_001037814VALIDATEDNo
Protein Domains and Families (Total: 10)
InterPro IDNameType
IPR003598Ig_sub2Domain
IPR003599Ig_subDomain
IPR003961FN3_dom (Fibronectin type III)Domain
IPR007110Ig-like_domDomain
IPR013098Ig_I-setDomain
IPR040849MyBP-C_THB (Myosin-binding protein C, tri-helix bundle)Domain
IPR013783Ig-like_foldHomologous superfamily
IPR036116FN3_sfHomologous superfamily
IPR036179Ig-like_dom_sfHomologous superfamily
IPR050964Striated_Muscle_RegulatoryFamily

Section 4: Structure Identifiers Experimental Structures (Total: 17 PDB entries)

PDB IDMethodResolutionTitle
2V6HX-ray1.55 ÅCrystal structure of C1 domain
3CX2X-ray1.30 ÅCrystal structure of C1 domain at 1.3Å
1GXENMR-Central domain structure
1PD6NMR-NMR structure of domain C2
2AVGNMR-NMR structure of cC1 domain
2K1MNMR-3D NMR structure of domain cC0
2MQ0NMR-NMR structure of c3 domain
2MQ3NMR-c3 domain with R502W mutation (HCM-related)
5K6PNMR-M domain tri-helix bundle and C2
6CXICryo-EM11.0 ÅCardiac thin filament with C0C1 fragment (mode 1)
6CXJCryo-EM11.0 ÅCardiac thin filament with C0C1 fragment (mode 2)
6G2TCryo-EM9.0 ÅC1 Ig-domain bound to cardiac thin filament
7LRGCryo-EM6.1 ÅF-actin decorated with C2 domain
7TIJCryo-EM8.0 ÅF-actin with regulatory M-domain
7TITCryo-EM8.0 ÅThin filament with M-domain
7TJ7Cryo-EM8.0 ÅThin filament with C1 Ig-domain and M-domain
8G4LCryo-EM6.4 ÅHuman cardiac myosin filament
Predicted Structures (AlphaFold)
AlphaFold IDGlobal pLDDTSequence LengthFraction Very High Confidence
Q1489679.4399060.37 (37%)

Section 5: Cross-Species Orthologs Key Model Organisms

SpeciesGene SymbolEnsembl Gene IDEntrez ID
Mouse (Mus musculus)Mybpc3ENSMUSG0000000210017868
Rat (Rattus norvegicus)Mybpc3ENSRNOG00000012307295929
Zebrafish (Danio rerio)mybpc3ENSDARG00000011615556489
Drosophila melanogaster-No ortholog-
C. elegans-No ortholog-
S. cerevisiae-No ortholog-
Human Paralogs (Total: 11)
GeneEnsembl IDDescription
MYBPC1ENSG00000196091Myosin binding protein C1 (slow skeletal)
MYBPC2ENSG00000086967Myosin binding protein C2 (fast skeletal)
MYBPHENSG00000133055Myosin binding protein H
MYBPHLENSG00000221986Myosin binding protein H-like
MYOM1ENSG00000101605Myomesin 1
MYOM2ENSG00000036448Myomesin 2
MYOM3ENSG00000142661Myomesin 3
OBSL1ENSG00000124006Obscurin-like 1
FNDC3AENSG00000102531Fibronectin type III domain containing 3A
FNDC3BENSG00000075420Fibronectin type III domain containing 3B
IGSF22ENSG00000179057Immunoglobulin superfamily member 22

Section 6: Clinical Variants & AI Predictions Clinical Variant Summary (ClinVar) Total Variants: 4,434

ClassificationCount (estimated)
Pathogenic>100
Likely Pathogenic>100
Uncertain Significance (VUS)~3,500+
Likely Benign>200
Benign>100
Conflicting Classifications>100
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
1069890c.1730G>A (p.Trp577Ter)SNVPathogenic
1069891c.1546G>T (p.Glu516Ter)SNVPathogenic
1069893c.1269_1282del (p.Ser424fs)DeletionPathogenic
1074339c.2164G>T (p.Glu722Ter)SNVPathogenic
1076719c.1831G>T (p.Glu611Ter)SNVPathogenic
164042g.47333236delDeletionPathogenic
164047c.3166dup (p.Ala1056fs)DuplicationPathogenic
164048c.3129C>A (p.Tyr1043Ter)SNVPathogenic
164055c.2920C>T (p.Gln974Ter)SNVPathogenic
164059c.2875_2876del (p.Thr959fs)DeletionPathogenic
164071c.2556del (p.Ile852fs)DeletionPathogenic
164107c.1624+2T>CSNVPathogenic
164118c.1357_1358del (p.Pro453fs)DeletionPathogenic
164134c.993dup (p.Glu332Ter)DuplicationPathogenic
164135c.979C>T (p.Gln327Ter)SNVPathogenic
164136c.966G>A (p.Trp322Ter)SNVPathogenic
1195305c.1900del (p.Val634fs)DeletionPathogenic
1333262c.3313_3314insGG (p.Ala1105fs)InsertionPathogenic
1333510c.1609G>T (p.Glu537Ter)SNVPathogenic
1362183c.2761C>T (p.Gln921Ter)SNVPathogenic
1365234c.965G>A (p.Trp322Ter)SNVPathogenic
1424922c.178G>T (p.Glu60Ter)SNVPathogenic
1446713c.3702_3703del (p.Leu1236fs)DeletionPathogenic
1452999c.3773T>A (p.Leu1258Ter)SNVPathogenic
1457802c.3328del (p.Met1110fs)DeletionPathogenic
1459506c.2221dup (p.Ala741fs)DuplicationPathogenic
1460339c.787G>T (p.Gly263Ter)SNVPathogenic
1067323c.1624+1G>TSNVPathogenic/Likely pathogenic
1070911c.565del (p.Val189fs)DeletionPathogenic/Likely pathogenic
1729315c.3242dup (p.Asn1081fs)DuplicationPathogenic
1769560c.1308C>A (p.Cys436Ter)SNVPathogenic
1027654c.2994+1G>ASNVLikely pathogenic
1013320c.653_654insCTGGTGACCC (p.Lys218delinsAsnTrpTer)InsertionLikely pathogenic
1066750c.1449_1457+4delDeletionLikely pathogenic
1067412NC_000011.9:g.(?47362544)(47368616_?)delDeletionLikely pathogenic
1067413NC_000011.9:g.(?_47358756)_47367811delDeletionLikely pathogenic
1068244c.3191-1G>CSNVLikely pathogenic
1068427c.292+1delDeletionLikely pathogenic
AI-Based Variant Predictions SpliceAI Predictions (Total: 5,457) Variants with High Splice-Altering Scores (≥0.5): >500
VariantEffectScore
11:47331714:TACCT:Tacceptor_loss0.98
11:47331715:ACCT:Aacceptor_loss0.98
11:47331717:CTGCA:Cacceptor_loss0.98
11:47331718:T:Gacceptor_loss0.98
11:47332067:CTCA:Cdonor_loss0.97
11:47332068:TCA:Tdonor_loss0.97
11:47332069:CACCT:Cdonor_loss0.97
11:47332092:TCA:Tdonor_gain0.97
11:47332093:CAC:Cdonor_gain0.97
11:47332094:ACA:Adonor_gain0.97
11:47332095:CAC:Cdonor_gain0.97
11:47331719:G:Cacceptor_loss0.96
11:47332095:CA:Cdonor_gain0.96
11:47331880:CA:Cacceptor_gain0.96
11:47331882:C:CCacceptor_gain0.99
AlphaMissense Predictions (Total: 8,320) Likely Pathogenic Variants (am_class = "likely_pathogenic"): >3,000
VariantProtein ChangeAM ScoreClassification
11:47332115:G:CN1257K0.994likely_pathogenic
11:47332115:G:TN1257K0.994likely_pathogenic
11:47332127:G:CC1253W0.994likely_pathogenic
11:47332129:A:GC1253R0.994likely_pathogenic
11:47332083:A:GL1268P0.991likely_pathogenic
11:47332146:T:AD1247V0.991likely_pathogenic
11:47332167:A:GI1240T0.990likely_pathogenic
11:47332135:A:TY1251N0.990likely_pathogenic
11:47332167:A:CI1240S0.988likely_pathogenic
11:47332134:T:GY1251S0.987likely_pathogenic
11:47332146:T:GD1247A0.983likely_pathogenic
11:47332128:C:TC1253Y0.983likely_pathogenic
11:47332123:C:GA1255P0.980likely_pathogenic
11:47332167:A:TI1240N0.980likely_pathogenic
11:47332116:T:AN1257I0.976likely_pathogenic
11:47332088:G:CC1266W0.976likely_pathogenic
11:47332147:C:AD1247Y0.974likely_pathogenic
11:47332146:T:CD1247G0.972likely_pathogenic
11:47332135:A:CY1251D0.997likely_pathogenic
11:47332077:A:TV1270E0.966likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 1)

Pathway IDNameDisease Pathway
R-HSA-390522Striated Muscle ContractionNo
Gene Ontology Annotations (Total: 23) Biological Process (9 terms)
GO IDTerm
GO:0003007heart morphogenesis
GO:0006942regulation of striated muscle contraction
GO:0007155cell adhesion
GO:0032971regulation of muscle filament sliding
GO:0045214sarcomere organization
GO:0055010ventricular cardiac muscle tissue morphogenesis
GO:0060048cardiac muscle contraction
GO:0086004regulation of cardiac muscle cell contraction
Molecular Function (9 terms)
GO IDTerm
GO:0001671ATPase activator activity
GO:0003779actin binding
GO:0008307structural constituent of muscle
GO:0017022myosin binding
GO:0031432titin binding
GO:0032036myosin heavy chain binding
GO:0042802identical protein binding
GO:0046872metal ion binding
Cellular Component (8 terms)
GO IDTerm
GO:0005829cytosol
GO:0005863striated muscle myosin thick filament
GO:0014705C zone
GO:0030017sarcomere
GO:0031430M band
GO:0031672A band
GO:0097512cardiac myofibril
MSigDB Gene Sets (Total: 223) Key gene sets include:
  • GOBP_CARDIAC_MUSCLE_CONTRACTION
  • GOBP_SARCOMERE_ORGANIZATION
  • GOBP_STRIATED_MUSCLE_CONTRACTION
  • REACTOME_STRIATED_MUSCLE_CONTRACTION
  • HP_HYPERTROPHIC_CARDIOMYOPATHY
  • HP_DILATED_CARDIOMYOPATHY

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions (Total: 1,760) TOP 50 Highest-Confidence Interacting Proteins:

UniProtGeneScore
Q8WZ42TTN (Titin)987
P12883MYH7 (Myosin heavy chain 7)976
P45379TNNT2 (Troponin T2)972
P19429TNNI3 (Troponin I3)966
P50461CSRP3 (Cysteine rich protein 3)934
P09493TPM1 (Tropomyosin 1)930
P08590MYL3 (Myosin light chain 3)923
P04270S100B919
P10916MYL2 (Myosin light chain 2)892
P13533MYH6 (Myosin heavy chain 6)886
Q9UGJ0CAPN3 (Calpain 3)867
O15273TRIM63 (Tripartite motif 63)862
P02593CALM1 (Calmodulin 1)860
P27482CALML3851
Q9NZT1CALML5851
Q8TD86MYL4 (Myosin light chain 4)848
Q96GE6MACS848
Q5T481MYOZ3825
P20929NEB (Nebulin)806
Q14524SCN5A (Sodium channel)792
BioGRID Interactions (Total: 14)
Interaction IDInteractorMethod
683024TRIM63Two-hybrid
683022MYBPC3Affinity Capture-Western
2934663MYBPC3Affinity Capture-MS
3800621MYBPC3Affinity Capture-MS
3801075MYBPC3Affinity Capture-MS
275551MYBPC3Far Western
SIGNOR Signaling Interactions (Total: 17)
RegulatorTargetEffectMechanism
PRKACAMYBPC3up-regulatesphosphorylation
PRKACBMYBPC3up-regulatesphosphorylation
PRKACGMYBPC3up-regulatesphosphorylation
PRKCDMYBPC3up-regulatesphosphorylation
PKAMYBPC3up-regulatesphosphorylation
PDE4DIPMYBPC3up-regulatesbinding
Protein Similarity ESM2 Structural/Embedding Similarity (Total: 26)
UniProtTop SimilarityAvg Similarity
O70468 (Mouse Mybpc3)0.99960.9663
P56741 (Rat Mybpc3)0.99960.9668
Q14324 (MYBPC2)0.99920.9543
A2RUH70.99800.9701
P521790.99790.9648
Q622340.99790.9667
Q5VTT50.99810.9707
DIAMOND Sequence Similarity (Total: 46)
UniProtTop IdentityTop Bitscore
Q3UH5389.3%3901
Q7Z5N489.3%3899
Q8AV5881.3%3495
P5217984.1%2801
Q62234 (Mouse Mybpc3)84.3%2809
E9PZ1999.5%2548
Q5VTT5 (MYBPC1)83.1%2448
Q9UPX0 (MYBPC2)92.9%2442

Section 9: Transcription Factor Regulatory Data MYBPC3 is NOT a transcription factor. Upstream Regulators (TFs that regulate MYBPC3) Based on MSigDB TF binding site predictions:

  • GATA1 (binding motifs in promoter)
  • GATA6 (binding motifs in promoter)
  • MAZ (binding motifs in promoter)
  • TFCP2 (CP2) (binding motifs in promoter)
  • HMGA1 (ChIP-seq evidence) No CollecTRI regulatory data available for this gene.

Section 10: Drug & Pharmacology Data DrugBank Entries (Total: 2)

DrugBank IDDrug NameDescription
DB11816Omecamtiv mecarbilCardiac myosin activator
DB18490AficamtenCardiac myosin inhibitor
PharmGKB
PharmGKB IDVIP GeneCPIC Guidelines
PA31351YesNo
ChEMBL Drug Targets No direct ChEMBL target entry for MYBPC3.

Section 11: Expression Profiles Bgee Expression Data

MetricValue
Expression BreadthUbiquitous
Max Expression Score99.90
Average Expression Score57.03
Total Conditions with Expression149 present, 139 absent
Gold Quality Annotations263
TOP 30 Tissues by Expression (Bgee)
RankTissue (UBERON ID)ExpressionScoreQuality
1Apex of heart (UBERON:0002098)present99.90gold
2Right atrium auricular region (UBERON:0006631)present99.65gold
3Cardiac atrium (UBERON:0002081)present99.63gold
4Left ventricle myocardium (UBERON:0006566)present99.45gold
5Heart left ventricle (UBERON:0002084)present99.34gold
6Cardiac ventricle (UBERON:0002082)present99.34gold
7Heart right ventricle (UBERON:0002080)present99.29gold
8Cardiac muscle of right atrium (UBERON:0003379)present99.25gold
9Myocardium (UBERON:0002349)present99.04gold
10Vena cava (UBERON:0004087)present96.51gold
11Heart (UBERON:0000948)present94.51gold
12Blood (UBERON:0000178)present75.95gold
13Pericardium (UBERON:0002407)present75.50silver
Tissue-Specific Pattern: MYBPC3 shows highly cardiac-specific expression, with the highest expression scores in heart tissues (apex, atria, ventricles, myocardium). Expression is notably absent in skeletal muscles (triceps, biceps, gluteal muscles, quadriceps). Single-Cell Expression Data
Dataset IDDescriptionCell Count
E-MTAB-11268The atlas of the human hypertrophied heart reveals impaired cell communication64,898 cells

Section 12: Disease Associations Mendelian/Monogenic Diseases (GenCC)

DiseaseDisease IDClassificationInheritanceEvidence
Hypertrophic cardiomyopathy 4OMIM:115197DefinitiveADAmbry, Invitae, G2P
Hypertrophic cardiomyopathy 4OMIM:115197StrongARAmbry
Left ventricular noncompaction 10OMIM:615396DefinitiveADAmbry
Left ventricular noncompaction 10OMIM:615396ModerateADAmbry
Left ventricular noncompaction 10OMIM:615396LimitedARAmbry
Atrial fibrillationMONDO:0004981LimitedADAmbry
Familial isolated dilated cardiomyopathyORPHANET:154SupportiveADOrphanet
Orphanet Disease Associations (Total: 2)
Orphanet IDDiseaseTypeGene Count
154Familial isolated dilated cardiomyopathyDisease53 genes
54260Left ventricular noncompactionDisease15 genes
HPO Phenotype Associations (Total: 48)
HPO IDPhenotype
HP:0001639Hypertrophic cardiomyopathy
HP:0001644Dilated cardiomyopathy
HP:0001645Sudden cardiac death
HP:0030682Left ventricular noncompaction
HP:0001714Ventricular hypertrophy
HP:0005144Ventricular septal hypertrophy
HP:0001663Ventricular fibrillation
HP:0001695Cardiac arrest
HP:0001635Congestive heart failure
HP:0001640Cardiomegaly
HP:0011675Arrhythmia
HP:0001678Atrioventricular block
HP:0001279Syncope
HP:0002875Exertional dyspnea
HP:0002094Dyspnea
HP:0012378Fatigue
HP:0100749Chest pain
HP:0001698Pericardial effusion
HP:0012664Reduced left ventricular ejection fraction
HP:0025169Left ventricular systolic dysfunction
HP:0000006Autosomal dominant inheritance
HP:0000007Autosomal recessive inheritance
HP:0003581Adult onset
HP:0003584Late onset
HP:0003621Juvenile onset
HP:0003623Neonatal onset
HP:0011462Young adult onset
GWAS Associations (Total: 64)
Study IDTraitGeneP-value
GCST010242_113HDL cholesterol levelsMYBPC31.0e-106
GCST008084_149HDL cholesterol x alcohol (drinkers vs non-drinkers)MYBPC35.0e-69
GCST010241_45Apolipoprotein A1 levelsMYBPC33.0e-63
GCST008084_66HDL cholesterol x alcohol interactionMYBPC31.0e-48
GCST008084_83HDL cholesterol x alcohol interactionMYBPC36.0e-31
GCST008075_214HDL cholesterol x alcohol interaction (2df)MYBPC34.0e-29
GCST002315_3Thrombin generation potential phenotypesMYBPC35.0e-22
GCST008085_105HDL cholesterol in current drinkersMYBPC34.0e-22
GCST006394_89Intraocular pressureMYBPC31.0e-18
GCST007269_195Pulse pressureMYBPC34.0e-14
GCST008757_31Alcohol consumptionMYBPC35.0e-14
GCST009597_128Multiple sclerosisMYBPC31.0e-13
GCST007930_47Medication use (renin-angiotensin agents)MYBPC35.0e-13
GCST010796_*Electrocardiogram morphology (multiple timepoints)MYBPC31.0e-08 to 1.0e-12

SUMMARY STATISTICS

CategoryCount
Gene Identifiers5 major databases
Transcripts (Ensembl)3
Transcripts (RefSeq)2 (human mRNA)
Exons (canonical)34
UniProt Entries1 (reviewed)
Protein Domains10
PDB Structures17
Orthologs3 model organisms
Paralogs11
ClinVar Variants4,434
AlphaMissense Predictions8,320
SpliceAI Predictions5,457
GO Annotations23
Reactome Pathways1
STRING Interactions1,760
BioGRID Interactions14
GWAS Associations64
HPO Phenotypes48
Disease Associations7+
Drug Associations2

Generated using BioBTree cross-database mapping. Data current as of database versions indexed. The comprehensive cross-database identifier and functional mapping for MYBPC3 is complete. Key highlights:

  • Clinical significance: MYBPC3 is a major disease gene with definitive evidence for hypertrophic cardiomyopathy (HCM-4) and left ventricular noncompaction (LVNC-10)
  • Variant burden: Over 4,400 ClinVar variants, with >100 classified as pathogenic (mostly truncating variants)
  • AI predictions: 8,320 AlphaMissense and 5,457 SpliceAI predictions available for variant interpretation
  • Expression: Highly cardiac-specific with maximal expression in heart apex, atria, and ventricles (score 99.9)
  • Structure: 17 experimental PDB structures covering multiple domains (C0-C3, M-domain) plus AlphaFold model
  • Drug targets: Associated with omecamtiv mecarbil (myosin activator) and aficamten (myosin inhibitor)
  • Protein function: Key sarcomeric protein regulating cardiac muscle contraction through myosin/actin/titin interactions, phosphorylated by PKA