FUS Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human FUS. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human FUS. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 38 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, cl, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, fantom5_promoter, gencc, go, gwas, hgnc, hpo, intact, interpro, jaspar, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uberon, uniprot
Generated: 2026-04-01 — For the latest data, query BioBTree directly via MCP or API.
View API calls (51)
FUS
DatabaseIdentifierDescription
HGNCHGNC:4010FUS RNA binding protein
Ensembl GeneENSG00000089280FUS
NCBI Entrez2521FUS RNA binding protein
OMIM137070Gene/Locus
PharmGKBPA28425VIP Gene (Very Important Pharmacogene)
Approved Nomenclature
FieldValue
Approved SymbolFUS
Approved NameFUS RNA binding protein
Locus TypeGene with protein product
Locus GroupProtein-coding gene
StatusApproved
Aliases and Previous Symbols
TypeValues
AliasesTLS, FUS1, hnRNP-P2, HNRNPP2
Previous SymbolsALS6
Previous NamesFused in sarcoma; Translocated in liposarcoma; Amyotrophic lateral sclerosis 6; Fusion (involved in t(12;16) in malignant liposarcoma)
Alternative NamesETM4, POMP75, altFUS
Gene Groups
  • Zinc fingers RANBP2-type
  • RNA binding motif containing
  • Heterogeneous nuclear ribonucleoproteins
  • FET protein family Genomic Location
FieldValue
Chromosome16
Cytogenetic Location16p11.2
Start Position (GRCh38)31,180,134
End Position (GRCh38)31,196,963
Strand+ (Plus/Forward)
Genomic AccessionNC_000016.10
Gene Span~16.8 kb

Section 2: Transcript Identifiers Transcript Summary Total Ensembl Transcripts: 47

BiotypeCount
Protein coding40
Retained intron5
Nonsense mediated decay2
Ensembl Transcripts (All 47)
Transcript IDBiotypeStartEndStrand
ENST00000254108protein_coding3118013931191605+
ENST00000380244protein_coding3118013931196963+
ENST00000474990protein_coding_CDS_not_defined3118549931191102+
ENST00000483853retained_intron3119037331191558+
ENST00000487045retained_intron3118013931183350+
ENST00000487509retained_intron3118015031191602+
ENST00000487974retained_intron3118355031184398+
ENST00000564766retained_intron3118846731189695+
ENST00000566605nonsense_mediated_decay3118013831191602+
ENST00000568685protein_coding3118015131191575+
ENST00000568901retained_intron3118815231190120+
ENST00000569760retained_intron3119021631191605+
ENST00000570090retained_intron3118508031187245+
ENST00000715541nonsense_mediated_decay3118013931191605+
ENST00000715542protein_coding3118013931191605+
ENST00000875021protein_coding3118013431191775+
ENST00000875022–ENST00000875033protein_codingvariousvarious+
ENST00000925795–ENST00000925808protein_codingvariousvarious+
ENST00000948616–ENST00000948621protein_codingvariousvarious+
RefSeq Transcripts (Human)
RefSeq IDTypeStatusMANE Select
NM_004960mRNAREVIEWED✓ YES (Canonical)
NM_001170634mRNAREVIEWEDNo
NM_001170937mRNAREVIEWEDNo
NR_028388ncRNAREVIEWEDNo
RefSeq Protein Accessions (Human)
RefSeq IDStatusMANE Select
NP_004951REVIEWED✓ YES
NP_001164105REVIEWEDNo
NP_001164408REVIEWEDNo
CCDS Identifiers
CCDS IDNotes
CCDS10707Primary
CCDS58454Alternative
Canonical Transcript Exon Details (ENST00000254108) Total Exons: 15
Exon IDStartEndLength (bp)
ENSE00003903481311801393118022789
ENSE00003475065311823983118242225
ENSE000036306773118251331182664152
ENSE000035995473118385831184002145
ENSE000035376173118420931184396188
ENSE000017918123118493931185179241
ENSE00004027052311868023118683635
ENSE00004027046311883253118835733
ENSE000035244363118912331189226104
ENSE000035344843118966531189794130
ENSE000036539713119004031190141102
ENSE000036046253119027531190398124
ENSE000034942953119074231190842101
ENSE000036375723119096331191110148
ENSE000040270473119139931191605207
UTR Regions (ENST00000254108):
- 5' UTR- 3' UTR
31180139–3118021431191439–31191605
Protein Identifiers**
UniProt Accessions Total: 6 entries
UniProt IDStatusName
P35637✓ Reviewed (Swiss-Prot)RNA-binding protein FUS (Canonical)
A0AAQ5BIG2Unreviewed-
A0AAQ5BIK3Unreviewed-
H3BNZ4Unreviewed-
H3BPE7Unreviewed-
Q6IBQ5Unreviewed-
Canonical Protein Properties (P35637)
PropertyValue
Length526 amino acids
Molecular Mass53,426 Da
Alternative Names75 kDa DNA-pairing protein; Oncogene FUS; Oncogene TLS; POMp75; Translocated in liposarcoma protein
Protein Domains and Families Total Domains: 6
InterPro IDNameType
IPR034870TET/FET familyFamily
IPR000504RRM domainDomain
IPR001876Zinc finger RanBP2-typeDomain
IPR012677Nucleotide-binding alpha-beta plait superfamilyHomologous superfamily
IPR035979RBD domain superfamilyHomologous superfamily
IPR036443Zinc finger RanBP2-type superfamilyHomologous superfamily
Additional Protein Databases
DatabaseIdentifier
ChEMBL TargetCHEMBL5724679
STRING9606.ENSP00000254108
Pfam2 domains
SMART2 domains
CATH Gene3D2 domains

Section 4: Structure Identifiers Experimental Structures Total PDB Structures: 23

PDB IDMethodResolutionDescription
2LA6NMR-RRM domain structure
2LCWNMR-FUS/TLS RRM domain
4FDDX-ray2.3 ÅKAP beta2-PY-NLS
4FQ3X-ray3.0 ÅTransportin/FUS-NLS
5W3NSolid-state NMR-FUS LC domain fibrils
5XRRX-ray1.5 ÅFUS (54-59) SYSSYG
5XSGMicroED0.73 ÅFUS (37-42) SYSGYS (ultra-high res)
5YVGX-ray4.05 ÅKaryopherin beta2 + FUS (full length)
5YVHX-ray3.15 ÅKaryopherin beta2 + FUS (371-526)
5YVIX-ray2.9 ÅKaryopherin beta2 + FUS (456-526)
6BWZX-ray1.1 ÅSYSGYS LC domain peptide
6BXVX-ray1.1 ÅSYSSYGQS LC domain peptide
6BZPMicroED1.1 ÅSTGGYG LC domain peptide
6G99NMR-FUS-ZnF bound to UGGUG RNA
6GBMNMR-FUS-RRM bound to stem-loop RNA
6KJ1MicroED0.65 ÅFUS (37-42) SYSGYS
6KJ2MicroED0.67 ÅFUS (37-42) SYSGYS
6KJ3MicroED0.60 ÅFUS (37-42) SYSGYS (highest resolution)
6KJ4MicroED0.65 ÅFUS (37-42) SYSGYS
6SNJNMR-FUS-RRM + U1 snRNA stem loop III
6XFMCryo-EM2.62 ÅAmyloid-like fibrils (111-214)
7CYLX-ray2.7 ÅKaryopherin-beta2 + FUS PY-NLS (P525L)
7VQQCryo-EM2.9 ÅFUS LC domain amyloid fibril
Methods Summary
MethodCount
X-ray Crystallography9
NMR6
Electron Crystallography (MicroED)6
Cryo-EM2
Predicted Structure (AlphaFold)
PropertyValue
AlphaFold IDP35637
Global pLDDT54.96
Fraction Very High Confidence9%
Sequence Length3762 (includes isoforms)
Note
Low overall confidence reflects the intrinsically disordered nature of the FUS low-complexity domain
Cross-Species Orthologs**
OrganismGene IDSymbolChromosomeLocation
Mouse (Mus musculus)ENSMUSG00000030795Fus7127,566,629–127,584,873
Rat (Rattus norvegicus)ENSRNOG00000023360Fus--
Zebrafish (Danio rerio)ENSDARG00000037968fus332,698,424–32,711,280
Fruit fly (Drosophila melanogaster)FBGN0285954caz (cabeza)X16,288,810–16,293,520
Worm (C. elegans)WBGENE00016173---
Yeast (S. cerevisiae)No ortholog---

Section 6: Clinical Variants & AI Predictions ClinVar Summary Total Variants: 657

ClassificationCount
Pathogenic30
Likely Pathogenic13
Uncertain Significance (VUS)Multiple
Likely BenignMultiple
BenignMultiple
ConflictingMultiple
Pathogenic Variants (All 30)
ClinVar IDHGVS NotationProtein ChangeVariant Type
16221c.1551C>Gp.His517GlnSNV
16222c.1561C>Gp.Arg521GlySNV
16223c.1553G>Ap.Arg518LysSNV
16224c.1561C>Tp.Arg521CysSNV
16225c.1562G>Ap.Arg521HisSNV
29707c.1483C>Tp.Arg495TerNonsense
29708c.616G>Ap.Gly206SerSNV
37070c.868C>Tp.Gln290TerNonsense
280109c.1572G>Cp.Arg524SerSNV
280110c.1574C>Tp.Pro525LeuSNV
280356c.1504_1505delp.Asp502fsFrameshift
280602c.1510_1514dupp.Phe506fsFrameshift
665141c.1509_1510dupp.Gly504fsFrameshift
817574c.1022_1023delp.Ser340_Phe341insTerFrameshift
847302c.1541+1G>ASplice donorSplice
873230c.1394-1G>TSplice acceptorSplice
873231c.1555C>Tp.Gln519TerNonsense
873234c.1577A>Gp.Tyr526CysSNV
933229c.1509dupp.Gly504fsFrameshift
1073222c.1554_1557delp.Gln519fsFrameshift
1423819c.1500dupp.Gly501fsFrameshift
1458206c.1509_1510delp.Gly504fsFrameshift
1485282c.1391_1392dupp.Gly465fsFrameshift
1918103c.253C>Tp.Gln85TerNonsense
2046744c.1573C>Ap.Pro525ThrSNV
2419096c.1449_1488delp.Tyr484fsFrameshift
2925592c.1540A>Gp.Arg514GlySNV
2945054c.1531dupp.Met511fsFrameshift
3759352c.1542G>Tp.Arg514SerSNV
3760171c.1496delp.Gly499fsFrameshift
Likely Pathogenic Variants (All 13)
ClinVar IDHGVS NotationProtein Change
427141c.760A>Gp.Met254Val
450106c.217C>Tp.Gln73Ter
803253c.1540A>Tp.Arg514Trp
873233c.1571G>Tp.Arg524Met
1333381c.1542-1G>CSplice acceptor
2444451c.1468dupp.Asp490fs
2498660c.1556A>Gp.Gln519Arg
2671949c.1573C>Tp.Pro525Ser
3344713c.1531delp.Met511fs
3571508c.1408delp.Asp470fs
3781327c.1496dupp.Gly500fs
4084959c.1526G>Ap.Gly509Asp
4533182c.1483dupp.Arg495fs
AlphaMissense Predictions Total Predictions: 3,410
ClassificationDescription
Likely Pathogenicam_pathogenicity ≥ 0.564
Ambiguous0.340 < am_pathogenicity < 0.564
Likely Benignam_pathogenicity ≤ 0.340
Top 50 Predicted Pathogenic Missense Variants:
Genomic PositionProtein ChangePathogenicity Score
16:31182517:G:TG15W0.955
16:31182421:A:CS13R0.936
16:31182517:G:AG15R0.937
16:31182583:A:CS37R0.944
16:31182634:A:CS54R0.931
16:31180227:G:CD5H0.942
16:31182398:A:TD5V0.930
16:31180227:G:TD5Y0.927
16:31182589:A:CS39R0.922
16:31180219:C:AA2D0.919
16:31180219:C:TA2V0.916
16:31182631:A:CS53R0.914
16:31182518:G:AG15E0.898
16:31182523:T:GY17D0.882
16:31182544:G:CG24R0.878
16:31182592:G:CG40R0.871
16:31182649:G:CG59R0.864
16:31182559:A:CS29R0.852
16:31182514:T:GY14D0.852
16:31182598:A:CS42R0.830
(and 30 more with scores 0.68-0.82)
SpliceAI Predictions Total Predictions: 1,976 Top 50 High-Confidence Splice-Altering Variants (Score ≥ 0.8):
VariantEffectDelta Score
16:31180225:ACGG:Adonor_loss1.00
16:31180226:CGG:Cdonor_loss1.00
16:31180228:G:GGdonor_gain1.00
16:31180229:T:Adonor_loss1.00
16:31182660:GAACA:Gdonor_gain1.00
16:31182665:G:GGdonor_gain1.00
16:31183849:T:Gacceptor_gain1.00
16:31183852:T:Aacceptor_gain1.00
16:31182396:A:AGacceptor_gain0.99
16:31182397:G:Aacceptor_loss0.99
16:31182420:AAGGT:Adonor_loss0.99
16:31182421:AGGT:Adonor_loss0.99
16:31182423:G:GGdonor_gain0.99
16:31182511:A:AGacceptor_gain0.99
16:31182661:AACAG:Adonor_loss0.99
16:31182665:G:Tdonor_loss0.99
16:31182669:G:Adonor_loss0.99
16:31183820:T:Aacceptor_gain0.99
16:31183848:A:AGacceptor_gain0.99
16:31183853:G:Aacceptor_gain0.99
16:31180430:G:Tdonor_gain0.98
16:31182663:CA:Cdonor_gain0.98
16:31183832:T:TAacceptor_gain0.98
16:31180225:ACG:Adonor_gain0.97
16:31180430:G:GTdonor_gain0.97
16:31181009:TTC:Tdonor_gain0.97
16:31181329:GAGGA:Gdonor_gain0.97
16:31182662:ACA:Adonor_gain0.97
16:31183852:TGGTA:Tacceptor_loss0.97
(and 21 more with scores 0.80-0.96)

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathways: 4

Pathway IDNameDisease Pathway
R-HSA-72163mRNA Splicing - Major PathwayNo
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNANo
R-HSA-9770562mRNA PolyadenylationNo
R-HSA-9918481Dengue Virus-Host InteractionsYes
Gene Ontology Annotations Total GO Terms: 24 Molecular Function (9 terms)
GO IDTerm
GO:0003723RNA binding
GO:0003677DNA binding
GO:0003682Chromatin binding
GO:0003712Transcription coregulator activity
GO:0003713Transcription coactivator activity
GO:0003730mRNA 3'-UTR binding
GO:0008270Zinc ion binding
GO:0042802Identical protein binding
GO:0140693Molecular condensate scaffold activity
Biological Process (11 terms)
GO IDTerm
GO:0006355Regulation of DNA-templated transcription
GO:0006357Regulation of transcription by RNA polymerase II
GO:0008380RNA splicing
GO:0043484Regulation of RNA splicing
GO:0048255mRNA stabilization
GO:0051260Protein homooligomerization
GO:0140694Membraneless organelle assembly
GO:1905168Positive regulation of double-strand break repair via homologous recombination
GO:1990000Amyloid fibril formation
Cellular Component (6 terms)
GO IDTerm
GO:0005634Nucleus
GO:0005654Nucleoplasm
GO:0098978Glutamatergic synapse
GO:0098982GABA-ergic synapse
GO:0099523Presynaptic cytosol
GO:0099524Postsynaptic cytosol
MSigDB Gene Sets
Total Gene Sets
527 (FUS is a member)
Protein Interactions & Molecular Networks**
Interaction Summary
DatabaseInteraction Count
STRING3,788 interactions
IntAct373+ interactions
BioGRID1,068 interactions
Top 50 STRING Interactions (Highest Confidence)
InteractorUniProtScore
FUS (self)P35637999
TARDBP (TDP-43)Q13148999
SOD1P00441992
L3MBTL3Q9NRR4988
NXF1A0A087WTZ4986
SRSF10O75494951
HNRNPA2B1P22626943
OPTNQ96CV9939
UBQLN2Q9UHD9934
SFPQP23246933
ILF3Q12906927
TRIM28Q96LT7919
ATXN2Q99700918
SRSF2Q01130899
CTBP1P30807893
MAPT (Tau)P10636889
CSDAQ70SY1888
SETXQ7Z333884
HNRNPCP07910882
MATR3P43243874
TNP1P92973867
HNRNPA1P09651856
EIF4A3P38919843
TRA2BQ13283839
TIA1P31483836
VAPBO95292835
SMARCC2Q8WXF1835
ATF1P18846829
GRNP16990821
ALS2Q96Q42820
SMN1Q16637811
HNRNPDLO14979801
PRPHQ99873799
NEFHP12036798
VCPP55072797
EWSR1Q01844796
(and 14 more with scores 722-781)
Top BioGRID Interactors by Experimental Evidence
InteractorExperimental Method
VCPAffinity Capture-MS/Western
SFPQAffinity Capture-MS/Western
YWHAZAffinity Capture-MS
PFN1Affinity Capture-MS
HNRNPDAffinity Capture-MS
HNRNPUAffinity Capture-MS
HNRNPKAffinity Capture-MS
DDX42Affinity Capture-MS
SF1Affinity Capture-MS
SF3A1Affinity Capture-MS
SF3B3Affinity Capture-MS
PRMT1Affinity Capture-MS
PCNAAffinity Capture-MS
ACTBAffinity Capture-MS
TUBBAffinity Capture-MS
Protein Similarity ESM2 Structural/Embedding Similarity Total Similar Proteins: 41
UniProtTop SimilarityAvg Similarity
A5A6M31.0000.976
P381591.0000.976
Q4R7F01.0000.975
Q6URK41.0000.987
Q8BG051.0000.987
P845860.99990.976
Q01844 (EWSR1)0.99990.975
Q280090.99980.977
P56959 (TAF15)0.99980.976
Q61545 (Mouse Ewsr1)0.99990.975
DIAMOND Sequence Similarity Total Homologs: 18
UniProtDescription% IdentityBit Score
Q27W01-100.0%344
Q3ZCE8-100.0%344
Q9CWZ3-100.0%344
Q9Y5S9-100.0%344
Q28BZ1-98.9%339
Q5D018-93.7%328
B5DGI7-92.6%316
Q01844 (EWSR1)-90.9%820
Q61545-90.9%823
Q28009-90.3%415
P56959 (TAF15)-86.4%396
P15771-68.5%479
Q92804-65.2%226

Section 9: Transcription Factor Regulatory Data FUS as a Transcriptional Regulator (CollecTRI) FUS Downstream Targets (Genes regulated BY FUS):

Target GeneRegulationConfidence
ARUnknown-
DDIT3Activation-
KLK3Unknown-
MDM2Repression-
Upstream Regulators (TFs that regulate FUS)
Transcription FactorRegulationConfidence
AR-Low
DDIT3-Low
GTF3AUnknown-
NFKB-Low
SP1-Low
TBPRepression-
TP53Unknown-
SIGNOR Regulatory Interactions Total SIGNOR Interactions: 33 Key Regulatory Relationships
Entity AEntity BEffectMechanism
TNPO1FUSUp-regulates activityRelocalization
PRMT1FUSDown-regulates activityMethylation
SAFBFUSUp-regulates activityRelocalization
EGFRFUSUp-regulates activityPhosphorylation
ABL1FUSDown-regulates activityPhosphorylation
SF1FUSDown-regulatesBinding
Exosome ComplexFUSUp-regulates activityRelocalization
mTORC2FUSDown-regulates activity-
D-glucitolFUSDown-regulates activityRelocalization
Osmotic stressFUSDown-regulates activityRelocalization
FUS Downstream Effects
TargetEffectMechanism
PA2G4Up-regulates activitySumoylation
DLG4Up-regulates quantityPost-transcriptional
SHANK1Up-regulates quantityPost-transcriptional
SMN1Up-regulates activityRelocalization
CSDE1Up-regulates by stabilizationPost-transcriptional
DHX9Down-regulates activityRelocalization
DDX3XDown-regulates activityRelocalization
MATR3Up-regulates activityRelocalization
SNRNP70Up-regulates activityBinding
Protein aggregatesUp-regulates quantity-
Protein synthesisDown-regulates-
DNA Binding Motifs
JASPAR Binding Profiles
No data available (FUS is primarily an RNA-binding protein)
Drug & Pharmacology Data**
ChEMBL Target Information
FieldValue
ChEMBL Target IDCHEMBL5724679
Target NameRNA-binding protein FUS
Target TypeSingle Protein
Targeting Molecules Total Molecules: 2 (all preclinical)
ChEMBL IDNameTypeHighest Phase
CHEMBL3752910-Small moleculePhase 0 (Preclinical)
CHEMBL5653589--Phase 0 (Preclinical)
PharmGKB Status
FieldValue
PharmGKB IDPA28425
VIP Gene✓ Yes (Very Important Pharmacogene)
CPIC GuidelineNo
Clinical AnnotationsLimited
Pharmacogenomics
  • No CPIC dosing guidelines currently exist
  • Gene marked as VIP indicating pharmacological importance
  • Limited drug-gene interaction annotations

Section 11: Expression Profiles Bgee Expression Summary

FieldValue
Expression PatternUbiquitous
Total Present Calls304
Max Expression Score99.63
Tissue Expression (Top 50 from Bgee/UBERON) FUS is expressed ubiquitously. Key tissues with confirmed expression: Category: Brain Tissues: Cerebellum, cerebellar cortex, prefrontal cortex, olfactory bulb, nucleus accumbens, putamen, caudate nucleus, amygdala, cingulate cortex, anterior cingulate cortex, Brodmann area 9, left/right frontal lobe Category: Spinal Cord Tissues: Spinal cord, C1 segment of cervical spinal cord Category: Nervous System Tissues: Nerve, tibial nerve Category: Endocrine Tissues: Pituitary gland, adenohypophysis, thyroid gland (left/right lobes), adrenal gland (left/right), adrenal cortex Category: Reproductive Tissues: Testis (left/right), ovary (left/right), uterus, endometrium, fallopian tubes, vagina Category: Gastrointestinal Tissues: Stomach (body/fundus), esophagus, small intestine, transverse colon, sigmoid colon, rectum, caecum, vermiform appendix Category: Cardiovascular Tissues: Heart apex, coronary arteries, aorta (ascending/thoracic), tibial artery, popliteal artery Category: Respiratory Tissues: Lung (right/left, upper/lower lobes) Category: Other Tissues: Liver, kidney (metanephros cortex), spleen, skin, lymph node, tonsil, gallbladder, pancreas, salivary glands Cell Type Expression (Bgee/Cell Ontology) Total Cell Types: 18
Cell TypeCell Ontology ID
GranulocyteCL:0000094
MonocyteCL:0000576
LeukocyteCL:0000738
Mononuclear leukocyteCL:0000842
Stromal cell of endometriumCL:0002255
Bone marrow cellCL:0002092
Type B pancreatic cellCL:0000169
Buccal mucosa cellCL:0002336
Bronchial epithelial cellCL:0002328
Secondary oocyteCL:0000655
Male germ cellCL:0000015
OocyteCL:0000023
SpermCL:0000019
Pancreatic ductal cellCL:0002079
Endothelial cellCL:0000115
Epithelial cell of pancreasCL:0000083
Single-Cell Expression Data (SCXA) Total Single-Cell Datasets: 11
Experiment IDDescriptionCells
E-CURD-79Human thymic development, T cell repertoire152,320
E-MTAB-10283Endometrial organoids with NOTCH/WNT inhibitors574,689
E-GEOD-149689COVID-19 and Influenza immunophenotyping166,852
E-GEOD-124263Neonatal and adult human testis64,206
E-GEOD-134144Testis development during human puberty150,071
E-MTAB-9435IDHwt glioblastoma (3 patient tumors)62,867
E-MTAB-10662Human fetal lung explants39,900
E-MTAB-10432Post-MI heart failure bone vascular niche8,228
E-MTAB-6819Naive and primed human ESCs1,344
E-MTAB-7037Dermal fibroblasts with dsRNA stimulation590
E-MTAB-4850Antigen-specific T cells post-HCV clearance63
FANTOM5 Promoter Data Total Active Promoters: 3
Promoter IDTPM AverageSamples Expressed
153775261.631,827
1537791.09640
1537851.991,044

Section 12: Disease Associations GenCC Disease-Gene Validity Total Curated Associations: 7

DiseaseDisease IDClassificationInheritanceSubmitter
Amyotrophic lateral sclerosisMONDO:0004976DefinitiveADIllumina
Amyotrophic lateral sclerosis type 6OMIM:608030StrongADLabcorp/Invitae
Amyotrophic lateral sclerosis type 6OMIM:608030StrongADGenomics England
Amyotrophic lateral sclerosisORPHANET:803SupportiveADOrphanet
Juvenile amyotrophic lateral sclerosisORPHANET:300605SupportiveAROrphanet
Tremor, hereditary essential, 4OMIM:614782LimitedADLabcorp/Invitae
Tremor, hereditary essential, 4OMIM:614782LimitedUnknownAmbry Genetics
Orphanet Disease Associations Total Orphanet Diseases: 5
Orphanet IDDisease NameTypeGenesPhenotypes
803Amyotrophic lateral sclerosisDisease3647
300605Juvenile amyotrophic lateral sclerosisDisease546
275872Frontotemporal dementia with motor neuron diseaseDisease735
79105MyxofibrosarcomaDisease30
99967Myxoid/round cell liposarcomaHistopathological subtype23
HPO Phenotype Associations Total HPO Terms: 118 Top 50 Associated Phenotypes:
HPO IDPhenotype
HP:0007354Amyotrophic lateral sclerosis
HP:0002145Frontotemporal dementia
HP:0000006Autosomal dominant inheritance
HP:0001260Dysarthria
HP:0002015Dysphagia
HP:0002380Fasciculations
HP:0001324Muscle weakness
HP:0003202Skeletal muscle atrophy
HP:0002878Respiratory failure
HP:0001283Bulbar palsy
HP:0001257Spasticity
HP:0001347Hyperreflexia
HP:0003487Babinski sign
HP:0002366Abnormal lower motor neuron morphology
HP:0002127Abnormal upper motor neuron morphology
HP:0002180Neurodegeneration
HP:0002529Neuronal loss in CNS
HP:0007190Neuronal loss in cerebral cortex
HP:0007373Motor neuron atrophy
HP:0002314Degeneration of lateral corticospinal tracts
HP:0001288Gait disturbance
HP:0003376Steppage gait
HP:0002540Inability to walk
HP:0002460Distal muscle weakness
HP:0003484Upper limb muscle weakness
HP:0007340Lower limb muscle weakness
HP:0003701Proximal muscle weakness
HP:0003324Generalized muscle weakness
HP:0003693Distal amyotrophy
HP:0007126Proximal amyotrophy
HP:0003700Generalized amyotrophy
HP:0008955Progressive distal muscular atrophy
HP:0002061Lower limb spasticity
HP:0006986Upper limb spasticity
HP:0002313Spastic paraparesis
HP:0001264Spastic diplegia
HP:0001308Tongue fasciculations
HP:0002483Bulbar signs
HP:0002425Anarthria
HP:0002300Mutism
HP:0002307Drooling
HP:0000217Xerostomia
HP:0001618Dysphonia
HP:0002167Abnormal speech pattern
HP:0002463Language impairment
HP:0001276Hypertonia
HP:0001265Hyporeflexia
HP:0001348Brisk reflexes
HP:0002169Clonus
HP:0002186Apraxia
GWAS Associations Total GWAS Associations: 3
Study IDTraitMapped GeneChromosomeP-value
GCST010242HDL cholesterol levelsPRSS36 - NDUFA3P6167.0 × 10⁻⁹
GCST008115Bipolar I disorderPRSS36165.0 × 10⁻⁷
GCST008103Bipolar disorderPRSS36168.0 × 10⁻⁶
Note: These associations are in the genomic region near FUS but the mapped gene listed is PRSS36 SUMMARY STATISTICS
CategoryCount
Ensembl Transcripts47
UniProt Entries6 (1 reviewed)
Exons (canonical transcript)15
PDB Structures23
ClinVar Variants657
Pathogenic Variants30
Likely Pathogenic Variants13
AlphaMissense Predictions3,410
SpliceAI Predictions1,976
GO Terms24
Reactome Pathways4
STRING Interactions3,788
IntAct Interactions373+
BioGRID Interactions1,068
HPO Phenotypes118
GenCC Disease Associations7
Orphanet Diseases5
Single-Cell Datasets11
FANTOM5 Promoters3
Orthologs5 species
Document Generated: 2026-03-29 Data Sources: HGNC, Ensembl, NCBI Entrez, UniProt, OMIM, PDB, AlphaFold, ClinVar, AlphaMissense, SpliceAI, Reactome, Gene Ontology, STRING, IntAct, BioGRID, CollecTRI, SIGNOR, PharmGKB, ChEMBL, Bgee, FANTOM5, SCXA, GenCC, Orphanet, HPO, GWAS Catalog