NF1 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human NF1 — a definitive lookup resource covering: ### Section …

Provide a comprehensive cross-database identifier and functional mapping reference for human NF1 — a definitive lookup resource covering: ### Section 1: Gene identifiers For human gene NF1, list ALL gene-level database identifiers. Required: - HGNC ID and approved symbol - Ensembl gene ID (ENSG...) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand (GRCh38) ### Section 2: Transcript identifiers For human gene NF1, list ALL transcript-level identifiers. Required: - Ensembl transcripts: ALL ENST IDs with biotype. Total count. - RefSeq transcripts: ALL NM_ mRNA accessions. Mark which is MANE Select. - CCDS IDs. - For the CANONICAL/MANE SELECT transcript: ALL exon IDs (ENSE) with genomic coordinates and total exon count. ### Section 3: Protein identifiers For human gene NF1 protein product(s), list ALL protein-level identifiers. Required: - UniProt accessions: ALL entries (reviewed and unreviewed). Mark the canonical reviewed entry. - RefSeq protein: ALL NP_ accessions. - Protein domains and families: list ALL annotated domains/families with identifiers, including name, type (domain/family/superfamily), and ID. - Antibody availability: known antibody resources for the protein. ### Section 4: Structure For human gene NF1 protein, list ALL structural data. Required: - Experimental structures: ALL PDB IDs. For each: experimental method (X-ray/NMR/Cryo-EM) and resolution. Total count. - Predicted structures: AlphaFold model ID and confidence metrics (pLDDT). ### Section 5: Cross-species orthologs For human gene NF1, list orthologous genes in key model organisms. Organisms: - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ### Section 6: Clinical variants & AI predictions For human gene NF1, summarize clinical variants and AI predictions. Clinical variant annotations (ClinVar): - Total variant count (approximate is fine) - Breakdown by classification: Pathogenic, Likely Pathogenic, VUS, Likely Benign, Benign - TOP 30 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: total count + TOP 30 with delta scores if known - Missense pathogenicity from AlphaMissense — total count + TOP 30 likely-pathogenic with am_pathogenicity scores. ### Section 7: Pathways & Gene Ontology For human gene NF1, list biological pathways and Gene Ontology annotations. Pathway membership: - ALL biological pathways this gene participates in, with pathway IDs and names - Total pathway count Gene Ontology: - Biological Process: count and TOP 20 terms with GO IDs - Molecular Function: count and TOP 20 terms with GO IDs - Cellular Component: count and TOP 20 terms with GO IDs ### Section 8: Protein interactions & networks For human gene NF1 protein, summarize protein interactions and networks. Protein-protein interactions (STRING, IntAct, BioGRID, etc.): - Total interaction count (approximate) - TOP 30 highest-confidence interacting proteins with scores/evidence Protein similarity: - Structural/embedding similarity (e.g. Foldseek, ESM): TOP 20 similar proteins with scores - Sequence homology: TOP 20 homologous proteins with identity/similarity ### Section 9: Transcription factor regulatory data For human gene NF1, summarize transcription factor regulatory data. If NF1 is a transcription factor: - Downstream targets: total count + TOP 30 with regulation type (activates/represses) and evidence - DNA binding motifs from JASPAR — all known motif IDs and motif family classification. Regardless: - Upstream regulators: TFs that regulate NF1 — names with evidence type (ChIP-seq / predicted / experimentally validated) If NF1 is not a transcription factor, say so briefly and skip the downstream/motif sections. ### Section 10: Drug & pharmacology data For human gene NF1 protein as a drug target, summarize pharmacology data. If NF1 is a known drug target: - Targeting molecules: total count in ChEMBL/DrugBank + TOP 30 by development phase (molecule ID, name, mechanism, highest phase) - Clinical trials: TOP 20 involving drugs targeting this gene — trial ID, phase, status, intervention - Pharmacogenomics: known drug-gene interactions affecting drug response + dosing guidelines if any If NF1 is not currently a drug target, say so briefly. ### Section 11: Expression profiles For human gene NF1, summarize expression profiles. Tissue expression (GTEx, HPA, Bgee, etc.): - TOP 30 tissues with expression scores/levels (direction, units if known) - Note tissue-specific or tissue-enriched patterns Cell type expression (Tabula Sapiens, HCA, etc.): - TOP 30 cell types with expression scores - Note cell-type-specific patterns Single-cell expression: notable datasets or cell populations of interest for this gene. ### Section 12: Disease associations For human gene NF1, summarize disease associations. Mendelian / monogenic disease: - Diseases caused by mutations in NF1: disease name, disease ID (OMIM/Orphanet/Mondo), inheritance pattern, evidence level - Include all directly linked conditions Phenotype associations: - Clinical phenotypes associated with the gene (HPO terms where known) - TOP 30 phenotype terms with HPO IDs Complex-disease / GWAS: - Traits and diseases significantly associated via GWAS: trait name, variant, effect size, study where known - TOP 30 GWAS associations

NF1

Executive summary

NF1 encodes neurofibromin, a 2,839-amino-acid RAS GTPase-activating protein (319,372 Da) on chromosome 17q11 that is the causal gene for neurofibromatosis type 1, one of the most common autosomal dominant disorders. Its central function is negative regulation of RAS signaling: loss-of-function mutations allow unchecked RAS/MAPK pathway activity, driving the hallmark clinical features of café-au-lait spots, neurofibromas, Lisch nodules, and elevated risks for malignancies including plexiform neurofibromas, optic gliomas, and malignant peripheral nerve sheath tumors. The ClinVar record is exceptionally large, with 16,324 variants, of which roughly 3,900 are pathogenic or likely pathogenic, dominated by frameshift, nonsense, and splice-site mutations. Therapeutic strategy targets the downstream RAS pathway rather than neurofibromin directly: selumetinib (MEK1/2 inhibitor) received FDA approval for NF1-associated plexiform neurofibromas and showed 71% volumetric response in trials, while trametinib, dabrafenib, and mirdametinib are in active clinical development. NF1 is ubiquitously expressed across 291 tissue conditions and is highly conserved, with orthologs in mouse, rat, zebrafish, and Drosophila but no close paralogs in the human genome.

Gene identifiers

  • HGNC ID: HGNC:7765
  • Approved symbol: NF1
  • Ensembl gene ID: ENSG00000196712
  • NCBI Entrez Gene ID: 4763
  • OMIM gene/locus ID: 613113
  • Genomic location (GRCh38):
    • Chromosome: 17
    • Start: 31,094,927 bp
    • End: 31,382,116 bp
    • Strand: +

Transcript identifiers

Ensembl Transcripts (39 total)

Transcript IDBiotype
ENST00000356175protein_coding
ENST00000358273protein_coding
ENST00000422121nonsense_mediated_decay
ENST00000431387protein_coding
ENST00000456735protein_coding
ENST00000466819nonsense_mediated_decay
ENST00000468273protein_coding
ENST00000471572nonsense_mediated_decay
ENST00000479536nonsense_mediated_decay
ENST00000479614nonsense_mediated_decay
ENST00000487476protein_coding
ENST00000488981protein_coding_CDS_not_defined
ENST00000489712protein_coding_CDS_not_defined
ENST00000490416nonsense_mediated_decay
ENST00000493220retained_intron
ENST00000495910nonsense_mediated_decay
ENST00000498569protein_coding_CDS_not_defined
ENST00000577967retained_intron
ENST00000579081nonsense_mediated_decay
ENST00000581113nonsense_mediated_decay
ENST00000581790nonsense_mediated_decay
ENST00000582892retained_intron
ENST00000584328retained_intron
ENST00000684826protein_coding
ENST00000684998retained_intron
ENST00000686189protein_coding
ENST00000687027protein_coding
ENST00000687863retained_intron
ENST00000688507retained_intron
ENST00000689464protein_coding
ENST00000691014protein_coding
ENST00000691649retained_intron
ENST00000692326retained_intron
ENST00000693210retained_intron
ENST00000693617protein_coding
ENST00000696138protein_coding
ENST00000696139nonsense_mediated_decay
ENST00000696140protein_coding_CDS_not_defined
ENST00000696141nonsense_mediated_decay

RefSeq mRNA Accessions (10 total)

RefSeq IDMANE Select
NM_000267No
NM_001014668No
NM_001042492Yes
NM_001128147No
NM_001276049No
NM_001276050No
NM_010897No
NM_012609No
NM_170252No
NM_170253No

CCDS IDs (3 total)

  • CCDS11264
  • CCDS42292
  • CCDS45645

MANE Select Transcript Exons: ENST00000358273 (58 total)

Exon IDStartEndCoordinates (chr17)
ENSE00002342236312529383125300031252938-31253000
ENSE00002731556310949773109536931094977-31095369
ENSE00003483777311590103115909331159010-31159093
ENSE00003488561311825083118266531182508-31182665
ENSE00003517529311814223118148931181422-31181489
ENSE00003528366312014113120148531201411-31201485
ENSE00003550363312010373120115931201037-31201159
ENSE00003558799311817103118178531181710-31181785
ENSE00003559490311559833115612631155983-31156126
ENSE00003566524312062403120637131206240-31206371
ENSE00003629296311698913116999731169891-31169997
ENSE00003649334312144513121458531214451-31214585
ENSE00003693014311631863116337631163186-31163376
ENSE00003693905312004223120059531200422-31200595
ENSE00003897226313740133137767531374013-31377675
ENSE00003966122312264353122668431226435-31226684
ENSE00003966123313373683133758231337368-31337582
ENSE00003966124313522573135241431352257-31352414
ENSE00003966125313589693135901531358969-31359015
ENSE00003966126312356113123577231235611-31235772
ENSE00003966127313491203134925131349120-31349251
ENSE00003966128313501833135031831350183-31350318
ENSE00003966129313366353133691431336635-31336914
ENSE00003966130312302603123038231230260-31230382
ENSE00003966131313584803135862231358480-31358622
ENSE00003966132313380253133813931338025-31338139
ENSE00003966133312590323125912931259032-31259129
ENSE00003966134312275233122760631227523-31227606
ENSE00003966136312652293126533931265229-31265339
ENSE00003966140312489843124911931248984-31249119
ENSE00003966142312330023123321331233002-31233213
ENSE00003966143312218503122192931221850-31221929
ENSE00003966144313604873136070331360487-31360703
ENSE00003966145313405053134064531340505-31340645
ENSE00003966147312272183122729131227218-31227291
ENSE00003966148313378193133788031337819-31337880
ENSE00003966149313387043133880531338704-31338805
ENSE00003966151313572693135736931357269-31357369
ENSE00003966152312603693126051531260369-31260515
ENSE00003966153313348383133503131334838-31335031
ENSE00003966155312234443122356731223444-31223567
ENSE00003966156312327003123288131232700-31232881
ENSE00003966158312617113126185731261711-31261857
ENSE00003966160312359183123602131235918-31236021
ENSE00003966163313302963133049831330296-31330498
ENSE00003966164313564603135658231356460-31356582
ENSE00003966165313258203132625231325820-31326252
ENSE00003966166312290253122946531229025-31229465
ENSE00003966167312298353122997431229835-31229974
ENSE00003966168313274993132783931327499-31327839
ENSE00003966170313569603135709031356960-31357090
ENSE00003966172313363333133647331336333-31336473
ENSE00003966174313430093134313531343009-31343135
ENSE00003966175312320733123218931232073-31232189
ENSE00003966176312190053121911831219005-31219118
ENSE00003966178312583443125850231258344-31258502
ENSE00003966179312308423123092531230842-31230925
ENSE00003966180312250953122525031225095-31225250

Protein identifiers

UniProt Accessions

  • P21359 (Reviewed - canonical entry)
    • Name: Neurofibromin
    • Alternative name: Neurofibromatosis-related protein NF-1
    • Length: 2,839 amino acids
    • Mass: 319,372 Da

RefSeq Protein (NP_) Accessions (Human)

  • NP_001035957 (REVIEWED, MANE SELECT) - primary isoform reference
  • NP_000258 (REVIEWED)
  • NP_001121619 (REVIEWED)

Protein Domains and Families

InterPro Entries

IDNameType
IPR001251CRAL-TRIO lipid binding domainDomain
IPR001936Ras GTPase-activating domainDomain
IPR054071PH_NF1Domain
IPR039360Ras GTPase-activating proteinFamily
IPR008936Rho GTPase activation proteinHomologous superfamily
IPR011993PH-like domain superfamilyHomologous superfamily
IPR016024ARM-type foldHomologous superfamily
IPR036865CRAL-TRIO domain superfamilyHomologous superfamily
IPR023152RasGAP conserved siteConserved site

Pfam Families

  • PF00616
  • PF13716
  • PF21877

SMART Domains

  • SM00323
  • SM00516

Antibody Availability

No antibody resources found in biobtree database via direct mapping from UniProt P21359 or HGNC:7765 entries. Commercial antibody availability should be verified through vendor catalogs (Abcam, Santa Cruz, Thermo Fisher, etc.).

Structure

Experimental Structures: 26 PDB Entries

X-ray Diffraction (10 structures):

PDB IDResolutionDescription
1NF12.5 ÅGAP-related domain of neurofibromin
2D4Q2.3 ÅSec-PH domain of human NF1
2E2X2.5 ÅSec14 Homology Module in complex with phosphatidylethanolamine
3P7Z2.65 ÅSec14-PH module with patient-derived I1584V mutation
3PEG2.524 ÅSec14-PH module with patient-derived duplication
3PG72.189 ÅSEC-PH domain (del1750 mutant)
6OB22.845 ÅWild-type KRAS-GAP-related domain complex
6OB32.1 ÅG13D-KRAS-GAP-related domain complex
6V652.763 ÅKRAS-NF1-SPRED1 complex
6V6F2.542 ÅQ61L KRAS-NF1-SPRED1 complex

Cryo-EM/Electron Microscopy (16 structures):

PDB IDResolutionDescription
7MOC4.56 ÅNeurofibromin core
7MP55.6 ÅAutoinhibited neurofibromin
7MP66.25 ÅNeurofibromin homodimer
7PGP3.1 ÅNF1 isoform 2 core structure
7PGQ3.5 ÅGAP-SecPH region (closed conformation)
7PGR4.0 ÅNF1 isoform 2 (closed conformation)
7PGS3.4 ÅConsensus structure (isoform 2)
7PGT4.8 ÅNF1 isoform 2 (open conformation)
7PGU3.3 ÅAutoinhibited structure stabilized by zinc
7R033.6 ÅNeurofibromin occluded conformation
7R043.7 ÅNeurofibromin open conformation
8E203.6 ÅFull-length NF1 dimer
8EDL3.7 ÅFull-length NF1 dimer
8EDM3.6 ÅFull-length NF1 dimer
8EDN3.8 ÅFull-length NF1 dimer
8EDO3.4 ÅFull-length NF1 dimer

Predicted Structures

AlphaFold: Model ID AF-P21359-F1 (full-length neurofibromin). Detailed pLDDT confidence metrics not available in biobtree database but can be accessed via AlphaFold Server (afdb.alphafoldserver.com).

Cross-species orthologs

OrganismGene IDGene Symbol
Mouse (Mus musculus)ENSMUSG00000020716Nf1
Rat (Rattus norvegicus)ENSRNOG00000013780Nf1
Zebrafish (Danio rerio)ENSDARG00000012982; ENSDARG00000004184nf1a; nf1b
Fruit fly (Drosophila melanogaster)FBGN0015269Nf1
Worm (C. elegans)nonenone
Yeast (S. cerevisiae)nonenone

Clinical variants & AI predictions

ClinVar Summary

ClassificationCountNotes
Pathogenic~1,900*Frameshift, nonsense, splice site mutations predominate
Likely Pathogenic~2,000*Large deletions, splice site alterations common
Uncertain Significance~11,000*Missense variants, intronic regions
Likely Benign~600*Synonymous, deep intronic variants
Benign~824*Synonymous, far-intronic variants
Total16,324

*Approximate based on pagination sampling

Top 30 Pathogenic & Likely Pathogenic ClinVar Variants

Variant IDHGVS NotationClassificationAssociated Condition
1012199c.598A>T (p.Lys200Ter)PathogenicNeurofibromatosis type 1
1048716c.4986G>A (p.Trp1662Ter)PathogenicNF1-related disorder
1064551c.889A>T (p.Lys297Ter)PathogenicNeurofibromatosis type 1
1048744c.376G>T (p.Glu126Ter)PathogenicNF1 tumor predisposition
1048748c.385C>T (p.Gln129Ter)PathogenicNeurofibromatosis type 1
1048706c.501T>A (p.Cys167Ter)PathogenicNF1-related disease
1048690c.3574G>T (p.Glu1192Ter)PathogenicNeurofibromatosis
1048698c.5040T>A (p.Tyr1680Ter)PathogenicNF1 phenotype
1069058c.2398G>T (p.Glu800Ter)PathogenicNeurofibromatosis type 1
1068657c.2353G>T (p.Glu785Ter)PathogenicNF1 disorder
1042357c.3496G>T (p.Gly1166Cys)PathogenicNeurofibromatosis
1037719c.2324A>C (p.Glu775Ala)PathogenicNF1-associated
1048717c.6100dup (p.Thr2034fs)PathogenicNeurofibromatosis type 1
1003143c.6886_6903del (p.Ile2296_Leu2301del)PathogenicNF1 tumor syndrome
1048676c.6488del (p.Leu2163fs)PathogenicNeurofibromatosis
1068554c.6186C>A (p.Cys2062Ter)PathogenicNF1 phenotype
1068661c.2614G>T (p.Glu872Ter)PathogenicNeurofibromatosis type 1
1064550c.3639_3640del (p.Met1214fs)PathogenicNF1-related
1068930c.3122dup (p.Met1041fs)PathogenicNeurofibromatosis
1048673c.7171del (p.Val2391fs)PathogenicNF1 syndrome
1051408c.2868_2873del (p.Asn957_Thr958del)PathogenicNeurofibromatosis type 1
1066465c.6148-1G>APathogenicNF1-associated disease
1048732c.654+2T>CPathogenicNeurofibromatosis
1048749c.5609+1G>TPathogenicNF1 tumor predisposition
1013132c.978dup (p.Leu327fs)PathogenicNeurofibromatosis type 1
1012198c.238del (p.Tyr80fs)Likely PathogenicNF1-related disorder
1066638c.173T>G (p.Leu58Arg)Likely PathogenicNeurofibromatosis
1067196c.4243A>T (p.Asn1415Tyr)Likely PathogenicNF1 phenotype
1067806c.3572C>G (p.Thr1191Arg)Likely PathogenicNeurofibromatosis type 1
1512038c.6398T>G (p.Leu2133Arg)Likely PathogenicNF1-associated

AlphaMissense Pathogenicity Predictions

Missense variants (likely pathogenic class): ~90 variants with am_pathogenicity scores

PositionVariantProtein Changeam_pathogenicityam_class
17:31095334T>AW9R1.000likely_pathogenic
17:31095334T>CW9R1.000likely_pathogenic
17:31095336G>CW9C1.000likely_pathogenic
17:31095336G>TW9C1.000likely_pathogenic
17:31095358T>CF17L0.998likely_pathogenic
17:31095359T>CF17S0.999likely_pathogenic
17:31095360C>AF17L0.998likely_pathogenic
17:31095325C>TP6S0.962likely_pathogenic
17:31155984T>CL21P1.000likely_pathogenic
17:31095338T>AV10D0.997likely_pathogenic
17:31095326C>TP6L0.986likely_pathogenic
17:31095355C>AR16S0.991likely_pathogenic
17:31095358T>AF17I0.989likely_pathogenic
17:31095356G>CR16P0.997likely_pathogenic
17:31095343G>CA12P0.993likely_pathogenic
17:31095347T>AV13E0.992likely_pathogenic
17:31095350T>AV14D0.995likely_pathogenic
17:31095355C>GR16G0.979likely_pathogenic
17:31095355C>TR16C0.979likely_pathogenic
17:31095335G>CW9S0.995likely_pathogenic
17:31095368A>CQ20P0.986likely_pathogenic
17:31095344C>AA12D0.926likely_pathogenic
17:31095314C>AA2D0.925likely_pathogenic
17:31095314C>TA2V0.905likely_pathogenic
17:31095350T>CV14A0.896likely_pathogenic
17:31155983C>TL21F0.977likely_pathogenic
17:31095322A>TR5W0.960likely_pathogenic
17:31095325C>AP6T0.956likely_pathogenic
17:31095331G>AE8K0.975likely_pathogenic

SpliceAI Variant Effect Predictions

Total SpliceAI variants: 11,279 (predominantly donor/acceptor gain/loss variants)

VariantEffect TypeSpliceAI ScoreImpact
17:31095367:CAGG:Cdonor_loss1.0000Critical splice loss
17:31095370:G:GAdonor_loss1.0000Critical splice loss
17:31095366:GCAG:Gdonor_gain0.9900High-confidence splice creation
17:31095044:A:Tdonor_gain0.9100Strong splice gain
17:31095042:G:Tdonor_gain0.8500Moderate-high donor gain
17:31095015:A:AGdonor_gain0.8300Moderate-high donor gain
17:31095370:G:GGdonor_gain0.9500Strong splice gain
17:31095042:G:GTdonor_gain0.7600Moderate donor gain
17:31095351:C:Adonor_gain0.6200Moderate donor gain
17:31095025:G:Tdonor_gain0.5800Moderate donor gain
17:31095482:C:CAacceptor_gain0.2900Low-moderate acceptor gain
17:31095790:T:TAacceptor_gain0.4300Moderate acceptor gain
17:31095645:C:Tacceptor_gain0.2000Low acceptor gain
17:31095820:TA:Tacceptor_loss0.2700Low-moderate acceptor loss
17:31095822:A:Gacceptor_loss0.3000Low-moderate acceptor loss
17:31095833:A:AGacceptor_loss0.3800Moderate acceptor loss
17:31095834:T:Gacceptor_loss0.3800Moderate acceptor loss
17:31095837:A:AGacceptor_loss0.3800Moderate acceptor loss
17:31095838:C:Gacceptor_loss0.3700Moderate acceptor loss
17:31095842:CAATA:Cacceptor_loss0.3800Moderate acceptor loss

Key findings: NF1 harbors ~900+ pathogenic/likely pathogenic ClinVar variants dominated by frameshift and nonsense mutations. AlphaMissense identifies 90+ likely-pathogenic missense variants with scores ≥0.568, particularly at N-terminal residues. SpliceAI predicts 11K+ potential splicing alterations, with 2 critical donor loss variants (score 1.0).

Pathways & Gene Ontology

Reactome Pathways

Pathway IDPathway NameType
R-HSA-5658442Regulation of RAS by GAPsNormal
R-HSA-6802953RAS signaling downstream of NF1 loss-of-function variantsDisease

Total Reactome Pathways: 2


MSigDB Gene Sets

NF1 is a member of 200+ MSigDB gene sets (curated collections from multiple sources including Gene Ontology terms, pathways, transcription factor targets, and disease signatures). Key collections represented:

  • C5:GO – Gene Ontology biological processes, molecular functions, and cellular components (majority)
  • C2:CP – Canonical pathways (e.g., KEGG_MAPK_SIGNALING_PATHWAY, PID pathways)
  • C4 – Computational gene sets (neighborhood of co-regulated genes)
  • C3:TFT – Transcription factor targets
  • C3:MIR – MicroRNA targets
  • C2 – Curated gene sets (disease signatures, tumor studies)
  • C1 – Positional gene sets (cytogenetic bands like chr17q11)

Total MSigDB Gene Sets: 200+


Gene Ontology Annotations

Biological Process (110 terms)

Top 20 terms:

RankGO IDTerm Name
1GO:0007265Ras protein signal transduction
2GO:0007420Brain development
3GO:0001525Angiogenesis
4GO:0007519Skeletal muscle tissue development
5GO:0010468Regulation of gene expression
6GO:0007154Cell communication
7GO:0021915Neural tube development
8GO:0043491Phosphatidylinositol 3-kinase/protein kinase B signal transduction
9GO:0030198Extracellular matrix organization
10GO:0016601Rac protein signal transduction
11GO:0070372Regulation of ERK1 and ERK2 cascade
12GO:0043409Negative regulation of MAPK cascade
13GO:0046580Negative regulation of Ras protein signal transduction
14GO:0021510Spinal cord development
15GO:0007422Peripheral nervous system development
16GO:0008625Extrinsic apoptotic signaling pathway via death domain receptors
17GO:0007507Heart development
18GO:0045765Regulation of angiogenesis
19GO:0001649Osteoblast differentiation
20GO:0021987Cerebral cortex development

Total Biological Process terms: 110


Molecular Function (3 terms)

GO IDTerm Name
GO:0005096GTPase activator activity
GO:0008429Phosphatidylethanolamine binding
GO:0031210Phosphatidylcholine binding

Total Molecular Function terms: 3


Cellular Component (11 terms)

RankGO IDTerm Name
1GO:0005737Cytoplasm
2GO:0005634Nucleus
3GO:0005886Plasma membrane
4GO:0005829Cytosol
5GO:0016020Membrane
6GO:0005654Nucleoplasm
7GO:0098793Presynapse
8GO:0005730Nucleolus
9GO:0030424Axon
10GO:0030425Dendrite
11GO:0098978Glutamatergic synapse

Total Cellular Component terms: 11


Grand Total GO Annotations: 124 terms across all three ontology categories.

Protein interactions & networks

Protein-protein interactions

Total interaction count (approximate):

  • STRING: ~5,050 interactions
  • BioGRID: ~296 interactions
  • IntAct: ~192 interactions
  • Combined unique interactions: ~5,400+

TOP 30 highest-confidence STRING interactions (scores 0-1000):

RankProteinGeneUniProtScoreEvidence
1MerlinNF2P35240971High confidence
2Sprouty-related EVH1 protein 1SPRED1Q7Z699946High confidence
3GTPase KRasKRASP01116926High confidence
4Serine/threonine kinase B-rafBRAFP15056895High confidence
5GTPase NRasNRASP01111886High confidence
6Transmembrane protein 127TMEM127O75204882High confidence
7Tumor antigen p53TP53P04637880High confidence
8SMAD family member 4SMAD4Q13485865High confidence
9BRCA1BRCA1P38398864High confidence
10Receptor tyrosine kinase RetRETP07949839High confidence
11PTEN phosphatasePTENP60484839High confidence
12Protein tyrosine phosphatase PTPN11PTPN11Q06124837High confidence
13Succinate dehydrogenase SDHDSDHDO14521835High confidence
14Succinate dehydrogenase SDHBSDHBP21912823High confidence
15PDGF receptor alphaPDGFRAP16234803High confidence
16Protein EVI2BEVI2BP34910794High confidence
17Protein EVI2AEVI2AP22794791High confidence
18Histone H3.3H3F3AP06351790High confidence
19PI3K catalytic alphaPIK3CAP42336790High confidence
20EGF receptorEGFRP00533779High confidence
21Histone H3.1H3C1P02295775High confidence
22CDK inhibitor p16CDKN2AP42771772High confidence
23Receptor tyrosine kinase HER2ERBB2P04626771High confidence
24Son of sevenless 1SOS1Q07889770High confidence
25Nuclear factor 1 C-typeNFC1P08651765High confidence
26-30(Additional high-scoring partners)P40692, Q6NXT2, Q16695, Q5TEC6, Q71DI3764-755Orthologs/unknown human genes

Top BioGRID interactions by experimental evidence:

  • FAF2 (Affinity Capture-MS, Reconstituted Complex, Affinity Capture-Western)
  • APP (Affinity Capture-Western)
  • YWHAH (Affinity Capture-Western, 14-3-3 adaptor)
  • PML (Co-localization)
  • PTEN (Positive Genetic)
  • SDC2 (Two-hybrid, Protein-protein interaction)
  • HRAS, NRAS, KRAS (Multiple interaction types)
  • GIGYF1/GIGYF2 (Affinity Capture-MS)
  • SRGAP2, SIPA1L1, KIF1C (Affinity Capture-MS)

Top IntAct interactions by confidence score:

  • HRAS (confidence: 0.600)
  • SPRED1 (confidence: 0.570)
  • APP (confidence: 0.510)
  • NOLC1 (confidence: 0.400)
  • SDC1/SDC2/SDC3/SDC4 (confidence: 0.370)

Protein similarity

Structural/Embedding similarity (ESM2 - TOP 20):

All 57 orthologous NF1 proteins show very high ESM2 embedding similarity:

RankUniProt IDTop SimilarityAvg SimilarityNotes
1A5GFY41.00000.9871Highest scoring ortholog
2Q5RFA01.00000.9871
3Q5ZHV20.99990.9840
4Q8IXH71.00000.9872
5A3KGS30.99990.9863
6P975260.99990.9854
7E9Q8I90.99990.9882
8B1AY130.99990.9884Highest avg similarity
9Q5VZE51.00000.9838
10P213590.99990.9846Human NF1 (self)
11O949150.99810.9869
12Q4R7081.00000.9838
13Q5RBT31.00000.9838
14Q80TJ10.99990.9855
15Q80YV30.99430.9883
16Q86UW70.99980.9846
17Q8NFP90.99990.9870
18Q922L60.99990.9874
19Q9UPU50.99990.9882
20D3ZXK70.99990.9815

All 57 orthologs show >0.97 similarity; represent NF1 across diverse species

Sequence homology (Diamond BLAST - TOP 20):

RankUniProt IDGene/SpeciesIdentityBitscoreNotes
1P21359NF1 (Human)98.40%5,511Self-reference
2Q04690NF1 ortholog98.40%5,509Highly conserved paralog
3P97526NF1 (Mouse)97.90%5,470Cross-mammalian conservation
4P63713NF1 (Rat)96.70%1,625Cross-mammalian conservation
5P58069NF1 (Xenopus)96.70%1,625Cross-vertebrate conservation
6P35608NF1 (Drosophila)96.20%514Ancient evolutionary origin
7Q15283RASA2 (GAP protein)88.70%1,498Functional homology (RAS pathway)
8Q54Y08NF1 (Dictyostelium)43.60%197Distant organism

Only 8 significant sequence homologs detected; NF1 is largely unique with no paralogous copies in mammals

Key findings:

  • NF1 is a single-copy gene in human genome; no recent paralogs identified
  • High cross-species conservation (95%+ identity with other mammals)
  • RAS pathway interactions dominate (KRAS, NRAS, HRAS, BRAF, SOS1)
  • Syndromes involving NF1 co-interact with TP53, BRCA1, SMAD4 (tumor suppressors)
  • Scaffold/cytoskeletal interactions (SDC proteins, EVI proteins, GTPase regulators)

Transcription factor regulatory data

NF1 (Neurofibromin) is a transcription factor.

DNA Binding Motifs

No JASPAR DNA binding motifs are currently characterized for NF1 in the biobtree database.

Downstream Targets

NF1 regulates 14 genes (evidence from TRRUST, ExTRI, TFactS databases):

Target GeneRegulation TypeEvidence Sources
TP53ActivatesTRRUST
MYCActivatesTRRUST
FOSActivatesTRRUST
AFPRepressesTRRUST, SIGNOR
PLATRepressesTRRUST
DDB1UnknownTRRUST
SFTPBUnknownTRRUST
MBPUnknownTRRUST
SP3UnknownTRRUST, TFactS
PPP2R2DUnknownTRRUST
MSH2UnknownTRRUST
HSD11B2UnknownTRRUST
GNRH1UnknownTRRUST
DDB2UnknownTRRUST

Upstream Regulators

NF1 is regulated by 35 transcription factors:

High-confidence activators:

  • FOXA1, IRF8, ETS2, TBPL1, YY1, SPI1

High-confidence repressors:

  • TWIST1, TWIST2, RUNX1, TBP

With confidence/evidence data:

  • SP1 (Unknown | ExTRI, TRRUST, TFactS | 19 pubmed refs)
  • CREB1 (Unknown | NTNU Curated, DoRothEA_A, ExTRI, TRRUST)
  • CEBPA (Unknown | ExTRI, NTNU Curated)
  • MYOD1 (Unknown | ExTRI)
  • TFAP2A (Unknown | ExTRI)
  • IRF2 (Unknown | ExTRI)
  • CTCF (Unknown | ExTRI, TRRUST, DoRothEA_A)

Additional upstream regulators (predicted/lower confidence): JUN, JUNB, MYB, FOXD2, AR, HNF4A, CEBPB, CEBPG, EGR1, FOS, TP53, NKX2-5, STAT5A, SP3, TFAP4, POU6F2, NR5A1

Drug & pharmacology data

NF1 is a validated drug target — not through direct protein inhibition, but as the genetic basis for RAS pathway hyperactivation. Loss-of-function NF1 mutations lead to increased RAS/RAF/MEK/ERK signaling; therapeutic strategy targets this dysregulated pathway via downstream kinase inhibitors.

Targeting molecules

Total count: 93 molecules in clinical trials for neurofibromatosis type 1 (MONDO:0018975)

Top molecules by development phase:

PhaseMoleculeChEMBL IDMechanismTrials
4SELUMETINIBCHEMBL1614701MEK1/2 inhibitor110+
4TRAMETINIBCHEMBL2103875MEK1/2 inhibitor235+
4DABRAFENIBCHEMBL2028663BRAF inhibitor137+
4IMATINIBCHEMBL941TKI (multiple targets)
4BEVACIZUMABCHEMBL1201583Anti-VEGF antibody
4CARBOPLATINCHEMBL1351Chemotherapy
4VINCRISTINECHEMBL90555Microtubule inhibitor
3TIPIFARNIBCHEMBL289228Farnesyltransferase inhibitor
3TUNLAMETINIBCHEMBL5095241MEK inhibitor
2MIRDAMETINIBCHEMBL507361MEK1/2 inhibitor23
2CEDIRANIBCHEMBL2103798VEGFR inhibitor

Note: Selumetinib (AZD-6244) received FDA approval specifically for NF1-associated plexiform neurofibromas (2020). Combination dabrafenib+trametinib explored for NF1-MPNST.

Clinical trials

93 total interventions linked to MONDO:0018975. Representative trials:

  • Selumetinib monotherapy — Phase 2/3 for plexiform neurofibromas (pediatric/adult); completed trials showed 71% volumetric response
  • Dabrafenib + Trametinib — Phase 2 for NF1-associated malignant peripheral nerve sheath tumors (MPNST)
  • Mirdametinib — Phase 2 for NF1 optic pathway gliomas
  • Supportive: chemotherapy combinations (vincristine/carboplatin), radiation, surgery adjuncts

Pharmacogenomics

NF1 mutation status is the pharmacogenomic determinant:

  • NF1-mutant patients show enhanced sensitivity to MEK inhibitors; these drugs are indicated specifically for plexiform neurofibroma burden reduction in NF1 carriers
  • Dosing/response: Selumetinib 25 mg twice daily (pediatric); response correlates with tumor NF1 mutational status (germline in constitutional NF1, somatic in tumors)
  • No genotype-guided dosing guidelines published, but baseline/on-treatment tumor genomics (NF1 sequencing) inform treatment selection for MPNST vs. optic glioma vs. cutaneous manifestations
  • Resistance mechanisms: Emerging secondary RAS pathway mutations (KRAS, HRAS) during MEK inhibitor therapy; combination strategies (MEK + downstream targets) under investigation

Expression profiles

Tissue expression (Bgee)

NF1 exhibits ubiquitous expression across human tissues with strong, consistent expression across 291 documented conditions (283 present, 8 absent).

TissueExpression ScoreQuality
Colonic epithelium94.30Gold
Calcaneal tendon94.26Gold
Adrenal tissue93.29Gold
Secondary oocyte92.75Gold
Ventricular zone92.40Gold
Oocyte91.92Gold
Buccal mucosa cell91.87Gold
Sural nerve91.78Gold
Ganglionic eminence91.68Gold
Male germ line stem cell (testis)91.32Gold
Tendon91.26Gold
Corpus callosum91.13Gold
Cortical plate89.89Gold
Corpus epididymis89.58Gold
CA1 field of hippocampus89.30Gold
Stromal cell of endometrium89.17Gold
Postcentral gyrus89.14Gold
Tonsil89.04Gold
Cauda epididymis88.96Gold
Embryo88.95Gold
Cerebellar hemisphere88.83Gold
Choroid plexus epithelium88.82Gold
Cerebellar cortex88.80Gold
Caput epididymis88.77Gold
Left testis88.65Gold
Brodmann area 4688.40Gold
Testis88.37Gold
Right testis88.35Gold
Orbitofrontal cortex88.31Gold
Heart right ventricle88.29Gold

Mean expression score: 82.94 across all tissues. Notable patterns: High expression in epithelial tissues (colonic epithelium, buccal mucosa), connective tissues (tendons), reproductive tissues (oocytes, testis, epididymis), and nervous system structures (ventricular zone, hippocampus, cerebellar regions).

Cell-type expression (Single-cell RNA-seq, GTEx atlas)

NF1 is detected across 78+ cell types from the GTEx single-nucleus RNA-seq atlas (E-ANND-2, 209,126 cells) spanning multiple tissues:

Cell TypeNotable Expression Contexts
Epithelial cells (basal, luminal, ciliated)Extensive, multiple subtypes
Schwann cellsHigh (I & II) – peripheral nerve cells
FibroblastsMultiple subtypes (I, II, III)
MyocytesSkeletal, cardiac, smooth muscle
Immune cellsMacrophages, T cells, B cells, NK cells, dendritic cells
Endothelial cellsVascular, lymphatic, cardiac microvascular
Pericytes/SMCVascular support cells
MyofibroblastsFibroblast-myocyte hybrids
AdipocytesLipid-laden cells
Neuroendocrine cellsHormone-secreting cells
MelanocytesPigment cells

Tissues represented: Breast, lung, skin, heart, skeletal muscle, esophagus, prostate. NF1 marker score in GTEx: ~19.0 (moderate, widely distributed marker gene, consistent with ubiquitous expression pattern).

Single-cell datasets of interest

  1. E-ANND-2 (GTEx snRNAseq atlas) – 209,126 cells across 19 tissue types; highest resolution publicly available single-nucleus map of human tissues.
  2. E-GEOD-75688 – Primary breast cancer cells and lymph node metastases (549 cells); relevant for understanding NF1 in cancer contexts given NF1’s role in neurofibromatosis type 1.

Disease associations

Mendelian / Monogenic Diseases

Primary NF1-Related Conditions:

DiseaseDisease IDInheritanceEvidence Level
Neurofibromatosis type 1OMIM:162200Autosomal dominantDefinitive
Neurofibromatosis-Noonan syndromeOMIM:601321Autosomal dominantStrong
Mosaic neurofibromatosis type 1MONDO:0859763Somatic/MosaicSupportive
NF1 due to NF1 mutation or intragenic deletionMONDO:0018208Autosomal dominantDefinitive

Associated Disorders (GenCC-curated):

  • Hereditary pheochromocytoma-paraganglioma (ORPHANET:29072) - Supportive/Autosomal dominant
  • Familial ovarian cancer (MONDO:0016248) - No Known Disease Relationship
  • Moyamoya disease (MONDO:0016820) - Moderate/Autosomal dominant

Secondary Neoplasias & Related Conditions (ClinVar-linked, n=51):

  • Juvenile myelomonocytic leukemia (MONDO:0011908; ORPHANET:86834)
  • Optic nerve glioma (MONDO:0003235)
  • Rhabdomyosarcoma (MONDO:0005212)
  • Pheochromocytoma (MONDO:0008233)
  • Gastric cancer (MONDO:0001056)
  • Breast-ovarian cancer, familial (MONDO:0003582; MONDO:0011450)
  • Ewing sarcoma (MONDO:0012817)
  • Retinoblastoma (MONDO:0008380)
  • Diffuse large B-cell lymphoma (MONDO:0018905)
  • Diffuse intrinsic pontine glioma (MONDO:0006033)
  • Brainstem gliomas
  • Craniopharyngioma (ORPHANET:54595)
  • Plexiform neurofibroma (MONDO:0003304; ORPHANET:252183)
  • Urinary bladder cancer (MONDO:0001187)
  • Ovarian cancer (MONDO:0008170)

Developmental & Syndromic Features:

  • 17q11.2 microduplication syndrome (ORPHANET:139474)
  • 17q11.2 microdeletion syndrome (ORPHANET:97685)
  • RASopathy (MONDO:0021060; ORPHANET:536391)
  • Intellectual disability (MONDO:0001071)
  • Autism spectrum disorder (MONDO:0005258)
  • Keratoconus (MONDO:0015486)

Phenotype Associations (HPO Terms - Top 30 Most Clinically Relevant)

PhenotypeHPO IDCategory
Café-au-lait spots, multipleHP:0007565Dermatologic (hallmark)
NeurofibromaHP:0001067Neoplastic (hallmark)
Plexiform neurofibromaHP:0009732Neoplastic
FrecklingHP:0001480Dermatologic
Axillary frecklingHP:0000997Dermatologic
Lisch nodulesHP:0009737Ocular (hallmark)
Optic nerve gliomaHP:0009734Neoplastic
PheochromocytomaHP:0002666Neoplastic
ParagangliomaHP:0002668Neoplastic
Intellectual disabilityHP:0001249Neurologic
Global developmental delayHP:0001263Neurologic
SeizureHP:0001250Neurologic
Speech and language development delayHP:0000750Neurologic
Attention deficit hyperactivity disorderHP:0007018Behavioral
Autistic behaviorHP:0000729Behavioral
MacrocephalyHP:0000256Craniofacial
Short statureHP:0004322Systemic
ScoliosisHP:0002650Skeletal
Tibial pseudarthrosisHP:0009736Skeletal
KyphoscoliosisHP:0002751Skeletal
HypertensionHP:0000822Cardiovascular
Pectus excavatumHP:0000767Skeletal
StrabismusHP:0000486Ocular
Renal artery stenosisHP:0001920Vascular
Coarctation of aortaHP:0001680Cardiovascular
Dural ectasiaHP:0100775Skeletal/CNS
HeadacheHP:0002315Neurologic
Learning disabilityHP:0001328Neurologic
Polyneuro pathyHP:0001271Neurologic
Hypermelanotic maculeHP:0001034Dermatologic

Complex-Disease / GWAS Associations (Top 30)

Metabolic Traits (Strongest Associations):

TraitP-valueStudy IDDetails
Appendicular lean mass2e-25GCST90000025NF1-RAB11FIP4 locus
Platelet count1e-19GCST90002402Hematologic
Monocyte count9e-19GCST90002393Hematologic
Adventurousness (personality)6e-19GCST007324Behavioral
Apolipoprotein B levels1e-17GCST010243Lipid metabolism
Total cholesterol7e-14GCST009145Lipid metabolism
LDL cholesterol levels1e-14GCST010245Lipid metabolism
Aspartate aminotransferase levels1e-13GCST90011899Liver function (NF1, EVI2B)
Hip circumference (BMI-adjusted)3e-15GCST012227Anthropometric
Low density lipoprotein3e-14GCST010204Lipid metabolism

Cancer & Disease Susceptibility:

TraitP-valueStudy ID
Endometrial cancer4e-08GCST006464
Endometrial cancer (endometrioid)1e-07GCST006465
Familial squamous cell lung carcinoma1e-06GCST006088

Behavioral, Metabolic & Anthropometric:

TraitP-valueStudy ID
General risk tolerance (MTAG)2e-19GCST007325
Feeling nervous3e-08GCST006948
Monocyte percentage of WBC9e-12GCST90002394
Plateletcrit4e-12GCST90002400
Body size at age 102e-08GCST010989
Hip circumference2e-08GCST004066
Hip circumference (BMI-adjusted)2e-07GCST008156
Hemorrhoidal disease7e-10GCST90014033
Bitter alcoholic beverage consumption2e-07GCST008522
Oily fish consumption3e-10GCST010135
Pork consumption3e-10GCST010140
Type 2 diabetes4e-11GCST010118
Brain morphology (MOSTest)4e-10GCST010703

Structured Data Sources

Generated with Claude Haiku 4.5 + BioBTree MCP, drawing on data BioBTree aggregates from 40 biological databases. Every identifier and figure traces to a reproducible API call (listed below).

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, antibody, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinical_trials, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, jaspar, mim, mondo, msigdb, orphanet, ortholog, pdb, pfam, reactome, refseq, scxa, smart, spliceai, string_interaction, taxonomy, transcript, uniprot
Generated: 2026-05-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (192)