RB1 Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human RB1. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human RB1. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 34 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pharmgkb, pharmgkb_gene, reactome, refseq, scxa, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-27 — For the latest data, query BioBTree directly via MCP or API.
View API calls (45)
RB1

Section 1: Gene Identifiers

AttributeValue
HGNC IDHGNC:9884
Approved SymbolRB1
Approved NameRB transcriptional corepressor 1
Previous NamesRetinoblastoma 1, Osteosarcoma
Previous SymbolsOSRC
AliasesRB, PPP1R130, pRb
Locus TypeGene with protein product
Locus GroupProtein-coding gene
StatusApproved
Ensembl Gene IDENSG00000139687
NCBI Entrez Gene ID5925
OMIM ID614041
Chromosome13
Cytogenetic Location13q14.2
Start Position48,303,735 (GRCh38)
End Position48,599,436 (GRCh38)
Strand+ (Plus/Forward)
Gene Span295,701 bp

Section 2: Transcript Identifiers Ensembl Transcripts (Total: 17)

Transcript IDBiotypeStartEndUTR5UTR3
ENST00000267163protein_coding48,303,75148,481,89048,303,751-48,303,91248,480,072-48,481,890
ENST00000467505nonsense_mediated_decay48,303,77548,481,89048,303,775-48,303,91248,479,998-48,481,890
ENST00000480491protein_coding_CDS_not_defined48,444,82848,456,307--
ENST00000484879retained_intron48,476,44748,480,647--
ENST00000525036retained_intron48,303,75148,360,737--
ENST00000531171protein_coding_CDS_not_defined48,476,40048,480,339--
ENST00000643064protein_coding48,381,25048,599,436--
ENST00000646097protein_coding48,303,74448,333,21648,303,744-48,303,91248,333,065-48,333,216
ENST00000650461protein_coding48,303,74748,481,72848,303,747-48,303,91248,480,038-48,481,728
ENST00000713856protein_coding48,303,75548,481,42948,303,755-48,303,91248,480,034-48,481,429
ENST00000713857protein_coding48,303,84748,481,87548,303,847-48,303,91248,480,072-48,481,875
ENST00000713858protein_coding48,303,84748,481,87548,303,847-48,303,91248,480,072-48,481,875
ENST00000713859nonsense_mediated_decay48,303,87548,481,88348,303,875-48,303,91248,479,998-48,481,883
ENST00000859510protein_coding48,303,75148,481,89048,303,751-48,303,91248,480,072-48,481,890
ENST00000859511protein_coding48,303,77848,481,88648,303,778-48,303,91248,480,072-48,481,886
ENST00000924352protein_coding48,303,73548,481,88648,303,735-48,303,91248,480,072-48,481,886
ENST00000941076protein_coding48,303,76948,481,88548,303,769-48,303,91248,480,072-48,481,885
RefSeq Transcripts - Human (Chromosome 13)
RefSeq IDTypeStatusMANE Select
NM_000321mRNAREVIEWED✓ Yes
NM_001407165mRNAREVIEWEDNo
NM_001407166mRNAREVIEWEDNo
NM_001407167mRNAREVIEWEDNo
NM_001407168mRNAREVIEWEDNo
CCDS ID
IDDescription
CCDS31973Consensus CDS for RB1
Canonical Transcript (ENST00000267163) Exons - Total: 27 exons
Exon IDStartEndLength
ENSE0000381621548,303,75148,304,049298 bp
ENSE0000346927248,307,28048,307,406126 bp
ENSE0000358805948,342,59948,342,714115 bp
ENSE0000349433848,345,08048,345,199119 bp
ENSE0000362123748,347,82548,347,86338 bp
ENSE0000347473348,348,95648,349,02367 bp
ENSE0000346131548,360,01748,360,127110 bp
ENSE0000093967048,362,81548,362,957142 bp
ENSE0000093967148,364,89448,364,97177 bp
ENSE0000093967248,367,49448,367,603109 bp
ENSE0000093967348,368,52748,368,60477 bp
ENSE0000093967448,373,40548,373,49287 bp
ENSE0000093967548,376,91848,377,034116 bp
ENSE0000093967648,379,59448,379,65056 bp
ENSE0000093967748,380,05348,380,08431 bp
ENSE0000093967848,380,16548,380,24176 bp
ENSE0000093967948,381,24748,381,443196 bp
ENSE0000359942448,452,99348,453,111118 bp
ENSE0000093968148,456,20448,456,349145 bp
ENSE0000093968248,459,68848,459,833145 bp
ENSE0000093968348,463,73148,463,835104 bp
ENSE0000093968448,464,99848,465,111113 bp
ENSE0000093968548,465,20548,465,368163 bp
ENSE0000100396248,473,36048,473,39030 bp
ENSE0000354160048,476,70148,476,843142 bp
ENSE0000364182748,477,35548,477,40449 bp
ENSE0000124106748,479,99848,481,8901,892 bp

Section 3: Protein Identifiers UniProt Entries

UniProt IDNameStatusLengthMass
P06400Retinoblastoma-associated protein✓ Canonical/Reviewed928 aa106,159 Da
Alternative Names: p105-Rb, p110-RB1, pRb, pp110 RefSeq Protein Accessions (Human)
RefSeq Protein IDStatusMANE Select
NP_000312REVIEWED✓ Yes
NP_001394094REVIEWEDNo
NP_001394095REVIEWEDNo
NP_001394096REVIEWEDNo
NP_001394097REVIEWEDNo
Protein Domains and Families (Total: 7)
Domain/Family IDNameType
IPR002719RB_BDomain
IPR002720RB_ADomain
IPR013763Cyclin-like_domDomain
IPR015030RB_CDomain
IPR024599RB_NDomain
IPR028309RB_famFamily
IPR036915Cyclin-like_sfHomologous_superfamily

Section 4: Structure Identifiers Experimental Structures (Total: 19 PDB structures)

PDB IDTitleMethodResolution
1AD6Domain A of human retinoblastoma tumor suppressorX-ray2.30 Å
1GH6Retinoblastoma pocket complexed with SV40 large T antigenX-ray3.20 Å
1GUXRB pocket bound to E7 LXCXE motifX-ray1.85 Å
1H25CDK2/Cyclin A in complex with 11-residue recruitment peptide from RbX-ray2.50 Å
1N4MStructure of Rb bound to transactivation domain of E2F-2X-ray2.20 Å
1O9KCrystal structure of Rb bound to E2F peptideX-ray2.60 Å
1PJMMouse Importin alpha-bipartite NLS from human Rb complexX-ray2.50 Å
2AZEStructure of Rb C-terminal domain bound to E2F1-DP1 heterodimerX-ray2.55 Å
2QDJCrystal structure of Rb N-domainX-ray2.00 Å
2R7GStructure of Rb pocket domain with adenovirus E1A CR1 domainX-ray1.67 Å
3N5UCrystal structure of Rb C-terminal peptide bound to PP1X-ray3.20 Å
3POMCrystal structure of unliganded Rb pocket domainX-ray2.50 Å
4CRICrystal structure of 53BP1 tudor domains with methylated K810 Rb peptideX-ray2.35 Å
4ELJCrystal structure of inactive Rb phosphorylated at T373X-ray2.70 Å
4ELLStructure of inactive Rb pocket domainX-ray1.98 Å
9DGKRb with mutation M704VX-ray2.38 Å
9DHCRb with mutation S751YX-ray2.32 Å
9DHFRb with mutation E554KX-ray2.26 Å
9DHURb with mutation E533KX-ray2.16 Å
Predicted Structures
AlphaFold IDGlobal pLDDTSequence LengthFraction Very High Confidence
AF-P0640077.037,455 residues53%

Section 5: Cross-Species Orthologs

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000022105Rb1protein_coding
Rat (Rattus norvegicus)ENSRNOG00000016029Rb1protein_coding
Zebrafish (Danio rerio)ENSDARG00000006782rb1protein_coding
Fruit fly (Drosophila melanogaster)FBGN0015799Rbfprotein_coding
Fruit fly (D. melanogaster)FBGN0038390Rbf2protein_coding
Worm (C. elegans)WBGENE00003020lin-35protein_coding
Yeast (S. cerevisiae)No ortholog identified--

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary (Total: 4,340 variants)

ClassificationCount
Pathogenic>100
Likely Pathogenic91
Uncertain Significance (VUS)>100
Likely Benign>100
Benign>100
TOP 50 Pathogenic/Likely Pathogenic Variants
Variant IDHGVS NotationClassificationType
100808c.1960G>A (p.Val654Met)PathogenicSNV
126784c.1723C>T (p.Gln575Ter)PathogenicSNV
126785c.1735C>T (p.Arg579Ter)PathogenicSNV
126788c.1909C>T (p.Gln637Ter)PathogenicSNV
126820c.763C>T (p.Arg255Ter)PathogenicSNV
126824c.958C>T (p.Arg320Ter)PathogenicSNV
126832c.1215+1G>APathogenicSplice
126834c.1332+1G>APathogenicSplice
126837c.1363C>T (p.Arg455Ter)PathogenicSNV
126840c.1654C>T (p.Arg552Ter)PathogenicSNV
1069507c.2236G>T (p.Glu746Ter)PathogenicSNV
1071424c.1597G>T (p.Glu533Ter)PathogenicSNV
1071425c.1332G>A (p.Gln444=)PathogenicSilent/Splice
1071426c.1422-2A>GPathogenicSplice
1073217c.19del (p.Arg7fs)PathogenicFrameshift
1073218c.62dup (p.Ala22fs)PathogenicFrameshift
1074237c.2420C>G (p.Ser807Ter)PathogenicSNV
1075933c.191T>G (p.Leu64Ter)PathogenicSNV
1076189c.377del (p.Ile126fs)PathogenicFrameshift
126818c.103C>T (p.Gln35Ter)PathogenicSNV
126810c.409G>T (p.Glu137Ter)PathogenicSNV
126812c.446C>G (p.Ser149Ter)PathogenicSNV
126814c.508G>T (p.Glu170Ter)PathogenicSNV
126844c.219_220del (p.Arg73fs)PathogenicFrameshift
126828c.1060_1061del (p.Gln354fs)PathogenicFrameshift
13069c.2520+1delPathogenicSplice
126805c.2520+1G>APathogenicSplice
1037460c.1589A>G (p.Lys530Arg)Pathogenic/Likely pathogenicMissense
1004646c.264+5G>ALikely pathogenicSplice
13072c.1700C>T (p.Ser567Leu)Likely pathogenicMissense
13085c.-189G>TLikely pathogenic5'UTR
126791c.1960+5G>ALikely pathogenicSplice
126799c.2212-13T>ALikely pathogenicSplice
126836c.1346G>T (p.Gly449Val)Likely pathogenicMissense
3787240c.1A>G (p.Met1Val)Likely pathogenicStart loss
579207c.539+1G>ALikely pathogenicSplice
495820c.1421G>T (p.Ser474Ile)Likely pathogenicMissense
2736025c.1215+5G>ALikely pathogenicSplice
3237051c.1970T>C (p.Leu657Pro)Likely pathogenicMissense
3237052c.1985T>C (p.Leu662Pro)Likely pathogenicMissense
3237063c.2104C>A (p.Gln702Lys)Likely pathogenicMissense
1698763c.2279T>C (p.Phe760Ser)Likely pathogenicMissense
2858090c.2084T>A (p.Met695Lys)Likely pathogenicMissense
3237102c.2490-28T>CLikely pathogenicSplice
949856c.2520+4A>GLikely pathogenicSplice
572140c.652T>G (p.Leu218Val)Likely pathogenicMissense
635474c.1696-2A>GLikely pathogenicSplice
1067622c.1199T>C (p.Leu400Pro)Likely pathogenicMissense
1064436c.1331A>C (p.Gln444Pro)Likely pathogenicMissense
3663254c.2663G>C (p.Ser888Thr)Likely pathogenicMissense
AI-Based Splice Effect Predictions (SpliceAI) Total predictions: 6,247 variants TOP 50 Predicted Splice-Altering Variants:
VariantEffectDelta Score
13:48304045:GTCAG:Gdonor_gain1.00
13:48304050:GTG:Gdonor_loss1.00
13:48304050:G:GGdonor_gain0.99
13:48304046:TCAG:Tdonor_gain0.95
13:48304569:GGTCC:Gdonor_gain0.95
13:48304570:GTCC:Gdonor_gain0.94
13:48304571:TCCT:Tdonor_gain0.94
13:48304485:G:GTdonor_gain0.91
13:48304083:G:Tdonor_gain0.87
13:48304741:G:GTdonor_gain0.86
13:48304485:G:Tdonor_gain0.86
13:48304481:TTGGG:Tdonor_gain0.84
13:48304048:AG:Adonor_gain0.83
13:48304049:GG:Gdonor_gain0.83
13:48304671:T:TAdonor_gain0.82
13:48304672:A:AAdonor_gain0.82
13:48304083:G:GTdonor_gain0.78
13:48304218:A:AGdonor_gain0.78
13:48304222:GATTC:Gdonor_gain0.76
13:48304684:A:Tdonor_gain0.76
13:48304608:G:GTdonor_gain0.74
13:48304217:GAC:Gdonor_gain0.72
13:48304594:GA:Gdonor_gain0.71
13:48304441:G:GAdonor_gain0.71
13:48304470:C:CAacceptor_gain0.70
13:48304018:G:GTdonor_gain0.67
13:48304219:C:Gdonor_gain0.66
13:48304560:G:GTdonor_gain0.66
13:48304701:G:GTdonor_gain0.66
13:48304572:CCTG:Cdonor_gain0.65
AI-Based Missense Pathogenicity Predictions (AlphaMissense) Total predictions: 6,110 variants TOP 50 Predicted Pathogenic Missense Variants:
VariantProtein ChangeAM ScoreClassification
13:48303924:A:CK4N0.673likely_pathogenic
13:48303924:A:TK4N0.673likely_pathogenic
13:48303932:G:AR7Q0.584likely_pathogenic
Note: Most N-terminal variants show likely_benign or ambiguous classifications. Higher pathogenicity scores are observed in the pocket domain regions (positions 379-792).

Section 7: Biological Pathways & Gene Ontology Reactome Pathways (Total: 18)

Pathway IDPathway NameDisease Pathway
R-HSA-69231Cyclin D associated events in G1No
R-HSA-69200Phosphorylation of proteins involved in G1/S transition by active Cyclin E:Cdk2 complexesNo
R-HSA-69202Cyclin E associated events during G1/S transitionNo
R-HSA-69656Cyclin A:Cdk2-associated events at S phase entryNo
R-HSA-113501Inhibition of replication initiation of damaged DNA by RB1/E2F1No
R-HSA-174178APC/C:Cdh1 mediated degradation of Cdc20 and other APC/C:Cdh1 targeted proteins in late mitosis/early G1No
R-HSA-2299718Condensation of Prophase ChromosomesNo
R-HSA-2559584Formation of Senescence-Associated Heterochromatin Foci (SAHF)No
R-HSA-2559585Oncogene Induced SenescenceNo
R-HSA-8940973RUNX2 regulates osteoblast differentiationNo
R-HSA-9764790Positive Regulation of CDH1 Gene TranscriptionNo
R-HSA-9841922MLL4 and MLL3 complexes regulate expression of PPARG target genesNo
R-HSA-9661069Defective binding of RB1 mutants to E2F1,(E2F2, E2F3)Yes
R-HSA-9661070Defective translocation of RB1 mutants to the nucleusYes
R-HSA-9687136Aberrant regulation of mitotic exit in cancer due to RB1 defectsYes
R-HSA-9682706Replication of the SARS-CoV-1 genomeYes
R-HSA-9694686Replication of the SARS-CoV-2 genomeYes
R-HSA-9725371Nuclear events stimulated by ALK signaling in cancerYes
Gene Ontology Annotations (Total: 77) Biological Process (47 terms) - TOP 20:
GO IDTerm Name
GO:0000082G1/S transition of mitotic cell cycle
GO:0000122Negative regulation of transcription by RNA polymerase II
GO:0006338Chromatin remodeling
GO:0006355Regulation of DNA-templated transcription
GO:0006366Transcription by RNA polymerase II
GO:0006469Negative regulation of protein kinase activity
GO:0007346Regulation of mitotic cell cycle
GO:0030154Cell differentiation
GO:0030308Negative regulation of cell growth
GO:0031507Heterochromatin formation
GO:0034088Maintenance of mitotic sister chromatid cohesion
GO:0045786Negative regulation of cell cycle
GO:0045892Negative regulation of DNA-templated transcription
GO:0045944Positive regulation of transcription by RNA polymerase II
GO:0051276Chromosome organization
GO:0051301Cell division
GO:0051726Regulation of cell cycle
GO:0007283Spermatogenesis
GO:0002062Chondrocyte differentiation
GO:2000134Negative regulation of G1/S transition of mitotic cell cycle
Molecular Function (14 terms) - ALL:
GO IDTerm Name
GO:0000977RNA polymerase II transcription regulatory region sequence-specific DNA binding
GO:0003714Transcription corepressor activity
GO:0019900Kinase binding
GO:0031625Ubiquitin protein ligase binding
GO:0042802Identical protein binding
GO:0051219Phosphoprotein binding
GO:0060090Molecular adaptor activity
GO:0061629RNA polymerase II-specific DNA-binding transcription factor binding
GO:0061676Importin-alpha family protein binding
GO:0097718Disordered domain specific binding
GO:0140297DNA-binding transcription factor binding
Cellular Component (10 terms) - ALL:
GO IDTerm Name
GO:0000785Chromatin
GO:0005634Nucleus
GO:0005654Nucleoplasm
GO:0005737Cytoplasm
GO:0005819Spindle
GO:0005829Cytosol
GO:0016514SWI/SNF complex
GO:0016605PML body
GO:0035189Rb-E2F complex
GO:0061793Chromatin lock complex

Section 8: Protein Interactions & Molecular Networks Protein-Protein Interactions

DatabaseInteraction Count
STRING2,743 interactions
IntAct473 interactions
BioGRID1,154 interactions
TOP 50 Highest-Confidence Interacting Proteins (STRING)
UniProt IDGeneScoreFunction
Q01094E2F1996Transcription factor
P11802CDK4985Cyclin-dependent kinase 4
Q13547HDAC1962Histone deacetylase 1
Q00987MDM2925E3 ubiquitin-protein ligase
P06703S100A6921Calcium-binding protein
P20248CCNA2911Cyclin A2
P78396CCNA1908Cyclin A1
O00716E2F3897Transcription factor
Q16254E2F4884Transcription factor
P04637TP53873Tumor protein p53
Q96S94CCNC853Cyclin C
P42771CDKN2A842p16-INK4a/p14-ARF
Q00534CDK6841Cyclin-dependent kinase 6
P24385CCND1831Cyclin D1
P24941CDK2821Cyclin-dependent kinase 2
Q13950RUNX2816Runt-related transcription factor 2
Q04118PRB3809Proline-rich protein 3
P04280PRB1786Proline-rich protein 1
P02813STATH780Statherin
P06401PGR769Progesterone receptor
P29375LTF763Lactotransferrin
Q14209E2F2756Transcription factor
Q5TKA1LIN9740Protein lin-9 homolog
P01106MYC736Myc proto-oncogene protein
P06493CDK1731Cyclin-dependent kinase 1
P60484PTEN715Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase
P30281CCND3712Cyclin D3
P02810PRB2699Proline-rich protein 2
Q13309SKP2696S-phase kinase-associated protein 2
P49326CST4688Cystatin S
Q15910EZH2687Enhancer of zeste homolog 2
P29374RAR-related orphan receptor A680Nuclear receptor
Q9NYW8RBBP7662RB binding protein 7
P38936CDKN1A658p21/WAF1/CIP1
P39880CUX1644Homeobox protein cut-like 1
Q96PU4UHRF2639E3 ubiquitin-protein ligase
Q16666IFI16630Interferon gamma inducible protein 16
P46527CDKN1B627p27/KIP1
P31149RAG1611V(D)J recombination-activating protein 1
Q9BT92UHRF1607E3 ubiquitin-protein ligase
Q86Y97MRPL54604Mitochondrial ribosomal protein
P12931SRC591Proto-oncogene tyrosine-protein kinase Src
Q15022SUZ12587Polycomb protein SUZ12
Q15329E2F5587Transcription factor
P24864CCNE1584Cyclin E1
P03372ESR1577Estrogen receptor
P31749AKT1577RAC-alpha serine/threonine-protein kinase
O43463SUV39H1568Histone-lysine N-methyltransferase
Protein Structural/Embedding Similarity (ESM2) Total similar proteins: 46 TOP 20:
UniProt IDSimilarity ScoreOrganism
A2YXJ71.0000Mammal
Q84QM31.0000Plant
P134050.9999Mouse (Rb1)
P335680.9999Rat (Rb1)
A9UL140.9997Mammal
Q8H0J60.9997Plant
Q9LKX90.9997Plant
Q9SLZ40.9997Plant
B9GLX80.9996Mammal
B9SVG90.9996Mammal
A9UL130.9995Mammal
A7P5140.9995Mammal
Q32N220.9995Mammal
Q3LXA70.9995Mammal
F7BLM10.9995Mammal
Q4JF750.9994Mammal
Q66WV00.9994Mammal
Q9LKZ30.9994Plant
Q8H2520.9993Plant
Q9SXN60.9992Plant
Sequence Homology (DIAMOND)
UniProt IDProtein NameIdentityBitscore
P13405Mouse Rb195.3%1677
P33568Rat Rb195.3%1681
Q90600Chicken Rb74.8%1291

Section 9: Transcription Factor Regulatory Data RB1 as a Transcriptional Regulator RB1 functions as a transcriptional corepressor, primarily through interaction with E2F family transcription factors. Total CollecTRI regulatory interactions: 114 Downstream Targets (Genes Regulated BY RB1) - TOP 35:

Target GeneRegulationConfidence
E2F1RepressionHigh
E2F2RepressionHigh
E2F3RepressionHigh
MYCRepressionMedium
BRCA1RepressionMedium
DHFRRepressionMedium
DNMT3ARepressionMedium
FGFR1RepressionMedium
FOSRepressionMedium
PCNARepressionMedium
RBL1RepressionMedium
TFDP1RepressionMedium
RB1Repression (autoregulation)Medium
BIRC5UnknownMedium
CDH1UnknownMedium
CDK1UnknownMedium
CDKN1AActivationMedium
VEGFAActivationMedium
SFTPDActivationMedium
ITGA10ActivationMedium
EBNA1BP2ActivationMedium
H1-5ActivationHigh
Upstream Regulators (TFs that Regulate RB1) - TOP 50:
TF GeneRegulationConfidence
E2F1ActivationHigh
E2F3-High
E2F4UnknownHigh
SP1UnknownHigh
FOSActivationHigh
MYOD1ActivationHigh
CTCF-High
AP1UnknownHigh
EGR1UnknownHigh
CREB1UnknownHigh
BCL3-High
CEBPB-High
CXXC1-High
FLI1-High
GRHL3-High
GTF2IRD1-High
HBP1-High
HES1-High
HINFP-High
HMGA1-High
JARID2-High
NCOR1-High
PAX5UnknownHigh
PAX6-High
SON-High
SPI1-High
TAF1-High
TBP-High
DNMT1RepressionHigh
DNMT3A-High
ARID4ARepressionHigh
KAT2BActivationMedium
PGRActivationMedium
RB1CC1ActivationMedium
STAT3RepressionLow
SMAD6RepressionMedium
CTNNB1RepressionMedium
DNA Binding Profiles RB1 does not directly bind DNA; it functions as a transcriptional corepressor by binding to E2F family transcription factors and recruiting chromatin-modifying complexes (HDACs, SUV39H1, etc.).

Section 10: Drug & Pharmacology Data ChEMBL Targets

Target IDTarget NameType
CHEMBL5288Retinoblastoma-associated proteinSINGLE PROTEIN
CHEMBL4523682Protein cereblon/Retinoblastoma-associated proteinPROTEIN-PROTEIN INTERACTION
Targeting Molecules (Total: 25)
ChEMBL IDNameTypeHighest Phase
CHEMBL4446357EBVACICLIBSmall moleculePhase 2
CHEMBL388978STAUROSPORINESmall moleculePhase 0
CHEMBL490451-Small moleculePhase 0
CHEMBL504547-Small moleculePhase 0
CHEMBL515631-Small moleculePhase 0
CHEMBL4060594-CHEMBL4093413Various compoundsSmall moleculePhase 0
Note: Most compounds targeting RB1 are in preclinical development. Ebvaciclib (Phase 2) works by inhibiting CDK4/6, which indirectly affects RB1 phosphorylation status. PharmGKB
IDVIP GeneCPIC Guideline
PA295Yes (Very Important Pharmacogene)No
Pharmacogenomics RB1 is classified as a VIP (Very Important Pharmacogene) due to its role in cancer biology and response to CDK4/6 inhibitors (palbociclib, ribociclib, abemaciclib) which function by maintaining RB1 in its active (hypophosphorylated) tumor-suppressive state.

Section 11: Expression Profiles Expression Summary (Bgee)

AttributeValue
Expression BreadthUbiquitous
Total Present Calls287
Max Expression Score97.39
TOP 30 Tissues by Expression Score
Tissue/Anatomical EntityExpression ScoreQuality
Epithelium of nasopharynx97.39Gold
Choroid plexus epithelium97.16Gold
Visceral pleura96.50Gold
Germinal epithelium of ovary96.24Gold
Gingival epithelium96.09Gold
Pleura95.70Gold
Palpebral conjunctiva95.70Gold
Parietal pleura95.67Gold
Seminal vesicle95.58Gold
Eye94.91Gold
Mucosa of paranasal sinus94.83Gold
Brodmann area 2394.77Gold
Gingiva94.07Gold
Trabecular bone tissue94.05Gold
Jejunal mucosa94.03Gold
Jejunum94.03Gold
Tibia93.98Gold
Superficial temporal artery93.79Gold
Caput epididymis93.77Gold
Corpus epididymis93.77Gold
Tonsil93.64Gold
Endometrium93.42Gold
Cauda epididymis93.40Gold
Adrenal tissue93.36Gold
Trigeminal ganglion93.28Gold
Ventricular zone93.03Gold
Synovial joint92.98Gold
Lower lobe of lung92.75Gold
Penis92.72Gold
Mammary duct92.69Gold
TOP 30 Cell Types by Expression
Cell TypeExpression ScoreQuality
Monocyte93.95Gold
Mononuclear cell93.67Gold
Endothelial cell93.65Gold
Leukocyte93.19Gold
Bone marrow cell91.25Gold
Bronchial epithelial cell91.04Gold
Stromal cell of endometrium89.61Gold
Single-Cell Expression Data
Dataset IDDescriptionSpeciesCell Count
E-ANND-5Mapping the developing human immune system across organsHomo sapiens911,873
E-MTAB-9906scRNA-seq of human fetal intestinal ileum tissueHomo sapiens21,097

Section 12: Disease Associations Mendelian/Monogenic Disease Links (GenCC)

DiseaseDisease IDClassificationInheritanceSubmitter
Hereditary retinoblastomaOMIM:180200DefinitiveAutosomal dominantAmbry Genetics
Hereditary retinoblastomaOMIM:180200DefinitiveAutosomal dominantG2P
Hereditary retinoblastomaOMIM:180200StrongAutosomal dominantLabcorp Genetics
Hereditary retinoblastomaORPHANET:357027SupportiveAutosomal dominantOrphanet
MelanomaMONDO:0005105ModerateAutosomal dominantAmbry Genetics
Orphanet Disease Associations (Total: 5)
Orphanet IDDisease NameTypeGene Count
ORPHA:357027Hereditary retinoblastomaClinical subtype2
ORPHA:357034Non-hereditary retinoblastomaClinical subtype2
ORPHA:1587Monosomy 13q14 syndromeMalformation syndrome1
ORPHA:668OsteosarcomaDisease4
ORPHA:70573Small cell lung cancerDisease3
Phenotype Associations (HPO) - Total: 58 terms
HPO IDPhenotype
HP:0009919Retinoblastoma
HP:0002669Osteosarcoma
HP:0000555Leukocoria
HP:0000518Cataract
HP:0000568Microphthalmia
HP:0000612Iris coloboma
HP:0007862Retinal calcification
HP:0007902Vitreous hemorrhage
HP:0011531Vitritis
HP:0010799Pinealoma
HP:0006740Transitional cell carcinoma of the bladder
HP:0009725Bladder neoplasm
HP:0100526Neoplasm of the lung
HP:0012254Ewing sarcoma
HP:0001909Leukemia
HP:0002665Lymphoma
HP:0001249Intellectual disability
HP:0000252Microcephaly
HP:0001360Holoprosencephaly
HP:0002079Hypoplasia of the corpus callosum
HP:0000243Trigonocephaly
HP:0000175Cleft palate
HP:0000347Micrognathia
HP:0000316Hypertelorism
HP:0000286Epicanthus
HP:0000426Prominent nasal bridge
HP:0000431Wide nasal bridge
HP:0000508Ptosis
HP:0000369Low-set ears
HP:0000391Thickened helices
HP:0000411Protruding ear
HP:0000465Webbed neck
HP:0000470Short neck
HP:0001156Brachydactyly
HP:0004209Clinodactyly of the 5th finger
HP:0006101Finger syndactyly
HP:0009601Aplasia/Hypoplasia of the thumb
HP:0004322Short stature
HP:0001511Intrauterine growth retardation
HP:0001252Hypotonia
HP:0000944Abnormal metaphysis morphology
HP:0006489Abnormal femoral metaphysis morphology
HP:0006491Abnormal tibial metaphysis morphology
HP:0002756Pathologic fracture
HP:0002797Osteolysis
HP:0001386Joint swelling
HP:0000006Autosomal dominant inheritance
HP:0003745Sporadic
HP:0001442Typified by somatic mosaicism
GWAS Associations (Total: 8)
Study IDTrait/Diseasep-valueMapped Gene(s)
GCST005146_24Birth weight2×10⁻⁸RB1
GCST008362_41Birth weight7×10⁻¹¹ITM2B - RB1-DT
GCST90002388_164Lymphocyte count4×10⁻¹¹LPAR6, RB1
GCST008363_106Offspring birth weight2×10⁻⁷ITM2B - RB1-DT
GCST012050_1Diastolic blood pressure2×10⁻⁷RB1
GCST003140_5Chronic kidney disease4×10⁻⁶RB1
GCST003264_156Post bronchodilator FEV1/FVC ratio2×10⁻⁶RB1
GCST003264_7Post bronchodilator FEV1/FVC ratio2×10⁻⁶RB1

DATA SOURCES This reference document integrates data from:

  • HGNC - HUGO Gene Nomenclature Committee
  • Ensembl - Genome browser and annotation
  • NCBI Entrez Gene - Gene database
  • OMIM - Online Mendelian Inheritance in Man
  • UniProt - Protein sequence and functional information
  • RefSeq - Reference sequences
  • CCDS - Consensus CDS
  • PDB - Protein Data Bank
  • AlphaFold - Predicted protein structures
  • ClinVar - Clinical variant interpretations
  • SpliceAI - AI-based splice predictions
  • AlphaMissense - AI-based missense predictions
  • Reactome - Pathway database
  • Gene Ontology - Functional annotations
  • STRING - Protein-protein interactions
  • IntAct - Molecular interactions
  • BioGRID - Biological interactions
  • CollecTRI - Transcription factor-target interactions
  • ChEMBL - Drug/compound bioactivity
  • PharmGKB - Pharmacogenomics
  • Bgee - Gene expression
  • Single Cell Expression Atlas - scRNA-seq data
  • GenCC - Gene-disease clinical validity
  • Orphanet - Rare disease database
  • HPO - Human Phenotype Ontology
  • GWAS Catalog - Genome-wide association studies

Document generated via biobtree cross-database queries. Last updated: March 2026