SUB1

gene
On this page

Also known as PC4p15p14

Summary

SUB1 (SUB1 regulator of transcription, HGNC:19985) is a protein-coding gene on chromosome 5p13.3, encoding Activated RNA polymerase II transcriptional coactivator p15 (P53999). General coactivator that functions cooperatively with TAFs and mediates functional interactions between upstream activators and the general transcriptional machinery.

Enables several functions, including DNA binding activity; identical protein binding activity; and transcription coactivator activity. Involved in RNA polymerase II promoter clearance; protein homooligomerization; and regulation of nucleobase-containing compound metabolic process. Located in nucleolus and nucleoplasm. Part of transcription regulator complex.

Source: NCBI Gene 10923 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 24 total
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_006713

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:19985
Approved symbolSUB1
NameSUB1 regulator of transcription
Location5p13.3
Locus typegene with protein product
StatusApproved
AliasesPC4, p15, p14
Ensembl geneENSG00000113387
Ensembl biotypeprotein_coding
OMIM600503
Entrez10923

Gene structure

Transcript identifiers

Ensembl transcripts: 32 — 25 protein_coding, 3 retained_intron, 3 protein_coding_CDS_not_defined, 1 nonsense_mediated_decay

ENST00000265073, ENST00000502453, ENST00000502897, ENST00000504016, ENST00000504789, ENST00000506237, ENST00000510442, ENST00000511175, ENST00000511615, ENST00000511988, ENST00000512913, ENST00000513013, ENST00000515355, ENST00000879054, ENST00000879055, ENST00000879056, ENST00000879057, ENST00000879058, ENST00000879059, ENST00000879060, ENST00000879061, ENST00000879062, ENST00000922096, ENST00000922097, ENST00000922098, ENST00000922099, ENST00000922100, ENST00000922101, ENST00000922102, ENST00000954957, ENST00000954958, ENST00000954959

RefSeq mRNA: 1 — MANE Select: NM_006713 NM_006713

CCDS: CCDS3897

Canonical transcript exons

ENST00000265073 — 5 exons

ExonStartEnd
ENSE000011644443258555732585625
ENSE000015630863260100532604079
ENSE000035220033259156332591685
ENSE000036188653259896132599069
ENSE000036284433258851232588584

Expression profiles

Bgee: expression breadth ubiquitous, 297 present calls, max score 99.43.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 398.9053 / max 20586.4502, expressed in 1826 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
56007390.87481826
560053.0025483
560042.5599505
560121.3639656
560060.6685183
560080.3326143
560090.103133

Top tissues by expression

297 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
epithelium of nasopharynxUBERON:000195199.43gold quality
nasopharynxUBERON:000172899.41gold quality
cortical plateUBERON:000534399.32gold quality
mammary ductUBERON:000176599.24gold quality
colonic epitheliumUBERON:000039799.23gold quality
ganglionic eminenceUBERON:000402399.22gold quality
gingival epitheliumUBERON:000194999.20gold quality
parotid glandUBERON:000183199.18gold quality
cervix squamous epitheliumUBERON:000692299.17gold quality
adrenal tissueUBERON:001830399.16gold quality
bone marrow cellCL:000209299.10gold quality
germinal epithelium of ovaryUBERON:000130499.09gold quality
gingivaUBERON:000182899.08gold quality
endometriumUBERON:000129599.05gold quality
ventricular zoneUBERON:000305399.04gold quality
epithelium of mammary glandUBERON:000324499.03gold quality
oral cavityUBERON:000016799.02gold quality
upper leg skinUBERON:000426299.02gold quality
oocyteCL:000002399.01gold quality
endothelial cellCL:000011599.01gold quality
embryoUBERON:000092299.01gold quality
tonsilUBERON:000237299.00gold quality
lymph nodeUBERON:000002998.97gold quality
epithelial cell of pancreasCL:000008398.96gold quality
Brodmann (1909) area 23UBERON:001355498.96gold quality
tongue squamous epitheliumUBERON:000691998.95gold quality
penisUBERON:000098998.92gold quality
pylorusUBERON:000116698.92gold quality
skin of hipUBERON:000155498.88gold quality
mammary glandUBERON:000191198.88gold quality

Single-cell (SCXA)

Detected in 21 experiment(s), a significant marker in 15.

ExperimentMarker?Max mean expression
E-MTAB-9467yes3887.37
E-MTAB-9221yes2821.35
E-HCAD-1yes2250.60
E-CURD-55yes1613.52
E-MTAB-9841yes1587.77
E-CURD-77yes1491.64
E-HCAD-4yes45.16
E-HCAD-9yes27.28
E-CURD-122yes25.25
E-CURD-46yes24.26
E-HCAD-13yes21.51
E-MTAB-6701yes19.17
E-MTAB-10042yes17.42
E-CURD-88yes4.26
E-HCAD-32no2410.60

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): MEF2C, RCOR2, RELA, TBP, TP53

miRNA regulators (miRDB)

191 targeting SUB1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3163100.0077.238605
HSA-MIR-340-5P100.0072.504437
HSA-MIR-3646100.0073.565283
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-656-3P100.0072.152788
HSA-MIR-3924100.0072.092394
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-450099.9972.722367
HSA-MIR-607799.9968.042299
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-366299.9973.825684
HSA-MIR-186-5P99.9970.833707
HSA-MIR-428299.9975.366408
HSA-MIR-453199.9969.703181
HSA-MIR-3692-3P99.9870.272139
HSA-MIR-1213699.9872.815713
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-27A-3P99.9872.132955
HSA-MIR-27B-3P99.9872.132955

Literature-anchored findings (GeneRIF, showing 38)

  • General transcription factor IIH protects promoters from PC4-mediated repression by relieving the topological constraint imposed by PC4 through the ERCC3 helicase activity rather than by reducing the repressive activity of PC4 via its phosphorylation. (PMID:12590132)
  • These results establish the first physiological role of PC4 as a transcriptional coactivator. (PMID:14966284)
  • structure explains high-affinity binding of PC4 to the complementary strands of unwinding duplex DNA. (PMID:16415882)
  • These results establish PC4 as a new member of chromatin-associated protein family, which plays an important role in chromatin organization. (PMID:16982701)
  • p38 MAP kinase mediated HNF4 Ser158 phosphorylation (P-HNF4-S158), binding of PC4 to P-HNF4-S158 and characterize the functional domain of PC4 required for P-HNF4-S158 binding. (PMID:17317687)
  • PC4 interacts with the DNA binding and C-terminal domains of p53 through its DNA binding domain, which is essential for the stimulation of p53 DNA binding. (PMID:17785449)
  • PC4 activates double-strand break (DSB) repair activity through stimulation of DSB rejoining in vivo (PMID:19038270)
  • Report recruitment of RNA polymerase II cofactor PC4 to DNA damage sites. (PMID:19047459)
  • analysis of the interaction between the transactivation domain of p53 and PC4 (PMID:19525231)
  • PC4 interacts with heterochromatin protein 1alpha, REST/NRSF (RE1-silencing transcription factor/neuron-restrictive silencer factor) and CoREST to establish the repressed state of neural genes in nonneuronal cells. (PMID:20080105)
  • Sub1/PC4 a chromatin associated protein with multiple functions in transcription. (PMID:20305379)
  • findings demonstrated a critical aspect of LHR modulation whereby PC4 acts as a coactivator of Sp1 to contribute to the human of LHR transcription (PMID:21193408)
  • acetylation of lysine 382/381 of p53 may play an important role in modulating p53-PC4 interaction and as a consequence PC4 mediated activation of p53 target genes. (PMID:21586571)
  • expression of PC4 in astrocytoma was upregulated as assessed by western blot and immunohistochemical staining. Moreover, elevated PC4 expression was strongly correlated with the progression of astrocytoma and overall survival. (PMID:25262015)
  • PC4 protects esophageal squamous cell carcinoma cells from IR-induced death by enhancing the nonhomologous end joining-promoting activity of XLF. (PMID:25321468)
  • Sub1, and its human homolog PC4, bind preferentially to G-quadruplexes (PMID:25813861)
  • PC4 has similar functions to RPA in binding ssDNA to promote genome stability, especially at sites of replication-transcription collisions. (PMID:25961912)
  • SMYD3 interacts with the human positive coactivator 4 (PC4) and that such interaction potentiates a group of genes whose expression is linked to cell proliferation and invasion. (PMID:26350217)
  • PC4 associates with Aurora A and Aurora B and undergoes phosphorylation, following which PC4 activates both Aurora A and B to sustain optimal kinase activity to maintain the phosphorylation gradient for the proper functioning of the mitotic machinery. (PMID:26777301)
  • Results show elevated expression of SUB1 in aggressive prostate cancer. Knockdown of SUB1 in prostate cancer cells resulted in reduced cell proliferation, invasion and migration. (PMID:27270442)
  • Yeast Sub1 and human PC4 are G-quadruplex binding proteins that suppress genome instability at co-transcriptionally formed G4 DNA. (PMID:28369605)
  • these data provide a novel mode of PC4 binding to a DNA secondary structure that remains within the framework of the model for binding to ssDNA. Additionally, consideration of the PC4-G4DNA interaction could provide insight into the biological functions of PC4, which remain incompletely understood. (PMID:28416612)
  • Evolutionarily conserved role for Sub1 and PC4 in the maintenance of yeast and human genome stability has been summarized. (Review) (PMID:28567479)
  • REVIEW: many evidences showed that Sub1 influences processes downstream during mRNA biogenesis, such as elongation, termination, and RNAPII phosphorylation. The recent discovery that Sub1 directly interacts with the RNAPII stalk adds new insights into how it achieves all these tasks. (PMID:28853990)
  • PC4 expression correlated with radiosensitivity and was an independent prognostic factor of progression-free survival (PFS) in patients with NSCLC. (PMID:29522271)
  • Study shows that PC4 might repress RNA pol II recruitment and transcription of replication-dependent histone genes in order to maintain the balance between histone gene expression and DNA synthesis. Moreover, PC4 might promote the interaction of cleavage and polyadenylation complex with histone pre-mRNAs, that might impede with the recruitment of histone cleavage complex. (PMID:30053800)
  • this study reveals for the first time that PC4 promotes breast cancer progression by directly regulating c-Myc transcription to promote Warburg effect, implying a novel therapeutic target for breast cancer. (PMID:30992017)
  • PC4 is essential for genomic integrity and autophagy regulation. (PMID:31169983)
  • PC4 emerges as a global co-regulator of p53 and a therapeutic target against pathogeneses where the p53-dependent cell death process plays a crucial role. (PMID:31236588)
  • Chromatin Protein PC4 Orchestrates B Cell Differentiation by Collaborating with IKAROS and IRF4. (PMID:33357426)
  • Knockdown of PC4 increases chemosensitivity of Oxaliplatin in triple negative breast cancer by suppressing mTOR pathway. (PMID:33524870)
  • The Transcription Factor SUB1 Is a Master Regulator of the Macrophage TLR Response in Atherosclerosis. (PMID:34378353)
  • CircVCAN/SUB1 up-regulates MYC/HSP90beta to enhance the proliferation and migration of glioma cells. (PMID:34534637)
  • Phosphorylation-dependent association of human chromatin protein PC4 to linker histone H1 regulates genome organization and transcription. (PMID:35670677)
  • SUB1 promotes colorectal cancer metastasis by activating NF-kappaB signaling via UBR5-mediated ubiquitination of UBXN1. (PMID:38240906)
  • Periodic changes of cyclin D1 mRNA stability are regulated by PC4 modifications in the cell cycle. (PMID:38349334)
  • PC4 promotes bladder cancer progression and stemness by directly interacting with Sp1 to transcriptionally activate the Wnt5a/beta-catenin pathway. (PMID:38820928)
  • HMGB3 and SUB1 Bind to and Facilitate the Repair of N[2]-Alkylguanine Lesions in DNA. (PMID:39101269)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriosub1bENSDARG00000007720
danio_reriosub1aENSDARG00000053446
mus_musculusSub1ENSMUSG00000022205
rattus_norvegicusLOC103693281ENSRNOG00000063454
rattus_norvegicusSub1ENSRNOG00000067555
caenorhabditis_elegansWBGENE00011743

Protein

Protein identifiers

Activated RNA polymerase II transcriptional coactivator p15P53999 (reviewed: P53999)

Alternative names: Positive cofactor 4, SUB1 homolog, p14

All UniProt accessions (4): P53999, D6R970, D6RC37, Q6IBA2

UniProt curated annotations — full annotation on UniProt →

Function. General coactivator that functions cooperatively with TAFs and mediates functional interactions between upstream activators and the general transcriptional machinery. May be involved in stabilizing the multiprotein transcription complex. Binds single-stranded DNA. Also binds, in vitro, non-specifically to double-stranded DNA (ds DNA).

Subunit / interactions. Homodimer. Interacts with CSTF2.

Subcellular location. Nucleus.

Post-translational modifications. Activity is controlled by protein kinases that target the regulatory region. Phosphorylation inactivates both ds DNA-binding and cofactor function, but does not affect binding to ssDNA. Seems to be phosphorylated in vivo by CK2 in at least 7 sites in the N-terminal Ser-rich region.

Similarity. Belongs to the transcriptional coactivator PC4 family.

RefSeq proteins (1): NP_006704* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR003173PC4_CDomain
IPR009044ssDNA-bd_transcriptional_regHomologous_superfamily
IPR045125Sub1/Tcp4-likeFamily

Pfam: PF02229

UniProt features (39 total): modified residue 16, mutagenesis site 5, strand 5, region of interest 3, compositionally biased region 3, cross-link 2, initiator methionine 1, chain 1, sequence variant 1, helix 1, site 1

Structure

Experimental structures (PDB)

6 structures.

PDBMethodResolution (Å)
1PCFX-RAY DIFFRACTION1.74
2C62X-RAY DIFFRACTION1.74
4USGX-RAY DIFFRACTION1.97
7E4WX-RAY DIFFRACTION2.9
6YCSX-RAY DIFFRACTION3.05
2PHESOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P53999-F177.030.54

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 50–51 (cleavage)

Post-translational modifications (18): 9, 10, 11, 13, 15, 17, 19, 35, 53, 55, 56, 57, 58, 68, 118, 68, 68, 4

Mutagenesis-validated functional residues (5):

PositionPhenotype
68reduced ssdna binding.
75reduced ssdna binding.
77–80loss of ssdna binding.
86loss of ssdna binding.
101loss of ssdna binding.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 301 (showing top): BUYTAERT_PHOTODYNAMIC_THERAPY_STRESS_DN, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, GOBP_POSITIVE_REGULATION_OF_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, TACAATC_MIR508, SHAFFER_IRF4_TARGETS_IN_PLASMA_CELL_VS_MATURE_B_LYMPHOCYTE, GOBP_REGULATION_OF_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, GOBP_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, GOBP_TRANSCRIPTION_INITIATION_AT_RNA_POLYMERASE_II_PROMOTER, GOZGIT_ESR1_TARGETS_UP, SCHLOSSER_SERUM_RESPONSE_DN, DODD_NASOPHARYNGEAL_CARCINOMA_UP, NADLER_HYPERGLYCEMIA_AT_OBESITY, TIEN_INTESTINE_PROBIOTICS_24HR_UP, SHAFFER_IRF4_TARGETS_IN_ACTIVATED_DENDRITIC_CELL, ACEVEDO_LIVER_CANCER_UP

GO Biological Process (6): RNA polymerase II promoter clearance (GO:0001111), regulation of transcription by RNA polymerase II (GO:0006357), negative regulation of DNA metabolic process (GO:0051053), positive regulation of transcription initiation by RNA polymerase II (GO:0060261), regulation of DNA-templated transcription (GO:0006355), protein homooligomerization (GO:0051260)

GO Molecular Function (10): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), double-stranded DNA binding (GO:0003690), single-stranded DNA binding (GO:0003697), transcription coactivator activity (GO:0003713), RNA binding (GO:0003723), identical protein binding (GO:0042802), DNA binding (GO:0003677), DNA helicase activity (GO:0003678), protein binding (GO:0005515), DNA-binding transcription factor binding (GO:0140297)

GO Cellular Component (5): nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription regulator complex (GO:0005667), nucleolus (GO:0005730), extracellular exosome (GO:0070062)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II2
DNA binding2
nucleic acid binding2
nuclear lumen2
promoter clearance during DNA-templated transcription1
regulation of DNA-templated transcription1
DNA metabolic process1
negative regulation of macromolecule metabolic process1
negative regulation of nucleobase-containing compound metabolic process1
regulation of DNA metabolic process1
transcription initiation at RNA polymerase II promoter1
positive regulation of transcription by RNA polymerase II1
regulation of transcription initiation by RNA polymerase II1
positive regulation of DNA-templated transcription initiation1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
protein complex oligomerization1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
cis-regulatory region sequence-specific DNA binding1
transcription coregulator activity1
positive regulation of DNA-templated transcription1
protein binding1
helicase activity1
ATP-dependent activity, acting on DNA1
binding1
transcription factor binding1
intracellular membrane-bounded organelle1
cellular anatomical structure1
protein-containing complex1
intracellular membraneless organelle1
extracellular vesicle1

Protein interactions and networks

STRING

2314 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SUB1GTF2BQ00403786
SUB1TBPP20226746
SUB1CSTF2P33240716
SUB1TOP1P11387650
SUB1MED19A0JLT2640
SUB1SUPT5HO00267605
SUB1MED10Q9BTT4599
SUB1AFF1P51825584
SUB1MED11Q9P086581
SUB1LDHAP00338580
SUB1MED18Q9BUE0567
SUB1AFF3P51826556
SUB1LDHCP07864547
SUB1CSNK2A1P19138540
SUB1TP53P04637531

IntAct

211 interactions, top by confidence:

ABTypeScore
CDKN2ACDK4psi-mi:“MI:0914”(association)0.960
PRKAA1PRKAB2psi-mi:“MI:0914”(association)0.950
EMC7EMC8psi-mi:“MI:0914”(association)0.790
SUB1SUB1psi-mi:“MI:0915”(physical association)0.750
SUB1SUB1psi-mi:“MI:0407”(direct interaction)0.750
COMMD1VPS26Cpsi-mi:“MI:0914”(association)0.730
COMMD4VPS26Cpsi-mi:“MI:0914”(association)0.730
NHP2DKC1psi-mi:“MI:0914”(association)0.730
NCBP1KPNA3psi-mi:“MI:0914”(association)0.640
COMMD6VPS26Cpsi-mi:“MI:0914”(association)0.640
SUB1CSNK2A1psi-mi:“MI:0217”(phosphorylation reaction)0.620
CSNK2A1SUB1psi-mi:“MI:0217”(phosphorylation reaction)0.620
tatSUB1psi-mi:“MI:0915”(physical association)0.590
tatSUB1psi-mi:“MI:0407”(direct interaction)0.590
SDCBPSUB1psi-mi:“MI:0915”(physical association)0.560
ABCD3SUB1psi-mi:“MI:0915”(physical association)0.560
PABIR3SUB1psi-mi:“MI:0915”(physical association)0.560
SMC1APDS5Bpsi-mi:“MI:0914”(association)0.530
TMEM63BSLC19A2psi-mi:“MI:0914”(association)0.530
ERBB2HAX1psi-mi:“MI:0914”(association)0.530
CFTRCNOT1psi-mi:“MI:0914”(association)0.480
GNL3IPO5psi-mi:“MI:0914”(association)0.480
DDX21MED19psi-mi:“MI:2364”(proximity)0.480
Kif1cKIF1Cpsi-mi:“MI:0915”(physical association)0.400

BioGRID (295): SUB1 (Affinity Capture-MS), SUB1 (Two-hybrid), SUB1 (Biochemical Activity), SUB1 (Affinity Capture-MS), SUB1 (Affinity Capture-MS), SUB1 (Affinity Capture-RNA), ABCF1 (Co-fractionation), CHD5 (Co-fractionation), EIF1AX (Co-fractionation), EIF2S2 (Co-fractionation), EIF2S3 (Co-fractionation), NACA (Co-fractionation), NOLC1 (Co-fractionation), POLR3C (Co-fractionation), SART1 (Co-fractionation)

ESM2 similar proteins: A4IGK3, A6H6W9, O08609, O14639, O43741, O60519, O88508, P53999, P60762, P97875, Q0VD32, Q12800, Q1LZ53, Q32M00, Q3SZ60, Q4R947, Q4W5Z4, Q5BJU6, Q5NVP9, Q5R6D0, Q5RBB8, Q5RCB7, Q5SQY2, Q5XIT1, Q5XJK1, Q5ZJV7, Q5ZK63, Q642H2, Q6AYJ2, Q6AYU1, Q6I9Y2, Q6PAM0, Q6PER3, Q78E65, Q7Z422, Q8BR65, Q8C8M1, Q8CCI5, Q8N488, Q8VE65

Diamond homologs: O65154, O65155, P11031, P53999, P54000, P87294, Q4R947, Q5R6D0, Q5ZK63, Q63396, Q872F4, Q94045, Q9VLR5, Q9FHX8

SIGNOR signaling

1 interactions.

AEffectBMechanism
SUB1“up-regulates activity”REST-CoRESTbinding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 198 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA Polyadenylation95.5×9e-03
Processing of Capped Intron-Containing Pre-mRNA95.1×9e-03
mRNA Splicing - Major Pathway134.9×2e-03
Dengue Virus-Host Interactions123.8×9e-03

GO biological processes:

GO termPartnersFoldFDR
telomere maintenance via telomerase520.7×2e-03
amyloid fibril formation517.0×4e-03
positive regulation of interferon-beta production511.1×9e-03
mRNA export from nucleus610.0×6e-03
positive regulation of fibroblast proliferation610.0×6e-03
rhythmic process68.5×8e-03
cytoplasmic translation77.3×8e-03
rRNA processing86.4×7e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

24 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance8
Likely benign1
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

2140 predictions. Top by Δscore:

VariantEffectΔscore
5:32531872:G:GTdonor_gain1.0000
5:32588510:AG:Aacceptor_gain1.0000
5:32588511:GG:Gacceptor_gain1.0000
5:32591560:TAGTT:Tacceptor_loss1.0000
5:32591561:A:AGacceptor_gain1.0000
5:32591562:G:GTacceptor_gain1.0000
5:32591562:GTTAA:Gacceptor_gain1.0000
5:32599065:AAAAG:Adonor_loss1.0000
5:32599068:AGG:Adonor_loss1.0000
5:32599070:GTG:Gdonor_loss1.0000
5:32599071:T:Adonor_loss1.0000
5:32531912:TACAG:Tdonor_loss0.9900
5:32531913:ACAGG:Adonor_loss0.9900
5:32531914:CAGGT:Cdonor_loss0.9900
5:32531915:AG:Adonor_loss0.9900
5:32531916:G:GCdonor_loss0.9900
5:32531918:T:Cdonor_loss0.9900
5:32531919:GAG:Gdonor_loss0.9900
5:32538265:G:GTdonor_gain0.9900
5:32588506:TTTCA:Tacceptor_loss0.9900
5:32588507:TTCA:Tacceptor_loss0.9900
5:32588508:TCAG:Tacceptor_loss0.9900
5:32588509:CA:Cacceptor_loss0.9900
5:32588510:A:AGacceptor_gain0.9900
5:32588510:A:Cacceptor_loss0.9900
5:32588511:G:GGacceptor_gain0.9900
5:32588581:AAAGG:Adonor_loss0.9900
5:32588583:AG:Adonor_loss0.9900
5:32588584:GG:Gdonor_loss0.9900
5:32588585:GTGA:Gdonor_loss0.9900

AlphaMissense

0 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000147655 (5:32598821 C>T), RS1000148010 (5:32587835 C>A,G,T), RS1000186102 (5:32585166 T>C), RS1000345412 (5:32587383 A>G), RS1000415679 (5:32587645 T>C), RS1000498955 (5:32588021 A>T), RS1000573991 (5:32592411 G>A), RS1000676750 (5:32588371 C>T), RS1000748912 (5:32583889 T>C), RS1000925463 (5:32592668 A>G,T), RS1001026553 (5:32594268 T>C), RS1001061728 (5:32603759 C>G), RS1001305725 (5:32584102 G>T), RS1001407854 (5:32597772 T>A), RS1001465498 (5:32592413 A>T)

Disease associations

OMIM: gene MIM:600503 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST009391_211Metabolite levels2.000000e-06
GCST010320_145PR interval2.000000e-11
GCST010321_214PR interval5.000000e-13

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004462PR interval

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5725063 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

2 potent at pChembl≥5 of 3 total, top 2 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.96Kd110nMMOLIBRESIB
6.51IC50310nMMOLIBRESIB

PubChem BioAssay actives

2 with measured affinity, of 9 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2179111: Binding affinity against SUB1 (unknown origin) assessed as apparent dissociation constant incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysiskd0.1100uM

CTD chemical–gene interactions

62 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteaffects methylation, decreases expression3
bisphenol Adecreases expression2
bisphenol Sincreases expression, affects cotreatment2
Benzo(a)pyreneaffects methylation, decreases expression2
Smokedecreases expression2
Valproic Acidaffects expression, increases expression2
Cyclosporinedecreases expression2
Particulate Matterdecreases expression, increases abundance2
aristolochic acid Idecreases expression1
FR900359affects phosphorylation1
dicrotophosdecreases expression1
bufotalinincreases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
geranioldecreases expression1
pyrogallol 1,3-dimethyl etheraffects cotreatment, decreases expression1
beta-lapachoneincreases expression1
arseniteaffects binding, increases reaction1
methylparabendecreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
coumarindecreases phosphorylation1
beta-methylcholineaffects expression1
CGP 52608affects binding, increases reaction1
CD 437increases expression1
K 7174decreases expression1
3-(4’-hydroxy-3’-adamantylbiphenyl-4-yl)acrylic acidincreases expression1
nutlin 3affects cotreatment, increases secretion1
jinfukangdecreases expression1
NSC 689534affects binding, decreases expression1
bisphenol AFincreases expression1

ChEMBL screening assays

8 unique, capped per target: 8 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652564BindingBinding affinity to human SUB1 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Cellosaurus cell lines

3 cell lines: 3 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A6V7SEES3-1V human SUB1, clone1Embryonic stem cellMale
CVCL_A6V8SEES3-1V human SUB1, clone2Embryonic stem cellMale
CVCL_A6V9SEES3-1V human SUB1, clone3Embryonic stem cellMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.