U2AF1

gene
On this page

Also known as U2AF35RNU2AF1RN

Summary

U2AF1 (U2 small nuclear RNA auxiliary factor 1, HGNC:12453) is a protein-coding gene on chromosome 21q22.3, encoding Splicing factor U2AF 35 kDa subunit (Q01081). Plays a critical role in both constitutive and enhancer-dependent splicing by mediating protein-protein interactions and protein-RNA interactions required for accurate 3’-splice site selection. It is a common-essential gene (DepMap: required in 99.9% of cancer cell lines).

This gene belongs to the splicing factor SR family of genes. U2 auxiliary factor, comprising a large and a small subunit, is a non-snRNP protein required for the binding of U2 snRNP to the pre-mRNA branch site. This gene encodes the small subunit which plays a critical role in both constitutive and enhancer-dependent RNA splicing by directly mediating interactions between the large subunit and proteins bound to the enhancers. Alternatively spliced transcript variants encoding different isoforms have been identified.

Source: NCBI Gene 7307 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 23 total — 1 likely-pathogenic
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • Cancer driver (intOGen): activating (oncogene-like) across 9 cancer types
  • Cancer dependency (DepMap): dependent in 99.9% of screened cell lines (common-essential)
  • MANE Select transcript: NM_006758

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:12453
Approved symbolU2AF1
NameU2 small nuclear RNA auxiliary factor 1
Location21q22.3
Locus typegene with protein product
StatusApproved
AliasesU2AF35, RNU2AF1, RN
Ensembl geneENSG00000160201
Ensembl biotypeprotein_coding
OMIM191317
Entrez7307

Gene structure

Transcript identifiers

Ensembl transcripts: 22 — 14 protein_coding, 6 retained_intron, 2 nonsense_mediated_decay

ENST00000291552, ENST00000380276, ENST00000459639, ENST00000463599, ENST00000464750, ENST00000468039, ENST00000471250, ENST00000475639, ENST00000478282, ENST00000486519, ENST00000496462, ENST00000887978, ENST00000887979, ENST00000887980, ENST00000887981, ENST00000887982, ENST00000887983, ENST00000887984, ENST00000921413, ENST00000921414, ENST00000921415, ENST00000921416

RefSeq mRNA: 3 — MANE Select: NM_006758 NM_001025203, NM_001025204, NM_006758

CCDS: CCDS13694, CCDS33574, CCDS42948

Canonical transcript exons

ENST00000291552 — 8 exons

ExonStartEnd
ENSE000019029954310745143107570
ENSE000034774044309465543094788
ENSE000034937684309295643093249
ENSE000035379474309569443095743
ENSE000036220104309447143094563
ENSE000036534634310431543104402
ENSE000036732524310045343100519
ENSE000036945244309543843095536

Expression profiles

Bgee: expression breadth ubiquitous, 134 present calls, max score 96.18.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 9.5651 / max 148.0827, expressed in 1619 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1906829.56511619
2092223.31861082

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
adenohypophysisUBERON:000219696.18gold quality
left uterine tubeUBERON:000130396.11gold quality
bone marrowUBERON:000237195.96gold quality
olfactory segment of nasal mucosaUBERON:000538695.81gold quality
mucosa of transverse colonUBERON:000499195.30gold quality
vermiform appendixUBERON:000115495.25gold quality
stromal cell of endometriumCL:000225595.05gold quality
right lungUBERON:000216795.03gold quality
monocyteCL:000057694.89gold quality
granulocyteCL:000009494.88gold quality
primary visual cortexUBERON:000243694.44gold quality
hindlimb stylopod muscleUBERON:000425294.44gold quality
leukocyteCL:000073894.43gold quality
bone marrow cellCL:000209294.11gold quality
metanephros cortexUBERON:001053394.05gold quality
placentaUBERON:000198794.04gold quality
cortex of kidneyUBERON:000122593.99gold quality
ventricular zoneUBERON:000305393.95gold quality
right uterine tubeUBERON:000130293.73gold quality
endometriumUBERON:000129593.68gold quality
lymph nodeUBERON:000002993.55gold quality
islet of LangerhansUBERON:000000693.38gold quality
bloodUBERON:000017893.17gold quality
pituitary glandUBERON:000000793.12gold quality
gall bladderUBERON:000211092.68gold quality
sural nerveUBERON:001548892.44gold quality
left ovaryUBERON:000211992.34gold quality
left lobe of thyroid glandUBERON:000112092.32gold quality
ganglionic eminenceUBERON:000402392.25gold quality
thyroid glandUBERON:000204692.19gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-MTAB-7249yes10.99
E-ANND-3yes4.66
E-MTAB-6142no84.36

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

1 targets.

TargetRegulation
TP53Repression

Upstream regulators (CollecTRI, top): CEBPB, CEBPD, CEBPG, ESR1, NFKB, RBPJ, ZNF362

miRNA regulators (miRDB)

14 targeting U2AF1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-684499.8270.692423
HSA-MIR-430799.8270.453374
HSA-MIR-548AG99.7769.251492
HSA-MIR-548AI99.6969.241494
HSA-MIR-548BA99.6969.141514
HSA-MIR-570-5P99.6969.241494
HSA-MIR-56799.6368.571219
HSA-MIR-29799.4069.581418
HSA-MIR-892C-5P99.1670.562116
HSA-MIR-4999-3P99.1165.55424
HSA-MIR-314998.7767.131639
HSA-MIR-316698.2466.631223
HSA-MIR-34C-3P98.1165.60858
HSA-MIR-582-3P96.6967.381019

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.9% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 40)

  • A misspliced form of the cholecystokinin-B/gastrin receptor in pancreatic carcinoma: role of reduced sellular U2AF35 and a suboptimal 3’-splicing site leading to retention of the fourth intron. (PMID:11830556)
  • U2AF35 RRM is unstructured in solution but its tertiary structure is induced upon binding to U2AF65. (PMID:12297299)
  • U2AF1 gene expression may provide a mechanism by which the relative cellular concentration and availability of U2AF(35) protein isoforms are modulated, thus contributing to the finely tuned control of splicing events in different tissues. (PMID:15096518)
  • U2AF35 appears to be completely dispensable for splicing in nuclear extracts prepared from adenovirus late-infected cells (PMID:15899895)
  • Binding assays revealed that IpaH9.8 has a specific affinity to U2AF(35), a mammalian splicing factor, which interferes with U2AF(35)-dependent splicing as assayed for IgM pre-mRNA (PMID:15950937)
  • identified and spatially localized sites of direct interaction between U2AF35 and U2AF65 in vivo in live cell nuclei. (PMID:16043505)
  • DEK enforces 3’ splice site discrimination by U2AF; DEK phosphorylated at serines 19 and 32 associates with U2AF35, facilitates the U2AF35-AG interaction and prevents binding of U2AF65 to pyrimidine tracts not followed by AG (PMID:16809543)
  • Taken together our results demonstrate that U2AF35a is essential for HeLa cell division and suggest a novel role for both U2AF35 protein isoforms as regulators of alternative splicing of a specific subset of genes. (PMID:16855028)
  • Results describe the roles of the two subunits of U2AF, U2AF65 and 35, in the selection between alternative 3’ splice sites associated with polypyrimidine tracts of different strengths. (PMID:16940179)
  • U2AF35 and hPrp3 interactions with SPF30 can occur simultaneously, thereby potentially linking 3’ splice site recognition with tri-small nuclear ribonucleoprotein addition (PMID:18211889)
  • SF1 and U2AF form extraspliceosomal complexes before and after taking part in the assembly of catalytic spliceosomes. (PMID:18285458)
  • Here we show that a missense mutation affecting the serine at codon 34 (Ser34) in U2AF1 was recurrently present in 13 out of 150 (8.7%) subjects with de novo MDS, and we found suggestive evidence of an increased risk of progression to sAML (PMID:22158538)
  • U2AF1 and SRSF2 mutations are frequent in chronic myelomonocytic leukemia and advanced forms of MDS. U2AF1 and SRSF2 mutations are predictive for shorter survival. (PMID:22323480)
  • hnRNP A1 forms a ternary complex with the U2AF heterodimer on AG-containing/uridine-rich RNAs, while it displaces U2AF from non-AG-containing/uridine-rich RNAs, an activity that requires the glycine-rich domain of hnRNP A1 (PMID:22325350)
  • In univariate analysis, mutated SRSF2 predicted shorter overall survival and more frequent acute myeloid leukemia progression compared with wild-type SRSF2, whereas mutated U2AF1, ZRSR2 had no impact on patient outcome. (PMID:22389253)
  • U2AF1 mutation is a recurrent event at a low frequency in acute myeloid leukemia and myelodysplastic syndrome and influences overall survival. (PMID:23029227)
  • data suggest that SF3B1, U2AF1 and SRSF2 mutations occur not only in myeloid lineage tumors but also in lymphoid lineage tumors; data suggest that the splicing gene mutations play important roles in the pathogenesis of hematologic tumors, but rarely in solid tumors (PMID:23280334)
  • SRSF2 is the most frequently mutated spliceosome gene in chronic myelomonocytic leukemia, but neither it nor SF3B1 or U2AF35 mutations are prognostically relevant. (PMID:23335386)
  • U2AF1 mutations play a significant role in myeloid leukemogenesis due to selective missplicing of tumor-associated genes (PMID:23775717)
  • genetic association studies in a population in Taiwan: Data suggest that U2AF1 mutations are associated with poor prognosis/survival in patients with myelodysplastic syndrome and shorter time-to-leukemia transformation in young patients. (PMID:23861105)
  • study describes incidence and phenotypic and prognostic relevance of U2AF1 mutations in primary myelofibrosis(PMF); U2AF1 mutations cluster with JAK2V617F, ASXL1 mutations and normal karyotype; U2AF1 mutations are strongly and inter-independently associated with anemia and thrombocytopenia (PMID:24097336)
  • Data indicate somatic mutations in the splicing factor U2AF1 across 12 cancer types. (PMID:24498085)
  • Molecular monitoring of patients having undergone AHSCT for PMF should not be restricted to JAK2, MPL or CALR, but all mutations present in primary fibrotic neoplastic myeloproliferation should be included to interpret abnormal blood values after AHSCT (PMID:25231745)
  • mutations influence the similarity of splicing programs in leukemias, but do not give rise to widespread splicing failure. (PMID:25267526)
  • A mutant U2AF1 (S34F) found in a variety of cancer types results in delayed splicing and disruption of kinetic competition during transcription. (PMID:25271374)
  • The S34F mutation alters U2AF1 function in the context of specific RNA sequences, leading to aberrant alternative splicing of target genes, some of which may be relevant for myelodysplastic syndromes pathogenesis. (PMID:25311244)
  • U2AF has the capacity to directly define ~88% of functional 3’ splice sites in the human genome; numerous U2AF binding events also occur in intronic locations. (PMID:25326705)
  • In multivariate analysis, U2AF1 and TP53 mutations retained independent prognostic significance across 93 cases of acute myeloid leukemia (PMID:25412851)
  • The mutational status of the SRSF2, U2AF1 and ZRSR2 did not affect the response rate or survival in MDS patients who had received first-line decitabine treatment. (PMID:25964599)
  • Mutations in ASXL1, U2AF1, and SF3B1 are common in Chinese patients with myelodysplastic syndromes. (PMID:26508027)
  • in primary myelofibrosis, anemia was significantly associated with U2AF1 mutation; study confirms previous observation regarding the association of mutant U2AF1 with anemia supporting its role in hematopoiesis and in the pathogenesis of PMF-associated anemia (PMID:27058230)
  • The U2AF35(S34F) mutation alters interaction with CFIm59, leading to increased use of a distal cleavage and polyadenylation site in the ATG7 pre-mRNA, decreasing levels of ATG7 protein and defective autophagy, ultimately leading to transformation. (PMID:27184077)
  • data confirm MLL-PTD and, to a lesser extent, FLT3-ITD as common events in +11 AML.6, 7, 8 However, the high mutation frequencies of U2AF1 and genes involved in methylation (DNMT3A, IDH2) have hitherto not been reported in +11 AML (PMID:27435003)
  • Alternative splicing of U2AF1 reveals a shared repression mechanism for duplicated exons controlled by SRF3. (PMID:27566151)
  • this study characterized novel candidate pediatric T-cell acute lymphoblastic leukemia driver mutation in splicesome factor U2AF1 (PMID:27602765)
  • The aberrantly spliced target genes and deregulated cellular pathways associated with the commonly mutated splicing factor genes in myelodysplastic syndromes (SF3B1, SRSF2 and U2AF1) are being identified, illuminating the molecular mechanisms underlying the disease. (Review) (PMID:27639445)
  • Our results provide mechanistic explanations of the magnitude of splicing changes observed in U2AF1-mutant cells and why tumors harboring U2AF1 mutations always retain an expressed copy of the wild-type allele (PMID:27776121)
  • In summary, we provide insight into the underlying molecular mechanism of U2AF binding to 3’ splice sites. (PMID:27799531)
  • The splicing effects of sudemycin and U2AF1 can be cumulative in cells exposed to both perturbations-drug and mutation, as compared with cells exposed to either alone. (PMID:28067246)
  • The frequently mutated SF3B1 residues contact the pre-mRNA splice site. Based on structural homology with other spliceosome subunits, and recent findings of altered RNA binding by mutant U2AF1 proteins, we suggest that affected U2AF1 residues also contact pre-mRNA. (PMID:28372848)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriou2af1ENSDARG00000015325
mus_musculusU2af1ENSMUSG00000061613
rattus_norvegicusU2af1ENSRNOG00000045860
drosophila_melanogasterU2af38FBGN0017457
caenorhabditis_elegansWBGENE00006698

Paralogs (2): U2AF1L4 (ENSG00000161265), ZRSR2 (ENSG00000169249)

Protein

Protein identifiers

Splicing factor U2AF 35 kDa subunitQ01081 (reviewed: Q01081)

Alternative names: U2 auxiliary factor 35 kDa subunit, U2 small nuclear RNA auxiliary factor 1, U2 snRNP auxiliary factor small subunit

All UniProt accessions (2): Q01081, A0A1B0GW87

UniProt curated annotations — full annotation on UniProt →

Function. Plays a critical role in both constitutive and enhancer-dependent splicing by mediating protein-protein interactions and protein-RNA interactions required for accurate 3’-splice site selection. Recruits U2 snRNP to the branch point. Directly mediates interactions between U2AF2 and proteins bound to the enhancers and thus may function as a bridge between U2AF2 and the enhancer complex to recruit it to the adjacent intron.

Subunit / interactions. Identified in the spliceosome C complex. Heterodimer with U2AF2. Interacts (via RS domain) with PHF5A (via N-terminus). Interacts with ZRANB2. Interacts with SDE2. Interacts with SF3B1.

Subcellular location. Nucleus. Nucleus speckle.

Disease relevance. Myelodysplastic syndrome (MDS) [MIM:614286] A heterogeneous group of closely related clonal hematopoietic disorders. All are characterized by a hypercellular or hypocellular bone marrow with impaired morphology and maturation, dysplasia of the myeloid, megakaryocytic and/or erythroid lineages, and peripheral blood cytopenias resulting from ineffective blood cell production. Included diseases are: refractory anemia (RA), refractory anemia with ringed sideroblasts (RARS), refractory anemia with excess blasts (RAEB), refractory cytopenia with multilineage dysplasia and ringed sideroblasts (RCMD-RS); chronic myelomonocytic leukemia (CMML) is a myelodysplastic/myeloproliferative disease. MDS is considered a premalignant condition in a subgroup of patients that often progresses to acute myeloid leukemia (AML). The gene represented in this entry may be involved in disease pathogenesis. Mutation altering U2AF1 function in the context of specific RNA sequences can lead to aberrant alternative splicing of target genes, some of which may be relevant for MDS pathogenesis.

Domain organisation. The C-terminal SR-rich domain is required for interactions with SR proteins and the splicing regulators TRA and TRA2, and the N-terminal domain is required for formation of the U2AF1/U2AF2 heterodimer.

Miscellaneous. Interacts with U2AF2 and stimulates U2AF splicing activity in vitro. Less efficient than isoform 1. Produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Similarity. Belongs to the splicing factor SR family.

Isoforms (4)

UniProt IDNamesCanonical?
Q01081-11, U2AF35ayes
Q01081-22, U2AF35b
Q01081-33, U2AF35c
Q01081-44

RefSeq proteins (6): NP_001020374, NP_001020375, NP_001307575, NP_001307577, NP_001307580, NP_006749* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000504RRM_domDomain
IPR000571Znf_CCCHDomain
IPR003954RRM_euk-typeDomain
IPR009145U2AF_smallFamily
IPR012677Nucleotide-bd_a/b_plait_sfHomologous_superfamily
IPR035979RBD_domain_sfHomologous_superfamily

Pfam: PF00076, PF00642

UniProt features (30 total): modified residue 7, splice variant 4, strand 4, sequence variant 3, compositionally biased region 3, helix 2, zinc finger region 2, initiator methionine 1, chain 1, domain 1, mutagenesis site 1, region of interest 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
1JMTX-RAY DIFFRACTION2.2

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q01081-F179.960.61

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (7): 39, 61, 145, 165, 39, 39, 2

Mutagenesis-validated functional residues (1):

PositionPhenotype
134decreases affinity for uaf2 by 3 orders of magnitude.

Function

Pathways and Gene Ontology

Reactome pathways

6 pathways

IDPathway
R-HSA-159236Transport of Mature mRNA derived from an Intron-Containing Transcript
R-HSA-72163mRNA Splicing - Major Pathway
R-HSA-72187mRNA 3’-end processing
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-73856RNA Polymerase II Transcription Termination

MSigDB gene sets: 175 (showing top): TTTGTAG_MIR520D, MORF_SNRP70, MORF_UBE2I, MORF_HDAC1, MORF_UBE2N, MORF_RAD21, MORF_CDK2, MORF_HDAC2, PUJANA_CHEK2_PCC_NETWORK, MORF_TERF1, MORF_SKP1A, MUELLER_PLURINET, WANG_LMO4_TARGETS_DN, MORF_CCNI, MORF_CTBP1

GO Biological Process (3): mRNA splicing, via spliceosome (GO:0000398), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (7): RNA binding (GO:0003723), zinc ion binding (GO:0008270), pre-mRNA 3’-splice site binding (GO:0030628), RS domain binding (GO:0050733), nucleic acid binding (GO:0003676), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (7): nucleoplasm (GO:0005654), spliceosomal complex (GO:0005681), Cajal body (GO:0015030), nuclear speck (GO:0016607), catalytic step 2 spliceosome (GO:0071013), U2AF complex (GO:0089701), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-6 pathways:

CategoryPathways
Transport of Mature Transcript to Cytoplasm1
mRNA Splicing1
Processing of Capped Intron-Containing Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1
RNA Polymerase II Transcription1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
binding2
nuclear protein-containing complex2
nuclear ribonucleoprotein granule2
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
mRNA processing1
mRNA metabolic process1
nucleic acid binding1
transition metal ion binding1
pre-mRNA binding1
protein domain specific binding1
cation binding1
nuclear lumen1
cellular anatomical structure1
ribonucleoprotein complex1
Prp19 complex1
spliceosomal complex1
U5 snRNP1
catalytic complex1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

2238 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
U2AF1U2AF2P26368999
U2AF1SRSF2Q01130997
U2AF1SRSF1Q07955938
U2AF1SF1Q15637912
U2AF1DEKP35659898
U2AF1RBM39Q14498896
U2AF1ASXL1Q8IXJ9881
U2AF1SF3A1Q15459848
U2AF1RSRC1Q96IZ7846
U2AF1SF3B1O75533843
U2AF1SRSF5Q13243819
U2AF1TET2Q6N021800
U2AF1SETBP1Q9Y6X0778
U2AF1DNMT3AQ9Y6K1773
U2AF1IDH1O75874763

IntAct

216 interactions, top by confidence:

ABTypeScore
U2AF2U2AF1psi-mi:“MI:0915”(physical association)0.950
U2AF2U2AF1psi-mi:“MI:0407”(direct interaction)0.950
U2AF1U2AF2psi-mi:“MI:0915”(physical association)0.950
U2AF2U2AF1psi-mi:“MI:0914”(association)0.950
U2AF1SRPK2psi-mi:“MI:0915”(physical association)0.830
SRPK2U2AF1psi-mi:“MI:0915”(physical association)0.830
U2AF1CARNMT1psi-mi:“MI:0915”(physical association)0.740
U2AF1SRSF3psi-mi:“MI:2364”(proximity)0.730
U2AF1SRSF3psi-mi:“MI:0915”(physical association)0.730
SNW1U2AF2psi-mi:“MI:0914”(association)0.730
HNRNPKU2AF1psi-mi:“MI:0915”(physical association)0.670
U2AF1HNRNPKpsi-mi:“MI:0915”(physical association)0.670
U2AF1U2AF2psi-mi:“MI:0914”(association)0.670
U2AF1U2AF2psi-mi:“MI:0915”(physical association)0.670
ATXN1U2AF1psi-mi:“MI:0915”(physical association)0.670
NCBP2KPNA3psi-mi:“MI:0914”(association)0.640

BioGRID (399): U2AF1 (Two-hybrid), U2AF1 (Two-hybrid), U2AF1 (Two-hybrid), U2AF1 (Two-hybrid), RNPS1 (Two-hybrid), U2AF2 (Two-hybrid), U2AF1 (Affinity Capture-MS), U2AF1 (Affinity Capture-MS), U2AF1 (Affinity Capture-MS), U2AF1 (Affinity Capture-MS), U2AF1 (Affinity Capture-MS), U2AF2 (Two-hybrid), FTSJ3 (Co-fractionation), U2AF2 (Affinity Capture-Western), U2AF1 (Co-fractionation)

ESM2 similar proteins: A1A4K8, A1Z9L3, A2A4P0, B5DGI7, E0X9N4, O01159, O23212, O48534, P17133, P90727, P90978, Q01081, Q08C72, Q09176, Q09217, Q14498, Q21832, Q27W01, Q28BZ1, Q2QKB3, Q2QKB4, Q2QZL4, Q2R0Q1, Q3T127, Q3ZCE8, Q52KN9, Q5D018, Q5RC80, Q5ZKA3, Q6AUG0, Q6IDS6, Q6P616, Q6PH90, Q7TP17, Q7ZXB5, Q8BGJ9, Q8L716, Q8VH51, Q8WU68, Q94535

Diamond homologs: A1A4K8, O13845, Q01081, Q09176, Q15695, Q15696, Q29350, Q3T127, Q62377, Q64707, Q6AUG0, Q6YVX9, Q7TP17, Q8BGJ9, Q8WU68, Q94535, Q9D883, Q9FMY5, Q9S709, Q9SY74, Q9ZQW8, A1A5R1, A6NDE4, A6NEQ0, A6NFN3, A6QPR6, B0BNE4, O14369, O43251, O89086, P04147, P0C7P1, P0CB38, P0DJD3, P0DJD4, P19339, P41891, P42696, P57052, P60047

SIGNOR signaling

2 interactions.

AEffectBMechanism
U2AF1“down-regulates quantity by repression”TP53“transcriptional regulation”
U2AF1“form complex”U2AF1/U2AF2binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 127 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Transport of Mature Transcript to Cytoplasm727.2×3e-07
mRNA 3’-end processing1122.1×2e-10
mRNA Splicing - Minor Pathway920.6×3e-08
mRNA Splicing1719.1×2e-15
Processing of Capped Intron-Containing Pre-mRNA2218.4×1e-19
Transport of Mature mRNA derived from an Intron-Containing Transcript1117.1×3e-09
mRNA Polyadenylation1917.0×3e-16
mRNA Splicing - Major Pathway3016.7×3e-26

GO biological processes:

GO termPartnersFoldFDR
regulation of mRNA splicing, via spliceosome753.1×1e-08
spliceosomal complex assembly946.3×7e-11
negative regulation of mRNA splicing, via spliceosome639.3×1e-06
U2-type prespliceosome assembly526.7×2e-04
mRNA splicing, via spliceosome2519.6×2e-22
regulation of alternative mRNA splicing, via spliceosome816.7×4e-06
RNA splicing1914.3×4e-14
mRNA export from nucleus512.6×4e-03

Disease & clinical

Cancer significance

From CIViC — curated cancer-variant interpretation:

U2AF1 is one of several spliceosome complex genes frequently mutated in a variety of hematologic malignancies, particularly de novo myelodysplastic syndromes (MDS), as well as solid tumors such as lung and pancreatic cancers. Two hotspot mutations (S34 and Q157) occur within the two zinc-finger domains of the U2AF1 protein. These mutations have been associated with altered splicing patterns in vitro and in vivo. U2AF1 mutations in MDS have been associated with an increased risk of transformation to secondary acute myeloid leukemia, however, the impact of these mutations on overall survival has been an area of debate.

From intOGen — cancer-driver classification: activating (oncogene-like) across 9 cancer types — AML, CHOL, LUAD, NSCLC, PAAD, PRAD, STAD, UCEC, UCS.

Clinical variants and AI predictions

ClinVar

23 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic1
Uncertain significance6
Likely benign0
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
376024NM_006758.3(U2AF1):c.470A>C (p.Gln157Pro)Likely pathogenic

SpliceAI

1775 predictions. Top by Δscore:

VariantEffectΔscore
21:43094465:A:ACdonor_gain1.0000
21:43094466:C:CCdonor_gain1.0000
21:43094466:CTCA:Cdonor_gain1.0000
21:43094467:TCA:Tdonor_loss1.0000
21:43094468:CACTT:Cdonor_loss1.0000
21:43094469:A:ACdonor_gain1.0000
21:43094469:AC:Adonor_loss1.0000
21:43094469:ACTT:Adonor_gain1.0000
21:43094470:C:CAdonor_gain1.0000
21:43094470:CT:Cdonor_gain1.0000
21:43094470:CTT:Cdonor_gain1.0000
21:43094470:CTTC:Cdonor_gain1.0000
21:43094470:CTTCT:Cdonor_gain1.0000
21:43094472:T:TAdonor_gain1.0000
21:43094559:ATTCT:Aacceptor_gain1.0000
21:43094560:TTCT:Tacceptor_gain1.0000
21:43094560:TTCTC:Tacceptor_loss1.0000
21:43094562:CT:Cacceptor_gain1.0000
21:43094563:TC:Tacceptor_loss1.0000
21:43094564:C:CAacceptor_loss1.0000
21:43094564:C:CCacceptor_gain1.0000
21:43094565:T:Gacceptor_loss1.0000
21:43094651:TCACC:Tdonor_loss1.0000
21:43094652:C:CGdonor_loss1.0000
21:43094652:CACC:Cdonor_loss1.0000
21:43094653:A:ACdonor_gain1.0000
21:43094653:AC:Adonor_gain1.0000
21:43094653:ACC:Adonor_gain1.0000
21:43094653:ACCC:Adonor_gain1.0000
21:43094654:C:CCdonor_gain1.0000

AlphaMissense

1571 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
21:43094513:G:AS178F1.000
21:43094521:C:AK175N1.000
21:43094521:C:GK175N1.000
21:43094527:A:CH173Q1.000
21:43094527:A:TH173Q1.000
21:43094528:T:CH173R1.000
21:43094529:G:CH173D1.000
21:43094529:G:TH173N1.000
21:43094531:A:CM172R1.000
21:43094531:A:GM172T1.000
21:43094531:A:TM172K1.000
21:43094533:G:CF171L1.000
21:43094533:G:TF171L1.000
21:43094534:A:CF171C1.000
21:43094534:A:GF171S1.000
21:43094535:A:CF171V1.000
21:43094535:A:GF171L1.000
21:43094535:A:TF171I1.000
21:43094536:G:CN170K1.000
21:43094536:G:TN170K1.000
21:43094537:T:AN170I1.000
21:43094538:T:CN170D1.000
21:43094538:T:GN170H1.000
21:43094539:G:CC169W1.000
21:43094540:C:AC169F1.000
21:43094540:C:GC169S1.000
21:43094540:C:TC169Y1.000
21:43094541:A:CC169G1.000
21:43094541:A:GC169R1.000
21:43094541:A:TC169S1.000

dbSNP variants (sampled 300 via entrez): RS1000371308 (21:43106997 A>G), RS1001817932 (21:43096098 C>A), RS1002945584 (21:43098306 T>C), RS1003261434 (21:43107799 C>T), RS1003387260 (21:43103551 G>A,C), RS1003540651 (21:43097070 A>T), RS1003945250 (21:43094192 C>T), RS1004757101 (21:43104039 G>T), RS1004837729 (21:43092804 G>A), RS1004892284 (21:43097889 C>T), RS1005560446 (21:43100417 C>G), RS1005858603 (21:43100115 G>A), RS1006276467 (21:43104987 G>A), RS1006433932 (21:43105309 AT>A,ATT), RS1007001246 (21:43106828 C>G)

Disease associations

OMIM: gene MIM:191317 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST002670_11Blood and toenail selenium levels3.000000e-08
GCST002670_12Blood and toenail selenium levels5.000000e-09
GCST002670_4Blood and toenail selenium levels4.000000e-09

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5725064 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

6 potent at pChembl≥5 of 6 total, top 6 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
9.28Kd0.531nMCHEMBL3752910
9.28ED500.531nMCHEMBL3752910
6.95Kd112nMMOLIBRESIB
6.86Kd137.4nMCHEMBL5653589
6.86ED50137.4nMCHEMBL5653589
6.70IC50200nMMOLIBRESIB

PubChem BioAssay actives

4 with measured affinity, of 11 total; 3 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149697: Binding affinity to human U2AF1 incubated for 45 mins by Kinobead based pull down assaykd0.0005uM
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2179137: Binding affinity against U2AF1 (unknown origin) assessed as apparent dissociation constant incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysiskd0.1120uM
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149697: Binding affinity to human U2AF1 incubated for 45 mins by Kinobead based pull down assaykd0.1374uM

CTD chemical–gene interactions

36 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Acetaminophendecreases expression, increases expression2
TAK-243increases sumoylation1
triphenyl phosphateaffects expression1
alpha-pineneaffects cotreatment, decreases expression, increases abundance1
bisphenol Adecreases expression1
methylparabenincreases expression1
methacrylaldehydedecreases expression, increases abundance, affects cotreatment1
di-n-butylphosphoric acidaffects expression1
bisphenol Bincreases expression1
bisphenol Sincreases expression1
qinghuangaffects response to substance1
NSC668394increases expression1
Temozolomideincreases expression1
Acroleinaffects cotreatment, decreases expression, increases abundance1
Air Pollutantsaffects cotreatment, decreases expression, increases abundance1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation1
Cytarabinedecreases expression1
Enzyme Inhibitorsincreases O-linked glycosylation, decreases activity1
Leadaffects expression, affects splicing1
Methotrexateincreases expression1
Ozoneaffects cotreatment, decreases expression, increases abundance1
Paraquatincreases expression1
Ribonucleotidesaffects binding1
Seleniumincreases expression1
Silicon Dioxideincreases expression1
Smokedecreases expression1
Tobacco Smoke Pollutionincreases expression1
Vitamin Eincreases expression1
Cyclosporineincreases expression1

ChEMBL screening assays

8 unique, capped per target: 8 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652739BindingBinding affinity to human U2AF1 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Cellosaurus cell lines

4 cell lines: 4 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B8A7Abcam Raji U2AF1 KOCancer cell lineMale
CVCL_C0B1Abcam THP-1 U2AF1 KOCancer cell lineMale
CVCL_C7CPAbcam PC-3 U2AF1 KOCancer cell lineMale
CVCL_S607SKK-1Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.