U2AF1L4

gene
On this page

Also known as MGC33901U2af26

Summary

U2AF1L4 (U2 small nuclear RNA auxiliary factor 1 like 4, HGNC:23020) is a protein-coding gene on chromosome 19q13.12, encoding Splicing factor U2AF 26 kDa subunit (Q8WU68). RNA-binding protein that function as a pre-mRNA splicing factor.

Predicted to enable pre-mRNA 3’-splice site binding activity. Predicted to be involved in mRNA splicing, via spliceosome. Predicted to be located in nucleoplasm. Predicted to be part of U2AF complex and spliceosomal complex.

Source: NCBI Gene 199746 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 50 total
  • MANE Select transcript: NM_001040425

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:23020
Approved symbolU2AF1L4
NameU2 small nuclear RNA auxiliary factor 1 like 4
Location19q13.12
Locus typegene with protein product
StatusApproved
AliasesMGC33901, U2af26
Ensembl geneENSG00000161265
Ensembl biotypeprotein_coding
OMIM601080
Entrez199746

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 9 retained_intron, 7 nonsense_mediated_decay, 4 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000292879, ENST00000378975, ENST00000412391, ENST00000585554, ENST00000585771, ENST00000586476, ENST00000587886, ENST00000587987, ENST00000588100, ENST00000588892, ENST00000588980, ENST00000589429, ENST00000590135, ENST00000590650, ENST00000591057, ENST00000591084, ENST00000591855, ENST00000592913, ENST00000594792, ENST00000600296, ENST00000601236

RefSeq mRNA: 3 — MANE Select: NM_001040425 NM_001040425, NM_001369824, NM_144987

CCDS: CCDS12473, CCDS42551

Canonical transcript exons

ENST00000378975 — 6 exons

ExonStartEnd
ENSE000029037423574534835745418
ENSE000035340243574512535745212
ENSE000035782263574380935743904
ENSE000035962923574401235744145
ENSE000036383073574432335744421
ENSE000036438443574252735742803

Expression profiles

Bgee: expression breadth ubiquitous, 134 present calls, max score 95.06.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 10.4293 / max 122.9063, expressed in 1765 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1805517.48331736
1805492.6447490
1805500.3013149

Top tissues by expression

134 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left lobe of thyroid glandUBERON:000112095.06gold quality
right lobe of thyroid glandUBERON:000111994.98gold quality
thyroid glandUBERON:000204694.85gold quality
pituitary glandUBERON:000000794.33gold quality
adenohypophysisUBERON:000219694.29gold quality
right uterine tubeUBERON:000130293.98gold quality
right adrenal gland cortexUBERON:003582793.81gold quality
metanephros cortexUBERON:001053393.26gold quality
right adrenal glandUBERON:000123393.24gold quality
right lobe of liverUBERON:000111493.16gold quality
hindlimb stylopod muscleUBERON:000425292.82gold quality
apex of heartUBERON:000209892.75gold quality
left adrenal gland cortexUBERON:003582592.72gold quality
spleenUBERON:000210692.66gold quality
primary visual cortexUBERON:000243692.64gold quality
granulocyteCL:000009492.61gold quality
esophagogastric junction muscularis propriaUBERON:003584192.54gold quality
left adrenal glandUBERON:000123492.52gold quality
lower esophagus muscularis layerUBERON:003583392.44gold quality
lower esophagusUBERON:001347392.41gold quality
mucosa of stomachUBERON:000119992.09gold quality
right testisUBERON:000453492.06gold quality
right hemisphere of cerebellumUBERON:001489091.99gold quality
left testisUBERON:000453391.92gold quality
muscle layer of sigmoid colonUBERON:003580591.80gold quality
tibial arteryUBERON:000761091.76gold quality
popliteal arteryUBERON:000225091.75gold quality
left uterine tubeUBERON:000130391.70gold quality
body of stomachUBERON:000116191.60gold quality
fundus of stomachUBERON:000116091.49gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes7.54

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 5)

  • Differential isoform expression and interaction with the P32 regulatory protein controls the subcellular localization of the splicing factor U2AF26 (PMID:18460468)
  • One of the conservative regions is responsible for the enhancer activity and is able to interact with proteins of HeLa cell nuclear extract. (PMID:22022969)
  • a 269 bp region located between the PSENEN and U2AF1L4 human genes is a genuine bidirectional promoter regulating a concerted divergent transcription of these genes. (PMID:23246698)
  • U2AF1L4 gene expression is regulated by cytokines in activated T cells. (PMID:27143377)
  • The zinc finger domains in U2AF26 and U2AF35 have diverse functionalities including a role in controlling translation. (PMID:32116123)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
mus_musculusU2af1l4ENSMUSG00000078765
rattus_norvegicusU2af1l4ENSRNOG00000024497
drosophila_melanogasterU2af38FBGN0017457
caenorhabditis_elegansWBGENE00006698

Paralogs (2): U2AF1 (ENSG00000160201), ZRSR2 (ENSG00000169249)

Protein

Protein identifiers

Splicing factor U2AF 26 kDa subunitQ8WU68 (reviewed: Q8WU68)

Alternative names: U2 auxiliary factor 26, U2 small nuclear RNA auxiliary factor 1-like protein 4, U2(RNU2) small nuclear RNA auxiliary factor 1-like protein 3

All UniProt accessions (9): A0A1D5RMU4, Q8WU68, K7EJH3, K7EJM7, M0QYK5, M0R1E1, S4R353, S4R3L8, S4R443

UniProt curated annotations — full annotation on UniProt →

Function. RNA-binding protein that function as a pre-mRNA splicing factor. Plays a critical role in both constitutive and enhancer-dependent splicing by mediating protein-protein interactions and protein-RNA interactions required for accurate 3’-splice site selection. Acts by enhancing the binding of U2AF2 to weak pyrimidine tracts. Also participates in the regulation of alternative pre-mRNA splicing. Activates exon 5 skipping of PTPRC during T-cell activation; an event reversed by GFI1. Binds to RNA at the AG dinucleotide at the 3’-splice site. Shows a preference for AGC or AGA.

Subunit / interactions. Interacts with GFI1, U2AF2 and C1QBP.

Subcellular location. Nucleus. Nucleus speckle. Cytoplasm.

Tissue specificity. Isoform 2 is widely expressed. Isoform 3 is highly expressed in heart, brain and lung, lower expressed in thymus and much lower expressed in peripheral blood leukocytes.

Domain organisation. The second zinc finger in necessary for interaction with GFI1 and for alternative pre-mRNA splicing events. The region 162-220 is essential for the nuclear import of the protein in spite of the absence of a nuclear localization signal (NLS). This region is essential for the interaction with C1QBP, interaction which is required for the nuclear translocation. This region may be involved in the localization in nuclear dot-like structures and it also confers the ability of nucleo-cytoplasmic shuttling.

Similarity. Belongs to the splicing factor SR family.

Isoforms (3)

UniProt IDNamesCanonical?
Q8WU68-11yes
Q8WU68-22
Q8WU68-33

RefSeq proteins (3): NP_001035515, NP_001356753, NP_659424 (=MANE)

Domains & families (InterPro)

IDNameType
IPR000504RRM_domDomain
IPR000571Znf_CCCHDomain
IPR003954RRM_euk-typeDomain
IPR009145U2AF_smallFamily
IPR012677Nucleotide-bd_a/b_plait_sfHomologous_superfamily
IPR035979RBD_domain_sfHomologous_superfamily

Pfam: PF00076, PF00642

UniProt features (12 total): splice variant 2, zinc finger region 2, compositionally biased region 2, initiator methionine 1, chain 1, sequence conflict 1, domain 1, region of interest 1, modified residue 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8WU68-F183.530.67

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 2

Function

Pathways and Gene Ontology

Reactome pathways

6 pathways

IDPathway
R-HSA-159236Transport of Mature mRNA derived from an Intron-Containing Transcript
R-HSA-72163mRNA Splicing - Major Pathway
R-HSA-72187mRNA 3’-end processing
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-73856RNA Polymerase II Transcription Termination

MSigDB gene sets: 87 (showing top): STARK_PREFRONTAL_CORTEX_22Q11_DELETION_DN, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, GRAESSMANN_RESPONSE_TO_MC_AND_DOXORUBICIN_DN, AACYNNNNTTCCS_UNKNOWN, CREBP1_Q2, CREB_Q4, GATA3_01, ATF1_Q6, REACTOME_MRNA_3_END_PROCESSING, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, HNF4_DR1_Q3, E4F1_Q6, GOBP_RNA_SPLICING, REACTOME_MRNA_SPLICING, AACTTT_UNKNOWN

GO Biological Process (3): mRNA splicing, via spliceosome (GO:0000398), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (5): zinc ion binding (GO:0008270), pre-mRNA 3’-splice site binding (GO:0030628), nucleic acid binding (GO:0003676), RNA binding (GO:0003723), metal ion binding (GO:0046872)

GO Cellular Component (6): nucleoplasm (GO:0005654), spliceosomal complex (GO:0005681), cytoplasm (GO:0005737), nuclear speck (GO:0016607), U2AF complex (GO:0089701), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-6 pathways:

CategoryPathways
Transport of Mature Transcript to Cytoplasm1
mRNA Splicing1
Processing of Capped Intron-Containing Pre-mRNA1
mRNA 3’-end processing1
Dengue Virus Infection1
RNA Polymerase II Transcription1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
cellular anatomical structure2
nuclear protein-containing complex2
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
mRNA processing1
mRNA metabolic process1
transition metal ion binding1
pre-mRNA binding1
binding1
nucleic acid binding1
cation binding1
nuclear lumen1
ribonucleoprotein complex1
intracellular anatomical structure1
nuclear ribonucleoprotein granule1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

0 interactions, top by confidence (×1000):

IntAct

12 interactions, top by confidence:

ABTypeScore
NNOP56psi-mi:“MI:0914”(association)0.530
NRBM47psi-mi:“MI:0914”(association)0.530
GFI1U2AF1L4psi-mi:“MI:0915”(physical association)0.400
Gfi1U2AF1L4psi-mi:“MI:0915”(physical association)0.400
NRBM47psi-mi:“MI:0914”(association)0.350
U2AF1RGPD2psi-mi:“MI:0914”(association)0.350

BioGRID (18): U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS), U2AF1L4 (Affinity Capture-MS)

ESM2 similar proteins: A1A4K8, A1Z9L3, A2A4P0, B5DGI7, E0X9N4, O01159, O23212, O48534, P17133, P90727, P90978, Q01081, Q08C72, Q09176, Q09217, Q14498, Q21832, Q27W01, Q28BZ1, Q2QKB3, Q2QKB4, Q2QZL4, Q2R0Q1, Q3T127, Q3ZCE8, Q52KN9, Q5D018, Q5RC80, Q5ZKA3, Q6AUG0, Q6IDS6, Q6P616, Q6PH90, Q7TP17, Q7ZXB5, Q8BGJ9, Q8L716, Q8VH51, Q8WU68, Q94535

Diamond homologs: A1A4K8, O13845, Q01081, Q09176, Q15695, Q15696, Q29350, Q3T127, Q62377, Q64707, Q6AUG0, Q6YVX9, Q7TP17, Q8BGJ9, Q8WU68, Q94535, Q9D883, Q9FMY5, Q9S709, Q9SY74, Q9ZQW8, Q59LX5, Q8BGC0, O43719, Q5RB63, P90727, P90978, P97343, Q24562, Q5RCY1, Q63285, Q8TAS1

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

50 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance30
Likely benign12
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1785 predictions. Top by Δscore:

VariantEffectΔscore
19:35744022:A:ACdonor_gain1.0000
19:35744023:C:CCdonor_gain1.0000
19:35744040:C:CAdonor_gain1.0000
19:35744146:C:CCacceptor_gain1.0000
19:35744318:AGCAC:Adonor_loss1.0000
19:35744319:GCAC:Gdonor_loss1.0000
19:35744320:CACC:Cdonor_loss1.0000
19:35744321:ACCTT:Adonor_loss1.0000
19:35744322:CCTT:Cdonor_loss1.0000
19:35744417:ACCTC:Aacceptor_gain1.0000
19:35744418:CCTCC:Cacceptor_gain1.0000
19:35744419:CTC:Cacceptor_gain1.0000
19:35744420:TCCTG:Tacceptor_loss1.0000
19:35744421:CCTG:Cacceptor_gain1.0000
19:35744422:C:CCacceptor_gain1.0000
19:35744423:T:Gacceptor_loss1.0000
19:35744620:A:ACdonor_gain1.0000
19:35744621:C:CCdonor_gain1.0000
19:35744624:A:ACdonor_gain1.0000
19:35744625:C:CCdonor_gain1.0000
19:35745120:CTCA:Cdonor_loss1.0000
19:35745121:TCAC:Tdonor_loss1.0000
19:35745122:CA:Cdonor_loss1.0000
19:35745123:ACCTG:Adonor_loss1.0000
19:35745124:C:Adonor_loss1.0000
19:35745124:CCTGG:Cdonor_gain1.0000
19:35745330:C:CAdonor_gain1.0000
19:35745346:A:ACdonor_gain1.0000
19:35745347:C:CCdonor_gain1.0000
19:35743798:T:TAdonor_gain0.9900

AlphaMissense

1200 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:35745197:A:CF20L0.981
19:35745197:A:TF20L0.981
19:35745199:A:GF20L0.981
19:35743874:G:CF171L0.978
19:35743874:G:TF171L0.978
19:35743876:A:GF171L0.978
19:35744143:G:CF117L0.974
19:35744143:G:TF117L0.974
19:35744145:A:GF117L0.974
19:35745365:G:CF9L0.969
19:35745365:G:TF9L0.969
19:35745367:A:GF9L0.969
19:35744413:G:CF86L0.966
19:35744413:G:TF86L0.966
19:35744415:A:GF86L0.966
19:35744044:G:CF150L0.965
19:35744044:G:TF150L0.965
19:35744046:A:GF150L0.965
19:35745131:G:CF42L0.955
19:35745131:G:TF42L0.955
19:35745133:A:GF42L0.955
19:35744144:A:GF117S0.954
19:35743883:G:CF168L0.945
19:35743883:G:TF168L0.945
19:35743885:A:GF168L0.945
19:35744089:G:CF135L0.943
19:35744089:G:TF135L0.943
19:35744091:A:GF135L0.943
19:35745128:G:CS43R0.936
19:35745128:G:TS43R0.936

dbSNP variants (sampled 300 via entrez): RS1001123270 (19:35746666 C>A), RS1001532039 (19:35746250 C>T), RS1002437820 (19:35746106 G>A), RS1003090496 (19:35743423 G>A), RS1003120808 (19:35744197 G>A,T), RS1003447402 (19:35744976 A>T), RS1003974626 (19:35745396 A>G), RS1004089174 (19:35745518 C>A,T), RS1005034610 (19:35744258 C>T), RS1006039121 (19:35742666 C>A,T), RS1006262926 (19:35746051 G>A), RS1007028135 (19:35744616 C>T), RS1007528852 (19:35745561 G>A,C,T), RS1008033533 (19:35743039 G>A,C), RS1008057979 (19:35745761 C>T)

Disease associations

OMIM: gene MIM:601080 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

31 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Sdecreases expression, affects cotreatment, increases expression2
Air Pollutantsincreases abundance, increases expression2
Tunicamycinincreases expression2
GSK-J4increases expression1
triphenyl phosphateaffects expression1
beta-lapachoneincreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
perfluorooctane sulfonic acidincreases expression1
pentabromodiphenyl etherincreases expression1
CGP 52608affects binding, increases reaction1
ICG 001increases expression1
abrineincreases expression1
2,2’,4,4’-tetrabromodiphenyl etherincreases expression1
Sunitinibincreases expression1
Atrazineincreases expression1
Coalincreases abundance, increases expression1
Dexamethasoneaffects cotreatment, increases expression1
Diazinonincreases methylation1
Diethylhexyl Phthalateincreases expression1
Dimethyl Sulfoxideaffects expression1
Doxorubicindecreases expression1
Indomethacinaffects cotreatment, increases expression1
Methyl Methanesulfonateincreases expression1
Smokeincreases abundance, increases expression1
Tetrachlorodibenzodioxinaffects expression1
Thiramincreases expression1
Valproic Acidaffects expression1
1-Methyl-3-isobutylxanthineaffects cotreatment, increases expression1
Cyclosporineincreases expression1
Thapsigarginincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.