SF3A1

gene
On this page

Also known as SF3a120SAP114PRPF21Prp21

Summary

SF3A1 (splicing factor 3a subunit 1, HGNC:10765) is a protein-coding gene on chromosome 22q12.2, encoding Splicing factor 3A subunit 1 (Q15459). Component of the 17S U2 SnRNP complex of the spliceosome, a large ribonucleoprotein complex that removes introns from transcribed pre-mRNAs. It is a common-essential gene (DepMap: required in 99.8% of cancer cell lines).

This gene encodes a subunit of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer is a component of the mature U2 small nuclear ribonucleoprotein particle (snRNP). U2 small nuclear ribonucleoproteins play a critical role in spliceosome assembly and pre-mRNA splicing.

Source: NCBI Gene 10291 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 55 total — 1 pathogenic
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.8% of screened cell lines (common-essential)
  • MANE Select transcript: NM_005877

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:10765
Approved symbolSF3A1
Namesplicing factor 3a subunit 1
Location22q12.2
Locus typegene with protein product
StatusApproved
AliasesSF3a120, SAP114, PRPF21, Prp21
Ensembl geneENSG00000099995
Ensembl biotypeprotein_coding
OMIM605595
Entrez10291

Gene structure

Transcript identifiers

Ensembl transcripts: 12 — 6 retained_intron, 4 protein_coding, 2 nonsense_mediated_decay

ENST00000215793, ENST00000411423, ENST00000444440, ENST00000447376, ENST00000463818, ENST00000471037, ENST00000471342, ENST00000485618, ENST00000496189, ENST00000498259, ENST00000872796, ENST00000872797

RefSeq mRNA: 1 — MANE Select: NM_005877 NM_005877

CCDS: CCDS13875

Canonical transcript exons

ENST00000215793 — 16 exons

ExonStartEnd
ENSE000006521873033565230335753
ENSE000006521883033702630337180
ENSE000006521933034069530340812
ENSE000006521943034169230341885
ENSE000006521973034493330345190
ENSE000006521983034631230346519
ENSE000013323263033769030337897
ENSE000013323353033546730335538
ENSE000013323383034019630340381
ENSE000013619293034220030342350
ENSE000017014903033198830334695
ENSE000018170753035673030356894
ENSE000034723273034280530342879
ENSE000034996753035295130353072
ENSE000035193683033913030339251
ENSE000036286713033878930339034

Expression profiles

Bgee: expression breadth ubiquitous, 300 present calls, max score 98.76.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 142.9451 / max 1006.6605, expressed in 1828 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
193623140.59351828
1936201.4059950
1936210.5234293
1936190.4222200

Top tissues by expression

301 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
spermCL:000001998.76gold quality
male germ cellCL:000001598.23gold quality
sural nerveUBERON:001548897.06gold quality
stromal cell of endometriumCL:000225596.82gold quality
ganglionic eminenceUBERON:000402396.66gold quality
ventricular zoneUBERON:000305396.61gold quality
colonic epitheliumUBERON:000039796.46gold quality
bone marrow cellCL:000209295.96gold quality
cortical plateUBERON:000534395.76gold quality
left ovaryUBERON:000211995.51gold quality
left testisUBERON:000453395.51gold quality
right testisUBERON:000453495.47gold quality
vermiform appendixUBERON:000115495.34gold quality
hindlimb stylopod muscleUBERON:000425295.34gold quality
bloodUBERON:000017895.16gold quality
right ovaryUBERON:000211895.05gold quality
tonsilUBERON:000237294.89gold quality
lymph nodeUBERON:000002994.85gold quality
granulocyteCL:000009494.82gold quality
islet of LangerhansUBERON:000000694.56gold quality
body of uterusUBERON:000985394.55gold quality
caecumUBERON:000115394.54gold quality
embryoUBERON:000092294.49gold quality
leukocyteCL:000073894.44gold quality
nippleUBERON:000203094.40gold quality
right lungUBERON:000216794.35gold quality
testisUBERON:000047394.34gold quality
mononuclear cellCL:000084294.24gold quality
popliteal arteryUBERON:000225094.21gold quality
tibial arteryUBERON:000761094.20gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes10.82

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

108 targeting SF3A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3646100.0073.565283
HSA-MIR-126-5P100.0072.713180
HSA-MIR-6798-5P100.0065.77699
HSA-MIR-4795-3P100.0074.624024
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-366299.9973.825684
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-1213699.9872.815713
HSA-MIR-477599.9875.006394
HSA-MIR-548N99.9871.944170
HSA-MIR-4789-5P99.9870.762721
HSA-MIR-50799.9770.111915
HSA-MIR-4666A-3P99.9671.713434
HSA-MIR-590-3P99.9674.346478
HSA-MIR-55799.9670.011640
HSA-MIR-55999.9572.283609
HSA-MIR-548AB99.9571.313488
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502
HSA-MIR-548BB-5P99.9471.273509
HSA-MIR-548D-5P99.9471.233502
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.8% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 7)

  • SF3a60, 66, and 120, but not SF1, are essential for pre-mRNA splicing (PMID:15647371)
  • the SF3A mRNA splicing complex controls production of a negative regulator of TLR signaling that limits the extent of innate immune activation. (PMID:24204290)
  • show that U1-SL4 interacts with the SF3A1 protein of the U2 snRNP (PMID:25403181)
  • SF3A1 polymorphisms are not associated with Colorectal Cancer Risk. (PMID:26079486)
  • Polymorphisms in SF3A1 gene is associated with pancreatic cancer. (PMID:26498691)
  • data show that the UBL domain of SF3A1 can function as an RNA binding domain and that mutations in this region may interfere with U1-SL4 binding (PMID:31383795)
  • Structural insights into recognition of SL4, the UUCG stem-loop, of human U1 snRNA by the ubiquitin-like domain, including the C-terminal tail in the SF3A1 subunit of U2 snRNP. (PMID:37094335)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriosf3a1ENSDARG00000041887
mus_musculusSf3a1ENSMUSG00000002129
rattus_norvegicusSf3a1ENSRNOG00000005218
drosophila_melanogasterSf3a1FBGN0266917
caenorhabditis_elegansWBGENE00004188

Protein

Protein identifiers

Splicing factor 3A subunit 1Q15459 (reviewed: Q15459)

Alternative names: SF3a120, Spliceosome-associated protein 114

All UniProt accessions (4): Q15459, F8WB66, F8WC79, H7C1L2

UniProt curated annotations — full annotation on UniProt →

Function. Component of the 17S U2 SnRNP complex of the spliceosome, a large ribonucleoprotein complex that removes introns from transcribed pre-mRNAs. The 17S U2 SnRNP complex (1) directly participates in early spliceosome assembly and (2) mediates recognition of the intron branch site during pre-mRNA splicing by promoting the selection of the pre-mRNA branch-site adenosine, the nucleophile for the first step of splicing. Within the 17S U2 SnRNP complex, SF3A1 is part of the SF3A subcomplex that contributes to the assembly of the 17S U2 snRNP, and the subsequent assembly of the pre-spliceosome ‘E’ complex and the pre-catalytic spliceosome ‘A’ complex. Involved in pre-mRNA splicing as a component of pre-catalytic spliceosome ‘B’ complexes.

Subunit / interactions. Component of the 17S U2 SnRNP complex, a ribonucleoprotein complex that contains small nuclear RNA (snRNA) U2 and a number of specific proteins. Part of the SF3A subcomplex of the 17S U2 SnRNP complex which is composed of three subunits; SF3A3/SAP61, SF3A2/SAP62 and SF3A1/SAP114. SF3A associates with the splicing factor SF3B and a 12S RNA unit to form the mature 17S U2 small nuclear ribonucleoprotein complex (17S U2 snRNP). SF3A1 functions as a scaffold that interacts directly with both SF3A2 and SF3A3. Identified in the spliceosome ‘E’ complex, a precursor of the spliceosome ‘A’ complex. Identified in the spliceosome ‘A’ and ‘B’ complexes. Identified in the spliceosome ‘C’ complex. Interacts with P2RX6; resulting in a reduction of the splicing activity.

Subcellular location. Nucleus. Nucleus speckle.

Tissue specificity. Ubiquitously expressed.

Domain organisation. SURP motif 2 mediates direct binding to SF3A3.

Isoforms (2)

UniProt IDNamesCanonical?
Q15459-11yes
Q15459-22

RefSeq proteins (1): NP_005868* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000061SurpDomain
IPR000626Ubiquitin-like_domDomain
IPR022030SF3A1_domDomain
IPR029071Ubiquitin-like_domsfHomologous_superfamily
IPR035563SF3As1_ubiDomain
IPR035967SWAP/Surp_sfHomologous_superfamily
IPR045146SF3A1Family

Pfam: PF00240, PF01805, PF12230

UniProt features (74 total): helix 18, strand 12, compositionally biased region 9, modified residue 9, region of interest 6, cross-link 6, mutagenesis site 4, repeat 2, turn 2, initiator methionine 1, chain 1, site 1, splice variant 1, sequence variant 1, domain 1

Structure

Experimental structures (PDB)

51 structures, top 30 by resolution.

PDBMethodResolution (Å)
7P0VX-RAY DIFFRACTION1.56
8ID2X-RAY DIFFRACTION1.8
7EVOELECTRON MICROSCOPY2.5
8H6LELECTRON MICROSCOPY2.6
8H6KELECTRON MICROSCOPY2.7
8HK1ELECTRON MICROSCOPY2.7
7VPXELECTRON MICROSCOPY3
8I0RELECTRON MICROSCOPY3
8I0TELECTRON MICROSCOPY3
8Q7NELECTRON MICROSCOPY3.1
8QOZELECTRON MICROSCOPY3.1
8QPEELECTRON MICROSCOPY3.1
8H6EELECTRON MICROSCOPY3.2
8H6JELECTRON MICROSCOPY3.25
6QX9ELECTRON MICROSCOPY3.28
8I0PELECTRON MICROSCOPY3.4
9ZE0ELECTRON MICROSCOPY3.43
8QP8ELECTRON MICROSCOPY3.5
9ZECELECTRON MICROSCOPY3.61
8QPAELECTRON MICROSCOPY3.7
8QPBELECTRON MICROSCOPY3.7
6AHDELECTRON MICROSCOPY3.8
9ZE3ELECTRON MICROSCOPY3.93
9ZEDELECTRON MICROSCOPY3.94
8QP9ELECTRON MICROSCOPY4.1
8QZSELECTRON MICROSCOPY4.1
8I0SELECTRON MICROSCOPY4.2
8QPKELECTRON MICROSCOPY4.2
8R09ELECTRON MICROSCOPY4.3
8R0BELECTRON MICROSCOPY4.4

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q15459-F167.760.14

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 169 (critical for binding to sf3a3)

Post-translational modifications (15): 55, 320, 329, 359, 413, 451, 456, 508, 759, 20, 131, 424, 499, 542, 686

Mutagenesis-validated functional residues (4):

PositionPhenotype
48slurp 1 motif acquires binding to sf3a3; when associated with leu-55.
55slurp 1 motif acquires binding to sf3a3; when associated with phe-48.
162no effect on binding to sf3a3.
169abolishes binding to sf3a3.

Function

Pathways and Gene Ontology

Reactome pathways

7 pathways

IDPathway
R-HSA-72163mRNA Splicing - Major Pathway
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-9943411CHD1 and CHD2 subfamily
R-HSA-72172mRNA Splicing
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA
R-HSA-8953854Metabolism of RNA

MSigDB gene sets: 197 (showing top): TSENG_IRS1_TARGETS_UP, ENK_UV_RESPONSE_KERATINOCYTE_UP, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, MORF_HDAC1, MORF_UBE2N, GGAMTNNNNNTCCY_UNKNOWN, YY1_Q6, MORF_CCNI, REACTOME_MRNA_3_END_PROCESSING, GROSS_HYPOXIA_VIA_ELK3_UP, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, GOBP_PROTEIN_RNA_COMPLEX_ORGANIZATION, GOBP_RNA_SPLICING, REACTOME_MRNA_SPLICING, GOBP_MRNA_SPLICE_SITE_RECOGNITION

GO Biological Process (7): mRNA 3’-splice site recognition (GO:0000389), mRNA splicing, via spliceosome (GO:0000398), mRNA processing (GO:0006397), mRNA cis splicing, via spliceosome (GO:0045292), U2-type prespliceosome assembly (GO:1903241), RNA processing (GO:0006396), RNA splicing (GO:0008380)

GO Molecular Function (2): RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (9): nucleus (GO:0005634), nucleoplasm (GO:0005654), spliceosomal complex (GO:0005681), U2-type spliceosomal complex (GO:0005684), U2 snRNP (GO:0005686), nuclear speck (GO:0016607), U2-type prespliceosome (GO:0071004), U2-type precatalytic spliceosome (GO:0071005), catalytic step 2 spliceosome (GO:0071013)

Reactome top-level categories

Rollup of top-6 pathways:

CategoryPathways
mRNA Splicing1
mRNA 3’-end processing1
Dengue Virus Infection1
CHD chromatin remodelers1
Processing of Capped Intron-Containing Pre-mRNA1
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
spliceosomal complex2
U2-type spliceosomal complex2
U1 snRNP2
U2 snRNP2
mRNA splice site recognition1
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
mRNA processing1
mRNA metabolic process1
mRNA splicing, via spliceosome1
spliceosomal complex assembly1
gene expression1
RNA biosynthetic process1
primary metabolic process1
nucleic acid binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
nuclear protein-containing complex1
ribonucleoprotein complex1
spliceosomal snRNP complex1
nuclear ribonucleoprotein granule1
prespliceosome1
U4/U6 x U5 tri-snRNP complex1
precatalytic spliceosome1
Prp19 complex1
U5 snRNP1
catalytic complex1

Protein interactions and networks

STRING

3057 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SF3A1SF3A3Q12874999
SF3A1SF3A2Q15428999
SF3A1SF3B2Q13435920
SF3A1SF3B3Q15393877
SF3A1U2AF1Q01081848
SF3A1SF3B1O75533826
SF3A1U2AF2P26368801
SF3A1DDX46Q7L014788
SF3A1SNRPA1P09661783
SF3A1PRPF40BQ6NWY9783
SF3A1SF3B4Q15427782
SF3A1PHF5AQ7RTV0780
SF3A1SF3B5Q9BWJ5778
SF3A1SF3B6Q9Y3B4751
SF3A1SNRNP200O75643723

IntAct

246 interactions, top by confidence:

ABTypeScore
SF1U2AF2psi-mi:“MI:0914”(association)0.950
SNRPFGEMIN2psi-mi:“MI:0914”(association)0.910
SF3A3SF3A1psi-mi:“MI:0407”(direct interaction)0.900
SF3A3SF3A1psi-mi:“MI:0915”(physical association)0.900
SF3A2SF3A1psi-mi:“MI:0915”(physical association)0.800
SMNDC1SF3B1psi-mi:“MI:0914”(association)0.790
SNRPEGEMIN2psi-mi:“MI:0914”(association)0.770
RBM17U2SURPpsi-mi:“MI:0914”(association)0.740
CCDC97SF3B1psi-mi:“MI:0914”(association)0.730
PHF5ASF3B1psi-mi:“MI:0914”(association)0.730
SF3A1SF1psi-mi:“MI:0407”(direct interaction)0.710
SNRPD2GEMIN2psi-mi:“MI:0914”(association)0.710
SNRPGGEMIN2psi-mi:“MI:0914”(association)0.710
SNRPBPRMT5psi-mi:“MI:0914”(association)0.670
QPRTPIK3C2Apsi-mi:“MI:0914”(association)0.640
SNRPA1HTATSF1psi-mi:“MI:0914”(association)0.640
NCBP1KPNA3psi-mi:“MI:0914”(association)0.640
DNAJC8SF3B1psi-mi:“MI:0914”(association)0.640
SF3B1SAP18psi-mi:“MI:0914”(association)0.640
SNRPA1U2SURPpsi-mi:“MI:0914”(association)0.640

BioGRID (735): SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-RNA), SF3A1 (Affinity Capture-RNA), SF3A1 (Affinity Capture-RNA), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS), SF3A1 (Affinity Capture-MS)

ESM2 similar proteins: A0A0A2J9B3, A0A0K0QSV4, A0A9P4XWM4, A1CT57, A1DMG9, A7E8B6, A8JPF9, A8NYG2, A9LNK9, E5AD52, E7F1H9, G4MRQ6, G4NF05, M2SQ20, N1PJ97, O42632, P0C5H8, P0DPB0, P38093, P59326, P87253, Q05534, Q0U086, Q15459, Q1K6U0, Q2UNX4, Q2UPS5, Q4R1B9, Q4WN42, Q4WVG0, Q4WXX4, Q4X228, Q5B3I8, Q5RFL8, Q6DDU9, Q6L612, Q7RZQ3, Q7Z739, Q8BYK6, Q8K4Z5

Diamond homologs: A2VDN6, B9DHA6, G1SK22, O13900, O14399, O15042, P05759, P0C016, P0C030, P0C031, P0C032, P0C073, P0C8R3, P0CG71, P0CG73, P0CG81, P0CG82, P0CG83, P0CG84, P0CG85, P0CG86, P0CG87, P0CH04, P0CH05, P0CH10, P0CH11, P0CH27, P0CH32, P0CH33, P0CH34, P0CH35, P0DJ25, P12297, P14795, P14799, P15357, P19848, P23324, P23398, P27923

SIGNOR signaling

2 interactions.

AEffectBMechanism
SF3A1“form complex”SF3abinding
SF3A1“form complex”“U2 snRNP complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 197 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Metabolism of non-coding RNA727.9×2e-07
mRNA Splicing - Minor Pathway1622.5×4e-16
mRNA Splicing2920.0×5e-28
mRNA Polyadenylation3418.8×6e-32
Processing of Capped Intron-Containing Pre-mRNA3116.0×7e-27
mRNA Splicing - Major Pathway4615.8×2e-40
CHD1 and CHD2 subfamily2315.7×2e-19
RNA Polymerase II Transcription Termination1013.8×2e-07

GO biological processes:

GO termPartnersFoldFDR
U2-type prespliceosome assembly1757.7×1e-24
spliceosomal complex assembly1032.7×6e-11
spliceosomal snRNP assembly928.4×4e-09
regulation of mRNA splicing, via spliceosome524.1×2e-04
RNA splicing, via transesterification reactions723.7×2e-06
mRNA splicing, via spliceosome4019.9×6e-38
RNA splicing2411.5×3e-16
mRNA processing187.7×4e-09

Disease & clinical

Clinical variants and AI predictions

ClinVar

55 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance41
Likely benign1
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
2671887NM_005877.6(SF3A1):c.1641del (p.Ser548fs)Pathogenic

SpliceAI

1952 predictions. Top by Δscore:

VariantEffectΔscore
22:30334692:TACC:Tacceptor_gain1.0000
22:30334693:ACC:Aacceptor_gain1.0000
22:30334694:CC:Cacceptor_gain1.0000
22:30334694:CCC:Cacceptor_gain1.0000
22:30334695:CC:Cacceptor_gain1.0000
22:30334696:C:Aacceptor_loss1.0000
22:30334696:C:CCacceptor_gain1.0000
22:30334696:C:Tacceptor_gain1.0000
22:30334697:T:Aacceptor_loss1.0000
22:30335463:TCA:Tdonor_loss1.0000
22:30335464:CAC:Cdonor_loss1.0000
22:30335465:A:ACdonor_gain1.0000
22:30335465:ACC:Adonor_loss1.0000
22:30335466:C:CCdonor_gain1.0000
22:30335466:C:Tdonor_loss1.0000
22:30335466:CCT:Cdonor_gain1.0000
22:30335539:C:CGacceptor_loss1.0000
22:30335540:T:Aacceptor_loss1.0000
22:30335648:ATACC:Adonor_loss1.0000
22:30335650:A:Tdonor_loss1.0000
22:30335651:C:CAdonor_loss1.0000
22:30335752:CC:Cacceptor_gain1.0000
22:30335753:CC:Cacceptor_gain1.0000
22:30335754:C:CCacceptor_gain1.0000
22:30337021:CATA:Cdonor_gain1.0000
22:30337024:A:ACdonor_gain1.0000
22:30337024:AC:Adonor_gain1.0000
22:30337024:ACC:Adonor_loss1.0000
22:30337025:C:CAdonor_gain1.0000
22:30337025:CC:Cdonor_gain1.0000

AlphaMissense

5226 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
22:30334607:C:TG790E1.000
22:30334610:C:TG789D1.000
22:30334611:C:GG789R1.000
22:30334612:T:AR788S1.000
22:30334612:T:GR788S1.000
22:30334613:C:AR788I1.000
22:30334613:C:GR788T1.000
22:30334618:C:AK786N1.000
22:30334618:C:GK786N1.000
22:30334620:T:CK786E1.000
22:30334628:A:GL783P1.000
22:30334664:G:TA771D1.000
22:30334679:T:AD766V1.000
22:30334679:T:GD766A1.000
22:30334680:C:GD766H1.000
22:30334681:T:AK765N1.000
22:30334681:T:GK765N1.000
22:30334682:T:AK765I1.000
22:30334683:T:CK765E1.000
22:30334687:G:CF763L1.000
22:30334687:G:TF763L1.000
22:30334689:A:GF763L1.000
22:30335477:A:CL757R1.000
22:30335477:A:GL757P1.000
22:30335477:A:TL757Q1.000
22:30335479:C:AK756N1.000
22:30335479:C:GK756N1.000
22:30335481:T:CK756E1.000
22:30335482:C:AQ755H1.000
22:30335482:C:GQ755H1.000

dbSNP variants (sampled 300 via entrez): RS1000043391 (22:30345911 G>A), RS1000053334 (22:30346149 G>A,C), RS1000137910 (22:30346214 T>C), RS1000177740 (22:30339837 G>C), RS1000224841 (22:30332834 G>A,C), RS1000360049 (22:30333192 G>C), RS1000361607 (22:30333674 A>G), RS1000613153 (22:30339529 G>A), RS1000637672 (22:30339796 G>A,T), RS1000937942 (22:30356181 C>T), RS1000992619 (22:30350808 G>T), RS1001048914 (22:30344548 T>C), RS1001377847 (22:30358647 C>T), RS1001411936 (22:30358411 C>T), RS1001435280 (22:30350965 G>A,C)

Disease associations

OMIM: gene MIM:605595 | disease phenotypes:

GenCC curated gene-disease

Mondo (1): diffuse large B-cell lymphoma (MONDO:0018905)

Orphanet (1): Diffuse large B-cell lymphoma (Orphanet:544)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST005851_15Delirium1.000000e-06
GCST008794_13Urinary albumin-to-creatinine ratio2.000000e-08
GCST90002382_502Eosinophil percentage of white cells5.000000e-10

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0007778urinary albumin to creatinine ratio
EFO:0007991eosinophil percentage of leukocytes

MeSH disease descriptors (1)

DescriptorNameTree numbers
D016403Lymphoma, Large B-Cell, DiffuseC04.557.386.480.150.585; C15.604.515.569.480.150.585; C20.683.515.761.480.150.585

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5725108 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

2 potent at pChembl≥5 of 2 total, top 2 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
7.57Kd26.84nMCHEMBL5653589
7.57ED5026.84nMCHEMBL5653589

PubChem BioAssay actives

1 with measured affinity, of 8 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149382: Binding affinity to human SF3A1 incubated for 45 mins by Kinobead based pull down assaykd0.0268uM

CTD chemical–gene interactions

44 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
methylmercuric chlorideincreases expression, affects cotreatment3
sodium arseniteincreases abundance, increases expression3
Cyclosporineincreases expression, increases methylation2
Cadmium Chloridedecreases expression, increases abundance2
GSK-J4increases expression1
FR900359increases phosphorylation1
TAK-243decreases sumoylation1
dicrotophosincreases expression1
triphenyl phosphateaffects expression1
deoxynivalenolincreases expression1
pyrogallol 1,3-dimethyl etheraffects cotreatment, decreases expression, affects localization, increases expression1
butyraldehydedecreases expression1
coumarinincreases phosphorylation1
yessotoxindecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideincreases expression, affects cotreatment1
nutlin 3increases secretion, affects cotreatment1
abrineincreases expression1
bromovaninincreases expression1
dorsomorphinaffects cotreatment, increases expression1
bisphenol Sincreases expression1
10-(octyloxy)decyl-2-(trimethylammonium)ethyl phosphatedecreases expression1
PP242increases expression1
bisphenol AFincreases expression1
Resveratrolaffects cotreatment, decreases expression1
Acetaminophenincreases expression1
Arsenicincreases abundance, increases expression1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation1
Cadmiumdecreases expression, increases abundance1
Caffeineincreases phosphorylation1

ChEMBL screening assays

7 unique, capped per target: 7 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652424BindingBinding affinity to human SF3A1 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

300 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00466258PHASE4COMPLETEDLINFOTARGAM: Treatment With Chemotherapy Plus Rituximab and Highly Active Antiretroviral Therapy in Patients With Diffuse Large B Cell Lymphoma and Infection With the Human Immunodeficiency Virus (HIV)
NCT01949818PHASE4UNKNOWNTreatment of Diffuse Large B Cell Lymphoma
NCT02752815PHASE4UNKNOWNReduced Chemotherapy in Low Risk DLBCL
NCT03376958PHASE4COMPLETEDApatinib for Relapsed and Refractory Diffuse Large B Cell Lymphoma
NCT03513601PHASE4UNKNOWNTreatment of Elderly Patients With Diffuse Large B-cell Lymphoma
NCT03579082PHASE4UNKNOWNA Clinical Trial of Decitabine in Relapse and Refractory Diffuse Large B Cell Lymphoma
NCT05108805PHASE4COMPLETEDChimeric Antigen Receptor (CAR) T Cell Therapy With YESCARTA in the Outpatient Setting
NCT05518383PHASE4RECRUITINGB-cell Mature Non-Hodgkin’s Lymphoma Treatment Protocol in Children and Adolescents 2021
NCT00075478PHASE3COMPLETEDTotal-Body Irradiation With or Without Fludarabine Phosphate Followed By Donor Stem Cell Transplant in Treating Patients With Hematologic Cancer
NCT00355199PHASE3COMPLETEDComparison of HD Chemotherapy Followed by Auto-transplant and R-CHOP in High Risk Patients With DLBCL.
NCT00400478PHASE3COMPLETEDA Multicentre, Randomized Phase III Study of Rituximab as Maintenance Treatment Versus Observation in Patients With Aggressive B-cell Lymphoma: NHL-13
NCT00499018PHASE3UNKNOWNDose Dense Chemotherapy + Rituximab +/-Intensified High Dose Chemoimmunotherapy With Support of Peripheral Autologous Stem Cell in Diffuse Large B-Cell Lymphoma
NCT00790036PHASE3COMPLETEDPhase III Study of RAD001 Adjuvant Therapy in Poor Risk Patients With Diffuse Large B-Cell Lymphoma (DLBCL) of RAD001 Versus Matching Placebo After Patients Have Achieved Complete Response With First-line Rituximab-chemotherapy
NCT00846157PHASE3UNKNOWNBiocell Natural Killer Mixture in Diffuse Large B Cell Lymphoma (DLBCL) Patients
NCT01122472PHASE3COMPLETEDStudy of Lenalidomide Maintenance Versus Placebo in Responding Elderly Patients With DLBCL and Treated With R-CHOP
NCT01148446PHASE3COMPLETEDR-CHOP Versus R-mini-CEOP in Elderly Patients(>65)With DLBCL
NCT01231412PHASE3COMPLETEDGraft-Versus-Host Disease Prophylaxis in Treating Patients With Hematologic Malignancies Undergoing Unrelated Donor Peripheral Blood Stem Cell Transplant
NCT01285765PHASE3COMPLETEDEvaluate a Treatment Adapted to the PET Response Compared to a Standard Treatment, for Low Risk DLBCL CD 20+ Patients
NCT01287741PHASE3TERMINATEDA Study of Obinutuzumab in Combination With CHOP Chemotherapy Versus Rituximab With CHOP in Participants With CD20-Positive Diffuse Large B-Cell Lymphoma (GOYA)
NCT01321541PHASE3COMPLETEDComparison of Pixantrone + Rituximab With Gemcitabine + Rituximab in Patients With Aggressive B-cell Non-Hodgkin Lymphoma or Follicular Grade 3 Lymphoma Who Have Relapsed After Therapy and Are Not Eligible for Stem Cell Transplant
NCT01459887PHASE3COMPLETEDStudy of Recombinant Human-Mouse Chimeric Anti-CD20 Monoclonal Antibody to Treat Non-hodgkin’s Lymphoma
NCT01510184PHASE3TERMINATEDStudy of Zevalin Versus Observation in Participants at Least 60 Years Old With Newly Diagnosed Diffuse Large B-cell Lymphoma in Positron Emission Tomography (PET)-Negative Complete Remission After Rituximab-Cyclophosphamide, Doxorubicin, Vincristine, and Prednisone (R-CHOP) or R-CHOP-like Therapy
NCT01804686PHASE3RECRUITINGA Long-term Extension Study of PCI-32765 (Ibrutinib)
NCT01852435PHASE3UNKNOWNR-CEOP-90/R-CEOP-70 Versus R-CHOP-50 in the Treatment of Diffuse Large B-cell Lymphoma and Follicular Lymphoma Grade 3B
NCT02054559PHASE3WITHDRAWNR-CHOP Alone vs. R-CHOP Plus Radiotherapy for Localized CD20+ DLBCL
NCT02128061PHASE3COMPLETEDEfficacy of Lenalidomide in Combination With Subcutaneous Rituximab + miniCHOP in DLBCL Patients of 80 y/o or+
NCT02268045PHASE3COMPLETEDStudy of RTXM83 Plus CHOP Chemotherapy Versus a Rituximab Plus CHOP Therapy in Patients With Non Hodgkin’s Lymphoma
NCT02366663PHASE3TERMINATEDBEAM vs. 90-Yttrium Ibritumomab Tiuxetan (Zevalin®)/BEAM With ASCT for Relapsed DLBCL
NCT02449265PHASE3UNKNOWNEfficacy of Consolidative Involved-site Radiotherapy for Patients With Limited-stage Diffuse Large B-cell Lymphoma
NCT02449278PHASE3UNKNOWNThe Palliative Benefit of Involved-site Radiotherapy for Patients With Advanced-stage Diffuse Large B-cell Lymphoma
NCT02531841PHASE3UNKNOWNHigh-dose Chemotherapy and ASCT or Consolidating Conventional Chemotherapy in Primary CNS Lymphoma
NCT02617485PHASE3COMPLETEDMabionCD20 Compared to MabThera in Lymphoma Patients
NCT02767674PHASE3UNKNOWNTrial of R-GemOx Versus R-miniCHOP Regimen in First-line Treatment of Elderly Diffuse Large B Cell Lymphoma
NCT02772822PHASE3UNKNOWNA Study Comparing the Efficiency and Safety of S-CHOP(Cyclophosphamide, Hydroxydaunomycin, Oncovin, and Prednisone) Versus R-CHOP in Untreated CD20(Cluster of Differentiation Antigen 20)-Positive DLBCL Patients
NCT02777736PHASE3UNKNOWNCNS Prophylaxis in Diffuse Large B-cell Lymphoma
NCT02842931PHASE3UNKNOWNR-Dose-adjusted (DA) - EPOCH-21 Versus R-modified Non-Hodgkin Lymphoma (NHL)-Berlin-Frankfurt-Munster (BFM)-90 Program (mNHL-BFM-90) and Autologous Stem Cells Transplantation (Auto-SCT) in DLBCL With Poor Prognosis
NCT02951156PHASE3TERMINATEDAvelumab In Combination Regimens That Include An Immune Agonist, Epigenetic Modulator, CD20 Antagonist and/or Conventional Chemotherapy in Patients With Relapsed or Refractory Diffuse Large B-cell Lymphoma (R/R DLBCL)
NCT03123718PHASE3UNKNOWNHigh-dose Intravenous Methotrexate Versus Intrathecal Methotrexate for Central Nervous System Prophylaxis in DLBCL
NCT03151044PHASE3UNKNOWNR±CEOP90 Versus R±CEOP75 in Newly Diagnosed Young Patients With Medium/High-risk DLBCL
NCT03213977PHASE3UNKNOWNR-DA-EDOCH Versus R-CEOP90, With/Without Upfront Auto-HSCT in Young Patients With High-risk DLBCL
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): delirium, diffuse large B-cell lymphoma