SCAF1

gene
On this page

Also known as SR-A1FLJ00034

Summary

SCAF1 (SR-related CTD associated factor 1, HGNC:30403) is a protein-coding gene on chromosome 19q13.3-q13.4, encoding Splicing factor, arginine/serine-rich 19 (Q9H7N4). May function in pre-mRNA splicing.

Enables RNA polymerase II C-terminal domain binding activity. Predicted to be involved in RNA splicing; mRNA processing; and transcription by RNA polymerase II. Predicted to be located in nucleus.

Source: NCBI Gene 58506 — RefSeq curated summary.

At a glance

  • GWAS associations: 5
  • Clinical variants (ClinVar): 276 total
  • MANE Select transcript: NM_021228

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:30403
Approved symbolSCAF1
NameSR-related CTD associated factor 1
Location19q13.3-q13.4
Locus typegene with protein product
StatusApproved
AliasesSR-A1, FLJ00034
Ensembl geneENSG00000126461
Ensembl biotypeprotein_coding
OMIM617264
Entrez58506

Gene structure

Transcript identifiers

Ensembl transcripts: 7 — 7 protein_coding

ENST00000360565, ENST00000595242, ENST00000598359, ENST00000601038, ENST00000892599, ENST00000892600, ENST00000892601

RefSeq mRNA: 1 — MANE Select: NM_021228 NM_021228

CCDS: CCDS33074

Canonical transcript exons

ENST00000360565 — 11 exons

ExonStartEnd
ENSE000003722444964535449645411
ENSE000007209144964610849646202
ENSE000007209194964652649646626
ENSE000007209254964671549646830
ENSE000007209344965434949654431
ENSE000007209394965465249654870
ENSE000010592354964220949642242
ENSE000013751964964502149645134
ENSE000013767014965820849658642
ENSE000013877554965086849653705
ENSE000016926564965776149657889

Expression profiles

Bgee: expression breadth ubiquitous, 182 present calls, max score 94.30.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 15.6290 / max 113.7149, expressed in 1799 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
17703914.52621797
1770410.3992175
1770380.3580177
1770400.3457156

Top tissues by expression

238 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
sural nerveUBERON:001548894.30gold quality
right frontal lobeUBERON:000281092.78gold quality
anterior cingulate cortexUBERON:000983592.28gold quality
gastrocnemiusUBERON:000138891.84gold quality
hindlimb stylopod muscleUBERON:000425291.47gold quality
prefrontal cortexUBERON:000045191.32gold quality
right hemisphere of cerebellumUBERON:001489091.24gold quality
Brodmann (1909) area 9UBERON:001354091.12gold quality
stromal cell of endometriumCL:000225590.97gold quality
apex of heartUBERON:000209890.92gold quality
popliteal arteryUBERON:000225090.81gold quality
tibial arteryUBERON:000761090.79gold quality
muscle of legUBERON:000138390.74gold quality
cortical plateUBERON:000534390.62gold quality
cerebellar hemisphereUBERON:000224590.57gold quality
cerebellar cortexUBERON:000212990.49gold quality
cardiac muscle of right atriumUBERON:000337990.37gold quality
ganglionic eminenceUBERON:000402390.33gold quality
left ventricle myocardiumUBERON:000656690.21silver quality
amygdalaUBERON:000187690.10gold quality
mucosa of stomachUBERON:000119990.01gold quality
lower esophagus muscularis layerUBERON:003583389.97gold quality
ventricular zoneUBERON:000305389.96gold quality
lower esophagusUBERON:001347389.96gold quality
aortaUBERON:000094789.93gold quality
esophagogastric junction muscularis propriaUBERON:003584189.86gold quality
body of uterusUBERON:000985389.83gold quality
adenohypophysisUBERON:000219689.82gold quality
muscle layer of sigmoid colonUBERON:003580589.80gold quality
monocyteCL:000057689.44gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.22
E-GEOD-111727no505.03
E-MTAB-7606no217.75

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

27 targeting SCAF1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-433-3P99.9869.371203
HSA-MIR-6825-5P99.9669.813431
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-4728-5P99.8569.394718
HSA-MIR-132199.8465.301811
HSA-MIR-473999.8465.251832
HSA-MIR-4756-5P99.8464.981809
HSA-MIR-6785-5P99.8268.684428
HSA-MIR-3150A-3P99.7664.441640
HSA-MIR-6763-5P99.7664.681767
HSA-MIR-3156-3P99.7666.72939
HSA-MIR-149-3P99.7268.223963
HSA-MIR-33A-3P99.7070.273362
HSA-MIR-7154-5P99.6970.521900
HSA-MIR-6883-5P99.6968.053785
HSA-MIR-613499.6365.681537
HSA-MIR-302B-5P99.5069.491857
HSA-MIR-302D-5P99.5069.341863
HSA-MIR-1207-5P99.4969.112983
HSA-MIR-468899.4864.68828
HSA-MIR-6743-5P99.4863.60721
HSA-MIR-361-3P99.1966.451381
HSA-MIR-6799-5P99.1465.722093
HSA-MIR-4763-3P99.1067.832649
HSA-MIR-423-5P98.6967.481522
HSA-MIR-3184-5P98.5667.131491
HSA-MIR-505-5P97.0165.54778

Literature-anchored findings (GeneRIF, showing 7)

  • This is the first study that examines the expression of the novel gene SR-A1 in colon cancer progression. (PMID:15493872)
  • Our results suggest that SRA1 is associated with cancer progression and may possibly be characterized as a new marker of unfavorable prognosis for ovarian cancer. (PMID:16631123)
  • This study unveils the presence of SR-A1, CD36, and LOX-1 in aortic valves and suggests potential mechanisms by which they may contribute to the pathological angiogenesis, inflammation, calcification, and lipid accumulation in AVS. (PMID:24929820)
  • Study describes the discovery and cloning of fifteen novel splice variants of SCAF1 gene in cancer cells comprising a total of nine novel alternative splicing events between the annotated exons of the gene, producing seven novel SCAF1 transcripts with open-reading frames, and eight novel SCAF1 transcripts with premature termination codons that are likely long non-coding RNAs. Additionally, a novel 3’ UTR was discovered. (PMID:29787824)
  • Analysis of the acrolein-modified sites of apolipoprotein B-100 in LDL. (PMID:32919080)
  • SCAF1 drives the compositional diversity of mammalian respirasomes. (PMID:38575788)
  • In vivo CRISPR screens reveal SCAF1 and USP15 as drivers of pancreatic cancer. (PMID:38902237)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_rerioscaf1ENSDARG00000054010
mus_musculusScaf1ENSMUSG00000038406
rattus_norvegicusScaf1ENSRNOG00000056946
drosophila_melanogasterCG2926FBGN0037344

Paralogs (2): PHRF1 (ENSG00000070047), SCAF11 (ENSG00000139218)

Protein

Protein identifiers

Splicing factor, arginine/serine-rich 19Q9H7N4 (reviewed: Q9H7N4)

Alternative names: SR-related C-terminal domain-associated factor 1, SR-related-CTD-associated factor, Serine arginine-rich pre-mRNA splicing factor SR-A1

All UniProt accessions (4): Q9H7N4, M0R232, M0R2L3, M0R3G4

UniProt curated annotations — full annotation on UniProt →

Function. May function in pre-mRNA splicing.

Subunit / interactions. Interacts with POLR2A.

Subcellular location. Nucleus.

Tissue specificity. Ubiquitous. Highly expressed in fetal brain and liver, poorly expressed in salivary gland, heart, skin and ovary. Expressed in colorectal carcinomas and ovarian cancers. Overexpressed in colorectal carcinomas as compared to normal colonic mucosa.

Induction. Up-regulated by estrogens, androgens and glucocorticoids.

Similarity. Belongs to the splicing factor SR family.

RefSeq proteins (1): NP_067051* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR042841SCAF1Family
IPR057031SFR19-like_CDomain

Pfam: PF23030

UniProt features (66 total): compositionally biased region 29, modified residue 26, region of interest 6, sequence variant 2, chain 1, cross-link 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H7N4-F147.650.04

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (27): 239, 335, 448, 453, 498, 500, 526, 548, 612, 614, 706, 719, 725, 732, 734, 738, 872, 874, 929, 936 …

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 55 (showing top): TCCCCAC_MIR491, GOBP_RNA_SPLICING, GINESTIER_BREAST_CANCER_ZNF217_AMPLIFIED_DN, GOMF_RNA_POLYMERASE_BINDING, GOMF_RNA_POLYMERASE_CORE_ENZYME_BINDING, GOMF_BASAL_TRANSCRIPTION_MACHINERY_BINDING, WAKABAYASHI_ADIPOGENESIS_PPARG_RXRA_BOUND_8D, WAKABAYASHI_ADIPOGENESIS_PPARG_RXRA_BOUND_WITH_H4K20ME1_MARK, DELACROIX_RARG_BOUND_MEF, GOBP_MRNA_PROCESSING, GOMF_RNA_POLYMERASE_II_C_TERMINAL_DOMAIN_BINDING, GSE14415_NATURAL_TREG_VS_TCONV_DN, ARID5B_TARGET_GENES, BARX1_TARGET_GENES, DLX6_TARGET_GENES

GO Biological Process (3): transcription by RNA polymerase II (GO:0006366), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (4): RNA binding (GO:0003723), protein domain specific binding (GO:0019904), RNA polymerase II C-terminal domain binding (GO:0099122), protein binding (GO:0005515)

GO Cellular Component (1): nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
DNA-templated transcription1
mRNA metabolic process1
nucleic acid binding1
protein binding1
RNA polymerase II complex binding1
binding1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

1290 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SCAF1POLGP54098580
SCAF1SCAF8Q9UPN6553
SCAF1COX6A1P12074539
SCAF1COX7A2LO14548506
SCAF1SCAF4O95104500
SCAF1COX7A2P14406488
SCAF1ACADSP16219447
SCAF1PRR12Q9ULL5427
SCAF1INSCQ1MX18382
SCAF1SRSF7Q16629347
SCAF1SEPTIN7Q16181319
SCAF1CCNCP24863308
SCAF1SCAIQ8N9R8299
SCAF1LAMB4A4D0S4297
SCAF1INTS15Q96N11290

IntAct

134 interactions, top by confidence:

ABTypeScore
POLR2JPOLR1Cpsi-mi:“MI:0914”(association)0.830
POLR2GRECQL5psi-mi:“MI:0914”(association)0.730
POLR2DMED19psi-mi:“MI:0914”(association)0.730
POLR2EMED19psi-mi:“MI:0914”(association)0.730
RPRD1BPOLR2Dpsi-mi:“MI:0914”(association)0.730
POLR2JPOLR2Dpsi-mi:“MI:0914”(association)0.730
PNNCASC3psi-mi:“MI:0914”(association)0.640
RPL14RRP8psi-mi:“MI:0914”(association)0.640
MEOX2SCAF1psi-mi:“MI:0915”(physical association)0.560
CIB1SCAF1psi-mi:“MI:0915”(physical association)0.560
GSC2SCAF1psi-mi:“MI:0915”(physical association)0.560
MTSS2SCAF1psi-mi:“MI:0915”(physical association)0.560
RTN4IP1SCAF1psi-mi:“MI:0915”(physical association)0.560
H1-6ZNF724psi-mi:“MI:0914”(association)0.530
RRP8NVLpsi-mi:“MI:0914”(association)0.530
ZBTB48ZBTB24psi-mi:“MI:0914”(association)0.530
NRBM47psi-mi:“MI:0914”(association)0.530
RPS3ZNF316psi-mi:“MI:0914”(association)0.530
SNRNP70GTPBP1psi-mi:“MI:0914”(association)0.530
PAIP2BCASC3psi-mi:“MI:0914”(association)0.530
RBM4NVLpsi-mi:“MI:0914”(association)0.530
SRPK2RRP9psi-mi:“MI:0914”(association)0.530

BioGRID (165): SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Biochemical Activity), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS), SCAF1 (Affinity Capture-MS)

ESM2 similar proteins: A6H619, A6NFR6, A8MZF0, B1B0V2, H3BKT1, O08550, O08664, O15054, P08393, P0C674, P0DOD9, P0DPQ3, P14379, P18346, P28284, Q00587, Q0VBZ8, Q17QW1, Q32PF7, Q5NCY0, Q5U2Y8, Q5U4C3, Q5VV67, Q63003, Q63624, Q63625, Q66648, Q66HC8, Q6NZN1, Q6ZRT6, Q80U35, Q80WJ1, Q86UU5, Q86X51, Q8C1R3, Q8V7G4, Q8V7J2, Q8WUZ0, Q914N2, Q91W92

Diamond homologs: A1YVX4, A2A8L1, A2AUY4, A2BIL7, A6H619, A7E320, A8DZJ1, A9LMC0, B6CHA3, B7ZS37, B9RU15, C4QVX6, D3ZD32, E7EZF3, F4I240, F4JYC8, F4KE59, F6UA42, G5EBZ4, O43918, O88379, O94400, O97159, P29375, P41229, P41230, P46605, P47156, P48786, P56163, P58268, P58269, P58270, Q04996, Q09477, Q12830, Q12873, Q14839, Q23541, Q23590

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 131 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Transport of Mature Transcript to Cytoplasm1044.3×3e-13
mRNA 3’-end processing1432.0×4e-16
RNA Polymerase II Transcription Termination1025.5×6e-11
Transport of Mature mRNA derived from an Intron-Containing Transcript1424.8×1e-14
mRNA Splicing1924.3×9e-20
Influenza Viral RNA Transcription and Replication922.6×2e-09
Pausing and recovery of Tat-mediated HIV elongation521.4×3e-05
Tat-mediated HIV elongation arrest and recovery521.4×3e-05

GO biological processes:

GO termPartnersFoldFDR
negative regulation of mRNA splicing, via spliceosome744.7×2e-08
cytoplasmic translation1523.1×6e-14
regulation of alternative mRNA splicing, via spliceosome1122.4×5e-10
mRNA export from nucleus819.7×5e-07
ribosomal small subunit biogenesis713.3×6e-05
negative regulation of translation711.4×2e-04
translation1311.1×2e-08
RNA splicing1511.0×1e-09

Disease & clinical

Clinical variants and AI predictions

ClinVar

276 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance252
Likely benign10
Benign4

Top pathogenic / likely-pathogenic (0)

SpliceAI

0 predictions. Top by Δscore:

AlphaMissense

8316 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:49651080:T:CF231L1.000
19:49651082:C:AF231L1.000
19:49651082:C:GF231L1.000
19:49651279:T:CI297T1.000
19:49651291:T:AL301Q1.000
19:49651291:T:CL301P1.000
19:49652467:T:CL693P1.000
19:49652491:T:AI701N1.000
19:49652491:T:CI701T1.000
19:49652503:T:AI705N1.000
19:49652503:T:CI705T1.000
19:49652503:T:GI705S1.000
19:49652893:C:TS835F1.000
19:49652897:G:CK836N1.000
19:49652897:G:TK836N1.000
19:49652908:T:CL840P1.000
19:49652985:T:CF866L1.000
19:49652986:T:CF866S1.000
19:49652987:C:AF866L1.000
19:49652987:C:GF866L1.000
19:49654358:T:AL1109H1.000
19:49654358:T:CL1109P1.000
19:49654361:T:AL1110H1.000
19:49654361:T:CL1110P1.000
19:49654363:T:CF1111L1.000
19:49654364:T:CF1111S1.000
19:49654365:C:AF1111L1.000
19:49654365:C:GF1111L1.000
19:49654370:T:GM1113R1.000
19:49654378:G:CA1116P1.000

dbSNP variants (sampled 300 via entrez): RS1000166903 (19:49649242 G>C), RS1000194092 (19:49648832 A>G,T), RS1000241123 (19:49642436 G>A), RS1000378568 (19:49655662 G>A,C), RS1000539579 (19:49658533 C>G,T), RS1000629462 (19:49649099 A>C), RS1000689819 (19:49642763 T>C,G), RS1000722274 (19:49643519 A>G), RS1000795650 (19:49653899 G>A), RS1001097370 (19:49657426 C>T), RS1001166964 (19:49648184 T>C), RS1001325441 (19:49652665 C>G,T), RS1001511612 (19:49655280 T>A,C), RS1001901816 (19:49656480 A>C), RS1001928509 (19:49651781 G>A)

Disease associations

OMIM: gene MIM:617264 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

5 associations (top):

StudyTraitp-value
GCST002539_91Schizophrenia5.000000e-08
GCST004785_16Vitiligo2.000000e-09
GCST006611_62HDL cholesterol2.000000e-09
GCST006803_99Schizophrenia4.000000e-11
GCST008871_6Basal cell carcinoma3.000000e-08

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004612high density lipoprotein cholesterol measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

39 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteincreases expression2
Air Pollutantsincreases abundance, increases expression, affects expression2
aristolochic acid Idecreases expression, increases expression1
FR900359affects phosphorylation1
bisphenol Faffects cotreatment, increases expression1
dicrotophosincreases expression1
triphenyl phosphateaffects expression1
pirinixic acidaffects binding, decreases expression, increases activity1
bisphenol Aaffects cotreatment, increases methylation1
lead acetateincreases expression1
trichostatin Aaffects expression1
coumarinaffects phosphorylation1
di-n-butylphosphoric acidaffects expression1
CGP 52608increases reaction, affects binding1
perfluoro-n-nonanoic aciddecreases expression1
abrineincreases expression1
Resveratrolaffects cotreatment, decreases expression1
Arsenic Trioxidedecreases expression1
Fulvestrantaffects cotreatment, increases methylation1
Atrazineincreases expression1
Caffeineaffects phosphorylation1
Cisplatindecreases expression1
Dexamethasoneincreases expression, affects cotreatment1
Doxorubicindecreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Indomethacinaffects cotreatment, increases expression1
Ivermectindecreases expression1
Mustard Gasincreases phosphorylation1
Ozoneaffects expression, increases abundance1
Plant Extractsaffects cotreatment, decreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): basal cell carcinoma, vitiligo