SUGP2

gene
On this page

Also known as KIAA0365

Summary

SUGP2 (SURP and G-patch domain containing 2, HGNC:18641) is a protein-coding gene on chromosome 19p13, encoding SURP and G-patch domain-containing protein 2 (Q8IX01). May play a role in mRNA splicing.

This gene encodes a member of the arginine/serine-rich family of splicing factors. The encoded protein functions in mRNA processing. Alternatively spliced transcript variants have been described.

Source: NCBI Gene 10147 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 115 total
  • MANE Select transcript: NM_001017392

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:18641
Approved symbolSUGP2
NameSURP and G-patch domain containing 2
Location19p13
Locus typegene with protein product
StatusApproved
AliasesKIAA0365
Ensembl geneENSG00000064607
Ensembl biotypeprotein_coding
OMIM607993
Entrez10147

Gene structure

Transcript identifiers

Ensembl transcripts: 38 — 28 protein_coding, 6 nonsense_mediated_decay, 4 protein_coding_CDS_not_defined

ENST00000330854, ENST00000337018, ENST00000452918, ENST00000593795, ENST00000594445, ENST00000594773, ENST00000597095, ENST00000597163, ENST00000597280, ENST00000598202, ENST00000598240, ENST00000598863, ENST00000600239, ENST00000600377, ENST00000601879, ENST00000867061, ENST00000867062, ENST00000867063, ENST00000867064, ENST00000867065, ENST00000867066, ENST00000867067, ENST00000867068, ENST00000867069, ENST00000867070, ENST00000928453, ENST00000928454, ENST00000928455, ENST00000928456, ENST00000928457, ENST00000953719, ENST00000953720, ENST00000953721, ENST00000953722, ENST00000953723, ENST00000953724, ENST00000953725, ENST00000953726

RefSeq mRNA: 6 — MANE Select: NM_001017392 NM_001017392, NM_001321697, NM_001321698, NM_001321699, NM_001352071, NM_014884

CCDS: CCDS12392, CCDS82322

Canonical transcript exons

ENST00000452918 — 11 exons

ExonStartEnd
ENSE000011226031902461919026226
ENSE000030917911899089318993740
ENSE000031916761903343719033517
ENSE000034748141899514418995280
ENSE000035696621901910919019229
ENSE000036083251900416819004646
ENSE000036190201900831719008428
ENSE000036269141903095119031082
ENSE000036428801900161319001674
ENSE000036591281900985519010342
ENSE000036818541899436618994486

Expression profiles

Bgee: expression breadth ubiquitous, 294 present calls, max score 98.50.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 20.9635 / max 227.7979, expressed in 1810 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
18011917.85711806
1801203.09021309
2087370.01626

Top tissues by expression

295 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cerebellar hemisphereUBERON:000224598.50gold quality
left testisUBERON:000453398.50gold quality
cerebellar cortexUBERON:000212998.47gold quality
right testisUBERON:000453498.47gold quality
right hemisphere of cerebellumUBERON:001489098.47gold quality
cerebellumUBERON:000203798.28gold quality
primary visual cortexUBERON:000243698.10gold quality
middle temporal gyrusUBERON:000277197.66gold quality
adenohypophysisUBERON:000219697.60gold quality
occipital lobeUBERON:000202197.48gold quality
cortical plateUBERON:000534397.40gold quality
ventricular zoneUBERON:000305397.39gold quality
right frontal lobeUBERON:000281097.37gold quality
pituitary glandUBERON:000000797.30gold quality
left lobe of thyroid glandUBERON:000112097.30gold quality
Brodmann (1909) area 9UBERON:001354097.13gold quality
Brodmann (1909) area 23UBERON:001355497.11gold quality
testisUBERON:000047397.09gold quality
right lobe of thyroid glandUBERON:000111997.05gold quality
ganglionic eminenceUBERON:000402396.93gold quality
lateral nuclear group of thalamusUBERON:000273696.88gold quality
paraflocculusUBERON:000535196.86gold quality
thyroid glandUBERON:000204696.84gold quality
nucleus accumbensUBERON:000188296.83gold quality
dorsolateral prefrontal cortexUBERON:000983496.81gold quality
caudate nucleusUBERON:000187396.67gold quality
adult organismUBERON:000702396.65gold quality
putamenUBERON:000187496.58gold quality
Brodmann (1909) area 10UBERON:001354196.55gold quality
middle frontal gyrusUBERON:000270296.54gold quality

Single-cell (SCXA)

Detected in 5 experiment(s), a significant marker in 4.

ExperimentMarker?Max mean expression
E-CURD-79yes1154.65
E-MTAB-8060yes55.60
E-CURD-112yes7.16
E-ANND-3yes6.79
E-CURD-135no705.55

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

73 targeting SUGP2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4476100.0068.182030
HSA-MIR-6876-5P100.0067.682126
HSA-MIR-4481100.0066.421669
HSA-MIR-366299.9973.825684
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-33A-5P99.9968.621055
HSA-MIR-33B-5P99.9968.581062
HSA-MIR-196A-1-3P99.9972.152772
HSA-MIR-453199.9969.703181
HSA-MIR-9983-3P99.9471.483631
HSA-MIR-6835-3P99.9370.492904
HSA-MIR-329-3P99.9166.561234
HSA-MIR-362-3P99.9166.381267
HSA-MIR-498-3P99.9171.271114
HSA-MIR-129799.9173.413162
HSA-MIR-4731-5P99.8967.232537
HSA-MIR-548E-5P99.8972.734486
HSA-MIR-129-5P99.8870.263273
HSA-MIR-3180-5P99.8269.122422
HSA-MIR-432099.7565.80793
HSA-MIR-3680-3P99.7572.513095
HSA-MIR-442899.7366.411733
HSA-MIR-471999.7372.103329
HSA-MIR-4524A-3P99.7266.852406
HSA-MIR-442299.7272.072908
HSA-MIR-430699.7270.503630
HSA-MIR-3059-5P99.7069.932491
HSA-MIR-3158-5P99.6567.511763
HSA-MIR-451699.6167.783390
HSA-MIR-426199.5970.303415

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriosugp1ENSDARG00000075936
mus_musculusSugp2ENSMUSG00000036054
rattus_norvegicusSugp2ENSRNOG00000020269
drosophila_melanogasterCG31550FBGN0051550

Paralogs (1): SUGP1 (ENSG00000105705)

Protein

Protein identifiers

SURP and G-patch domain-containing protein 2Q8IX01 (reviewed: Q8IX01)

Alternative names: Arginine/serine-rich-splicing factor 14, Splicing factor, arginine/serine-rich 14

All UniProt accessions (7): Q8IX01, M0R065, M0R1Y3, M0R216, M0R2Y3, M0R2Z9, M0R3F6

UniProt curated annotations — full annotation on UniProt →

Function. May play a role in mRNA splicing.

Subcellular location. Nucleus.

Tissue specificity. Detected in adult testis, and in fetal brain and kidney.

Isoforms (3)

UniProt IDNamesCanonical?
Q8IX01-11yes
Q8IX01-33
Q8IX01-44

RefSeq proteins (6): NP_001017392, NP_001308626, NP_001308627, NP_001308628, NP_001339000, NP_055699 (=MANE)

Domains & families (InterPro)

IDNameType
IPR000061SurpDomain
IPR000467G_patch_domDomain
IPR035967SWAP/Surp_sfHomologous_superfamily
IPR040169SUGP1/2Family

Pfam: PF01585, PF01805

UniProt features (43 total): modified residue 11, compositionally biased region 6, sequence variant 5, region of interest 5, helix 4, cross-link 3, repeat 2, splice variant 2, chain 1, domain 1, turn 1, strand 1, short sequence motif 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
1X4PSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8IX01-F153.860.06

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (14): 7, 96, 224, 275, 277, 315, 573, 603, 754, 757, 863, 228, 305, 650

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 139 (showing top): GGTGTGT_MIR329, MODULE_255, MORF_SNRP70, MODULE_317, MITSIADES_RESPONSE_TO_APLIDIN_DN, MYCMAX_01, GOBP_RNA_SPLICING, P300_01, BILBAN_B_CLL_LPL_DN, JAZAERI_BREAST_CANCER_BRCA1_VS_BRCA2_DN, GINESTIER_BREAST_CANCER_ZNF217_AMPLIFIED_DN, GOCC_NUCLEAR_BODY, MODULE_69, chr19p13, KIM_WT1_TARGETS_DN

GO Biological Process (3): mRNA processing (GO:0006397), RNA splicing (GO:0008380), RNA processing (GO:0006396)

GO Molecular Function (3): RNA binding (GO:0003723), nucleic acid binding (GO:0003676), protein binding (GO:0005515)

GO Cellular Component (3): nucleoplasm (GO:0005654), nuclear body (GO:0016604), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
binding2
mRNA metabolic process1
gene expression1
RNA biosynthetic process1
primary metabolic process1
nucleic acid binding1
nuclear lumen1
cellular anatomical structure1
nucleoplasm1
intracellular membraneless organelle1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

898 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SUGP2MATR3P43243655
SUGP2HNRNPMP52272633
SUGP2SCAF11Q99590506
SUGP2GIN1Q9NXP7492
SUGP2SFSWAPQ12872479
SUGP2ZBED11P0CF97462
SUGP2ANKRD10Q9NXR5455
SUGP2UBXN8O00124444
SUGP2SUGP1Q8IWZ8436
SUGP2OR10S1Q8NGN2379
SUGP2SRSF7Q16629368
SUGP2DACT3Q96B18365
SUGP2C1orf50Q9BV19357
SUGP2SRSF11Q05519352
SUGP2GPALPP1Q8IXQ4348

IntAct

167 interactions, top by confidence:

ABTypeScore
CDK8MED19psi-mi:“MI:2364”(proximity)0.850
TUBG1TUBG1psi-mi:“MI:2364”(proximity)0.760
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
LIN28AIGF2BP3psi-mi:“MI:0914”(association)0.640
AURKBSEC16Apsi-mi:“MI:2364”(proximity)0.570
BCORCBX4psi-mi:“MI:0914”(association)0.530
CLEC11AVWA8psi-mi:“MI:0914”(association)0.530
ELAVL2IGF2BP3psi-mi:“MI:0914”(association)0.530
ILF2IGF2BP3psi-mi:“MI:0914”(association)0.530
GSPT2IGF2BP3psi-mi:“MI:0914”(association)0.530
HNRNPCCASC3psi-mi:“MI:0914”(association)0.530
PRR3MRPS14psi-mi:“MI:0914”(association)0.530
CPSF6DDX39Apsi-mi:“MI:0914”(association)0.480
PTBP3HNRNPCpsi-mi:“MI:0914”(association)0.480
DDX21MED19psi-mi:“MI:2364”(proximity)0.480
SUGP2HMGN1psi-mi:“MI:0915”(physical association)0.400
COL5A1SUGP2psi-mi:“MI:0915”(physical association)0.400
SUGP2BLKpsi-mi:“MI:0915”(physical association)0.400
GSK3ASUGP2psi-mi:“MI:0915”(physical association)0.370
LZTR1SUGP2psi-mi:“MI:0915”(physical association)0.370
SetZKSCAN1psi-mi:“MI:0914”(association)0.350
MATR3BCLAF3psi-mi:“MI:0914”(association)0.350
HNRNPA1MATR3psi-mi:“MI:0914”(association)0.350
Ptbp1MATR3psi-mi:“MI:0914”(association)0.350
Pcdh1SRIpsi-mi:“MI:0914”(association)0.350
DDX41DDX39Apsi-mi:“MI:0914”(association)0.350

BioGRID (244): SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Proximity Label-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS), SUGP2 (Affinity Capture-MS)

ESM2 similar proteins: A1KXM5, A6NFR6, A6QQS3, A7KBS4, F1MJR8, O14513, O35668, O60284, P0C671, P54256, P54257, Q05860, Q0P670, Q12774, Q1RMQ5, Q1RMX6, Q2HR82, Q32LN6, Q32MG2, Q3B8N5, Q3URK1, Q3V0M2, Q5BI31, Q5DU28, Q63449, Q68FQ8, Q6AXV6, Q6PEX7, Q6ZPK7, Q7Z2Y5, Q80YD3, Q8BII1, Q8BUY8, Q8IWB6, Q8IX01, Q8IYM0, Q8IZ16, Q8N4L4, Q8NA61, Q8NDB6

Diamond homologs: A0JMV4, A4IGK4, A4L691, A5DSB5, B2GV05, P15771, P52756, P70501, P98175, Q0IIX9, Q17784, Q1RMU5, Q5ZII9, Q66J74, Q68FU8, Q6C233, Q6DDU9, Q7TN31, Q7TQC7, Q8CH02, Q8CH09, Q8IWZ8, Q8IX01, Q8N302, Q91YE7, Q94C11, Q96BK5, Q99KG3, Q9CZX5, Q9NW75, Q9UTK6, P78332, A0A0R4IEW8, A4FV72, A4RHN3, A5A6M3, A5E1Z4, A6NDE4, A6NEQ0, A7SKE9

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 170 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA 3’-end processing713.8×7e-05
Transport of Mature mRNA derived from an Intron-Containing Transcript710.7×3e-04
mRNA Polyadenylation108.8×3e-05
mRNA Splicing77.7×2e-03
Processing of Capped Intron-Containing Pre-mRNA97.4×3e-04
mRNA Splicing - Major Pathway137.1×2e-05
Major pathway of rRNA processing in the nucleolus and cytosol106.2×3e-04
Dengue Virus-Host Interactions135.9×3e-05

GO biological processes:

GO termPartnersFoldFDR
negative regulation of mRNA splicing, via spliceosome527.0×2e-04
ribosomal large subunit biogenesis515.6×2e-03
mRNA stabilization615.5×3e-04
regulation of alternative mRNA splicing, via spliceosome712.0×3e-04
mRNA transport611.1×2e-03
mRNA processing147.8×3e-06
mRNA splicing, via spliceosome127.7×2e-05
RNA splicing127.5×2e-05

Disease & clinical

Clinical variants and AI predictions

ClinVar

115 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance93
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

3029 predictions. Top by Δscore:

VariantEffectΔscore
19:18994361:CATA:Cdonor_loss1.0000
19:18994362:ATAC:Adonor_loss1.0000
19:18994363:TAC:Tdonor_loss1.0000
19:18994364:ACCT:Adonor_loss1.0000
19:18994484:CCC:Cacceptor_gain1.0000
19:18994485:CC:Cacceptor_gain1.0000
19:18994485:CCC:Cacceptor_gain1.0000
19:18994486:CC:Cacceptor_gain1.0000
19:18994487:C:CCacceptor_gain1.0000
19:18995131:G:GAdonor_gain1.0000
19:18995142:A:ACdonor_gain1.0000
19:18995143:C:CCdonor_gain1.0000
19:18995143:CA:Cdonor_gain1.0000
19:18995143:CACG:Cdonor_gain1.0000
19:18995157:C:CAdonor_gain1.0000
19:18995162:ATGC:Adonor_gain1.0000
19:18995170:T:Adonor_gain1.0000
19:18995195:T:TAdonor_gain1.0000
19:18995199:T:TAdonor_gain1.0000
19:19001675:C:CCacceptor_gain1.0000
19:19009853:A:ACdonor_gain1.0000
19:19009854:C:CCdonor_gain1.0000
19:19012395:T:Adonor_gain1.0000
19:19019227:CAG:Cacceptor_gain1.0000
19:19030980:ACAG:Adonor_gain1.0000
19:19030981:CAGC:Cdonor_gain1.0000
19:19031078:TTGCC:Tacceptor_gain1.0000
19:19031082:CCTA:Cacceptor_gain1.0000
19:19031083:C:CCacceptor_gain1.0000
19:19031083:CT:Cacceptor_loss1.0000

AlphaMissense

0 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000002236 (19:19031960 G>A), RS1000034789 (19:19032282 C>T), RS1000156347 (19:19026112 C>G), RS1000166045 (19:19014619 A>G), RS1000218289 (19:19008690 A>G), RS1000246592 (19:19032645 T>G), RS1000254854 (19:19026302 C>T), RS1000288148 (19:19011113 T>C), RS1000365332 (19:19020376 G>A), RS1000458164 (19:18992658 C>T), RS1000493282 (19:19005181 C>T), RS1000568828 (19:18999825 G>A), RS1000752689 (19:19033831 C>G,T), RS1000784911 (19:19002840 C>T), RS1000800274 (19:19033656 C>G,T)

Disease associations

OMIM: gene MIM:607993 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

49 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression, decreases methylation, increases expression5
trichostatin Adecreases expression, affects cotreatment2
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamidedecreases expression, affects cotreatment, increases expression2
FR900359increases phosphorylation1
bisphenol Faffects cotreatment, decreases expression1
testosterone enanthateaffects expression1
methylmercuric chloridedecreases expression1
alpha-pineneaffects cotreatment, increases expression, increases abundance1
bisphenol Aaffects cotreatment, decreases expression1
beta-lapachonedecreases expression1
sodium arsenitedecreases expression1
cobaltous chloridedecreases expression1
butyraldehydedecreases expression1
methacrylaldehydeaffects cotreatment, increases expression, increases abundance1
beta-methylcholineaffects expression1
celastrolincreases expression1
di-n-butylphosphoric acidaffects expression1
AM 251decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
bisphenol Saffects cotreatment, decreases expression1
LDN 193189increases expression, affects cotreatment1
NSC668394decreases expression1
4-(4-((5-(4,5-dimethyl-2-nitrophenyl)-2-furanyl)methylene)-4,5-dihydro-3-methyl-5-oxo-1H-pyrazol-1-yl)benzoic acidincreases expression1
Resveratrolaffects cotreatment, increases expression1
Temozolomideincreases expression1
Decitabineaffects expression1
Leflunomidedecreases expression1
Acroleinaffects cotreatment, increases expression, increases abundance1
Air Pollutantsaffects cotreatment, increases abundance, increases expression1
Benzo(a)pyrenedecreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B2HVAbcam HeLa SUGP2 KOCancer cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.