C9orf78

gene
On this page

Also known as HSPC220HCA59CSU2

Summary

C9orf78 (chromosome 9 open reading frame 78, HGNC:24932) is a protein-coding gene on chromosome 9q34.11, encoding Splicing factor C9orf78 (Q9NZ63). Plays a role in pre-mRNA splicing by promoting usage of the upstream 3’-splice site at alternative NAGNAG splice sites; these are sites featuring alternative acceptor motifs separated by only a few nucleotides. It is a selective cancer dependency (DepMap: 57.0% of cell lines).

Enables U5 snRNA binding activity. Involved in mRNA cis splicing, via spliceosome and regulation of homologous chromosome segregation. Located in chromosome, centromeric region; cytosol; and nucleoplasm.

Source: NCBI Gene 51759 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 21 total
  • Cancer dependency (DepMap): dependent in 57.0% of screened cell lines
  • MANE Select transcript: NM_016520

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:24932
Approved symbolC9orf78
Namechromosome 9 open reading frame 78
Location9q34.11
Locus typegene with protein product
StatusApproved
AliasesHSPC220, HCA59, CSU2
Ensembl geneENSG00000136819
Ensembl biotypeprotein_coding
OMIM619569
Entrez51759

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 15 protein_coding, 6 protein_coding_CDS_not_defined

ENST00000372447, ENST00000461349, ENST00000461539, ENST00000461762, ENST00000480023, ENST00000492991, ENST00000495934, ENST00000875653, ENST00000875654, ENST00000875655, ENST00000912502, ENST00000912503, ENST00000912504, ENST00000912505, ENST00000912506, ENST00000912507, ENST00000912508, ENST00000912509, ENST00000950227, ENST00000950228, ENST00000950229

RefSeq mRNA: 1 — MANE Select: NM_016520 NM_016520

CCDS: CCDS6931

Canonical transcript exons

ENST00000372447 — 9 exons

ExonStartEnd
ENSE00001688070129834707129834766
ENSE00001692714129827290129828252
ENSE00001932884129835139129835275
ENSE00003538793129833658129833709
ENSE00003564405129829405129829541
ENSE00003582250129831896129831973
ENSE00003586818129829205129829303
ENSE00003598913129833447129833517
ENSE00003686675129830871129831068

Expression profiles

Bgee: expression breadth ubiquitous, 281 present calls, max score 98.12.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 64.8932 / max 1228.0360, expressed in 1825 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
10276755.78261825
1027669.11061744

Top tissues by expression

287 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
monocyteCL:000057698.12gold quality
calcaneal tendonUBERON:000370197.94gold quality
mononuclear cellCL:000084297.81gold quality
leukocyteCL:000073897.67gold quality
cortical plateUBERON:000534396.41gold quality
C1 segment of cervical spinal cordUBERON:000646995.90gold quality
gastrocnemiusUBERON:000138895.68gold quality
tendonUBERON:000004395.62gold quality
muscle of legUBERON:000138395.47gold quality
anterior cingulate cortexUBERON:000983595.43gold quality
cingulate cortexUBERON:000302795.34gold quality
popliteal arteryUBERON:000225095.29gold quality
tibial arteryUBERON:000761095.29gold quality
descending thoracic aortaUBERON:000234595.17gold quality
aortaUBERON:000094795.09gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099195.07gold quality
stromal cell of endometriumCL:000225595.04gold quality
thoracic aortaUBERON:000151595.04gold quality
mucosa of stomachUBERON:000119995.03gold quality
bloodUBERON:000017894.96gold quality
granulocyteCL:000009494.95gold quality
ascending aortaUBERON:000149694.95gold quality
nucleus accumbensUBERON:000188294.92gold quality
rectumUBERON:000105294.85gold quality
islet of LangerhansUBERON:000000694.84gold quality
right coronary arteryUBERON:000162594.83gold quality
caudate nucleusUBERON:000187394.82gold quality
gall bladderUBERON:000211094.71gold quality
hindlimb stylopod muscleUBERON:000425294.67gold quality
amygdalaUBERON:000187694.59gold quality

Single-cell (SCXA)

Detected in 3 experiment(s), a significant marker in 3.

ExperimentMarker?Max mean expression
E-HCAD-4yes20.45
E-HCAD-9yes6.23
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

35 targeting C9orf78, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-570-3P99.9672.414910
HSA-MIR-495-3P99.9672.814197
HSA-MIR-568899.9673.234504
HSA-MIR-153-5P99.8973.866317
HSA-MIR-30A-3P99.8769.742928
HSA-MIR-30D-3P99.8769.922917
HSA-MIR-30E-3P99.8769.682942
HSA-MIR-4728-5P99.8569.394718
HSA-MIR-576-5P99.8470.462582
HSA-MIR-6785-5P99.8268.684428
HSA-MIR-520F-3P99.8271.321216
HSA-MIR-63699.8069.581500
HSA-MIR-149-3P99.7268.223963
HSA-MIR-4524A-3P99.7266.852406
HSA-MIR-6883-5P99.6968.053785
HSA-MIR-453099.6966.471509
HSA-MIR-46699.6770.852863
HSA-MIR-29899.6367.561916
HSA-MIR-4743-3P99.6268.122095
HSA-MIR-1249-5P99.6166.552049
HSA-MIR-6797-5P99.6166.552084
HSA-MIR-6832-5P99.5864.821132
HSA-MIR-7106-5P99.5367.473574
HSA-MIR-671-5P99.5267.111277
HSA-MIR-5571-5P99.4966.991764
HSA-MIR-6799-5P99.1465.722093
HSA-MIR-7151-3P99.0469.722370
HSA-MIR-474499.0169.911581
HSA-MIR-1207-3P98.9966.221532
HSA-MIR-447398.8969.10652

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 57.0% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 1)

  • C9ORF78 partially localizes to centromeres and plays a role in chromosome segregation. (PMID:35167828)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerioC5H9orf78ENSDARG00000002220
mus_musculusBC005624ENSMUSG00000026851
rattus_norvegicusC3h9orf78ENSRNOG00000007532
drosophila_melanogasterCG7974FBGN0035254
caenorhabditis_elegansT23G11.4WBGENE00011969

Protein

Protein identifiers

Splicing factor C9orf78Q9NZ63 (reviewed: Q9NZ63)

Alternative names: Hepatocellular carcinoma-associated antigen 59

All UniProt accessions (1): Q9NZ63

UniProt curated annotations — full annotation on UniProt →

Function. Plays a role in pre-mRNA splicing by promoting usage of the upstream 3’-splice site at alternative NAGNAG splice sites; these are sites featuring alternative acceptor motifs separated by only a few nucleotides. May also modulate exon inclusion events. Plays a role in spliceosomal remodeling by displacing WBP4 from SNRNP200 and may act to inhibit SNRNP200 helicase activity. Binds U5 snRNA. Required for proper chromosome segregation. Not required for splicing of shelterin components.

Subunit / interactions. Component of the spliceosome. Interacts with SNRNP200; the interaction is direct. Interacts with PRPF8.

Subcellular location. Nucleus. Chromosome. Centromere.

Similarity. Belongs to the TLS1 family.

RefSeq proteins (1): NP_057604* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR010756Tls1-likeFamily

Pfam: PF07052

UniProt features (20 total): modified residue 5, region of interest 3, helix 3, mutagenesis site 2, sequence conflict 2, compositionally biased region 2, chain 1, sequence variant 1, strand 1

Structure

Experimental structures (PDB)

3 structures.

PDBMethodResolution (Å)
7OS2ELECTRON MICROSCOPY2.76
8C6JELECTRON MICROSCOPY2.8
9FMDELECTRON MICROSCOPY3.3

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NZ63-F168.640.07

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (5): 261, 15, 17, 147, 253

Mutagenesis-validated functional residues (2):

PositionPhenotype
8decreases binding to snrnp200.
41abolishes binding to snrnp200.

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-72163mRNA Splicing - Major Pathway

MSigDB gene sets: 127 (showing top): GOBP_MEIOTIC_CHROMOSOME_SEGREGATION, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, LINDGREN_BLADDER_CANCER_CLUSTER_2A_DN, FERREIRA_EWINGS_SARCOMA_UNSTABLE_VS_STABLE_DN, WEI_MYCN_TARGETS_WITH_E_BOX, GOBP_ORGANELLE_FISSION, GOBP_REGULATION_OF_CHROMOSOME_SEGREGATION, REACTOME_PROCESSING_OF_CAPPED_INTRON_CONTAINING_PRE_MRNA, GOBP_REGULATION_OF_CELL_CYCLE, GOBP_RNA_SPLICING, GOBP_REGULATION_OF_REPRODUCTIVE_PROCESS, YAO_TEMPORAL_RESPONSE_TO_PROGESTERONE_CLUSTER_13, REACTOME_MRNA_SPLICING, GOBP_HOMOLOGOUS_CHROMOSOME_SEGREGATION, GOBP_MEIOTIC_CELL_CYCLE_PROCESS

GO Biological Process (6): mRNA splicing, via spliceosome (GO:0000398), chromosome segregation (GO:0007059), mRNA cis splicing, via spliceosome (GO:0045292), regulation of homologous chromosome segregation (GO:0060629), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (3): U5 snRNA binding (GO:0030623), RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (6): chromosome, centromeric region (GO:0000775), nucleus (GO:0005634), nucleoplasm (GO:0005654), spliceosomal complex (GO:0005681), cytosol (GO:0005829), chromosome (GO:0005694)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
mRNA Splicing1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
cellular anatomical structure2
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
mRNA processing1
cell cycle process1
mRNA splicing, via spliceosome1
homologous chromosome segregation1
regulation of chromosome segregation1
regulation of reproductive process1
mRNA metabolic process1
snRNA binding1
nucleic acid binding1
binding1
chromosomal region1
intracellular membrane-bounded organelle1
nuclear lumen1
nuclear protein-containing complex1
ribonucleoprotein complex1
cytoplasm1
intracellular membraneless organelle1

Protein interactions and networks

STRING

2133 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C9orf78FNBP1Q96RU3491
C9orf78NOSIPQ9Y314455
C9orf78RAI1Q7Z5J4424
C9orf78SYKP43405424
C9orf78FSAF1Q8NDD1420
C9orf78MED22Q15528405
C9orf78TOR1BO14657396
C9orf78SEC63Q9UGP8393
C9orf78USP20Q9Y2K6392
C9orf78HPDLQ96IR7392
C9orf78CWC27Q6UX04392
C9orf78CHP2O43745390
C9orf78DISP3Q9P2K9388
C9orf78TOR1AO14656382
C9orf78DHX34Q14147375

IntAct

43 interactions, top by confidence:

ABTypeScore
PIPTBKBP1psi-mi:“MI:0914”(association)0.530
C9orf78TERF2IPpsi-mi:“MI:0915”(physical association)0.510
H3C1SMCHD1psi-mi:“MI:2364”(proximity)0.410
Prpf8psi-mi:“MI:0915”(physical association)0.400
C9orf78TNNC2psi-mi:“MI:0915”(physical association)0.400
TAB1C9orf78psi-mi:“MI:0915”(physical association)0.370
C9orf78MAPK11psi-mi:“MI:0915”(physical association)0.370
Klc3ZC3HAV1psi-mi:“MI:0914”(association)0.350
MAPTMEX3Apsi-mi:“MI:0914”(association)0.350
SERPINB13TTC4psi-mi:“MI:0914”(association)0.350
NKAPD1CASC3psi-mi:“MI:0914”(association)0.350
NAB2GRNpsi-mi:“MI:0914”(association)0.350
C9orf78CEP120psi-mi:“MI:0914”(association)0.350
ARHGEF19NUP42psi-mi:“MI:0914”(association)0.350
CCP110A2ML1psi-mi:“MI:0914”(association)0.350
CCP110KIF2Apsi-mi:“MI:0914”(association)0.350
C9orf78SNRNP200psi-mi:“MI:0914”(association)0.350
CD2BP2PRPF4psi-mi:“MI:0914”(association)0.350
FAM50ASNRNP200psi-mi:“MI:0914”(association)0.350
JPH3PLEKHG3psi-mi:“MI:0914”(association)0.350
H2AJWDR46psi-mi:“MI:0914”(association)0.350
SERF2WDR46psi-mi:“MI:0914”(association)0.350
ZC3H11ADDX3Ypsi-mi:“MI:0914”(association)0.350
IFI27L1KIF1Cpsi-mi:“MI:0914”(association)0.350
SF3B1RBM10psi-mi:“MI:0914”(association)0.350
SF3B1FAM83Gpsi-mi:“MI:0914”(association)0.350
LMNASMCHD1psi-mi:“MI:2364”(proximity)0.270
H2BC10SMCHD1psi-mi:“MI:2364”(proximity)0.270

BioGRID (662): OCRL (Affinity Capture-MS), CEP120 (Affinity Capture-MS), BTBD1 (Affinity Capture-MS), BTBD2 (Affinity Capture-MS), C9orf78 (Co-fractionation), C9orf78 (Co-fractionation), HARS (Co-fractionation), HNRNPH1 (Co-fractionation), C9orf78 (Proximity Label-MS), C9orf78 (Proximity Label-MS), C9orf78 (Affinity Capture-MS), C9orf78 (Biochemical Activity), C9orf78 (Affinity Capture-MS), EFTUD2 (Affinity Capture-MS), SNRNP200 (Affinity Capture-MS)

ESM2 similar proteins: A4IGY3, A7SD85, A8WMM4, E1C760, F4ICK8, O08837, O15541, O48713, P19351, P92948, Q0JHZ2, Q0VFP5, Q12000, Q16543, Q1RMM1, Q28Y69, Q2KJC1, Q2KJD3, Q3TIV5, Q3TQI7, Q54DA5, Q568A0, Q5BH88, Q5BJP2, Q5EAC6, Q5H7N8, Q5PQS7, Q5RC87, Q5RE65, Q67ER4, Q6A068, Q6DD06, Q6DKE6, Q6NUB2, Q6U6G5, Q7JWR9, Q803J8, Q8GX84, Q8WU90, Q93618

Diamond homologs: F4HVZ5, Q3TQI7, Q5RC87, Q9NZ63

SIGNOR signaling

1 interactions.

AEffectBMechanism
SMURF1unknownC9orf78ubiquitination

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 58 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA Splicing - Minor Pathway526.7×8e-05
Transport of Mature mRNA derived from an Intron-Containing Transcript518.1×4e-04
mRNA Splicing - Major Pathway1215.6×2e-09
Processing of Capped Intron-Containing Pre-mRNA713.7×6e-05
mRNA Splicing513.1×1e-03
CHD1 and CHD2 subfamily512.9×1e-03
mRNA Polyadenylation612.6×4e-04
Dengue Virus-Host Interactions1112.0×2e-07

GO biological processes:

GO termPartnersFoldFDR
mRNA splicing, via spliceosome915.6×2e-06

Disease & clinical

Clinical variants and AI predictions

ClinVar

21 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1174 predictions. Top by Δscore:

VariantEffectΔscore
9:129828251:CC:Cacceptor_gain1.0000
9:129828252:CC:Cacceptor_gain1.0000
9:129828252:CCTGA:Cacceptor_loss1.0000
9:129828253:C:CGacceptor_loss1.0000
9:129828254:T:Aacceptor_loss1.0000
9:129829203:A:ACdonor_gain1.0000
9:129829204:C:CCdonor_gain1.0000
9:129829204:CG:Cdonor_gain1.0000
9:129829204:CGCT:Cdonor_gain1.0000
9:129829235:CT:Cdonor_gain1.0000
9:129829299:ATAAA:Aacceptor_gain1.0000
9:129829300:TAAA:Tacceptor_gain1.0000
9:129829301:AAA:Aacceptor_gain1.0000
9:129829301:AAAC:Aacceptor_loss1.0000
9:129829302:AA:Aacceptor_gain1.0000
9:129829302:AAC:Aacceptor_loss1.0000
9:129829303:ACT:Aacceptor_loss1.0000
9:129829304:C:CCacceptor_gain1.0000
9:129829305:T:Cacceptor_loss1.0000
9:129829311:C:CTacceptor_gain1.0000
9:129829311:C:Tacceptor_gain1.0000
9:129829312:A:Tacceptor_gain1.0000
9:129829320:C:CTacceptor_gain1.0000
9:129829399:GCTTA:Gdonor_loss1.0000
9:129829400:CTTAC:Cdonor_loss1.0000
9:129829401:TTA:Tdonor_loss1.0000
9:129829402:TACA:Tdonor_loss1.0000
9:129829403:A:ACdonor_gain1.0000
9:129829403:A:AGdonor_loss1.0000
9:129829403:ACAT:Adonor_gain1.0000

AlphaMissense

1925 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
9:129828182:C:AK283N1.000
9:129828182:C:GK283N1.000
9:129828185:G:CF282L1.000
9:129828185:G:TF282L1.000
9:129828186:A:CF282C1.000
9:129828186:A:GF282S1.000
9:129828187:A:GF282L1.000
9:129828199:G:CH278D1.000
9:129828206:A:CD275E1.000
9:129828206:A:TD275E1.000
9:129828207:T:AD275V1.000
9:129828207:T:CD275G1.000
9:129828207:T:GD275A1.000
9:129828208:C:AD275Y1.000
9:129828208:C:GD275H1.000
9:129828210:G:AT274I1.000
9:129829302:A:CF227L1.000
9:129829302:A:TF227L1.000
9:129829405:A:GF227L1.000
9:129829407:C:GR226T1.000
9:129829424:A:CN220K1.000
9:129829424:A:TN220K1.000
9:129829431:G:TA218D1.000
9:129829436:G:CN216K1.000
9:129829436:G:TN216K1.000
9:129829494:A:GL197P1.000
9:129829502:C:AK194N1.000
9:129829502:C:GK194N1.000
9:129829507:C:GA193P1.000
9:129829524:A:CI187S1.000

dbSNP variants (sampled 300 via entrez): RS1000364254 (9:129835001 G>A,C), RS1000696501 (9:129836376 A>T), RS1000742270 (9:129834042 C>T), RS1000775996 (9:129835221 T>C,G), RS1000798366 (9:129827525 A>G), RS1001088343 (9:129832924 G>C), RS1001420260 (9:129834211 T>G), RS1001643209 (9:129834941 C>A,G,T), RS1002143290 (9:129833856 C>CTACA), RS1002856017 (9:129833223 C>A), RS1002871772 (9:129826879 G>A), RS1002970951 (9:129827211 T>G), RS1002993122 (9:129828805 A>G), RS1003242306 (9:129832288 A>G), RS1003474669 (9:129827437 G>C)

Disease associations

OMIM: gene MIM:619569 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

19 total (human), top 19 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
FR900359affects phosphorylation1
dicrotophosdecreases expression1
sodium arseniteincreases expression, increases abundance1
manganese chlorideincreases abundance, increases expression1
abrinedecreases expression1
Sunitinibincreases expression1
Acetaminophenincreases expression1
Arsenicincreases abundance, increases expression1
Atrazinedecreases expression1
Caffeinedecreases phosphorylation1
Cisplatinincreases expression1
Ivermectindecreases expression1
Manganeseincreases abundance, increases expression1
Quercetindecreases phosphorylation1
Smokedecreases expression1
Urethaneincreases expression1
Zincdecreases expression1
Cyclosporineincreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B1LWAbcam HeLa C9orf78 KOCancer cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.