ZBTB8OS

gene
On this page

Also known as ARCH

Summary

ZBTB8OS (zinc finger and BTB domain containing 8 opposite strand, HGNC:24094) is a protein-coding gene on chromosome 1p35.1, encoding tRNA-splicing ligase-activating factor archease (Q8IWT0). Activation factor of RTCB, the catalytic subunit of the tRNA-splicing ligase complex (tRNA-LC), which is involved in the enzyme-dependent maturation of intron-containing pre-tRNAs. It is a common-essential gene (DepMap: required in 98.6% of cancer cell lines).

Predicted to enable metal ion binding activity. Involved in tRNA splicing, via endonucleolytic cleavage and ligation. Part of tRNA-splicing ligase complex.

Source: NCBI Gene 339487 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 29 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 98.6% of screened cell lines (common-essential)
  • MANE Select transcript: NM_178547

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:24094
Approved symbolZBTB8OS
Namezinc finger and BTB domain containing 8 opposite strand
Location1p35.1
Locus typegene with protein product
StatusApproved
AliasesARCH
Ensembl geneENSG00000176261
Ensembl biotypeprotein_coding
OMIM615891
Entrez339487

Gene structure

Transcript identifiers

Ensembl transcripts: 27 — 19 protein_coding, 4 nonsense_mediated_decay, 3 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000341885, ENST00000373501, ENST00000373506, ENST00000436661, ENST00000465588, ENST00000466361, ENST00000467652, ENST00000468695, ENST00000473294, ENST00000473383, ENST00000476493, ENST00000479075, ENST00000483138, ENST00000492007, ENST00000498691, ENST00000899477, ENST00000899478, ENST00000899479, ENST00000899480, ENST00000933118, ENST00000933119, ENST00000933120, ENST00000933121, ENST00000933122, ENST00000933123, ENST00000933124, ENST00000933125

RefSeq mRNA: 24 — MANE Select: NM_178547 NM_001308135, NM_001308136, NM_001308137, NM_001308138, NM_001308139, NM_001308140, NM_001308141, NM_001330475, NM_001366255, NM_001366256, NM_001366257, NM_001366258, NM_001366259, NM_001366260, NM_001366263, NM_001366264, NM_001366265, NM_001366266, NM_001366267, NM_001366268, NM_001366269, NM_001366270, NM_001366271, NM_178547

CCDS: CCDS365, CCDS76134, CCDS81292

Canonical transcript exons

ENST00000468695 — 7 exons

ExonStartEnd
ENSE000013265213265043332650550
ENSE000018266603262082032621948
ENSE000034702293263364532633727
ENSE000035891023262750832627544
ENSE000036216513263395132634072
ENSE000036818473263182732631879
ENSE000037380423263476832634792

Expression profiles

Bgee: expression breadth ubiquitous, 255 present calls, max score 95.80.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 40.0131 / max 1595.5053, expressed in 1808 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1154639.02971804
115470.9835578

Top tissues by expression

256 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
monocyteCL:000057695.80gold quality
mucosa of transverse colonUBERON:000499195.80gold quality
left ventricle myocardiumUBERON:000656695.74gold quality
leukocyteCL:000073895.67gold quality
bone marrow cellCL:000209295.12gold quality
islet of LangerhansUBERON:000000695.10gold quality
rectumUBERON:000105294.87gold quality
calcaneal tendonUBERON:000370194.80gold quality
adenohypophysisUBERON:000219694.77gold quality
body of pancreasUBERON:000115094.54gold quality
kidney epitheliumUBERON:000481994.40gold quality
ileal mucosaUBERON:000033194.26gold quality
colonic epitheliumUBERON:000039794.13gold quality
vermiform appendixUBERON:000115494.09gold quality
pancreasUBERON:000126494.07gold quality
granulocyteCL:000009494.04gold quality
hindlimb stylopod muscleUBERON:000425293.88gold quality
pituitary glandUBERON:000000793.79gold quality
gastrocnemiusUBERON:000138893.75gold quality
body of stomachUBERON:000116193.69gold quality
muscle of legUBERON:000138393.66gold quality
left coronary arteryUBERON:000162693.44gold quality
descending thoracic aortaUBERON:000234593.44gold quality
lower esophagusUBERON:001347393.41gold quality
lower esophagus muscularis layerUBERON:003583393.41gold quality
transverse colonUBERON:000115793.32gold quality
epithelial cell of pancreasCL:000008393.31gold quality
small intestine Peyer’s patchUBERON:000345493.23gold quality
thoracic aortaUBERON:000151593.22gold quality
ascending aortaUBERON:000149693.17gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes15.31

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

58 targeting ZBTB8OS, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-450099.9972.722367
HSA-MIR-607799.9968.042299
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-LET-7D-5P99.9671.761632
HSA-MIR-445899.9671.641650
HSA-MIR-4760-3P99.9370.502385
HSA-MIR-204-5P99.7971.622439
HSA-MIR-211-5P99.7971.652440
HSA-MIR-320A-3P99.7769.732107
HSA-MIR-320B99.7769.732107
HSA-MIR-320C99.7769.732107
HSA-MIR-320D99.7769.732107
HSA-MIR-442999.7769.622111
HSA-MIR-4446-5P99.7269.192544
HSA-MIR-4755-5P99.7170.342716
HSA-MIR-5006-3P99.7170.262728
HSA-MIR-128399.6972.423009
HSA-MIR-10393-5P99.6568.011368

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 98.6% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 3)

  • archease (also called ZBTB8OS), a protein of unknown function, is required for full activity of the human tRNA ligase complex and, in cooperation with DDX1, facilitates the formation of an RTCB-guanylate intermediate central to mammalian RNA ligation (PMID:24870230)
  • RTCB, the catalytic subunit of the tRNA ligase complex, and its co-factor archease mediate XBP1 mRNA splicing both in vitro and in vivo. (PMID:25378478)
  • ARCH mutation is associated with chronic myelomonocytic leukemia. (PMID:26648538)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriozbtb8osENSDARG00000016368
mus_musculusZbtb8osENSMUSG00000057572
rattus_norvegicusZbtb8osENSRNOG00000008226
drosophila_melanogasterArcheaseFBGN0289108
caenorhabditis_elegansarch-1WBGENE00016621

Protein

Protein identifiers

tRNA-splicing ligase-activating factor archeaseQ8IWT0 (reviewed: Q8IWT0)

Alternative names: Zinc finger and BTB domain-containing opposite strand protein 8

All UniProt accessions (12): A0A087WTK6, A0A087WUU3, A0A087WW18, A0A087X0V4, A0A087X1H4, A0A8C8MQ05, A8K0B5, D3DPQ2, Q8IWT0, F6SII6, H7C3R6, S4R426

UniProt curated annotations — full annotation on UniProt →

Function. Activation factor of RTCB, the catalytic subunit of the tRNA-splicing ligase complex (tRNA-LC), which is involved in the enzyme-dependent maturation of intron-containing pre-tRNAs. Functions downstream of the tRNA-splicing endonuclease that removes introns, ligating the two generated halves via phosphodiester bond formation. Catalyzes the GTP-dependent activation of RTCB, both proteins forming a composite active site coordinating GTP and metal ions to enable the guanylylation of RTCB a prerequisite to undergo multiple catalytic cycles. Also required for the activation of RTCB in non-canonical, spliceosome-independent, cytoplasmic splicing of XBP1 mRNAs during the unfolded protein response (UPR). Upon endoplasmic reticulum (ER) stress, the endoribonuclease IRE1/ERN1 excises a short intron, generating free exon ends that are aligned by RNA-intrinsic, zipper-like stem-loop structures. These exon ends are then recognized and ligated by RTCB. This splicing event yields the active XBP1s transcription factor, which induces genes required to resolve protein folding defects in the endoplasmic reticulum.

Subunit / interactions. Monomeric. Could also form homodimers. Associates, as a monomer, with the tRNA-splicing ligase complex composed of the catalytic subunit RTCB and the accessory proteins DDX1, C2orf49/Ashwin/ASW, FAM98B and RTRAF/CGI-99.

Subcellular location. Cytoplasm. Nucleus.

Cofactor. Binds 1 Mn(2+) ion per subunit. Three Mn(2+) ions are bound in the composite active site formed by RTCB and Archease. Two are exclusively coordinated by RTCB residues and the third one is solely coordinated by ZBTB8OS/Archease residues.

Similarity. Belongs to the archease family.

Isoforms (3)

UniProt IDNamesCanonical?
Q8IWT0-11yes
Q8IWT0-22
Q8IWT0-33

RefSeq proteins (24): NP_001295064, NP_001295065, NP_001295066, NP_001295067, NP_001295068, NP_001295069, NP_001295070, NP_001317404, NP_001353184, NP_001353185, NP_001353186, NP_001353187, NP_001353188, NP_001353189, NP_001353192, NP_001353193, NP_001353194, NP_001353195, NP_001353196, NP_001353197, NP_001353198, NP_001353199, NP_001353200, NP_848642* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002804ArcheaseFamily
IPR023572Archease_domDomain
IPR036820Archease_dom_sfHomologous_superfamily

Pfam: PF01951

Catalyzed reactions (Rhea), 1 shown:

  • L-histidyl-[RNA ligase] + GTP = N(pros)-(5’-guanosyl-phosphono)-L-histidyl-[RNA ligase] + diphosphate (RHEA:68084)

UniProt features (15 total): sequence conflict 5, binding site 3, splice variant 2, mutagenesis site 2, initiator methionine 1, chain 1, modified residue 1

Structure

Experimental structures (PDB)

6 structures.

PDBMethodResolution (Å)
8BTXX-RAY DIFFRACTION1.84
5YZ1X-RAY DIFFRACTION1.97
8ODOX-RAY DIFFRACTION2.2
8ODPX-RAY DIFFRACTION2.3
5YZJX-RAY DIFFRACTION2.79
5YZLX-RAY DIFFRACTION3.4

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8IWT0-F192.310.84

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (3): 39; 166; 167

Post-translational modifications (1): 2

Mutagenesis-validated functional residues (2):

PositionPhenotype
39abolishes ability to activate trna ligase activity of rtcb.
144abolishes ability to activate trna ligase activity of rtcb.

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-6784531tRNA processing in the nucleus
R-HSA-72306tRNA processing
R-HSA-8953854Metabolism of RNA

MSigDB gene sets: 140 (showing top): E2F_Q4_01, GOBP_IRE1_MEDIATED_UNFOLDED_PROTEIN_RESPONSE, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, GOBP_RESPONSE_TO_ENDOPLASMIC_RETICULUM_STRESS, GOBP_TRNA_METABOLIC_PROCESS, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, GOBP_CELLULAR_RESPONSE_TO_TOPOLOGICALLY_INCORRECT_PROTEIN, E2F_Q3, SOX9_B1, GOBP_RNA_SPLICING_VIA_ENDONUCLEOLYTIC_CLEAVAGE_AND_LIGATION, USF_01, GOBP_RNA_SPLICING, LASTOWSKA_NEUROBLASTOMA_COPY_NUMBER_DN, RYTTCCTG_ETS2_B, E2F1_Q3

GO Biological Process (4): RNA splicing, via endonucleolytic cleavage and ligation (GO:0000394), tRNA exon ligation (GO:0000968), tRNA splicing, via endonucleolytic cleavage and ligation (GO:0006388), tRNA processing (GO:0008033)

GO Molecular Function (3): protein guanylyltransferase activity (GO:0044600), metal ion binding (GO:0046872), protein binding (GO:0005515)

GO Cellular Component (5): nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), tRNA-splicing ligase complex (GO:0072669), catalytic complex (GO:1902494)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
tRNA processing1
Metabolism of RNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
RNA splicing1
RNA exon ligation1
tRNA splicing, via endonucleolytic cleavage and ligation1
RNA splicing, via endonucleolytic cleavage and ligation1
tRNA processing1
RNA processing1
tRNA metabolic process1
protein guanylylation1
guanylyltransferase activity1
cation binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1
intracellular protein-containing complex1
catalytic complex1
protein-containing complex1

Protein interactions and networks

STRING

604 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
ZBTB8OSRTCBQ9Y3I0874
ZBTB8OSDDX1Q92499748
ZBTB8OSC1orf94Q6P1W5646
ZBTB8OSFAM98BQ52LJ0640
ZBTB8OSRTRAFQ9Y224580
ZBTB8OSRTCAO00442547
ZBTB8OSTRMT1Q9NXH9502
ZBTB8OSTHUMPD2Q9BTF0496
ZBTB8OSC2orf49Q9BVC5479
ZBTB8OSASPHD1Q5U4P2431
ZBTB8OSCMPK1P30085408
ZBTB8OSTSEN54Q7Z6J9385
ZBTB8OSALDH18A1P54886379
ZBTB8OSTRPT1Q86TN4375
ZBTB8OSGCNT4Q9P109370
ZBTB8OSRSRC1Q96IZ7370

IntAct

10 interactions, top by confidence:

ABTypeScore
ZBTB8OSRTCBpsi-mi:“MI:0915”(physical association)0.500
ZBTB8OSpsi-mi:“MI:0915”(physical association)0.490
ZBTB8OSpsi-mi:“MI:0915”(physical association)0.490
ZBTB8OSPAGE2psi-mi:“MI:0915”(physical association)0.400
ZBTB8OSFAM98Bpsi-mi:“MI:0914”(association)0.350
psi-mi:“MI:0914”(association)0.350
ZBTB8OSgcvP2psi-mi:“MI:0915”(physical association)0.000
PRKCHZBTB8OSpsi-mi:“MI:0915”(physical association)0.000

BioGRID (16): ZBTB8OS (Affinity Capture-RNA), ZBTB8OS (Affinity Capture-MS), ATP8A2 (Co-fractionation), HSP90B1 (Co-fractionation), PRPF40A (Co-fractionation), ZBTB8OS (Co-fractionation), GPANK1 (Two-hybrid), ZBTB8OS (Affinity Capture-RNA), ZBTB8OS (Affinity Capture-RNA), ZBTB8OS (Affinity Capture-MS), PAGE2 (Affinity Capture-MS), ZBTB8OS (Negative Genetic), ZBTB8OS (Negative Genetic), ZBTB8OS (Affinity Capture-MS), ZBTB8OS (Negative Genetic)

ESM2 similar proteins: A0A0U1WZ18, A4FV08, A4IHW6, C7EXK4, O88958, O97555, O97556, P21856, P31150, P46926, P50395, P50396, P50397, P50398, P50399, P60028, P61221, P61222, P62497, P70704, Q07346, Q0VCX5, Q16HW7, Q17QL1, Q29449, Q2EGP9, Q2YDE7, Q42472, Q42521, Q505B7, Q566V0, Q5R4C7, Q5R8T8, Q5RCE1, Q5U2Q7, Q5ZIZ4, Q61598, Q64422, Q6PA43, Q6Q7J2

Diamond homologs: A1RV54, A3MTM9, B1YCY3, C3MQ89, C3MVY1, C3N5V7, C3NEH3, C3NH78, C4KHI6, C5A6Z1, O27635, O28777, O59205, Q2EGP9, Q2YDE7, Q46D27, Q4JB72, Q505B7, Q54BU9, Q566V0, Q5JCY8, Q60334, Q8IWT0, Q8PWN9, Q8TU10, Q8TYR4, Q8U0N4, Q8ZWP9, Q975L0, Q9UX79, Q9V105, Q9VD92, Q9YFV2, Q9TZN1, Q61DI9, Q9HKC1, Q97A49

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

29 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance16
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1505 predictions. Top by Δscore:

VariantEffectΔscore
1:32633693:C:CTacceptor_gain1.0000
1:32633694:A:Tacceptor_gain1.0000
1:32633964:TTC:Tdonor_gain1.0000
1:32633965:TCT:Tdonor_gain1.0000
1:32650427:A:ACdonor_gain1.0000
1:32650428:C:CCdonor_gain1.0000
1:32650428:CT:Cdonor_gain1.0000
1:32650428:CTCA:Cdonor_gain1.0000
1:32650431:A:ACdonor_gain1.0000
1:32650432:C:CCdonor_gain1.0000
1:32650432:CA:Cdonor_gain1.0000
1:32650432:CACT:Cdonor_gain1.0000
1:32650439:A:ACdonor_gain1.0000
1:32650440:C:CCdonor_gain1.0000
1:32650442:T:TAdonor_gain1.0000
1:32650443:C:Adonor_gain1.0000
1:32631876:CTTC:Cacceptor_gain0.9900
1:32633639:GCTTA:Gdonor_loss0.9900
1:32633640:CTT:Cdonor_loss0.9900
1:32633641:TTA:Tdonor_loss0.9900
1:32633642:T:TGdonor_loss0.9900
1:32633643:A:ACdonor_gain0.9900
1:32633643:A:Cdonor_loss0.9900
1:32633643:AC:Adonor_gain0.9900
1:32633644:C:CCdonor_gain0.9900
1:32633644:CC:Cdonor_gain0.9900
1:32633693:C:Tacceptor_gain0.9900
1:32633699:G:GCacceptor_gain0.9900
1:32633726:TCC:Tacceptor_loss0.9900
1:32633728:C:CCacceptor_gain0.9900

AlphaMissense

1188 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000032996 (1:32632413 T>G), RS1000069292 (1:32625924 G>A), RS1000078789 (1:32637539 G>C), RS1000235263 (1:32632198 G>A), RS1000339242 (1:32625733 T>C), RS1000476735 (1:32625424 C>T), RS1000840744 (1:32630577 T>C), RS1000908790 (1:32651102 G>A), RS1001080126 (1:32636356 G>C), RS1001100415 (1:32642823 G>A), RS1001243097 (1:32651566 C>A,T), RS1001265347 (1:32652440 C>T), RS1001378153 (1:32630248 C>T), RS1001422758 (1:32646540 A>G), RS1001487736 (1:32645418 C>G,T)

Disease associations

OMIM: gene MIM:615891 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5069379 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

29 total (human), top 29 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases expression3
Valproic Acidincreases methylation, affects expression, increases expression3
Particulate Matteraffects cotreatment, increases abundance, decreases expression, decreases reaction2
aristolochic acid Iincreases expression1
dicrotophosdecreases expression1
triphenyl phosphateaffects expression1
terbufosincreases methylation1
sodium arseniteincreases expression1
2-methyl-2H-pyrazole-3-carboxylic acid (2-methyl-4-o-tolylazophenyl)amidedecreases expression, decreases reaction1
Sunitinibincreases expression1
Vorinostatincreases expression1
Vehicle Emissionsdecreases reaction, decreases expression1
Bilirubindecreases expression1
Coumestrolincreases expression1
Doxorubicindecreases expression1
Fonofosincreases methylation1
Estradioldecreases expression1
Gasolineaffects cotreatment, decreases expression, increases abundance1
Ivermectindecreases expression1
Leadaffects expression1
Parathionincreases methylation1
Phthalic Acidsincreases expression1
Polycyclic Aromatic Hydrocarbonsaffects cotreatment, decreases expression, increases abundance1
Rotenoneincreases expression1
Tretinoindecreases expression1
Urethanedecreases expression1
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression1
1-Butanolaffects cotreatment, decreases expression, increases abundance1
Acrylamideincreases expression1

ChEMBL screening assays

2 unique, capped per target: 2 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5059512BindingProteomics fold change data (SUDHL4 cells, 1h)Data for DCP probe CCT369260

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.