THOC2

gene
On this page

Also known as THO2dJ506G2.1

Summary

THOC2 (THO complex subunit 2, HGNC:19073) is a protein-coding gene on chromosome Xq25, encoding THO complex subunit 2 (Q8NI27). Component of the THO subcomplex of the TREX complex which is thought to couple mRNA transcription, processing and nuclear export, and which specifically associates with spliced mRNA and not with unspliced pre-mRNA. It is a common-essential gene (DepMap: required in 99.1% of cancer cell lines).

The TREX multiprotein complex binds specifically to spliced mRNAs to facilitate mRNA export. The protein encoded by this gene is a member of the THO complex, a subset of the TREX complex. The encoded protein interacts with the THOC1 protein.

Source: NCBI Gene 57187 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): X-linked complex neurodevelopmental disorder (Definitive, ClinGen) — +1 more curated relationship
  • GWAS associations: 1
  • Clinical variants (ClinVar): 587 total — 8 pathogenic, 13 likely-pathogenic
  • Phenotypes (HPO): 83
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.1% of screened cell lines (common-essential)
  • MANE Select transcript: NM_001081550

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:19073
Approved symbolTHOC2
NameTHO complex subunit 2
LocationXq25
Locus typegene with protein product
StatusApproved
AliasesTHO2, dJ506G2.1
Ensembl geneENSG00000125676
Ensembl biotypeprotein_coding
OMIM300395
Entrez57187

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 10 protein_coding, 6 retained_intron, 3 protein_coding_CDS_not_defined, 2 nonsense_mediated_decay

ENST00000245838, ENST00000355725, ENST00000416618, ENST00000419789, ENST00000432353, ENST00000433883, ENST00000438358, ENST00000441692, ENST00000448128, ENST00000455053, ENST00000459945, ENST00000464161, ENST00000464604, ENST00000464982, ENST00000464992, ENST00000491737, ENST00000492203, ENST00000496830, ENST00000497887, ENST00000618150, ENST00000931863

RefSeq mRNA: 1 — MANE Select: NM_001081550 NM_001081550

CCDS: CCDS43988

Canonical transcript exons

ENST00000245838 — 39 exons

ExonStartEnd
ENSE00001537798123600569123601338
ENSE00002700240123732952123733052
ENSE00003516348123671669123671761
ENSE00003520667123703454123703505
ENSE00003528531123613639123613708
ENSE00003528571123611440123611516
ENSE00003544996123610918123610963
ENSE00003556009123696021123696154
ENSE00003614298123613399123613556
ENSE00003631304123614052123614189
ENSE00003644252123622758123622860
ENSE00003655894123686548123686714
ENSE00003659811123619401123619436
ENSE00003674538123624060123624191
ENSE00003888891123638934123639027
ENSE00003888921123665642123665837
ENSE00003889126123625912123626069
ENSE00003889566123644779123644909
ENSE00003889679123638043123638123
ENSE00003890321123712850123712908
ENSE00003890379123623105123623283
ENSE00003890581123632861123633040
ENSE00003890683123697681123697751
ENSE00003891306123627693123627968
ENSE00003891615123667106123667278
ENSE00003892015123631688123631852
ENSE00003892209123668159123668314
ENSE00003892304123645334123645375
ENSE00003892371123636079123636175
ENSE00003893204123621157123621587
ENSE00003893645123624541123624669
ENSE00003893834123633953123634070
ENSE00003894240123623787123623971
ENSE00003894520123696721123696842
ENSE00003894623123640538123640622
ENSE00003894692123626521123626662
ENSE00003894918123644575123644676
ENSE00003895400123620907123620965
ENSE00003896240123706858123706949

Expression profiles

Bgee: expression breadth ubiquitous, 297 present calls, max score 99.20.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 33.8049 / max 818.1560, expressed in 1805 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
20038832.10361803
2003861.7013771

Top tissues by expression

299 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065599.20gold quality
calcaneal tendonUBERON:000370198.40gold quality
oocyteCL:000002398.23gold quality
sural nerveUBERON:001548897.92gold quality
ventricular zoneUBERON:000305396.56gold quality
colonic epitheliumUBERON:000039796.42gold quality
adrenal tissueUBERON:001830396.03gold quality
right lungUBERON:000216795.35gold quality
lower esophagus muscularis layerUBERON:003583395.31gold quality
lower esophagusUBERON:001347395.30gold quality
visceral pleuraUBERON:000240195.10gold quality
tendonUBERON:000004394.99gold quality
left ovaryUBERON:000211994.74gold quality
mucosa of stomachUBERON:000119994.72gold quality
body of pancreasUBERON:000115094.66gold quality
right ovaryUBERON:000211894.66gold quality
ganglionic eminenceUBERON:000402394.66gold quality
monocyteCL:000057694.60gold quality
mononuclear cellCL:000084294.55gold quality
tonsilUBERON:000237294.55gold quality
esophagogastric junction muscularis propriaUBERON:003584194.49gold quality
pylorusUBERON:000116694.46gold quality
body of uterusUBERON:000985394.37gold quality
muscle layer of sigmoid colonUBERON:003580594.37gold quality
skin of abdomenUBERON:000141694.30gold quality
leukocyteCL:000073894.27gold quality
esophagusUBERON:000104394.25gold quality
upper lobe of left lungUBERON:000895294.24gold quality
left lobe of thyroid glandUBERON:000112094.15gold quality
small intestine Peyer’s patchUBERON:000345494.13gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes12.96

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

110 targeting THOC2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4673100.0066.641490
HSA-MIR-5692A100.0074.406850
HSA-MIR-3163100.0077.238605
HSA-MIR-450099.9972.722367
HSA-MIR-366299.9973.825684
HSA-MIR-4645-5P99.9865.811284
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-103A-3P99.9869.141595
HSA-MIR-10799.9869.141595
HSA-MIR-3692-3P99.9870.272139
HSA-MIR-480399.9871.993117
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-1213699.9872.815713
HSA-MIR-3688-3P99.9772.022834
HSA-MIR-60799.9773.625593
HSA-MIR-1229-3P99.9766.49906
HSA-MIR-7152-3P99.9767.47849
HSA-LET-7D-5P99.9671.761632
HSA-MIR-445899.9671.641650
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-493-5P99.9672.472382

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.1% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 4)

  • recruitment of the human TREX complex to spliced mRNA is not directly coupled to transcription, but is instead coupled to transcription indirectly through splicing (PMID:15998806)
  • THOC2 mutations implicate mRNA-export pathway in X-linked intellectual disability. (PMID:26166480)
  • Study presents detailed clinical assessment and functional studies on a de novo variant in a female with an epileptic encephalopathy and discuss an additional four families with rare variants in THOC2 with supportive evidence for pathogenicity. (PMID:29851191)
  • THOC2 reduction repressed melanoma cell proliferation and invasion, and induced cell apoptosis. (PMID:31680623)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriothoc2ENSDARG00000037503
mus_musculusThoc2ENSMUSG00000037475
mus_musculusThoc2lENSMUSG00000097392
rattus_norvegicusThoc2ENSRNOG00000007315
rattus_norvegicusThoc2lENSRNOG00000042340
drosophila_melanogastertho2FBGN0031390
caenorhabditis_elegansWBGENE00015813

Protein

Protein identifiers

THO complex subunit 2Q8NI27 (reviewed: Q8NI27)

Alternative names: hTREX120

All UniProt accessions (9): Q8NI27, A0A0C4DG98, B7ZB98, B7ZBA0, F2Z2V2, H0Y594, H0Y7U4, H0Y815, H7C477

UniProt curated annotations — full annotation on UniProt →

Function. Component of the THO subcomplex of the TREX complex which is thought to couple mRNA transcription, processing and nuclear export, and which specifically associates with spliced mRNA and not with unspliced pre-mRNA. Required for efficient export of polyadenylated RNA and spliced mRNA. The THOC1-THOC2-THOC3 core complex alone is sufficient to bind export factor NXF1-NXT1 and promote ATPase activity of DDX39B; in the complex THOC2 is the only component that directly interacts with DDX39B. TREX is recruited to spliced mRNAs by a transcription-independent mechanism, binds to mRNA upstream of the exon-junction complex (EJC) and is recruited in a splicing- and cap-dependent manner to a region near the 5’ end of the mRNA where it functions in mRNA export to the cytoplasm via the TAP/NXF1 pathway. Required for NXF1 localization to the nuclear rim. THOC2 (and probably the THO complex) is involved in releasing mRNA from nuclear speckle domains. (Microbial infection) The TREX complex is essential for the export of Kaposi’s sarcoma-associated herpesvirus (KSHV) intronless mRNAs and infectious virus production.

Subunit / interactions. Component of the THO subcomplex, which is composed of THOC1, THOC2, THOC3, THOC5, THOC6 and THOC7. The THO subcomplex interacts with DDX39B to form the THO-DDX39B complex which multimerizes into a 28-subunit tetrameric assembly. Component of the transcription/export (TREX) complex at least composed of ALYREF/THOC4, DDX39B, SARNP/CIP29, CHTOP and the THO subcomplex; in the complex interacts with THOC1, THOC3, THOC5, THOC7 and DDX39B. TREX seems to have a dynamic structure involving ATP-dependent remodeling. Interacts with POLDIP3 and ZC3H11A.

Subcellular location. Nucleus. Nucleus speckle. Cytoplasm.

Tissue specificity. Expressed in the hippocampus and the cerebral cortex.

Disease relevance. Intellectual developmental disorder, X-linked, syndromic, Kumar type (MRXSK) [MIM:300957] A form of intellectual disability, a disorder characterized by significantly below average general intellectual functioning associated with impairments in adaptive behavior and manifested during the developmental period. Intellectual deficiency is the only primary symptom of non-syndromic X-linked forms, while syndromic forms present with associated physical, neurological and/or psychiatric manifestations. MRXSK patients manifest variable degrees of intellectual disability. Commonly observed features included speech delay, elevated BMI, short stature, seizure disorders, gait disturbance, and tremors. The disease is caused by variants affecting the gene represented in this entry. Arthrogryposis multiplex congenita 7, X-linked (AMC7) [MIM:301127] A form of arthrogryposis multiplex congenita, a developmental condition characterized by multiple joint contractures resulting from reduced or absent fetal movements. AMC7 is an X-linked recessive, severe form with onset in utero. Affected fetuses may also have subcutaneous edema and dysmorphic facial features. The disease is caused by variants affecting the gene represented in this entry.

Similarity. Belongs to the THOC2 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q8NI27-11yes
Q8NI27-22

RefSeq proteins (1): NP_001075019* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR021418THO_THOC2_CDomain
IPR021726THO_THOC2_NDomain
IPR032302THOC2_NDomain
IPR040007Tho2Family

Pfam: PF11262, PF11732, PF16134

UniProt features (121 total): helix 56, sequence variant 19, compositionally biased region 9, modified residue 9, region of interest 7, strand 7, sequence conflict 4, turn 4, mutagenesis site 2, chain 1, splice variant 1, coiled-coil region 1, short sequence motif 1

Structure

Experimental structures (PDB)

4 structures.

PDBMethodResolution (Å)
7APKELECTRON MICROSCOPY3.3
7ZNLELECTRON MICROSCOPY3.45
7ZNKELECTRON MICROSCOPY3.9
8R7LELECTRON MICROSCOPY4.12

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8NI27-F172.830.33

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (9): 1222, 1385, 1390, 1393, 1417, 1443, 1450, 1486, 1516

Mutagenesis-validated functional residues (2):

PositionPhenotype
551–558impairs interaction with ddx39b. abolishes interaction with ddx39b; when associated with 589-a–a-592.
589–592impairs interaction with ddx39b. abolishes interaction with ddx39b; when associated with 551-a–s-558.

Function

Pathways and Gene Ontology

Reactome pathways

8 pathways

IDPathway
R-HSA-159236Transport of Mature mRNA derived from an Intron-Containing Transcript
R-HSA-72187mRNA 3’-end processing
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-72202Transport of Mature Transcript to Cytoplasm
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA
R-HSA-73857RNA Polymerase II Transcription
R-HSA-74160Gene expression (Transcription)
R-HSA-8953854Metabolism of RNA

MSigDB gene sets: 381 (showing top): RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, BROWNE_HCMV_INFECTION_6HR_DN, YAGI_AML_WITH_INV_16_TRANSLOCATION, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, GOBP_NEUROGENESIS, MORF_HDAC2, GOBP_REGULATION_OF_NUCLEOBASE_CONTAINING_COMPOUND_TRANSPORT, GOBP_NEGATIVE_REGULATION_OF_CELLULAR_COMPONENT_ORGANIZATION, GOBP_NUCLEAR_TRANSPORT, MODULE_331, GOBP_STEM_CELL_DIVISION, GOBP_REGULATION_OF_CELL_PROJECTION_ORGANIZATION, GOBP_IN_UTERO_EMBRYONIC_DEVELOPMENT, GOBP_REGULATION_OF_NUCLEOCYTOPLASMIC_TRANSPORT

GO Biological Process (13): cell morphogenesis (GO:0000902), blastocyst development (GO:0001824), mRNA processing (GO:0006397), mRNA export from nucleus (GO:0006406), RNA splicing (GO:0008380), regulation of gene expression (GO:0010468), regulation of mRNA export from nucleus (GO:0010793), negative regulation of neuron projection development (GO:0010977), poly(A)+ mRNA export from nucleus (GO:0016973), stem cell division (GO:0017145), neuron development (GO:0048666), generation of neurons (GO:0048699), mRNA transport (GO:0051028)

GO Molecular Function (3): mRNA binding (GO:0003729), RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (8): transcription export complex (GO:0000346), THO complex (GO:0000347), THO complex part of transcription export complex (GO:0000445), nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), nuclear speck (GO:0016607), chromosome, telomeric region (GO:0000781)

Reactome top-level categories

Rollup of top-5 pathways:

CategoryPathways
Processing of Capped Intron-Containing Pre-mRNA2
Transport of Mature Transcript to Cytoplasm1
RNA Polymerase II Transcription1
Metabolism of RNA1
Gene expression (Transcription)1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
gene expression2
mRNA export from nucleus2
nuclear protein-containing complex2
cellular anatomical structure2
anatomical structure morphogenesis1
in utero embryonic development1
anatomical structure development1
mRNA metabolic process1
RNA export from nucleus1
mRNA transport1
regulation of macromolecule biosynthetic process1
regulation of RNA export from nucleus1
regulation of ribonucleoprotein complex localization1
regulation of neuron projection development1
neuron projection development1
negative regulation of cell projection organization1
cell division1
neuron differentiation1
cell development1
neurogenesis1
RNA transport1
RNA binding1
nucleic acid binding1
binding1
transcription export complex1
THO complex1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1
nuclear ribonucleoprotein granule1
chromosomal region1

Protein interactions and networks

STRING

2800 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THOC2THOC1Q96FV9999
THOC2THOC3Q96J01999
THOC2DDX39BQ13838998
THOC2THOC5Q13769997
THOC2THOC7Q6I9Y2997
THOC2THOC6Q86W42997
THOC2GLI2P10070992
THOC2ALYREFQ86V81992
THOC2CYLDQ9NQC7992
THOC2RAE1P78406860
THOC2NXF1Q9UBU9827
THOC2SARNPP82979763
THOC2FYTTD1Q96QD9733
THOC2ZC3H18Q86VM9713
THOC2NCBP3Q53F19688

IntAct

108 interactions, top by confidence:

ABTypeScore
THOC1THOC5psi-mi:“MI:0914”(association)0.930
MED4MED19psi-mi:“MI:0914”(association)0.900
MED20MED19psi-mi:“MI:0914”(association)0.840
DDX39BTHOC5psi-mi:“MI:0915”(physical association)0.800
DDX39BALYREFpsi-mi:“MI:0914”(association)0.770
THOC2THOC5psi-mi:“MI:0914”(association)0.730
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
ALYREFTHOC5psi-mi:“MI:0914”(association)0.710
THOC5ALYREFpsi-mi:“MI:0914”(association)0.710
MOB1ALATS1psi-mi:“MI:0914”(association)0.670
THOC1EIF4A3psi-mi:“MI:0914”(association)0.660
CHTOPTHOC5psi-mi:“MI:0914”(association)0.660
THOC1DDX39Apsi-mi:“MI:0914”(association)0.640
KPNB1POM121Cpsi-mi:“MI:0914”(association)0.530
THOC3CLUHpsi-mi:“MI:0914”(association)0.530
NCBP3SAP18psi-mi:“MI:0914”(association)0.530
THOC5EIF4A3psi-mi:“MI:0914”(association)0.530
VCAM1PSMD11psi-mi:“MI:0914”(association)0.530
EIF4A3psi-mi:“MI:0915”(physical association)0.490
THOC2CRKpsi-mi:“MI:0915”(physical association)0.400
HLCSTHOC2psi-mi:“MI:0915”(physical association)0.400

BioGRID (343): THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS), PDCD4 (Co-fractionation), THOC1 (Co-fractionation), THOC2 (Co-fractionation), THOC2 (Co-fractionation), THOC6 (Co-fractionation), THOC7 (Co-fractionation), USP7 (Co-fractionation), THOC2 (Affinity Capture-MS), THOC2 (Affinity Capture-MS)

ESM2 similar proteins: A0A3Q1LSX9, A2APV2, A2AT37, A2VD00, A4II09, B0KWH8, B1AZI6, B1MTK1, B2KI97, B3MS75, B3NU52, B4GW22, B4I0W6, B4JM29, B4L2J8, B4NC41, B4Q034, C0H906, C1FXW9, F4IUX6, Q09161, Q16UN6, Q1LUC1, Q1LXC9, Q29G82, Q2L4X1, Q3UYV9, Q4R6R4, Q56A27, Q5R7L4, Q5ZJZ6, Q5ZL42, Q5ZLT7, Q5ZMW3, Q6DDM4, Q6DIE2, Q6GQ80, Q6GQD0, Q6P2Z0, Q6P7P5

Diamond homologs: B0KWH8, B1AZI6, B1MTK1, B2KI97, C1FXW9, F4IAT2, Q09779, Q8NI27

SIGNOR signaling

1 interactions.

AEffectBMechanism
THOC2“form complex”“TREX complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 109 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA 3’-end processing1232.4×3e-13
Transport of Mature Transcript to Cytoplasm631.3×2e-06
Transport of Mature mRNA derived from an Intron-Containing Transcript1327.1×3e-13
RNA Polymerase II Transcription Termination721.1×2e-06
Processing of Capped Intron-Containing Pre-mRNA1415.8×3e-11
mRNA Splicing913.5×2e-06
mRNA Splicing - Major Pathway1410.5×6e-09
Metabolism of RNA148.0×2e-07

GO biological processes:

GO termPartnersFoldFDR
RNA export from nucleus550.9×7e-06
negative regulation of mRNA splicing, via spliceosome541.6×1e-05
mRNA export from nucleus1238.6×1e-13
positive regulation of transcription elongation by RNA polymerase II619.6×6e-05
RNA splicing1413.4×8e-10
mRNA processing1210.3×5e-07
mRNA splicing, via spliceosome1010.0×1e-05

Disease & clinical

Clinical variants and AI predictions

ClinVar

587 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic8
Likely pathogenic13
Uncertain significance200
Likely benign31
Benign21

Top pathogenic / likely-pathogenic (21)

Variant IDHGVSClassification
208523NM_001081550.2(THOC2):c.1313T>C (p.Leu438Pro)Pathogenic
208524NM_001081550.2(THOC2):c.937C>T (p.Leu313Phe)Pathogenic
208525NM_001081550.2(THOC2):c.3034T>C (p.Ser1012Pro)Pathogenic
208526NM_001081550.2(THOC2):c.2399T>C (p.Ile800Thr)Pathogenic
3362897THOC2, 2.4-KB DEL, EX37-38DELPathogenic
3362900THOC2, 4-BP DEL, 2482GTCAPathogenic
488431NM_001081550.2(THOC2):c.2138G>A (p.Gly713Asp)Pathogenic
488434NM_001081550.2(THOC2):c.4450-2A>GPathogenic
1527937NM_001081550.2(THOC2):c.1844G>A (p.Cys615Tyr)Likely pathogenic
1700225NM_001081550.2(THOC2):c.34T>C (p.Trp12Arg)Likely pathogenic
1700228NM_001081550.2(THOC2):c.2695T>C (p.Tyr899His)Likely pathogenic
1801970NM_001081550.2(THOC2):c.2788C>T (p.Arg930Cys)Likely pathogenic
2314312NM_001081550.2(THOC2):c.2818G>A (p.Glu940Lys)Likely pathogenic
3342610NM_001081550.2(THOC2):c.2482-1_2484delLikely pathogenic
3349329NM_001081550.2(THOC2):c.3215C>T (p.Thr1072Ile)Likely pathogenic
488433NM_001081550.2(THOC2):c.3503+4A>CLikely pathogenic
488437NM_001081550.2(THOC2):c.3323C>T (p.Ser1108Leu)Likely pathogenic
804356NM_001081550.2(THOC2):c.2642A>G (p.Tyr881Cys)Likely pathogenic
807709NM_001081550.2(THOC2):c.149A>C (p.Tyr50Ser)Likely pathogenic
982367NM_001081550.2(THOC2):c.1942G>T (p.Ala648Ser)Likely pathogenic
984647NM_001081550.2(THOC2):c.3305A>G (p.Tyr1102Cys)Likely pathogenic

SpliceAI

6313 predictions. Top by Δscore:

VariantEffectΔscore
X:123611452:T:TAdonor_gain1.0000
X:123612217:ATAG:Adonor_gain1.0000
X:123612217:ATAGC:Adonor_gain1.0000
X:123613552:CCCCA:Cacceptor_gain1.0000
X:123613553:CCCA:Cacceptor_gain1.0000
X:123613553:CCCAC:Cacceptor_gain1.0000
X:123613554:CCA:Cacceptor_gain1.0000
X:123613554:CCAC:Cacceptor_gain1.0000
X:123613555:CAC:Cacceptor_gain1.0000
X:123613557:C:CCacceptor_gain1.0000
X:123613637:A:ACdonor_gain1.0000
X:123613638:C:CCdonor_gain1.0000
X:123614050:AC:Adonor_gain1.0000
X:123614051:CC:Cdonor_gain1.0000
X:123614066:T:Adonor_gain1.0000
X:123614190:C:CCacceptor_gain1.0000
X:123615309:A:ACdonor_gain1.0000
X:123615310:T:Cdonor_gain1.0000
X:123620966:C:CCacceptor_gain1.0000
X:123621589:T:Cacceptor_gain1.0000
X:123621589:T:TCacceptor_gain1.0000
X:123622752:TCCTA:Tdonor_loss1.0000
X:123622753:CCTAC:Cdonor_loss1.0000
X:123622754:CTA:Cdonor_loss1.0000
X:123622755:TAC:Tdonor_loss1.0000
X:123622756:ACC:Adonor_loss1.0000
X:123622757:CCTGT:Cdonor_loss1.0000
X:123622856:TTTAT:Tacceptor_gain1.0000
X:123622857:TTAT:Tacceptor_gain1.0000
X:123622858:TAT:Tacceptor_gain1.0000

AlphaMissense

10588 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:123623228:G:CH1187D1.000
X:123623229:A:CF1186L1.000
X:123623229:A:TF1186L1.000
X:123623230:A:GF1186S1.000
X:123623231:A:GF1186L1.000
X:123623275:C:TG1171E1.000
X:123623793:G:TA1166D1.000
X:123623794:C:GA1166P1.000
X:123623868:C:TG1141D1.000
X:123623886:G:TP1135Q1.000
X:123623887:G:AP1135S1.000
X:123623910:A:GL1127P1.000
X:123623955:A:GL1112P1.000
X:123624067:A:GL1104P1.000
X:123624080:A:GW1100R1.000
X:123624080:A:TW1100R1.000
X:123624081:T:AK1099N1.000
X:123624081:T:GK1099N1.000
X:123624082:T:AK1099I1.000
X:123624083:T:CK1099E1.000
X:123624086:G:CH1098D1.000
X:123624091:A:TV1096D1.000
X:123624097:C:GR1094P1.000
X:123624100:A:GF1093S1.000
X:123624115:A:GL1088S1.000
X:123624168:G:CF1070L1.000
X:123624168:G:TF1070L1.000
X:123624170:A:GF1070L1.000
X:123624172:C:TG1069E1.000
X:123624173:C:GG1069R1.000

dbSNP variants (sampled 300 via entrez): RS1000011782 (X:123656098 A>G), RS1000018433 (X:123728270 C>T), RS1000047919 (X:123654055 C>T), RS1000049051 (X:123720935 A>G), RS1000053335 (X:123640748 A>G), RS1000071868 (X:123658657 G>A), RS1000124839 (X:123639925 C>T), RS1000130298 (X:123718925 C>T), RS1000156599 (X:123682492 G>A), RS1000192990 (X:123733488 TTCTCCC>T,TTCTCCCTCTCCC), RS1000198446 (X:123688567 C>A), RS1000228637 (X:123719284 A>C), RS1000229470 (X:123614397 C>A), RS1000291732 (X:123663625 T>C,G), RS1000318879 (X:123646098 G>A)

Disease associations

OMIM: gene MIM:300395 | disease phenotypes: MIM:300957, MIM:620977, MIM:209850, MIM:301127

GenCC curated gene-disease

DiseaseClassificationInheritance
X-linked intellectual disability-short stature-overweight syndromeStrongX-linked

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
X-linked complex neurodevelopmental disorderDefinitiveXL

Mondo (6): intellectual disability (MONDO:0001071), X-linked intellectual disability-short stature-overweight syndrome (MONDO:0010496), neurodevelopmental disorder (MONDO:0700092), immunodeficiency 127 (MONDO:0975832), autism (MONDO:0005260), arthrogryposis multiplex congenita 7, X-linked (MONDO:0975826)

Orphanet (2): X-linked intellectual disability-short stature-overweight syndrome (Orphanet:457240), NON RARE IN EUROPE: Unexplained intellectual disability (Orphanet:319658)

HPO phenotypes

83 total (30 of 83 shown, HPO-id order):

HPOTerm
HP:0000028Cryptorchidism
HP:0000054Micropenis
HP:0000202Orofacial cleft
HP:0000218High palate
HP:0000252Microcephaly
HP:0000256Macrocephaly
HP:0000316Hypertelorism
HP:0000337Broad forehead
HP:0000347Micrognathia
HP:0000348High forehead
HP:0000400Macrotia
HP:0000407Sensorineural hearing impairment
HP:0000463Anteverted nares
HP:0000486Strabismus
HP:0000505Visual impairment
HP:0000577Exotropia
HP:0000639Nystagmus
HP:0000708Atypical behavior
HP:0000716Depression
HP:0000729Autistic behavior
HP:0000733Motor stereotypy
HP:0000739Anxiety
HP:0000742Self-mutilation
HP:0000750Delayed speech and language development
HP:0000824Decreased response to growth hormone stimulation test
HP:0000954Single transverse palmar crease
HP:0001188Hand clenching
HP:0001249Intellectual disability
HP:0001250Seizure
HP:0001252Hypotonia

GWAS associations

1 associations (top):

StudyTraitp-value
GCST90002381_503Eosinophil count2.000000e-10

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004842eosinophil count

MeSH disease descriptors (3)

DescriptorNameTree numbers
D001321Autistic DisorderF03.625.164.113.500
D008607Intellectual DisabilityC10.597.606.360; C23.888.592.604.646; F01.700.687; F03.625.539
D065886Neurodevelopmental DisordersF03.625

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL6066378 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

4 potent at pChembl≥5 of 4 total, top 4 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
5.69Kd2050nMCHEMBL5653589
5.69ED502050nMCHEMBL5653589
5.08Kd8402nMCHEMBL3752910
5.08ED508402nMCHEMBL3752910

PubChem BioAssay actives

2 with measured affinity, of 4 total; 2 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149582: Binding affinity to human THOC2 incubated for 45 mins by Kinobead based pull down assaykd2.0497uM
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149582: Binding affinity to human THOC2 incubated for 45 mins by Kinobead based pull down assaykd8.4024uM

CTD chemical–gene interactions

52 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Adecreases expression2
sodium arsenitedecreases expression, increases expression2
Valproic Aciddecreases expression, increases methylation2
aristolochic acid Idecreases expression1
FR900359affects phosphorylation1
bisphenol Faffects cotreatment, increases expression1
TAK-243decreases sumoylation1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
alpha-pineneaffects cotreatment, increases oxidation, increases abundance1
titanium dioxideincreases methylation1
decabromobiphenyl etherincreases expression1
beta-lapachonedecreases expression1
methylparabenincreases expression1
tetrabromobisphenol Adecreases expression1
potassium chromate(VI)affects cotreatment, decreases expression1
aflatoxin B2increases methylation1
coumarindecreases phosphorylation1
methacrylaldehydeincreases abundance, affects cotreatment, increases oxidation1
beta-methylcholineaffects expression1
epigallocatechin gallateaffects cotreatment, decreases expression1
di-n-butylphosphoric acidaffects expression1
perfluorooctane sulfonic aciddecreases expression1
CGP 52608affects binding, increases reaction1
2,2’,4,4’-tetrabromodiphenyl etherdecreases expression1
pentabrominated diphenyl ether 100decreases expression1
hexabrominated diphenyl ether 153decreases expression1
jinfukangdecreases expression1
Resveratrolaffects cotreatment, increases expression1
Sunitinibincreases expression1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652624BindingBinding affinity to human THOC2 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

298 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT05657860PHASE4COMPLETEDGuanfacine Extended Release for the Reduction of Aggression and Self-injurious Behavior Associated With Prader-Willi Syndrome
NCT05744479PHASE4RECRUITINGMetformin for Antipsychotic-induced Weight Gain in Adults With Intellectual Disability
NCT06107829PHASE4WITHDRAWNValbenazine Treatment of Tardive Dyskinesia in Adults With Intellectual/Developmental Disabilities
NCT06997198PHASE4NOT_YET_RECRUITINGDeutetrabenazine Treatment for Tardive Dyskinesia in Intellectual/Developmental Disabilities
NCT04586348PHASE4UNKNOWNPrenatal Iodine Supplementation and Early Childhood Neurodevelopment
NCT04873115PHASE4UNKNOWNDouble-blind, Placebo-controlled, Randomized Clinical Trial Comparing the Efficacy and Safety of Sialanar Plus orAl rehabiLitation Against Placebo Plus Oral Rehabilitation for chIldren and Adolescents With seVere Sialorrhoea and Neurodisabilties,
NCT02270736PHASE3COMPLETEDClinical Study to Investigate the Efficacy and Safety of NT 201 Compared to Placebo in the Treatment of Chronic Troublesome Drooling Associated With Neurological Disorders and/or Intellectual Disability
NCT02559102PHASE3COMPLETEDDexmedetomidine Sedation Versus General Anaesthesia for Inguinal Hernia Surgery in Infants
NCT02757079PHASE3COMPLETEDStudy of the Efficacy and Safety of NPC-15 for Sleep Disorders of Children With Neurodevelopmental Disorders
NCT06915480PHASE3RECRUITINGReducing Missed Appointments
NCT07377032PHASE3RECRUITINGTAP-GRIN: Interventional Study on Patients With GRIN-related Neurodevelopmental Disorders
NCT02304302PHASE2COMPLETEDDown Syndrome Memantine Follow-up Study
NCT03862950PHASE2COMPLETEDA Trial of Metformin in Individuals With Fragile X Syndrome (Met)
NCT04529226PHASE2UNKNOWNStudy to Compare Clozapine vs Treatment as Usual in People With Intellectual Disability & Treatment-resistant Psychosis
NCT04821856PHASE2COMPLETEDEvaluation of the Effectiveness of Cannabidiol in Treating Severe Behavioural Problems in Children and Adolescents With Intellectual Disability
NCT02909959PHASE2COMPLETEDSulforaphane for the Treatment of Young Men With Autism Spectrum Disorder
NCT06081348PHASE2RECRUITINGSertraline vs. Placebo in the Treatment of Anxiety in Children and AdoLescents With NeurodevelopMental Disorders
NCT06352372PHASE2COMPLETEDSafety and Efficacy of tPBM for Epileptiform Activity in Autism
NCT05273320PHASE1COMPLETEDClinical Trial of Nabilone for Aggression in Adults With Intellectual and Developmental Disabilities
NCT05301361PHASE1ENROLLING_BY_INVITATIONSensitivity of the NIH Toolbox to Stimulant Treatment in Intellectual Disabilities
NCT06016764PHASE1COMPLETEDUse of MRI and cTBS for Catatonia in Autism
NCT06586827PHASE1COMPLETEDImpact of Competency-Based Training and Technical Assistance Employment Outcomes of Individuals With ID/DD
NCT07531940PHASE1NOT_YET_RECRUITINGEscalating Doses of Memantine in Down Syndrome (MEDS-123)
NCT00503191PHASE1COMPLETEDNeuroModulation Technique Treatment of Autism
NCT04475848PHASE1COMPLETEDA Study to Investigate the Safety, Tolerability, Pharmacokinetics, Pharmacodynamics and Food Effect of RO6953958 in Healthy Participants
NCT06300398PHASE1COMPLETEDIAMA-6 Oral Dose Study in Healthy Adults
NCT03479476PHASE2/PHASE3COMPLETEDA Trial of Metformin in Individuals With Fragile X Syndrome
NCT02616796PHASE1/PHASE2COMPLETEDEffects of Social Gaze Training on Brain and Behavior in Fragile X Syndrome
NCT06860672EARLY_PHASE1RECRUITINGClinical Trial of the Dual Vector Base Editor for the Treatment of the CHD3-R1025W Mutation
NCT00597948Not specifiedCOMPLETEDHealthy Lifestyles for People With Intellectual Disabilities
NCT01087320Not specifiedRECRUITINGGenome Medical Sequencing for Gene Discovery
NCT01652963Not specifiedUNKNOWNPicture-based Computerised Assessment and Training of Cognitive Behaviour Therapy Skills
NCT01695395Not specifiedCOMPLETEDMental Health Care Provision for Adults With Intellectual Disability and a Mental Disorder
NCT01867554Not specifiedCOMPLETEDResearch and Characterization of New Genes Involved in Intellectual Disability
NCT01915381Not specifiedCOMPLETEDImproving Adherence Healthy Lifestyle With a Smartphone Application Based on Adults With Intellectual Disabilities
NCT01988623Not specifiedCOMPLETEDPivotal Response Treatment for Individuals With Intellectual Disabilities
NCT02099773Not specifiedCOMPLETEDSupport Staff-client Interactions With Augmentative and Alternative Communication
NCT02136849Not specifiedCOMPLETEDInter-regional Project of the Great Western Exploration Approach for Exome Molecular Causes Severe Intellectual Disability Isolated or Syndromic
NCT02225041Not specifiedCOMPLETEDSedation Strategy and Cognitive Outcome After Critical Illness in Early Childhood
NCT02414438Not specifiedCOMPLETEDEstablishing the Clinical Utility of First StepDx PLUS and NextStepDx PLUS Study