THEMIS

gene
On this page

Also known as bA325O24.4FLJ40584bA325O24.3THEMIS1

Summary

THEMIS (thymocyte selection associated, HGNC:21569) is a protein-coding gene on chromosome 6q22.33, encoding Protein THEMIS (Q8N1K5). Plays a central role in late thymocyte development by controlling both positive and negative T-cell selection.

This gene encodes a protein that plays a regulatory role in both positive and negative T-cell selection during late thymocyte development. The protein functions through T-cell antigen receptor signaling, and is necessary for proper lineage commitment and maturation of T-cells. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 387357 — RefSeq curated summary.

At a glance

  • GWAS associations: 17
  • Clinical variants (ClinVar): 106 total
  • MANE Select transcript: NM_001010923

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21569
Approved symbolTHEMIS
Namethymocyte selection associated
Location6q22.33
Locus typegene with protein product
StatusApproved
AliasesbA325O24.4, FLJ40584, bA325O24.3, THEMIS1
Ensembl geneENSG00000172673
Ensembl biotypeprotein_coding
OMIM613607
Entrez387357

Gene structure

Transcript identifiers

Ensembl transcripts: 7 — 7 protein_coding

ENST00000368248, ENST00000368250, ENST00000434358, ENST00000537166, ENST00000626040, ENST00000630369, ENST00000852157

RefSeq mRNA: 7 — MANE Select: NM_001010923 NM_001010923, NM_001164685, NM_001164687, NM_001318531, NM_001394520, NM_001394521, NM_001394522

CCDS: CCDS34534, CCDS55055, CCDS55056

Canonical transcript exons

ENST00000368248 — 6 exons

ExonStartEnd
ENSE00001084554127719688127719823
ENSE00001179249127812883127813931
ENSE00001179254127829476127829934
ENSE00003599310127855030127855188
ENSE00003923719127708200127710016
ENSE00003936109127900842127901058

Expression profiles

Bgee: expression breadth ubiquitous, 120 present calls, max score 82.86.

FANTOM5 (CAGE): breadth broad, TPM avg 5.4701 / max 460.8167, expressed in 215 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
754523.8293157
754550.604077
754540.374066
754560.245957
754500.237939
754530.173857
754490.00522

Top tissues by expression

130 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
lymph nodeUBERON:000002982.86gold quality
granulocyteCL:000009477.59gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047376.04gold quality
vermiform appendixUBERON:000115475.23gold quality
bloodUBERON:000017875.03gold quality
primary visual cortexUBERON:000243673.90gold quality
colonic epitheliumUBERON:000039771.93gold quality
superior frontal gyrusUBERON:000266170.73gold quality
prefrontal cortexUBERON:000045170.64gold quality
tonsilUBERON:000237270.27gold quality
duodenumUBERON:000211469.45gold quality
bone marrow cellCL:000209269.16gold quality
gall bladderUBERON:000211068.73gold quality
Brodmann (1909) area 9UBERON:001354067.19gold quality
bone marrowUBERON:000237165.93gold quality
rectumUBERON:000105265.53gold quality
frontal cortexUBERON:000187065.20gold quality
dorsolateral prefrontal cortexUBERON:000983464.94gold quality
spleenUBERON:000210664.17gold quality
cerebral cortexUBERON:000095662.01gold quality
anterior cingulate cortexUBERON:000983562.00gold quality
small intestineUBERON:000210858.65gold quality
smooth muscle tissueUBERON:000113558.13gold quality
small intestine Peyer’s patchUBERON:000345457.31gold quality
right frontal lobeUBERON:000281056.34gold quality
lungUBERON:000204855.91gold quality
corpus callosumUBERON:000233655.54gold quality
mucosa of transverse colonUBERON:000499155.07gold quality
endometriumUBERON:000129554.64gold quality
upper lobe of left lungUBERON:000895254.13gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.47

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

124 targeting THEMIS, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692A100.0074.406850
HSA-MIR-9-5P100.0072.282361
HSA-MIR-190A-3P100.0080.355520
HSA-MIR-186-5P99.9970.833707
HSA-MIR-477599.9875.006394
HSA-MIR-548N99.9871.944170
HSA-MIR-4482-3P99.9872.503147
HSA-MIR-480399.9871.993117
HSA-MIR-314899.9775.066478
HSA-MIR-3065-5P99.9771.563281
HSA-MIR-590-3P99.9674.346478
HSA-MIR-211099.9666.681930
HSA-MIR-551B-5P99.9671.283493
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-55999.9572.283609
HSA-MIR-548AB99.9571.313488
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502

Literature-anchored findings (GeneRIF, showing 11)

  • Functional characterization of the homologous mouse gene. (PMID:19597498)
  • Functional analysis of the mouse gene by knockout, and comparison of the mouse and human proteins. (PMID:19597499)
  • specific role for THEMIS in early TCR signalosome function (PMID:21189249)
  • SPOP mutations may define a new molecular subtype of prostate cancer. (PMID:22610119)
  • Themis1 exhibits the same potential to restore T cell development and T cell receptor-mediated signaling in Themis1-deficient mice. (PMID:22732588)
  • THEMIS binding to GRB2 was required for thymocyte development. (PMID:23460737)
  • THEMIS showed higher mRNA expression in active celiac disease compared with treated patients and controls. (PMID:23820479)
  • the THEMIS:SHP complex dampens early TCR signalling by a previously unknown molecular mechanism that favours T-cell survival. (PMID:25535246)
  • we have demonstrated mutually exclusive use of exon 1 and 2 amongst 16 novel THEMIS isoforms. We also show inverse correlation between THEMIS expression in human CD4+ T-cells and dosage of the multiple sclerosis risk allele at rs13204742, driven by reduced expression of exon 1- containing isoforms (PMID:27438997)
  • we found that THEMIS directly regulated the catalytic activity of the tyrosine phosphatase SHP-1. (PMID:28250424)
  • THEMIS-SHP1 Recruitment by 4-1BB Tunes LCK-Mediated Priming of Chimeric Antigen Receptor-Redirected T Cells (PMID:32004441)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriothemisENSDARG00000010619
mus_musculusThemisENSMUSG00000049109
rattus_norvegicusThemisENSRNOG00000014979

Paralogs (1): THEMIS2 (ENSG00000130775)

Protein

Protein identifiers

Protein THEMISQ8N1K5 (reviewed: Q8N1K5)

Alternative names: Thymocyte-expressed molecule involved in selection

All UniProt accessions (3): Q8N1K5, A0A0D9SFD2, X1WI26

UniProt curated annotations — full annotation on UniProt →

Function. Plays a central role in late thymocyte development by controlling both positive and negative T-cell selection. Required to sustain and/or integrate signals required for proper lineage commitment and maturation of T-cells. Regulates T-cell development through T-cell antigen receptor (TCR) signaling and in particular through the regulation of calcium influx and phosphorylation of Erk.

Subunit / interactions. Interacts with PLCG1, ITK, GRB2, and LAT.

Subcellular location. Cytoplasm. Nucleus.

Post-translational modifications. Phosphorylated on Tyr residues quickly after TCR stimulation.

Similarity. Belongs to the themis family.

Isoforms (4)

UniProt IDNamesCanonical?
Q8N1K5-11yes
Q8N1K5-22
Q8N1K5-33
Q8N1K5-44

RefSeq proteins (7): NP_001010923, NP_001158157, NP_001158159, NP_001305460, NP_001381449, NP_001381450, NP_001381451 (=MANE)

Domains & families (InterPro)

IDNameType
IPR025946CABIT_domDomain
IPR039671THEMISFamily

Pfam: PF12736

UniProt features (15 total): sequence conflict 5, region of interest 3, splice variant 3, sequence variant 2, chain 1, modified residue 1

Structure

Experimental structures (PDB)

3 structures.

PDBMethodResolution (Å)
9UOUX-RAY DIFFRACTION3.06
9IAZELECTRON MICROSCOPY3.2
9I3PELECTRON MICROSCOPY3.3

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N1K5-F176.260.28

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 584

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 175 (showing top): GOBP_ANTIGEN_RECEPTOR_MEDIATED_SIGNALING_PATHWAY, ACEVEDO_LIVER_CANCER_WITH_H3K27ME3_UP, GOBP_REGULATION_OF_IMMUNE_RESPONSE, GOBP_POSITIVE_T_CELL_SELECTION, GOBP_ADAPTIVE_IMMUNE_RESPONSE, GOBP_T_CELL_RECEPTOR_SIGNALING_PATHWAY, GOBP_NEGATIVE_T_CELL_SELECTION, GOCC_CELL_CELL_JUNCTION, GOBP_T_CELL_SELECTION, GAVIN_FOXP3_TARGETS_CLUSTER_P3, GAVIN_FOXP3_TARGETS_CLUSTER_P4, ZHENG_BOUND_BY_FOXP3, MARSON_BOUND_BY_FOXP3_STIMULATED, MARSON_BOUND_BY_FOXP3_UNSTIMULATED, GOCC_COP9_SIGNALOSOME

GO Biological Process (5): adaptive immune response (GO:0002250), positive T cell selection (GO:0043368), negative T cell selection (GO:0043383), T cell receptor signaling pathway (GO:0050852), immune system process (GO:0002376)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (4): nucleus (GO:0005634), cytoplasm (GO:0005737), cell-cell junction (GO:0005911), COP9 signalosome (GO:0008180)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
T cell selection2
immune response1
antigen receptor-mediated signaling pathway1
biological_process1
binding1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
cellular anatomical structure1
anchoring junction1
nuclear protein-containing complex1

Protein interactions and networks

STRING

516 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THEMISGRB2P29354565
THEMISPTPN11Q06124552
THEMISHBZP02008544
THEMISPTPRKQ15262510
THEMISTESPA1A2RU30480
THEMISTIFAQ96CG3462
THEMISLCP2Q13094449
THEMISPLCG1P19174406
THEMISKIAA0408Q6ZU52404
THEMISSELLP14151401
THEMISRAG1P15918398
THEMISSRCP12931355
THEMISSEC63Q9UGP8348
THEMISGRAP2O75791334
THEMISMTCL3Q5TF21323

IntAct

20 interactions, top by confidence:

ABTypeScore
THEMISGRB2psi-mi:“MI:0914”(association)0.840
THEMISGRB2psi-mi:“MI:0915”(physical association)0.840
HBZTHEMISpsi-mi:“MI:0915”(physical association)0.540
THEMISHBZpsi-mi:“MI:0914”(association)0.540
HBZTHEMISpsi-mi:“MI:0403”(colocalization)0.540
THEMISPRMT6psi-mi:“MI:0915”(physical association)0.510
THEMISPTPN6psi-mi:“MI:0914”(association)0.500
PTPN6THEMISpsi-mi:“MI:0915”(physical association)0.500
THEMISPLCG1psi-mi:“MI:0914”(association)0.460
THEMISLATpsi-mi:“MI:0914”(association)0.460
LATGRB2psi-mi:“MI:0914”(association)0.350
THEMISPOTEFpsi-mi:“MI:0914”(association)0.350
THEMISALOX5psi-mi:“MI:0914”(association)0.350
cutATHEMISpsi-mi:“MI:0915”(physical association)0.000

BioGRID (8): PTPN11 (Affinity Capture-MS), GRB2 (Affinity Capture-MS), POTEF (Affinity Capture-MS), POTEF (Affinity Capture-MS), GRB2 (Affinity Capture-MS), PTPN11 (Affinity Capture-MS), PRMT6 (Affinity Capture-Luminescence), PRMT6 (Two-hybrid)

ESM2 similar proteins: A4D1B5, A5D789, A5PF62, B1ANS9, B2GV47, D3YVL2, D3ZSP7, O00443, O70167, O70173, O75747, O88196, P53804, Q14D04, Q1LVQ2, Q3TCV3, Q3TUL7, Q402B2, Q4R8X0, Q4R9E9, Q4VA55, Q5H9S7, Q5R4M2, Q5RAY1, Q5RB52, Q5RBH9, Q5T0N1, Q5U228, Q5ZKI7, Q61194, Q6NU22, Q6NU51, Q6P2C0, Q6ZUJ8, Q8BGW0, Q8C779, Q8CDN1, Q8IXR9, Q8N1K5, Q8NB91

Diamond homologs: A5D789, A5PF62, Q5TEJ8, Q8BGW0, Q8N1K5, Q91YX0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

106 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance90
Likely benign6
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

2104 predictions. Top by Δscore:

VariantEffectΔscore
6:127719822:CG:Cacceptor_gain1.0000
6:127812877:CCTTA:Cdonor_loss1.0000
6:127812878:CTTA:Cdonor_loss1.0000
6:127812879:TTACC:Tdonor_loss1.0000
6:127812880:TACCT:Tdonor_loss1.0000
6:127812881:ACCTT:Adonor_loss1.0000
6:127812882:C:Gdonor_loss1.0000
6:127813932:C:CCacceptor_gain1.0000
6:127855024:CTGTA:Cdonor_loss1.0000
6:127855025:TGTA:Tdonor_loss1.0000
6:127855026:GTAC:Gdonor_loss1.0000
6:127855027:TA:Tdonor_loss1.0000
6:127855029:C:CGdonor_loss1.0000
6:127918462:T:Cdonor_gain1.0000
6:127710012:TGTTT:Tacceptor_gain0.9900
6:127710015:TT:Tacceptor_gain0.9900
6:127710015:TTCT:Tacceptor_loss0.9900
6:127710016:TCT:Tacceptor_loss0.9900
6:127710017:C:CCacceptor_gain0.9900
6:127710017:C:CGacceptor_loss0.9900
6:127710018:T:Aacceptor_loss0.9900
6:127719819:TGACG:Tacceptor_gain0.9900
6:127719820:GACG:Gacceptor_gain0.9900
6:127719821:ACGC:Aacceptor_loss0.9900
6:127719824:C:CCacceptor_gain0.9900
6:127719825:T:Gacceptor_loss0.9900
6:127782771:C:CTacceptor_gain0.9900
6:127813928:CGAA:Cacceptor_gain0.9900
6:127813930:AAC:Aacceptor_loss0.9900
6:127813931:ACTA:Aacceptor_loss0.9900

AlphaMissense

4234 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:127813623:C:GG340R0.993
6:127829591:T:AR198S0.991
6:127829591:T:GR198S0.991
6:127813622:C:TG340D0.990
6:127829511:A:GL225P0.990
6:127813205:A:GL479P0.989
6:127813577:A:GL355P0.987
6:127813682:G:TA320D0.986
6:127813340:A:GF434S0.985
6:127855098:G:TA61D0.983
6:127813616:A:GF342S0.981
6:127813230:C:GD471H0.980
6:127813612:C:AK343N0.980
6:127813612:C:GK343N0.980
6:127813645:G:CF332L0.980
6:127813645:G:TF332L0.980
6:127813647:A:GF332L0.980
6:127813172:A:GL490P0.979
6:127813541:A:TV367E0.979
6:127813622:C:AG340V0.979
6:127813683:C:GA320P0.979
6:127813339:A:CF434L0.978
6:127813339:A:TF434L0.978
6:127813341:A:GF434L0.978
6:127813607:C:GR345P0.978
6:127813646:A:GF332S0.977
6:127813886:A:TV252D0.977
6:127829511:A:TL225H0.977
6:127829586:A:TV200E0.977
6:127829592:C:GR198T0.977

dbSNP variants (sampled 300 via entrez): RS1000026405 (6:127828020 A>G), RS1000044676 (6:127806568 T>C,G), RS1000044907 (6:127738727 G>A,C), RS1000066 (6:127918267 C>A,G,T), RS1000066316 (6:127770940 A>C,G), RS1000070907 (6:127853259 T>TA), RS1000072260 (6:127728978 AC>A,ACC), RS1000095783 (6:127780768 G>T), RS1000113035 (6:127834191 T>C), RS1000118504 (6:127771293 G>A), RS1000151029 (6:127871887 C>T), RS1000153243 (6:127895590 C>G,T), RS1000156584 (6:127904750 C>T), RS1000167225 (6:127701308 G>A), RS1000188616 (6:127735401 A>C)

Disease associations

OMIM: gene MIM:613607 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

17 associations (top):

StudyTraitp-value
GCST000612_24Celiac disease3.000000e-14
GCST001198_54Multiple sclerosis6.000000e-09
GCST001532_6Immune response to smallpox vaccine (IL-6)2.000000e-07
GCST001920_5QRS duration5.000000e-06
GCST002880_2Recalcitrant atopic dermatitis7.000000e-07
GCST003264_900Post bronchodilator FEV1/FVC ratio5.000000e-06
GCST003264_969Post bronchodilator FEV1/FVC ratio2.000000e-06
GCST003944_14Hepcidin/ferritin ratio1.000000e-06
GCST003944_15Hepcidin/ferritin ratio4.000000e-06
GCST004132_4Crohn’s disease7.000000e-06
GCST006631_43Nicotine dependence and major depression (severity of comorbidity)4.000000e-06
GCST008489_19Celiac disease2.000000e-08
GCST008916_127Asthma1.000000e-11
GCST009597_161Multiple sclerosis1.000000e-09
GCST009798_84Asthma3.000000e-10
GCST012489_81Heel bone mineral density x serum urate levels interaction6.000000e-09
GCST90014325_3Asthma7.000000e-11

EFO canonical traits (9, from GWAS)

EFO IDTrait name
EFO:0004645response to vaccine
EFO:0004873cytokine measurement
EFO:1000651recalcitrant atopic dermatitis
EFO:0004713FEV/FVC ratio
EFO:0007901hepcidin:ferritin ratio
EFO:0007006depressive symptom measurement
EFO:0009262nicotine dependence symptom count
EFO:0004531urate measurement
EFO:0009270heel bone mineral density

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneincreases methylation, increases mutagenesis2
Nickelincreases expression2
triphenyl phosphateaffects expression1
bisphenol Adecreases methylation1
sodium arseniteincreases expression1
2,2’,4,4’,5-brominated diphenyl etherdecreases expression1
Arsenic Trioxidedecreases expression1
Diurondecreases expression1
Zincdecreases expression1
Aflatoxin B1increases methylation1
Antirheumatic Agentsdecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.