SIM2

gene
On this page

Also known as MGC119447bHLHe15

Summary

SIM2 (SIM bHLH transcription factor 2, HGNC:10883) is a protein-coding gene on chromosome 21q22.13, encoding Single-minded homolog 2 (Q14190). Transcription factor that may be a master gene of CNS development in cooperation with Arnt.

This gene represents a homolog of the Drosophila single-minded (sim) gene, which encodes a transcription factor that is a master regulator of neurogenesis. The encoded protein is ubiquitinated by RING-IBR-RING-type E3 ubiquitin ligases, including the parkin RBR E3 ubiquitin protein ligase. This gene maps within the so-called Down syndrome chromosomal region, and is thus thought to contribute to some specific Down syndrome phenotypes. Alternative splicing of this gene results in multiple transcript variants.

Source: NCBI Gene 6493 — RefSeq curated summary.

At a glance

  • GWAS associations: 10
  • Clinical variants (ClinVar): 104 total — 1 likely-pathogenic
  • Transcription factor: yes — 12 downstream targets (CollecTRI)
  • MANE Select transcript: NM_005069

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:10883
Approved symbolSIM2
NameSIM bHLH transcription factor 2
Location21q22.13
Locus typegene with protein product
StatusApproved
AliasesMGC119447, bHLHe15
Ensembl geneENSG00000159263
Ensembl biotypeprotein_coding
OMIM600892
Entrez6493

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 3 protein_coding, 1 protein_coding_CDS_not_defined, 1 retained_intron

ENST00000290399, ENST00000431229, ENST00000460783, ENST00000481185, ENST00000483178

RefSeq mRNA: 2 — MANE Select: NM_005069 NM_005069, NM_009586

CCDS: CCDS13646

Canonical transcript exons

ENST00000290399 — 11 exons

ExonStartEnd
ENSE000010443823674766536749917
ENSE000012845953674472836745136
ENSE000034589943674338736743555
ENSE000034743903674171736741864
ENSE000035586013672304536723130
ENSE000035759173671982136719929
ENSE000035799963669911536699921
ENSE000035989473672611936726318
ENSE000036214943673104536731151
ENSE000036326483671253336712622
ENSE000036353703670916836709250

Expression profiles

Bgee: expression breadth ubiquitous, 135 present calls, max score 95.91.

FANTOM5 (CAGE): breadth broad, TPM avg 4.4001 / max 278.0239, expressed in 643 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
1890032.8303542
1890020.9623446
1890040.3629212
1890050.141576
1890060.068719
1890070.03448

Top tissues by expression

261 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
renal medullaUBERON:000036295.91gold quality
lower esophagus mucosaUBERON:003583493.17gold quality
esophagus squamous epitheliumUBERON:000692092.67gold quality
buccal mucosa cellCL:000233692.18gold quality
metanephros cortexUBERON:001053390.84gold quality
esophagus mucosaUBERON:000246990.60gold quality
epithelium of esophagusUBERON:000197690.25gold quality
oral cavityUBERON:000016780.15gold quality
amniotic fluidUBERON:000017379.50gold quality
squamous epitheliumUBERON:000691479.22gold quality
vastus lateralisUBERON:000137978.91silver quality
tongue squamous epitheliumUBERON:000691977.99silver quality
adult mammalian kidneyUBERON:000008277.94gold quality
superior surface of tongueUBERON:000737177.49gold quality
tibiaUBERON:000097977.14gold quality
quadriceps femorisUBERON:000137776.94silver quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047376.39gold quality
kidneyUBERON:000211375.08gold quality
triceps brachiiUBERON:000150974.55gold quality
tongueUBERON:000172374.47gold quality
gluteal muscleUBERON:000200073.63gold quality
nippleUBERON:000203073.03gold quality
fundus of stomachUBERON:000116072.61gold quality
tendon of biceps brachiiUBERON:000818871.70silver quality
body of stomachUBERON:000116171.34gold quality
stomachUBERON:000094571.17gold quality
cardia of stomachUBERON:000116270.94gold quality
muscle organUBERON:000163070.55gold quality
medial globus pallidusUBERON:000247770.22silver quality
body of tongueUBERON:001187669.63gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-CURD-119yes57.96
E-ANND-3yes10.20

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

12 targets.

TargetRegulation
ARNT
BCL2Repression
CASP3Activation
CSTB
EPO
HIF1ARepression
MYOD1
MYOM2Activation
PIAS1
PRKNUnknown
SIM2
SNAI2Unknown

Upstream regulators (CollecTRI, top): CTNNB1, MYB, MYC, SIM2

miRNA regulators (miRDB)

28 targeting SIM2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3692-3P99.9870.272139
HSA-MIR-449399.9066.48977
HSA-MIR-548AJ-5P99.7871.123085
HSA-MIR-548G-5P99.7871.123085
HSA-MIR-548X-5P99.7871.123085
HSA-MIR-548F-5P99.7871.023093
HSA-MIR-1260A99.6166.671098
HSA-MIR-1260B99.6166.671098
HSA-MIR-451699.6167.783390
HSA-MIR-372-5P99.4169.112299
HSA-MIR-6852-5P99.1766.692073
HSA-MIR-429299.1665.571767
HSA-MIR-6791-5P99.1665.921844
HSA-MIR-66199.0965.942062
HSA-MIR-6770-5P98.9766.761853
HSA-MIR-126798.2469.05837
HSA-MIR-66597.6065.641781
HSA-MIR-6802-3P97.2965.42613
HSA-MIR-428697.2064.371587
HSA-MIR-3189-3P96.8066.34896
HSA-MIR-552-3P96.6864.121026
HSA-MIR-7108-5P96.4266.17598
HSA-MIR-6726-5P95.9763.72841
HSA-MIR-92095.9763.95811
HSA-MIR-430095.8564.561003
HSA-MIR-5591-5P95.8564.761002
HSA-MIR-874-3P95.0265.66806
HSA-MIR-6774-3P89.1465.2068

Literature-anchored findings (GeneRIF, showing 32)

  • Stage-specific expression of the sim2-short-form protein was seen in normal matched paraffin sections of the colon tumors. In a matched set of tissues of Benign Prostatic Hyperplasia (BPH) and prostate carcinomas, sim2-s expression was detected in the BPH (PMID:12530058)
  • Isoform-specific expression of SIM2 short-form (SIM2-s) was seen selectively in colon, prostate, and pancreatic carcinomas. (PMID:12676991)
  • Tissue- and tumor-specific expression of both SIM2 isoforms (SIM2-s and SIM2-l), but not SIM1, were detected in the pancreatic tumor models. (PMID:14550949)
  • SIM1 and SIM2 have a novel nuclear localization signal (PMID:14697214)
  • a novel NXF signaling system on neural gene promoter may be a molecular target of the adverse effects of Sim2 in the mental retardation of Down’s syndrome (PMID:14701734)
  • Potential role of SIM2 in central nervous system development. SIM2 overexpression could participate in pathogenesis of mental retardation in Down syndrome. (PMID:15946822)
  • provide a direct link between SIM2-s and differentiation, and may provide a model to identify SIM2-s targets (PMID:16129820)
  • The expression of a splice variant of SIM2, SIM2s, is lost or reduced in human breast cancers. Inhibits growth and invasiveness of breast cancer cells upon ectopic expression. Represses MMP3 expression by direct binding to the MMP3 promoter. (PMID:16840439)
  • Knockdown of SIM2s in MCF-7 breast cancer cells contributed to an epithelial-mesenchymal transition associated with increased MMP2 and slug levels. (PMID:18160708)
  • Ha-Ras transformation of MCF10A cells leads to repression of Singleminded-2s through NOTCH and C/EBPbeta. (PMID:19169276)
  • SIM2s’ functional interference with HIF1alpha activities on BNIP3 may indicate a novel role for SIM2s in promoting tumourigenesis. (PMID:19668230)
  • The SIM2 is expressed in prostate cancer(PCa) and that anti-SIM2 antibodies are detectable in PCa patients’ sera implicate SIM2 as a PCa-associated antigen that is a suitable potential target for PCa immunotherapy. (PMID:19737960)
  • Genetic variants of SIM2 gene may not be associated with the susceptibility to congenital scoliosis and different clinical phenotypes of CS in Chinese Han population. (PMID:20137643)
  • Significantly lower frequency of SIM2 C-G haplotype (rs2073601-rs2073416) was noticed in individuals with Down syndrome (P value =0.01669) and their fathers (P value=0.01185) (PMID:22048266)
  • These results suggest an involvement of SIM2 in key traits of prostate tumor cell biology. (PMID:22174909)
  • role for SIM2s in promoting human breast tumor differentiation and maintaining epithelial integrity. (PMID:22777354)
  • Altered expression of SIM2 and ETS2 could be one of the reasons for variable occurrence of different malignant conditions in Down syndrome. (PMID:23343470)
  • miR-200a is downregulated in human glioma and inhibition of miR-200a caused upregulation of SIM2-s in T98G cells and promoted their motility. (PMID:24162743)
  • a single peptide containing multiple SIM2 epitopes can be used to induce both a CD4 and CD8 T cell response, providing a peptide-based vaccine formulation for potential use in immunotherapy of various cancers. (PMID:24690990)
  • Data suggest selected SIM1 variants exhibit poor dimerization with ARNT2 (aryl-hydrocarbon receptor nuclear translocator 2) and anomalous intracellular localization; data were used to predict spot in SIM1/SIM2 (residues 290-326) critical in function. (PMID:24814368)
  • Data showed that SIM2 was highly expressed in human glioma tumors and cell lines and its short form might increase the invasion of glioma cells through the alteration of epithelial-mesenchymal transition. (PMID:24909296)
  • In SH-SY5Y cells, ethanol exposure increased Sim2 expression in a manner similar to that of cleaved caspase 3. PKA activation was required for this. Sim2 might be involved in activating caspase 3 and ethanol-induced apoptosis in SH-SY5Y cells. (PMID:25319570)
  • Two genes, OR51E2 and SIM2, and two miRNAs, miR-200c and miR-200b, showed significant association with prostate cancer. (PMID:28910345)
  • attenuates resistance to hypoxia and tumor growth by transcriptional suppression of HIF1A in uterine cervical squamous cell carcinoma (PMID:29109451)
  • SIM2 increases CRT sensitivity through tumor differentiation by cooperation with ARNT. (PMID:29427302)
  • TMEM75 functions as an oncogene relying on activation of SIM2 in colorectal cancer. (PMID:29964097)
  • SIM2s interacts with ATM and is stabilized through ATM-dependent phosphorylation in response to IR. (PMID:30531838)
  • these results show a role for SIM2s in the resolution of replication stress and further characterize the necessity of SIM2s for effective RAD51 loading in response to DNA damage or stress, ultimately promoting genomic integrity and thus preventing the accumulation of cancer-promoting mutations. (PMID:31775907)
  • These findings support a role for SIM2s in the prevention of breast cancer progression through its ability to repress PTGS2 expression via modulating the NFkappaB signaling pathway. (PMID:31783895)
  • CD24 and CK4 are upregulated by SIM2, and are predictive biomarkers for chemoradiotherapy and surgery in esophageal cancer. (PMID:32124945)
  • Long noncoding RNA CASC15 facilitates esophageal squamous cell carcinoma tumorigenesis via decreasing SIM2 stability via FTOmediated demethylation. (PMID:33650646)
  • Characterization of functionally deficient SIM2 variants found in patients with neurological phenotypes. (PMID:35730699)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
mus_musculusSim2ENSMUSG00000062713
rattus_norvegicusSim2ENSRNOG00000054203
drosophila_melanogastertrhFBGN0262139
drosophila_melanogastersimaFBGN0266411
caenorhabditis_elegansWBGENE00001851

Paralogs (7): HIF1A (ENSG00000100644), SIM1 (ENSG00000112246), EPAS1 (ENSG00000116016), HIF3A (ENSG00000124440), NPAS1 (ENSG00000130751), NPAS3 (ENSG00000151322), NPAS4 (ENSG00000174576)

Protein

Protein identifiers

Single-minded homolog 2Q14190 (reviewed: Q14190)

Alternative names: Class E basic helix-loop-helix protein 15

All UniProt accessions (3): Q14190, H7BZX8, V9GY04

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor that may be a master gene of CNS development in cooperation with Arnt. It may have pleiotropic effects in the tissues expressed during development.

Subunit / interactions. Efficient DNA binding requires dimerization with another bHLH protein. Heterodimer of SIM2 and ARNT.

Subcellular location. Nucleus.

Isoforms (2)

UniProt IDNamesCanonical?
Q14190-1SIM2yes
Q14190-2SIM2S

RefSeq proteins (2): NP_005060, NP_033664 (=MANE)

Domains & families (InterPro)

IDNameType
IPR000014PASDomain
IPR001610PACRepeat
IPR010578SIM_CDomain
IPR011598bHLH_domDomain
IPR013655PAS_fold_3Domain
IPR013767PAS_foldDomain
IPR035965PAS-like_dom_sfHomologous_superfamily
IPR036638HLH_DNA-bd_sfHomologous_superfamily

Pfam: PF00989, PF06621, PF08447, PF23171

UniProt features (36 total): mutagenesis site 18, domain 5, sequence conflict 3, region of interest 3, compositionally biased region 2, splice variant 2, chain 1, sequence variant 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q14190-F163.050.36

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Mutagenesis-validated functional residues (18):

PositionPhenotype
367reduced nuclear translocation.
368no effect on nuclear translocation.
369no effect on nuclear translocation.
370no effect on nuclear translocation.
371no effect on nuclear translocation.
372no effect on nuclear translocation.
373reduced nuclear translocation.
375no effect on nuclear translocation.
376no effect on nuclear translocation.
377no effect on nuclear translocation.
378no effect on nuclear translocation.
379no effect on nuclear translocation.
380no effect on nuclear translocation.
381no effect on nuclear translocation.
382no effect on nuclear translocation.
383no effect on nuclear translocation.
385reduced nuclear translocation.
386reduced nuclear translocation.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 138 (showing top): GCACCTT_MIR18A_MIR18B, BENPORATH_ES_WITH_H3K27ME3, MORF_MSH3, GCANCTGNY_MYOD_Q6, MODULE_16, MORF_RAD51L3, TANG_SENESCENCE_TP53_TARGETS_UP, GOBP_EMBRYONIC_PATTERN_SPECIFICATION, CREIGHTON_ENDOCRINE_THERAPY_RESISTANCE_5, GOBP_RESPIRATORY_SYSTEM_DEVELOPMENT, GOBP_EMBRYO_DEVELOPMENT, MORF_ATF2, chr21q22, DANG_BOUND_BY_MYC, MORF_BCL2L11

GO Biological Process (9): negative regulation of transcription by RNA polymerase II (GO:0000122), regulation of transcription by RNA polymerase II (GO:0006357), nervous system development (GO:0007399), embryonic pattern specification (GO:0009880), cell differentiation (GO:0030154), lung development (GO:0030324), regulation of DNA-templated transcription (GO:0006355), negative regulation of DNA-templated transcription (GO:0045892), system development (GO:0048731)

GO Molecular Function (7): RNA polymerase II transcription regulatory region sequence-specific DNA binding (GO:0000977), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700), protein heterodimerization activity (GO:0046982), protein binding (GO:0005515), protein dimerization activity (GO:0046983)

GO Cellular Component (4): chromatin (GO:0000785), nucleus (GO:0005634), nucleoplasm (GO:0005654), nuclear body (GO:0016604)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of DNA-templated transcription3
regulation of transcription by RNA polymerase II2
transcription by RNA polymerase II2
DNA-templated transcription2
transcription cis-regulatory region binding2
cellular anatomical structure2
negative regulation of DNA-templated transcription1
system development1
pattern specification process1
embryo development1
cellular developmental process1
respiratory tube development1
animal organ development1
respiratory system development1
regulation of gene expression1
regulation of RNA biosynthetic process1
negative regulation of RNA biosynthetic process1
multicellular organism development1
anatomical structure development1
chromatin1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
DNA-binding transcription factor activity1
nucleic acid binding1
transcription regulator activity1
protein dimerization activity1
binding1
protein binding1
chromosome1
intracellular membrane-bounded organelle1
nuclear lumen1
nucleoplasm1
intracellular membraneless organelle1

Protein interactions and networks

STRING

778 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SIM2ARNTP27540601
SIM2OTPQ5XKR4568
SIM2THSD4Q6ZMP0529
SIM2NPAS4Q8IUM7512
SIM2PIAS1O75925508
SIM2SIX3O95343502
SIM2ACTBP02570490
SIM2ARNT2Q9HBZ2485
SIM2TRHP20396467
SIM2CTSEP14091460
SIM2SUMO1P55856448
SIM2SUMO2P55855444
SIM2HIF1AQ16665444
SIM2TRIM5Q9C035443
SIM2ROS1P08922422

IntAct

38 interactions, top by confidence:

ABTypeScore
SIM2PTGES3psi-mi:“MI:0915”(physical association)0.570
FOXR2NME2P1psi-mi:“MI:0914”(association)0.530
SIM2Dlg4psi-mi:“MI:0407”(direct interaction)0.440
HSP90AB1SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2FKBP5psi-mi:“MI:0915”(physical association)0.400
MLF1SIM2psi-mi:“MI:0915”(physical association)0.400
Hacd3SIM2psi-mi:“MI:0915”(physical association)0.400
MLF2SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2HSP90AB1psi-mi:“MI:0915”(physical association)0.400
SIM2BAG4psi-mi:“MI:0915”(physical association)0.400
SIM2NUDCpsi-mi:“MI:0915”(physical association)0.400
BAG2SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2psi-mi:“MI:0915”(physical association)0.400
HSPB1SIM2psi-mi:“MI:0915”(physical association)0.400
NUDCD3SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2PSMD2psi-mi:“MI:0915”(physical association)0.400
SGTASIM2psi-mi:“MI:0915”(physical association)0.400
SIM2STIP1psi-mi:“MI:0915”(physical association)0.400
FKBPLSIM2psi-mi:“MI:0915”(physical association)0.400
SIM2STUB1psi-mi:“MI:0915”(physical association)0.400
CACYBPSIM2psi-mi:“MI:0915”(physical association)0.400
SIM2AIPpsi-mi:“MI:0915”(physical association)0.400
AARSD1SIM2psi-mi:“MI:0915”(physical association)0.400
SIM2PPP5Cpsi-mi:“MI:0915”(physical association)0.400
SIM2TTC1psi-mi:“MI:0915”(physical association)0.400
FOXN1FOXN1psi-mi:“MI:0914”(association)0.350
SIM2HSPA8psi-mi:“MI:0914”(association)0.350
ARNT2FNTApsi-mi:“MI:0914”(association)0.350

BioGRID (52): SIM2 (Affinity Capture-MS), SIM2 (Affinity Capture-MS), SIM2 (Affinity Capture-MS), SIM2 (Affinity Capture-MS), ARNT (Affinity Capture-MS), ARNT2 (Affinity Capture-MS), MAGED1 (Affinity Capture-MS), MATR3 (Affinity Capture-MS), MAGED1 (Affinity Capture-Western), SIM2 (Affinity Capture-Western), SIM2 (Affinity Capture-Western), ARNT (FRET), MAGED1 (FRET), ARNT (Reconstituted Complex), ARNT2 (Reconstituted Complex)

ESM2 similar proteins: A1YFY6, A2T6X9, A6H7I8, B2RUJ5, F1M5F3, F1N2W9, O35430, O35431, O95487, O95628, O95644, P0C6S7, P14316, P17863, P22681, P22682, P23798, P23906, P35227, P81133, P98084, Q02410, Q0IHY4, Q13469, Q14190, Q14432, Q1L994, Q3UR85, Q52L14, Q5CD77, Q5RD33, Q60591, Q61045, Q61079, Q66JB6, Q69ZT9, Q6NRE7, Q6QB00, Q8BIZ1, Q8BT14

Diamond homologs: A1YFY6, A2T6X9, A9YTQ3, O09000, O35800, P05709, P81133, P97459, P97481, Q0PGG7, Q0VBL6, Q14190, Q16665, Q24119, Q24167, Q309Z6, Q61045, Q61079, Q61221, Q8IXF0, Q98SJ5, Q98SW2, Q99742, Q99814, Q9I8A9, Q9JHS1, Q9JHS2, Q9QZQ0, Q9XTA5, Q9Y2N7, Q9YIB9, E7FFX1, O02747, O44712, O57539, O61734, P30561, P35869, P41738, P70365

SIGNOR signaling

1 interactions.

AEffectBMechanism
SIM2“up-regulates activity”ARNTbinding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 33 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
HSP90 chaperone cycle for steroid hormone receptors (SHR) in the presence of ligand544.0×8e-06
Cellular responses to stress58.4×4e-03
Cellular responses to stimuli57.2×7e-03

GO biological processes:

GO termPartnersFoldFDR
protein folding1035.6×3e-11
protein stabilization511.5×4e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

104 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic1
Uncertain significance84
Likely benign5
Benign3

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
1334899NM_005069.6(SIM2):c.461A>G (p.Tyr154Cys)Likely pathogenic

SpliceAI

2379 predictions. Top by Δscore:

VariantEffectΔscore
21:36699919:AAG:Adonor_loss1.0000
21:36699921:GG:Gdonor_loss1.0000
21:36699923:T:Adonor_loss1.0000
21:36712832:T:Aacceptor_gain1.0000
21:36718661:G:GTdonor_gain1.0000
21:36726117:A:AGacceptor_gain1.0000
21:36726117:AGG:Aacceptor_loss1.0000
21:36726118:G:Cacceptor_loss1.0000
21:36726118:G:GGacceptor_gain1.0000
21:36726314:TCCAG:Tdonor_loss1.0000
21:36726315:CCAG:Cdonor_loss1.0000
21:36726316:CAGG:Cdonor_loss1.0000
21:36726318:GGTG:Gdonor_loss1.0000
21:36726320:T:Adonor_loss1.0000
21:36731147:CCTCC:Cdonor_gain1.0000
21:36731152:G:GGdonor_gain1.0000
21:36741715:A:AGacceptor_gain1.0000
21:36741716:G:GGacceptor_gain1.0000
21:36741716:GT:Gacceptor_gain1.0000
21:36741837:GCATC:Gdonor_gain1.0000
21:36741842:G:GGdonor_gain1.0000
21:36747663:A:AGacceptor_gain1.0000
21:36747664:G:GGacceptor_gain1.0000
21:36747664:GC:Gacceptor_gain1.0000
21:36712833:G:Aacceptor_gain0.9900
21:36723043:A:AGacceptor_gain0.9900
21:36723044:G:GGacceptor_gain0.9900
21:36723044:GA:Gacceptor_gain0.9900
21:36726118:GGTC:Gacceptor_gain0.9900
21:36726258:AC:Adonor_gain0.9900

AlphaMissense

4311 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
21:36699764:G:CK6N1.000
21:36699764:G:TK6N1.000
21:36699772:C:AA9D1.000
21:36699780:A:GR12G1.000
21:36699781:G:CR12T1.000
21:36699781:G:TR12M1.000
21:36699782:G:CR12S1.000
21:36699782:G:TR12S1.000
21:36699783:A:GR13G1.000
21:36699784:G:CR13T1.000
21:36699784:G:TR13M1.000
21:36699785:G:CR13S1.000
21:36699785:G:TR13S1.000
21:36699792:G:AE16K1.000
21:36699794:A:CE16D1.000
21:36699794:A:TE16D1.000
21:36699797:T:AN17K1.000
21:36699797:T:GN17K1.000
21:36699804:T:CF20L1.000
21:36699805:T:CF20S1.000
21:36699805:T:GF20C1.000
21:36699806:T:AF20L1.000
21:36699806:T:GF20L1.000
21:36699814:T:AL23H1.000
21:36699814:T:CL23P1.000
21:36699816:G:CA24P1.000
21:36699817:C:AA24D1.000
21:36699823:T:CL26P1.000
21:36699826:T:AL27H1.000
21:36699826:T:CL27P1.000

dbSNP variants (sampled 300 via entrez): RS1000030674 (21:36706578 G>A,T), RS1000120358 (21:36733228 C>A), RS1000134758 (21:36701851 C>T), RS1000159034 (21:36737822 G>A), RS1000209563 (21:36722257 A>G), RS1000212076 (21:36743916 C>T), RS1000285147 (21:36705742 C>A,T), RS1000314859 (21:36738086 G>A), RS1000321781 (21:36701354 T>C), RS1000387091 (21:36711566 G>A,T), RS1000408836 (21:36733001 C>T), RS1000441853 (21:36705942 T>C,G), RS1000538423 (21:36747280 T>C), RS1000588679 (21:36701161 T>A), RS1000607600 (21:36741994 T>C)

Disease associations

OMIM: gene MIM:600892 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

10 associations (top):

StudyTraitp-value
GCST002322_19Asthma and hay fever2.000000e-06
GCST003542_70Night sleep phenotypes1.000000e-06
GCST004601_205Red blood cell count2.000000e-11
GCST004604_2Hematocrit7.000000e-13
GCST004615_134Hemoglobin concentration2.000000e-11
GCST009203_14Cerebellum cortex volume9.000000e-06
GCST010083_63Hemoglobin levels2.000000e-21
GCST90000025_698Appendicular lean mass8.000000e-21
GCST90002383_321Hematocrit2.000000e-10
GCST90002384_490Hemoglobin1.000000e-09

EFO canonical traits (5, from GWAS)

EFO IDTrait name
EFO:0007827nighttime rest measurement
EFO:0004305erythrocyte count
EFO:0004348hematocrit
EFO:0004509hemoglobin measurement
EFO:0004980appendicular lean mass

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

30 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteaffects methylation, decreases expression, increases abundance, increases expression4
bisphenol Aaffects cotreatment, decreases methylation, increases expression2
Benzo(a)pyreneaffects methylation, increases methylation2
Nickeldecreases expression2
Valproic Acidaffects cotreatment, decreases expression2
geraniolincreases expression1
ethyl-p-hydroxybenzoateincreases expression1
trichostatin Adecreases expression1
arseniteincreases methylation1
benzo(e)pyreneincreases methylation1
potassium chromate(VI)increases expression1
aflatoxin B2increases methylation1
cupric oxidedecreases expression1
S-(1,2-dichlorovinyl)cysteineaffects response to substance, increases expression1
2-palmitoylglycerolincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
abrineincreases expression1
dorsomorphinaffects cotreatment, decreases expression1
jinfukangaffects cotreatment, decreases expression1
Fulvestrantaffects cotreatment, decreases methylation1
Arsenicincreases abundance, increases expression1
Cisplatinaffects cotreatment, decreases expression1
Clozapinedecreases expression1
Estradiolincreases expression1
Lipopolysaccharidesaffects response to substance, increases expression1
Methapyrileneincreases methylation1
Silicon Dioxidedecreases expression1
Smokeincreases expression1
Vanadatesdecreases expression1
Particulate Matterincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): asthma, seasonal allergic rhinitis