SMC2

gene
On this page

Also known as hCAP-ECAP-E

Summary

SMC2 (structural maintenance of chromosomes 2, HGNC:14011) is a protein-coding gene on chromosome 9q31.1, encoding Structural maintenance of chromosomes protein 2 (O95347). Central component of the condensin complex, a complex required for conversion of interphase chromatin into mitotic-like condense chromosomes. It is a common-essential gene (DepMap: required in 99.8% of cancer cell lines).

Predicted to enable chromatin binding activity. Involved in mitotic chromosome condensation and positive regulation of chromosome condensation. Located in condensed chromosome; cytoplasm; and nuclear lumen. Part of condensin complex. Implicated in colon adenocarcinoma. Biomarker of colorectal cancer.

Source: NCBI Gene 10592 — RefSeq curated summary.

At a glance

  • GWAS associations: 14
  • Clinical variants (ClinVar): 191 total
  • Druggable target: yes — 2 molecules with ChEMBL bioactivity
  • Cancer dependency (DepMap): dependent in 99.8% of screened cell lines (common-essential)
  • MANE Select transcript: NM_006444

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14011
Approved symbolSMC2
Namestructural maintenance of chromosomes 2
Location9q31.1
Locus typegene with protein product
StatusApproved
AliaseshCAP-E, CAP-E
Ensembl geneENSG00000136824
Ensembl biotypeprotein_coding
OMIM605576
Entrez10592

Gene structure

Transcript identifiers

Ensembl transcripts: 15 — 14 protein_coding, 1 nonsense_mediated_decay

ENST00000286398, ENST00000374787, ENST00000374793, ENST00000440179, ENST00000493955, ENST00000886042, ENST00000886043, ENST00000926170, ENST00000926171, ENST00000926172, ENST00000926173, ENST00000926174, ENST00000956306, ENST00000956307, ENST00000956308

RefSeq mRNA: 4 — MANE Select: NM_006444 NM_001042550, NM_001042551, NM_001265602, NM_006444

CCDS: CCDS35086

Canonical transcript exons

ENST00000374793 — 25 exons

ExonStartEnd
ENSE00000926624104132009104132125
ENSE00000926629104126641104126784
ENSE00000926634104116200104116319
ENSE00000926638104111581104111814
ENSE00000926641104100389104100433
ENSE00000926645104096148104096297
ENSE00000926646104095324104095552
ENSE00001138020104124912104125105
ENSE00001138026104123108104123232
ENSE00001138038104114691104114829
ENSE00001144192104134415104134575
ENSE00001304760104118171104118375
ENSE00001307115104099644104099682
ENSE00001307256104113316104113475
ENSE00001310886104120027104120162
ENSE00001322979104100093104100203
ENSE00001324511104113964104114081
ENSE00001330797104102424104102573
ENSE00001421187104094309104094477
ENSE00001464668104127286104127480
ENSE00001897927104139139104141419
ENSE00003499790104098446104098568
ENSE00003534259104129645104129845
ENSE00003583373104138018104138165
ENSE00003789764104101960104102193

Expression profiles

Bgee: expression breadth ubiquitous, 265 present calls, max score 95.92.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 13.0780 / max 354.5567, expressed in 1649 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
9777910.65581585
977802.3581909
977810.064120

Top tissues by expression

289 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ventricular zoneUBERON:000305395.92gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047395.02gold quality
ganglionic eminenceUBERON:000402391.77gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099190.34gold quality
embryoUBERON:000092287.56gold quality
calcaneal tendonUBERON:000370186.48gold quality
secondary oocyteCL:000065585.32gold quality
spermCL:000001984.27gold quality
trabecular bone tissueUBERON:000248383.39gold quality
bone marrowUBERON:000237182.50gold quality
testisUBERON:000047382.38gold quality
bone marrow cellCL:000209282.02gold quality
adrenal tissueUBERON:001830382.00gold quality
bone elementUBERON:000147481.98gold quality
rectumUBERON:000105281.85gold quality
vermiform appendixUBERON:000115481.53gold quality
right testisUBERON:000453481.52gold quality
male germ cellCL:000001581.38gold quality
left testisUBERON:000453380.94gold quality
stromal cell of endometriumCL:000225580.69gold quality
lymph nodeUBERON:000002980.05gold quality
oocyteCL:000002379.41gold quality
hair follicleUBERON:000207378.92silver quality
tonsilUBERON:000237278.65gold quality
caecumUBERON:000115378.33gold quality
islet of LangerhansUBERON:000000677.93gold quality
endometriumUBERON:000129577.82gold quality
esophagus squamous epitheliumUBERON:000692077.67gold quality
gingival epitheliumUBERON:000194977.63gold quality
esophagus mucosaUBERON:000246977.16gold quality

Single-cell (SCXA)

Detected in 15 experiment(s), a significant marker in 15.

ExperimentMarker?Max mean expression
E-MTAB-8559yes492.85
E-MTAB-8530yes259.77
E-CURD-114yes64.31
E-CURD-112yes48.36
E-MTAB-9467yes30.42
E-HCAD-10yes24.01
E-CURD-122yes23.61
E-GEOD-125970yes23.16
E-HCAD-13yes22.82
E-HCAD-5yes20.62
E-HCAD-1yes18.25
E-MTAB-6678yes9.72
E-ANND-3yes8.66
E-MTAB-10553yes7.56
E-CURD-88yes6.61

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): E2F4, TCF7L2

miRNA regulators (miRDB)

110 targeting SMC2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-190A-3P100.0080.355520
HSA-MIR-5011-5P100.0083.465820
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-126-5P100.0072.713180
HSA-MIR-3163100.0077.238605
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-340-5P100.0072.504437
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-150-5P99.9966.691976
HSA-MIR-477599.9875.006394
HSA-MIR-56899.9869.862084
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-551B-5P99.9671.283493
HSA-MIR-570-3P99.9672.414910
HSA-MIR-568899.9673.234504
HSA-MIR-495-3P99.9672.814197
HSA-MIR-96-5P99.9572.802140
HSA-MIR-651-3P99.9473.485177
HSA-MIR-548J-3P99.9472.614881
HSA-MIR-548AE-3P99.9372.664867
HSA-MIR-548AH-3P99.9372.544872
HSA-MIR-548AM-3P99.9372.544872
HSA-MIR-548AQ-3P99.9372.664867
HSA-MIR-381-3P99.9371.872854
HSA-MIR-30099.9271.762856
HSA-MIR-1213399.9271.822006

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.8% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 10)

  • mutation analysis of hCAP-C & hCAP-E in leukemia-lymphoma cell lines including 8 pyothorax-associated lymphoma (PAL); 3 of 8 PAL cell lines have hCAP-C or hCAP-E heterozygous mutations, indicating condensin mutations are relatively frequent in PAL (PMID:17488335)
  • human SMC2 hinge domain with short coiled coils was cloned, expressed, purified and crystallized in the orthorhombic space group C222 in native and SeMet-derivatized forms (PMID:20823528)
  • Studies indicate that tagged SMC2 subunit of the condensin complex with the streptavidin-Binding Peptide optimising efficacious use of this tag for protein analysis. (PMID:21194474)
  • WNT signaling can directly activate SMC2 transcription as a key player in the mitotic cell division machinery. (PMID:23095742)
  • SMC2 gene is altered by both frameshift mutation and loss of expression in gastric and colorectal cancers, suggesting that SMC2 gene alterations might be involved in pathogenesis of these cancers (PMID:24483990)
  • SMC2 (or the condensin complex) is a novel molecular target for the treatment of MYCN-amplified neuroblastoma. (PMID:24553121)
  • analysis of functional differences of SMC2 and SMC4 hinge domain between condensins and cohesin in DNA recognition (PMID:26491021)
  • Rs3818626 and rs9895829 were significantly associated with SMC2 and TP53 messenger RNA expression. (PMID:30794721)
  • Structural maintenance of chromosomes 2 is identified as an oncogene in bladder cancer in vitro and in vivo. (PMID:31986889)
  • SMC2 knockdown inhibits malignant progression of lung adenocarcinoma by upregulating BTG2 expression. (PMID:38729325)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriosmc2ENSDARG00000017744
mus_musculusSmc2ENSMUSG00000028312
rattus_norvegicusSmc2ENSRNOG00000022325
drosophila_melanogasterSMC2FBGN0027783
caenorhabditis_elegansWBGENE00003367

Paralogs (7): SMC1A (ENSG00000072501), SMC1B (ENSG00000077935), SMC3 (ENSG00000108055), SMC4 (ENSG00000113810), CKAP4 (ENSG00000136026), CCDC122 (ENSG00000151773), CCDC157 (ENSG00000187860)

Protein

Protein identifiers

Structural maintenance of chromosomes protein 2O95347 (reviewed: O95347)

Alternative names: Chromosome-associated protein E, XCAP-E homolog

All UniProt accessions (3): A0A0C4DGE5, O95347, Q5T821

UniProt curated annotations — full annotation on UniProt →

Function. Central component of the condensin complex, a complex required for conversion of interphase chromatin into mitotic-like condense chromosomes. The condensin complex probably introduces positive supercoils into relaxed DNA in the presence of type I topoisomerases and converts nicked DNA into positive knotted forms in the presence of type II topoisomerases.

Subunit / interactions. Forms a heterodimer with SMC4. Component of the condensin complex, which contains the SMC2 and SMC4 heterodimer, and three non SMC subunits that probably regulate the complex: BRRN1/CAPH, CNAP1/CAPD2 and CAPG. Interacts with BRD4 (isoform B), leading to insulate chromatin from DNA damage response pathway.

Subcellular location. Nucleus. Cytoplasm. Chromosome.

Domain organisation. The SMC hinge domain, which separates the large intramolecular coiled coil regions, allows the heterodimerization with SMC4, forming a V-shaped heterodimer.

Similarity. Belongs to the SMC family. SMC2 subfamily.

Isoforms (2)

UniProt IDNamesCanonical?
O95347-11yes
O95347-22

RefSeq proteins (4): NP_001036015, NP_001036016, NP_001252531, NP_006435* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR003395RecF/RecN/SMC_NDomain
IPR010935SMC_hingeDomain
IPR024704SMCFamily
IPR027120Smc2_ABCDomain
IPR027417P-loop_NTPaseHomologous_superfamily
IPR036277SMC_hinge_sfHomologous_superfamily

Pfam: PF02463, PF06470

UniProt features (43 total): helix 14, strand 9, modified residue 5, sequence conflict 3, coiled-coil region 3, turn 3, splice variant 2, chain 1, domain 1, sequence variant 1, binding site 1

Structure

Experimental structures (PDB)

2 structures.

PDBMethodResolution (Å)
4U4PX-RAY DIFFRACTION1.89
9F5WELECTRON MICROSCOPY7.5

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O95347-F183.420.21

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (1): 32–39

Post-translational modifications (5): 1160, 114, 222, 677, 1158

Function

Pathways and Gene Ontology

Reactome pathways

7 pathways

IDPathway
R-HSA-2299718Condensation of Prophase Chromosomes
R-HSA-2514853Condensation of Prometaphase Chromosomes
R-HSA-1640170Cell Cycle
R-HSA-68875Mitotic Prophase
R-HSA-68877Mitotic Prometaphase
R-HSA-68886M Phase
R-HSA-69278Cell Cycle, Mitotic

MSigDB gene sets: 376 (showing top): GNF2_CKS1B, GOBP_MEIOTIC_CHROMOSOME_SEGREGATION, GOBP_CHROMOSOME_ORGANIZATION, WU_APOPTOSIS_BY_CDKN1A_VIA_TP53, E2F_Q4_01, RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, GNF2_CENPF, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, CROONQUIST_NRAS_SIGNALING_DN, MITSIADES_RESPONSE_TO_APLIDIN_DN, GOBP_CHROMOSOME_SEPARATION, GOBP_POSITIVE_REGULATION_OF_ORGANELLE_ORGANIZATION, KAUFFMANN_DNA_REPAIR_GENES, KONG_E2F3_TARGETS, GOBP_CHROMOSOME_CONDENSATION

GO Biological Process (12): mitotic chromosome condensation (GO:0007076), meiotic chromosome condensation (GO:0010032), meiotic chromosome segregation (GO:0045132), cell division (GO:0051301), kinetochore organization (GO:0051383), positive regulation of chromosome segregation (GO:0051984), positive regulation of chromosome separation (GO:1905820), positive regulation of chromosome condensation (GO:1905821), nuclear division (GO:0000280), chromosome condensation (GO:0030261), chromosome organization (GO:0051276), nuclear chromosome segregation (GO:0098813)

GO Molecular Function (6): chromatin binding (GO:0003682), single-stranded DNA binding (GO:0003697), ATP binding (GO:0005524), ATP hydrolysis activity (GO:0016887), nucleotide binding (GO:0000166), protein binding (GO:0005515)

GO Cellular Component (12): nuclear chromosome (GO:0000228), chromatin (GO:0000785), condensed chromosome (GO:0000793), condensed nuclear chromosome (GO:0000794), condensin complex (GO:0000796), nucleus (GO:0005634), nucleoplasm (GO:0005654), nucleolus (GO:0005730), cytoplasm (GO:0005737), cytosol (GO:0005829), extracellular exosome (GO:0070062), chromosome (GO:0005694)

Reactome top-level categories

Rollup of top-5 pathways:

CategoryPathways
M Phase2
Mitotic Prophase1
Mitotic Prometaphase1
Cell Cycle, Mitotic1
Cell Cycle1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
chromosome4
cellular anatomical structure4
chromosome condensation3
nuclear lumen3
chromosome organization2
chromosome segregation2
positive regulation of cell cycle process2
binding2
nucleus2
intracellular membraneless organelle2
mitotic sister chromatid segregation1
mitotic cell cycle1
mitotic cell cycle process1
meiotic cell cycle1
chromosome organization involved in meiotic cell cycle1
nuclear chromosome segregation1
meiotic nuclear division1
meiotic cell cycle process1
cellular process1
regulation of chromosome segregation1
chromosome separation1
regulation of chromosome separation1
regulation of chromosome condensation1
positive regulation of chromosome organization1
organelle fission1
organelle organization1
DNA binding1
adenyl ribonucleotide binding1
purine ribonucleoside triphosphate binding1
ribonucleoside triphosphate phosphatase activity1
ATP-dependent activity1
nucleoside phosphate binding1
heterocyclic compound binding1
nuclear chromosome1
condensed chromosome1
protein-containing complex1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
cytoplasm1
extracellular vesicle1

Protein interactions and networks

STRING

3351 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SMC2SMC4Q9NTJ3978
SMC2NCAPD2Q15021965
SMC2NCAPHQ15003911
SMC2NCAPGQ9BPX3874
SMC2NCAPG2Q86XI2649
SMC2TOP2AP11388643
SMC2SMC5Q8IY18626
SMC2SMC1AQ14683619
SMC2NCAPD3P42695607
SMC2NCAPH2Q6IBW4592
SMC2SMC3Q9UQE7572
SMC2RAD21O60216540
SMC2SMC6Q96SB8536
SMC2BRD4O60885515
SMC2KIF4AO95239449

IntAct

188 interactions, top by confidence:

ABTypeScore
NCAPHSMC2psi-mi:“MI:0915”(physical association)0.810
RFXANKRFXAPpsi-mi:“MI:0914”(association)0.780
GATAD2ACDK2AP1psi-mi:“MI:0914”(association)0.730
NCAPH2SMC2psi-mi:“MI:0915”(physical association)0.720
NCAPH2SMC2psi-mi:“MI:0914”(association)0.720
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
HLA-ATAPBPpsi-mi:“MI:0915”(physical association)0.690
SMC4SMC2psi-mi:“MI:0915”(physical association)0.690
SMC2SMC4psi-mi:“MI:0915”(physical association)0.690
IFT22IFT56psi-mi:“MI:0914”(association)0.640
CHEK2PPM1Gpsi-mi:“MI:0914”(association)0.560
FOXP3FOXP2psi-mi:“MI:0914”(association)0.530
TUBB3POTEFpsi-mi:“MI:0914”(association)0.530
XPO1psi-mi:“MI:0914”(association)0.530
ARRDC4WWP2psi-mi:“MI:0914”(association)0.530
CXCR4TMEM120Bpsi-mi:“MI:0914”(association)0.530
EPHA1EXOC5psi-mi:“MI:0914”(association)0.530
BMP1TLL1psi-mi:“MI:0914”(association)0.530
FAM174ABLTP3Bpsi-mi:“MI:0914”(association)0.530
DPEP1ILVBLpsi-mi:“MI:0914”(association)0.530
FOXM1PES1psi-mi:“MI:0914”(association)0.500
DDX21MED19psi-mi:“MI:2364”(proximity)0.480
HLA-ASMC2psi-mi:“MI:0915”(physical association)0.400
KIF4BSMC2psi-mi:“MI:0915”(physical association)0.400
MAP7D2SMC2psi-mi:“MI:0915”(physical association)0.400
JMJD1CSMC2psi-mi:“MI:0915”(physical association)0.400

BioGRID (465): SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), SMC2 (Affinity Capture-MS), ADAR (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation), SMC2 (Co-fractionation)

ESM2 similar proteins: A0A1L8GVF0, A0A1L8GXM0, A0A8M9PQ61, A2ZAC2, G0SHW7, G5EG17, O14777, O44199, O94383, O95347, P12753, P38989, P41003, P48996, P53692, P70388, Q09591, Q10173, Q12267, Q12749, Q196W6, Q336R3, Q4R630, Q503N2, Q54PK4, Q5U4X5, Q6DRJ7, Q6GQ71, Q6P9I7, Q6Q1P4, Q76I89, Q76I90, Q7ZW63, Q802R8, Q8AWF4, Q8AWF5, Q8CG48, Q90988, Q924W5, Q92878

Diamond homologs: A3PMS2, A9II65, B8CW13, B8GZ28, B9E1H0, C4ZJU1, D4GUK1, E1X022, O66878, O95347, P15016, P32908, P41508, P47540, P48996, P50532, P50533, P75361, Q12267, Q1INB1, Q20060, Q24U48, Q54PK4, Q59037, Q5H054, Q5N0D2, Q604U6, Q69GZ5, Q6N1B7, Q6Q1P4, Q7NG51, Q7ZAK1, Q81ZL2, Q8CG47, Q8CG48, Q8KBS6, Q8NDV3, Q8REH4, Q8TZY2, Q90988

SIGNOR signaling

2 interactions.

AEffectBMechanism
SMC2“form complex”“Condensin I”binding
SMC2“form complex”“Condensin II”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 223 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

GO biological processes:

GO termPartnersFoldFDR
mitotic chromosome condensation525.4×3e-04
cell surface receptor protein tyrosine kinase signaling pathway98.0×4e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

191 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance155
Likely benign4
Benign5

Top pathogenic / likely-pathogenic (0)

SpliceAI

3200 predictions. Top by Δscore:

VariantEffectΔscore
9:104098564:TGCAG:Tdonor_loss1.0000
9:104098565:GCAGG:Gdonor_loss1.0000
9:104098566:CAG:Cdonor_loss1.0000
9:104098567:AGGT:Adonor_loss1.0000
9:104099681:AG:Adonor_loss1.0000
9:104099683:G:Adonor_loss1.0000
9:104100088:TCCA:Tacceptor_loss1.0000
9:104100089:CCA:Cacceptor_loss1.0000
9:104100090:CA:Cacceptor_loss1.0000
9:104100091:A:AGacceptor_gain1.0000
9:104100091:A:Tacceptor_loss1.0000
9:104100092:G:GAacceptor_gain1.0000
9:104100092:G:GTacceptor_loss1.0000
9:104100092:GA:Gacceptor_gain1.0000
9:104100092:GAT:Gacceptor_gain1.0000
9:104100092:GATT:Gacceptor_gain1.0000
9:104100092:GATTT:Gacceptor_gain1.0000
9:104100177:G:GTdonor_gain1.0000
9:104100177:GAGGC:Gdonor_loss1.0000
9:104100180:GC:Gdonor_gain1.0000
9:104100200:GACG:Gdonor_gain1.0000
9:104100203:GGT:Gdonor_loss1.0000
9:104100205:T:Cdonor_loss1.0000
9:104100431:G:GTdonor_gain1.0000
9:104100431:GAG:Gdonor_gain1.0000
9:104101956:TCA:Tacceptor_loss1.0000
9:104101958:A:ACacceptor_loss1.0000
9:104101958:A:AGacceptor_gain1.0000
9:104101958:AG:Aacceptor_gain1.0000
9:104101959:G:GCacceptor_loss1.0000

AlphaMissense

7960 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
9:104095415:T:CF11L1.000
9:104095417:C:AF11L1.000
9:104095417:C:GF11L1.000
9:104095418:A:GK12E1.000
9:104095420:G:CK12N1.000
9:104095420:G:TK12N1.000
9:104095478:G:CG32R1.000
9:104095479:G:AG32D1.000
9:104095484:A:GN34D1.000
9:104095486:T:AN34K1.000
9:104095486:T:GN34K1.000
9:104095487:G:CG35R1.000
9:104095487:G:TG35C1.000
9:104095488:G:AG35D1.000
9:104095493:G:AG37R1.000
9:104095493:G:CG37R1.000
9:104095493:G:TG37W1.000
9:104095494:G:AG37E1.000
9:104095494:G:TG37V1.000
9:104095496:A:CK38Q1.000
9:104095497:A:TK38I1.000
9:104095499:T:CS39P1.000
9:104095500:C:AS39Y1.000
9:104095500:C:TS39F1.000
9:104095504:C:AN40K1.000
9:104095504:C:GN40K1.000
9:104098568:G:CQ147H1.000
9:104098568:G:TQ147H1.000
9:104138083:T:CL1112P1.000
9:104138085:G:CD1113H1.000

dbSNP variants (sampled 300 via entrez): RS1000090236 (9:104088160 A>C,G), RS1000101023 (9:104086703 C>T), RS1000177297 (9:104102340 G>A), RS1000182516 (9:104093953 G>A,C), RS1000205059 (9:104131694 A>G), RS1000259871 (9:104130674 G>A), RS1000297002 (9:104107844 C>G,T), RS1000347323 (9:104136857 G>A), RS1000356440 (9:104137289 A>G), RS1000399123 (9:104102683 G>A,C), RS1000408055 (9:104099125 A>G), RS1000409269 (9:104102973 G>A,C), RS1000429436 (9:104136936 T>A), RS1000561053 (9:104125802 C>G,T), RS1000608862 (9:104104696 T>C)

Disease associations

OMIM: gene MIM:605576 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

14 associations (top):

StudyTraitp-value
GCST001082_2Orofacial clefts (maternal alcohol consumption interaction)3.000000e-07
GCST002345_9Response to cytadine analogues (cytosine arabinoside)5.000000e-07
GCST004250_21Alanine aminotransferase (ALT) levels after remission induction therapy in actute lymphoblastic leukemia (ALL)2.000000e-06
GCST005434_23Pancreatic cancer7.000000e-06
GCST005434_28Pancreatic cancer6.000000e-07
GCST007235_8Pancreatic ductal adenocarcinoma3.000000e-07
GCST008870_35Keratinocyte cancer (MTAG)4.000000e-08
GCST008871_30Basal cell carcinoma9.000000e-11
GCST009310_28Sensorimotor dexterity2.000000e-07
GCST010135_30Oily fish consumption4.000000e-09
GCST010140_20Pork consumption4.000000e-09
GCST010241_100Apolipoprotein A1 levels1.000000e-12
GCST010242_439HDL cholesterol levels2.000000e-11
GCST90013410_48Basal cell carcinoma2.000000e-09

EFO canonical traits (7, from GWAS)

EFO IDTrait name
EFO:0003959cleft lip
EFO:0007965response to combination chemotherapy
EFO:0010176keratinocyte carcinoma
EFO:0008354cognitive function measurement
EFO:0008111diet measurement
EFO:0004614apolipoprotein A 1 measurement
EFO:0004612high density lipoprotein cholesterol measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4105890 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

2 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 12,282 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1614701SELUMETINIB410,221
CHEMBL494089GSK-69069312,061

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

6 potent at pChembl≥5 of 6 total, top 6 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
7.95Kd11.26nMCHEMBL5653589
7.95ED5011.26nMCHEMBL5653589
6.68Kd209nMSELUMETINIB
5.96Kd1106nMGSK-690693
5.25Kd5620nMCHEMBL3752910
5.25ED505620nMCHEMBL3752910

PubChem BioAssay actives

4 with measured affinity, of 243 total; 4 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149445: Binding affinity to human SMC2 incubated for 45 mins by Kinobead based pull down assaykd0.0113uM
Selumetinib1425173: Kinobeads (epsilon), multiple immobilized ATP-competitive broad spectrum kinase inhibitors, used to assess residual binding of ~300 proteins simultaneously from cell lysate in the presence of a compound. Quantitative readout performed by mass spectrometry.kd0.2090uM
4-[2-(4-amino-1,2,5-oxadiazol-3-yl)-1-ethyl-7-[[(3S)-piperidin-3-yl]methoxy]imidazo[4,5-c]pyridin-4-yl]-2-methylbut-3-yn-2-ol1425173: Kinobeads (epsilon), multiple immobilized ATP-competitive broad spectrum kinase inhibitors, used to assess residual binding of ~300 proteins simultaneously from cell lysate in the presence of a compound. Quantitative readout performed by mass spectrometry.kd1.1060uM
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149445: Binding affinity to human SMC2 incubated for 45 mins by Kinobead based pull down assaykd5.6201uM

CTD chemical–gene interactions

59 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases expression, decreases methylation, increases expression4
Benzo(a)pyrenedecreases expression, increases expression, increases methylation3
Estradiolincreases expression3
Cyclosporinedecreases expression3
sodium arsenitedecreases expression, affects cotreatment, increases abundance2
perfluorooctane sulfonic aciddecreases expression2
Resveratrolaffects cotreatment, increases expression2
Acetaminophendecreases expression, increases expression2
aristolochic acid Idecreases expression1
dicrotophosdecreases expression1
testosterone enanthateaffects expression1
triphenyl phosphateaffects expression1
propionaldehydedecreases expression1
bisphenol Aincreases expression1
sodium arsenatedecreases expression1
cobaltous chloridedecreases expression1
perfluorooctanoic aciddecreases expression1
manganese chlorideaffects cotreatment, decreases expression, increases abundance1
potassium chromate(VI)affects cotreatment, decreases expression1
2,3-bis(3’-hydroxybenzyl)butyrolactoneincreases expression, affects cotreatment1
epigallocatechin gallateaffects cotreatment, decreases expression1
chromium hexavalent iondecreases expression1
abrinedecreases expression1
incobotulinumtoxinAdecreases expression1
Dasatinibdecreases expression1
Sunitinibdecreases expression1
Troglitazonedecreases expression1
Air Pollutantsdecreases expression, increases abundance1
Arsenicdecreases expression, increases abundance, affects cotreatment1
Azathioprinedecreases expression1

ChEMBL screening assays

2 unique, capped per target: 2 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL3991886BindingKinobeads (epsilon), multiple immobilized ATP-competitive broad spectrum kinase inhibitors, used to assess residual binding of ~300 proteins simultaneously from cell lysate in the presence of a compound. Quantitative readout performed by maThe target landscape of clinical kinase drugs. — Science

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.