SCML2

gene
On this page

Summary

SCML2 (Scm polycomb group protein like 2, HGNC:10581) is a protein-coding gene on chromosome Xp22.13, encoding Sex comb on midleg-like protein 2 (Q9UQR0). Putative Polycomb group (PcG) protein.

This gene encodes a member of the Polycomb group proteins. These proteins form the Polycomb repressive complexes which are involved in transcriptional repression. The encoded protein binds histone peptides that are monomethylated at lysine residues and may be involved in regulating homeotic gene expression during development.

Source: NCBI Gene 10389 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 180 total — 1 likely-pathogenic
  • MANE Select transcript: NM_006089

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:10581
Approved symbolSCML2
NameScm polycomb group protein like 2
LocationXp22.13
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000102098
Ensembl biotypeprotein_coding
OMIM300208
Entrez10389

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 7 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000251900, ENST00000398048, ENST00000420857, ENST00000491988, ENST00000665583, ENST00000926833, ENST00000926834, ENST00000926835

RefSeq mRNA: 1 — MANE Select: NM_006089 NM_006089

CCDS: CCDS14185

Canonical transcript exons

ENST00000251900 — 15 exons

ExonStartEnd
ENSE000006661371824243918242590
ENSE000013415441835459218354688
ENSE000016320321824657718246828
ENSE000016924591826017118260291
ENSE000017209431825684818257030
ENSE000017220151832490718324977
ENSE000017408721833058718330655
ENSE000017615901832385918324093
ENSE000017768911824776918247882
ENSE000017809511830497218305215
ENSE000018010551825804418258247
ENSE000018029081832033218320420
ENSE000022856301833405018334095
ENSE000035454771823931318241379
ENSE000036641531826558518265802

Expression profiles

Bgee: expression breadth ubiquitous, 228 present calls, max score 96.57.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 4.9471 / max 165.9154, expressed in 1093 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1985894.94711093

Top tissues by expression

251 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
olfactory bulbUBERON:000226496.57silver quality
diaphragmUBERON:000110393.79silver quality
type B pancreatic cellCL:000016993.77silver quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099191.43gold quality
parotid glandUBERON:000183188.65gold quality
pancreatic ductal cellCL:000207988.25silver quality
biceps brachiiUBERON:000150788.10silver quality
oocyteCL:000002388.06gold quality
adrenal tissueUBERON:001830387.29gold quality
mucosa of urinary bladderUBERON:000125987.24silver quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047387.18gold quality
heart right ventricleUBERON:000208085.90silver quality
skeletal muscle tissue of biceps brachiiUBERON:000450285.84silver quality
upper leg skinUBERON:000426285.70gold quality
tibialis anteriorUBERON:000138585.44silver quality
male germ cellCL:000001585.14silver quality
middle temporal gyrusUBERON:000277184.84silver quality
nephron tubuleUBERON:000123184.69silver quality
endothelial cellCL:000011584.58silver quality
adult organismUBERON:000702384.01gold quality
spermCL:000001983.85silver quality
tendon of biceps brachiiUBERON:000818883.73silver quality
seminal vesicleUBERON:000099883.63silver quality
esophagus squamous epitheliumUBERON:000692083.62silver quality
jejunal mucosaUBERON:000039983.45silver quality
epithelium of esophagusUBERON:000197683.19silver quality
secondary oocyteCL:000065583.00gold quality
pigmented layer of retinaUBERON:000178282.92silver quality
Brodmann (1909) area 23UBERON:001355482.83silver quality
colonic mucosaUBERON:000031782.82gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no5.25

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

161 targeting SCML2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5692A100.0074.406850
HSA-MIR-5011-5P100.0083.465820
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-3163100.0077.238605
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-29A-3P100.0073.111835
HSA-MIR-29B-3P100.0073.181833
HSA-MIR-29C-3P100.0073.151833
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-340-5P100.0072.504437
HSA-MIR-366299.9973.825684
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-34A-5P99.9971.211784
HSA-MIR-449A99.9971.051776
HSA-MIR-186-5P99.9970.833707
HSA-MIR-428299.9975.366408
HSA-MIR-480399.9871.993117
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-3688-3P99.9772.022834
HSA-MIR-34C-5P99.9770.451577
HSA-MIR-449B-5P99.9770.261580
HSA-MIR-314899.9775.066478

Literature-anchored findings (GeneRIF, showing 6)

  • 1.78-A crystal structure of the two MBT repeats of SCML2 show significant structural similarities to the Tudor, PWWP, and chromo domains, suggesting probable evolutionary relationships and functional similarities between the MBT repeats and these domains. (PMID:12952983)
  • The malignant brain tumor repeats of SCML2 preferentially bind histone peptides monomethylated at lysine residues. The crystal structure of the complex reveals that the modified amino acid binds to an aromatic rich pocket. (PMID:18706910)
  • Because PRC1 complexes localize to the promoters of a specific subset of developmental genes in vivo, the SLED domain of Scml2 may provide an important link connecting the PRC1 complexes to their target genes. (PMID:24727478)
  • SCML2A, an SCML2 isoform tightly associated to chromatin, contributes to PRC1 localization and also directly enforces repression of certain Polycomb target genes. (PMID:24986859)
  • SCML2 directly interacts with USP7 and drives its localization. SCML2 connects USP7 to PRC1.4, allowing for the stabilization of BMI1, and USP7 is found on SCML2 and BMI1 target genes. (PMID:25605328)
  • Sex Comb on Midleg Like-2 Accelerates Hepatocellular Carcinoma Cell Proliferation and Metastasis by Activating Wnt/beta-Catenin/EMT Signaling. (PMID:34816637)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_rerioscml2ENSDARG00000012949
mus_musculusScml2ENSMUSG00000000037
rattus_norvegicusScml2ENSRNOG00000003726
drosophila_melanogasterl(3)mbtFBGN0002441
drosophila_melanogasterSfmbtFBGN0032475
caenorhabditis_eleganslin-61WBGENE00003041
caenorhabditis_elegansmbtr-1WBGENE00021661

Paralogs (18): SCMH1 (ENSG00000010803), MBTD1 (ENSG00000011258), SCML1 (ENSG00000047634), L3MBTL2 (ENSG00000100395), PHC1 (ENSG00000111752), THAP10 (ENSG00000129028), PHC2 (ENSG00000134686), SAMD1 (ENSG00000141858), SCML4 (ENSG00000146285), L3MBTL4 (ENSG00000154655), SFMBT1 (ENSG00000163935), PHC3 (ENSG00000173889), L3MBTL1 (ENSG00000185513), SAMD7 (ENSG00000187033), SAMD11 (ENSG00000187634), SFMBT2 (ENSG00000198879), L3MBTL3 (ENSG00000198945), SAMD13 (ENSG00000203943)

Protein

Protein identifiers

Sex comb on midleg-like protein 2Q9UQR0 (reviewed: Q9UQR0)

All UniProt accessions (3): Q9UQR0, B4DRC2, H0Y6S1

UniProt curated annotations — full annotation on UniProt →

Function. Putative Polycomb group (PcG) protein. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development.

Subcellular location. Nucleus.

Tissue specificity. Highly expressed in placenta, thymus and testis. Detected at lower levels in brain, liver, skeletal muscle, pancreas and ovary.

Similarity. Belongs to the SCM family.

Isoforms (2)

UniProt IDNamesCanonical?
Q9UQR0-11yes
Q9UQR0-22

RefSeq proteins (1): NP_006080* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001660SAMDomain
IPR004092MbtDomain
IPR013761SAM/pointed_sfHomologous_superfamily
IPR021987SLEDDomain
IPR033763SCML2_RBRDomain
IPR038348SLED_sfHomologous_superfamily
IPR047531SAM_Scm-likeDomain
IPR050548PcG_chromatin_remod_factorsFamily

Pfam: PF00536, PF02820, PF12140, PF17208

UniProt features (60 total): strand 16, modified residue 14, helix 12, compositionally biased region 5, cross-link 4, region of interest 4, repeat 2, chain 1, domain 1, splice variant 1

Structure

Experimental structures (PDB)

5 structures.

PDBMethodResolution (Å)
2BIVX-RAY DIFFRACTION1.7
1OI1X-RAY DIFFRACTION1.78
2VYTX-RAY DIFFRACTION1.9
4EDUX-RAY DIFFRACTION2.58
2MEMSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9UQR0-F169.430.44

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (18): 256, 261, 267, 299, 300, 305, 499, 503, 511, 522, 570, 583, 590, 594, 518, 536, 599, 605

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 145 (showing top): TGGTGCT_MIR29A_MIR29B_MIR29C, CMYB_01, CTATGCA_MIR153, DODD_NASOPHARYNGEAL_CARCINOMA_UP, DANG_BOUND_BY_MYC, AP2GAMMA_01, MODULE_204, GOMF_CHROMATIN_BINDING, GOCC_PCG_PROTEIN_COMPLEX, CTGAGCC_MIR24, BENPORATH_MYC_MAX_TARGETS, GOMF_HISTONE_BINDING, GCACTTT_MIR175P_MIR20A_MIR106A_MIR106B_MIR20B_MIR519D, ATAAGCT_MIR21, MIKKELSEN_ES_ICP_WITH_H3K4ME3

GO Biological Process (3): anatomical structure morphogenesis (GO:0009653), negative regulation of DNA-templated transcription (GO:0045892), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (3): chromatin binding (GO:0003682), histone binding (GO:0042393), protein binding (GO:0005515)

GO Cellular Component (2): nucleus (GO:0005634), PcG protein complex (GO:0031519)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA-templated transcription2
binding2
developmental process1
anatomical structure development1
regulation of DNA-templated transcription1
negative regulation of RNA biosynthetic process1
regulation of gene expression1
regulation of RNA biosynthetic process1
protein binding1
intracellular membrane-bounded organelle1
nuclear protein-containing complex1

Protein interactions and networks

STRING

995 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SCML2PPEF1O14829845
SCML2MRPL3P09001769
SCML2BMI1P35226765
SCML2R4GMX3R4GMX3765
SCML2PRTN3P15637743
SCML2CBX4O00257740
SCML2ARSFP54793719
SCML2PCGF2P35227689
SCML2CDKL5O76039679
SCML2PCGF1Q9BSM1651
SCML2RING1Q06587640
SCML2HORMAD2Q8N7B1627
SCML2HORMAD1Q86X24616
SCML2PCGF6Q9BYE7604
SCML2RYBPQ8N488597

IntAct

57 interactions, top by confidence:

ABTypeScore
CDK2CDKN1Apsi-mi:“MI:0914”(association)0.980
CCNE2CDK2psi-mi:“MI:0915”(physical association)0.940
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
USP7BCORpsi-mi:“MI:0914”(association)0.660
NCK2SCML2psi-mi:“MI:0915”(physical association)0.560
LNX1SCML2psi-mi:“MI:0915”(physical association)0.560
CAB39LSCML2psi-mi:“MI:0915”(physical association)0.560
SFMBT2SCML2psi-mi:“MI:0915”(physical association)0.560
SRGAP2BSCML2psi-mi:“MI:0915”(physical association)0.560
RNF4SCML2psi-mi:“MI:0915”(physical association)0.560
USP7MCM4psi-mi:“MI:0914”(association)0.530
POTEB3POTEFpsi-mi:“MI:0914”(association)0.530
CBX6IGF2BP3psi-mi:“MI:0914”(association)0.530
SCML2CDK2psi-mi:“MI:0915”(physical association)0.500
AURKACCNB2psi-mi:“MI:0915”(physical association)0.400
CCNB2CCNE2psi-mi:“MI:0915”(physical association)0.400
RNF2AKIP1psi-mi:“MI:0914”(association)0.350
CBX8CMSS1psi-mi:“MI:0914”(association)0.350
Cbx7E2F6psi-mi:“MI:0914”(association)0.350
USP7STILpsi-mi:“MI:0914”(association)0.350
Ring1CENPXpsi-mi:“MI:0914”(association)0.350
ESR1ESYT2psi-mi:“MI:0914”(association)0.350
USP7psi-mi:“MI:0914”(association)0.350
SCMH1CBX4psi-mi:“MI:0914”(association)0.350
SCML2CSTApsi-mi:“MI:0914”(association)0.350
H2BC21SMCHD1psi-mi:“MI:0914”(association)0.350
POTEB3POTEBpsi-mi:“MI:0914”(association)0.350

BioGRID (104): SCML2 (Affinity Capture-MS), BAG1 (Co-fractionation), USP7 (Affinity Capture-Western), SCML2 (Affinity Capture-Western), SCML2 (Reconstituted Complex), USP7 (Reconstituted Complex), SCML2 (Affinity Capture-Western), SCML2 (Affinity Capture-Western), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS), SCML2 (Affinity Capture-MS)

ESM2 similar proteins: A8MYV0, D3Z8X7, D3ZR10, D3ZUI5, F8VPQ2, M0R2J8, O35867, O54916, O75128, O94988, O95810, Q08D35, Q13625, Q149C2, Q3U1C4, Q3UH68, Q3UHI4, Q3UMF0, Q53SF7, Q5DU00, Q5JSH3, Q5JV73, Q5NBX1, Q5R9S0, Q5XIN3, Q63679, Q6AY22, Q6IRU7, Q6NVE8, Q6P1H6, Q6PGZ3, Q6PL24, Q7TP65, Q7TQ95, Q80U16, Q8BGI4, Q8CG79, Q8HYW0, Q8TDR0, Q92833

Diamond homologs: A2A5N8, B1B1A0, D3YUG0, D3YXK1, D3ZWK4, E1C2V1, O02274, O60284, O95251, P39769, P59178, P70047, P70475, P78364, P97500, Q01538, Q05BQ5, Q1JQD9, Q1RNF8, Q29L50, Q32N90, Q3MIF2, Q4V7W5, Q5DTW2, Q5R737, Q5SVQ0, Q5VUG0, Q5VXD3, Q64028, Q6DIN3, Q6P5G3, Q6SPE9, Q6SPF0, Q7Z3H4, Q80TY4, Q810T5, Q8BLB7, Q8C8Y5, Q8CFC2, Q8CHP6

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 58 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
RUNX1 interacts with co-factors whose precise effect on RUNX1 targets is not known541.7×2e-05
Regulation of PTEN gene transcription629.7×2e-05
Cellular Senescence622.9×2e-05
Differentiation of naive CD4+ T cells to T helper 2 cells (Th2 cells)520.3×2e-04
Intracellular signaling by second messengers615.2×1e-04
Oxidative Stress Induced Senescence615.1×1e-04
PIP3 activates AKT signaling713.0×6e-05
Cell Cycle Checkpoints512.3×7e-04

GO biological processes:

GO termPartnersFoldFDR
G1/S transition of mitotic cell cycle521.3×1e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

180 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic1
Uncertain significance39
Likely benign5
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
564795GRCh37/hg19 Xp22.13(chrX:18275593-18653790)x1Likely pathogenic

SpliceAI

3406 predictions. Top by Δscore:

VariantEffectΔscore
X:18241376:TTTC:Tacceptor_gain1.0000
X:18241379:CCTA:Cacceptor_loss1.0000
X:18241381:T:Aacceptor_loss1.0000
X:18242437:A:ACdonor_gain1.0000
X:18242438:C:CCdonor_gain1.0000
X:18246743:T:TAdonor_gain1.0000
X:18247880:TTG:Tacceptor_gain1.0000
X:18247881:TG:Tacceptor_gain1.0000
X:18247883:C:CCacceptor_gain1.0000
X:18247890:C:CTacceptor_gain1.0000
X:18247891:A:Tacceptor_gain1.0000
X:18256842:TCTCA:Tdonor_loss1.0000
X:18256843:CTCA:Cdonor_loss1.0000
X:18256845:CA:Cdonor_loss1.0000
X:18256846:A:ATdonor_loss1.0000
X:18256847:C:Tdonor_loss1.0000
X:18256847:CCTGA:Cdonor_gain1.0000
X:18257026:GGAGG:Gacceptor_gain1.0000
X:18257027:GAGG:Gacceptor_gain1.0000
X:18257028:AGG:Aacceptor_gain1.0000
X:18257029:GG:Gacceptor_gain1.0000
X:18257029:GGCTA:Gacceptor_loss1.0000
X:18257030:GC:Gacceptor_loss1.0000
X:18257031:C:CAacceptor_loss1.0000
X:18257031:C:CCacceptor_gain1.0000
X:18257034:T:TCacceptor_gain1.0000
X:18258055:T:TAdonor_gain1.0000
X:18258204:C:CTacceptor_gain1.0000
X:18258224:CA:Cacceptor_gain1.0000
X:18258225:A:Cacceptor_gain1.0000

AlphaMissense

4602 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:18256997:A:GL436P0.999
X:18305127:G:TA192E0.999
X:18305164:C:GA180P0.999
X:18305169:A:GL178P0.999
X:18242522:A:GW631R0.998
X:18242522:A:TW631R0.998
X:18256997:A:TL436H0.998
X:18258049:A:TI423K0.998
X:18258142:A:GL392P0.998
X:18258205:A:GL371P0.998
X:18258241:A:TV359D0.998
X:18305017:A:GW229R0.998
X:18305017:A:TW229R0.998
X:18305128:C:GA192P0.998
X:18320395:C:AW141C0.998
X:18320395:C:GW141C0.998
X:18320397:A:GW141R0.998
X:18320397:A:TW141R0.998
X:18323969:C:GR96P0.998
X:18324002:A:TV85D0.998
X:18324008:G:TA83D0.998
X:18242449:A:GF655S0.997
X:18256997:A:CL436R0.997
X:18258091:A:TV409D0.997
X:18258166:C:TG384D0.997
X:18258167:C:GG384R0.997
X:18258181:A:GL379P0.997
X:18305012:A:CC230W0.997
X:18305097:A:TV202D0.997
X:18305163:G:TA180D0.997

dbSNP variants (sampled 300 via entrez): RS1000013675 (X:18247530 A>G), RS1000075187 (X:18298441 C>G), RS1000083478 (X:18247138 G>A), RS1000144884 (X:18310326 C>T), RS1000170210 (X:18282291 C>T), RS1000187737 (X:18288480 G>C), RS1000211313 (X:18301688 A>G), RS1000221114 (X:18281145 C>G,T), RS1000295223 (X:18283427 T>C), RS1000305660 (X:18256471 C>A,T), RS1000307596 (X:18301370 G>C), RS1000377213 (X:18292930 A>T), RS1000439925 (X:18310558 G>A), RS1000558558 (X:18283993 A>G), RS1000568408 (X:18335347 T>C)

Disease associations

OMIM: gene MIM:300208 | disease phenotypes: MIM:300672

GenCC curated gene-disease

Mondo (1): developmental and epileptic encephalopathy, 2 (MONDO:0010396)

Orphanet (3): Early infantile developmental and epileptic encephalopathy (Orphanet:1934), West syndrome (Orphanet:3451), CDKL5-deficiency disorder (Orphanet:505652)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (1)

DescriptorNameTree numbers
C564064CDKL5 deficiency disorder (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

28 total (human), top 28 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arsenitedecreases expression, increases expression2
Air Pollutantsincreases abundance, increases expression2
Benzo(a)pyreneaffects methylation2
Tetrachlorodibenzodioxinaffects expression, decreases expression2
aristolochic acid Idecreases expression1
FR900359affects phosphorylation1
dicrotophosincreases expression1
beauvericinaffects cotreatment, decreases expression1
potassium chromate(VI)increases expression1
perfluorooctane sulfonic aciddecreases expression1
enniatinsaffects cotreatment, decreases expression1
Leflunomidedecreases expression1
Acetaminophenincreases expression1
Caffeineaffects phosphorylation1
Calcitrioldecreases expression, affects cotreatment1
Coalincreases expression, increases abundance1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Quercetindecreases expression1
Seleniumdecreases expression1
Smokeincreases abundance, increases expression1
Testosteroneaffects cotreatment, decreases expression1
Tobacco Smoke Pollutiondecreases expression1
Tretinoindecreases expression1
Valproic Acidincreases methylation1
Vanadatesdecreases expression1
Cyclosporinedecreases expression1
Gold Compoundsdecreases expression1
Particulate Matterincreases abundance, increases expression1

Cellosaurus cell lines

2 cell lines: 2 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_TK14HAP1 SCML2 (-) 1Cancer cell lineMale
CVCL_XS50HAP1 SCML2 (-) 2Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.