SCML1

gene
On this page

Summary

SCML1 (Scm polycomb group protein like 1, HGNC:10580) is a protein-coding gene on chromosome Xp22.13, encoding Sex comb on midleg-like protein 1 (Q9UN30). Putative Polycomb group (PcG) protein.

Predicted to enable chromatin binding activity and histone binding activity. Predicted to be involved in negative regulation of DNA-templated transcription. Predicted to be located in female germ cell nucleus and male germ cell nucleus. Predicted to be active in nucleus.

Source: NCBI Gene 6322 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 45 total — 1 pathogenic
  • MANE Select transcript: NM_001037540

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:10580
Approved symbolSCML1
NameScm polycomb group protein like 1
LocationXp22.13
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000047634
Ensembl biotypeprotein_coding
OMIM300227
Entrez6322

Gene structure

Transcript identifiers

Ensembl transcripts: 13 — 11 protein_coding, 2 retained_intron

ENST00000380041, ENST00000380043, ENST00000380045, ENST00000398080, ENST00000419185, ENST00000427362, ENST00000487842, ENST00000895837, ENST00000895838, ENST00000895839, ENST00000895840, ENST00000918259, ENST00000918260

RefSeq mRNA: 4 — MANE Select: NM_001037540 NM_001037535, NM_001037536, NM_001037540, NM_006746

CCDS: CCDS14182, CCDS35210, CCDS35211

Canonical transcript exons

ENST00000380041 — 8 exons

ExonStartEnd
ENSE000022726581774407117744219
ENSE000035323061775325817754985
ENSE000035524611774940017749504
ENSE000036089441774601817746098
ENSE000036367421774989417750293
ENSE000036704801774545617745539
ENSE000036876381775181517751966
ENSE000038414861773749317737680

Expression profiles

Bgee: expression breadth ubiquitous, 275 present calls, max score 96.61.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 11.4092 / max 130.8094, expressed in 1609 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1956744.47851352
1956753.69721304
1956763.06911101
1956730.126439
1956770.038016

Top tissues by expression

286 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
secondary oocyteCL:000065596.61gold quality
oocyteCL:000002395.38gold quality
left ovaryUBERON:000211992.69gold quality
adrenal tissueUBERON:001830391.90gold quality
right testisUBERON:000453491.55gold quality
right ovaryUBERON:000211891.25gold quality
left testisUBERON:000453390.84gold quality
testisUBERON:000047390.44gold quality
ovaryUBERON:000099290.43gold quality
right lobe of liverUBERON:000111490.26gold quality
spermCL:000001989.68gold quality
male germ cellCL:000001589.20gold quality
biceps brachiiUBERON:000150789.18gold quality
liverUBERON:000210788.98gold quality
gastrocnemiusUBERON:000138888.29gold quality
mucosa of stomachUBERON:000119988.23gold quality
endocervixUBERON:000045887.75gold quality
right adrenal gland cortexUBERON:003582787.73gold quality
tibial nerveUBERON:000132387.65gold quality
muscle of legUBERON:000138387.45gold quality
right lungUBERON:000216787.19gold quality
left uterine tubeUBERON:000130387.16gold quality
hindlimb stylopod muscleUBERON:000425286.98gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451186.91gold quality
pigmented layer of retinaUBERON:000178286.77gold quality
muscle organUBERON:000163086.70gold quality
seminal vesicleUBERON:000099886.67gold quality
triceps brachiiUBERON:000150986.66silver quality
subcutaneous adipose tissueUBERON:000219086.60gold quality
left adrenal gland cortexUBERON:003582586.40gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-36552yes141.69
E-ANND-3yes7.61

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

128 targeting SCML1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-3163100.0077.238605
HSA-MIR-5692B100.0071.322622
HSA-MIR-5692C100.0071.322622
HSA-MIR-5692A100.0074.406850
HSA-MIR-29A-3P100.0073.111835
HSA-MIR-29B-3P100.0073.181833
HSA-MIR-29C-3P100.0073.151833
HSA-MIR-340-5P100.0072.504437
HSA-MIR-186-5P99.9970.833707
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-548N99.9871.944170
HSA-MIR-1213699.9872.815713
HSA-MIR-539-3P99.9870.741616
HSA-MIR-485-3P99.9870.681585
HSA-MIR-50799.9770.111915
HSA-MIR-60799.9773.625593
HSA-MIR-55799.9670.011640
HSA-MIR-568899.9673.234504
HSA-MIR-495-3P99.9672.814197
HSA-MIR-548AB99.9571.313488
HSA-MIR-55999.9572.283609
HSA-MIR-651-3P99.9473.485177
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502

Literature-anchored findings (GeneRIF, showing 2)

  • the molecular evolutionary pattern of SCML1 in diverse primate species showed a strong signature of adaptive evolution which is caused by Darwinian positive selection (PMID:18601738)
  • Here, we report a microdeletion of 170,6 Kb at Xp22.13 (17.733.948-17.904.576) (GRCh37/hg19).The microdeletion harbors the NHS, SCLML1, and RAI2 genes and results in a phenotype consistent with Nance-Horan syndromesyndrome and developmental delay (PMID:28464487)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriophc1ENSDARG00000002971
drosophila_melanogasterl(3)mbtFBGN0002441
drosophila_melanogasterScmFBGN0003334
drosophila_melanogasterSfmbtFBGN0032475
caenorhabditis_eleganslin-61WBGENE00003041
caenorhabditis_elegansmbtr-1WBGENE00021661

Paralogs (18): SCMH1 (ENSG00000010803), MBTD1 (ENSG00000011258), L3MBTL2 (ENSG00000100395), SCML2 (ENSG00000102098), PHC1 (ENSG00000111752), THAP10 (ENSG00000129028), PHC2 (ENSG00000134686), SAMD1 (ENSG00000141858), SCML4 (ENSG00000146285), L3MBTL4 (ENSG00000154655), SFMBT1 (ENSG00000163935), PHC3 (ENSG00000173889), L3MBTL1 (ENSG00000185513), SAMD7 (ENSG00000187033), SAMD11 (ENSG00000187634), SFMBT2 (ENSG00000198879), L3MBTL3 (ENSG00000198945), SAMD13 (ENSG00000203943)

Protein

Protein identifiers

Sex comb on midleg-like protein 1Q9UN30 (reviewed: Q9UN30)

All UniProt accessions (2): Q9UN30, Q5H966

UniProt curated annotations — full annotation on UniProt →

Function. Putative Polycomb group (PcG) protein. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. May be involved in spermatogenesis during sexual maturation.

Subcellular location. Nucleus.

Tissue specificity. Ubiquitous. Expressed in fetal and adult tissues.

Similarity. Belongs to the SCM family.

Isoforms (3)

UniProt IDNamesCanonical?
Q9UN30-33yes
Q9UN30-11
Q9UN30-22

RefSeq proteins (4): NP_001032624, NP_001032625, NP_001032629, NP_006737 (=MANE)

Domains & families (InterPro)

IDNameType
IPR001660SAMDomain
IPR013761SAM/pointed_sfHomologous_superfamily
IPR047531SAM_Scm-likeDomain

Pfam: PF00536

UniProt features (7 total): modified residue 2, splice variant 2, chain 1, domain 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9UN30-F167.080.31

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (2): 138, 238

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 183 (showing top): BEGUM_TARGETS_OF_PAX3_FOXO1_FUSION_UP, HNF3ALPHA_Q6, FOXO1_01, RODWELL_AGING_KIDNEY_NO_BLOOD_DN, LINDGREN_BLADDER_CANCER_CLUSTER_2A_DN, NKX62_Q2, DEURIG_T_CELL_PROLYMPHOCYTIC_LEUKEMIA_DN, FOXJ2_01, HP1SITEFACTOR_Q6, HFH3_01, PYEON_CANCER_HEAD_AND_NECK_VS_CERVICAL_UP, LEE_EARLY_T_LYMPHOCYTE_DN, SCHLOSSER_SERUM_RESPONSE_DN, DODD_NASOPHARYNGEAL_CARCINOMA_UP, BERTUCCI_MEDULLARY_VS_DUCTAL_BREAST_CANCER_UP

GO Biological Process (1): negative regulation of DNA-templated transcription (GO:0045892)

GO Molecular Function (3): chromatin binding (GO:0003682), histone binding (GO:0042393), protein binding (GO:0005515)

GO Cellular Component (1): nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding2
DNA-templated transcription1
regulation of DNA-templated transcription1
negative regulation of RNA biosynthetic process1
protein binding1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

519 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SCML1ZNF394Q53GI3402
SCML1NOL10Q9BSC4365
SCML1GSAPA4D1B5334
SCML1FLRT1Q9NZU1311
SCML1WDR41Q9HAD4310
SCML1APOLD1Q96LR9301
SCML1ACOT13Q9NPJ3300
SCML1PDZRN3Q9UPQ7300
SCML1SNTB2Q13425297
SCML1ADIRFQ15847290
SCML1DBPQ10586287
SCML1TFAP4Q01664283
SCML1ESYT1Q9BSJ8275
SCML1HLFQ16534271
SCML1GPR6P46095271

IntAct

38 interactions, top by confidence:

ABTypeScore
CBX8BMI1psi-mi:“MI:0914”(association)0.970
BMI1CBX4psi-mi:“MI:0914”(association)0.900
PTPN3YWHAQpsi-mi:“MI:2364”(proximity)0.850
PHC1CBX4psi-mi:“MI:0914”(association)0.790
PTPN3MCCpsi-mi:“MI:0914”(association)0.660
DYNLL2BLTP3Bpsi-mi:“MI:0914”(association)0.640
CCDC43PNKPpsi-mi:“MI:0914”(association)0.640
PLK1C1orf226psi-mi:“MI:0914”(association)0.560
SCML1ARVCFpsi-mi:“MI:0914”(association)0.530
EPB41L1AP3B1psi-mi:“MI:0914”(association)0.530
EPB41L3AP3B1psi-mi:“MI:0914”(association)0.530
RNPS1CASC3psi-mi:“MI:0914”(association)0.530
CBX6RPS3psi-mi:“MI:0914”(association)0.530
SCML1POTEKPpsi-mi:“MI:0915”(physical association)0.400
PTPN4TPRpsi-mi:“MI:0914”(association)0.350
NPM3KPNA3psi-mi:“MI:0914”(association)0.350
PSKH1AIPpsi-mi:“MI:0914”(association)0.350
AURKBTARS3psi-mi:“MI:0914”(association)0.350
NPM1RPS3Apsi-mi:“MI:0914”(association)0.350
ZCCHC10C1orf226psi-mi:“MI:0914”(association)0.350
RNPS1C1orf226psi-mi:“MI:0914”(association)0.350
CDH5NBASpsi-mi:“MI:0914”(association)0.350
MRFAP1L1MYO9Apsi-mi:“MI:0914”(association)0.350
CBX4DPM1psi-mi:“MI:0914”(association)0.350
PCGF1HSPD1psi-mi:“MI:0914”(association)0.350
PHC2CBX4psi-mi:“MI:0914”(association)0.350
PHC3CBX4psi-mi:“MI:0914”(association)0.350

BioGRID (76): SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), CDADC1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS), SCML1 (Affinity Capture-MS)

ESM2 similar proteins: B0FZN7, B0FZN8, B0FZN9, B0FZP0, B0FZP1, B0FZP2, B0FZP3, O08750, O60381, P01105, P04150, P08235, P10157, P19102, P32314, P46200, Q08D88, Q0P4X6, Q16649, Q1LXZ9, Q29131, Q2KJ34, Q3UPW2, Q3YC04, Q4JM28, Q4V7E1, Q5DTV4, Q5FW38, Q5HYM0, Q5R7I3, Q5R9P5, Q5WM45, Q62661, Q66J36, Q66J77, Q68EL6, Q6NYU3, Q6XLJ0, Q8AYI2, Q8K402

Diamond homologs: B0FZN7, B0FZN8, B0FZN9, B0FZP0, B0FZP1, B0FZP2, B0FZP3, P78364, Q5DTW2, Q5VUG0, Q64028, Q80VG1, Q8CHP6, Q8K214, Q8N228, Q8NDX5, Q8QHL5, Q96GD3, Q9JMD1, Q9JMD2, Q9UHJ3, Q9UN30, Q9UQR0, Q9VHA0, Q9VK33, Q29L50, Q4V7W5, Q8IXK0, Q9QWH1, A2A5N8, B1B1A0, D3YUG0, D3YXK1, D3ZWK4, E1C2V1, I3L5V6, P39769, Q1RNF8, Q5VXD3, Q6SPE9

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 48 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
SUMOylation of DNA methylation proteins6122.1×1e-09
RUNX1 interacts with co-factors whose precise effect on RUNX1 targets is not known763.8×1e-09
SUMOylation of transcription cofactors751.5×3e-09
SUMOylation of RNA binding proteins750.5×3e-09
Regulation of PTEN gene transcription737.9×2e-08
SUMOylation of chromatin organization proteins733.6×4e-08
SUMOylation of DNA damage response and repair proteins731.1×6e-08
Differentiation of naive CD4+ T cells to T helper 2 cells (Th2 cells)731.1×6e-08

Disease & clinical

Clinical variants and AI predictions

ClinVar

45 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance6
Likely benign2
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
2424639NC_000023.10:g.(?17393881)(18525300_?)delPathogenic

SpliceAI

1090 predictions. Top by Δscore:

VariantEffectΔscore
X:17737663:GC:Gdonor_gain1.0000
X:17737677:A:AGdonor_gain1.0000
X:17745540:G:GGdonor_gain1.0000
X:17749394:TAATA:Tacceptor_loss1.0000
X:17749397:TAGGT:Tacceptor_loss1.0000
X:17749398:A:AGacceptor_gain1.0000
X:17749398:A:Cacceptor_loss1.0000
X:17749399:G:Aacceptor_loss1.0000
X:17749399:G:GGacceptor_gain1.0000
X:17749502:CAT:Cdonor_gain1.0000
X:17749503:ATGTA:Adonor_loss1.0000
X:17749504:TG:Tdonor_loss1.0000
X:17749505:G:GAdonor_loss1.0000
X:17749506:TAAGT:Tdonor_loss1.0000
X:17749883:T:TAacceptor_gain1.0000
X:17749885:T:TAacceptor_gain1.0000
X:17749890:TCA:Tacceptor_loss1.0000
X:17749891:CAG:Cacceptor_loss1.0000
X:17749892:A:AGacceptor_gain1.0000
X:17749892:A:Tacceptor_loss1.0000
X:17749893:G:Aacceptor_loss1.0000
X:17749893:G:GAacceptor_gain1.0000
X:17749893:GA:Gacceptor_gain1.0000
X:17749893:GAA:Gacceptor_gain1.0000
X:17749893:GAAGC:Gacceptor_gain1.0000
X:17750289:TTCAG:Tdonor_loss1.0000
X:17750290:TCAGG:Tdonor_loss1.0000
X:17750291:CAGG:Cdonor_loss1.0000
X:17750292:AGGTA:Adonor_loss1.0000
X:17750293:GG:Gdonor_loss1.0000

AlphaMissense

2154 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
X:17751883:T:AW258R0.997
X:17751883:T:CW258R0.997
X:17753267:G:TG289W0.997
X:17753268:G:TG289V0.997
X:17753268:G:AG289E0.996
X:17751885:G:CW258C0.995
X:17751885:G:TW258C0.995
X:17751908:T:CF266S0.995
X:17753277:T:CL292P0.995
X:17751907:T:CF266L0.993
X:17751909:T:AF266L0.993
X:17751909:T:GF266L0.993
X:17753364:T:CL321P0.993
X:17751956:T:CF282S0.992
X:17753343:T:CL314P0.991
X:17753265:A:TD288V0.990
X:17753267:G:AG289R0.990
X:17753267:G:CG289R0.990
X:17753286:T:CL295P0.990
X:17753341:G:CK313N0.990
X:17753341:G:TK313N0.990
X:17751911:T:CL267P0.988
X:17753277:T:AL292Q0.988
X:17751884:G:CW258S0.987
X:17753339:A:GK313E0.987
X:17753266:C:AD288E0.985
X:17753266:C:GD288E0.985
X:17753277:T:GL292R0.985
X:17753283:T:CL294P0.985
X:17753286:T:AL295H0.985

dbSNP variants (sampled 300 via entrez): RS1000006818 (X:17740475 A>G), RS1000071516 (X:17738518 C>T), RS1000122345 (X:17751614 A>G), RS1000474142 (X:17748004 A>G,T), RS1000547783 (X:17740829 G>A,T), RS1000593221 (X:17736337 A>AC), RS1000608083 (X:17748486 A>G), RS1000705215 (X:17748919 A>G), RS1001144258 (X:17752889 A>G), RS1001279931 (X:17753604 A>G), RS1001380220 (X:17743165 C>T), RS1001817482 (X:17742963 G>A), RS1001895109 (X:17742762 G>A), RS1002081433 (X:17742565 T>G), RS1002224132 (X:17754960 T>G)

Disease associations

OMIM: gene MIM:300227 | disease phenotypes: MIM:300672

GenCC curated gene-disease

Mondo (1): developmental and epileptic encephalopathy, 2 (MONDO:0010396)

Orphanet (3): Early infantile developmental and epileptic encephalopathy (Orphanet:1934), West syndrome (Orphanet:3451), CDKL5-deficiency disorder (Orphanet:505652)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST009391_1520Metabolite levels2.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0010418triacylglycerol 52:6 measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C564064CDKL5 deficiency disorder (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

53 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Aciddecreases expression, increases expression, affects expression, increases methylation, affects cotreatment10
trichostatin Aincreases expression, affects cotreatment4
belinostatincreases expression, affects cotreatment2
Panobinostataffects cotreatment, increases expression2
Air Pollutantsincreases abundance, increases expression, decreases expression2
Tobacco Smoke Pollutionincreases expression, affects expression2
bisphenol Faffects cotreatment, decreases expression1
dicrotophosdecreases expression1
methylmercuric chloridedecreases expression1
sodium arseniteincreases expression1
potassium chromate(VI)affects cotreatment, increases expression1
cupric oxideincreases expression1
epigallocatechin gallateaffects cotreatment, increases expression1
avobenzoneincreases expression1
di-n-butylphosphoric acidaffects expression1
perfluorooctane sulfonic aciddecreases expression1
CGP 52608affects binding, increases reaction1
entinostatincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
dorsomorphinincreases expression, affects cotreatment1
(4-amino-1,4-dihydro-3-(2-pyridyl)-5-thioxo-1,2,4-triazole)copper(II)increases expression1
jinfukangdecreases expression1
MT19c compoundincreases expression1
PCI 5002affects cotreatment, increases expression1
Sunitinibdecreases expression1
Vorinostatincreases expression1
Azathioprinedecreases expression1
Benzo(a)pyreneaffects methylation, increases methylation1
Coaldecreases expression, increases abundance1
Dexamethasoneaffects cotreatment, decreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.