SCML4

gene
On this page

Also known as dJ47M23.1

Summary

SCML4 (Scm polycomb group protein like 4, HGNC:21397) is a protein-coding gene on chromosome 6q21, encoding Sex comb on midleg-like protein 4 (Q8N228). Putative Polycomb group (PcG) protein.

Predicted to enable chromatin binding activity and histone binding activity. Predicted to be involved in negative regulation of DNA-templated transcription. Predicted to be active in nucleus.

Source: NCBI Gene 256380 — RefSeq curated summary.

At a glance

  • GWAS associations: 11
  • Clinical variants (ClinVar): 75 total
  • MANE Select transcript: NM_198081

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21397
Approved symbolSCML4
NameScm polycomb group protein like 4
Location6q21
Locus typegene with protein product
StatusApproved
AliasesdJ47M23.1
Ensembl geneENSG00000146285
Ensembl biotypeprotein_coding
Entrez256380

Gene structure

Transcript identifiers

Ensembl transcripts: 15 — 9 protein_coding, 4 retained_intron, 2 protein_coding_CDS_not_defined

ENST00000369020, ENST00000369022, ENST00000369025, ENST00000426221, ENST00000440927, ENST00000459992, ENST00000463507, ENST00000473515, ENST00000476798, ENST00000479803, ENST00000909485, ENST00000909486, ENST00000909487, ENST00000909488, ENST00000909489

RefSeq mRNA: 3 — MANE Select: NM_198081 NM_001286408, NM_001286409, NM_198081

CCDS: CCDS5060, CCDS69163, CCDS75500

Canonical transcript exons

ENST00000369020 — 8 exons

ExonStartEnd
ENSE00001448615107772172107772386
ENSE00001448631107707866107708011
ENSE00001894257107702154107705325
ENSE00003475930107746689107746889
ENSE00003493212107720703107720993
ENSE00003633436107749684107749813
ENSE00003786544107744949107745143
ENSE00003844249107824126107824313

Expression profiles

Bgee: expression breadth ubiquitous, 173 present calls, max score 87.43.

FANTOM5 (CAGE): breadth broad, TPM avg 3.6941 / max 269.0466, expressed in 216 samples.

FANTOM5 promoters (10 alternative TSS)

Promoter IDTPM avgSamples expressed
749362.0118164
749390.582660
749330.319564
749290.258472
749370.176368
749350.082345
749380.082353
749400.072928
749320.071422
749340.036522

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
granulocyteCL:000009487.43gold quality
bloodUBERON:000017886.95gold quality
jejunal mucosaUBERON:000039983.02gold quality
lymph nodeUBERON:000002982.71gold quality
thymusUBERON:000237082.51gold quality
duodenumUBERON:000211480.57gold quality
vermiform appendixUBERON:000115479.33gold quality
spleenUBERON:000210678.91gold quality
small intestine Peyer’s patchUBERON:000345477.06gold quality
small intestineUBERON:000210876.17gold quality
caecumUBERON:000115375.97gold quality
mucosa of transverse colonUBERON:000499173.59gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047372.37gold quality
bone marrow cellCL:000209272.16gold quality
bone marrowUBERON:000237170.57gold quality
superficial temporal arteryUBERON:000161470.37gold quality
colonic epitheliumUBERON:000039770.12gold quality
tonsilUBERON:000237269.15gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099168.89gold quality
rectumUBERON:000105268.64gold quality
left testisUBERON:000453368.57gold quality
right lobe of liverUBERON:000111468.29gold quality
jejunumUBERON:000211568.19gold quality
gall bladderUBERON:000211067.60gold quality
right testisUBERON:000453467.57gold quality
epithelium of nasopharynxUBERON:000195166.74gold quality
amniotic fluidUBERON:000017366.70gold quality
testisUBERON:000047366.16gold quality
upper lobe of left lungUBERON:000895264.29gold quality
upper lobe of lungUBERON:000894864.08gold quality

Single-cell (SCXA)

Detected in 5 experiment(s), a significant marker in 4.

ExperimentMarker?Max mean expression
E-CURD-46yes29.83
E-CURD-122yes28.82
E-ANND-3yes13.07
E-MTAB-9067yes5.45
E-MTAB-7606no893.91

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

96 targeting SCML4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-513A-5P100.0069.772465
HSA-MIR-4533100.0069.482758
HSA-MIR-513B-5P99.9969.962150
HSA-MIR-607799.9968.042299
HSA-MIR-318599.9968.121959
HSA-MIR-548AW99.9972.573559
HSA-MIR-477599.9875.006394
HSA-MIR-548AN99.9770.912817
HSA-MIR-6888-3P99.9765.951170
HSA-MIR-590-3P99.9674.346478
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-365899.9673.874379
HSA-MIR-185-3P99.9567.011743
HSA-MIR-96-5P99.9572.802140
HSA-MIR-9983-3P99.9471.483631
HSA-MIR-1213399.9271.822006
HSA-MIR-515-5P99.9269.822343
HSA-MIR-519E-5P99.9269.622358
HSA-MIR-1271-5P99.9171.991972
HSA-MIR-430299.8967.941187
HSA-MIR-129-5P99.8870.263273
HSA-MIR-182-5P99.8774.032589
HSA-MIR-391999.8769.452489
HSA-MIR-10395-5P99.8667.35676
HSA-MIR-5003-3P99.8569.292517
HSA-MIR-3663-3P99.8470.39798
HSA-MIR-3180-5P99.8269.122422
HSA-MIR-6875-3P99.8270.262983
HSA-MIR-313399.8170.923506

Literature-anchored findings (GeneRIF, showing 1)

  • SCML4 and THSD7A are identified as novel susceptibility genes for coronary artery disease. (PMID:29472232)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_rerioscml4ENSDARG00000017266
mus_musculusScml4ENSMUSG00000044770
rattus_norvegicusScml4ENSRNOG00000026110
drosophila_melanogasterl(3)mbtFBGN0002441
drosophila_melanogasterSfmbtFBGN0032475
caenorhabditis_eleganslin-61WBGENE00003041
caenorhabditis_elegansmbtr-1WBGENE00021661

Paralogs (18): SCMH1 (ENSG00000010803), MBTD1 (ENSG00000011258), SCML1 (ENSG00000047634), L3MBTL2 (ENSG00000100395), SCML2 (ENSG00000102098), PHC1 (ENSG00000111752), THAP10 (ENSG00000129028), PHC2 (ENSG00000134686), SAMD1 (ENSG00000141858), L3MBTL4 (ENSG00000154655), SFMBT1 (ENSG00000163935), PHC3 (ENSG00000173889), L3MBTL1 (ENSG00000185513), SAMD7 (ENSG00000187033), SAMD11 (ENSG00000187634), SFMBT2 (ENSG00000198879), L3MBTL3 (ENSG00000198945), SAMD13 (ENSG00000203943)

Protein

Protein identifiers

Sex comb on midleg-like protein 4Q8N228 (reviewed: Q8N228)

All UniProt accessions (3): Q8N228, A0A0B4J1S9, Q5T0T5

UniProt curated annotations — full annotation on UniProt →

Function. Putative Polycomb group (PcG) protein. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development.

Subcellular location. Nucleus.

Similarity. Belongs to the SCM family.

Isoforms (3)

UniProt IDNamesCanonical?
Q8N228-11yes
Q8N228-22
Q8N228-33

RefSeq proteins (3): NP_001273337, NP_001273338, NP_932347* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001660SAMDomain
IPR013761SAM/pointed_sfHomologous_superfamily
IPR021987SLEDDomain
IPR033763SCML2_RBRDomain
IPR038348SLED_sfHomologous_superfamily
IPR047531SAM_Scm-likeDomain
IPR050548PcG_chromatin_remod_factorsFamily

Pfam: PF00536, PF12140, PF17208

UniProt features (13 total): splice variant 4, compositionally biased region 3, modified residue 2, chain 1, domain 1, sequence variant 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N228-F169.850.43

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (2): 55, 65

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 189 (showing top): NKX25_02, TGCTGAY_UNKNOWN, GATA1_01, HP1SITEFACTOR_Q6, OCT1_07, ZHOU_INFLAMMATORY_RESPONSE_LIVE_DN, YY1_01, IK3_01, OCT1_B, GOMF_CHROMATIN_BINDING, ZHENG_BOUND_BY_FOXP3, MARSON_BOUND_BY_FOXP3_UNSTIMULATED, ZHENG_FOXP3_TARGETS_IN_T_LYMPHOCYTE_DN, GOMF_HISTONE_BINDING, E4BP4_01

GO Biological Process (1): negative regulation of DNA-templated transcription (GO:0045892)

GO Molecular Function (3): chromatin binding (GO:0003682), histone binding (GO:0042393), protein binding (GO:0005515)

GO Cellular Component (1): nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding2
DNA-templated transcription1
regulation of DNA-templated transcription1
negative regulation of RNA biosynthetic process1
protein binding1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

506 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SCML4LRRC61Q9BV99475
SCML4MCCP23508438
SCML4NWD2Q9ULI1428
SCML4GGA3Q9NZ52419
SCML4QRSL1Q9H0R6405
SCML4RTN4IP1Q8WWV3404
SCML4PXYLP1Q8TE99397
SCML4BEND3Q5T5X7395
SCML4TSC22D2O75157376
SCML4ZNF367Q7RTV3372
SCML4AP5M1Q9H0R1371
SCML4NEK10Q6ZWH5371
SCML4ATAD2BQ9ULI0369
SCML4MEX3AA1L020364
SCML4OSTM1Q86WC4362

IntAct

2 interactions, top by confidence:

ABTypeScore
SCML4KDM5Cpsi-mi:“MI:0914”(association)0.350

BioGRID (12): UBR1 (Affinity Capture-MS), FBXO11 (Affinity Capture-MS), KDM5C (Affinity Capture-MS), UBR1 (Affinity Capture-MS), KDM5C (Affinity Capture-MS), FBXO11 (Affinity Capture-MS), SCML4 (Two-hybrid), CLIC3 (Two-hybrid), FBXO11 (Affinity Capture-MS), UBR1 (Affinity Capture-MS), KDM5C (Affinity Capture-MS), CHCHD3 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A0G2JTY4, A2VD01, A5PMU4, A8E4V2, D2HNW6, E1BEQ5, O54972, O95644, P16236, P59281, P70365, P97305, Q12968, Q13191, Q13469, Q13905, Q15788, Q1LY51, Q2VPU4, Q3LRZ1, Q3TTA7, Q3U182, Q4PJW2, Q4VCS5, Q60591, Q61122, Q66IV1, Q68FF7, Q6DFR2, Q6GQL0, Q6NYU6, Q6ZNC4, Q80TM6, Q80VG1, Q8HWS3, Q8IXK0, Q8IY63, Q8K4S7, Q8N228, Q8VHG2

Diamond homologs: B0FZN7, B0FZN8, B0FZN9, B0FZP0, B0FZP1, B0FZP2, B0FZP3, P78364, Q5DTW2, Q5VUG0, Q64028, Q80VG1, Q8CHP6, Q8K214, Q8N228, Q8NDX5, Q8QHL5, Q96GD3, Q9JMD1, Q9JMD2, Q9UHJ3, Q9UN30, Q9UQR0, Q9VHA0, Q9VK33, Q29L50, Q4V7W5, Q8IXK0, Q9QWH1, A2A5N8, B1B1A0, D3YUG0, D3YXK1, D3ZWK4, E1C2V1, P39769, P59178, P70047, Q05BQ5, Q1JQD9

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

75 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance59
Likely benign1
Benign2

Top pathogenic / likely-pathogenic (0)

SpliceAI

2361 predictions. Top by Δscore:

VariantEffectΔscore
6:107705341:C:CTacceptor_gain1.0000
6:107705342:A:Tacceptor_gain1.0000
6:107707854:CCA:Cdonor_gain1.0000
6:107707864:A:ACdonor_gain1.0000
6:107707865:C:CCdonor_gain1.0000
6:107707865:CGTG:Cdonor_gain1.0000
6:107708007:TGAGG:Tacceptor_gain1.0000
6:107708010:GG:Gacceptor_gain1.0000
6:107708011:GC:Gacceptor_loss1.0000
6:107708012:C:CAacceptor_loss1.0000
6:107708012:C:CCacceptor_gain1.0000
6:107708013:T:Cacceptor_loss1.0000
6:107744943:GCCTA:Gdonor_loss1.0000
6:107744944:CCTA:Cdonor_loss1.0000
6:107744945:CTA:Cdonor_loss1.0000
6:107744946:TA:Tdonor_loss1.0000
6:107744947:ACC:Adonor_loss1.0000
6:107744948:CCT:Cdonor_loss1.0000
6:107744966:T:TAdonor_gain1.0000
6:107745139:CGAGA:Cacceptor_gain1.0000
6:107745143:AC:Aacceptor_loss1.0000
6:107745144:C:CCacceptor_gain1.0000
6:107745144:C:CGacceptor_loss1.0000
6:107746703:C:CTdonor_gain1.0000
6:107746710:AGC:Adonor_gain1.0000
6:107746719:TG:Tdonor_gain1.0000
6:107789432:T:Adonor_gain1.0000
6:107705321:ATCTC:Aacceptor_gain0.9900
6:107705323:CTC:Cacceptor_gain0.9900
6:107705324:TC:Tacceptor_gain0.9900

AlphaMissense

2676 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:107705315:C:TG377D1.000
6:107705240:A:GL402P0.999
6:107705242:T:AK401N0.999
6:107705242:T:GK401N0.999
6:107705246:A:GL400P0.999
6:107705264:A:GL394P0.999
6:107705297:A:GL383P0.999
6:107705306:A:GL380P0.999
6:107705315:C:AG377V0.999
6:107705316:C:AG377C0.999
6:107705321:A:CI375S0.999
6:107705321:A:TI375N0.999
6:107707876:A:GF370S0.999
6:107707949:A:GW346R0.999
6:107707949:A:TW346R0.999
6:107746772:A:TV135D0.999
6:107705240:A:TL402H0.998
6:107705256:C:GG397R0.998
6:107705256:C:TG397R0.998
6:107705306:A:TL380Q0.998
6:107705309:G:TA379D0.998
6:107705316:C:GG377R0.998
6:107705318:T:AD376V0.998
6:107705321:A:GI375T0.998
6:107707923:A:CF354L0.998
6:107707923:A:TF354L0.998
6:107707925:A:GF354L0.998
6:107707947:C:AW346C0.998
6:107707947:C:GW346C0.998
6:107705219:A:GL409P0.997

dbSNP variants (sampled 300 via entrez): RS1000016891 (6:107724991 T>C), RS1000023065 (6:107745769 C>T), RS1000037291 (6:107767252 G>A), RS1000044536 (6:107772651 G>A), RS1000047788 (6:107836528 C>A,T), RS1000062463 (6:107830355 C>T), RS1000137320 (6:107817906 T>G), RS1000155766 (6:107702529 G>A), RS1000160889 (6:107750943 A>C), RS1000177747 (6:107708365 G>A,C), RS1000196184 (6:107733662 C>T), RS1000197444 (6:107817302 A>G), RS1000212740 (6:107751206 G>A,T), RS1000236297 (6:107792028 G>A), RS1000245776 (6:107823818 T>C)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

11 associations (top):

StudyTraitp-value
GCST004627_31Lymphocyte count3.000000e-09
GCST004632_52Lymphocyte percentage of white cells2.000000e-10
GCST004633_122Neutrophil percentage of white cells4.000000e-09
GCST005094_8Iris color (L* coordinate)7.000000e-06
GCST005582_2Coronary artery disease4.000000e-11
GCST90002388_1Lymphocyte count3.000000e-32
GCST90002388_100Lymphocyte count2.000000e-09
GCST90002389_330Lymphocyte percentage of white cells1.000000e-24
GCST90002389_331Lymphocyte percentage of white cells1.000000e-10
GCST90002399_301Neutrophil percentage of white cells3.000000e-13
GCST90002399_302Neutrophil percentage of white cells4.000000e-09

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0004587lymphocyte count
EFO:0007993lymphocyte percentage of leukocytes
EFO:0007990neutrophil percentage of leukocytes
EFO:0003949eye color

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

18 total (human), top 18 by PubMed support.

ChemicalActions (top 5)PubMed papers
(+)-JQ1 compounddecreases expression2
Benzo(a)pyreneaffects methylation, decreases expression, decreases methylation2
Aflatoxin B1decreases methylation, increases methylation2
triphenyl phosphateaffects expression1
benzo(e)pyreneincreases methylation1
cylindrospermopsindecreases expression1
entinostatdecreases expression1
Vorinostatdecreases expression1
Acetaminophenincreases expression1
Leadincreases expression1
Methapyrileneincreases methylation1
Nickelincreases expression1
Rotenonedecreases expression1
Tretinoindecreases expression1
1-Methyl-4-phenylpyridiniumdecreases expression1
Cyclosporinedecreases expression1
Antirheumatic Agentsdecreases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.