ZSCAN5A

gene
On this page

Also known as MGC4161

Summary

ZSCAN5A (zinc finger and SCAN domain containing 5A, HGNC:23710) is a protein-coding gene on chromosome 19q13.43, encoding Zinc finger and SCAN domain-containing protein 5A (Q9BUG6). May be involved in transcriptional regulation.

Enables sequence-specific double-stranded DNA binding activity. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be located in nucleus.

Source: NCBI Gene 79149 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 106 total
  • MANE Select transcript: NM_001322064

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:23710
Approved symbolZSCAN5A
Namezinc finger and SCAN domain containing 5A
Location19q13.43
Locus typegene with protein product
StatusApproved
AliasesMGC4161
Ensembl geneENSG00000131848
Ensembl biotypeprotein_coding
OMIM620916
Entrez79149

Gene structure

Transcript identifiers

Ensembl transcripts: 37 — 29 protein_coding, 7 protein_coding_CDS_not_defined, 1 retained_intron

ENST00000391713, ENST00000585670, ENST00000586031, ENST00000587075, ENST00000587340, ENST00000587492, ENST00000587614, ENST00000588442, ENST00000588955, ENST00000589279, ENST00000590675, ENST00000591078, ENST00000592101, ENST00000592355, ENST00000592509, ENST00000592679, ENST00000593106, ENST00000683990, ENST00000891612, ENST00000891613, ENST00000891614, ENST00000891615, ENST00000891616, ENST00000891617, ENST00000891618, ENST00000928141, ENST00000928142, ENST00000928143, ENST00000928144, ENST00000928145, ENST00000948522, ENST00000948523, ENST00000948524, ENST00000948525, ENST00000948526, ENST00000948527, ENST00000948528

RefSeq mRNA: 23 — MANE Select: NM_001322064 NM_001322061, NM_001322062, NM_001322064, NM_001322065, NM_001322066, NM_001322067, NM_001322068, NM_001322070, NM_001322072, NM_001322073, NM_001322074, NM_001322075, NM_001322076, NM_001322077, NM_001322078, NM_001387853, NM_001387854, NM_001387855, NM_001387856, NM_001387857, NM_001387858, NM_001387859, NM_024303

CCDS: CCDS12941, CCDS82403

Canonical transcript exons

ENST00000683990 — 6 exons

ExonStartEnd
ENSE000024438335622466356225173
ENSE000024881285622259156222741
ENSE000029665215631468156314839
ENSE000035390745622363156223834
ENSE000036908015631328356313384
ENSE000039217975622130356222326

Expression profiles

Bgee: expression breadth ubiquitous, 186 present calls, max score 89.69.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 5.8159 / max 54.1666, expressed in 1750 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
1828775.59641750
1828760.083923
1828740.063320
1828750.051721
1828730.020612

Top tissues by expression

276 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.69gold quality
right testisUBERON:000453489.19gold quality
spermCL:000001988.55silver quality
left testisUBERON:000453388.19gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047387.63gold quality
hindlimb stylopod muscleUBERON:000425287.62gold quality
testisUBERON:000047386.40gold quality
apex of heartUBERON:000209886.24gold quality
male germ cellCL:000001586.23silver quality
right uterine tubeUBERON:000130285.94gold quality
right atrium auricular regionUBERON:000663184.65gold quality
heart left ventricleUBERON:000208483.39gold quality
cardiac ventricleUBERON:000208282.70gold quality
cardiac atriumUBERON:000208182.07gold quality
gastrocnemiusUBERON:000138881.98gold quality
nucleus accumbensUBERON:000188281.77gold quality
muscle of legUBERON:000138381.76gold quality
adenohypophysisUBERON:000219681.38gold quality
heartUBERON:000094880.92gold quality
caudate nucleusUBERON:000187380.41gold quality
monocyteCL:000057680.39gold quality
prefrontal cortexUBERON:000045180.27gold quality
mononuclear cellCL:000084280.22gold quality
leukocyteCL:000073880.12gold quality
olfactory segment of nasal mucosaUBERON:000538680.09gold quality
right ovaryUBERON:000211879.93gold quality
putamenUBERON:000187479.89gold quality
right lobe of thyroid glandUBERON:000111979.74gold quality
left ovaryUBERON:000211979.31gold quality
pituitary glandUBERON:000000779.17gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no4.74

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

12 targeting ZSCAN5A, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-570-3P99.9672.414910
HSA-MIR-6825-5P99.9669.813431
HSA-MIR-4778-3P99.9370.401818
HSA-MIR-605-3P99.8869.221833
HSA-MIR-221-5P99.8665.451052
HSA-MIR-807399.8665.211118
HSA-MIR-6715A-3P99.8368.051473
HSA-MIR-119799.7067.751027
HSA-MIR-4666B99.6468.691282
HSA-MIR-3129-3P97.8567.631246
HSA-MIR-5583-5P97.8567.611243
HSA-MIR-59296.5967.59817

Cross-species orthologs

11 orthologs

OrganismSymbolGene ID
danio_rerioplagxENSDARG00000036855
danio_rerioovol1aENSDARG00000076472
danio_rerioplagl2ENSDARG00000076657
danio_rerioovol1bENSDARG00000078256
mus_musculusZscan5bENSMUSG00000058028
rattus_norvegicusZscan5bENSRNOG00000049014
drosophila_melanogasterhbFBGN0001180
drosophila_melanogasterCG12391FBGN0033581
caenorhabditis_elegansWBGENE00001824
caenorhabditis_elegansWBGENE00003033
caenorhabditis_elegansWBGENE00012385

Paralogs (29): ZNF446 (ENSG00000083838), REST (ENSG00000084093), ZNF174 (ENSG00000103343), OVOL3 (ENSG00000105261), PLAGL1 (ENSG00000118495), ZSCAN18 (ENSG00000121413), ZNF576 (ENSG00000124444), OVOL2 (ENSG00000125850), PLAGL2 (ENSG00000126003), ZSCAN29 (ENSG00000140265), ZSCAN32 (ENSG00000140987), ZSCAN1 (ENSG00000152467), ZNF18 (ENSG00000154957), ZKSCAN2 (ENSG00000155592), ZNF496 (ENSG00000162714), ZNF202 (ENSG00000166261), ZNF641 (ENSG00000167528), ZNF444 (ENSG00000167685), SCAND1 (ENSG00000171222), ZNF274 (ENSG00000171606), ZNF131 (ENSG00000172262), OVOL1 (ENSG00000172818), ZNF518A (ENSG00000177853), ZNF518B (ENSG00000178163), PLAG1 (ENSG00000181690), ZSCAN5B (ENSG00000197213), ZNF770 (ENSG00000198146), PEG3 (ENSG00000198300), ZSCAN5C (ENSG00000204532)

Protein

Protein identifiers

Zinc finger and SCAN domain-containing protein 5AQ9BUG6 (reviewed: Q9BUG6)

Alternative names: Zinc finger protein 495

All UniProt accessions (8): A0A0C4DGQ1, Q9BUG6, K7EIX9, K7EMC4, K7EN85, K7EQW6, K7ESE4, K7ESN6

UniProt curated annotations — full annotation on UniProt →

Function. May be involved in transcriptional regulation.

Subcellular location. Nucleus.

Isoforms (2)

UniProt IDNamesCanonical?
Q9BUG6-11yes
Q9BUG6-22

RefSeq proteins (23): NP_001308990, NP_001308991, NP_001308993, NP_001308994, NP_001308995, NP_001308996, NP_001308997, NP_001308999, NP_001309001, NP_001309002, NP_001309003, NP_001309004, NP_001309005, NP_001309006, NP_001309007, NP_001374782, NP_001374783, NP_001374784, NP_001374785, NP_001374786, NP_001374787, NP_001374788, NP_077279 (=MANE)

Domains & families (InterPro)

IDNameType
IPR003309SCAN_domDomain
IPR013087Znf_C2H2_typeDomain
IPR036236Znf_C2H2_sfHomologous_superfamily
IPR038269SCAN_sfHomologous_superfamily
IPR050331Zinc_finger_PRDM4/PRDM1/PRDM14Family

Pfam: PF00096, PF02023, PF12874

UniProt features (20 total): zinc finger region 5, compositionally biased region 3, sequence conflict 3, splice variant 2, sequence variant 2, region of interest 2, chain 1, domain 1, cross-link 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BUG6-F158.500.14

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 159

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 55 (showing top): GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, KRIGE_RESPONSE_TO_TOSEDOSTAT_6HR_DN, WHITFIELD_CELL_CYCLE_M_G1, ZWANG_EGF_INTERVAL_UP, NOTCH_DN.V1_DN, GOMF_TRANSCRIPTION_REGULATOR_ACTIVITY, CEBPZ_TARGET_GENES, DLX6_TARGET_GENES, ELF2_TARGET_GENES, NAB2_TARGET_GENES, ZFP91_TARGET_GENES, ZNF33A_TARGET_GENES, ZNF528_TARGET_GENES, ZNF592_TARGET_GENES, ZSCAN2_TARGET_GENES

GO Biological Process (1): regulation of transcription by RNA polymerase II (GO:0006357)

GO Molecular Function (7): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), zinc ion binding (GO:0008270), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (1): nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
transition metal ion binding1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
binding1
cation binding1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

494 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
ZSCAN5AEOLA2Q96DE9540
ZSCAN5AKHDC1Q4VXA5433
ZSCAN5ATRIM43Q96BQ3405
ZSCAN5AHHIPL2Q6UWX4379
ZSCAN5ARFPL4AA6NLU0371
ZSCAN5APRAMEF10O60809356
ZSCAN5ADMRTB1Q96MA1331
ZSCAN5AGPR45Q9Y5Y3322
ZSCAN5ARUFY2Q8WXA3314
ZSCAN5AC21orf91Q9NYK6312
ZSCAN5ACCSAPQ6IQ19311
ZSCAN5AMGAT4CQ9UBM8311
ZSCAN5AMALRD1Q5VYJ5310
ZSCAN5APRAMEF11O60813302
ZSCAN5AHMGN3Q15651297

IntAct

19 interactions, top by confidence:

ABTypeScore
DYNLL2BLTP3Bpsi-mi:“MI:0914”(association)0.640
NFU1ZSCAN5Apsi-mi:“MI:0915”(physical association)0.560
CIB3ZSCAN5Apsi-mi:“MI:0915”(physical association)0.560
ZSCAN5ANFU1psi-mi:“MI:0915”(physical association)0.560
CYSRT1ZSCAN5Apsi-mi:“MI:0915”(physical association)0.560
ZSCAN5AKDM1Apsi-mi:“MI:0914”(association)0.530
ZMYM4ILVBLpsi-mi:“MI:0914”(association)0.530
ZSCAN5Apsi-mi:“MI:0915”(physical association)0.400
ZSCAN5AMRPS12psi-mi:“MI:0914”(association)0.350
ZSCAN5AHDAC3psi-mi:“MI:0914”(association)0.350
ZSCAN5AGAPDHSpsi-mi:“MI:0914”(association)0.350
ZMYM2ZBTB5psi-mi:“MI:2364”(proximity)0.270
ZSCAN5ACYSRT1psi-mi:“MI:0915”(physical association)0.000

BioGRID (60): EPHB4 (Affinity Capture-MS), HMGA1 (Affinity Capture-MS), MRPS12 (Affinity Capture-MS), ZMYM2 (Affinity Capture-MS), ZMYM4 (Affinity Capture-MS), IPO8 (Affinity Capture-MS), ZNF616 (Affinity Capture-MS), CDYL2 (Affinity Capture-MS), RCOR2 (Affinity Capture-MS), IPO8 (Affinity Capture-MS), ZMYM2 (Affinity Capture-MS), CSNK2A2 (Affinity Capture-MS), SCAND1 (Affinity Capture-MS), RCOR3 (Affinity Capture-MS), CSNK2A1 (Affinity Capture-MS)

ESM2 similar proteins: A0A1D5NS60, A0JN76, A1YFX5, A2T7G6, A6NJL1, D2HQI1, F1MJR8, O14901, P0CG00, P10754, P22227, P98182, Q0IJ29, Q1L8W0, Q3SWU4, Q5DW34, Q5EAC5, Q5EXX3, Q5RHB5, Q5SXI5, Q5T619, Q66H04, Q6NRM8, Q6NV66, Q6ZSB9, Q7M6U3, Q7TS63, Q7TSH3, Q7ZWZ4, Q801P1, Q86VK4, Q8BKX7, Q8BXX2, Q8NAM6, Q8NAP3, Q8NCP5, Q8R0A2, Q91VW9, Q96IT1, Q96N77

Diamond homologs: A1YEP8, A1YFW2, A1YFX5, A1YG60, A1YGJ4, A2T6E3, A2T6V8, A2T736, A2T7D7, A2T7F4, A2T7G6, A6QNZ0, A7KBS4, D2HQI1, O14978, O15535, O43309, O60304, P0CG00, P10073, P17028, P49910, Q15697, Q15776, Q16670, Q1LZ87, Q3MJ62, Q3URS2, Q4R8H9, Q5RAE6, Q5RJ54, Q5SXI5, Q6NSZ9, Q6R2W3, Q80VJ6, Q86W11, Q8BGS3, Q8CF60, Q8IWY8, Q8N0Y2

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

106 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance90
Likely benign10
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

2212 predictions. Top by Δscore:

VariantEffectΔscore
19:56222324:CTG:Cacceptor_gain1.0000
19:56222327:C:CCacceptor_gain1.0000
19:56222586:CTCA:Cdonor_gain1.0000
19:56222589:A:ACdonor_gain1.0000
19:56222590:C:CTdonor_gain1.0000
19:56222590:CTGGG:Cdonor_gain1.0000
19:56222642:C:Adonor_gain1.0000
19:56222737:TCTCC:Tacceptor_gain1.0000
19:56222738:CTCC:Cacceptor_gain1.0000
19:56222738:CTCCC:Cacceptor_gain1.0000
19:56222739:TCC:Tacceptor_gain1.0000
19:56222739:TCCC:Tacceptor_loss1.0000
19:56222739:TCCCT:Tacceptor_gain1.0000
19:56222740:CC:Cacceptor_gain1.0000
19:56222740:CCC:Cacceptor_gain1.0000
19:56222741:CC:Cacceptor_gain1.0000
19:56222742:C:CCacceptor_gain1.0000
19:56222742:C:Tacceptor_gain1.0000
19:56223629:A:ACdonor_gain1.0000
19:56223629:ACCTG:Adonor_gain1.0000
19:56223630:C:CGdonor_loss1.0000
19:56223630:C:CTdonor_gain1.0000
19:56223630:CCTG:Cdonor_gain1.0000
19:56223630:CCTGC:Cdonor_gain1.0000
19:56223830:ACAGA:Aacceptor_gain1.0000
19:56223831:CAGA:Cacceptor_gain1.0000
19:56223831:CAGAC:Cacceptor_gain1.0000
19:56223832:AGA:Aacceptor_gain1.0000
19:56223833:GA:Gacceptor_gain1.0000
19:56223834:ACTG:Aacceptor_loss1.0000

AlphaMissense

0 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000019743 (19:56302105 A>C), RS1000031411 (19:56253687 G>A), RS1000046315 (19:56259761 T>C), RS1000055361 (19:56255462 C>T), RS1000059660 (19:56337355 G>A), RS1000077553 (19:56260029 A>G), RS1000087570 (19:56298044 G>A), RS1000132283 (19:56245640 G>A,C,T), RS1000160610 (19:56296174 G>A), RS1000163622 (19:56258642 C>A), RS1000187378 (19:56225677 A>C), RS1000207621 (19:56240152 ACT>A), RS1000253258 (19:56331524 T>C), RS1000256183 (19:56265313 A>G), RS1000258949 (19:56258317 G>A)

Disease associations

OMIM: gene MIM:620916 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST003253_2Microalbuminuria5.000000e-06
GCST006464_11Endometrial cancer3.000000e-07
GCST90002393_675Monocyte count1.000000e-12
GCST90002394_569Monocyte percentage of white cells6.000000e-12

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0005091monocyte count
EFO:0007989monocyte percentage of leukocytes

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

36 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation2
FR900359increases phosphorylation1
triphenyl phosphateaffects expression1
bisphenol Aincreases expression1
geraniolincreases expression1
beta-lapachonedecreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
benzo(e)pyreneincreases methylation1
ferrous chloridedecreases expression1
aflatoxin B2increases methylation1
CGP 52608affects binding, increases reaction1
ICG 001decreases expression1
abrinedecreases expression1
jinfukangaffects cotreatment, decreases expression1
PCI 5002affects cotreatment, increases expression1
Sunitinibdecreases expression1
Vorinostatincreases expression1
Cadmiumincreases abundance, increases expression1
Caffeinedecreases phosphorylation1
Carbamazepineaffects expression1
Cisplatinaffects cotreatment, decreases expression1
Formaldehydedecreases expression1
Hydrogen Peroxideaffects expression1
Methapyrileneincreases methylation1
Smokedecreases expression1
Testosteronedecreases expression1
Tobacco Smoke Pollutiondecreases expression1
Tretinoindecreases expression1
Zincaffects cotreatment, increases expression1
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression1

Cellosaurus cell lines

4 cell lines: 3 embryonic stem cell, 1 transformed cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A8L2SEES3-1V human ZSCAN5A, clone1Embryonic stem cellMale
CVCL_A8L3SEES3-1V human ZSCAN5A, clone2Embryonic stem cellMale
CVCL_A8L4SEES3-1V human ZSCAN5A, clone3Embryonic stem cellMale
CVCL_HD50HEK293 eGFP-ZSCAN5ATransformed cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): endometrial carcinoma