C5orf52

gene
On this page

Summary

C5orf52 (chromosome 5 open reading frame 52, HGNC:35121) is a protein-coding gene on chromosome 5q33.3, encoding Uncharacterized protein C5orf52 (A6NGY3).

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 5 total — 1 pathogenic
  • MANE Select transcript: NM_001145132

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:35121
Approved symbolC5orf52
Namechromosome 5 open reading frame 52
Location5q33.3
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000187658
Ensembl biotypeprotein_coding
Entrez100190949

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000409999

RefSeq mRNA: 1 — MANE Select: NM_001145132 NM_001145132

CCDS: CCDS47329

Canonical transcript exons

ENST00000409999 — 3 exons

ExonStartEnd
ENSE00001365239157675092157675200
ENSE00001577869157679841157680158
ENSE00001580984157671533157671826

Expression profiles

Bgee: expression breadth broad, 44 present calls, max score 89.54.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1097 / max 83.9018, expressed in 10 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
598340.06625
598360.02434
598330.01533
598350.00392

Top tissues by expression

107 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left testisUBERON:000453389.54gold quality
right testisUBERON:000453489.48gold quality
testisUBERON:000047388.80gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.55gold quality
mucosa of transverse colonUBERON:000499171.56gold quality
transverse colonUBERON:000115760.80gold quality
rectumUBERON:000105260.47gold quality
duodenumUBERON:000211457.43gold quality
right uterine tubeUBERON:000130250.82gold quality
colonic epitheliumUBERON:000039749.24silver quality
small intestineUBERON:000210847.13gold quality
small intestine Peyer’s patchUBERON:000345447.00gold quality
intestineUBERON:000016046.40gold quality
colonUBERON:000115546.33gold quality
granulocyteCL:000009442.64silver quality
fallopian tubeUBERON:000388942.58gold quality
prefrontal cortexUBERON:000045137.29gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
bone marrow cellCL:000209236.16gold quality
hypothalamusUBERON:000189835.96gold quality
ganglionic eminenceUBERON:000402335.49gold quality
frontal cortexUBERON:000187034.07silver quality
skeletal muscle tissueUBERON:000113433.38gold quality
metanephros cortexUBERON:001053332.66gold quality
anterior cingulate cortexUBERON:000983532.25gold quality
hindlimb stylopod muscleUBERON:000425232.15gold quality
cerebral cortexUBERON:000095632.03silver quality
liverUBERON:000210731.99gold quality
bone marrowUBERON:000237131.74gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.14

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

5 targeting C5orf52, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-139-5P99.8069.501399
HSA-MIR-486-3P99.5166.821901
HSA-MIR-4652-3P99.3370.022742
HSA-MIR-3692-5P99.2967.041421
HSA-MIR-6739-3P99.2268.841843

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculus4921536K21RikENSMUSG00000020434
rattus_norvegicusC14h5orf52ENSRNOG00000004112

Protein

Protein identifiers

Uncharacterized protein C5orf52A6NGY3 (reviewed: A6NGY3)

All UniProt accessions (1): A6NGY3

RefSeq proteins (1): NP_001138604* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR038935C5orf52Family

Pfam: PF17666

UniProt features (5 total): compositionally biased region 3, chain 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-A6NGY3-F175.490.36

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 24 (showing top): GSE13522_CTRL_VS_T_CRUZI_Y_STRAIN_INF_SKIN_129_MOUSE_DN, ATF5_TARGET_GENES, DMRT1_TARGET_GENES, LMX1B_TARGET_GENES, ZFHX3_TARGET_GENES, ZNF407_TARGET_GENES, GSE13493_CD4INTCD8POS_VS_CD8POS_THYMOCYTE_UP, GSE17721_CTRL_VS_LPS_4H_BMDC_DN, chr5q33, GSE17721_PAM3CSK4_VS_GADIQUIMOD_12H_BMDC_DN, DESCARTES_FETAL_INTESTINE_INTESTINAL_EPITHELIAL_CELLS, GSE17721_LPS_VS_GARDIQUIMOD_12H_BMDC_DN, TOX4_TARGET_GENES, LHX2_TARGET_GENES, ZNF260_TARGET_GENES

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (0):

Protein interactions and networks

STRING

132 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C5orf52CFAP97D1B2RV13610
C5orf52TMEM42Q69YG0541
C5orf52CCDC24Q8N4L8515
C5orf52SPMIP9Q96LM6507
C5orf52TLCD5Q6ZRR5507
C5orf52CCDC116Q8IYX3506
C5orf52KIAA0586Q9BVV6506
C5orf52DNAJC5GQ8N7S2476
C5orf52FBXW12Q6X9E4448
C5orf52DDX59Q5T1V6448
C5orf52TMEM67Q5HYA8419
C5orf52SNX29Q8TEQ0398
C5orf52NIBAN2Q96TA1397
C5orf52KIAA0753Q2KHM9366
C5orf52STK10O94804323

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A0A1B0GU33, A0A1B0GUV7, A6NGY3, A6ZT44, F5HHT4, G2TRR6, H3BU77, O13894, O54835, O89085, P0C2W9, P0C6G1, P0C6G2, P0C6G3, P0C6G4, P0C6G5, P0C733, P0DON5, P0DON6, P14976, P16722, P27262, P27271, P28979, P29887, P38612, P38839, Q05105, Q08588, Q1KN14, Q1KN19, Q1X6Y3, Q1X6Y4, Q1X6Z0, Q2YDL7, Q5R5R7, Q5ZK14, Q67621, Q6W0C5, Q88891

Diamond homologs: A6NGY3, Q9CR34

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

5 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance1
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
684880GRCh37/hg19 5q33.3-35.1(chr5:156597181-171166353)x3Pathogenic

SpliceAI

495 predictions. Top by Δscore:

VariantEffectΔscore
5:157679838:AAGGT:Aacceptor_gain1.0000
5:157671876:G:Tdonor_gain0.9900
5:157679835:T:TAacceptor_gain0.9900
5:157679838:A:AGacceptor_gain0.9900
5:157679838:AAG:Aacceptor_gain0.9900
5:157679839:A:AGacceptor_loss0.9900
5:157679839:A:Gacceptor_gain0.9900
5:157671766:G:GTdonor_gain0.9800
5:157679839:AGGTG:Aacceptor_gain0.9800
5:157679840:GGT:Gacceptor_gain0.9800
5:157679839:AGGT:Aacceptor_gain0.9700
5:157679840:G:Aacceptor_gain0.9700
5:157679840:GGTG:Gacceptor_gain0.9700
5:157679840:GGTGA:Gacceptor_gain0.9700
5:157671845:GAGC:Gdonor_gain0.9600
5:157672160:GGGGA:Gdonor_gain0.9600
5:157679840:G:GGacceptor_gain0.9600
5:157671771:C:Adonor_gain0.9400
5:157671823:TCAG:Tdonor_loss0.9400
5:157671824:CAG:Cdonor_loss0.9400
5:157671825:AGG:Adonor_loss0.9400
5:157671826:GG:Gdonor_loss0.9400
5:157671827:G:Adonor_loss0.9400
5:157671828:T:Gdonor_loss0.9400
5:157671941:A:Tdonor_gain0.9400
5:157672257:G:GAdonor_gain0.9300
5:157671819:C:CGdonor_gain0.9200
5:157679839:AG:Aacceptor_gain0.9200
5:157671759:GGC:Gdonor_gain0.9100
5:157671760:GCG:Gdonor_gain0.9100

AlphaMissense

1034 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:157679904:T:CF129L0.979
5:157679906:C:AF129L0.979
5:157679906:C:GF129L0.979
5:157679897:A:CK126N0.964
5:157679897:A:TK126N0.964
5:157679893:T:CL125P0.953
5:157679905:T:CF129S0.947
5:157671786:T:CF58L0.943
5:157671788:T:AF58L0.943
5:157671788:T:GF58L0.943
5:157679939:G:CW140C0.933
5:157679939:G:TW140C0.933
5:157679937:T:AW140R0.926
5:157679937:T:CW140R0.926
5:157671822:T:CF70L0.923
5:157671824:C:AF70L0.923
5:157671824:C:GF70L0.923
5:157675166:A:TD96V0.920
5:157679927:G:CK136N0.919
5:157679927:G:TK136N0.919
5:157679896:A:TK126I0.918
5:157675138:A:CS87R0.911
5:157675140:C:AS87R0.911
5:157675140:C:GS87R0.911
5:157679917:A:CQ133P0.907
5:157675145:T:CL89S0.906
5:157675157:T:CI93T0.897
5:157671825:A:CS71R0.892
5:157675092:C:AS71R0.892
5:157675092:C:GS71R0.892

dbSNP variants (sampled 300 via entrez): RS1000146346 (5:157673952 A>G,T), RS1000272919 (5:157672241 C>T), RS1000397292 (5:157677595 G>A,C,T), RS1000874291 (5:157673452 A>G), RS1001005869 (5:157679009 A>C,G), RS1002712321 (5:157670543 G>A), RS1002851424 (5:157675845 A>G), RS1003093027 (5:157678130 C>T), RS1003499353 (5:157676951 C>T), RS1003684615 (5:157671615 A>G), RS1003834596 (5:157676520 A>G,T), RS1004113047 (5:157680002 T>A,C), RS1004360709 (5:157674078 A>G), RS1004674191 (5:157669727 G>C,T), RS1004874379 (5:157676486 A>G)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST011066_3Motor fluctuations in levodopa treated Parkinson’s disease9.000000e-06
GCST011766_9Chronic obstructive pulmonary disease1.000000e-07

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0010747response to levodopa
EFO:0010749motor function measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

2 total (human), top 2 by PubMed support.

ChemicalActions (top 5)PubMed papers
Arsenicaffects methylation1
Benzo(a)pyreneincreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.