C6orf52

gene
On this page

Summary

C6orf52 (chromosome 6 open reading frame 52, HGNC:20881) is a protein-coding gene on chromosome 6p24.2, encoding Putative uncharacterized protein C6orf52 (Q5T4I8).

At a glance

  • Clinical variants (ClinVar): 18 total — 1 pathogenic
  • MANE Select transcript: NM_001145020

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20881
Approved symbolC6orf52
Namechromosome 6 open reading frame 52
Location6p24.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000137434
Ensembl biotypeprotein_coding
OMIM621281
Entrez347744

Gene structure

Transcript identifiers

Ensembl transcripts: 7 — 6 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000259983, ENST00000379586, ENST00000426700, ENST00000460742, ENST00000467832, ENST00000503680, ENST00000896013

RefSeq mRNA: 4 — MANE Select: NM_001145020 NM_001145020, NM_001354357, NM_001388310, NM_001388311

CCDS: CCDS47371, CCDS87364

Canonical transcript exons

ENST00000259983 — 5 exons

ExonStartEnd
ENSE000017344821069449410694616
ENSE000034847521068748010687561
ENSE000034910861067142210671598
ENSE000035652281068318710683232
ENSE000036197611068696610687164

Expression profiles

Bgee: expression breadth ubiquitous, 132 present calls, max score 90.60.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 2.5596 / max 54.1666, expressed in 1090 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
716942.55961090

Top tissues by expression

133 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047390.60gold quality
right testisUBERON:000453483.37gold quality
testisUBERON:000047382.89gold quality
left testisUBERON:000453382.60gold quality
right uterine tubeUBERON:000130281.41gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099180.63gold quality
olfactory segment of nasal mucosaUBERON:000538680.37gold quality
duodenumUBERON:000211477.61gold quality
islet of LangerhansUBERON:000000675.72gold quality
ganglionic eminenceUBERON:000402372.81gold quality
hypothalamusUBERON:000189872.70gold quality
amygdalaUBERON:000187671.56gold quality
substantia nigraUBERON:000203871.53gold quality
temporal lobeUBERON:000187171.44gold quality
C1 segment of cervical spinal cordUBERON:000646971.40gold quality
Ammon’s hornUBERON:000195470.60gold quality
prostate glandUBERON:000236770.07gold quality
pancreasUBERON:000126469.66gold quality
right adrenal gland cortexUBERON:003582769.52gold quality
stromal cell of endometriumCL:000225569.27gold quality
anterior cingulate cortexUBERON:000983569.18gold quality
prefrontal cortexUBERON:000045169.16gold quality
primary visual cortexUBERON:000243669.10gold quality
right adrenal glandUBERON:000123368.83gold quality
endometriumUBERON:000129568.83gold quality
nucleus accumbensUBERON:000188268.63gold quality
ventricular zoneUBERON:000305368.55gold quality
left adrenal gland cortexUBERON:003582568.48gold quality
cerebral cortexUBERON:000095668.47gold quality
caudate nucleusUBERON:000187368.20gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-GEOD-100618yes22.05
E-ANND-3no2.51

Regulation

Is transcription factor: no

Cross-species orthologs

1 orthologs

OrganismSymbolGene ID
rattus_norvegicusC17h6orf52ENSRNOG00000039379

Protein

Protein identifiers

Putative uncharacterized protein C6orf52Q5T4I8 (reviewed: Q5T4I8)

All UniProt accessions (3): Q5T4I8, D6RAP8, D6RGE8

Isoforms (2)

UniProt IDNamesCanonical?
Q5T4I8-11yes
Q5T4I8-22

RefSeq proteins (4): NP_001138492, NP_001341286, NP_001375239, NP_001375240 (=MANE)

Domains & families (InterPro)

IDNameType
IPR040434TSAP1Family
IPR041085TSAP1_CDomain

Pfam: PF17654

UniProt features (3 total): chain 1, splice variant 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5T4I8-F158.150.07

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 44 (showing top): GSE45365_CD8A_DC_VS_CD11B_DC_IFNAR_KO_MCMV_INFECTION_DN, GSE45365_NK_CELL_VS_CD8A_DC_MCMV_INFECTION_UP, chr6p24, KOYAMA_SEMA3B_TARGETS_DN, STAMBOLSKY_RESPONSE_TO_VITAMIN_D3_DN, JOHNSTONE_PARVB_TARGETS_3_DN, BARX1_TARGET_GENES, DACH1_TARGET_GENES, DIDO1_TARGET_GENES, E2F2_TARGET_GENES, FEV_TARGET_GENES, IRF5_TARGET_GENES, KLF7_TARGET_GENES, NCOA2_TARGET_GENES, NFE2L1_TARGET_GENES

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (0):

Protein interactions and networks

STRING

1219 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C6orf52CENATACQ86UT8613
C6orf52RBM44Q6ZP01507
C6orf52IMPG1Q17R60504
C6orf52TOR1AIP2Q8NFQ8484
C6orf52WACQ9BTA9431
C6orf52ASMTLO95671411
C6orf52GABRR1P24046411
C6orf52TMSB15AP0CG34377
C6orf52LCA5Q86VQ0366
C6orf52EEFSECP57772348
C6orf52SEPSECSQ9HD40348
C6orf52AMPD3Q01432346
C6orf52PRPF39Q86UA1340
C6orf52ELOVL4Q9GZR5334
C6orf52MYO6Q9UM54329

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A0A345BJN6, B3FWS0, C9K7C3, G1XTZ6, G4NB33, I1S490, K0DZ91, K3VDP7, O09102, O15198, O36398, O75603, O93958, P09265, P09286, P0CK38, P13622, P17924, P17926, P21944, P24433, P28937, P28989, P35192, P36280, P68349, P68948, P68949, P70255, P70348, P80074, P9WEJ9, Q00028, Q06658, Q08427, Q08589, Q2U9L6, Q4JQW5, Q4QFD3, Q5T4I8

Diamond homologs: A2VDB3, F4I3B3, O13759, O22173, O57437, O60176, P08199, P09405, P0CB38, P13383, P19338, P32588, P39697, P41891, P70318, Q00539, Q01085, Q0J9Y2, Q0P5L0, Q0WW84, Q10355, Q15427, Q1RMJ7, Q24207, Q4KM14, Q4R4J7, Q4V7Y4, Q503H1, Q5R462, Q5RF26, Q5T4I8, Q64368, Q66J99, Q66KL9, Q6AXT7, Q6AYL5, Q6DRG1, Q6YZW2, Q76CY5, Q804A9

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

18 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance13
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
146906GRCh38/hg38 6p25.2-24.1(chr6:4068792-13267799)x1Pathogenic

SpliceAI

877 predictions. Top by Δscore:

VariantEffectΔscore
6:10671609:C:CTacceptor_gain1.0000
6:10671610:A:Tacceptor_gain1.0000
6:10671608:CCAA:Cacceptor_gain0.9900
6:10671609:C:Tacceptor_gain0.9900
6:10671611:A:ACacceptor_gain0.9900
6:10671596:GATC:Gacceptor_loss0.9800
6:10671597:ATCT:Aacceptor_loss0.9800
6:10671598:TC:Tacceptor_loss0.9800
6:10671599:C:CCacceptor_gain0.9800
6:10671611:A:Cacceptor_gain0.9800
6:10671614:A:Cacceptor_gain0.9800
6:10671620:A:Cacceptor_gain0.9800
6:10694492:AC:Adonor_gain0.9800
6:10694493:CC:Cdonor_gain0.9800
6:10671594:TGGAT:Tacceptor_gain0.9700
6:10671609:CAA:Cacceptor_loss0.9700
6:10671614:A:ACacceptor_gain0.9700
6:10684924:C:CCacceptor_gain0.9700
6:10671596:GAT:Gacceptor_gain0.9600
6:10683229:TTTCC:Tacceptor_loss0.9600
6:10683230:TTCCT:Tacceptor_loss0.9600
6:10683233:C:Aacceptor_loss0.9600
6:10683234:TG:Tacceptor_loss0.9600
6:10683235:G:Cacceptor_loss0.9600
6:10683239:CAAAA:Cacceptor_loss0.9600
6:10684931:C:CTacceptor_gain0.9600
6:10671595:GGAT:Gacceptor_gain0.9500
6:10671618:A:Cacceptor_gain0.9400
6:10683181:TGTTA:Tdonor_loss0.9400
6:10683183:TTA:Tdonor_loss0.9400

AlphaMissense

1014 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:10671507:C:AW136C0.990
6:10671507:C:GW136C0.990
6:10671543:A:CS124R0.987
6:10671543:A:TS124R0.987
6:10671545:T:GS124R0.987
6:10671509:A:GW136R0.986
6:10671509:A:TW136R0.986
6:10671523:A:GL131P0.975
6:10671527:A:GS130P0.972
6:10671555:A:CF120L0.964
6:10671555:A:TF120L0.964
6:10671556:A:GF120S0.964
6:10671557:A:GF120L0.964
6:10671508:C:GW136S0.962
6:10671535:A:GL127P0.961
6:10671523:A:TL131H0.956
6:10671508:C:AW136L0.950
6:10671544:C:AS124I0.949
6:10671509:A:CW136G0.929
6:10671515:A:GC134R0.929
6:10671556:A:CF120C0.920
6:10671513:G:CC134W0.919
6:10671478:A:GI146T0.917
6:10671523:A:CL131R0.906
6:10671564:G:CN117K0.902
6:10671564:G:TN117K0.902
6:10671496:T:AD140V0.901
6:10671497:C:GD140H0.899
6:10671544:C:TS124N0.898
6:10671541:T:AE125V0.893

dbSNP variants (sampled 300 via entrez): RS1000038898 (6:10694166 AG>A), RS1000236696 (6:10684016 GA>G), RS1000252202 (6:10676941 G>A), RS1000403623 (6:10675516 C>A,T), RS1000459014 (6:10688858 C>A,T), RS1000524099 (6:10679630 A>G), RS1000556617 (6:10679983 T>C), RS1000763656 (6:10687299 G>A,T), RS1000828050 (6:10688572 T>C), RS1000972574 (6:10685823 G>C), RS1001034243 (6:10678830 C>T), RS1001319000 (6:10680762 G>A,C), RS1001463349 (6:10695389 A>G), RS1001470653 (6:10674773 T>A), RS1001490726 (6:10677865 T>C)

Disease associations

OMIM: gene MIM:621281 | disease phenotypes: MIM:116700

GenCC curated gene-disease

Mondo (1): cataract 13 with adult I phenotype (MONDO:0007289)

Orphanet (1): Early onset non-syndromic cataract (Orphanet:91492)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

17 total (human), top 17 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases methylation, increases expression4
aristolochic acid Iincreases expression1
sodium arsenatedecreases expression, increases abundance1
Resveratrolaffects cotreatment, decreases expression1
Temozolomidedecreases expression1
Vorinostatincreases expression1
Air Pollutantsincreases abundance, increases expression1
Arsenicdecreases expression, increases abundance1
Formaldehydeincreases expression1
Indomethacinincreases expression1
Plant Extractsaffects cotreatment, decreases expression1
Quercetinincreases expression1
Smokeincreases abundance, increases expression1
Tobacco Smoke Pollutionaffects expression1
Aflatoxin B1increases expression1
Cadmium Chloridedecreases expression1
Acrylamideincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): cataract 13 with adult I phenotype