C20orf203

gene
On this page

Also known as FLJ33706

Summary

C20orf203 (chromosome 20 open reading frame 203, HGNC:26592) is a protein-coding gene on chromosome 20q11.21, encoding Uncharacterized protein C20orf203 (Q8NBC4).

The protein encoded by this gene is thought to be a human-specific protein. Currently available evidence suggests that orthologous regions in other organisms contain sequence differences that would not support production of a protein product. Genome-wide association studies have suggested the possibility that a SNP in the 3’ UTR, rs17123507, could be associated with nicotine addiction. Expression of this gene may be elevated in some individuals with Alzheimer’s disease.

Source: NCBI Gene 284805 — RefSeq curated summary.

At a glance

  • GWAS associations: 14
  • Clinical variants (ClinVar): 8 total
  • MANE Select transcript: NM_182584

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26592
Approved symbolC20orf203
Namechromosome 20 open reading frame 203
Location20q11.21
Locus typegene with protein product
StatusApproved
AliasesFLJ33706
Ensembl geneENSG00000198547
Ensembl biotypeprotein_coding
Entrez284805

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000360785, ENST00000608990

RefSeq mRNA: 1 — MANE Select: NM_182584 NM_182584

CCDS: CCDS13203

Canonical transcript exons

ENST00000608990 — 6 exons

ExonStartEnd
ENSE000015433593264056632640687
ENSE000015433643265173332651981
ENSE000037040073267363232673941
ENSE000037095743264925532650881
ENSE000037096153263162532634270
ENSE000037299103265101832651166

Expression profiles

Bgee: expression breadth ubiquitous, 102 present calls, max score 76.43.

Top tissues by expression

114 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047376.43gold quality
cerebellar hemisphereUBERON:000224560.66gold quality
cerebellar cortexUBERON:000212960.50gold quality
cerebellumUBERON:000203760.34gold quality
right hemisphere of cerebellumUBERON:001489059.80gold quality
sural nerveUBERON:001548858.89silver quality
cortical plateUBERON:000534358.60gold quality
primary visual cortexUBERON:000243656.75gold quality
ventricular zoneUBERON:000305356.37gold quality
right frontal lobeUBERON:000281055.97gold quality
putamenUBERON:000187455.76gold quality
nucleus accumbensUBERON:000188255.23gold quality
caudate nucleusUBERON:000187354.90gold quality
Brodmann (1909) area 9UBERON:001354054.09gold quality
dorsolateral prefrontal cortexUBERON:000983453.79gold quality
frontal cortexUBERON:000187053.77gold quality
brainUBERON:000095553.24gold quality
anterior cingulate cortexUBERON:000983553.24gold quality
hypothalamusUBERON:000189853.07gold quality
cerebral cortexUBERON:000095652.91gold quality
prefrontal cortexUBERON:000045152.53gold quality
ganglionic eminenceUBERON:000402352.13silver quality
superior frontal gyrusUBERON:000266152.10gold quality
uterine cervixUBERON:000000251.74gold quality
left ovaryUBERON:000211951.51gold quality
ovaryUBERON:000099251.07gold quality
temporal lobeUBERON:000187150.66gold quality
amygdalaUBERON:000187650.62gold quality
bone marrow cellCL:000209250.58gold quality
lymph nodeUBERON:000002950.40gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.80

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

112 targeting C20orf203, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-340-5P100.0072.504437
HSA-MIR-4283100.0066.422097
HSA-MIR-4481100.0066.421669
HSA-MIR-607799.9968.042299
HSA-MIR-6870-5P99.9968.552115
HSA-MIR-453499.9966.581907
HSA-MIR-4650-5P99.9864.69999
HSA-MIR-4723-5P99.9768.702034
HSA-MIR-569899.9768.492029
HSA-MIR-7111-5P99.9768.482062
HSA-MIR-570-3P99.9672.414910
HSA-MIR-6825-5P99.9669.813431
HSA-MIR-10523-5P99.9169.222038
HSA-MIR-3529-3P99.9073.553045
HSA-MIR-4731-5P99.8967.232537
HSA-MIR-444799.8567.812900
HSA-MIR-6756-5P99.8267.972466
HSA-MIR-313399.8170.923506
HSA-MIR-6842-5P99.8067.541587
HSA-MIR-7110-5P99.8067.841712
HSA-MIR-4668-5P99.7970.583782
HSA-MIR-3150A-3P99.7664.441640
HSA-MIR-6763-5P99.7664.681767
HSA-MIR-431999.7669.832586
HSA-MIR-378G99.7164.901106
HSA-MIR-33A-3P99.7070.273362
HSA-MIR-64699.6867.841645
HSA-MIR-6766-5P99.6867.702325
HSA-MIR-580-3P99.6769.231841
HSA-MIR-7-5P99.6770.531809

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

Uncharacterized protein C20orf203Q8NBC4 (reviewed: Q8NBC4)

All UniProt accessions (1): Q8NBC4

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Cytoplasm.

Tissue specificity. Expressed most abundantly in the brain at protein level. Present in cortex, cerebellum and midbrain. Found in neurons. Elevated expressions detected in Alzheimer brain samples. Also expressed in testis.

Miscellaneous. Originated from non-coding DNA sequences (insertion of repeat elements especially Alu). Seems to exist only in human.

RefSeq proteins (1): NP_872390* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR040965DUF5559Family

Pfam: PF17714

UniProt features (3 total): chain 1, region of interest 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8NBC4-F153.810.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 25 (showing top): chr20q11, PEDRIOLI_MIR31_TARGETS_UP, IRF5_TARGET_GENES, MIR570_3P, MIR340_5P, MIR4447, MIR1343_5P, MIR939_5P, MIR514A_3P_MIR514B_3P, MIR6823_3P, MIR9903, MIR2114_3P, GSE13485_CTRL_VS_DAY21_YF17D_VACCINE_PBMC_DN, LIU_TARGETS_OF_VMYB_VS_CMYB_UP, DESCARTES_MAIN_FETAL_PDE1C_ACSM3_POSITIVE_CELLS

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (1): cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
intracellular anatomical structure1
cellular anatomical structure1

Protein interactions and networks

STRING

44 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C20orf203ECHDC1Q9NTX5348
C20orf203HNF1AP20823300
C20orf203ANKHQ9HCJ1275
C20orf203MCPH1Q8NEM0272
C20orf203SFTA3P0C7M3245
C20orf203ST6GAL1P15907218
C20orf203ASPMQ8IZT6202
C20orf203CCNT1O60563188
C20orf203SIGLEC11Q96RL6175
C20orf203MYEOVQ96EZ4167
C20orf203GCN1Q92616166
C20orf203CNRIP1Q96F850
C20orf203SPATA19Q7Z5L40
C20orf203CNBD2Q96M200
C20orf203OPN5Q6U7360
C20orf203MROH2BQ7Z7450
C20orf203POTEIP0CG380
C20orf203WDR87Q6ZQQ60
C20orf203TAAR9Q96RI90

IntAct

3 interactions, top by confidence:

ABTypeScore
C20orf203POTEIpsi-mi:“MI:0914”(association)0.530

BioGRID (7): POTEI (Affinity Capture-MS), CNRIP1 (Affinity Capture-MS), POTEI (Affinity Capture-MS), CNRIP1 (Affinity Capture-MS), CNRIP1 (Affinity Capture-MS), POTEI (Affinity Capture-MS), C20orf203 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A1B0GUT2, A0A3Q1LFG5, A1L4Q6, A2RUQ5, A8MQB3, A8MU10, B1ANY3, C0HM98, H3BQW9, J3KSC0, P0C092, P0DMU3, P0DPA3, P24026, P59020, P59021, P59052, P87743, Q06250, Q0IIN9, Q0VFX4, Q14695, Q4R3X9, Q4VX62, Q52M75, Q5SR53, Q6ZUF6, Q6ZWC4, Q71F78, Q7Z4H9, Q8JMY5, Q8JMZ5, Q8JN06, Q8N2C9, Q8N2X6, Q8N3U1, Q8N9X3, Q8NAA6, Q8NBC4, Q8NDY4

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

8 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance3
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

0 predictions. Top by Δscore:

AlphaMissense

1234 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:32650438:A:CF193L0.935
20:32650438:A:TF193L0.935
20:32650440:A:GF193L0.935
20:32650727:A:GI97T0.909
20:32650573:A:CF148L0.869
20:32650573:A:TF148L0.869
20:32650575:A:GF148L0.869
20:32650675:C:AK114N0.849
20:32650675:C:GK114N0.849
20:32650723:C:AW98C0.845
20:32650723:C:GW98C0.845
20:32650733:T:AE95V0.823
20:32650531:G:CF162L0.816
20:32650531:G:TF162L0.816
20:32650533:A:GF162L0.816
20:32650663:T:AR118S0.804
20:32650663:T:GR118S0.804
20:32650727:A:CI97S0.804
20:32650657:C:AR120S0.801
20:32650657:C:GR120S0.801
20:32651048:A:CF35L0.799
20:32651048:A:TF35L0.799
20:32651050:A:GF35L0.799
20:32650661:T:AD119V0.784
20:32650481:A:GI179T0.764
20:32650662:C:GD119H0.752
20:32650661:T:GD119A0.750
20:32650674:C:AV115L0.746
20:32650674:C:GV115L0.746
20:32650660:A:CD119E0.741

dbSNP variants (sampled 300 via entrez): RS1000185456 (20:32671882 G>A), RS1000438071 (20:32669007 G>T), RS1000460368 (20:32665077 C>A), RS1000515922 (20:32664116 A>C), RS1000519740 (20:32670237 G>C,T), RS1000573376 (20:32670057 C>A,T), RS1000752410 (20:32664064 T>C), RS1000758435 (20:32651159 C>A,G,T), RS1000762946 (20:32658872 C>T), RS1000851781 (20:32651388 G>A,C), RS1000924822 (20:32633928 G>A), RS1001040823 (20:32639128 T>A), RS1001108812 (20:32638163 A>G), RS1001147091 (20:32657968 C>A,T), RS1001152822 (20:32639883 C>T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

14 associations (top):

StudyTraitp-value
GCST004618_51White blood cell count (basophil)5.000000e-16
GCST004631_16Basophil percentage of white cells9.000000e-17
GCST004634_14Basophil percentage of granulocytes1.000000e-14
GCST005146_20Birth weight8.000000e-11
GCST005976_25White blood cell count (basophil)4.000000e-08
GCST007267_157Systolic blood pressure6.000000e-09
GCST007327_179Smoking status (ever vs never smokers)3.000000e-12
GCST007928_74Medication use (diuretics)9.000000e-09
GCST008839_110Height6.000000e-09
GCST90002379_160Basophil count1.000000e-21
GCST90002380_21Basophil percentage of white cells2.000000e-22
GCST90002396_54Mean reticulocyte volume8.000000e-16
GCST90002397_450Mean spheric corpuscular volume2.000000e-13
GCST90002403_316Red blood cell count1.000000e-13

EFO canonical traits (9, from GWAS)

EFO IDTrait name
EFO:0005090basophil count
EFO:0007992basophil percentage of leukocytes
EFO:0007995basophil percentage of granulocytes
EFO:0004344birth weight
EFO:0006335systolic blood pressure
EFO:0004318smoking behavior
EFO:0009928Diuretic use measurement
EFO:0010701mean reticulocyte volume
EFO:0004305erythrocyte count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

16 total (human), top 16 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation2
bisphenol Adecreases methylation1
ethyl-p-hydroxybenzoatedecreases expression1
perfluorooctanoic acidincreases expression1
benzo(e)pyreneincreases methylation1
aflatoxin B2increases methylation1
perfluorooctane sulfonic acidincreases expression1
perfluoro-n-nonanoic acidincreases expression1
bisphenol Sdecreases expression1
Diethylhexyl Phthalatedecreases expression1
Leadaffects expression1
Methapyrileneincreases methylation1
Rotenoneincreases expression1
Valproic Acidincreases methylation1
1-Methyl-4-phenylpyridiniumincreases expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.