C4orf33

gene
On this page

Also known as FLJ33703

Summary

C4orf33 (chromosome 4 open reading frame 33, HGNC:27025) is a protein-coding gene on chromosome 4q28.2, encoding UPF0462 protein C4orf33 (Q8N1A6).

At a glance

  • GWAS associations: 15
  • Clinical variants (ClinVar): 20 total — 1 pathogenic
  • MANE Select transcript: NM_001099783

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:27025
Approved symbolC4orf33
Namechromosome 4 open reading frame 33
Location4q28.2
Locus typegene with protein product
StatusApproved
AliasesFLJ33703
Ensembl geneENSG00000151470
Ensembl biotypeprotein_coding
Entrez132321

Gene structure

Transcript identifiers

Ensembl transcripts: 19 — 18 protein_coding, 1 retained_intron

ENST00000281146, ENST00000425929, ENST00000502360, ENST00000502887, ENST00000508622, ENST00000508673, ENST00000859483, ENST00000859484, ENST00000859485, ENST00000859486, ENST00000859487, ENST00000859488, ENST00000911334, ENST00000911335, ENST00000911336, ENST00000911337, ENST00000911338, ENST00000911339, ENST00000956407

RefSeq mRNA: 2 — MANE Select: NM_001099783 NM_001099783, NM_173487

CCDS: CCDS3741

Canonical transcript exons

ENST00000425929 — 6 exons

ExonStartEnd
ENSE00001205230129111686129116637
ENSE00003842813129096152129096209
ENSE00003889393129109473129109672
ENSE00003892319129102602129102791
ENSE00003893662129106587129106647
ENSE00003896241129109307129109358

Expression profiles

Bgee: expression breadth ubiquitous, 213 present calls, max score 88.99.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 10.4952 / max 162.8225, expressed in 1675 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
496486.02281238
496494.37381510
496500.098641

Top tissues by expression

234 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
monocyteCL:000057688.99gold quality
leukocyteCL:000073888.23gold quality
rectumUBERON:000105286.58gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047385.32gold quality
olfactory segment of nasal mucosaUBERON:000538684.09gold quality
adrenal tissueUBERON:001830383.69gold quality
duodenumUBERON:000211483.49gold quality
islet of LangerhansUBERON:000000683.33gold quality
gall bladderUBERON:000211081.25gold quality
cerebellar cortexUBERON:000212981.12gold quality
cerebellar hemisphereUBERON:000224581.12gold quality
right hemisphere of cerebellumUBERON:001489080.78gold quality
C1 segment of cervical spinal cordUBERON:000646980.38gold quality
right adrenal gland cortexUBERON:003582780.22gold quality
cerebellumUBERON:000203780.03gold quality
colonic mucosaUBERON:000031779.97gold quality
bone marrow cellCL:000209279.85gold quality
right adrenal glandUBERON:000123379.48gold quality
left adrenal gland cortexUBERON:003582579.47gold quality
left adrenal glandUBERON:000123479.40gold quality
mucosa of sigmoid colonUBERON:000499379.18gold quality
granulocyteCL:000009479.02gold quality
pancreasUBERON:000126478.69gold quality
jejunal mucosaUBERON:000039978.54gold quality
spinal cordUBERON:000224078.22gold quality
left ovaryUBERON:000211978.19gold quality
adrenal glandUBERON:000236978.18gold quality
bronchial epithelial cellCL:000232878.10gold quality
adrenal cortexUBERON:000123578.09gold quality
mucosa of transverse colonUBERON:000499177.86gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.72

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

94 targeting C4orf33, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-432-3P100.0067.86705
HSA-MIR-3646100.0073.565283
HSA-MIR-548N99.9871.944170
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-314899.9775.066478
HSA-MIR-4666A-3P99.9671.713434
HSA-MIR-548AB99.9571.313488
HSA-MIR-55999.9572.283609
HSA-MIR-548A-5P99.9471.273482
HSA-MIR-548AD-5P99.9471.233502
HSA-MIR-548AE-5P99.9471.233502
HSA-MIR-548AK99.9471.243488
HSA-MIR-548AM-5P99.9471.243488
HSA-MIR-548AP-5P99.9471.143489
HSA-MIR-548AQ-5P99.9471.343426
HSA-MIR-548AR-5P99.9471.283515
HSA-MIR-548AS-5P99.9471.223482
HSA-MIR-548AU-5P99.9471.243488
HSA-MIR-548AY-5P99.9471.233502
HSA-MIR-548B-5P99.9471.233502
HSA-MIR-548BB-5P99.9471.273509
HSA-MIR-548C-5P99.9471.243488
HSA-MIR-548D-5P99.9471.233502
HSA-MIR-548H-5P99.9471.243488
HSA-MIR-548I99.9471.253481
HSA-MIR-548J-5P99.9471.143489
HSA-MIR-548O-5P99.9471.243488
HSA-MIR-548W99.9471.243488
HSA-MIR-548Y99.9471.283514

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_rerioC1H4orf33ENSDARG00000017985
mus_musculusD3Ertd751eENSMUSG00000025766
rattus_norvegicusC2h4orf33ENSRNOG00000038330
caenorhabditis_elegansWBGENE00007882

Protein

Protein identifiers

UPF0462 protein C4orf33Q8N1A6 (reviewed: Q8N1A6)

All UniProt accessions (4): D6RD26, D6RIS3, D6RIT3, Q8N1A6

UniProt curated annotations — full annotation on UniProt →

Similarity. Belongs to the UPF0462 family.

RefSeq proteins (2): NP_001093253, NP_775758 (=MANE)

Domains & families (InterPro)

UniProt features (7 total): sequence variant 4, sequence conflict 2, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8N1A6-F195.500.95

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 71 (showing top): KOYAMA_SEMA3B_TARGETS_UP, NF1_Q6_01, CREB_Q3, ACEVEDO_LIVER_CANCER_UP, KRIGE_RESPONSE_TO_TOSEDOSTAT_24HR_UP, KRIGE_RESPONSE_TO_TOSEDOSTAT_6HR_DN, chr4q28, MTF1_Q4, ACEVEDO_NORMAL_TISSUE_ADJACENT_TO_LIVER_TUMOR_UP, ELF2_TARGET_GENES, EMX1_TARGET_GENES, HES2_TARGET_GENES, ID2_TARGET_GENES, NR1D1_TARGET_GENES, MIR153_5P

GO Biological Process (0):

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (0):

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding1

Protein interactions and networks

STRING

222 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C4orf33SYNRGQ9UMZ2496
C4orf33TAF11Q15544493
C4orf33CCDC185Q8N715479
C4orf33KHDRBS2Q5VWX1460
C4orf33GPATCH11Q8N954446
C4orf33PCDH10Q9P2E7427
C4orf33ZFP37Q9Y6Q3418
C4orf33BTAF1O14981389
C4orf33CDH18Q13634383
C4orf33SLC31A2O15432378
C4orf33SFT2D1Q8WV19377
C4orf33PCDH17O14917374
C4orf33ZFAND1Q8TCF1370
C4orf33TMEM60Q9H2L4370
C4orf33BASP1P80723368

IntAct

26 interactions, top by confidence:

ABTypeScore
C4orf33VAC14psi-mi:“MI:0915”(physical association)0.830
VAC14C4orf33psi-mi:“MI:0915”(physical association)0.830
C4orf33TRIP13psi-mi:“MI:0915”(physical association)0.780
TRIP13C4orf33psi-mi:“MI:0915”(physical association)0.780
C4orf33CASP6psi-mi:“MI:0915”(physical association)0.560
C4orf33LAMP2psi-mi:“MI:0915”(physical association)0.560
C4orf33SH3GLB1psi-mi:“MI:0915”(physical association)0.560
ECE1C4orf33psi-mi:“MI:0915”(physical association)0.370
C4orf33VAC14psi-mi:“MI:0915”(physical association)0.000
C4orf33TRIP13psi-mi:“MI:0915”(physical association)0.000

BioGRID (9): C4orf33 (Two-hybrid), C4orf33 (Two-hybrid), DCXR (Co-fractionation), C4orf33 (Positive Genetic), C4orf33 (Two-hybrid), C4orf33 (Two-hybrid), C4orf33 (Two-hybrid), C4orf33 (Affinity Capture-RNA), EEA1 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A2AV36, A3AZW5, A4IF69, A4IG42, A6QQV6, E1BVR9, F1RET2, O02791, P52019, P54803, P57075, Q08BV2, Q28GP0, Q32PY6, Q3U3W5, Q3UDE2, Q3V3E1, Q4R3W5, Q5F204, Q5IH14, Q5M845, Q5RGQ2, Q5T8I9, Q5U4E8, Q5ZI67, Q5ZIB9, Q6DIZ0, Q6EE23, Q6GML1, Q6P2P2, Q6P2T7, Q6P7I0, Q6PCI6, Q75W54, Q7Z3D6, Q7ZU91, Q8BGG7, Q8BGN2, Q8CFC1, Q8N1A6

Diamond homologs: Q28GP0, Q5M845, Q6P2T7, Q8BGN2, Q8N1A6

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

20 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance9
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
57367GRCh38/hg38 4q28.2-31.21(chr4:128119872-142431375)x1Pathogenic

SpliceAI

1139 predictions. Top by Δscore:

VariantEffectΔscore
4:129102788:GAAG:Gdonor_gain1.0000
4:129102789:AAGGT:Adonor_loss1.0000
4:129102792:G:GAdonor_loss1.0000
4:129102793:T:Gdonor_loss1.0000
4:129106582:A:AGacceptor_gain1.0000
4:129106585:A:Gacceptor_gain1.0000
4:129109668:GATTT:Gdonor_gain1.0000
4:129100308:G:GGdonor_gain0.9900
4:129101633:A:AGdonor_gain0.9900
4:129102596:TTTTA:Tacceptor_loss0.9900
4:129102597:TTTA:Tacceptor_loss0.9900
4:129102599:TA:Tacceptor_loss0.9900
4:129102599:TAGA:Tacceptor_gain0.9900
4:129102600:A:AGacceptor_gain0.9900
4:129102600:A:ATacceptor_loss0.9900
4:129102600:AGAGA:Aacceptor_gain0.9900
4:129102601:G:GGacceptor_gain0.9900
4:129102601:GA:Gacceptor_gain0.9900
4:129102601:GAGA:Gacceptor_gain0.9900
4:129102790:AG:Adonor_gain0.9900
4:129102791:GG:Gdonor_gain0.9900
4:129102792:G:GGdonor_gain0.9900
4:129106583:A:Gacceptor_gain0.9900
4:129106586:G:GAacceptor_gain0.9900
4:129106586:GTT:Gacceptor_gain0.9900
4:129106648:G:GGdonor_gain0.9900
4:129109456:A:AGacceptor_gain0.9900
4:129109457:C:Gacceptor_gain0.9900
4:129109459:T:Gacceptor_gain0.9900
4:129109472:GCAA:Gacceptor_gain0.9900

AlphaMissense

1317 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
4:129106598:T:CF65L0.933
4:129106600:T:AF65L0.933
4:129106600:T:GF65L0.933
4:129109518:T:AW114R0.932
4:129109518:T:CW114R0.932
4:129109581:T:CF135L0.925
4:129109583:T:AF135L0.925
4:129109583:T:GF135L0.925
4:129109620:G:CA148P0.905
4:129111702:T:CF171L0.897
4:129111704:C:AF171L0.897
4:129111704:C:GF171L0.897
4:129106601:T:CF66L0.892
4:129106603:C:AF66L0.892
4:129106603:C:GF66L0.892
4:129109577:T:AN133K0.887
4:129109577:T:GN133K0.887
4:129109520:G:CW114C0.885
4:129109520:G:TW114C0.885
4:129106633:A:CE76D0.877
4:129106633:A:TE76D0.877
4:129109572:T:CF132L0.875
4:129109574:C:AF132L0.875
4:129109574:C:GF132L0.875
4:129109671:T:CF165L0.861
4:129111686:C:AF165L0.861
4:129111686:C:GF165L0.861
4:129106594:A:CE63D0.859
4:129106594:A:TE63D0.859
4:129102635:T:AW9R0.858

dbSNP variants (sampled 300 via entrez): RS1000044094 (4:129107095 C>T), RS1000513379 (4:129112270 A>G), RS1000599268 (4:129116052 A>AG), RS1000671430 (4:129115814 A>ATG), RS1000857991 (4:129093234 C>T), RS10008622 (4:129114596 G>A), RS1000911638 (4:129093447 G>A,C), RS1000916102 (4:129109128 A>G), RS10009430 (4:129109608 C>T), RS1001027976 (4:129105629 T>G), RS1001103428 (4:129098809 T>C), RS1001238116 (4:129106107 A>G), RS1001442593 (4:129105813 T>C), RS1001525421 (4:129098623 G>A), RS1001870757 (4:129103101 G>A,T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

15 associations (top):

StudyTraitp-value
GCST002783_177Body mass index1.000000e-06
GCST002783_281Body mass index1.000000e-06
GCST002783_75Body mass index3.000000e-07
GCST004735_22Epstein-Barr virus copy number in lymphoblastoid cell lines1.000000e-06
GCST004863_140Mosquito bite size4.000000e-08
GCST005833_2Remission after SSRI treatment in MDD or openness4.000000e-07
GCST007326_38Number of sexual partners4.000000e-10
GCST007576_262Chronotype1.000000e-10
GCST008830_14Neurofibrillary tangles3.000000e-06
GCST009733_169Urinary metabolite levels in chronic kidney disease6.000000e-18
GCST010922_3Hip bone mineral density and total body fat mass (bivariate analysis)6.000000e-09
GCST010988_77Adult body size2.000000e-10
GCST010989_244Body size at age 101.000000e-11
GCST012020_3Serum metabolite levels7.000000e-22
GCST90002409_36Childhood body mass index3.000000e-06

EFO canonical traits (9, from GWAS)

EFO IDTrait name
EFO:0004340body mass index
EFO:0008378mosquito bite reaction size measurement
EFO:0005658response to selective serotonin reuptake inhibitor
EFO:0007914openness measurement
EFO:0008328chronotype measurement
EFO:0006797neurofibrillary tangles measurement
EFO:0005116urinary metabolite measurement
EFO:0007702hip bone mineral density
EFO:0009819comparative body size at age 10, self-reported

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

35 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression, decreases methylation5
methylmercuric chloridedecreases expression3
Acetaminophendecreases expression, increases expression2
Cyclosporinedecreases expression2
dicrotophosdecreases expression1
triphenyl phosphateaffects expression1
bisphenol Adecreases methylation1
trichostatin Aincreases expression1
potassium chromate(VI)decreases expression1
di-n-butylphosphoric acidaffects expression1
perfluorooctane sulfonic aciddecreases expression1
entinostatincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
belinostatincreases expression1
2-methyl-2H-pyrazole-3-carboxylic acid (2-methyl-4-o-tolylazophenyl)amidedecreases expression, decreases reaction1
dorsomorphinaffects cotreatment, increases expression1
bisphenol Sincreases methylation1
jinfukangdecreases expression1
2,3,5-trichloro-6-phenyl-(1,4)benzoquinoneincreases expression1
Sunitinibdecreases expression1
Fulvestrantdecreases methylation1
Vorinostatincreases expression1
Vehicle Emissionsdecreases reaction, decreases expression1
Cisplatinincreases expression1
Drugs, Chinese Herbalincreases expression1
Estradioldecreases expression1
Ethyl Methanesulfonateincreases expression1
Methyl Methanesulfonateincreases expression1
Naphthoquinonesincreases expression1
Silicon Dioxidedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Epstein-Barr virus infection