C3orf49

gene
On this page

Summary

C3orf49 (chromosome 3 open reading frame 49, HGNC:25190) is a protein-coding gene on chromosome 3p14.1, encoding Putative uncharacterized protein C3orf49 (Q96BT1). It is a selective cancer dependency (DepMap: 13.7% of cell lines).

At a glance

  • GWAS associations: 10
  • Clinical variants (ClinVar): 35 total
  • Cancer dependency (DepMap): dependent in 13.7% of screened cell lines
  • MANE Select transcript: NM_001355236

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25190
Approved symbolC3orf49
Namechromosome 3 open reading frame 49
Location3p14.1
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000163632
Ensembl biotypeprotein_coding
Entrez132200

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 4 protein_coding

ENST00000295896, ENST00000491896, ENST00000647022, ENST00000673037

RefSeq mRNA: 1 — MANE Select: NM_001355236 NM_001355236

CCDS: CCDS87102

Canonical transcript exons

ENST00000295896 — 7 exons

ExonStartEnd
ENSE000010767866383168063831844
ENSE000011254346382325063823569
ENSE000013662306384502363845082
ENSE000015338126384836463848636
ENSE000036673956383111063831223
ENSE000036782196382760163827725
ENSE000038896906381929963819596

Expression profiles

Bgee: expression breadth ubiquitous, 157 present calls, max score 80.61.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0561 / max 99.9489, expressed in 2 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
371430.02511
371420.01972
371410.01141

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047380.61gold quality
secondary oocyteCL:000065573.05silver quality
prefrontal cortexUBERON:000045166.59gold quality
cerebellar hemisphereUBERON:000224565.04gold quality
cerebellar cortexUBERON:000212965.03gold quality
oocyteCL:000002364.86gold quality
superficial temporal arteryUBERON:000161464.52gold quality
right hemisphere of cerebellumUBERON:001489064.00gold quality
cerebellumUBERON:000203763.70gold quality
Brodmann (1909) area 9UBERON:001354063.12gold quality
calcaneal tendonUBERON:000370163.03gold quality
muscle of legUBERON:000138362.28gold quality
gastrocnemiusUBERON:000138862.26gold quality
spermCL:000001961.59silver quality
frontal cortexUBERON:000187059.84gold quality
adrenal tissueUBERON:001830359.78gold quality
neocortexUBERON:000195059.00gold quality
dorsolateral prefrontal cortexUBERON:000983458.87gold quality
right frontal lobeUBERON:000281058.54gold quality
anterior cingulate cortexUBERON:000983558.36gold quality
primary visual cortexUBERON:000243658.13gold quality
smooth muscle tissueUBERON:000113557.68gold quality
esophagus squamous epitheliumUBERON:000692057.68gold quality
corpus callosumUBERON:000233656.98gold quality
epithelium of nasopharynxUBERON:000195156.95gold quality
vena cavaUBERON:000408756.85gold quality
occipital lobeUBERON:000202156.81gold quality
gingival epitheliumUBERON:000194956.68gold quality
skin of hipUBERON:000155455.91gold quality
descending thoracic aortaUBERON:000234555.90gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no4.68

Regulation

Is transcription factor: no

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 13.7% of screened cell lines.

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
mus_musculusGm11100ENSMUSG00000079357
rattus_norvegicusC15h3orf49ENSRNOG00000027633
drosophila_melanogasterCG11929FBGN0031620

Protein

Protein identifiers

Putative uncharacterized protein C3orf49Q96BT1 (reviewed: Q96BT1)

All UniProt accessions (4): Q96BT1, A0A2R8Y7G4, A0A2R8YD18, A0A5F9ZI70

RefSeq proteins (1): NP_001342165* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR055322C3orf49-likeFamily

UniProt features (2 total): chain 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96BT1-F161.010.21

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 11 (showing top): GSE37336_LY6C_POS_VS_NEG_NAIVE_CD4_TCELL_UP, CCANNAGRKGGC_UNKNOWN, chr3p14, TGGAAA_NFAT_Q4_01, SOX10_TARGET_GENES, HBZ_TARGET_GENES, MXD1_TARGET_GENES, GSE17974_IL4_AND_ANTI_IL12_VS_UNTREATED_0.5H_ACT_CD4_TCELL_DN, RYCACNNRNNRNCAG_UNKNOWN, GSE40666_UNTREATED_VS_IFNA_STIM_EFFECTOR_CD8_TCELL_90MIN_UP, GSE20727_DNFB_ALLERGEN_VS_ROS_INH_AND_DNFB_ALLERGEN_TREATED_DC_UP

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (0):

Protein interactions and networks

STRING

48 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C3orf49ZNF823P16415521
C3orf49THOC7Q6I9Y2433
C3orf49MT1FP04733305
C3orf49NRGNQ92686287
C3orf49ENPP1P22413227
C3orf49NDNFQ8TB73178
C3orf49COL2A1P02458167
C3orf49EPYCQ99645165
C3orf49SPRR2AP35326159
C3orf49SPRR3Q9UBC9159
C3orf49S100A2P29034153
C3orf49FARP2O948870
C3orf49A0A0A6YYI9A0A0A6YYI90
C3orf49CENPBP071990
C3orf49UBTFP174800
C3orf49BPTFQ128300
C3orf49STX3Q132770
C3orf49DPYSL3Q141950

IntAct

3 interactions, top by confidence:

ABTypeScore
C3orf49DPYSL3psi-mi:“MI:0915”(physical association)0.400
C3orf49FARP2psi-mi:“MI:0914”(association)0.350

ESM2 similar proteins: A0A1B0GUA6, A2AHC3, A2APA5, A2ARZ3, A3KP40, A5WUN7, A6H5Y1, D3Z8E6, D3ZJ47, D4AEC2, E9Q309, F5H4B4, O60303, Q08AD1, Q0P5X5, Q32LN6, Q3KQW7, Q3TCJ8, Q3V036, Q3ZAV0, Q4R815, Q5CZC0, Q5JX69, Q5T5Y3, Q5VT06, Q60664, Q66H35, Q6IRN6, Q6ZVD7, Q7PCK7, Q8BZJ8, Q8C4J0, Q8C636, Q8CB14, Q8CCG1, Q8IYD9, Q8K3V7, Q8R2H7, Q8WWI1, Q96BT1

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

35 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance22
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1958 predictions. Top by Δscore:

VariantEffectΔscore
3:63819582:G:GTdonor_gain1.0000
3:63819583:A:Tdonor_gain1.0000
3:63819593:AAAG:Adonor_loss1.0000
3:63819595:AGGT:Adonor_loss1.0000
3:63819596:GGT:Gdonor_loss1.0000
3:63819598:T:Gdonor_loss1.0000
3:63827596:T:Aacceptor_gain1.0000
3:63831203:G:GGdonor_gain1.0000
3:63831754:GT:Gdonor_gain1.0000
3:63831826:G:GTdonor_gain1.0000
3:63831826:G:Tdonor_gain1.0000
3:63834200:C:CCacceptor_gain1.0000
3:63835152:A:ACdonor_gain1.0000
3:63835152:ACTTT:Adonor_gain1.0000
3:63835153:C:CCdonor_gain1.0000
3:63835153:CTTTC:Cdonor_gain1.0000
3:63835156:T:Adonor_gain1.0000
3:63835220:CCAG:Cacceptor_gain1.0000
3:63835221:CAG:Cacceptor_gain1.0000
3:63835221:CAGC:Cacceptor_gain1.0000
3:63835222:AGC:Aacceptor_loss1.0000
3:63835224:C:CCacceptor_gain1.0000
3:63835225:T:Aacceptor_loss1.0000
3:63835226:G:Cacceptor_gain1.0000
3:63835226:G:GCacceptor_gain1.0000
3:63835229:T:TCacceptor_gain1.0000
3:63835328:T:Cdonor_gain1.0000
3:63835388:TCCC:Tacceptor_loss1.0000
3:63835389:CC:Cacceptor_gain1.0000
3:63835390:CC:Cacceptor_gain1.0000

AlphaMissense

1911 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:63831690:T:CL232P0.973
3:63831747:T:CL251P0.972
3:63831797:T:CF268L0.969
3:63831799:C:AF268L0.969
3:63831799:C:GF268L0.969
3:63831723:T:CL243P0.962
3:63831736:A:CR247S0.958
3:63831736:A:TR247S0.958
3:63831780:T:CL262P0.952
3:63831740:G:CA249P0.950
3:63831798:T:CF268S0.949
3:63831713:T:CS240P0.948
3:63831777:T:CI261T0.948
3:63831767:G:TG258W0.943
3:63831213:T:CL225P0.935
3:63831767:G:AG258R0.934
3:63831767:G:CG258R0.934
3:63831777:T:GI261S0.934
3:63823251:T:AW43R0.926
3:63823251:T:CW43R0.926
3:63831793:A:CK266N0.921
3:63831793:A:TK266N0.921
3:63831755:T:CC254R0.917
3:63831714:C:TS240F0.916
3:63831768:G:AG258E0.914
3:63831768:G:TG258V0.914
3:63831798:T:GF268C0.908
3:63831726:T:CL244P0.906
3:63831788:T:CS265P0.905
3:63831789:C:TS265F0.897

dbSNP variants (sampled 300 via entrez): RS1000074512 (3:63835748 A>C), RS1000096696 (3:63806162 T>G), RS1000120340 (3:63848844 T>C), RS1000126641 (3:63836040 T>G), RS1000161965 (3:63830783 TCTTA>T), RS1000183844 (3:63822255 G>A,C), RS1000207184 (3:63779517 A>G,T), RS1000234474 (3:63831006 A>C,G), RS1000291921 (3:63783341 C>T), RS1000374389 (3:63843151 C>T), RS1000387014 (3:63824381 C>T), RS1000387201 (3:63815529 G>A), RS1000400993 (3:63783726 A>G,T), RS1000487293 (3:63807871 A>C,G), RS1000630319 (3:63844028 G>A)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

10 associations (top):

StudyTraitp-value
GCST002539_49Schizophrenia1.000000e-08
GCST006803_106Schizophrenia1.000000e-10
GCST008595_34Cognitive ability, years of educational attainment or schizophrenia (pleiotropy)2.000000e-08
GCST010698_66Subcortical volume (min-P)1.000000e-16
GCST010699_21Brain morphology (min-P)9.000000e-10
GCST010701_75Cortical surface area (MOSTest)4.000000e-09
GCST010702_127Subcortical volume (MOSTest)4.000000e-09
GCST010703_71Brain morphology (MOSTest)3.000000e-08
GCST90002398_124Neutrophil count2.000000e-09
GCST90002407_49White blood cell count5.000000e-11

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0004337intelligence
EFO:0004784self reported educational attainment
EFO:0004346neuroimaging measurement
EFO:0004833neutrophil count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

3 total (human), top 3 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneincreases methylation1
Tobacco Smoke Pollutiondecreases expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.