C1orf116

gene
On this page

Also known as SARGFLJ36507MGC2742MGC4309

Summary

C1orf116 (chromosome 1 open reading frame 116, HGNC:28667) is a protein-coding gene on chromosome 1q32.1, encoding Specifically androgen-regulated gene protein (Q9BW04). Putative androgen-specific receptor.

Located in cytosol and plasma membrane.

Source: NCBI Gene 79098 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 17 total
  • MANE Select transcript: NM_023938

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:28667
Approved symbolC1orf116
Namechromosome 1 open reading frame 116
Location1q32.1
Locus typegene with protein product
StatusApproved
AliasesSARG, FLJ36507, MGC2742, MGC4309
Ensembl geneENSG00000182795
Ensembl biotypeprotein_coding
OMIM611680
Entrez79098

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 5 protein_coding

ENST00000359470, ENST00000461135, ENST00000895722, ENST00000938289, ENST00000945944

RefSeq mRNA: 2 — MANE Select: NM_023938 NM_001083924, NM_023938

CCDS: CCDS1475, CCDS44306

Canonical transcript exons

ENST00000359470 — 4 exons

ExonStartEnd
ENSE00001443447207027494207027679
ENSE00001686775207018522207023480
ENSE00001879330207032579207032756
ENSE00003649328207024887207025064

Expression profiles

Bgee: expression breadth ubiquitous, 181 present calls, max score 98.64.

FANTOM5 (CAGE): breadth broad, TPM avg 2.9809 / max 105.3374, expressed in 292 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
171501.3562210
171520.7864187
171510.6125193
171490.099472
171480.088963
171530.037615

Top tissues by expression

271 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
pancreatic ductal cellCL:000207998.64gold quality
esophagus squamous epitheliumUBERON:000692098.36gold quality
amniotic fluidUBERON:000017398.29gold quality
tongue squamous epitheliumUBERON:000691998.01gold quality
lower esophagus mucosaUBERON:003583497.97gold quality
epithelium of esophagusUBERON:000197697.33gold quality
squamous epitheliumUBERON:000691497.27gold quality
cervix squamous epitheliumUBERON:000692296.87gold quality
palpebral conjunctivaUBERON:000181295.65gold quality
oral cavityUBERON:000016795.61gold quality
esophagus mucosaUBERON:000246994.86gold quality
pharyngeal mucosaUBERON:000035594.71gold quality
gingivaUBERON:000182894.68gold quality
gingival epitheliumUBERON:000194994.54gold quality
lower lobe of lungUBERON:000894993.48gold quality
upper leg skinUBERON:000426291.84gold quality
visceral pleuraUBERON:000240191.37gold quality
lungUBERON:000204891.00gold quality
upper arm skinUBERON:000426390.95gold quality
hair follicleUBERON:000207390.35gold quality
upper lobe of lungUBERON:000894890.15gold quality
upper lobe of left lungUBERON:000895289.98gold quality
cervix epitheliumUBERON:000480189.87gold quality
skin of legUBERON:000151188.77gold quality
zone of skinUBERON:000001488.66gold quality
epithelial cell of pancreasCL:000008388.15gold quality
skin of abdomenUBERON:000141688.15gold quality
right lungUBERON:000216788.13gold quality
islet of LangerhansUBERON:000000687.91gold quality
mouth mucosaUBERON:000372987.65gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3no0.00

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): AR

Literature-anchored findings (GeneRIF, showing 1)

  • SARG mRNA expression is high in prostate tissue. SARG is composed of four exons and spans a region of 14.5 kbp on chromosome 1q32.2. (PMID:15525603)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriozgc:158258ENSDARG00000070229
danio_reriosi:ch73-184c24.1ENSDARG00000074827
mus_musculusAA986860ENSMUSG00000042510
rattus_norvegicusC13h1orf116ENSRNOG00000004341

Protein

Protein identifiers

Specifically androgen-regulated gene proteinQ9BW04 (reviewed: Q9BW04)

All UniProt accessions (1): Q9BW04

UniProt curated annotations — full annotation on UniProt →

Function. Putative androgen-specific receptor.

Subcellular location. Cytoplasm.

Tissue specificity. Highly expressed in prostate.

Induction. Expression is up-regulated by androgen, but not by glucocorticoids.

Similarity. Belongs to the SARG family.

Isoforms (2)

UniProt IDNamesCanonical?
Q9BW04-11yes
Q9BW04-22

RefSeq proteins (2): NP_001077393, NP_076427* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR026152SARGFamily

Pfam: PF15385

UniProt features (27 total): compositionally biased region 12, sequence variant 7, region of interest 3, modified residue 3, chain 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9BW04-F154.110.07

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (3): 131, 133, 519

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 100 (showing top): BERTUCCI_MEDULLARY_VS_DUCTAL_BREAST_CANCER_DN, MODULE_255, JAEGER_METASTASIS_DN, MODULE_317, AP4_Q6, TGACCTY_ERR1_Q2, HNF1_Q6, CAGCTG_AP4_Q5, ONDER_CDH1_TARGETS_3_DN, RICKMAN_TUMOR_DIFFERENTIATED_WELL_VS_POORLY_DN, FREAC3_01, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_DN, CHARAFE_BREAST_CANCER_BASAL_VS_MESENCHYMAL_UP, SHEDDEN_LUNG_CANCER_GOOD_SURVIVAL_A4, SCHAEFFER_PROSTATE_DEVELOPMENT_6HR_DN

GO Biological Process (0):

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (4): cytoplasm (GO:0005737), cytosol (GO:0005829), plasma membrane (GO:0005886), extracellular exosome (GO:0070062)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
binding1
intracellular anatomical structure1
cytoplasm1
membrane1
cell periphery1
extracellular vesicle1

Protein interactions and networks

STRING

524 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C1orf116NKX3-1Q99801547
C1orf116OS9Q13438543
C1orf116INIPQ9NRY2460
C1orf116ZNF888P0CJ79448
C1orf116PRR15LQ9BU68427
C1orf116C9orf152Q5JTZ5411
C1orf116PAK1IP1Q9NWT1395
C1orf116H0Y8G9H0Y8G9370
C1orf116HERC3Q15034370
C1orf116C1orf74Q96LT6367
C1orf116MED28Q9H204359
C1orf116PMEPA1Q969W9357
C1orf116KIAA0040Q15053354
C1orf116KLK2P20151349
C1orf116TRIM7Q9C029346

IntAct

20 interactions, top by confidence:

ABTypeScore
SARGHOMER1psi-mi:“MI:0915”(physical association)0.780
HOMER1SARGpsi-mi:“MI:0915”(physical association)0.780
SARGHOMER3psi-mi:“MI:0915”(physical association)0.560
SARGKRT14psi-mi:“MI:0915”(physical association)0.400
SMAD2FAM83Gpsi-mi:“MI:0915”(physical association)0.400
SMAD3FAM83Gpsi-mi:“MI:0915”(physical association)0.400
SARGECE1psi-mi:“MI:0915”(physical association)0.370
ECE1SARGpsi-mi:“MI:0915”(physical association)0.370
CCR1UBA6psi-mi:“MI:0914”(association)0.350
SSUH2IGLC7psi-mi:“MI:0914”(association)0.350
SMPD2A2ML1psi-mi:“MI:0914”(association)0.350
CDH1ESYT2psi-mi:“MI:2364”(proximity)0.270
HOMER1SARGpsi-mi:“MI:0915”(physical association)0.000
HOMER3SARGpsi-mi:“MI:0915”(physical association)0.000

BioGRID (27): C1orf116 (Two-hybrid), C1orf116 (Two-hybrid), C1orf116 (Reconstituted Complex), HOMER1 (Two-hybrid), C1orf116 (Proximity Label-MS), C1orf116 (Affinity Capture-MS), C1orf116 (Proximity Label-MS), C1orf116 (Two-hybrid), C1orf116 (Two-hybrid), C1orf116 (Proximity Label-MS), C1orf116 (Proximity Label-MS), C1orf116 (Affinity Capture-MS), C1orf116 (Affinity Capture-MS), C1orf116 (Affinity Capture-MS), C1orf116 (Co-fractionation)

ESM2 similar proteins: A1L170, A4IFJ0, A5D7K1, A6H7B4, A6NGG8, A6X8Z5, B1AXH1, D3ZMK9, O08696, O14513, O43151, P01099, P10637, P19103, P19332, Q08DN6, Q13522, Q2M1Z3, Q2TBN9, Q571I4, Q58CU6, Q5HYW2, Q5JSZ5, Q5M831, Q5M865, Q60664, Q640N3, Q68DA7, Q6DJE5, Q6PAC4, Q6ZW13, Q7LBC6, Q80U35, Q80U49, Q86YV5, Q8BG87, Q8C3W1, Q8C5R2, Q8C5W0, Q8WYL5

Diamond homologs: A5D7K1, Q499V8, Q8BI29, Q9BW04

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

17 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance8
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

561 predictions. Top by Δscore:

VariantEffectΔscore
1:207027493:CAGAT:Cdonor_gain1.0000
1:207024927:T:TAdonor_gain0.9900
1:207025060:TCACT:Tacceptor_gain0.9900
1:207025061:CACT:Cacceptor_gain0.9900
1:207025061:CACTC:Cacceptor_gain0.9900
1:207025063:CT:Cacceptor_gain0.9900
1:207025064:TC:Tacceptor_loss0.9900
1:207025065:C:CCacceptor_gain0.9900
1:207025065:C:Tacceptor_loss0.9900
1:207025074:T:Cacceptor_gain0.9900
1:207025074:T:TCacceptor_gain0.9900
1:207027492:A:ACdonor_gain0.9900
1:207027493:C:CCdonor_gain0.9900
1:207023492:A:Tacceptor_gain0.9800
1:207024885:AC:Adonor_gain0.9800
1:207024885:ACC:Adonor_gain0.9800
1:207024886:CC:Cdonor_gain0.9800
1:207024886:CCC:Cdonor_gain0.9800
1:207025068:T:TCacceptor_gain0.9800
1:207027485:A:ACdonor_gain0.9800
1:207027489:CGTA:Cdonor_gain0.9800
1:207027493:CA:Cdonor_gain0.9800
1:207027493:CAG:Cdonor_gain0.9800
1:207025062:ACT:Aacceptor_gain0.9700
1:207025063:CTC:Cacceptor_gain0.9700
1:207025064:TCT:Tacceptor_gain0.9700
1:207025068:T:Cacceptor_gain0.9700
1:207025080:T:TCacceptor_gain0.9700
1:207025114:G:Tacceptor_gain0.9700
1:207023477:CCTC:Cacceptor_gain0.9600

AlphaMissense

3864 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:207021971:A:GL598P0.993
1:207022034:A:TV577D0.993
1:207022697:A:GL356P0.993
1:207022716:C:GA350P0.992
1:207025018:A:GL51P0.990
1:207022703:A:GL354P0.988
1:207021986:A:GL593P0.987
1:207025011:G:CF53L0.987
1:207025011:G:TF53L0.987
1:207025013:A:GF53L0.987
1:207025009:A:GL54P0.986
1:207022724:C:GR347P0.985
1:207021977:A:GL596P0.983
1:207021971:A:TL598Q0.982
1:207022725:G:TR347S0.982
1:207022705:C:AK353N0.981
1:207022705:C:GK353N0.981
1:207025001:T:GT57P0.980
1:207021990:C:GA592P0.979
1:207022028:A:GI579T0.979
1:207022712:A:GL351P0.979
1:207022703:A:TL354Q0.975
1:207025009:A:TL54Q0.974
1:207025012:A:GF53S0.974
1:207022028:A:CI579S0.973
1:207022700:C:AG355V0.972
1:207023152:G:CF204L0.972
1:207023152:G:TF204L0.972
1:207023154:A:GF204L0.972
1:207022697:A:TL356Q0.971

dbSNP variants (sampled 300 via entrez): RS1000029117 (1:207028869 T>C), RS1000776646 (1:207022065 C>A), RS1001413815 (1:207025171 C>T), RS1001538025 (1:207026206 A>T), RS1001616031 (1:207030762 G>A,C), RS1001866850 (1:207023667 A>G), RS1001977493 (1:207029145 T>C), RS1002045476 (1:207030587 T>G), RS1002148906 (1:207034561 A>G), RS1002346483 (1:207029458 T>C), RS1002361975 (1:207030735 G>A), RS1002455634 (1:207026298 C>T), RS1002645275 (1:207031602 C>A), RS1002774570 (1:207019564 A>G), RS1002823019 (1:207030940 G>A)

Disease associations

OMIM: gene MIM:611680 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

49 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases expression, decreases methylation, increases expression4
bisphenol Adecreases expression, affects cotreatment, increases methylation2
sodium arsenitedecreases expression, increases expression2
Resveratrolaffects cotreatment, decreases expression, increases expression2
Tobacco Smoke Pollutionaffects expression, increases expression2
Valproic Acidaffects expression, decreases expression2
aristolochic acid Iincreases expression1
FR900359affects phosphorylation1
sotorasibdecreases expression, affects cotreatment1
propionaldehydeincreases expression1
pyrogallol 1,3-dimethyl etheraffects localization, decreases expression, affects cotreatment1
2-methyl-4-isothiazolin-3-oneincreases expression1
ethyl-p-hydroxybenzoateincreases expression1
tris(2-butoxyethyl) phosphateaffects expression1
beta-lapachoneincreases expression1
S-(1,2-dichlorovinyl)cysteineaffects response to substance, increases expression, affects cotreatment1
enzalutamidedecreases expression1
jinfukangincreases expression, affects cotreatment1
NSC 689534increases expression1
trametinibaffects cotreatment, decreases expression1
NVP-BKM120affects cotreatment, decreases expression1
theaflavin-3,3’-digallateaffects expression1
Fulvestrantaffects cotreatment, increases methylation1
Acetaminophendecreases expression1
Ethanolaffects cotreatment, increases abundance, increases expression1
Calcitriolincreases expression1
Camptothecinincreases expression1
Cisplatinaffects cotreatment, increases expression1
Dichlorodiphenyl Dichloroethylenedecreases expression1
Estradiolaffects cotreatment, decreases expression1

Cellosaurus cell lines

3 cell lines: 3 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_C9B3CLF_PEDS0005_T1Cancer cell line
CVCL_C9B4CLF_PEDS0005_T2ACancer cell line
CVCL_C9B5CLF_PEDS0005_T2BCancer cell line

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.