C20orf173

gene
On this page

Also known as dJ477O4.4

Summary

C20orf173 (chromosome 20 open reading frame 173, HGNC:16166) is a protein-coding gene on chromosome 20q11.22, encoding Uncharacterized protein C20orf173 (Q96LM9).

Predicted to enable beta-galactoside (CMP) alpha-2,3-sialyltransferase activity. Predicted to be involved in ganglioside biosynthetic process via lactosylceramide; protein glycosylation; and sialylation. Predicted to be active in membrane.

Source: NCBI Gene 140873 — RefSeq curated summary.

At a glance

  • GWAS associations: 5
  • Clinical variants (ClinVar): 3 total — 1 pathogenic
  • MANE Select transcript: NM_001145350

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:16166
Approved symbolC20orf173
Namechromosome 20 open reading frame 173
Location20q11.22
Locus typegene with protein product
StatusApproved
AliasesdJ477O4.4
Ensembl geneENSG00000125975
Ensembl biotypeprotein_coding
Entrez140873

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 4 protein_coding

ENST00000246199, ENST00000374345, ENST00000424444, ENST00000444723

RefSeq mRNA: 1 — MANE Select: NM_001145350 NM_001145350

CCDS: CCDS46594

Canonical transcript exons

ENST00000444723 — 6 exons

ExonStartEnd
ENSE000034626723552845135528544
ENSE000035271223552955935529652
ENSE000035617733552906535529424
ENSE000035801283552870135528879
ENSE000036548783552823335528284
ENSE000038987323552697935527250

Expression profiles

Bgee: expression breadth broad, 27 present calls, max score 98.77.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0313 / max 32.9292, expressed in 3 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1870600.03133

Top tissues by expression

91 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047398.77gold quality
right testisUBERON:000453489.04gold quality
left testisUBERON:000453388.80gold quality
testisUBERON:000047388.11gold quality
granulocyteCL:000009442.62silver quality
bone marrow cellCL:000209241.49gold quality
bone marrowUBERON:000237137.62silver quality
colonic epitheliumUBERON:000039737.20gold quality
sural nerveUBERON:001548836.87gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
hindlimb stylopod muscleUBERON:000425236.25silver quality
mucosa of stomachUBERON:000119936.17silver quality
ganglionic eminenceUBERON:000402335.49gold quality
superior frontal gyrusUBERON:000266135.21gold quality
tonsilUBERON:000237233.14gold quality
bloodUBERON:000017832.30gold quality
lymph nodeUBERON:000002931.47silver quality
primary visual cortexUBERON:000243630.88gold quality
leukocyteCL:000073829.91silver quality
stromal cell of endometriumCL:000225529.87gold quality
uterine cervixUBERON:000000229.70gold quality
monocyteCL:000057629.30silver quality
olfactory segment of nasal mucosaUBERON:000538629.24gold quality
endocervixUBERON:000045829.19gold quality
urinary bladderUBERON:000125529.11gold quality
gall bladderUBERON:000211028.99silver quality
muscle of legUBERON:000138328.47gold quality
duodenumUBERON:000211428.14gold quality
myometriumUBERON:000129628.08silver quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.31

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

11 targeting C20orf173, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-5193100.0067.261744
HSA-MIR-6744-5P99.9366.82748
HSA-MIR-466399.6265.33957
HSA-MIR-129099.5969.902079
HSA-MIR-502-5P98.7766.51906
HSA-MIR-3192-3P98.6265.80970
HSA-MIR-64098.4466.93644
HSA-MIR-3151-3P97.8066.16479
HSA-MIR-197297.6767.381172
HSA-MIR-6818-5P97.5067.101167
HSA-MIR-500B-3P96.4965.401087

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriost3gal8ENSDARG00000007494
danio_reriost3gal7ENSDARG00000039064
danio_rerioENSDARG00000112644
mus_musculus6430550D23RikENSMUSG00000074646
rattus_norvegicusC3h20orf173ENSRNOG00000047596

Paralogs (14): ST3GAL1 (ENSG00000008513), ST3GAL6 (ENSG00000064225), ST6GALNAC1 (ENSG00000070526), ST6GALNAC2 (ENSG00000070731), ST6GAL1 (ENSG00000073849), ST3GAL4 (ENSG00000110080), ST3GAL5 (ENSG00000115525), ST6GALNAC5 (ENSG00000117069), ST3GAL3 (ENSG00000126091), ST6GALNAC4 (ENSG00000136840), ST6GAL2 (ENSG00000144057), ST3GAL2 (ENSG00000157350), ST6GALNAC6 (ENSG00000160408), ST6GALNAC3 (ENSG00000184005)

Protein

Protein identifiers

Uncharacterized protein C20orf173Q96LM9 (reviewed: Q96LM9)

All UniProt accessions (2): Q96LM9, Q4VXR9

Isoforms (2)

UniProt IDNamesCanonical?
Q96LM9-11yes
Q96LM9-22

RefSeq proteins (1): NP_001138822* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR038578GT29-like_sfHomologous_superfamily
IPR051757Beta-gal_alpha2-3_sialyltransFamily

UniProt features (3 total): chain 1, splice variant 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96LM9-F147.960.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 19 (showing top): GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_BIOSYNTHETIC_PROCESS, ZIC1_01, chr20q11, GOBP_SIALYLATION, GOBP_GLYCOPROTEIN_METABOLIC_PROCESS, GOMF_GLYCOSYLTRANSFERASE_ACTIVITY, GOBP_GLYCOPROTEIN_BIOSYNTHETIC_PROCESS, GOMF_SIALYLTRANSFERASE_ACTIVITY, HMGA1_TARGET_GENES, MIR1290, MIR6818_5P, MIR3192_3P, MIR6744_5P, MIR3151_3P

GO Biological Process (3): sialylation (GO:0097503), obsolete protein glycosylation (GO:0006486), obsolete ganglioside biosynthetic process via lactosylceramide (GO:0010706)

GO Molecular Function (3): beta-galactoside (CMP) alpha-2,3-sialyltransferase activity (GO:0003836), protein binding (GO:0005515), sialyltransferase activity (GO:0008373)

GO Cellular Component (1): membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
macromolecule modification1
sialyltransferase activity1
binding1
glycosyltransferase activity1
cellular anatomical structure1

Protein interactions and networks

STRING

232 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C20orf173SPATA31G1Q5VYM1608
C20orf173CABS1Q96KC9581
C20orf173TMCO5AQ8N6Q1578
C20orf173CXorf65A6NEN9544
C20orf173CCDC187A0A096LP49541
C20orf173TMCO2Q7Z6W1528
C20orf173RIMBP3Q9UFD9519
C20orf173CCDC54Q8NEL0507
C20orf173ZC3H11DQ8NA57507
C20orf173FAM209AQ5JX71507
C20orf173LYZL2Q7Z4W2507
C20orf173CFAP184Q2M329506
C20orf173CIMIP4O43247506
C20orf173ROPN1LQ96C74479
C20orf173C22orf23Q9BZE7478

IntAct

4 interactions, top by confidence:

ABTypeScore
HGSC20orf173psi-mi:“MI:0915”(physical association)0.560
HGSC20orf173psi-mi:“MI:0915”(physical association)0.000

BioGRID (2): HGS (Two-hybrid), C20orf173 (Negative Genetic)

ESM2 similar proteins: A2APT9, A5PJC7, E1BDF2, O73895, P0C6B2, P0C6B3, P32927, P40238, P51172, Q01114, Q08351, Q0VCS0, Q27969, Q2T9K0, Q3UIW8, Q496M5, Q5F267, Q5FW56, Q5MJ68, Q5R866, Q5R8H1, Q5SZI1, Q5VTJ3, Q5W186, Q68UT4, Q6AZ51, Q6DKI7, Q6IPT2, Q6ISU1, Q6JVE6, Q6PL45, Q6UX52, Q6ZST4, Q70Z44, Q7L513, Q7Z5J1, Q8BH06, Q8K558, Q8NFR9, Q8TCY5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

3 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance1
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
152798GRCh38/hg38 20q11.22(chr20:35285724-35744837)x1Pathogenic

SpliceAI

1049 predictions. Top by Δscore:

VariantEffectΔscore
20:35528455:T:Cdonor_gain1.0000
20:35527072:G:Cdonor_gain0.9900
20:35528450:CCTTA:Cdonor_gain0.9900
20:35528454:A:ACdonor_gain0.9900
20:35528461:A:ACdonor_gain0.9900
20:35528461:AGAG:Adonor_gain0.9900
20:35528462:G:Cdonor_gain0.9900
20:35527082:G:Cdonor_gain0.9800
20:35527129:T:TAdonor_gain0.9800
20:35528448:CACCT:Cdonor_loss0.9800
20:35528449:ACCT:Adonor_loss0.9800
20:35528450:C:Adonor_loss0.9800
20:35528452:TTATC:Tdonor_gain0.9800
20:35528786:C:Tacceptor_gain0.9800
20:35529130:T:TAdonor_gain0.9800
20:35529557:AC:Adonor_gain0.9800
20:35529558:CC:Cdonor_gain0.9800
20:35527102:TGTCC:Tdonor_gain0.9700
20:35529068:C:CTdonor_gain0.9700
20:35529217:T:TAdonor_gain0.9700
20:35527071:A:ACdonor_gain0.9600
20:35527115:G:Cdonor_gain0.9600
20:35528453:TATC:Tdonor_gain0.9600
20:35528786:C:CTacceptor_gain0.9600
20:35529218:C:Adonor_gain0.9600
20:35528443:T:TAdonor_gain0.9500
20:35528545:CT:Cacceptor_loss0.9500
20:35528546:T:Gacceptor_loss0.9500
20:35529069:C:CTdonor_gain0.9500
20:35528545:C:CCacceptor_gain0.9400

AlphaMissense

1334 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:35528703:G:CF109L0.875
20:35528703:G:TF109L0.875
20:35528705:A:GF109L0.875
20:35528844:C:AW62C0.806
20:35528844:C:GW62C0.806
20:35528799:A:CF77L0.717
20:35528799:A:TF77L0.717
20:35528801:A:GF77L0.717
20:35528846:A:GW62R0.703
20:35528846:A:TW62R0.703
20:35528520:C:AW118C0.686
20:35528520:C:GW118C0.686
20:35528793:G:CF79L0.681
20:35528793:G:TF79L0.681
20:35528795:A:GF79L0.681
20:35528780:A:GC84R0.675
20:35528778:A:CC84W0.671
20:35528824:A:GI69T0.642
20:35528475:C:AW133C0.630
20:35528475:C:GW133C0.630
20:35528845:C:GW62S0.614
20:35528779:C:TC84Y0.613
20:35528787:A:CC81W0.606
20:35528283:A:GI142T0.567

dbSNP variants (sampled 300 via entrez): RS1000056503 (20:35528118 T>C), RS1000062499 (20:35525850 C>A,T), RS1000665507 (20:35526247 C>G), RS1001014007 (20:35529131 C>A,T), RS1001132696 (20:35527145 C>T), RS1001179914 (20:35527575 T>C), RS1001930367 (20:35527814 C>G), RS1002140284 (20:35527487 G>A,T), RS1002408687 (20:35521766 G>A), RS1002550738 (20:35530538 C>A), RS1002856899 (20:35525646 A>G), RS1002988096 (20:35523743 G>A), RS1003354500 (20:35528938 G>A,C,T), RS1003531470 (20:35522342 T>C), RS1003678076 (20:35520485 A>C)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

5 associations (top):

StudyTraitp-value
GCST002337_32Amyotrophic lateral sclerosis (sporadic)3.000000e-07
GCST005956_31Waist-to-hip ratio adjusted for BMI8.000000e-08
GCST005958_16Waist-to-hip ratio adjusted for BMI (age >50)6.000000e-06
GCST005962_40Waist-to-hip ratio adjusted for BMI x sex x age interaction (4df test)3.000000e-08
GCST010703_112Brain morphology (MOSTest)4.000000e-19

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0007788BMI-adjusted waist-hip ratio
EFO:0008007age at assessment
EFO:0008343sex interaction measurement
EFO:0004346neuroimaging measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

6 total (human), top 6 by PubMed support.

ChemicalActions (top 5)PubMed papers
maleic acidincreases expression1
Cadmiumdecreases expression, increases abundance1
Phthalic Acidsdecreases methylation1
Valproic Acidincreases methylation1
Aflatoxin B1increases methylation1
Cadmium Chloridedecreases expression, increases abundance1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): sporadic amyotrophic lateral sclerosis