COL23A1

gene
On this page

Also known as DKFZp434K0621

Summary

COL23A1 (collagen type XXIII alpha 1 chain, HGNC:22990) is a protein-coding gene on chromosome 5q35.3, encoding Collagen alpha-1(XXIII) chain (Q86Y22).

COL23A1 is a member of the transmembrane collagens, a subfamily of the nonfibrillar collagens that contain a single pass hydrophobic transmembrane domain (Banyard et al., 2003 [PubMed 12644459]).

Source: NCBI Gene 91522 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 125 total
  • MANE Select transcript: NM_173465

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:22990
Approved symbolCOL23A1
Namecollagen type XXIII alpha 1 chain
Location5q35.3
Locus typegene with protein product
StatusApproved
AliasesDKFZp434K0621
Ensembl geneENSG00000050767
Ensembl biotypeprotein_coding
OMIM610043
Entrez91522

Gene structure

Transcript identifiers

Ensembl transcripts: 9 — 4 nonsense_mediated_decay, 3 protein_coding, 2 protein_coding_CDS_not_defined

ENST00000390654, ENST00000407622, ENST00000484750, ENST00000646779, ENST00000679888, ENST00000679896, ENST00000680268, ENST00000680889, ENST00000681261

RefSeq mRNA: 1 — MANE Select: NM_173465 NM_173465

CCDS: CCDS4436

Canonical transcript exons

ENST00000390654 — 29 exons

ExonStartEnd
ENSE00000770283178239141178239179
ENSE00000770284178242042178242128
ENSE00000973022178249117178249206
ENSE00000973024178247775178247831
ENSE00000973025178247526178247552
ENSE00000973026178246391178246453
ENSE00000973027178246254178246307
ENSE00000973029178242341178242394
ENSE00001290409178256866178256928
ENSE00001293360178250061178250105
ENSE00001295338178248192178248254
ENSE00001300755178262217178262252
ENSE00001307117178267307178267333
ENSE00001309173178306875178306919
ENSE00001323944178257523178257567
ENSE00001325296178263208178263324
ENSE00001325792178256353178256397
ENSE00001356640178252544178252597
ENSE00001356641178254949178255026
ENSE00001356695178589904178590393
ENSE00001399827178270337178270363
ENSE00001404070178261722178261748
ENSE00001407776178268730178268756
ENSE00001419363178245942178245968
ENSE00001424779178288324178288350
ENSE00001425372178259721178259747
ENSE00001508549178237618178238700
ENSE00002483769178290362178290369
ENSE00003483384178560682178560748

Expression profiles

Bgee: expression breadth ubiquitous, 166 present calls, max score 95.52.

FANTOM5 (CAGE): breadth broad, TPM avg 1.8376 / max 125.4838, expressed in 404 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
651761.3070351
651750.4860170
651740.044719

Top tissues by expression

247 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right lobe of thyroid glandUBERON:000111995.52gold quality
left lobe of thyroid glandUBERON:000112095.06gold quality
thyroid glandUBERON:000204694.89gold quality
apex of heartUBERON:000209886.76gold quality
heart left ventricleUBERON:000208486.01gold quality
cardiac ventricleUBERON:000208285.30gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099184.78gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.40gold quality
mucosa of stomachUBERON:000119979.91gold quality
ileal mucosaUBERON:000033179.04gold quality
heartUBERON:000094878.02gold quality
skin of abdomenUBERON:000141676.35gold quality
right frontal lobeUBERON:000281076.12gold quality
sural nerveUBERON:001548876.04gold quality
right atrium auricular regionUBERON:000663175.39gold quality
skin of legUBERON:000151175.00gold quality
anterior cingulate cortexUBERON:000983574.76gold quality
cardiac atriumUBERON:000208174.72gold quality
tibialis anteriorUBERON:000138574.42silver quality
seminal vesicleUBERON:000099874.06gold quality
zone of skinUBERON:000001473.69gold quality
Brodmann (1909) area 9UBERON:001354073.68gold quality
pancreatic ductal cellCL:000207973.63silver quality
tibial nerveUBERON:000132373.21gold quality
metanephros cortexUBERON:001053372.88gold quality
right lungUBERON:000216772.86gold quality
prefrontal cortexUBERON:000045172.45gold quality
right hemisphere of cerebellumUBERON:001489072.02gold quality
dorsolateral prefrontal cortexUBERON:000983471.87gold quality
small intestine Peyer’s patchUBERON:000345471.82gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no3.37

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

64 targeting COL23A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-8485100.0077.574731
HSA-MIR-656-3P100.0072.152788
HSA-MIR-448799.9664.581252
HSA-MIR-6721-5P99.9368.922981
HSA-LET-7A-2-3P99.8770.531921
HSA-MIR-449299.8768.253611
HSA-MIR-394199.8670.542735
HSA-LET-7G-3P99.8570.431929
HSA-MIR-6756-5P99.8267.972466
HSA-MIR-6817-3P99.7968.352126
HSA-MIR-3934-3P99.7665.511351
HSA-MIR-2681-5P99.7567.641655
HSA-MIR-674599.7465.331321
HSA-MIR-6766-5P99.6867.702325
HSA-MIR-3934-5P99.6764.04846
HSA-MIR-452-5P99.6569.631762
HSA-MIR-4676-3P99.6569.311733
HSA-MIR-892C-3P99.6569.381745
HSA-MIR-4753-5P99.5468.511356
HSA-MIR-1213299.4768.901341
HSA-MIR-449899.4767.422360
HSA-MIR-363-5P99.4664.511015
HSA-MIR-16-2-3P99.2970.601954
HSA-MIR-195-3P99.2970.611954
HSA-MIR-450599.2767.812678
HSA-MIR-578799.2267.862628
HSA-MIR-10399-5P99.1769.872610
HSA-MIR-6504-3P99.1769.312891
HSA-MIR-429299.1665.571767

Literature-anchored findings (GeneRIF, showing 5)

  • identification and cloning; a new member of the transmembrane collagen family, showing structural homology with the transmembrane collagens XIII and XXV (PMID:12644459)
  • analysis of collagen XXIII mRNA and protein (PMID:16728390)
  • newly synthesized collagen XXIII either is cleaved inside the Golgi/trans-Golgi network or reaches the cell surface, where it becomes protected from processing by being localized in lipid rafts. (PMID:17627939)
  • High COL23A1 expression is associated with recurrent non-small cell lung cancer. (PMID:20447926)
  • data suggest that extracellular membrane-bound CAIV, but not cytosolic CAII, augments transport activity of MCT2 in a non-catalytic manner, possibly by facilitating a proton pathway other than His-88 (PMID:21652699)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriocol7a1lENSDARG00000069692
danio_reriosi:dkey-117n7.4ENSDARG00000092064
ENSDARG00000105274
mus_musculusCol23a1ENSMUSG00000063564
rattus_norvegicusCol23a1ENSRNOG00000003349

Paralogs (37): COL9A2 (ENSG00000049089), COL11A1 (ENSG00000060718), COL17A1 (ENSG00000065618), COL5A3 (ENSG00000080573), COL4A4 (ENSG00000081052), COL16A1 (ENSG00000084636), COL9A3 (ENSG00000092758), COL20A1 (ENSG00000101203), COL1A1 (ENSG00000108821), COL9A1 (ENSG00000112280), COL7A1 (ENSG00000114270), COL21A1 (ENSG00000124749), COL5A1 (ENSG00000130635), COL4A2 (ENSG00000134871), COL2A1 (ENSG00000139219), COL6A1 (ENSG00000142156), COL6A2 (ENSG00000142173), EDA (ENSG00000158813), COL26A1 (ENSG00000160963), COL1A2 (ENSG00000164692), COL3A1 (ENSG00000168542), COL4A3 (ENSG00000169031), COL22A1 (ENSG00000169436), COL24A1 (ENSG00000171502), COL18A1 (ENSG00000182871), EMID1 (ENSG00000186998), COL4A1 (ENSG00000187498), COL4A5 (ENSG00000188153), COL25A1 (ENSG00000188517), COL27A1 (ENSG00000196739), COL13A1 (ENSG00000197467), COL4A6 (ENSG00000197565), COL11A2 (ENSG00000204248), COL5A2 (ENSG00000204262), COL15A1 (ENSG00000204291), COLQ (ENSG00000206561), COL28A1 (ENSG00000215018)

Protein

Protein identifiers

Collagen alpha-1(XXIII) chainQ86Y22 (reviewed: Q86Y22)

All UniProt accessions (5): A0A0A0MSD3, A0A2R8Y887, A0A7P0T8B8, Q86Y22, L8EAS4

UniProt curated annotations — full annotation on UniProt →

Subunit / interactions. Homotrimer.

Subcellular location. Cell membrane.

Post-translational modifications. Undergoes proteolytic cleavage by furin protease to yield a 60 kDa soluble form that forms a homotrimer and exhibits a low affinity interaction with heparin.

Isoforms (2)

UniProt IDNamesCanonical?
Q86Y22-11yes
Q86Y22-22

RefSeq proteins (1): NP_775736* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008160CollagenRepeat
IPR050938Collagen_Structural_ProteinsFamily

Pfam: PF01391

UniProt features (27 total): compositionally biased region 9, splice variant 5, domain 5, region of interest 3, topological domain 2, chain 1, transmembrane region 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q86Y22-F159.010.05

Function

Pathways and Gene Ontology

Reactome pathways

4 pathways

IDPathway
R-HSA-1442490Collagen degradation
R-HSA-1650814Collagen biosynthesis and modifying enzymes
R-HSA-216083Integrin cell surface interactions
R-HSA-8948216Collagen chain trimerization

MSigDB gene sets: 82 (showing top): GSE45365_NK_CELL_VS_CD8A_DC_UP, GOCC_COLLAGEN_TRIMER, GOCC_CELL_SURFACE, SRF_Q5_01, AAACCAC_MIR140, GOMF_EXTRACELLULAR_MATRIX_STRUCTURAL_CONSTITUENT, SCHAEFFER_PROSTATE_DEVELOPMENT_6HR_DN, SCHAEFFER_PROSTATE_DEVELOPMENT_12HR_UP, GOMF_GLYCOSAMINOGLYCAN_BINDING, OCT1_B, CCCNNGGGAR_OLF1_01, GOMF_HEPARIN_BINDING, REACTOME_INTEGRIN_CELL_SURFACE_INTERACTIONS, GOCC_ENDOPLASMIC_RETICULUM_LUMEN, GOMF_SULFUR_COMPOUND_BINDING

GO Biological Process (0):

GO Molecular Function (4): heparin binding (GO:0008201), extracellular matrix structural constituent conferring tensile strength (GO:0030020), identical protein binding (GO:0042802), protein binding (GO:0005515)

GO Cellular Component (7): collagen trimer (GO:0005581), obsolete extracellular space (GO:0005615), endoplasmic reticulum lumen (GO:0005788), plasma membrane (GO:0005886), cell surface (GO:0009986), extracellular matrix (GO:0031012), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-4 pathways:

CategoryPathways
Degradation of the extracellular matrix1
Collagen formation1
Extracellular matrix organization1
Collagen biosynthesis and modifying enzymes1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
glycosaminoglycan binding1
sulfur compound binding1
extracellular matrix structural constituent1
protein binding1
binding1
protein-containing complex1
endoplasmic reticulum1
intracellular organelle lumen1
membrane1
cell periphery1
external encapsulating structure1

Protein interactions and networks

STRING

1050 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
COL23A1FURINP09958568
COL23A1C2CD4DB7Z1M9480
COL23A1MMP9P14780451
COL23A1COL26A1Q96A83438
COL23A1SLC16A2P36021428
COL23A1PHYKPLQ8IUZ5404
COL23A1TMEM184CQ9NVA4400
COL23A1SPATA17Q96L03378
COL23A1CPLX3Q8WVH0337
COL23A1COL8A1P27658332
COL23A1PCOLCE2Q9UKZ9332
COL23A1COL25A1Q9BXS0318
COL23A1LAMB3Q13751311
COL23A1ANKS1BQ7Z6G8302
COL23A1ANKFY1Q9P2R3300

IntAct

12 interactions, top by confidence:

ABTypeScore
EXOSC8COL23A1psi-mi:“MI:0915”(physical association)0.560
COL23A1EXOSC8psi-mi:“MI:0915”(physical association)0.560
COL23A1RPL10Apsi-mi:“MI:0915”(physical association)0.400
COL23A1SYNE2psi-mi:“MI:0915”(physical association)0.400
COL23A1H4C16psi-mi:“MI:0915”(physical association)0.400
COL23A1H1-4psi-mi:“MI:0915”(physical association)0.400
COL23A1LMNApsi-mi:“MI:0915”(physical association)0.400
COL23A1CFTRpsi-mi:“MI:0915”(physical association)0.370
COL23A1PLOD2psi-mi:“MI:0914”(association)0.350
LSM8COL23A1psi-mi:“MI:0915”(physical association)0.000

BioGRID (16): COL23A1 (Two-hybrid), COL23A1 (Affinity Capture-RNA), COL23A1 (Proximity Label-MS), SYNE2 (Proximity Label-MS), COL23A1 (Proximity Label-MS), COL23A1 (Proximity Label-MS), COL23A1 (Proximity Label-MS), COL23A1 (Two-hybrid), PLOD2 (Affinity Capture-MS), PLOD1 (Affinity Capture-MS), COLGALT1 (Affinity Capture-MS), PLOD3 (Affinity Capture-MS), COL23A1 (PCA), COL23A1 (Co-fractionation), COL23A1 (Co-fractionation)

ESM2 similar proteins: C0HLH0, C0HLH4, C0HLI6, C0HLN2, P02460, P02462, P02463, P05997, P08120, P08122, P08125, P08572, P12106, P12107, P12108, P13942, P20849, P20850, P20909, P23206, P25318, P27393, P29400, P30754, P32017, P70560, Q01955, Q03692, Q05306, Q05722, Q07643, Q0VF58, Q14031, Q14050, Q14055, Q14993, Q28083, Q28247, Q32S24, Q3U962

Diamond homologs: Q810Y4, Q86Y22, Q8K4G2, Q99MQ5, Q9BXS0, Q9R1N9, Q5TAT6

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

125 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance93
Likely benign3
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

8041 predictions. Top by Δscore:

VariantEffectΔscore
5:178239127:T:TAdonor_gain1.0000
5:178239128:C:Adonor_gain1.0000
5:178246389:A:ACdonor_gain1.0000
5:178246390:C:CCdonor_gain1.0000
5:178248191:CCAGG:Cdonor_gain1.0000
5:178248199:T:Adonor_gain1.0000
5:178256804:C:CAdonor_gain1.0000
5:178261718:TCA:Tdonor_loss1.0000
5:178261719:CACCT:Cdonor_loss1.0000
5:178261750:T:Aacceptor_loss1.0000
5:178263202:TCTTA:Tdonor_loss1.0000
5:178263203:CTTAC:Cdonor_loss1.0000
5:178263204:TTACC:Tdonor_loss1.0000
5:178263205:TA:Tdonor_loss1.0000
5:178263206:A:ACdonor_gain1.0000
5:178263207:C:CCdonor_gain1.0000
5:178263207:CCG:Cdonor_gain1.0000
5:178270335:A:ACdonor_gain1.0000
5:178270336:C:CCdonor_gain1.0000
5:178270336:CGGG:Cdonor_gain1.0000
5:178238606:CCA:Cacceptor_gain0.9900
5:178238607:C:CTacceptor_gain0.9900
5:178238607:C:Tacceptor_gain0.9900
5:178238608:A:ACacceptor_gain0.9900
5:178238608:A:Cacceptor_gain0.9900
5:178238699:CA:Cacceptor_gain0.9900
5:178238701:C:CCacceptor_gain0.9900
5:178239138:TA:Tdonor_loss0.9900
5:178239139:ACC:Adonor_loss0.9900
5:178239140:CCTTA:Cdonor_loss0.9900

AlphaMissense

3345 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:178242050:A:GC525R0.996
5:178242049:C:GC525S0.994
5:178242049:C:TC525Y0.994
5:178242050:A:TC525S0.994
5:178239152:A:GC537R0.993
5:178239151:C:GC537S0.992
5:178239152:A:TC537S0.992
5:178242048:A:CC525W0.992
5:178246279:C:TG463E0.990
5:178263251:C:TG199D0.990
5:178270362:C:TG148E0.990
5:178290369:C:TG136D0.990
5:178306901:C:TG127D0.990
5:178239151:C:TC537Y0.988
5:178242109:C:TG505D0.988
5:178242393:C:TG481D0.988
5:178288331:C:TG145D0.988
5:178306892:C:TG130E0.988
5:178246270:C:TG466E0.987
5:178288340:C:TG142E0.987
5:178306883:C:TG133D0.987
5:178239150:G:CC537W0.986
5:178263233:C:TG205D0.986
5:178288349:C:TG139E0.985
5:178263260:C:TG196D0.984
5:178306910:C:TG124E0.984
5:178242049:C:AC525F0.983
5:178242101:C:GG508R0.983
5:178242101:C:TG508R0.983
5:178263242:C:TG202E0.982

dbSNP variants (sampled 300 via entrez): RS1000006999 (5:178366215 C>T), RS1000010297 (5:178455080 A>C), RS1000022030 (5:178561888 G>A), RS1000031708 (5:178328348 G>A,C), RS1000041509 (5:178355224 T>C), RS1000042797 (5:178476197 G>A), RS1000044695 (5:178532717 C>T), RS1000050207 (5:178285112 G>C), RS1000062309 (5:178292672 T>C,G), RS1000070475 (5:178417822 C>T), RS1000071684 (5:178465713 C>T), RS1000074115 (5:178355371 C>G), RS1000092318 (5:178587992 A>C,G,T), RS1000138917 (5:178379061 G>A,T), RS1000150563 (5:178449892 C>T)

Disease associations

OMIM: gene MIM:610043 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST001762_247Obesity-related traits6.000000e-06
GCST001958_7Bulimia nervosa6.000000e-06
GCST003465_17Cannabis dependence symptom count2.000000e-07
GCST003563_10Presence of antiphospholipid antibodies3.000000e-06
GCST006085_81Prostate cancer7.000000e-09
GCST006499_1Nose length3.000000e-10
GCST006585_2990Blood protein levels4.000000e-07
GCST012320_1HDL levels x SSRI levels (escitalopram or citalopram) interaction in schizophrenia or bipolar disorder2.000000e-07

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0005106body composition measurement
EFO:0008457cannabis dependence measurement
EFO:0004612high density lipoprotein cholesterol measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

32 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression6
trichostatin Aaffects cotreatment, increases expression3
Panobinostataffects cotreatment, increases expression2
Benzo(a)pyreneincreases methylation, affects methylation, decreases methylation2
Aflatoxin B1increases methylation2
aristolochic acid Iincreases expression1
bisphenol Aaffects cotreatment, decreases methylation1
benzo(e)pyreneaffects methylation1
aflatoxin B2affects methylation1
CGP 52608affects binding, increases reaction1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
2,2’,4,4’-tetrabromodiphenyl etherdecreases expression1
dorsomorphinaffects cotreatment, increases expression1
Resveratrolaffects cotreatment, decreases expression1
Fulvestrantaffects cotreatment, decreases methylation1
Vorinostataffects cotreatment, increases expression1
Air Pollutantsaffects methylation1
Arsenicaffects methylation1
Cadmiumincreases abundance, increases expression1
Cisplatindecreases expression1
Copperaffects cotreatment, decreases expression1
Diazinonincreases methylation1
Diethylhexyl Phthalatedecreases expression1
Methapyrileneaffects methylation1
Rotenoneincreases expression1
Dronabinoldecreases expression1
Thiramincreases expression1
Tretinoinincreases expression1
Cadmium Chlorideincreases abundance, increases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): bulimia nervosa