COL20A1

gene
On this page

Also known as KIAA1510

Summary

COL20A1 (collagen type XX alpha 1 chain, HGNC:14670) is a protein-coding gene on chromosome 20q13.33, encoding Collagen alpha-1(XX) chain (Q9P218). Probable collagen protein.

Predicted to be located in collagen-containing extracellular matrix; endoplasmic reticulum lumen; and extracellular region. Predicted to be part of collagen trimer.

Source: NCBI Gene 57642 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): hereditary palmoplantar keratoderma (Limited, GenCC)
  • Clinical variants (ClinVar): 301 total — 1 pathogenic, 1 likely-pathogenic
  • MANE Select transcript: NM_020882

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14670
Approved symbolCOL20A1
Namecollagen type XX alpha 1 chain
Location20q13.33
Locus typegene with protein product
StatusApproved
AliasesKIAA1510
Ensembl geneENSG00000101203
Ensembl biotypeprotein_coding
OMIM619390
Entrez57642

Gene structure

Transcript identifiers

Ensembl transcripts: 10 — 6 protein_coding, 3 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000358894, ENST00000415763, ENST00000422202, ENST00000455906, ENST00000471582, ENST00000479501, ENST00000494913, ENST00000496810, ENST00000894509, ENST00000938907

RefSeq mRNA: 1 — MANE Select: NM_020882 NM_020882

CCDS: CCDS46628

Canonical transcript exons

ENST00000358894 — 36 exons

ExonStartEnd
ENSE000006633936331242063312549
ENSE000006633996331905863319200
ENSE000006634006331948763319596
ENSE000008565696330975863309915
ENSE000008565706331540463315439
ENSE000014104786329318663293275
ENSE000015633636332807963328127
ENSE000034705286332544163325494
ENSE000034740996332675263326823
ENSE000034746376329791063298020
ENSE000034805616332003963320197
ENSE000034870826331139463311539
ENSE000034881416332101363321099
ENSE000034953296331374363313891
ENSE000034954446330797163308090
ENSE000035059246331038163310510
ENSE000035211486332205863322111
ENSE000035441466329509863295189
ENSE000035527006331655363316691
ENSE000035557826330541763305560
ENSE000035649296330749063307648
ENSE000035687166332795263327987
ENSE000035748276331407263314201
ENSE000035779186331279263312934
ENSE000035884276331191663312055
ENSE000035941186330933363309497
ENSE000035950326331162563311748
ENSE000036025306330588163306039
ENSE000036026786332609663326149
ENSE000036224306332958563329661
ENSE000036340446330854263308706
ENSE000036788766332566863325721
ENSE000036902246332029163320368
ENSE000036940956331311763313249
ENSE000037869356332833163328498
ENSE000039366486333072063334806

Expression profiles

Bgee: expression breadth ubiquitous, 133 present calls, max score 87.70.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.4232 / max 48.4954, expressed in 107 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1858060.4232107

Top tissues by expression

230 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right testisUBERON:000453487.70gold quality
left testisUBERON:000453386.85gold quality
anterior cingulate cortexUBERON:000983586.41gold quality
right frontal lobeUBERON:000281085.57gold quality
amygdalaUBERON:000187684.91gold quality
pancreatic ductal cellCL:000207983.79silver quality
testisUBERON:000047383.59gold quality
Brodmann (1909) area 9UBERON:001354082.75gold quality
endothelial cellCL:000011582.05silver quality
nucleus accumbensUBERON:000188280.45gold quality
dorsolateral prefrontal cortexUBERON:000983480.33gold quality
C1 segment of cervical spinal cordUBERON:000646979.75gold quality
substantia nigraUBERON:000203879.01gold quality
tibialis anteriorUBERON:000138578.64silver quality
spinal cordUBERON:000224078.50gold quality
neocortexUBERON:000195078.17gold quality
putamenUBERON:000187478.09gold quality
temporal lobeUBERON:000187177.98gold quality
Ammon’s hornUBERON:000195477.88gold quality
cerebral cortexUBERON:000095677.74gold quality
right hemisphere of cerebellumUBERON:001489077.56gold quality
midbrainUBERON:000189177.50gold quality
frontal cortexUBERON:000187077.48gold quality
hypothalamusUBERON:000189877.00gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047376.84gold quality
buccal mucosa cellCL:000233676.81gold quality
forebrainUBERON:000189076.65gold quality
brainUBERON:000095576.28gold quality
cerebellar hemisphereUBERON:000224575.68gold quality
cerebellar cortexUBERON:000212975.60gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-93593yes5.58
E-ANND-3yes3.76

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

6 targeting COL20A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6892-3P99.6866.401178
HSA-MIR-1211399.3267.541072
HSA-MIR-548V99.2969.471157
HSA-MIR-6815-3P99.1368.981530
HSA-MIR-453998.7867.18888
HSA-MIR-193B-5P97.9165.88837

Literature-anchored findings (GeneRIF, showing 2)

  • Our study proposes COL20A1 gene as another potential candidate gene for striate palmoplantar keratoderma (PPKS)which expand the spectrum of collagen proteins in the pathogenicity of Palmoplantar keratoderma. (PMID:29934816)
  • Type XX Collagen Is Elevated in Circulation of Patients with Solid Tumors. (PMID:35456962)

Cross-species orthologs

1 orthologs

OrganismSymbolGene ID
mus_musculusCol20a1ENSMUSG00000016356

Paralogs (37): COL9A2 (ENSG00000049089), COL23A1 (ENSG00000050767), COL11A1 (ENSG00000060718), COL17A1 (ENSG00000065618), COL5A3 (ENSG00000080573), COL4A4 (ENSG00000081052), COL16A1 (ENSG00000084636), COL9A3 (ENSG00000092758), COL1A1 (ENSG00000108821), COL9A1 (ENSG00000112280), COL7A1 (ENSG00000114270), COL21A1 (ENSG00000124749), COL5A1 (ENSG00000130635), COL4A2 (ENSG00000134871), COL2A1 (ENSG00000139219), COL6A1 (ENSG00000142156), COL6A2 (ENSG00000142173), EDA (ENSG00000158813), COL26A1 (ENSG00000160963), COL1A2 (ENSG00000164692), COL3A1 (ENSG00000168542), COL4A3 (ENSG00000169031), COL22A1 (ENSG00000169436), COL24A1 (ENSG00000171502), COL18A1 (ENSG00000182871), EMID1 (ENSG00000186998), COL4A1 (ENSG00000187498), COL4A5 (ENSG00000188153), COL25A1 (ENSG00000188517), COL27A1 (ENSG00000196739), COL13A1 (ENSG00000197467), COL4A6 (ENSG00000197565), COL11A2 (ENSG00000204248), COL5A2 (ENSG00000204262), COL15A1 (ENSG00000204291), COLQ (ENSG00000206561), COL28A1 (ENSG00000215018)

Protein

Protein identifiers

Collagen alpha-1(XX) chainQ9P218 (reviewed: Q9P218)

All UniProt accessions (3): Q9P218, B7ZBI4, B7ZBI5

UniProt curated annotations — full annotation on UniProt →

Function. Probable collagen protein.

Subcellular location. Secreted. Extracellular space.

Tissue specificity. High expression in heart, lung, liver, skeletal muscle, kidney, pancreas, spleen, testis, ovary, subthalamic nucleus and fetal liver. Weak expression in other tissues tested.

Isoforms (3)

UniProt IDNamesCanonical?
Q9P218-11yes
Q9P218-22
Q9P218-33

RefSeq proteins (1): NP_065933* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002035VWF_ADomain
IPR003961FN3_domDomain
IPR008160CollagenRepeat
IPR013320ConA-like_dom_sfHomologous_superfamily
IPR013783Ig-like_foldHomologous_superfamily
IPR036116FN3_sfHomologous_superfamily
IPR036465vWFA_dom_sfHomologous_superfamily
IPR048287TSPN-like_NDomain
IPR050991ECM_Regulatory_ProteinsFamily

Pfam: PF00041, PF00092, PF01391

UniProt features (57 total): strand 27, domain 10, sequence conflict 5, compositionally biased region 4, region of interest 3, splice variant 3, signal peptide 1, chain 1, glycosylation site 1, sequence variant 1, turn 1

Structure

Experimental structures (PDB)

4 structures.

PDBMethodResolution (Å)
5KF4X-RAY DIFFRACTION2.5
2DKMSOLUTION NMR
2EE3SOLUTION NMR
2EKJSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9P218-F173.880.24

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (1): 607

Function

Pathways and Gene Ontology

Reactome pathways

2 pathways

IDPathway
R-HSA-1650814Collagen biosynthesis and modifying enzymes
R-HSA-8948216Collagen chain trimerization

MSigDB gene sets: 59 (showing top): GOCC_COLLAGEN_TRIMER, DARWICHE_SKIN_TUMOR_PROMOTER_DN, DARWICHE_PAPILLOMA_RISK_LOW_UP, DARWICHE_PAPILLOMA_RISK_HIGH_DN, DARWICHE_SQUAMOUS_CELL_CARCINOMA_DN, DARWICHE_PAPILLOMA_PROGRESSION_RISK, COATES_MACROPHAGE_M1_VS_M2_DN, MODULE_48, MODULE_95, GOCC_ENDOPLASMIC_RETICULUM_LUMEN, NIKOLSKY_BREAST_CANCER_20Q12_Q13_AMPLICON, MARTENS_TRETINOIN_RESPONSE_UP, CERIBELLI_PROMOTERS_INACTIVE_AND_BOUND_BY_NFY, GOBERT_OLIGODENDROCYTE_DIFFERENTIATION_DN, MODULE_163

GO Biological Process (0):

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (4): extracellular region (GO:0005576), collagen trimer (GO:0005581), endoplasmic reticulum lumen (GO:0005788), extracellular matrix (GO:0031012)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Collagen formation1
Collagen biosynthesis and modifying enzymes1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
binding1
cellular anatomical structure1
protein-containing complex1
endoplasmic reticulum1
intracellular organelle lumen1
external encapsulating structure1

Protein interactions and networks

STRING

930 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
COL20A1CNIH1O95406495
COL20A1CNIH3Q8TBE1490
COL20A1ARSJQ5FYB0448
COL20A1PALMDQ9NP74439
COL20A1RIC3Q7Z5B4429
COL20A1IFT38Q96AJ1427
COL20A1KLHL31Q9H511427
COL20A1SEC16BQ96JE7423
COL20A1ACAD10Q6JQN1418
COL20A1TDRD7Q8NHU6405
COL20A1UBE2HP37286400
COL20A1MKNK1Q9BUB5385
COL20A1AUTS2Q8WXX7383
COL20A1COL26A1Q96A83382
COL20A1ADAMTS14Q8WXS8350

IntAct

14 interactions, top by confidence:

ABTypeScore
GPC3CLGNpsi-mi:“MI:0914”(association)0.530
ATP5MC3ATP5F1Bpsi-mi:“MI:0914”(association)0.530
JPT1COL20A1psi-mi:“MI:0915”(physical association)0.400
UBE2BCOL20A1psi-mi:“MI:0915”(physical association)0.400
USP47COL20A1psi-mi:“MI:0915”(physical association)0.400
COL20A1FAM9Bpsi-mi:“MI:0915”(physical association)0.370
ECE1COL20A1psi-mi:“MI:0915”(physical association)0.370
TEX11COL20A1psi-mi:“MI:0915”(physical association)0.370
COL20A1MANBApsi-mi:“MI:0914”(association)0.350
NUDT16BCKDKpsi-mi:“MI:0914”(association)0.350
COL20A1TSC2psi-mi:“MI:0914”(association)0.350
CPLX3ATE1psi-mi:“MI:0914”(association)0.350
KLHL22TRAV18psi-mi:“MI:0914”(association)0.350

BioGRID (31): COL20A1 (Two-hybrid), COL20A1 (Two-hybrid), FAM9B (Two-hybrid), COL20A1 (Two-hybrid), GGACT (Affinity Capture-MS), TSC2 (Affinity Capture-MS), COL14A1 (Affinity Capture-MS), COL20A1 (Affinity Capture-MS), MEIS1 (Affinity Capture-MS), CDKN2C (Affinity Capture-MS), PBX2 (Affinity Capture-MS), GLS (Affinity Capture-MS), PPP2CA (Affinity Capture-MS), HSPA5 (Affinity Capture-MS), COL20A1 (Affinity Capture-MS)

ESM2 similar proteins: A0A140LHF2, A0JPB1, A2AJ76, A7LCJ3, A8E0Y8, E7FF10, O00241, O60500, P01874, P03988, P04221, P0DOX2, P0DOX3, P0DOX4, P0DOX6, P0DP72, P32507, P35590, P43121, P50895, Q06805, Q06806, Q148M6, Q15109, Q5TFQ8, Q5U5A3, Q5XI43, Q62230, Q62786, Q8HW98, Q8NDA2, Q8R2Y2, Q92154, Q923P0, Q93033, Q95KI3, Q96MS0, Q9BRK3, Q9BZZ2, Q9DBV4

Diamond homologs: A0A1D5NSM8, A0JNA2, A2AVA0, A2AX52, D3YXF5, O02839, O19063, O35764, O43405, O70340, O76536, O89029, O95502, O96530, P02741, P02743, P06205, P06206, P06207, P06681, P07202, P07629, P08607, P09871, P0C6B8, P10643, P12246, P13944, P14151, P14847, P15697, P18337, P23680, P32018, P47970, P47971, P47972, P48199, P49254, P49262

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

301 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic1
Uncertain significance259
Likely benign28
Benign2

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
564629GRCh37/hg19 20q13.33(chr20:61815143-62318983)x1Pathogenic
1326328GRCh37/hg19 20q13.33(chr20:61944468-62104030)Likely pathogenic

SpliceAI

7330 predictions. Top by Δscore:

VariantEffectΔscore
20:63294752:G:GTdonor_gain1.0000
20:63297908:A:AGacceptor_gain1.0000
20:63297909:G:GGacceptor_gain1.0000
20:63297909:GCA:Gacceptor_gain1.0000
20:63297909:GCAA:Gacceptor_gain1.0000
20:63305553:GTTT:Gdonor_gain1.0000
20:63305557:GTGA:Gdonor_gain1.0000
20:63305559:GA:Gdonor_gain1.0000
20:63305561:G:GGdonor_gain1.0000
20:63305879:A:AGacceptor_gain1.0000
20:63305880:G:GCacceptor_gain1.0000
20:63305880:GTT:Gacceptor_gain1.0000
20:63305880:GTTGA:Gacceptor_gain1.0000
20:63305988:GCTTC:Gdonor_gain1.0000
20:63306263:G:GTdonor_gain1.0000
20:63306311:G:GTdonor_gain1.0000
20:63306359:G:GTdonor_gain1.0000
20:63307645:GTAG:Gdonor_gain1.0000
20:63309330:CA:Cacceptor_loss1.0000
20:63309331:AGGTG:Aacceptor_loss1.0000
20:63309332:G:Tacceptor_loss1.0000
20:63311392:A:AGacceptor_gain1.0000
20:63311393:G:GAacceptor_gain1.0000
20:63311537:GAG:Gdonor_gain1.0000
20:63311744:GACCC:Gdonor_gain1.0000
20:63311749:G:GGdonor_gain1.0000
20:63312922:G:Tdonor_gain1.0000
20:63313207:G:Tdonor_gain1.0000
20:63316551:AG:Aacceptor_gain1.0000
20:63316552:GG:Gacceptor_gain1.0000

AlphaMissense

8163 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:63313816:G:CW761C0.983
20:63313816:G:TW761C0.983
20:63313814:T:AW761R0.980
20:63313814:T:CW761R0.980
20:63307594:T:CF201L0.977
20:63307596:C:AF201L0.977
20:63307596:C:GF201L0.977
20:63312487:T:AV624D0.976
20:63307538:T:CF182S0.975
20:63297965:G:CW46C0.974
20:63297965:G:TW46C0.974
20:63305554:T:CF111L0.974
20:63305556:T:AF111L0.974
20:63305556:T:GF111L0.974
20:63307547:A:TD185V0.973
20:63311461:G:CW487C0.973
20:63311461:G:TW487C0.973
20:63307595:T:CF201S0.972
20:63307558:A:CS189R0.969
20:63307560:C:AS189R0.969
20:63307560:C:GS189R0.969
20:63309841:T:AW397R0.969
20:63309841:T:CW397R0.969
20:63311459:T:AW487R0.969
20:63311459:T:CW487R0.969
20:63308008:G:CW231C0.966
20:63308008:G:TW231C0.966
20:63314145:T:AV811D0.964
20:63309843:G:CW397C0.963
20:63309843:G:TW397C0.963

dbSNP variants (sampled 300 via entrez): RS1000003062 (20:63299787 T>C), RS1000019703 (20:63309036 G>A), RS1000022222 (20:63293779 CCT>C), RS1000073974 (20:63305109 G>A), RS1000196661 (20:63312883 G>A,C), RS1000360006 (20:63321420 C>A,G,T), RS1000366283 (20:63298905 G>A), RS1000412398 (20:63321111 G>A), RS1000542168 (20:63312223 G>A), RS1000735505 (20:63317076 G>A,T), RS1000785414 (20:63312258 T>A), RS1000836985 (20:63320470 G>A), RS1000993305 (20:63294437 C>T), RS1001001067 (20:63308283 G>A), RS1001003355 (20:63302379 G>T)

Disease associations

OMIM: gene MIM:619390 | disease phenotypes: MIM:148700, MIM:121200, MIM:613720

GenCC curated gene-disease

DiseaseClassificationInheritance
hereditary palmoplantar keratodermaLimitedAutosomal dominant

Mondo (4): palmoplantar keratoderma i, striate, focal, or diffuse (MONDO:0007859), seizures, benign familial neonatal, 1 (MONDO:0007365), developmental and epileptic encephalopathy, 7 (MONDO:0013387), hereditary palmoplantar keratoderma (MONDO:0019272)

Orphanet (4): Diffuse palmoplantar keratoderma with painful fissures (Orphanet:369999), Focal palmoplantar keratoderma with joint keratoses (Orphanet:370002), Self-limited neonatal epilepsy (Orphanet:1949), KCNQ2-related developmental and epileptic encephalopathy (Orphanet:439218)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

MeSH disease descriptors (2)

DescriptorNameTree numbers
C567743Epilepsy, Benign Neonatal, 1, And-Or Myokymia (supp.)
C536162Keratosis palmoplantaris striata 1 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

20 total (human), top 20 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteincreases expression2
Benzo(a)pyreneaffects methylation2
Cisplatinaffects cotreatment, increases expression, decreases expression2
aristolochic acid Iincreases expression1
benzo(e)pyrenedecreases methylation1
aflatoxin B2decreases methylation1
calfactantincreases expression, affects cotreatment1
bisphenol Sdecreases methylation1
Resveratrolaffects cotreatment, increases expression1
Copperaffects cotreatment, increases expression1
Methapyrilenedecreases methylation1
Silicon Dioxidedecreases expression1
Smokedecreases expression1
Dronabinolincreases expression1
Tobacco Smoke Pollutionincreases expression1
Valproic Acidincreases methylation1
Aflatoxin B1increases methylation1
Paclitaxelaffects cotreatment, increases expression1
Okadaic Acidincreases expression1
Nanotubes, Carbonaffects cotreatment, increases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.