COL19A1

gene
On this page

Summary

COL19A1 (collagen type XIX alpha 1 chain, HGNC:2196) is a protein-coding gene on chromosome 6q13, encoding Collagen alpha-1(XIX) chain (Q14993). May act as a cross-bridge between fibrils and other extracellular matrix molecules.

This gene encodes the alpha chain of type XIX collagen, a member of the FACIT collagen family (fibril-associated collagens with interrupted helices). Although the function of this collagen is not known, other members of this collagen family are found in association with fibril-forming collagens such as type I and II, and serve to maintain the integrity of the extracellular matrix. The transcript produced from this gene has an unusually large 3’ UTR which has not been completely sequenced.

Source: NCBI Gene 1310 — RefSeq curated summary.

At a glance

  • GWAS associations: 5
  • Clinical variants (ClinVar): 306 total — 1 pathogenic
  • MANE Select transcript: NM_001858

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2196
Approved symbolCOL19A1
Namecollagen type XIX alpha 1 chain
Location6q13
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000082293
Ensembl biotypeprotein_coding
OMIM120165
Entrez1310

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 3 protein_coding, 2 protein_coding_CDS_not_defined

ENST00000455415, ENST00000476656, ENST00000478620, ENST00000483745, ENST00000620364

RefSeq mRNA: 1 — MANE Select: NM_001858 NM_001858

CCDS: CCDS4970

Canonical transcript exons

ENST00000620364 — 51 exons

ExonStartEnd
ENSE000016229167013018270130223
ENSE000016340427012188070121942
ENSE000037123507017652070176564
ENSE000037130996993803869938100
ENSE000037131866990023969900338
ENSE000037132637014202370142076
ENSE000037142976993678569936910
ENSE000037146217003424570034298
ENSE000037168807018470370184738
ENSE000037171607014276770142820
ENSE000037174147018487170184915
ENSE000037188857014421070144263
ENSE000037202017020690170206978
ENSE000037215587010216970102222
ENSE000037221516993278369932863
ENSE000037236247016334370163396
ENSE000037240907016594170165985
ENSE000037244807017196470172017
ENSE000037263917016817170168215
ENSE000037265376992942569929700
ENSE000037292707016802570168075
ENSE000037300996996282669962870
ENSE000037305587014189370141928
ENSE000037307147018031270180356
ENSE000037314257015612770156231
ENSE000037339157013768570137747
ENSE000037343067014665970146703
ENSE000037354617014491870145007
ENSE000037361286989894869899022
ENSE000037365087015631670156369
ENSE000037367917003590470035939
ENSE000037374687019960870199736
ENSE000037376597002362770023680
ENSE000037385767018046170180523
ENSE000037389506992790969928032
ENSE000037396057014970470149739
ENSE000037407956987953669879658
ENSE000037415467016190070161953
ENSE000037432767018807570188245
ENSE000037442757014681270146889
ENSE000037462537006842370068476
ENSE000037464167015667070156723
ENSE000037470097014999270150045
ENSE000037471167019031570190381
ENSE000037472447020714770212468
ENSE000037497936986655669866640
ENSE000037501277016865570168681
ENSE000037501877015139770151438
ENSE000037520647014985170149904
ENSE000037521636995999669960040
ENSE000037526717014095470140989

Expression profiles

Bgee: expression breadth ubiquitous, 153 present calls, max score 77.48.

FANTOM5 (CAGE): breadth broad, TPM avg 3.4837 / max 592.8185, expressed in 387 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
684592.3230321
684600.7978224
684610.188769
684570.080523
2040490.060622
684580.033122

Top tissues by expression

251 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047377.48gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099176.60gold quality
mucosa of stomachUBERON:000119975.65gold quality
endothelial cellCL:000011570.23silver quality
epithelium of nasopharynxUBERON:000195169.92gold quality
cerebellar vermisUBERON:000472069.91silver quality
cortical plateUBERON:000534369.84gold quality
right hemisphere of cerebellumUBERON:001489069.17gold quality
epithelial cell of pancreasCL:000008368.73gold quality
cerebellar cortexUBERON:000212968.62gold quality
cerebellumUBERON:000203768.59gold quality
cerebellar hemisphereUBERON:000224568.50gold quality
cartilage tissueUBERON:000241867.93silver quality
esophagogastric junction muscularis propriaUBERON:003584165.98gold quality
tibialis anteriorUBERON:000138565.80silver quality
lymph nodeUBERON:000002965.62gold quality
popliteal arteryUBERON:000225065.04gold quality
tibial arteryUBERON:000761065.02gold quality
Brodmann (1909) area 23UBERON:001355464.76gold quality
middle temporal gyrusUBERON:000277164.47silver quality
urinary bladderUBERON:000125564.35gold quality
lower esophagus muscularis layerUBERON:003583363.74gold quality
lower esophagusUBERON:001347363.72gold quality
primary visual cortexUBERON:000243663.60gold quality
vermiform appendixUBERON:000115462.59gold quality
right atrium auricular regionUBERON:000663162.55gold quality
tonsilUBERON:000237262.52gold quality
prefrontal cortexUBERON:000045162.32gold quality
granulocyteCL:000009462.16gold quality
cardiac atriumUBERON:000208162.03gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes7.36
E-GEOD-111727no125.89

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

340 targeting COL19A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3163100.0077.238605
HSA-MIR-29A-3P100.0073.111835
HSA-MIR-29B-3P100.0073.181833
HSA-MIR-29C-3P100.0073.151833
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-5692A100.0074.406850
HSA-LET-7A-3P100.0074.033932
HSA-MIR-513A-5P100.0069.772465
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-3646100.0073.565283
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-518D-5P100.0067.51979
HSA-MIR-518E-5P100.0067.66954
HSA-MIR-518F-5P100.0067.51979
HSA-MIR-519A-5P100.0067.66954
HSA-MIR-519B-5P100.0067.66954
HSA-MIR-519C-5P100.0067.66954
HSA-MIR-520C-5P100.0067.51979
HSA-MIR-522-5P100.0067.66954
HSA-MIR-523-5P100.0067.66954
HSA-MIR-526A-5P100.0067.51979
HSA-MIR-8485100.0077.574731
HSA-MIR-4262100.0073.263931
HSA-MIR-4481100.0066.421669
HSA-MIR-4682100.0068.891258
HSA-MIR-340-5P100.0072.504437
HSA-MIR-3689D100.0066.141181
HSA-MIR-366299.9973.825684

Literature-anchored findings (GeneRIF, showing 4)

  • collagen IX has unique cell adhesion properties when compared with other collagens, and it provides a novel mechanism for cell adhesion to cartilaginous matrix (PMID:15383545)
  • The NC2 domain of collagen XIX and probably of other FACITs is responsible for chain selection and trimerization (PMID:18845531)
  • Type XIX collagen is a new partner in the interactions between tumor cells and their microenvironment. (Review) (PMID:27491275)
  • Naturally occurring interruptions in nonfibrillar collagen play key roles in molecular flexibility, collagen degradation, and ligand binding. This study focused on a G5G type natural interruption sequence G-POALO-G from human type XIX collagen, a homotrimer collagen, as this sequence possesses distinct properties compared with those of a pathological similar Gly mutation sequence in collagen mimic peptides. (PMID:29376320)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusCol19a1ENSMUSG00000026141
rattus_norvegicusCol19a1ENSRNOG00000012759

Paralogs (23): C1QTNF3 (ENSG00000082196), PDCD7 (ENSG00000090470), COL10A1 (ENSG00000123500), C1QL1 (ENSG00000131094), C1QTNF6 (ENSG00000133466), C1QL2 (ENSG00000144119), COL8A1 (ENSG00000144810), C1QTNF2 (ENSG00000145861), C1QC (ENSG00000159189), C1QTNF7 (ENSG00000163145), C1QL3 (ENSG00000165985), COL8A2 (ENSG00000171812), C1QTNF4 (ENSG00000172247), C1QB (ENSG00000173369), C1QA (ENSG00000173372), C1QTNF1 (ENSG00000173918), ADIPOQ (ENSG00000181092), OTOL1 (ENSG00000182447), C1QTNF8 (ENSG00000184471), C1QL4 (ENSG00000186897), C1QTNF9B (ENSG00000205863), C1QTNF5 (ENSG00000223953), C1QTNF9 (ENSG00000240654)

Protein

Protein identifiers

Collagen alpha-1(XIX) chainQ14993 (reviewed: Q14993)

Alternative names: Collagen alpha-1(Y) chain

All UniProt accessions (3): Q14993, A0A087WVJ7, Q5JVU1

UniProt curated annotations — full annotation on UniProt →

Function. May act as a cross-bridge between fibrils and other extracellular matrix molecules. Involved in skeletal myogenesis in the developing esophagus. May play a role in organization of the pericellular matrix or the sphinteric smooth muscle.

Subunit / interactions. Oligomer; disulfide-linked.

Subcellular location. Secreted. Extracellular space. Extracellular matrix.

Tissue specificity. Localized to vascular, neuronal, mesenchymal, and some epithelial basement membrane zones in umbilical cord.

Post-translational modifications. Prolines at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains.

Domain organisation. The numerous interruptions in the triple helix may make this molecule either elastic or flexible.

Similarity. Belongs to the fibril-associated collagens with interrupted helices (FACIT) family.

RefSeq proteins (1): NP_001849* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008160CollagenRepeat
IPR013320ConA-like_dom_sfHomologous_superfamily
IPR048287TSPN-like_NDomain
IPR050149Collagen_superfamilyFamily

Pfam: PF01391

UniProt features (52 total): compositionally biased region 13, domain 12, sequence conflict 10, region of interest 9, sequence variant 5, signal peptide 1, chain 1, short sequence motif 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q14993-F155.830.15

Function

Pathways and Gene Ontology

Reactome pathways

3 pathways

IDPathway
R-HSA-1442490Collagen degradation
R-HSA-1650814Collagen biosynthesis and modifying enzymes
R-HSA-8948216Collagen chain trimerization

MSigDB gene sets: 273 (showing top): MORF_ITGA2, TGGTGCT_MIR29A_MIR29B_MIR29C, GOBP_MUSCLE_TISSUE_DEVELOPMENT, MORF_MSH3, GOBP_SKELETAL_SYSTEM_DEVELOPMENT, TGCACTT_MIR519C_MIR519B_MIR519A, GOCC_COLLAGEN_TRIMER, GCANCTGNY_MYOD_Q6, ATACCTC_MIR202, MORF_BRCA1, GAUSSMANN_MLL_AF4_FUSION_TARGETS_A_DN, ACEVEDO_LIVER_CANCER_WITH_H3K27ME3_UP, MORF_RAD51L3, GOBP_CELL_CELL_ADHESION, GOBP_MUSCLE_STRUCTURE_DEVELOPMENT

GO Biological Process (7): skeletal system development (GO:0001501), cell adhesion (GO:0007155), skeletal muscle tissue development (GO:0007519), cell differentiation (GO:0030154), extracellular matrix organization (GO:0030198), cell-cell adhesion (GO:0098609), muscle organ development (GO:0007517)

GO Molecular Function (3): extracellular matrix structural constituent (GO:0005201), extracellular matrix structural constituent conferring tensile strength (GO:0030020), protein-macromolecule adaptor activity (GO:0030674)

GO Cellular Component (4): extracellular region (GO:0005576), collagen trimer (GO:0005581), endoplasmic reticulum lumen (GO:0005788), extracellular matrix (GO:0031012)

Reactome top-level categories

Rollup of top-3 pathways:

CategoryPathways
Degradation of the extracellular matrix1
Collagen formation1
Collagen biosynthesis and modifying enzymes1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
system development1
cellular process1
striated muscle tissue development1
skeletal muscle organ development1
cellular developmental process1
extracellular structure organization1
external encapsulating structure organization1
cell adhesion1
animal organ development1
muscle structure development1
structural molecule activity1
extracellular matrix1
extracellular matrix structural constituent1
protein binding1
molecular adaptor activity1
cellular anatomical structure1
protein-containing complex1
endoplasmic reticulum1
intracellular organelle lumen1
external encapsulating structure1

Protein interactions and networks

STRING

978 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
COL19A1VWA1Q6PCB0840
COL19A1CNIH3Q8TBE1539
COL19A1CNIH1O95406532
COL19A1COL18A1P39060473
COL19A1ZNF573Q86YE8469
COL19A1COL15A1P39059457
COL19A1SEC16BQ96JE7445
COL19A1ABCA13Q86UQ4445
COL19A1APOBP04114439
COL19A1HSPG2P98160427
COL19A1MCTP2Q6DN12418
COL19A1PRDM15P57071417
COL19A1UNC80Q8N2C7417
COL19A1CMC2Q9NRP2410
COL19A1MAST4O15021410
COL19A1TP53I11O14683410

IntAct

2 interactions, top by confidence:

ABTypeScore
CAND1GTPBP10psi-mi:“MI:0914”(association)0.350

BioGRID (4): COL19A1 (Co-fractionation), COL19A1 (Co-fractionation), COL19A1 (Co-fractionation), COL19A1 (Affinity Capture-MS)

ESM2 similar proteins: A0A060WQA3, A0MSJ1, A5PN28, A6NHN0, A8WR59, C0HLN2, C7DZK3, O35167, O35348, O76368, O88207, P08122, P08572, P12106, P12107, P12108, P13942, P20849, P20850, P20908, P20909, P25067, P25940, P53420, P83371, P98085, Q01955, Q03637, Q05722, Q07092, Q07643, Q0VF58, Q14031, Q14055, Q14993, Q17RW2, Q28083, Q30D77, Q32S24, Q4ZJM7

Diamond homologs: P12106, P13944, P20849, P20850, Q05722, Q07092, Q0VF58, Q14993, Q60847, Q641F3, Q8BLX7, Q8NFW1, Q96P44, Q99715, Q9JI03, A0A1D5NSM8, A2AX52, A6H584, A6NMZ7, A6QLN9, A8TX70, E1BMV3, O00339, O08746, O15232, O35701, O42163, O42401, O43405, O89029, O95460, P05099, P12111, P15989, P18614, P20785, P21941, P32018, P34576, P51942

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

306 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance182
Likely benign15
Benign92

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
1456629NC_000006.11:g.(?70386050)(71012627_?)delPathogenic

SpliceAI

7835 predictions. Top by Δscore:

VariantEffectΔscore
6:69898942:A:AGacceptor_gain1.0000
6:69898943:A:Gacceptor_gain1.0000
6:69898947:GAA:Gacceptor_gain1.0000
6:69898947:GAAGA:Gacceptor_gain1.0000
6:69900159:A:Gdonor_gain1.0000
6:69900237:A:AGacceptor_gain1.0000
6:69900237:A:Cacceptor_loss1.0000
6:69900237:AG:Aacceptor_gain1.0000
6:69900238:G:GGacceptor_gain1.0000
6:69900238:GG:Gacceptor_gain1.0000
6:69900238:GGT:Gacceptor_gain1.0000
6:69900334:ACTAT:Adonor_gain1.0000
6:69900335:CTAT:Cdonor_gain1.0000
6:69900336:TAT:Tdonor_gain1.0000
6:69900336:TATG:Tdonor_loss1.0000
6:69900337:AT:Adonor_gain1.0000
6:69900337:ATG:Adonor_loss1.0000
6:69900338:TG:Tdonor_loss1.0000
6:69900339:G:GGdonor_gain1.0000
6:69900339:GTAA:Gdonor_loss1.0000
6:69900340:T:TCdonor_loss1.0000
6:69900341:AA:Adonor_loss1.0000
6:69927907:A:AGacceptor_gain1.0000
6:69927908:G:GTacceptor_gain1.0000
6:69927908:GT:Gacceptor_gain1.0000
6:69927908:GTA:Gacceptor_gain1.0000
6:69927908:GTAA:Gacceptor_gain1.0000
6:69929697:GGAT:Gdonor_gain1.0000
6:69929698:GAT:Gdonor_gain1.0000
6:69929698:GATG:Gdonor_gain1.0000

AlphaMissense

7247 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:69927991:T:AW117R0.991
6:69927991:T:CW117R0.991
6:69928000:T:AW120R0.990
6:69928000:T:CW120R0.990
6:70184907:G:AG950R0.988
6:70184907:G:CG950R0.988
6:69932790:T:CL225P0.986
6:70188076:G:AG953D0.986
6:70184899:G:AG947D0.985
6:69927962:G:CR107P0.983
6:69929547:G:CW171C0.983
6:69929547:G:TW171C0.983
6:70184908:G:AG950E0.983
6:70184898:G:CG947R0.982
6:70188084:G:AG956R0.982
6:70188084:G:CG956R0.982
6:70207212:T:AC1123S0.981
6:70207213:G:CC1123S0.981
6:70171982:G:AG863R0.980
6:70171982:G:CG863R0.980
6:70172000:G:AG869R0.980
6:70172000:G:CG869R0.980
6:70199736:G:CG1075R0.980
6:70150029:G:AG674D0.979
6:70206910:G:AG1078E0.979
6:70206918:G:AG1081R0.978
6:70206918:G:CG1081R0.978
6:70188084:G:TG956W0.977
6:70199691:G:TG1060W0.976
6:70156679:G:AG750R0.975

dbSNP variants (sampled 300 via entrez): RS1000002560 (6:70123322 G>T), RS1000006792 (6:70023494 T>C), RS1000014791 (6:69926570 G>C), RS1000018552 (6:70064092 A>G), RS1000022834 (6:69989024 T>C), RS1000031392 (6:70183197 G>T), RS1000043910 (6:70036737 T>A,C), RS1000062486 (6:70019847 A>C), RS1000074521 (6:69923299 A>G), RS1000109004 (6:70129605 C>G,T), RS1000112674 (6:70170897 G>A), RS1000120562 (6:69879968 A>G), RS1000124824 (6:69923523 T>C), RS1000153949 (6:70166169 A>G), RS1000177635 (6:70178435 A>G,T)

Disease associations

OMIM: gene MIM:120165 | disease phenotypes: MIM:277380

GenCC curated gene-disease

Mondo (1): methylmalonic aciduria and homocystinuria type cblF (MONDO:0010183)

Orphanet (1): Methylmalonic acidemia with homocystinuria type cblF (Orphanet:79284)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

5 associations (top):

StudyTraitp-value
GCST000635_7Response to statin therapy4.000000e-06
GCST006665_4Social science traits (pleiotropy) (HIPO component 1)2.000000e-09
GCST008157_16Body fat mass7.000000e-06
GCST008158_150Body mass index7.000000e-06
GCST010676_6Leukoderma in response to rhododendrol2.000000e-06

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0007006depressive symptom measurement
EFO:0007660neuroticism measurement
EFO:0007869wellbeing measurement
EFO:0004340body mass index

MeSH disease descriptors (1)

DescriptorNameTree numbers
C564747Methylmalonic Aciduria and Homocystinuria, CblF Type (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

8 total (human), top 8 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases methylation2
Aflatoxin B1decreases methylation, increases methylation2
FR900359decreases phosphorylation1
Sunitinibincreases expression1
Phthalic Acidsaffects methylation1
Valproic Aciddecreases methylation1
Asbestos, Serpentineincreases methylation1
Coal Ashincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.