COL1A1

gene
On this page

Also known as OI4

Summary

COL1A1 (collagen type I alpha 1 chain, HGNC:2197) is a protein-coding gene on chromosome 17q21.33, encoding Collagen alpha-1(I) chain (P02452). Type I collagen is a member of group I collagen (fibrillar forming collagen). It is haploinsufficient (ClinGen: sufficient evidence).

This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene.

Source: NCBI Gene 1277 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Caffey disease (Definitive, ClinGen) — +11 more curated relationships
  • GWAS associations: 5
  • Clinical variants (ClinVar): 3,793 total — 1061 pathogenic, 318 likely-pathogenic
  • Phenotypes (HPO): 186
  • Druggable target: yes
  • Cancer driver (intOGen): loss-of-function (tumor-suppressor-like) across 5 cancer types
  • Dosage sensitivity (ClinGen): haploinsufficiency sufficient evidence, triplosensitivity no evidence
  • MANE Select transcript: NM_000088

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:2197
Approved symbolCOL1A1
Namecollagen type I alpha 1 chain
Location17q21.33
Locus typegene with protein product
StatusApproved
AliasesOI4
Ensembl geneENSG00000108821
Ensembl biotypeprotein_coding
OMIM120150
Entrez1277

Gene structure

Transcript identifiers

Ensembl transcripts: 27 — 16 protein_coding, 11 retained_intron

ENST00000225964, ENST00000463440, ENST00000471344, ENST00000474644, ENST00000476387, ENST00000485870, ENST00000486572, ENST00000494334, ENST00000495677, ENST00000504289, ENST00000507689, ENST00000510710, ENST00000511732, ENST00000861333, ENST00000861334, ENST00000861335, ENST00000861336, ENST00000861337, ENST00000861338, ENST00000861339, ENST00000861340, ENST00000861341, ENST00000861342, ENST00000861343, ENST00000861344, ENST00000919350, ENST00000963828

RefSeq mRNA: 1 — MANE Select: NM_000088 NM_000088

CCDS: CCDS11561

Canonical transcript exons

ENST00000225964 — 51 exons

ExonStartEnd
ENSE000007367135019843350198504
ENSE000007367435019648450196528
ENSE000007367475019631450196367
ENSE000007367515019615550196199
ENSE000007367865019394350194041
ENSE000007368145019054350190596
ENSE000007368195019032750190380
ENSE000007368235019000150190108
ENSE000007368275018985950189912
ENSE000007368305018967950189732
ENSE000007368325018937750189538
ENSE000007368345018916850189275
ENSE000007368365018890350189010
ENSE000007368385018874250188795
ENSE000007368405018853050188637
ENSE000007368425018809650188149
ENSE000007368515018664050186922
ENSE000007368535018631750186507
ENSE000007368555018577850186020
ENSE000008198775019922650199327
ENSE000011320885018410150185648
ENSE000015948595018701550187122
ENSE000022144985019592350195976
ENSE000025113935019955650199590
ENSE000025151225019941850199453
ENSE000032844765018787650187983
ENSE000034320125018748450187537
ENSE000034585165019661750196670
ENSE000034658305019198050192024
ENSE000034680215019413050194183
ENSE000034716905019434950194447
ENSE000034724085019472150194828
ENSE000034975225019299450193047
ENSE000035000235020141150201631
ENSE000035016965019264050192693
ENSE000035024015019816150198205
ENSE000035205335019718050197233
ENSE000035276095019556750195665
ENSE000035280105019457350194626
ENSE000035411535019701050197063
ENSE000035817725019178850191886
ENSE000035861345019773250197785
ENSE000036033145019523250195330
ENSE000036149035019504750195100
ENSE000036218665019794950198002
ENSE000036266855019247550192528
ENSE000036342355019081750190924
ENSE000036395855019543450195478
ENSE000036651825019138350191490
ENSE000036848205019975350199947
ENSE000036857235019279750192850

Expression profiles

Bgee: expression breadth ubiquitous, 298 present calls, max score 99.99.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 1383.3737 / max 36784.7674, expressed in 1468 samples.

FANTOM5 promoters (54 alternative TSS)

Promoter IDTPM avgSamples expressed
1669731271.35481233
20825816.2230843
1669727.54231109
1669006.5092737
1669086.3696721
1669065.5118735
1668865.1671676
1669294.8966691
1669244.4311657
1669043.3619650

Top tissues by expression

306 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
stromal cell of endometriumCL:000225599.99gold quality
skin of hipUBERON:000155499.96gold quality
periodontal ligamentUBERON:000826699.94gold quality
gall bladderUBERON:000211099.90gold quality
tibiaUBERON:000097999.86gold quality
cartilage tissueUBERON:000241899.86gold quality
mucosa of paranasal sinusUBERON:000503099.86gold quality
tendon of biceps brachiiUBERON:000818899.86gold quality
upper leg skinUBERON:000426299.85gold quality
parietal pleuraUBERON:000240099.83gold quality
visceral pleuraUBERON:000240199.82gold quality
upper arm skinUBERON:000426399.79gold quality
trabecular bone tissueUBERON:000248399.76gold quality
pylorusUBERON:000116699.75gold quality
cardia of stomachUBERON:000116299.74gold quality
saphenous veinUBERON:000731899.73gold quality
mammalian vulvaUBERON:000099799.72gold quality
myometriumUBERON:000129699.66gold quality
smooth muscle tissueUBERON:000113599.63gold quality
cauda epididymisUBERON:000436099.63gold quality
endocervixUBERON:000045899.62gold quality
adult organismUBERON:000702399.61gold quality
urethraUBERON:000005799.59gold quality
nippleUBERON:000203099.58gold quality
vena cavaUBERON:000408799.55gold quality
pleuraUBERON:000097799.54gold quality
mammary ductUBERON:000176599.54gold quality
body of uterusUBERON:000985399.54gold quality
seminal vesicleUBERON:000099899.53gold quality
right coronary arteryUBERON:000162599.53gold quality

Single-cell (SCXA)

Detected in 58 experiment(s), a significant marker in 57.

ExperimentMarker?Max mean expression
E-CURD-112yes48432.62
E-MTAB-10596yes42250.84
E-MTAB-8221yes28494.09
E-HCAD-24yes28388.60
E-MTAB-6701yes21099.15
E-MTAB-9906yes20090.44
E-GEOD-75688yes18621.46
E-MTAB-8410yes18053.62
E-MTAB-7407yes17802.80
E-HCAD-23yes17548.49
E-MTAB-5061yes13628.07
E-MTAB-10662yes11151.95
E-CURD-79yes10565.85
E-HCAD-10yes10395.39
E-CURD-126yes9557.13

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): CDK9, CEBPB, CEBPD, CIITA, CREB3L1, DLX5, EGR1, ETS1, HES1, HEXIM1, LEF1, MAPK14, MEF2A, MRTFA, MSX2, MYB, MYBL2, NFATC1, NFATC4, NFIC, NFKB1, NOTCH1, NR1H4, NR4A2, RELA, ROCK1, ROCK2, RUNX2, SCX, SMAD3, SMAD4, SMAD7, SMOC1, SP1, SP3, SP7, SPOP, SRSF1, SRSF2, STAT1

miRNA regulators (miRDB)

154 targeting COL1A1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-29A-3P100.0073.111835
HSA-MIR-29B-3P100.0073.181833
HSA-MIR-29C-3P100.0073.151833
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-6740-5P100.0065.64932
HSA-MIR-4510100.0066.602050
HSA-MIR-6127100.0066.762188
HSA-MIR-6133100.0066.482064
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-6129100.0066.462080
HSA-MIR-6130100.0066.692012
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-9-5P100.0072.282361
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-450099.9972.722367
HSA-MIR-453199.9969.703181
HSA-MIR-4789-5P99.9870.762721
HSA-MIR-1213699.9872.815713
HSA-MIR-3692-3P99.9870.272139
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-314899.9775.066478
HSA-MIR-548AJ-3P99.9673.385345

Functional genomics

ClinGen dosage: haploinsufficiency 3 (sufficient evidence), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 40)

  • disulfide disruption permits slow assembly and secretion of overmodified, but stable procollagen trimers and results in mild osteogenesis imperfecta (PMID:11432962)
  • Deletions and duplications of Gly-Xaa-Yaa triplet repeats in the triple helical domains of type I collagen chains disrupt helix formation and result in several types of osteogenesis imperfecta (PMID:11668615)
  • A member of the Y-box protein family interacts with an upstream element in the alpha1(I) collagen gene. (PMID:11731270)
  • COLIA1 Sp1 TT genotype is associated with an increased fracture risk in postmenopausal women (PMID:11792589)
  • two new SNPs in the COL1A1 promoter, which may affect bone mass determination (PMID:11874231)
  • keratinocyte growth factor (KGF), a key stimulator of epithelial cell proliferation during wound healing, preferentially binds to collagens I, III, and VI. (PMID:11973338)
  • The collagen type I, alpha 1 polymorphism in the Sp1 binding site is associated with differences in ultrasound transmission velocity in the calcaneus of postmenopausal women. (PMID:12016466)
  • Fusion of COL1A1 exon 29 with PDGFB exon 2 in a der(22)t(17;22) in a pediatric giant cell fibroblastoma with a pigmented Bednar tumor component. Evidence for age-related chromosomal pattern in dermatofibrosarcoma protuberans and related tumors. (PMID:12034531)
  • A 30-residue peptide model of the alpha 1(I) chain of type I collagen is designed which demonstrates sequence preference of the C-terminal repeating (Gly-Pro-Hyp)4 subdomain for initiation of triple-helix formation–an all-or-none third-order reaction. (PMID:12069607)
  • Interaction of human breast fibroblasts with collagen I increases total cathepsin B protein levels (PMID:12072442)
  • investigation of the genetic influence of the Sp1 polymorphism on bone density in Irish women (PMID:12073153)
  • C-propeptide region of human pro alpha 1 type 1 collagen interacts with thioredoxin (PMID:12099690)
  • A fusion of COLIA1 exon 24 in frame with PDGFB exon 2 was found in a complex marker chromosome from dermatofibrosarcoma protuberans with sequences from chromosomes 7,8,17,21, & 22. (PMID:12127408)
  • These data indicate that mitoxantrone and WP631 are very potent inhibitors of basal and TGF-beta-stimulated COL1A1 expression. (PMID:12138160)
  • in the presence of high molecular weight fragments of type I collagen, type I procollagen synthesis is inhibited. (PMID:12164934)
  • Upstream elements present in the 3’-untranslated region of the gene influence the processing efficiency of overlapping polyadenylation signals. (PMID:12200454)
  • Allele frequency of the G–>T mutation of this gene is analyzed by an ARMS-PCR in osteoporotic subjects with femoral neck fractures. (PMID:12211646)
  • The COLIA1 polymorphism in children and young adults is associated with several bone characteristics. (PMID:12232678)
  • This protein is regulated by human basic fibroblast growth factor. (PMID:12393937)
  • Analysis of the Collal alleles provides early detection of the individuals with hereditary predisposition to osteoporosis and prophylaxis of the disease at the presymptomatic stage. (PMID:12575457)
  • A degradation fragment originating from the helical part of type I collagen consisting of residues 620-633 of the alpha 1 chain sequence is useful clinically as a bone resorption marker in patients with osteoporosis. (PMID:12584032)
  • mutational analysis in Lithuanian patients with osteogenesis imperfecta (PMID:12590186)
  • in men, the “ss” genotype of COLIA1 polymorphism could be the best osteoporotic fracture risk genetic predictor, independent of bone mass values (PMID:12753258)
  • COL7A1 mutation is characteristic in patients with dystrophic epidermolysis bullosa. (PMID:12787275)
  • Polymorphism analysis in the COLIA1 gene of patients with thalassemia major and intermedia. (PMID:12803121)
  • The Sp1 transcription factor binding site polymorphism of COL1A1 is associated with a modest reduction in bone mineral density and a significant increase in risk of osteoporotic fracture, particularly vertebral fracture. (PMID:12810179)
  • estrogen receptor alpha Px haplotype and collagen IA1 s allele may be involved in causing the phenotypic expression of higher circulating levels of parathyroid hormone and higher bone turnover, which may lead to bone loss (PMID:14506618)
  • genetic polymorphisms of the vitamin D receptor, estrogen receptor, and collagen Ialpha1 genes were characterized in 72 osteosarcoma and 53 Ewing sarcomas and in a group of 143 healthy matched children. (PMID:14528100)
  • The “s” allele of COLIA1 gene in combination with the Px haplotype of the ER alpha gene contributes to reduced BMD in females. (PMID:14562991)
  • In a cohort of 75-year-old Swedish women, there was an association among the Sp1 COLIA1 polymorphism, bone mass, and fracture. (PMID:14595528)
  • COL1A1 gene expression is downregulated by B-Myb in fibroblasts from patients with systemic sclerosis. (PMID:14613485)
  • Although connective tissue growth factor alone had no effect on collagen secretion, combined stimulation with IGF-I enhanced collagen accumulation. (PMID:14633859)
  • This study identified a novel COL1A1 breakpoint, namely, exon 42 of the COL1A1 gene. (PMID:14643521)
  • COL1A1 gene may have some effects on bone size variation at the wrist, but not at the spine or hip in Caucasian nuclear families. (PMID:14722584)
  • The genotype frequency of COLIA1 polymorphism was also not different between PBC patients and controls, however the “s” allele was significantly less frequent in patients with PBC (p = 0.038). (PMID:15049048)
  • Osteogenesis imperfecta Type 1: diagnosis could be verified by moleculargenetic analysis, a newly recognized heterozygous point mutation (Arg420Stop) in the COL1A1-gene was found. (PMID:15106082)
  • Results characterize promoters from the human alpha1(I) procollagen gene. (PMID:15133851)
  • The lower collagen content in the endopelvic fascia and skin of women with SUI is not due to reduced collagen synthesis or selective reduction in synthesis of either collagen I or collagen III. (PMID:15227656)
  • This study showed a significant association between otosclerosis and the COL1A1 first intron Sp1 site. The allelic frequency of the Sp1 site is very similar between otosclerosis and osteoporosis. (PMID:15241219)
  • Predicted rates of AA substitution for Gly were compared with missense mutations known to cause disease. The most destabilizing residues, Val, Glu, & Asp, & the least destabilizing residue, Ala, were underrepresented. (PMID:15365990)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriocol1a1bENSDARG00000035809
mus_musculusCol1a1ENSMUSG00000001506
rattus_norvegicusCol1a1ENSRNOG00000003897

Paralogs (37): COL9A2 (ENSG00000049089), COL23A1 (ENSG00000050767), COL11A1 (ENSG00000060718), COL17A1 (ENSG00000065618), COL5A3 (ENSG00000080573), COL4A4 (ENSG00000081052), COL16A1 (ENSG00000084636), COL9A3 (ENSG00000092758), COL20A1 (ENSG00000101203), COL9A1 (ENSG00000112280), COL7A1 (ENSG00000114270), COL21A1 (ENSG00000124749), COL5A1 (ENSG00000130635), COL4A2 (ENSG00000134871), COL2A1 (ENSG00000139219), COL6A1 (ENSG00000142156), COL6A2 (ENSG00000142173), EDA (ENSG00000158813), COL26A1 (ENSG00000160963), COL1A2 (ENSG00000164692), COL3A1 (ENSG00000168542), COL4A3 (ENSG00000169031), COL22A1 (ENSG00000169436), COL24A1 (ENSG00000171502), COL18A1 (ENSG00000182871), EMID1 (ENSG00000186998), COL4A1 (ENSG00000187498), COL4A5 (ENSG00000188153), COL25A1 (ENSG00000188517), COL27A1 (ENSG00000196739), COL13A1 (ENSG00000197467), COL4A6 (ENSG00000197565), COL11A2 (ENSG00000204248), COL5A2 (ENSG00000204262), COL15A1 (ENSG00000204291), COLQ (ENSG00000206561), COL28A1 (ENSG00000215018)

Protein

Protein identifiers

Collagen alpha-1(I) chainP02452 (reviewed: P02452)

Alternative names: Alpha-1 type I collagen

All UniProt accessions (2): P02452, I3L3H7

UniProt curated annotations — full annotation on UniProt →

Function. Type I collagen is a member of group I collagen (fibrillar forming collagen).

Subunit / interactions. Trimers of one alpha 2(I) and two alpha 1(I) chains. Interacts with MRC2. Interacts with TRAM2. Interacts with MFAP4 in a Ca (2+)-dependent manner.

Subcellular location. Secreted. Extracellular space. Extracellular matrix.

Tissue specificity. Forms the fibrils of tendon, ligaments and bones. In bones the fibrils are mineralized with calcium hydroxyapatite.

Post-translational modifications. Contains mostly 4-hydroxyproline. Proline residues at the third position of the tripeptide repeating unit (G-X-Y) are hydroxylated in some or all of the chains. Contains 3-hydroxyproline at a few sites. This modification occurs on the first proline residue in the sequence motif Gly-Pro-Hyp, where Hyp is 4-hydroxyproline. Lysine residues at the third position of the tripeptide repeating unit (G-X-Y) are 5-hydroxylated in some or all of the chains. O-glycosylated on hydroxylated lysine residues. The O-linked glycan consists of a Glc-Gal disaccharide.

Disease relevance. Caffey disease (CAFYD) [MIM:114000] An autosomal dominant disorder characterized by an infantile episode of massive subperiosteal new bone formation that typically involves the diaphyses of the long bones, mandible, and clavicles. The involved bones may also appear inflamed, with painful swelling and systemic fever often accompanying the illness. The bone changes usually begin before 5 months of age and resolve before 2 years of age. The disease is caused by variants affecting the gene represented in this entry. Ehlers-Danlos syndrome, classic type, 1 (EDSCL1) [MIM:130000] A form of Ehlers-Danlos syndrome, a group of connective tissue disorders characterized by skin hyperextensibility, articular hypermobility, and tissue fragility. The main features of classic Ehlers-Danlos syndrome are joint hypermobility and dislocation, and fragile, bruisable skin. EDSCL1 inheritance is autosomal dominant. The disease may be caused by variants affecting the gene represented in this entry. Ehlers-Danlos syndrome, arthrochalasia type, 1 (EDSARTH1) [MIM:130060] A form of Ehlers-Danlos syndrome, a connective tissue disorder characterized by hyperextensible skin, atrophic cutaneous scars due to tissue fragility and joint hyperlaxity. EDSARTH1 is an autosomal dominant form characterized by frequent congenital hip dislocation and extreme joint laxity with recurrent joint subluxations and minimal skin involvement. The disease is caused by variants affecting the gene represented in this entry. Osteogenesis imperfecta 1 (OI1) [MIM:166200] An autosomal dominant form of osteogenesis imperfecta, a disorder of bone formation characterized by bone low bone mass, bone fragility and susceptibility to fractures after minimal trauma. Disease severity ranges from very mild forms without fractures to intrauterine fractures and perinatal lethality. Extraskeletal manifestations, which affect a variable number of patients, are dentinogenesis imperfecta, hearing loss, and blue sclerae. OI1 is a non-deforming form with normal height or mild short stature, and no dentinogenesis imperfecta. The disease is caused by variants affecting the gene represented in this entry. Osteogenesis imperfecta 2 (OI2) [MIM:166210] An autosomal dominant form of osteogenesis imperfecta, a disorder of bone formation characterized by low bone mass, bone fragility and susceptibility to fractures after minimal trauma. Disease severity ranges from very mild forms without fractures to intrauterine fractures and perinatal lethality. Extraskeletal manifestations, which affect a variable number of patients, are dentinogenesis imperfecta, hearing loss, and blue sclerae. OI2 is characterized by bone fragility, with many perinatal fractures, severe bowing of long bones, undermineralization, and death in the perinatal period due to respiratory insufficiency. The disease is caused by variants affecting the gene represented in this entry. Osteogenesis imperfecta 3 (OI3) [MIM:259420] An autosomal dominant form of osteogenesis imperfecta, a disorder of bone formation characterized by low bone mass, bone fragility and susceptibility to fractures after minimal trauma. Disease severity ranges from very mild forms without fractures to intrauterine fractures and perinatal lethality. Extraskeletal manifestations, which affect a variable number of patients, are dentinogenesis imperfecta, hearing loss, and blue sclerae. OI3 is characterized by progressively deforming bones, very short stature, a triangular face, severe scoliosis, grayish sclera and dentinogenesis imperfecta. The disease is caused by variants affecting the gene represented in this entry. Osteogenesis imperfecta 4 (OI4) [MIM:166220] An autosomal dominant form of osteogenesis imperfecta, a disorder of bone formation characterized by low bone mass, bone fragility and susceptibility to fractures after minimal trauma. Disease severity ranges from very mild forms without fractures to intrauterine fractures and perinatal lethality. Extraskeletal manifestations, which affect a variable number of patients, are dentinogenesis imperfecta, hearing loss, and blue sclerae. OI4 is characterized by moderately short stature, mild to moderate scoliosis, grayish or white sclera and dentinogenesis imperfecta. The disease is caused by variants affecting the gene represented in this entry. Combined osteogenesis imperfecta and Ehlers-Danlos syndrome 1 (OIEDS1) [MIM:619115] An autosomal dominant connective tissue disorder characterized by osteopenia, bone fragility, long bone fractures, blue sclerae, joint hyperextensibility, soft and hyperextensible skin, abnormal wound healing, easy bruising, and vascular fragility. The disease is caused by variants affecting the gene represented in this entry. Osteoporosis (OSTEOP) [MIM:166710] A systemic skeletal disorder characterized by decreased bone mass and deterioration of bone microarchitecture without alteration in the composition of bone. The result is fragile bones and an increased risk of fractures, even after minimal trauma. Osteoporosis is a chronic condition of multifactorial etiology and is usually clinically silent until a fracture occurs. Disease susceptibility is associated with variants affecting the gene represented in this entry. A chromosomal aberration involving COL1A1 is found in dermatofibrosarcoma protuberans. Translocation t(17;22)(q22;q13) with PDGF.

Domain organisation. The C-terminal propeptide, also known as COLFI domain, have crucial roles in tissue growth and repair by controlling both the intracellular assembly of procollagen molecules and the extracellular assembly of collagen fibrils. It binds a calcium ion which is essential for its function.

Similarity. Belongs to the fibrillar collagen family.

RefSeq proteins (1): NP_000079* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000885Fib_collagen_CDomain
IPR001007VWF_domDomain
IPR008160CollagenRepeat
IPR050149Collagen_superfamilyFamily

Pfam: PF00093, PF01391, PF01410

UniProt features (386 total): sequence variant 175, modified residue 125, compositionally biased region 25, strand 14, sequence conflict 11, helix 6, disulfide bond 5, binding site 5, region of interest 4, glycosylation site 3, site 3, propeptide 2, short sequence motif 2, turn 2, domain 2, signal peptide 1, chain 1

Structure

Experimental structures (PDB)

14 structures.

PDBMethodResolution (Å)
5CTDX-RAY DIFFRACTION1.6
8YV3X-RAY DIFFRACTION1.68
1Q7DX-RAY DIFFRACTION1.8
5CTIX-RAY DIFFRACTION1.9
5CVAX-RAY DIFFRACTION2.1
3EJHX-RAY DIFFRACTION2.1
5K31X-RAY DIFFRACTION2.2
5CVBX-RAY DIFFRACTION2.25
5OU8X-RAY DIFFRACTION2.5
5OU9X-RAY DIFFRACTION2.5
3GXEX-RAY DIFFRACTION2.6
7E7BELECTRON MICROSCOPY2.6
7E7DELECTRON MICROSCOPY3.2
2LLPSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P02452-F154.260.18

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (3): 161–162 (cleavage; by procollagen n-endopeptidase); 953–954 (cleavage; by collagenase); 1218–1219 (cleavage; by procollagen c-endopeptidase)

Ligand- & substrate-binding residues (5): 1277; 1279; 1280; 1282; 1285

Post-translational modifications (125): 622, 640, 646, 652, 658, 664, 670, 682, 691, 703, 715, 718, 724, 730, 739, 751, 757, 772, 778, 787 …

Disulfide bonds (5): 1259–1291, 1265, 1282, 1299–1462, 1370–1415

Glycosylation sites (3): 265, 1108, 1365

Function

Pathways and Gene Ontology

Reactome pathways

26 pathways

IDPathway
R-HSA-114604GPVI-mediated activation cascade
R-HSA-1442490Collagen degradation
R-HSA-1566977Fibronectin matrix formation
R-HSA-1650814Collagen biosynthesis and modifying enzymes
R-HSA-198933Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell
R-HSA-2022090Assembly of collagen fibrils and other multimeric structures
R-HSA-202733Cell surface interactions at the vascular wall
R-HSA-216083Integrin cell surface interactions
R-HSA-2214320Anchoring fibril formation
R-HSA-2243919Crosslinking of collagen fibrils
R-HSA-3000170Syndecan interactions
R-HSA-3000171Non-integrin membrane-ECM interactions
R-HSA-3000178ECM proteoglycans
R-HSA-3000480Scavenging by Class A Receptors
R-HSA-430116GP1b-IX-V activation signalling
R-HSA-75892Platelet Adhesion to exposed collagen
R-HSA-76009Platelet Aggregation (Plug Formation)
R-HSA-8874081MET activates PTK2 signaling
R-HSA-8940973RUNX2 regulates osteoblast differentiation
R-HSA-8948216Collagen chain trimerization
R-HSA-9845619Enhanced cleavage of VWF variant by ADAMTS13
R-HSA-9845620Enhanced binding of GP1BA variant to VWF multimer:collagen
R-HSA-9845621Defective VWF cleavage by ADAMTS13 variant
R-HSA-9845622Defective VWF binding to collagen type I
R-HSA-9846298Defective binding of VWF variant to GPIb:IX:V
R-HSA-9925563Developmental Lineage of Pancreatic Ductal Cells

MSigDB gene sets: 895 (showing top): GSE18804_SPLEEN_MACROPHAGE_VS_COLON_TUMORAL_MACROPHAGE_UP, TGGTGCT_MIR29A_MIR29B_MIR29C, AGGAAGC_MIR5163P, TURASHVILI_BREAST_LOBULAR_CARCINOMA_VS_DUCTAL_NORMAL_UP, GOBP_RESPONSE_TO_NITROGEN_COMPOUND, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, ACTACCT_MIR196A_MIR196B, GOBP_BODY_MORPHOGENESIS, TAATAAT_MIR126, GOBP_BONE_TRABECULA_MORPHOGENESIS, CHIARADONNA_NEOPLASTIC_TRANSFORMATION_KRAS_DN, MYOGENIN_Q6, GOBP_COLLAGEN_FIBRIL_ORGANIZATION, GOBP_RESPONSE_TO_ESTRADIOL, GOBP_CARTILAGE_DEVELOPMENT

GO Biological Process (51): skeletal system development (GO:0001501), blood vessel development (GO:0001568), osteoblast differentiation (GO:0001649), intramembranous ossification (GO:0001957), endochondral ossification (GO:0001958), visual perception (GO:0007601), sensory perception of sound (GO:0007605), response to xenobiotic stimulus (GO:0009410), positive regulation of epithelial to mesenchymal transition (GO:0010718), negative regulation of cell-substrate adhesion (GO:0010812), protein transport (GO:0015031), collagen fibril organization (GO:0030199), positive regulation of cell migration (GO:0030335), response to estradiol (GO:0032355), response to insulin (GO:0032868), collagen biosynthetic process (GO:0032964), protein localization to nucleus (GO:0034504), tooth mineralization (GO:0034505), collagen-activated tyrosine kinase receptor signaling pathway (GO:0038063), response to hydrogen peroxide (GO:0042542), skin morphogenesis (GO:0043589), cellular response to fibroblast growth factor stimulus (GO:0044344), positive regulation of DNA-templated transcription (GO:0045893), response to steroid hormone (GO:0048545), embryonic skeletal system development (GO:0048706), response to cAMP (GO:0051591), response to hyperoxia (GO:0055093), face morphogenesis (GO:0060325), bone trabecula formation (GO:0060346), cartilage development involved in endochondral bone morphogenesis (GO:0060351), cellular response to amino acid stimulus (GO:0071230), cellular response to mechanical stimulus (GO:0071260), cellular response to retinoic acid (GO:0071300), cellular response to vitamin E (GO:0071306), cellular response to glucose stimulus (GO:0071333), cellular response to tumor necrosis factor (GO:0071356), cellular response to epidermal growth factor stimulus (GO:0071364), cellular response to transforming growth factor beta stimulus (GO:0071560), positive regulation of canonical Wnt signaling pathway (GO:0090263), cellular response to fluoride (GO:1902618)

GO Molecular Function (7): protease binding (GO:0002020), extracellular matrix structural constituent conferring tensile strength (GO:0030020), identical protein binding (GO:0042802), metal ion binding (GO:0046872), platelet-derived growth factor binding (GO:0048407), extracellular matrix structural constituent (GO:0005201), protein binding (GO:0005515)

GO Cellular Component (9): extracellular region (GO:0005576), collagen type I trimer (GO:0005584), obsolete extracellular space (GO:0005615), endoplasmic reticulum lumen (GO:0005788), secretory granule (GO:0030141), extracellular matrix (GO:0031012), collagen trimer (GO:0005581), fibrillar collagen trimer (GO:0005583), cytoplasm (GO:0005737)

Reactome top-level categories

Rollup of top-12 pathways:

CategoryPathways
Extracellular matrix organization4
Platelet activation, signaling and aggregation3
Collagen formation2
Hemostasis2
Assembly of collagen fibrils and other multimeric structures2
Degradation of the extracellular matrix1
Adaptive Immune System1
Non-integrin membrane-ECM interactions1
Binding and Uptake of Ligands by Scavenger Receptors1
MET promotes cell motility1
RUNX2 regulates bone development1
Collagen biosynthesis and modifying enzymes1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
system development1
vasculature development1
anatomical structure development1
ossification1
cell differentiation1
direct ossification1
replacement ossification1
endochondral bone morphogenesis1
sensory perception of light stimulus1
sensory perception of mechanical stimulus1
response to chemical1
epithelial to mesenchymal transition1
regulation of epithelial to mesenchymal transition1
positive regulation of cell differentiation1
positive regulation of multicellular organismal process1
negative regulation of cell adhesion1
regulation of cell-substrate adhesion1
cell-substrate adhesion1
transport1
intracellular protein localization1
establishment of protein localization1
extracellular matrix organization1
cell migration1
regulation of cell migration1
positive regulation of cell motility1
response to lipid1
response to oxygen-containing compound1
response to peptide hormone1
biosynthetic process1
collagen metabolic process1
protein localization to organelle1
biomineral tissue development1
odontogenesis1
cell surface receptor protein tyrosine kinase signaling pathway1
collagen-activated signaling pathway1
response to reactive oxygen species1
enzyme binding1
extracellular matrix structural constituent1
protein binding1

Protein interactions and networks

STRING

5086 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
COL1A1COL1A2P02464981
COL1A1COL3A1P02461965
COL1A1CRTAPO75718925
COL1A1BGNP13247916
COL1A1FN1P02751912
COL1A1COL5A2P05997912
COL1A1ACTA2P03996900
COL1A1BGLAPP02818889
COL1A1CD44P16070882
COL1A1P3H1Q32P28881
COL1A1COL5A1P20908880
COL1A1SP7Q8TDD2879
COL1A1COL6A3P12111875
COL1A1LUMP51884854
COL1A1FBN1P35555852

IntAct

114 interactions, top by confidence:

ABTypeScore
FN1COL1A1psi-mi:“MI:0407”(direct interaction)0.710
COL1A1P4HA2psi-mi:“MI:0915”(physical association)0.710
COL1A1FN1psi-mi:“MI:0407”(direct interaction)0.710
PEG10RTL8Cpsi-mi:“MI:0914”(association)0.670
YAF2E2F6psi-mi:“MI:0914”(association)0.640
COL1A1psi-mi:“MI:0407”(direct interaction)0.600
COL1A1psi-mi:“MI:0407”(direct interaction)0.600
COL1A1COL1A2psi-mi:“MI:0915”(physical association)0.580
COL1A1CRTAPpsi-mi:“MI:0915”(physical association)0.580
COL1A1KEAP1psi-mi:“MI:0915”(physical association)0.560
COL1A1PDIA4psi-mi:“MI:0914”(association)0.560
LPAR1TMEM223psi-mi:“MI:0914”(association)0.530
TIMM44COL1A1psi-mi:“MI:0914”(association)0.530
ERAL1COL1A1psi-mi:“MI:0914”(association)0.530
UBXN11COL1A1psi-mi:“MI:0914”(association)0.530
CAMKMTCOL1A1psi-mi:“MI:0914”(association)0.530
COLGALT2COL1A1psi-mi:“MI:0914”(association)0.530
TMTC4CLGNpsi-mi:“MI:0914”(association)0.530
LAIR2LAMA5psi-mi:“MI:0914”(association)0.530
COL1A1GOLIM4psi-mi:“MI:0914”(association)0.500

BioGRID (141): COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS), COL1A1 (Affinity Capture-MS)

ESM2 similar proteins: C0HJN3, C0HJN4, C0HJN5, C0HJN7, C0HJN9, C0HJP0, C0HJP1, C0HJP3, C0HJP5, C0HJP6, C0HJP7, C0HJP8, C0HLG7, C0HLG9, C0HLH1, C0HLH3, C0HLH5, C0HLH9, C0HLI1, C0HLI3, C0HLI5, C0HLI7, C0HLI9, C0HLJ1, C0HLJ3, C0HLJ5, C0HLJ7, C0HLJ9, C0HM84, C0HM85, C0HM93, C0HM95, O46392, P02452, P02453, P02454, P02457, P02461, P02465, P02466

Diamond homologs: A0MSJ1, C7DZK3, O42350, O88207, P02452, P02457, P02458, P02459, P02460, P02461, P02466, P05997, P08121, P12105, P12107, P13941, P13942, P20908, P20909, Q17RW2, Q30D77, Q32S24, Q3U962, Q5QNQ9, Q60467, Q61245, Q64739, Q6P4Z2, Q80ZF0, Q8IZC6, Q91717, Q9JI03, Q9YIB4, B8V7R6, O46392, O93484, P02453, P02454, P02465, P02467

SIGNOR signaling

16 interactions.

AEffectBMechanism
RUNX2“up-regulates quantity by expression”COL1A1“transcriptional regulation”
MAPK14“up-regulates quantity by expression”COL1A1“transcriptional regulation”
COL1A1“up-regulates activity”“A1/b1 integrin”binding
COL1A1up-regulatesECM_synthesis
SMOC1“up-regulates quantity by expression”COL1A1“transcriptional regulation”
COL1A1“up-regulates activity”DDR1binding
COL1A1“up-regulates activity”DDR2binding
TGFB1“up-regulates quantity by expression”COL1A1“transcriptional regulation”
COL1A1up-regulates“A2/b1 integrin”binding
COL1A1up-regulatesDDR1binding
COL1A1“up-regulates activity”“A11/b1 integrin”binding
BMP1“up-regulates activity”COL1A1cleavage
COLGALT1“up-regulates activity”COL1A1glycosylation
COLGALT2“up-regulates activity”COL1A1glycosylation
MMP1“down-regulates quantity by destabilization”COL1A1cleavage
COL1A1“up-regulates activity”“A2/b1 integrin”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 114 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Collagen biosynthesis and modifying enzymes818.4×5e-06
Developmental Lineage of Pancreatic Ductal Cells515.4×4e-03

GO biological processes:

GO termPartnersFoldFDR
collagen fibril organization716.7×2e-04

Disease & clinical

Cancer significance

From intOGen — cancer-driver classification: loss-of-function (tumor-suppressor-like) across 5 cancer types — ANSC, GBM, MEL, PAAD, PCM.

Clinical variants and AI predictions

ClinVar

3793 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1061
Likely pathogenic318
Uncertain significance779
Likely benign988
Benign103

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1065492NM_000088.4(COL1A1):c.4291del (p.Thr1431fs)Pathogenic
1069647NM_000088.4(COL1A1):c.2678G>C (p.Gly893Ala)Pathogenic
1069928NM_000088.4(COL1A1):c.2235+1G>CPathogenic
1069954NM_000088.4(COL1A1):c.394_395del (p.Asp132fs)Pathogenic
1070187NM_000088.4(COL1A1):c.3589_3590del (p.Asp1197fs)Pathogenic
1070188NM_000088.4(COL1A1):c.3569G>A (p.Gly1190Asp)Pathogenic
1070189NM_000088.4(COL1A1):c.3531+1G>CPathogenic
1070241NM_000088.4(COL1A1):c.3020del (p.Gly1007fs)Pathogenic
1070818NM_000088.4(COL1A1):c.3051del (p.Pro1018fs)Pathogenic
1070914NM_000088.4(COL1A1):c.2907_2908delinsAT (p.Arg970Ter)Pathogenic
1071489NM_000088.4(COL1A1):c.814G>A (p.Gly272Ser)Pathogenic
1072196NM_000088.4(COL1A1):c.2585_2586del (p.Lys862fs)Pathogenic
1072430NM_000088.4(COL1A1):c.1981C>T (p.Gln661Ter)Pathogenic
1072431NM_000088.4(COL1A1):c.1886del (p.Gly629fs)Pathogenic
1073168NM_000088.4(COL1A1):c.595C>T (p.Gln199Ter)Pathogenic
1073803NM_000088.4(COL1A1):c.3061_3068del (p.Glu1021fs)Pathogenic
1074006NM_000088.4(COL1A1):c.3910C>T (p.Gln1304Ter)Pathogenic
1074007NM_000088.4(COL1A1):c.3385C>T (p.Gln1129Ter)Pathogenic
1074064NM_000088.4(COL1A1):c.1797del (p.Val600fs)Pathogenic
1074183NM_000088.4(COL1A1):c.1821del (p.Gly608fs)Pathogenic
1074312NM_000088.4(COL1A1):c.1614+1G>APathogenic
1074344NM_000088.4(COL1A1):c.1804G>T (p.Gly602Ter)Pathogenic
1074370NM_000088.4(COL1A1):c.642+2T>APathogenic
1074716NM_000088.4(COL1A1):c.1451del (p.Pro484fs)Pathogenic
1074717NM_000088.4(COL1A1):c.1003G>A (p.Gly335Ser)Pathogenic
1074867NM_000088.4(COL1A1):c.3649del (p.Arg1217fs)Pathogenic
1075011NM_000088.4(COL1A1):c.2584A>T (p.Lys862Ter)Pathogenic
1076006NM_000088.4(COL1A1):c.288del (p.Asp97fs)Pathogenic
1076168NM_000088.4(COL1A1):c.3470G>A (p.Gly1157Asp)Pathogenic
1076244NM_000088.4(COL1A1):c.3302_3303insTTGTAACTATTATGAGTCCTAGTTGACTTGAAGTGGAGAAGGCTACGATTTTTTTGAAGCCGCCTAGTTTTAAGAGTACTGCGGCAAGTACTATTGACCCAGCGATGGGGGCTTCGACATGGGCTTTAGGGAGTCATAAGTGGAGTCCGTAAGGCGA (p.Glu1101delinsAspCysAsnTyrTyrGluSerTer)Pathogenic

SpliceAI

4269 predictions. Top by Δscore:

VariantEffectΔscore
17:50185647:CT:Cacceptor_gain1.0000
17:50185770:GCAC:Gdonor_loss1.0000
17:50185771:CACT:Cdonor_loss1.0000
17:50185772:ACTC:Adonor_loss1.0000
17:50185773:CTCAC:Cdonor_loss1.0000
17:50185774:TCAC:Tdonor_loss1.0000
17:50185775:C:CGdonor_loss1.0000
17:50185776:A:ACdonor_gain1.0000
17:50185776:A:AGdonor_loss1.0000
17:50185777:C:CCdonor_gain1.0000
17:50185777:CCGTG:Cdonor_gain1.0000
17:50186016:TCGAA:Tacceptor_gain1.0000
17:50186017:CGAA:Cacceptor_gain1.0000
17:50186017:CGAAC:Cacceptor_gain1.0000
17:50186018:GAA:Gacceptor_gain1.0000
17:50186019:AA:Aacceptor_gain1.0000
17:50186021:C:CCacceptor_gain1.0000
17:50186024:C:CTacceptor_gain1.0000
17:50186025:A:Tacceptor_gain1.0000
17:50186312:CGCA:Cdonor_gain1.0000
17:50186314:CA:Cdonor_loss1.0000
17:50186315:A:ACdonor_gain1.0000
17:50186315:A:ATdonor_loss1.0000
17:50186316:C:CCdonor_gain1.0000
17:50186503:CTCTC:Cacceptor_gain1.0000
17:50186505:CTC:Cacceptor_gain1.0000
17:50186506:TC:Tacceptor_gain1.0000
17:50186506:TCCTG:Tacceptor_loss1.0000
17:50186507:CC:Cacceptor_gain1.0000
17:50186507:CCTGT:Cacceptor_loss1.0000

AlphaMissense

9226 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
17:50185587:A:GL1437P1.000
17:50185782:C:GC1415S1.000
17:50185783:A:GC1415R1.000
17:50185783:A:TC1415S1.000
17:50185916:G:CC1370W1.000
17:50185917:C:GC1370S1.000
17:50185918:A:GC1370R1.000
17:50185918:A:TC1370S1.000
17:50185956:A:GL1357P1.000
17:50186449:G:CC1291W1.000
17:50186450:C:GC1291S1.000
17:50186450:C:TC1291Y1.000
17:50186451:A:GC1291R1.000
17:50186451:A:TC1291S1.000
17:50186669:A:GL1262P1.000
17:50186677:G:CC1259W1.000
17:50186678:C:GC1259S1.000
17:50186678:C:TC1259Y1.000
17:50186679:A:GC1259R1.000
17:50186679:A:TC1259S1.000
17:50187058:C:TG1163E1.000
17:50193959:C:TG584E1.000
17:50193968:C:TG581E1.000
17:50193969:C:GG581R1.000
17:50193969:C:TG581R1.000
17:50199863:C:GC63S1.000
17:50199864:A:TC63S1.000
17:50199869:C:GC61S1.000
17:50199870:A:GC61R1.000
17:50199870:A:TC61S1.000

dbSNP variants (sampled 300 via entrez): RS1000231009 (17:50184915 A>G), RS1000386116 (17:50189513 G>T), RS1000616928 (17:50193756 C>T), RS1000683455 (17:50195127 T>C,G), RS1000721897 (17:50188125 C>T), RS1001015192 (17:50200461 C>T), RS1001018768 (17:50200705 C>A,T), RS1001533491 (17:50200955 AC>A), RS1001554485 (17:50184485 A>G), RS1001739152 (17:50194704 C>G,T), RS1001925288 (17:50199035 A>G), RS1002020181 (17:50187555 A>G), RS1002076072 (17:50193379 T>A), RS1002741196 (17:50194471 C>G,T), RS1002752803 (17:50203026 T>G)

Disease associations

OMIM: gene MIM:120150 | disease phenotypes: MIM:166200, MIM:130000, MIM:166710, MIM:114000, MIM:166210, MIM:166220, MIM:259420, MIM:619115, MIM:261600, MIM:132400, MIM:608099, MIM:148300, MIM:607086, MIM:604841, MIM:617821, MIM:225320

GenCC curated gene-disease

DiseaseClassificationInheritance
Caffey diseaseDefinitiveAutosomal dominant
Ehlers-Danlos syndrome, arthrochalasia typeDefinitiveAutosomal dominant
osteogenesis imperfecta type 4DefinitiveAutosomal dominant
Ehlers-Danlos syndrome, classic typeDefinitiveAutosomal dominant
osteogenesis imperfecta type 3StrongAutosomal dominant
combined osteogenesis imperfecta and Ehlers-Danlos syndrome 1ModerateAutosomal dominant
osteogenesis imperfecta type 1SupportiveAutosomal dominant
osteogenesis imperfecta type 2SupportiveAutosomal dominant
Ehlers-Danlos/osteogenesis imperfecta syndromeSupportiveAutosomal dominant
high bone mass osteogenesis imperfectaSupportiveAutosomal dominant

ClinGen Gene-Disease Validity (3)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
Caffey diseaseDefinitiveAD
osteogenesis imperfectaDefinitiveAD
COL1A1-related Ehlers-Danlos syndromeDefinitiveAD

Mondo (31): osteogenesis imperfecta type 1 (MONDO:0008146), Ehlers-Danlos syndrome, classic type, 1 (MONDO:0019567), osteoporosis (MONDO:0005298), Caffey disease (MONDO:0007244), Ehlers-Danlos syndrome, arthrochalasia type (MONDO:0007525), osteogenesis imperfecta type 2 (MONDO:0008147), osteogenesis imperfecta type 4 (MONDO:0008148), osteogenesis imperfecta type 3 (MONDO:0009804), combined osteogenesis imperfecta and Ehlers-Danlos syndrome 1 (MONDO:0030854), Ehlers-Danlos syndrome (MONDO:0020066), osteogenesis imperfecta (MONDO:0019019), postmenopausal osteoporosis (MONDO:0008159), congenital heart disease (MONDO:0005453), Ehlers-Danlos syndrome, classic type (MONDO:0007522), phenylketonuria (MONDO:0009861)

Orphanet (27): Osteogenesis imperfecta type 1 (Orphanet:216796), Osteogenesis imperfecta (Orphanet:666), Caffey disease (Orphanet:1310), Arthrochalasia Ehlers-Danlos syndrome (Orphanet:1899), Osteogenesis imperfecta type 2 (Orphanet:216804), Osteogenesis imperfecta type 3 (Orphanet:216812), Osteogenesis imperfecta type 4 (Orphanet:216820), OBSOLETE: Ehlers-Danlos syndrome type 7A (Orphanet:99875), OBSOLETE: Ehlers-Danlos syndrome type 7B (Orphanet:99876), Ehlers-Danlos syndrome (Orphanet:98249), Classical Ehlers-Danlos syndrome (Orphanet:287), Phenylketonuria (Orphanet:716), Multiple epiphyseal dysplasia type 1 (Orphanet:93308), Alpha-sarcoglycan-related limb-girdle muscular dystrophy R3 (Orphanet:62), Familial prostate cancer (Orphanet:1331)

HPO phenotypes

186 total (30 of 186 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000015Bladder diverticulum
HP:0000023Inguinal hernia
HP:0000139Uterine prolapse
HP:0000239Large fontanelles
HP:0000260Wide anterior fontanel
HP:0000272Malar flattening
HP:0000278Retrognathia
HP:0000286Epicanthus
HP:0000316Hypertelorism
HP:0000324Facial asymmetry
HP:0000325Triangular face
HP:0000347Micrognathia
HP:0000362Otosclerosis
HP:0000365Hearing impairment
HP:0000444Convex nasal ridge
HP:0000481Abnormal cornea morphology
HP:0000520Proptosis
HP:0000592Blue sclerae
HP:0000703Dentinogenesis imperfecta
HP:0000708Atypical behavior
HP:0000883Thin ribs
HP:0000926Platyspondyly
HP:0000938Osteopenia
HP:0000939Osteoporosis
HP:0000963Thin skin
HP:0000974Hyperextensible skin
HP:0000978Bruising susceptibility
HP:0000993Molluscoid pseudotumors
HP:0001001Abnormality of subcutaneous fat tissue

GWAS associations

5 associations (top):

StudyTraitp-value
GCST000079_6Breast cancer8.000000e-08
GCST001692_9Response to taxane treatment (docetaxel)1.000000e-06
GCST006979_501Heel bone mineral density7.000000e-24
GCST008338_7Blood cell traits (multivariate analysis)1.000000e-08
GCST90013442_31Keratoconus3.000000e-09

EFO canonical traits (8, from GWAS)

EFO IDTrait name
EFO:0009270heel bone mineral density
EFO:0004305erythrocyte count
EFO:0004309platelet count
EFO:0004528mean corpuscular hemoglobin concentration
EFO:0004833neutrophil count
EFO:0004842eosinophil count
EFO:0005090basophil count
EFO:0005091monocyte count

MeSH disease descriptors (21)

DescriptorNameTree numbers
D001238Asphyxia NeonatorumC16.614.092
D002311Cardiomyopathy, DilatedC14.280.195.160; C14.280.238.070; C16.320.488.750
D002312Cardiomyopathy, HypertrophicC14.280.238.100; C14.280.484.048.750.070.160
D003240Connective Tissue DiseasesC17.300
D004535Ehlers-Danlos SyndromeC14.907.454.240; C15.378.463.515.240; C16.131.831.428; C16.320.850.260; C17.300.200.310; C17.800.804.428; C17.800.827.260
D006330Heart Defects, CongenitalC14.240.400; C14.280.400; C16.131.240.400
D006958Hyperostosis, Cortical, CongenitalC05.116.099.708.479; C05.116.540.400; C16.614.465
D007640KeratoconusC11.204.627
D010013Osteogenesis ImperfectaC05.116.099.708.685; C16.320.737; C17.300.200.540
D010024OsteoporosisC05.116.198.579; C18.452.104.579
D015663Osteoporosis, PostmenopausalC05.116.198.579.610; C18.452.104.579.610
D010661PhenylketonuriasC10.228.140.163.100.687; C16.320.565.100.766; C16.320.565.189.687; C18.452.132.100.687; C18.452.648.100.766; C18.452.648.189.687
D011471Prostatic NeoplasmsC04.588.945.440.770; C12.100.500.260.750; C12.100.500.565.625; C12.200.294.260.750; C12.200.294.565.625; C12.200.758.409.750; C12.900.619.750
C565061EDS VIIB (supp.)
C562625Ehlers-Danlos Syndrome, Type VII, Autosomal Dominant (supp.)
C536194Ehlers-Danlos syndrome type 1 (supp.)
C536200Ehlers-Danlos syndrome, cardiac valvular form (supp.)
C565178OI-EDS Combined Syndrome (supp.)
C536042Osteogenesis imperfecta, type 2A (supp.)
C536044Osteogenesis imperfecta, type 3 (supp.)
C537493Stickler syndrome, type 2 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (2): CHEMBL2364188 (PROTEIN COMPLEX GROUP), CHEMBL3030 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

PharmGKB clinical annotations

2 annotations.

VariantTypeLevelDrugsPhenotypes
rs1800012Dosage3somatropin recombinant
rs1800012Toxicity3valproic acidEpilepsy

PharmGKB variants

1 variants.

VariantGenesLevelScore#Clin annotsDrugs
rs1800012COL1A132.002valproic acid;somatropin recombinant

ChEMBL bioactivities

5 potent at pChembl≥5 of 19 total, top 5 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
5.70IC502000nMCHEMBL5415605
5.11IC507830nMCHEMBL5194526
5.10IC508010nMCHEMBL5398209
5.09IC508190nMCHEMBL5415777
5.00IC501e+04nMCHEMBL5440427

PubChem BioAssay actives

5 with measured affinity, of 136 total; 5 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
2-[(5R,6R,9S,13S)-6-butyl-1,7-diazatricyclo[7.3.1.05,13]tridecan-7-yl]-N-(3-phenylphenyl)acetamide1991399: Inhibition of COL1A1 (unknown origin) expressed in human LX2 cellsic502.0000uM
2,3,10-trimethoxy-9-[(4-propan-2-ylphenyl)methoxy]-5,6-dihydroisoquinolino[2,1-b]isoquinolin-7-ium bromide2032035: Inhibition of COL1A1 (unknown origin) expressed in human LX2 cells incubated for 48 hrs by Bright-Glo luciferase assayic507.8300uM
2-[(5R,6R,9S,13S)-6-butyl-1,7-diazatricyclo[7.3.1.05,13]tridecan-7-yl]-N-(2,4-dichlorophenyl)acetamide1991399: Inhibition of COL1A1 (unknown origin) expressed in human LX2 cellsic508.0100uM
2,3,10-trimethoxy-9-[(3-methylphenyl)methoxy]-5,6-dihydroisoquinolino[2,1-b]isoquinolin-7-ium bromide2032035: Inhibition of COL1A1 (unknown origin) expressed in human LX2 cells incubated for 48 hrs by Bright-Glo luciferase assayic508.1900uM
(5R,6R,9S,13S)-6-butyl-N-(4-butylphenyl)-1,7-diazatricyclo[7.3.1.05,13]tridecane-7-carboxamide1991399: Inhibition of COL1A1 (unknown origin) expressed in human LX2 cellsic5010.0000uM

CTD chemical–gene interactions

326 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Resveratroldecreases expression, affects secretion, increases reaction, increases oxidation, increases phosphorylation (+5 more)9
sodium arsenitedecreases reaction, affects reaction, affects methylation, decreases expression, affects cotreatment (+2 more)8
Valproic Acidaffects cotreatment, increases expression, affects expression, decreases expression8
Particulate Matterdecreases expression, increases abundance, increases expression7
Dexamethasonedecreases reaction, increases expression, affects reaction, increases reaction, affects cotreatment6
methylmercuric chlorideaffects cotreatment, increases expression4
trichostatin Aaffects expression, affects cotreatment, increases expression4
Arsenic Trioxidedecreases expression, decreases reaction, increases expression4
Benzo(a)pyrenedecreases expression, increases expression, increases methylation4
Glucoseaffects secretion, affects reaction, decreases expression, decreases reaction, increases expression4
Quercetinincreases expression, increases secretion, decreases secretion, decreases expression, decreases reaction4
Tretinoindecreases reaction, increases expression, decreases expression4
ascorbate-2-phosphateincreases expression, affects cotreatment, decreases reaction3
beta-glycerophosphoric acidaffects cotreatment, affects reaction, decreases reaction, increases expression3
2-(2-amino-3-methoxyphenyl)-4H-1-benzopyran-4-onedecreases activity, decreases reaction, increases expression, increases phosphorylation, decreases expression3
Y 27632decreases reaction, increases expression3
Air Pollutantsaffects cotreatment, increases abundance, increases expression, decreases expression3
Ascorbic Acidincreases expression, affects cotreatment, affects reaction, decreases reaction3
Calcitriolincreases expression, decreases reaction3
Hydrogen Peroxidedecreases expression, increases expression, decreases reaction3
Methotrexatedecreases expression, increases expression3
Oxygendecreases reaction, increases expression, affects reaction3
Ozoneaffects cotreatment, increases expression, increases abundance3
Silicon Dioxidedecreases reaction, increases expression, decreases expression, increases response to substance3
Tetrachlorodibenzodioxinincreases expression, affects expression3
Tobacco Smoke Pollutionaffects expression, decreases expression3
Asbestos, Crocidoliteincreases expression, decreases expression3
Cadmium Chloridedecreases reaction, increases abundance, increases expression, decreases expression3
baicaleinaffects cotreatment, affects reaction, decreases reaction, increases expression2
bisphenol Aincreases abundance, increases expression, affects cotreatment, affects expression2

ChEMBL screening assays

8 unique, capped per target: 8 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5338438BindingBinding affinity to Col1a1 (unknown origin) at 200 uM preincubated for 2 hrs followed by pronase addition and measured after 30 mins by coomassie blue staining based SDS-PAGE gel analysisStructurally Diverse Alkaloids with Anti-Renal-Fibrosis Activity from the Centipede Scolopendra subspinipes mutilans. — J Nat Prod

Cellosaurus cell lines

22 cell lines: 12 cancer cell line, 8 induced pluripotent stem cell, 1 finite cell line, 1 embryonic stem cell

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B7E2SMBCi014-AInduced pluripotent stem cellFemale
CVCL_B8DYAbcam HCT 116 COL1A1 KOCancer cell lineMale
CVCL_B8U9Abcam MCF-7 COL1A1 KOCancer cell lineFemale
CVCL_B9G6Abcam A-549 COL1A1 KOCancer cell lineMale
CVCL_C0BNAbcam U2OS COL1A1 KOCancer cell lineFemale
CVCL_D4A8MDFSP-S1Cancer cell lineFemale
CVCL_D4ZCOI-Pt1 iPSCInduced pluripotent stem cellFemale
CVCL_D4ZDOI-Pt2 iPSCInduced pluripotent stem cellMale
CVCL_D4ZEOI-Pt1 iPSC correctedInduced pluripotent stem cellFemale
CVCL_D4ZKNCC-DFSP5-C1Cancer cell lineMale

Clinical trials (associated diseases)

305 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00006180PHASE4COMPLETEDBone Loss in Premenopausal Women With Depression
NCT00035256PHASE4COMPLETEDSequential Use of Teriparatide and Raloxifene HCl in the Treatment of Postmenopausal Women With Osteoporosis
NCT00035971PHASE4COMPLETEDEVA: Evista Alendronate Comparison
NCT00114556PHASE4COMPLETEDThe Effect of Zoledronic Acid on Bone Density in Liver Transplant Patients
NCT00130403PHASE4COMPLETEDOPTAMISE: Clinical Effectiveness of Teriparatide After Alendronate or Risedronate Therapy in Osteoporotic Postmenopausal Women
NCT00148915PHASE4COMPLETEDA Study To Assess the Quality and Strength of Bone in Women Participants With Osteoporosis Taking Oral Ibandronate Versus Placebo
NCT00157690PHASE4COMPLETEDStudy of Alendronate to Prevent and Treat Osteoporosis in Cystic Fibrosis Patients
NCT00159419PHASE4COMPLETEDBisphosphonate Therapy for Osteogenesis Imperfecta
NCT00165607PHASE4COMPLETEDRandomized, Open, Parallel, Active Controlled Study on Fracture Prevention in Antiosteoporosis Treatment (OF Study)
NCT00168909PHASE4COMPLETEDInfluence of Alfacalcidol on Falls in Osteopenic/Osteoporotic Postmenopausal Women (ALFA Study)
NCT00182871PHASE4COMPLETEDTestosterone Effects on Bone and Frailty
NCT00191425PHASE4COMPLETED2-Year Therapy With Teriparatide vs 1-yr Therapy Followed by 1-Year of Raloxifene or Calcium/Vit D in Severe Postmenopausal Osteoporosis
NCT00211211PHASE4COMPLETEDFREE Study - Fracture Reduction Evaluation
NCT00221299PHASE4COMPLETEDRisedronate and Parathyroid Hormone to Reverse Osteoporosis Caused by Chronic Steroid Use
NCT00252408PHASE4COMPLETEDDanish Osteoporosis Prevention Study
NCT00259298PHASE4COMPLETEDEvaluation of the Effects of Teriparatide on Skeleton Images in Postmenopausal Women With Osteoporosis
NCT00261625PHASE4COMPLETEDCan Alendronate Suppress Calcification and Improve Bone Density in Chronic Peritoneal Dialysis Patients?
NCT00271713PHASE4COMPLETEDImpact of Oral Ibandronate 150 mg Monthly on Structural Properties of Bone in Postmenopausal Osteoporosis (SPIMOS-3D)
NCT00294463PHASE4COMPLETEDEffects of Tibolone Treatment on the Endometrium
NCT00327990PHASE4COMPLETEDEvaluation Of Missed Osteoporosis Diagnoses, And Preference Between Once Monthly Ibandronate And Once Weekly Alendronate
NCT00357331PHASE4COMPLETEDThe Effects of Potassium Citrate on Bone Metabolism
NCT00371956PHASE4COMPLETEDRaloxifene for Prevention of Bone Loss in Postmenopausal Patients Receiving Chronic Corticosteroid Therapy
NCT00372372PHASE4COMPLETEDThe Efficacy of Risedronate in Prevention of Bone Loss in Patients Receiving High Dose Corticosteroid Treatment
NCT00376662PHASE4COMPLETEDHRT Versus Etidronate for Osteoporosis and Fractures in Asthmatics Receiving Glucocorticoids.
NCT00402441PHASE4COMPLETEDRisedronate in the Prevention of Osteoporosis in Postmenopausal Women
NCT00405392PHASE4COMPLETEDStudy To Investigate Patient Preference On Dosing In Ibandronate And Risedronate In Korean Women With Postmenopausal Osteoporosis
NCT00431444PHASE4COMPLETEDEffects of Zoledronic Acid and Raloxifene on Bone Turnover Markers in Postmenopausal Women With Low Bone Mineral Density
NCT00446589PHASE4TERMINATEDThe Effects of Ibandronate or Teriparatide Therapy on Bone Histology and Biochemical Indices in Patients on Hemodialysis With Low Bone Mineral Density
NCT00453492PHASE4COMPLETEDRisedronate Sodium in Post Menopausal Osteoporosis
NCT00460057PHASE4COMPLETEDThe Change of Bone Markers After Low Dose Alendronate in Postmenopausal Women With Bone Loss
NCT00479037PHASE4COMPLETEDEffect of Full Length Parathyroid Hormone, PTH(1-84) or Strontium Ranelate on Bone Markers in Postmenopausal Women With Primary Osteoporosis (FP-006-IM)
NCT00485953PHASE4COMPLETEDEffect of Bisphosphonate on Bone Loss in Postmenopausal Women With Breast Cancer Initiating Aromatase Inhibitor Therapy
NCT00489424PHASE4COMPLETEDAcetaminophen or Fluvastatin Compared to Placebo on the Transient Post-Dose Symptoms (PDS) Following an Intravenous (i.v.) Infusion of a Single Dose of Zoledronic Acid 5mg, in Post-menopausal Women With Low Bone Mass
NCT00491920PHASE4COMPLETEDHigh Dosage Vitamin D and Osteoporosis
NCT00504166PHASE4COMPLETEDAlendronate Prevents Microarchitectural Deterioration of Trabecular Bone in Early Postmenopausal Women
NCT00544180PHASE4TERMINATEDROSPA - Record on Satisfaction of Patients With Actonel 35 mg Once a Week
NCT00548509PHASE4COMPLETEDThe Effect of Vitamin K2 on Bone Turnover
NCT00549068PHASE4COMPLETEDPOWER Point of Care Effect on Satisfaction of Treatment
NCT00549965PHASE4COMPLETEDSatisfaction and Compliance of Risedronate in PMO
NCT00557310PHASE4COMPLETEDA Study Using Imaging Technology to Measure Changes in Bone Structure After Treatment With Teriparatide