P4HA1

gene
On this page

Also known as C-P4Halpha(I)

Summary

P4HA1 (prolyl 4-hydroxylase subunit alpha 1, HGNC:8546) is a protein-coding gene on chromosome 10q22.1, encoding Prolyl 4-hydroxylase subunit alpha-1 (P13674). Catalyzes the post-translational formation of 4-hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other proteins.

This gene encodes a component of prolyl 4-hydroxylase, a key enzyme in collagen synthesis composed of two identical alpha subunits and two beta subunits. The encoded protein is one of several different types of alpha subunits and provides the major part of the catalytic site of the active enzyme. In collagen and related proteins, prolyl 4-hydroxylase catalyzes the formation of 4-hydroxyproline that is essential to the proper three-dimensional folding of newly synthesized procollagen chains. Alternatively spliced transcript variants encoding different isoforms have been described.

Source: NCBI Gene 5033 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): connective tissue disorder (Limited, GenCC)
  • GWAS associations: 1
  • Clinical variants (ClinVar): 85 total — 1 likely-pathogenic
  • Druggable target: yes
  • MANE Select transcript: NM_001017962

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:8546
Approved symbolP4HA1
Nameprolyl 4-hydroxylase subunit alpha 1
Location10q22.1
Locus typegene with protein product
StatusApproved
AliasesC-P4Halpha(I)
Ensembl geneENSG00000122884
Ensembl biotypeprotein_coding
OMIM176710
Entrez5033

Gene structure

Transcript identifiers

Ensembl transcripts: 28 — 27 protein_coding, 1 retained_intron

ENST00000263556, ENST00000307116, ENST00000373008, ENST00000394890, ENST00000440381, ENST00000464310, ENST00000886200, ENST00000886201, ENST00000886202, ENST00000886203, ENST00000886204, ENST00000886205, ENST00000886206, ENST00000886207, ENST00000886208, ENST00000886209, ENST00000886210, ENST00000924332, ENST00000924333, ENST00000924334, ENST00000951388, ENST00000951389, ENST00000951390, ENST00000951391, ENST00000951392, ENST00000951393, ENST00000951394, ENST00000951395

RefSeq mRNA: 4 — MANE Select: NM_001017962 NM_000917, NM_001017962, NM_001142595, NM_001142596

CCDS: CCDS41537, CCDS44432, CCDS7320

Canonical transcript exons

ENST00000394890 — 15 exons

ExonStartEnd
ENSE000009871247306884673068983
ENSE000011020187307202973072180
ENSE000011020257307373173073827
ENSE000011699447300980773009903
ENSE000011699657301684673016899
ENSE000011699727303027173030370
ENSE000011699917305105373051249
ENSE000011699987305335173053590
ENSE000011700367300721773008292
ENSE000011700447307480873074915
ENSE000033116667304498173045051
ENSE000034409837304692573047101
ENSE000035138887301422473014289
ENSE000036525557301096973011037
ENSE000039981597309676673096866

Expression profiles

Bgee: expression breadth ubiquitous, 293 present calls, max score 98.82.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 84.0859 / max 1828.8941, expressed in 1822 samples.

FANTOM5 promoters (6 alternative TSS)

Promoter IDTPM avgSamples expressed
11003482.69911822
2059000.7381386
2058990.3197116
1100350.203444
1100320.095829
1100330.02975

Top tissues by expression

298 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
cartilage tissueUBERON:000241898.82gold quality
calcaneal tendonUBERON:000370197.66gold quality
tibiaUBERON:000097997.41gold quality
ventricular zoneUBERON:000305397.23gold quality
pericardiumUBERON:000240797.20gold quality
stromal cell of endometriumCL:000225597.14gold quality
periodontal ligamentUBERON:000826696.92gold quality
tendonUBERON:000004396.21gold quality
gastrocnemiusUBERON:000138895.93gold quality
muscle of legUBERON:000138395.46gold quality
tibialis anteriorUBERON:000138595.19gold quality
triceps brachiiUBERON:000150995.16gold quality
gluteal muscleUBERON:000200095.12gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451195.07gold quality
adrenal tissueUBERON:001830394.95gold quality
muscle organUBERON:000163094.89gold quality
skeletal muscle organUBERON:001489294.89gold quality
biceps brachiiUBERON:000150794.76gold quality
deltoidUBERON:000147694.74gold quality
heart right ventricleUBERON:000208094.66gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450294.45gold quality
skeletal muscle tissueUBERON:000113494.06gold quality
choroid plexus epitheliumUBERON:000391193.95gold quality
tendon of biceps brachiiUBERON:000818893.81gold quality
endometriumUBERON:000129593.70gold quality
muscle tissueUBERON:000238593.36gold quality
caput epididymisUBERON:000435893.24gold quality
vastus lateralisUBERON:000137992.84gold quality
smooth muscle tissueUBERON:000113592.77gold quality
nephron tubuleUBERON:000123192.74gold quality

Single-cell (SCXA)

Detected in 7 experiment(s), a significant marker in 7.

ExperimentMarker?Max mean expression
E-GEOD-180759yes4059.16
E-MTAB-11121yes510.39
E-HCAD-5yes31.89
E-CURD-112yes15.17
E-HCAD-13yes12.43
E-MTAB-9388yes11.02
E-ANND-3no0.00

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): SP1, USF1, USF2

miRNA regulators (miRDB)

107 targeting P4HA1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-511-3P99.9968.851467
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-499A-5P99.9870.791323
HSA-MIR-551B-5P99.9671.283493
HSA-MIR-365899.9673.874379
HSA-MIR-493-5P99.9672.472382
HSA-MIR-1250-3P99.9670.044038
HSA-MIR-3912-5P99.9566.11925
HSA-MIR-391099.9571.132227
HSA-MIR-144-3P99.9473.982698
HSA-MIR-311999.9271.342390
HSA-MIR-3529-3P99.9073.553045
HSA-MIR-808799.9069.551351
HSA-MIR-124-3P99.8973.743043
HSA-MIR-506-3P99.8973.553057
HSA-MIR-990299.8969.152250
HSA-MIR-153-5P99.8973.866317
HSA-MIR-6780A-5P99.8866.692776
HSA-MIR-3121-3P99.8271.963630
HSA-MIR-442099.8270.081624
HSA-MIR-181B-2-3P99.8170.061646
HSA-MIR-181B-3P99.8170.061646
HSA-MIR-1273H-5P99.7766.322471
HSA-MIR-467999.7669.191229

Literature-anchored findings (GeneRIF, showing 33)

  • These in vivo data demonstrated that smokers had thinner atherosclerotic cap thickness and lower levels of P4Halpha and collagen. (PMID:15369792)
  • Collagen prolyl 4-hydroxylase-alpha (I)mRNA is stabilized by interation of RNA-binding proteins hnRNP-A2/B1 with a U(16) element within the 3’-UTR (PMID:16464861)
  • positive (transforming growth factor beta1) and negative (cigarette smoking extract) regulators appear to influence the USF-E-box interaction and affect P4Halpha(I) expression (PMID:16488890)
  • Results suggest that the alteration of translational efficiency by nucleolin, which occurs through a hypoxia inducible factor independent pathway, is an important step in collagen prolyl 4-hydroxylase-alpha(I) regulation under hypoxia. (PMID:16837461)
  • In comparison with healthy cartilage, Osteoarthritis articular chondrocytes exhibit increased in vivo synthesis of collagen prolyl-4-hydroxylase type II, a pivotal enzyme in collagen triple helix formation. (PMID:16877351)
  • HIF-P4H, HIF-1alpha and HIF-2alpha are effective oxygen sensors (PMID:16885164)
  • analysis of the 2-His-1-Asp active-site motif in prolyl 4-hydroxylase (PMID:19890397)
  • IL-6 significantly downregulated P4Halpha1 expression in aortic smooth muscle cells (PMID:23022409)
  • Overexpression of miR-122 markedly attenuated the expression of P4HA1 via targeting a binding site located at 3’-UTR of P4HA1 mRNA (PMID:23178710)
  • Studies indicate P4HA1 copy number gain in a subset of metastatic prostate tumors and its expression is also regulated by microRNA-124. (PMID:25115393)
  • Thus, we conclude that miR-30e suppresses proliferation of hepatoma cells through targeting P4HA1 mRNA. (PMID:26966067)
  • Findings suggest that the catalytic domain of collagen prolyl 4-hydroxylases (CP4Hs) recognizes the cis conformation of the prolyl peptide bond. (PMID:28001367)
  • Our study indicates that P4HA1 plays a pivotal role in the process of GSC-EC transdifferentiation and the structural formation of vascular BMs. (PMID:28415787)
  • we report compound heterozygous frameshift and splice site mutations in P4HA1 that impair but not abolish C-P4H alpha(I) activity. The maternal P4HA1 exon 12 splice donor site mutation causes an internally deleted C-P4H alpha(I) predicted to completely lack catalytic activity. two nucleic acid insertion in exon 9 results in a premature stop in the exon 9 P4HA1 splice form. (PMID:28419360)
  • High expression of P4HA1 was correlated with the malignancy of gliomas and could serve as a prognostic indicator for patients with high-grade gliomas (PMID:28964577)
  • miR-122 inhibited migration, invasion, epithelial mesenchymal transition, and metastasis in peritoneal cavity of ovarian cancer cells by targeting P4HA1. (PMID:30136751)
  • Collagen prolyl 4-hydroxylase alpha 1 subunit (P4HA1) protein expression is induced in triple-negative breast cancer (TNBC) and HER2 positive breast cancer. Inhibition of P4HA1 sensitizes TNBC to the chemotherapeutic agent docetaxel and doxorubicin in xenografts and patient-derived models. Increased P4HA1 expression correlates with short relapse-free survival in TNBC patients who received chemotherapy. (PMID:30367042)
  • Findings show that prolyl 4-hydroxylase subunit alpha 1 (P4HA1) can enhance hypoxia-inducible factor 1alpha (HIF1alpha) stability, indicating a positive feedback loop between HIF1alpha and P4HA1 in pancreatic ductal adenocarcinoma (PDAC). (PMID:31239153)
  • High P4HA1 expression is an independent prognostic factor for poor overall survival and recurrent-free survival in head and neck squamous cell carcinoma. (PMID:31782831)
  • Prolyl 4-hydroxylase subunit alpha 1 (P4HA1) is a biomarker of poor prognosis in primary melanomas, and its depletion inhibits melanoma cell invasion and disrupts tumor blood vessel walls. (PMID:32053263)
  • Diagnostic and Prognostic Values of P4HA1 Expression in Lung Cancer, Breast Cancer, and Head and Neck Cancer (PMID:32150689)
  • Overexpression of P4HA1 Is Correlated with Poor Survival and Immune Infiltrates in Lung Adenocarcinoma. (PMID:33299876)
  • Systematic Analysis of the Expression and Prognostic Significance of P4HA1 in Pancreatic Cancer and Construction of a lncRNA-miRNA-P4HA1 Regulatory Axis. (PMID:33415167)
  • Targeting HIF-activated collagen prolyl 4-hydroxylase expression disrupts collagen deposition and blocks primary and metastatic uveal melanoma growth. (PMID:34218269)
  • Prognostic and diagnostic roles of prolyl 4-hydroxylase subunit alpha members in breast cancer. (PMID:34387118)
  • Comprehensive Analysis of Gene Expression Profiles Identifies a P4HA1-Related Gene Panel as a Prognostic Model in Colorectal Cancer Patients. (PMID:34520234)
  • Prolyl 4-hydroxylase P4HA1 Mediates the Interplay Between Glucose Metabolism and Stemness in Pancreatic Cancer Cells. (PMID:36043766)
  • P4HA1 activates HMGCS1 to promote nasopharyngeal carcinoma ferroptosis resistance and progression. (PMID:36702290)
  • P4HA1, transcriptionally activated by STAT1, promotes esophageal cancer progression. (PMID:36734588)
  • Identification and verification of FN1, P4HA1 and CREBBP as potential biomarkers in human atrial fibrillation. (PMID:37161136)
  • P4HA1 Promotes Cell Migration and Colonization in Hypopharyngeal Squamous Cell Carcinoma. (PMID:37247900)
  • Identification and validation of INHBE and P4HA1 as hub genes in non-alcoholic fatty liver disease. (PMID:37922570)
  • P4HA1 expression and function in esophageal squamous cell carcinoma. (PMID:38134053)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriop4ha1bENSDARG00000071082
mus_musculusP4ha1ENSMUSG00000019916
rattus_norvegicusP4ha1ENSRNOG00000050655
drosophila_melanogasterCG31021FBGN0051021
caenorhabditis_elegansWBGENE00004026
caenorhabditis_elegansWBGENE00015773
caenorhabditis_elegansWBGENE00077688

Paralogs (3): P4HA2 (ENSG00000072682), P4HA3 (ENSG00000149380), P4HTM (ENSG00000178467)

Protein

Protein identifiers

Prolyl 4-hydroxylase subunit alpha-1P13674 (reviewed: P13674)

Alternative names: Procollagen-proline,2-oxoglutarate-4-dioxygenase subunit alpha-1

All UniProt accessions (2): P13674, Q5VSQ6

UniProt curated annotations — full annotation on UniProt →

Function. Catalyzes the post-translational formation of 4-hydroxyproline in -Xaa-Pro-Gly- sequences in collagens and other proteins.

Subunit / interactions. Heterotetramer of two alpha-1 chains and two beta chains (P4HB)(the beta chain is the multi-functional PDI), where P4HB plays the role of a structural subunit; this tetramer catalyzes the formation of 4-hydroxyproline in collagen.

Subcellular location. Endoplasmic reticulum lumen.

Tissue specificity. Expressed in the heart, liver, skeletal muscle, kidney, placenta, lung and pancreas.

Activity regulation. Inhibited by poly(L-proline).

Cofactor. Binds 1 Fe(2+) ion per subunit.

Similarity. Belongs to the P4HA family.

Isoforms (3)

UniProt IDNamesCanonical?
P13674-11yes
P13674-22
P13674-33

RefSeq proteins (4): NP_000908, NP_001017962, NP_001136067, NP_001136068 (=MANE)

Domains & families (InterPro)

IDNameType
IPR005123Oxoglu/Fe-dep_dioxygenase_domDomain
IPR006620Pro_4_hyd_alphDomain
IPR011990TPR-like_helical_dom_sfHomologous_superfamily
IPR013547P4H_NDomain
IPR019734TPR_rptRepeat
IPR044862Pro_4_hyd_alph_FE2OG_OXYDomain
IPR045054P4HA-likeFamily
IPR059068TPR_P4HDomain

Pfam: PF08336, PF13640, PF23558

Enzyme classification (BRENDA):

  • EC 1.14.11.2 — procollagen-proline 4-dioxygenase (BRENDA: 37 organisms, 182 substrates, 264 inhibitors, 251 Km, 44 kcat entries)

Substrate kinetics (BRENDA)

77 substrates with measured Km, best-characterized 15. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
2-OXOGLUTARATE0.001–137
POLY(L-PROLINE)0.0002–1.5926
(L-PRO-L-PRO-GLY)100.015–2.915
(PRO-PRO-GLY)100.011–1.2715
POLY(L-PRO)0.002–0.6515
FE2+0.0004–0.0169
O20.03–1.59
ASCORBATE0.14–0.378
(L-PRO-L-PRO-GLY)50.12–1.86
PROTOCOLLAGEN6
(L-PRO-L-ALA-L-PRO-L-LYS)30.09–0.314
(SER-PRO)50.14–0.784
DLDLEMLAPYIPMDD0.009–0.13
DLDLEMLAPYIPMDDDF0.007–0.073
EMLAPYIPMDDDFQL0.007–0.083

Catalyzed reactions (Rhea), 1 shown:

  • L-prolyl-[collagen] + 2-oxoglutarate + O2 = trans-4-hydroxy-L-prolyl-[collagen] + succinate + CO2 (RHEA:18945)

UniProt features (29 total): helix 12, binding site 4, mutagenesis site 3, splice variant 2, glycosylation site 2, signal peptide 1, chain 1, sequence conflict 1, repeat 1, strand 1, domain 1

Structure

Experimental structures (PDB)

10 structures.

PDBMethodResolution (Å)
4BTBX-RAY DIFFRACTION1.9
4BT9X-RAY DIFFRACTION1.9
2V5FX-RAY DIFFRACTION2.03
9HREX-RAY DIFFRACTION2.05
9HT8X-RAY DIFFRACTION2.15
9HPQX-RAY DIFFRACTION2.17
4BT8X-RAY DIFFRACTION2.2
1TJCX-RAY DIFFRACTION2.3
4BTAX-RAY DIFFRACTION2.95
2YQ8X-RAY DIFFRACTION2.99

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P13674-F190.230.79

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (4): 429; 431; 500; 510

Glycosylation sites (2): 113, 259

Mutagenesis-validated functional residues (3):

PositionPhenotype
210strongly reduced affinity for peptide substrate.
213strongly reduced affinity for peptide substrate.
247strongly reduced affinity for peptide substrate.

Function

Pathways and Gene Ontology

Reactome pathways

2 pathways

IDPathway
R-HSA-1650814Collagen biosynthesis and modifying enzymes
R-HSA-9918432Maturation of DENV proteins

MSigDB gene sets: 282 (showing top): WU_APOPTOSIS_BY_CDKN1A_VIA_TP53, BORCZUK_MALIGNANT_MESOTHELIOMA_UP, ZHAN_LATE_DIFFERENTIATION_GENES_UP, TGCGCANK_UNKNOWN, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, GOBP_COLLAGEN_FIBRIL_ORGANIZATION, HARRIS_HYPOXIA, GOMF_OXIDOREDUCTASE_ACTIVITY_ACTING_ON_PAIRED_DONORS_WITH_INCORPORATION_OR_REDUCTION_OF_MOLECULAR_OXYGEN, MAZ_Q6, MENSE_HYPOXIA_UP, AP2_Q3, MODULE_16, GTGCCTT_MIR506, CAGCAGG_MIR370, CADWELL_ATG16L1_TARGETS_DN

GO Biological Process (1): collagen fibril organization (GO:0030199)

GO Molecular Function (9): procollagen-proline 4-dioxygenase activity (GO:0004656), iron ion binding (GO:0005506), L-ascorbic acid binding (GO:0031418), identical protein binding (GO:0042802), protein binding (GO:0005515), oxidoreductase activity (GO:0016491), oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705), metal ion binding (GO:0046872), dioxygenase activity (GO:0051213)

GO Cellular Component (6): mitochondrion (GO:0005739), endoplasmic reticulum (GO:0005783), endoplasmic reticulum lumen (GO:0005788), membrane (GO:0016020), procollagen-proline 4-dioxygenase complex (GO:0016222), collagen trimer (GO:0005581)

Reactome top-level categories

Rollup of top-2 pathways:

CategoryPathways
Collagen formation1
Dengue Virus Genome Translation and Replication1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
oxidoreductase activity2
cytoplasm2
intracellular membrane-bounded organelle2
extracellular matrix organization1
procollagen-proline dioxygenase activity1
peptidyl-proline 4-dioxygenase activity1
transition metal ion binding1
vitamin binding1
carboxylic acid binding1
monosaccharide binding1
heterocyclic compound binding1
protein binding1
binding1
catalytic activity1
cation binding1
endomembrane system1
endoplasmic reticulum1
intracellular organelle lumen1
cellular anatomical structure1
endoplasmic reticulum protein-containing complex1
oxidoreductase complex1
protein-containing complex1

Protein interactions and networks

STRING

2276 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
P4HA1P4HBP07237979
P4HA1PLOD1Q02809871
P4HA1PLOD2O00469824
P4HA1P3H2Q8IVL5779
P4HA1P3H1Q32P28766
P4HA1EGLN1Q9GZT9730
P4HA1HIF1AQ16665708
P4HA1HIF3AQ9Y2N7685
P4HA1PDIA3P30101675
P4HA1PLOD3O60568671
P4HA1EGLN3Q9H6Z9671
P4HA1PDIA4P13667671
P4HA1COL13A1Q5TAT6658
P4HA1ARNTP27540633
P4HA1AGO4Q9HCK5623

IntAct

193 interactions, top by confidence:

ABTypeScore
RBM8ACASC3psi-mi:“MI:0914”(association)0.900
MED9MED19psi-mi:“MI:0914”(association)0.790
NSPIK3R2psi-mi:“MI:0914”(association)0.750
WASLWIPF3psi-mi:“MI:0914”(association)0.740
P4HBP4HA2psi-mi:“MI:0914”(association)0.740
COL1A1P4HA2psi-mi:“MI:0915”(physical association)0.710
CFTRESYT2psi-mi:“MI:2364”(proximity)0.710
SNRPD2GEMIN2psi-mi:“MI:0914”(association)0.710
PTK2TGFB1I1psi-mi:“MI:0914”(association)0.680
P4HA1P4HA1psi-mi:“MI:0407”(direct interaction)0.650
DNAJC8SF3B1psi-mi:“MI:0914”(association)0.640
SF3B1SAP18psi-mi:“MI:0914”(association)0.640
IRS4PIK3R2psi-mi:“MI:0914”(association)0.640
SF3B4SF3B1psi-mi:“MI:0914”(association)0.610
IGF1RPIK3R2psi-mi:“MI:2364”(proximity)0.590
HRGPLSCR1psi-mi:“MI:0914”(association)0.590
P4HA1P4HA2psi-mi:“MI:0915”(physical association)0.560
FOXP3FOXP2psi-mi:“MI:0914”(association)0.530
EBNA-LPHAX1psi-mi:“MI:0914”(association)0.530
EDAAP3B1psi-mi:“MI:0914”(association)0.530
ZNF397ZNF197psi-mi:“MI:0914”(association)0.530
TIGD5P4HA2psi-mi:“MI:0914”(association)0.530
L3MBTL3P4HA1psi-mi:“MI:0914”(association)0.530
COLQPLOD3psi-mi:“MI:0914”(association)0.530

BioGRID (363): P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), EVPL (Co-fractionation), P4HA1 (Co-fractionation), P4HA1 (Co-fractionation), P4HA1 (Proximity Label-MS), P4HA1 (Proximity Label-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS), P4HA1 (Affinity Capture-MS)

ESM2 similar proteins: A0A0B5GR44, A0A166YZY4, A7XDQ9, F4HSU3, F4ILF8, F4KF16, O04084, O15460, O23364, P13674, P16924, P43295, P54001, Q0WRX3, Q0WUG6, Q10576, Q16820, Q1PFD9, Q1RMU3, Q4V398, Q5RAG8, Q5UP57, Q5ZLK5, Q60715, Q60716, Q6E279, Q6W3E9, Q84LR6, Q84WF0, Q8GXT7, Q8L7B2, Q8LED3, Q8LPF8, Q8RXQ1, Q8VZJ7, Q8VZU3, Q93ZR8, Q9C7Z9, Q9CAU0, Q9LSV8

Diamond homologs: F4ILF8, F4J0A8, F4JAU3, F4JNU8, F4JZ24, O15460, P0DUB0, P13674, P16924, P54001, Q10576, Q19673, Q1RMU3, Q20065, Q24JN5, Q5RAG8, Q5UP57, Q5ZLK5, Q60715, Q60716, Q6W3E9, Q6W3F0, Q75UG4, Q7Z4N8, Q8BG58, Q8GXT7, Q8L970, Q8LAN3, Q8VZJ7, Q9LN20, Q9NXG6, Q9SZT0, Q9ZW86

SIGNOR signaling

6 interactions.

AEffectBMechanism
HIF1A“up-regulates quantity by expression”P4HA1“transcriptional regulation”
P4HA1“up-regulates quantity by expression”HMGCS1“transcriptional regulation”
P4HA1“up-regulates quantity by expression”HMGCR“transcriptional regulation”
P4HA1“up-regulates quantity”Collagen“chemical modification”
P4HA1“up-regulates activity”HIF1A
P4HA1up-regulatesGlycolysis

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 201 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA Splicing - Minor Pathway710.9×3e-04
mRNA Splicing1410.7×2e-08
RNA Polymerase II Transcription Termination710.7×3e-04
mRNA Polyadenylation1710.4×4e-10
mRNA 3’-end processing79.6×6e-04
Collagen biosynthesis and modifying enzymes89.5×2e-04
SARS-CoV-2 modulates host translation machinery69.3×3e-03
snRNP Assembly68.8×3e-03

GO biological processes:

GO termPartnersFoldFDR
U2-type prespliceosome assembly932.3×3e-09
spliceosomal snRNP assembly723.4×5e-06
peptidyl-tyrosine phosphorylation819.4×3e-06
vascular endothelial growth factor receptor signaling pathway513.8×5e-03
mRNA splicing, via spliceosome1910.0×7e-11
cell surface receptor protein tyrosine kinase signaling pathway1010.0×2e-05
protein autophosphorylation86.7×5e-03
RNA splicing115.6×1e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

85 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic1
Uncertain significance52
Likely benign8
Benign3

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
3024166NM_001017962.3(P4HA1):c.1148+1030C>TLikely pathogenic

SpliceAI

2467 predictions. Top by Δscore:

VariantEffectΔscore
10:73008288:GGATA:Gacceptor_gain1.0000
10:73008289:GATA:Gacceptor_gain1.0000
10:73008289:GATAC:Gacceptor_loss1.0000
10:73008290:ATAC:Aacceptor_loss1.0000
10:73008291:TA:Tacceptor_gain1.0000
10:73008291:TACT:Tacceptor_loss1.0000
10:73008292:ACTG:Aacceptor_loss1.0000
10:73008293:C:CCacceptor_gain1.0000
10:73008293:C:CGacceptor_loss1.0000
10:73008294:T:Aacceptor_loss1.0000
10:73010963:GCTTA:Gdonor_loss1.0000
10:73010964:CTTAC:Cdonor_loss1.0000
10:73010965:TTA:Tdonor_loss1.0000
10:73010966:TACT:Tdonor_loss1.0000
10:73010967:A:ACdonor_gain1.0000
10:73010967:A:Tdonor_loss1.0000
10:73010968:C:CCdonor_gain1.0000
10:73010968:CT:Cdonor_gain1.0000
10:73010968:CTT:Cdonor_gain1.0000
10:73010968:CTTT:Cdonor_gain1.0000
10:73010968:CTTTT:Cdonor_gain1.0000
10:73011033:CTCAT:Cacceptor_gain1.0000
10:73011034:TCAT:Tacceptor_gain1.0000
10:73011035:CAT:Cacceptor_gain1.0000
10:73011035:CATC:Cacceptor_gain1.0000
10:73011036:ATCT:Aacceptor_loss1.0000
10:73011037:TCTAT:Tacceptor_loss1.0000
10:73011038:C:CCacceptor_gain1.0000
10:73011038:C:CGacceptor_loss1.0000
10:73013185:CATG:Cdonor_gain1.0000

AlphaMissense

3508 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
10:73014245:T:AR449S1.000
10:73014245:T:GR449S1.000
10:73008279:C:AW516C0.999
10:73008279:C:GW516C0.999
10:73008281:A:GW516R0.999
10:73008281:A:TW516R0.999
10:73009808:C:AW511C0.999
10:73009808:C:GW511C0.999
10:73009810:A:GW511R0.999
10:73009810:A:TW511R0.999
10:73009811:T:AK510N0.999
10:73009811:T:GK510N0.999
10:73009832:A:CC503W0.999
10:73009833:C:TC503Y0.999
10:73009834:A:GC503R0.999
10:73009843:G:CH500D0.999
10:73009878:A:GL488P0.999
10:73009880:A:CN487K0.999
10:73009880:A:TN487K0.999
10:73009886:C:AW485C0.999
10:73009886:C:GW485C0.999
10:73009888:A:GW485R0.999
10:73009888:A:TW485R0.999
10:73009896:G:TA482D0.999
10:73011002:A:CF468L0.999
10:73011002:A:TF468L0.999
10:73011003:A:CF468C0.999
10:73011003:A:GF468S0.999
10:73011004:A:GF468L0.999
10:73014240:G:TA451D0.999

dbSNP variants (sampled 300 via entrez): RS1000021879 (10:73097663 A>C), RS1000052993 (10:73097197 T>C), RS1000056416 (10:73059181 T>C), RS1000080456 (10:73021320 A>G), RS1000112418 (10:73013270 T>C), RS1000115429 (10:73053376 C>A,T), RS1000142494 (10:73072858 A>G), RS1000184791 (10:73055025 G>A), RS1000198341 (10:73072430 T>C), RS1000210352 (10:73040516 G>A), RS1000215167 (10:73091356 G>A), RS1000264702 (10:73027300 G>C,T), RS1000297046 (10:73079934 T>C), RS1000347442 (10:73033681 A>C), RS1000368861 (10:73078904 G>A)

Disease associations

OMIM: gene MIM:176710 | disease phenotypes:

GenCC curated gene-disease

DiseaseClassificationInheritance
connective tissue disorderLimitedAutosomal recessive

Mondo (1): connective tissue disorder (MONDO:0003900)

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST90011898_75Alanine aminotransferase levels2.000000e-09

MeSH disease descriptors (1)

DescriptorNameTree numbers
D003240Connective Tissue DiseasesC17.300

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL1250350 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

9 potent at pChembl≥5 of 10 total, top 9 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
9.88Kd0.132nMCHEMBL5653589
9.88ED500.132nMCHEMBL5653589
6.85Ki140nMCHEMBL332005
6.70Ki200nMCHEMBL3586252
6.29Kd510.8nMCHEMBL3752910
6.29ED50510.8nMCHEMBL3752910
6.08IC50840nMCHEMBL332005
5.92IC501200nMCHEMBL3586252
5.58Ki2600nMPYRIDINE-2,5-DICARBOXYLIC ACID

PubChem BioAssay actives

7 with measured affinity, of 20 total; 5 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(2-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2148923: Binding affinity to human P4HA1 incubated for 45 mins by Kinobead based pull down assaykd0.0001uM
6-(5-carboxy-2-pyridinyl)pyridine-3-carboxylic acid1230196: Inhibition of human human recombinant CP4H1 expressed in Escherichia coli Origami B(DE3) pre-incubated for 2 mins followed by alpha-ketoglutarate addition using dansylGlyProProGlyOEt substrate in Tris-HCl buffer at pH 7.8 at 30 degC by HPLC based Cheng-Prussoff equation methodki0.1400uM
6-(4-carboxy-2-pyridinyl)pyridine-3-carboxylic acid1230196: Inhibition of human human recombinant CP4H1 expressed in Escherichia coli Origami B(DE3) pre-incubated for 2 mins followed by alpha-ketoglutarate addition using dansylGlyProProGlyOEt substrate in Tris-HCl buffer at pH 7.8 at 30 degC by HPLC based Cheng-Prussoff equation methodki0.2000uM
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2148923: Binding affinity to human P4HA1 incubated for 45 mins by Kinobead based pull down assaykd0.5108uM
pyridine-2,5-dicarboxylic acid1230196: Inhibition of human human recombinant CP4H1 expressed in Escherichia coli Origami B(DE3) pre-incubated for 2 mins followed by alpha-ketoglutarate addition using dansylGlyProProGlyOEt substrate in Tris-HCl buffer at pH 7.8 at 30 degC by HPLC based Cheng-Prussoff equation methodki2.6000uM

CTD chemical–gene interactions

85 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Oxygenincreases reaction, increases expression, affects binding6
bisphenol Aaffects expression, decreases expression, increases expression4
Valproic Aciddecreases methylation, affects expression, decreases expression4
Cyclosporinedecreases expression3
Cadmium Chlorideincreases expression, decreases reaction, increases abundance, increases palmitoylation3
methylmercuric chloridedecreases expression2
sodium arsenitedecreases expression, affects expression2
cobaltous chlorideincreases expression, decreases reaction2
Tetrachlorodibenzodioxindecreases expression2
Aflatoxin B1affects expression, increases methylation2
Particulate Matterdecreases expression, increases abundance2
aristolochic acid Idecreases expression1
GSK-J4increases expression1
bisphenol Fincreases expression1
dicrotophosdecreases expression1
selenomethylselenocysteineincreases expression1
2,4,6-tribromophenolincreases expression1
chloroacetaldehydeincreases expression1
triphenyl phosphateaffects expression1
diethyl phthalatedecreases expression, increases abundance1
decabromobiphenyl etherincreases expression1
ferric ammonium citratedecreases reaction, increases expression1
beta-lapachoneincreases expression1
arseniteaffects binding, increases reaction1
zinc chloridedecreases reaction, increases expression1
tetrabromobisphenol Aincreases expression1
2-bromopalmitatedecreases reaction, increases abundance, increases palmitoylation1
nickel chlorideincreases expression1
perfluorooctanoic aciddecreases expression1
zinc chromateincreases abundance, increases expression1

ChEMBL screening assays

6 unique, capped per target: 6 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL1243628BindingActivity of collagen prolyl 4 hydroxylase-1Non-heme dioxygenases: cellular sensors and regulators jelly rolled into one? — Nat Chem Biol

Clinical trials (associated diseases)

83 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT01042158PHASE4COMPLETEDA Clinical Trial of Ambrisentan and Tadalafil in Pulmonary Arterial Hypertension Associated With Systemic Sclerosis
NCT03688191PHASE4UNKNOWNStudy of Sirolimus in CTD-TP in China
NCT04169100PHASE4UNKNOWNNovel Form of Acquired Long QT Syndrome
NCT04197050PHASE4UNKNOWNEffect of Sacubitril/Valsartan on Reduced Right Ventricular Ejection Fraction in Patients With CTD
NCT04928586PHASE4UNKNOWNImmunosuppressant Combined With Pirfenidone in CTD-ILD
NCT05440240PHASE4RECRUITINGPercutaneous Needle Fasciotomy +/- Corticosteroid Injection for Dupuytren’s Contracture
NCT05505409PHASE4UNKNOWNEfficacy and Safety of Pirfenidone in CTD-ILD
NCT06499233PHASE4RECRUITINGEfficacy and Safety of Prophylactic Treatment for Pneumocystis Jirovecii Pneumonia in Patients With Autoimmune Inflammatory Rheumatic Disease
NCT00864201PHASE3UNKNOWNA Study to Evaluate the Use of Bosentan in Patients With Exercise Induced Pulmonary Arterial Hypertension Associated With Connective Tissue Disease
NCT01196091PHASE3COMPLETEDA Study of LY2127399 in Participants With Systemic Lupus Erythematosus
NCT01205438PHASE3COMPLETEDA Study of LY2127399 in Participants With Systemic Lupus Erythematosus
NCT01488708PHASE3TERMINATEDOn Open-Label Study in Participants With Systemic Lupus Erythematosus
NCT03626688PHASE3COMPLETEDA Study Evaluating the Efficacy and Safety of Ralinepag to Improve Treatment Outcomes in PAH Patients
NCT03683186PHASE3ENROLLING_BY_INVITATIONA Study Evaluating the Long-Term Efficacy and Safety of Ralinepag in Subjects With PAH Via an Open-Label Extension
NCT04084678PHASE3TERMINATEDA Study of Ralinepag to Evaluate Effects on Exercise Capacity by CPET in Subjects With WHO Group 1 PH
NCT06716606PHASE3RECRUITINGA Study to Investigate the Long-term Safety and Efficacy of Belimumab in Adults With Interstitial Lung Disease (ILD) Associated With Systemic Sclerosis (SSc) and Other Connective Tissue Diseases (CTD) (BLISSconneCTD-OLE)
NCT06917690PHASE3RECRUITINGA Study to Learn About the Safety and Efficacy of the Drug Oleogel-S10 in Japanese Patients With Epidermolysis Bullosa
NCT00004357PHASE2COMPLETEDAbsorption of Corticosteroids in Children With Juvenile Dermatomyositis
NCT00005675PHASE2COMPLETEDOral Type I Collagen for Relieving Scleroderma
NCT01808196PHASE2COMPLETEDTesting Effectiveness of Losartan in Patients With EoE With or Without a CTD
NCT02682511PHASE2ACTIVE_NOT_RECRUITINGOral Ifetroban to Treat Diffuse Cutaneous Systemic Sclerosis (SSc) or SSc-associated Pulmonary Arterial Hypertension
NCT04993885PHASE2RECRUITINGAvatrombopag in the Treatment of Adult Immune Thrombocytopenia With Autoantibodies
NCT05516758PHASE2TERMINATEDA Study of Peresolimab (LY3462817) in Participants With Moderately-to-Severely Active Rheumatoid Arthritis
NCT05998759PHASE2RECRUITINGTelitacicept for the Treatment of Connective Tissue Disease-associated Thrombocytopenia
NCT06104228PHASE2RECRUITING129 Xenon MRI as a Biomarker for Diagnosis and Response to Therapy in Pulmonary Arterial Hypertension (PAH)
NCT01093911PHASE1COMPLETEDSafety Study of CDP7657 in Healthy Volunteers and Patients With Systemic Lupus Erythematosus (SLE)
NCT01764594PHASE1COMPLETEDSafety Study of CDP7657 in Patients With Systemic Lupus Erythematosus
NCT02392130PHASE1COMPLETEDA Clinical Trial to Assess the Potential of LEO 130852A Gel to Reduce Steroid Induced Skin Atrophy on Healthy Skin
NCT03337165PHASE1COMPLETEDAutologous Tolerogenic Dendritic Cells for Treatment of Patients With Rheumatoid Arthritis
NCT03929120PHASE1COMPLETEDAllogeneic Bone Marrow Mesenchymal Stem Cells for Patients With Interstitial Lung Disease (ILD) & Connective Tissue Disorders (CTD)
NCT01424033PHASE2/PHASE3TERMINATEDA Clinical Trial for CTD-ILD Treatment
NCT04915482PHASE2/PHASE3UNKNOWNTPO-RAs Combined With Anti-CD20 Antibody in the Treatment of Adult Immune Thrombocytopenia With Autoantibodies
NCT06574581PHASE1/PHASE2RECRUITINGADSCs Therapy in Patients With CTD-ILD
NCT00001330Not specifiedCOMPLETEDStudy of Silicone-Associated Connective Tissue Diseases
NCT00001641Not specifiedCOMPLETEDStudy of Heritable Connective Tissue Disorders
NCT00001978Not specifiedTERMINATEDDetermination of Kidney Function
NCT00076830Not specifiedCOMPLETEDEvaluation and Treatment of Patients With Connective Tissue Disease
NCT00341679Not specifiedCOMPLETEDStudies of the Natural History and Pathogenesis of Autoimmune/Connective Tissue Diseases
NCT00470327Not specifiedRECRUITINGA Study of the Natural Progression of Interstitial Lung Disease (ILD)
NCT00491309Not specifiedUNKNOWNExercise and Respiratory Therapy in Patients With Rheumatoid Arthritis / Collagenosis and Pulmonary Hypertension