PGA3

gene
On this page

Summary

PGA3 (pepsinogen A3, HGNC:8885) is a protein-coding gene on chromosome 11q12.2, encoding Pepsin A-3 (P0DJD8). Shows particularly broad specificity; although bonds involving phenylalanine and leucine are preferred, many others are also cleaved to some extent.

This gene encodes a protein precursor of the digestive enzyme pepsin, a member of the peptidase A1 family of endopeptidases. The encoded precursor is secreted by gastric chief cells and undergoes autocatalytic cleavage in acidic conditions to form the active enzyme, which functions in the digestion of dietary proteins. This gene is found in a cluster of related genes on chromosome 11, each of which encodes one of multiple pepsinogens. Pepsinogen levels in serum may serve as a biomarker for atrophic gastritis and gastric cancer.

Source: NCBI Gene 643834 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 24 total
  • MANE Select transcript: NM_001079807

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:8885
Approved symbolPGA3
Namepepsinogen A3
Location11q12.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000229859
Ensembl biotypeprotein_coding
OMIM169710
Entrez643834

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 4 protein_coding, 4 retained_intron

ENST00000325558, ENST00000535551, ENST00000537954, ENST00000538258, ENST00000539649, ENST00000541699, ENST00000543125, ENST00000543505

RefSeq mRNA: 1 — MANE Select: NM_001079807 NM_001079807

CCDS: CCDS31574

Canonical transcript exons

ENST00000325558 — 9 exons

ExonStartEnd
ENSE000024781956120829161208490
ENSE000025048526120749761207615
ENSE000025057286121255361213098
ENSE000025178676120351561203620
ENSE000034587466120410761204269
ENSE000034934546121133761211435
ENSE000036518656121109061211234
ENSE000037581006120651061206627
ENSE000037867066120984261209958

Expression profiles

Bgee: expression breadth ubiquitous, 123 present calls, max score 96.24.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 1.1118 / max 1557.1047, expressed in 55 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1145521.111855

Top tissues by expression

131 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
body of stomachUBERON:000116196.24gold quality
stomachUBERON:000094593.66gold quality
fundus of stomachUBERON:000116092.67gold quality
right uterine tubeUBERON:000130288.90gold quality
right coronary arteryUBERON:000162588.20gold quality
endocervixUBERON:000045886.74gold quality
lower esophagus mucosaUBERON:003583486.39gold quality
ectocervixUBERON:001224985.89gold quality
left uterine tubeUBERON:000130383.47gold quality
metanephros cortexUBERON:001053381.79gold quality
uterine cervixUBERON:000000281.20gold quality
right adrenal glandUBERON:000123380.35gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099179.81gold quality
body of pancreasUBERON:000115079.80gold quality
spleenUBERON:000210677.61gold quality
transverse colonUBERON:000115777.39gold quality
right ovaryUBERON:000211877.32gold quality
fallopian tubeUBERON:000388977.11gold quality
right lungUBERON:000216776.82gold quality
omental fat padUBERON:001041476.72gold quality
subcutaneous adipose tissueUBERON:000219076.67gold quality
adipose tissueUBERON:000101376.41gold quality
left adrenal glandUBERON:000123476.39gold quality
left testisUBERON:000453375.19gold quality
thoracic mammary glandUBERON:000520074.99gold quality
esophagus mucosaUBERON:000246974.94gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047374.82gold quality
right lobe of liverUBERON:000111474.71gold quality
right adrenal gland cortexUBERON:003582774.70gold quality
mucosa of transverse colonUBERON:000499174.66gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-MTAB-6058no22.22
E-ANND-3no2.12

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

5 targeting PGA3, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6793-5P99.9765.95758
HSA-MIR-6783-3P99.8967.922059
HSA-MIR-430799.8270.453374
HSA-MIR-4768-3P98.1666.022330
HSA-MIR-194-3P97.3665.961027

Literature-anchored findings (GeneRIF, showing 10)

  • Higher levels of Pepsinogen I is associated with early gastric cancer (PMID:15688378)
  • Helicobacter pylori eradication improves gastric histology and decreases serum gastrin, pepsinogen I and pepsinogen II levels in patients with duodenal ulcer. (PMID:17559360)
  • pepsinogens may have a role in esophageal squamous dysplasia (PMID:18844222)
  • Barrett’s esophagus is characterized by the absence of Helicobacter pylori infection and high levels of serum pepsinogen I concentration in Japan. (PMID:19196398)
  • A significant negative correlation was found between the degree of corpus atrophy and the level of serum pepsinogen-I in atrophic gastritis patients with gastroesophageal reflux. (PMID:21175799)
  • Significantly higher AUC was observed for the PGI/PGII ratio and is associated with atrophy and intestinal metaplasia. (PMID:23886209)
  • High serum Pepsinogen I level is associ9ated with the progression of gastric precancerous lesions. (PMID:24895149)
  • Low Pepsinogen I is associated with precancerous gastric lesions. (PMID:25292040)
  • Serum pepsinogen (PG), gastrin, and Helicobacter pylori (H. pylori) IgG antibodies have become indicators for “serological biopsy” of the gastric mucosa lesion. (PMID:31414587)
  • Diagnostic value of MRI-DWI signal intensity value combined with serum PGI, PGII and CA199 in early gastric cancer. (PMID:34817333)

Cross-species orthologs

14 orthologs

OrganismSymbolGene ID
drosophila_melanogasterBaceFBGN0032049
drosophila_melanogasterCG6508FBGN0032303
drosophila_melanogasterCG17134FBGN0032304
drosophila_melanogasterCG33128FBGN0053128
caenorhabditis_elegansWBGENE00000214
caenorhabditis_elegansWBGENE00000218
caenorhabditis_elegansWBGENE00012681
caenorhabditis_elegansWBGENE00012682
caenorhabditis_elegansWBGENE00012683
caenorhabditis_elegansWBGENE00013973
caenorhabditis_elegansWBGENE00017678
caenorhabditis_elegansWBGENE00019104
caenorhabditis_elegansWBGENE00019105
caenorhabditis_elegansWBGENE00077655

Paralogs (9): PGC (ENSG00000096088), CTSD (ENSG00000117984), NAPSA (ENSG00000131400), REN (ENSG00000143839), BACE2 (ENSG00000182240), BACE1 (ENSG00000186318), CTSE (ENSG00000196188), PGA4 (ENSG00000229183), PGA5 (ENSG00000256713)

Protein

Protein identifiers

Pepsin A-3P0DJD8 (reviewed: P0DJD8)

Alternative names: Pepsinogen-3

All UniProt accessions (4): P0DJD8, F5GXL4, F5H0H6, F5H842

UniProt curated annotations — full annotation on UniProt →

Function. Shows particularly broad specificity; although bonds involving phenylalanine and leucine are preferred, many others are also cleaved to some extent.

Subcellular location. Secreted.

Similarity. Belongs to the peptidase A1 family.

RefSeq proteins (1): NP_001073275* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001461Aspartic_peptidase_A1Family
IPR001969Aspartic_peptidase_ASActive_site
IPR012848Aspartic_peptidase_NDomain
IPR021109Peptidase_aspartic_dom_sfHomologous_superfamily
IPR033121PEPTIDASE_A1Domain
IPR034162Pepsin_ADomain

Pfam: PF00026, PF07966

UniProt features (11 total): disulfide bond 3, sequence conflict 2, active site 2, signal peptide 1, propeptide 1, chain 1, domain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P0DJD8-F191.150.75

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (2): 94; 277

Disulfide bonds (3): 107–112, 268–272, 311–344

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-5683826Surfactant metabolism

MSigDB gene sets: 30 (showing top): GOBP_DIGESTION, NIKOLSKY_BREAST_CANCER_11Q12_Q14_AMPLICON, GOCC_MULTIVESICULAR_BODY, GOCC_ENDOSOME_LUMEN, GOBP_PROTEOLYSIS, GOMF_PEPTIDASE_ACTIVITY, GOMF_ASPARTIC_TYPE_PEPTIDASE_ACTIVITY, GOCC_LATE_ENDOSOME_LUMEN, GOCC_MULTIVESICULAR_BODY_LUMEN, REACTOME_SURFACTANT_METABOLISM, ZNF184_TARGET_GENES, ZNF512_TARGET_GENES, MIR4768_3P, MIR194_3P, MIR6793_5P

GO Biological Process (2): proteolysis (GO:0006508), digestion (GO:0007586)

GO Molecular Function (3): aspartic-type endopeptidase activity (GO:0004190), peptidase activity (GO:0008233), hydrolase activity (GO:0016787)

GO Cellular Component (3): extracellular exosome (GO:0070062), multivesicular body lumen (GO:0097486), extracellular region (GO:0005576)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
protein metabolic process1
multicellular organismal process1
endopeptidase activity1
aspartic-type peptidase activity1
hydrolase activity1
catalytic activity, acting on a protein1
catalytic activity1
extracellular vesicle1
multivesicular body1
late endosome lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

0 interactions, top by confidence (×1000):

IntAct

3 interactions, top by confidence:

ABTypeScore
PGA3RMND1psi-mi:“MI:0915”(physical association)0.400
cagAPGA3psi-mi:“MI:0915”(physical association)0.370

BioGRID (2): RMND1 (Affinity Capture-MS), PGA3 (Negative Genetic)

ESM2 similar proteins: O93428, P00791, P00792, P00793, P00794, P00795, P03954, P03955, P04073, P0DJD7, P0DJD8, P0DJD9, P11489, P14091, P16228, P16476, P18276, P20142, P25796, P27677, P27678, P27821, P27822, P27823, P28712, P28713, P43159, P56272, P70269, P81497, P81498, Q03168, Q28389, Q64411, Q689Z7, Q800A0, Q805F2, Q805F3, Q8SQ41, Q9D7R7

Diamond homologs: A0A146F0J0, A1CBR4, A1DDK1, A2R3L3, B0Y1V8, B6HL60, B8MF81, B8NLY9, C5FBS2, C5FW52, C5FZ57, C5PEI9, D4AIC4, D4ANC3, D4AT39, D4D7C5, D4DE18, D4DGR1, E5A7T3, E5R1B9, O01530, O13340, O60020, O65390, O76856, O93885, P00791, P00792, P00798, P03954, P03955, P04073, P06026, P0CU33, P0DJD7, P0DJD8, P0DJD9, P10602, P11489, P11838

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

24 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance18
Likely benign5
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

885 predictions. Top by Δscore:

VariantEffectΔscore
11:61203621:G:GGdonor_gain1.0000
11:61204102:CACA:Cacceptor_loss1.0000
11:61204103:ACAG:Aacceptor_gain1.0000
11:61204104:CA:Cacceptor_loss1.0000
11:61204105:A:ACacceptor_loss1.0000
11:61204105:A:AGacceptor_gain1.0000
11:61204105:AG:Aacceptor_gain1.0000
11:61204105:AGG:Aacceptor_gain1.0000
11:61204106:G:GTacceptor_gain1.0000
11:61204106:GG:Gacceptor_gain1.0000
11:61204106:GGG:Gacceptor_gain1.0000
11:61204106:GGGT:Gacceptor_gain1.0000
11:61204106:GGGTC:Gacceptor_gain1.0000
11:61204203:A:Tdonor_gain1.0000
11:61204265:TGGAT:Tdonor_gain1.0000
11:61204266:GGAT:Gdonor_gain1.0000
11:61204266:GGATG:Gdonor_gain1.0000
11:61204267:GAT:Gdonor_gain1.0000
11:61204267:GATG:Gdonor_gain1.0000
11:61204268:AT:Adonor_gain1.0000
11:61204269:TGTG:Tdonor_loss1.0000
11:61204270:G:GGdonor_gain1.0000
11:61204270:GTG:Gdonor_loss1.0000
11:61204271:TGAG:Tdonor_loss1.0000
11:61204272:G:GTdonor_loss1.0000
11:61204274:GT:Gdonor_gain1.0000
11:61207492:T:TAacceptor_gain1.0000
11:61207492:TGCA:Tacceptor_loss1.0000
11:61207494:CA:Cacceptor_loss1.0000
11:61207495:A:AGacceptor_gain1.0000

AlphaMissense

2544 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:61212626:G:TG364V0.999
11:61206571:A:TD94V0.998
11:61206572:C:AD94E0.998
11:61206572:C:GD94E0.998
11:61209941:G:CW252C0.998
11:61209941:G:TW252C0.998
11:61211146:A:TD277V0.998
11:61211147:C:AD277E0.998
11:61211147:C:GD277E0.998
11:61212616:T:AW361R0.998
11:61212616:T:CW361R0.998
11:61206571:A:CD94A0.997
11:61206574:C:TT95I0.997
11:61206591:T:AW101R0.997
11:61206591:T:CW101R0.997
11:61206611:C:GC107W0.997
11:61207511:T:CF118L0.997
11:61207513:C:AF118L0.997
11:61207513:C:GF118L0.997
11:61208330:A:CS166R0.997
11:61208332:C:AS166R0.997
11:61208332:C:GS166R0.997
11:61208384:G:TG184W0.997
11:61209939:T:AW252R0.997
11:61209939:T:CW252R0.997
11:61211145:G:CD277H0.997
11:61211146:A:CD277A0.997
11:61211146:A:GD277G0.997
11:61211383:T:CF322S0.997
11:61212625:G:CG364R0.997

dbSNP variants (sampled 300 via entrez): RS1002371559 (11:61202961 A>G), RS1002402654 (11:61203785 TA>T,TAA), RS1002738167 (11:61201875 T>C,G), RS1003991848 (11:61201732 A>G), RS1005745299 (11:61202438 T>A,C), RS1006012592 (11:61202878 G>A,C), RS1006838418 (11:61201937 G>C,T), RS1009728804 (11:61205338 A>G), RS1010119132 (11:61201925 G>A), RS1010193774 (11:61203830 G>A), RS1010481596 (11:61202094 G>A), RS1015331217 (11:61201747 T>G), RS1015956890 (11:61207610 G>C), RS1017123900 (11:61202923 G>A,C), RS1019955642 (11:61203886 AT>A)

Disease associations

OMIM: gene MIM:169710 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
quercitrinincreases expression1
hydroxyhydroquinonedecreases expression1
Grape Seed Proanthocyanidinsincreases expression, affects cotreatment1
Resveratrolaffects cotreatment, decreases expression1
Catechinaffects cotreatment, increases expression1
Copperdecreases expression, affects cotreatment1
Leadaffects expression1
Triclosandecreases expression1
Valproic Acidincreases methylation1
Aflatoxin B1decreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.