PM20D2

gene
On this page

Also known as bA63L7.3

Summary

PM20D2 (peptidase M20 domain containing 2, HGNC:21408) is a protein-coding gene on chromosome 6q15, encoding Xaa-Arg dipeptidase (Q8IYS1). Catalyzes the peptide bond hydrolysis in dipeptides having basic amino acids lysine, ornithine or arginine at C-terminus.

Enables dipeptidase activity and identical protein binding activity. Acts upstream of or within proteolysis and regulation of protein metabolic process. Located in nucleoplasm.

Source: NCBI Gene 135293 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 83 total
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_001010853

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21408
Approved symbolPM20D2
Namepeptidase M20 domain containing 2
Location6q15
Locus typegene with protein product
StatusApproved
AliasesbA63L7.3
Ensembl geneENSG00000146281
Ensembl biotypeprotein_coding
OMIM615913
Entrez135293

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000275072, ENST00000879768

RefSeq mRNA: 1 — MANE Select: NM_001010853 NM_001010853

CCDS: CCDS34499

Canonical transcript exons

ENST00000275072 — 7 exons

ExonStartEnd
ENSE000009748468914926589149413
ENSE000009748478915304389153185
ENSE000009748488915474889154902
ENSE000010165158914605589146609
ENSE000010165178916178389161890
ENSE000010165188915832589158460
ENSE000013056608916210989165565

Expression profiles

Bgee: expression breadth ubiquitous, 257 present calls, max score 95.44.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 8.6114 / max 225.1369, expressed in 1425 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
688747.92021381
688730.6913382

Top tissues by expression

259 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
tibialis anteriorUBERON:000138595.44gold quality
deltoidUBERON:000147694.51gold quality
epithelial cell of pancreasCL:000008394.39gold quality
oviduct epitheliumUBERON:000480494.20gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451194.18gold quality
quadriceps femorisUBERON:000137793.92gold quality
pigmented layer of retinaUBERON:000178293.87gold quality
vastus lateralisUBERON:000137993.64gold quality
germinal epithelium of ovaryUBERON:000130492.76gold quality
left ventricle myocardiumUBERON:000656692.61gold quality
cardiac muscle of right atriumUBERON:000337992.02gold quality
skeletal muscle tissueUBERON:000113491.60gold quality
thymusUBERON:000237091.23gold quality
mucosa of paranasal sinusUBERON:000503090.84gold quality
biceps brachiiUBERON:000150790.71gold quality
muscle tissueUBERON:000238590.52gold quality
oral cavityUBERON:000016789.98gold quality
skeletal muscle organUBERON:001489289.75gold quality
gastrocnemiusUBERON:000138889.48gold quality
ventricular zoneUBERON:000305389.31gold quality
ileal mucosaUBERON:000033189.27gold quality
muscle of legUBERON:000138388.95gold quality
calcaneal tendonUBERON:000370188.45gold quality
superficial temporal arteryUBERON:000161488.42gold quality
upper arm skinUBERON:000426388.42gold quality
skeletal muscle tissue of biceps brachiiUBERON:000450287.70gold quality
myocardiumUBERON:000234987.63gold quality
tongueUBERON:000172387.61gold quality
trabecular bone tissueUBERON:000248387.53gold quality
descending thoracic aortaUBERON:000234587.41gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 4.

ExperimentMarker?Max mean expression
E-MTAB-3929yes35.40
E-MTAB-7249yes11.03
E-ANND-3yes4.71
E-CURD-112yes4.13

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

111 targeting PM20D2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-4455100.0065.481587
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-3162-3P100.0065.37363
HSA-MIR-340-5P100.0072.504437
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-3924100.0072.092394
HSA-MIR-428299.9975.366408
HSA-MIR-4789-3P99.9970.752484
HSA-MIR-548AW99.9972.573559
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-511-3P99.9968.851467
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-1213699.9872.815713
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-477599.9875.006394
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-548AN99.9770.912817
HSA-MIR-365899.9673.874379
HSA-MIR-548AJ-3P99.9673.385345

Literature-anchored findings (GeneRIF, showing 1)

  • PM20D2 hydrolyzed beta-alanyl-lysine, beta-alanyl-ornithine, gamma-aminobutyryl-lysine, and gamma-aminobutyryl-ornithine as its best substrates. (PMID:24891507)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusPm20d2ENSMUSG00000054659
rattus_norvegicusPm20d2ENSRNOG00000007755

Protein

Protein identifiers

Xaa-Arg dipeptidaseQ8IYS1 (reviewed: Q8IYS1)

Alternative names: Beta-Ala-Lys dipeptidase

All UniProt accessions (1): Q8IYS1

UniProt curated annotations — full annotation on UniProt →

Function. Catalyzes the peptide bond hydrolysis in dipeptides having basic amino acids lysine, ornithine or arginine at C-terminus. Postulated to function in a metabolite repair mechanism by eliminating alternate dipeptide by-products formed during carnosine synthesis.

Similarity. Belongs to the peptidase M20A family.

RefSeq proteins (1): NP_001010853* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002933Peptidase_M20Family
IPR011650Peptidase_M20_dimerDomain
IPR017144Xaa-Arg_dipeptidaseFamily
IPR036264Bact_exopeptidase_dim_domHomologous_superfamily
IPR052030Peptidase_M20/M20A_hydrolasesFamily

Pfam: PF01546, PF07687

Catalyzed reactions (Rhea), 5 shown:

  • beta-alanyl-L-lysine + H2O = beta-alanine + L-lysine (RHEA:59608)
  • beta-alanyl-L-ornithine + H2O = beta-alanine + L-ornithine (RHEA:59612)
  • N(2)-(4-aminobutanoyl)-L-lysine + H2O = 4-aminobutanoate + L-lysine (RHEA:59620)
  • N(2)-(4-aminobutanoyl)-L-ornithine + H2O = 4-aminobutanoate + L-ornithine (RHEA:59624)
  • N(2)-(4-aminobutanoyl)-L-arginine + H2O = 4-aminobutanoate + L-arginine (RHEA:59628)

UniProt features (35 total): helix 14, strand 11, turn 5, sequence conflict 3, chain 1, sequence variant 1

Structure

Experimental structures (PDB)

4 structures.

PDBMethodResolution (Å)
7YH4X-RAY DIFFRACTION2.03
8ZEIX-RAY DIFFRACTION2.3
8ZELX-RAY DIFFRACTION2.3
8XZCX-RAY DIFFRACTION2.35

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q8IYS1-F193.560.87

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 88 (showing top): chr6q15, GOBP_MODIFIED_AMINO_ACID_CATABOLIC_PROCESS, GOBP_DICARBOXYLIC_ACID_METABOLIC_PROCESS, GOBP_PTERIDINE_CONTAINING_COMPOUND_METABOLIC_PROCESS, GOBP_AMIDE_METABOLIC_PROCESS, GOBP_ORGANIC_ACID_CATABOLIC_PROCESS, GOBP_ORGANIC_ACID_METABOLIC_PROCESS, GOBP_SMALL_MOLECULE_CATABOLIC_PROCESS, TTGGAGA_MIR5155P_MIR519E, DODD_NASOPHARYNGEAL_CARCINOMA_UP, CHARAFE_BREAST_CANCER_LUMINAL_VS_BASAL_DN, JAATINEN_HEMATOPOIETIC_STEM_CELL_UP, GOBP_DICARBOXYLIC_ACID_CATABOLIC_PROCESS, GOBP_VITAMIN_METABOLIC_PROCESS, GOBP_FOLIC_ACID_METABOLIC_PROCESS

GO Biological Process (2): proteolysis (GO:0006508), regulation of protein metabolic process (GO:0051246)

GO Molecular Function (6): carboxypeptidase activity (GO:0004180), dipeptidase activity (GO:0016805), identical protein binding (GO:0042802), protein binding (GO:0005515), peptidase activity (GO:0008233), hydrolase activity (GO:0016787)

GO Cellular Component (1): nucleoplasm (GO:0005654)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
protein metabolic process2
exopeptidase activity2
regulation of macromolecule metabolic process1
regulation of primary metabolic process1
protein binding1
binding1
hydrolase activity1
catalytic activity, acting on a protein1
catalytic activity1
nuclear lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

703 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PM20D2MAPK1IP1LQ8NDC0520
PM20D2KLHL33A6NCF5515
PM20D2UBE2J1Q9Y385509
PM20D2MDN1Q9NU22486
PM20D2ZKSCAN8Q15776453
PM20D2PGRMC2O15173451
PM20D2CCDC198Q9NVL8447
PM20D2FAAHO00519441
PM20D2QRSL1Q9H0R6436
PM20D2GABRR2P28476429
PM20D2PM20D1Q6GTS8428
PM20D2PPP4R4Q6NUP7425
PM20D2MAST2Q6P0Q8425
PM20D2INTS5Q6P9B9412
PM20D2CNDP1Q96KN2412

IntAct

97 interactions, top by confidence:

ABTypeScore
ANGRNH1psi-mi:“MI:0914”(association)0.750
NIPAL1ESYT2psi-mi:“MI:0914”(association)0.640
PM20D2CNNM3psi-mi:“MI:0915”(physical association)0.560
PM20D2PM20D2psi-mi:“MI:0915”(physical association)0.560
PM20D2DDIT4Lpsi-mi:“MI:0915”(physical association)0.560
PM20D2SAXO1psi-mi:“MI:0915”(physical association)0.560
PM20D2HAPLN2psi-mi:“MI:0915”(physical association)0.560
PM20D2DNAJC11psi-mi:“MI:0915”(physical association)0.560
MEOX1PM20D2psi-mi:“MI:0915”(physical association)0.560
TDO2PM20D2psi-mi:“MI:0915”(physical association)0.560
RELPM20D2psi-mi:“MI:0915”(physical association)0.560
PM20D2NAA80psi-mi:“MI:0915”(physical association)0.560
PM20D2TAPBPLpsi-mi:“MI:0915”(physical association)0.560
PM20D2CERT1psi-mi:“MI:0915”(physical association)0.560
KELPM20D2psi-mi:“MI:0915”(physical association)0.560
RGS12PM20D2psi-mi:“MI:0915”(physical association)0.560
PM20D2EPDR1psi-mi:“MI:0915”(physical association)0.560
DALRD3PM20D2psi-mi:“MI:0915”(physical association)0.560
RP2PM20D2psi-mi:“MI:0915”(physical association)0.560
POU6F2PM20D2psi-mi:“MI:0915”(physical association)0.560
KRT75PM20D2psi-mi:“MI:0915”(physical association)0.560
KRTAP26-1PM20D2psi-mi:“MI:0915”(physical association)0.560
PM20D2RUSC1psi-mi:“MI:0915”(physical association)0.560
PM20D2MYOZ3psi-mi:“MI:0915”(physical association)0.560
PSMA1PM20D2psi-mi:“MI:0915”(physical association)0.560
SIVA1PM20D2psi-mi:“MI:0915”(physical association)0.560
HAO2EIF4G3psi-mi:“MI:0914”(association)0.530
ANGPTL4NMT2psi-mi:“MI:0914”(association)0.530
DEFB1NMT2psi-mi:“MI:0914”(association)0.530

BioGRID (72): PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Proximity Label-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS), KIAA1671 (Affinity Capture-MS), PRIM2 (Affinity Capture-MS), TK1 (Affinity Capture-MS), PM20D2 (Affinity Capture-MS)

ESM2 similar proteins: A3KG59, A4IFH5, B9N1F9, D3ZVR9, O04059, O35331, O35621, O46560, P11172, P24298, P37111, Q03154, Q04609, Q15124, Q17QK3, Q2R483, Q501L1, Q5E9T8, Q5I0K3, Q5NAY4, Q5R514, Q5R5C9, Q5RDE7, Q5RDN7, Q5RFB0, Q5RFI8, Q6AY30, Q6AYS7, Q6K2E8, Q6PTT0, Q6Q0N1, Q6ZV70, Q7TSV4, Q7X7L3, Q8BZF8, Q8CG45, Q8CG76, Q8IYS1, Q8K183, Q8N0X4

Diamond homologs: A3KG59, Q0S8V5, Q501L1, Q8IYS1

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

83 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance63
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1538 predictions. Top by Δscore:

VariantEffectΔscore
6:89149259:TTCTA:Tacceptor_loss1.0000
6:89149264:GGTA:Gacceptor_gain1.0000
6:89149409:CATGA:Cdonor_gain1.0000
6:89149410:ATGA:Adonor_gain1.0000
6:89149410:ATGAG:Adonor_loss1.0000
6:89149411:TGA:Tdonor_gain1.0000
6:89149412:GA:Gdonor_gain1.0000
6:89149412:GAG:Gdonor_gain1.0000
6:89149412:GAGTG:Gdonor_loss1.0000
6:89149413:AGT:Adonor_loss1.0000
6:89149414:G:GGdonor_gain1.0000
6:89149414:GT:Gdonor_loss1.0000
6:89149415:T:Gdonor_loss1.0000
6:89149417:AGT:Adonor_loss1.0000
6:89153041:A:AGacceptor_gain1.0000
6:89153041:AGTGT:Aacceptor_gain1.0000
6:89153042:G:GAacceptor_gain1.0000
6:89153042:GT:Gacceptor_gain1.0000
6:89153042:GTGT:Gacceptor_gain1.0000
6:89153042:GTGTG:Gacceptor_gain1.0000
6:89153176:A:Tdonor_gain1.0000
6:89153182:CATGG:Cdonor_loss1.0000
6:89153183:ATGG:Adonor_loss1.0000
6:89153185:GGT:Gdonor_loss1.0000
6:89153186:G:GAdonor_loss1.0000
6:89153186:G:GGdonor_gain1.0000
6:89153187:T:Adonor_loss1.0000
6:89154797:T:TAacceptor_gain1.0000
6:89154903:G:GGdonor_gain1.0000
6:89158312:A:AGacceptor_gain1.0000

AlphaMissense

2813 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:89154817:G:CR276P0.997
6:89149308:A:TK170I0.995
6:89149364:C:GH189D0.995
6:89146482:A:TE113V0.994
6:89153153:G:CR242T0.993
6:89153154:A:CR242S0.993
6:89153154:A:TR242S0.993
6:89154899:C:GC303W0.993
6:89161865:T:AH377Q0.993
6:89161865:T:GH377Q0.993
6:89146403:T:CF87L0.992
6:89146405:C:AF87L0.992
6:89146405:C:GF87L0.992
6:89146525:C:AN127K0.992
6:89146525:C:GN127K0.992
6:89149291:A:CE164D0.992
6:89149291:A:TE164D0.992
6:89149309:A:CK170N0.992
6:89149309:A:TK170N0.992
6:89149366:C:AH189Q0.992
6:89149366:C:GH189Q0.992
6:89153117:C:AA230D0.992
6:89162145:C:AA398D0.992
6:89146473:T:CF110S0.991
6:89146522:C:AH126Q0.991
6:89146522:C:GH126Q0.991
6:89149290:A:TE164V0.991
6:89149293:A:TE165V0.991
6:89153063:G:AG212E0.991
6:89153119:G:CA231P0.991

dbSNP variants (sampled 300 via entrez): RS1000001820 (6:89141424 G>A), RS1000024219 (6:89137736 A>G), RS1000106183 (6:89115385 C>A,T), RS1000108810 (6:89152570 C>A), RS1000178375 (6:89123214 A>G), RS1000180742 (6:89128613 G>A), RS1000194564 (6:89165137 A>G), RS1000253668 (6:89122851 A>G), RS1000335786 (6:89122471 T>C), RS1000345234 (6:89161370 A>G), RS1000403892 (6:89152979 A>G), RS1000507272 (6:89105718 T>C,G), RS1000517360 (6:89121494 A>AG), RS1000613820 (6:89106262 G>A), RS1000648841 (6:89134577 C>T)

Disease associations

OMIM: gene MIM:615913 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST012020_120Serum metabolite levels7.000000e-23
GCST012020_121Serum metabolite levels3.000000e-16

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL5725160 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 1,538 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1232461MOLIBRESIB21,538

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

1 potent at pChembl≥5 of 1 total, top 1 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
5.00IC501e+04nMMOLIBRESIB

PubChem BioAssay actives

1 with measured affinity, of 6 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
2-[(4S)-6-(4-chlorophenyl)-8-methoxy-1-methyl-4H-[1,2,4]triazolo[4,3-a][1,4]benzodiazepin-4-yl]-N-ethylacetamide2178966: Inhibition of PM20D2 (unknown origin) incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisic5010.0000uM

CTD chemical–gene interactions

37 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
trichostatin Aaffects cotreatment, decreases expression, affects expression3
Valproic Acidaffects expression, decreases expression, increases methylation3
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxideincreases expression2
Cyclosporinedecreases expression, increases expression2
Cadmium Chloridedecreases expression2
triphenyl phosphateaffects expression1
bisphenol Adecreases expression1
lead acetatedecreases expression1
sodium arsenitedecreases expression1
perfluorooctane sulfonic aciddecreases expression1
monomethylarsonous aciddecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
jinfukangdecreases expression1
NSC 689534affects binding, decreases expression1
Bortezomibincreases expression1
Sunitinibdecreases expression1
Leflunomidedecreases expression1
Acetaminophendecreases expression1
Air Pollutantsdecreases expression, increases abundance1
Ethanolincreases expression1
Carbamazepineaffects expression1
Copperaffects binding, decreases expression1
Dichlorodiphenyl Dichloroethyleneincreases expression1
Demecolcinedecreases expression1
Doxorubicindecreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Ethyl Methanesulfonatedecreases expression1
Formaldehydedecreases expression1
Methyl Methanesulfonatedecreases expression1

ChEMBL screening assays

6 unique, capped per target: 6 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5697696BindingInhibition of PM20D2 (unknown origin) assessed as fold change at 10 uM incubated for 1 hr by colloidal coomassie staining based LC-MS/MS analysisInhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia. — Nature

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.