PGBD2

gene
On this page

Summary

PGBD2 (piggyBac transposable element derived 2, HGNC:19399) is a protein-coding gene on chromosome 1q44, encoding PiggyBac transposable element-derived protein 2 (Q6P3X8).

The piggyBac family of proteins, found in diverse animals, are transposases related to the transposase of the canonical piggyBac transposon from the moth, Trichoplusia ni. This family also includes genes in several genomes, including human, that appear to have been derived from the piggyBac transposons. This gene belongs to the subfamily of piggyBac transposable element derived (PGBD) genes. The PGBD proteins appear to be novel, with no obvious relationship to other transposases, or other known protein families. The exact function of this gene is not known. Two transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 267002 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 118 total — 1 pathogenic
  • MANE Select transcript: NM_170725

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:19399
Approved symbolPGBD2
NamepiggyBac transposable element derived 2
Location1q44
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000185220
Ensembl biotypeprotein_coding
Entrez267002

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 3 protein_coding, 1 protein_coding_CDS_not_defined

ENST00000329291, ENST00000355360, ENST00000462488, ENST00000935434

RefSeq mRNA: 2 — MANE Select: NM_170725 NM_001017434, NM_170725

CCDS: CCDS31128, CCDS31129

Canonical transcript exons

ENST00000329291 — 3 exons

ExonStartEnd
ENSE00001312561248916602248919146
ENSE00001637224248906235248906342
ENSE00003610993248913816248913879

Expression profiles

Bgee: expression breadth ubiquitous, 223 present calls, max score 80.67.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 4.5115 / max 54.2742, expressed in 1628 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
96624.50271626
96630.00893

Top tissues by expression

242 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
granulocyteCL:000009480.67gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047380.49gold quality
left testisUBERON:000453379.08gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099179.07gold quality
testisUBERON:000047378.39gold quality
right testisUBERON:000453478.33gold quality
leukocyteCL:000073878.30gold quality
monocyteCL:000057678.20gold quality
oocyteCL:000002377.92gold quality
mucosa of transverse colonUBERON:000499177.32gold quality
rectumUBERON:000105276.80gold quality
spleenUBERON:000210676.73gold quality
lymph nodeUBERON:000002976.49gold quality
body of pancreasUBERON:000115075.94gold quality
right lobe of liverUBERON:000111475.91gold quality
myocardiumUBERON:000234975.83silver quality
gingival epitheliumUBERON:000194975.80silver quality
mucosa of sigmoid colonUBERON:000499375.50gold quality
lower esophagus mucosaUBERON:003583475.34gold quality
upper arm skinUBERON:000426375.31silver quality
right lobe of thyroid glandUBERON:000111975.13gold quality
gingivaUBERON:000182875.13gold quality
vermiform appendixUBERON:000115474.91gold quality
pancreasUBERON:000126474.86gold quality
germinal epithelium of ovaryUBERON:000130474.86silver quality
islet of LangerhansUBERON:000000674.84gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451174.69silver quality
pancreatic ductal cellCL:000207974.52silver quality
gall bladderUBERON:000211074.45gold quality
small intestine Peyer’s patchUBERON:000345474.44gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no2.91

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

43 targeting PGBD2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3646100.0073.565283
HSA-MIR-5692A100.0074.406850
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-6759-5P99.9966.54785
HSA-MIR-511-3P99.9968.851467
HSA-MIR-25-3P99.9874.601817
HSA-MIR-32-5P99.9875.211964
HSA-MIR-363-3P99.9874.721821
HSA-MIR-367-3P99.9874.831819
HSA-MIR-92A-3P99.9875.211960
HSA-MIR-92B-3P99.9875.251955
HSA-MIR-4789-5P99.9870.762721
HSA-MIR-365899.9673.874379
HSA-MIR-144-3P99.9473.982698
HSA-MIR-335-3P99.9373.364958
HSA-MIR-95-5P99.8972.173973
HSA-MIR-3121-3P99.8271.963630
HSA-MIR-808099.8267.521342
HSA-MIR-451799.7669.191867
HSA-MIR-4666B99.6468.691282
HSA-MIR-510-3P99.5470.062965
HSA-MIR-5004-3P99.5468.271371
HSA-MIR-608399.4768.732393
HSA-MIR-2115-3P99.3169.682026
HSA-MIR-223-5P99.2468.821206
HSA-MIR-4687-5P99.1466.26488

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

PiggyBac transposable element-derived protein 2Q6P3X8 (reviewed: Q6P3X8)

All UniProt accessions (1): Q6P3X8

Isoforms (2)

UniProt IDNamesCanonical?
Q6P3X8-11yes
Q6P3X8-22

RefSeq proteins (2): NP_001017434, NP_733843* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR029526PGBDDomain
IPR052638PiggyBac_TE-derivedFamily

Pfam: PF13843

UniProt features (3 total): chain 1, region of interest 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q6P3X8-F183.550.60

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 52 (showing top): chr1q44, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, CBX5_TARGET_GENES, ID2_TARGET_GENES, KAT2A_TARGET_GENES, NCOA6_TARGET_GENES, SFMBT1_TARGET_GENES, SNIP1_TARGET_GENES, TOP2B_TARGET_GENES, ZNF2_TARGET_GENES, ZNF274_TARGET_GENES, ZNF391_TARGET_GENES, ZNF407_TARGET_GENES, ZNF524_TARGET_GENES, ZNF592_TARGET_GENES

GO Biological Process (0):

GO Molecular Function (1): sequence-specific DNA binding (GO:0043565)

GO Cellular Component (0):

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA binding1

Protein interactions and networks

STRING

344 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PGBD2ZBED8Q8IZ13617
PGBD2THAP9Q9H5L6555
PGBD2POGKQ9P215524
PGBD2PGBD5Q8N414513
PGBD2ZNF862O60290510
PGBD2BET1LQ9NYM9481
PGBD2ERCC6Q03468480
PGBD2ZNF672Q499Z4473
PGBD2NAIF1Q69YI7453
PGBD2ZNF385DQ9H6B1447
PGBD2HARBI1Q96MB7445
PGBD2GNL1P36915440
PGBD2TP53TG3Q9ULZ0398
PGBD2TRPC4APQ8TEL6386
PGBD2GIN1Q9NXP7384

IntAct

9 interactions, top by confidence:

ABTypeScore
PGBD1SCAND1psi-mi:“MI:0914”(association)0.940
PGBD1ZNF24psi-mi:“MI:0914”(association)0.900
PGBD1ZNF213psi-mi:“MI:0914”(association)0.530
FAXCMETTL15psi-mi:“MI:0914”(association)0.350
FTLSH3PXD2Bpsi-mi:“MI:0914”(association)0.350
FAXCPOLRMTpsi-mi:“MI:0914”(association)0.350
FTLpsi-mi:“MI:0914”(association)0.350

BioGRID (7): PGBD2 (Affinity Capture-MS), PGBD2 (Affinity Capture-MS), PGBD2 (Affinity Capture-MS), PGBD2 (Affinity Capture-MS), PGBD2 (Protein-peptide), TFRC (Cross-Linking-MS (XL-MS)), PGBD2 (Affinity Capture-MS)

ESM2 similar proteins: A0A2P1BRP3, A0A2P1BRQ0, A0ZSK3, A0ZSK4, A4IFA3, A4Z943, A4Z944, A4Z945, B2RD01, B2RRL2, O43422, O60108, O96006, P0CF97, P12258, P34601, P49777, Q09575, Q0VBL1, Q17RP2, Q2T9S8, Q3EBC8, Q49AG3, Q4R6P1, Q4W5G0, Q5SVZ6, Q6EKJ0, Q6NT04, Q6P3X8, Q6R2W3, Q7L775, Q7M3K2, Q7TSV6, Q86UP8, Q8BUZ3, Q8BV66, Q8CBA2, Q8IY51, Q8IZ13, Q8TCP9

Diamond homologs: F8VPZ5, P0DP91, Q03468, Q6P3X8, Q8N328, Q96JS3, A1YEP8, A1YEQ3, A1YEV9, A1YFW2, A1YFW6, A1YG26, A1YG48, A1YG60, A1YGJ4, A2T6E3, A2T6V8, A2T6W2, A2T712, A2T736, A2T7D2, A2T7D7, A2T7F4, A2T7L7, A2T812, A6QNZ0, A6QPT6, B2KFW1, O14709, O14771, O14978, O15535, O43309, O60304, O95125, P10073, P17022, P17028, P17029, P17040

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

118 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance109
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
442386GRCh37/hg19 1p36.33-q44(chr1:849467-249224684)x3Pathogenic

SpliceAI

916 predictions. Top by Δscore:

VariantEffectΔscore
1:248913813:CAGGT:Cacceptor_loss1.0000
1:248913814:A:AGacceptor_gain1.0000
1:248913814:AGGT:Aacceptor_loss1.0000
1:248913815:G:GGacceptor_gain1.0000
1:248913875:TCCAG:Tdonor_loss1.0000
1:248913876:CCAG:Cdonor_loss1.0000
1:248913877:CAGG:Cdonor_loss1.0000
1:248913878:AGGTA:Adonor_loss1.0000
1:248913879:GGT:Gdonor_loss1.0000
1:248913880:GT:Gdonor_loss1.0000
1:248913881:T:Gdonor_loss1.0000
1:248913815:GGTT:Gacceptor_gain0.9900
1:248916601:GAGAT:Gacceptor_gain0.9900
1:248916999:C:Gdonor_gain0.9900
1:248906490:G:GTdonor_gain0.9800
1:248906517:G:GTdonor_gain0.9800
1:248913814:AG:Aacceptor_gain0.9800
1:248913815:GG:Gacceptor_gain0.9800
1:248913815:GGTTC:Gacceptor_gain0.9800
1:248916600:A:AGacceptor_gain0.9800
1:248916601:G:GGacceptor_gain0.9800
1:248916601:GA:Gacceptor_gain0.9800
1:248913811:TACAG:Tacceptor_gain0.9700
1:248914504:G:GTdonor_gain0.9700
1:248906341:AG:Adonor_loss0.9600
1:248906342:GGTAG:Gdonor_loss0.9600
1:248906343:G:GCdonor_loss0.9600
1:248906344:T:Adonor_loss0.9600
1:248906485:G:GTdonor_gain0.9600
1:248906491:A:Tdonor_gain0.9600

AlphaMissense

3942 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:248918304:T:AC574S0.997
1:248918305:G:CC574S0.997
1:248918340:T:CF586L0.997
1:248918342:C:AF586L0.997
1:248918342:C:GF586L0.997
1:248918271:T:AC563S0.996
1:248918271:T:CC563R0.996
1:248918272:G:CC563S0.996
1:248918304:T:CC574R0.996
1:248918337:T:CC585R0.996
1:248918352:C:GH590D0.996
1:248918280:T:AC566S0.995
1:248918281:G:CC566S0.995
1:248918326:T:AV581D0.995
1:248918337:T:AC585S0.995
1:248918338:G:CC585S0.995
1:248918280:T:CC566R0.994
1:248918313:T:AC577S0.994
1:248918314:G:CC577S0.994
1:248918354:C:AH590Q0.994
1:248918354:C:GH590Q0.994
1:248917149:A:CS189R0.993
1:248917151:T:AS189R0.993
1:248917151:T:GS189R0.993
1:248918313:T:CC577R0.993
1:248917185:T:AW201R0.992
1:248917185:T:CW201R0.992
1:248918240:C:AH552Q0.992
1:248918240:C:GH552Q0.992
1:248918305:G:AC574Y0.992

dbSNP variants (sampled 300 via entrez): RS1000051404 (1:248872252 A>G), RS1000056410 (1:248899828 A>G), RS1000145049 (1:248915581 C>A,T), RS1000171504 (1:248885828 G>A), RS1000186437 (1:248920341 C>G,T), RS1000291658 (1:248888973 A>G), RS1000300462 (1:248893121 A>G), RS1000323028 (1:248920526 TTTTC>T), RS1000323416 (1:248887714 G>T), RS1000342186 (1:248929515 G>A), RS1000346542 (1:248874757 A>G), RS1000346544 (1:248906922 G>A), RS1000365927 (1:248924696 GACA>G), RS1000506013 (1:248916556 G>C), RS1000519278 (1:248919113 T>A)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST006462_4Uterine fibroids4.000000e-09
GCST009560_3Decreased low contrast letter acuity in multiple sclerosis2.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0008385visual acuity measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
3-((6-(2-methoxyphenyl)pyrimidin-4-yl)amino)phenyl)methane sulfonamidedecreases expression1
triphenyl phosphateaffects expression1
tris(2-butoxyethyl) phosphateaffects expression1
sodium arsenitedecreases expression1
ferrous chloridedecreases expression1
di-n-butylphosphoric acidaffects expression1
abrinedecreases expression1
licochalcone Bincreases expression1
Sunitinibdecreases expression1
Air Pollutants, Occupationaldecreases expression1
Benzo(a)pyreneincreases expression1
Ethyl Methanesulfonateincreases expression1
Methyl Methanesulfonateincreases expression1
Nickeldecreases expression1
Niclosamidedecreases expression1
Rotenonedecreases expression1
Smokedecreases expression1
Thimerosalincreases expression1
Urethaneincreases expression1
Cyclosporinedecreases expression1
Copper Sulfatedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): uterine corpus leiomyoma