PROX2

gene
On this page

Also known as FLJ36749

Summary

PROX2 (prospero homeobox 2, HGNC:26715) is a protein-coding gene on chromosome 14q24.3, encoding Prospero homeobox protein 2 (Q3B8N5). Transcription regulator.

Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be located in chromatin. Predicted to be active in nucleus.

Source: NCBI Gene 283571 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 57 total
  • MANE Select transcript: NM_001243007

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26715
Approved symbolPROX2
Nameprospero homeobox 2
Location14q24.3
Locus typegene with protein product
StatusApproved
AliasesFLJ36749
Ensembl geneENSG00000119608
Ensembl biotypeprotein_coding
OMIM615094
Entrez283571

Gene structure

Transcript identifiers

Ensembl transcripts: 3 — 3 protein_coding

ENST00000556489, ENST00000619947, ENST00000673765

RefSeq mRNA: 3 — MANE Select: NM_001243007 NM_001080408, NM_001243007, NM_001384314

CCDS: CCDS45136, CCDS73663

Canonical transcript exons

ENST00000556489 — 6 exons

ExonStartEnd
ENSE000008082467485680174856995
ENSE000022597657487110374871237
ENSE000024350057485840774858514
ENSE000024723127485303374855302
ENSE000025136557486253074864008
ENSE000039209547487589574876145

Expression profiles

Bgee: expression breadth ubiquitous, 139 present calls, max score 78.67.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0175 / max 8.2010, expressed in 6 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1440870.01756

Top tissues by expression

246 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047378.67silver quality
pancreatic ductal cellCL:000207967.54silver quality
stromal cell of endometriumCL:000225565.03gold quality
granulocyteCL:000009463.36gold quality
bone marrow cellCL:000209257.61gold quality
ileal mucosaUBERON:000033157.27silver quality
colonic epitheliumUBERON:000039756.73gold quality
tibialis anteriorUBERON:000138555.70silver quality
right ovaryUBERON:000211855.63gold quality
epithelial cell of pancreasCL:000008355.56gold quality
cortical plateUBERON:000534355.34silver quality
left ovaryUBERON:000211955.04gold quality
ganglionic eminenceUBERON:000402355.04silver quality
cardiac muscle of right atriumUBERON:000337954.34gold quality
left ventricle myocardiumUBERON:000656654.23gold quality
endocervixUBERON:000045854.20gold quality
kidney epitheliumUBERON:000481953.93gold quality
upper arm skinUBERON:000426353.52gold quality
ovaryUBERON:000099252.51gold quality
right uterine tubeUBERON:000130252.33gold quality
right lobe of thyroid glandUBERON:000111952.11gold quality
deltoidUBERON:000147651.67gold quality
body of uterusUBERON:000985351.26gold quality
prefrontal cortexUBERON:000045151.10gold quality
uterine cervixUBERON:000000251.07gold quality
right coronary arteryUBERON:000162550.80gold quality
left lobe of thyroid glandUBERON:000112050.79gold quality
lymph nodeUBERON:000002950.70gold quality
tonsilUBERON:000237250.47gold quality
endometriumUBERON:000129550.34gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.26

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

95 targeting PROX2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6867-5P100.0082.213464
HSA-MIR-4283100.0066.422097
HSA-MIR-6740-5P100.0065.64932
HSA-MIR-4262100.0073.263931
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-493-5P99.9672.472382
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-6783-3P99.8967.922059
HSA-MIR-990299.8969.152250
HSA-MIR-1343-3P99.8966.781815
HSA-MIR-153-5P99.8973.866317
HSA-MIR-5582-3P99.8672.484221
HSA-MIR-5003-3P99.8569.292517
HSA-MIR-548AR-3P99.8571.263889
HSA-MIR-469899.8471.414303
HSA-MIR-5010-3P99.8370.602357
HSA-MIR-548AZ-3P99.8270.563549
HSA-MIR-548BC99.8270.613524
HSA-MIR-548E-3P99.8270.593514
HSA-MIR-548F-3P99.8270.593540
HSA-MIR-205-5P99.8170.051557
HSA-MIR-4659A-3P99.8072.624248
HSA-MIR-4659B-3P99.8072.624248
HSA-MIR-205299.7969.372031
HSA-MIR-3913-5P99.7867.26968
HSA-MIR-129999.7771.242389
HSA-MIR-431999.7669.832586
HSA-MIR-548A-3P99.7670.583524

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerioprox2ENSDARG00000041952
mus_musculusProx2ENSMUSG00000042320
rattus_norvegicusProx2ENSRNOG00000005054
drosophila_melanogasterprosFBGN0004595
caenorhabditis_elegansWBGENE00000448

Paralogs (1): PROX1 (ENSG00000117707)

Protein

Protein identifiers

Prospero homeobox protein 2Q3B8N5 (reviewed: Q3B8N5)

Alternative names: Homeobox prospero-like protein PROX2

All UniProt accessions (3): Q3B8N5, A0A0G2JMC2, G3V3G0

UniProt curated annotations — full annotation on UniProt →

Function. Transcription regulator. Does not seem to be essential for embryonic development and postnatal survival.

Subcellular location. Nucleus.

Domain organisation. The Prospero-type homeodomain and the adjacent Prospero domain act as a single structural unit, the Homeo-Prospero domain.

Similarity. Belongs to the Prospero homeodomain family.

Isoforms (2)

UniProt IDNamesCanonical?
Q3B8N5-11yes
Q3B8N5-22

RefSeq proteins (3): NP_001073877, NP_001229936, NP_001371243 (=MANE)

Domains & families (InterPro)

IDNameType
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR023082Homeo_prospero_domDomain
IPR037131Homeo_prospero_dom_sfHomologous_superfamily
IPR039350Prospero_homeodomainFamily

Pfam: PF05044

UniProt features (18 total): region of interest 6, compositionally biased region 4, sequence conflict 3, domain 2, chain 1, splice variant 1, sequence variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q3B8N5-F160.970.32

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 85 (showing top): GSE45365_NK_CELL_VS_CD11B_DC_UP, GOBP_LENS_FIBER_CELL_DIFFERENTIATION, GOBP_HEPATICOBILIARY_SYSTEM_DEVELOPMENT, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_EPITHELIAL_CELL_DEVELOPMENT, GOBP_PANCREAS_DEVELOPMENT, GOBP_ANIMAL_ORGAN_MORPHOGENESIS, chr14q24, GOBP_ENDOTHELIUM_DEVELOPMENT, GOBP_EMBRYO_DEVELOPMENT, GOBP_SENSORY_ORGAN_DEVELOPMENT, GOBP_SENSORY_ORGAN_MORPHOGENESIS, GOBP_LENS_MORPHOGENESIS_IN_CAMERA_TYPE_EYE, GOBP_LYMPH_VESSEL_DEVELOPMENT, GOBP_RETINA_DEVELOPMENT_IN_CAMERA_TYPE_EYE

GO Biological Process (12): regulation of transcription by RNA polymerase II (GO:0006357), central nervous system development (GO:0007417), pancreas development (GO:0031016), tube development (GO:0035295), cell fate commitment (GO:0045165), anatomical structure formation involved in morphogenesis (GO:0048646), embryonic retina morphogenesis in camera-type eye (GO:0060059), lymphatic endothelial cell differentiation (GO:0060836), lens fiber cell morphogenesis (GO:0070309), hepatocyte differentiation (GO:0070365), regulation of DNA-templated transcription (GO:0006355), positive regulation of cell cycle (GO:0045787)

GO Molecular Function (3): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA binding (GO:0003677)

GO Cellular Component (3): chromatin (GO:0000785), nucleus (GO:0005634), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
cellular anatomical structure2
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
nervous system development1
system development1
animal organ development1
multicellular organism development1
anatomical structure development1
cell differentiation1
cellular developmental process1
anatomical structure morphogenesis1
developmental process1
embryonic morphogenesis1
retina morphogenesis in camera-type eye1
lymph vessel development1
endothelial cell differentiation1
cell morphogenesis1
lens morphogenesis in camera-type eye1
lens fiber cell development1
liver development1
epithelial cell differentiation1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
cell cycle1
positive regulation of cellular process1
regulation of cell cycle1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
nucleic acid binding1
chromosome1
intracellular membrane-bounded organelle1
intracellular anatomical structure1

Protein interactions and networks

STRING

530 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PROX2VRTNQ9H8Y1726
PROX2AREL1O15033598
PROX2SYNDIG1LA6NDD5542
PROX2YLPM1P49750521
PROX2RPS6KL1Q9Y6S9518
PROX2CATSPER1Q8NEC5455
PROX2SWI5Q1ZZU3455
PROX2IFT80Q9P2H3447
PROX2NBPF6Q5VWK0447
PROX2CTXN1P60606437
PROX2HEATR4Q86WZ0436
PROX2FLRT2O43155430
PROX2FCF1Q9Y324419
PROX2TMEM236Q5W0B7413
PROX2NISCHQ9Y2I1411

IntAct

0 interactions, top by confidence:

ESM2 similar proteins: A0A1B0GVQ3, A0A1W2PPK0, A0A1W2PPM1, A2A9I7, A6NCI8, A6QQS3, A7XCE8, E9PI22, E9PXT9, O15016, O91083, P09414, P0DMB1, P17923, P18804, P20879, P35965, P49750, Q0P670, Q12857, Q1RMX6, Q32LN6, Q32MG2, Q3B8N5, Q3T016, Q3V0A6, Q4JK59, Q5BI31, Q5T035, Q5ZKH6, Q642A3, Q6AXV6, Q6IMN6, Q6P1W5, Q6PEX7, Q6X4T0, Q80YD3, Q86UF4, Q8BII1, Q8C5V0

Diamond homologs: P29617, P34522, P48437, Q3B8N5, Q8BII1, Q91018, Q92786, Q9U6A1

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

57 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance49
Likely benign2
Benign3

Top pathogenic / likely-pathogenic (0)

SpliceAI

601 predictions. Top by Δscore:

VariantEffectΔscore
14:74855322:C:CTacceptor_gain0.9900
14:74855322:C:Tacceptor_gain0.9900
14:74856992:TGAA:Tacceptor_gain0.9900
14:74856996:C:CCacceptor_gain0.9900
14:74863408:G:Cdonor_gain0.9900
14:74856795:TCTTA:Tdonor_loss0.9800
14:74856796:CTTA:Cdonor_loss0.9800
14:74856797:TTA:Tdonor_loss0.9800
14:74856798:TACCT:Tdonor_loss0.9800
14:74856799:A:Cdonor_loss0.9800
14:74856800:C:CAdonor_loss0.9800
14:74856991:TTGAA:Tacceptor_gain0.9800
14:74863153:T:Adonor_gain0.9800
14:74856794:GTCTT:Gdonor_loss0.9700
14:74856993:GAA:Gacceptor_gain0.9700
14:74856995:ACT:Aacceptor_loss0.9500
14:74856996:C:Aacceptor_loss0.9500
14:74857006:A:Tacceptor_loss0.9500
14:74855312:C:CTacceptor_gain0.9400
14:74855212:T:TAdonor_gain0.9300
14:74856998:G:Cacceptor_gain0.9300
14:74856994:AA:Aacceptor_gain0.9200
14:74856814:C:Adonor_gain0.9100
14:74863363:C:Adonor_gain0.9100
14:74863343:T:TAdonor_gain0.9000
14:74863456:T:TAdonor_gain0.9000
14:74863138:C:Adonor_gain0.8900
14:74857002:A:Tacceptor_loss0.8800
14:74855313:A:Tacceptor_gain0.8700
14:74857001:CAAA:Cacceptor_loss0.8700

AlphaMissense

3870 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
14:74855210:C:AK567N0.997
14:74855210:C:GK567N0.997
14:74855215:A:GW566R0.997
14:74855215:A:TW566R0.997
14:74856939:A:CF490L0.997
14:74856939:A:TF490L0.997
14:74856941:A:GF490L0.997
14:74856954:G:CS485R0.997
14:74856954:G:TS485R0.997
14:74856956:T:GS485R0.997
14:74856962:A:GW483R0.997
14:74856962:A:TW483R0.997
14:74856913:G:TA499D0.996
14:74856947:G:TR488S0.996
14:74856957:G:CF484L0.996
14:74856957:G:TF484L0.996
14:74856959:A:GF484L0.996
14:74855255:G:CF552L0.995
14:74855255:G:TF552L0.995
14:74855257:A:GF552L0.995
14:74856946:C:GR488P0.995
14:74856963:C:AK482N0.995
14:74856963:C:GK482N0.995
14:74858464:A:CF452L0.995
14:74858464:A:TF452L0.995
14:74858466:A:GF452L0.995
14:74855213:C:AW566C0.994
14:74855213:C:GW566C0.994
14:74856965:T:CK482E0.994
14:74858473:T:AK449N0.994

dbSNP variants (sampled 300 via entrez): RS1000042726 (14:74858335 G>A), RS1000114281 (14:74868745 A>G), RS1000174504 (14:74871340 TAAA>T,TAA,TAAAA), RS1000215899 (14:74865172 A>G), RS1000232943 (14:74864301 C>T), RS1000436204 (14:74858145 C>A,G), RS1000490074 (14:74872020 A>T), RS1000510905 (14:74869831 T>G), RS1000544990 (14:74869547 G>A), RS1000600751 (14:74856309 G>A), RS1000610145 (14:74870993 G>A,T), RS1000686958 (14:74858006 C>T), RS1000720491 (14:74856692 C>G,T), RS1000777150 (14:74863363 C>T), RS1000801306 (14:74856376 G>A)

Disease associations

OMIM: gene MIM:615094 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST005237_3Mood instability1.000000e-06
GCST005238_3Mood instability3.000000e-09
GCST005839_15Depression4.000000e-09
GCST006697_23Parental longevity (combined parental attained age, Martingale residuals)2.000000e-08
GCST006701_14Parental longevity (father’s attained age)2.000000e-07
GCST009145_7Total cholesterol levels1.000000e-07
GCST90000514_22Gastroesophageal reflux disease2.000000e-09
GCST90002393_488Monocyte count8.000000e-17

EFO canonical traits (4, from GWAS)

EFO IDTrait name
EFO:0008475mood instability measurement
EFO:0007796parental longevity
EFO:0004574total cholesterol measurement
EFO:0005091monocyte count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

10 total (human), top 10 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
triphenyl phosphateaffects expression1
pirinixic acidaffects binding, decreases expression, increases activity1
bisphenol Aaffects cotreatment, decreases methylation1
Fulvestrantaffects cotreatment, decreases methylation1
Benzo(a)pyreneaffects methylation1
Malathiondecreases expression1
Phthalic Acidsincreases methylation1
Tobacco Smoke Pollutiondecreases expression1
Cyclosporineincreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): gastroesophageal reflux disease