PROB1

gene
On this page

Summary

PROB1 (proline rich basic protein 1, HGNC:41906) is a protein-coding gene on chromosome 5q31.2, encoding Proline-rich basic protein 1 (E7EW31).

Located in nucleoplasm.

Source: NCBI Gene 389333 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 113 total
  • MANE Select transcript: NM_001161546

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:41906
Approved symbolPROB1
Nameproline rich basic protein 1
Location5q31.2
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000228672
Ensembl biotypeprotein_coding
Entrez389333

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000434752

RefSeq mRNA: 1 — MANE Select: NM_001161546 NM_001161546

CCDS: CCDS54909

Canonical transcript exons

ENST00000434752 — 1 exons

ExonStartEnd
ENSE00001788179139390592139395104

Expression profiles

Bgee: expression breadth ubiquitous, 130 present calls, max score 98.35.

FANTOM5 (CAGE): breadth broad, TPM avg 0.9639 / max 51.6302, expressed in 387 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
637500.6724299
637510.189783
637480.092029
637490.00984

Top tissues by expression

138 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
quadriceps femorisUBERON:000137798.35silver quality
gastrocnemiusUBERON:000138893.70gold quality
hindlimb stylopod muscleUBERON:000425292.82gold quality
muscle of legUBERON:000138392.17gold quality
skeletal muscle tissueUBERON:000113492.05gold quality
apex of heartUBERON:000209891.91gold quality
thymusUBERON:000237090.15gold quality
heart left ventricleUBERON:000208488.16gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047383.45gold quality
heartUBERON:000094882.20gold quality
muscle tissueUBERON:000238581.85gold quality
right atrium auricular regionUBERON:000663181.17gold quality
left adrenal glandUBERON:000123473.80gold quality
left adrenal gland cortexUBERON:003582573.52gold quality
right adrenal glandUBERON:000123373.17gold quality
adrenal glandUBERON:000236972.55gold quality
right adrenal gland cortexUBERON:003582772.51gold quality
stromal cell of endometriumCL:000225570.57gold quality
lower esophagus muscularis layerUBERON:003583370.53gold quality
lower esophagusUBERON:001347370.43gold quality
right uterine tubeUBERON:000130270.17gold quality
mucosa of transverse colonUBERON:000499169.55gold quality
prostate glandUBERON:000236768.02gold quality
adrenal tissueUBERON:001830367.92gold quality
esophagogastric junction muscularis propriaUBERON:003584167.75gold quality
cerebellar vermisUBERON:000472067.71gold quality
adenohypophysisUBERON:000219666.41gold quality
pituitary glandUBERON:000000766.30gold quality
amygdalaUBERON:000187665.76gold quality
temporal lobeUBERON:000187165.72gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.77

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 1)

  • Segregation analysis revealed that variants c.475T>G in SKP1, c.671G>A in PROB1, and c.527G>A in IL17B in the 5q31.1-q35.3 linkage region, and c.850G>A in HKDC1 in the 10q22 locus completely segregated with the phenotype in the studied Keratoconus family (PMID:27703147)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusProb1ENSMUSG00000073600
rattus_norvegicusProb1ENSRNOG00000039596

Protein

Protein identifiers

Proline-rich basic protein 1E7EW31 (reviewed: E7EW31)

All UniProt accessions (1): E7EW31

UniProt curated annotations — full annotation on UniProt →

RefSeq proteins (1): NP_001155018* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR027838DUF4585Domain
IPR052303CEFIPFamily

Pfam: PF15232

UniProt features (25 total): compositionally biased region 16, region of interest 7, chain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-E7EW31-F144.630.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 16 (showing top): chr5q31, FOURATI_BLOOD_TWINRIX_AGE_25_83YO_RESPONDERS_VS_POOR_RESPONDERS_0DY_DN, HARALAMBIEVA_PBMC_M_M_R_II_AGE_11_22YO_VACCINATED_VS_UNVACCINATED_7YR_UP, GENES_CORRELATED_WITH_MYC_DELETION, GSE8835_HEALTHY_VS_CLL_CD8_TCELL_UP, GSE2585_CD80_HIGH_VS_LOW_MTEC_UP, GSE8921_3H_VS_24H_TLR1_2_STIM_MONOCYTE_DN, GSE7831_1H_VS_4H_INFLUENZA_STIM_PDC_DN, GSE19888_ADENOSINE_A3R_INH_VS_INH_PRETREAT_AND_ACT_WITH_TCELL_MEMBRANES_MAST_CELL_DN, GSE24210_TCONV_VS_TREG_UP, GSE24972_WT_VS_IRF8_KO_MARGINAL_ZONE_SPLEEN_BCELL_DN, GSE36078_UNTREATED_VS_AD5_T425A_HEXON_INF_IL1R_KO_MOUSE_LUNG_DC_UP, GSE11961_MARGINAL_ZONE_BCELL_VS_GERMINAL_CENTER_BCELL_DAY7_DN, GSE43863_TH1_VS_TFH_EFFECTOR_CD4_TCELL_UP, GSE46606_IRF4MID_VS_WT_CD40L_IL2_IL5_DAY1_STIMULATED_BCELL_DN

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (1): nucleoplasm (GO:0005654)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
nuclear lumen1
cellular anatomical structure1

Protein interactions and networks

STRING

254 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
PROB1SMIM33A0A1B0GW64571
PROB1SPATA24Q86W54540
PROB1JADE2Q9NQC1469
PROB1PLEKHH3Q7Z736458
PROB1NAGSQ8N159412
PROB1ECSCRQ19T08403
PROB1PHF23Q9BUL5401
PROB1DNAJC18Q9H819395
PROB1DRAM1Q8N682352
PROB1DENND5BQ6ZUT9334
PROB1NTN5Q8WTR8307
PROB1PRDM7Q9NQW5305
PROB1CYYR1Q96J86300
PROB1VWA7Q9Y334297
PROB1FAM171A1Q5VUB5297

IntAct

0 interactions, top by confidence:

BioGRID (2): PROB1 (Affinity Capture-MS), PROB1 (Affinity Capture-RNA)

ESM2 similar proteins: A0A0J9YXV3, A0A172M4N0, A2VE23, A5PL33, C7EMF5, E7EW31, F1NSM7, I3L273, O15027, O48582, O55189, O55196, O97939, P0C671, P0DV77, P14138, Q14D33, Q1XI13, Q28989, Q3B7M4, Q4R729, Q5R7U0, Q5SWP3, Q62840, Q63003, Q6E0U4, Q6H236, Q6NUN9, Q6UXA7, Q7Z2K8, Q86UU5, Q8BM15, Q8K4E0, Q8K4L6, Q8N1P7, Q8N3D4, Q96D09, Q96JG9, Q9BGL9, Q9D7G9

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

113 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance105
Likely benign5
Benign3

Top pathogenic / likely-pathogenic (0)

SpliceAI

35 predictions. Top by Δscore:

VariantEffectΔscore
5:139393028:T:TGacceptor_gain0.6900
5:139393029:T:Aacceptor_gain0.6800
5:139391847:T:TAdonor_gain0.5200
5:139393030:G:GAacceptor_gain0.4400
5:139393031:A:AAacceptor_gain0.4400
5:139393027:CTTGA:Cacceptor_gain0.3900
5:139393032:T:Aacceptor_gain0.3900
5:139393045:G:GTacceptor_gain0.3200
5:139393046:T:TTacceptor_gain0.3200
5:139393022:AAAT:Aacceptor_gain0.3100
5:139390827:C:Aacceptor_gain0.3000
5:139390757:C:CTacceptor_gain0.2800
5:139391816:A:Cdonor_gain0.2800
5:139393181:G:Cdonor_gain0.2700
5:139394197:AG:Adonor_gain0.2700
5:139393023:AAT:Aacceptor_gain0.2600
5:139393026:CCT:Cacceptor_gain0.2600
5:139393028:T:Cacceptor_gain0.2500
5:139393188:TC:Tdonor_gain0.2500
5:139394936:T:TAdonor_gain0.2400
5:139390825:TCCC:Tacceptor_gain0.2300
5:139390826:CCCC:Cacceptor_gain0.2300
5:139393021:AAAAT:Aacceptor_gain0.2300
5:139393024:AT:Aacceptor_gain0.2300
5:139393952:CA:Cdonor_gain0.2300
5:139394948:T:TAdonor_gain0.2300
5:139393038:C:Aacceptor_gain0.2200
5:139393144:AGAG:Adonor_gain0.2200
5:139391815:A:ACdonor_gain0.2100
5:139393025:T:Aacceptor_gain0.2100

AlphaMissense

6376 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:139394461:C:AK207N0.992
5:139394461:C:GK207N0.992
5:139392362:T:CD907G0.991
5:139394460:G:TR208S0.987
5:139392408:A:CY892D0.985
5:139394065:A:CS339R0.985
5:139394065:A:TS339R0.985
5:139394067:T:GS339R0.985
5:139394581:C:AW167C0.985
5:139394581:C:GW167C0.985
5:139392434:A:GL883S0.984
5:139393186:C:AK632N0.983
5:139393186:C:GK632N0.983
5:139394737:G:CF115L0.982
5:139394737:G:TF115L0.982
5:139394739:A:GF115L0.982
5:139394059:G:CS341R0.980
5:139394059:G:TS341R0.980
5:139394061:T:GS341R0.980
5:139392345:A:CY913D0.979
5:139392350:C:AG911V0.979
5:139392368:A:GL905P0.979
5:139392364:G:CF906L0.978
5:139392364:G:TF906L0.978
5:139392366:A:GF906L0.978
5:139392428:T:CD885G0.977
5:139394053:G:CF343L0.977
5:139394053:G:TF343L0.977
5:139394055:A:GF343L0.977
5:139394583:A:GW167R0.977

dbSNP variants (sampled 300 via entrez): RS1000037209 (5:139395931 A>G), RS1000257218 (5:139393599 T>G), RS1000348803 (5:139393322 G>A,T), RS1000518419 (5:139394934 C>T), RS1000591429 (5:139394856 G>A,C), RS1000686286 (5:139394636 G>A), RS1001114950 (5:139393638 G>C,T), RS1001231277 (5:139394346 C>T), RS1001568333 (5:139392929 C>T), RS1001682220 (5:139394604 C>T), RS1001898730 (5:139394320 C>G,T), RS1001914949 (5:139394060 C>A,G), RS1002575849 (5:139391629 G>A), RS1003236811 (5:139392040 C>A,G), RS1003698597 (5:139392292 G>C)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST005951_151Body mass index6.000000e-07
GCST010725_68Malaria4.000000e-07
GCST010725_7Malaria2.000000e-06
GCST010796_3063Electrocardiogram morphology (amplitude at temporal datapoints)1.000000e-09
GCST010796_3064Electrocardiogram morphology (amplitude at temporal datapoints)1.000000e-12
GCST010796_3065Electrocardiogram morphology (amplitude at temporal datapoints)7.000000e-14
GCST010796_3066Electrocardiogram morphology (amplitude at temporal datapoints)1.000000e-14
GCST012101_11Hypertrophic cardiomyopathy6.000000e-08

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004340body mass index
EFO:0004327electrocardiography

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

17 total (human), top 17 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, decreases expression, increases methylation2
aristolochic acid Iincreases expression1
GSK-J4decreases expression1
triphenyl phosphateaffects expression1
di-n-butylphosphoric acidaffects expression1
jinfukangaffects cotreatment, decreases expression1
Sunitinibdecreases expression1
Cisplatinaffects cotreatment, decreases expression1
Doxorubicindecreases expression1
Smokedecreases expression1
Testosteronedecreases expression1
Tobacco Smoke Pollutiondecreases expression1
Triclosandecreases expression1
Valproic Acidincreases methylation1
1-Methyl-4-phenylpyridiniumincreases expression1
Aflatoxin B1decreases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.