EPPIN

gene
On this page

Also known as EPPIN1EPPIN2EPPIN3dJ461P17.2WAP7WFDC7CT71

Summary

EPPIN (epididymal peptidase inhibitor, HGNC:15932) is a protein-coding gene on chromosome 20q13.12, encoding Eppin (O95925). Serine protease inhibitor that plays an essential role in male reproduction and fertility.

This gene encodes an epididymal protease inhibitor, which contains both kunitz-type and WAP-type four-disulfide core (WFDC) protease inhibitor consensus sequences. Most WFDC genes are localized to chromosome 20q12-q13 in two clusters: centromeric and telomeric. This gene is a member of the WFDC gene family and belongs to the telomeric cluster. The protein can inhibit human sperm motility and exhibits antimicrobial activity against E. coli, and polymorphisms in this gene are associated with male infertility. Read-through transcription also exists between this gene and the downstream WFDC6 (WAP four-disulfide core domain 6) gene. Two transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 57119 — RefSeq curated summary.

At a glance

  • Clinical variants (ClinVar): 1 total
  • MANE Select transcript: NM_020398

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:15932
Approved symbolEPPIN
Nameepididymal peptidase inhibitor
Location20q13.12
Locus typegene with protein product
StatusApproved
AliasesEPPIN1, EPPIN2, EPPIN3, dJ461P17.2, WAP7, WFDC7, CT71
Ensembl geneENSG00000101448
Ensembl biotypeprotein_coding
OMIM609031
Entrez57119

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 3 protein_coding, 1 retained_intron

ENST00000336443, ENST00000354280, ENST00000409554, ENST00000496898

RefSeq mRNA: 2 — MANE Select: NM_020398 NM_001302861, NM_020398

CCDS: CCDS13359

Canonical transcript exons

ENST00000354280 — 4 exons

ExonStartEnd
ENSE000012908794554062645542154
ENSE000018853254554726745547401
ENSE000035608634554270045542867
ENSE000036767144554563945545770

Expression profiles

Bgee: expression breadth broad, 96 present calls, max score 96.98.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0115 / max 8.8046, expressed in 3 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
1874520.26675
1874530.02034
1874550.00733
1874560.00422

Top tissues by expression

218 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right testisUBERON:000453496.98gold quality
corpus epididymisUBERON:000435996.22gold quality
left testisUBERON:000453396.04gold quality
buccal mucosa cellCL:000233694.32gold quality
testisUBERON:000047393.50gold quality
adult organismUBERON:000702393.44gold quality
epithelium of bronchusUBERON:000203185.43gold quality
bronchusUBERON:000218584.28gold quality
bronchial epithelial cellCL:000232883.95gold quality
cauda epididymisUBERON:000436082.19gold quality
mucosa of paranasal sinusUBERON:000503081.84gold quality
olfactory segment of nasal mucosaUBERON:000538680.63gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099179.20silver quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047377.91gold quality
right uterine tubeUBERON:000130277.32gold quality
nasal cavity epitheliumUBERON:000538473.79gold quality
caput epididymisUBERON:000435868.97gold quality
nasal cavity mucosaUBERON:000182666.50gold quality
epithelium of nasopharynxUBERON:000195165.77silver quality
male germ cellCL:000001563.17silver quality
spermCL:000001959.78silver quality
oviduct epitheliumUBERON:000480457.78silver quality
upper leg skinUBERON:000426257.36gold quality
fallopian tubeUBERON:000388956.90gold quality
tibialis anteriorUBERON:000138556.77silver quality
ileal mucosaUBERON:000033156.36silver quality
left lobe of thyroid glandUBERON:000112055.16gold quality
right lungUBERON:000216754.88gold quality
cranial nerve IIUBERON:000094154.73gold quality
thyroid glandUBERON:000204653.72gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-134144yes487.60
E-ANND-3yes9.08

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): ATRX, ESR1

miRNA regulators (miRDB)

31 targeting EPPIN, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3134100.0066.43777
HSA-MIR-450099.9972.722367
HSA-MIR-453499.9966.581907
HSA-MIR-453199.9969.703181
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-314899.9775.066478
HSA-LET-7D-5P99.9671.761632
HSA-MIR-445899.9671.641650
HSA-MIR-182599.7268.111089
HSA-MIR-875-3P99.6369.472548
HSA-MIR-671-5P99.5267.111277
HSA-MIR-766-5P99.4767.912225
HSA-MIR-442699.1766.741949
HSA-MIR-6868-5P99.0665.691284
HSA-MIR-4738-3P98.9867.981846
HSA-MIR-10526-3P97.8664.971342
HSA-MIR-431497.5067.301369
HSA-MIR-3192-5P96.9865.761926
HSA-MIR-127096.9466.65931
HSA-MIR-62096.9466.79888
HSA-MIR-1212896.6766.981471
HSA-MIR-1236-5P96.6266.38856
HSA-MIR-5586-3P95.5167.00805

Literature-anchored findings (GeneRIF, showing 14)

  • This antimicrobial peptide permeabilized outer and inner membranes of E. coli, consistent with its ability to induce striking morphological alterations of E. coli membranes. Has role in innate immune system of human epididymis. (PMID:15229136)
  • in seminal plasma and on spermatozoa following ejaculation, Eppin is bound to semenogelin I (PMID:15590901)
  • although both of eppin’s domains are involved in the protein’s antibacterial activity, only the Kunitz domain is required for selective protease inhibition (PMID:18331357)
  • Eppin protects a 15-kDa fragment of semenogelin from hydrolysis by prostate specific antigen (PMID:18645681)
  • Eppin can modulate intracellular calcium concentrations and subsequently affect the calcium ionophore A23187-induced acrosome reaction. (PMID:19801569)
  • Cysteine 239 of rSEMG1 appears to be the critical amino acid for both binding to eppin and inhibiting sperm motility. (PMID:19889947)
  • Genetic variants in the Eppin gene may be associated with semen quality. (PMID:20158997)
  • Different variants in the EPPIN gene are related to idiopathic male infertility and men carrying these variants have a decreased or increased risk of abnormal semen parameters associated with male infertility. (PMID:20488803)
  • Letter: Eppin overexpression in prostate cancer correlates with favorable prognosis. (PMID:21392882)
  • The expression of mouse EPPIN protein was detected in A1-A4 spermatogonia and the expression of human EPPIN was detected in type A spermatogonia as well as in the earliest preleptotene spermatocytes. (PMID:21461566)
  • EPPIN’s semenogelin I binding site: a contraceptive drug target. (PMID:22699487)
  • EPPIN and SEMG1 rapidly co-evolved in primates (PMID:24312623)
  • Eppin and Fibronectin bind to each other in human semen and on human ejaculated spermatozoa. (PMID:24358212)
  • eppin plays a role outside of the reproductive tract; eppin may have a role in the innate immune response in the lung (PMID:28705940)

Cross-species orthologs

16 orthologs

OrganismSymbolGene ID
danio_reriospint1bENSDARG00000012467
danio_reriolrp11ENSDARG00000034076
danio_reriowfikkn1ENSDARG00000101364
danio_reriospint1aENSDARG00000102332
mus_musculusEppinENSMUSG00000017733
rattus_norvegicusEppinENSRNOG00000028908
drosophila_melanogasterCG7565FBGN0035833
caenorhabditis_elegansWBGENE00008304
caenorhabditis_elegansWBGENE00008449
caenorhabditis_elegansWBGENE00009386
caenorhabditis_elegansWBGENE00010792
caenorhabditis_elegansWBGENE00012186
caenorhabditis_elegansWBGENE00012814
caenorhabditis_elegansWBGENE00015355
caenorhabditis_elegansWBGENE00020648
caenorhabditis_elegansWBGENE00021939

Paralogs (13): TFPI (ENSG00000003436), TFPI2 (ENSG00000105825), AMBP (ENSG00000106927), LRP11 (ENSG00000120256), WFIKKN1 (ENSG00000127578), KIAA0319 (ENSG00000137261), KIAA0319L (ENSG00000142687), SPINT4 (ENSG00000149651), WFDC8 (ENSG00000158901), SPINT1 (ENSG00000166145), SPINT2 (ENSG00000167642), WFIKKN2 (ENSG00000173714), WFDC6 (ENSG00000243543)

Protein

Protein identifiers

EppinO95925 (reviewed: O95925)

Alternative names: Cancer/testis antigen 71, Epididymal protease inhibitor, Protease inhibitor WAP7, Serine protease inhibitor-like with Kunitz and WAP domains 1, WAP four-disulfide core domain protein 7

All UniProt accessions (3): O95925, A0A384NYB9, B7ZBA9

UniProt curated annotations — full annotation on UniProt →

Function. Serine protease inhibitor that plays an essential role in male reproduction and fertility. Modulates the hydrolysis of SEMG1 by KLK3/PSA (a serine protease), provides antimicrobial protection for spermatozoa in the ejaculate coagulum, and binds SEMG1 thereby inhibiting sperm motility.

Subunit / interactions. Monomer. Homodimer. Homomultimers. Interacts with SEMG1 (via 164-283 AA). Interacts with LTF. Found in a complex with LTF, CLU, EPPIN and SEMG1.

Subcellular location. Secreted. Cell surface.

Tissue specificity. In testis, expressed and secreted by Sertoli cells, appearing on the surface of testicular and ejaculate spermatozoa. Expressed in the spermatogonia and the earliest preleptotene spermatocytes. In the epididymis, is expressed and secreted by epithelial cells and covers the surface of epididymal spermatozoa and ciliated epithelial cells (at protein level). Expressed specifically in epididymis and testis. Isoform 2 is expressed only in the epididymis. Weak expression is detected in myoid cells as well as spermatogenic cells.

Domain organisation. The BPTI/Kunitz inhibitor domain is required for elastase inhibitory activity. BPTI/Kunitz inhibitor and WAP domains are involved in the protein antibacterial activity.

Miscellaneous. Might be used as a target for male contraception. Lacks a cleavable signal sequence. Based on a readthrough transcript which may produce a EPPIN-WFDC6 fusion protein.

Isoforms (3)

UniProt IDNamesCanonical?
O95925-11yes
O95925-22, Eppin-2
O95925-33

RefSeq proteins (2): NP_001289790, NP_065131* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002223Kunitz_BPTIDomain
IPR008197WAP_domDomain
IPR020901Prtase_inh_Kunz-CSConserved_site
IPR036645Elafin-like_sfHomologous_superfamily
IPR036880Kunitz_BPTI_sfHomologous_superfamily
IPR051388Serpin_venom_toxinFamily

Pfam: PF00014, PF00095

UniProt features (24 total): disulfide bond 7, mutagenesis site 7, splice variant 2, sequence variant 2, domain 2, region of interest 2, signal peptide 1, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O95925-F189.870.69

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Disulfide bonds (7): 77–127, 86–110, 102–123, 33–61, 40–65, 48–60, 54–69

Mutagenesis-validated functional residues (7):

PositionPhenotype
87loss of effect on klk3 activity.
102reduces the binding to semg1 by 45%.
107reduces the binding to semg1 by 68%.
110does not affect the binding of semg1 or ltf. does not affect the binding of semg1; when associated with a-123 and a-127.
117reduces the binding to semg1 by 68% and to ltf by 73%.
123does not affect the binding of semg1 or ltf. does not affect the binding of semg1; when associated with a-110 and a-127.
127does not affect the binding of semg1 or ltf. does not affect the binding of semg1; when associated with a-110 and a-123.

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-6803157Antimicrobial peptides

MSigDB gene sets: 116 (showing top): GSE45365_HEALTHY_VS_MCMV_INFECTION_CD8_TCELL_IFNAR_KO_UP, GOBP_NEGATIVE_REGULATION_OF_REPRODUCTIVE_PROCESS, GOBP_NEGATIVE_REGULATION_OF_PROTEOLYSIS, REACTOME_INNATE_IMMUNE_SYSTEM, GOBP_REGULATION_OF_MICROTUBULE_BASED_PROCESS, GOCC_SECRETORY_GRANULE, GOBP_NEGATIVE_REGULATION_OF_HYDROLASE_ACTIVITY, GOCC_CELL_SURFACE, GOBP_MONOATOMIC_CATION_TRANSPORT, FERREIRA_EWINGS_SARCOMA_UNSTABLE_VS_STABLE_DN, GOBP_REGULATION_OF_HYDROLASE_ACTIVITY, GOBP_NEGATIVE_REGULATION_OF_TRANSPORT, GOBP_CALCIUM_ION_IMPORT, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_DN, GOBP_NEGATIVE_REGULATION_OF_CALCIUM_ION_IMPORT

GO Biological Process (4): negative regulation of peptidase activity (GO:0010466), defense response to bacterium (GO:0042742), negative regulation of calcium ion import (GO:0090281), negative regulation of flagellated sperm motility (GO:1901318)

GO Molecular Function (3): serine-type endopeptidase inhibitor activity (GO:0004867), protein binding (GO:0005515), peptidase inhibitor activity (GO:0030414)

GO Cellular Component (7): acrosomal vesicle (GO:0001669), obsolete extracellular space (GO:0005615), cell surface (GO:0009986), protein-containing complex (GO:0032991), sperm plasma membrane (GO:0097524), extracellular region (GO:0005576), cytoplasm (GO:0005737)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Innate Immune System1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
peptidase activity2
negative regulation of proteolysis1
negative regulation of hydrolase activity1
regulation of peptidase activity1
defense response1
response to bacterium1
negative regulation of calcium ion transport1
calcium ion import1
regulation of calcium ion import1
negative regulation of cilium movement1
flagellated sperm motility1
regulation of flagellated sperm motility1
negative regulation of cilium-dependent cell motility1
negative regulation of reproductive process1
serine-type endopeptidase activity1
endopeptidase inhibitor activity1
binding1
enzyme inhibitor activity1
peptidase regulator activity1
secretory granule1
cellular_component1
plasma membrane1
intracellular anatomical structure1

Protein interactions and networks

STRING

864 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
EPPINSEMG1P04279755
EPPINSPINT4Q6UDR6717
EPPINETV4P43268714
EPPINWFDC10AQ9H1F0611
EPPINGATA1P15976548
EPPINDEFB126Q9BYW3541
EPPINCT55Q8WUE5506
EPPINCPXCR1Q8N123479
EPPINKLK3P07288479
EPPINTULP2O00295478
EPPINWFDC3Q8IUB2437
EPPINWFDC9Q8NEX5435
EPPINCRISP2P16562434
EPPINJUNP05412411
EPPINSEMG2Q02383403

IntAct

4 interactions, top by confidence:

ABTypeScore
EPPINUMPSpsi-mi:“MI:0914”(association)0.530
EPPINHSPA5psi-mi:“MI:0914”(association)0.350
EPPINpsi-mi:“MI:0915”(physical association)0.000

BioGRID (10): LTF (Co-fractionation), CLU (Co-fractionation), EPPIN (Affinity Capture-Western), LTF (Far Western), CLU (Far Western), UMPS (Affinity Capture-MS), HSPA5 (Affinity Capture-MS), UMPS (Affinity Capture-MS), ADAM9 (Affinity Capture-MS), EPPIN (Protein-RNA)

ESM2 similar proteins: A4K2P0, A5X2X1, A6MFL2, A7X3V4, A7X3V7, A7X4K1, A8Y7N4, A8Y7N7, A8Y7N8, A8Y7N9, A8Y7P5, B2BS84, B2KTG1, B5KF95, B5KF96, B5KL34, B5KL36, B5L5Q8, B7S4N9, D4A2Z2, E5AJX3, F6ULY1, H6VC05, I2G9B4, O35536, O43278, O43291, O95925, P00974, P00991, P00992, P01173, P01174, P04815, Q08E66, Q29100, Q2ES46, Q3UW55, Q4KUS1, Q6T6S5

Diamond homologs: A0A6P8HC43, A5X2X1, A6MFL4, A7X3V4, B2BS84, B2KTG1, B5G6G6, B5KF95, B5KF96, B5KL36, B5L5M7, B5L5R0, B6ZIW0, B7S4N9, C0HJU6, C0HLB2, C0HMC7, C1IC50, C1IC53, F6ULY1, F8J2F3, H2A0P0, O43291, O54819, O95925, P00974, P00976, P00978, P00993, P00994, P02760, P04365, P04366, P04815, P05067, P08592, P0C8W3, P0DN06, P0DN07, P0DN08

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

1 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

491 predictions. Top by Δscore:

VariantEffectΔscore
20:45542769:C:CTacceptor_gain1.0000
20:45547263:TTA:Tdonor_loss1.0000
20:45547264:TAC:Tdonor_loss1.0000
20:45547265:A:ACdonor_gain1.0000
20:45547265:ACTGG:Adonor_loss1.0000
20:45547266:C:CGdonor_gain1.0000
20:45547266:C:CTdonor_gain1.0000
20:45547266:CT:Cdonor_gain1.0000
20:45547266:CTG:Cdonor_gain1.0000
20:45547266:CTGG:Cdonor_gain1.0000
20:45547266:CTGGG:Cdonor_gain1.0000
20:45547261:ACTT:Adonor_loss0.9900
20:45547262:CTTA:Cdonor_gain0.9900
20:45541386:G:Cacceptor_gain0.9800
20:45546402:T:TCacceptor_gain0.9800
20:45547296:C:Adonor_gain0.9700
20:45545633:TATTA:Tdonor_loss0.9600
20:45545634:ATTAC:Adonor_loss0.9600
20:45545635:TTACC:Tdonor_loss0.9600
20:45545636:TACCT:Tdonor_loss0.9600
20:45545637:ACC:Adonor_loss0.9600
20:45545637:ACCTT:Adonor_loss0.9600
20:45545638:C:Adonor_loss0.9600
20:45545769:CC:Cacceptor_gain0.9600
20:45545770:CC:Cacceptor_gain0.9600
20:45547261:A:ACdonor_gain0.9600
20:45547262:C:CCdonor_gain0.9600
20:45542814:A:Tacceptor_gain0.9500
20:45545687:T:TAdonor_gain0.9500
20:45545767:CTCC:Cacceptor_gain0.9500

AlphaMissense

889 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:45542776:A:CF105L0.973
20:45542776:A:TF105L0.973
20:45542778:A:GF105L0.973
20:45542740:G:CF117L0.972
20:45542740:G:TF117L0.972
20:45542742:A:GF117L0.972
20:45542777:A:CF105C0.963
20:45542741:A:CF117C0.954
20:45545673:G:CF63L0.951
20:45545673:G:TF63L0.951
20:45545675:A:GF63L0.951
20:45545674:A:CF63C0.930
20:45542762:C:GC110S0.922
20:45542763:A:TC110S0.922
20:45542723:C:GC123S0.905
20:45542724:A:TC123S0.905
20:45542861:C:GC77S0.905
20:45542862:A:TC77S0.905
20:45545656:C:GC69S0.903
20:45545657:A:TC69S0.903
20:45545719:C:GC48S0.902
20:45545720:A:TC48S0.902
20:45542741:A:GF117S0.901
20:45545701:C:GC54S0.901
20:45545702:A:TC54S0.901
20:45542747:T:AN115I0.893
20:45542786:C:GC102S0.889
20:45542787:A:TC102S0.889
20:45542834:C:GC86S0.886
20:45542835:A:TC86S0.886

dbSNP variants (sampled 300 via entrez): RS1000113555 (20:45542069 C>T), RS1000310170 (20:45546404 G>T), RS1000312325 (20:45547619 G>A,T), RS1000429870 (20:45546738 A>G), RS1001416692 (20:45541372 A>G), RS1001947251 (20:45547171 T>A,C), RS1002336740 (20:45546077 G>C), RS1002953006 (20:45543679 T>G), RS1003013862 (20:45548699 T>C), RS1003310675 (20:45545089 GTAT>G), RS1003710659 (20:45542646 G>C), RS1003824274 (20:45546290 G>T), RS1004553631 (20:45544159 G>A), RS1004985522 (20:45545812 A>C,G), RS1005820510 (20:45545582 G>A)

Disease associations

OMIM: gene MIM:609031 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

0 associations (top):

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
CGP 52608increases reaction, affects binding1
Resveratrolaffects cotreatment, decreases expression1
Air Pollutantsincreases abundance, increases expression1
Benzo(a)pyreneaffects methylation, decreases methylation, increases methylation1
Dimethyl Sulfoxideaffects expression1
Plant Extractsaffects cotreatment, decreases expression1
Smokeincreases abundance, increases expression1
Tobacco Smoke Pollutiondecreases expression1
Valproic Aciddecreases methylation1
Aflatoxin B1increases methylation1
Cadmium Chloridedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.