SEMG2

gene
On this page

Also known as SGII

Summary

SEMG2 (semenogelin 2, HGNC:10743) is a protein-coding gene on chromosome 20q13.12, encoding Semenogelin-2 (Q02383). Participates in the formation of a gel matrix (sperm coagulum) entrapping the accessory gland secretions and ejaculated spermatozoa.

The secreted protein encoded by this gene is involved in the formation of a gel matrix that encases ejaculated spermatozoa. Proteolysis by the prostate-specific antigen (PSA) breaks down the gel matrix and allows the spermatozoa to move more freely. The encoded protein is found in lesser abundance than a similar semenogelin protein. An antibacterial activity has been found for a antimicrobial peptide isolated from this protein. The genes encoding these two semenogelin proteins are found in a cluster on chromosome 20.

Source: NCBI Gene 6407 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 93 total
  • MANE Select transcript: NM_003008

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:10743
Approved symbolSEMG2
Namesemenogelin 2
Location20q13.12
Locus typegene with protein product
StatusApproved
AliasesSGII
Ensembl geneENSG00000124157
Ensembl biotypeprotein_coding
OMIM182141
Entrez6407

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000372769

RefSeq mRNA: 1 — MANE Select: NM_003008 NM_003008

CCDS: CCDS13346

Canonical transcript exons

ENST00000372769 — 3 exons

ExonStartEnd
ENSE000008449214522170945223425
ENSE000008449224522429145224458
ENSE000018583114522137345221465

Expression profiles

Bgee: expression breadth broad, 45 present calls, max score 100.00.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 65.8511 / max 60755.1035, expressed in 21 samples.

FANTOM5 promoters (7 alternative TSS)

Promoter IDTPM avgSamples expressed
18482364.425518
1848280.44104
1848220.28477
1848270.22253
1848240.18513
1848250.17253
1848260.11982

Top tissues by expression

261 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
seminal vesicleUBERON:0000998100.00gold quality
spermCL:000001996.69gold quality
male germ cellCL:000001592.63gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047391.32gold quality
paraflocculusUBERON:000535162.99gold quality
frontal poleUBERON:000279562.94gold quality
middle frontal gyrusUBERON:000270262.83gold quality
endometrium epitheliumUBERON:000481162.65gold quality
olfactory segment of nasal mucosaUBERON:000538661.01gold quality
nasal cavity mucosaUBERON:000182660.04gold quality
colonic epitheliumUBERON:000039756.89gold quality
tendon of biceps brachiiUBERON:000818853.87gold quality
prostate glandUBERON:000236753.48gold quality
cerebellar vermisUBERON:000472052.81gold quality
metanephric glomerulusUBERON:000473652.07gold quality
Brodmann (1909) area 10UBERON:001354150.99gold quality
quadriceps femorisUBERON:000137750.44gold quality
vastus lateralisUBERON:000137949.93gold quality
Brodmann (1909) area 46UBERON:000648349.30gold quality
cervix squamous epitheliumUBERON:000692249.20gold quality
hair follicleUBERON:000207349.18gold quality
kidney epitheliumUBERON:000481948.93gold quality
olfactory bulbUBERON:000226448.92gold quality
myocardiumUBERON:000234948.87gold quality
type B pancreatic cellCL:000016948.83gold quality
thymusUBERON:000237048.80gold quality
metanephrosUBERON:000008148.75silver quality
oviduct epitheliumUBERON:000480448.62gold quality
cardiac muscle of right atriumUBERON:000337948.55gold quality
CA1 field of hippocampusUBERON:000388148.50gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

9 targeting SEMG2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-4283100.0066.422097
HSA-MIR-204-5P99.7971.622439
HSA-MIR-211-5P99.7971.652440
HSA-MIR-6832-3P99.5270.441726
HSA-MIR-133A-3P99.2771.531270
HSA-MIR-133B99.2771.531270
HSA-MIR-188-5P97.8967.01756

Literature-anchored findings (GeneRIF, showing 11)

  • SgII transcripts were demonstrated in several tissues, with the strongest signals coming from seminal vesicles, vas deferens, prostate, epididymis and trachea. (PMID:12200457)
  • Seminal plasma motility inhibitor, one of the fragments of Sg, has its inhibitory effect on ejaculated spermatozoa in liquefied semen under physiological conditions. (PMID:14581514)
  • structural changes in the semenogelin 1 and 2 proteins that have arisen since the human-chimpanzee-gorilla split may be responsible for the physiological differences between these species ejaculated semen that correlate with their sociosexual behavior (PMID:14629036)
  • The binding of Zn2+ to SgI and SgII and their involvment in regulating the activity of PSA are reported. (PMID:15563730)
  • SGII is a novel target for protein S-nitrosylation in spermatozoa. (PMID:17683036)
  • semenogelins I and II were directly cleaved by KLK14. Semenogelins were also able to reverse KLK14 inhibition by Zn2+, providing a novel regulatory mechanism for KLK14 activity. (PMID:18482984)
  • antibacterial activity of the semenogelin-derived peptides generated in seminal plasma was strictly zinc-dependent both at neutral and low pH (PMID:18714013)
  • Semenogelins (Sgs) modifies the membrane structure, indirectly inhibiting motility, and provides suggestions for a therapy for male infertility through selection of a functional sperm population using Sgs. (PMID:19089943)
  • These results suggest the involvement of semenogelins in prostate cancer and their prognostic values in predicting cancer progression after radical prostatectomy. (PMID:21557275)
  • Peptides released by physiological cleavage of Semg1 and Semg2 form amyloids that enhance HIV infection. (PMID:22177559)
  • SEMG1/2 augment energy metabolism of tumor cells. (PMID:33311447)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
mus_musculusSvs3aENSMUSG00000017003
mus_musculusSemg1ENSMUSG00000040132
mus_musculusSvs3bENSMUSG00000050383
rattus_norvegicusSemg1ENSRNOG00000013776
rattus_norvegicusSvs3bENSRNOG00000036782
rattus_norvegicusSvs3aENSRNOG00000062737

Paralogs (1): SEMG1 (ENSG00000124233)

Protein

Protein identifiers

Semenogelin-2Q02383 (reviewed: Q02383)

Alternative names: Semenogelin II

All UniProt accessions (1): Q02383

UniProt curated annotations — full annotation on UniProt →

Function. Participates in the formation of a gel matrix (sperm coagulum) entrapping the accessory gland secretions and ejaculated spermatozoa.

Subunit / interactions. Interacts with SERPINA5.

Subcellular location. Secreted.

Tissue specificity. Seminal vesicles, and to a much lesser extent, epididymis.

Post-translational modifications. Semenogelin-2 is thought to form both the 71 kDa polypeptide and, in its glycosylated form, the 76 kDa polypeptide.

Similarity. Belongs to the semenogelin family.

RefSeq proteins (1): NP_002999* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008836SemenogelinFamily

Pfam: PF05474

UniProt features (33 total): compositionally biased region 14, region of interest 7, sequence variant 5, repeat 4, signal peptide 1, chain 1, glycosylation site 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q02383-F130.690.00

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Glycosylation sites (1): 272

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 79 (showing top): GSE45365_HEALTHY_VS_MCMV_INFECTION_CD8_TCELL_IFNAR_KO_UP, GOBP_NEGATIVE_REGULATION_OF_REPRODUCTIVE_PROCESS, GOBP_ANTIMICROBIAL_HUMORAL_RESPONSE, GOBP_REGULATION_OF_MICROTUBULE_BASED_PROCESS, GOCC_SECRETORY_GRANULE, MODULE_151, GOBP_MALE_GAMETE_GENERATION, GOBP_SPERM_CAPACITATION, GOBP_ANATOMICAL_STRUCTURE_MATURATION, GOBP_POSITIVE_REGULATION_OF_CATALYTIC_ACTIVITY, GOBP_REGULATION_OF_HYDROLASE_ACTIVITY, GOBP_POSITIVE_REGULATION_OF_MOLECULAR_FUNCTION, GOBP_DEFENSE_RESPONSE_TO_OTHER_ORGANISM, GOBP_CELL_MATURATION, GOBP_CILIUM_MOVEMENT

GO Biological Process (5): antibacterial humoral response (GO:0019731), sperm capacitation (GO:0048240), coagulation (GO:0050817), positive regulation of serine-type endopeptidase activity (GO:1900005), negative regulation of flagellated sperm motility (GO:1901318)

GO Molecular Function (3): protease binding (GO:0002020), zinc ion binding (GO:0008270), protein binding (GO:0005515)

GO Cellular Component (5): acrosomal vesicle (GO:0001669), obsolete extracellular space (GO:0005615), nucleus (GO:0005634), extracellular exosome (GO:0070062), extracellular region (GO:0005576)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
antimicrobial humoral response1
defense response to bacterium1
developmental process involved in reproduction1
spermatid development1
cellular process involved in reproduction in multicellular organism1
cell maturation1
multicellular organismal process1
serine-type endopeptidase activity1
positive regulation of endopeptidase activity1
regulation of serine-type endopeptidase activity1
negative regulation of cilium movement1
flagellated sperm motility1
regulation of flagellated sperm motility1
negative regulation of cilium-dependent cell motility1
negative regulation of reproductive process1
enzyme binding1
transition metal ion binding1
binding1
secretory granule1
intracellular membrane-bounded organelle1
extracellular vesicle1
cellular anatomical structure1

Protein interactions and networks

STRING

536 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SEMG2WFDC12Q8WWY7871
SEMG2KLK3P07288744
SEMG2PI3P19957735
SEMG2SLPIP03973670
SEMG2ZANQ9Y493667
SEMG2SEMG1P04279640
SEMG2SERPINA5P05154597
SEMG2FN1P02751596
SEMG2KLK2P20151595
SEMG2WFDC8Q8IUA0582
SEMG2OR8U3Q8NH85582
SEMG2LGALS3BPQ08380575
SEMG2TGM4P49221570
SEMG2SPINT4Q6UDR6568
SEMG2WFDC5Q8TCV5568

IntAct

71 interactions, top by confidence:

ABTypeScore
SNTB2CASKpsi-mi:“MI:0914”(association)0.670
CD27TCAF2psi-mi:“MI:0914”(association)0.640
SCGB1D1MANBApsi-mi:“MI:0914”(association)0.640
FTH1A2ML1psi-mi:“MI:0914”(association)0.530
CNGA3C2CD2Lpsi-mi:“MI:0914”(association)0.530
SYT16DUSP14psi-mi:“MI:0914”(association)0.530
HACD1SEMG1psi-mi:“MI:0914”(association)0.530
NUPR1SEMG1psi-mi:“MI:0914”(association)0.530
MSS51SEMG1psi-mi:“MI:0914”(association)0.530
LINC02908SEMG1psi-mi:“MI:0914”(association)0.530
LSM14BSEMG1psi-mi:“MI:0914”(association)0.530
SEMG2VSIG8psi-mi:“MI:0914”(association)0.530
DPEP1ILVBLpsi-mi:“MI:0914”(association)0.530
PIK3R2BCR/ABL fusionpsi-mi:“MI:0914”(association)0.460
ESR2FBLL1psi-mi:“MI:0914”(association)0.460
SEMG2CDK10psi-mi:“MI:0915”(physical association)0.400
NFKB1NFKB1psi-mi:“MI:0914”(association)0.350
POLLSULT1C2psi-mi:“MI:0914”(association)0.350
BSPRYDEAF1psi-mi:“MI:0914”(association)0.350
SLC22A13SEMG1psi-mi:“MI:0914”(association)0.350

BioGRID (96): SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Proximity Label-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SMURF2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS), SEMG2 (Affinity Capture-MS)

ESM2 similar proteins: A0A1B0GUY1, A6NJ88, A6QL64, B3KS81, E9Q6E9, O43493, O48582, O77733, P04279, P0C7A4, P0C7A5, P0CV57, P0DKJ7, P10322, P16225, P48997, P48998, Q02383, Q06990, Q08AG5, Q0ZNK1, Q5JPF3, Q5JRC9, Q5SRN2, Q5U7M7, Q5U7M8, Q5U7M9, Q5U7N0, Q5U7N1, Q5U7N3, Q5U7N4, Q5XHX6, Q659K0, Q6AYN3, Q6JHY2, Q6P902, Q6SJ82, Q6X2M3, Q6XPR3, Q80Y39

Diamond homologs: O77733, P04279, P0C7A4, P0C7A5, Q02383, Q5U7M7, Q5U7M8, Q5U7M9, Q5U7N0, Q5U7N1, Q5U7N3, Q5U7N4, Q6X2M3, Q95196, P22006, F2Z472

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

93 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance89
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

180 predictions. Top by Δscore:

VariantEffectΔscore
20:45221463:AAGG:Adonor_loss1.0000
20:45221466:GTGA:Gdonor_loss1.0000
20:45221467:T:Gdonor_loss1.0000
20:45221466:G:GGdonor_gain0.9900
20:45224285:CCCTA:Cacceptor_loss0.9900
20:45224286:CCTA:Cacceptor_loss0.9900
20:45224287:CTAG:Cacceptor_loss0.9900
20:45224288:TAGG:Tacceptor_loss0.9900
20:45224289:A:Gacceptor_loss0.9900
20:45224290:GGT:Gacceptor_gain0.9800
20:45221464:AG:Adonor_gain0.9600
20:45221465:GG:Gdonor_gain0.9600
20:45224289:A:AGacceptor_gain0.9600
20:45224290:G:GGacceptor_gain0.9600
20:45221463:AAG:Adonor_gain0.9300
20:45221467:T:Adonor_gain0.9300
20:45221468:G:GGdonor_loss0.9300
20:45221462:AAAGG:Adonor_gain0.9200
20:45221465:GGT:Gdonor_gain0.9200
20:45221466:GTG:Gdonor_gain0.9200
20:45221463:AAGGT:Adonor_gain0.9100
20:45221464:AGGTG:Adonor_gain0.9100
20:45221469:AGTGG:Adonor_gain0.9100
20:45221462:AAAG:Adonor_gain0.9000
20:45221468:GAGTG:Gdonor_gain0.8900
20:45221470:G:Cdonor_gain0.8900
20:45221461:AAAAG:Adonor_gain0.8700
20:45224289:AG:Aacceptor_gain0.8600
20:45224290:GG:Gacceptor_gain0.8600
20:45224290:GGTGT:Gacceptor_gain0.8400

AlphaMissense

3896 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
20:45221436:A:TE16V0.943
20:45221421:T:CL11P0.917
20:45222392:T:CF254L0.911
20:45222394:T:AF254L0.911
20:45222394:T:GF254L0.911
20:45221444:G:CA19P0.907
20:45221412:T:AV8D0.903
20:45221437:G:CE16D0.898
20:45221437:G:TE16D0.898
20:45221747:T:CF39L0.890
20:45221749:T:AF39L0.890
20:45221749:T:GF39L0.890
20:45221999:T:CF123L0.887
20:45222001:T:AF123L0.887
20:45222001:T:GF123L0.887
20:45221427:T:CL13P0.876
20:45221447:G:CA20P0.874
20:45221408:T:CF7L0.869
20:45221410:T:AF7L0.869
20:45221410:T:GF7L0.869
20:45221440:G:CK17N0.865
20:45221440:G:TK17N0.865
20:45221436:A:CE16A0.860
20:45221443:A:CQ18H0.849
20:45221443:A:TQ18H0.849
20:45221415:T:GL9R0.837
20:45221421:T:GL11R0.835
20:45221424:T:CL12P0.828
20:45221424:T:GL12R0.826
20:45221819:T:CF63L0.823

dbSNP variants (sampled 300 via entrez): RS1001057447 (20:45224299 C>T), RS1001330971 (20:45224394 T>A,C), RS1001966331 (20:45220979 A>C), RS1002034264 (20:45219707 T>A,C), RS1003112003 (20:45221234 T>C), RS1003298655 (20:45219898 G>C), RS1003395339 (20:45220126 T>A), RS1005400261 (20:45220159 C>A,T), RS1005763576 (20:45220589 T>A,C), RS1005894935 (20:45224800 T>G), RS1006384084 (20:45224727 C>T), RS1007190932 (20:45219423 G>A), RS1007271347 (20:45220896 C>T), RS1009210050 (20:45220618 C>A,T), RS1009262500 (20:45220938 A>G)

Disease associations

OMIM: gene MIM:182141 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST005212_22Asthma3.000000e-06
GCST008103_143Bipolar disorder3.000000e-06
GCST010725_40Malaria1.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

4 total (human), top 4 by PubMed support.

ChemicalActions (top 5)PubMed papers
Zincdecreases reaction, affects binding, affects reaction, decreases activity2
Benzo(a)pyreneaffects methylation, increases methylation1
Tetrachlorodibenzodioxinincreases expression1
Dextran Sulfateaffects binding, affects reaction1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): asthma, bipolar disorder, malaria