LYG2

gene
On this page

Also known as LYGHLYGA2

Summary

LYG2 (lysozyme g2, HGNC:29615) is a protein-coding gene on chromosome 2q11.2, encoding Lysozyme g-like protein 2 (Q86SG7). May act as a potent antibacterial protein that may play a role in the innate immunity.

The protein encoded by this gene contains a SLT domain, a protein domain present in bacterial lytic transglycosylase (SLT) and in eukaryotic lysozymes (GEWL). SLT domain catalyzes the cleavage of the beta-1,4-glycosidic bond between N-acetylmuramic acid (MurNAc) and N-acetyglucosamine (GlcNAc).

Source: NCBI Gene 254773 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 38 total
  • MANE Select transcript: NM_175735

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:29615
Approved symbolLYG2
Namelysozyme g2
Location2q11.2
Locus typegene with protein product
StatusApproved
AliasesLYGH, LYGA2
Ensembl geneENSG00000185674
Ensembl biotypeprotein_coding
OMIM616547
Entrez254773

Gene structure

Transcript identifiers

Ensembl transcripts: 5 — 5 protein_coding

ENST00000333017, ENST00000409679, ENST00000423306, ENST00000423800, ENST00000924907

RefSeq mRNA: 1 — MANE Select: NM_175735 NM_175735

CCDS: CCDS2042

Canonical transcript exons

ENST00000333017 — 7 exons

ExonStartEnd
ENSE000012936619924526299245458
ENSE000013056709925421899254285
ENSE000013110689924668099246820
ENSE000013119059924399999244137
ENSE000015785659924224699242482
ENSE000017149659925502099255137
ENSE000039286919925560399255666

Expression profiles

Bgee: expression breadth ubiquitous, 152 present calls, max score 89.52.

Top tissues by expression

244 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.52gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047378.09gold quality
buccal mucosa cellCL:000233678.00silver quality
tendon of biceps brachiiUBERON:000818863.56gold quality
cerebellar cortexUBERON:000212959.89gold quality
cerebellar hemisphereUBERON:000224559.78gold quality
cerebellumUBERON:000203758.64gold quality
right hemisphere of cerebellumUBERON:001489058.15gold quality
stromal cell of endometriumCL:000225556.71gold quality
tendonUBERON:000004355.42silver quality
monocyteCL:000057654.59gold quality
adenohypophysisUBERON:000219653.91gold quality
skin of abdomenUBERON:000141653.34gold quality
leukocyteCL:000073853.18gold quality
corpus callosumUBERON:000233652.90gold quality
lower lobe of lungUBERON:000894952.63silver quality
nucleus accumbensUBERON:000188251.91gold quality
pituitary glandUBERON:000000751.61gold quality
calcaneal tendonUBERON:000370151.01silver quality
medial globus pallidusUBERON:000247750.71gold quality
zone of skinUBERON:000001450.63gold quality
trabecular bone tissueUBERON:000248350.41gold quality
granulocyteCL:000009450.11silver quality
apex of heartUBERON:000209849.96gold quality
cortex of kidneyUBERON:000122549.87gold quality
right lobe of thyroid glandUBERON:000111949.86gold quality
ganglionic eminenceUBERON:000402349.84gold quality
left testisUBERON:000453349.06gold quality
right testisUBERON:000453448.93gold quality
skin of legUBERON:000151148.91gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.85

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

3 targeting LYG2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-433-3P99.9869.371203
HSA-MIR-684499.8270.692423
HSA-MIR-430398.0168.132304

Literature-anchored findings (GeneRIF, showing 2)

  • HLysG2 is a potent antibacterial protein that may play a role in the innate immunity of the human eye. (PMID:21093056)
  • HLysG2 was identified and purified from human seminal plasma and showed a significant killing effect on Micrococcus lysodeikticus, Bacillus subtilis and Staphylococcus aureus. (PMID:28274309)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriolygl1ENSDARG00000056874
danio_reriozgc:162941ENSDARG00000071701
danio_reriolygl2ENSDARG00000099562
mus_musculusLyg2ENSMUSG00000061584
rattus_norvegicusLyg2ENSRNOG00000050591

Paralogs (1): LYG1 (ENSG00000144214)

Protein

Protein identifiers

Lysozyme g-like protein 2Q86SG7 (reviewed: Q86SG7)

All UniProt accessions (3): C9J4J0, C9JBA4, Q86SG7

UniProt curated annotations — full annotation on UniProt →

Function. May act as a potent antibacterial protein that may play a role in the innate immunity.

Subcellular location. Secreted.

Tissue specificity. Strong expression detected in the eye and weak expression in the testis. No expression is observed in any other tissues.

Similarity. Belongs to the glycosyl hydrolase 23 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q86SG7-11yes
Q86SG7-22

RefSeq proteins (1): NP_783862* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR002152Glyco_hydro_23Family
IPR023346Lysozyme-like_dom_sfHomologous_superfamily

Enzyme classification (BRENDA):

  • EC 3.2.1.17 — lysozyme (BRENDA: 110 organisms, 187 substrates, 100 inhibitors, 8 Km, 2 kcat entries)

Substrate kinetics (BRENDA)

3 substrates with measured Km, best-characterized 3. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
4-METHYLUMBELLIFERYL-BETA-D-N,N’,N’’-TRIACETYLCH0.0008–0.0012
P-NITROPHENYL-N,N’,N’’,N’’’,N’’’’-PENTAACETYLCHI8.331
P-NITROPHENYL-GLCNACBETA(1-4)GLCNACBETA(1-4)GLCN0

UniProt features (10 total): sequence conflict 4, disulfide bond 2, signal peptide 1, chain 1, active site 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q86SG7-F188.110.79

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 105

Disulfide bonds (2): 39–92, 53–61

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 36 (showing top): GOBP_MACROMOLECULE_CATABOLIC_PROCESS, GOBP_CARBOHYDRATE_DERIVATIVE_CATABOLIC_PROCESS, AREB6_01, TGACCTY_ERR1_Q2, LHX3_01, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GOBP_AMINOGLYCAN_METABOLIC_PROCESS, GOBP_DEFENSE_RESPONSE_TO_GRAM_POSITIVE_BACTERIUM, GOBP_DEFENSE_RESPONSE_TO_BACTERIUM, GOBP_AMINOGLYCAN_CATABOLIC_PROCESS, TGGAAA_NFAT_Q4_01, GOMF_LYSOZYME_ACTIVITY, GOMF_HYDROLASE_ACTIVITY_ACTING_ON_GLYCOSYL_BONDS, GOBP_RESPONSE_TO_BACTERIUM, chr2q11

GO Biological Process (3): peptidoglycan catabolic process (GO:0009253), defense response to bacterium (GO:0042742), defense response to Gram-positive bacterium (GO:0050830)

GO Molecular Function (4): lysozyme activity (GO:0003796), protein binding (GO:0005515), hydrolase activity (GO:0016787), hydrolase activity, acting on glycosyl bonds (GO:0016798)

GO Cellular Component (1): extracellular region (GO:0005576)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
peptidoglycan metabolic process1
glycosaminoglycan catabolic process1
defense response1
response to bacterium1
defense response to bacterium1
hydrolase activity, hydrolyzing O-glycosyl compounds1
peptidoglycan muralytic activity1
binding1
catalytic activity1
hydrolase activity1
cellular anatomical structure1

Protein interactions and networks

STRING

724 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
LYG2BPIFB2Q8N4F0660
LYG2ZNF696Q9H7X3516
LYG2SERPINB10P48595511
LYG2CLEC2AQ6UVW9490
LYG2IFIT5Q13325485
LYG2KRABD3A5PL33472
LYG2KCNJ12Q14500422
LYG2LYZL2Q7Z4W2411
LYG2TMPRSS5Q9H3S3394
LYG2ANKRD23Q86SG2381
LYG2TTC16Q8NEE8377
LYG2ADAMTSL2Q86TH1372
LYG2KCNJ18B7U540349
LYG2ACOD1A6NK06345
LYG2GATD3P0DPI2342

IntAct

71 interactions, top by confidence:

ABTypeScore
OAZ3AZIN1psi-mi:“MI:0914”(association)0.800
IFT43TULP3psi-mi:“MI:0914”(association)0.790
KIF3AKIF3Cpsi-mi:“MI:0914”(association)0.730
ANXA9PPLpsi-mi:“MI:0914”(association)0.660
RAB11BSH3BP5psi-mi:“MI:0914”(association)0.640
CAPZA2CNOT1psi-mi:“MI:0914”(association)0.640
LYG2TEPSINpsi-mi:“MI:0915”(physical association)0.560
RPN2SMPD2psi-mi:“MI:0914”(association)0.530
CPLX3CIAO1psi-mi:“MI:0914”(association)0.530
LACC1DUSP14psi-mi:“MI:0914”(association)0.530
NSMAFDUSP14psi-mi:“MI:0914”(association)0.530
MRPL38DUSP14psi-mi:“MI:0914”(association)0.530
FBXL4DUSP14psi-mi:“MI:0914”(association)0.530
LYG2COL4A2psi-mi:“MI:0914”(association)0.530
SEMG2VSIG8psi-mi:“MI:0914”(association)0.530
DOLPP1VSIG8psi-mi:“MI:0914”(association)0.530
APIPVSIG8psi-mi:“MI:0914”(association)0.530
LYG2TRAF2psi-mi:“MI:0914”(association)0.530
SIDT2AP3D1psi-mi:“MI:0914”(association)0.530
LYG2H2BC21psi-mi:“MI:0915”(physical association)0.400

BioGRID (87): LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), NID2 (Affinity Capture-MS), SPATA20 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), VPS33B (Affinity Capture-MS), VIPAS39 (Affinity Capture-MS), LYG2 (Affinity Capture-MS), LYG2 (Affinity Capture-MS)

ESM2 similar proteins: B3A003, C0HLB7, C1IIX1, D9J142, D9J143, O02119, O04138, O22841, O22842, O24603, O76357, O76358, P05105, P16579, P19171, P21226, P27054, P29022, P29023, P29059, P29615, P36907, P37160, P37161, P42820, P48816, P50717, P50718, P52403, P52404, P52405, P80052, P83673, P86383, P86473, Q05538, Q1XG90, Q39799, Q41596, Q54F36

Diamond homologs: B3A0P5, G3XDD8, P00717, P00718, P00719, P27042, P61133, Q3V1I0, Q7LZR3, Q86SG7, Q8JFR1, Q90VZ3, Q90X99, Q9D7Q0, Q8N1E2, O31976, O64046

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

38 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance35
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

478 predictions. Top by Δscore:

VariantEffectΔscore
2:99242479:CCAC:Cacceptor_gain1.0000
2:99242480:CACC:Cacceptor_gain1.0000
2:99244134:CAAG:Cacceptor_gain1.0000
2:99245329:T:Adonor_gain1.0000
2:99246678:A:ACdonor_gain1.0000
2:99246679:C:CCdonor_gain1.0000
2:99246679:CCG:Cdonor_gain1.0000
2:99242480:CAC:Cacceptor_gain0.9900
2:99242483:C:CCacceptor_gain0.9900
2:99242484:T:Aacceptor_loss0.9900
2:99244133:TCAAG:Tacceptor_gain0.9900
2:99244134:CAAGC:Cacceptor_gain0.9900
2:99244138:C:CCacceptor_gain0.9900
2:99245330:C:Adonor_gain0.9900
2:99246673:CACT:Cdonor_loss0.9900
2:99246674:ACTTA:Adonor_loss0.9900
2:99246675:CTT:Cdonor_loss0.9900
2:99246676:TTAC:Tdonor_loss0.9900
2:99246677:T:TGdonor_loss0.9900
2:99246679:CCGCA:Cdonor_gain0.9900
2:99246821:C:CAacceptor_loss0.9900
2:99246821:C:CCacceptor_gain0.9900
2:99242478:ACCAC:Aacceptor_gain0.9800
2:99242479:CCACC:Cacceptor_gain0.9800
2:99242481:AC:Aacceptor_gain0.9800
2:99242482:CC:Cacceptor_gain0.9800
2:99242488:A:ACacceptor_gain0.9800
2:99244135:AAGC:Aacceptor_loss0.9800
2:99244136:AGCT:Aacceptor_loss0.9800
2:99244137:GCTA:Gacceptor_loss0.9800

AlphaMissense

1404 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:99242393:C:GA204P0.987
2:99245325:G:CS106R0.987
2:99245325:G:TS106R0.987
2:99245327:T:GS106R0.987
2:99245274:A:CF123L0.976
2:99245274:A:TF123L0.976
2:99245276:A:GF123L0.976
2:99242395:C:GR203P0.974
2:99242385:G:CF206L0.972
2:99242385:G:TF206L0.972
2:99242387:A:GF206L0.972
2:99245345:C:GA100P0.969
2:99242392:G:TA204D0.968
2:99245434:G:TA70D0.968
2:99245347:G:TA99E0.965
2:99245269:A:GL125S0.964
2:99244030:G:CF163L0.963
2:99244030:G:TF163L0.963
2:99244032:A:GF163L0.963
2:99244085:A:GL145P0.961
2:99245381:C:AG88W0.961
2:99242479:C:TG175D0.960
2:99245336:A:GS103P0.960
2:99245344:G:TA100D0.959
2:99242384:A:CY207D0.956
2:99245272:C:TG124D0.956
2:99244004:A:GL172P0.955
2:99245335:G:AS103F0.955
2:99242367:G:CF212L0.953
2:99242367:G:TF212L0.953

dbSNP variants (sampled 300 via entrez): RS1000115499 (2:99262063 A>G), RS1000371968 (2:99259727 A>G), RS1000424411 (2:99260216 G>A), RS1000715066 (2:99261458 C>G,T), RS1000767528 (2:99261836 G>T), RS1000983730 (2:99248216 G>C,T), RS1001036321 (2:99248639 C>G,T), RS1001131366 (2:99247723 T>G), RS1001184117 (2:99247437 GTTTT>G,GT,GTT,GTTT,GTTTTT,GTTTTTT,GTTTTTTT), RS1001212789 (2:99249353 G>A), RS1001243727 (2:99248943 A>G), RS1001440499 (2:99255164 A>G), RS1001609280 (2:99262891 C>A,T), RS1001639876 (2:99255446 C>A), RS1002168000 (2:99261036 G>T)

Disease associations

OMIM: gene MIM:616547 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST001241_13Bipolar disorder3.000000e-06
GCST90002385_137High light scatter reticulocyte count1.000000e-09
GCST90002386_253High light scatter reticulocyte percentage of red cells7.000000e-10

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0007986reticulocyte count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

3 total (human), top 3 by PubMed support.

ChemicalActions (top 5)PubMed papers
aristolochic acid Iincreases expression1
Tobacco Smoke Pollutiondecreases expression1
Aflatoxin B1increases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.