LGSN

gene
On this page

Also known as LGS

Summary

LGSN (lengsin, lens protein with glutamine synthetase domain, HGNC:21016) is a protein-coding gene on chromosome 6q12, encoding Lengsin (Q5TDP6). May act as a component of the cytoskeleton or as a chaperone for the reorganization of intermediate filament proteins during terminal differentiation in the lens.

This gene encodes a protein with similarity to the GS I members of the glutamine synthetase superfamily. The encoded protein is referred to as a pseudo-glutamine synthetase because it has no glutamine synthesis activity and may function as a chaperone protein. This protein is localized to the lens and may be associated with cataract disease. Alternative splicing results in multiple transcript variants.

Source: NCBI Gene 51557 — RefSeq curated summary.

At a glance

  • GWAS associations: 5
  • Clinical variants (ClinVar): 101 total
  • MANE Select transcript: NM_016571

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21016
Approved symbolLGSN
Namelengsin, lens protein with glutamine synthetase domain
Location6q12
Locus typegene with protein product
StatusApproved
AliasesLGS
Ensembl geneENSG00000146166
Ensembl biotypeprotein_coding
OMIM611470
Entrez51557

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 4 protein_coding

ENST00000370657, ENST00000370658, ENST00000485906, ENST00000622415

RefSeq mRNA: 2 — MANE Select: NM_016571 NM_001143940, NM_016571

CCDS: CCDS4964, CCDS55027

Canonical transcript exons

ENST00000370657 — 4 exons

ExonStartEnd
ENSE000021516546329491363295045
ENSE000021782796327595163281220
ENSE000021878616328558763285753
ENSE000038935506331991463319983

Expression profiles

Bgee: expression breadth broad, 67 present calls, max score 82.01.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0497 / max 89.7447, expressed in 1 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
742000.04971

Top tissues by expression

252 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
buccal mucosa cellCL:000233682.01gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047375.95gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099168.23gold quality
right lobe of liverUBERON:000111465.49gold quality
liverUBERON:000210762.46gold quality
placentaUBERON:000198759.82gold quality
epithelium of bronchusUBERON:000203152.07silver quality
bronchusUBERON:000218551.63silver quality
bronchial epithelial cellCL:000232851.59silver quality
adult mammalian kidneyUBERON:000008250.12gold quality
nephron tubuleUBERON:000123149.89silver quality
colonic epitheliumUBERON:000039749.68gold quality
kidneyUBERON:000211349.40gold quality
Brodmann (1909) area 46UBERON:000648349.30gold quality
blood vessel layerUBERON:000479749.29gold quality
cervix squamous epitheliumUBERON:000692249.20gold quality
hair follicleUBERON:000207349.18gold quality
olfactory bulbUBERON:000226448.92gold quality
choroid plexus epitheliumUBERON:000391148.89gold quality
myocardiumUBERON:000234948.87gold quality
type B pancreatic cellCL:000016948.83gold quality
quadriceps femorisUBERON:000137748.81gold quality
cardiac muscle of right atriumUBERON:000337948.55gold quality
CA1 field of hippocampusUBERON:000388148.50gold quality
vastus lateralisUBERON:000137948.25gold quality
left ventricle myocardiumUBERON:000656648.24gold quality
orbitofrontal cortexUBERON:000416748.20gold quality
upper arm skinUBERON:000426348.06gold quality
cervix epitheliumUBERON:000480148.04gold quality
tongue squamous epitheliumUBERON:000691947.92gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.90
E-GEOD-99795no16.64

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

135 targeting LGSN, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3646100.0073.565283
HSA-MIR-7110-3P100.0073.182486
HSA-MIR-8485100.0077.574731
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-656-3P100.0072.152788
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-3924100.0072.092394
HSA-MIR-366299.9973.825684
HSA-MIR-186-5P99.9970.833707
HSA-MIR-10401-5P99.9965.79948
HSA-MIR-428299.9975.366408
HSA-MIR-4715-3P99.9866.03670
HSA-MIR-569699.9872.364487
HSA-MIR-806899.9873.852376
HSA-MIR-314899.9775.066478
HSA-MIR-568899.9673.234504
HSA-MIR-495-3P99.9672.814197
HSA-MIR-302E99.9670.742669
HSA-MIR-590-3P99.9674.346478
HSA-MIR-23A-3P99.9574.243163
HSA-MIR-23B-3P99.9574.243163
HSA-MIR-23C99.9573.923192
HSA-MIR-651-3P99.9473.485177
HSA-MIR-381-3P99.9371.872854
HSA-MIR-338-5P99.9272.342951
HSA-MIR-30099.9271.762856
HSA-MIR-806399.9169.763146
HSA-MIR-367199.9073.043897

Literature-anchored findings (GeneRIF, showing 5)

  • Lengsin is major novel protein of the vertebrate eye lens. (PMID:12107413)
  • A potential chaperone-like role of Lengsin was investigated. (PMID:17010935)
  • Lengsin belongs to the predominantly prokaryotic class 1 branch of glutamine synthetases but has lost enzyme activity (PMID:17161372)
  • Novel spliced form of a lens protein as a novel lung cancer antigen, Lengsin splicing variant 4, is reported. (PMID:19459848)
  • In major depressive disorder genome-wide association study, best male-specific SNP was rs9362774 within LGSN gene (PMID:21621269)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriolgsnENSDARG00000007715
mus_musculusLgsnENSMUSG00000050217
rattus_norvegicusLgsnENSRNOG00000012205
caenorhabditis_elegansWBGENE00001602
caenorhabditis_elegansWBGENE00001604
caenorhabditis_elegansWBGENE00001606
caenorhabditis_elegansWBGENE00001607

Paralogs (1): GLUL (ENSG00000135821)

Protein

Protein identifiers

LengsinQ5TDP6 (reviewed: Q5TDP6)

Alternative names: Glutamate-ammonia ligase domain-containing protein 1, Lens glutamine synthase-like

All UniProt accessions (2): Q5TDP6, A0A0A0MTD2

UniProt curated annotations — full annotation on UniProt →

Function. May act as a component of the cytoskeleton or as a chaperone for the reorganization of intermediate filament proteins during terminal differentiation in the lens. Does not seem to have enzymatic activity.

Subunit / interactions. Dodecamer. Interacts with BFSP2 and VIM.

Tissue specificity. Abundantly expressed in lens.

Similarity. Belongs to the glutamine synthetase family.

Isoforms (3)

UniProt IDNamesCanonical?
Q5TDP6-11, LGS_v2yes
Q5TDP6-22, LGS_v1
Q5TDP6-33, LGS_v3

RefSeq proteins (2): NP_001137412, NP_057655* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008146Gln_synth_cat_domDomain
IPR008147Gln_synt_NDomain
IPR014746Gln_synth/guanido_kin_cat_domHomologous_superfamily
IPR036651Gln_synt_N_sfHomologous_superfamily

Pfam: PF00120

UniProt features (13 total): splice variant 4, sequence variant 3, domain 2, sequence conflict 2, chain 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5TDP6-F177.350.53

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 82 (showing top): GOBP_GLUTAMINE_FAMILY_AMINO_ACID_BIOSYNTHETIC_PROCESS, chr6q12, GOBP_ALPHA_AMINO_ACID_METABOLIC_PROCESS, GOBP_AMINO_ACID_BIOSYNTHETIC_PROCESS, GOBP_GLUTAMINE_FAMILY_AMINO_ACID_METABOLIC_PROCESS, CAGCTG_AP4_Q5, GOBP_ORGANIC_ACID_BIOSYNTHETIC_PROCESS, GOBP_GLUTAMINE_METABOLIC_PROCESS, GOBP_SMALL_MOLECULE_BIOSYNTHETIC_PROCESS, GATA6_01, TGACATY_UNKNOWN, GOBP_ORGANIC_ACID_METABOLIC_PROCESS, HELLER_HDAC_TARGETS_SILENCED_BY_METHYLATION_UP, GOMF_LIGASE_ACTIVITY_FORMING_CARBON_NITROGEN_BONDS, TGGAAA_NFAT_Q4_01

GO Biological Process (1): obsolete glutamine biosynthetic process (GO:0006542)

GO Molecular Function (2): catalytic activity (GO:0003824), glutamine synthetase activity (GO:0004356)

GO Cellular Component (3): cytoplasm (GO:0005737), plasma membrane (GO:0005886), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
molecular_function1
ammonia ligase activity1
intracellular anatomical structure1
membrane1
cell periphery1

Protein interactions and networks

STRING

2587 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
LGSNGLULP15104940
LGSNCRYBA4P53673586
LGSNBFSP2Q13515580
LGSNBFSP1Q12934570
LGSNKHDRBS2Q5VWX1557
LGSNGRIFINA4D1Z8532
LGSNNOXRED1Q6NXP6529
LGSNCRYZL1O95825519
LGSNSMAP1Q8IYB5507
LGSNCRYBB1P53674500
LGSNCRYGNQ8WXF5491
LGSNCRYBA2P53672490
LGSNLIM2P55344475
LGSNMIPP30301472
LGSNCRYGDP07320464

IntAct

3 interactions, top by confidence:

ABTypeScore
YBX1HNRNPRpsi-mi:“MI:0915”(physical association)0.770
NEK4E2F8psi-mi:“MI:0914”(association)0.350

BioGRID (3): LGSN (Co-fractionation), LGSN (Affinity Capture-MS), LGSN (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0JM23, A2RSY6, A6H6X4, A6H8I2, A6NNW6, O60502, O93511, P0CI65, P41320, Q008S8, Q0GA40, Q14AW5, Q1ZZS1, Q2TBM9, Q3U213, Q3UH60, Q496Z9, Q4R3G4, Q4R6C7, Q5R5T0, Q5TDP6, Q692V3, Q6AZT7, Q6K3T2, Q6NVJ5, Q6P2S7, Q6P3V7, Q6P6R7, Q6UXZ4, Q7T2D0, Q7TT51, Q7Z2T5, Q8CIX8, Q8GYY7, Q8TEU7, Q8VIJ5, Q96524, Q96JX3, Q99MV7, Q9BXT8

Diamond homologs: A0R079, I3R176, O08467, O27612, O29313, O58097, O59648, O66514, O87393, P00964, P05457, P07804, P09826, P0A039, P0A040, P0A1P6, P0A1P7, P0A591, P0A9C5, P0A9C6, P0A9C7, P0A9C8, P0C7B6, P10583, P10656, P12425, P13499, P15106, P15124, P19064, P19904, P20805, P21154, P22248, P23794, P25821, P28605, P28786, P31592, P33035

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

101 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance86
Likely benign9
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

1075 predictions. Top by Δscore:

VariantEffectΔscore
6:63285585:A:ACdonor_gain1.0000
6:63285586:C:CCdonor_gain1.0000
6:63294929:AT:Adonor_gain1.0000
6:63294976:T:Adonor_gain1.0000
6:63294979:T:TAdonor_gain1.0000
6:63293852:CCTCT:Cacceptor_gain0.9900
6:63293853:CTCT:Cacceptor_gain0.9900
6:63293853:CTCTC:Cacceptor_gain0.9900
6:63293854:TCTCT:Tacceptor_gain0.9900
6:63293855:CT:Cacceptor_gain0.9900
6:63293857:C:CCacceptor_gain0.9900
6:63294869:G:Adonor_gain0.9900
6:63294945:T:TAdonor_gain0.9900
6:63295056:T:TCacceptor_gain0.9900
6:63295491:CCTTA:Cacceptor_gain0.9900
6:63295493:T:Cacceptor_gain0.9900
6:63295495:A:Cacceptor_gain0.9900
6:63295975:AGTAT:Adonor_gain0.9900
6:63319912:ACCT:Adonor_gain0.9900
6:63319913:CCTC:Cdonor_gain0.9900
6:63319915:T:TAdonor_gain0.9900
6:63285754:C:CCacceptor_gain0.9800
6:63293854:TCT:Tacceptor_gain0.9800
6:63293855:CTC:Cacceptor_gain0.9800
6:63293855:CTCT:Cacceptor_loss0.9800
6:63293856:TCT:Tacceptor_gain0.9800
6:63293857:CT:Cacceptor_loss0.9800
6:63293858:T:Aacceptor_loss0.9800
6:63294930:T:Cdonor_gain0.9800
6:63294980:C:Adonor_gain0.9800

AlphaMissense

3388 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:63285651:C:GR89P0.994
6:63280306:G:CN415K0.993
6:63280306:G:TN415K0.993
6:63280521:A:GW344R0.993
6:63280521:A:TW344R0.993
6:63281041:A:CC170W0.991
6:63285654:A:TV88E0.990
6:63281043:A:GC170R0.989
6:63280271:C:TG427D0.988
6:63280483:G:CS356R0.988
6:63280483:G:TS356R0.988
6:63280485:T:GS356R0.988
6:63285611:C:AK102N0.988
6:63285611:C:GK102N0.988
6:63285643:C:GA92P0.988
6:63285616:A:GS101P0.987
6:63280508:A:GL348P0.986
6:63281042:C:TC170Y0.986
6:63285648:A:GF90S0.986
6:63280997:C:AR185M0.985
6:63281099:G:CP151R0.983
6:63280274:G:TA426D0.982
6:63280278:C:GA425P0.982
6:63281048:A:TV168E0.982
6:63281099:G:TP151Q0.982
6:63280997:C:GR185T0.981
6:63280967:A:GL195P0.980
6:63281054:G:TA166E0.980
6:63285636:T:AD94V0.980
6:63280715:G:TA279D0.979

dbSNP variants (sampled 300 via entrez): RS1000013411 (6:63494193 G>A,T), RS1000023914 (6:63523524 T>C), RS1000028040 (6:63481101 T>A,C), RS1000041265 (6:63392072 G>C), RS1000044776 (6:63414448 T>A,C), RS1000048089 (6:63454534 G>A), RS1000069601 (6:63378208 G>A), RS1000070053 (6:63469385 A>G), RS1000072605 (6:63500096 C>A), RS1000073611 (6:63337097 G>A), RS1000100161 (6:63532841 A>C), RS1000121016 (6:63504462 G>C), RS1000121441 (6:63378515 T>C), RS1000130102 (6:63557357 G>A), RS1000141490 (6:63433567 C>T)

Disease associations

OMIM: gene MIM:611470 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

5 associations (top):

StudyTraitp-value
GCST001762_641Obesity-related traits5.000000e-06
GCST004735_25Epstein-Barr virus copy number in lymphoblastoid cell lines4.000000e-07
GCST009391_76Metabolite levels9.000000e-06
GCST90000050_54Age at first birth3.000000e-08
GCST90014325_1Asthma6.000000e-09

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0009101age at first birth measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

19 total (human), top 19 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases expression, decreases methylation2
2-methyl-4-isothiazolin-3-onedecreases expression1
sulforaphanedecreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
dinophysistoxin 1decreases expression1
CGP 52608increases reaction, affects binding1
abrineincreases expression1
licochalcone Bdecreases expression1
Cadmiumdecreases expression1
Hydrogen Peroxideaffects expression1
Quercetindecreases expression1
Silicon Dioxidedecreases expression1
Tobacco Smoke Pollutionincreases expression1
Aflatoxin B1decreases methylation1
Asbestos, Crocidolitedecreases expression1
Okadaic Aciddecreases expression1
Copper Sulfateincreases expression1
Vitamin K 3affects expression1
Particulate Matterdecreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_D7TKUbigene A-549 LGSN KOCancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Epstein-Barr virus infection