GIN1

gene
On this page

Also known as FLJ20125GIN-1TGIN1

Summary

GIN1 (gypsy retrotransposon integrase 1, HGNC:25959) is a protein-coding gene on chromosome 5q21.1, encoding Gypsy retrotransposon integrase-like protein 1 (Q9NXP7).

Predicted to enable nucleic acid binding activity. Predicted to be involved in DNA integration.

Source: NCBI Gene 54826 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 80 total
  • MANE Select transcript: NM_017676

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25959
Approved symbolGIN1
Namegypsy retrotransposon integrase 1
Location5q21.1
Locus typegene with protein product
StatusApproved
AliasesFLJ20125, GIN-1, TGIN1
Ensembl geneENSG00000145723
Ensembl biotypeprotein_coding
Entrez54826

Gene structure

Transcript identifiers

Ensembl transcripts: 15 — 10 protein_coding, 2 protein_coding_CDS_not_defined, 2 nonsense_mediated_decay, 1 retained_intron

ENST00000399004, ENST00000507478, ENST00000508629, ENST00000511400, ENST00000512248, ENST00000513603, ENST00000513747, ENST00000910330, ENST00000910331, ENST00000910332, ENST00000910333, ENST00000910334, ENST00000926085, ENST00000926086, ENST00000926087

RefSeq mRNA: 2 — MANE Select: NM_017676 NM_001317954, NM_017676

CCDS: CCDS43349

Canonical transcript exons

ENST00000399004 — 8 exons

ExonStartEnd
ENSE00000971862103106716103106909
ENSE00001174759103120064103120138
ENSE00002028587103086000103088172
ENSE00003461986103096541103096826
ENSE00003462342103097590103097781
ENSE00003479475103104541103104846
ENSE00003591106103097314103097490
ENSE00003621099103108569103108714

Expression profiles

Bgee: expression breadth ubiquitous, 254 present calls, max score 89.00.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 5.6778 / max 200.9506, expressed in 1579 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
627895.67781579

Top tissues by expression

274 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.00gold quality
biceps brachiiUBERON:000150786.58gold quality
diaphragmUBERON:000110386.37silver quality
skeletal muscle tissue of biceps brachiiUBERON:000450285.98gold quality
buccal mucosa cellCL:000233685.73gold quality
heart right ventricleUBERON:000208085.57gold quality
myocardiumUBERON:000234982.25gold quality
deltoidUBERON:000147682.13gold quality
calcaneal tendonUBERON:000370182.09gold quality
secondary oocyteCL:000065581.60gold quality
vastus lateralisUBERON:000137981.30gold quality
hindlimb stylopod muscleUBERON:000425281.30gold quality
ventricular zoneUBERON:000305381.29gold quality
esophagus squamous epitheliumUBERON:000692081.19gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047381.16gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451181.13gold quality
quadriceps femorisUBERON:000137780.99gold quality
adrenal tissueUBERON:001830380.90gold quality
germinal epithelium of ovaryUBERON:000130480.54gold quality
skeletal muscle tissueUBERON:000113480.33gold quality
muscle organUBERON:000163080.23gold quality
visceral pleuraUBERON:000240180.14gold quality
epithelium of nasopharynxUBERON:000195180.12gold quality
muscle tissueUBERON:000238579.87gold quality
muscle of legUBERON:000138379.86gold quality
heart left ventricleUBERON:000208479.82gold quality
parietal pleuraUBERON:000240079.76gold quality
tibiaUBERON:000097979.74gold quality
cardiac ventricleUBERON:000208279.73gold quality
hair follicleUBERON:000207379.64silver quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no4.44
E-MTAB-6678no3.37

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

78 targeting GIN1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-200B-3P100.0073.312693
HSA-MIR-200C-3P100.0073.352685
HSA-MIR-429100.0073.442698
HSA-MIR-477599.9875.006394
HSA-MIR-485-3P99.9870.681585
HSA-MIR-539-3P99.9870.741616
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-495-3P99.9672.814197
HSA-MIR-568899.9673.234504
HSA-MIR-590-3P99.9674.346478
HSA-MIR-365899.9673.874379
HSA-MIR-335-3P99.9373.364958
HSA-MIR-4778-3P99.9370.401818
HSA-MIR-205-3P99.9269.923165
HSA-MIR-652-5P99.9167.49505
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-1211999.8768.351653
HSA-MIR-449299.8768.253611
HSA-MIR-5582-3P99.8672.484221
HSA-MIR-5003-3P99.8569.292517
HSA-MIR-520F-3P99.8271.321216
HSA-MIR-57799.7869.132479
HSA-MIR-202-5P99.7867.65991
HSA-MIR-370-5P99.7866.81706
HSA-MIR-494-3P99.7071.452795
HSA-MIR-128499.6773.561353
HSA-MIR-10393-5P99.6568.011368
HSA-MIR-452-5P99.6569.631762
HSA-MIR-4676-3P99.6569.311733

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusGin1ENSMUSG00000026333
rattus_norvegicusGin1ENSRNOG00000011962

Protein

Protein identifiers

Gypsy retrotransposon integrase-like protein 1Q9NXP7 (reviewed: Q9NXP7)

Alternative names: Ty3/Gypsy integrase 1, Zinc finger H2C2 domain-containing protein

All UniProt accessions (1): Q9NXP7

UniProt curated annotations — full annotation on UniProt →

Tissue specificity. Widely expressed. Also found in tumors originating from parathyroid gland, colon, stomach, bladder, uterus and prostate.

Miscellaneous. May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Isoforms (3)

UniProt IDNamesCanonical?
Q9NXP7-11yes
Q9NXP7-22
Q9NXP7-33

RefSeq proteins (2): NP_001304883, NP_060146* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001584Integrase_cat-coreDomain
IPR012337RNaseH-like_sfHomologous_superfamily
IPR036397RNaseH_sfHomologous_superfamily
IPR041588Integrase_H2C2Domain
IPR050951

Pfam: PF00665, PF17921

UniProt features (11 total): splice variant 4, sequence conflict 2, sequence variant 2, chain 1, domain 1, modified residue 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9NXP7-F170.950.30

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (1): 502

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 104 (showing top): RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, IVANOVA_HEMATOPOIESIS_LATE_PROGENITOR, GOBP_DNA_INTEGRATION, MODULE_205, GGAANCGGAANY_UNKNOWN, SCHAEFFER_PROSTATE_DEVELOPMENT_6HR_UP, chr5q21, GOBP_DNA_METABOLIC_PROCESS, SIRNA_EIF4GI_UP, SHEN_SMARCA2_TARGETS_UP, GSE5503_MLN_DC_VS_SPLEEN_DC_ACTIVATED_ALLOGENIC_TCELL_DN, GSE5503_PLN_DC_VS_SPLEEN_DC_ACTIVATED_ALLOGENIC_TCELL_DN, ALKBH3_TARGET_GENES, BARX1_TARGET_GENES, DIDO1_TARGET_GENES

GO Biological Process (1): DNA integration (GO:0015074)

GO Molecular Function (1): nucleic acid binding (GO:0003676)

GO Cellular Component (0):

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA metabolic process1
binding1

Protein interactions and networks

STRING

740 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
GIN1KRABD2Q6ZNG9664
GIN1PGBD5Q8N414659
GIN1HARBI1Q96MB7592
GIN1FAM110BQ8TC76549
GIN1MACIRQ96GV9542
GIN1THAP9Q9H5L6531
GIN1BUD31P41223521
GIN1DRC8Q5VUJ9507
GIN1DHX8Q14562501
GIN1UBXN8O00124497
GIN1CGGBP1Q9UFW8492
GIN1SUGP2Q8IX01492
GIN1AKR1B15C9JRZ8481
GIN1PGBD1Q96JS3477
GIN1ZBED8Q8IZ13477

IntAct

4 interactions, top by confidence:

ABTypeScore
GIN1GSTA2psi-mi:“MI:0915”(physical association)0.400
CPSF6GIN1psi-mi:“MI:0915”(physical association)0.370
GIN1psi-mi:“MI:0915”(physical association)0.370

BioGRID (15): GIN1 (Two-hybrid), GIN1 (Affinity Capture-MS), GIN1 (Two-hybrid), GIN1 (Affinity Capture-MS), GIN1 (Affinity Capture-RNA), GSTA2 (Affinity Capture-MS), GIN1 (Affinity Capture-MS), RPS14 (Cross-Linking-MS (XL-MS)), RPS14 (Cross-Linking-MS (XL-MS)), RPS28 (Cross-Linking-MS (XL-MS)), RPS19 (Cross-Linking-MS (XL-MS)), GIN1 (Cross-Linking-MS (XL-MS)), GIN1 (Cross-Linking-MS (XL-MS)), GIN1 (Cross-Linking-MS (XL-MS)), GIN1 (Affinity Capture-MS)

ESM2 similar proteins: A0A0B7P3V8, A4FUB7, A4H7G5, A4HVU6, A6NKG5, B4NSS9, B6EU02, F4I1N8, H2KZW3, O10318, O13329, P04586, P0C2J7, P18475, P34601, P47024, Q01577, Q05101, Q09178, Q09293, Q1L6Q1, Q3S4A7, Q4DCH3, Q4QFY1, Q4R6I1, Q52QI2, Q57X81, Q5RBK0, Q5RDX4, Q62120, Q66H30, Q6R2W3, Q7KLI1, Q7LHG5, Q8CDM1, Q8K259, Q8N157, Q90179, Q91829, Q99315

Diamond homologs: A4FUB7, Q4R6I1, Q5RBK0, Q66H30, Q8K259, Q9NXP7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

80 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance65
Likely benign6
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

1408 predictions. Top by Δscore:

VariantEffectΔscore
5:103088169:CTTT:Cacceptor_gain1.0000
5:103088170:TTT:Tacceptor_gain1.0000
5:103088173:C:CCacceptor_gain1.0000
5:103097588:A:ACdonor_gain1.0000
5:103097588:A:Cdonor_loss1.0000
5:103097589:C:CCdonor_gain1.0000
5:103106715:CCCA:Cdonor_gain1.0000
5:103106718:A:ACdonor_gain1.0000
5:103106719:C:CCdonor_gain1.0000
5:103108563:TTTTA:Tdonor_loss1.0000
5:103108564:TTTA:Tdonor_loss1.0000
5:103108565:TTACC:Tdonor_loss1.0000
5:103108566:TAC:Tdonor_loss1.0000
5:103108567:A:ATdonor_loss1.0000
5:103108568:CCTT:Cdonor_loss1.0000
5:103088153:T:Cacceptor_gain0.9900
5:103088170:TTTC:Tacceptor_loss0.9900
5:103088171:TT:Tacceptor_gain0.9900
5:103088172:TCTGA:Tacceptor_loss0.9900
5:103088173:C:CAacceptor_loss0.9900
5:103096833:G:GCacceptor_gain0.9900
5:103097453:C:CCacceptor_gain0.9900
5:103097589:CCAA:Cdonor_gain0.9900
5:103097777:TTGAT:Tacceptor_gain0.9900
5:103097778:TGAT:Tacceptor_gain0.9900
5:103097781:TCTGA:Tacceptor_loss0.9900
5:103097782:C:CCacceptor_gain0.9900
5:103097782:C:Tacceptor_loss0.9900
5:103097783:T:Aacceptor_loss0.9900
5:103106707:AAG:Adonor_gain0.9900

AlphaMissense

3479 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
5:103096573:A:GL421P0.999
5:103096657:A:TI393K0.999
5:103106726:A:TV108D0.999
5:103106748:A:GW101R0.999
5:103106748:A:TW101R0.999
5:103108579:A:CF43L0.999
5:103108579:A:TF43L0.999
5:103108581:A:GF43L0.999
5:103108590:C:GA40P0.999
5:103108656:A:CY18D0.999
5:103096573:A:TL421H0.998
5:103096719:T:AR372S0.998
5:103096719:T:GR372S0.998
5:103096726:A:TV370D0.998
5:103106751:A:CY100D0.998
5:103106774:A:GL92P0.998
5:103108580:A:GF43S0.998
5:103108601:A:CI36R0.998
5:103108601:A:TI36K0.998
5:103096624:A:GL404P0.997
5:103096630:G:TA402D0.997
5:103096657:A:CI393R0.997
5:103104837:A:GC115R0.997
5:103106746:C:AW101C0.997
5:103106746:C:GW101C0.997
5:103108589:G:TA40E0.997
5:103108593:C:GA39P0.997
5:103108594:T:AR38S0.997
5:103108594:T:GR38S0.997
5:103108597:T:AR37S0.997

dbSNP variants (sampled 300 via entrez): RS1000354781 (5:103091242 T>C), RS1000366570 (5:103097180 T>C), RS1000574460 (5:103115767 A>C), RS1000707441 (5:103110206 A>G), RS1000861992 (5:103086067 A>G), RS1000918419 (5:103092263 T>C), RS1000983868 (5:103121515 C>G), RS1000999178 (5:103092632 C>G,T), RS1001014865 (5:103121862 A>G), RS1001240864 (5:103113330 T>C), RS1001566269 (5:103113027 G>A), RS1001627608 (5:103089785 C>A,T), RS1001649089 (5:103118712 T>C), RS1001680344 (5:103119089 T>C), RS1001994887 (5:103089517 C>T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST003129_2Primary biliary cholangitis1.000000e-08
GCST005568_19Rheumatoid arthritis (ACPA-positive)3.000000e-06
GCST005568_33Rheumatoid arthritis (ACPA-positive)9.000000e-08
GCST005569_41Rheumatoid arthritis9.000000e-08
GCST005569_8Rheumatoid arthritis7.000000e-10
GCST007559_21Sleep duration (short sleep)2.000000e-08
GCST010002_34Refractive error2.000000e-10
GCST90000025_2Appendicular lean mass1.000000e-12

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004980appendicular lean mass

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

24 total (human), top 24 by PubMed support.

ChemicalActions (top 5)PubMed papers
Air Pollutantsdecreases expression, increases abundance2
GSK-J4decreases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
sodium arsenitedecreases expression1
potassium chromate(VI)affects cotreatment, decreases expression1
resorcinoldecreases expression1
epigallocatechin gallatedecreases expression, affects cotreatment1
di-n-butylphosphoric acidaffects expression1
abrineincreases expression1
jinfukangdecreases expression1
Vorinostatincreases expression1
Cisplatindecreases expression1
Formaldehydedecreases expression1
Smokedecreases expression, increases abundance1
Thimerosalincreases expression1
Tretinoindecreases expression1
Urethanedecreases expression1
Cyclosporineincreases expression1
Aflatoxin M1decreases expression1
Okadaic Aciddecreases expression1
Copper Sulfatedecreases expression1
Lactic Aciddecreases expression1
Particulate Matterincreases abundance, decreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): primary biliary cholangitis