HARBI1

gene
On this page

Also known as FLJ32675

Summary

HARBI1 (harbinger transposase derived 1, HGNC:26522) is a protein-coding gene on chromosome 11p11.2, encoding Putative nuclease HARBI1 (Q96MB7). Transposase-derived protein that may have nuclease activity (Potential).

Predicted to enable metal ion binding activity and nuclease activity. Located in centriolar satellite; cytosol; and plasma membrane.

Source: NCBI Gene 283254 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 51 total
  • MANE Select transcript: NM_173811

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26522
Approved symbolHARBI1
Nameharbinger transposase derived 1
Location11p11.2
Locus typegene with protein product
StatusApproved
AliasesFLJ32675
Ensembl geneENSG00000180423
Ensembl biotypeprotein_coding
OMIM615086
Entrez283254

Gene structure

Transcript identifiers

Ensembl transcripts: 12 — 12 protein_coding

ENST00000326737, ENST00000529192, ENST00000532281, ENST00000891261, ENST00000891262, ENST00000936938, ENST00000936939, ENST00000936940, ENST00000936941, ENST00000960128, ENST00000960129, ENST00000960130

RefSeq mRNA: 1 — MANE Select: NM_173811 NM_173811

CCDS: CCDS7920

Canonical transcript exons

ENST00000326737 — 3 exons

ExonStartEnd
ENSE000012303774661712446617227
ENSE000012303904661556846616381
ENSE000014156164660286146603909

Expression profiles

Bgee: expression breadth ubiquitous, 171 present calls, max score 84.40.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 4.1454 / max 139.9218, expressed in 1612 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1195261.97611283
1195251.1443488
1195271.0249358

Top tissues by expression

245 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099184.40gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.24gold quality
spermCL:000001982.03silver quality
bone marrow cellCL:000209279.30gold quality
islet of LangerhansUBERON:000000677.96gold quality
left testisUBERON:000453377.74gold quality
right testisUBERON:000453477.74gold quality
stromal cell of endometriumCL:000225577.26gold quality
testisUBERON:000047376.94gold quality
cortical plateUBERON:000534375.69gold quality
monocyteCL:000057675.14gold quality
leukocyteCL:000073875.00gold quality
prefrontal cortexUBERON:000045173.58gold quality
smooth muscle tissueUBERON:000113572.21gold quality
ventricular zoneUBERON:000305372.07gold quality
ganglionic eminenceUBERON:000402371.80gold quality
right adrenal glandUBERON:000123371.76gold quality
right adrenal gland cortexUBERON:003582771.70gold quality
rectumUBERON:000105271.44gold quality
granulocyteCL:000009471.40gold quality
kidney epitheliumUBERON:000481970.73gold quality
gastrocnemiusUBERON:000138870.72gold quality
bone marrowUBERON:000237170.38gold quality
left adrenal glandUBERON:000123470.32gold quality
muscle of legUBERON:000138370.27gold quality
pancreasUBERON:000126469.92gold quality
lymph nodeUBERON:000002969.68gold quality
left adrenal gland cortexUBERON:003582569.48gold quality
mucosa of transverse colonUBERON:000499169.36gold quality
adrenal cortexUBERON:000123569.35gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.51

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

12 targeting HARBI1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3134100.0066.43777
HSA-MIR-8485100.0077.574731
HSA-MIR-223-3P99.9970.141140
HSA-MIR-453499.9966.581907
HSA-MIR-808299.9567.271170
HSA-MIR-10395-5P99.8667.35676
HSA-MIR-29B-2-5P99.6768.981726
HSA-MIR-570099.6469.882280
HSA-MIR-432899.5771.064094
HSA-MIR-892A99.5468.161141
HSA-MIR-520E-5P99.2768.901513
HSA-MIR-426496.3564.761480

Literature-anchored findings (GeneRIF, showing 1)

  • The functions of two transposon-derived human proteins: HARBI1, a domesticated transposase-derived protein, and NAIF1, which contains a trihelix motif similar to that described in the Myb-like protein, was investigated. (PMID:18339812)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_rerioharbi1ENSDARG00000036038
mus_musculusHarbi1ENSMUSG00000027243
rattus_norvegicusHarbi1ENSRNOG00000081861
drosophila_melanogasterCG43088FBGN0262534

Protein

Protein identifiers

Putative nuclease HARBI1Q96MB7 (reviewed: Q96MB7)

Alternative names: Harbinger transposase-derived nuclease

All UniProt accessions (3): Q96MB7, E9PK24, E9PQI1

UniProt curated annotations — full annotation on UniProt →

Function. Transposase-derived protein that may have nuclease activity (Potential). Does not have transposase activity.

Subunit / interactions. Interacts with NAIF1.

Subcellular location. Nucleus. Cytoplasm.

Tissue specificity. Detected in brain, eye, nerve tissue, kidney and lung.

Similarity. Belongs to the HARBI1 family.

RefSeq proteins (1): NP_776172* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR026103HARBI1_animalFamily
IPR027806HARBI1_domDomain
IPR045249HARBI1-likeFamily

Pfam: PF13359

UniProt features (6 total): binding site 4, chain 1, domain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96MB7-F184.960.64

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (4): 149; 199; 225; 261

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 71 (showing top): GOMF_NUCLEASE_ACTIVITY, IVANOVA_HEMATOPOIESIS_LATE_PROGENITOR, GGGTGGRR_PAX4_03, TGANTCA_AP1_C, MARSON_BOUND_BY_FOXP3_UNSTIMULATED, HSF1_01, ARID5B_TARGET_GENES, BARX1_TARGET_GENES, DIDO1_TARGET_GENES, DMRT1_TARGET_GENES, ELF2_TARGET_GENES, GUCY1B1_TARGET_GENES, KLF7_TARGET_GENES, NFE2L1_TARGET_GENES, RFX7_TARGET_GENES

GO Biological Process (0):

GO Molecular Function (4): nuclease activity (GO:0004518), hydrolase activity (GO:0016787), metal ion binding (GO:0046872), protein binding (GO:0005515)

GO Cellular Component (5): nucleus (GO:0005634), cytosol (GO:0005829), plasma membrane (GO:0005886), centriolar satellite (GO:0034451), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure3
catalytic activity, acting on a nucleic acid1
catalytic activity1
cation binding1
binding1
intracellular membrane-bounded organelle1
cytoplasm1
membrane1
cell periphery1
centrosome1
intracellular anatomical structure1

Protein interactions and networks

STRING

306 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
HARBI1NAIF1Q69YI7801
HARBI1PGBD5Q8N414663
HARBI1GIN1Q9NXP7592
HARBI1THAP9Q9H5L6589
HARBI1QRICH1Q2TAL8548
HARBI1TIGD2Q4W5G0517
HARBI1PIGZQ86VD9505
HARBI1PRORPO15091487
HARBI1POGKQ9P215479
HARBI1PGBD2Q6P3X8445
HARBI1SHPKQ9UHJ6433
HARBI1ZBED8Q8IZ13433
HARBI1CENPBD1PB2RD01431
HARBI1SFT2D2O95562430
HARBI1TIGD5Q53EQ6397

IntAct

3 interactions, top by confidence:

ABTypeScore
HARBI1NAIF1psi-mi:“MI:0915”(physical association)0.590

BioGRID (3): NAIF1 (Affinity Capture-MS), NAIF1 (Affinity Capture-MS), HARBI1 (Affinity Capture-MS)

ESM2 similar proteins: A0A0G2K1Q8, B0BN95, B2RRL2, O14423, O43422, O46510, O60108, O75564, O96006, P49777, P55205, P55211, Q01841, Q0VBL1, Q17QR8, Q17RP2, Q3V1F8, Q4W5G0, Q5U538, Q5XH12, Q60976, Q62711, Q66KB7, Q6AZB8, Q6B0B8, Q6IE26, Q6NT04, Q6P7F1, Q7L775, Q7L7V1, Q7TM95, Q8BR93, Q8BUZ3, Q8BZS9, Q8IY51, Q8K3R3, Q8VEH5, Q924H9, Q94K49, Q96DM1

Diamond homologs: B0BN95, Q17QR8, Q5U538, Q6AZB8, Q8BR93, Q96MB7

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

51 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance46
Likely benign2
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

540 predictions. Top by Δscore:

VariantEffectΔscore
11:46603910:C:CAacceptor_loss0.9900
11:46617183:T:TAdonor_gain0.9900
11:46617886:CGCAG:Cdonor_loss0.9900
11:46617888:CAG:Cdonor_loss0.9900
11:46617889:AG:Adonor_loss0.9900
11:46617890:GGT:Gdonor_loss0.9900
11:46617891:G:Adonor_loss0.9900
11:46617892:T:Gdonor_loss0.9900
11:46615840:T:TAdonor_gain0.9800
11:46603882:C:CTacceptor_gain0.9700
11:46617156:TCCC:Tdonor_gain0.9700
11:46603907:CAC:Cacceptor_gain0.9600
11:46617813:G:GTdonor_gain0.9600
11:46603910:C:CCacceptor_gain0.9500
11:46615561:AACTT:Adonor_loss0.9500
11:46615562:ACTTA:Adonor_loss0.9500
11:46615563:CTTAC:Cdonor_loss0.9500
11:46615564:TTA:Tdonor_loss0.9500
11:46615565:TA:Tdonor_loss0.9500
11:46615566:A:AAdonor_loss0.9500
11:46617157:C:Adonor_gain0.9500
11:46615566:AC:Adonor_gain0.9400
11:46615567:CC:Cdonor_gain0.9400
11:46617156:T:TAdonor_gain0.9400
11:46615566:A:ACdonor_gain0.9300
11:46615567:C:CCdonor_gain0.9300
11:46616125:A:Cdonor_gain0.9300
11:46617186:T:TAdonor_gain0.9300
11:46615609:CTA:Cdonor_gain0.9100
11:46603924:C:Tacceptor_gain0.9000

AlphaMissense

2304 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:46603767:G:CF271L0.997
11:46603767:G:TF271L0.997
11:46603769:A:GF271L0.997
11:46616001:A:CF79L0.997
11:46616001:A:TF79L0.997
11:46616003:A:GF79L0.997
11:46615950:A:CS96R0.996
11:46615950:A:TS96R0.996
11:46615952:T:GS96R0.996
11:46615927:A:TV104D0.995
11:46615931:A:GC103R0.995
11:46615947:C:AQ97H0.995
11:46615947:C:GQ97H0.995
11:46615983:G:CF85L0.995
11:46615983:G:TF85L0.995
11:46615985:A:GF85L0.995
11:46615933:C:GR102P0.994
11:46616011:G:TA76E0.994
11:46616014:G:TA75E0.994
11:46603695:A:CC295W0.993
11:46615915:G:AT108I0.993
11:46615943:A:GS99P0.993
11:46615951:C:AS96I0.993
11:46615980:C:AQ86H0.993
11:46615980:C:GQ86H0.993
11:46616002:A:GF79S0.993
11:46615721:A:GS173P0.992
11:46615918:A:TV107D0.992
11:46616000:A:CY80D0.992
11:46603690:A:TV297D0.991

dbSNP variants (sampled 300 via entrez): RS1000020603 (11:46614527 G>A), RS1000138243 (11:46608096 T>C), RS1000194244 (11:46617167 C>T), RS1000341024 (11:46607442 GCAT>G), RS1000394734 (11:46607170 GAAAA>G), RS1000620647 (11:46612637 A>C), RS1000734782 (11:46605486 G>A,C,T), RS1000798452 (11:46606068 T>C), RS1000824311 (11:46618213 C>T), RS1000952883 (11:46617861 G>A), RS1001069041 (11:46618206 A>G), RS1001295101 (11:46610716 CA>C), RS1002194106 (11:46603148 A>T), RS1002226534 (11:46603471 G>C), RS1002356829 (11:46604279 C>T)

Disease associations

OMIM: gene MIM:615086 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST000763_2Immunoglobulin A2.000000e-06
GCST004521_122Autism spectrum disorder or schizophrenia3.000000e-13
GCST004521_165Autism spectrum disorder or schizophrenia3.000000e-08
GCST006803_20Schizophrenia3.000000e-13
GCST006947_1Feeling fed-up3.000000e-10
GCST007825_4Alzheimer’s disease or fasting glucose levels (pleiotropy)3.000000e-16
GCST009600_131Anorexia nervosa, attention-deficit/hyperactivity disorder, autism spectrum disorder, bipolar disorder, major depression, obsessive-compulsive disorder, schizophrenia, or Tourette syndrome (pleiotropy)1.000000e-08
GCST90000047_168Age at first sexual intercourse2.000000e-10

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004747protein measurement
EFO:0009588feeling “fed-up” measurement
EFO:0009749age at first sexual intercourse measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

13 total (human), top 13 by PubMed support.

ChemicalActions (top 5)PubMed papers
3-((6-(2-methoxyphenyl)pyrimidin-4-yl)amino)phenyl)methane sulfonamidedecreases expression1
kojic aciddecreases expression1
avobenzonedecreases expression1
di-n-butylphosphoric acidaffects expression1
abrineincreases expression1
Sunitinibincreases expression1
Benzo(a)pyreneincreases mutagenesis1
Smokedecreases expression1
Thiramdecreases expression1
Urethaneincreases expression1
Cadmium Chloridedecreases expression1
Copper Sulfatedecreases expression1
Lactic Aciddecreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_SQ89HAP1 HARBI1 (-)Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.