TREH

gene
On this page

Also known as TRETREAMGC129621

Summary

TREH (trehalase, HGNC:12266) is a protein-coding gene on chromosome 11q23.3, encoding Trehalase (O43280). Intestinal trehalase is probably involved in the hydrolysis of ingested trehalose.

This gene encodes an enzyme that hydrolyses trehalose, a disaccharide formed from two glucose molecules found mainly in fungi, plants, and insects. A partial duplication of this gene is located adjacent to this locus on chromosome 11. Two transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 11181 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): diarrhea-vomiting due to trehalase deficiency (Supportive, GenCC)
  • GWAS associations: 14
  • Clinical variants (ClinVar): 102 total
  • Phenotypes (HPO): 7
  • Druggable target: yes — 1 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_007180

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:12266
Approved symbolTREH
Nametrehalase
Location11q23.3
Locus typegene with protein product
StatusApproved
AliasesTRE, TREA, MGC129621
Ensembl geneENSG00000118094
Ensembl biotypeprotein_coding
OMIM275360
Entrez11181

Gene structure

Transcript identifiers

Ensembl transcripts: 6 — 3 protein_coding, 2 retained_intron, 1 nonsense_mediated_decay

ENST00000264029, ENST00000397925, ENST00000527558, ENST00000531295, ENST00000613915, ENST00000854539

RefSeq mRNA: 2 — MANE Select: NM_007180 NM_001301065, NM_007180

CCDS: CCDS73401, CCDS73402

Canonical transcript exons

ENST00000264029 — 15 exons

ExonStartEnd
ENSE00000796071118658680118658733
ENSE00000796072118658905118659017
ENSE00000796073118659370118659481
ENSE00000796074118659747118659964
ENSE00000796076118660866118660915
ENSE00000796077118661160118661282
ENSE00001302378118657316118658441
ENSE00003461195118662881118662968
ENSE00003494992118661890118661990
ENSE00003512150118663339118663439
ENSE00003583528118661637118661729
ENSE00003585564118663052118663196
ENSE00003652439118660539118660733
ENSE00003652500118661393118661509
ENSE00003721071118679539118679650

Expression profiles

Bgee: expression breadth ubiquitous, 162 present calls, max score 93.26.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.1609 / max 90.0600, expressed in 22 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1226130.132719
1226140.02829

Top tissues by expression

292 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
jejunal mucosaUBERON:000039993.26gold quality
duodenumUBERON:000211492.11gold quality
small intestineUBERON:000210881.94gold quality
small intestine Peyer’s patchUBERON:000345481.77gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047379.27gold quality
adult mammalian kidneyUBERON:000008275.44gold quality
ileal mucosaUBERON:000033174.66silver quality
gall bladderUBERON:000211073.13gold quality
right lobe of liverUBERON:000111472.10gold quality
sural nerveUBERON:001548871.77gold quality
stromal cell of endometriumCL:000225571.42gold quality
mucosa of transverse colonUBERON:000499169.94gold quality
jejunumUBERON:000211569.65gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099168.95gold quality
kidneyUBERON:000211368.51gold quality
pancreatic ductal cellCL:000207967.19silver quality
rectumUBERON:000105267.14gold quality
lower esophagus mucosaUBERON:003583466.11gold quality
metanephros cortexUBERON:001053365.78gold quality
body of stomachUBERON:000116165.76gold quality
transverse colonUBERON:000115765.73gold quality
cortex of kidneyUBERON:000122565.59gold quality
body of pancreasUBERON:000115064.16gold quality
liverUBERON:000210763.95gold quality
intestineUBERON:000016063.92gold quality
esophagus mucosaUBERON:000246963.70gold quality
right atrium auricular regionUBERON:000663163.36gold quality
left uterine tubeUBERON:000130362.97gold quality
stomachUBERON:000094562.49gold quality
cardiac atriumUBERON:000208162.20gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes4.36
E-MTAB-6075no96.82

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 6)

  • These results suggest that TREH acts as a stress-response protein in the kidney rather than involved in utilization of exogenous trehalose. (PMID:19126402)
  • we found three protective alleles for glioma patients, including the allele ‘‘T’’ of rs17748 in the TREH gene by recessive model (OR, 0.48; 95% CI, 0.23-1.01; P = 0.05) analysis (PMID:22369735)
  • variants in TREH control trehalase activity, and although one of these variants is also reproducibly associated with T2D, it is likely that the effect of the SNP on risk of T2D occurs by a mechanism different than affecting trehalase activity. (PMID:23468175)
  • results suggest a significant association between the RETL1, TREH, and PHLDB1 genes and GBM development in the Han Chinese population (PMID:26156397)
  • In conclusion, we identified four novel loci (TGFA, PIK3R1, FGFR3 and TREH) and confirmed two loci known to be associated with cartilage thickness.The identified associations were not caused by rare exonic variants. This is the first report linking TGFA to human Osteoarthritis, which may serve as a new target for future therapies. (PMID:27701424)
  • these results indicate that trehalose has anti-apoptotic effects through the suppression of oxidative stress-induced mitochondrial injury and ER stress which is dependent on the promotion of autophagic flux and the induction of selective autophagy (PMID:28981117)

Cross-species orthologs

10 orthologs

OrganismSymbolGene ID
danio_reriotrehENSDARG00000077911
mus_musculusTrehENSMUSG00000032098
rattus_norvegicusTrehENSRNOG00000052010
drosophila_melanogasterTrehFBGN0003748
drosophila_melanogasterCG6262FBGN0034121
caenorhabditis_eleganstre-1WBGENE00006607
caenorhabditis_elegansWBGENE00006608
caenorhabditis_elegansWBGENE00006609
caenorhabditis_elegansWBGENE00006610
caenorhabditis_elegansWBGENE00006611

Protein

Protein identifiers

TrehalaseO43280 (reviewed: O43280)

Alternative names: Alpha,alpha-trehalase, Alpha,alpha-trehalose glucohydrolase

All UniProt accessions (2): O43280, A0A087WTJ4

UniProt curated annotations — full annotation on UniProt →

Function. Intestinal trehalase is probably involved in the hydrolysis of ingested trehalose.

Subunit / interactions. Homodimer; disulfide-linked.

Subcellular location. Cell membrane.

Tissue specificity. Expressed in kidney, liver and small intestine. Also more weakly expressed in pancreas.

Disease relevance. Trehalase deficiency (TREHD) [MIM:612119] An autosomal recessive condition characterized by the inability to digest trehalose, a disaccharide found in mushrooms, products containing baker’s yeast, and dried food. Individuals with trehalase deficiency suffer from abdominal pain, increased rectal flatulence, and diarrhea due to osmotic water flow into the colon. The gene represented in this entry is involved in disease pathogenesis.

Similarity. Belongs to the glycosyl hydrolase 37 family.

Isoforms (2)

UniProt IDNamesCanonical?
O43280-11yes
O43280-22

RefSeq proteins (2): NP_001287994, NP_009111* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001661Glyco_hydro_37Family
IPR0089286-hairpin_glycosidase_sfHomologous_superfamily
IPR0123416hp_glycosidase-like_sfHomologous_superfamily
IPR018232Glyco_hydro_37_CSConserved_site

Pfam: PF01204

Enzyme classification (BRENDA):

  • EC 3.2.1.28 — alpha,alpha-trehalase (BRENDA: 90 organisms, 95 substrates, 318 inhibitors, 114 Km, 29 kcat entries)

Substrate kinetics (BRENDA)

2 substrates with measured Km, best-characterized 2. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
ALPHA,ALPHA-TREHALOSE0.06–72.859
TREHALOSE0.0109–5550

Catalyzed reactions (Rhea), 1 shown:

  • alpha,alpha-trehalose + H2O = alpha-D-glucose + beta-D-glucose (RHEA:32675)

UniProt features (24 total): binding site 7, sequence variant 5, glycosylation site 4, active site 2, signal peptide 1, chain 1, lipid moiety-binding region 1, splice variant 1, propeptide 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O43280-F192.800.90

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (2): 321 (proton donor/acceptor); 514 (proton donor/acceptor)

Ligand- & substrate-binding residues (7): 319; 529; 168; 175–176; 212; 221–223; 286–288

Post-translational modifications (1): 556

Glycosylation sites (4): 78, 239, 261, 369

Function

Pathways and Gene Ontology

Reactome pathways

1 pathways

IDPathway
R-HSA-189085Digestion of dietary carbohydrate

MSigDB gene sets: 71 (showing top): GOBP_OLIGOSACCHARIDE_METABOLIC_PROCESS, REACTOME_DIGESTION_OF_DIETARY_CARBOHYDRATE, GOBP_OLIGOSACCHARIDE_CATABOLIC_PROCESS, GOBP_ANIMAL_ORGAN_MORPHOGENESIS, HNF4_DR1_Q3, GOBP_CARBOHYDRATE_METABOLIC_PROCESS, KEGG_STARCH_AND_SUCROSE_METABOLISM, GOBP_CARBOHYDRATE_CATABOLIC_PROCESS, GOCC_SIDE_OF_MEMBRANE, GOMF_HYDROLASE_ACTIVITY_ACTING_ON_GLYCOSYL_BONDS, YOSHIMURA_MAPK8_TARGETS_UP, MULLIGHAN_MLL_SIGNATURE_1_DN, HNF4ALPHA_Q6, GOMF_HYDROLASE_ACTIVITY_HYDROLYZING_O_GLYCOSYL_COMPOUNDS, CHEN_METABOLIC_SYNDROM_NETWORK

GO Biological Process (4): trehalose metabolic process (GO:0005991), trehalose catabolic process (GO:0005993), animal organ morphogenesis (GO:0009887), carbohydrate metabolic process (GO:0005975)

GO Molecular Function (3): alpha,alpha-trehalase activity (GO:0004555), hydrolase activity (GO:0016787), hydrolase activity, acting on glycosyl bonds (GO:0016798)

GO Cellular Component (4): plasma membrane (GO:0005886), membrane (GO:0016020), extracellular exosome (GO:0070062), side of membrane (GO:0098552)

Reactome top-level categories

Rollup of top-1 pathways:

CategoryPathways
Digestion1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
membrane2
cellular anatomical structure2
disaccharide metabolic process1
trehalose metabolic process1
disaccharide catabolic process1
anatomical structure morphogenesis1
animal organ development1
primary metabolic process1
trehalase activity1
catalytic activity1
hydrolase activity1
cell periphery1
extracellular vesicle1
leaflet of membrane bilayer1

Protein interactions and networks

STRING

1885 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TREHMGAMO43451779
TREHSIP14410737
TREHMGAM2Q2M2H8734
TREHPABPN1Q86U42712
TREHPGM3O95394592
TREHAMY2AP04746591
TREHUGP2Q16851591
TREHPYGLP06737582
TREHPYGBP11216582
TREHPYGMP11217577
TREHPGGHGQ32M88575
TREHGPIP06744574
TREHAMY1BP04745565
TREHPHLDB1Q86UU1563
TREHNTHL1P78549560

IntAct

2 interactions, top by confidence:

ABTypeScore
TREHGPAA1psi-mi:“MI:0915”(physical association)0.400

BioGRID (2): GPAA1 (Affinity Capture-MS), GPAA1 (Affinity Capture-MS)

ESM2 similar proteins: A0JMP0, A4IG42, A5PJN5, A6QLU7, A6QQ07, F1N2K1, O00462, O18835, O43280, O77695, O95479, O97524, P08236, P10253, P12265, P19813, P70699, P82450, Q3U4H6, Q4FAT7, Q4FZV0, Q5E985, Q5FVF9, Q5R5N6, Q5R7A9, Q5R8R3, Q5RFU0, Q5XHI4, Q641Z7, Q6P6V7, Q6P7A9, Q6QR59, Q6RHW4, Q76HN1, Q865R1, Q8BFW6, Q8BNE1, Q8BP56, Q8C0L6, Q8CFX1

Diamond homologs: A1AAC5, A1AH61, A7ZKW9, A7ZT60, A7ZZD1, A8A5X9, A8AFT6, A8J4S9, A9MLK7, A9MVX4, B0RNH1, B1IU96, B1J0B4, B1LHA4, B1LJ63, B1X7X4, B1XAN8, B2U4I7, B4SUI9, B4SWF1, B4T8D8, B4TXW7, B4TZ02, B5BI56, B5F4F0, B5F9H0, B5FKM4, B5FTN7, B5R2X4, B5R404, B5R904, B5RGS3, B5YVG5, B6I385, B6I9Q8, B7L603, B7LGV7, B7LSZ0, B7LXB1, B7M3D0

SIGNOR signaling

1 interactions.

AEffectBMechanism
CDK1“up-regulates activity”TREHphosphorylation

Disease & clinical

Clinical variants and AI predictions

ClinVar

102 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance74
Likely benign19
Benign5

Top pathogenic / likely-pathogenic (0)

SpliceAI

2780 predictions. Top by Δscore:

VariantEffectΔscore
11:118658734:C:CCacceptor_gain1.0000
11:118659833:C:CAdonor_gain1.0000
11:118661207:T:Adonor_gain1.0000
11:118661208:C:Adonor_gain1.0000
11:118661391:A:ACdonor_gain1.0000
11:118661392:C:CCdonor_gain1.0000
11:118663046:TCATA:Tdonor_loss1.0000
11:118663047:CATAC:Cdonor_loss1.0000
11:118663048:ATAC:Adonor_loss1.0000
11:118663049:TACCT:Tdonor_loss1.0000
11:118663050:A:ATdonor_loss1.0000
11:118663051:C:CTdonor_loss1.0000
11:118663051:CCTGT:Cdonor_gain1.0000
11:118663059:T:Adonor_gain1.0000
11:118663440:C:CCacceptor_gain1.0000
11:118658730:CATA:Cacceptor_gain0.9900
11:118658732:TA:Tacceptor_gain0.9900
11:118659365:CCTA:Cdonor_loss0.9900
11:118659366:CTACC:Cdonor_loss0.9900
11:118659367:TA:Tdonor_loss0.9900
11:118659368:A:Gdonor_loss0.9900
11:118659369:CCT:Cdonor_gain0.9900
11:118659748:T:TAdonor_gain0.9900
11:118660541:AG:Adonor_gain0.9900
11:118660665:A:Tacceptor_gain0.9900
11:118660667:C:CTacceptor_gain0.9900
11:118660860:GCTCA:Gdonor_loss0.9900
11:118660861:CTCAC:Cdonor_loss0.9900
11:118660862:TCA:Tdonor_loss0.9900
11:118660863:CACCT:Cdonor_loss0.9900

AlphaMissense

3785 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:118658433:A:CF536L0.998
11:118658433:A:TF536L0.998
11:118658435:A:GF536L0.998
11:118661907:A:CF169L0.998
11:118661907:A:TF169L0.998
11:118661909:A:GF169L0.998
11:118661160:C:AR286M0.997
11:118661160:C:GR286T0.997
11:118658429:A:GW538R0.995
11:118658429:A:TW538R0.995
11:118660896:C:GD293H0.995
11:118660915:C:AR286S0.995
11:118660915:C:GR286S0.995
11:118661483:C:GR215P0.995
11:118659415:C:GD463H0.994
11:118659418:A:GW462R0.994
11:118659418:A:TW462R0.994
11:118660675:G:CF322L0.994
11:118660675:G:TF322L0.994
11:118660677:A:GF322L0.994
11:118660678:G:CD321E0.994
11:118660678:G:TD321E0.994
11:118660679:T:AD321V0.994
11:118662901:A:GW135R0.994
11:118662901:A:TW135R0.994
11:118663373:C:AK52N0.994
11:118663373:C:GK52N0.994
11:118659416:C:AW462C0.993
11:118659416:C:GW462C0.993
11:118660680:C:GD321H0.993

dbSNP variants (sampled 300 via entrez): RS1000269510 (11:118657612 T>C), RS1000600962 (11:118677182 G>A,C), RS1000654618 (11:118669204 G>A), RS1000714997 (11:118676799 C>A,T), RS1000943185 (11:118669725 C>G,T), RS1000975533 (11:118664099 A>C,G), RS1001429600 (11:118665807 A>C), RS1001501944 (11:118677277 C>T), RS1001859552 (11:118664538 C>T), RS1001931708 (11:118670560 T>G), RS1002059662 (11:118677684 C>G), RS1002435417 (11:118664759 G>C,T), RS1002745273 (11:118681420 G>A), RS1003136556 (11:118673592 G>A,T), RS1003190403 (11:118679711 C>T)

Disease associations

OMIM: gene MIM:275360 | disease phenotypes: MIM:612119

GenCC curated gene-disease

DiseaseClassificationInheritance
diarrhea-vomiting due to trehalase deficiencySupportiveAutosomal dominant

Mondo (1): diarrhea-vomiting due to trehalase deficiency (MONDO:0012803)

Orphanet (1): Trehalase deficiency (Orphanet:103909)

HPO phenotypes

7 total (7 of 7 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0002013Vomiting
HP:0002014Diarrhea
HP:0002024Malabsorption
HP:0002027Abdominal pain
HP:0003270Abdominal distention
HP:0012379Abnormal circulating enzyme concentration or activity

GWAS associations

14 associations (top):

StudyTraitp-value
GCST000817_136Height2.000000e-08
GCST001670_2Vitiligo2.000000e-09
GCST002388_19Serum metabolite levels3.000000e-30
GCST003853_6Hip minimal joint space width2.000000e-07
GCST005523_30Celiac disease2.000000e-11
GCST005531_22Multiple sclerosis8.000000e-11
GCST005534_4Systemic sclerosis1.000000e-07
GCST005535_1Diffuse cutaneous systemic sclerosis3.000000e-06
GCST005990_14Non-albumin protein levels2.000000e-08
GCST006612_66LDL cholesterol8.000000e-11
GCST008489_5Celiac disease2.000000e-08
GCST009652_17Serum alkaline phosphatase levels1.000000e-09
GCST010002_199Refractive error3.000000e-34
GCST90011900_61Serum alkaline phosphatase levels7.000000e-39

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0007873cartilage thickness measurement
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0004533alkaline phosphatase measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C562603Trehalase Deficiency (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL3087 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

1 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 4,739 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL307429DUVOGLUSTAT24,739

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

1 potent at pChembl≥5 of 2 total, top 1 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.75Ki180nMSALBOSTATIN

CTD chemical–gene interactions

9 total (human), top 9 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases expression2
aristolochic acid Iincreases expression1
sodium arsenitedecreases expression1
CGP 52608affects binding, increases reaction1
Acetaminophendecreases expression1
Arbutindecreases expression1
Hydrocortisonedecreases activity1
Aflatoxin B1increases expression1
Okadaic Aciddecreases expression1

ChEMBL screening assays

4 unique, capped per target: 4 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL812720BindingEvaluated for the inhibition of trehalase from porcine kidneysTotal synthesis of trehalase inhibitor salbostatin — Bioorg Med Chem Lett

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.