NTHL1

gene
On this page

Also known as NTH1OCTS3

Summary

NTHL1 (nth like DNA glycosylase 1, HGNC:8028) is a protein-coding gene on chromosome 16p13.3, encoding Endonuclease III-like protein 1 (P78549). Bifunctional DNA N-glycosylase with associated apurinic/apyrimidinic (AP) lyase function that catalyzes the first step in base excision repair (BER), the primary repair pathway for the repair of oxidative DNA damage.

The protein encoded by this gene is a DNA N-glycosylase of the endonuclease III family. Like a similar protein in E. coli, the encoded protein has DNA glycosylase activity on DNA substrates containing oxidized pyrimidine residues and has apurinic/apyrimidinic lyase activity.

Source: NCBI Gene 4913 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): NTHL1-deficiency tumor predisposition syndrome (Definitive, ClinGen) — +3 more curated relationships
  • GWAS associations: 2
  • Clinical variants (ClinVar): 1,808 total — 114 pathogenic, 33 likely-pathogenic
  • Phenotypes (HPO): 18
  • Druggable target: yes
  • Dosage sensitivity (ClinGen): haploinsufficiency autosomal recessive, triplosensitivity no evidence
  • MANE Select transcript: NM_002528

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:8028
Approved symbolNTHL1
Namenth like DNA glycosylase 1
Location16p13.3
Locus typegene with protein product
StatusApproved
AliasesNTH1, OCTS3
Ensembl geneENSG00000065057
Ensembl biotypeprotein_coding
OMIM602656
Entrez4913

Gene structure

Transcript identifiers

Ensembl transcripts: 26 — 18 protein_coding, 5 retained_intron, 2 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000219066, ENST00000561841, ENST00000561862, ENST00000562120, ENST00000562951, ENST00000565406, ENST00000566380, ENST00000567727, ENST00000568513, ENST00000623977, ENST00000651522, ENST00000651570, ENST00000651583, ENST00000903566, ENST00000903567, ENST00000903568, ENST00000903569, ENST00000925705, ENST00000925706, ENST00000925707, ENST00000925708, ENST00000925709, ENST00000925710, ENST00000925711, ENST00000925713, ENST00000971053

RefSeq mRNA: 3 — MANE Select: NM_002528 NM_001318193, NM_001318194, NM_002528

CCDS: CCDS10457

Canonical transcript exons

ENST00000651570 — 6 exons

ExonStartEnd
ENSE0000066510520461282046366
ENSE0000349385320446302044800
ENSE0000349710820401332040238
ENSE0000355675120398202040047
ENSE0000364333120435672043726
ENSE0000384241920477092047834

Expression profiles

Bgee: expression breadth ubiquitous, 211 present calls, max score 94.90.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 11.8585 / max 119.1715, expressed in 1713 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
15589911.78211713
1558980.07659

Top tissues by expression

269 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right lobe of liverUBERON:000111494.90gold quality
apex of heartUBERON:000209893.17gold quality
mucosa of transverse colonUBERON:000499192.55gold quality
right uterine tubeUBERON:000130289.94gold quality
putamenUBERON:000187489.25gold quality
right hemisphere of cerebellumUBERON:001489089.01gold quality
cerebellar hemisphereUBERON:000224588.86gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099188.85gold quality
cerebellar cortexUBERON:000212988.62gold quality
nucleus accumbensUBERON:000188288.60gold quality
caudate nucleusUBERON:000187388.56gold quality
right frontal lobeUBERON:000281088.19gold quality
right lungUBERON:000216787.85gold quality
Brodmann (1909) area 9UBERON:001354087.46gold quality
liverUBERON:000210787.38gold quality
tibial nerveUBERON:000132386.98gold quality
right ovaryUBERON:000211886.83gold quality
C1 segment of cervical spinal cordUBERON:000646986.70gold quality
ectocervixUBERON:001224986.66gold quality
oocyteCL:000002386.52gold quality
prefrontal cortexUBERON:000045186.49gold quality
cerebellumUBERON:000203786.42gold quality
transverse colonUBERON:000115786.41gold quality
spleenUBERON:000210686.30gold quality
endocervixUBERON:000045886.19gold quality
muscle layer of sigmoid colonUBERON:003580586.19gold quality
amygdalaUBERON:000187686.10gold quality
cingulate cortexUBERON:000302786.02gold quality
right coronary arteryUBERON:000162585.86gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047385.78gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.78

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): BRCA1, ELF1, MYC, POU2F1

miRNA regulators (miRDB)

3 targeting NTHL1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-329-3P99.9166.561234
HSA-MIR-362-3P99.9166.381267
HSA-MIR-427298.7668.741810

Functional genomics

ClinGen dosage: haploinsufficiency 30 (autosomal recessive), triplosensitivity 0 (no evidence). ClinGen Gene Dosage Map

Literature-anchored findings (GeneRIF, showing 39)

  • NTH1 is involved in removal of 8-oxoguanine from 8-oxoguanine/guanine mispairs in DNA (PMID:11328882)
  • recombinant hNTH1 lacking 55, 72 or 80 residues from the N terminus had four- to fivefold higher activities than the full-length enzyme (PMID:12144783)
  • substrate selectivity of mammalian NTH1 and the concomitant selective stimulation of activity by APE1 are indicative of selective repair of oxidative damage in different regions of the genome (PMID:12519758)
  • dimerization of hNTH1 involving the N-terminal tail masks the inhibitory effect of this tail and plays a critical role in its catalytic turnover in the cell (PMID:14522981)
  • hNTH1 and hNEIL1 but not hNEIL2 excised the two stereoisomers of thymine glycol (5R-Tg and 5S-Tg), but their isomer specificity was markedly different (PMID:14734554)
  • search for the factors interacting with NTH1 shows GST-NTH1 fusion protein precipitates proliferating cell nuclear antigen (PCNA) and p53 as well as XPG from human cell-free extracts (PMID:15358233)
  • NTH1 is a DNA glycosylase that excise 5-formyluracil, 5-hydroxymethyluracil and Thymine glycol in human cells. (PMID:15533839)
  • hNTH1 plays two roles in the processing of radiation damages: repair of potentially lethal single lesions and generation of lethal double strand breaks at clustered damage sites. (PMID:16111924)
  • Nth1 released 5R,6S 2’-deoxyribonucleoside diastereoisomer (Tg2) much more rapidly than cis 5S,6R-deoxyribonucleoside diastereoisomer (Tg1) regardless of the opposing purine. Neil1 released Thymine glycol non-stereoselectively. (PMID:16446124)
  • The damage specificity of human homologues of Endo III (hNTHl) and Endo VIII (hNEIL1 and hNEIL2) is compared to elucidate the repair role in cells. (PMID:17150535)
  • These observations suggest that access to sterically occluded lesions entails the partial, reversible unwrapping of DNA from the histone octamer, allowing hNTH1 to capture its DNA substrate when it is in an unwound state. (PMID:17923696)
  • YB-1 binds specifically to the auto-inhibitory domain of hNTH1, providing a mechanism by which YB-1 stimulates hNTH1 activity. (PMID:18307537)
  • Downregulation of NTH1 is associated with gastric cancer. (PMID:19414504)
  • we demonstrated that hNEIL1 and hNTH1 cleave Oz sites as efficiently as 5-hydroxyuracil sites. Thus, hNEIL1 and hNTH1 can repair Oz lesions (PMID:22465744)
  • A homozygous loss-of-function germline mutation in the NTHL1 gene predisposes to a new subtype of BER-associated adenomatous polyposis and colorectal cancer. (PMID:25938944)
  • NTH1 polymorphisms may be associated with non-small cell lung cancer pathogenesis. (PMID:26400813)
  • This study extends the description of biallelic mutations in NTHL1 beyond the single c.268C–>T,p.Q90* mutation that was observed previously. (PMID:26559593)
  • We therefore found published evidence to support the association between variants in NTHL1 and RPS20 with CRC. (PMID:27713038)
  • Both Ntg1 and its human homologue, NTHL1, can be SUMO-modified in response to oxidative stress. (PMID:27839712)
  • WT NTHL1 (human) and Nth (E. coli) are remarkably alike with respect to specificity of glycosylase reaction, and although NTHL1 is a much slower enzyme than Nth, the tighter binding of NTHL1 compensates, resulting in similar kcat/Kd values for both enzymes with each of the substrates tested. For NTHL1 Gln287Ala, specificity for substrates positioned opposite G is lost, but not that of substrates positioned opposite A. (PMID:28292631)
  • Data indicate that DNA glycosylases MYH, UNG2, MPG, NTH1, NEIL1, 2 and 3 on nascent DNA. (PMID:28575236)
  • NTHL1 in these extracts was able to excise thymine glycol from both naked DNA and sites in nucleosomes that earlier studies had shown to be sterically accessible. However, the same extracts were able to excise lesions from sterically-occluded sites in nucleosomes only after the addition of Mg2+/ATP. (PMID:28709015)
  • This paper demonstrates that cellular NTH1 protein is induced in response to oxidative stress following hydrogen peroxide treatment of cells and that accumulation of NTH1 on chromatin is exacerbated in the absence of TRIM26. (PMID:29610152)
  • The processing of the lesions was evaluated by purified enzyme cocktails of hNTH1 and hOGG1 as well as with a HeLa cell extract. Interestingly, the yield of double-strand breaks (DSBs) resulting from the processing of the bistranded lesions are appreciably lower when the DNA is treated with the HeLa cell extract compared with the relevant purified enzyme cocktail in both models (PMID:30207305)
  • The results suggest that the NTHL1 variants Q90X, Y130X, R153X, and Q287X, but not R19Q, V179I, V217F, or G286S, were defective in 5OHU repair and the alleles encoding them were considered to be pathogenic for NTHL1-associated polyposis. (PMID:30552997)
  • he lyase activity of hNTHL1, and the 3’ diesterase activity of APE1, which had been seen as relatively dispensable, may have been preserved during evolution to enhance base excision repair (BER) in chromatin (PMID:30649547)
  • NTHL1 is a multi-tumor predisposition gene with a high lifetime risk for extracolonic cancers and a typical mutational signature observed across tumor types, which can assist in the recognition of this syndrome. (PMID:30753826)
  • NTHL1 biallelic mutations seldom cause colorectal cancer, serrated polyposis or a multi-tumor phenotype, in absence of colorectal adenomas. (PMID:31227763)
  • An Excimer Clamp for Measuring Damaged-Base Excision by the DNA Repair Enzyme NTH1. (PMID:32109332)
  • Monoallelic NTHL1 Loss-of-Function Variants and Risk of Polyposis and Colorectal Cancer. (PMID:32860789)
  • Evaluating the role of NTHL1 p.Q90* allele in inherited breast cancer predisposition. (PMID:32949222)
  • NTHL1 in genomic integrity, aging and cancer. (PMID:33087284)
  • Further delineation of the NTHL1 associated syndrome: A report from the French Oncogenetic Consortium. (PMID:33454955)
  • Prevalence and Characterization of Biallelic and Monoallelic NTHL1 and MSH3 Variant Carriers From a Pan-Cancer Patient Population. (PMID:34250384)
  • Caught in motion: human NTHL1 undergoes interdomain rearrangement necessary for catalysis. (PMID:34871433)
  • Human endonuclease III/NTH1: focusing on the [4Fe-4S] cluster and the N-terminal domain. (PMID:36279116)
  • Concurrent Reduced Expression of Contiguous PKD1, TSC2 and NTHL1 Leading to Kidney Diseases and Multiple Diverse Renal Cancers. (PMID:36581342)
  • NTHL1 is a recessive cancer susceptibility gene. (PMID:38036545)
  • Enhanced Sampling for Conformational Changes and Molecular Mechanisms of Human NTHL1. (PMID:38483510)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerionthl1ENSDARG00000042881
mus_musculusNthl1ENSMUSG00000041429
rattus_norvegicusNthl1ENSRNOG00000012213
drosophila_melanogasterNthl1FBGN0032907
caenorhabditis_elegansWBGENE00011201

Paralogs (1): MUTYH (ENSG00000132781)

Protein

Protein identifiers

Endonuclease III-like protein 1P78549 (reviewed: P78549)

Alternative names: Bifunctional DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase

All UniProt accessions (7): P78549, A0A494BZZ6, A0A494C106, E5KTI5, H3BPD5, H3BRL9, H3BUV2

UniProt curated annotations — full annotation on UniProt →

Function. Bifunctional DNA N-glycosylase with associated apurinic/apyrimidinic (AP) lyase function that catalyzes the first step in base excision repair (BER), the primary repair pathway for the repair of oxidative DNA damage. The DNA N-glycosylase activity releases the damaged DNA base from DNA by cleaving the N-glycosidic bond, leaving an AP site. The AP-lyase activity cleaves the phosphodiester bond 3’ to the AP site by a beta-elimination. Primarily recognizes and repairs oxidative base damage of pyrimidines. Also has 8-oxo-7,8-dihydroguanine (8-oxoG) DNA glycosylase activity. Acts preferentially on DNA damage opposite guanine residues in DNA. Is able to process lesions in nucleosomes without requiring or inducing nucleosome disruption.

Subunit / interactions. Interacts with YBX1. Interacts with ERCC5/XPG; the interaction stimulates NTHL1 activity and NTHL1 binding to its DNA substrate.

Subcellular location. Nucleus. Mitochondrion.

Tissue specificity. Widely expressed with highest levels in heart and lowest levels in lung and liver.

Post-translational modifications. Ubiquitinated by TRIM26; leading to proteasomal degradation.

Disease relevance. Familial adenomatous polyposis 3 (FAP3) [MIM:616415] A form of familial adenomatous polyposis, a condition characterized by the development of multiple colorectal adenomatous polyps, benign neoplasms derived from glandular epithelium. Some affected individuals may develop colorectal carcinoma. The disease is caused by variants affecting the gene represented in this entry.

Activity regulation. APE1 displaces NTHL1 from the N-glycosylase-generated AP site in DNA, thereby increasing the turnover of the DNA N-glycosylase activity. AP lyase activity is stimulated by YBX1. ERCC5/XPG stimulates NTHL1 activity and NTHL1 binding to its DNA substrate.

Cofactor. Binds 1 [4Fe-4S] cluster. The cluster does not appear to play a role in catalysis, but is probably involved in the proper positioning of the enzyme along the DNA strand.

Similarity. Belongs to the Nth/MutY family.

Isoforms (3)

UniProt IDNamesCanonical?
P78549-22, M+1yes
P78549-11, M-8
P78549-33, M+8

RefSeq proteins (3): NP_001305122, NP_001305123, NP_002519* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000445HhH_motifConserved_site
IPR003265HhH-GPD_domainDomain
IPR003651Endonuclease3_FeS-loop_motifConserved_site
IPR004036Endonuclease-III-like_CS2Conserved_site
IPR011257DNA_glycosylaseHomologous_superfamily
IPR023170HhH_base_excis_CHomologous_superfamily
IPR030841NTH1Family

Pfam: PF00633, PF00730

Enzyme classification (BRENDA):

  • EC 4.2.99.18 — DNA-(apurinic or apyrimidinic site) lyase (BRENDA: 46 organisms, 543 substrates, 374 inhibitors, 166 Km, 138 kcat entries)

Substrate kinetics (BRENDA)

68 substrates with measured Km, best-characterized 15. Km ranges are aggregated across organisms/conditions.

SubstrateKm (mM)Measurements
5’-CTCTCCCTTC-5,6-DIHYDROURACIL-CTCCTTTCCTCT-3'13
DNA CONTAINING AN ABASIC SITE100–4139
5’-GACAAGCGCAG-(5R,6S)-2’-DEOXY-5,6-DIHYDROXYURI8
DNA CONTAINING 5-OH-C/A6
5’-GACAAGCGCAG-(5S,6R)-2’-DEOXY-5,6-DIHYDROXYURI5
DNA CONTAINING 5-OH-C/G5
OLIGOMER WITH G/U PAIR0.0001–0.00135
43-MER OLIGONUCLEOTIDE CONTAINING APURINIC/APYRI4
5’-CTCTCCCTTC-8-OXO-7,8-DIHYDROGUANINE-CTCCTTTCC0.0006–0.00134
AP-DNA0.0001–0.0114
DNA0.0009–0.00174
DNA CONTAINING APURINIC/APYRIMIDINIC SITES4
THF-CONTAINING OLIGONUCLEOTIDE0.0001–0.00024
12-MER OLIGODEOXYRIBONUCLEOTIDE CONTAINING A NAT0.0001–0.00022
18-MER CONTAINING P33-LABELED TETRAHYDROFURAN2

Catalyzed reactions (Rhea), 1 shown:

  • 2’-deoxyribonucleotide-(2’-deoxyribose 5’-phosphate)-2’-deoxyribonucleotide-DNA = a 3’-end 2’-deoxyribonucleotide-(2,3-dehydro-2,3-deoxyribose 5’-phosphate)-DNA + a 5’-end 5’-phospho-2’-deoxyribonucleoside-DNA + H(+) (RHEA:66592)

UniProt features (51 total): helix 16, mutagenesis site 6, sequence variant 5, binding site 4, sequence conflict 4, modified residue 2, splice variant 2, strand 2, compositionally biased region 2, transit peptide 1, chain 1, site 1, domain 1, region of interest 1, turn 1, short sequence motif 1, active site 1

Structure

Experimental structures (PDB)

2 structures.

PDBMethodResolution (Å)
7RDTX-RAY DIFFRACTION2.1
7RDSX-RAY DIFFRACTION2.5

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P78549-F182.870.74

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (2): 231 (important for catalytic activity); 212 (nucleophile; for n-glycosylase activity)

Ligand- & substrate-binding residues (4): 292; 298; 282; 289

Post-translational modifications (2): 63, 65

Mutagenesis-validated functional residues (6):

PositionPhenotype
32–34sorted to the cytoplasm.
34sorted to both nuclei and mitochondria.
34sorted to the cytoplasm.
49sorted to both nuclei and mitochondria.
212inactivates enzyme.
21285-fold reduction in activity. uncouples the glycosylase activity from the lyase activity. shows glycosylase activity wi

Function

Pathways and Gene Ontology

Reactome pathways

5 pathways

IDPathway
R-HSA-110328Recognition and association of DNA glycosylase with site containing an affected pyrimidine
R-HSA-110329Cleavage of the damaged pyrimidine
R-HSA-110357Displacement of DNA glycosylase by APEX1
R-HSA-9630221Defective NTHL1 substrate processing
R-HSA-9630222Defective NTHL1 substrate binding

MSigDB gene sets: 225 (showing top): GOMF_ENDONUCLEASE_ACTIVITY, BUYTAERT_PHOTODYNAMIC_THERAPY_STRESS_DN, GOMF_NUCLEASE_ACTIVITY, ENK_UV_RESPONSE_KERATINOCYTE_UP, GOBP_CARBOHYDRATE_DERIVATIVE_CATABOLIC_PROCESS, GOBP_ORGANOPHOSPHATE_METABOLIC_PROCESS, KAUFFMANN_DNA_REPAIR_GENES, GOBP_CARBOHYDRATE_DERIVATIVE_METABOLIC_PROCESS, GOBP_NUCLEOBASE_CONTAINING_SMALL_MOLECULE_METABOLIC_PROCESS, GOBP_PYRIMIDINE_CONTAINING_COMPOUND_CATABOLIC_PROCESS, GOBP_NUCLEOTIDE_EXCISION_REPAIR, MUELLER_PLURINET, GCM_RING1, GOBP_PYRIMIDINE_NUCLEOTIDE_METABOLIC_PROCESS, GOBP_DNA_MODIFICATION

GO Biological Process (7): base-excision repair, AP site formation (GO:0006285), nucleotide-excision repair (GO:0006289), depyrimidination (GO:0045008), DNA repair (GO:0006281), base-excision repair (GO:0006284), response to stress (GO:0006950), DNA damage response (GO:0006974)

GO Molecular Function (17): oxidized pyrimidine nucleobase lesion DNA N-glycosylase activity (GO:0000703), damaged DNA binding (GO:0003684), double-stranded DNA binding (GO:0003690), DNA-(apurinic or apyrimidinic site) endonuclease activity (GO:0003906), endonuclease activity (GO:0004519), oxidized purine nucleobase lesion DNA N-glycosylase activity (GO:0008534), DNA N-glycosylase activity (GO:0019104), metal ion binding (GO:0046872), 4 iron, 4 sulfur cluster binding (GO:0051539), class I DNA-(apurinic or apyrimidinic site) endonuclease activity (GO:0140078), DNA binding (GO:0003677), catalytic activity (GO:0003824), protein binding (GO:0005515), hydrolase activity (GO:0016787), hydrolase activity, acting on glycosyl bonds (GO:0016798), lyase activity (GO:0016829), iron-sulfur cluster binding (GO:0051536)

GO Cellular Component (3): nucleus (GO:0005634), nucleoplasm (GO:0005654), mitochondrion (GO:0005739)

Reactome top-level categories

Rollup of top-3 pathways:

CategoryPathways
Depyrimidination2
Defective Base Excision Repair Associated with NTHL12
Resolution of Abasic Sites (AP sites)1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
DNA metabolic process2
DNA repair2
oxidized base lesion DNA N-glycosylase activity2
DNA binding2
catalytic activity2
intracellular membrane-bounded organelle2
base-excision repair1
base-excision repair, AP site formation1
DNA modification1
pyrimidine deoxyribonucleotide catabolic process1
DNA damage response1
response to stimulus1
cellular response to stress1
DNA endonuclease activity1
nuclease activity1
hydrolase activity, hydrolyzing N-glycosyl compounds1
catalytic activity, acting on DNA1
cation binding1
iron-sulfur cluster binding1
DNA-(apurinic or apyrimidinic site) endonuclease activity1
carbon-oxygen lyase activity1
nucleic acid binding1
molecular_function1
binding1
hydrolase activity1
metal cluster binding1
nuclear lumen1
cellular anatomical structure1
cytoplasm1

Protein interactions and networks

STRING

1904 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NTHL1OGG1P78554985
NTHL1NEIL1Q96FI4984
NTHL1NEIL2Q969S2982
NTHL1XRCC1P18887976
NTHL1NEIL3Q8TAT5906
NTHL1YBX1P16990895
NTHL1MUTYHQ9UIF7862
NTHL1APEX1P27695846
NTHL1UNGP13051796
NTHL1TDGQ13569775
NTHL1MPGP29372768
NTHL1POLD1P28340767
NTHL1POLBP06746765
NTHL1FEN1P39748744
NTHL1POLEQ07864744

IntAct

32 interactions, top by confidence:

ABTypeScore
MINDY3UBBpsi-mi:“MI:0914”(association)0.530
RPL27NTHL1psi-mi:“MI:0915”(physical association)0.400
POU4F1NTHL1psi-mi:“MI:0915”(physical association)0.400
Rpl35RPS6psi-mi:“MI:0914”(association)0.350
RPL10RPS6psi-mi:“MI:0914”(association)0.350
Gspt1MRPL27psi-mi:“MI:0914”(association)0.350
CAMKK1psi-mi:“MI:0914”(association)0.350
ESR1ESYT2psi-mi:“MI:0914”(association)0.350
ENGIGKV2-28psi-mi:“MI:0914”(association)0.350
HTRA4PSMD12psi-mi:“MI:0914”(association)0.350
H2BC21SMCHD1psi-mi:“MI:0914”(association)0.350
HMGN5SMCHD1psi-mi:“MI:0914”(association)0.350
NUMA1SHANK3psi-mi:“MI:0914”(association)0.350
ANOS1ZNF724psi-mi:“MI:0914”(association)0.350
PTDSS1IGLL5psi-mi:“MI:0914”(association)0.350
KCNE3PIK3R2psi-mi:“MI:0914”(association)0.350
CYB561D2TNFRSF10Bpsi-mi:“MI:0914”(association)0.350
EMX2LRP4psi-mi:“MI:0914”(association)0.350
IL31RADUSP14psi-mi:“MI:0914”(association)0.350
RINLSERPINB8psi-mi:“MI:0914”(association)0.350
SLC16A8NTHL1psi-mi:“MI:0914”(association)0.350
ATF3C11orf98psi-mi:“MI:0914”(association)0.350
CASP3C11orf98psi-mi:“MI:0914”(association)0.350
NTHL1NUDT1psi-mi:“MI:0915”(physical association)0.000
NTHL1psi-mi:“MI:0915”(physical association)0.000
NTHL1RBL1psi-mi:“MI:0915”(physical association)0.000
NTHL1GAPDHSpsi-mi:“MI:0915”(physical association)0.000
NTHL1MYH7psi-mi:“MI:0915”(physical association)0.000

BioGRID (58): NTHL1 (Affinity Capture-RNA), NTHL1 (Affinity Capture-RNA), NTHL1 (Biochemical Activity), NTHL1 (Affinity Capture-MS), NTHL1 (Affinity Capture-MS), NTHL1 (Affinity Capture-MS), NTHL1 (Affinity Capture-MS), NTHL1 (Affinity Capture-MS), NTHL1 (Affinity Capture-MS), MYH7 (Affinity Capture-MS), NUDT1 (Affinity Capture-MS), MYH4 (Affinity Capture-MS), GAPDHS (Affinity Capture-MS), RNMTL1 (Affinity Capture-MS), ERCC5 (Reconstituted Complex)

ESM2 similar proteins: A0A1D6LAG9, A2RTX5, A6QLY4, A6QNM8, A7M7B9, A9S0B8, A9VB27, B9DFZ0, C1BM18, E2RDZ6, F4JCQ3, G3X8U3, O35980, O70157, O75879, P13051, P36776, P54137, P78549, P97931, Q09907, Q0V9S0, Q13472, Q2KID2, Q4KM92, Q4R380, Q59HJ6, Q5ZKD7, Q640B4, Q7SXA9, Q80UN9, Q80VY9, Q8BLY2, Q8BYM8, Q8CGK3, Q8K582, Q8R5G2, Q8SRB8, Q924S5, Q99JT1

Diamond homologs: A7M7B9, B9DFZ0, C8VC05, O35980, P31378, P39788, P54137, P78549, Q08214, Q09907, Q2KID2, Q4UK93, Q58030, Q8SRB8, Q9SIC4, Q9WYK0, P0AB83, P0AB84, P46303, P57219, P73715, Q89AW4, Q92GH4, O05956, Q05869, Q68W04, P63541, P9WQ10, P9WQ11, O83754, P29588, P44319, Q4J8R2, Q8KA16, Q9CB92, A0R567, O31584, P83847, Q10159, P9WQ08

SIGNOR signaling

3 interactions.

AEffectBMechanism
TRIM26“down-regulates quantity by destabilization”NTHL1ubiquitination
CDK1“up-regulates activity”NTHL1phosphorylation
NTHL1up-regulatesBase-excision_repair

Disease & clinical

Clinical variants and AI predictions

ClinVar

1808 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic114
Likely pathogenic33
Uncertain significance950
Likely benign532
Benign6

Top pathogenic / likely-pathogenic (30)

Variant IDHGVSClassification
1068685NM_002528.7(NTHL1):c.510del (p.Val171fs)Pathogenic
1070434NM_002528.7(NTHL1):c.265dup (p.Val89fs)Pathogenic
1073183NC_000016.9:g.(?2089925)(2114438_?)delPathogenic
1074525NM_002528.7(NTHL1):c.307dup (p.Asp103fs)Pathogenic
1074540NM_002528.7(NTHL1):c.84del (p.Pro30fs)Pathogenic
1075052NM_002528.7(NTHL1):c.160_161del (p.Gln54fs)Pathogenic
1075792NM_002528.7(NTHL1):c.161dup (p.Arg55fs)Pathogenic
1076395NM_002528.7(NTHL1):c.736A>T (p.Lys246Ter)Pathogenic
1415037NM_002528.7(NTHL1):c.418dup (p.Ala140fs)Pathogenic
1425021NC_000016.10:g.2047835G>APathogenic
1435962NM_002528.7(NTHL1):c.449dup (p.Leu151fs)Pathogenic
1441860NM_002528.7(NTHL1):c.237G>A (p.Trp79Ter)Pathogenic
1452364NM_002528.7(NTHL1):c.568C>T (p.Gln190Ter)Pathogenic
1454268NC_000016.9:g.(?2089925)(2104451_?)delPathogenic
1457084NM_002528.7(NTHL1):c.726del (p.Arg242fs)Pathogenic
1457235NM_002528.7(NTHL1):c.172_175del (p.Val58fs)Pathogenic
1484387NM_002528.7(NTHL1):c.675_676del (p.Ser226fs)Pathogenic
1740398NM_002528.7(NTHL1):c.417_436dup (p.Leu146fs)Pathogenic
1741266NM_002528.7(NTHL1):c.428dup (p.Met143fs)Pathogenic
1754295NM_002528.7(NTHL1):c.633dup (p.Lys212fs)Pathogenic
1776794NM_002528.7(NTHL1):c.138del (p.Pro46_Val47insTer)Pathogenic
2020093NM_002528.7(NTHL1):c.664dup (p.Trp222fs)Pathogenic
2020379NM_002528.7(NTHL1):c.666G>A (p.Trp222Ter)Pathogenic
2020818NM_002528.7(NTHL1):c.356del (p.Val119fs)Pathogenic
2025416NM_002528.7(NTHL1):c.226del (p.Val76fs)Pathogenic
2084356NM_002528.7(NTHL1):c.365_371del (p.Tyr122fs)Pathogenic
2120066NM_002528.7(NTHL1):c.252G>A (p.Trp84Ter)Pathogenic
2135297NM_002528.7(NTHL1):c.469del (p.Leu157fs)Pathogenic
2423284NC_000016.9:g.(?2093548)(2113074_?)delPathogenic
2423285NC_000016.9:g.(?2089925)(2105540_?)delPathogenic

SpliceAI

1587 predictions. Top by Δscore:

VariantEffectΔscore
16:2043559:CTACT:Cdonor_loss1.0000
16:2043561:ACT:Adonor_loss1.0000
16:2043562:CTC:Cdonor_loss1.0000
16:2043563:TCACC:Tdonor_loss1.0000
16:2043564:CAC:Cdonor_loss1.0000
16:2043565:A:ACdonor_gain1.0000
16:2043565:ACCA:Adonor_loss1.0000
16:2043566:C:Adonor_loss1.0000
16:2043566:C:CCdonor_gain1.0000
16:2043722:TTGCT:Tacceptor_gain1.0000
16:2043723:TGCT:Tacceptor_gain1.0000
16:2043725:CT:Cacceptor_gain1.0000
16:2043726:TC:Tacceptor_loss1.0000
16:2043727:C:CCacceptor_gain1.0000
16:2043728:T:Cacceptor_loss1.0000
16:2044624:GCTCA:Gdonor_loss1.0000
16:2044625:CTCA:Cdonor_loss1.0000
16:2044626:TCA:Tdonor_loss1.0000
16:2044628:AC:Adonor_gain1.0000
16:2044628:ACCCT:Adonor_gain1.0000
16:2044629:CC:Cdonor_gain1.0000
16:2044629:CCCT:Cdonor_gain1.0000
16:2044629:CCCTC:Cdonor_gain1.0000
16:2044632:T:Adonor_gain1.0000
16:2046362:CGCTT:Cacceptor_gain1.0000
16:2046364:CTT:Cacceptor_gain1.0000
16:2046367:C:CCacceptor_gain1.0000
16:2040045:TCCC:Tacceptor_loss0.9900
16:2040048:C:CAacceptor_loss0.9900
16:2040239:C:CCacceptor_gain0.9900

AlphaMissense

1959 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
16:2043721:C:AK185N0.990
16:2043721:C:GK185N0.990
16:2044636:G:CF181L0.989
16:2044636:G:TF181L0.989
16:2044638:A:GF181L0.989
16:2040197:A:GW251R0.987
16:2040197:A:TW251R0.987
16:2040232:T:AD239V0.985
16:2040231:G:CD239E0.984
16:2040231:G:TD239E0.984
16:2043616:C:AK220N0.982
16:2043616:C:GK220N0.982
16:2043697:G:CS193R0.981
16:2043697:G:TS193R0.981
16:2043699:T:GS193R0.981
16:2046197:C:AR103S0.976
16:2046197:C:GR103S0.976
16:2040225:A:CH241Q0.972
16:2040225:A:TH241Q0.972
16:2040216:T:AR244S0.970
16:2040216:T:GR244S0.970
16:2040227:G:CH241D0.970
16:2043626:A:TV217D0.970
16:2044766:A:TL138H0.969
16:2040195:C:AW251C0.968
16:2040195:C:GW251C0.968
16:2043723:T:CK185E0.968
16:2044759:G:CS140R0.968
16:2044759:G:TS140R0.968
16:2044761:T:GS140R0.968

dbSNP variants (sampled 300 via entrez): RS1000035334 (16:2042100 G>A), RS1000122133 (16:2049036 T>C), RS1000139988 (16:2046913 A>G), RS1000524143 (16:2045925 T>A), RS1000576492 (16:2046148 C>A,G,T), RS1000703810 (16:2041668 C>T), RS1000830374 (16:2045395 C>G,T), RS1001117510 (16:2041416 C>T), RS1001385865 (16:2045415 T>G), RS1001704151 (16:2043289 C>T), RS1002333412 (16:2042154 G>A), RS1002532120 (16:2040538 G>C,T), RS1002990126 (16:2040726 G>A), RS1003268934 (16:2043209 C>T), RS1003422958 (16:2047284 C>T)

Disease associations

OMIM: gene MIM:602656 | disease phenotypes: MIM:616415, MIM:613254, MIM:114500, MIM:175100, MIM:606690, MIM:607341, MIM:173900

GenCC curated gene-disease

DiseaseClassificationInheritance
familial adenomatous polyposis 3DefinitiveAutosomal recessive
NTHL1-deficiency tumor predisposition syndromeDefinitiveAutosomal recessive
breast cancerLimitedAutosomal recessive
meningiomaLimitedAutosomal recessive

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
NTHL1-deficiency tumor predisposition syndromeDefinitiveAR

Mondo (11): hereditary neoplastic syndrome (MONDO:0015356), familial adenomatous polyposis 3 (MONDO:0014630), tuberous sclerosis 2 (MONDO:0013199), colorectal cancer (MONDO:0005575), familial adenomatous polyposis 1 (MONDO:0021056), NTHL1-deficiency tumor predisposition syndrome (MONDO:0100502), lymphangioleiomyomatosis (MONDO:0011705), isolated focal cortical dysplasia type II (MONDO:0011818), polycystic kidney disease 1 (MONDO:0008263), breast cancer (MONDO:0007254), meningioma (MONDO:0016642)

Orphanet (7): Inherited cancer-predisposing syndrome (Orphanet:140162), Attenuated familial adenomatous polyposis (Orphanet:220460), NTHL1-related polyposis (Orphanet:454840), Tuberous sclerosis complex (Orphanet:805), Isolated focal cortical dysplasia type II (Orphanet:268994), Lymphangioleiomyomatosis (Orphanet:538), NON RARE IN EUROPE: Colorectal cancer (Orphanet:466667)

HPO phenotypes

18 total (18 of 18 shown, HPO-id order):

HPOTerm
HP:0000007Autosomal recessive inheritance
HP:0000138Ovarian cyst
HP:0002671Basal cell carcinoma
HP:0002858Meningioma
HP:0002860Squamous cell carcinoma
HP:0003002Breast carcinoma
HP:0003003Colon cancer
HP:0003581Adult onset
HP:0005227Adenomatous colonic polyposis
HP:0006725Pancreatic adenocarcinoma
HP:0006771Duodenal adenocarcinoma
HP:0008069Neoplasm of the skin
HP:0009725Bladder neoplasm
HP:0012114Endometrial carcinoma
HP:0012125Prostate cancer
HP:0012539Non-Hodgkin lymphoma
HP:0031287Seborrheic keratosis
HP:0100743Neoplasm of the rectum

GWAS associations

2 associations (top):

StudyTraitp-value
GCST000785_36Longevity1.000000e-06
GCST010988_30Adult body size2.000000e-09

MeSH disease descriptors (6)

DescriptorNameTree numbers
D018192LymphangioleiomyomatosisC04.557.375.460.465; C04.557.450.692.500; C15.604.515.562.465; C20.683.515.710.465
D008579MeningiomaC04.557.580.520; C04.557.645.520; C04.588.614.250.580.500; C10.551.240.500.500
D009386Neoplastic Syndromes, HereditaryC04.700; C16.320.700
C537067Focal cortical dysplasia of Taylor (supp.)
C536326Polycystic kidney disease, type 1 (supp.)
C566021Tuberous Sclerosis 2 (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4523264 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

62 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Aaffects cotreatment, increases methylation, decreases expression, decreases methylation, increases expression (+1 more)4
sodium arseniteaffects cotreatment, increases expression, decreases expression4
Tretinoindecreases expression3
cobaltous chloridedecreases expression2
Acetaminophendecreases expression2
Air Pollutantsaffects cotreatment, increases abundance, increases oxidation, decreases expression2
Benzo(a)pyreneaffects methylation2
Cisplatinaffects cotreatment, increases expression, affects expression, affects reaction2
Methyl Methanesulfonatedecreases expression2
Cyclosporinedecreases expression2
FR900359decreases phosphorylation1
bisphenol Faffects cotreatment, increases expression1
TAK-243increases sumoylation1
pradimicin-IRDdecreases expression1
alpha-pineneaffects cotreatment, increases oxidation, increases abundance1
lead acetateaffects cotreatment, increases expression1
mono-(2-ethylhexyl)phthalateincreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
chromous chlorideaffects cotreatment, increases expression1
chromic oxideaffects cotreatment, increases expression1
benzo(e)pyrenedecreases methylation1
coumarinincreases phosphorylation1
methacrylaldehydeincreases oxidation, increases abundance, affects cotreatment1
beta-methylcholineaffects expression1
di-n-butylphosphoric acidaffects expression1
CPG-oligonucleotidedecreases expression1
K 7174decreases expression1
jinfukangaffects cotreatment, increases expression1
riccardin Ddecreases expression1
NSC 689534affects binding, decreases expression1

ChEMBL screening assays

8 unique, capped per target: 7 binding, 1 functional

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4422183BindingInhibition of NTH1 (unknown origin) assessed as remaining enzyme activity at 50 uM incubated for 40 mins by TAMRA fluorescence-based assaySmall molecule inhibitors of 8-oxoguanine dna glycosylase-1 (ogg1)
CHEMBL5665449FunctionalInhibition of NTHL1 by quantifying inhibition of NTHL1-mediated cleavage and dissociation of quenched duplex DNA oligonucleotides, measured as fluorescence at 594 nm.Enzyme Inhibitor Single Concentration assay results for EUbOPEN Chemogenomics Library

Cellosaurus cell lines

8 cell lines: 8 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_DX48HAP1 NTHL1 (-) XPA (-) 1Cancer cell lineMale
CVCL_DX49HAP1 NTHL1 (-) XPA (-) 2Cancer cell lineMale
CVCL_DX50HAP1 NTHL1 (-) XPA (-) 3Cancer cell lineMale
CVCL_TB25HAP1 NTHL1 (-) 1Cancer cell lineMale
CVCL_TB26HAP1 NTHL1 (-) 2Cancer cell lineMale
CVCL_TB27HAP1 NTHL1 (-) 3Cancer cell lineMale
CVCL_TB28HAP1 NTHL1 (-) 4Cancer cell lineMale
CVCL_TB29HAP1 NTHL1 (-) 5Cancer cell lineMale

Clinical trials (associated diseases)

595 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00014638PHASE4COMPLETEDLetrozole in Treating Postmenopausal Women With Metastatic Breast Cancer
NCT00022386PHASE4COMPLETEDEpoetin Alfa in Treating Chemotherapy-Related Anemia in Women With Stage I, Stage II, or Stage III Breast Cancer
NCT00029224PHASE4COMPLETEDTreatment With Zoledronic Acid in Patients With Breast Cancer, Multiple Myeloma, and Prostate Cancer With Cancer Related Bone Lesions
NCT00030758PHASE4UNKNOWNFilgrastim or Pegfilgrastim in Preventing Neutropenia in Women Receiving Chemotherapy Following Surgery for Breast Cancer
NCT00082277PHASE4COMPLETEDAnastrozole Biphosphonate Study in Postmenopausal Women With Hormone-Receptor-Positive Early Breast Cancer
NCT00087620PHASE4TERMINATEDA Study of Capecitabine In Combination With Docetaxel vs Capecitabine Followed by Docetaxel As First-Line Treatment For Metastatic Breast Cancer
NCT00121836PHASE4COMPLETEDA Study of Xeloda (Capecitabine) in Women With HER2-Negative Metastatic Breast Cancer
NCT00126360PHASE4UNKNOWNSTARS Breast Trial (Study of Anastrozole and Radiotherapy Sequencing Pilot)
NCT00127933PHASE4COMPLETEDXeNA Study - A Study of Xeloda (Capecitabine) in Patients With Invasive Breast Cancer
NCT00128297PHASE4COMPLETEDPamidronate Administration in Breast Cancer Patients With Bone Metastases
NCT00129597PHASE4UNKNOWNEffect of Ketalar to Prevent Postoperative Chronic Pain After Mastectomy
NCT00131170PHASE4COMPLETEDParavertebral Block for Breast Surgery
NCT00156039PHASE4COMPLETEDRandomized Trial of Follow-up Strategies in Breast Cancer
NCT00160901PHASE4COMPLETEDComplementary Therapies for the Reduction of Side Effects During Chemotherapy for Breast Cancer
NCT00171847PHASE4TERMINATEDStudy of the Efficacy and Safety of Letrozole Combined With Trastuzumab in Patients With Metastatic Breast Cancer
NCT00176046PHASE4COMPLETEDMistletoe Extract in Early or Advanced Breast Cancer, A Feasibility Study
NCT00190697PHASE4COMPLETEDA Study of LY353381 (Arzoxifene) for Patients Who Benefitted From This Drug in Other Oncology Trials and Wished to Continue Treatment
NCT00234195PHASE4COMPLETEDWellbutrin XL, Major Depressive Disorder and Breast Cancer
NCT00237133PHASE4COMPLETEDTreatment of Locally Advanced Breast Cancer With Letrozole in Postmenopausal Women
NCT00237224PHASE4COMPLETEDOpen Label Study of Postmenopausal Women With ER and /or PgR Positive Breast Cancer Treated With Letrozole
NCT00241046PHASE4TERMINATEDLetrozole in the Treatment of 1st and 2nd Line Hormone Receptor Positive Breast Cancer: Pre-therapeutic Risk Assessment
NCT00277160PHASE4COMPLETEDA Study of Primary Prophylaxis With Neulasta (Pegfilgrastim) Versus Secondary Prophylaxis After Chemotherapy in Elderly Subjects (>/= 65 Years Old) With Cancer
NCT00323479PHASE4COMPLETEDArthralgia During Anastrozole Therapy for Breast Cancer
NCT00334139PHASE4COMPLETEDEffect of Zoledronic Acid on Bone Metabolism in Patients With Bone Metastasis and Prostate or Breast Cancer
NCT00356148PHASE4COMPLETEDThe Efficacy of Prophylactic Antibiotic Administration During Breast Cancer Surgery in Overweight Patients.
NCT00372476PHASE4COMPLETEDEfficacy and Safety of Imatinib and Vinorelbine in Patients With Advanced Breast Cancer
NCT00413491PHASE4UNKNOWNNational Screening in Denmark With MR Versus Mammography and Ultrasound of Women With BRCA1 or BRCA2 Mutations
NCT00484614PHASE4UNKNOWNStudy the Role of Positron Emission Mammography in Pre-surgical Planning for Breast Cancer
NCT00485953PHASE4COMPLETEDEffect of Bisphosphonate on Bone Loss in Postmenopausal Women With Breast Cancer Initiating Aromatase Inhibitor Therapy
NCT00496678PHASE4COMPLETEDTrial of Patient Navigation-Activation
NCT00531973PHASE4UNKNOWNA Study of Liposomal Doxorubicin in Women With Breast Cancer Exploiting Tissue Doppler Imaging
NCT00537771PHASE4COMPLETEDLiver Safety Under Upfront Arimidex vs Tamoxifen
NCT00544986PHASE4COMPLETEDA Prospective,Open-label Study of Anastrozole in Post-menopausal Women With Hormone Sensitive Advanced Breast Cancer
NCT00613275PHASE4COMPLETEDPatient Navigation in the Safety Net:CONNECTeDD
NCT00638599PHASE4COMPLETEDComparison of Laryngeal Mask Airway (LMA®) and Tracheal Tube in Modified Radical Mastectomy on Breast Cancer
NCT00647075PHASE4UNKNOWNYunzhi as Dietary Supplement in Breast Cancer
NCT00688909PHASE4COMPLETEDRheumatological Evaluation of Anastrozole and Letrozole as Adjuvant Treatment in Post-menopausal Women With Breast Cancer
NCT00699101PHASE4TERMINATEDUsing the Conture® Multi-Lumen Balloon to Deliver Accelerated Partial Breast Brachytherapy
NCT00742222PHASE4COMPLETEDElectronic Xoft Intersociety Brachytherapy Trial: Electronic Brachytherapy (EBT) For Treatment of Early Stage Breast Cancer
NCT00754767PHASE4TERMINATEDL-Carnitine L-Tartrate in Preventing Peripheral Neuropathy Caused By Chemotherapy in Women With Metastatic Breast Cancer