NT5DC1

gene
On this page

Also known as dJ486I3.1MGC24302

Summary

NT5DC1 (5’-nucleotidase domain containing 1, HGNC:21556) is a protein-coding gene on chromosome 6q22.1, encoding 5’-nucleotidase domain-containing protein 1 (Q5TFE4).

While the exact function of the protein encoded by this gene is not known, it belongs to the 5’(3’)-deoxyribonucleotidase family.

Source: NCBI Gene 221294 — RefSeq curated summary.

At a glance

  • GWAS associations: 15
  • Clinical variants (ClinVar): 119 total — 4 pathogenic
  • Druggable target: yes
  • MANE Select transcript: NM_152729

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21556
Approved symbolNT5DC1
Name5’-nucleotidase domain containing 1
Location6q22.1
Locus typegene with protein product
StatusApproved
AliasesdJ486I3.1, MGC24302
Ensembl geneENSG00000178425
Ensembl biotypeprotein_coding
OMIM621076
Entrez221294

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 6 protein_coding, 1 retained_intron, 1 nonsense_mediated_decay

ENST00000319550, ENST00000417846, ENST00000419791, ENST00000460749, ENST00000880963, ENST00000880964, ENST00000880965, ENST00000959019

RefSeq mRNA: 1 — MANE Select: NM_152729 NM_152729

CCDS: CCDS5104

Canonical transcript exons

ENST00000319550 — 12 exons

ExonStartEnd
ENSE00001240265116243909116249497
ENSE00001243243116221054116221228
ENSE00001873818116100853116101023
ENSE00002179152116223034116223131
ENSE00002201677116236966116237084
ENSE00002210590116110850116110956
ENSE00002257752116108364116108435
ENSE00002312554116117861116117945
ENSE00002314863116106244116106335
ENSE00003467936116238955116239123
ENSE00003548480116238187116238348
ENSE00003756646116115691116115770

Expression profiles

Bgee: expression breadth ubiquitous, 255 present calls, max score 95.42.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 24.7482 / max 217.0966, expressed in 1809 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
6939524.74821809

Top tissues by expression

256 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
corpus callosumUBERON:000233695.42gold quality
kidney epitheliumUBERON:000481995.31gold quality
inferior vagus X ganglionUBERON:000536393.82gold quality
C1 segment of cervical spinal cordUBERON:000646993.54gold quality
epithelium of nasopharynxUBERON:000195193.49gold quality
spinal cordUBERON:000224093.16gold quality
left ventricle myocardiumUBERON:000656693.15gold quality
right adrenal glandUBERON:000123392.69gold quality
right adrenal gland cortexUBERON:003582792.30gold quality
left adrenal glandUBERON:000123492.22gold quality
heart right ventricleUBERON:000208091.90gold quality
descending thoracic aortaUBERON:000234591.85gold quality
buccal mucosa cellCL:000233691.84gold quality
right coronary arteryUBERON:000162591.70gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099191.69gold quality
adrenal glandUBERON:000236991.62gold quality
subthalamic nucleusUBERON:000190691.53gold quality
left adrenal gland cortexUBERON:003582591.40gold quality
adrenal tissueUBERON:001830391.33gold quality
bronchial epithelial cellCL:000232891.25gold quality
liverUBERON:000210791.25gold quality
adrenal cortexUBERON:000123591.19gold quality
palpebral conjunctivaUBERON:000181291.12gold quality
right lobe of liverUBERON:000111491.08gold quality
popliteal arteryUBERON:000225091.06gold quality
tibial arteryUBERON:000761091.05gold quality
bronchusUBERON:000218590.90gold quality
ileal mucosaUBERON:000033190.89gold quality
deltoidUBERON:000147690.84gold quality
islet of LangerhansUBERON:000000690.81gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

106 targeting NT5DC1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-511-3P99.9968.851467
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-806899.9873.852376
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-569699.9872.364487
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-302C-5P99.9772.563642
HSA-MIR-314899.9775.066478
HSA-MIR-590-3P99.9674.346478
HSA-MIR-144-3P99.9473.982698
HSA-MIR-9983-3P99.9471.483631
HSA-MIR-314399.9371.963104
HSA-MIR-6835-3P99.9370.492904
HSA-MIR-129799.9173.413162
HSA-MIR-7-1-3P99.9171.534384
HSA-MIR-7-2-3P99.9171.404394
HSA-MIR-374A-5P99.9071.342923
HSA-MIR-374B-5P99.9069.982734
HSA-MIR-95-5P99.8972.173973
HSA-MIR-153-5P99.8973.866317
HSA-MIR-129-5P99.8870.263273
HSA-MIR-391999.8769.452489
HSA-MIR-5582-3P99.8672.484221
HSA-MIR-132399.8369.892471
HSA-MIR-6875-3P99.8270.262983
HSA-MIR-4668-5P99.7970.583782
HSA-MIR-26A-5P99.7873.522303
HSA-MIR-26B-5P99.7873.512305

Literature-anchored findings (GeneRIF, showing 2)

  • The rs1052443 C allele of NT5DC1 was associated with a protective effect against the deterioration of pulmonary function. (PMID:22736055)
  • Weighted gene co-expression network analysis identifies specific modules and hub genes related to subsyndromal symptomatic depression. (PMID:30489189)

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriont5dc1ENSDARG00000006797
mus_musculusNt5dc1ENSMUSG00000039480
rattus_norvegicusNt5dc1ENSRNOG00000000546
drosophila_melanogasterNt5aFBGN0035204

Paralogs (4): NT5C2 (ENSG00000076685), NT5DC3 (ENSG00000111696), NT5DC4 (ENSG00000144130), NT5DC2 (ENSG00000168268)

Protein

Protein identifiers

5’-nucleotidase domain-containing protein 1Q5TFE4 (reviewed: Q5TFE4)

All UniProt accessions (3): Q5TFE4, H0YDA5, Q5QPD0

UniProt curated annotations — full annotation on UniProt →

Similarity. Belongs to the 5’(3’)-deoxyribonucleotidase family.

Isoforms (2)

UniProt IDNamesCanonical?
Q5TFE4-11yes
Q5TFE4-22

RefSeq proteins (1): NP_689942* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008380HAD-SF_hydro_IG_5-nuclFamily
IPR023214HAD_sfHomologous_superfamily
IPR036412HAD-like_sfHomologous_superfamily

Pfam: PF05761

UniProt features (10 total): binding site 3, active site 2, chain 1, region of interest 1, compositionally biased region 1, modified residue 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q5TFE4-F188.760.78

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (2): 16 (nucleophile); 18 (proton donor)

Ligand- & substrate-binding residues (3): 16; 18; 313

Post-translational modifications (1): 171

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 91 (showing top): GSE18804_SPLEEN_MACROPHAGE_VS_TUMORAL_MACROPHAGE_DN, MOTAMED_RESPONSE_TO_ANDROGEN_DN, HEIDENBLAD_AMPLICON_8Q24_DN, CHANDRAN_METASTASIS_DN, BASAKI_YBX1_TARGETS_DN, VECCHI_GASTRIC_CANCER_EARLY_DN, RIGGI_EWING_SARCOMA_PROGENITOR_UP, CHANDRAN_METASTASIS_TOP50_DN, GOMF_HYDROLASE_ACTIVITY_ACTING_ON_ESTER_BONDS, GOMF_NUCLEOTIDASE_ACTIVITY, GOMF_PHOSPHORIC_ESTER_HYDROLASE_ACTIVITY, chr6q22, SENGUPTA_NASOPHARYNGEAL_CARCINOMA_WITH_LMP1_UP, WHITFIELD_CELL_CYCLE_S, JOHNSTONE_PARVB_TARGETS_3_UP

GO Biological Process (0):

GO Molecular Function (4): 5’-nucleotidase activity (GO:0008253), metal ion binding (GO:0046872), protein binding (GO:0005515), hydrolase activity (GO:0016787)

GO Cellular Component (0):

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
nucleotidase activity1
cation binding1
binding1
catalytic activity1

Protein interactions and networks

STRING

488 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NT5DC1CALHM4Q5JW98541
NT5DC1TSPYL4Q9UJ04507
NT5DC1CALHM5Q8N5C1500
NT5DC1OR6C65A6NJZ3480
NT5DC1CCDC152Q4G0S7472
NT5DC1SEC23IPQ9Y6Y8451
NT5DC1CTXN2P0C2S0444
NT5DC1MTMR11A4FU01422
NT5DC1NT5DC4Q86YG4412
NT5DC1OR4D1Q15615398
NT5DC1CALHM6Q5R3K3395
NT5DC1ARL6IP1Q15041391
NT5DC1TSPYL1Q9H0U9391
NT5DC1B4GALT4O60513389
NT5DC1NUDT8Q8WV74387

IntAct

16 interactions, top by confidence:

ABTypeScore
NT5DC1HSP90AA4Ppsi-mi:“MI:0915”(physical association)0.560
LRRK2psi-mi:“MI:0914”(association)0.350
NT5DC1GFAPpsi-mi:“MI:0914”(association)0.350
NT5DC1ACACBpsi-mi:“MI:0914”(association)0.350
DDX28UBA6psi-mi:“MI:0914”(association)0.350
DND1UBA6psi-mi:“MI:0914”(association)0.350
MRPL49UBA6psi-mi:“MI:0914”(association)0.350
OSBPL11DNM1Lpsi-mi:“MI:0914”(association)0.350
PEX7UBA6psi-mi:“MI:0914”(association)0.350
repARPC1Bpsi-mi:“MI:2364”(proximity)0.270
CFTRUBA6psi-mi:“MI:2364”(proximity)0.270
NT5DC1bfmbAbpsi-mi:“MI:0915”(physical association)0.000

BioGRID (58): HSP90AA4P (Affinity Capture-MS), GFAP (Affinity Capture-MS), PKLR (Affinity Capture-MS), HBD (Affinity Capture-MS), PKLR (Affinity Capture-MS), GFAP (Affinity Capture-MS), HSP90AA4P (Affinity Capture-MS), NT5DC1 (Affinity Capture-MS), NT5DC1 (Proximity Label-MS), NT5DC1 (Affinity Capture-MS), NT5DC1 (Proximity Label-MS), NT5DC1 (Reconstituted Complex), HSP90AA4P (Affinity Capture-MS), TRABD (Affinity Capture-MS), ACACB (Affinity Capture-MS)

ESM2 similar proteins: A1KXW8, A6QL50, E1BGQ2, H0Y354, O94955, P47224, Q08326, Q0IIH8, Q1JQA1, Q1RMS8, Q1RMZ1, Q2TBU5, Q3T1H6, Q4R372, Q4R528, Q4R9C4, Q5F480, Q5F4A1, Q5I0G3, Q5RCQ0, Q5RFG8, Q5TFE4, Q5TYM5, Q641X7, Q6L9T8, Q6PIP5, Q7L622, Q7Z6J8, Q7ZX59, Q86X60, Q8BFZ8, Q8BKW4, Q8BM85, Q8BX13, Q8CEL2, Q8N5C7, Q8N635, Q8NHU2, Q8TCF1, Q8TCJ0

Diamond homologs: A4IHT9, Q3UHB1, Q54XC1, Q5TFE4, Q6GN91, Q6Q0N3, Q75K12, Q86UY8, Q86YG4, Q9H857, Q2TBU5, Q8C5P5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

119 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic4
Likely pathogenic0
Uncertain significance89
Likely benign4
Benign2

Top pathogenic / likely-pathogenic (4)

Variant IDHGVSClassification
17465NM_000493.4(COL10A1):c.1792T>G (p.Tyr598Asp)Pathogenic
17471NM_000493.4(COL10A1):c.1858_1859del (p.Pro620fs)Pathogenic
17478NM_000493.4(COL10A1):c.1896C>A (p.Tyr632Ter)Pathogenic
17479NM_000493.4(COL10A1):c.1784G>A (p.Gly595Glu)Pathogenic

SpliceAI

3035 predictions. Top by Δscore:

VariantEffectΔscore
6:116101020:CCCGG:Cdonor_loss1.0000
6:116101021:CCGGT:Cdonor_loss1.0000
6:116101022:CGGT:Cdonor_loss1.0000
6:116101024:G:GCdonor_loss1.0000
6:116101024:G:GGdonor_gain1.0000
6:116101025:T:Gdonor_loss1.0000
6:116106236:A:AGacceptor_gain1.0000
6:116106237:T:Gacceptor_gain1.0000
6:116106240:GCA:Gacceptor_loss1.0000
6:116106241:CAGCT:Cacceptor_loss1.0000
6:116106242:A:AGacceptor_gain1.0000
6:116106243:G:GAacceptor_gain1.0000
6:116106243:GCTC:Gacceptor_gain1.0000
6:116106243:GCTCA:Gacceptor_gain1.0000
6:116106333:CTGGT:Cdonor_loss1.0000
6:116106334:TGG:Tdonor_loss1.0000
6:116106335:GGTA:Gdonor_loss1.0000
6:116106336:G:GAdonor_loss1.0000
6:116106336:G:GGdonor_gain1.0000
6:116106337:TAAGT:Tdonor_loss1.0000
6:116108359:TTCA:Tacceptor_loss1.0000
6:116108360:TCA:Tacceptor_loss1.0000
6:116108361:CAGTT:Cacceptor_loss1.0000
6:116108362:A:AGacceptor_gain1.0000
6:116108362:AG:Aacceptor_loss1.0000
6:116108363:G:GTacceptor_gain1.0000
6:116108363:GTT:Gacceptor_gain1.0000
6:116115685:TTTTA:Tacceptor_loss1.0000
6:116115686:TTTA:Tacceptor_loss1.0000
6:116115687:TTA:Tacceptor_loss1.0000

AlphaMissense

3002 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
6:116223082:G:CK251N0.999
6:116223082:G:TK251N0.999
6:116100977:A:TD16V0.998
6:116100980:T:CL17P0.998
6:116100983:A:TD18V0.998
6:116221180:C:TT219I0.998
6:116237025:G:TG288W0.998
6:116238200:G:TG312V0.998
6:116238203:A:CD313A0.998
6:116238203:A:TD313V0.998
6:116100971:G:AG14E0.997
6:116100978:C:AD16E0.997
6:116100978:C:GD16E0.997
6:116100984:C:AD18E0.997
6:116100984:C:GD18E0.997
6:116115717:T:CF131L0.997
6:116115719:T:AF131L0.997
6:116115719:T:GF131L0.997
6:116221071:T:CF183L0.997
6:116221073:T:AF183L0.997
6:116221073:T:GF183L0.997
6:116221182:A:CS220R0.997
6:116221184:T:AS220R0.997
6:116221184:T:GS220R0.997
6:116223075:C:AA249E0.997
6:116223089:T:CF254L0.997
6:116223091:C:AF254L0.997
6:116223091:C:GF254L0.997
6:116237025:G:AG288R0.997
6:116237025:G:CG288R0.997

dbSNP variants (sampled 300 via entrez): RS1000019405 (6:116228984 T>A), RS1000033516 (6:116155885 G>A), RS1000043820 (6:116216754 ATTTATCTCT>A), RS1000049843 (6:116206990 T>C), RS1000068351 (6:116137384 G>T), RS1000069339 (6:116247598 A>G), RS1000113914 (6:116200180 G>A,T), RS1000149318 (6:116155568 A>G), RS1000158201 (6:116112478 C>G,T), RS1000191000 (6:116103581 G>C), RS1000195704 (6:116225816 G>A), RS1000195940 (6:116189728 A>G), RS1000202309 (6:116174839 A>T), RS1000215489 (6:116178374 G>A), RS1000219478 (6:116129799 G>A,T)

Disease associations

OMIM: gene MIM:621076 | disease phenotypes: MIM:156500

GenCC curated gene-disease

Mondo (1): Schmid metaphyseal chondrodysplasia (MONDO:0007983)

Orphanet (1): Metaphyseal chondrodysplasia, Schmid type (Orphanet:174)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

15 associations (top):

StudyTraitp-value
GCST001713_16Dental caries2.000000e-06
GCST003997_6Myopia2.000000e-13
GCST007643_3Gemcitabine-induced early high-grade neutropenia in pancreatic cancer3.000000e-06
GCST008070_13HDL cholesterol levels6.000000e-06
GCST008078_106LDL cholesterol levels x alcohol consumption (regular vs non-regular drinkers) interaction (2df)1.000000e-13
GCST008078_21LDL cholesterol levels x alcohol consumption (regular vs non-regular drinkers) interaction (2df)8.000000e-12
GCST008079_152LDL cholesterol levels x alcohol consumption (drinkers vs non-drinkers) interaction (2df)3.000000e-15
GCST008079_21LDL cholesterol levels x alcohol consumption (drinkers vs non-drinkers) interaction (2df)5.000000e-13
GCST008086_29LDL cholesterol levels in current drinkers7.000000e-07
GCST008086_71LDL cholesterol levels in current drinkers8.000000e-09
GCST008789_10Adolescent idiopathic scoliosis2.000000e-08
GCST010002_333Refractive error2.000000e-36
GCST011348_7High density lipoprotein cholesterol levels8.000000e-09
GCST90002383_453Hematocrit6.000000e-10
GCST90011900_57Serum alkaline phosphatase levels2.000000e-09

EFO canonical traits (5, from GWAS)

EFO IDTrait name
EFO:0004612high density lipoprotein cholesterol measurement
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0004329alcohol drinking
EFO:0004348hematocrit
EFO:0004533alkaline phosphatase measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C537352Metaphyseal chondrodysplasia Schmid type (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4295852 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

35 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Aciddecreases expression, decreases methylation, affects expression5
trichostatin Aaffects cotreatment, decreases expression3
bisphenol Adecreases expression, increases expression2
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression2
Benzo(a)pyrenedecreases expression2
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxidedecreases expression2
Cyclosporinedecreases expression2
aristolochic acid Idecreases expression, increases expression1
bisphenol Fincreases expression1
dicrotophosdecreases expression1
methylmercuric chloridedecreases expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
sodium arsenitedecreases expression1
nutlin 3increases secretion, affects cotreatment1
bisphenol Bincreases expression1
dorsomorphinaffects cotreatment, decreases expression1
jinfukangaffects cotreatment, increases expression1
LDN 193189affects cotreatment, decreases expression1
Sunitinibdecreases expression1
Air Pollutantsdecreases expression, increases abundance1
Cisplatinincreases expression, affects cotreatment1
Clorgylineincreases expression1
Dactinomycinaffects cotreatment, increases secretion1
Dimethyl Sulfoxideincreases expression1
Dinitrochlorobenzeneaffects binding1
Doxorubicindecreases expression1
Enzyme Inhibitorsdecreases activity, increases O-linked glycosylation1
Ivermectindecreases expression1
Tobacco Smoke Pollutiondecreases expression1
Tretinoindecreases expression1

ChEMBL screening assays

11 unique, capped per target: 11 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4118796BindingBinding affinity to NT5DC1 in human NCI-H23 cells at 1 uM by mass spectrometry based pull down assayStudies of TAK1-centered polypharmacology with novel covalent TAK1 inhibitors. — Bioorg Med Chem

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.