CTDSP1

gene
On this page

Also known as NLIIFSCP1

Summary

CTDSP1 (CTD small phosphatase 1, HGNC:21614) is a protein-coding gene on chromosome 2q35, encoding Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 (Q9GZU7). Preferentially catalyzes the dephosphorylation of ‘Ser-5’ within the tandem 7 residue repeats in the C-terminal domain (CTD) of the largest RNA polymerase II subunit POLR2A.

This gene encodes a member of the small C-terminal domain phosphatase (SCP) family of nuclear phosphatases. These proteins play a role in transcriptional regulation through specific dephosphorylation of phosphoserine 5 within tandem heptapeptide repeats of the C-terminal domain of RNA polymerase II. The encoded protein plays a role in neuronal gene silencing in non-neuronal cells, and may also inhibit osteoblast differentiation. Alternatively spliced transcript variants encoding multiple isoforms have been observed for this gene.

Source: NCBI Gene 58190 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 51 total
  • Druggable target: yes — 10 molecules with ChEMBL bioactivity
  • MANE Select transcript: NM_021198

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21614
Approved symbolCTDSP1
NameCTD small phosphatase 1
Location2q35
Locus typegene with protein product
StatusApproved
AliasesNLIIF, SCP1
Ensembl geneENSG00000144579
Ensembl biotypeprotein_coding
OMIM605323
Entrez58190

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 14 protein_coding, 3 protein_coding_CDS_not_defined, 3 retained_intron, 1 nonsense_mediated_decay

ENST00000273062, ENST00000428361, ENST00000431127, ENST00000443891, ENST00000452977, ENST00000464255, ENST00000473420, ENST00000482272, ENST00000488627, ENST00000491064, ENST00000492545, ENST00000494067, ENST00000496785, ENST00000497677, ENST00000498160, ENST00000710828, ENST00000710837, ENST00000710838, ENST00000710839, ENST00000885504, ENST00000885505

RefSeq mRNA: 12 — MANE Select: NM_021198 NM_001206878, NM_001400268, NM_001400269, NM_001400270, NM_001400271, NM_001400272, NM_001400273, NM_001400274, NM_001400275, NM_001400276, NM_021198, NM_182642

CCDS: CCDS2416

Canonical transcript exons

ENST00000273062 — 7 exons

ExonStartEnd
ENSE00001038975218399868218400157
ENSE00001930491218404297218405941
ENSE00003585758218403232218403417
ENSE00004013738218402111218402215
ENSE00004013776218402349218402405
ENSE00004013779218401564218401712
ENSE00004013784218403035218403127

Expression profiles

Bgee: expression breadth ubiquitous, 271 present calls, max score 98.74.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 43.6429 / max 281.7028, expressed in 1820 samples.

FANTOM5 promoters (9 alternative TSS)

Promoter IDTPM avgSamples expressed
2534024.21541801
253386.28691669
253414.00971566
253422.86291280
253432.61751246
253351.7874452
253391.3546839
253360.3232171
253370.185395

Top tissues by expression

287 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
granulocyteCL:000009498.74gold quality
mucosa of stomachUBERON:000119998.10gold quality
right coronary arteryUBERON:000162598.10gold quality
body of stomachUBERON:000116197.90gold quality
descending thoracic aortaUBERON:000234597.87gold quality
apex of heartUBERON:000209897.84gold quality
left uterine tubeUBERON:000130397.82gold quality
right ovaryUBERON:000211897.76gold quality
thoracic aortaUBERON:000151597.75gold quality
endocervixUBERON:000045897.74gold quality
popliteal arteryUBERON:000225097.72gold quality
tibial arteryUBERON:000761097.72gold quality
right lobe of thyroid glandUBERON:000111997.71gold quality
ascending aortaUBERON:000149697.70gold quality
aortaUBERON:000094797.67gold quality
esophagogastric junction muscularis propriaUBERON:003584197.67gold quality
right lungUBERON:000216797.65gold quality
metanephros cortexUBERON:001053397.64gold quality
lower esophagus muscularis layerUBERON:003583397.60gold quality
body of uterusUBERON:000985397.59gold quality
lower esophagusUBERON:001347397.59gold quality
left coronary arteryUBERON:000162697.51gold quality
left ovaryUBERON:000211997.49gold quality
left lobe of thyroid glandUBERON:000112097.35gold quality
tibial nerveUBERON:000132397.34gold quality
monocyteCL:000057697.30gold quality
coronary arteryUBERON:000162197.30gold quality
leukocyteCL:000073897.26gold quality
stomachUBERON:000094597.20gold quality
small intestine Peyer’s patchUBERON:000345497.20gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes11.21

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

119 targeting CTDSP1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-6798-5P100.0065.77699
HSA-MIR-6851-5P100.0065.631294
HSA-MIR-3689D100.0066.141181
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-6870-5P99.9968.552115
HSA-MIR-548C-3P99.9974.017587
HSA-MIR-6759-5P99.9966.54785
HSA-MIR-4789-5P99.9870.762721
HSA-MIR-4723-5P99.9768.702034
HSA-MIR-569899.9768.492029
HSA-MIR-7111-5P99.9768.482062
HSA-MIR-6825-5P99.9669.813431
HSA-MIR-493-5P99.9672.472382
HSA-MIR-96-5P99.9572.802140
HSA-MIR-185-3P99.9567.011743
HSA-MIR-6721-5P99.9368.922981
HSA-MIR-1213399.9271.822006
HSA-MIR-1271-5P99.9171.991972
HSA-MIR-6809-3P99.9171.453814
HSA-MIR-4753-3P99.9071.033786
HSA-MIR-124-3P99.8973.743043
HSA-MIR-506-3P99.8973.553057
HSA-MIR-4731-5P99.8967.232537
HSA-MIR-6783-3P99.8967.922059
HSA-MIR-1343-3P99.8966.781815
HSA-MIR-605-3P99.8869.221833
HSA-MIR-449299.8768.253611
HSA-MIR-391999.8769.452489
HSA-MIR-3151-5P99.8663.831069
HSA-MIR-4728-5P99.8569.394718

Literature-anchored findings (GeneRIF, showing 16)

  • Polymorphisms vary significantly between races; Not associated with TB susceptibility in Caucasians. (PMID:11950066)
  • SCP1 may play a role in the regulation of gene expression, possibly by controlling the transition from initiation/capping to processive transcript elongation (PMID:12721286)
  • results show that REST/NRSF recruits SCPs to neuronal genes that contain RE-1 elements, leading to neuronal gene silencing in non-neuronal cells (PMID:15681389)
  • SCP1 acts as a phosphatase for regulatory phosphorylations in the linker region of Smad1 and Smad2. (PMID:17085434)
  • Captured the phosphoryl-aspartate intermediate in the crystal structure of a Scp1D206A mutant soaked with para-nitrophenyl phosphate (pNPP), providing strong evidence for the proposed mechanism. (PMID:20222012)
  • SCP1 phosphatase substrate specificity: CdcA3 could be an enzymatic substrate for SCP1. (PMID:24769477)
  • hSCP1 may be an beta-O-linked N-acetylglucosamine modified protein in vivo. (PMID:25081999)
  • CTDSP1 activity stabilizes REST in stem cells and ERK-dependent phosphorylation combined with Pin1 activity promotes REST degradation in neural progenitors. (PMID:25197063)
  • SCP1 regulates c-Myc stability and functions through dephosphorylating c-Myc Serine-62. (PMID:25893300)
  • SCP1 is the phosphatase that counterregulates the MAPK-mediated phosphorylation of Ser(68)-Twist1. (PMID:26975371)
  • Our results provide a novel mechanistic insight into the activation of Myc by 27-HC via transcriptional repression of PP2A, SCP1 and FBW7 to increase Myc protein stability in breast cancer cells. (PMID:27751849)
  • REST is a bona fide substrate for SCP1 in vivo and that SCP1 phosphatase activity protects REST against degradation. (PMID:30217818)
  • Tumor suppressor properties of the small C-terminal domain phosphatases in non-small cell lung cancer. (PMID:31774910)
  • Deubiquitinase USP29 promotes gastric cancer cell migration by cooperating with phosphatase SCP1 to stabilize Snail protein. (PMID:32973332)
  • GPC5 suppresses lung cancer progression and metastasis via intracellular CTDSP1/AhR/ARNT signaling axis and extracellular exosome secretion. (PMID:34079082)
  • C-terminal domain small phosphatase 1 (CTDSP1) regulates growth factor expression and axonal regeneration in peripheral nerve tissue. (PMID:34262056)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerioctdsp1ENSDARG00000040674
mus_musculusCtdsp1ENSMUSG00000026176
rattus_norvegicusCtdsp1ENSRNOG00000015295
drosophila_melanogasterhzgFBGN0036556
caenorhabditis_elegansscpl-1WBGENE00007054

Paralogs (5): TIMM50 (ENSG00000105197), CTDSPL2 (ENSG00000137770), CTDSPL (ENSG00000144677), CTDSP2 (ENSG00000175215), CTDNEP1 (ENSG00000175826)

Protein

Protein identifiers

Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1Q9GZU7 (reviewed: Q9GZU7)

Alternative names: Nuclear LIM interactor-interacting factor 3, Small C-terminal domain phosphatase 1

All UniProt accessions (4): Q9GZU7, A0AA34QVU0, A0AA34QW09, H7C270

UniProt curated annotations — full annotation on UniProt →

Function. Preferentially catalyzes the dephosphorylation of ‘Ser-5’ within the tandem 7 residue repeats in the C-terminal domain (CTD) of the largest RNA polymerase II subunit POLR2A. Negatively regulates RNA polymerase II transcription, possibly by controlling the transition from initiation/capping to processive transcript elongation. Recruited by REST to neuronal genes that contain RE-1 elements, leading to neuronal gene silencing in non-neuronal cells.

Subunit / interactions. Monomer. Interacts with GTF2F1. Interacts with REST.

Subcellular location. Nucleus.

Tissue specificity. Expression is restricted to non-neuronal tissues. Highest expression in skeletal muscle, spleen, lung and placenta.

Activity regulation. Stimulated by GTF2F1. Inhibited by beryllofluoride anions.

Cofactor. Binds 1 Mg(2+) ion per monomer.

Isoforms (3)

UniProt IDNamesCanonical?
Q9GZU7-11yes
Q9GZU7-22
Q9GZU7-33

RefSeq proteins (12): NP_001193807, NP_001387197, NP_001387198, NP_001387199, NP_001387200, NP_001387201, NP_001387202, NP_001387203, NP_001387204, NP_001387205, NP_067021, NP_872580 (=MANE)

Domains & families (InterPro)

IDNameType
IPR004274FCP1_domDomain
IPR011948Dullard_phosphataseDomain
IPR023214HAD_sfHomologous_superfamily
IPR036412HAD-like_sfHomologous_superfamily
IPR040078RNA_Pol_CTD_PhosphataseFamily
IPR050365TIM50Family

Pfam: PF03031

Catalyzed reactions (Rhea), 2 shown:

  • O-phospho-L-seryl-[protein] + H2O = L-seryl-[protein] + phosphate (RHEA:20629)
  • O-phospho-L-threonyl-[protein] + H2O = L-threonyl-[protein] + phosphate (RHEA:47004)

UniProt features (41 total): strand 11, helix 10, binding site 3, site 2, splice variant 2, mutagenesis site 2, compositionally biased region 2, active site 2, chain 1, domain 1, modified residue 1, sequence variant 1, sequence conflict 1, region of interest 1, turn 1

Structure

Experimental structures (PDB)

11 structures.

PDBMethodResolution (Å)
2GHTX-RAY DIFFRACTION1.8
2GHQX-RAY DIFFRACTION2.05
1TA0X-RAY DIFFRACTION2.1
4YH1X-RAY DIFFRACTION2.2
1T9ZX-RAY DIFFRACTION2.3
3L0YX-RAY DIFFRACTION2.3
3L0BX-RAY DIFFRACTION2.35
3PGLX-RAY DIFFRACTION2.35
4YGYX-RAY DIFFRACTION2.36
3L0CX-RAY DIFFRACTION2.45
6DU3X-RAY DIFFRACTION2.58

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9GZU7-F182.130.70

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (4): 152 (transition state stabilizer); 190 (transition state stabilizer); 96 (4-aspartylphosphate intermediate); 98 (proton donor)

Ligand- & substrate-binding residues (3): 96; 98; 207

Post-translational modifications (1): 1

Mutagenesis-validated functional residues (2):

PositionPhenotype
96no effect. completely abolishes phosphatase activity; when associated with n-98.
98completely abolishes phosphatase activity; when associated with e-96.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 232 (showing top): CREL_01, GOBP_NEGATIVE_REGULATION_OF_CELL_DEVELOPMENT, GOBP_NEGATIVE_REGULATION_OF_NEURON_DIFFERENTIATION, PAX4_01, SP3_Q3, GOBP_CELL_CYCLE_PHASE_TRANSITION, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, AP4_Q6, GOBP_NEUROGENESIS, GOBP_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, AP2_Q3, GOBP_NEGATIVE_REGULATION_OF_NERVOUS_SYSTEM_DEVELOPMENT, GTGCCTT_MIR506, GOBP_NEGATIVE_REGULATION_OF_MULTICELLULAR_ORGANISMAL_PROCESS, E2F_Q3

GO Biological Process (5): regulation of transcription by RNA polymerase II (GO:0006357), protein dephosphorylation (GO:0006470), negative regulation of neuron differentiation (GO:0045665), negative regulation of neurogenesis (GO:0050768), negative regulation of G1/S transition of mitotic cell cycle (GO:2000134)

GO Molecular Function (7): RNA polymerase II CTD heptapeptide repeat phosphatase activity (GO:0008420), metal ion binding (GO:0046872), phosphoprotein phosphatase activity (GO:0004721), protein serine/threonine phosphatase activity (GO:0004722), protein binding (GO:0005515), hydrolase activity (GO:0016787), phosphatase activity (GO:0016791)

GO Cellular Component (3): nucleus (GO:0005634), nucleoplasm (GO:0005654), extracellular exosome (GO:0070062)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
dephosphorylation1
protein modification process1
neuron differentiation1
negative regulation of cell differentiation1
regulation of neuron differentiation1
negative regulation of cell development1
neurogenesis1
regulation of neurogenesis1
negative regulation of nervous system development1
G1/S transition of mitotic cell cycle1
negative regulation of mitotic cell cycle phase transition1
negative regulation of cell cycle G1/S phase transition1
regulation of G1/S transition of mitotic cell cycle1
protein serine/threonine phosphatase activity1
RNA polymerase II CTD heptapeptide repeat modifying activity1
cation binding1
phosphatase activity1
catalytic activity, acting on a protein1
phosphoprotein phosphatase activity1
binding1
catalytic activity1
phosphoric ester hydrolase activity1
intracellular membrane-bounded organelle1
nuclear lumen1
cellular anatomical structure1
extracellular vesicle1

Protein interactions and networks

STRING

1189 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CTDSP1CTDP1Q9Y5B0868
CTDSP1RESTQ13127830
CTDSP1UBLCP1Q8WVY7815
CTDSP1NIF3L1Q9GZT8768
CTDSP1SLC11A1P49279615
CTDSP1GTF2F1P35269564
CTDSP1CDCA3Q99618550
CTDSP1ACTL6BO94805506
CTDSP1PTBP1P26599502
CTDSP1RCOR1Q9UKL0491
CTDSP1ACTL6AO96019488
CTDSP1LDB1Q86U70474
CTDSP1SSU72Q9NP77444
CTDSP1KDM5CP41229442
CTDSP1KCNN4O15554428

IntAct

134 interactions, top by confidence:

ABTypeScore
CTDSP1CDCA3psi-mi:“MI:0915”(physical association)0.940
CDCA3CTDSP1psi-mi:“MI:0915”(physical association)0.940
CTDSP1CDCA3psi-mi:“MI:0914”(association)0.940
CTDSP1MBPpsi-mi:“MI:0915”(physical association)0.740
MBPCTDSP1psi-mi:“MI:0915”(physical association)0.740
MBPCTDSP1psi-mi:“MI:0915”(physical association)0.670
CDCA3CTDSPLpsi-mi:“MI:0914”(association)0.670
CDALIN7Apsi-mi:“MI:0914”(association)0.640
CTDSP1CTDSPLpsi-mi:“MI:0914”(association)0.640
MAPKBP1CTDSP1psi-mi:“MI:0915”(physical association)0.560
ITPKBCTDSP1psi-mi:“MI:0915”(physical association)0.560
FAM110ACTDSP1psi-mi:“MI:0915”(physical association)0.560
ACTN3CTDSP1psi-mi:“MI:0915”(physical association)0.560
CTDSP1WWOXpsi-mi:“MI:0915”(physical association)0.560
TRAPPC6ACTDSP1psi-mi:“MI:0915”(physical association)0.560
CDKN2CCTDSP1psi-mi:“MI:0915”(physical association)0.560
FAAP20CTDSP1psi-mi:“MI:0915”(physical association)0.560
MORN3CTDSP1psi-mi:“MI:0915”(physical association)0.560
ZBTB16CTDSP1psi-mi:“MI:0915”(physical association)0.560
CTDSP1FANCLpsi-mi:“MI:0915”(physical association)0.560
LCKCTDSP1psi-mi:“MI:0915”(physical association)0.560
CTDSP1CSTF2Tpsi-mi:“MI:0915”(physical association)0.560
RAB3IL1CTDSP1psi-mi:“MI:0915”(physical association)0.560
SCNM1CTDSP1psi-mi:“MI:0915”(physical association)0.560

BioGRID (150): CTDSP1 (Two-hybrid), CDCA3 (Two-hybrid), CTDSP2 (Affinity Capture-MS), CTDSPL (Affinity Capture-MS), TUBB8 (Affinity Capture-MS), TUBB1 (Affinity Capture-MS), HEATR6 (Affinity Capture-MS), DSTYK (Affinity Capture-MS), TUBA4A (Affinity Capture-MS), TUBA1A (Affinity Capture-MS), CDCA3 (Affinity Capture-MS), FAM118B (Affinity Capture-MS), CDCA3 (Two-hybrid), REST (Affinity Capture-Western), CDCA3 (Affinity Capture-MS)

ESM2 similar proteins: A4QNX6, M9PFN0, O14595, O15194, O19111, O59697, O80673, P09217, P32019, P53681, P58465, P58466, Q00IB6, Q02956, Q05513, Q05D32, Q08BB5, Q09315, Q09695, Q14BI7, Q1EHT7, Q30DN6, Q3KQB6, Q3MHU3, Q5F3Z7, Q5XIK8, Q5XUN4, Q5ZI74, Q62240, Q66KM5, Q75WV3, Q801R4, Q8BG15, Q8BX07, Q8K337, Q8K4T5, Q8L7M4, Q8NDG6, Q8SV03, Q9BY66

Diamond homologs: A4QNX6, M9PFN0, O13636, O14595, O15194, O59718, O95476, P0CN66, P0CN67, P38757, P58465, P58466, Q02776, Q05D32, Q07800, Q07949, Q08BB5, Q09695, Q1RMV9, Q20432, Q28HW9, Q29I63, Q3B7T6, Q3KQB6, Q3TP92, Q3ZCQ8, Q4I099, Q4PEW9, Q4WI16, Q54GB2, Q59W44, Q5B4P0, Q5F3Z7, Q5RAJ8, Q5S7T7, Q5U395, Q5U3T3, Q5XIK8, Q61C05, Q66KM5

SIGNOR signaling

39 interactions.

AEffectBMechanism
CTDSP1down-regulatesSMAD1dephosphorylation
CTDSP1“form complex”NLI/Lmx1.1/Isl1binding
CTDSP1“down-regulates activity”TWIST1dephosphorylation
CTDSP1“up-regulates activity”SMAD3dephosphorylation
CTDSP1“down-regulates activity”SMAD2dephosphorylation
CTDSP1“down-regulates activity”SMAD1dephosphorylation
CTDSP1“up-regulates activity”PRKDCdephosphorylation
CTDSP1“up-regulates activity”POLR2Adephosphorylation

Disease & clinical

Clinical variants and AI predictions

ClinVar

51 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance37
Likely benign0
Benign2

Top pathogenic / likely-pathogenic (0)

SpliceAI

1153 predictions. Top by Δscore:

VariantEffectΔscore
2:218401711:AGGT:Adonor_loss1.0000
2:218401713:GTGC:Gdonor_loss1.0000
2:218402109:A:AGacceptor_gain1.0000
2:218402110:G:GGacceptor_gain1.0000
2:218402110:GC:Gacceptor_gain1.0000
2:218402110:GCA:Gacceptor_gain1.0000
2:218402211:TCAAG:Tdonor_loss1.0000
2:218402212:CAAGG:Cdonor_loss1.0000
2:218402213:AAG:Adonor_loss1.0000
2:218402214:AGG:Adonor_loss1.0000
2:218402215:GGTG:Gdonor_loss1.0000
2:218402347:A:AGacceptor_gain1.0000
2:218402348:G:GGacceptor_gain1.0000
2:218402348:GCCA:Gacceptor_gain1.0000
2:218402405:GGTGA:Gdonor_loss1.0000
2:218402406:G:Cdonor_loss1.0000
2:218402407:T:Gdonor_loss1.0000
2:218402417:G:GTdonor_gain1.0000
2:218403033:A:ACacceptor_loss1.0000
2:218403128:GT:Gdonor_loss1.0000
2:218403129:T:Gdonor_loss1.0000
2:218403230:A:AGacceptor_gain1.0000
2:218403231:G:GGacceptor_gain1.0000
2:218403231:GTAC:Gacceptor_gain1.0000
2:218403413:ATGCT:Adonor_gain1.0000
2:218403417:TG:Tdonor_loss1.0000
2:218403418:G:GGdonor_gain1.0000
2:218403418:GT:Gdonor_loss1.0000
2:218404292:CCCAG:Cacceptor_loss1.0000
2:218404293:CCAG:Cacceptor_loss1.0000

AlphaMissense

1698 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:218402175:T:AV94D1.000
2:218402178:T:AI95N1.000
2:218402180:G:CD96H1.000
2:218402181:A:CD96A1.000
2:218402181:A:TD96V1.000
2:218402182:C:AD96E1.000
2:218402182:C:GD96E1.000
2:218402184:T:AL97Q1.000
2:218402184:T:CL97P1.000
2:218402186:G:AD98N1.000
2:218402186:G:CD98H1.000
2:218402186:G:TD98Y1.000
2:218402187:A:CD98A1.000
2:218402187:A:GD98G1.000
2:218402187:A:TD98V1.000
2:218402188:C:AD98E1.000
2:218402188:C:GD98E1.000
2:218402190:A:CE99A1.000
2:218402190:A:TE99V1.000
2:218402191:G:CE99D1.000
2:218402191:G:TE99D1.000
2:218402192:A:CT100P1.000
2:218402193:C:TT100I1.000
2:218402196:T:AL101Q1.000
2:218402196:T:CL101P1.000
2:218402199:T:AV102E1.000
2:218402201:C:GH103D1.000
2:218402203:C:AH103Q1.000
2:218402203:C:GH103Q1.000
2:218402204:A:CS104R1.000

dbSNP variants (sampled 300 via entrez): RS1000027061 (2:218404221 G>A), RS1000081277 (2:218401932 C>G), RS1000331553 (2:218395095 G>A,C), RS1000480197 (2:218395644 G>A), RS1000637242 (2:218397792 GTAAC>G,GTAACTAAC), RS1000668580 (2:218397682 G>A), RS1000724276 (2:218396262 G>A), RS1000921583 (2:218405326 T>A,C), RS1000950661 (2:218406262 C>A,G,T), RS1000952394 (2:218405155 AC>A,ACC), RS1001071596 (2:218396158 G>C,T), RS1001577234 (2:218399498 A>G), RS1001637847 (2:218398468 C>T), RS1001753386 (2:218402042 A>G), RS1001828632 (2:218403196 C>T)

Disease associations

OMIM: gene MIM:605323 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST001725_74Inflammatory bowel disease4.000000e-12
GCST004605_61Mean corpuscular hemoglobin concentration4.000000e-12
GCST90002387_60Immature fraction of reticulocytes2.000000e-21
GCST90002391_141Mean corpuscular hemoglobin concentration3.000000e-27

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0004528mean corpuscular hemoglobin concentration

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL1795098 (SINGLE PROTEIN)

Molecules with ChEMBL bioactivity

10 molecules (phase ≥1), by development phase (incl. off-target/promiscuous compounds). Patent mentions across the top 20 by phase: 126,848 (via chembl_molecule»patent_compound — counts attach to the compound, not the gene–compound relationship, so off-target/promiscuous molecules can dominate).

MoleculeNamePhasePatents
CHEMBL1200347ISOPROPAMIDE IODIDE42,616
CHEMBL2146124CEFUROXIME SODIUM456
CHEMBL3561076TRIMETAZIDINE DIHYDROCHLORIDE4280
CHEMBL617CEPHALOTHIN424,927
CHEMBL680CEFACLOR ANHYDROUS434,121
CHEMBL927CEFDINIR419,403
CHEMBL1716983ENISAMIUM IODIDE31
CHEMBL449317HESPERIDIN318,753
CHEMBL1257065STILONIUM IODIDE2751
CHEMBL150KAEMPFEROL125,940

PharmGKB: 1 entry (VIP=true, CPIC=false)

Binding affinities (BindingDB)

927 measured of 947 human assays (1011 total across all organisms); most potent 50 below. Values come from heterogeneous assays and are not directly comparable.

LigandMeasureValue
3-(2-furanylmethyl)-2-propylthiochromeno[2,3-d]pyrimidine-4,5-dioneEC500.0148 nM
6-bromo-2-{[4-(2-furoyl)-1-piperazinyl]carbonyl}pyrazolo[1,5-a]pyrimidineEC502.73 nM
4-[[2-[(4-keto-6-methyl-1H-pyrimidin-2-yl)thio]acetyl]amino]benzoic acid ethyl esterIC5023 nM
5-(tosylmethyl)-2-furoic acidIC5023 nM
3-[2-(hydroximinomethyl)pyrrol-1-yl]benzoic acidIC5023 nM
2-[(4-benzyl-5-methyl-1,2,4-triazol-3-yl)sulfanyl]-N-(furan-2-ylmethylcarbamoyl)acetamideEC5044.7 nM
cid_694792KI60 nM
MLS000122969IC50127 nM
MLS000082005IC50351 nM
MLS000562014IC50408 nM
SMR000199423EC50629 nM
(5E)-5-[(4-methoxy-3-methylphenyl)methylidene]-1-methyl-2-sulfanylidene-1,3-diazinane-4,6-dioneIC50632 nM
N-[6-azanyl-2,4-bis(oxidanylidene)-1-(phenylmethyl)pyrimidin-5-yl]-N-propyl-3-pyrrol-1-yl-benzamideEC50720 nM
(E)-1-(1,3-dithiolan-2-ylidene)-4-(2-furanyl)-3-buten-2-oneEC50781 nM
(E)-N-[2-[(N’‘E)-N’’-[1-(4-ethylphenyl)ethylidene]hydrazino]-2-keto-ethyl]-3-(2-furyl)acrylamideIC50844 nM
SMR000241765IC501020 nM
cid_1728957IC501020 nM
MLS001008494IC501030 nM
MLS000531387IC501050 nM
(E)-3-(3,4-dimethoxyphenyl)-N-[(2-hydroxy-5-nitro-phenyl)thiocarbamoyl]acrylamideIC501090 nM
2-[3-[2-hydroxyethyl(dimethyl)azaniumyl]propanoylamino]-2-methyl-propane-1-sulfonateEC501160 nM
AmBIC501170 nM
2-[2,6-bis(chloranyl)-4-[(Z)-[3-[2-[(4-methylphenyl)amino]-2-oxidanylidene-ethyl]-2,4-bis(oxidanylidene)-1,3-thiazolidin-5-ylidene]methyl]phenoxy]ethanoic acidIC501200 nM
2-{3-[(2-Hydroxy-benzoyl)-hydrazono]-2-oxo-2,3-dihydro-indol-1-yl}-N-o-tolyl-acetamideIC501230 nM
cid_2480569IC501380 nM
[4-[[(3,5-dihydroxybenzoyl)hydrazinylidene]methyl]phenyl] 2-chlorobenzoateIC501400 nM
3-[((2Z)-5-[(acetyloxy)methyl]-3-{[(2,4-dimethoxyphenyl)amino]carbonyl}-8-methyl-2H-pyrano[2,3-c]pyridin-2-ylidene)amino]benzoic acidIC501530 nM
(5Z)-2-azanylidene-3-(4-fluorophenyl)-5-[(4-hydroxyphenyl)methylidene]-1,3-thiazolidin-4-oneIC501550 nM
(5E)-5-[[5-[[(2-anilino-1,2-dioxoethyl)amino]methyl]-2-furanyl]methylidene]-2-methyl-4-oxo-1H-pyrrole-3-carboxylic acid ethyl esterIC501560 nM
2-[(4-benzyl-5-thiophen-2-yl-1,2,4-triazol-3-yl)sulfanylmethyl]pyrimido[2,1-b][1,3]benzothiazol-4-oneIC501560 nM
MLS000673878IC501570 nM
cid_4790991IC501620 nM
3-(3-nitrophenyl)-N’-[(1E)-1-(6-oxo-1-cyclohexa-2,4-dienylidene)ethyl]-1H-pyrazole-5-carbohydrazideIC501700 nM
N-[[(E)-3-phenylprop-2-enylidene]amino]-2-benzo[e]benzofurancarboxamideIC501720 nM
(E)-2-(6-methyl-1H-benzimidazol-2-yl)-3-[5-(3-nitrophenyl)-2-furanyl]-2-propenenitrileIC501750 nM
5-chloro-2-thiophenecarboxylic acid [4-[(E)-2-(1H-benzimidazol-2-yl)-2-cyanoethenyl]phenyl] esterIC501770 nM
N-(3-methylphenyl)-2-[4-[(Z)-2-[5-nitro-2,4-bis(oxidanylidene)-1H-pyrimidin-6-yl]ethenyl]phenoxy]ethanamideIC501780 nM
4-{2-[(5-bromo-3-pyridinyl)carbonyl]carbonohydrazonoyl}phenyl 1,3-benzodioxole-5-carboxylateIC501800 nM
MLS000585164IC501820 nM
5-bromo-2-thiophenecarboxylic acid [4-[(Z)-2-(1H-benzimidazol-2-yl)-2-cyanoethenyl]phenyl] esterIC501820 nM
MLS-0336006.0001IC501830 nM
4-(5,5-dimethyl-3-morpholin-4-ylcyclohex-2-en-1-ylidene)-2,6-dimethylmorpholin-4-ium;iodideIC501830 nM
MLS000773704IC501850 nM
MLS001211762IC501880 nM
N’-(3-chlorobenzoyl)thieno[2,3-b]quinoline-2-carbohydrazideIC501900 nM
(2Z)-5-amino-2-(2-furfurylidene)-7-(2-furyl)-3-keto-7H-thiazolo[3,2-a]pyridine-6,8-dicarbonitrileIC501920 nM
(8S)-7-(3,3-diphenylpropanoyl)-3-[3-(2-methylprop-2-enoylamino)phenyl]-1-oxa-2,7-diazaspiro[4.4]non-2-ene-8-carboxamideEC501940 nM
MLS000548432IC501990 nM
3-[7-[(2-chlorophenyl)methoxy]-4,8-dimethyl-2-oxochromen-3-yl]propanoic acidIC501990 nM
cid_5754238IC502000 nM

ChEMBL bioactivities

779 potent at pChembl≥5 of 1003 total, top 50 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
6.39IC50408nMCHEMBL1464280
6.34IC50453nMCHEMBL1533230
6.27IC50540nMCHEMBL1360585
6.20IC50632nMCHEMBL1477703
6.15IC50711nMCHEMBL1309432
6.13IC50737nMCHEMBL3212539
6.08IC50836nMCHEMBL1325664
6.07IC50844nMCHEMBL1967028
6.05IC50900nMCHEMBL1517799
5.99IC501027nMCHEMBL1461216
5.99IC501025nMCHEMBL1361920
5.99IC501017nMCHEMBL1735048
5.95IC501120nMCHEMBL1324382
5.93IC501169nMCHEMBL1700818
5.91IC501227nMCHEMBL598477
5.91IC501231nMCHEMBL1374955
5.86IC501375nMCHEMBL1557036
5.86IC501396nMCHEMBL1978925
5.84IC501444nMCHEMBL1481493
5.84IC501455nMCHEMBL3856092
5.81IC501544nMCHEMBL3194164
5.81IC501552nMCHEMBL1560629
5.80IC501567nMCHEMBL1538387
5.80IC501595nMCHEMBL1374016
5.79IC501603nMCHEMBL1485960
5.79IC501615nMCHEMBL1322848
5.79IC501614nMCHEMBL1510349
5.77IC501700nMCHEMBL1975756
5.76IC501718nMCHEMBL3197136
5.76IC501719nMCHEMBL1305930
5.76IC501734nMCHEMBL1340117
5.75IC501768nMCHEMBL1528986
5.75IC501800nMCHEMBL3198331
5.75IC501779nMCHEMBL1501349
5.75IC501777nMCHEMBL1575221
5.74IC501828nMCHEMBL1977271
5.74IC501828nMCHEMBL1390354
5.74IC501838nMCHEMBL1469245
5.74IC501818nMCHEMBL1883674
5.74IC501830nMCHEMBL1566816
5.73IC501879nMCHEMBL1302375
5.73IC501847nMCHEMBL3195249
5.73IC501878nMCHEMBL1443718
5.73IC501857nMCHEMBL1612081
5.72IC501896nMCHEMBL1331974
5.72IC501894nMCHEMBL1368108
5.72IC501889nMCHEMBL5483017
5.70IC501988nMCHEMBL1466471
5.70IC502002nMCHEMBL1490134
5.70IC502000nMCHEMBL1342081

PubChem BioAssay actives

3 with measured affinity, of 17 total; 2 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
1,1-dioxo-N-[3-[[4-(trifluoromethoxy)phenyl]sulfonylamino]propyl]-1-benzothiophene-2-carboxamide1824654: Covalent inhibition of wild type human SCP1 assessed as inhibition constant using Ac-EDLphosphoSPPSPPLPK-NH2 peptide as substrate preincubated for 0.5 to 18 hrs followed by substrate addition by malachite green reagent based assayki4.8000uM
1,1-dioxo-N-[2-[2-[[4-(trifluoromethoxy)phenyl]sulfonylamino]ethoxy]ethyl]-1-benzothiophene-2-carboxamide1824652: Inhibition of wild type human SCP1 using pNPP as substrate incubated for 0.5 to 18 hrs followed by substrate addition by absorbance based analysisic5010.0000uM

CTD chemical–gene interactions

56 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteincreases abundance, increases expression, decreases expression, affects cotreatment5
mercuric bromidedecreases expression, affects cotreatment2
Air Pollutantsincreases abundance, increases oxidation, decreases expression, affects cotreatment2
Arsenicincreases abundance, increases expression, affects cotreatment, decreases expression2
Ozoneincreases oxidation, increases abundance, decreases expression, affects cotreatment2
Phenylmercuric Acetateaffects cotreatment, decreases expression2
Tobacco Smoke Pollutiondecreases expression2
Cadmium Chloridedecreases expression, decreases reaction, increases abundance, increases palmitoylation2
p-Chloromercuribenzoic Acidaffects cotreatment, decreases expression2
aristolochic acid Idecreases expression1
bufotalindecreases expression1
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
alpha-pineneaffects cotreatment, increases oxidation, increases abundance1
propionaldehydedecreases expression1
trichostatin Aaffects expression1
afimoxifenedecreases expression, decreases reaction1
cobaltous chloridedecreases expression1
butyraldehydedecreases expression1
2-bromopalmitateincreases palmitoylation, decreases reaction, increases abundance1
manganese chlorideincreases abundance, affects cotreatment, decreases expression1
ferrous chloridedecreases expression1
methacrylaldehydeaffects cotreatment, increases oxidation, increases abundance1
testosterone-3-carboxymethyloxime-bovine serum albumin conjugateaffects expression1
arsenic disulfideaffects expression1
di-n-butylphosphoric acidaffects expression1
monomethylarsonous aciddecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
Sunitinibincreases expression1

ChEMBL screening assays

11 unique, capped per target: 10 binding, 1 functional

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL1794574FunctionalPUBCHEM_BIOASSAY: Dose Response confirmation of uHTS hits for Scp-1 phosphatase using a colorimetric assay. (Class of assay: confirmatory) [Related pubchem assays (depositor defined):AID493091, AID493120]PubChem BioAssay data set
CHEMBL3429233BindingInhibition of SCP1 (unknown origin) using pNPP as substrate at pH 7 at 25 degC by spectrophotometric analysisA potent and selective inhibitor for the UBLCP1 proteasome phosphatase. — Bioorg Med Chem

Cellosaurus cell lines

3 cell lines: 2 cancer cell line, 1 transformed cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B2VDAbcam HEK293T CTDSP1 KOTransformed cell lineFemale
CVCL_SK00HAP1 CTDSP1 (-) 1Cancer cell lineMale
CVCL_SK01HAP1 CTDSP1 (-) 2Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.