NCBP2

gene
On this page

Also known as NIP1CBP20Cbc2

Summary

NCBP2 (nuclear cap binding protein subunit 2, HGNC:7659) is a protein-coding gene on chromosome 3q29, encoding Nuclear cap-binding protein subunit 2 (P52298). Component of the cap-binding complex (CBC), which binds co-transcriptionally to the 5’ cap of pre-mRNAs and is involved in various processes such as pre-mRNA splicing, translation regulation, nonsense-mediated mRNA decay, RNA-mediated gene silencing (RNAi) by microRNAs (miRNAs)…. It is a common-essential gene (DepMap: required in 99.8% of cancer cell lines).

The product of this gene is a component of the nuclear cap-binding protein complex (CBC), which binds to the monomethylated 5’ cap of nascent pre-mRNA in the nucleoplasm. The encoded protein has an RNP domain commonly found in RNA binding proteins, and contains the cap-binding activity. The CBC promotes pre-mRNA splicing, 3’-end processing, RNA nuclear export, and nonsense-mediated mRNA decay. Multiple transcript variants encoding different isoforms have been found for this gene.

Source: NCBI Gene 22916 — RefSeq curated summary.

At a glance

  • GWAS associations: 3
  • Clinical variants (ClinVar): 20 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 99.8% of screened cell lines (common-essential)
  • MANE Select transcript: NM_007362

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:7659
Approved symbolNCBP2
Namenuclear cap binding protein subunit 2
Location3q29
Locus typegene with protein product
StatusApproved
AliasesNIP1, CBP20, Cbc2
Ensembl geneENSG00000114503
Ensembl biotypeprotein_coding
OMIM605133
Entrez22916

Gene structure

Transcript identifiers

Ensembl transcripts: 13 — 7 protein_coding, 4 retained_intron, 1 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined

ENST00000321256, ENST00000411704, ENST00000422610, ENST00000427641, ENST00000428425, ENST00000447325, ENST00000452404, ENST00000455953, ENST00000463783, ENST00000467803, ENST00000468923, ENST00000479647, ENST00000482976

RefSeq mRNA: 4 — MANE Select: NM_007362 NM_001042540, NM_001308036, NM_001410857, NM_007362

CCDS: CCDS3323, CCDS46986, CCDS77878, CCDS93446

Canonical transcript exons

ENST00000321256 — 4 exons

ExonStartEnd
ENSE00001056985196942426196942528
ENSE00001746919196935406196937082
ENSE00003539773196939251196939432
ENSE00003554682196937510196937648

Expression profiles

Bgee: expression breadth ubiquitous, 294 present calls, max score 98.36.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 56.7747 / max 673.4212, expressed in 1823 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
4639053.34581823
463892.11151267
463880.9871694
463870.3302137

Top tissues by expression

295 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
ganglionic eminenceUBERON:000402398.36gold quality
cortical plateUBERON:000534398.33gold quality
ventricular zoneUBERON:000305398.15gold quality
secondary oocyteCL:000065596.22gold quality
embryoUBERON:000092295.88gold quality
nucleus accumbensUBERON:000188295.46gold quality
stromal cell of endometriumCL:000225595.36gold quality
caudate nucleusUBERON:000187395.22gold quality
rectumUBERON:000105295.14gold quality
islet of LangerhansUBERON:000000695.13gold quality
amygdalaUBERON:000187695.10gold quality
putamenUBERON:000187495.06gold quality
C1 segment of cervical spinal cordUBERON:000646994.71gold quality
right adrenal gland cortexUBERON:003582794.70gold quality
lymph nodeUBERON:000002994.66gold quality
right adrenal glandUBERON:000123394.63gold quality
left adrenal glandUBERON:000123494.60gold quality
endometrium epitheliumUBERON:000481194.54gold quality
gall bladderUBERON:000211094.49gold quality
left adrenal gland cortexUBERON:003582594.38gold quality
spinal cordUBERON:000224094.36gold quality
muscle layer of sigmoid colonUBERON:003580594.22gold quality
adrenal cortexUBERON:000123594.21gold quality
adrenal tissueUBERON:001830394.20gold quality
lower esophagus muscularis layerUBERON:003583394.19gold quality
pericardiumUBERON:000240794.18gold quality
lower esophagusUBERON:001347394.18gold quality
adrenal glandUBERON:000236994.12gold quality
endometriumUBERON:000129594.05gold quality
smooth muscle tissueUBERON:000113593.99gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-MTAB-6911no536.99
E-MTAB-7606no379.02
E-MTAB-5061no3.57
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

70 targeting NCBP2, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-4682100.0068.891258
HSA-MIR-1252-5P100.0069.802774
HSA-MIR-3646100.0073.565283
HSA-MIR-4425100.0067.591049
HSA-MIR-186-5P99.9970.833707
HSA-MIR-366299.9973.825684
HSA-MIR-499A-5P99.9870.791323
HSA-MIR-19A-3P99.9875.332762
HSA-MIR-19B-3P99.9875.442754
HSA-MIR-6888-3P99.9765.951170
HSA-MIR-548AN99.9770.912817
HSA-MIR-314899.9775.066478
HSA-MIR-426799.9666.532368
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-545-3P99.9570.742783
HSA-MIR-335-3P99.9373.364958
HSA-MIR-7-1-3P99.9171.534384
HSA-MIR-7-2-3P99.9171.404394
HSA-MIR-430299.8967.941187
HSA-MIR-548AZ-5P99.8369.943230
HSA-MIR-548T-5P99.8369.913220
HSA-MIR-442099.8270.081624
HSA-MIR-181B-2-3P99.8170.061646
HSA-MIR-181B-3P99.8170.061646
HSA-MIR-4639-5P99.8167.371028
HSA-MIR-432099.7565.80793
HSA-MIR-518A-5P99.7069.012209
HSA-MIR-52799.7069.012209
HSA-MIR-128399.6972.423009
HSA-MIR-7161-5P99.6868.921592

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 99.8% of screened cell lines, common-essential.

Literature-anchored findings (GeneRIF, showing 9)

  • crystal structure at 2.1 A resolution of CBP20 and CBP80 bound to an m(7)GpppG cap analogue (PMID:12374755)
  • Structural basis of m7GpppG binding to the nuclear cap-binding protein complex. (PMID:12434151)
  • We show that NELF interacts with the nuclear cap binding complex (CBC), a heterodimeric, multifunctional factor that plays important roles in several mRNA processing steps, and the two factors together participate in the 3’ end processing of histone mRNAs (PMID:17499042)
  • a new MIF4G domain-containing protein, CTIF (CBP80/20-dependent translation initiation factor) that interacts directly with CBP80 and is part of the CBP80/20-dependent translation initiation complex (PMID:19648179)
  • determined how importin-beta binds to the cap-binding complex (CBC) through its CBP20 subunit (PMID:19668212)
  • A novel alternative splice variant of CBP20, termed CBP20S, has an in-frame deletion, leading to the translation of a protein lacking most of the RNA recognition motif (RRM). (PMID:21326824)
  • NCBP2 is not required for cell viability and poly(A) RNA export. (PMID:26382858)
  • Affinity proteomic dissection of the human nuclear cap-binding complex interactome. (PMID:32960270)
  • m7G-related genes-NCBP2 and EIF4E3 determine immune contexture in head and neck squamous cell carcinoma by regulating CCL4/CCL5 expression. (PMID:37067401)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_rerioncbp2ENSDARG00000014898
mus_musculusNcbp2ENSMUSG00000022774
rattus_norvegicusNcbp2ENSRNOG00000001746
drosophila_melanogasterCbp20FBGN0022943
caenorhabditis_elegansncbp-2WBGENE00009141

Paralogs (1): NCBP2L (ENSG00000170935)

Protein

Protein identifiers

Nuclear cap-binding protein subunit 2P52298 (reviewed: P52298)

Alternative names: 20 kDa nuclear cap-binding protein, Cell proliferation-inducing gene 55 protein, NCBP 20 kDa subunit, NCBP-interacting protein 1

All UniProt accessions (5): C9JKF4, C9JQX9, P52298, F6WVI0, F8WE41

UniProt curated annotations — full annotation on UniProt →

Function. Component of the cap-binding complex (CBC), which binds co-transcriptionally to the 5’ cap of pre-mRNAs and is involved in various processes such as pre-mRNA splicing, translation regulation, nonsense-mediated mRNA decay, RNA-mediated gene silencing (RNAi) by microRNAs (miRNAs) and mRNA export. The CBC complex is involved in mRNA export from the nucleus via its interaction with ALYREF/THOC4/ALY, leading to the recruitment of the mRNA export machinery to the 5’ end of mRNA and to mRNA export in a 5’ to 3’ direction through the nuclear pore. The CBC complex is also involved in mediating U snRNA and intronless mRNAs export from the nucleus. The CBC complex is essential for a pioneer round of mRNA translation, before steady state translation when the CBC complex is replaced by cytoplasmic cap-binding protein eIF4E. The pioneer round of mRNA translation mediated by the CBC complex plays a central role in nonsense-mediated mRNA decay (NMD), NMD only taking place in mRNAs bound to the CBC complex, but not on eIF4E-bound mRNAs. The CBC complex enhances NMD in mRNAs containing at least one exon-junction complex (EJC) via its interaction with UPF1, promoting the interaction between UPF1 and UPF2. The CBC complex is also involved in ‘failsafe’ NMD, which is independent of the EJC complex, while it does not participate in Staufen-mediated mRNA decay (SMD). During cell proliferation, the CBC complex is also involved in microRNAs (miRNAs) biogenesis via its interaction with SRRT/ARS2, thereby being required for miRNA-mediated RNA interference. The CBC complex also acts as a negative regulator of PARN, thereby acting as an inhibitor of mRNA deadenylation. In the CBC complex, NCBP2/CBP20 recognizes and binds capped RNAs (m7GpppG-capped RNA) but requires NCBP1/CBP80 to stabilize the movement of its N-terminal loop and lock the CBC into a high affinity cap-binding state with the cap structure. The conventional cap-binding complex with NCBP2 binds both small nuclear RNA (snRNA) and messenger (mRNA) and is involved in their export from the nucleus.

Subunit / interactions. Component of the nuclear cap-binding complex (CBC), a heterodimer composed of NCBP1/CBP80 and NCBP2/CBP20 that interacts with m7GpppG-capped RNA. Found in a U snRNA export complex with PHAX/RNUXA, NCBP1/CBP80, NCBP2/CBP20, RAN, XPO1 and m7G-capped RNA. Interacts with PHAX/RNUXA, EIF4G1, HNRNPF, HNRNPH1 and ALYREF/THOC4/ALY. Interacts with SRRT/ARS2 and KPNA3.

Subcellular location. Nucleus. Cytoplasm.

Similarity. Belongs to the RRM NCBP2 family.

Isoforms (3)

UniProt IDNamesCanonical?
P52298-11yes
P52298-22
P52298-33

RefSeq proteins (4): NP_001036005, NP_001294965, NP_001397786, NP_031388* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000504RRM_domDomain
IPR012677Nucleotide-bd_a/b_plait_sfHomologous_superfamily
IPR027157NCBP2Family
IPR034148NCBP2_RRMDomain
IPR035979RBD_domain_sfHomologous_superfamily

Pfam: PF00076

UniProt features (50 total): mutagenesis site 13, helix 8, strand 8, binding site 5, modified residue 4, turn 4, splice variant 2, initiator methionine 1, chain 1, domain 1, sequence conflict 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

18 structures.

PDBMethodResolution (Å)
1H2VX-RAY DIFFRACTION2
1H6KX-RAY DIFFRACTION2
1H2TX-RAY DIFFRACTION2.1
1N52X-RAY DIFFRACTION2.11
3FEYX-RAY DIFFRACTION2.2
1H2UX-RAY DIFFRACTION2.4
9HFLELECTRON MICROSCOPY2.62
6D0YX-RAY DIFFRACTION2.68
1N54X-RAY DIFFRACTION2.72
5OOBX-RAY DIFFRACTION2.79
5OO6X-RAY DIFFRACTION2.8
8BY6ELECTRON MICROSCOPY3.19
8SRRELECTRON MICROSCOPY3.22
8SUYELECTRON MICROSCOPY3.38
8PMPELECTRON MICROSCOPY3.43
8PNTELECTRON MICROSCOPY3.46
3FEXX-RAY DIFFRACTION3.55
7ABGELECTRON MICROSCOPY7.8

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P52298-F193.820.88

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (5): 20; 43; 112–116; 123–127; 133–134

Post-translational modifications (4): 2, 13, 18, 146

Mutagenesis-validated functional residues (13):

PositionPhenotype
20abolishes mrna cap-binding.
20strongly impairs mrna cap-binding.
25does not affect mrna cap-binding.
43abolishes mrna cap-binding.
43does not affect mrna cap-binding.
46does not affect mrna cap-binding.
83abolishes mrna cap-binding.
85impairs mrna cap-binding.
112does not affect mrna cap-binding.
114does not affect mrna cap-binding.
116abolishes mrna cap-binding.
119does not affect mrna cap-binding.
138does not affect mrna cap-binding.

Function

Pathways and Gene Ontology

Reactome pathways

61 pathways

IDPathway
R-HSA-111367SLBP independent Processing of Histone Pre-mRNAs
R-HSA-112382Formation of RNA Pol II elongation complex
R-HSA-113418Formation of the Early Elongation Complex
R-HSA-159227Transport of the SLBP independent Mature mRNA
R-HSA-159230Transport of the SLBP Dependant Mature mRNA
R-HSA-159231Transport of Mature mRNA Derived from an Intronless Transcript
R-HSA-159236Transport of Mature mRNA derived from an Intron-Containing Transcript
R-HSA-167152Formation of HIV elongation complex in the absence of HIV Tat
R-HSA-167158Formation of the HIV-1 Early Elongation Complex
R-HSA-167200Formation of HIV-1 elongation complex containing HIV-1 Tat
R-HSA-167242Abortive elongation of HIV-1 transcript in the absence of Tat
R-HSA-191859snRNP Assembly
R-HSA-674695RNA Polymerase II Pre-transcription Events
R-HSA-6803529FGFR2 alternative splicing
R-HSA-6807505RNA polymerase II transcribes snRNA genes
R-HSA-72086mRNA Capping
R-HSA-72163mRNA Splicing - Major Pathway
R-HSA-72165mRNA Splicing - Minor Pathway
R-HSA-72187mRNA 3’-end processing
R-HSA-72203Processing of Capped Intron-Containing Pre-mRNA
R-HSA-73856RNA Polymerase II Transcription Termination
R-HSA-77588SLBP Dependent Processing of Replication-Dependent Histone Pre-mRNAs
R-HSA-77595Processing of Intronless Pre-mRNAs
R-HSA-8851708Signaling by FGFR2 IIIa TM
R-HSA-9010553Regulation of expression of SLITs and ROBOs
R-HSA-975956Nonsense Mediated Decay (NMD) independent of the Exon Junction Complex (EJC)
R-HSA-975957Nonsense Mediated Decay (NMD) enhanced by the Exon Junction Complex (EJC)
R-HSA-9770562mRNA Polyadenylation
R-HSA-9918481Dengue Virus-Host Interactions
R-HSA-9930044Nuclear RNA decay

MSigDB gene sets: 305 (showing top): GSE45365_CD8A_DC_VS_CD11B_DC_IFNAR_KO_UP, MORF_MTA1, GOBP_CYTOPLASMIC_TRANSLATION, BORCZUK_MALIGNANT_MESOTHELIOMA_UP, RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, RODRIGUES_THYROID_CARCINOMA_POORLY_DIFFERENTIATED_UP, PAL_PRMT5_TARGETS_UP, GOBP_POSITIVE_REGULATION_OF_MRNA_PROCESSING, GOBP_ALTERNATIVE_MRNA_SPLICING_VIA_SPLICEOSOME, REACTOME_SIGNALING_BY_FGFR, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, DACOSTA_UV_RESPONSE_VIA_ERCC3_UP, GOBP_TRANSLATIONAL_INITIATION, MORF_HDAC2, MODULE_16

GO Biological Process (23): nuclear-transcribed mRNA catabolic process, nonsense-mediated decay (GO:0000184), alternative mRNA splicing, via spliceosome (GO:0000380), mRNA splicing, via spliceosome (GO:0000398), cap-dependent translational initiation (GO:0002191), mRNA export from nucleus (GO:0006406), snRNA export from nucleus (GO:0006408), regulation of translational initiation (GO:0006446), histone mRNA metabolic process (GO:0008334), RNA splicing (GO:0008380), mRNA metabolic process (GO:0016071), primary miRNA processing (GO:0031053), mRNA 3’-end processing (GO:0031124), positive regulation of mRNA 3’-end processing (GO:0031442), positive regulation of transcription elongation by RNA polymerase II (GO:0032968), regulatory ncRNA-mediated post-transcriptional gene silencing (GO:0035194), miRNA-mediated post-transcriptional gene silencing (GO:0035195), mRNA transcription by RNA polymerase II (GO:0042789), mRNA cis splicing, via spliceosome (GO:0045292), positive regulation of RNA export from nucleus (GO:0046833), mRNA processing (GO:0006397), regulation of translation (GO:0006417), regulatory ncRNA-mediated gene silencing (GO:0031047), mRNA transport (GO:0051028)

GO Molecular Function (8): RNA cap binding (GO:0000339), RNA 7-methylguanosine cap binding (GO:0000340), DNA binding (GO:0003677), RNA binding (GO:0003723), mRNA binding (GO:0003729), snRNA binding (GO:0017069), nucleic acid binding (GO:0003676), protein binding (GO:0005515)

GO Cellular Component (7): nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), cytosol (GO:0005829), nuclear cap binding complex (GO:0005846), RNA cap binding complex (GO:0034518), ciliary basal body (GO:0036064)

Reactome top-level categories

Rollup of top-13 pathways:

CategoryPathways
Transport of Mature mRNAs Derived from Intronless Transcripts3
RNA Polymerase II Transcription Elongation2
HIV Transcription Elongation2
RNA Polymerase II Transcription2
Metabolism of RNA2
mRNA Splicing2
Processing of Capped Intronless Pre-mRNA1
Transport of Mature Transcript to Cytoplasm1
Transcription of the HIV genome1
Tat-mediated elongation of the HIV-1 transcript1
Metabolism of non-coding RNA1
Signaling by FGFR21
Processing of Capped Intron-Containing Pre-mRNA1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA export from nucleus3
RNA binding3
cellular anatomical structure3
mRNA splicing, via spliceosome2
mRNA processing2
mRNA metabolic process2
RNA processing2
nucleic acid binding2
binding2
nuclear-transcribed mRNA catabolic process1
RNA splicing, via transesterification reactions with bulged adenosine as nucleophile1
cytoplasmic translational initiation1
cap-dependent translation initiation factor activity1
gene expression1
mRNA transport1
snRNA transport1
translational initiation1
regulation of translation1
RNA metabolic process1
miRNA processing1
RNA 3’-end processing1
mRNA 3’-end processing1
regulation of mRNA 3’-end processing1
positive regulation of mRNA processing1
transcription elongation by RNA polymerase II1
positive regulation of DNA-templated transcription, elongation1
regulation of transcription elongation by RNA polymerase II1
positive regulation of transcription by RNA polymerase II1
post-transcriptional gene silencing1
regulatory ncRNA-mediated gene silencing1
regulatory ncRNA-mediated post-transcriptional gene silencing1
transcription by RNA polymerase II1
mRNA transcription1
positive regulation of nucleobase-containing compound transport1
positive regulation of nucleocytoplasmic transport1
regulation of RNA export from nucleus1
RNA cap binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1

Protein interactions and networks

STRING

2994 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
NCBP2NCBP1Q09161999
NCBP2SRRTQ9BXP5973
NCBP2CTIFO43310928
NCBP2EIF4G1Q04637907
NCBP2EIF4EP06730899
NCBP2PHAXQ9H814897
NCBP2UPF2Q9HAU5866
NCBP2PABPN1Q86U42859
NCBP2UPF3AQ9H1J1857
NCBP2XPO1O14980757
NCBP2UPF1Q92900750
NCBP2MTREXP42285702
NCBP2SF3B4Q15427700
NCBP2SNRPCP09234694
NCBP2CDC5LQ99459678

IntAct

136 interactions, top by confidence:

ABTypeScore
NCBP2NCBP1psi-mi:“MI:0407”(direct interaction)0.960
NCBP1NCBP2psi-mi:“MI:0407”(direct interaction)0.960
NCBP1NCBP2psi-mi:“MI:0915”(physical association)0.960
NCBP1NCBP2psi-mi:“MI:2364”(proximity)0.960

BioGRID (191): LZTS2 (Two-hybrid), NCBP2 (Affinity Capture-MS), ADPRHL2 (Co-fractionation), NCBP1 (Co-fractionation), NCBP2 (Co-fractionation), SYNCRIP (Co-fractionation), NCBP2 (Affinity Capture-MS), ZBTB48 (Affinity Capture-MS), KPNA2 (Affinity Capture-MS), KPNA3 (Affinity Capture-MS), KPNA4 (Affinity Capture-MS), NCBP1 (Affinity Capture-MS), NUP98 (Affinity Capture-MS), RPL10 (Affinity Capture-MS), STX3 (Affinity Capture-MS)

ESM2 similar proteins: A2SW84, A6PVI3, A8Y1R8, B0W939, B1WC40, B3LYP1, B3P0D7, B4GLK8, B4IBA4, B4JUT1, B4KCD5, B4LZ88, B4M205, B4NB54, B4PL68, B4QV17, B5G279, B7P877, C0H859, C1BY64, C6Y4C0, O94290, P25555, P36629, P40565, P52298, P52299, Q00916, Q05AT9, Q08920, Q177H0, Q1HE01, Q293V6, Q3ZBJ1, Q4IE79, Q54KR9, Q5ZKR5, Q6DES0, Q6FNF9, Q754W7

Diamond homologs: A0A0D1C8Z4, A2SW84, A6PVI3, A8Y1R8, B0W939, B1WC40, B3LYP1, B3P0D7, B4GLK8, B4IBA4, B4JUT1, B4KCD5, B4LZ88, B4M205, B4NB54, B4PL68, B4QV17, B5DGI7, B5G279, B7P877, C0H859, C0HFE5, C1BY64, O35698, O89086, P19684, P27476, P33240, P52298, P52299, P60824, P60825, P60826, P62995, P62996, P62997, P78795, P98179, Q05AT9, Q08920

SIGNOR signaling

1 interactions.

AEffectBMechanism
NCBP2“form complex”“Cap-binding complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 92 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
HIV Transcription Elongation524.3×5e-05
Dengue Virus Genome Translation and Replication523.0×7e-05
mRNA Splicing1320.7×5e-12
mRNA 3’-end processing720.0×3e-06
RNA Polymerase II Transcription Termination619.1×2e-05
Formation of HIV-1 elongation complex containing HIV-1 Tat518.8×1e-04
Tat-mediated elongation of the HIV-1 transcript518.8×1e-04
Formation of HIV elongation complex in the absence of HIV Tat518.0×2e-04

GO biological processes:

GO termPartnersFoldFDR
U2-type prespliceosome assembly538.1×3e-05
spliceosomal complex assembly536.7×3e-05
mRNA export from nucleus828.8×9e-08
nuclear-transcribed mRNA catabolic process, nonsense-mediated decay528.5×8e-05
mRNA splicing, via spliceosome1617.9×3e-13
regulation of alternative mRNA splicing, via spliceosome514.9×1e-03
RNA splicing1111.8×4e-07
mRNA processing87.7×7e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

20 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance12
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1147 predictions. Top by Δscore:

VariantEffectΔscore
3:196937552:AAAG:Adonor_gain1.0000
3:196939252:C:CAdonor_gain1.0000
3:196939282:T:TAdonor_gain1.0000
3:196939429:CACC:Cacceptor_gain1.0000
3:196939431:CC:Cacceptor_gain1.0000
3:196939432:CC:Cacceptor_gain1.0000
3:196939433:CTAG:Cacceptor_loss1.0000
3:196937079:GAACC:Gacceptor_loss0.9900
3:196937080:AACCT:Aacceptor_loss0.9900
3:196937081:ACCTT:Aacceptor_loss0.9900
3:196937082:CCTTT:Cacceptor_loss0.9900
3:196937083:C:Aacceptor_loss0.9900
3:196937084:T:Aacceptor_loss0.9900
3:196937547:T:TAdonor_gain0.9900
3:196939196:C:CTdonor_gain0.9900
3:196939205:G:Cdonor_gain0.9900
3:196939211:C:Adonor_gain0.9900
3:196939249:A:ACdonor_gain0.9900
3:196939250:C:CCdonor_gain0.9900
3:196939392:CAG:Cdonor_gain0.9900
3:196939428:TCACC:Tacceptor_gain0.9900
3:196939429:CACCC:Cacceptor_gain0.9900
3:196939433:C:CCacceptor_gain0.9900
3:196939434:T:Aacceptor_loss0.9900
3:196942379:G:Cdonor_gain0.9900
3:196942387:G:Adonor_gain0.9900
3:196942399:C:Adonor_gain0.9900
3:196942419:C:Adonor_gain0.9900
3:196942420:CCTCA:Cdonor_loss0.9900
3:196942421:CTCA:Cdonor_loss0.9900

AlphaMissense

1018 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:196937540:C:AR123S1.000
3:196937540:C:GR123S1.000
3:196937541:C:AR123M1.000
3:196937541:C:GR123T1.000
3:196937583:C:GR109P1.000
3:196939256:A:CF85L1.000
3:196939256:A:TF85L1.000
3:196939257:A:GF85S1.000
3:196939258:A:GF85L1.000
3:196939259:A:CC84W1.000
3:196939260:C:TC84Y1.000
3:196939261:A:GC84R1.000
3:196939262:G:CF83L1.000
3:196939262:G:TF83L1.000
3:196939263:A:GF83S1.000
3:196939264:A:GF83L1.000
3:196939266:C:AG82V1.000
3:196939266:C:TG82E1.000
3:196939297:C:GG72R1.000
3:196939371:A:GL47P1.000
3:196939377:C:TG45E1.000
3:196939380:A:TV44D1.000
3:196937517:C:TG131E0.999
3:196937526:C:TG128E0.999
3:196937527:C:GG128R0.999
3:196937527:C:TG128R0.999
3:196937530:G:TR127S0.999
3:196937532:C:TG126D0.999
3:196937533:C:GG126R0.999
3:196937552:A:CF119L0.999

dbSNP variants (sampled 300 via entrez): RS1000108563 (3:196941161 G>A,C,T), RS1000352863 (3:196936214 GA>G), RS1000807537 (3:196940812 T>G), RS1000859702 (3:196941070 T>G), RS1000883245 (3:196935395 C>G), RS1001237265 (3:196941428 C>G), RS1001251725 (3:196935165 C>T), RS1001268638 (3:196941693 C>A,G,T), RS1001383458 (3:196935765 T>G), RS1001881886 (3:196936098 T>C), RS1003253449 (3:196944136 A>G), RS1003487448 (3:196943571 C>A), RS1003562695 (3:196942548 C>A,G), RS1003593663 (3:196942656 G>A), RS1003610362 (3:196943901 C>A,T)

Disease associations

OMIM: gene MIM:605133 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

3 associations (top):

StudyTraitp-value
GCST003542_204Night sleep phenotypes9.000000e-06
GCST008758_42Pre-treatment viral load in HIV-1 infection2.000000e-17
GCST010725_2Malaria8.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0010125viral load

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (2): CHEMBL4665589 (PROTEIN COMPLEX), CHEMBL6066873 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

ChEMBL bioactivities

2 potent at pChembl≥5 of 2 total, top 2 by pChembl (potency: 10 = 0.1 nM, 6 = 1 µM).

pChemblTypeValueUnitMolecule
8.78Kd1.678nMCHEMBL3752910
8.64ED502.293nMCHEMBL3752910

PubChem BioAssay actives

1 with measured affinity, of 2 total; 1 most potent distinct compounds. Largely complementary to BindingDB; screening values are coarse (µM, 4 dp), so sub-nM hits tie at the floor.

CompoundAssayTypeValueUnit
4-methyl-3-[(1-methyl-6-pyridin-3-ylpyrazolo[3,4-d]pyrimidin-4-yl)amino]-N-[3-(trifluoromethyl)phenyl]benzamide2149895: Binding affinity to human NCBP2 incubated for 45 mins by Kinobead based pull down assaykd0.0017uM

CTD chemical–gene interactions

31 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
bisphenol Adecreases expression2
trichostatin Aaffects cotreatment, increases expression2
cobaltous chloridedecreases expression2
aristolochic acid Idecreases expression1
dicrotophosdecreases expression1
triphenyl phosphateaffects expression1
tris(1,3-dichloro-2-propyl)phosphatedecreases expression1
sodium arseniteaffects binding, increases reaction1
bleomycetinincreases expression1
N,N,N’,N’-tetrakis(2-pyridylmethyl)ethylenediamineincreases expression1
tamibarotenedecreases expression1
di-n-butylphosphoric acidaffects expression1
perfluorooctane sulfonic acidincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
bisphenol Bincreases expression1
dorsomorphinaffects cotreatment, increases expression1
(4-amino-1,4-dihydro-3-(2-pyridyl)-5-thioxo-1,2,4-triazole)copper(II)increases expression1
bisphenol AFincreases expression1
Resveratrolaffects cotreatment, increases expression1
Acetaminophendecreases expression1
Cadmiumincreases abundance, increases expression1
Cisplatinincreases expression1
Diethylstilbestroldecreases expression1
Nickelincreases expression1
Plant Extractsaffects cotreatment, increases expression1
Smokedecreases expression1
Tobacco Smoke Pollutionincreases expression1
Tretinoindecreases expression1
Cyclosporinedecreases expression1
Aflatoxin B1increases methylation1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL5652937BindingBinding affinity to human NCBP2 incubated for 45 mins by Kinobead based pull down assayNVP-BHG712: Effects of Regioisomers on the Affinity and Selectivity toward the EPHrin Family. — ChemMedChem

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.