KHDC4

gene
On this page

Also known as BLOM7SNORA80EHG

Summary

KHDC4 (KH domain containing 4, pre-mRNA splicing factor, HGNC:29145) is a protein-coding gene on chromosome 1q22, encoding KH homology domain-containing protein 4 (Q7Z7F0). RNA-binding protein involved in pre-mRNA splicing.

Enables RNA binding activity. Involved in mRNA splice site recognition. Located in cytoplasm; nucleoplasm; and spliceosomal complex.

Source: NCBI Gene 22889 — RefSeq curated summary.

At a glance

  • GWAS associations: 16
  • Clinical variants (ClinVar): 103 total
  • MANE Select transcript: NM_014949

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:29145
Approved symbolKHDC4
NameKH domain containing 4, pre-mRNA splicing factor
Location1q22
Locus typegene with protein product
StatusApproved
AliasesBLOM7, SNORA80EHG
Ensembl geneENSG00000132680
Ensembl biotypeprotein_coding
OMIM619370
Entrez22889

Gene structure

Transcript identifiers

Ensembl transcripts: 12 — 7 protein_coding_CDS_not_defined, 5 protein_coding

ENST00000368319, ENST00000368320, ENST00000368321, ENST00000465953, ENST00000466520, ENST00000466713, ENST00000478002, ENST00000482337, ENST00000483237, ENST00000491599, ENST00000916409, ENST00000916410

RefSeq mRNA: 1 — MANE Select: NM_014949 NM_014949

CCDS: CCDS30885

Canonical transcript exons

ENST00000368321 — 14 exons

ExonStartEnd
ENSE00001054158155916625155916737
ENSE00001446866155913045155914320
ENSE00001446867155934336155934413
ENSE00003464365155923627155923687
ENSE00003478287155915873155915964
ENSE00003572139155929296155929375
ENSE00003581371155921375155921628
ENSE00003597421155926676155926839
ENSE00003599469155921861155921918
ENSE00003609530155927104155927156
ENSE00003622417155933633155933849
ENSE00003626984155917499155917672
ENSE00003639961155925632155925843
ENSE00003660321155929712155929840

Expression profiles

Bgee: expression breadth ubiquitous, 293 present calls, max score 96.84.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 24.1341 / max 226.2115, expressed in 1811 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
1500116.43691800
150007.57981734
149970.117432

Top tissues by expression

295 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
tibiaUBERON:000097996.84gold quality
endocervixUBERON:000045896.10gold quality
pylorusUBERON:000116696.09gold quality
corpus epididymisUBERON:000435995.95gold quality
right ovaryUBERON:000211895.86gold quality
cardia of stomachUBERON:000116295.82gold quality
tendon of biceps brachiiUBERON:000818895.81gold quality
left ovaryUBERON:000211995.76gold quality
skin of hipUBERON:000155495.55gold quality
ectocervixUBERON:001224995.41gold quality
rectumUBERON:000105295.40gold quality
minor salivary glandUBERON:000183095.32gold quality
adenohypophysisUBERON:000219695.26gold quality
ovaryUBERON:000099295.12gold quality
pituitary glandUBERON:000000795.11gold quality
fundus of stomachUBERON:000116095.09gold quality
tibial nerveUBERON:000132395.05gold quality
upper leg skinUBERON:000426295.04gold quality
metanephros cortexUBERON:001053394.99gold quality
vaginaUBERON:000099694.95gold quality
body of uterusUBERON:000985394.91gold quality
cerebellar hemisphereUBERON:000224594.80gold quality
cerebellar cortexUBERON:000212994.75gold quality
body of stomachUBERON:000116194.71gold quality
renal medullaUBERON:000036294.67gold quality
right uterine tubeUBERON:000130294.63gold quality
small intestine Peyer’s patchUBERON:000345494.62gold quality
mouth mucosaUBERON:000372994.56gold quality
seminal vesicleUBERON:000099894.53gold quality
right hemisphere of cerebellumUBERON:001489094.53gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.27
E-MTAB-7606no17.63

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

107 targeting KHDC4, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-LET-7A-3P100.0074.033932
HSA-LET-7B-3P100.0074.083913
HSA-LET-7F-1-3P100.0074.023928
HSA-MIR-98-3P100.0074.083907
HSA-MIR-548AW99.9972.573559
HSA-MIR-19A-3P99.9875.332762
HSA-MIR-19B-3P99.9875.442754
HSA-MIR-1213699.9872.815713
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-569699.9872.364487
HSA-MIR-548AT-5P99.9670.832666
HSA-MIR-551B-5P99.9671.283493
HSA-MIR-1468-3P99.9672.743797
HSA-MIR-1250-3P99.9670.044038
HSA-MIR-590-3P99.9674.346478
HSA-MIR-96-5P99.9572.802140
HSA-MIR-1213399.9271.822006
HSA-MIR-205-3P99.9269.923165
HSA-MIR-1271-5P99.9171.991972
HSA-MIR-3529-3P99.9073.553045
HSA-MIR-374B-5P99.9069.982734
HSA-MIR-374A-5P99.9071.342923
HSA-MIR-449699.8868.892236
HSA-MIR-137-3P99.8774.742401
HSA-MIR-182-5P99.8774.032589
HSA-MIR-612499.8769.783551
HSA-MIR-1211999.8768.351653
HSA-MIR-5582-3P99.8672.484221
HSA-MIR-579-3P99.8671.663628

Literature-anchored findings (GeneRIF, showing 2)

  • Blom7alpha is a novel heterogeneous nuclear ribonucleoprotein K homology domain protein involved in pre-mRNA splicing and interacting with the CDC5L-SNEV(Prp19-Pso4) complex. (PMID:19641227)
  • Results suggest that the RNA binding activity of Blom7alpha might be required for pre-mRNA splicing catalysis. (PMID:23144703)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriokhdc4ENSDARG00000035023
mus_musculusKhdc4ENSMUSG00000028060
rattus_norvegicusKhdc4ENSRNOG00000020139

Protein

Protein identifiers

KH homology domain-containing protein 4Q7Z7F0 (reviewed: Q7Z7F0)

Alternative names: Brings lots of money 7, Pre-mRNA splicing factor protein KHDC4

All UniProt accessions (1): Q7Z7F0

UniProt curated annotations — full annotation on UniProt →

Function. RNA-binding protein involved in pre-mRNA splicing. Interacts with the PRP19C/Prp19 complex/NTC/Nineteen complex which is part of the spliceosome. Involved in regulating splice site selection. Binds preferentially RNA with A/C rich sequences and poly-C stretches.

Subunit / interactions. Interacts with PRPF19. Interacts with U2AF65.

Subcellular location. Nucleus. Cytoplasm.

Tissue specificity. Ubiquitous. Expressed at high level in skeletal muscle, kidney, heart, brain and liver.

Domain organisation. The C-terminal part is necessary for the interaction with the PRP19C/Prp19 complex/NTC/Nineteen complex. The KH domains mediate RNA-binding.

Miscellaneous. Interacts with U2AF65.

Similarity. Belongs to the KHDC4 family.

Isoforms (4)

UniProt IDNamesCanonical?
Q7Z7F0-11, Alpha, Blom7alphayes
Q7Z7F0-22, Beta, KIAA0907
Q7Z7F0-33, Gamma
Q7Z7F0-44

RefSeq proteins (1): NP_055764* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR031121RIK/BLOM7Family
IPR036612KH_dom_type_1_sfHomologous_superfamily
IPR047889KHDC4_KH-I_secondDomain
IPR047890KHDC4_KH-I_firstDomain
IPR055256KH_1_KHDC4/BBP-likeDomain
IPR056149PRP5/DDX46/KHDC4_KHDomain

Pfam: PF22675, PF23469

UniProt features (31 total): compositionally biased region 6, splice variant 6, strand 4, helix 4, region of interest 4, domain 2, modified residue 2, turn 2, chain 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
2YQRSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q7Z7F0-F163.070.28

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (2): 571, 572

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 148 (showing top): GCM_GSPT1, KAAB_FAILED_HEART_ATRIUM_DN, BROWNE_HCMV_INFECTION_8HR_UP, MONNIER_POSTRADIATION_TUMOR_ESCAPE_UP, BROWNE_HCMV_INFECTION_24HR_UP, GOBP_PROTEIN_RNA_COMPLEX_ORGANIZATION, GOBP_RNA_SPLICING, chr1q22, GOBP_MRNA_SPLICE_SITE_RECOGNITION, CAIRO_HEPATOBLASTOMA_UP, CTTTGTA_MIR524, ACEVEDO_LIVER_CANCER_UP, GOBP_RIBONUCLEOPROTEIN_COMPLEX_BIOGENESIS, TTTGCAC_MIR19A_MIR19B, GINESTIER_BREAST_CANCER_ZNF217_AMPLIFIED_DN

GO Biological Process (3): mRNA splice site recognition (GO:0006376), mRNA processing (GO:0006397), RNA splicing (GO:0008380)

GO Molecular Function (2): RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (4): nucleus (GO:0005634), nucleoplasm (GO:0005654), cytoplasm (GO:0005737), spliceosomal complex (GO:0005681)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA processing2
cellular anatomical structure2
spliceosomal complex assembly1
protein-RNA complex assembly1
mRNA metabolic process1
nucleic acid binding1
binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular anatomical structure1
nuclear protein-containing complex1
ribonucleoprotein complex1

Protein interactions and networks

STRING

470 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
KHDC4PRPF19Q9UMS4571
KHDC4LUC7LQ9NQ29475
KHDC4FUBP3Q96I24473
KHDC4YAE1Q9NRH1471
KHDC4EFCAB12Q6NXP0462
KHDC4CDC5LQ99459462
KHDC4SF1Q15637456
KHDC4ZNF529Q6P280450
KHDC4LUC7L3O95232445
KHDC4CWC15Q9P013440
KHDC4SMC5Q8IY18429
KHDC4NOVA1P51513404
KHDC4CHERPQ8IWX8387
KHDC4SLC68A1Q14CX5384
KHDC4RABL6Q3YEC7379

IntAct

29 interactions, top by confidence:

ABTypeScore
CLOCKBMAL1psi-mi:“MI:0914”(association)0.880
KHDC4RBM39psi-mi:“MI:0915”(physical association)0.560
RBM39KHDC4psi-mi:“MI:0915”(physical association)0.560
PUF60KHDC4psi-mi:“MI:0915”(physical association)0.560
KHDC4TYMPpsi-mi:“MI:0914”(association)0.530
STARD7TLE5psi-mi:“MI:0914”(association)0.530
U2AF2KHDC4psi-mi:“MI:0915”(physical association)0.520
ZDHHC17KHDC4psi-mi:“MI:0915”(physical association)0.370
KHDC4IGKCpsi-mi:“MI:0914”(association)0.350
STARD7TLE3psi-mi:“MI:0914”(association)0.350
lpqNSOWAHDpsi-mi:“MI:0914”(association)0.350
KHDC4DNAJC5psi-mi:“MI:2364”(proximity)0.270
NOLC1ILVBLpsi-mi:“MI:2364”(proximity)0.270
SF3B4MED19psi-mi:“MI:2364”(proximity)0.270
U2AF2NACApsi-mi:“MI:2364”(proximity)0.270
KHDC4SF1psi-mi:“MI:0915”(physical association)0.000
SF1KHDC4psi-mi:“MI:0915”(physical association)0.000
KHDC4psi-mi:“MI:0915”(physical association)0.000
KHDC4maP3psi-mi:“MI:0915”(physical association)0.000

BioGRID (77): KIAA0907 (Two-hybrid), KIAA0907 (Two-hybrid), KIAA0907 (Affinity Capture-RNA), KIAA0907 (Affinity Capture-RNA), KIAA0907 (Affinity Capture-RNA), KIAA0907 (Two-hybrid), KIAA0907 (Affinity Capture-MS), KIAA0907 (Two-hybrid), KIAA0907 (Co-fractionation), KIAA0907 (Co-fractionation), KIAA0907 (Proximity Label-MS), KIAA0907 (Affinity Capture-MS), SERPINB5 (Affinity Capture-MS), CALML5 (Affinity Capture-MS), IGHG2 (Affinity Capture-MS)

ESM2 similar proteins: A0JM64, A4IHD9, B5DF93, D3ZTQ1, E7F1H9, O70305, P70501, Q12872, Q13625, Q2NLB0, Q2T9I5, Q32SG5, Q3TC46, Q3TCX3, Q3USH5, Q498K9, Q566L7, Q5R413, Q5R8Q4, Q5SFM8, Q5T8P6, Q5ZL54, Q62415, Q66IJ0, Q68FI1, Q6DDU9, Q6GP15, Q6NXI6, Q6NZ18, Q6NZN0, Q6PEH8, Q7Z7F0, Q86TB9, Q8BG81, Q8CG79, Q8CGC4, Q8IZD4, Q8K2F8, Q8ND56, Q8R205

Diamond homologs: A0JM64, Q2NLB0, Q3TCX3, Q5ZL54, Q6NZ18, Q7Z7F0, Q8H0U8, Q9LIA4, Q9SF41, A7ENE0, Q32SG5, Q4TVV3, Q553B1, Q2HAD8, Q75GR5

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 21 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
mRNA Splicing - Major Pathway519.5×2e-04

Disease & clinical

Clinical variants and AI predictions

ClinVar

103 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance74
Likely benign1
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

1507 predictions. Top by Δscore:

VariantEffectΔscore
1:155914321:C:CCacceptor_gain1.0000
1:155915867:ACTC:Adonor_loss1.0000
1:155915868:CTCA:Cdonor_loss1.0000
1:155915869:TCA:Tdonor_loss1.0000
1:155915870:CACC:Cdonor_loss1.0000
1:155915871:A:ATdonor_loss1.0000
1:155915880:C:CAdonor_gain1.0000
1:155915961:CTGC:Cacceptor_gain1.0000
1:155915962:TGC:Tacceptor_gain1.0000
1:155915963:GC:Gacceptor_gain1.0000
1:155915964:CCTAT:Cacceptor_gain1.0000
1:155915965:C:CCacceptor_gain1.0000
1:155915968:T:Cacceptor_gain1.0000
1:155915968:T:TCacceptor_gain1.0000
1:155916620:CTTA:Cdonor_loss1.0000
1:155916621:TTACC:Tdonor_loss1.0000
1:155916622:TA:Tdonor_loss1.0000
1:155916623:ACCT:Adonor_loss1.0000
1:155916624:C:CGdonor_loss1.0000
1:155916624:CCTGT:Cdonor_gain1.0000
1:155916734:CATG:Cacceptor_gain1.0000
1:155916738:C:CCacceptor_gain1.0000
1:155917493:TTTAA:Tdonor_loss1.0000
1:155917494:TTAAC:Tdonor_loss1.0000
1:155917496:AACCT:Adonor_loss1.0000
1:155917497:A:ATdonor_loss1.0000
1:155917497:A:Cdonor_gain1.0000
1:155917498:CCTG:Cdonor_loss1.0000
1:155917671:CT:Cacceptor_gain1.0000
1:155917673:C:CCacceptor_gain1.0000

AlphaMissense

3950 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
1:155921917:A:TV319D1.000
1:155923637:A:GL315P1.000
1:155923637:A:TL315H1.000
1:155923645:A:CC312W1.000
1:155923646:C:TC312Y1.000
1:155923647:A:GC312R1.000
1:155923649:A:GL311P1.000
1:155923658:G:TA308D1.000
1:155923659:C:GA308P1.000
1:155923667:A:GL305P1.000
1:155923685:T:CH299R1.000
1:155925635:A:TI297N1.000
1:155925641:A:TI295N1.000
1:155925654:C:TE291K1.000
1:155925655:A:CF290L1.000
1:155925655:A:TF290L1.000
1:155925657:A:GF290L1.000
1:155925665:C:GR287P1.000
1:155925681:C:TE282K1.000
1:155925701:C:AG275V1.000
1:155925701:C:GG275A1.000
1:155925701:C:TG275D1.000
1:155925702:C:AG275C1.000
1:155925702:C:GG275R1.000
1:155925702:C:TG275S1.000
1:155925704:C:GR274P1.000
1:155925707:A:GL273P1.000
1:155925707:A:TL273Q1.000
1:155925713:A:TV271D1.000
1:155925719:G:TA269D1.000

dbSNP variants (sampled 300 via entrez): RS1000006697 (1:155932886 G>A,C), RS1000124773 (1:155935243 GAC>G), RS1000323098 (1:155928816 G>A), RS1000938679 (1:155919923 C>T), RS1001187392 (1:155933214 T>C), RS1001258993 (1:155927807 A>C,G), RS1001347448 (1:155914560 A>G), RS1001406985 (1:155920244 C>G), RS1001697220 (1:155926295 T>C), RS1002126153 (1:155931626 T>C), RS1002278779 (1:155912743 C>A,G,T), RS1002298863 (1:155912637 AGAACCACTT>A), RS1002531475 (1:155919211 A>C), RS1002788468 (1:155923235 T>C), RS1002810304 (1:155934362 C>A,T)

Disease associations

OMIM: gene MIM:619370 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

16 associations (top):

StudyTraitp-value
GCST004131_70Inflammatory bowel disease6.000000e-08
GCST004132_44Crohn’s disease2.000000e-07
GCST007294_124Body fat distribution (trunk fat ratio)8.000000e-35
GCST007294_3Body fat distribution (trunk fat ratio)6.000000e-21
GCST007294_50Body fat distribution (trunk fat ratio)1.000000e-15
GCST007295_17Body fat distribution (leg fat ratio)3.000000e-13
GCST007295_37Body fat distribution (leg fat ratio)7.000000e-17
GCST007295_72Body fat distribution (leg fat ratio)1.000000e-28
GCST010696_19Cortical thickness (min-P)2.000000e-10
GCST010697_10Cortical surface area (min-P)3.000000e-10
GCST010698_59Subcortical volume (min-P)9.000000e-10
GCST010699_20Brain morphology (min-P)7.000000e-10
GCST010700_5Cortical thickness (MOSTest)8.000000e-17
GCST010701_66Cortical surface area (MOSTest)1.000000e-09
GCST010702_43Subcortical volume (MOSTest)3.000000e-10
GCST010703_253Brain morphology (MOSTest)4.000000e-14

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004341body fat distribution
EFO:0004346neuroimaging measurement
EFO:0004840cortical thickness

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

36 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases expression2
alpha-pineneaffects cotreatment, increases oxidation, increases abundance1
bisphenol Adecreases expression1
deoxynivalenolincreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
sodium arseniteincreases expression1
butyraldehydeincreases expression1
nickel chloridedecreases expression1
zinc chromateincreases abundance, increases expression1
potassium chromate(VI)affects cotreatment, increases expression1
resorcinolincreases expression1
methacrylaldehydeaffects cotreatment, increases oxidation, increases abundance1
epigallocatechin gallateaffects cotreatment, increases expression1
chromium hexavalent ionincreases abundance, increases expression1
perfluorooctane sulfonic aciddecreases expression1
K 7174increases expression1
abrineincreases expression1
(4-amino-1,4-dihydro-3-(2-pyridyl)-5-thioxo-1,2,4-triazole)copper(II)increases expression1
jinfukangaffects cotreatment, decreases expression1
Leflunomidedecreases expression1
Acroleinaffects cotreatment, increases oxidation, increases abundance1
Air Pollutantsaffects cotreatment, increases abundance, increases oxidation1
Benzo(a)pyreneaffects methylation1
Caffeinedecreases phosphorylation1
Cisplatinaffects cotreatment, decreases expression1
Doxorubicindecreases expression1
Estradioldecreases expression1
Ozoneincreases abundance, affects cotreatment, increases oxidation1
Quercetinincreases expression1
Dronabinolincreases expression1

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B1V6Abcam HeLa KHDC4 KOCancer cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.