KHNYN

gene
On this page

Summary

KHNYN (KH and NYN domain containing, HGNC:20166) is a protein-coding gene on chromosome 14q12, encoding Protein KHNYN (O15037).

The protein encoded by this gene contains a ribonuclease NYN domain and belongs to the N4BP1 family. The protein is a cofactor for the zinc finger antiviral protein (ZAP protein) which targets viral RNA for degradation and restricts SARS-CoV-2 infection.

Source: NCBI Gene 23351 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 159 total
  • MANE Select transcript: NM_015299

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20166
Approved symbolKHNYN
NameKH and NYN domain containing
Location14q12
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000100441
Ensembl biotypeprotein_coding
OMIM619579
Entrez23351

Gene structure

Transcript identifiers

Ensembl transcripts: 17 — 16 protein_coding, 1 retained_intron

ENST00000251343, ENST00000553935, ENST00000554268, ENST00000556255, ENST00000556510, ENST00000556842, ENST00000859089, ENST00000859090, ENST00000859091, ENST00000859092, ENST00000859093, ENST00000859094, ENST00000859095, ENST00000859096, ENST00000859097, ENST00000944610, ENST00000944611

RefSeq mRNA: 3 — MANE Select: NM_015299 NM_001290256, NM_001290257, NM_015299

CCDS: CCDS32058

Canonical transcript exons

ENST00000553935 — 8 exons

ExonStartEnd
ENSE000006545002443146324432610
ENSE000006545012443272224432852
ENSE000008894702443071424430931
ENSE000008894722443607224436179
ENSE000015260462442995424430119
ENSE000034782142443293624433032
ENSE000036216272443638824436489
ENSE000036259982443703624441843

Expression profiles

Bgee: expression breadth ubiquitous, 273 present calls, max score 94.61.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 17.8738 / max 356.9954, expressed in 1810 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
13904417.49581809
1390450.3780167

Top tissues by expression

294 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left ovaryUBERON:000211994.61gold quality
mucosa of stomachUBERON:000119994.28gold quality
right ovaryUBERON:000211894.22gold quality
rectumUBERON:000105292.84gold quality
small intestine Peyer’s patchUBERON:000345492.74gold quality
body of stomachUBERON:000116192.71gold quality
granulocyteCL:000009492.69gold quality
sural nerveUBERON:001548892.65gold quality
adrenal tissueUBERON:001830392.50gold quality
left uterine tubeUBERON:000130392.46gold quality
transverse colonUBERON:000115792.42gold quality
small intestineUBERON:000210892.19gold quality
body of uterusUBERON:000985392.13gold quality
body of pancreasUBERON:000115092.02gold quality
stomachUBERON:000094592.00gold quality
muscle layer of sigmoid colonUBERON:003580591.99gold quality
lower esophagus muscularis layerUBERON:003583391.87gold quality
lower esophagusUBERON:001347391.84gold quality
endocervixUBERON:000045891.80gold quality
esophagogastric junction muscularis propriaUBERON:003584191.80gold quality
endometrium epitheliumUBERON:000481191.74gold quality
ovaryUBERON:000099291.57gold quality
right coronary arteryUBERON:000162591.39gold quality
ectocervixUBERON:001224991.35gold quality
nerveUBERON:000102191.27gold quality
right uterine tubeUBERON:000130291.27gold quality
tibial nerveUBERON:000132391.27gold quality
right lungUBERON:000216791.18gold quality
esophagusUBERON:000104391.17gold quality
vermiform appendixUBERON:000115491.02gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no5.38

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

151 targeting KHNYN, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-4510100.0066.602050
HSA-MIR-6127100.0066.762188
HSA-MIR-6129100.0066.462080
HSA-MIR-6130100.0066.692012
HSA-MIR-6133100.0066.482064
HSA-MIR-4692100.0067.322066
HSA-MIR-7110-3P100.0073.182486
HSA-MIR-450099.9972.722367
HSA-MIR-196A-1-3P99.9972.152772
HSA-MIR-451499.9967.101870
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-118499.9968.191458
HSA-LET-7A-5P99.9872.291790
HSA-LET-7B-5P99.9872.311790
HSA-LET-7C-5P99.9872.291790
HSA-LET-7E-5P99.9872.291790
HSA-LET-7F-5P99.9872.561784
HSA-LET-7G-5P99.9872.371784
HSA-LET-7I-5P99.9872.371788
HSA-MIR-98-5P99.9872.331787
HSA-MIR-314899.9775.066478
HSA-MIR-448799.9664.581252
HSA-LET-7D-5P99.9671.761632
HSA-MIR-445899.9671.641650
HSA-MIR-22-3P99.9368.13917

Literature-anchored findings (GeneRIF, showing 2)

  • KHNYN as a novel cofactor for ZAP to target CpG-containing retroviral RNA for degradation. (PMID:31284899)
  • Crystallization and biochemical studies of the NYN domain of human KHNYN. (PMID:38376822)

Cross-species orthologs

9 orthologs

OrganismSymbolGene ID
mus_musculusKhnynENSMUSG00000047153
rattus_norvegicusKhnynENSRNOG00000020506
drosophila_melanogasterRegnase-1FBGN0038769
drosophila_melanogasterCG42360FBGN0259742
caenorhabditis_elegansWBGENE00013107
caenorhabditis_elegansrege-1WBGENE00016260
caenorhabditis_elegansWBGENE00021287
caenorhabditis_elegansWBGENE00021998
caenorhabditis_elegansWBGENE00022003

Paralogs (6): ZC3H12B (ENSG00000102053), N4BP1 (ENSG00000102921), ZC3H12C (ENSG00000149289), ZC3H12A (ENSG00000163874), ZC3H12D (ENSG00000178199), NYNRIN (ENSG00000205978)

Protein

Protein identifiers

Protein KHNYNO15037 (reviewed: O15037)

Alternative names: KH and NYN domain-containing protein

All UniProt accessions (3): O15037, G3V331, G3V3G3

UniProt curated annotations — full annotation on UniProt →

Similarity. Belongs to the N4BP1 family.

RefSeq proteins (3): NP_001277185, NP_001277186, NP_056114* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR021869RNase_Zc3h12_NYNDomain
IPR036612KH_dom_type_1_sfHomologous_superfamily
IPR051101ZC3H12/N4BP1_RNase_RegFamily
IPR056578UBA_N4BP1_CDomain
IPR056629KH_N4BP1_1stDomain
IPR056630KH_N4BP1_2ndDomain

Pfam: PF11977, PF23050, PF23052, PF23054

UniProt features (33 total): helix 14, strand 6, region of interest 3, compositionally biased region 3, sequence variant 2, modified residue 2, chain 1, domain 1, turn 1

Structure

Experimental structures (PDB)

7 structures.

PDBMethodResolution (Å)
8WZCX-RAY DIFFRACTION1.93
9HTSX-RAY DIFFRACTION2.2
9BGLX-RAY DIFFRACTION2.29
9S2DX-RAY DIFFRACTION2.35
9CS9X-RAY DIFFRACTION2.5
2N5MSOLUTION NMR
2N7KSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O15037-F169.120.35

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (2): 10, 359

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 159 (showing top): GSE18804_SPLEEN_MACROPHAGE_VS_TUMORAL_MACROPHAGE_UP, MORF_RAGE, AGGAAGC_MIR5163P, GOMF_ENDONUCLEASE_ACTIVITY, GOMF_RNA_NUCLEASE_ACTIVITY, GOBP_REGULATION_OF_MRNA_CATABOLIC_PROCESS, GOMF_NUCLEASE_ACTIVITY, GOBP_MACROMOLECULE_CATABOLIC_PROCESS, chr14q12, GOBP_POST_TRANSCRIPTIONAL_REGULATION_OF_GENE_EXPRESSION, DEURIG_T_CELL_PROLYMPHOCYTIC_LEUKEMIA_DN, GOMF_RNA_ENDONUCLEASE_ACTIVITY, GOBP_REGULATION_OF_CATABOLIC_PROCESS, GOBP_POSITIVE_REGULATION_OF_CATABOLIC_PROCESS, TTGGAGA_MIR5155P_MIR519E

GO Biological Process (1): biological_process (GO:0008150)

GO Molecular Function (5): mRNA binding (GO:0003729), RNA endonuclease activity (GO:0004521), molecular_function (GO:0003674), RNA binding (GO:0003723), protein binding (GO:0005515)

GO Cellular Component (3): nucleus (GO:0005634), cytoplasmic ribonucleoprotein granule (GO:0036464), cellular_component (GO:0005575)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA binding1
endonuclease activity1
RNA nuclease activity1
nucleic acid binding1
binding1
intracellular membrane-bounded organelle1
cytoplasm1
ribonucleoprotein granule1

Protein interactions and networks

STRING

602 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
KHNYNTRIM25Q14258718
KHNYNZC3HAV1Q7Z2W4717
KHNYNRNASELQ05823502
KHNYNUNKLQ9H9P5463
KHNYNADGRA3Q8IWK6449
KHNYNTNFRSF10DQ9UBN6420
KHNYNSDR39U1Q9NRG7413
KHNYNFAM171A1Q5VUB5393
KHNYNOAS3Q9Y6K5389
KHNYNXRN1Q8IZH2389
KHNYNGBP5Q96PP8389
KHNYNPARP12Q9H0J9366
KHNYNCAMKK1Q8N5S9364
KHNYNMARCHF1Q8TCQ1356
KHNYNCERKQ8TCT0353
KHNYNKDM6BO15054353

IntAct

46 interactions, top by confidence:

ABTypeScore
NEDD8KHNYNpsi-mi:“MI:0915”(physical association)0.700
KHNYNNEDD8psi-mi:“MI:0915”(physical association)0.700
KHNYNNEDD8psi-mi:“MI:0407”(direct interaction)0.700
OPG200IKBKBpsi-mi:“MI:0914”(association)0.620
KHNYNpsi-mi:“MI:0915”(physical association)0.580
KHNYNpsi-mi:“MI:0914”(association)0.580
KHNYNRNF216psi-mi:“MI:0915”(physical association)0.560
KHNYNTEPSINpsi-mi:“MI:0915”(physical association)0.560
KHNYNPLOD3psi-mi:“MI:0915”(physical association)0.560
TKTPOTEFpsi-mi:“MI:0914”(association)0.530
ZC3HAV1KHNYNpsi-mi:“MI:0914”(association)0.530
KHNYNENTREP1psi-mi:“MI:0915”(physical association)0.370
Xpo1IFT56psi-mi:“MI:0914”(association)0.350
RBCK1UMAD1psi-mi:“MI:0914”(association)0.350
RBCK1KHNYNpsi-mi:“MI:0914”(association)0.350

BioGRID (42): KHNYN (Affinity Capture-MS), KHNYN (Affinity Capture-MS), KHNYN (Affinity Capture-MS), KHNYN (Affinity Capture-MS), KHNYN (Affinity Capture-MS), KHNYN (Protein-peptide), KHNYN (Protein-peptide), KHNYN (Reconstituted Complex), KHNYN (Reconstituted Complex), NEDD8 (Reconstituted Complex), UBC (Reconstituted Complex), NEDD8 (Affinity Capture-Western), CUL1 (Reconstituted Complex), CUL2 (Reconstituted Complex), NEDD8 (Co-crystal Structure)

ESM2 similar proteins: A0JPN4, A2A288, A2ARK0, A6ND36, A6QQJ8, A7E316, E9Q0S6, E9Q2Z1, O15037, O54714, O54967, O70260, O70405, O75385, O94983, P42335, P48778, Q07912, Q0P4K8, Q17R13, Q1LVK9, Q32PJ7, Q4V8I3, Q5D1E7, Q5D1E8, Q5DTV4, Q5HYM0, Q5JV73, Q5SWY7, Q5SXM2, Q5U2X5, Q5XIS1, Q68CZ2, Q6A037, Q6IRU7, Q6P1H6, Q6S5L8, Q7TP65, Q7TSG2, Q80U38

Diamond homologs: A0JPN4, A2A288, A6QQJ8, O15037, O75113, Q1LVK9, Q5D1E7, Q5D1E8, Q5DTV4, Q5DTZ0, Q5HYM0, Q5ZLE9, Q6A037, Q6DJS0, Q7ZXG4, Q80U38, Q8BIY3, Q95YE2, Q9C0D7, Q9P2P1, A6NKG5, Q8I7P9

SIGNOR signaling

0 interactions.

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 32 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

GO biological processes:

GO termPartnersFoldFDR
positive regulation of canonical NF-kappaB signal transduction513.0×7e-03
proteasome-mediated ubiquitin-dependent protein catabolic process59.3×9e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

159 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance139
Likely benign6
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2762 predictions. Top by Δscore:

VariantEffectΔscore
14:24428284:AC:Adonor_loss1.0000
14:24428285:C:Gdonor_loss1.0000
14:24428307:CACCT:Cdonor_gain1.0000
14:24430927:CCAAG:Cdonor_loss1.0000
14:24430928:CAAG:Cdonor_loss1.0000
14:24430929:AAGG:Adonor_loss1.0000
14:24430930:AGGT:Adonor_loss1.0000
14:24430931:GGTGA:Gdonor_loss1.0000
14:24430932:G:GAdonor_loss1.0000
14:24430933:T:Gdonor_loss1.0000
14:24432607:TGGT:Tdonor_gain1.0000
14:24432608:GGTG:Gdonor_gain1.0000
14:24432609:GT:Gdonor_gain1.0000
14:24432611:G:GGdonor_gain1.0000
14:24432931:A:AGacceptor_gain1.0000
14:24432934:A:AGacceptor_gain1.0000
14:24432935:G:GGacceptor_gain1.0000
14:24432935:GAGA:Gacceptor_gain1.0000
14:24436048:C:Aacceptor_gain1.0000
14:24436055:T:Aacceptor_gain1.0000
14:24436057:T:TAacceptor_gain1.0000
14:24436058:G:Aacceptor_gain1.0000
14:24436063:T:TAacceptor_gain1.0000
14:24436175:GAACG:Gdonor_gain1.0000
14:24436386:A:AGacceptor_gain1.0000
14:24436387:G:GAacceptor_gain1.0000
14:24436387:GCC:Gacceptor_gain1.0000
14:24436387:GCCT:Gacceptor_gain1.0000
14:24436485:GCCAG:Gdonor_gain1.0000
14:24436486:CCAGG:Cdonor_loss1.0000

AlphaMissense

4393 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
14:24436113:T:AV540D1.000
14:24432589:A:TD443V0.999
14:24432808:T:AV479D0.999
14:24436089:C:AA532D0.999
14:24436116:C:TS541F0.999
14:24436122:A:TD543V0.999
14:24436420:T:CF573S0.999
14:24432583:T:AV441D0.998
14:24432586:T:AI442N0.998
14:24432592:G:AG444D0.998
14:24432594:A:CS445R0.998
14:24432596:C:AS445R0.998
14:24432596:C:GS445R0.998
14:24432780:T:AW470R0.998
14:24432780:T:CW470R0.998
14:24433026:A:TD524V0.998
14:24433029:A:CD525A0.998
14:24433029:A:TD525V0.998
14:24436103:G:TG537W0.998
14:24436104:G:AG537E0.998
14:24436116:C:AS541Y0.998
14:24436122:A:CD543A0.998
14:24436128:T:CF545S0.998
14:24436393:T:CL564P0.998
14:24437173:T:CL642P0.998
14:24432589:A:CD443A0.997
14:24432590:C:AD443E0.997
14:24432590:C:GD443E0.997
14:24432753:G:CG461R0.997
14:24432760:C:AA463D0.997

dbSNP variants (sampled 300 via entrez): RS1000260818 (14:24439000 G>A), RS1000300505 (14:24428239 C>T), RS1000399866 (14:24421735 G>A), RS1000564660 (14:24426607 C>A,T), RS1000598011 (14:24437137 G>A), RS1000999348 (14:24439052 T>C,G), RS1001311315 (14:24421470 T>C,G), RS1001346952 (14:24439385 C>T), RS1001403246 (14:24423128 T>A), RS1001407779 (14:24433178 A>AT), RS1001776218 (14:24439794 A>C,G), RS1001963359 (14:24434045 G>A,T), RS1002078356 (14:24435137 G>T), RS1002114493 (14:24437852 T>C), RS1002558024 (14:24441206 T>A)

Disease associations

OMIM: gene MIM:619579 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST000759_4LDL cholesterol4.000000e-11
GCST90000025_531Appendicular lean mass8.000000e-14

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0004980appendicular lean mass

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

31 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases expression3
Valproic Acidaffects expression, decreases expression3
Tetrachlorodibenzodioxinincreases expression2
FR900359increases phosphorylation1
malachite greenaffects response to substance1
alpha-pineneaffects cotreatment, increases oxidation, increases abundance1
bisphenol Aincreases expression1
sodium arseniteincreases expression1
butyraldehydedecreases expression1
manganese chlorideincreases abundance, increases expression1
methacrylaldehydeincreases oxidation, increases abundance, affects cotreatment1
di-n-butylphosphoric acidaffects expression1
abrineincreases expression1
Acroleinincreases abundance, affects cotreatment, increases oxidation1
Air Pollutantsaffects cotreatment, increases abundance, increases oxidation1
Atrazineincreases expression1
Benzeneincreases expression1
Carbamazepineaffects expression1
Manganeseincreases abundance, increases expression1
Mercuric Chlorideincreases expression1
Niclosamideincreases expression1
Ozoneaffects cotreatment, increases oxidation, increases abundance1
Quercetinincreases expression1
Thiramincreases expression1
Tobacco Smoke Pollutionincreases expression1
Tretinoinincreases expression1
Cyclosporineincreases expression1
Aflatoxin B1decreases methylation1
Cadmium Chloridedecreases expression1
tert-Butylhydroperoxideincreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.