Keratoconus

disease
On this page

Also known as isolated keratoconusKCkeratoconus (disease)noninflammatory corneal thining

Summary

Keratoconus (MONDO:0015486) is a disease (an umbrella term covering 11 Mondo subtypes) with 75 cohort genes (78 GWAS associations across 9 studies) and 279 clinical trials. The dominant Reactome pathway is Assembly of collagen fibrils and other multimeric structures (8 cohort genes). Top therapeutic interventions include riboflavin, fluorometholone acetate, and prednisolone acetate.

At a glance

  • Umbrella term: 11 Mondo subtypes
  • Cohort genes: 75
  • GWAS associations: 78
  • ClinVar variants: 40
  • Clinical trials: 279

Clinical features

No curated clinical features (Orphanet) for this disease.

Identifiers

Disease identifiers

FieldValue
Canonical namekeratoconus
Mondo IDMONDO:0015486
MeSHD007640
OMIM148300
Orphanet156071, 2335
DOIDDOID:10126
ICD-10-CMH18.6
ICD-11945228622
NCITC26806
SNOMED CT65636009
UMLSC0022578
MedGen44015
MedDRA10023353
Is cancer (heuristic)no

Also known as: isolated keratoconus · KC · keratoconus · keratoconus (disease) · noninflammatory corneal thining

Data availability: 40 ClinVar variants · 78 GWAS associations (9 studies) · 1 GenCC gene-disease record · 1 HPO phenotype.

Disease family

An umbrella term covering 11 Mondo subtypes.

Classification path: disease › human disease › disease by body system or component › disorder of orbital regioneye disordercorneal disorderkeratoconus

Related subtypes (23): cornea plana, pseudopterygium, corneal deposit, Bowman’s membrane folds or rupture, corneal degeneration, corneal staphyloma, corneal argyrosis, corneal ectasia, keratopathy, keratitis, corneal edema, brittle cornea syndrome, megalocornea, X-linked corneal dermoid, Peters anomaly, pellucid marginal degeneration, corneal dystrophy, sclerocornea, cornea neoplasm, Arnold stickler bourne syndrome, limbal stem cell deficiency, thygeson superficial punctate keratopathy, Terrien marginal degeneration

Subtypes (11): acute hydrops keratoconus, keratoconus, stable condition, keratoconus 1, keratoconus 3, keratoconus 2, keratoconus 4, keratoconus 5, keratoconus 6, keratoconus 8, keratoconus 7, keratoconus 9

Genetics & variants

GWAS landscape

78 GWAS associations across 9 studies. Top hits map to 29 distinct genes (as reported by GWAS).

Top associations by p-value

rsIDp-valueGeneRisk alleleOdds ratio
rs27210516e-35LINC00598 - FOXO1T0.45
rs31185182e-28RXRA - COL5A1A0.27
rs71179211e-26PIDD1, RPLP2T0.27
rs48944141e-26FNDC3BT0.3
rs129120102e-26SMAD3T0.31
rs355238083e-25COL12A1A0.66
rs8404642e-20LOX - ZNF474T0.23
rs111174014e-20LINC02182A0.24
rs13241753e-19LINC01235 - LINC00583T0.24
rs125154003e-18RPL13AP13 - FSTT0.22
rs3752905742e-16RDH8 - C3P1C3.69
rs5446965841e-14ARHGEF37 - U3C3.71
rs126030552e-14KIF1C-AS1, KIF1CC0.23
rs1889936169e-14MRPS35C2.86
rs9623076785e-13RN7SKP6 - PRR20AC4.16
rs1929849221e-12DDX43P2 - VWC2G2.97
rs1446190212e-12LINC02264 - NDST3G3.13
rs172855503e-12KLF5A0.18
rs5741778603e-12SPRY4-AS1G3.41
rs6583524e-12EIF3AT0.18
rs21436834e-12LINC01695T0.18
rs5670101384e-12PSD3C2.8
rs1818568464e-12HSBP1 - MLYCDG4
rs5752730904e-12IRX1 - LINC02063C3.35
rs3731924925e-12TMEM213G3.63
rs1838013416e-12LINC01876T3.73
rs5325476956e-12HS3ST3B1 - RPS18P12G4.04
rs767473457e-12Y_RNA - LINC01684A0.55
rs46467859e-12ALDH3A1T0.2
rs1424930249e-12COL6A1A0.76

Top studies (by case count)

StudyLead authorYearCasesControlsTitle
GCST90013442Hardcastle AJ20212,11624,626A multi-ethnic genome-wide association study implicates collagen matrix integrity and cell differentiation pathways in keratoconus.
GCST90837424Xu L20248536,248Association of Novel Loci With Keratoconus Susceptibility in a Chinese Genome-Wide Association Study.
GCST90477661Verma A2024573450,622Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST009677McComish BJ20195220Association of Genetic Variation With Keratoconus.
GCST90480060Verma A2024367121,509Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST90481911Verma A2024367121,509Diversity and scale: Genetic architecture of 2068 traits in the VA Million Veteran Program.
GCST001274Burdon KP20113193,462Association of polymorphisms in the hepatocyte growth factor gene promoter with keratoconus.
GCST001261Li X20112223,324A genome-wide association study identifies a potential novel gene locus for keratoconus, one of the commonest causes for corneal transplantation in developed countries.
GCST90435979Zhou W201887397,761Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.

Variant details and genetic-evidence tiers

Tier distribution (top 50 variants)

TierVariants
Tier 1: coding1
Tier 2: splice/UTR2
Tier 3: regulatory1
Tier 4: intronic/intergenic46

MAF distribution

BucketVariants
common (>=0.05)27
low_freq (0.01-0.05)0
rare (<0.01)21
unknown2

Functional consequences

ConsequenceCount
intron_variant27
intergenic_variant17
non_coding_transcript_exon_variant2
5_prime_UTR_variant1
missense_variant1
3_prime_UTR_variant1
regulatory_region_variant1

Top variants

rsIDChrPosAllelesMAFConsequenceGenep-valueTier
rs27210511340536747C>G,T0.05intron_variantLINC00598 - FOXO16e-35Tier 4: intronic/intergenic
rs31185189134548237G>A0.05intergenic_variantRXRA - COL5A12e-28Tier 4: intronic/intergenic
rs711792111800486C>T0.055_prime_UTR_variantPIDD1, RPLP21e-26Tier 2: splice/UTR
rs48944143172279709C>A,T0.05intron_variantFNDC3B1e-26Tier 4: intronic/intergenic
rs129120101567174805G>T0.05intron_variantSMAD32e-26Tier 4: intronic/intergenic
rs35523808675125255T>A0.05missense_variantCOL12A13e-25Tier 1: coding
rs8404645122081768G>T0.05intergenic_variantLOX - ZNF4742e-20Tier 4: intronic/intergenic
rs111174011688280846G>A,C,T0.05intron_variantLINC021824e-20Tier 4: intronic/intergenic
rs1324175913549156C>A,T0.05intron_variantLINC01235 - LINC005833e-19Tier 4: intronic/intergenic
rs12515400553285349C>T0.05intergenic_variantRPL13AP13 - FST3e-18Tier 4: intronic/intergenic
rs3752905741910028881C>T0intergenic_variantRDH8 - C3P12e-16Tier 4: intronic/intergenic
rs5446965845149656432C>T0intron_variantARHGEF37 - U31e-14Tier 4: intronic/intergenic
rs12603055175027076G>A,C,T0.053_prime_UTR_variantKIF1C-AS1, KIF1C2e-14Tier 2: splice/UTR
rs1889936161227745673C>A,T0.001intron_variantMRPS359e-14Tier 4: intronic/intergenic
rs9623076781357078141C>T0intron_variantRN7SKP6 - PRR20A5e-13Tier 4: intronic/intergenic
rs192984922749453836G>A0.001intergenic_variantDDX43P2 - VWC21e-12Tier 4: intronic/intergenic
rs1446190214117877256G>A0.001intron_variantLINC02264 - NDST32e-12Tier 4: intronic/intergenic
rs172855501373070755A>G0.05intron_variantKLF53e-12Tier 4: intronic/intergenic
rs5741778605142409381G>C0intron_variantSPRY4-AS13e-12Tier 4: intronic/intergenic
rs65835210119044276T>C0.05intron_variantEIF3A4e-12Tier 4: intronic/intergenic
rs21436832128158967T>A,C,G0.05intron_variantLINC016954e-12Tier 4: intronic/intergenic
rs567010138819046420C>A0.001intergenic_variantPSD34e-12Tier 4: intronic/intergenic
rs1818568461683855405G>C0intron_variantHSBP1 - MLYCD4e-12Tier 4: intronic/intergenic
rs57527309053992921C>T0.001intergenic_variantIRX1 - LINC020634e-12Tier 4: intronic/intergenic
rs3731924927138824390G>A0.001intron_variantTMEM2135e-12Tier 4: intronic/intergenic
rs1838013412156109824T>C0intron_variantLINC018766e-12Tier 4: intronic/intergenic
rs5325476951714404852G>A0intron_variantHS3ST3B1 - RPS18P126e-12Tier 4: intronic/intergenic
rs767473452123984911A>C,G0.05non_coding_transcript_exon_variantY_RNA - LINC016847e-12Tier 4: intronic/intergenic
rs46467851719748589C>A,G,T0.05regulatory_region_variantALDH3A19e-12Tier 3: regulatory
rs1424930242145993879G>A,Tintron_variantCOL6A19e-12Tier 4: intronic/intergenic

ClinVar germline variants

40 retrieved; paginated sample, class counts are floors:

31 uncertain significance, 4 likely pathogenic, 3 conflicting classifications of pathogenicity, 1 likely benign, 1 benign/likely benign

ClinVarVariant (HGVS)GeneClassificationReview
2442791NM_000691.5(ALDH3A1):c.703G>A (p.Gly235Arg)ALDH3A1Likely pathogenicno assertion criteria provided
2577488NM_001845.6(COL4A1):c.976G>A (p.Gly326Ser)COL4A1Likely pathogenicno assertion criteria provided
2577487NM_000393.5(COL5A2):c.4364G>A (p.Gly1455Glu)COL5A2Likely pathogenicno assertion criteria provided
50262NM_005430.4(WNT1):c.1063G>T (p.Val355Phe)WNT1Likely pathogeniccriteria provided, single submitter
225320NM_000088.4(COL1A1):c.3766G>A (p.Ala1256Thr)COL1A1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
459803NM_000302.4(PLOD1):c.109G>A (p.Glu37Lys)PLOD1Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
126928NM_001367624.2(ZNF469):c.6179C>A (p.Ser2060Tyr)ZNF469Conflicting classifications of pathogenicitycriteria provided, conflicting classifications
2498228NR_002144.1:n.472_478delUncertain significanceno assertion criteria provided
2498236NC_000004.12:g.69391612A>GUncertain significanceno assertion criteria provided
2498237NR_027420.1:n.875C>TUncertain significanceno assertion criteria provided
545133NM_001395430.1(PAK6):c.1855C>A (p.Pro619Thr)BUB1B-PAK6Uncertain significanceno assertion criteria provided
545128NM_015719.4(COL5A3):c.4204A>C (p.Lys1402Gln)COL5A3Uncertain significanceno assertion criteria provided
545131NM_001395010.1(DAB2IP):c.2759C>T (p.Pro920Leu)DAB2IPUncertain significanceno assertion criteria provided
2445998NM_001320714.2(DOP1B):c.1227_1228del (p.Gln410fs)DOP1BUncertain significanceno assertion criteria provided
2445997NM_001039753.4(EML6):c.2528G>T (p.Trp843Leu)EML6Uncertain significanceno assertion criteria provided
2445980NM_000875.5(IGF1R):c.1447G>A (p.Gly483Arg)IGF1RUncertain significanceno assertion criteria provided
2498238NM_014218.3(KIR2DL1):c.272C>A (p.Thr91Lys)KIR2DL1Uncertain significanceno assertion criteria provided
2498239NM_013289.4(KIR3DL1):c.337G>A (p.Val113Met)KIR3DL1Uncertain significanceno assertion criteria provided
2498240NM_013289.4(KIR3DL1):c.475G>T (p.Gly159Trp)KIR3DL1Uncertain significanceno assertion criteria provided
545126NM_002336.3(LRP6):c.4822C>T (p.Pro1608Ser)LRP6Uncertain significanceno assertion criteria provided
2498241NM_001304359.2(MUC5AC):c.3601C>T (p.Arg1201Trp)MUC5ACUncertain significanceno assertion criteria provided
2498242NM_001304359.2(MUC5AC):c.5459G>A (p.Arg1820Gln)MUC5ACUncertain significanceno assertion criteria provided
2498243NM_001304359.2(MUC5AC):c.10301C>T (p.Pro3434Leu)MUC5ACUncertain significanceno assertion criteria provided
2498244NM_001355281.2(NANOGP8):c.190G>T (p.Asp64Tyr)NANOGP8Uncertain significanceno assertion criteria provided
2445999NM_015175.3(NBEAL2):c.7878+2T>ANBEAL2Uncertain significanceno assertion criteria provided
2498229NM_001351365.2(NBPF19):c.11302C>G (p.Leu3768Val)NBPF19Uncertain significanceno assertion criteria provided
545132NM_001042492.3(NF1):c.5257G>C (p.Val1753Leu)NF1Uncertain significanceno assertion criteria provided
545130NM_173165.3(NFATC3):c.1922G>A (p.Arg641Gln)NFATC3Uncertain significanceno assertion criteria provided
545129NM_001282426.2(PIK3CG):c.711del (p.Asp238fs)PIK3CGUncertain significanceno assertion criteria provided
545127NM_005605.5(PPP3CC):c.1199T>C (p.Met400Thr)PPP3CCUncertain significanceno assertion criteria provided

Genes & proteins

Mendelian disease overlap and somatic drivers

GenCC: 1 · Orphanet: 57 · OMIM-shared: 0 · Dual-evidence (GWAS+Mendelian): 3

Dual-evidence genes (GWAS + Mendelian — highest-confidence targets)

GeneHGNCEvidence routes
COL1A1COL1A1GWAS, Orphanet
ZNF469ZNF469GWAS, Orphanet
ALDH3A1ALDH3A1GWAS

GenCC gene–disease validity (cohort genes)

the Disease column is the GenCC-asserted condition — a cohort gene’s strongest validity may be for a related predisposition syndrome.

GeneClassificationInheritanceDiseaseRecords
DOCK9LimitedAutosomal dominantkeratoconus

Orphanet rare-disease linkage (cohort genes)

GeneOrphanet IDRare disease
COL1A1Orphanet:1310Caffey disease
COL1A1Orphanet:1899Arthrochalasia Ehlers-Danlos syndrome
COL1A1Orphanet:216796Osteogenesis imperfecta type 1
COL1A1Orphanet:216804Osteogenesis imperfecta type 2
COL1A1Orphanet:216812Osteogenesis imperfecta type 3
COL1A1Orphanet:216820Osteogenesis imperfecta type 4
COL1A1Orphanet:230857Ehlers-Danlos/osteogenesis imperfecta syndrome
COL1A1Orphanet:287Classical Ehlers-Danlos syndrome
COL1A1Orphanet:31112Dermatofibrosarcoma protuberans
COL1A1Orphanet:314029High bone mass osteogenesis imperfecta
ZNF469Orphanet:90354Brittle cornea syndrome
RORAOrphanet:528084Non-specific syndromic intellectual disability
SGCAOrphanet:62Alpha-sarcoglycan-related limb-girdle muscular dystrophy R3
SLC25A15Orphanet:415Hyperornithinemia-hyperammonemia-homocitrullinuria syndrome
TALDO1Orphanet:101028Transaldolase deficiency
TFAP2BOrphanet:46627Char syndrome
TFAP2BOrphanet:466729Familial patent arterial duct
THRBOrphanet:566243Resistance to thyroid hormone due to a mutation in thyroid hormone receptor beta
TRAOrphanet:99861Precursor T-cell acute lymphoblastic leukemia
TRBOrphanet:99861Precursor T-cell acute lymphoblastic leukemia
TSC1Orphanet:210159Adult hepatocellular carcinoma
TSC1Orphanet:269008Isolated focal cortical dysplasia type IIb
TSC1Orphanet:538Lymphangioleiomyomatosis
TSC1Orphanet:805Tuberous sclerosis complex
WNT1Orphanet:216812Osteogenesis imperfecta type 3
WNT1Orphanet:216820Osteogenesis imperfecta type 4
WNT1Orphanet:85193Idiopathic juvenile osteoporosis
C8GOrphanet:169150Immunodeficiency due to a late component of complement deficiency
VANGL1Orphanet:3027Caudal regression syndrome
RAB3GAP1Orphanet:1387Cataract-intellectual disability-hypogonadism syndrome
RAB3GAP1Orphanet:2510Micro syndrome
CHRNEOrphanet:98913Postsynaptic congenital myasthenic syndrome
SLC25A22Orphanet:1934Early infantile developmental and epileptic encephalopathy
SLC25A22Orphanet:293181Epilepsy of infancy with migrating focal seizures
EPS8L2Orphanet:90636Rare autosomal recessive non-syndromic sensorineural deafness type DFNB
COL12A1Orphanet:536516Myopathic Ehlers-Danlos syndrome
COL12A1Orphanet:610Bethlem muscular dystrophy
COL12A1Orphanet:75840Ullrich congenital muscular dystrophy
COL4A1Orphanet:36383COL4A1/2-related familial vascular leukoencephalopathy
COL4A1Orphanet:477749Pontine autosomal dominant microangiopathy with leukoencephalopathy
COL4A1Orphanet:481986Familial schizencephaly
COL4A1Orphanet:73229HANAC syndrome
COL4A1Orphanet:75326Familial isolated retinal arteriolar tortuosity
COL4A1Orphanet:899Walker-Warburg syndrome
COL4A1Orphanet:99810Familial porencephaly
COL5A1Orphanet:287Classical Ehlers-Danlos syndrome
COL5A2Orphanet:287Classical Ehlers-Danlos syndrome
COL6A1Orphanet:610Bethlem muscular dystrophy
COL6A1Orphanet:646113Intermediate collagen VI-related muscular dystrophy
COL6A1Orphanet:75840Ullrich congenital muscular dystrophy

Cohort genes → proteins

75 cohort genes, 75 distinct canonical proteins.

Evidence partition

SubsetGenes
gwas_only55
gwas_and_clinvar3
multi_evidence17

Cohort genes (full)

SymbolHGNCEnsemblUniProtNameEvidence
COL1A1HGNC:2197ENSG00000108821P02452Collagen alpha-1(I) chaingwas,clinvar
ZNF469HGNC:23216ENSG00000225614Q96JG9Zinc finger protein 469gwas,clinvar
ALDH3A1HGNC:405ENSG00000108602P30838Aldehyde dehydrogenase, dimeric NADP-preferringgwas,clinvar
DOCK9HGNC:14132ENSG00000088387Q9BZ29Dedicator of cytokinesis protein 9gencc
RORAHGNC:10258ENSG00000069667P35398Nuclear receptor ROR-alphagwas
RPLP2HGNC:10377ENSG00000177600P05387Large ribosomal subunit protein P2gwas
BHLHE40HGNC:1046ENSG00000134107O14503Class E basic helix-loop-helix protein 40gwas
RXRAHGNC:10477ENSG00000186350P19793Retinoic acid receptor RXR-alphagwas
SGCAHGNC:10805ENSG00000108823Q16586Alpha-sarcoglycangwas
SLC25A15HGNC:10985ENSG00000102743Q9Y619Mitochondrial ornithine transporter 1gwas
SPAG7HGNC:11216ENSG00000091640O75391Sperm-associated antigen 7gwas
TALDO1HGNC:11559ENSG00000177156P37837Transaldolasegwas
TFAP2BHGNC:11743ENSG00000008196Q92481Transcription factor AP-2-betagwas
THRBHGNC:11799ENSG00000151090P10828Thyroid hormone receptor betaclinvar
TRAHGNC:12027P0DSE1M1-specific T cell receptor alpha chainclinvar
TRBHGNC:12155P0DSE2M1-specific T cell receptor beta chainclinvar
TSC1HGNC:12362ENSG00000165699Q92574Hamartinclinvar
WBP4HGNC:12739ENSG00000120688O75554WW domain-binding protein 4gwas
WNT1HGNC:12774ENSG00000125084P04628Proto-oncogene Wnt-1clinvar
ZFP3HGNC:12861ENSG00000180787Q96NJ6Zinc finger protein 3 homologgwas
DOP1BHGNC:1291ENSG00000142197Q9Y3R5Protein DOP1Bclinvar
BANPHGNC:13450ENSG00000172530Q8N9N5Protein BANPgwas
C8GHGNC:1354ENSG00000176919P07360Complement component C8 gamma chaingwas
FBXW5HGNC:13613ENSG00000159069Q969U6F-box/WD repeat-containing protein 5gwas
MRPS14HGNC:14049ENSG00000120333O60783Small ribosomal subunit protein uS14mgwas
UBXN4HGNC:14860ENSG00000144224Q92575UBX domain-containing protein 4gwas
COL5A3HGNC:14864ENSG00000080573P25940Collagen alpha-3(V) chainclinvar
VANGL1HGNC:15512ENSG00000173218Q8TAA9Vang-like protein 1clinvar
SKAP1HGNC:15605ENSG00000141293Q86WV1Src kinase-associated phosphoprotein 1gwas
CCNE2HGNC:1590ENSG00000175305O96020G1/S-specific cyclin-E2gwas
ACTL7BHGNC:162ENSG00000148156Q9Y614Actin-like protein 7Bgwas
STK35HGNC:16254ENSG00000125834Q8TDR2Serine/threonine-protein kinase 35gwas
MAML2HGNC:16259ENSG00000184384Q8IZL2Mastermind-like protein 2gwas
PIDD1HGNC:16491ENSG00000177595C0HMD6PIDD1 alternative open reading frame proteingwas
CD34HGNC:1662ENSG00000174059P28906Hematopoietic progenitor cell antigen CD34gwas
MRPS31HGNC:16632ENSG00000102738Q92665Small ribosomal subunit protein mS31gwas
RAB3GAP1HGNC:17063ENSG00000115839Q15042Rab3 GTPase-activating protein catalytic subunitgwas
DAB2IPHGNC:17294ENSG00000136848Q5VWQ8Disabled homolog 2-interacting proteinclinvar
TP53INP1HGNC:18022ENSG00000164938Q96A56Tumor protein p53-inducible nuclear protein 1gwas
CAMTA2HGNC:18807ENSG00000108509O94983Calmodulin-binding transcription activator 2gwas
CHRNEHGNC:1966ENSG00000108556Q04844Acetylcholine receptor subunit epsilongwas
SLC25A22HGNC:19954ENSG00000177542Q9H936Mitochondrial glutamate carrier 1gwas
TIPE3HGNC:20620ENSG00000183578Q5GJ75Tumor necrosis factor alpha-induced protein 8-like protein 3gwas
EPS8L2HGNC:21296ENSG00000177106Q9H6S3Epidermal growth factor receptor kinase substrate 8-like protein 2gwas
COL12A1HGNC:2188ENSG00000111799Q99715Collagen alpha-1(XII) chaingwas
COL4A1HGNC:2202ENSG00000187498P02462Collagen alpha-1(IV) chainclinvar
COL5A1HGNC:2209ENSG00000130635P20908Collagen alpha-1(V) chaingwas
COL5A2HGNC:2210ENSG00000204262P05997Collagen alpha-2(V) chainclinvar
COL6A1HGNC:2211ENSG00000142156P12109Collagen alpha-1(VI) chaingwas
NANOS1HGNC:23044ENSG00000188613Q8WY41Nanos homolog 1gwas

Cohort function summary

Lead sentence per gene, UniProt-curated.

SymbolProtein nameFunction (lead sentence)
COL1A1Collagen alpha-1(I) chainType I collagen is a member of group I collagen (fibrillar forming collagen).
ZNF469Zinc finger protein 469May be involved in transcriptional regulation.
ALDH3A1Aldehyde dehydrogenase, dimeric NADP-preferringALDHs play a major role in the detoxification of alcohol-derived acetaldehyde.
DOCK9Dedicator of cytokinesis protein 9Guanine nucleotide-exchange factor (GEF) that activates CDC42 by exchanging bound GDP for free GTP.
RORANuclear receptor ROR-alphaNuclear receptor that binds DNA as a monomer to ROR response elements (RORE) containing a single core motif half-site 5’-AGGTCA-3’ preceded by a short A-T-rich sequence.
RPLP2Large ribosomal subunit protein P2Plays an important role in the elongation step of protein synthesis.
BHLHE40Class E basic helix-loop-helix protein 40Transcriptional repressor involved in the regulation of the circadian rhythm by negatively regulating the activity of the clock genes and clock-controlled genes.
RXRARetinoic acid receptor RXR-alphaReceptor for retinoic acid that acts as a transcription factor.
SGCAAlpha-sarcoglycanComponent of the sarcoglycan complex, a subcomplex of the dystrophin-glycoprotein complex which forms a link between the F-actin cytoskeleton and the extracellular matrix.
SLC25A15Mitochondrial ornithine transporter 1Mitochondrial ornithine-citrulline antiporter.
TALDO1TransaldolaseCatalyzes the rate-limiting step of the non-oxidative phase in the pentose phosphate pathway.
TFAP2BTranscription factor AP-2-betaSequence-specific DNA-binding protein that interacts with inducible viral and cellular enhancer elements to regulate transcription of selected genes.
THRBThyroid hormone receptor betaNuclear hormone receptor that can act as a repressor or activator of transcription.
TRAM1-specific T cell receptor alpha chainThe alpha chain of TRAV2701J4201C01/TRBV1901J2S701C02 alpha-beta T cell receptor (TR) clonotype that is specific for HLA-A*02:01-restricted M/matrix protein 1 immunodominant epitope GILGFVFTL of influenza A virus (IAV).
TRBM1-specific T cell receptor beta chainThe beta chain of TRAV2701J4201C01/TRBV1901J2S701C02 alpha-beta T cell receptor (TR) clonotype that is specific for HLA-A*02:01-restricted M/matrix protein 1 immunodominant epitope GILGFVFTL of influenza A virus (IAV).
TSC1HamartinNon-catalytic component of the TSC-TBC complex, a multiprotein complex that acts as a negative regulator of the canonical mTORC1 complex, an evolutionarily conserved central nutrient sensor that stimulates anabolic reactions and macromolec…
WBP4WW domain-binding protein 4Involved in pre-mRNA splicing as a component of the spliceosome.
WNT1Proto-oncogene Wnt-1Ligand for members of the frizzled family of seven transmembrane receptors.
ZFP3Zinc finger protein 3 homologMay be involved in transcriptional regulation.
DOP1BProtein DOP1BMay play a role in regulating membrane trafficking of cargo proteins.
BANPProtein BANPControls V(D)J recombination during T-cell development by repressing T-cell receptor (TCR) beta enhancer function.
C8GComplement component C8 gamma chainComponent of the membrane attack complex (MAC), a multiprotein complex activated by the complement cascade, which inserts into a target cell membrane and forms a pore, leading to target cell membrane rupture and cell lysis.
FBXW5F-box/WD repeat-containing protein 5Substrate recognition component of both SCF (SKP1-CUL1-F-box protein) and DCX (DDB1-CUL4-X-box) E3 ubiquitin-protein ligase complexes.
UBXN4UBX domain-containing protein 4Involved in endoplasmic reticulum-associated protein degradation (ERAD).
COL5A3Collagen alpha-3(V) chainType V collagen is a member of group I collagen (fibrillar forming collagen).
SKAP1Src kinase-associated phosphoprotein 1Positively regulates T-cell receptor signaling by enhancing the MAP kinase pathway.
CCNE2G1/S-specific cyclin-E2Essential for the control of the cell cycle at the late G1 and early S phase.
MAML2Mastermind-like protein 2Acts as a transcriptional coactivator for NOTCH proteins.
CD34Hematopoietic progenitor cell antigen CD34Possible adhesion molecule with a role in early hematopoiesis by mediating the attachment of stem cells to the bone marrow extracellular matrix or directly to stromal cells.
RAB3GAP1Rab3 GTPase-activating protein catalytic subunitCatalytic subunit of the Rab3 GTPase-activating (Rab3GAP) complex composed of RAB3GAP1 and RAB3GAP2, which accelerates the otherwise slow GTP hydrolysis catalyzed by Rab proteins.
DAB2IPDisabled homolog 2-interacting proteinFunctions as a scaffold protein implicated in the regulation of a large spectrum of both general and specialized signaling pathways.
TP53INP1Tumor protein p53-inducible nuclear protein 1Antiproliferative and proapoptotic protein involved in cell stress response which acts as a dual regulator of transcription and autophagy.
CAMTA2Calmodulin-binding transcription activator 2Transcription activator.
CHRNEAcetylcholine receptor subunit epsilonAfter binding acetylcholine, the AChR responds by an extensive change in conformation that affects all subunits and leads to opening of an ion-conducting channel across the plasma membrane.
SLC25A22Mitochondrial glutamate carrier 1Mitochondrial glutamate/H(+) symporter.
TIPE3Tumor necrosis factor alpha-induced protein 8-like protein 3Acts as a lipid transfer protein.
EPS8L2Epidermal growth factor receptor kinase substrate 8-like protein 2Stimulates guanine exchange activity of SOS1.
COL12A1Collagen alpha-1(XII) chainType XII collagen interacts with type I collagen-containing fibrils, the COL1 domain could be associated with the surface of the fibrils, and the COL2 and NC3 domains may be localized in the perifibrillar matrix.
COL4A1Collagen alpha-1(IV) chainType IV collagen is the major structural component of glomerular basement membranes (GBM), forming a ‘chicken-wire’ meshwork together with laminins, proteoglycans and entactin/nidogen.
COL5A1Collagen alpha-1(V) chainType V collagen is a member of group I collagen (fibrillar forming collagen).
COL5A2Collagen alpha-2(V) chainType V collagen is a member of group I collagen (fibrillar forming collagen).
COL6A1Collagen alpha-1(VI) chainCollagen VI acts as a cell-binding protein.
NANOS1Nanos homolog 1May act as a translational repressor which regulates translation of specific mRNAs by forming a complex with PUM2 that associates with the 3’-UTR of mRNA targets.
NANOGP8Homeobox protein NANOGP8May act as a transcription regulator.
CEND1Cell cycle exit and neuronal differentiation protein 1Involved in neuronal differentiation.
CSNK1ECasein kinase I isoform epsilonCasein kinases are operationally defined by their preferential utilization of acidic proteins such as caseins as substrates.
RNF167E3 ubiquitin-protein ligase RNF167E3 ubiquitin-protein ligase that acts as a regulator of the TORC1 signaling pathway.
FNDC3BFibronectin type III domain-containing protein 3BMay be a positive regulator of adipogenesis.
UCMAUnique cartilage matrix-associated proteinMay be involved in the negative control of osteogenic differentiation of osteochondrogenic precursor cells in peripheral zones of fetal cartilage and at the cartilage-bone interface.
AAGABAlpha- and gamma-adaptin-binding protein p34May be involved in endocytic recycling of growth factor receptors such as EGFR.

Protein-family classification

Druggable: 13 · Difficult: 14 · Unknown: 48 · Druggable fraction: 0.17

Family distribution

Cohort families vs a genome-wide background (hypergeometric, BH-FDR; fold = observed/expected). Counts kept; sorted by enrichment, so the catch-all Other/Unknown bucket no longer leads.

FamilyGenesFoldFDR
Nuclear receptor315.4×0.009
Antibody/Immunoglobulin51.9×0.343
Other/Unknown481.1×0.343
Scaffold/PPI61.4×0.599
Transporter11.0×0.975
Transcription factor80.9×0.975
Kinase20.7×0.975
Protease10.5×0.985
Enzyme (other)10.2×0.999

Per-gene assignment

SymbolFamilyDruggable?ECInterPro (top 3)
COL1A1Other/UnknownnoFib_collagen_C, VWF_dom, Collagen
ZNF469Transcription factornoZnf_C2H2_type, Znf_C2H2_sf, ZNF469
ALDH3A1Enzyme (other)yes1.2.1.5Aldehyde_DH_NAD(P), Aldehyde_DH_dom, Ald_DH_CS_CYS
DOCK9Scaffold/PPInoPH_domain, PH-like_dom_sf, ARM-type_fold
RORANuclear receptoryesNucl_hrmn_rcpt_lig-bd, Znf_hrmn_rcpt, Nuclear_hrmn_rcpt
RPLP2Other/UnknownnoRibosomal_P1/P2, P1/P2_N_sf, Ribosomal_P2_euk
BHLHE40Transcription factornoOrange_dom, bHLH_dom, HLH_DNA-bd_sf
RXRANuclear receptoryesRetinoid-X_rcpt/HNF4, Nucl_hrmn_rcpt_lig-bd, Znf_hrmn_rcpt
SGCAOther/UnknownnoCadg, Sarcoglycan_alpha/epsilon, Cadherin-like_sf
SLC25A15Other/UnknownnoMCP_transmembrane, MCP_dom_sf, Mitochondrial_Carrier
SPAG7Other/UnknownnoR3H_dom, SPAG7, R3H_sperm-antigen
TALDO1Other/UnknownnoTAL/FSA, Transaldolase_1, Aldolase_TIM
TFAP2BTranscription factornoTF_AP2, TF_AP2_beta, TF_AP2_C
THRBNuclear receptoryesNucl_hrmn_rcpt_lig-bd, Znf_hrmn_rcpt, Nuclear_hrmn_rcpt
TRAAntibody/ImmunoglobulinyesIg_sub, Ig-like_dom, Ig_V-set
TRBAntibody/ImmunoglobulinyesIg_C1-set, Ig_sub, Ig-like_dom
TSC1Other/UnknownnoHamartin
WBP4Transcription factornoMatrin/U1-C_Znf_C2H2, WW_dom, Matrin/U1-like-C_Znf_C2H2
WNT1Other/UnknownnoWnt, Wnt1, Wnt_CS
ZFP3Transcription factornoZnf_C2H2_type, Znf_C2H2_sf, ZnF_C2H2-type_TF
DOP1BOther/UnknownnoDOP1_N, DOP1, DOP1_C
BANPOther/UnknownnoBEN_domain, BANP
C8GOther/UnknownnoLipocln_cytosolic_FA-bd_dom, A1-microglobln, Calycin
FBXW5Scaffold/PPInoWD40_rpt, F-box_dom, WD40/YVTN_repeat-like_dom_sf
MRPS14Other/UnknownnoRibosomal_uS14
UBXN4Other/UnknownnoUBX_dom, Ubiquitin-like_domsf, Thioredoxin-like_sf
COL5A3Other/UnknownnoFib_collagen_C, Collagen, ConA-like_dom_sf
VANGL1Other/UnknownnoVANGL
SKAP1Scaffold/PPInoSH3_domain, PH_domain, PH-like_dom_sf
CCNE2Other/UnknownnoCyclin_C-dom, Cyclin_N, Cyclin-like_dom
ACTL7BOther/UnknownnoActin, ATPase_NBD
STK35KinaseyesProt_kinase_dom, Ser/Thr_kinase_AS, Kinase-like_dom_sf
MAML2Other/UnknownnoMastermind-like_N, MAML1-3, MAML_N_sf
PIDD1ProteaseyesDeath_dom, ZU5_dom, Leu-rich_rpt
CD34Other/UnknownnoCD34, CD34/Podocalyxin
MRPS31Other/UnknownnoMRP-S31
RAB3GAP1Other/UnknownnoRab3GAP1_conserved, Rab3GAP1_C, Rab3GAP1
DAB2IPScaffold/PPInoC2_dom, PH_domain, RasGAP_dom
TP53INP1Other/UnknownnoTP53INP
CAMTA2Antibody/ImmunoglobulinyesIPT_dom, CG-1_dom, Ig-like_fold
CHRNEOther/UnknownnoNicotinic_acetylcholine_rcpt, Neurotrans-gated_channel_TM, Neur_channel
SLC25A22TransporteryesMCP, MCP_transmembrane, MCP_dom_sf
TIPE3Other/UnknownnoTNFAIP8-like, TNFAIP8_sf
EPS8L2Scaffold/PPInoSH3_domain, PTB/PI_dom, PH-like_dom_sf
COL12A1Antibody/ImmunoglobulinyesVWF_A, FN3_dom, Collagen
COL4A1Other/UnknownnoCollagen_IV_NC, Collagen, CTDL_fold
COL5A1Other/UnknownnoFib_collagen_C, Laminin_G, Collagen
COL5A2Other/UnknownnoFib_collagen_C, VWF_dom, Collagen
COL6A1Other/UnknownnoVWF_A, Collagen, vWFA_dom_sf
NANOS1Transcription factornoNanos/Xcar2, Znf_nanos-typ, Nanos_sf

Expression context

Cohort genes with no expression data: 2.

62 cohort genes are a single-cell marker in ≥1 SCXA experiment.

Breadth distribution (Bgee present_calls)

BucketGenes
narrow (1-5 tissues)0
moderate (6-20)0
broad (>20)73
unknown2

Top tissues across cohort

TissueCohort genes
oocyte8
apex of heart7
secondary oocyte7
left testis6
right hemisphere of cerebellum6
cartilage tissue5
calcaneal tendon5
tendon of biceps brachii5
sperm5
right uterine tube5
male germ line stem cell (sensu Vertebrata) in testis5
skin of hip4
stromal cell of endometrium4
tibia4
lower esophagus mucosa4
primordial germ cell in gonad4
granulocyte4
sural nerve4
cerebellar hemisphere4
periodontal ligament3

Per-gene tissue summary (top 30)

SymbolBgee breadthFANTOM5 breadthSCXATop tissues
COL1A1298ubiquitousmarkerstromal cell of endometrium, skin of hip, periodontal ligament
ZNF469211broadyestibia, upper arm skin, cartilage tissue
ALDH3A1204broadmarkernasal cavity epithelium, trachea, lower esophagus mucosa
DOCK9293ubiquitousmarkercalcaneal tendon, right lung, corpus callosum
RORA284ubiquitousmarkerupper leg skin, lateral nuclear group of thalamus, skin of hip
RPLP2295ubiquitousmarkernipple, primordial germ cell in gonad, skin of hip
BHLHE40302ubiquitousmarkersaphenous vein, vena cava, lower esophagus mucosa
RXRA294ubiquitousmarkerskin of hip, gingival epithelium, pancreatic ductal cell
SGCA190broadmarkerhindlimb stylopod muscle, gastrocnemius, apex of heart
SLC25A15198ubiquitousmarkerliver, right lobe of liver, duodenum
SPAG7295ubiquitousmarkerapex of heart, hindlimb stylopod muscle, gastrocnemius
TALDO1294ubiquitousmarkertrabecular bone tissue, blood, gingival epithelium
TFAP2B128broadmarkercorpus epididymis, cauda epididymis, oocyte
THRB267ubiquitousmarkerBrodmann (1909) area 23, middle temporal gyrus, tibia
TRA
TRB
TSC1297ubiquitousmarkersubstantia nigra pars compacta, gluteal muscle, lateral globus pallidus
WBP4293ubiquitousmarkeroocyte, secondary oocyte, amniotic fluid
WNT173tissue_specificyesgranulocyte, nucleus accumbens, superior frontal gyrus
ZFP3221ubiquitousyescardiac muscle of right atrium, epithelial cell of pancreas, left ventricle myocardium
DOP1B224ubiquitousmarkercortical plate, ganglionic eminence, rectum
BANP277ubiquitousmarkeroocyte, secondary oocyte, blood
C8G164tissue_specificmarkerright lobe of liver, liver, primordial germ cell in gonad
FBXW5220ubiquitousmarkerright testis, left testis, apex of heart
MRPS14287ubiquitousmarkertendon of biceps brachii, endothelial cell, oocyte
UBXN4303ubiquitousmarkersperm, pylorus, cardia of stomach
COL5A3240broadmarkersural nerve, apex of heart, endocervix
VANGL1234ubiquitousmarkerbronchial epithelial cell, corpus epididymis, caput epididymis
SKAP1189broadmarkergranulocyte, right lobe of liver, right uterine tube
CCNE2233ubiquitousmarkerendometrium epithelium, corpus callosum, C1 segment of cervical spinal cord

Protein interactions among cohort

Intra-cohort edges: 35.

Hub genes (top 10 by interactor count)

SymbolInteractor count
TSC15,445
COL1A15,341
CD344,928
ALDH3A13,940
RPLP23,763
TALDO13,738
MRPS143,690
RXRA3,421
COL6A13,049
COL4A12,909

Intra-cohort edges

ABSources
BANPFNDC3Bstring_interaction
BANPZNF469string_interaction
BHLHE40CSNK1Estring_interaction
CD34SPAG7string_interaction
COL12A1COL1A1string_interaction
COL12A1COL5A2string_interaction
COL12A1COL5A3string_interaction
COL12A1COL6A1string_interaction
COL1A1COL5A1intact, string_interaction
COL1A1COL5A2string_interaction
COL1A1COL6A1string_interaction
COL4A1COL5A1string_interaction
COL4A1COL5A2string_interaction
COL4A1COL5A3string_interaction
COL4A1COL6A1string_interaction
COL5A1COL5A2string_interaction
COL5A1COL5A3string_interaction
COL5A1COL6A1intact
COL5A1ZNF469string_interaction
COL5A2COL5A3string_interaction
COL5A2COL6A1string_interaction
COL5A3COL6A1string_interaction
COL5A3TSC1string_interaction
CSNK1ERORAbiogrid_interaction
DOCK9RAB3GAP1string_interaction
DOCK9ZNF469string_interaction
FBXW5SRFBP1biogrid_interaction, intact
FBXW5TIPE3string_interaction
FBXW5TSC1string_interaction
FNDC3BLCN12string_interaction
FNDC3BZNF469string_interaction
LCN12ZNF469string_interaction
MRPS31WBP4string_interaction
RAB3GAP1ZNF469string_interaction
RXRATHRBbiogrid_interaction, intact

Structural data

PDB: 37 · AlphaFold-only: 38 · No structure: 0

Cohort genes with PDB structures (top 30)

SymbolUniProtPDB entries
RXRAP19793110
MRPS14O6078377
MRPS31Q9266576
THRBP1082834
TRBP0DSE223
C8GP0736015
COL1A1P0245214
TRAP0DSE113
CHRNEQ0484413
WBP4O7555411
INTS8Q75QN210
RPLP2P053878
ALDH3A1P308387
AIFM3Q96NN97
TSC1Q925745
DOCK9Q9BZ294
BANPQ8N9N54
VANGL1Q8TAA94
COL4A1P024624
RORAP353983
AAGABQ6PD743
EPS8L2Q9H6S32
CSNK1EP496742
GATD1Q8NB372
SPAG7O753911
TALDO1P378371
TFAP2BQ924811
UBXN4Q925751
SKAP1Q86WV11
CCNE2O960201

AlphaFold-only cohort genes (top 30 by pLDDT)

SymbolUniProtpLDDT
DENND10P1Q6NSW590.55
NAA16Q6N06989.49
ACTL7BQ9Y61488.35
NDUFAF6Q330K287.70
SLC25A15Q9Y61987.43
LCN12Q6JVE586.62
WNT1P0462886.53
FBXW5Q969U684.64
SLC52A1Q9NWF484.13
UCMAQ8WVF282.20
SLC47A2Q86VL882.05
SPATA17Q96L0381.98
SGCAQ1658680.15
RNF167Q9H6Y779.08
SLC25A22Q9H93678.48
FAM76BQ5HYJ376.41
FNDC3BQ53EP075.78
ZFP3Q96NJ675.03
PNPLA2Q96AD572.56
DOP1BQ9Y3R572.36
RAB11FIP4Q86YS372.35
STK35Q8TDR267.52
DAB2IPQ5VWQ865.74
SRFBP1Q8NEF962.20
CD34P2890660.99
CEND1Q8N11160.87
CAMTA2O9498360.82
INCA1Q0VD8660.75
BHLHE40O1450360.60
TP53INP1Q96A5658.45

Function

Pathway analysis

Distinct Reactome pathways touched by cohort: 450. Enrichment computed across 118 evidence-associated genes (82 with Reactome annotation).

Pathways by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 82 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

PathwayCohort genesFoldFDRSample cohort genes
Assembly of collagen fibrils and other multimeric structures819.6×3e-06COL1A1, COL5A3, COL12A1, COL4A1, COL5A1, COL5A2, COL6A1, LOX
Collagen chain trimerization722.2×4e-06COL1A1, COL5A3, COL12A1, COL4A1, COL5A1, COL5A2, COL6A1
Collagen biosynthesis and modifying enzymes816.6×4e-06COL1A1, COL5A3, COL12A1, COL4A1, COL5A1, COL5A2, COL6A1, PLOD1
Fibronectin matrix formation534.8×3e-05COL1A1, COL5A3, COL4A1, COL5A1, COL5A2
Collagen degradation715.0×3e-05COL1A1, COL5A3, COL12A1, COL4A1, COL5A1, COL5A2, COL6A1
Syndecan interactions525.8×7e-05COL1A1, COL5A3, COL5A1, COL5A2, ITGA2
Non-integrin membrane-ECM interactions713.2×7e-05COL1A1, SGCA, COL5A3, COL4A1, COL5A1, COL5A2, ITGA2
ECM proteoglycans712.8×7e-05COL1A1, COL5A3, COL4A1, COL5A1, COL5A2, COL6A1, ITGA2
MET activates PTK2 signaling523.2×1e-04COL1A1, COL5A3, COL5A1, COL5A2, ITGA2
Integrin cell surface interactions711.5×1e-04COL1A1, COL5A3, COL4A1, COL5A1, COL5A2, COL6A1, ITGA2
Signaling by PDGF515.5×7e-04COL5A3, COL4A1, COL5A1, COL5A2, COL6A1
Attachment of bacteria to epithelial cells424.2×7e-04COL5A3, COL4A1, COL5A1, COL5A2
NCAM1 interactions515.1×7e-04COL5A3, COL4A1, COL5A1, COL5A2, COL6A1
R-HSA-400253416.9×0.003RORA, BHLHE40, RXRA, CSNK1E
Platelet Adhesion to exposed collagen324.6×0.007COL1A1, GP1BA, ITGA2
Crosslinking of collagen fibrils320.9×0.010COL1A1, COL4A1, LOX
Developmental Lineage of Pancreatic Ductal Cells411.1×0.012COL1A1, COL5A3, COL5A1, COL5A2
Enhanced binding of GP1BA variant to VWF multimer:collagen239.8×0.025COL1A1, GP1BA
Defective binding of VWF variant to GPIb:IX:V239.8×0.025COL1A1, GP1BA
SUMO E3 ligases SUMOylate target proteins48.7×0.025RORA, RXRA, TFAP2B, THRB
Disassembly of the destruction complex and recruitment of AXIN to the membrane313.1×0.032WNT1, FZD2, LRP6
SUMOylation48.0×0.032RORA, RXRA, TFAP2B, THRB
Calcineurin activates NFAT230.9×0.034NFATC3, PPP3CA
SUMOylation of intracellular receptors312.3×0.034RORA, RXRA, THRB
CLEC7A (Dectin-1) induces NFAT activation225.3×0.048NFATC3, PPP3CA
GP1b-IX-V activation signalling223.2×0.055COL1A1, GP1BA
RNA Polymerase II Transcription102.8×0.055RORA, RXRA, TFAP2B, THRB, BANP, CCNE2, MAML2, TP53INP1 (+2 more)
Gastrulation39.5×0.061TFAP2B, MAML2, SMAD3
Anchoring fibril formation218.6×0.077COL1A1, COL4A1
TCF dependent signaling in response to WNT45.7×0.077WNT1, CSNK1E, FZD2, LRP6

GO biological processes by enrichment

Over-representation of cohort genes vs the genome-wide background (hypergeometric test, Benjamini-Hochberg FDR; fold = observed/expected over 108 annotated cohort genes). Counts and members are kept as ground-truth; sorted by enrichment.

GO termCohort genesFoldFDRSample cohort genes
collagen fibril organization1020.8×4e-08COL1A1, COL5A3, COL12A1, COL4A1, COL5A1, COL5A2, COL6A1, LOX (+2 more)
rhombomere 4 development2156.0×0.011HOXB1, HOXB2
negative regulation of endodermal cell differentiation2156.0×0.011COL5A1, COL5A2
detection of tumor cell2104.0×0.011TRA, TRB
T cell mediated cytotoxicity directed against tumor cell target2104.0×0.011TRA, TRB
regulation of cell-matrix adhesion336.0×0.011TSC1, NF1, PKHD1
collagen biosynthetic process329.3×0.011COL1A1, COL5A1, PLOD1
negative regulation of fibroblast proliferation418.4×0.011DAB2IP, TP53INP1, INCA1, NF1
skin development416.4×0.011TFAP2B, COL5A3, COL5A1, COL5A2
canonical Wnt signaling pathway68.5×0.011WNT1, COL6A1, CSNK1E, FOXO1, FZD2, LRP6
positive regulation of DNA-templated transcription133.4×0.011COL1A1, RORA, RXRA, TFAP2B, WNT1, SKAP1, TP53INP1, NAA16 (+5 more)
positive regulation of transcription by RNA polymerase II192.6×0.011RORA, RXRA, TFAP2B, THRB, WNT1, SKAP1, MAML2, DAB2IP (+11 more)
positive regulation of thyroid hormone receptor signaling pathway278.0×0.017RXRA, THRB
eye morphogenesis278.0×0.017COL5A1, COL5A2
calcineurin-NFAT signaling cascade323.4×0.017NFATC3, PPP3CA, PPP3CC
negative regulation of calcium ion import across plasma membrane262.4×0.023PPP3CA, PPP3CC
negative regulation of fat cell differentiation411.6×0.023RORA, WNT1, FOXO1, SMAD3
cellular response to amino acid stimulus411.3×0.023COL1A1, COL4A1, COL5A2, COL6A1
regulation of striated muscle tissue development252.0×0.031LOX, SMAD3
transdifferentiation239.0×0.051CD34, SMAD3
regulation of calcium ion transmembrane transport239.0×0.051ATP1B1, PIK3CG
facial nerve structural organization234.7×0.058HOXB1, HOXB2
regulation of megakaryocyte differentiation234.7×0.058EIF6, LOX
negative regulation of cardiac muscle hypertrophy in response to stress234.7×0.058FOXO1, SMAD3
positive regulation of calcium ion import across plasma membrane231.2×0.064PPP3CA, PPP3CC
excitatory postsynaptic potential312.3×0.064RAB3GAP1, GRIN2B, PPP3CA
phosphatidylinositol 3-kinase/protein kinase B signal transduction47.8×0.064COL6A1, IGF1R, NF1, PIK3CG
adaptive immune response64.7×0.064TRA, TRB, TSC1, SKAP1, CD46, PIK3CG
osteoblast differentiation55.6×0.067COL1A1, COL6A1, UCMA, LOX, NF1
calcineurin-mediated signaling228.4×0.070PPP3CA, PPP3CC

Therapeutics

Drugs indicated for this disease

0 approved, 1 in late-stage (phase 3) trials. Disease-direct ChEMBL indications, not inferred from the associated-gene cohort below.

DrugDevelopment status
RiboflavinPhase 3 (in late-stage trials)

Earlier-phase candidates (phase 2, investigational — efficacy not yet established): Hydroxocobalamin, Mitomycin.

Drug target analysis

Approved (phase 4): 8 · Phase ≥3: 8 · Phased (≥1): 9 · Undrugged: 66

Druggability breadth: 47 of 118 evidence-associated genes (40%) have a ChEMBL target (buckets above are over the deeply-mined display cohort).

Genes with an approved drug

The molecule shown is one approved compound that hits the gene — not necessarily a drug of choice or one indicated for this disease.

SymbolExample approved molecule
RORATRETINOIN
RXRABEXAROTENE
THRBAMINOCAPROIC ACID
CCNE2PALBOCICLIB
STK35NERATINIB
CHRNEMECAMYLAMINE
CSNK1EAFATINIB
SLC47A2EPINASTINE

Top cohort targets by molecule count

SymbolMoleculesMax phase
THRB1174
CSNK1E374
SLC47A2284
RXRA234
STK35184
CCNE2124
CHRNE44
RORA24
WNT111
COL1A100

Drugs targeting cohort genes (top 30)

MoleculeMax phaseTargets in cohort
TRETINOIN4RORA, RXRA
BEXAROTENE4RXRA
OXAPROZIN4RXRA
PITAVASTATIN CALCIUM4RXRA
SULINDAC4RXRA
LEVOTHYROXINE4RXRA, THRB
DOMPERIDONE4RXRA, SLC47A2
FLUVASTATIN4RXRA
ALITRETINOIN4RXRA
ROSIGLITAZONE MALEATE4RXRA
PIOGLITAZONE HYDROCHLORIDE4RXRA
MECLOFENAMIC ACID4RXRA
CALCITRIOL4RXRA
AMINOCAPROIC ACID4THRB
INDIGOTINDISULFONATE4THRB
CHLORMADINONE ACETATE4THRB
AMOXAPINE4THRB
IDARUBICIN4THRB
DYCLONINE HYDROCHLORIDE4THRB
ISOSORBIDE4THRB
CLOMIPRAMINE HYDROCHLORIDE4THRB
CHLORMEZANONE4THRB
PHENOXYBENZAMINE HYDROCHLORIDE4THRB
METHYSERGIDE MALEATE4THRB
LIOTHYRONINE SODIUM4THRB
DYCLONINE4THRB
ROSE BENGAL FREE ACID4THRB
INAMRINONE4THRB
MOLSIDOMINE4THRB
METRONIDAZOLE4THRB

Bioactivity and enzyme data

Enzyme cohort genes (≥1 EC): 2.

Cohort genes with ChEMBL bioactivity (full, sorted by assay count)

SymbolAssaysType breakdown
RXRA846Binding:703, Functional:138, ADMET:5
CSNK1E420Binding:416, ADMET:2, Functional:2
THRB169Binding:129, Functional:40
CCNE2159Binding:159
RORA115Binding:111, Functional:3, Unclassified:1
STK3594Binding:94
ALDH3A142Binding:38, ADMET:4
CHRNE28Binding:26, Functional:2
SLC47A217Binding:14, ADMET:3
WNT110Binding:10
COL1A18Binding:8
DAB2IP8Binding:8
INTS86Binding:6
PNPLA25Binding:5
RPLP22Binding:2
TALDO11Binding:1
ZFP31Binding:1
UBXN41Binding:1

Cohort enzymes (BRENDA EC)

SymbolEC numbersNames
ALDH3A11.2.1.5aldehyde dehydrogenase [NAD(P)+]
CSNK1E2.7.11.1non-specific serine/threonine protein kinase

Cohort genes with high screening signal

≥100 ChEMBL assays — a studied-ness signal; see Therapeutics for approved-drug status.

SymbolChEMBL assays
RORA115
RXRA846
THRB169
CCNE2159
CSNK1E420

Pharmacogenomics

Cohort genes with a PharmGKB record: 74; with CPIC/DPWG dosing guidelines: 0.

No cohort gene has a CPIC/DPWG genotype-guided dosing guideline (PharmGKB).

Chemical tractability of cohort targets

30 approved/phased compounds have measured bioactivity against a cohort gene (and aren’t yet in disease-level trials). This is a research / tractability signal, NOT a therapeutic recommendation — a bioactivity row often reflects off-target or screening binding (e.g. promiscuous kinase inhibitors against a cohort kinase), implying no disease mechanism.

CompoundMax phaseCohort target (bioactivity)
TRETINOIN4RORA, RXRA
BEXAROTENE4RXRA
OXAPROZIN4RXRA
PITAVASTATIN CALCIUM4RXRA
SULINDAC4RXRA
LEVOTHYROXINE4RXRA, THRB
DOMPERIDONE4RXRA, SLC47A2
FLUVASTATIN4RXRA
ALITRETINOIN4RXRA
ROSIGLITAZONE MALEATE4RXRA
PIOGLITAZONE HYDROCHLORIDE4RXRA
MECLOFENAMIC ACID4RXRA
CALCITRIOL4RXRA
AMINOCAPROIC ACID4THRB
INDIGOTINDISULFONATE4THRB
CHLORMADINONE ACETATE4THRB
AMOXAPINE4THRB
IDARUBICIN4THRB
DYCLONINE HYDROCHLORIDE4THRB
ISOSORBIDE4THRB
CLOMIPRAMINE HYDROCHLORIDE4THRB
CHLORMEZANONE4THRB
PHENOXYBENZAMINE HYDROCHLORIDE4THRB
METHYSERGIDE MALEATE4THRB
LIOTHYRONINE SODIUM4THRB
DYCLONINE4THRB
ROSE BENGAL FREE ACID4THRB
INAMRINONE4THRB
MOLSIDOMINE4THRB
METRONIDAZOLE4THRB

Druggability pyramid

Cohort genes binned by druggability tier (high → low):

TierDefinitionGenesSymbols
AApproved (phase 4 drug)8RORA, RXRA, THRB, CCNE2, STK35, CHRNE, CSNK1E, SLC47A2
BPhased (≥1) drug, not yet approved1WNT1
CDruggable family + PDB, no drug5ALDH3A1, TRA, TRB, PIDD1, COL12A1
DDruggable family + AlphaFold only, no drug3CAMTA2, SLC25A22, FNDC3B
EDifficult family or no structure, no drug58COL1A1, ZNF469, DOCK9, RPLP2, BHLHE40, SGCA, SLC25A15, SPAG7, TALDO1, TFAP2B (+48 more)

Undrugged target profiles

66 cohort genes are undrugged. Ranked by ‘starting-point quality’ (assay depth + drugged-partner adjacency).

SymbolChEMBL assaysDrugged partners (top 3)
COL1A18
ZNF4690
ALDH3A142
DOCK90
RPLP22
BHLHE400
SGCA0
SLC25A150
SPAG70
TALDO11
TFAP2B0
TRA0
TRB0
TSC10
WBP40
ZFP31
DOP1B0
BANP0
C8G0
FBXW50
MRPS140
UBXN41
COL5A30
VANGL10
SKAP10
ACTL7B0
MAML20
PIDD10
CD340
MRPS310

Clinical trials & evidence

Clinical trials

Clinical trials: 279.

Phase distribution (across all retrieved trials)

PhaseTrials
Not specified197
PHASE330
PHASE220
PHASE2/PHASE311
PHASE1/PHASE28
PHASE46
PHASE15
EARLY_PHASE12

Top trials by phase / activity

NCTPhaseStatusTitle
NCT07474870PHASE4NOT_YET_RECRUITINGOutcomes of CTAK Surgery
NCT01485211PHASE4COMPLETEDCorneal Thickness Changes During Corneal Collagen Cross-linking With Ultraviolet-A Irradiation and Riboflavin
NCT02119039PHASE4COMPLETEDEffect of CACICOL20 on Corneal Epithelial Healing After Cross-linking in Patients With Keratoconus
NCT03245853PHASE4COMPLETEDEpi-On Corneal Crosslinking for Keratoconus
NCT03429569PHASE4UNKNOWNCross-Linking ACcéléré Iontophorèse Confocal kératocONE
NCT04427956PHASE4COMPLETEDCorneal Crosslinking Treatment Study
NCT01604135PHASE3ACTIVE_NOT_RECRUITINGCollagen Crosslinking for Keratoconus - a Randomized Controlled Clinical Trial
NCT03922542PHASE2/PHASE3RECRUITINGComparison of Standard vs. Accelerated Corneal Crosslinking
NCT06100939PHASE3ACTIVE_NOT_RECRUITINGEpithelium-On Corneal Cross-linking in Subjects 8 to 45 Years of Age With Keratoconus
NCT06100952PHASE3ACTIVE_NOT_RECRUITINGEpithelium-On Corneal Cross-linking in Subjects 8 to 45 Years of Age with Keratoconus
NCT06450470PHASE3RECRUITINGUse of a Freeze-dried Amniotic Membrane Post Crosslinking in Subjects With Progressive Keratoconus
NCT06601101PHASE3RECRUITINGEffects of Topical Insulin on Corneal Epithelium Healing After Corneal Crosslinking in Patients With Keratoconus
NCT07124910PHASE3RECRUITINGComparison of Epi-ON Corneal Collagen Crosslinking Performed Using an 18-Minute UVA Exposure vs. a 24-Minute UVA Exposure on Eyes With Ectatic Corneal Diseases
NCT07135167PHASE3RECRUITINGCompassionate Use Study of Epi-ON Corneal Collagen Crosslinking Performed Using UVA Exposure on Eyes With Ectatic Corneal Diseases for Subjects With Down Syndrome
NCT00371202PHASE3UNKNOWNComparison of Penetrating Keratoplasty and Deep Lamellar Keratoplasty With the Big Bubble Technique for Keratoconus
NCT00435799PHASE2/PHASE3UNKNOWNEvaluating Collagen Cross-Linking (CCL) Treatment in Norway
NCT00567671PHASE2/PHASE3COMPLETEDCorneal Collagen Cross-Linking for the Treatment of Progressive Keratoconus and Corneal Ectasia
NCT00626717PHASE2/PHASE3COMPLETEDRiboflavin Mediated Corneal Crosslinking for Stabilizing Progression of Keratoconus
NCT00647699PHASE3COMPLETEDCorneal Collagen Cross-linking for Progressive Keratoconus
NCT00679666PHASE2/PHASE3WITHDRAWNCorneal Crosslinking in Keratoconus and Corneal Ectasia
NCT00815256PHASE3UNKNOWNSafety and Effectiveness of Collagen Cross Linking in Progressive Mild and Moderate Keratoconus
NCT00841386PHASE2/PHASE3UNKNOWNTreatment of Keratoconus Using Collagen Cross-Linking
NCT00887900PHASE3COMPLETEDDeep Anterior Lamellar Keratoplasty (DALK)
NCT01081561PHASE2/PHASE3UNKNOWNCollagen Cross-Linking for Keratoconus/Ectasia With and Without Intacs
NCT01112072PHASE3UNKNOWNCorneal Collagen Crosslinking and Intacs for Keratoconus and Ectasia
NCT01152541PHASE3UNKNOWNCorneal Collagen Crosslinking for Progressive Keratoconus and Ectasia Using Riboflavin/Dextran and Hypotonic Riboflavin
NCT01190306PHASE3TERMINATEDSafety Study of the VEGA UV-A System to Treat Keratoconus
NCT01344187PHASE3COMPLETEDSafety and Efficacy Study of Corneal Collagen Cross-Linking in Eyes With Keratoconus
NCT01459679PHASE3TERMINATEDSafety & Efficacy of Corneal Collagen Cross-Linking in Eyes With Keratoconus or Corneal Ectasia After Refractive Surgery
NCT01464268PHASE3UNKNOWNTransepithelial Corneal Collagen Crosslinking for Keratoconus and Corneal Ectasia
NCT01643226PHASE3COMPLETEDSafety and Efficacy Study of Corneal Collagen Cross-Linking in Eyes With Keratoconus
NCT01672814PHASE3COMPLETEDMicrowave Treatment and Corneal Collagen Crosslinking for Keratoconus
NCT01682993PHASE3TERMINATEDCorneal Cross Linking and Topography Guided Excimer Laser Treatment
NCT01743443PHASE2/PHASE3COMPLETEDCorneal Sensitivity Cross-linking Keratoconus
NCT01972854PHASE3TERMINATEDSafety and Efficacy of Corneal Collagen Cross-Linking in Eyes With Keratoconus
NCT02613780PHASE3UNKNOWNRefractive Treatment of Early Keratoconus
NCT02638376PHASE3UNKNOWNEvaluating the Safety and Efficacy of the KXL System for Corneal Collagen Cross-Linking in Eyes Having Keratoconus
NCT03080077PHASE3UNKNOWNSafety and Effectiveness of Corneal Crosslinking (CXL): Keratoconus and Post-Refractive Ectasia
NCT03187912PHASE3COMPLETEDAccelerated Corneal Cross-linking With Different Riboflavin Solutions
NCT03442751PHASE3COMPLETEDStudy to Evaluate the Safety and Efficacy of Epi-on Corneal Cross-linking in Eyes With Progressive Keratoconus

Drugs tested across these trials (top 30)

MoleculeMax phaseTrials referencing
RIBOFLAVIN433
FLUOROMETHOLONE ACETATE42
PREDNISOLONE ACETATE42
DEXAMETHASONE SODIUM PHOSPHATE41
HEXACHLOROPHENE41
INSULIN HUMAN41
LOTEPREDNOL ETABONATE41
RIBOFLAVIN 5’-PHOSPHATE41
DEXTRAN31
DEXTRAN 121
CHEMBL511565033
CHEMBL5267279033
CHEMBL120063701
CHEMBL474380601