SGF29

gene
On this page

Also known as FLJ32446TDRD29

Summary

SGF29 (SAGA complex associated factor 29, HGNC:25156) is a protein-coding gene on chromosome 16p11.2, encoding SAGA-associated factor 29 (Q96ES7). Chromatin reader component of some histone acetyltransferase (HAT) SAGA-type complexes like the TFTC-HAT, ATAC or STAGA complexes. It is a selective cancer dependency (DepMap: 58.5% of cell lines).

CCDC101 is a subunit of 2 histone acetyltransferase complexes: the ADA2A (TADA2A; MIM 602276)-containing (ATAC) complex and the SPT3 (SUPT3H; MIM 602947)-TAF9 (MIM 600822)-GCN5 (KAT2A; MIM 602301)/PCAF (KAT2B; MIM 602303) acetylase (STAGA) complex. Both of these complexes contain either GCN5 or PCAF, which are paralogous acetyltransferases (Wang et al., 2008 [PubMed 18838386]).

Source: NCBI Gene 112869 — RefSeq curated summary.

At a glance

  • GWAS associations: 25
  • Clinical variants (ClinVar): 35 total
  • Druggable target: yes
  • Cancer dependency (DepMap): dependent in 58.5% of screened cell lines
  • MANE Select transcript: NM_138414

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25156
Approved symbolSGF29
NameSAGA complex associated factor 29
Location16p11.2
Locus typegene with protein product
StatusApproved
AliasesFLJ32446, TDRD29
Ensembl geneENSG00000176476
Ensembl biotypeprotein_coding
OMIM613374
Entrez112869

Gene structure

Transcript identifiers

Ensembl transcripts: 15 — 10 protein_coding, 4 retained_intron, 1 nonsense_mediated_decay

ENST00000317058, ENST00000564023, ENST00000564682, ENST00000565945, ENST00000567447, ENST00000567564, ENST00000569581, ENST00000898115, ENST00000898116, ENST00000898117, ENST00000898118, ENST00000898119, ENST00000898120, ENST00000898121, ENST00000930812

RefSeq mRNA: 1 — MANE Select: NM_138414 NM_138414

CCDS: CCDS10635

Canonical transcript exons

ENST00000317058 — 10 exons

ExonStartEnd
ENSE000012564642859029628590442
ENSE000026130522855392028554097
ENSE000034843102858491328584988
ENSE000034907272859063128590666
ENSE000035645532859009628590225
ENSE000035745772858105528581144
ENSE000035837492859159028591790
ENSE000036466992859077328590935
ENSE000036506142858910028589164
ENSE000036727832858564828585720

Expression profiles

Bgee: expression breadth ubiquitous, 268 present calls, max score 94.73.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 20.4914 / max 171.4036, expressed in 1808 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
15338213.32771786
1533846.41921637
1533830.7186435
1533850.025911

Top tissues by expression

284 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
hindlimb stylopod muscleUBERON:000425294.73gold quality
right testisUBERON:000453494.53gold quality
left testisUBERON:000453394.47gold quality
testisUBERON:000047392.98gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099192.91gold quality
left ovaryUBERON:000211992.81gold quality
body of uterusUBERON:000985392.71gold quality
gastrocnemiusUBERON:000138892.63gold quality
left uterine tubeUBERON:000130392.44gold quality
muscle of legUBERON:000138392.28gold quality
right ovaryUBERON:000211892.23gold quality
muscle layer of sigmoid colonUBERON:003580592.15gold quality
granulocyteCL:000009491.95gold quality
endocervixUBERON:000045891.49gold quality
muscle organUBERON:000163091.38gold quality
esophagogastric junction muscularis propriaUBERON:003584191.13gold quality
mucosa of stomachUBERON:000119991.12gold quality
body of pancreasUBERON:000115091.09gold quality
triceps brachiiUBERON:000150990.87gold quality
right lobe of thyroid glandUBERON:000111990.67gold quality
lower esophagus muscularis layerUBERON:003583390.62gold quality
ovaryUBERON:000099290.61gold quality
right lobe of liverUBERON:000111490.61gold quality
lower esophagusUBERON:001347390.60gold quality
left lobe of thyroid glandUBERON:000112090.49gold quality
popliteal arteryUBERON:000225090.43gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047390.42gold quality
tibial arteryUBERON:000761090.41gold quality
apex of heartUBERON:000209890.35gold quality
gluteal muscleUBERON:000200090.23gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.72
E-GEOD-124858no24.95

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): SRY

Functional genomics

DepMap (CRISPR cell-line fitness): dependent in 58.5% of screened cell lines.

Literature-anchored findings (GeneRIF, showing 3)

  • Results suggest that the H3K4me3-binding protein SGF29 plays a central and dual role in the ER stress response. (PMID:23894581)
  • Loss of function SGF29 mutation is associated with Cryptococcus neoformans hypervirulence. (PMID:29263343)
  • Transcriptional control of leukemogenesis by the chromatin reader SGF29. (PMID:38048593)

Cross-species orthologs

6 orthologs

OrganismSymbolGene ID
danio_reriosgf29ENSDARG00000002339
mus_musculusSgf29ENSMUSG00000030714
rattus_norvegicusSgf29ENSRNOG00000019245
drosophila_melanogasterSgf29FBGN0050390
caenorhabditis_elegansWBGENE00020673
caenorhabditis_elegansWBGENE00021205

Protein

Protein identifiers

SAGA-associated factor 29Q96ES7 (reviewed: Q96ES7)

Alternative names: Coiled-coil domain-containing protein 101, SAGA complex-associated factor 29

All UniProt accessions (3): Q96ES7, H3BN84, H3BPL5

UniProt curated annotations — full annotation on UniProt →

Function. Chromatin reader component of some histone acetyltransferase (HAT) SAGA-type complexes like the TFTC-HAT, ATAC or STAGA complexes. SGF29 specifically recognizes and binds methylated ‘Lys-4’ of histone H3 (H3K4me), with a preference for trimethylated form (H3K4me3). In the SAGA-type complexes, SGF29 is required to recruit complexes to H3K4me. Involved in the response to endoplasmic reticulum (ER) stress by recruiting the SAGA complex to H3K4me, thereby promoting histone H3 acetylation and cell survival. Also binds non-histone proteins that are methylated on Lys residues: specifically recognizes and binds CGAS monomethylated on ‘Lys-506’.

Subunit / interactions. Interacts with dimethylated and trimethylated ‘Lys-4’ of histone H3 (H3K4me2 and H3K4me3), with a preference for the trimethylated form (H3K4me3). Component of some SAGA-type complexes. Component of the ADA2A-containing complex (ATAC), composed of KAT14, KAT2A, TADA2L, TADA3L, ZZ3, MBIP, WDR5, YEATS2, CCDC101 and DR1. Interacts with (methylated) CGAS. Interacts with TADA3L, GCN5L2, SUPT3H and MYC.

Subcellular location. Nucleus.

Domain organisation. The SGF29 C-terminal (also named tudor-like) domain mediates binding to methylated ‘Lys-4’ of histone H3 (H3K4me), with a preference for trimethylated form (H3K4me3).

Similarity. Belongs to the SGF29 family.

RefSeq proteins (1): NP_612423* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR010750SGF29_tudor-like_domDomain
IPR037802SGF29Family
IPR047287Tudor_SGF29_rpt2Domain
IPR047288Tudor_SGF29_rpt1Domain

Pfam: PF07039

UniProt features (45 total): mutagenesis site 18, strand 11, helix 4, region of interest 3, turn 2, site 2, chain 1, domain 1, sequence conflict 1, coiled-coil region 1, modified residue 1

Structure

Experimental structures (PDB)

8 structures.

PDBMethodResolution (Å)
3MEAX-RAY DIFFRACTION1.26
3MEUX-RAY DIFFRACTION1.28
3ME9X-RAY DIFFRACTION1.37
5C0MX-RAY DIFFRACTION1.6
3LX7X-RAY DIFFRACTION1.78
3MEVX-RAY DIFFRACTION1.83
3MEWX-RAY DIFFRACTION1.92
3METX-RAY DIFFRACTION2

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96ES7-F191.790.83

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (2): 238 (histone h3k4me3 binding); 245 (histone h3k4me3 binding)

Post-translational modifications (1): 288

Mutagenesis-validated functional residues (18):

PositionPhenotype
179does not strongly affect binding to h3k4me.
194abolishes h3k4me3 binding.
196abolishes h3k4me3 binding.
214does not strongly affect binding to h3k4me.
232does not strongly affect binding to h3k4me.
238strongly reduced h3k4me3 binding.
238does not affect binding to h3k4me3.
240slightly reduced h3k4me3 binding.
242almost abolished h3k4me3 binding.
245abolishes h3k4me3 binding.
245reduced h3k4me3 binding.
256does not strongly affect binding to h3k4me.
264strongly reduced binding to h3k4me3.
266strongly reduced binding to h3k4me3.
266does not affect binding to h3k4me3.
266slightly reduced binding to h3k4me3.
282does not strongly affect binding to h3k4me.
175does not strongly affect binding to h3k4me.

Function

Pathways and Gene Ontology

Reactome pathways

7 pathways

IDPathway
R-HSA-3214847HATs acetylate histones
R-HSA-9772755Formation of WDR5-containing histone-modifying complexes
R-HSA-212165Epigenetic regulation of gene expression
R-HSA-3247509Chromatin modifying enzymes
R-HSA-4839726Chromatin organization
R-HSA-74160Gene expression (Transcription)
R-HSA-9917777Epigenetic regulation by WDR5-containing histone modifying complexes

MSigDB gene sets: 153 (showing top): GOBP_RESPONSE_TO_ENDOPLASMIC_RETICULUM_STRESS, GOBP_REGULATION_OF_PROTEIN_DEACETYLATION, GOBP_ESTABLISHMENT_OF_PROTEIN_LOCALIZATION_TO_CHROMOSOME, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_UP, GOBP_MACROMOLECULE_DEACYLATION, AACYNNNNTTCCS_UNKNOWN, GOBP_ESTABLISHMENT_OF_PROTEIN_LOCALIZATION_TO_ORGANELLE, GOBP_REGULATION_OF_DNA_REPAIR, CREIGHTON_ENDOCRINE_THERAPY_RESISTANCE_1, chr16p11, GOBP_REGULATION_OF_CELL_CYCLE, GOBP_REGULATION_OF_RESPONSE_TO_STRESS, GOBP_RNA_SPLICING, GOBP_REGULATION_OF_CELL_DIVISION, GOBP_DNA_DAMAGE_RESPONSE

GO Biological Process (17): negative regulation of transcription by RNA polymerase II (GO:0000122), DNA repair (GO:0006281), regulation of DNA repair (GO:0006282), regulation of DNA-templated transcription (GO:0006355), regulation of transcription by RNA polymerase II (GO:0006357), response to endoplasmic reticulum stress (GO:0034976), regulation of RNA splicing (GO:0043484), negative regulation of DNA repair (GO:0045738), transcription initiation-coupled chromatin remodeling (GO:0045815), positive regulation of DNA-templated transcription (GO:0045893), regulation of embryonic development (GO:0045995), regulation of cell division (GO:0051302), regulation of cell cycle (GO:0051726), establishment of protein localization to chromatin (GO:0071169), chromatin organization (GO:0006325), regulation of nucleobase-containing compound metabolic process (GO:0019219), regulation of macromolecule metabolic process (GO:0060255)

GO Molecular Function (4): histone H3K4me3 reader activity (GO:0140002), histone H3K36me3 reader activity (GO:0140003), methylation-dependent protein binding (GO:0140034), protein binding (GO:0005515)

GO Cellular Component (6): SAGA complex (GO:0000124), nucleoplasm (GO:0005654), SAGA-type complex (GO:0070461), mitotic spindle (GO:0072686), ATAC complex (GO:0140672), nucleus (GO:0005634)

Reactome top-level categories

Rollup of top-5 pathways:

CategoryPathways
Chromatin modifying enzymes1
Epigenetic regulation by WDR5-containing histone modifying complexes1
Gene expression (Transcription)1
Chromatin organization1
Epigenetic regulation of gene expression1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase II2
DNA repair2
DNA-templated transcription2
regulation of gene expression2
regulation of DNA-templated transcription2
regulation of primary metabolic process2
regulation of cellular process2
histone H3 reader activity2
SAGA-type complex2
regulation of transcription by RNA polymerase II1
negative regulation of DNA-templated transcription1
DNA metabolic process1
DNA damage response1
regulation of DNA metabolic process1
regulation of cellular response to stress1
regulation of RNA biosynthetic process1
cellular response to stress1
RNA splicing1
regulation of DNA repair1
negative regulation of response to stimulus1
negative regulation of DNA metabolic process1
transcription initiation at RNA polymerase II promoter1
positive regulation of gene expression, epigenetic1
positive regulation of RNA biosynthetic process1
embryo development1
regulation of multicellular organismal development1
cell division1
cell cycle1
establishment of protein localization to chromosome1
cellular component organization1
nucleobase-containing compound metabolic process1
regulation of metabolic process1
macromolecule metabolic process1
modification-dependent protein binding1
binding1
DUBm complex1
peptidase complex1
nuclear lumen1
cellular anatomical structure1
histone acetyltransferase complex1

Protein interactions and networks

STRING

1494 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SGF29KAT2AQ92830998
SGF29TADA3O75528998
SGF29KAT2BQ92831997
SGF29TADA2BQ86TJ2982
SGF29TADA2AO75478982
SGF29TAF9Q16594895
SGF29ATXN7O15265894
SGF29YEATS2Q9ULM3855
SGF29ENY2Q9NPA8829
SGF29TAF5Q15542825
SGF29TAF6P49848819
SGF29SF3B5Q9BWJ5806
SGF29SUPT3HO75486804
SGF29TAF12Q16514792
SGF29TRRAPQ9Y4A5782

IntAct

254 interactions, top by confidence:

ABTypeScore
TADA3SGF29psi-mi:“MI:0915”(physical association)0.940
SGF29TADA3psi-mi:“MI:0915”(physical association)0.940
SPC24NDC80psi-mi:“MI:0914”(association)0.920
SGF29NDC80psi-mi:“MI:0915”(physical association)0.840
TCF4SGF29psi-mi:“MI:0915”(physical association)0.840
SGF29TCF4psi-mi:“MI:0915”(physical association)0.840
NDC80SGF29psi-mi:“MI:0915”(physical association)0.840
SGF29NDC80psi-mi:“MI:0914”(association)0.840
KRT15SGF29psi-mi:“MI:0915”(physical association)0.780
RINT1SGF29psi-mi:“MI:0915”(physical association)0.780
SGF29KRT15psi-mi:“MI:0915”(physical association)0.780
SGF29RINT1psi-mi:“MI:0915”(physical association)0.780
TAF12TAF4psi-mi:“MI:0914”(association)0.760
BECN1SGF29psi-mi:“MI:0915”(physical association)0.740
TRRAPATXN7psi-mi:“MI:0914”(association)0.740
KRT16SGF29psi-mi:“MI:0915”(physical association)0.670
KRT27SGF29psi-mi:“MI:0915”(physical association)0.670

BioGRID (406): CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CCDC101 (Two-hybrid), CLHC1 (Two-hybrid), CCDC101 (Reconstituted Complex), CCDC101 (Protein-peptide), ZZZ3 (Affinity Capture-MS), DR1 (Affinity Capture-MS), YEATS2 (Affinity Capture-MS)

ESM2 similar proteins: A2AFR3, A2VDY6, O94967, P0C606, P79987, Q03353, Q03354, Q03355, Q14161, Q1L8G7, Q3UX43, Q4R8N2, Q4R8V9, Q50H33, Q58A45, Q5RDU9, Q5RED8, Q5TKA1, Q5VUG0, Q5ZL38, Q641K1, Q68CL5, Q6AXS8, Q6GR30, Q6PAJ1, Q6ZWB6, Q7SXV2, Q7ZXY4, Q8C5G2, Q8C735, Q8CB44, Q8CGF6, Q8IWR0, Q8K2D3, Q8N6S4, Q8QFX1, Q8VDD9, Q8WUU5, Q8WWQ0, Q8WXG6

Diamond homologs: P0C606, Q5ZL38, Q8RXY6, Q96ES7, Q9DA08, Q9W2I4

SIGNOR signaling

1 interactions.

AEffectBMechanism
SGF29“form complex”“SAGA complex”binding

Enriched among interaction partners

Reactome pathways and GO biological processes over-represented among this gene’s 139 IntAct physical interaction partners (hypergeometric vs the genome-wide background, BH-FDR, gene-set size 15–500, ranked by fold). A functional readout of the neighbourhood — distinct from this gene’s own memberships above, and biased toward well-studied / hub proteins, so read it as themes rather than proof.

Reactome pathways:

PathwayPartnersFoldFDR
Formation of WDR5-containing histone-modifying complexes513.7×2e-03
HATs acetylate histones1411.4×9e-09
Deubiquitination810.2×3e-04
Epigenetic regulation by WDR5-containing histone modifying complexes69.6×2e-03
CHD1 and CHD2 subfamily77.8×2e-03
UCH proteinases67.7×6e-03
Chromatin organization97.6×4e-04
Formation of the beta-catenin:TCF transactivating complex67.4×7e-03

GO biological processes:

GO termPartnersFoldFDR
regulation of DNA repair1430.4×1e-14
regulation of RNA splicing1220.7×2e-10
morphogenesis of an epithelium719.0×1e-05
intermediate filament organization815.2×1e-05
RNA polymerase II preinitiation complex assembly715.0×6e-05
regulation of embryonic development513.0×4e-03
positive regulation of transcription initiation by RNA polymerase II612.8×9e-04
epithelial cell differentiation68.3×7e-03

Disease & clinical

Clinical variants and AI predictions

ClinVar

35 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance19
Likely benign1
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

1619 predictions. Top by Δscore:

VariantEffectΔscore
16:28554151:GGC:Gdonor_gain1.0000
16:28581041:C:CAacceptor_gain1.0000
16:28581050:ACCAG:Aacceptor_gain1.0000
16:28581054:G:GTacceptor_loss1.0000
16:28584984:CAAGA:Cdonor_gain1.0000
16:28584985:AAGA:Adonor_gain1.0000
16:28584986:AGA:Adonor_gain1.0000
16:28584986:AGAGT:Adonor_loss1.0000
16:28584987:GA:Gdonor_gain1.0000
16:28584987:GAG:Gdonor_gain1.0000
16:28584988:AGT:Adonor_loss1.0000
16:28584989:G:GGdonor_gain1.0000
16:28584990:TGA:Tdonor_loss1.0000
16:28584991:GAG:Gdonor_loss1.0000
16:28585645:CA:Cacceptor_loss1.0000
16:28585646:A:AGacceptor_gain1.0000
16:28585646:A:Cacceptor_loss1.0000
16:28585647:G:GCacceptor_gain1.0000
16:28585647:GT:Gacceptor_gain1.0000
16:28585647:GTTT:Gacceptor_gain1.0000
16:28585716:TGCAA:Tdonor_gain1.0000
16:28585717:GCAA:Gdonor_gain1.0000
16:28585717:GCAAG:Gdonor_gain1.0000
16:28585721:G:GGdonor_gain1.0000
16:28585722:T:Gdonor_loss1.0000
16:28589091:C:Aacceptor_gain1.0000
16:28589096:TCA:Tacceptor_loss1.0000
16:28589097:CA:Cacceptor_loss1.0000
16:28589098:A:AGacceptor_gain1.0000
16:28589098:A:Gacceptor_loss1.0000

AlphaMissense

1890 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
16:28581119:T:CL17P1.000
16:28584920:G:CR28P1.000
16:28585668:A:GK58E1.000
16:28585670:G:CK58N1.000
16:28585670:G:TK58N1.000
16:28585672:T:CL59P1.000
16:28585681:T:CL62P1.000
16:28585692:G:CA66P1.000
16:28585693:C:AA66D1.000
16:28585704:G:CA70P1.000
16:28590149:C:AR115S1.000
16:28590149:C:TR115C1.000
16:28590153:G:CR116T1.000
16:28590153:G:TR116I1.000
16:28590154:A:CR116S1.000
16:28590154:A:TR116S1.000
16:28590155:G:AG117R1.000
16:28590155:G:CG117R1.000
16:28590155:G:TG117W1.000
16:28590156:G:AG117E1.000
16:28590156:G:TG117V1.000
16:28590162:T:CL119P1.000
16:28590165:T:AM120K1.000
16:28590165:T:CM120T1.000
16:28590165:T:GM120R1.000
16:28590166:G:AM120I1.000
16:28590166:G:CM120I1.000
16:28590166:G:TM120I1.000
16:28590171:T:CL122P1.000
16:28590174:T:AL123Q1.000

dbSNP variants (sampled 300 via entrez): RS1000042689 (16:28558302 G>A), RS1000045131 (16:28554446 C>T), RS1000149731 (16:28561128 A>C), RS1000152063 (16:28588439 C>T), RS1000235224 (16:28571728 A>C,T), RS1000281675 (16:28577670 T>G), RS1000395246 (16:28584702 A>T), RS1000414573 (16:28553420 G>T), RS1000570113 (16:28589433 G>A), RS1000667300 (16:28558565 T>G), RS1000716704 (16:28576508 C>G,T), RS1000866832 (16:28570200 C>G), RS1000867028 (16:28584504 A>G,T), RS1001030780 (16:28553381 A>G), RS1001044318 (16:28563327 C>G,T)

Disease associations

OMIM: gene MIM:613374 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

25 associations (top):

StudyTraitp-value
GCST000531_2Inflammatory bowel disease (early onset)2.000000e-09
GCST002598_62Educational attainment1.000000e-06
GCST004131_83Inflammatory bowel disease2.000000e-12
GCST004132_69Crohn’s disease3.000000e-10
GCST004147_4Chronic obstructive pulmonary disease7.000000e-10
GCST005316_517Intelligence (MTAG)8.000000e-15
GCST006269_1207General cognitive ability4.000000e-08
GCST006269_605General cognitive ability3.000000e-13
GCST007044_23Extremely high intelligence2.000000e-08
GCST007293_116Body fat distribution (arm fat ratio)2.000000e-08
GCST007293_16Body fat distribution (arm fat ratio)4.000000e-09
GCST007293_43Body fat distribution (arm fat ratio)2.000000e-12
GCST007294_71Body fat distribution (trunk fat ratio)2.000000e-12
GCST007294_97Body fat distribution (trunk fat ratio)1.000000e-11
GCST007295_20Body fat distribution (leg fat ratio)3.000000e-06
GCST007295_44Body fat distribution (leg fat ratio)1.000000e-21
GCST007295_79Body fat distribution (leg fat ratio)2.000000e-24
GCST009733_134Urinary metabolite levels in chronic kidney disease3.000000e-31
GCST010002_111Refractive error3.000000e-09
GCST010133_15Lamb consumption3.000000e-08
GCST010703_152Brain morphology (MOSTest)3.000000e-09
GCST012020_152Serum metabolite levels1.000000e-45
GCST012021_77Serum metabolite levels1.000000e-45
GCST012227_381Hip circumference adjusted for BMI1.000000e-08
GCST90002402_456Platelet count3.000000e-12

EFO canonical traits (8, from GWAS)

EFO IDTrait name
EFO:0004784self reported educational attainment
EFO:0004337intelligence
EFO:0004341body fat distribution
EFO:0005116urinary metabolite measurement
EFO:0008111diet measurement
EFO:0004346neuroimaging measurement
EFO:0008039BMI-adjusted hip circumference
EFO:0004309platelet count

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4523425 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

20 total (human), top 20 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneaffects methylation, increases methylation2
alpha phellandrenedecreases expression1
bisphenol Adecreases methylation1
trichostatin Aaffects expression1
benzo(e)pyrenedecreases methylation1
aflatoxin B2decreases methylation1
K 7174increases expression1
Temozolomideincreases expression1
Coumestrolincreases expression1
Methapyrilenedecreases methylation1
Methyl Methanesulfonateincreases expression1
Seleniumincreases expression1
Smokedecreases expression1
Thiramdecreases expression1
Tunicamycinincreases expression1
Valproic Acidaffects expression1
Cyclosporineincreases expression1
Copper Sulfatedecreases expression1
Lactic Aciddecreases expression1
Acrylamidedecreases expression1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4377348BindingBinding affinity to SGF29 (unknown origin) assessed as induction of thermal shifts at 20 uM measured for 25 mins by SYPRO orange dye thermal shift assayDiscovery of a Potent and Selective Fragment-like Inhibitor of Methyllysine Reader Protein Spindlin 1 (SPIN1). — J Med Chem

Cellosaurus cell lines

1 cell lines: 1 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_SH18HAP1 CCDC101 (-)Cancer cell lineMale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.