CCDC38

gene
On this page

Also known as FLJ40089

Summary

CCDC38 (coiled-coil domain containing 38, HGNC:26843) is a protein-coding gene on chromosome 12q23.1, encoding Coiled-coil domain-containing protein 38 (Q502W7). Essential for male fertility.

Predicted to enable ubiquitin-modified histone reader activity. Predicted to be involved in acrosome assembly and sperm flagellum assembly. Located in centrosome.

Source: NCBI Gene 120935 — RefSeq curated summary.

At a glance

  • GWAS associations: 8
  • Clinical variants (ClinVar): 105 total
  • MANE Select transcript: NM_182496

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26843
Approved symbolCCDC38
Namecoiled-coil domain containing 38
Location12q23.1
Locus typegene with protein product
StatusApproved
AliasesFLJ40089
Ensembl geneENSG00000165972
Ensembl biotypeprotein_coding
Entrez120935

Gene structure

Transcript identifiers

Ensembl transcripts: 8 — 3 protein_coding, 2 nonsense_mediated_decay, 2 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000344280, ENST00000546386, ENST00000546803, ENST00000546816, ENST00000546947, ENST00000548771, ENST00000549752, ENST00000549876

RefSeq mRNA: 1 — MANE Select: NM_182496 NM_182496

CCDS: CCDS9056

Canonical transcript exons

ENST00000344280 — 16 exons

ExonStartEnd
ENSE000010985349587225595872460
ENSE000010985439588148595881554
ENSE000010985499587964495879795
ENSE000010985589587821195878346
ENSE000011926709589083295890930
ENSE000011926809589838595898465
ENSE000011926859589856895898731
ENSE000011926899590638795906451
ENSE000011926919591712995917294
ENSE000011926969586704895867189
ENSE000012804769588845895888506
ENSE000013101889589498995895146
ENSE000024038119594243195942614
ENSE000034626849586948095869573
ENSE000034974059591887695918976
ENSE000035732029593647395936523

Expression profiles

Bgee: expression breadth ubiquitous, 157 present calls, max score 89.36.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0811 / max 49.6193, expressed in 11 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
1327560.068810
1327550.01233

Top tissues by expression

241 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
left testisUBERON:000453389.36gold quality
right testisUBERON:000453488.66gold quality
testisUBERON:000047386.45gold quality
spermCL:000001984.88gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099180.79gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047377.56gold quality
right lobe of liverUBERON:000111475.13gold quality
adrenal tissueUBERON:001830374.59gold quality
liverUBERON:000210764.79gold quality
epithelium of nasopharynxUBERON:000195164.64silver quality
sural nerveUBERON:001548862.81silver quality
calcaneal tendonUBERON:000370162.19gold quality
mucosa of paranasal sinusUBERON:000503062.19gold quality
adult organismUBERON:000702361.93gold quality
germinal epithelium of ovaryUBERON:000130459.36gold quality
spleenUBERON:000210658.12gold quality
lower lobe of lungUBERON:000894956.21silver quality
gingival epitheliumUBERON:000194955.29gold quality
cerebellar hemisphereUBERON:000224555.02gold quality
cerebellar cortexUBERON:000212954.98gold quality
islet of LangerhansUBERON:000000654.71gold quality
adrenal glandUBERON:000236954.71gold quality
tendonUBERON:000004354.31gold quality
left adrenal gland cortexUBERON:003582554.30gold quality
right lungUBERON:000216754.25gold quality
cerebellumUBERON:000203754.00gold quality
left adrenal glandUBERON:000123453.96gold quality
bone marrowUBERON:000237153.87gold quality
right adrenal gland cortexUBERON:003582753.43gold quality
smooth muscle tissueUBERON:000113553.42gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.96

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 1)

  • strongest signal of association with smoking resistance for a non-synonymous SNP in CCDC38, a gene which has previously been reported to show association with FEV1/FVC, and we demonstrate moderate expression of CCDC38 in bronchial epithelial cells (PMID:24786987)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusCcdc38ENSMUSG00000036168
rattus_norvegicusCcdc38ENSRNOG00000021892

Paralogs (3): CCDC42 (ENSG00000161973), CFAP100 (ENSG00000163885), CFAP73 (ENSG00000186710)

Protein

Protein identifiers

Coiled-coil domain-containing protein 38Q502W7 (reviewed: Q502W7)

All UniProt accessions (5): Q502W7, F8VRK3, F8VTU7, F8VY05, H0YHI1

UniProt curated annotations — full annotation on UniProt →

Function. Essential for male fertility. Required for sperm flagellum biogenesis. Also required for acrosome biogenesis. Required for the attachment of developing acrosomes to the nucleus during spermiogenesis and may be involved in the transport of fibrous sheath components.

Subunit / interactions. Interacts with CCDC42, CFAP53, IFT88 and ODF2. Interacts with CCDC146. Interacts with TEKT3. Interacts with ubiquitinated histone H2A.

Subcellular location. Cytoplasm. Cytoskeleton. Microtubule organizing center. Centrosome. Perinuclear region. Cell projection. Cilium. Flagellum.

RefSeq proteins (1): NP_872302* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR025252DUF4200Domain
IPR051147CFAP_domain-containingFamily

Pfam: PF13863

UniProt features (13 total): region of interest 3, coiled-coil region 3, compositionally biased region 3, sequence variant 2, chain 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q502W7-F173.840.35

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 86 (showing top): GOBP_VESICLE_ORGANIZATION, DARWICHE_SKIN_TUMOR_PROMOTER_UP, DARWICHE_PAPILLOMA_RISK_LOW_UP, DARWICHE_PAPILLOMA_RISK_HIGH_UP, DARWICHE_SQUAMOUS_CELL_CARCINOMA_UP, TGACCTY_ERR1_Q2, GOBP_MALE_GAMETE_GENERATION, GOCC_MICROTUBULE_ORGANIZING_CENTER, GOBP_SECRETORY_GRANULE_ORGANIZATION, GOBP_CELLULAR_COMPONENT_ASSEMBLY_INVOLVED_IN_MORPHOGENESIS, GOBP_CILIUM_ORGANIZATION, GOBP_CILIUM_MOVEMENT, GOCC_CENTROSOME, GOBP_CILIUM_OR_FLAGELLUM_DEPENDENT_CELL_MOTILITY, GOBP_ACROSOME_ASSEMBLY

GO Biological Process (6): acrosome assembly (GO:0001675), intraciliary anterograde transport (GO:0035720), sperm flagellum assembly (GO:0120316), chromatin organization (GO:0006325), spermatogenesis (GO:0007283), cell differentiation (GO:0030154)

GO Molecular Function (1): ubiquitin-modified histone reader activity (GO:0061649)

GO Cellular Component (10): manchette (GO:0002177), centrosome (GO:0005813), sperm flagellum (GO:0036126), perinuclear region of cytoplasm (GO:0048471), sperm head (GO:0061827), cytoplasm (GO:0005737), cytoskeleton (GO:0005856), cilium (GO:0005929), motile cilium (GO:0031514), cell projection (GO:0042995)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure5
developmental process involved in reproduction3
spermatid development2
cellular component assembly involved in morphogenesis1
cellular process involved in reproduction in multicellular organism1
secretory granule organization1
organelle assembly1
intraciliary transport1
flagellated sperm motility1
motile cilium assembly1
cellular component organization1
male gamete generation1
cellular developmental process1
ubiquitin-modified protein reader activity1
histone reader activity1
microtubule cytoskeleton1
centriole1
microtubule organizing center1
9+2 motile cilium1
cytoplasm1
intracellular anatomical structure1
intracellular membraneless organelle1
intraciliary transport particle1
membrane-bounded organelle1
plasma membrane bounded cell projection1
cilium1

Protein interactions and networks

STRING

380 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CCDC38CCDC42Q96M95791
CCDC38CFAP73A6NFT4632
CCDC38SPATA9Q9BWV2624
CCDC38ARMC2Q8NEN0605
CCDC38CFAP184Q2M329591
CCDC38SPMIP4Q8N865584
CCDC38CFAP96A7E2U8580
CCDC38CCDC116Q8IYX3541
CCDC38LRMDAQ9H2I8512
CCDC38CCDC146Q8IYE0509
CCDC38SNRPFP62306498
CCDC38CFDP1Q9UEE9485
CCDC38ZKSCAN3Q9BRR0480
CCDC38CFAP263Q9H0I3478
CCDC38FAM13AO94988445

IntAct

3 interactions, top by confidence:

ABTypeScore
CCDC38PPIBpsi-mi:“MI:0915”(physical association)0.400
CCDC38GAPDHpsi-mi:“MI:0915”(physical association)0.400

BioGRID (4): CCDC38 (Affinity Capture-MS), CCDC38 (Proximity Label-MS), CCDC38 (Proximity Label-MS), EEA1 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A1B0GVH7, A0A4X1TZW7, A0A5F8MPE6, E1B836, E1C760, E7EXT2, F7AEX0, O74370, O95391, P21374, P51950, Q20716, Q24276, Q24740, Q28E45, Q2F5J3, Q2KI00, Q3B820, Q3KQD1, Q3TGF2, Q3ZBE5, Q45GW3, Q4R4P2, Q502W7, Q569B9, Q5EAW4, Q5RHY1, Q5U4F3, Q5XI37, Q5XIN9, Q5ZIG2, Q69ZQ2, Q6SP97, Q7L590, Q80ZG5, Q8BHJ9, Q8BRC6, Q8CDN8, Q8NCR3, Q8TC29

Diamond homologs: Q494V2, Q4R8V8, Q502W7, Q80VN0, Q8CDN8, Q95JL6, A8I4E9

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

105 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance84
Likely benign10
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2703 predictions. Top by Δscore:

VariantEffectΔscore
12:95867185:CCCAA:Cacceptor_gain1.0000
12:95867186:CCAA:Cacceptor_gain1.0000
12:95867186:CCAAC:Cacceptor_gain1.0000
12:95867187:CAA:Cacceptor_gain1.0000
12:95867187:CAAC:Cacceptor_gain1.0000
12:95867188:AA:Aacceptor_gain1.0000
12:95867188:AACTG:Aacceptor_loss1.0000
12:95867189:ACTGA:Aacceptor_loss1.0000
12:95867190:C:CCacceptor_gain1.0000
12:95867190:CTGAA:Cacceptor_loss1.0000
12:95869475:CAAA:Cdonor_loss1.0000
12:95869476:AAACC:Adonor_loss1.0000
12:95869477:AACC:Adonor_loss1.0000
12:95869478:A:Tdonor_loss1.0000
12:95869571:AACC:Aacceptor_loss1.0000
12:95869574:C:CCacceptor_gain1.0000
12:95869574:CTGT:Cacceptor_loss1.0000
12:95869575:T:Aacceptor_loss1.0000
12:95869578:C:CTacceptor_gain1.0000
12:95872238:AT:Adonor_gain1.0000
12:95872253:A:ACdonor_gain1.0000
12:95872254:C:CCdonor_gain1.0000
12:95879642:A:ACdonor_gain1.0000
12:95879643:C:CCdonor_gain1.0000
12:95881495:T:TAdonor_gain1.0000
12:95881555:C:CCacceptor_gain1.0000
12:95888552:G:GCacceptor_gain1.0000
12:95898461:CTGCC:Cacceptor_gain1.0000
12:95898464:CC:Cacceptor_gain1.0000
12:95898465:CC:Cacceptor_gain1.0000

AlphaMissense

3748 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
12:95898578:C:GA175P0.997
12:95898609:G:CF164L0.996
12:95898609:G:TF164L0.996
12:95898611:A:GF164L0.996
12:95898421:A:GL193P0.994
12:95898604:C:GR166P0.992
12:95898607:A:GL165P0.992
12:95898618:A:CF161L0.992
12:95898618:A:TF161L0.992
12:95898620:A:GF161L0.992
12:95879779:A:GF336S0.991
12:95895090:A:GL224P0.991
12:95898439:A:GL187P0.991
12:95898413:C:GA196P0.990
12:95898464:C:GA179P0.990
12:95879725:A:GL354P0.989
12:95879740:A:GL349P0.989
12:95879778:G:CF336L0.989
12:95879778:G:TF336L0.989
12:95879780:A:GF336L0.989
12:95895120:A:GL214P0.989
12:95898461:C:GA180P0.989
12:95879761:A:GL342S0.988
12:95898629:C:GA158P0.987
12:95898640:A:GL154P0.984
12:95898653:C:GA150P0.984
12:95879731:T:GQ352P0.981
12:95895065:C:AW232C0.981
12:95895065:C:GW232C0.981
12:95898574:A:GL176P0.981

dbSNP variants (sampled 300 via entrez): RS1000005429 (12:95877887 C>A), RS1000022635 (12:95906749 A>C), RS1000034142 (12:95868257 C>T), RS1000086146 (12:95925480 C>T), RS1000098398 (12:95880733 G>A), RS1000106458 (12:95935308 A>T), RS1000156490 (12:95884828 C>A,T), RS1000161249 (12:95912577 T>A), RS1000181774 (12:95867471 C>T), RS1000182874 (12:95867720 C>G,T), RS1000291517 (12:95919444 A>T), RS1000299777 (12:95906183 G>A,T), RS1000426560 (12:95885156 G>A), RS1000428974 (12:95918486 T>C), RS1000520646 (12:95869126 A>G)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

8 associations (top):

StudyTraitp-value
GCST001248_6Pulmonary function2.000000e-11
GCST001784_7Pulmonary function (smoking interaction)1.000000e-07
GCST002479_12Lupus nephritis in systemic lupus erythematosus2.000000e-06
GCST004185_54Lung function (FEV1/FVC)5.000000e-08
GCST007637_21Diffusing capacity of carbon monoxide2.000000e-06
GCST009240_270Serum metabolite levels (CMS)7.000000e-18
GCST009242_73Serum metabolite levels2.000000e-13
GCST010002_220Refractive error1.000000e-19

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0003892pulmonary function measurement
EFO:0004713FEV/FVC ratio
EFO:0009369diffusing capacity of the lung for carbon monoxide

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

11 total (human), top 11 by PubMed support.

ChemicalActions (top 5)PubMed papers
Aflatoxin B1decreases methylation, increases methylation2
pirinixic acidaffects binding, decreases expression, increases activity1
terbufosincreases methylation1
sodium arseniteincreases expression1
S-(1,2-dichlorovinyl)cysteineincreases expression, affects cotreatment, affects response to substance1
Fonofosincreases methylation1
Hydrogen Peroxideincreases expression1
Lipopolysaccharidesaffects response to substance, increases expression, affects cotreatment1
Parathionincreases methylation1
Tretinoindecreases expression1
Okadaic Aciddecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): lupus nephritis