SOFU1

gene
On this page

Also known as CTM-1SOF1

Summary

SOFU1 (sperm-oocyte fusion factor 1, HGNC:21750) is a protein-coding gene on chromosome 7q34, encoding Sperm-egg fusion protein LLCFC1 (Q96L11). Sperm protein required for fusion of sperm with the egg membrane during fertilization.

Predicted to be involved in fusion of sperm to egg plasma membrane involved in single fertilization. Predicted to be located in extracellular region.

Source: NCBI Gene 135927 — RefSeq curated summary.

At a glance

  • GWAS associations: 1
  • Clinical variants (ClinVar): 1 total
  • MANE Select transcript: NM_001382496

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:21750
Approved symbolSOFU1
Namesperm-oocyte fusion factor 1
Location7q34
Locus typegene with protein product
StatusApproved
AliasesCTM-1, SOF1
Ensembl geneENSG00000165131
Ensembl biotypeprotein_coding
OMIM618946
Entrez135927

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000409607, ENST00000458732

RefSeq mRNA: 1 — MANE Select: NM_001382496 NM_001382496

CCDS: CCDS5876

Canonical transcript exons

ENST00000409607 — 2 exons

ExonStartEnd
ENSE00001584889142939484142939754
ENSE00001589670142940352142940868

Expression profiles

Bgee: expression breadth ubiquitous, 107 present calls, max score 97.26.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0181 / max 14.8858, expressed in 3 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
817440.01493
817430.00322

Top tissues by expression

119 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047397.26gold quality
left testisUBERON:000453394.60gold quality
right testisUBERON:000453494.17gold quality
testisUBERON:000047393.83gold quality
C1 segment of cervical spinal cordUBERON:000646970.71gold quality
bloodUBERON:000017870.63gold quality
corpus callosumUBERON:000233666.69gold quality
substantia nigraUBERON:000203858.36gold quality
spleenUBERON:000210658.25gold quality
bone marrowUBERON:000237157.37gold quality
primary visual cortexUBERON:000243655.95gold quality
Ammon’s hornUBERON:000195453.38gold quality
endocervixUBERON:000045853.14gold quality
tonsilUBERON:000237253.00gold quality
lymph nodeUBERON:000002951.41gold quality
putamenUBERON:000187451.29gold quality
monocyteCL:000057650.92silver quality
leukocyteCL:000073849.59silver quality
ganglionic eminenceUBERON:000402349.09gold quality
skeletal muscle tissueUBERON:000113448.64gold quality
hypothalamusUBERON:000189848.59gold quality
amygdalaUBERON:000187648.48gold quality
right coronary arteryUBERON:000162548.21gold quality
temporal lobeUBERON:000187148.21gold quality
apex of heartUBERON:000209848.04gold quality
muscle tissueUBERON:000238548.02gold quality
bone marrow cellCL:000209247.80gold quality
caudate nucleusUBERON:000187347.03gold quality
Brodmann (1909) area 9UBERON:001354046.43gold quality
descending thoracic aortaUBERON:000234546.28gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no0.18

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

30 targeting SOFU1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-10401-5P99.9965.79948
HSA-MIR-430699.7270.503630
HSA-MIR-7156-5P99.6468.811369
HSA-MIR-182799.6368.573265
HSA-MIR-129099.5969.902079
HSA-MIR-427399.4567.931206
HSA-MIR-185-5P99.3568.602497
HSA-MIR-6731-5P99.2867.422375
HSA-MIR-808599.2867.562362
HSA-MIR-6739-3P99.2268.841843
HSA-MIR-429299.1665.571767
HSA-MIR-6791-5P99.1665.921844
HSA-MIR-607199.1667.771780
HSA-MIR-425499.1165.151315
HSA-MIR-4738-3P98.9867.981846
HSA-MIR-570198.9769.541502
HSA-MIR-6770-5P98.9766.761853
HSA-MIR-887-5P98.8265.901347
HSA-MIR-6728-3P98.6367.631534
HSA-MIR-6878-5P98.4967.912142
HSA-MIR-1233-5P98.1966.711201
HSA-MIR-6778-5P98.1966.591239
HSA-MIR-93-3P98.1566.651309
HSA-MIR-3664-3P97.8567.621452
HSA-MIR-30C-1-3P97.8066.361499
HSA-MIR-30C-2-3P97.8066.451499
HSA-MIR-6788-5P97.8066.411532
HSA-MIR-6747-3P97.7364.841596
HSA-MIR-66597.6065.641781
HSA-MIR-129196.2865.891224

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusLlcfc1ENSMUSG00000029867
rattus_norvegicusLlcfc1ENSRNOG00000025618

Protein

Protein identifiers

Sperm-egg fusion protein LLCFC1Q96L11 (reviewed: Q96L11)

Alternative names: LLLL and CFNLAS motif-containing protein 1, MSSP-binding protein CTM-1, Sperm-oocyte fusion required protein 1

All UniProt accessions (2): A0A2Y9D021, Q96L11

UniProt curated annotations — full annotation on UniProt →

Function. Sperm protein required for fusion of sperm with the egg membrane during fertilization.

Subcellular location. Secreted.

Isoforms (3)

UniProt IDNamesCanonical?
Q96L11-11, CTM-1betayes
Q96L11-22, CTM-1alpha
Q96L11-33

RefSeq proteins (1): NP_001369425* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR031684LLCFC1Family

Pfam: PF15838

UniProt features (7 total): splice variant 3, signal peptide 1, chain 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q96L11-F160.930.00

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 41 (showing top): GOBP_SINGLE_FERTILIZATION, GOBP_MEMBRANE_FUSION, GOBP_PLASMA_MEMBRANE_ORGANIZATION, GOBP_CELLULAR_PROCESS_INVOLVED_IN_REPRODUCTION_IN_MULTICELLULAR_ORGANISM, TGACATY_UNKNOWN, GOBP_ENDOMEMBRANE_SYSTEM_ORGANIZATION, VDR_Q3, GOBP_PLASMA_MEMBRANE_FUSION, GOBP_MEMBRANE_ORGANIZATION, GOBP_FERTILIZATION, TGGAAA_NFAT_Q4_01, BRUINS_UVC_RESPONSE_EARLY_LATE, ZWANG_TRANSIENTLY_UP_BY_2ND_EGF_PULSE_ONLY, MIR1290, MIR8085

GO Biological Process (2): fusion of sperm to egg plasma membrane involved in single fertilization (GO:0007342), single fertilization (GO:0007338)

GO Molecular Function (0):

GO Cellular Component (1): extracellular region (GO:0005576)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
single fertilization1
cellular process involved in reproduction in multicellular organism1
fertilization1
cellular anatomical structure1

Protein interactions and networks

STRING

136 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SOFU1TMEM95Q3KNT9631
SOFU1SPACA6W5XKT8541
SOFU1SETBP1Q9Y6X0512
SOFU1IZUMO1Q8IYV9465
SOFU1NBEAL1Q6ZS30401
SOFU1SAXO6Q8TC05350
SOFU1HEMK1Q9Y5R4348
SOFU1EPB41L1Q9H4G0324
SOFU1MTURNQ8N3F0308
SOFU1NTSP30990292
SOFU1EEF1AKMT4P0DPD7290
SOFU1TYW2Q53H54288
SOFU1CYRENQ9BWK5288
SOFU1EMC10Q5UCC4256
SOFU1FLOT2Q14254256

IntAct

4 interactions, top by confidence:

ABTypeScore
CFTRLLCFC1psi-mi:“MI:0915”(physical association)0.370
ZBTB48LLCFC1psi-mi:“MI:0915”(physical association)0.370
LLCFC1POTEFpsi-mi:“MI:0914”(association)0.350

BioGRID (119): C7orf34 (Two-hybrid), MAN2A1 (Affinity Capture-MS), MAN2B1 (Affinity Capture-MS), MAN2A2 (Affinity Capture-MS), WNT5A (Affinity Capture-MS), DHFRL1 (Affinity Capture-MS), TMEM131 (Affinity Capture-MS), TOR1B (Affinity Capture-MS), LAMA3 (Affinity Capture-MS), PXDN (Affinity Capture-MS), CNTNAP1 (Affinity Capture-MS), MELK (Affinity Capture-MS), SEMA3C (Affinity Capture-MS), CACNA2D1 (Affinity Capture-MS), EOGT (Affinity Capture-MS)

ESM2 similar proteins: A0A0B4J1N3, A0A1B0GTK4, A0A1B0GTR0, A0JNL8, A2RUT3, A4IFR0, C9JUS6, D3ZKM3, E9PXB6, F2Z3F1, O70899, O71302, O93195, O95411, P03165, P04610, P0C7M3, P12912, P13206, P20976, P20977, P29560, P47939, P47940, P69714, Q02919, Q08648, Q1RN00, Q1WG82, Q5PR19, Q66669, Q67923, Q69027, Q69604, Q6PDA7, Q6UWK7, Q80IU5, Q80IU8, Q8N5N4, Q913A9

Diamond homologs: Q96L11, Q9D9P8

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

1 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance1
Likely benign0
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

228 predictions. Top by Δscore:

VariantEffectΔscore
7:142940344:T:TAacceptor_gain0.9900
7:142940350:A:AGacceptor_gain0.9900
7:142940351:G:GGacceptor_gain0.9900
7:142940351:GACC:Gacceptor_gain0.9900
7:142940351:GACCA:Gacceptor_gain0.9900
7:142940349:CAGAC:Cacceptor_loss0.9800
7:142940350:A:ACacceptor_loss0.9800
7:142940351:G:GTacceptor_loss0.9800
7:142940351:GA:Gacceptor_gain0.9700
7:142940351:GAC:Gacceptor_gain0.9700
7:142939527:GGT:Gdonor_gain0.9600
7:142939750:TGCAG:Tdonor_loss0.9600
7:142939752:CAG:Cdonor_loss0.9600
7:142939754:GG:Gdonor_loss0.9600
7:142939759:G:Cdonor_loss0.9100
7:142939556:G:GTdonor_gain0.9000
7:142939757:A:Cdonor_loss0.8900
7:142939758:GG:Gdonor_loss0.8400
7:142939528:GT:Gdonor_gain0.8300
7:142939529:TT:Tdonor_gain0.8300
7:142939529:T:Gdonor_gain0.8000
7:142939583:G:GTdonor_gain0.7800
7:142940307:AGGC:Aacceptor_gain0.7800
7:142939689:TG:Tdonor_gain0.6900
7:142939755:G:GGdonor_gain0.6800
7:142940320:GGCTC:Gacceptor_gain0.6600
7:142940349:C:Gacceptor_gain0.6500
7:142940350:A:Gacceptor_gain0.6500
7:142940303:G:Tacceptor_gain0.6300
7:142940305:TCAGG:Tacceptor_gain0.6300

AlphaMissense

968 scored. Top likely-pathogenic:

dbSNP variants (sampled 300 via entrez): RS1000200856 (7:142938033 C>T), RS1000484211 (7:142939119 T>C), RS1002936225 (7:142940894 T>A), RS1004598054 (7:142940007 T>G), RS1004665132 (7:142938763 C>A,G,T), RS1005300745 (7:142939132 A>G), RS1006362457 (7:142940319 G>C), RS1007454734 (7:142940738 G>A,C), RS1009748176 (7:142938089 G>C,T), RS1009772252 (7:142938452 C>T), RS1010453097 (7:142941114 T>C), RS1012195980 (7:142940209 G>T), RS1013812845 (7:142938418 C>G), RS1014681130 (7:142940342 C>A,T), RS1015041158 (7:142940518 G>A,C,T)

Disease associations

OMIM: gene MIM:618946 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST004860_10Alcoholic chronic pancreatitis3.000000e-06

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

6 total (human), top 6 by PubMed support.

ChemicalActions (top 5)PubMed papers
S-(1,2-dichlorovinyl)cysteineincreases expression1
2-palmitoylglycerolincreases expression1
Resveratrolaffects cotreatment, decreases expression1
Benzo(a)pyreneincreases methylation1
Folic Aciddecreases expression1
Plant Extractsdecreases expression, affects cotreatment1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): alcoholic pancreatitis