OCEL1

gene
On this page

Also known as FLJ22709

Summary

OCEL1 (occludin/ELL domain containing 1, HGNC:26221) is a protein-coding gene on chromosome 19p13.11, encoding Occludin/ELL domain-containing protein 1 (Q9H607).

Predicted to be involved in bicellular tight junction assembly. Predicted to be located in membrane. Predicted to be active in apical plasma membrane; bicellular tight junction; and cytoplasmic vesicle.

Source: NCBI Gene 79629 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): Aicardi syndrome (Limited, GenCC)
  • GWAS associations: 1
  • Clinical variants (ClinVar): 78 total
  • MANE Select transcript: NM_024578

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26221
Approved symbolOCEL1
Nameoccludin/ELL domain containing 1
Location19p13.11
Locus typegene with protein product
StatusApproved
AliasesFLJ22709
Ensembl geneENSG00000099330
Ensembl biotypeprotein_coding
Entrez79629

Gene structure

Transcript identifiers

Ensembl transcripts: 18 — 10 protein_coding, 4 retained_intron, 2 nonsense_mediated_decay, 2 protein_coding_CDS_not_defined

ENST00000215061, ENST00000594283, ENST00000595573, ENST00000595769, ENST00000596279, ENST00000597836, ENST00000598068, ENST00000598172, ENST00000599286, ENST00000599588, ENST00000600232, ENST00000600826, ENST00000601529, ENST00000601576, ENST00000602236, ENST00000928402, ENST00000928403, ENST00000928404

RefSeq mRNA: 1 — MANE Select: NM_024578 NM_024578

CCDS: CCDS12351

Canonical transcript exons

ENST00000215061 — 6 exons

ExonStartEnd
ENSE000030314671722622617226316
ENSE000031309381722880317229219
ENSE000034833311722669317226869
ENSE000035050371722784017228005
ENSE000036465681722825617228309
ENSE000036798961722699417227199

Expression profiles

Bgee: expression breadth ubiquitous, 240 present calls, max score 95.89.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 23.5564 / max 166.1558, expressed in 1810 samples.

FANTOM5 promoters (5 alternative TSS)

Promoter IDTPM avgSamples expressed
17446914.35301799
1744717.14431629
1744681.44861048
1744700.3170144
1744720.2935138

Top tissues by expression

276 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
right uterine tubeUBERON:000130295.89gold quality
right lobe of liverUBERON:000111495.20gold quality
mucosa of transverse colonUBERON:000499193.56gold quality
granulocyteCL:000009491.74gold quality
apex of heartUBERON:000209891.70gold quality
right lobe of thyroid glandUBERON:000111991.55gold quality
hindlimb stylopod muscleUBERON:000425290.67gold quality
metanephros cortexUBERON:001053390.66gold quality
right adrenal glandUBERON:000123390.39gold quality
left lobe of thyroid glandUBERON:000112090.36gold quality
right adrenal gland cortexUBERON:003582790.35gold quality
transverse colonUBERON:000115790.29gold quality
olfactory segment of nasal mucosaUBERON:000538690.28gold quality
body of pancreasUBERON:000115089.92gold quality
left adrenal gland cortexUBERON:003582589.84gold quality
body of stomachUBERON:000116189.72gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099189.63gold quality
left adrenal glandUBERON:000123489.42gold quality
small intestine Peyer’s patchUBERON:000345489.25gold quality
adrenal cortexUBERON:000123589.11gold quality
thyroid glandUBERON:000204689.10gold quality
liverUBERON:000210788.82gold quality
endocervixUBERON:000045888.76gold quality
muscle layer of sigmoid colonUBERON:003580588.31gold quality
small intestineUBERON:000210888.24gold quality
adult mammalian kidneyUBERON:000008287.97gold quality
body of uterusUBERON:000985387.83gold quality
prefrontal cortexUBERON:000045187.77gold quality
stromal cell of endometriumCL:000225587.72gold quality
omental fat padUBERON:001041487.66gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.68

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

22 targeting OCEL1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-76599.8468.242442
HSA-MIR-320A-3P99.7769.732107
HSA-MIR-320B99.7769.732107
HSA-MIR-320C99.7769.732107
HSA-MIR-320D99.7769.732107
HSA-MIR-442999.7769.622111
HSA-MIR-6794-5P99.7666.381048
HSA-MIR-11181-3P99.7566.382205
HSA-MIR-4716-3P99.6966.731022
HSA-MIR-4666A-5P99.4169.721887
HSA-MIR-4685-5P99.2565.991563
HSA-MIR-6837-5P99.2565.471632
HSA-MIR-312599.1468.492269
HSA-MIR-425499.1165.151315
HSA-MIR-391698.9968.042155
HSA-MIR-6859-5P98.9968.072049
HSA-MIR-653-3P98.3167.711542
HSA-MIR-7113-5P97.8867.331735
HSA-MIR-66597.6065.641781
HSA-MIR-55897.5067.16977
HSA-MIR-128997.4665.37655
HSA-MIR-324-5P95.6865.20560

Literature-anchored findings (GeneRIF, showing 1)

  • Low OCEL1 expression is associated with poor prognosis in human non-small cell lung cancer. (PMID:32083572)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_reriosi:ch211-13k12.2ENSDARG00000038121
danio_reriozgc:154006ENSDARG00000061797
danio_reriomarveld2lENSDARG00000061800
mus_musculusOcel1ENSMUSG00000002396
rattus_norvegicusOcel1ENSRNOG00000032575
drosophila_melanogasterSu(Tpl)FBGN0014037
caenorhabditis_elegansWBGENE00021281

Paralogs (5): ELL (ENSG00000105656), ELL2 (ENSG00000118985), ELL3 (ENSG00000128886), MARVELD2 (ENSG00000152939), OCLN (ENSG00000197822)

Protein

Protein identifiers

Occludin/ELL domain-containing protein 1Q9H607 (reviewed: Q9H607)

All UniProt accessions (9): Q9H607, M0QXM2, M0QY63, M0QZ32, M0QZ36, M0QZZ7, M0R0F7, M0R0L3, M0R1A4

UniProt curated annotations — full annotation on UniProt →

Similarity. Belongs to the ELL/occludin family.

RefSeq proteins (1): NP_078854* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR010844Occludin_ELLDomain
IPR031176ELL/occludinFamily

Pfam: PF07303

UniProt features (7 total): compositionally biased region 2, sequence variant 2, chain 1, domain 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H607-F174.590.42

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 121 (showing top): GOBP_APICAL_JUNCTION_ASSEMBLY, GRAESSMANN_APOPTOSIS_BY_DOXORUBICIN_DN, GRAESSMANN_RESPONSE_TO_MC_AND_DOXORUBICIN_DN, GOBP_CELL_JUNCTION_ORGANIZATION, RICKMAN_TUMOR_DIFFERENTIATED_WELL_VS_POORLY_DN, BYSTRYKH_HEMATOPOIESIS_STEM_CELL_AND_BRAIN_QTL_CIS, COATES_MACROPHAGE_M1_VS_M2_UP, NIKOLSKY_BREAST_CANCER_19P13_AMPLICON, GOCC_APICAL_PLASMA_MEMBRANE, GOBP_CELL_JUNCTION_ASSEMBLY, GOBP_CELL_CELL_JUNCTION_ASSEMBLY, GOCC_CELL_CELL_JUNCTION, RICKMAN_TUMOR_DIFFERENTIATED_MODERATELY_VS_POORLY_UP, ACEVEDO_METHYLATED_IN_LIVER_CANCER_DN, SCHLOSSER_SERUM_RESPONSE_UP

GO Biological Process (1): bicellular tight junction assembly (GO:0070830)

GO Molecular Function (0):

GO Cellular Component (4): bicellular tight junction (GO:0005923), apical plasma membrane (GO:0016324), cytoplasmic vesicle (GO:0031410), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
apical junction assembly1
tight junction assembly1
apical junction complex1
tight junction1
apical part of cell1
plasma membrane region1
cytoplasm1
intracellular vesicle1
cellular anatomical structure1

Protein interactions and networks

STRING

372 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
OCEL1ABHD8Q96I13455
OCEL1TMEM205Q6UW68446
OCEL1TLCD1Q96CP7431
OCEL1CLDND1Q9NY35429
OCEL1DYDC2Q96IM9415
OCEL1A0A2U3TZT1A0A2U3TZT1401
OCEL1RPP25Q9BUL9382
OCEL1LYPD8Q6UX82379
OCEL1RAB13P51153377
OCEL1BBLNQ9BUW7377
OCEL1RAB31Q13636374
OCEL1FAM217BQ9NTX9373
OCEL1DHRS12A0PJE2370
OCEL1YIPF3Q9GZM5370
OCEL1TMT1BQ6UX53367

IntAct

4 interactions, top by confidence:

ABTypeScore
OCEL1psi-mi:“MI:0407”(direct interaction)0.440
SRPK2OCEL1psi-mi:“MI:0217”(phosphorylation reaction)0.440
OCEL1SRPK1psi-mi:“MI:0217”(phosphorylation reaction)0.440

BioGRID (5): OCEL1 (Reconstituted Complex), OCEL1 (Affinity Capture-RNA), APP (Reconstituted Complex), OCEL1 (Biochemical Activity), OCEL1 (Biochemical Activity)

ESM2 similar proteins: A0A1B0GW64, A0A5F4BST2, A0PJX4, A8MVS5, A8MWV9, B0FP48, E5RIL1, E9PGG2, O14836, O60320, O95998, P09564, Q01113, Q01114, Q13477, Q2KI80, Q2T9R2, Q3TS39, Q3UPR0, Q3URD2, Q4V9L6, Q5FVJ4, Q5M869, Q6A044, Q6UWJ8, Q75VT8, Q864V4, Q8BRJ3, Q8BX43, Q8C503, Q8IVY1, Q8K5A9, Q8N112, Q8NC24, Q8NDY8, Q8QZT4, Q8R138, Q969Z4, Q9BUF7, Q9CQM1

Diamond homologs: F1MGG3, O00472, O08856, P55199, Q0IHQ3, Q28793, Q3UKU1, Q3UZP0, Q5XFX8, Q6P6T5, Q80VR2, Q8N4S9, Q91049, Q9H607, Q9HB65, Q9PUN1, Q9VW51, Q8VCR9

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

78 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance48
Likely benign8
Benign7

Top pathogenic / likely-pathogenic (0)

SpliceAI

1120 predictions. Top by Δscore:

VariantEffectΔscore
19:17227159:GGGCA:Gdonor_gain1.0000
19:17226264:G:GTdonor_gain0.9900
19:17227099:GA:Gdonor_gain0.9900
19:17227160:GGCA:Gdonor_gain0.9900
19:17227161:GCA:Gdonor_gain0.9900
19:17227195:GAGCT:Gdonor_gain0.9900
19:17227197:GCT:Gdonor_gain0.9900
19:17227200:G:GGdonor_gain0.9900
19:17228254:A:AGacceptor_gain0.9900
19:17228255:G:GGacceptor_gain0.9900
19:17228255:GAA:Gacceptor_gain0.9900
19:17227839:GTAA:Gacceptor_gain0.9800
19:17228251:TGCA:Tacceptor_loss0.9800
19:17228252:GCAGA:Gacceptor_loss0.9800
19:17228253:CA:Cacceptor_loss0.9800
19:17228254:AG:Aacceptor_loss0.9800
19:17228255:G:GCacceptor_loss0.9800
19:17227834:T:Aacceptor_gain0.9700
19:17227838:A:AGacceptor_gain0.9700
19:17227839:G:GGacceptor_gain0.9700
19:17228254:AGAAG:Aacceptor_gain0.9700
19:17228255:GA:Gacceptor_gain0.9700
19:17228255:GAAGG:Gacceptor_gain0.9700
19:17228307:ATGGT:Adonor_loss0.9700
19:17228310:G:Adonor_loss0.9700
19:17228311:TGA:Tdonor_loss0.9700
19:17228312:G:GTdonor_loss0.9700
19:17228313:A:ACdonor_loss0.9700
19:17228922:C:CGdonor_gain0.9700
19:17227100:A:Gdonor_gain0.9600

AlphaMissense

1702 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
19:17228853:A:CK241N0.991
19:17228853:A:TK241N0.991
19:17227093:T:CF116L0.990
19:17227095:T:AF116L0.990
19:17227095:T:GF116L0.990
19:17227892:T:CF169L0.990
19:17227894:C:AF169L0.990
19:17227894:C:GF169L0.990
19:17228884:T:CF252L0.987
19:17228886:C:AF252L0.987
19:17228886:C:GF252L0.987
19:17227893:T:CF169S0.986
19:17228920:T:CF264L0.985
19:17228922:C:AF264L0.985
19:17228922:C:GF264L0.985
19:17228868:G:CK246N0.981
19:17228868:G:TK246N0.981
19:17227845:A:CY153S0.978
19:17228864:T:AL245H0.978
19:17228876:T:CI249T0.978
19:17228855:T:CL242P0.976
19:17228864:T:CL245P0.976
19:17227893:T:GF169C0.974
19:17228852:A:TK241I0.974
19:17227845:A:GY153C0.972
19:17227844:T:CY153H0.970
19:17227844:T:GY153D0.968
19:17228843:T:CL238P0.968
19:17228860:C:GH244D0.968
19:17228851:A:GK241E0.966

dbSNP variants (sampled 300 via entrez): RS1000020814 (19:17224937 G>A,C), RS1000584507 (19:17226526 G>A), RS1001372831 (19:17227208 C>T), RS1002827638 (19:17226468 C>A,T), RS1003159250 (19:17225442 T>C), RS1003357758 (19:17227440 C>T), RS1003380442 (19:17225103 G>C), RS1004670177 (19:17225967 G>C), RS1004718339 (19:17229564 T>A,G), RS1004986759 (19:17225465 A>C,G), RS1006657553 (19:17228089 T>A), RS1007942314 (19:17226915 A>T), RS1008027427 (19:17228943 C>T), RS1009164022 (19:17224613 G>A), RS1010894701 (19:17224925 A>G)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

DiseaseClassificationInheritance
Aicardi syndromeLimitedAutosomal dominant

Mondo (1): Aicardi syndrome (MONDO:0010568)

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

1 associations (top):

StudyTraitp-value
GCST003542_158Night sleep phenotypes9.000000e-07

MeSH disease descriptors (1)

DescriptorNameTree numbers
D058540Aicardi SyndromeC10.500.034.687; C11.270.019; C16.131.162; C16.131.666.034.687; C16.320.290.019; C16.320.322.030

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

33 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Aflatoxin B1affects expression, decreases expression3
Air Pollutantsdecreases expression, increases abundance, increases expression2
Benzo(a)pyrenedecreases expression, increases methylation2
Cisplatinincreases expression, affects expression, affects cotreatment2
Smokeincreases abundance, increases expression, decreases expression2
Testosteroneaffects cotreatment, increases expression2
Cyclosporinedecreases expression2
aristolochic acid Idecreases expression1
GSK-J4decreases expression1
afuresertibincreases expression1
triphenyl phosphateaffects expression1
pirinixic acidaffects binding, increases activity, increases expression1
arseniteaffects binding, increases reaction1
sodium arsenitedecreases expression1
ICG 001increases expression1
abrinedecreases expression1
jinfukangaffects cotreatment, increases expression1
(+)-JQ1 compoundincreases expression1
4-(4-((5-(4,5-dimethyl-2-nitrophenyl)-2-furanyl)methylene)-4,5-dihydro-3-methyl-5-oxo-1H-pyrazol-1-yl)benzoic acidincreases expression1
Temozolomideincreases expression1
Decitabineaffects expression1
Leflunomidedecreases expression1
Calcitriolincreases expression, affects cotreatment1
Hydrogen Peroxideaffects expression1
Silicon Dioxidedecreases expression1
Thiramdecreases expression1
Valproic Acidaffects expression1
Copper Sulfatedecreases expression1
Lactic Aciddecreases expression1
Acrylamidedecreases expression1

Clinical trials (associated diseases)

2 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00305305Not specifiedUNKNOWNBrain Development Research Program
NCT00697411Not specifiedRECRUITINGStudy of Selected X-Linked Disorders: Aicardi Syndrome
  • Associated diseases: Aicardi syndrome
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): Aicardi syndrome