TM4SF20

gene
On this page

Also known as FLJ22800TCCE518

Summary

TM4SF20 (transmembrane 4 L six family member 20, HGNC:26230) is a protein-coding gene on chromosome 2q36.3, encoding Transmembrane 4 L6 family member 20 (Q53R12). Polytopic transmembrane protein that inhibits regulated intramembrane proteolysis (RIP) of CREB3L1, inhibiting its activation and the induction of collagen synthesis.

The protein encoded by this gene is a member of the four-transmembrane L6 superfamily. Members of this family function in various cellular processes including cell proliferation, motility, and adhesion via their interactions with integrins. In human brain tissue, this gene is expressed at high levels in the parietal lobe, occipital lobe, hippocampus, pons, white matter, corpus callosum, and cerebellum. Knockout of the homologous gene in mouse results in a neurobehavioral phenotype with suggested enhanced motor coordination. A deletion mutation in the human gene is associated with specific language impairment-5.

Source: NCBI Gene 79853 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): specific language impairment 5 (Limited, GenCC)
  • GWAS associations: 2
  • Clinical variants (ClinVar): 102 total — 2 pathogenic
  • Phenotypes (HPO): 7
  • MANE Select transcript: NM_024795

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26230
Approved symbolTM4SF20
Nametransmembrane 4 L six family member 20
Location2q36.3
Locus typegene with protein product
StatusApproved
AliasesFLJ22800, TCCE518
Ensembl geneENSG00000168955
Ensembl biotypeprotein_coding
OMIM615404
Entrez79853

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000304568, ENST00000449706

RefSeq mRNA: 1 — MANE Select: NM_024795 NM_024795

CCDS: CCDS2466

Canonical transcript exons

ENST00000304568 — 4 exons

ExonStartEnd
ENSE00001124101227362038227364012
ENSE00001124107227379086227379306
ENSE00001134324227366093227366244
ENSE00001134331227370915227370980

Expression profiles

Bgee: expression breadth broad, 77 present calls, max score 99.03.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 1.5175 / max 691.2163, expressed in 65 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
343451.106015
343460.235431
343470.139933
343480.03627

Top tissues by expression

222 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
jejunal mucosaUBERON:000039999.03gold quality
ileal mucosaUBERON:000033195.73gold quality
ileumUBERON:000211695.63silver quality
duodenumUBERON:000211495.48gold quality
small intestine Peyer’s patchUBERON:000345486.32gold quality
small intestineUBERON:000210886.03gold quality
pancreatic ductal cellCL:000207977.39silver quality
buccal mucosa cellCL:000233676.53gold quality
jejunumUBERON:000211576.32gold quality
rectumUBERON:000105272.81gold quality
mucosa of transverse colonUBERON:000499170.15gold quality
colonic mucosaUBERON:000031765.28gold quality
endothelial cellCL:000011562.36gold quality
intestineUBERON:000016061.62gold quality
amniotic fluidUBERON:000017361.15silver quality
mucosa of sigmoid colonUBERON:000499361.10silver quality
vena cavaUBERON:000408759.59gold quality
oocyteCL:000002359.45gold quality
left testisUBERON:000453358.42gold quality
tendon of biceps brachiiUBERON:000818857.20silver quality
transverse colonUBERON:000115757.09gold quality
testisUBERON:000047356.93gold quality
cerebellar vermisUBERON:000472056.42gold quality
right testisUBERON:000453456.07gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099155.30silver quality
epithelial cell of pancreasCL:000008354.21silver quality
smooth muscle tissueUBERON:000113554.21gold quality
cranial nerve IIUBERON:000094154.03silver quality
islet of LangerhansUBERON:000000653.41gold quality
large intestineUBERON:000005952.98gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-GEOD-125970yes69.37
E-ANND-3yes7.31

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

79 targeting TM4SF20, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-3924100.0072.092394
HSA-MIR-3646100.0073.565283
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-548AA99.9670.643753
HSA-MIR-548AP-3P99.9670.643753
HSA-MIR-548T-3P99.9670.643753
HSA-MIR-590-3P99.9674.346478
HSA-MIR-4666A-3P99.9671.713434
HSA-MIR-570-3P99.9672.414910
HSA-MIR-96-5P99.9572.802140
HSA-MIR-128-3P99.9571.172484
HSA-MIR-216A-3P99.9571.192505
HSA-MIR-1213399.9271.822006
HSA-MIR-1271-5P99.9171.991972
HSA-MIR-806399.9169.763146
HSA-MIR-3140-3P99.8868.472069
HSA-MIR-3681-3P99.8870.462254
HSA-MIR-477999.8666.501583
HSA-MIR-469899.8471.414303
HSA-MIR-132399.8369.892471
HSA-MIR-4760-5P99.8069.881619
HSA-MIR-548O-3P99.7469.302228

Literature-anchored findings (GeneRIF, showing 4)

  • TM4SF20 ancestral deletion predisposes carriers to a pediatric disorder of early language delay and cerebral white matter hyperintensities. (PMID:23810381)
  • Ceramide inverts the membrane orientation of TMS4SF20, creating a form of TM4SF20 that stimulates the cleavage of CREB3L1. (PMID:27499293)
  • TM4SF20 is the first protein shown to undergo Regulated Alternative Translocation in response to ceramide. In the absence of ceramide, the N terminus of the first transmembrane helix of TM4SF20 is inserted into the ER lumen. This form blocks activation of CREB3L1, a membrane-bound transcription factor. In the presence of ceramide, the membrane topology of TM4SF20 is inverted. The inverted form stimulates CREB3L1 cleavage. (PMID:27499293)
  • Identification of residues critical for topology inversion of the transmembrane protein TM4SF20 through regulated alternative translocation. (PMID:30808712)

Cross-species orthologs

2 orthologs

OrganismSymbolGene ID
mus_musculusTm4sf20ENSMUSG00000026149
rattus_norvegicusTm4sf20ENSRNOG00000015460

Paralogs (5): TM4SF5 (ENSG00000142484), TM4SF19 (ENSG00000145107), TM4SF18 (ENSG00000163762), TM4SF4 (ENSG00000169903), TM4SF1 (ENSG00000169908)

Protein

Protein identifiers

Transmembrane 4 L6 family member 20Q53R12 (reviewed: Q53R12)

All UniProt accessions (2): Q53R12, C9JES4

UniProt curated annotations — full annotation on UniProt →

Function. Polytopic transmembrane protein that inhibits regulated intramembrane proteolysis (RIP) of CREB3L1, inhibiting its activation and the induction of collagen synthesis. In response to ceramide, which alters TM4SF20 membrane topology, stimulates RIP activation of CREB3L1. Ceramide reverses the direction through which transmembrane helices are translocated into the endoplasmic reticulum membrane during translation of TM4SF20, this mechanism is called ‘regulated alternative translocation’ (RAT) and regulates the function of the transmembrane protein.

Subcellular location. Membrane. Endoplasmic reticulum membrane.

Tissue specificity. Expressed in the brain, with high levels in the parietal lobe, hippocampus, pons, white matter and cerebellum.

Post-translational modifications. Glycosylated at Asn-132, Asn-148 and Asn-163 in presence of ceramide which inverts the orientation of TM4SF20 in membranes exposing these residues to the endoplasmic reticulum lumen. Cleaved by signal peptidase at Ser-14 but the peptide does not act as a signal peptide. Cleavage is inhibited by ceramide which inverts the orientation of TM4SF20 in membranes exposing the N-terminus to the cytosol and not to the endoplasmic reticulum lumen.

Disease relevance. Specific language impairment 5 (SLI5) [MIM:615432] A disorder characterized by a delay in early speech acquisition. It is usually associated with cerebral white matter abnormalities on brain MRI. Some individuals may show disorders in communication, consistent with autism spectrum disorder, or global developmental delay, although others ultimately show normal cognitive function. Penetrance is incomplete and expressivity is variable. The disease is caused by variants affecting the gene represented in this entry.

Domain organisation. The first transmembrane helix plays a critical role for the insertion orientation in the endoplasmic reticulum membrane.

Induction. TGFB1 inhibits TM4SF20 expression to activate CREB3L1.

Similarity. Belongs to the L6 tetraspanin family.

RefSeq proteins (1): NP_079071* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR008661L6_membraneFamily

Pfam: PF05805

UniProt features (24 total): mutagenesis site 11, topological domain 5, transmembrane region 4, chain 1, site 1, sequence variant 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q53R12-F173.100.29

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Catalytic / active sites (1): 13–14 (cleavage)

Mutagenesis-validated functional residues (11):

PositionPhenotype
12no effect on cleavage of the first 13 residues.
13no effect on cleavage of the first 13 residues.
14abolishes cleavage of the first 13 residues.
15no effect on cleavage of the first 13 residues.
16no effect on cleavage of the first 13 residues.
22inverts transmembrane topology. induces cleavage of creb3l1.
26inverts transmembrane topology.
80no effect on glycosylation upon ceramide treatment.
132reduces glycosylation upon ceramide treatment. abolishes glycosylation upon ceramide treatment; when associated with q-1
148reduces glycosylation upon ceramide treatment. abolishes glycosylation upon ceramide treatment; when associated with q-1
163reduces glycosylation upon ceramide treatment. abolishes glycosylation upon ceramide treatment; when associated with q-1

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 76 (showing top): GOBP_NEGATIVE_REGULATION_OF_PROTEOLYSIS, GOBP_NEGATIVE_REGULATION_OF_PROTEIN_METABOLIC_PROCESS, HNF1_01, RIGGI_EWING_SARCOMA_PROGENITOR_UP, GOBP_REGULATION_OF_PROTEOLYSIS, GOCC_NUCLEAR_OUTER_MEMBRANE_ENDOPLASMIC_RETICULUM_MEMBRANE_NETWORK, GOBP_PROTEOLYSIS, GOCC_ORGANELLE_SUBCOMPARTMENT, GOBP_REGULATION_OF_PROTEIN_METABOLIC_PROCESS, VECCHI_GASTRIC_CANCER_ADVANCED_VS_EARLY_DN, BRUINS_UVC_RESPONSE_VIA_TP53_GROUP_A, PRC1_BMI_UP.V1_DN, PRC2_SUZ12_UP.V1_DN, NOTCH_DN.V1_DN, NFE2L2.V2

GO Biological Process (1): negative regulation of proteolysis (GO:0045861)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (5): endoplasmic reticulum membrane (GO:0005789), plasma membrane (GO:0005886), focal adhesion (GO:0005925), membrane (GO:0016020), endoplasmic reticulum (GO:0005783)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
proteolysis1
regulation of proteolysis1
negative regulation of protein metabolic process1
binding1
organelle membrane1
nuclear outer membrane-endoplasmic reticulum membrane network1
endoplasmic reticulum subcompartment1
membrane1
cell periphery1
cell-substrate junction1
cellular anatomical structure1
cytoplasm1
endomembrane system1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

394 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
TM4SF20TM4SF18Q96CE8608
TM4SF20TM4SF19Q96DZ7528
TM4SF20CREB3L1Q96BA8494
TM4SF20TM4SF4P48230455
TM4SF20IGSF23A1L1A6450
TM4SF20TEX261Q6UWH6446
TM4SF20TTC16Q8NEE8444
TM4SF20GPA33Q99795441
TM4SF20PLLPQ9Y342424
TM4SF20ATP2C2O75185419
TM4SF20TRAM1Q15629405
TM4SF20NFXL1Q6ZNB6400
TM4SF20LCN15Q6UWW0396
TM4SF20ZNF232Q9UNY5393
TM4SF20HNRNPKP61978393

IntAct

12 interactions, top by confidence:

ABTypeScore
TM4SF20NINJ2psi-mi:“MI:0915”(physical association)0.560
VAMP3TM4SF20psi-mi:“MI:0915”(physical association)0.560
BBS1TM4SF20psi-mi:“MI:0915”(physical association)0.370
MEP1BTM4SF20psi-mi:“MI:0915”(physical association)0.370
TM4SF20CD74psi-mi:“MI:0914”(association)0.350
TM4SF20AK1psi-mi:“MI:0914”(association)0.350
NINJ2TM4SF20psi-mi:“MI:0915”(physical association)0.000
VAMP3TM4SF20psi-mi:“MI:0915”(physical association)0.000

BioGRID (28): VKORC1L1 (Affinity Capture-MS), MBOAT7 (Affinity Capture-MS), HSDL1 (Affinity Capture-MS), TMEM186 (Affinity Capture-MS), CAB39 (Affinity Capture-MS), TMEM43 (Affinity Capture-MS), CD74 (Affinity Capture-MS), RHOA (Affinity Capture-MS), FAM210B (Affinity Capture-MS), CD74 (Affinity Capture-MS), FAM210B (Affinity Capture-MS), HSDL1 (Affinity Capture-MS), TMEM186 (Affinity Capture-MS), MBOAT7 (Affinity Capture-MS), C10orf35 (Affinity Capture-MS)

ESM2 similar proteins: A0A2R8RY99, A0PK11, A9UL59, B2RVW2, B4L184, B4LC58, B4N5D3, D3ZFW5, O95473, P23290, P35801, P35802, P35803, P36964, P36965, P51674, P56749, P58418, P79826, Q0IIL2, Q0P4G7, Q0VD07, Q11085, Q13491, Q2YDD6, Q53R12, Q5R603, Q5R9K1, Q5R9Q3, Q5R9R3, Q5T9L3, Q5ZLR1, Q6AYR5, Q6CRM6, Q6DID7, Q6P689, Q6UX40, Q754N9, Q7YWX7, Q812E9

Diamond homologs: E9Q9H8, O14894, P30408, P48230, P49111, Q2KIG8, Q3T110, Q53R12, Q5R6Z4, Q5RE43, Q64302, Q91XD3, Q96CE8, Q96DZ7, Q9CQY8, Q9EQL5, Q3T0Z4

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

102 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic2
Likely pathogenic0
Uncertain significance66
Likely benign23
Benign9

Top pathogenic / likely-pathogenic (2)

Variant IDHGVSClassification
243063TM4SF20, 4-KB DELPathogenic
625778GRCh37/hg19 2q36.3(chr2:228230759-228234864)Pathogenic

SpliceAI

419 predictions. Top by Δscore:

VariantEffectΔscore
2:227364008:TGTCA:Tacceptor_gain1.0000
2:227364009:GTCA:Gacceptor_gain1.0000
2:227364010:TCA:Tacceptor_gain1.0000
2:227364011:CA:Cacceptor_gain1.0000
2:227364011:CAC:Cacceptor_gain1.0000
2:227364012:ACTG:Aacceptor_loss1.0000
2:227364013:C:CCacceptor_gain1.0000
2:227364013:CT:Cacceptor_loss1.0000
2:227364014:T:Gacceptor_loss1.0000
2:227370913:A:ACdonor_gain1.0000
2:227370913:ACT:Adonor_gain1.0000
2:227370914:C:CCdonor_gain1.0000
2:227370914:CTC:Cdonor_gain1.0000
2:227370981:C:CCacceptor_gain1.0000
2:227379159:T:TAdonor_gain1.0000
2:227366087:ACTT:Adonor_loss0.9900
2:227366088:CTT:Cdonor_loss0.9900
2:227366089:TTA:Tdonor_loss0.9900
2:227366090:TACCT:Tdonor_loss0.9900
2:227366091:A:ACdonor_gain0.9900
2:227366092:C:CCdonor_gain0.9900
2:227366220:CA:Cacceptor_gain0.9900
2:227366222:C:CCacceptor_gain0.9900
2:227366242:CAT:Cacceptor_gain0.9900
2:227370914:CT:Cdonor_gain0.9900
2:227370977:TGGC:Tacceptor_gain0.9900
2:227370979:GCCT:Gacceptor_loss0.9900
2:227370981:CTGG:Cacceptor_loss0.9900
2:227379199:C:CTdonor_gain0.9900
2:227379200:T:TTdonor_gain0.9900

AlphaMissense

1511 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:227366191:G:CC101W0.990
2:227363790:A:CS208R0.988
2:227363790:A:TS208R0.988
2:227363792:T:GS208R0.988
2:227366160:C:AG112C0.987
2:227363838:A:CF192L0.986
2:227363838:A:TF192L0.986
2:227363840:A:GF192L0.986
2:227363979:C:AW145C0.983
2:227363979:C:GW145C0.983
2:227366204:C:TG97D0.983
2:227366221:A:CS91R0.981
2:227366221:A:TS91R0.981
2:227366223:T:GS91R0.981
2:227379191:A:CN26K0.981
2:227379191:A:TN26K0.981
2:227366193:A:GC101R0.977
2:227366180:G:AS105F0.976
2:227366160:C:GG112R0.975
2:227379093:C:TG59D0.975
2:227363819:C:GG199R0.973
2:227363819:C:TG199R0.973
2:227363830:A:GL195P0.971
2:227366180:G:TS105Y0.971
2:227366192:C:TC101Y0.971
2:227366205:C:GG97R0.970
2:227379109:C:GG54R0.970
2:227379109:C:TG54R0.970
2:227379205:C:GG22R0.969
2:227379205:C:TG22R0.969

dbSNP variants (sampled 300 via entrez): RS1000020956 (2:227371545 G>A,C), RS1000116361 (2:227375674 A>G), RS1000154586 (2:227375527 T>C), RS1000344491 (2:227380843 C>T), RS1000446544 (2:227375226 CA>C,CAA), RS1000594296 (2:227363329 C>A,T), RS1000608415 (2:227362436 TGAG>T), RS1000626821 (2:227370281 G>A), RS1001143758 (2:227369979 A>T), RS1001257645 (2:227379874 A>T), RS1001384289 (2:227364391 C>T), RS1001440617 (2:227374376 C>T), RS1001562537 (2:227368293 G>A), RS1001825587 (2:227361706 T>G), RS1001886225 (2:227375894 C>G,T)

Disease associations

OMIM: gene MIM:615404 | disease phenotypes: MIM:615432

GenCC curated gene-disease

DiseaseClassificationInheritance
specific language impairment 5LimitedAutosomal dominant

Mondo (1): specific language impairment 5 (MONDO:0014184)

Orphanet (0):

HPO phenotypes

7 total (7 of 7 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000729Autistic behavior
HP:0000750Delayed speech and language development
HP:0001263Global developmental delay
HP:0002463Language impairment
HP:0003829Typified by incomplete penetrance
HP:0030890Hyperintensity of cerebral white matter on MRI

GWAS associations

2 associations (top):

StudyTraitp-value
GCST007465_2Phoneme awareness9.000000e-06
GCST011742_42Triglyceride levels in HIV infection5.000000e-07

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0005301reading and spelling ability
EFO:0004530triglyceride measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

18 total (human), top 18 by PubMed support.

ChemicalActions (top 5)PubMed papers
propionaldehydedecreases expression1
butyraldehydedecreases expression1
S-(1,2-dichlorovinyl)cysteineaffects cotreatment, affects response to substance, increases expression1
CGP 52608affects binding, increases reaction1
abrinedecreases expression1
jinfukangaffects cotreatment, increases expression1
incobotulinumtoxinAincreases expression1
Air Pollutantsdecreases expression1
Benzo(a)pyrenedecreases expression1
Cisplatinaffects cotreatment, increases expression1
Daunorubicinaffects response to substance1
Lipopolysaccharidesaffects response to substance, increases expression, affects cotreatment1
Methotrexatedecreases expression1
Silicon Dioxidedecreases expression1
7,8-Dihydro-7,8-dihydroxybenzo(a)pyrene 9,10-oxideincreases expression1
Asbestos, Crocidolitedecreases expression1
Okadaic Acidaffects expression1
Particulate Matterdecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.