SOX17

gene
On this page

Summary

SOX17 (SRY-box transcription factor 17, HGNC:18122) is a protein-coding gene on chromosome 8q11.23, encoding Transcription factor SOX-17 (Q9H6I2). Acts as a transcription regulator that binds target promoter DNA.

This gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulator after forming a protein complex with other proteins.

Source: NCBI Gene 64321 — RefSeq curated summary.

At a glance

  • Gene–disease (curated): pulmonary arterial hypertension (Definitive, ClinGen) — +2 more curated relationships
  • GWAS associations: 28
  • Clinical variants (ClinVar): 272 total — 6 pathogenic, 5 likely-pathogenic
  • Phenotypes (HPO): 62
  • Druggable target: yes
  • Cancer driver (intOGen): activating (oncogene-like) across 1 cancer types
  • Transcription factor: yes — 51 downstream targets (CollecTRI)
  • MANE Select transcript: NM_022454

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:18122
Approved symbolSOX17
NameSRY-box transcription factor 17
Location8q11.23
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000164736
Ensembl biotypeprotein_coding
OMIM610928
Entrez64321

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000297316

RefSeq mRNA: 1 — MANE Select: NM_022454 NM_022454

CCDS: CCDS6159

Canonical transcript exons

ENST00000297316 — 2 exons

ExonStartEnd
ENSE000010870255445793554458445
ENSE000010870275445905854460892

Expression profiles

Bgee: expression breadth ubiquitous, 190 present calls, max score 91.20.

FANTOM5 (CAGE): breadth broad, TPM avg 10.6306 / max 256.4264, expressed in 587 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
8887010.1345584
888720.3144171
888730.091259
888710.090656

Top tissues by expression

270 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
endothelial cellCL:000011591.20gold quality
omental fat padUBERON:001041490.57gold quality
peritoneumUBERON:000235890.56gold quality
pericardiumUBERON:000240789.64gold quality
endometriumUBERON:000129589.17gold quality
adipose tissue of abdominal regionUBERON:000780889.11gold quality
right uterine tubeUBERON:000130287.76gold quality
vena cavaUBERON:000408786.73gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099185.97gold quality
left uterine tubeUBERON:000130385.46gold quality
tibial nerveUBERON:000132384.70gold quality
subcutaneous adipose tissueUBERON:000219083.33gold quality
hindlimb stylopod muscleUBERON:000425281.52gold quality
uterusUBERON:000099581.49gold quality
endocervixUBERON:000045881.31gold quality
adipose tissueUBERON:000101381.01gold quality
mucosa of stomachUBERON:000119980.91gold quality
apex of heartUBERON:000209880.69gold quality
body of uterusUBERON:000985380.50gold quality
connective tissueUBERON:000238479.82gold quality
heart left ventricleUBERON:000208479.23gold quality
spleenUBERON:000210679.00gold quality
cardiac ventricleUBERON:000208278.95gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047378.57gold quality
gastrocnemiusUBERON:000138878.36gold quality
left coronary arteryUBERON:000162678.08gold quality
gall bladderUBERON:000211078.08gold quality
diaphragmUBERON:000110378.02gold quality
right lungUBERON:000216777.64gold quality
fallopian tubeUBERON:000388977.39gold quality

Single-cell (SCXA)

Detected in 9 experiment(s), a significant marker in 6.

ExperimentMarker?Max mean expression
E-MTAB-7008yes1238.31
E-GEOD-135922yes667.36
E-MTAB-10137yes578.14
E-MTAB-10287yes53.16
E-GEOD-109979yes44.65
E-MTAB-10283no637.18
E-MTAB-8271no448.95
E-MTAB-6524no46.38
E-ANND-3no0.00

Regulation

Is transcription factor: yes

Downstream targets (CollecTRI)

51 targets.

TargetRegulation
ACANActivation
ACHE
ADAM2
ADRA1D
AXIN2
BCL2
CCND1Unknown
CDKN1ARepression
CDKN1CRepression
CDKN2B
CEL
COL2A1Activation
COMP
CRYAB
CSNK1A1Repression
CTNNB1Repression
CXCR4Repression
CYP17A1
DMRT1
ESR1
FGF4
FN1Repression
HNF4AActivation
HP
HSPB2
IBSP
IER3Repression
KDR
LAMA1Unknown
LEF1Unknown

Upstream regulators (CollecTRI, top): EZH2, GATA5, HOXA1, KLF5, NANOG, PITX2, POU5F1, SALL4, SOX17, SP1, SRY, TFAP2A

miRNA regulators (miRDB)

65 targeting SOX17, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-196A-1-3P99.9972.152772
HSA-MIR-548AN99.9770.912817
HSA-MIR-590-3P99.9674.346478
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-141-3P99.9472.792421
HSA-MIR-200A-3P99.9472.682420
HSA-MIR-539-5P99.9370.302855
HSA-MIR-329-3P99.9166.561234
HSA-MIR-362-3P99.9166.381267
HSA-MIR-129799.9173.413162
HSA-MIR-95-5P99.8972.173973
HSA-MIR-548D-3P99.8770.674362
HSA-MIR-548BB-3P99.8670.584354
HSA-MIR-548AC99.8470.774351
HSA-MIR-548H-3P99.8470.804349
HSA-MIR-548Z99.8470.804349
HSA-MIR-3180-5P99.8269.122422
HSA-MIR-3913-5P99.7867.26968
HSA-MIR-26A-5P99.7873.522303
HSA-MIR-26B-5P99.7873.512305
HSA-MIR-548AG99.7769.251492
HSA-MIR-1255A99.7468.09744
HSA-MIR-1255B-5P99.7468.16741
HSA-MIR-446599.7172.562096
HSA-MIR-33A-3P99.7070.273362
HSA-MIR-548AI99.6969.241494
HSA-MIR-548BA99.6969.141514
HSA-MIR-570-5P99.6969.241494
HSA-MIR-312299.5066.33821

Literature-anchored findings (GeneRIF, showing 40)

  • SOX17 silencing due to promoter hypermethylation is an early event during tumorigenesis and may contribute to aberrant activation of Wnt signaling in colorectal cancer. (PMID:18413743)
  • SOX17-Chromatin immunoprecipitation identified zinc finger protein 202 (Zfp202) as a direct target of SOX17 during endoderm differentiation of F9 embryonal carcinoma cells. (PMID:18523156)
  • stable endoderm progenitors can be established from human ES cells by constitutive expression of SOX17 producing definitive endoderm progenitors (PMID:18682240)
  • stable expression induces a stable expandable definitive endoderm phenotype in hESCs (PMID:18940723)
  • Sox17 may be a valuable biomarker for the study of breast cancer carcinogenesis and progression. (PMID:19301122)
  • SOX2 and SOX17 expression patterns can distinguish between seminoma and embryonal carcinoma, and this distinction may be diagnostically useful. (PMID:19369635)
  • Sox17 protects benign tumors from malignant progression at an early stage of tumorigenesis, and down-regulation of Sox17 contributes to malignant progression through promotion of Wnt activity. (PMID:19549530)
  • Data reveal a molecular Oct4-, Sox2- and Sox17-mediated mechanism that disrupts the stem cell microenvironment favoring pluripotency to provide an endodermal environment in which cell lineage is determined and commits the cells to a cardiogenic fate. (PMID:19736317)
  • SOX17 negatively regulates canonical WNT/beta-catenin signaling pathway and inhibits human HCC cells growth (PMID:20716954)
  • These data indicate a role of SOX17 in human kidney and urinary tract development and implicate the SOX17-p.Y259N mutation as a causative factor in congenital anomalies of the kidney and the urinary tract . (PMID:20960469)
  • Data show that down-regulated by Sox17 expressing HepG2 cells is a set of genes that are expressed in the developing liver, suggesting that one function of Sox17 is the repression of liver gene expression. (PMID:21305474)
  • SOX17(+)-human embryonic stem cell progeny expressed endodermal markers. (PMID:21362573)
  • Sox17 might be a key transcription factor controlling CD133 expression, and that it might also play a role in the control of gastric tumor progression. (PMID:21457403)
  • Sox17 prominently contributes to gastric cancer progression through regulating proliferation and cell cycle, indicating a novel diagnosis and prognosis biomarker as well as a potential therapeutic target in gastric cancer (PMID:21514720)
  • A stage-specific transduction of SOX17 in the primitive endoderm or mesendoderm promotes directive extraembryonic endoderm or definitive endoderm differentiation by SOX17 transduction, respectively. (PMID:21760905)
  • Downstream targets of Sox17 define signaling pathways and molecular mechanisms in oligodendrocyte progenitor cells that are regulated by Sox17 during cell cycle exit and differentiation in oligodendrocyte development. (PMID:21957254)
  • silencing of Sox17 occurs frequently in early gastric cancer (PMID:22161215)
  • New sequence variations in SOX17 were identified but all correspond to nonpathogenic variants, suggesting that SOX17 is not involved in UHL phenotype (PMID:22348788)
  • SOX17 acts as a Wnt signaling inhibitor. (PMID:22846201)
  • SOX17 promoter region is frequently methylated in esophageal cancer and in a progression tendency during esophageal carcinogenesis. (PMID:22921431)
  • provided further evidence to support the previously reported association of intracranial aneurysm with single nucleotide polymorphism in SOX17 (PMID:22961961)
  • SOX17 was frequently methylated in human PTC. Loss of SOX17 expression was induced by promoter region hypermethylation. SOX17 inhibited thyroid cancer proliferation. Methylation of SOX17 activated the Wnt signaling pathway in human thyroid cancer. (PMID:23044318)
  • after electrostatic interactions attract Sox17 to DNA, Asn73, Ser99, and Trp106 form hydrogen bonds with DNA, Arg70, Lys80, Arg83, His94, and Asn95 on Sox17 undergo conformational changes and form hydrogen bonds with DNA (PMID:23061670)
  • SOX17 promoter is highly methylated in primary breast tumors, in CTCs isolated from patients with breast cancer, and in corresponding cfDNA samples (PMID:23136251)
  • SOX17 plays a key role in priming hemogenic potential in endothelial cells, thereby regulating hematopoietic development from stem cells. (PMID:23169777)
  • findings establish Sox17 as a key regulator of tumor angiogenesis and tumor progression. (PMID:23241958)
  • Overall survival of patients with gastric cancer was found to be significantly associated with SOX17 promoter methylation. (PMID:23403728)
  • recombinant Sox17 mediates modulation of the Wnt pathway through changes in beta-catenin, SFRP1 and Wnt/Frizzled expression. (PMID:23474492)
  • Experimental confirmation of miRNA-mRNA interactions established a critical role of miR-200a in regulating both EMT and definitive endoderm formation, through direct repression of ZEB2 and SOX17, during early stage differentiation. (PMID:23813959)
  • In multiple sclerosis tissue, Sox17 was primarily detected in actively demyelinating lesions and periplaque white matter. (PMID:23918253)
  • Immunofluorescence analysis of human pancreatic tissue arrays revealed the presence of tuft cells in metaplasia and early-stage tumors, along with SOX17 expression, consistent with a biliary phenotype. (PMID:23999170)
  • Low SOX17 expression is associated with esophageal cancer progression. (PMID:24407731)
  • Possible association of SOX-17 and RBBP8 with brain arteriovenous malformations, genes involved in cell cycle progression, deserves further investigation (PMID:25053769)
  • Low Sox17 expression is associated with hepatocellular carcinoma. (PMID:25106407)
  • Hypermethylation of SOX17 promoter may be one of the early events in the development of myelodysplatic syndrome and predicts poor prognosis. (PMID:25291942)
  • Decreased Sox17 expression is correlated with melanoma progression, an unfavorable survival of melanoma patients and is an independent molecular prognostic factor for melanoma. (PMID:25310020)
  • SOX17 is the key regulator of human primordial germ cells-like cells specification, whereas BLIMP1 represses endodermal and other somatic genes during hPGCLC specification. (PMID:25543152)
  • Human intracranial aneurysm samples showed reduced Sox17 expression and impaired endothelial integrity. (PMID:25596186)
  • In this study, oligodendroglioma patients with 1p/19q LOH and Sox17 protein expression had a better prognosis. (PMID:25674225)
  • promoter methylation may play an important role in breast cancer progression and could be used as a prognostic biomarker to identify patients at risk of developing metastasis or recurrence after mastectomy (PMID:25789956)

Cross-species orthologs

5 orthologs

OrganismSymbolGene ID
danio_reriosox17ENSDARG00000101717
mus_musculusSox17ENSMUSG00000025902
rattus_norvegicusSox17ENSRNOG00000027357
drosophila_melanogasterSox14FBGN0005612
drosophila_melanogasterSox21aFBGN0036411

Paralogs (20): SOX8 (ENSG00000005513), SOX30 (ENSG00000039600), SOX10 (ENSG00000100146), SOX6 (ENSG00000110693), SOX4 (ENSG00000124766), SOX21 (ENSG00000125285), SOX9 (ENSG00000125398), SOX15 (ENSG00000129194), SOX5 (ENSG00000134532), SOX3 (ENSG00000134595), SOX13 (ENSG00000143842), SOX14 (ENSG00000168875), SOX7 (ENSG00000171056), SOX11 (ENSG00000176887), SOX12 (ENSG00000177732), CFAP65 (ENSG00000181378), SOX2 (ENSG00000181449), SOX1 (ENSG00000182968), SRY (ENSG00000184895), SOX18 (ENSG00000203883)

Protein

Protein identifiers

Transcription factor SOX-17Q9H6I2 (reviewed: Q9H6I2)

All UniProt accessions (1): Q9H6I2

UniProt curated annotations — full annotation on UniProt →

Function. Acts as a transcription regulator that binds target promoter DNA. Binds to the sequences 5’-AACAAT-‘3 or 5’-AACAAAG-3’. Modulates transcriptional regulation via WNT3A. Inhibits Wnt signaling. Promotes degradation of activated CTNNB1. Plays a key role in the regulation of embryonic development. Required for normal development of the definitive gut endoderm. Required for normal looping of the embryonic heart tube. Plays an important role in embryonic and postnatal vascular development, including development of arteries. Plays an important role in postnatal angiogenesis, where it is functionally redundant with SOX18. Required for the generation and maintenance of fetal hematopoietic stem cells, and for fetal hematopoiesis. Probable transcriptional activator in the premeiotic germ cells.

Subunit / interactions. Interacts with CTNNB1, LEF1 and TCF4.

Subcellular location. Nucleus.

Tissue specificity. Expressed in adult heart, lung, spleen, testis, ovary, placenta, fetal lung, and kidney. In normal gastrointestinal tract, it is preferentially expressed in esophagus, stomach and small intestine than in colon and rectum.

Disease relevance. Vesicoureteral reflux 3 (VUR3) [MIM:613674] A disease belonging to the group of congenital anomalies of the kidney and urinary tract. It is characterized by the reflux of urine from the bladder into the ureters and sometimes into the kidneys, and is a risk factor for urinary tract infections. Primary disease results from a developmental defect of the ureterovesical junction. In combination with intrarenal reflux, the resulting inflammatory reaction may result in renal injury or scarring, also called reflux nephropathy. Extensive renal scarring impairs renal function and may predispose patients to hypertension, proteinuria, renal insufficiency and end-stage renal disease. The disease is caused by variants affecting the gene represented in this entry. Pulmonary hypertension, primary, 7 (PPH7) [MIM:621248] A form of primary pulmonary hypertension, a disease defined by plexiform lesions of proliferating endothelial cells in pulmonary arterioles. The lesions lead to elevated pulmonary arterial pression, right ventricular failure, and death. Primary pulmonary hypertension exhibits incomplete penetrance, sex bias and variable age of onset, both within and between families. PPH7 is an autosomal dominant form. The disease is caused by variants affecting the gene represented in this entry.

Domain organisation. The 9aaTAD motif is a transactivation domain present in a large number of yeast and animal transcription factors.

RefSeq proteins (1): NP_071899* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR021934Sox_CDomain
IPR033392Sox7/17/18_centralDomain
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR050140SRY-related_HMG-box_TF-likeFamily

Pfam: PF00505, PF12067

UniProt features (41 total): sequence variant 28, region of interest 4, helix 3, compositionally biased region 2, chain 1, domain 1, DNA-binding region 1, short sequence motif 1

Structure

Experimental structures (PDB)

2 structures.

PDBMethodResolution (Å)
4A3NX-RAY DIFFRACTION2.4
2YULSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H6I2-F158.950.20

Function

Pathways and Gene Ontology

Reactome pathways

12 pathways

IDPathway
R-HSA-3769402Deactivation of the beta-catenin transactivating complex
R-HSA-9823730Formation of definitive endoderm
R-HSA-9827857Specification of primordial germ cells
R-HSA-9937080Developmental Lineage of Multipotent Pancreatic Progenitor Cells
R-HSA-1266738Developmental Biology
R-HSA-1474165Reproduction
R-HSA-162582Signal Transduction
R-HSA-195721Signaling by WNT
R-HSA-201681TCF dependent signaling in response to WNT
R-HSA-9734767Developmental Cell Lineages
R-HSA-9758941Gastrulation
R-HSA-9925561Developmental Lineage of Pancreatic Acinar Cells

MSigDB gene sets: 253 (showing top): GOBP_MORPHOGENESIS_OF_AN_EPITHELIUM, GOBP_RESPONSE_TO_NITROGEN_COMPOUND, GOBP_HEPATICOBILIARY_SYSTEM_DEVELOPMENT, GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_EPITHELIUM_DEVELOPMENT, GOBP_URETER_DEVELOPMENT, WALLACE_PROSTATE_CANCER_RACE_UP, BENPORATH_ES_WITH_H3K27ME3, GOBP_METANEPHROS_DEVELOPMENT, GOBP_RESPONSE_TO_PEPTIDE, GOBP_FORMATION_OF_PRIMARY_GERM_LAYER, GOBP_NEGATIVE_REGULATION_OF_CELL_GROWTH, GOBP_GROWTH, GOBP_POSITIVE_REGULATION_OF_PROTEIN_CATABOLIC_PROCESS, GOBP_MACROMOLECULE_CATABOLIC_PROCESS

GO Biological Process (56): angiogenesis (GO:0001525), vasculogenesis (GO:0001570), metanephros development (GO:0001656), endoderm formation (GO:0001706), endodermal cell fate specification (GO:0001714), inner cell mass cellular morphogenesis (GO:0001828), heart looping (GO:0001947), cardiogenic plate morphogenesis (GO:0003142), embryonic heart tube morphogenesis (GO:0003143), outflow tract morphogenesis (GO:0003151), regulation of DNA-templated transcription (GO:0006355), regulation of transcription by RNA polymerase II (GO:0006357), spermatogenesis (GO:0007283), endodermal cell fate determination (GO:0007493), heart development (GO:0007507), gene expression (GO:0010467), positive regulation of gene expression (GO:0010628), Wnt signaling pathway (GO:0016055), rostrocaudal neural tube patterning (GO:0021903), signal transduction involved in regulation of gene expression (GO:0023019), negative regulation of cell growth (GO:0030308), protein destabilization (GO:0031648), embryonic heart tube development (GO:0035050), cell migration involved in gastrulation (GO:0042074), response to alkaloid (GO:0043279), positive regulation of protein catabolic process (GO:0045732), positive regulation of DNA-templated transcription (GO:0045893), positive regulation of transcription by RNA polymerase II (GO:0045944), regulation of embryonic development (GO:0045995), embryonic foregut morphogenesis (GO:0048617), stem cell fate specification (GO:0048866), protein stabilization (GO:0050821), endocardium formation (GO:0060214), cardiac cell fate determination (GO:0060913), heart formation (GO:0060914), endocardial cell differentiation (GO:0060956), common bile duct development (GO:0061009), gallbladder development (GO:0061010), endodermal digestive tract morphogenesis (GO:0061031), regulation of stem cell proliferation (GO:0072091)

GO Molecular Function (11): transcription cis-regulatory region binding (GO:0000976), RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA-binding transcription activator activity, RNA polymerase II-specific (GO:0001228), beta-catenin binding (GO:0008013), RNA polymerase II-specific DNA-binding transcription factor binding (GO:0061629), DNA binding (GO:0003677), DNA-binding transcription factor activity (GO:0003700), protein binding (GO:0005515), sequence-specific DNA binding (GO:0043565), sequence-specific double-stranded DNA binding (GO:1990837)

GO Cellular Component (4): chromatin (GO:0000785), nucleus (GO:0005634), nucleoplasm (GO:0005654), transcription regulator complex (GO:0005667)

Reactome top-level categories

Rollup of top-7 pathways:

CategoryPathways
Developmental Cell Lineages of the Exocrine Pancreas2
Developmental Biology2
TCF dependent signaling in response to WNT1
Gastrulation1
Reproduction1
Signal Transduction1
Signaling by WNT1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
heart morphogenesis3
regulation of gene expression3
RNA polymerase II transcription regulatory region sequence-specific DNA binding3
blood vessel morphogenesis2
endodermal cell fate commitment2
embryonic morphogenesis2
anatomical structure morphogenesis2
regulation of DNA-templated transcription2
cellular anatomical structure2
anatomical structure formation involved in morphogenesis1
cell differentiation1
kidney development1
formation of primary germ layer1
endoderm development1
cell fate specification1
cell morphogenesis1
inner cell mass cell differentiation1
embryonic heart tube morphogenesis1
determination of heart left/right asymmetry1
embryonic heart tube development1
embryonic organ morphogenesis1
epithelial tube morphogenesis1
DNA-templated transcription1
regulation of RNA biosynthetic process1
transcription by RNA polymerase II1
developmental process involved in reproduction1
male gamete generation1
cell fate determination1
animal organ development1
circulatory system development1
macromolecule biosynthetic process1
gene expression1
positive regulation of macromolecule biosynthetic process1
cell surface receptor signaling pathway1
anterior/posterior pattern specification1
neural tube patterning1
signal transduction1
transcription regulatory region nucleic acid binding1
sequence-specific double-stranded DNA binding1
cis-regulatory region sequence-specific DNA binding1

Protein interactions and networks

STRING

2682 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SOX17POU5F1P31359984
SOX17FOXA2Q9Y261940
SOX17CTNNB1P35222914
SOX17GATA4P43694875
SOX17NANOGQ9H9S0858
SOX17GATA6P78327846
SOX17MIXL1Q9H2W2839
SOX17PAX6P26367809
SOX17MESP1Q9BRJ9802
SOX17HNF4AP41235798
SOX17CDX2Q99626794
SOX17FOXA1P55317783
SOX17ROBO2Q9HCK4764
SOX17WNT3AP56704750
SOX17EOMESO95936746

IntAct

9 interactions, top by confidence:

ABTypeScore
SOX17CTNNB1psi-mi:“MI:0914”(association)0.500
CTNNB1SOX17psi-mi:“MI:0915”(physical association)0.500
NFIASOX17psi-mi:“MI:0915”(physical association)0.470
NFIBSOX17psi-mi:“MI:0915”(physical association)0.470
NFICSOX17psi-mi:“MI:0915”(physical association)0.400
APBA3SOX17psi-mi:“MI:0915”(physical association)0.370
SOX17EEFSECpsi-mi:“MI:0914”(association)0.350
SOX17BCL9psi-mi:“MI:2364”(proximity)0.270

BioGRID (81): RPL38 (Affinity Capture-MS), HOMER3 (Affinity Capture-MS), EBNA1BP2 (Affinity Capture-MS), EEFSEC (Affinity Capture-MS), KDM6A (Proximity Label-MS), POU2F1 (Proximity Label-MS), HOXA5 (Proximity Label-MS), SMARCA2 (Proximity Label-MS), SALL1 (Proximity Label-MS), TRPS1 (Proximity Label-MS), BCL9 (Proximity Label-MS), NFIB (Proximity Label-MS), ARID1A (Proximity Label-MS), KMT2D (Proximity Label-MS), ZNF609 (Proximity Label-MS)

ESM2 similar proteins: A2A9A2, A4QNP7, A6QQ94, A6YP92, A7M7C7, A7MB54, B5RHS5, M0R6D8, O09029, O35085, P28360, P41225, P43694, P46153, P50548, P78414, P78415, P79772, P81067, P81068, P84550, P84551, Q08369, Q0Q0E4, Q14549, Q14774, Q2I327, Q2MJB4, Q2VL84, Q2VL87, Q2VL88, Q2VWA4, Q3SZJ5, Q4AE28, Q61169, Q61345, Q6YHU8, Q71T09, Q76L87, Q7TQ40

Diamond homologs: A0A0G2JTZ2, A2TED3, A5D8R3, B1H349, B3DLD3, B3DM43, F1M8W4, O42342, O42601, P0C1G9, P35710, P35711, P35712, P35713, P35716, P36389, P36390, P36393, P36394, P36396, P40645, P40646, P40647, P40649, P40650, P40656, P40657, P43680, P47792, P48433, P48435, Q03255, Q03257, Q04891, Q05738, Q06831, Q06945, Q20201, Q23045, Q27949

SIGNOR signaling

6 interactions.

AEffectBMechanism
POU5F1“down-regulates quantity by repression”SOX17“transcriptional regulation”
NANOG“down-regulates quantity by repression”SOX17“transcriptional regulation”
SOX17down-regulatesCTNNB1binding
SOX17up-regulatesGLI2
SOX17“down-regulates activity”CTNNB1
SOX17“form complex”SOX17/POU5F1binding

Disease & clinical

Cancer significance

From intOGen — cancer-driver classification: activating (oncogene-like) across 1 cancer types — UCEC.

Clinical variants and AI predictions

ClinVar

272 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic6
Likely pathogenic5
Uncertain significance176
Likely benign54
Benign16

Top pathogenic / likely-pathogenic (11)

Variant IDHGVSClassification
1527900NM_022454.4(SOX17):c.314C>A (p.Ser105Ter)Pathogenic
18415SOX17, 6-BP INS, NT51Pathogenic
3906893SOX17, TYR137TERPathogenic
3906894Q127*Pathogenic
3906895SOX17, 22-BP DEL, NT499Pathogenic
817453NM_022454.4(SOX17):c.499_520del (p.Leu167fs)Pathogenic
1188434NM_022454.4(SOX17):c.404A>G (p.Tyr135Cys)Likely pathogenic
2584495NM_022454.4(SOX17):c.208C>G (p.Arg70Gly)Likely pathogenic
2683958NM_022454.4(SOX17):c.245A>G (p.Glu82Gly)Likely pathogenic
3900765NM_022454.4(SOX17):c.209G>A (p.Arg70Gln)Likely pathogenic
4530598NM_022454.4(SOX17):c.413G>C (p.Arg138Pro)Likely pathogenic

SpliceAI

174 predictions. Top by Δscore:

VariantEffectΔscore
8:54458443:TGGG:Tdonor_loss1.0000
8:54458444:GG:Gdonor_gain1.0000
8:54458445:GG:Gdonor_gain1.0000
8:54458446:G:GGdonor_gain1.0000
8:54458446:GTGA:Gdonor_loss1.0000
8:54458447:T:Adonor_loss1.0000
8:54459049:A:AGacceptor_gain1.0000
8:54459049:ACT:Aacceptor_gain1.0000
8:54459050:C:Gacceptor_gain1.0000
8:54459051:T:Aacceptor_gain1.0000
8:54459053:T:TAacceptor_gain1.0000
8:54459055:CAGGC:Cacceptor_loss1.0000
8:54459056:A:AGacceptor_gain1.0000
8:54459057:G:GGacceptor_gain1.0000
8:54459057:GGCAA:Gacceptor_gain1.0000
8:54458430:T:Gdonor_gain0.9900
8:54458441:GCTGG:Gdonor_gain0.9900
8:54459049:ACTGT:Aacceptor_gain0.9900
8:54459056:AG:Aacceptor_gain0.9900
8:54459057:GG:Gacceptor_gain0.9900
8:54459057:GGC:Gacceptor_gain0.9900
8:54459057:GGCA:Gacceptor_gain0.9900
8:54458443:TGG:Tdonor_gain0.9700
8:54458444:GGG:Gdonor_gain0.9700
8:54458442:CTGG:Cdonor_gain0.9600
8:54459055:CAGG:Cacceptor_gain0.9500
8:54459056:AGGC:Aacceptor_gain0.9500
8:54458450:G:Cdonor_loss0.9300
8:54459053:TGCA:Tacceptor_gain0.9300
8:54459054:GCAG:Gacceptor_gain0.9300

AlphaMissense

2685 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
8:54458341:T:AI68N1.000
8:54458341:T:CI68T1.000
8:54458341:T:GI68S1.000
8:54458346:C:GR70G1.000
8:54458346:C:TR70W1.000
8:54458347:G:AR70Q1.000
8:54458349:C:AP71T1.000
8:54458349:C:GP71A1.000
8:54458349:C:TP71S1.000
8:54458350:C:AP71Q1.000
8:54458350:C:GP71R1.000
8:54458350:C:TP71L1.000
8:54458353:T:CM72T1.000
8:54458354:G:AM72I1.000
8:54458354:G:CM72I1.000
8:54458354:G:TM72I1.000
8:54458355:A:CN73H1.000
8:54458355:A:GN73D1.000
8:54458355:A:TN73Y1.000
8:54458356:A:CN73T1.000
8:54458356:A:GN73S1.000
8:54458356:A:TN73I1.000
8:54458357:C:AN73K1.000
8:54458357:C:GN73K1.000
8:54458359:C:AA74D1.000
8:54458359:C:TA74V1.000
8:54458361:T:AF75I1.000
8:54458361:T:CF75L1.000
8:54458361:T:GF75V1.000
8:54458362:T:CF75S1.000

dbSNP variants (sampled 300 via entrez): RS1000269730 (8:54456993 T>C), RS1001499056 (8:54456824 A>G), RS1002224545 (8:54456953 A>G), RS1002655500 (8:54458225 C>A,T), RS1002681757 (8:54457182 G>A), RS1004560711 (8:54456091 G>T), RS1004630574 (8:54457513 A>C), RS1005009675 (8:54457437 T>C), RS1005224778 (8:54458669 C>T), RS1005389905 (8:54457668 CAGCCCTCCCAGACGGCTTTTCGAGTCTCCCT>C), RS1005828003 (8:54459934 T>A,C,G), RS1006180345 (8:54457219 T>C,G), RS1006254586 (8:54458456 G>C), RS1006579680 (8:54458792 T>C), RS1007459047 (8:54458001 C>A,G)

Disease associations

OMIM: gene MIM:610928 | disease phenotypes: MIM:613674, MIM:621248

GenCC curated gene-disease

DiseaseClassificationInheritance
vesicoureteral reflux 3DefinitiveAutosomal dominant
pulmonary arterial hypertensionStrongAutosomal dominant
familial vesicoureteral refluxSupportiveAutosomal dominant

ClinGen Gene-Disease Validity (1)

Expert-panel classifications — Definitive > Strong > Moderate > Limited > Disputed > Refuted.

DiseaseClassificationInheritance
pulmonary arterial hypertensionDefinitiveAD

Mondo (6): vesicoureteral reflux 3 (MONDO:0013356), pulmonary arterial hypertension (MONDO:0015924), pulmonary hypertension, primary, 7 (MONDO:0979237), vesicoureteral reflux (MONDO:0006007), chronic kidney disease (MONDO:0005300), familial vesicoureteral reflux (MONDO:0017329)

Orphanet (3): Familial vesicoureteral reflux (Orphanet:289365), Pulmonary arterial hypertension (Orphanet:182090), Idiopathic/heritable pulmonary arterial hypertension (Orphanet:422)

HPO phenotypes

62 total (30 of 62 shown, HPO-id order):

HPOTerm
HP:0000006Autosomal dominant inheritance
HP:0000010Recurrent urinary tract infections
HP:0000072Hydroureter
HP:0000074Ureteropelvic junction obstruction
HP:0000126Hydronephrosis
HP:0000421Epistaxis
HP:0000717Autism
HP:0000821Hypothyroidism
HP:0000961Cyanosis
HP:0001297Stroke
HP:0001508Failure to thrive
HP:0001561Polyhydramnios
HP:0001629Ventricular septal defect
HP:0001631Atrial septal defect
HP:0001635Congestive heart failure
HP:0001642Pulmonic stenosis
HP:0001643Patent ductus arteriosus
HP:0001655Patent foramen ovale
HP:0001667Right ventricular hypertrophy
HP:0001695Cardiac arrest
HP:0001909Leukemia
HP:0002092Pulmonary arterial hypertension
HP:0002105Hemoptysis
HP:0002870Obstructive sleep apnea
HP:0002875Exertional dyspnea
HP:0003493Antinuclear antibody positivity
HP:0003593Infantile onset
HP:0003596Middle age onset
HP:0003621Juvenile onset
HP:0003623Neonatal onset

GWAS associations

28 associations (top):

StudyTraitp-value
GCST000262_2Intracranial aneurysm1.000000e-10
GCST000262_4Intracranial aneurysm2.000000e-09
GCST000646_2Intracranial aneurysm5.000000e-07
GCST000646_3Intracranial aneurysm1.000000e-12
GCST002221_37Cholesterol, total5.000000e-11
GCST002222_9LDL cholesterol4.000000e-11
GCST004233_16LDL cholesterol levels8.000000e-11
GCST004235_41Total cholesterol levels7.000000e-10
GCST004748_134Lung cancer2.000000e-06
GCST004749_24Lung cancer in ever smokers9.000000e-07
GCST007228_2Pulmonary arterial hypertension2.000000e-12
GCST007228_3Pulmonary arterial hypertension5.000000e-15
GCST007445_3Factor VIII levels2.000000e-07
GCST007445_32Factor VIII levels2.000000e-09
GCST008077_14LDL cholesterol levels6.000000e-11
GCST008077_72LDL cholesterol levels1.000000e-08
GCST008078_6LDL cholesterol levels x alcohol consumption (regular vs non-regular drinkers) interaction (2df)1.000000e-18
GCST008078_73LDL cholesterol levels x alcohol consumption (regular vs non-regular drinkers) interaction (2df)1.000000e-17
GCST008079_12LDL cholesterol levels x alcohol consumption (drinkers vs non-drinkers) interaction (2df)3.000000e-20
GCST008079_143LDL cholesterol levels x alcohol consumption (drinkers vs non-drinkers) interaction (2df)8.000000e-20
GCST008086_53LDL cholesterol levels in current drinkers4.000000e-13
GCST008086_86LDL cholesterol levels in current drinkers2.000000e-13
GCST010243_14Apolipoprotein B levels1.000000e-13
GCST010245_113LDL cholesterol levels3.000000e-13
GCST011020_3Intracranial aneurysm1.000000e-14
GCST011021_4Intracranial aneurysm3.000000e-14
GCST011741_56LDL cholesterol levels in HIV infection7.000000e-06
GCST90013407_79Liver enzyme levels (gamma-glutamyl transferase)4.000000e-51

EFO canonical traits (6, from GWAS)

EFO IDTrait name
EFO:0004574total cholesterol measurement
EFO:0004611low density lipoprotein cholesterol measurement
EFO:0004630factor VIII measurement
EFO:0004329alcohol drinking
EFO:0004615apolipoprotein B measurement
EFO:0004532serum gamma-glutamyl transferase measurement

MeSH disease descriptors (3)

DescriptorNameTree numbers
D007676Kidney Failure, ChronicC12.050.351.968.419.780.750.500; C12.200.777.419.780.750.500; C12.950.419.780.750.500; C23.550.291.500.906.500
D000081029Pulmonary Arterial HypertensionC08.381.423.847
D014718Vesico-Ureteral RefluxC12.050.351.968.829.920; C12.200.777.829.920; C12.950.829.920

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4523455 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

53 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases reaction, increases expression, decreases localization3
Tretinoinaffects expression, decreases localization, affects cotreatment, decreases reaction, increases expression3
Valproic Acidaffects expression, decreases expression, decreases localization3
dorsomorphindecreases localization, affects cotreatment, increases expression2
Diethylstilbestroldecreases localization, increases expression2
Fluorouracildecreases localization, decreases expression2
Aflatoxin B1decreases expression, decreases methylation2
chlorophacinonedecreases localization1
bisphenol Aincreases expression1
arseniteincreases methylation1
butyraldehydeincreases expression1
dinosebdecreases localization1
maleic acidincreases expression1
diniconazoledecreases localization1
columbaminedecreases expression1
flusilazoledecreases localization1
fluazinamdecreases localization1
ziprasidonedecreases localization1
CGP 52608affects binding, increases reaction1
hexaconazoledecreases localization1
vandetanibdecreases localization1
Chir 99021decreases reaction, increases expression, affects cotreatment1
abrineincreases expression1
licochalcone Bincreases expression1
Imatinib Mesylatedecreases localization1
Dasatinibdecreases localization1
Gefitinibdecreases localization1
Sorafenibdecreases localization1
Resveratroldecreases expression1
Decitabinedecreases methylation, increases expression1

ChEMBL screening assays

1 unique, capped per target: 1 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4421281BindingInhibition of SOX17 (unknown origin) expressed in African green monkey COS7 cells assessed as reduction in SOX17 transcriptional activity atInhibitors of sox18 protein activity for treating angiogenesis- and/or lymphangiogenesis-related diseases

Cellosaurus cell lines

11 cell lines: 7 embryonic stem cell, 4 cancer cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_A6M3SEES3-1V human SOX17, clone1Embryonic stem cellMale
CVCL_A6M4SEES3-1V human SOX17, clone2Embryonic stem cellMale
CVCL_A6M5SEES3-1V human SOX17, clone3Embryonic stem cellMale
CVCL_B2H2Abcam HeLa SOX17 KOCancer cell lineFemale
CVCL_B8PYAbcam HCT 116 SOX17 KOCancer cell lineMale
CVCL_B9SEAbcam A-549 SOX17 KOCancer cell lineMale
CVCL_C3L6MSHRIe002-A-SOX17 control 1Embryonic stem cellMale
CVCL_C3L7MSHRIe002-A-SOX17 control 2Embryonic stem cellMale
CVCL_C3L8MSHRIe002-A-SOX17 overexpressing 1Embryonic stem cellMale
CVCL_C3L9MSHRIe002-A-SOX17 overexpressing 2Embryonic stem cellMale

Clinical trials (associated diseases)

300 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00058929PHASE4COMPLETEDA Transition Study From Flolan® to Remodulin® in Patients With Pulmonary Arterial Hypertension
NCT00303459PHASE4COMPLETEDEffects of the Combination of Bosentan and Sildenafil Versus Sildenafil Monotherapy on Pulmonary Arterial Hypertension (PAH)
NCT00323297PHASE4COMPLETEDAssess the Efficacy and Safety of Sildenafil When Added to Bosentan in the Treatment of Pulmonary Arterial Hypertension
NCT00367770PHASE4COMPLETEDBREATHE 5-OL: Tracleer (Bosentan) in Patients With Pulmonary Arterial Hypertension Related to Eisenmenger Physiology
NCT00403650PHASE4COMPLETEDInhaled Iloprost for Sarcoidosis-associated Pulmonary Hypertension
NCT00430716PHASE4TERMINATEDTo Assess The Efficacy and Safety Of Oral Sildenafil in the Treatment of Pulmonary Arterial Hypertension.
NCT00433329PHASE4COMPLETEDCombination Therapy in Pulmonary Arterial Hypertension
NCT00439946PHASE4TERMINATEDSafety, Efficacy, and Treatment Satisfaction Switching From Flolan to Remodulin Using the Crono Five Ambulatory Pump in Patients With PAH
NCT00483626PHASE4UNKNOWNHemodynamic Response After Six Months of Sildenafil
NCT00494533PHASE4TERMINATEDStudy of Intravenous Remodulin in Patients in India With Pulmonary Arterial Hypertension
NCT00617305PHASE4COMPLETEDStudy of Add-on Ambrisentan Therapy to Background Phosphodiesterase Type-5 Inhibitor (PDE5i) Therapy in Pulmonary Arterial Hypertension (ATHENA-1)
NCT00625079PHASE4WITHDRAWNPulmonary Hypertension Secondary to Idiopathic Pulmonary Fibrosis And Treatment With Sildenafil
NCT00625469PHASE4WITHDRAWNPulmonary Arterial Hypertension Secondary to Idiopathic Pulmonary Fibrosis and Treatment With Bosentan
NCT00705588PHASE4UNKNOWNLong Acting Phosphodiesterase 5 Inhibitors as Add-on Therapy for Patients With Pulmonary Hypertension Treated With Prostanoids.
NCT00741819PHASE4COMPLETEDSafety Evaluation of Inhaled Treprostinil Administration Following Transition From Inhaled Ventavis in Pulmonary Arterial Hypertension (PAH) Subjects
NCT01042158PHASE4COMPLETEDA Clinical Trial of Ambrisentan and Tadalafil in Pulmonary Arterial Hypertension Associated With Systemic Sclerosis
NCT01105091PHASE4COMPLETEDEpoprostenol for Injection in Pulmonary Arterial Hypertension
NCT01105117PHASE4COMPLETEDEpoprostenol for Injection in Pulmonary Arterial Hypertension - Extension of AC-066A401
NCT01268553PHASE4COMPLETEDTransition From Injectable Prostacyclin Medication to Inhaled Prostacyclin Medication
NCT01302444PHASE4TERMINATEDTreprostinil Combined With Tadalafil for Pulmonary Hypertension
NCT01330108PHASE4COMPLETEDSafely Change From Bosentan to Ambrisentan in Pulmonary Hypertension
NCT01433328PHASE4TERMINATEDLidocaine Subcutaneous Infusion for Control of Treprostinil Related Site Pain
NCT01508780PHASE4WITHDRAWNCombined Use of Angiography, Optical Coherence Tomography and Intravascular Ultrasound in Evaluation of Pulmonary Vascular Structure and Function in Patients With Pulmonary Arterial Hypertension Treated With Oral Bosentan
NCT01615627PHASE4WITHDRAWNHypotonic Treprostinil Subcutaneous Infusion for Control of Treprostinil Related Site Pain
NCT01642407PHASE4COMPLETEDSafety And Efficacy Of Sildenafil In Children With Pulmonary Arterial Hypertension
NCT01649739PHASE4UNKNOWNVardenafil as add-on Therapy for Patients With Pulmonary Hypertension Treated With Inhaled Iloprost
NCT02060487PHASE4TERMINATEDEffects of Oral Sildenafil on Mortality in Adults With PAH
NCT02253394PHASE4TERMINATEDThe Combination Ambrisentan Plus Spironolactone in Pulmonary Arterial Hypertension Study
NCT02284737PHASE4TERMINATEDA Study to Investigate the Efficacy of PADN to Improved Functional Capacity and Hemodynamics in Patients With PAH
NCT02310672PHASE4COMPLETEDREPAIR: Right vEntricular Remodeling in Pulmonary ArterIal hypeRtension
NCT02847260PHASE4COMPLETEDSafety and Tolerability of Rapid Dose Titration of Subcutaneous Remodulin® Therapy in PAH Subjects (RAPID)
NCT02882126PHASE4WITHDRAWNAn Open Label Extension Study to Evaluate the Safety of Continued Therapy of Subcutanous Remodulin® in Pulmonary Arterial Hypertension
NCT02885012PHASE4TERMINATEDCrossover Study From Macitentan or Bosentan Over to Ambrisentan
NCT02891850PHASE4COMPLETEDRiociguat rEplacing PDE-5i Therapy evaLuated Against Continued PDE-5i thErapy
NCT02893995PHASE4WITHDRAWNSafety, Tolerability, Pharmacokinetics and Efficacy of Two Different Rates of Subcutanous Remodulin® Dose Titration in Pulmonary Arterial Hypertension
NCT02968901PHASE4TERMINATEDClinical Study Evaluating the Effects of First-line Oral cOmbination theraPy of maciTentan and tadalafIl in Patients With Newly Diagnosed pulMonary Arterial Hypertension (OPTIMA)
NCT03055221PHASE4COMPLETEDTRUST-2: Safety and Efficacy of Intravenous Remodulin® in Patients in India With Pulmonary Arterial Hypertension (PAH)
NCT03078907PHASE4COMPLETEDEffect of Selexipag on Daily Life Physical Activity of Patients With Pulmonary Arterial Hypertension.
NCT03236818PHASE4UNKNOWNGoal Oriented Strategy to Preserve Ejection Fraction Trial
NCT03344159PHASE4COMPLETEDSpironolactone Therapy in Chronic Stable Right HF Trial