UBTFL1

gene
On this page

Summary

UBTFL1 (upstream binding transcription factor like 1, HGNC:14533) is a protein-coding gene on chromosome 11q14.3, encoding Upstream-binding factor 1-like protein 1 (P0CB47). Essential for proliferation of the inner cell mass and trophectodermal cells in peri-implantation development.

Predicted to enable RNA polymerase I core promoter sequence-specific DNA binding activity and RNA polymerase I general transcription initiation factor activity. Predicted to be involved in positive regulation of transcription by RNA polymerase I and transcription by RNA polymerase I. Predicted to act upstream of or within blastocyst growth; embryo implantation; and regulation of gene expression. Predicted to be located in cytoplasm and intracellular membrane-bounded organelle. Predicted to be active in nucleus.

Source: NCBI Gene 642623 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 44 total
  • MANE Select transcript: NM_001143975

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:14533
Approved symbolUBTFL1
Nameupstream binding transcription factor like 1
Location11q14.3
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000255009
Ensembl biotypeprotein_coding
OMIM613696
Entrez642623

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000530464

RefSeq mRNA: 1 — MANE Select: NM_001143975 NM_001143975

CCDS: CCDS44704

Canonical transcript exons

ENST00000530464 — 1 exons

ExonStartEnd
ENSE000021432429008595090087131

Expression profiles

Bgee: expression breadth tissue_specific, 5 present calls, max score 37.20.

Top tissues by expression

127 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
colonic epitheliumUBERON:000039737.20gold quality
ventricular zoneUBERON:000305336.48gold quality
cortical plateUBERON:000534336.47gold quality
bone marrow cellCL:000209236.16gold quality
hindlimb stylopod muscleUBERON:000425235.78silver quality
ganglionic eminenceUBERON:000402335.49gold quality
skeletal muscle tissueUBERON:000113433.38gold quality
bone marrowUBERON:000237131.74gold quality
muscle tissueUBERON:000238531.06gold quality
sural nerveUBERON:001548830.93gold quality
urinary bladderUBERON:000125530.28silver quality
stromal cell of endometriumCL:000225529.87gold quality
lymph nodeUBERON:000002929.42silver quality
prefrontal cortexUBERON:000045129.21gold quality
liverUBERON:000210728.59gold quality
pituitary glandUBERON:000000728.56silver quality
monocyteCL:000057628.29gold quality
leukocyteCL:000073828.23gold quality
duodenumUBERON:000211428.14gold quality
islet of LangerhansUBERON:000000626.55gold quality
bloodUBERON:000017826.44gold quality
vermiform appendixUBERON:000115426.42gold quality
gall bladderUBERON:000211025.98gold quality
olfactory segment of nasal mucosaUBERON:000538625.89gold quality
placentaUBERON:000198725.81gold quality
primary visual cortexUBERON:000243624.61gold quality
frontal cortexUBERON:000187024.26gold quality
pancreasUBERON:000126424.13gold quality
superior frontal gyrusUBERON:000266124.08gold quality
gastrocnemiusUBERON:000138823.71gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.64

Regulation

Is transcription factor: no

Literature-anchored findings (GeneRIF, showing 1)

  • Functionally characterizes an homologous mouse gene and product, and compares it to this human product. (PMID:19915186)

Cross-species orthologs

7 orthologs

OrganismSymbolGene ID
danio_rerioubtfENSDARG00000035066
danio_rerioubtflENSDARG00000038780
mus_musculusUbtfl1ENSMUSG00000074502
rattus_norvegicusUbtfl1ENSRNOG00000079909
rattus_norvegicusENSRNOG00000084112
caenorhabditis_eleganshmg-3WBGENE00001973
caenorhabditis_elegansWBGENE00001974

Paralogs (20): HMGB3 (ENSG00000029993), HMG20B (ENSG00000064961), SP100 (ENSG00000067066), SMARCE1 (ENSG00000073584), SP140 (ENSG00000079263), TOX4 (ENSG00000092203), HMGXB4 (ENSG00000100281), TOX3 (ENSG00000103460), TFAM (ENSG00000108064), UBTF (ENSG00000108312), HMGB1P1 (ENSG00000124097), TOX2 (ENSG00000124191), SP110 (ENSG00000135899), HMG20A (ENSG00000140382), SSRP1 (ENSG00000149136), HMGB2 (ENSG00000164104), HMGB4 (ENSG00000176256), SP140L (ENSG00000185404), HMGB1 (ENSG00000189403), TOX (ENSG00000198846)

Protein

Protein identifiers

Upstream-binding factor 1-like protein 1P0CB47 (reviewed: P0CB47)

All UniProt accessions (1): P0CB47

UniProt curated annotations — full annotation on UniProt →

Function. Essential for proliferation of the inner cell mass and trophectodermal cells in peri-implantation development.

Subcellular location. Cytoplasm. Nucleus.

RefSeq proteins (1): NP_001137447* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR009071HMG_box_domDomain
IPR036910HMG_box_dom_sfHomologous_superfamily
IPR051762UBF1Family

Pfam: PF00505

UniProt features (5 total): DNA-binding region 2, chain 1, region of interest 1, compositionally biased region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-P0CB47-F171.730.29

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 42 (showing top): GOBP_EMBRYO_DEVELOPMENT_ENDING_IN_BIRTH_OR_EGG_HATCHING, GOBP_GROWTH, GOBP_IN_UTERO_EMBRYONIC_DEVELOPMENT, GOBP_BLASTOCYST_DEVELOPMENT, GOBP_DNA_TEMPLATED_TRANSCRIPTION_INITIATION, GOBP_MULTI_MULTICELLULAR_ORGANISM_PROCESS, GOBP_EMBRYO_DEVELOPMENT, GOBP_BLASTOCYST_GROWTH, GOBP_PROTEIN_DNA_COMPLEX_ORGANIZATION, GOBP_DEVELOPMENTAL_GROWTH, GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, GOMF_CORE_PROMOTER_SEQUENCE_SPECIFIC_DNA_BINDING, GOBP_RNA_POLYMERASE_I_PREINITIATION_COMPLEX_ASSEMBLY, GOBP_REGULATION_OF_TRANSCRIPTION_BY_RNA_POLYMERASE_I, GOBP_TRANSCRIPTION_BY_RNA_POLYMERASE_I

GO Biological Process (6): blastocyst growth (GO:0001832), transcription by RNA polymerase I (GO:0006360), embryo implantation (GO:0007566), positive regulation of transcription by RNA polymerase I (GO:0045943), RNA polymerase I preinitiation complex assembly (GO:0001188), regulation of gene expression (GO:0010468)

GO Molecular Function (4): RNA polymerase I core promoter sequence-specific DNA binding (GO:0001164), RNA polymerase I general transcription initiation factor activity (GO:0001181), DNA binding (GO:0003677), protein binding (GO:0005515)

GO Cellular Component (2): nucleus (GO:0005634), cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
transcription by RNA polymerase I2
blastocyst development1
developmental growth1
DNA-templated transcription1
multicellular organism development1
female pregnancy1
reproductive process1
regulation of transcription by RNA polymerase I1
positive regulation of DNA-templated transcription1
transcription initiation at RNA polymerase I promoter1
transcription preinitiation complex assembly1
gene expression1
regulation of macromolecule biosynthetic process1
core promoter sequence-specific DNA binding1
RNA polymerase I transcription regulatory region sequence-specific DNA binding1
RNA polymerase I preinitiation complex assembly1
general transcription initiation factor activity1
nucleic acid binding1
binding1
intracellular membrane-bounded organelle1
intracellular anatomical structure1
cellular anatomical structure1

Protein interactions and networks

STRING

844 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
UBTFL1BDP1A6H8Y1761
UBTFL1ZSCAN4Q8NAM6624
UBTFL1CDX2Q99626617
UBTFL1NANOGQ9H9S0578
UBTFL1POU5F1P31359556
UBTFL1SERBP1Q8NC51500
UBTFL1SP140Q13342492
UBTFL1OR2T2Q6IF00445
UBTFL1NCOR1O75376431
UBTFL1ZNF736B4DX44420
UBTFL1OR4C3Q8NH37400
UBTFL1ZSCAN5CA6NGD5372
UBTFL1MSX2P35548353
UBTFL1OR1E2P47887353
UBTFL1DISP1Q96F81353
UBTFL1IL17RCQ8NAC3353
UBTFL1TCF7P36402353
UBTFL1TACC2O95359353

IntAct

85 interactions, top by confidence:

ABTypeScore
UBTFL1CDR2psi-mi:“MI:0915”(physical association)0.560
UBTFL1NACC1psi-mi:“MI:0915”(physical association)0.560
ZBTB7BUBTFL1psi-mi:“MI:0915”(physical association)0.560
UBTFL1GOLGA2psi-mi:“MI:0915”(physical association)0.560
UBTFL1psi-mi:“MI:0915”(physical association)0.560
UBTFL1RUNDC3Apsi-mi:“MI:0915”(physical association)0.560
UBTFL1MTUS2psi-mi:“MI:0915”(physical association)0.560
UBTFL1STX1Apsi-mi:“MI:0915”(physical association)0.560
UBTFL1LBX1psi-mi:“MI:0915”(physical association)0.560
UBTFL1PUF60psi-mi:“MI:0915”(physical association)0.560
UBTFL1LHX9psi-mi:“MI:0915”(physical association)0.560
UBTFL1LHX3psi-mi:“MI:0915”(physical association)0.560
UBTFL1TFIP11psi-mi:“MI:0915”(physical association)0.560
UBTFL1RABEP1psi-mi:“MI:0915”(physical association)0.560
UBTFL1BEND3psi-mi:“MI:0915”(physical association)0.560
UBTFL1TAX1BP1psi-mi:“MI:0915”(physical association)0.560
UBTFL1LHX2psi-mi:“MI:0915”(physical association)0.560
GOLGA6L9UBTFL1psi-mi:“MI:0915”(physical association)0.560
LDOC1UBTFL1psi-mi:“MI:0915”(physical association)0.560
UBTFL1OR4F3psi-mi:“MI:0915”(physical association)0.560
UBTFL1OR5AS1psi-mi:“MI:0915”(physical association)0.560
UBTFL1DDIT4Lpsi-mi:“MI:0915”(physical association)0.560
UBTFL1C3orf36psi-mi:“MI:0915”(physical association)0.560
UBTFL1ZNF558psi-mi:“MI:0915”(physical association)0.560
UBTFL1PBX4psi-mi:“MI:0915”(physical association)0.560
FSD2UBTFL1psi-mi:“MI:0915”(physical association)0.560
NOTOUBTFL1psi-mi:“MI:0915”(physical association)0.560
UBTFL1CDR2psi-mi:“MI:0915”(physical association)0.000
UBTFL1NACC1psi-mi:“MI:0915”(physical association)0.000

BioGRID (30): UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid), UBTFL1 (Two-hybrid)

ESM2 similar proteins: A2CG63, C0SUW7, F4JQZ3, F8VPQ2, I1HNB2, O49595, O49597, O64702, O70523, O75400, O96028, P0CB47, P0CB48, P17480, P25976, P25977, P25979, P25980, P26585, P29374, P40620, P40630, P51115, P93831, Q02395, Q0WNR6, Q3USZ2, Q42344, Q5BJ56, Q5R7T9, Q5VN06, Q5ZKF4, Q61584, Q6AZF8, Q6DIJ5, Q6INA9, Q8BVE8, Q8LA53, Q8LDF9, Q8LPQ7

Diamond homologs: P0CB47, P0CB48, P17480, P25976, P25977, P25979, P25980, P40626, Q0II87, Q3USZ2, Q5D144, P11873, P40625, Q02486, Q32L68, Q4IQX3, Q5ZKF4, Q7S045, Q9P0W2, Q9Z104, B2RPK0, B3DLD3, C7U331, O04235, O49596, O54879, O60248, O64702, P07156, P07746, P0CY16, P0CY17, P11633, P26585, P33417, P35713, P40619, P40620, P40621, P40622

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

44 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance40
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

178 predictions. Top by Δscore:

VariantEffectΔscore
11:90087034:T:TAacceptor_gain0.8800
11:90087035:G:Aacceptor_gain0.6300
11:90087034:T:Gacceptor_gain0.6200
11:90087033:AT:Aacceptor_gain0.5900
11:90086661:TC:Tdonor_gain0.5700
11:90086589:A:Tdonor_gain0.5500
11:90087042:GAAGA:Gacceptor_gain0.5400
11:90087041:A:AGacceptor_gain0.5300
11:90087042:G:GGacceptor_gain0.5300
11:90087072:G:GTdonor_gain0.5300
11:90087042:GAA:Gacceptor_gain0.5200
11:90086766:GC:Gdonor_gain0.5100
11:90087033:A:AGacceptor_gain0.5100
11:90087033:ATG:Aacceptor_gain0.5100
11:90086454:G:Tdonor_gain0.4900
11:90086585:CAAGA:Cacceptor_gain0.4900
11:90086586:AAGAA:Aacceptor_gain0.4900
11:90087026:T:Gacceptor_gain0.4900
11:90087021:T:Aacceptor_gain0.4800
11:90087038:A:AGacceptor_gain0.4800
11:90087019:A:AGacceptor_gain0.4700
11:90086587:AGAAG:Adonor_loss0.4600
11:90086589:AAGGT:Adonor_loss0.4600
11:90086590:AG:Adonor_loss0.4600
11:90086591:GGTAA:Gdonor_loss0.4600
11:90086592:G:Adonor_loss0.4600
11:90086593:T:Gdonor_loss0.4600
11:90086867:GG:Gdonor_gain0.4500
11:90086868:GG:Gdonor_gain0.4500
11:90087018:AACT:Adonor_gain0.4500

AlphaMissense

2645 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
11:90086643:T:CF232L0.979
11:90086645:T:AF232L0.979
11:90086645:T:GF232L0.979
11:90086808:T:AW287R0.976
11:90086808:T:CW287R0.976
11:90086277:T:CF110L0.975
11:90086279:C:AF110L0.975
11:90086279:C:GF110L0.975
11:90086721:T:AW258R0.972
11:90086721:T:CW258R0.972
11:90086723:G:CW258C0.969
11:90086723:G:TW258C0.969
11:90086810:G:CW287C0.969
11:90086810:G:TW287C0.969
11:90086091:T:CF48L0.964
11:90086093:T:AF48L0.964
11:90086093:T:GF48L0.964
11:90086747:G:CK266N0.962
11:90086747:G:TK266N0.962
11:90086809:G:CW287S0.962
11:90086698:G:CR250P0.961
11:90086722:G:CW258S0.958
11:90086127:T:AW60R0.956
11:90086127:T:CW60R0.956
11:90086275:G:CR109P0.955
11:90086644:T:CF232S0.955
11:90086787:T:GY280D0.950
11:90086610:C:TP221S0.948
11:90086387:A:CK146N0.947
11:90086387:A:TK146N0.947

dbSNP variants (sampled 300 via entrez): RS1007646169 (11:90084137 C>A,T), RS1008031751 (11:90084719 G>C,T), RS1010899223 (11:90084809 A>C), RS1011347567 (11:90084315 C>T), RS1012745638 (11:90085460 G>T), RS1013254966 (11:90086468 T>C,G), RS1017990284 (11:90084187 C>T), RS1019029543 (11:90086985 G>A,C,T), RS1021003577 (11:90084820 A>C), RS1021538880 (11:90084316 T>C), RS1022766124 (11:90086632 G>C), RS1023261706 (11:90085498 A>C), RS1025909437 (11:90085734 G>T), RS1026004561 (11:90086943 G>C), RS1035492028 (11:90084919 T>C)

Disease associations

OMIM: gene MIM:613696 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST012020_223Serum metabolite levels2.000000e-29
GCST012020_224Serum metabolite levels2.000000e-47

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

1 total (human), top 1 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyrenedecreases methylation1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.