SP140L

gene
On this page

Summary

SP140L (SP140 like nuclear body protein, HGNC:25105) is a protein-coding gene on chromosome 2q37.1, encoding Nuclear body protein SP140-like protein (Q9H930).

Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific. Predicted to be involved in regulation of transcription by RNA polymerase II. Predicted to be active in nucleus.

Source: NCBI Gene 93349 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 127 total
  • Druggable target: yes
  • MANE Select transcript: NM_138402

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25105
Approved symbolSP140L
NameSP140 like nuclear body protein
Location2q37.1
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000185404
Ensembl biotypeprotein_coding
OMIM617747
Entrez93349

Gene structure

Transcript identifiers

Ensembl transcripts: 22 — 17 protein_coding, 5 retained_intron

ENST00000243810, ENST00000396563, ENST00000415673, ENST00000444636, ENST00000458341, ENST00000462156, ENST00000466656, ENST00000483728, ENST00000496870, ENST00000497212, ENST00000909865, ENST00000909866, ENST00000909867, ENST00000909868, ENST00000909869, ENST00000909870, ENST00000909871, ENST00000909872, ENST00000909873, ENST00000955822, ENST00000955823, ENST00000955824

RefSeq mRNA: 6 — MANE Select: NM_138402 NM_001308162, NM_001308163, NM_001352892, NM_001352893, NM_001352894, NM_138402

CCDS: CCDS46538, CCDS77536, CCDS77537

Canonical transcript exons

ENST00000415673 — 19 exons

ExonStartEnd
ENSE00000965539230361614230361697
ENSE00001342482230370908230370967
ENSE00001766396230402798230403732
ENSE00002432872230401366230401443
ENSE00002496600230400955230401063
ENSE00002505069230328757230328831
ENSE00002507201230358964230359132
ENSE00003474582230392087230392229
ENSE00003564781230396757230396798
ENSE00003568115230388559230388633
ENSE00003591195230385224230385304
ENSE00003616453230400127230400242
ENSE00003617548230393414230393461
ENSE00003624764230383510230383575
ENSE00003625944230371598230371651
ENSE00003644214230389919230390023
ENSE00003654119230401664230401807
ENSE00003660332230357805230357967
ENSE00003905746230327193230327301

Expression profiles

Bgee: expression breadth ubiquitous, 242 present calls, max score 94.36.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 10.0890 / max 144.4441, expressed in 1518 samples.

FANTOM5 promoters (2 alternative TSS)

Promoter IDTPM avgSamples expressed
258139.49651513
258140.5925303

Top tissues by expression

278 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
granulocyteCL:000009494.36gold quality
spleenUBERON:000210693.76gold quality
small intestine Peyer’s patchUBERON:000345492.36gold quality
lymph nodeUBERON:000002992.24gold quality
vermiform appendixUBERON:000115492.13gold quality
monocyteCL:000057692.09gold quality
mononuclear cellCL:000084291.82gold quality
leukocyteCL:000073891.72gold quality
buccal mucosa cellCL:000233691.47gold quality
rectumUBERON:000105290.77gold quality
bloodUBERON:000017890.06gold quality
gall bladderUBERON:000211090.00gold quality
upper lobe of left lungUBERON:000895289.73gold quality
right lobe of liverUBERON:000111489.64gold quality
small intestineUBERON:000210889.54gold quality
right lungUBERON:000216789.31gold quality
sural nerveUBERON:001548889.11gold quality
right uterine tubeUBERON:000130288.89gold quality
right adrenal glandUBERON:000123388.83gold quality
upper lobe of lungUBERON:000894888.72gold quality
right ovaryUBERON:000211888.63gold quality
right adrenal gland cortexUBERON:003582788.51gold quality
minor salivary glandUBERON:000183088.26gold quality
left ovaryUBERON:000211988.24gold quality
left adrenal glandUBERON:000123488.17gold quality
adrenal tissueUBERON:001830387.83gold quality
tibial nerveUBERON:000132387.52gold quality
tonsilUBERON:000237287.50gold quality
left adrenal gland cortexUBERON:003582587.47gold quality
transverse colonUBERON:000115787.19gold quality

Single-cell (SCXA)

Detected in 4 experiment(s), a significant marker in 2.

ExperimentMarker?Max mean expression
E-MTAB-7316yes34.41
E-ANND-3yes12.71
E-MTAB-6386no363.30
E-GEOD-124858no136.35

Regulation

Is transcription factor: yes

JASPAR motifs

MotifNameFamily
MA2540.1SP140LSp140-Sp100

JASPAR matrix evidence (PMIDs): PMID:39605320

miRNA regulators (miRDB)

44 targeting SP140L, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-5692B100.0071.322622
HSA-MIR-5692C100.0071.322622
HSA-MIR-1277-5P100.0073.955056
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-428299.9975.366408
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-314899.9775.066478
HSA-MIR-449399.9066.48977
HSA-MIR-6780A-5P99.8866.692776
HSA-MIR-1273H-5P99.7766.322471
HSA-MIR-30B-3P99.7065.762325
HSA-MIR-3689A-3P99.7065.732306
HSA-MIR-3689B-3P99.7065.712311
HSA-MIR-3689C99.7065.712311
HSA-MIR-6779-5P99.7065.762363
HSA-MIR-10394-5P99.6566.831852
HSA-MIR-120599.6566.761826
HSA-MIR-3942-3P99.5769.032854
HSA-MIR-766-5P99.4767.912225
HSA-MIR-569599.4167.481047
HSA-MIR-751599.3168.221795
HSA-MIR-6731-5P99.2867.422375
HSA-MIR-808599.2867.562362
HSA-MIR-569399.2466.671106
HSA-MIR-877-3P99.0968.101637
HSA-MIR-887-5P98.8265.901347
HSA-MIR-7113-3P98.7565.711120
HSA-MIR-797798.6566.182590

Literature-anchored findings (GeneRIF, showing 1)

  • our results show that SP140L is phylogenetically recent member of SP100 proteins and acts as an autoantigen in primary biliary cirrhosis patients. (PMID:26347895)

Cross-species orthologs

9 orthologs

OrganismSymbolGene ID
danio_rerioubtfENSDARG00000035066
danio_rerioubtflENSDARG00000038780
rattus_norvegicusENSRNOG00000082639
drosophila_melanogasterHmgDFBGN0004362
drosophila_melanogasterHmgZFBGN0010228
drosophila_melanogasterSsrpFBGN0010278
drosophila_melanogasterCG4617FBGN0029936
caenorhabditis_eleganshmg-3WBGENE00001973
caenorhabditis_elegansWBGENE00001974

Paralogs (20): HMGB3 (ENSG00000029993), HMG20B (ENSG00000064961), SP100 (ENSG00000067066), SMARCE1 (ENSG00000073584), SP140 (ENSG00000079263), TOX4 (ENSG00000092203), HMGXB4 (ENSG00000100281), TOX3 (ENSG00000103460), TFAM (ENSG00000108064), UBTF (ENSG00000108312), HMGB1P1 (ENSG00000124097), TOX2 (ENSG00000124191), SP110 (ENSG00000135899), HMG20A (ENSG00000140382), SSRP1 (ENSG00000149136), HMGB2 (ENSG00000164104), HMGB4 (ENSG00000176256), HMGB1 (ENSG00000189403), TOX (ENSG00000198846), UBTFL1 (ENSG00000255009)

Protein

Protein identifiers

Nuclear body protein SP140-like proteinQ9H930 (reviewed: Q9H930)

All UniProt accessions (3): Q9H930, H7BYP4, U5Y3L1

UniProt curated annotations — full annotation on UniProt →

Isoforms (4)

UniProt IDNamesCanonical?
Q9H930-44yes
Q9H930-11
Q9H930-22
Q9H930-33

RefSeq proteins (6): NP_001295091, NP_001295092, NP_001339821, NP_001339822, NP_001339823, NP_612411* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000770SAND_domDomain
IPR001487BromodomainDomain
IPR001965Znf_PHDDomain
IPR004865HSR_domDomain
IPR010919SAND-like_dom_sfHomologous_superfamily
IPR011011Znf_FYVE_PHDHomologous_superfamily
IPR013083Znf_RING/FYVE/PHDHomologous_superfamily
IPR019786Zinc_finger_PHD-type_CSConserved_site
IPR019787Znf_PHD-fingerDomain
IPR030411Sp140_BromoDomain
IPR036427Bromodomain-like_sfHomologous_superfamily
IPR043563Sp110/Sp140/Sp140L-likeFamily

Pfam: PF00439, PF00628, PF01342, PF03172

UniProt features (22 total): splice variant 5, compositionally biased region 4, domain 3, sequence variant 3, cross-link 2, chain 1, modified residue 1, sequence conflict 1, zinc finger region 1, region of interest 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H930-F168.470.31

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Post-translational modifications (3): 180, 169, 292

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 93 (showing top): RODRIGUES_THYROID_CARCINOMA_ANAPLASTIC_UP, MODULE_255, GOZGIT_ESR1_TARGETS_DN, MODULE_317, KOYAMA_SEMA3B_TARGETS_UP, AGCTCCT_MIR28, MODULE_69, GOCC_NUCLEOLUS, RUTELLA_RESPONSE_TO_HGF_UP, RUTELLA_RESPONSE_TO_HGF_VS_CSF2RB_AND_IL4_UP, MODULE_37, DCA_UP.V1_DN, SIRNA_EIF4GI_DN, GOMF_TRANSCRIPTION_REGULATOR_ACTIVITY, MIR3942_3P

GO Biological Process (2): regulation of transcription by RNA polymerase II (GO:0006357), defense response (GO:0006952)

GO Molecular Function (5): DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), DNA binding (GO:0003677), zinc ion binding (GO:0008270), protein binding (GO:0005515), metal ion binding (GO:0046872)

GO Cellular Component (4): nucleus (GO:0005634), nucleolus (GO:0005730), cytoplasm (GO:0005737), nuclear lumen (GO:0031981)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
response to stress1
chromatin1
RNA polymerase II transcription regulatory region sequence-specific DNA binding1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
nucleic acid binding1
transition metal ion binding1
binding1
cation binding1
intracellular membrane-bounded organelle1
nuclear lumen1
intracellular membraneless organelle1
intracellular anatomical structure1
cellular anatomical structure1
nucleus1
intracellular organelle lumen1

Protein interactions and networks

STRING

548 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
SP140LBRDTQ58F21555
SP140LPMLP29590529
SP140LSUMO1P55856510
SP140LSUMO2P55855501
SP140LATRXP46100469
SP140LBRWD1Q9NSI6438
SP140LBRWD3Q6RI45421
SP140LZMYND8Q9ULU4420
SP140LZMYND11Q15326403
SP140LZNF266Q14584391
SP140LSP140Q13342388
SP140LTAF1LQ8IZX4382
SP140LATAD2BQ9ULI0376
SP140LCECR2Q9BXF3373
SP140LBRPF3Q9ULD4372

IntAct

2 interactions, top by confidence:

ABTypeScore
CHCHD2ZNF593psi-mi:“MI:0914”(association)0.350

BioGRID (12): SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Affinity Capture-MS), SP140L (Protein-peptide), SP140L (Reconstituted Complex), APP (Reconstituted Complex)

ESM2 similar proteins: A0A494C1R9, A2AKB4, A2APT9, A6NKD2, A8MT33, B0BN44, E9PGG2, F5GYI3, O19110, O88852, P0CV98, P0CV99, P0CW00, P0CW01, Q01534, Q03386, Q0P5N2, Q12967, Q14684, Q2M329, Q3U3N0, Q5F267, Q5I0E2, Q5R5G8, Q5R866, Q5SYB0, Q5VTJ3, Q60953, Q69ZB3, Q6ZUX3, Q7TQI8, Q80VJ8, Q80VR2, Q86VY4, Q8BSI6, Q8IZJ4, Q8N831, Q8VD63, Q95LS7, Q96FG2

Diamond homologs: A0A7U2QYM2, A2A8L1, A2BIL7, B2RWS6, D3ZD32, D4A7T3, E9Q2Z1, F1QW93, F1R5H6, F4IXE7, F4KBP5, F7DRV9, G5EBZ4, O15016, O16102, O43918, O60885, O74964, O88379, O88491, O96028, O97159, P13709, P25440, P35817, P45481, Q07442, Q08D75, Q09472, Q12830, Q12873, Q13342, Q14839, Q15059, Q22516, Q32S26, Q338B9, Q4R8Y1, Q54UW4, Q58F21

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

127 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance83
Likely benign10
Benign1

Top pathogenic / likely-pathogenic (0)

SpliceAI

3035 predictions. Top by Δscore:

VariantEffectΔscore
2:230357803:A:AGacceptor_gain1.0000
2:230357804:G:GGacceptor_gain1.0000
2:230357804:GGCT:Gacceptor_gain1.0000
2:230357962:TTTG:Tdonor_gain1.0000
2:230357965:GAA:Gdonor_gain1.0000
2:230357965:GAAGT:Gdonor_loss1.0000
2:230357966:AA:Adonor_gain1.0000
2:230357967:AG:Adonor_loss1.0000
2:230357968:G:GGdonor_gain1.0000
2:230357968:GTA:Gdonor_loss1.0000
2:230357969:T:Adonor_loss1.0000
2:230358951:A:AGacceptor_gain1.0000
2:230358952:A:Gacceptor_gain1.0000
2:230358960:AAAG:Aacceptor_gain1.0000
2:230359090:GAA:Gdonor_gain1.0000
2:230359091:A:Tdonor_gain1.0000
2:230371596:A:AGacceptor_gain1.0000
2:230371597:G:GGacceptor_gain1.0000
2:230371649:G:GTdonor_gain1.0000
2:230371649:GAA:Gdonor_gain1.0000
2:230371652:G:GGdonor_gain1.0000
2:230383500:T:TAacceptor_gain1.0000
2:230383508:A:ACacceptor_loss1.0000
2:230385222:A:AGacceptor_gain1.0000
2:230385223:G:GGacceptor_gain1.0000
2:230385302:GGG:Gdonor_gain1.0000
2:230385303:GG:Gdonor_gain1.0000
2:230385303:GGG:Gdonor_gain1.0000
2:230385304:GG:Gdonor_gain1.0000
2:230385304:GGTA:Gdonor_loss1.0000

AlphaMissense

3891 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:230401781:T:CF540L0.993
2:230401783:C:AF540L0.993
2:230401783:C:GF540L0.993
2:230357896:G:CA67P0.992
2:230392179:T:AW353R0.992
2:230392179:T:CW353R0.992
2:230400202:T:CF425L0.992
2:230400204:C:AF425L0.992
2:230400204:C:GF425L0.992
2:230401704:T:CL514S0.992
2:230359103:T:CL137S0.991
2:230392143:T:CF341L0.991
2:230392145:T:AF341L0.991
2:230392145:T:GF341L0.991
2:230400145:T:CC406R0.991
2:230359022:T:CL110P0.990
2:230400145:T:AC406S0.990
2:230400146:G:CC406S0.990
2:230401417:T:CF492L0.990
2:230401419:T:AF492L0.990
2:230401419:T:GF492L0.990
2:230357930:T:CL78P0.989
2:230400154:T:AC409S0.989
2:230400155:G:CC409S0.989
2:230401758:T:CF532S0.989
2:230357918:T:CF74S0.988
2:230400214:T:CC429R0.988
2:230401773:G:CR537P0.988
2:230357933:G:CR79P0.987
2:230402840:T:CF563L0.987

dbSNP variants (sampled 300 via entrez): RS1000015595 (2:230358164 A>G), RS1000089777 (2:230327969 A>G), RS1000141924 (2:230383327 G>A,C,T), RS1000155154 (2:230347452 A>C,G), RS1000191555 (2:230342515 G>A,C,T), RS1000213520 (2:230359890 A>C), RS1000224174 (2:230352753 T>C), RS1000257108 (2:230366966 A>C,G), RS1000297950 (2:230352453 A>C,T), RS1000316278 (2:230366760 G>T), RS1000347909 (2:230332965 C>T), RS1000387707 (2:230346174 T>G), RS1000412422 (2:230387711 T>C,G), RS1000421757 (2:230385007 G>A), RS1000476232 (2:230358478 G>T)

Disease associations

OMIM: gene MIM:617747 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST002135_1Nicotine use5.000000e-06
GCST003542_13Night sleep phenotypes8.000000e-06
GCST005860_3Cholangiocarcinoma in primary sclerosing cholangitis (time to event)1.000000e-06
GCST90002389_8Lymphocyte percentage of white cells7.000000e-09

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0005430nicotine use
EFO:0007993lymphocyte percentage of leukocytes

Drugs & pharmacology

Drug and pharmacology data

Is drug target: yes

ChEMBL targets (1): CHEMBL4105997 (SINGLE PROTEIN)

PharmGKB: 1 entry (VIP=true, CPIC=false)

GtoPdb / IUPHAR curated pharmacology

(IUPHAR/BPS Guide to Pharmacology — expert-curated)

Target class: other protein — Non-enzymatic BRD containing proteins

CTD chemical–gene interactions

38 total (human), top 30 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression5
trichostatin Aaffects cotreatment, increases expression3
sodium arseniteincreases abundance, increases expression2
entinostataffects cotreatment, increases expression2
belinostatincreases expression, affects cotreatment2
Panobinostataffects cotreatment, increases expression2
Acetaminophendecreases expression, increases expression2
Vehicle Emissionsdecreases expression, decreases reaction, increases abundance, increases expression2
Calcitriolincreases expression, affects cotreatment2
Nickelincreases expression2
Particulate Matterincreases abundance, increases expression, decreases expression, decreases reaction2
TAK-243increases sumoylation1
triphenyl phosphateaffects expression1
bisphenol Aaffects binding, increases reaction1
geraniolincreases expression1
di-n-butylphosphoric acidaffects expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideincreases expression, affects cotreatment1
abrineincreases expression1
2-methyl-2H-pyrazole-3-carboxylic acid (2-methyl-4-o-tolylazophenyl)amidedecreases expression, decreases reaction1
dorsomorphinaffects cotreatment, increases expression1
Vorinostataffects cotreatment, increases expression1
Arsenicincreases abundance, increases expression1
Demecolcineincreases expression1
Diethylhexyl Phthalateincreases expression1
Ethyl Methanesulfonateincreases expression1
Formaldehydeincreases expression1
Methyl Methanesulfonateincreases expression1
Potassium Dichromatedecreases expression1
Silicon Dioxideincreases expression1
Smokedecreases expression1

ChEMBL screening assays

4 unique, capped per target: 4 binding

Representative assays (with source publication via chembl_document):

Assay IDTypeDescriptionSource paper
CHEMBL4051569BindingInhibition of SP140L (unknown origin) assessed as change in melting temperature at 10 uM by SYPRO Orange-dye based fluorescence thermal shift assayBenzoisoquinolinediones as Potent and Selective Inhibitors of BRPF2 and TAF1/TAF1L Bromodomains. — J Med Chem

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): cholangiocarcinoma