THSD7B

gene
On this page

Also known as KIAA1679

Summary

THSD7B (thrombospondin type 1 domain containing 7B, HGNC:29348) is a protein-coding gene on chromosome 2q22.1, encoding Thrombospondin type-1 domain-containing protein 7B (Q9C0I4).

Predicted to be involved in actin cytoskeleton organization. Predicted to be located in membrane. Predicted to be active in plasma membrane.

Source: NCBI Gene 80731 — RefSeq curated summary.

At a glance

  • GWAS associations: 26
  • Clinical variants (ClinVar): 267 total
  • MANE Select transcript: NM_001316349

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:29348
Approved symbolTHSD7B
Namethrombospondin type 1 domain containing 7B
Location2q22.1
Locus typegene with protein product
StatusApproved
AliasesKIAA1679
Ensembl geneENSG00000144229
Ensembl biotypeprotein_coding
Entrez80731

Gene structure

Transcript identifiers

Ensembl transcripts: 4 — 1 protein_coding, 1 nonsense_mediated_decay, 1 retained_intron, 1 protein_coding_CDS_not_defined

ENST00000409968, ENST00000472720, ENST00000480352, ENST00000485379

RefSeq mRNA: 1 — MANE Select: NM_001316349 NM_001316349

CCDS: CCDS82515

Canonical transcript exons

ENST00000409968 — 28 exons

ExonStartEnd
ENSE00000964126137056420137057230
ENSE00000995382137563221137563354
ENSE00001071669137450845137451023
ENSE00001586805136765545136765687
ENSE00001593822137657065137657160
ENSE00001610289137616175137616316
ENSE00001622385137170741137170938
ENSE00001641936137618392137618507
ENSE00001644614137656796137656969
ENSE00001655958137115124137115293
ENSE00001673333137411609137411872
ENSE00001673443137231044137231235
ENSE00001695416137572406137572556
ENSE00001724940137272533137272662
ENSE00001727792137275923137276026
ENSE00001736345137620609137620726
ENSE00001763099137242457137242572
ENSE00001763906137663383137663575
ENSE00001771982137655501137655660
ENSE00001781228137232899137233133
ENSE00001787956137642488137642633
ENSE00001798035137405613137405807
ENSE00001798124137667774137667861
ENSE00002303675137094873137095121
ENSE00003419666136882144136882317
ENSE00003558062137659664137659746
ENSE00003613799137160213137160368
ENSE00003841995137676524137677718

Expression profiles

Bgee: expression breadth ubiquitous, 151 present calls, max score 79.65.

FANTOM5 (CAGE): breadth broad, TPM avg 0.6758 / max 186.5793, expressed in 235 samples.

FANTOM5 promoters (4 alternative TSS)

Promoter IDTPM avgSamples expressed
227060.4712187
227080.094035
227050.055621
227070.055019

Top tissues by expression

239 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047379.65gold quality
pigmented layer of retinaUBERON:000178269.62gold quality
cortical plateUBERON:000534369.62gold quality
oviduct epitheliumUBERON:000480469.53gold quality
descending thoracic aortaUBERON:000234566.48gold quality
thoracic aortaUBERON:000151564.82gold quality
ascending aortaUBERON:000149664.62gold quality
tibiaUBERON:000097964.20gold quality
ganglionic eminenceUBERON:000402363.36gold quality
calcaneal tendonUBERON:000370163.25gold quality
visceral pleuraUBERON:000240163.07gold quality
ventricular zoneUBERON:000305363.01gold quality
parietal pleuraUBERON:000240062.56gold quality
adipose tissueUBERON:000101362.08gold quality
buccal mucosa cellCL:000233661.69silver quality
sural nerveUBERON:001548861.49silver quality
adipose tissue of abdominal regionUBERON:000780860.95gold quality
omental fat padUBERON:001041460.77gold quality
peritoneumUBERON:000235860.74gold quality
subcutaneous adipose tissueUBERON:000219059.75gold quality
ponsUBERON:000098859.35silver quality
secondary oocyteCL:000065559.20gold quality
endothelial cellCL:000011559.14gold quality
aortaUBERON:000094758.40gold quality
hindlimb stylopod muscleUBERON:000425256.99gold quality
pancreatic ductal cellCL:000207956.68silver quality
corpus epididymisUBERON:000435956.46silver quality
prefrontal cortexUBERON:000045155.85gold quality
C1 segment of cervical spinal cordUBERON:000646955.60gold quality
tibial nerveUBERON:000132355.46gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes6.18

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

110 targeting THSD7B, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-340-5P100.0072.504437
HSA-MIR-6873-3P100.0071.422626
HSA-MIR-3163100.0077.238605
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-607799.9968.042299
HSA-MIR-520D-5P99.9873.344883
HSA-MIR-524-5P99.9873.434882
HSA-MIR-548AJ-3P99.9673.385345
HSA-MIR-548X-3P99.9673.385345
HSA-MIR-570-3P99.9672.414910
HSA-MIR-4666A-3P99.9671.713434
HSA-LET-7C-3P99.9573.422862
HSA-MIR-391099.9571.132227
HSA-MIR-548J-3P99.9472.614881
HSA-MIR-548AE-3P99.9372.664867
HSA-MIR-548AH-3P99.9372.544872
HSA-MIR-548AM-3P99.9372.544872
HSA-MIR-548AQ-3P99.9372.664867
HSA-MIR-515-5P99.9269.822343
HSA-MIR-519E-5P99.9269.622358
HSA-MIR-6508-5P99.9270.672465
HSA-MIR-338-5P99.9272.342951
HSA-MIR-450B-5P99.9271.483175
HSA-MIR-219A-5P99.9173.36735
HSA-MIR-329-3P99.9166.561234
HSA-MIR-362-3P99.9166.381267
HSA-MIR-548E-5P99.8972.734486
HSA-MIR-124-3P99.8973.743043
HSA-MIR-506-3P99.8973.553057
HSA-MIR-4782-3P99.8873.31735

Literature-anchored findings (GeneRIF, showing 2)

  • Several flanking SNPs of the top hits in the meta-analysis demonstrated borderline associations with alcohol dependence in the family sample for KIAA0040, NRD1 and THSD7B, respectively. (PMID:21703634)
  • THSD7B Mutation Induces Platinum Resistance in Small Cell Lung Cancer Patients. (PMID:35685767)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriothsd7baENSDARG00000095022
mus_musculusThsd7bENSMUSG00000042581
rattus_norvegicusThsd7bENSRNOG00000003878

Paralogs (3): THSD7A (ENSG00000005108), SPON2 (ENSG00000159674), SPON1 (ENSG00000262655)

Protein

Protein identifiers

Thrombospondin type-1 domain-containing protein 7BQ9C0I4 (reviewed: Q9C0I4)

All UniProt accessions (2): Q9C0I4, R4GMU2

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Membrane.

RefSeq proteins (1): NP_001303278* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR000884TSP1_rptRepeat
IPR036383TSP1_rpt_sfHomologous_superfamily
IPR044004TSP1_spondin_domDomain
IPR051418Spondin/Thrombospondin_T1Family
IPR056991TSP1_TSH7A-B_CDomain

Pfam: PF00090, PF19028, PF19030, PF23308

UniProt features (63 total): disulfide bond 24, domain 18, glycosylation site 11, sequence conflict 3, topological domain 2, signal peptide 1, chain 1, region of interest 1, compositionally biased region 1, transmembrane region 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9C0I4-F167.230.00

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Disulfide bonds (24): 411–477, 431–481, 442–466, 602–643, 613–617, 655–660, 738–779, 749–753, 789–795, 872–907, 883–887, 921–923, 937–993, 959–997, 970–983, 1001–1038, 1012–1016, 1120–1124, 1248–1286, 1259–1263 …

Glycosylation sites (11): 150, 190, 219, 683, 757, 842, 933, 1186, 1308, 1456, 1524

Function

Pathways and Gene Ontology

Reactome pathways

9 pathways

IDPathway
R-HSA-5083635Defective B3GALTL causes PpS
R-HSA-5173214O-glycosylation of TSR domain-containing proteins
R-HSA-1643685Disease
R-HSA-3781865Diseases of glycosylation
R-HSA-3906995Diseases associated with O-glycosylation of proteins
R-HSA-392499Metabolism of proteins
R-HSA-5173105O-linked glycosylation
R-HSA-5668914Diseases of metabolism
R-HSA-597592Post-translational protein modification

MSigDB gene sets: 82 (showing top): chr2q22, MEISSNER_NPC_HCP_WITH_H3_UNMETHYLATED, MEISSNER_BRAIN_HCP_WITH_H3K4ME3_AND_H3K27ME3, REACTOME_POST_TRANSLATIONAL_PROTEIN_MODIFICATION, MIKKELSEN_MEF_HCP_WITH_H3_UNMETHYLATED, REACTOME_DISEASES_OF_GLYCOSYLATION, REACTOME_DISEASES_ASSOCIATED_WITH_O_GLYCOSYLATION_OF_PROTEINS, REACTOME_O_LINKED_GLYCOSYLATION, REACTOME_O_GLYCOSYLATION_OF_TSR_DOMAIN_CONTAINING_PROTEINS, REACTOME_DISEASES_OF_METABOLISM, ZNF513_TARGET_GENES, MIR548AJ_3P_MIR548X_3P, MIR548AE_3P_MIR548AQ_3P, MIR548AH_3P_MIR548AM_3P, MIR548J_3P

GO Biological Process (1): actin cytoskeleton organization (GO:0030036)

GO Molecular Function (1): protein binding (GO:0005515)

GO Cellular Component (2): plasma membrane (GO:0005886), membrane (GO:0016020)

Reactome top-level categories

Rollup of top-7 pathways:

CategoryPathways
Diseases associated with O-glycosylation of proteins1
O-linked glycosylation1
Diseases of metabolism1
Diseases of glycosylation1
Post-translational protein modification1
Disease1
Metabolism of proteins1

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cytoskeleton organization1
actin filament-based process1
binding1
membrane1
cell periphery1
cellular anatomical structure1

Protein interactions and networks

STRING

680 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
THSD7BZNF595Q8IYB9582
THSD7BSNTG2Q9NY99462
THSD7BPGLYRP3Q96LB9454
THSD7BPGLYRP4Q96LB8453
THSD7BMAP3K19Q56UN5442
THSD7BOR5L1Q8NGL2422
THSD7BKIAA0040Q15053419
THSD7BOR5J2Q8NH18388
THSD7BSCN11AQ9UI33380
THSD7BOR1N2Q8NGR9379
THSD7BFREM3P0C091378
THSD7BNIPA2Q8N8Q9378
THSD7BNLGN3Q9NZ94377
THSD7BZNF473Q8WTR7377
THSD7BCLDN17P56750374

IntAct

76 interactions, top by confidence:

ABTypeScore
THSD7BLYVE1psi-mi:“MI:0915”(physical association)0.560
THSD7BKCNJ6psi-mi:“MI:0915”(physical association)0.560
THSD7BDARS2psi-mi:“MI:0915”(physical association)0.560
THSD7BFKBP7psi-mi:“MI:0915”(physical association)0.560
THSD7BSLC30A8psi-mi:“MI:0915”(physical association)0.560
THSD7BCISD2psi-mi:“MI:0915”(physical association)0.560
THSD7BAQP6psi-mi:“MI:0915”(physical association)0.560
THSD7BFAM209Apsi-mi:“MI:0915”(physical association)0.560
THSD7BPIGPpsi-mi:“MI:0915”(physical association)0.560
THSD7BSPACA1psi-mi:“MI:0915”(physical association)0.560
THSD7BSLC14A1psi-mi:“MI:0915”(physical association)0.560
THSD7BARL13Bpsi-mi:“MI:0915”(physical association)0.560
THSD7BSYT2psi-mi:“MI:0915”(physical association)0.560
THSD7BKIR3DL1psi-mi:“MI:0915”(physical association)0.560
THSD7BMFSD14Bpsi-mi:“MI:0915”(physical association)0.560
THSD7BGPR152psi-mi:“MI:0915”(physical association)0.560
THSD7BTMX2psi-mi:“MI:0915”(physical association)0.560
THSD7BGPR37L1psi-mi:“MI:0915”(physical association)0.560
THSD7BSTOMpsi-mi:“MI:0915”(physical association)0.560
THSD7BPLEKHO1psi-mi:“MI:0915”(physical association)0.560
THSD7BMTIF3psi-mi:“MI:0915”(physical association)0.560
THSD7BGPR101psi-mi:“MI:0915”(physical association)0.560
THSD7BGJA8psi-mi:“MI:0915”(physical association)0.560
THSD7BPEX12psi-mi:“MI:0915”(physical association)0.560

ESM2 similar proteins: A7MBS7, D3YXF5, F1LW30, O89103, P10643, P11680, P27918, P35446, P82987, P90884, Q29RQ1, Q2I0M5, Q2MKA7, Q3UPR9, Q3UTY6, Q4R7Z5, Q5M7L6, Q5RAD0, Q5RBP1, Q5RBP8, Q5UE90, Q64181, Q66PY1, Q69ZU6, Q6NZL8, Q6P4U0, Q6UXX9, Q6ZMP0, Q7T3Q2, Q7TSK7, Q80YN4, Q86TH1, Q8BFU0, Q8BJ73, Q8BLI0, Q8IUX8, Q8IWY4, Q8IX30, Q8N6G6, Q8VCC9

Diamond homologs: A7MBS7, B3EWY9, B3EWZ8, Q1RMU1, Q2MKA7, Q3UPR9, Q5R328, Q5R7Y0, Q69Z28, Q69ZU6, Q6P4U0, Q8BMS2, Q8IVN8, Q8TE57, Q9BUD6, Q9C0I4, Q9UPZ6, Q9WV75, Q9Z132, A8WGB1, B3EWZ3, C0HL12, C5IAW9, D3YXG0, D3ZTD8, F1LW30, G5ECS8, O08721, O08722, O08747, O14514, O15072, O60241, O60242, O95185, O95450, P07996, P10643, P11680, P27590

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

267 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance233
Likely benign13
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

5580 predictions. Top by Δscore:

VariantEffectΔscore
2:136765683:GACGG:Gdonor_gain1.0000
2:136765686:GG:Gdonor_gain1.0000
2:136765687:GG:Gdonor_gain1.0000
2:136836385:G:GTdonor_gain1.0000
2:136847154:G:GTdonor_gain1.0000
2:136882142:A:ACacceptor_loss1.0000
2:136882142:A:AGacceptor_gain1.0000
2:136882142:AG:Aacceptor_gain1.0000
2:136882143:G:GGacceptor_gain1.0000
2:136882143:G:GTacceptor_loss1.0000
2:136882143:GG:Gacceptor_gain1.0000
2:136882143:GGA:Gacceptor_gain1.0000
2:136882143:GGAAT:Gacceptor_gain1.0000
2:137056415:TGTA:Tacceptor_loss1.0000
2:137056416:GTA:Gacceptor_loss1.0000
2:137056417:TAGGT:Tacceptor_loss1.0000
2:137056418:A:ACacceptor_loss1.0000
2:137056419:GGTCC:Gacceptor_gain1.0000
2:137231039:TGTAG:Tacceptor_loss1.0000
2:137231040:GTA:Gacceptor_loss1.0000
2:137231042:A:AGacceptor_gain1.0000
2:137231042:A:ATacceptor_loss1.0000
2:137231043:G:GAacceptor_gain1.0000
2:137231043:GGA:Gacceptor_gain1.0000
2:137231205:TC:Tdonor_gain1.0000
2:137272532:GGA:Gacceptor_gain1.0000
2:137272532:GGAA:Gacceptor_gain1.0000
2:137272623:A:Tdonor_gain1.0000
2:137272667:G:GGdonor_gain1.0000
2:136765688:G:GGdonor_gain0.9900

AlphaMissense

10532 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:137242544:G:CW746C0.999
2:137242544:G:TW746C0.999
2:137620686:G:CW1253C0.998
2:137620686:G:TW1253C0.998
2:137620695:G:CW1256C0.998
2:137620695:G:TW1256C0.998
2:137657090:G:CW1435C0.998
2:137657090:G:TW1435C0.998
2:137657105:G:CW1440C0.998
2:137657105:G:TW1440C0.998
2:137657130:T:AC1449S0.998
2:137657131:G:CC1449S0.998
2:137659705:T:AC1473S0.998
2:137659706:G:CC1473S0.998
2:137663398:T:AC1492S0.998
2:137663399:G:CC1492S0.998
2:137663449:T:AC1509S0.998
2:137663450:G:CC1509S0.998
2:137663451:C:GC1509W0.998
2:137056844:G:CW188C0.997
2:137056844:G:TW188C0.997
2:137094948:G:CW342C0.997
2:137094948:G:TW342C0.997
2:137115148:G:CW408C0.997
2:137115148:G:TW408C0.997
2:137231150:G:CW610C0.997
2:137231150:G:TW610C0.997
2:137242542:T:AW746R0.997
2:137242542:T:CW746R0.997
2:137405743:G:CW877C0.997

dbSNP variants (sampled 300 via entrez): RS1000003698 (2:136852503 A>C), RS1000005092 (2:136836759 G>A,T), RS1000007071 (2:137265416 G>A), RS1000007231 (2:137435684 T>G), RS1000012329 (2:137337720 T>C), RS1000013801 (2:137648947 T>A), RS1000021907 (2:137016831 A>G), RS1000024316 (2:137293600 T>C), RS1000025254 (2:137175583 A>T), RS1000025523 (2:136877831 G>C), RS1000030128 (2:137658216 T>A), RS1000031336 (2:137606413 C>A,T), RS1000034902 (2:137218977 T>C), RS1000037947 (2:137136299 T>C), RS1000049005 (2:137331965 C>A)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (1): Ehlers-Danlos syndrome, classic type (MONDO:0007522)

Orphanet (1): Classical Ehlers-Danlos syndrome (Orphanet:287)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

26 associations (top):

StudyTraitp-value
GCST000745_16Pancreatic cancer7.000000e-06
GCST001521_12Subcutaneous adipose tissue3.000000e-06
GCST001734_9Non-small cell lung cancer7.000000e-06
GCST002208_8Liver enzyme levels (aspartate transaminase)4.000000e-06
GCST002256_1Tooth agenesis (third molar)8.000000e-06
GCST002931_14Aluminium levels4.000000e-06
GCST003255_7Urinary albumin-to-creatinine ratio7.000000e-06
GCST003518_80Daytime sleep phenotypes8.000000e-06
GCST003784_3Multiple system atrophy7.000000e-06
GCST004639_31Prudent dietary pattern7.000000e-06
GCST004639_38Prudent dietary pattern8.000000e-06
GCST004639_39Prudent dietary pattern8.000000e-06
GCST005987_29Albumin-globulin ratio2.000000e-11
GCST005990_43Non-albumin protein levels1.000000e-11
GCST006624_90Systolic blood pressure5.000000e-09
GCST006920_6Regular attendance at a gym or sports club3.000000e-08
GCST007323_101Risk-taking tendency (4-domain principal component model)3.000000e-09
GCST007327_33Smoking status (ever vs never smokers)7.000000e-09
GCST007327_81Smoking status (ever vs never smokers)5.000000e-08
GCST007603_30Smoking initiation2.000000e-08
GCST008152_178Weight8.000000e-06
GCST008810_85Smoking initiation (ever regular vs never regular)5.000000e-09
GCST010248_1Machado-Joseph disease (age at onset)4.000000e-06
GCST010546_7Problematic alcohol use2.000000e-09
GCST011703_52Smoking initiation3.000000e-09
GCST012489_48Heel bone mineral density x serum urate levels interaction6.000000e-09

EFO canonical traits (15, from GWAS)

EFO IDTrait name
EFO:0004736aspartate aminotransferase measurement
EFO:0007778urinary albumin to creatinine ratio
EFO:0007828daytime rest measurement
EFO:0008111diet measurement
EFO:0005128albumin:globulin ratio measurement
EFO:0006335systolic blood pressure
EFO:0009592social interaction measurement
EFO:0008579risk-taking behaviour
EFO:0004318smoking behavior
EFO:0005670smoking initiation
EFO:0004338body weight
EFO:0004847age at onset
EFO:0009458alcohol use disorder measurement
EFO:0004531urate measurement
EFO:0009270heel bone mineral density

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

PharmGKB clinical annotations

1 annotations.

VariantTypeLevelDrugsPhenotypes
rs13382553Toxicity3ethanolAlcohol abuse

PharmGKB variants

1 variants.

VariantGenesLevelScore#Clin annotsDrugs
rs13382553THSD7B31.501ethanol

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, increases expression, affects expression7
aristolochic acid Idecreases expression1
bisphenol Aaffects methylation, affects cotreatment, increases methylation1
trichostatin Aincreases expression1
perfluorooctane sulfonic aciddecreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
dorsomorphinaffects cotreatment, increases expression1
bisphenol Saffects methylation1
Resveratrolaffects cotreatment, decreases expression1
Arsenic Trioxidedecreases expression1
Fulvestrantaffects cotreatment, increases methylation1
Acetaminophendecreases expression1
Arsenicaffects methylation1
Benzo(a)pyreneaffects methylation, increases methylation1
Caffeineincreases phosphorylation1
Carbamazepineaffects expression1
Copperaffects cotreatment, decreases expression1
Phthalic Acidsincreases methylation1
Tretinoinincreases expression1
Aflatoxin M1decreases expression1
Asbestos, Crocidolitedecreases methylation1

Clinical trials (associated diseases)

2 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT05429996Not specifiedUNKNOWNUltrastructural Collagen Markers in Ehlers Danlos Syndromes
NCT05434728Not specifiedUNKNOWNCharacterization of Bleeding Disorders in EDS