MFSD1

gene
On this page

Also known as MinervaSLC72A1FLJ14153UG0581B09

Summary

MFSD1 (major facilitator superfamily domain containing 1, HGNC:25874) is a protein-coding gene on chromosome 3q25.32, encoding Lysosomal dipeptide transporter MFSD1 (Q9H3U5). Lysosomal dipeptide uniporter that selectively exports lysine, arginine or histidine-containing dipeptides with a net positive charge from the lysosome lumen into the cytosol.

Enables dipeptide uniporter activity. Involved in dipeptide transmembrane transport from lysosomal lumen to cytosol. Is active in lysosomal membrane.

Source: NCBI Gene 64747 — RefSeq curated summary.

At a glance

  • GWAS associations: 2
  • Clinical variants (ClinVar): 120 total
  • MANE Select transcript: NM_022736

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25874
Approved symbolMFSD1
Namemajor facilitator superfamily domain containing 1
Location3q25.32
Locus typegene with protein product
StatusApproved
AliasesMinerva, SLC72A1, FLJ14153, UG0581B09
Ensembl geneENSG00000118855
Ensembl biotypeprotein_coding
OMIM619976
Entrez64747

Gene structure

Transcript identifiers

Ensembl transcripts: 25 — 16 protein_coding, 5 retained_intron, 4 nonsense_mediated_decay

ENST00000264266, ENST00000392813, ENST00000415822, ENST00000465235, ENST00000465624, ENST00000465739, ENST00000468409, ENST00000471266, ENST00000471500, ENST00000474670, ENST00000476860, ENST00000477743, ENST00000480292, ENST00000482835, ENST00000484166, ENST00000486568, ENST00000489639, ENST00000491804, ENST00000622669, ENST00000651862, ENST00000893305, ENST00000893306, ENST00000893307, ENST00000945280, ENST00000945281

RefSeq mRNA: 4 — MANE Select: NM_022736 NM_001167903, NM_001289406, NM_001289407, NM_022736

CCDS: CCDS3185, CCDS54666

Canonical transcript exons

ENST00000415822 — 16 exons

ExonStartEnd
ENSE00001821566158802054158802315
ENSE00003493263158824124158824236
ENSE00003507702158809179158809287
ENSE00003509305158820215158820326
ENSE00003525235158821597158821653
ENSE00003545740158821984158822140
ENSE00003554596158819649158819747
ENSE00003564792158823428158823525
ENSE00003608459158826015158826062
ENSE00003633520158807040158807082
ENSE00003635978158827280158827337
ENSE00003637209158813965158814067
ENSE00003660264158805362158805474
ENSE00003694329158807396158807463
ENSE00003788745158804319158804371
ENSE00003892740158828979158829716

Expression profiles

Bgee: expression breadth ubiquitous, 289 present calls, max score 98.82.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 72.0875 / max 841.4899, expressed in 1824 samples.

FANTOM5 promoters (10 alternative TSS)

Promoter IDTPM avgSamples expressed
3951368.51941823
395101.2593605
395180.7739172
395150.6564124
395090.3019123
395170.173873
395140.165167
395160.141260
395190.082035
395110.01444

Top tissues by expression

292 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
monocyteCL:000057698.82gold quality
mononuclear cellCL:000084298.81gold quality
leukocyteCL:000073898.62gold quality
esophagus squamous epitheliumUBERON:000692098.52gold quality
trabecular bone tissueUBERON:000248398.32gold quality
palpebral conjunctivaUBERON:000181298.20gold quality
epithelium of esophagusUBERON:000197698.17gold quality
visceral pleuraUBERON:000240198.06gold quality
pleuraUBERON:000097797.97gold quality
squamous epitheliumUBERON:000691497.92gold quality
epithelium of nasopharynxUBERON:000195197.91gold quality
parietal pleuraUBERON:000240097.90gold quality
nasopharynxUBERON:000172897.89gold quality
gingivaUBERON:000182897.69gold quality
skin of hipUBERON:000155497.65gold quality
parotid glandUBERON:000183197.45gold quality
gingival epitheliumUBERON:000194997.43gold quality
oral cavityUBERON:000016797.41gold quality
tibiaUBERON:000097997.34gold quality
mammalian vulvaUBERON:000099797.33gold quality
upper leg skinUBERON:000426297.18gold quality
germinal epithelium of ovaryUBERON:000130497.13gold quality
calcaneal tendonUBERON:000370197.09gold quality
bone marrowUBERON:000237197.04gold quality
cervix squamous epitheliumUBERON:000692296.94gold quality
bone marrow cellCL:000209296.92gold quality
layer of synovial tissueUBERON:000761696.92gold quality
synovial jointUBERON:000221796.91gold quality
eyeUBERON:000097096.90gold quality
deciduaUBERON:000245096.88gold quality

Single-cell (SCXA)

Detected in 8 experiment(s), a significant marker in 7.

ExperimentMarker?Max mean expression
E-MTAB-6678yes40.35
E-MTAB-9543yes40.26
E-HCAD-13yes20.81
E-CURD-122yes20.01
E-CURD-112yes14.18
E-MTAB-9067yes13.89
E-GEOD-99795no274.49
E-ANND-3no0.00

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

32 targeting MFSD1, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-8485100.0077.574731
HSA-MIR-6499-3P99.9066.381212
HSA-MIR-576-5P99.8470.462582
HSA-MIR-6715A-3P99.8368.051473
HSA-MIR-4694-3P99.7969.532640
HSA-MIR-119799.7067.751027
HSA-MIR-561-3P99.6470.903647
HSA-MIR-392399.5269.21446
HSA-MIR-548G-3P99.4868.672159
HSA-MIR-4797-5P99.3968.011354
HSA-MIR-426399.1869.252236
HSA-MIR-770299.0665.95698
HSA-MIR-4699-5P98.9967.501210
HSA-MIR-143-5P98.9868.87946
HSA-MIR-3124-3P98.8768.952123
HSA-MIR-3074-5P98.8266.561414
HSA-MIR-4716-5P98.8268.571168
HSA-MIR-6715B-3P98.8068.071204
HSA-MIR-3144-3P98.1567.34677
HSA-MIR-188-5P97.8967.01756
HSA-MIR-6765-3P97.8364.591165
HSA-MIR-6893-3P97.7964.911238
HSA-MIR-61897.6267.46861
HSA-MIR-467897.5968.31902
HSA-MIR-6866-3P97.3866.94748
HSA-MIR-370-3P97.0964.921221
HSA-MIR-383-5P96.8667.55820
HSA-MIR-331-5P96.5967.94705
HSA-MIR-3157-3P95.8667.08454
HSA-MIR-1178-5P95.8364.12504

Literature-anchored findings (GeneRIF, showing 2)

  • MFSD1 protein localizes in lysosomes in HeLa cells (PMID:21752829)
  • Orphan lysosomal solute carrier MFSD1 facilitates highly selective dipeptide transport. (PMID:38507452)

Cross-species orthologs

13 orthologs

OrganismSymbolGene ID
danio_reriomfsd1ENSDARG00000101379
mus_musculusMfsd1ENSMUSG00000027775
rattus_norvegicusMfsd1ENSRNOG00000013790
drosophila_melanogasterCG12194FBGN0031636
drosophila_melanogastermrvaFBGN0035763
caenorhabditis_elegansWBGENE00010182
caenorhabditis_elegansWBGENE00010282
caenorhabditis_elegansWBGENE00012222
caenorhabditis_elegansWBGENE00016093
caenorhabditis_elegansWBGENE00016094
caenorhabditis_elegansWBGENE00016095
caenorhabditis_elegansWBGENE00017861
caenorhabditis_elegansWBGENE00021814

Paralogs (6): MFSD10 (ENSG00000109736), MFSD9 (ENSG00000135953), MFSD14B (ENSG00000148110), MFSD14A (ENSG00000156875), MFSD8 (ENSG00000164073), MFSD5 (ENSG00000182544)

Protein

Protein identifiers

Lysosomal dipeptide transporter MFSD1Q9H3U5 (reviewed: Q9H3U5)

Alternative names: Major facilitator superfamily domain-containing protein 1, Smooth muscle cell-associated protein 4

All UniProt accessions (10): Q9H3U5, A0A494C105, C9J5G6, C9JAC3, C9JBA3, C9JCH3, C9JZW5, H7C4F4, H7C593, H7C5U3

UniProt curated annotations — full annotation on UniProt →

Function. Lysosomal dipeptide uniporter that selectively exports lysine, arginine or histidine-containing dipeptides with a net positive charge from the lysosome lumen into the cytosol. Could play a role in a specific type of protein O-glycosylation indirectly regulating macrophages migration and tissue invasion. Also essential for liver homeostasis.

Subunit / interactions. Homodimer. Interacts with lysosomal protein GLMP (via lumenal domain); the interaction starts while both proteins are still in the endoplasmic reticulum and is required for stabilization of MFSD1 in lysosomes but has no direct effect on its targeting to lysosomes or transporter activity.

Subcellular location. Lysosome membrane.

Domain organisation. The dileucine internalization motif is required for lysosomal localization.

Miscellaneous. May be produced at very low levels due to a premature stop codon in the mRNA, leading to nonsense-mediated mRNA decay.

Similarity. Belongs to the major facilitator superfamily.

Isoforms (6)

UniProt IDNamesCanonical?
Q9H3U5-11yes
Q9H3U5-22
Q9H3U5-33
Q9H3U5-44
Q9H3U5-55
Q9H3U5-66

RefSeq proteins (4): NP_001161375, NP_001276335, NP_001276336, NP_073573* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR011701MFSFamily
IPR020846MFS_domDomain
IPR036259MFS_trans_sfHomologous_superfamily
IPR052187MFSD1Family

Pfam: PF07690

Catalyzed reactions (Rhea), 12 shown:

  • L-alpha-aminoacyl-L-arginine(out) = L-alpha-aminoacyl-L-arginine(in) (RHEA:79367)
  • L-arginyl-L-alpha-amino acid(out) = L-arginyl-L-alpha-amino acid(in) (RHEA:79371)
  • L-alpha-aminoacyl-L-histidine(out) = L-alpha-aminoacyl-L-histidine(in) (RHEA:79375)
  • L-histidyl-L-alpha-amino acid(out) = L-histidyl-L-alpha-amino acid(in) (RHEA:79379)
  • L-alpha-aminoacyl-L-lysine(out) = L-alpha-aminoacyl-L-lysine(in) (RHEA:79383)
  • L-lysyl-L-alpha-amino acid(out) = L-lysyl-L-alpha-amino acid(in) (RHEA:79387)
  • L-arginyl-glycine(out) = L-arginyl-glycine(in) (RHEA:79391)
  • L-histidyl-glycine(out) = L-histidyl-glycine(in) (RHEA:79395)
  • L-lysyl-L-alanine(out) = L-lysyl-L-alanine(in) (RHEA:79399)
  • L-lysyl-L-lysine(out) = L-lysyl-L-lysine(in) (RHEA:79403)
  • L-lysyl-glycine(out) = L-lysyl-glycine(in) (RHEA:79407)
  • L-aspartyl-L-lysine(out) = L-aspartyl-L-lysine(in) (RHEA:79411)

UniProt features (27 total): transmembrane region 12, splice variant 6, sequence variant 4, chain 1, region of interest 1, short sequence motif 1, mutagenesis site 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q9H3U5-F188.830.69

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Mutagenesis-validated functional residues (1):

PositionPhenotype
11–12loss of localization to lysosome. localizes to plasma membrane.

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 211 (showing top): GOBP_LYSOSOMAL_TRANSPORT, MODULE_255, GOCC_VACUOLAR_MEMBRANE, MODULE_151, GRAESSMANN_APOPTOSIS_BY_SERUM_DEPRIVATION_UP, MODULE_317, GOBP_VACUOLAR_TRANSPORT, GOBP_PROTEIN_LOCALIZATION_TO_LYSOSOME, GNF2_MCL1, GOBP_PROTEIN_LOCALIZATION_TO_VACUOLE, GOBP_PROTEIN_STABILIZATION, GOBP_REGULATION_OF_PROTEIN_STABILITY, DOUGLAS_BMI1_TARGETS_DN, NAKAMURA_TUMOR_ZONE_PERIPHERAL_VS_CENTRAL_DN, TIEN_INTESTINE_PROBIOTICS_24HR_UP

GO Biological Process (5): protein stabilization (GO:0050821), protein localization to lysosome (GO:0061462), dipeptide transmembrane transport from lysosomal lumen to cytosol (GO:0141204), dipeptide transmembrane transport (GO:0035442), transmembrane transport (GO:0055085)

GO Molecular Function (3): protein homodimerization activity (GO:0042803), dipeptide uniporter activity (GO:0160178), transmembrane transporter activity (GO:0022857)

GO Cellular Component (4): lysosome (GO:0005764), lysosomal membrane (GO:0005765), endomembrane system (GO:0012505), membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
cellular anatomical structure2
regulation of protein stability1
protein localization to vacuole1
peptide transport1
transmembrane transport from lysosomal lumen to cytosol1
oligopeptide transmembrane transport1
dipeptide transport1
transport1
cellular process1
identical protein binding1
protein dimerization activity1
uniporter activity1
dipeptide transmembrane transporter activity1
transporter activity1
transmembrane transport1
lytic vacuole1
lysosome1
lytic vacuole membrane1
vacuole1
plasma membrane1

Protein interactions and networks

STRING

806 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MFSD1SLC67A2Q8NBP5566
MFSD1SLC49A3Q6UXD7512
MFSD1SLC33A2Q96ES6509
MFSD1MFSD11O43934493
MFSD1UNC93AQ86WB7489
MFSD1SLC75A1Q14728459
MFSD1GFM1Q96RP9448
MFSD1SLC71A1Q96MC6446
MFSD1MFSD8Q8NHS3446
MFSD1MLF1P58340430
MFSD1MFSD6Q6ZSS7420
MFSD1SLC61A1Q6N075407
MFSD1SLC29A3Q9BZD2402
MFSD1OR2F1Q13607399
MFSD1SLC39A11Q8N1S5398
MFSD1M0QYU9M0QYU9398

IntAct

5 interactions, top by confidence:

ABTypeScore
TSPAN31TMEM120Bpsi-mi:“MI:0914”(association)0.350
CACNG1TMEM120Bpsi-mi:“MI:0914”(association)0.350
MFSD1RP2psi-mi:“MI:0914”(association)0.350
MFSD1RAB10psi-mi:“MI:0915”(physical association)0.000

BioGRID (23): MFSD1 (Affinity Capture-RNA), MFSD1 (Affinity Capture-RNA), MFSD1 (Affinity Capture-MS), MFSD1 (Affinity Capture-RNA), MFSD1 (Affinity Capture-MS), MFSD1 (Affinity Capture-MS), MFSD1 (Affinity Capture-MS), ABCA3 (Affinity Capture-MS), ACP2 (Affinity Capture-MS), ANKRD13A (Affinity Capture-MS), APP (Affinity Capture-MS), ATP13A2 (Affinity Capture-MS), ATP5L (Affinity Capture-MS), CC2D1A (Affinity Capture-MS), OSTC (Affinity Capture-MS)

ESM2 similar proteins: A6NFX1, A6QQL0, D2HKB0, D3ZG27, F1NCD6, F1NJ67, O09014, O70496, O82390, P51798, P51799, Q17QZ3, Q1JQC1, Q32LQ6, Q3T9M1, Q3T9X0, Q3TIT8, Q4PKH3, Q504N2, Q58CV5, Q5F3N0, Q5R8G5, Q5RB09, Q5T4D3, Q5ZIT9, Q5ZKS8, Q66H95, Q68F72, Q69YG0, Q6AY78, Q6PDE8, Q7SY29, Q8BFQ6, Q8BG19, Q8GX78, Q8IVW8, Q8IZD6, Q8N697, Q8NCC5, Q8NEB5

Diamond homologs: Q1JQC1, Q32LQ6, Q5R8G5, Q5ZIT9, Q9DC37, Q9H3U5

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

120 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance80
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (0)

SpliceAI

2833 predictions. Top by Δscore:

VariantEffectΔscore
3:158802171:G:GTdonor_gain1.0000
3:158803320:G:GTdonor_gain1.0000
3:158804318:GGCA:Gacceptor_gain1.0000
3:158804368:ACGA:Adonor_gain1.0000
3:158804369:CGA:Cdonor_gain1.0000
3:158804370:GA:Gdonor_gain1.0000
3:158804370:GAG:Gdonor_gain1.0000
3:158804371:AG:Adonor_loss1.0000
3:158804372:G:GGdonor_gain1.0000
3:158804372:GTAA:Gdonor_loss1.0000
3:158804373:T:Adonor_loss1.0000
3:158804374:AAGTT:Adonor_loss1.0000
3:158805472:ACGG:Adonor_loss1.0000
3:158805473:CGGTA:Cdonor_loss1.0000
3:158805475:GTAAG:Gdonor_loss1.0000
3:158807390:TTATA:Tacceptor_loss1.0000
3:158807391:TATA:Tacceptor_loss1.0000
3:158807393:TA:Tacceptor_loss1.0000
3:158807394:A:ATacceptor_loss1.0000
3:158807395:GGTT:Gacceptor_gain1.0000
3:158807461:TGGG:Tdonor_loss1.0000
3:158807462:GG:Gdonor_gain1.0000
3:158807463:GG:Gdonor_gain1.0000
3:158807463:GGTA:Gdonor_loss1.0000
3:158807464:G:Cdonor_loss1.0000
3:158807464:G:GGdonor_gain1.0000
3:158807465:T:Adonor_loss1.0000
3:158809171:A:AGacceptor_gain1.0000
3:158809172:T:Gacceptor_gain1.0000
3:158809174:TTTA:Tacceptor_loss1.0000

AlphaMissense

3031 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:158809231:T:CF165L1.000
3:158809233:T:AF165L1.000
3:158809233:T:GF165L1.000
3:158809262:G:AG175E1.000
3:158809273:A:CS179R1.000
3:158809275:C:AS179R1.000
3:158809275:C:GS179R1.000
3:158822025:G:AG321E1.000
3:158823470:T:AW374R1.000
3:158823470:T:CW374R1.000
3:158823522:G:AG391D1.000
3:158824142:T:AN398K1.000
3:158824142:T:GN398K1.000
3:158805413:T:AW90R0.999
3:158805413:T:CW90R0.999
3:158805421:T:AN92K0.999
3:158805421:T:GN92K0.999
3:158805431:T:CC96R0.999
3:158805432:G:AC96Y0.999
3:158805433:T:GC96W0.999
3:158805443:G:CG100R0.999
3:158805444:G:AG100D0.999
3:158807044:G:CG112R0.999
3:158807462:G:TG147W0.999
3:158807463:G:AG147E0.999
3:158809184:G:AG149D0.999
3:158809190:A:TE151V0.999
3:158809202:T:AV155D0.999
3:158809205:C:AA156D0.999
3:158809212:T:AN158K0.999

dbSNP variants (sampled 300 via entrez): RS1000011986 (3:158807217 A>T), RS1000182979 (3:158807998 C>T), RS1000276334 (3:158800096 C>T), RS1000418555 (3:158823839 G>A,C), RS1000524773 (3:158814417 G>A,T), RS1000614192 (3:158803890 T>C), RS1000658374 (3:158820162 G>A), RS1000724663 (3:158827036 C>T), RS1000838022 (3:158827836 G>C,T), RS1000847728 (3:158811286 G>A), RS1000913160 (3:158812353 A>G,T), RS1000922039 (3:158805831 C>A), RS1001028848 (3:158822269 C>G), RS1001238012 (3:158824674 C>T), RS1001260370 (3:158816786 CTAACGTTTAAGTCTT>C)

Disease associations

OMIM: gene MIM:619976 | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

2 associations (top):

StudyTraitp-value
GCST002126_21Periodontitis (CDC/AAP)3.000000e-07
GCST004251_5Paneth cell defects in Crohn’s disease2.000000e-06

EFO canonical traits (1, from GWAS)

EFO IDTrait name
EFO:0007963abnormal paneth cell measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects expression, decreases methylation, affects cotreatment, increases expression9
sodium arsenitedecreases expression, increases expression2
Tretinoinincreases expression2
methylmercuric chloridedecreases expression1
triphenyl phosphateaffects expression1
sodium bichromatedecreases expression1
perfluorooctane sulfonic acidincreases expression1
CGP 52608affects binding, increases reaction1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, increases expression1
dorsomorphinaffects cotreatment, increases expression1
Resveratrolaffects cotreatment, increases expression1
Atrazinedecreases expression1
Benzo(a)pyreneaffects methylation1
Carbamazepineaffects expression1
Doxorubicindecreases expression1
Formaldehydeincreases expression1
Methyl Methanesulfonateincreases expression1
Plant Extractsaffects cotreatment, increases expression1
Cyclosporinedecreases expression1
Aflatoxin B1increases methylation1
Vitamin K 3affects expression1

Cellosaurus cell lines

4 cell lines: 3 cancer cell line, 1 transformed cell line

First 10 cell lines (id-ordered, not curated):

CellosaurusNameCategorySex
CVCL_B1X2Abcam HeLa MFSD1 KOCancer cell lineFemale
CVCL_D4EB1321N1-MFSD1-KO-c12Cancer cell lineMale
CVCL_D4EC1321N1-MFSD1-KO-c5Cancer cell lineMale
CVCL_D9K9Ubigene HEK293 MFSD1 KOTransformed cell lineFemale

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.