MTHFSD

gene
On this page

Also known as FLJ12998

Summary

MTHFSD (methenyltetrahydrofolate synthetase domain containing, HGNC:25778) is a protein-coding gene on chromosome 16q24.1, encoding Methenyltetrahydrofolate synthase domain-containing protein (Q2M296).

Enables RNA binding activity. Predicted to be active in cytoplasm.

Source: NCBI Gene 64779 — RefSeq curated summary.

At a glance

  • GWAS associations: 6
  • Clinical variants (ClinVar): 89 total — 1 pathogenic
  • MANE Select transcript: NM_001159377

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:25778
Approved symbolMTHFSD
Namemethenyltetrahydrofolate synthetase domain containing
Location16q24.1
Locus typegene with protein product
StatusApproved
AliasesFLJ12998
Ensembl geneENSG00000103248
Ensembl biotypeprotein_coding
OMIM616820
Entrez64779

Gene structure

Transcript identifiers

Ensembl transcripts: 21 — 15 protein_coding, 3 nonsense_mediated_decay, 1 protein_coding_CDS_not_defined, 1 retained_intron, 1 TEC

ENST00000360900, ENST00000381214, ENST00000543303, ENST00000546093, ENST00000561522, ENST00000561848, ENST00000561989, ENST00000562096, ENST00000562940, ENST00000562994, ENST00000564364, ENST00000565482, ENST00000566050, ENST00000566469, ENST00000567539, ENST00000568037, ENST00000568798, ENST00000569000, ENST00000625049, ENST00000634347, ENST00000950129

RefSeq mRNA: 5 — MANE Select: NM_001159377 NM_001159377, NM_001159378, NM_001159379, NM_001159380, NM_022764

CCDS: CCDS42211, CCDS54047, CCDS54048, CCDS58490

Canonical transcript exons

ENST00000360900 — 8 exons

ExonStartEnd
ENSE000022352948653018686532481
ENSE000034633338654655986546649
ENSE000034691868655516986555235
ENSE000034901218654846486548577
ENSE000034959668654169786541822
ENSE000036553168654210186542213
ENSE000036559398655203386552146
ENSE000037211848655464586554751

Expression profiles

Bgee: expression breadth ubiquitous, 207 present calls, max score 93.18.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 8.3623 / max 82.6070, expressed in 1774 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
1584468.36231774

Top tissues by expression

264 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
buccal mucosa cellCL:000233693.18gold quality
right uterine tubeUBERON:000130290.46gold quality
minor salivary glandUBERON:000183085.59gold quality
granulocyteCL:000009485.41gold quality
left testisUBERON:000453385.41gold quality
rectumUBERON:000105285.33gold quality
right lobe of thyroid glandUBERON:000111985.27gold quality
right testisUBERON:000453485.21gold quality
gall bladderUBERON:000211084.92gold quality
left lobe of thyroid glandUBERON:000112084.66gold quality
skin of abdomenUBERON:000141684.61gold quality
skin of legUBERON:000151184.49gold quality
body of pancreasUBERON:000115084.35gold quality
stromal cell of endometriumCL:000225584.30gold quality
apex of heartUBERON:000209884.18gold quality
metanephros cortexUBERON:001053384.12gold quality
right adrenal gland cortexUBERON:003582784.12gold quality
right adrenal glandUBERON:000123384.09gold quality
endocervixUBERON:000045884.02gold quality
testisUBERON:000047384.01gold quality
mucosa of stomachUBERON:000119983.99gold quality
thyroid glandUBERON:000204683.89gold quality
left ovaryUBERON:000211983.89gold quality
body of stomachUBERON:000116183.85gold quality
esophagogastric junction muscularis propriaUBERON:003584183.79gold quality
ganglionic eminenceUBERON:000402383.67gold quality
right ovaryUBERON:000211883.63gold quality
monocyteCL:000057683.61gold quality
transverse colonUBERON:000115783.61gold quality
lower esophagus muscularis layerUBERON:003583383.55gold quality

Single-cell (SCXA)

Detected in 2 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes3.29
E-MTAB-6058no42.06

Regulation

Is transcription factor: no

miRNA regulators (miRDB)

72 targeting MTHFSD, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-4533100.0069.482758
HSA-MIR-30A-5P100.0076.313233
HSA-MIR-30B-5P100.0076.293248
HSA-MIR-30C-5P100.0076.293248
HSA-MIR-30D-5P100.0076.323233
HSA-MIR-30E-5P100.0076.323242
HSA-MIR-3120-5P100.0065.56965
HSA-MIR-185-3P99.9567.011743
HSA-MIR-452599.9464.38675
HSA-MIR-5010-5P99.9464.11705
HSA-MIR-539-5P99.9370.302855
HSA-MIR-990299.8969.152250
HSA-MIR-444799.8567.812900
HSA-MIR-4728-5P99.8569.394718
HSA-MIR-548AG99.7769.251492
HSA-MIR-92A-2-5P99.7567.012164
HSA-MIR-471999.7372.103329
HSA-MIR-6752-3P99.7266.711587
HSA-MIR-548M99.7068.871749
HSA-MIR-548AI99.6969.241494
HSA-MIR-548BA99.6969.141514
HSA-MIR-570-5P99.6969.241494
HSA-MIR-10394-5P99.6566.831852
HSA-MIR-120599.6566.761826
HSA-MIR-129099.5969.902079
HSA-MIR-3942-3P99.5769.032854
HSA-MIR-432899.5771.064094

Cross-species orthologs

4 orthologs

OrganismSymbolGene ID
danio_reriomthfsdENSDARG00000062042
mus_musculusMthfsdENSMUSG00000031816
rattus_norvegicusMthfsdENSRNOG00000046443
drosophila_melanogasterlostFBGN0263594

Protein

Protein identifiers

Methenyltetrahydrofolate synthase domain-containing proteinQ2M296 (reviewed: Q2M296)

All UniProt accessions (14): Q2M296, A0A087WX22, F5H0M7, H3BML0, H3BMX6, H3BNE5, H3BP93, H3BPD2, H3BQU0, H3BSB5, H3BSD1, H3BSW8, H3BUU0, H3BVG2

Isoforms (4)

UniProt IDNamesCanonical?
Q2M296-11yes
Q2M296-22
Q2M296-33
Q2M296-44

RefSeq proteins (5): NP_001152849, NP_001152850, NP_001152851, NP_001152852, NP_073601 (=MANE)

Domains & families (InterPro)

IDNameType
IPR000504RRM_domDomain
IPR002698FTHF_cligaseFamily
IPR012677Nucleotide-bd_a/b_plait_sfHomologous_superfamily
IPR024185FTHF_cligase-like_sfHomologous_superfamily
IPR034359MTHFSD_RRMDomain
IPR035979RBD_domain_sfHomologous_superfamily
IPR037171NagB/RpiA_transferase-likeHomologous_superfamily

Pfam: PF00076, PF01812

UniProt features (21 total): strand 6, sequence variant 5, sequence conflict 2, helix 2, splice variant 2, chain 1, domain 1, turn 1, region of interest 1

Structure

Experimental structures (PDB)

1 structures.

PDBMethodResolution (Å)
2E5JSOLUTION NMR

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q2M296-F185.200.67

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 52 (showing top): EBNA1BP2_TARGET_GENES, FEV_TARGET_GENES, GLI3_TARGET_GENES, LMTK3_TARGET_GENES, NFKBIA_TARGET_GENES, ZNF436_TARGET_GENES, ZNF528_TARGET_GENES, ZNF92_TARGET_GENES, MIR3942_3P, MIR4447, MIR5196_5P, MIR4747_5P, MIR1253, MIR4472, MIR1912_3P

GO Biological Process (0):

GO Molecular Function (2): RNA binding (GO:0003723), nucleic acid binding (GO:0003676)

GO Cellular Component (1): cytoplasm (GO:0005737)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
nucleic acid binding1
binding1
intracellular anatomical structure1
cellular anatomical structure1

Protein interactions and networks

STRING

1690 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
MTHFSDFOXL1Q12952633
MTHFSDMTHFSP49914619
MTHFSDFOXF1Q12946550
MTHFSDTMEM200CA6NKL6547
MTHFSDGPAT2Q6NUI2532
MTHFSDSH2D4AQ9H788511
MTHFSDDEF8Q6ZN54505
MTHFSDABLIM3O94929466
MTHFSDMTFMTQ96DP5458
MTHFSDFOXC2Q99958447
MTHFSDSYCP2Q9BX26441
MTHFSDDAGLBQ8NCG7438
MTHFSDPDCD5O14737434
MTHFSDTCF25Q9BQ70424
MTHFSDADAM22Q9P0K1417

IntAct

10 interactions, top by confidence:

ABTypeScore
RPS3ZNF316psi-mi:“MI:0914”(association)0.530
CUL2ANXA2P2psi-mi:“MI:0914”(association)0.350
EFCAB2IQGAP2psi-mi:“MI:0914”(association)0.350
C1orf174POLRMTpsi-mi:“MI:0914”(association)0.350
H1-1POLRMTpsi-mi:“MI:0914”(association)0.350
MACROH2A2ZNF316psi-mi:“MI:0914”(association)0.350
PYM1POLRMTpsi-mi:“MI:0914”(association)0.350
SERBP1ZNF593psi-mi:“MI:0914”(association)0.350
ZNF346ZNF316psi-mi:“MI:0914”(association)0.350

BioGRID (16): MTHFSD (Affinity Capture-RNA), MTHFSD (Affinity Capture-MS), MTHFSD (Affinity Capture-MS), MTHFSD (Affinity Capture-MS), MTHFSD (Affinity Capture-MS), MTHFSD (Protein-RNA), MTHFSD (Cross-Linking-MS (XL-MS)), MTHFSD (Co-fractionation), MTHFSD (Co-fractionation), MTHFSD (Protein-peptide), MTHFSD (Affinity Capture-RNA), MTHFSD (Affinity Capture-Western), MTHFSD (Affinity Capture-Western), MTHFSD (Affinity Capture-Western), MTHFSD (Affinity Capture-MS)

ESM2 similar proteins: A0A8C2MDK8, A7SLX5, A7YY46, D3ZEY4, D3ZX08, E9QAM5, O59713, O95822, P0C7A1, P48760, P52333, P52824, Q002B5, Q07071, Q14397, Q15477, Q2KI24, Q2M296, Q2NKY8, Q3SYT1, Q3T7C9, Q3U1Y4, Q3URQ7, Q4R380, Q52L34, Q567W6, Q568Y2, Q5I0I8, Q5NCQ5, Q5ZHX9, Q6GPQ5, Q6NZR5, Q6P5E8, Q8BUI3, Q8BX80, Q8C9A2, Q8NFF5, Q8NFI3, Q91X44, Q920F5

Diamond homologs: Q0P464, Q2KI24, Q2M296, Q3URQ7, Q52L34, Q9SRE0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

89 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance74
Likely benign4
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
869495NC_000016.9:g.86243180_87703229delPathogenic

SpliceAI

1947 predictions. Top by Δscore:

VariantEffectΔscore
16:86532482:C:CCacceptor_gain1.0000
16:86532482:CTG:Cacceptor_loss1.0000
16:86541818:ACGAC:Aacceptor_gain1.0000
16:86541819:CGAC:Cacceptor_gain1.0000
16:86541819:CGACC:Cacceptor_gain1.0000
16:86541820:GAC:Gacceptor_gain1.0000
16:86541821:AC:Aacceptor_gain1.0000
16:86541822:CC:Cacceptor_gain1.0000
16:86541822:CCTGG:Cacceptor_loss1.0000
16:86541823:C:CAacceptor_loss1.0000
16:86541823:C:CCacceptor_gain1.0000
16:86541824:T:Aacceptor_loss1.0000
16:86542081:AG:Adonor_gain1.0000
16:86542081:AGC:Adonor_gain1.0000
16:86542081:AGCC:Adonor_gain1.0000
16:86548573:TTGCT:Tacceptor_gain1.0000
16:86548576:CT:Cacceptor_gain1.0000
16:86548578:C:CCacceptor_gain1.0000
16:86548579:T:Cacceptor_gain1.0000
16:86532477:CTGAT:Cacceptor_gain0.9900
16:86532478:TGAT:Tacceptor_gain0.9900
16:86541692:CCCA:Cdonor_loss0.9900
16:86541693:CCACC:Cdonor_loss0.9900
16:86541694:CACCT:Cdonor_loss0.9900
16:86541695:A:Gdonor_loss0.9900
16:86541823:C:Tacceptor_gain0.9900
16:86542050:C:CAdonor_gain0.9900
16:86542082:G:Cdonor_gain0.9900
16:86542096:AGCAC:Adonor_loss0.9900
16:86542097:GCAC:Gdonor_loss0.9900

AlphaMissense

2475 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
16:86548558:A:TV86D0.992
16:86546570:A:TV144D0.991
16:86554648:A:CF40L0.991
16:86554648:A:TF40L0.991
16:86554650:A:GF40L0.991
16:86542197:C:AK153N0.990
16:86542197:C:GK153N0.990
16:86542114:A:TV181D0.989
16:86542206:T:AR150S0.989
16:86542206:T:GR150S0.989
16:86554728:G:TR14S0.985
16:86546573:G:TA143D0.984
16:86546574:C:GA143P0.984
16:86552071:C:GD67H0.984
16:86546580:A:GS141P0.983
16:86554681:A:CF29L0.983
16:86554681:A:TF29L0.983
16:86554683:A:GF29L0.983
16:86554738:T:AK10N0.983
16:86554738:T:GK10N0.983
16:86542207:C:GR150T0.982
16:86542123:A:TV178D0.981
16:86548474:G:TA114D0.981
16:86546632:A:CS123R0.980
16:86546632:A:TS123R0.980
16:86546634:T:GS123R0.980
16:86552069:A:CD67E0.980
16:86552069:A:TD67E0.980
16:86552070:T:AD67V0.980
16:86554658:A:GI37T0.980

dbSNP variants (sampled 300 via entrez): RS1000030343 (16:86551937 C>G,T), RS1000361324 (16:86547308 T>G), RS1000388796 (16:86530345 C>G,T), RS1000417582 (16:86534949 T>C), RS1000566624 (16:86545718 C>A,T), RS1000616385 (16:86555275 C>A,T), RS1000740579 (16:86541766 G>A,C), RS1000761207 (16:86530563 C>T), RS1000874485 (16:86554368 G>A,C), RS1001086885 (16:86542821 C>G,T), RS1001282173 (16:86536878 A>T), RS1001296984 (16:86543098 T>C), RS1001336236 (16:86537046 T>C), RS1001352053 (16:86538146 G>A), RS1001411315 (16:86547765 G>A)

Disease associations

OMIM: gene MIM:616820 | disease phenotypes: MIM:265380

GenCC curated gene-disease

Mondo (1): alveolar capillary dysplasia with misalignment of pulmonary veins (MONDO:0009934)

Orphanet (1): Congenital alveolar capillary dysplasia (Orphanet:210122)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

6 associations (top):

StudyTraitp-value
GCST004025_25Systemic juvenile idiopathic arthritis5.000000e-06
GCST006431_17Plasma parathyroid hormone levels2.000000e-06
GCST007431_55Lung function (FEV1/FVC)7.000000e-09
GCST007432_60FEV14.000000e-10
GCST011365_151Myocardial infarction2.000000e-06
GCST011742_68Triglyceride levels in HIV infection7.000000e-06

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0004713FEV/FVC ratio
EFO:0004314forced expiratory volume
EFO:0004530triglyceride measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C536590Alveolar capillary dysplasia (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

21 total (human), top 21 by PubMed support.

ChemicalActions (top 5)PubMed papers
Valproic Acidaffects cotreatment, decreases expression, affects expression4
entinostatdecreases expression, affects cotreatment2
aristolochic acid Iincreases expression1
triphenyl phosphateaffects expression1
ferrous chloridedecreases expression1
di-n-butylphosphoric acidaffects expression1
CGP 52608increases reaction, affects binding1
monomethylarsonous acidincreases expression1
4-(5-benzo(1,3)dioxol-5-yl-4-pyridin-2-yl-1H-imidazol-2-yl)benzamideaffects cotreatment, decreases expression1
dorsomorphinaffects cotreatment, decreases expression1
bisphenol Sincreases methylation1
Sunitinibincreases expression1
Benzo(a)pyreneaffects methylation1
Ethyl Methanesulfonateincreases expression1
Formaldehydeincreases expression1
Leaddecreases expression1
Methyl Methanesulfonateincreases expression1
Ribonucleotidesaffects binding1
Testosteronedecreases expression1
Asbestos, Serpentinedecreases methylation1
Copper Sulfatedecreases expression1

Clinical trials (associated diseases)

2 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00732537PHASE4COMPLETEDInhaled Nitric Oxide by Oxygen Hood in Neonates
NCT04328636PHASE1/PHASE2COMPLETEDNebulized MgSO4 in Persistent Pulmonary Hypertension of Newborn