ARGFX

gene
On this page

Summary

ARGFX (arginine-fifty homeobox, HGNC:30146) is a protein-coding gene on chromosome 3q13.33, encoding Arginine-fifty homeobox (A6NJG6). Transcription factor that acts as activator.

Homeobox genes encode DNA-binding proteins, many of which are thought to be involved in early embryonic development. Homeobox genes encode a DNA-binding domain of 60 to 63 amino acids referred to as the homeodomain. This gene is a member of the ARGFX homeobox gene family.

Source: NCBI Gene 503582 — RefSeq curated summary.

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 42 total — 1 pathogenic
  • MANE Select transcript: NM_001012659

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:30146
Approved symbolARGFX
Namearginine-fifty homeobox
Location3q13.33
Locus typegene with protein product
StatusApproved
Ensembl geneENSG00000186103
Ensembl biotypeprotein_coding
OMIM611164
Entrez503582

Gene structure

Transcript identifiers

Ensembl transcripts: 2 — 2 protein_coding

ENST00000334384, ENST00000651603

RefSeq mRNA: 1 — MANE Select: NM_001012659 NM_001012659

CCDS: CCDS33834

Canonical transcript exons

ENST00000334384 — 5 exons

ExonStartEnd
ENSE00001340146121584917121585065
ENSE00001498042121576784121576900
ENSE00001498043121570702121570816
ENSE00003845215121567949121568013
ENSE00003850786121586022121590622

Expression profiles

Bgee: expression breadth tissue_specific, 2 present calls, max score 53.79.

FANTOM5 (CAGE): breadth tissue_specific, TPM avg 0.0070 / max 8.2516, expressed in 2 samples.

FANTOM5 promoters (1 alternative TSS)

Promoter IDTPM avgSamples expressed
381940.00702

Top tissues by expression

118 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
bone marrow cellCL:000209253.79gold quality
sural nerveUBERON:001548851.26gold quality
skeletal muscle tissueUBERON:000113451.17gold quality
colonic epitheliumUBERON:000039750.20gold quality
stromal cell of endometriumCL:000225549.22gold quality
muscle tissueUBERON:000238548.81gold quality
bone marrowUBERON:000237147.68gold quality
ventricular zoneUBERON:000305346.77gold quality
cortical plateUBERON:000534346.71gold quality
mucosa of transverse colonUBERON:000499146.48gold quality
ganglionic eminenceUBERON:000402345.49gold quality
bloodUBERON:000017841.31gold quality
duodenumUBERON:000211440.82gold quality
liverUBERON:000210740.65gold quality
granulocyteCL:000009439.42gold quality
calcaneal tendonUBERON:000370139.28gold quality
gastrocnemiusUBERON:000138839.07gold quality
lower esophagus mucosaUBERON:003583438.82gold quality
monocyteCL:000057638.75gold quality
apex of heartUBERON:000209838.56silver quality
primary visual cortexUBERON:000243638.53gold quality
leukocyteCL:000073838.40gold quality
cortex of kidneyUBERON:000122538.13gold quality
smooth muscle tissueUBERON:000113537.87gold quality
superior frontal gyrusUBERON:000266137.80gold quality
adrenal tissueUBERON:001830337.37gold quality
endometriumUBERON:000129537.32gold quality
lymph nodeUBERON:000002937.23gold quality
prefrontal cortexUBERON:000045135.91gold quality
right coronary arteryUBERON:000162535.50gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes5.22

Regulation

Is transcription factor: yes

JASPAR motifs

MotifNameFamily
MA1463.1ARGFXPaired-related HD factors
MA1463.2ARGFXPaired-related HD factors

JASPAR matrix evidence (PMIDs): PMID:18585359

miRNA regulators (miRDB)

179 targeting ARGFX, top 30 by miRDB confidence (max_score; target_count = how many genes the miRNA targets in total — lower means more specific):

miRNAMax scoreAvg scoremiRNA target_count
HSA-MIR-3613-3P100.0076.367965
HSA-MIR-5011-5P100.0083.465820
HSA-MIR-190A-3P100.0080.355520
HSA-MIR-4776-3P100.0068.731340
HSA-MIR-656-3P100.0072.152788
HSA-MIR-3924100.0072.092394
HSA-MIR-4747-5P100.0067.902681
HSA-MIR-5196-5P100.0067.982761
HSA-MIR-3646100.0073.565283
HSA-MIR-4283100.0066.422097
HSA-MIR-450A-1-3P100.0069.331837
HSA-MIR-371B-5P99.9975.344759
HSA-MIR-3667-3P99.9967.171636
HSA-MIR-366299.9973.825684
HSA-MIR-477599.9875.006394
HSA-MIR-373-5P99.9875.364753
HSA-MIR-616-5P99.9875.584775
HSA-MIR-4482-3P99.9872.503147
HSA-MIR-569699.9872.364487
HSA-MIR-314899.9775.066478
HSA-MIR-590-3P99.9674.346478
HSA-MIR-568899.9673.234504
HSA-MIR-495-3P99.9672.814197
HSA-MIR-4725-3P99.9669.532520
HSA-MIR-6780B-5P99.9669.602562
HSA-MIR-211099.9666.681930
HSA-MIR-6778-3P99.9667.292693
HSA-MIR-185-3P99.9567.011743
HSA-MIR-3912-5P99.9566.11925
HSA-MIR-9983-3P99.9471.483631

Cross-species orthologs

0 orthologs

Paralogs (50): ARX (ENSG00000004848), PAX6 (ENSG00000007372), PAX7 (ENSG00000009709), ALX4 (ENSG00000052850), GSC2 (ENSG00000063515), PITX1 (ENSG00000069011), PAX2 (ENSG00000075891), RHOXF1 (ENSG00000101883), CRX (ENSG00000105392), EVX1 (ENSG00000106038), PAX4 (ENSG00000106331), NOBOX (ENSG00000106410), PITX3 (ENSG00000107859), PHOX2B (ENSG00000109132), OTX1 (ENSG00000115507), PRRX1 (ENSG00000116132), VSX2 (ENSG00000119614), ESX1 (ENSG00000123576), PAX8 (ENSG00000125618), PAX1 (ENSG00000125813), RHOXF2 (ENSG00000131721), GSC (ENSG00000133937), RAX (ENSG00000134438), PAX3 (ENSG00000135903), ALX3 (ENSG00000156150), HESX1 (ENSG00000163666), PITX2 (ENSG00000164093), UNCX (ENSG00000164853), PHOX2A (ENSG00000165462), OTX2 (ENSG00000165588), DRGX (ENSG00000165606), PRRX2 (ENSG00000167157), SHOX2 (ENSG00000168779), OTP (ENSG00000171540), RAX2 (ENSG00000173976), EVX2 (ENSG00000174279), PROP1 (ENSG00000175325), ISX (ENSG00000175329), ALX1 (ENSG00000180318), MIXL1 (ENSG00000185155)

Protein

Protein identifiers

Arginine-fifty homeoboxA6NJG6 (reviewed: A6NJG6)

All UniProt accessions (1): A6NJG6

UniProt curated annotations — full annotation on UniProt →

Function. Transcription factor that acts as activator.

Subcellular location. Nucleus.

Tissue specificity. Expressed at low level in testis and undifferentiated embryonic stem cells.

Similarity. Belongs to the paired homeobox family.

RefSeq proteins (1): NP_001012677* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001356HDDomain
IPR009057Homeodomain-like_sfHomologous_superfamily
IPR017970Homeobox_CSConserved_site

Pfam: PF00046

UniProt features (7 total): compositionally biased region 2, chain 1, DNA-binding region 1, region of interest 1, sequence variant 1, sequence conflict 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-A6NJG6-F157.780.18

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 35 (showing top): GOMF_SEQUENCE_SPECIFIC_DNA_BINDING, GOMF_TRANSCRIPTION_REGULATOR_ACTIVITY, MIR616_5P, MIR371B_5P, MIR373_5P, MIR4659A_3P_MIR4659B_3P, MIR548AZ_5P, MIR548T_5P, MIR576_5P, MIR7106_5P, MIR3202, MIR6780A_5P, MIR4743_3P, MIR1273H_5P, MIR138_2_3P

GO Biological Process (2): regulation of transcription by RNA polymerase II (GO:0006357), regulation of DNA-templated transcription (GO:0006355)

GO Molecular Function (4): RNA polymerase II cis-regulatory region sequence-specific DNA binding (GO:0000978), DNA-binding transcription factor activity, RNA polymerase II-specific (GO:0000981), sequence-specific double-stranded DNA binding (GO:1990837), DNA binding (GO:0003677)

GO Cellular Component (2): chromatin (GO:0000785), nucleus (GO:0005634)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
RNA polymerase II transcription regulatory region sequence-specific DNA binding2
regulation of DNA-templated transcription1
transcription by RNA polymerase II1
DNA-templated transcription1
regulation of gene expression1
regulation of RNA biosynthetic process1
cis-regulatory region sequence-specific DNA binding1
chromatin1
DNA-binding transcription factor activity1
regulation of transcription by RNA polymerase II1
double-stranded DNA binding1
sequence-specific DNA binding1
nucleic acid binding1
chromosome1
cellular anatomical structure1
intracellular membrane-bounded organelle1

Protein interactions and networks

STRING

542 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
ARGFXTPRX1Q8N7U7797
ARGFXDUXBA0A1W2PPF3730
ARGFXDUXAA6NLW8693
ARGFXNANOGNBQ7Z5D8623
ARGFXKLF17Q5JT82591
ARGFXVENTXO95231590
ARGFXANHXE9PGG2535
ARGFXKHDC1LQ5JSQ8532
ARGFXDPPA3Q6W0C5510
ARGFXRHOXF2BP0C7M4497
ARGFXPLAC9Q5JTB6460
ARGFXKHDC1Q4VXA5440
ARGFXDPPA5A6NC42434
ARGFXGDF3Q9NR23417
ARGFXDPPA2Q7Z7J5404

IntAct

0 interactions, top by confidence:

BioGRID (8): GNB2 (Proximity Label-MS), GNB2L1 (Proximity Label-MS), EP300 (Proximity Label-MS), NUP43 (Proximity Label-MS), CREBBP (Proximity Label-MS), WDR92 (Proximity Label-MS), MLST8 (Proximity Label-MS), DTL (Proximity Label-MS)

ESM2 similar proteins: A0A1W2PPF3, A0A1W2PPM1, A1A546, A1YGI6, A2T763, A5YC49, A6NFQ7, A6NJG6, D2HQI1, G3X9P6, O42173, O57374, P09632, P0C7M4, P10242, P14837, P17278, P31272, P31538, Q1KKS8, Q28ET4, Q28G02, Q3LTE0, Q3UT54, Q4JM65, Q4KL20, Q5TM83, Q5TM84, Q5W1J6, Q68EH7, Q6NSW7, Q80Z64, Q8IUE1, Q8JH55, Q8JIT7, Q8JJ26, Q8MIB7, Q8MIB8, Q8MIE9, Q91685

Diamond homologs: A0A096LPI5, A6NIU2, A6NJG6, F2Z398, P0DTE4, P51957, Q09FC8, Q5H9K5, Q5T7P6, Q68CZ1, Q6B4Z3, Q6UX73, Q86U02, Q8IV13, Q8N7M2, Q8N9N2, Q8NDZ0, Q8NEM8, Q8TDM0, Q92918, Q96J02, Q96MD7, Q9BUA6, Q9NXG0, A1A546, A1YEY5, A1YFI3, A1YG57, A1YGA2, A2T733, A2T777, A2T7P4, A6NFQ7, A6NJT0, A6NNA5, A6YP92, F1NEA7, G5EC89, O08934, O14813

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

42 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance35
Likely benign5
Benign0

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
4819208Single allelePathogenic

SpliceAI

693 predictions. Top by Δscore:

VariantEffectΔscore
3:121570812:TCCCA:Tdonor_gain1.0000
3:121570813:CCCA:Cdonor_gain1.0000
3:121570814:CCA:Cdonor_gain1.0000
3:121570815:CA:Cdonor_gain1.0000
3:121570816:AG:Adonor_loss1.0000
3:121570817:G:GGdonor_gain1.0000
3:121585060:G:GGdonor_gain1.0000
3:121570820:A:AGdonor_gain0.9900
3:121570821:G:GGdonor_gain0.9900
3:121584907:T:TAacceptor_gain0.9900
3:121584912:T:TAacceptor_gain0.9900
3:121570789:A:AGdonor_gain0.9800
3:121584915:A:AGacceptor_gain0.9800
3:121584916:G:GGacceptor_gain0.9800
3:121586019:CA:Cacceptor_loss0.9700
3:121586020:A:ACacceptor_loss0.9700
3:121584918:A:AGacceptor_gain0.9600
3:121584916:GCA:Gacceptor_gain0.9400
3:121570819:GA:Gdonor_gain0.9200
3:121585050:G:GTdonor_gain0.9200
3:121586014:A:AGacceptor_gain0.9200
3:121586021:GGTTT:Gacceptor_gain0.9200
3:121570818:T:Adonor_loss0.9000
3:121585061:TAAAG:Tdonor_loss0.9000
3:121585062:AAAGG:Adonor_loss0.9000
3:121585064:AGGTC:Adonor_loss0.9000
3:121585065:GGTC:Gdonor_loss0.9000
3:121585067:T:Gdonor_loss0.9000
3:121570819:GAGTA:Gdonor_gain0.8900
3:121570820:AGTAT:Adonor_gain0.8900

AlphaMissense

2066 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
3:121586028:T:CF126L0.994
3:121586030:C:AF126L0.994
3:121586030:C:GF126L0.994
3:121586029:T:CF126S0.988
3:121584949:T:CF85L0.987
3:121584951:C:AF85L0.987
3:121584951:C:GF85L0.987
3:121584985:T:CF97L0.986
3:121584987:T:AF97L0.986
3:121584987:T:GF97L0.986
3:121585000:T:CF102L0.986
3:121585002:C:AF102L0.986
3:121585002:C:GF102L0.986
3:121586038:G:CR129P0.985
3:121586025:T:AW125R0.974
3:121586025:T:CW125R0.974
3:121586029:T:GF126C0.971
3:121584950:T:CF85S0.969
3:121585046:T:CL117P0.969
3:121584986:T:CF97S0.966
3:121586041:G:CR130P0.964
3:121584950:T:GF85C0.960
3:121586027:G:CW125C0.959
3:121586027:G:TW125C0.959
3:121584974:T:CL93P0.958
3:121586028:T:GF126V0.957
3:121586048:A:CK132N0.957
3:121586048:A:TK132N0.957
3:121586043:T:CF131L0.954
3:121586045:C:AF131L0.954

dbSNP variants (sampled 300 via entrez): RS1000122614 (3:121574094 T>C), RS1000223165 (3:121571716 A>C), RS1000389820 (3:121577455 T>C), RS1000413149 (3:121584156 C>T), RS1000502538 (3:121571645 A>G), RS1000920095 (3:121579154 G>A,T), RS1000946194 (3:121566006 T>A,C), RS1001091660 (3:121566872 T>C), RS1001101024 (3:121584001 C>A,G), RS1001103748 (3:121572901 G>A,C), RS1001212735 (3:121570186 A>C), RS1001242385 (3:121569888 T>C), RS1001307144 (3:121572621 A>G), RS1001904956 (3:121569667 G>A), RS1001908842 (3:121582492 A>G)

Disease associations

OMIM: gene MIM:611164 | disease phenotypes:

GenCC curated gene-disease

Mondo (1): partial duplication of the long arm of chromosome 3 (MONDO:0016954)

Orphanet (1): Partial duplication of the long arm of chromosome 3 syndrome (Orphanet:262851)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST003014_5Postprandial triglyceride response to high fat diet meal9.000000e-07
GCST003014_6Postprandial triglyceride response to high fat diet meal4.000000e-07
GCST011741_49LDL cholesterol levels in HIV infection9.000000e-06
GCST90002395_380Mean platelet volume7.000000e-13

EFO canonical traits (3, from GWAS)

EFO IDTrait name
EFO:0007681triglyceride change measurement
EFO:0007684response to high fat food intake
EFO:0004611low density lipoprotein cholesterol measurement

MeSH disease descriptors (1)

DescriptorNameTree numbers
C536813Chromosome 3, trisomy 3q (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

3 total (human), top 3 by PubMed support.

ChemicalActions (top 5)PubMed papers
Benzo(a)pyreneincreases methylation1
Cadmiumdecreases expression1
Niclosamidedecreases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.