CYP20A1

gene
On this page

Also known as CYP-M

Summary

CYP20A1 (cytochrome P450 family 20 subfamily A member 1, HGNC:20576) is a protein-coding gene on chromosome 2q33.2, encoding Cytochrome P450 20A1 (Q6UW02).

This gene encodes a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are monooxygenases that catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids. This protein lacks one amino acid of the conserved heme binding site. It also lacks the conserved I-helix motif AGX(D,E)T, suggesting that its substrate may carry its own oxygen.

Source: NCBI Gene 57404 — RefSeq curated summary.

At a glance

  • GWAS associations: 17
  • Clinical variants (ClinVar): 82 total
  • MANE Select transcript: NM_177538

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:20576
Approved symbolCYP20A1
Namecytochrome P450 family 20 subfamily A member 1
Location2q33.2
Locus typegene with protein product
StatusApproved
AliasesCYP-M
Ensembl geneENSG00000119004
Ensembl biotypeprotein_coding
Entrez57404

Gene structure

Transcript identifiers

Ensembl transcripts: 12 — 6 protein_coding, 3 nonsense_mediated_decay, 3 protein_coding_CDS_not_defined

ENST00000356079, ENST00000428265, ENST00000429815, ENST00000431118, ENST00000443941, ENST00000449301, ENST00000461371, ENST00000463131, ENST00000489360, ENST00000875108, ENST00000875109, ENST00000956048

RefSeq mRNA: 13 — MANE Select: NM_177538 NM_001371695, NM_001371696, NM_001371697, NM_001371698, NM_001371699, NM_001371700, NM_001371701, NM_001371702, NM_001371703, NM_001371704, NM_001371705, NM_001371706, NM_177538

CCDS: CCDS2357, CCDS92931

Canonical transcript exons

ENST00000356079 — 13 exons

ExonStartEnd
ENSE00000803483203246755203246921
ENSE00001076405203245846203245895
ENSE00001554594203296758203306026
ENSE00001905806203239018203239134
ENSE00003493353203280059203280113
ENSE00003539280203251967203252109
ENSE00003573542203296474203296563
ENSE00003575020203278573203278688
ENSE00003661257203266514203266681
ENSE00003668160203272670203272748
ENSE00003671815203292262203292326
ENSE00003675193203289765203289876
ENSE00003785833203285612203285732

Expression profiles

Bgee: expression breadth ubiquitous, 225 present calls, max score 93.36.

FANTOM5 (CAGE): breadth ubiquitous, TPM avg 19.1715 / max 525.3354, expressed in 1816 samples.

FANTOM5 promoters (3 alternative TSS)

Promoter IDTPM avgSamples expressed
247469.54341799
247478.87541771
247480.7528434

Top tissues by expression

271 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
calcaneal tendonUBERON:000370193.36gold quality
primordial germ cell in gonadCL:0000670 ∩ UBERON:000099192.04gold quality
adrenal tissueUBERON:001830391.70gold quality
colonic epitheliumUBERON:000039788.75gold quality
monocyteCL:000057685.80gold quality
islet of LangerhansUBERON:000000685.52gold quality
mononuclear cellCL:000084285.31gold quality
leukocyteCL:000073885.24gold quality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047384.78gold quality
tendonUBERON:000004383.86gold quality
body of pancreasUBERON:000115083.17gold quality
pancreasUBERON:000126483.02gold quality
adrenal glandUBERON:000236982.82gold quality
rectumUBERON:000105282.74gold quality
right adrenal gland cortexUBERON:003582782.55gold quality
right adrenal glandUBERON:000123382.36gold quality
left adrenal glandUBERON:000123482.31gold quality
left adrenal gland cortexUBERON:003582581.95gold quality
popliteal arteryUBERON:000225081.46gold quality
tibial arteryUBERON:000761081.46gold quality
gall bladderUBERON:000211081.42gold quality
adrenal cortexUBERON:000123581.28gold quality
stromal cell of endometriumCL:000225581.06gold quality
aortaUBERON:000094780.87gold quality
left coronary arteryUBERON:000162680.82gold quality
right testisUBERON:000453480.80gold quality
sural nerveUBERON:001548880.62gold quality
thoracic aortaUBERON:000151580.26gold quality
ascending aortaUBERON:000149680.20gold quality
left ovaryUBERON:000211980.16gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 1.

ExperimentMarker?Max mean expression
E-ANND-3yes8.83

Regulation

Is transcription factor: no

Upstream regulators (CollecTRI, top): TCF3

Literature-anchored findings (GeneRIF, showing 2)

  • Multiple Alu Exonization in 3’UTR of a Primate-Specific Isoform of CYP20A1 Creates a Potential miRNA Sponge. (PMID:33434274)
  • The association study between CYP20A1, CYP4F2, CYP2D6 gene polymorphisms and coronary heart disease risk in the Han population in southern China. (PMID:34302632)

Cross-species orthologs

3 orthologs

OrganismSymbolGene ID
danio_reriocyp20a1ENSDARG00000006040
mus_musculusCyp20a1ENSMUSG00000049439
rattus_norvegicusCyp20a1ENSRNOG00000017631

Protein

Protein identifiers

Cytochrome P450 20A1Q6UW02 (reviewed: Q6UW02)

All UniProt accessions (5): Q6UW02, C9J8K7, E9PHG5, F2Z3K6, F8WBE2

UniProt curated annotations — full annotation on UniProt →

Subcellular location. Membrane.

Similarity. Belongs to the cytochrome P450 family.

Isoforms (2)

UniProt IDNamesCanonical?
Q6UW02-11yes
Q6UW02-22

RefSeq proteins (13): NP_001358624, NP_001358625, NP_001358626, NP_001358627, NP_001358628, NP_001358629, NP_001358630, NP_001358631, NP_001358632, NP_001358633, NP_001358634, NP_001358635, NP_803882* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR001128Cyt_P450Family
IPR002401Cyt_P450_E_grp-IFamily
IPR036396Cyt_P450_sfHomologous_superfamily
IPR052666CYP450_20A1-likeFamily

Pfam: PF00067

UniProt features (9 total): sequence conflict 3, sequence variant 2, chain 1, transmembrane region 1, binding site 1, splice variant 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-Q6UW02-F189.000.64

Functional residue map

Curated UniProt residues grouped by drug-discovery relevance — catalytic, ligand-binding, modification, and mutation-validated positions. Source: UniProtKB sequence features.

Ligand- & substrate-binding residues (1): 409 (axial binding residue)

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 135 (showing top): GSE45365_NK_CELL_VS_CD8_TCELL_UP, GOMF_OXIDOREDUCTASE_ACTIVITY_ACTING_ON_PAIRED_DONORS_WITH_INCORPORATION_OR_REDUCTION_OF_MOLECULAR_OXYGEN, KINSEY_TARGETS_OF_EWSR1_FLII_FUSION_DN, CUI_TCF21_TARGETS_2_DN, GOMF_TETRAPYRROLE_BINDING, GOMF_IRON_ION_BINDING, GOMF_MONOOXYGENASE_ACTIVITY, CCACACA_MIR147, KASLER_HDAC7_TARGETS_1_UP, PTEN_DN.V2_DN, GSE13522_CTRL_VS_T_CRUZI_Y_STRAIN_INF_SKIN_IFNG_KO_UP, GSE14415_INDUCED_TREG_VS_FAILED_INDUCED_TREG_UP, GSE14415_ACT_VS_CTRL_NATURAL_TREG_UP, GSE14386_UNTREATED_VS_IFNA_TREATED_ACT_PBMC_MS_PATIENT_DN, GSE13547_2H_VS_12_H_ANTI_IGM_STIM_BCELL_DN

GO Biological Process (0):

GO Molecular Function (7): monooxygenase activity (GO:0004497), iron ion binding (GO:0005506), oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (GO:0016705), heme binding (GO:0020037), protein binding (GO:0005515), oxidoreductase activity (GO:0016491), metal ion binding (GO:0046872)

GO Cellular Component (1): membrane (GO:0016020)

GO top-level categories

Rollup of top GO terms by namespace:

CategoryTerms
oxidoreductase activity2
transition metal ion binding1
tetrapyrrole binding1
binding1
catalytic activity1
cation binding1
cellular anatomical structure1

Protein interactions and networks

STRING

1512 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
CYP20A1ICA1LQ8NDH6544
CYP20A1GK5Q6ZS86542
CYP20A1CYP39A1Q9NYL5532
CYP20A1C17orf75Q9HAS0521
CYP20A1DCAF10Q5QP82514
CYP20A1ZNF429Q86V71497
CYP20A1CYP7A1P22680490
CYP20A1ZNF850A8MQ14461
CYP20A1PPIHO43447453
CYP20A1ZNF714Q96N38449
CYP20A1CYB5D2Q8WUJ1444
CYP20A1CIAO1O76071443
CYP20A1CACNG8Q8WXS5438
CYP20A1FKBP14Q9NWM8437
CYP20A1HSD3B7Q9H2F3437

IntAct

18 interactions, top by confidence:

ABTypeScore
VSIG1TTI1psi-mi:“MI:0914”(association)0.640
CYP20A1PIEZO1psi-mi:“MI:0915”(physical association)0.590
BTNL3FAM171A2psi-mi:“MI:0914”(association)0.530
VSIG1TNPO2psi-mi:“MI:0914”(association)0.530
Rrbp1PIPSLpsi-mi:“MI:0914”(association)0.350
COPEESYT2psi-mi:“MI:0914”(association)0.350
TMED10PGRMC1psi-mi:“MI:0914”(association)0.350
SLC11A2GPR89Apsi-mi:“MI:0914”(association)0.350
SLC16A6RER1psi-mi:“MI:0914”(association)0.350
SLC17A3ADCY3psi-mi:“MI:0914”(association)0.350
SLC44A1UPK3BL1psi-mi:“MI:0914”(association)0.350
SLC7A8SPTLC1psi-mi:“MI:0914”(association)0.350
CFTRUBA6psi-mi:“MI:2364”(proximity)0.270

BioGRID (31): CYP20A1 (Synthetic Growth Defect), CYP20A1 (Affinity Capture-MS), PIEZO1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-RNA), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), PIEZO1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS), CYP20A1 (Affinity Capture-MS)

ESM2 similar proteins: A0A1V6PBE7, A0A5B8ND22, A0A9Y1LLN2, A8WGA0, E1BHJ4, F1RE08, G3V7X8, I1GQE7, M1W080, O02766, O23051, O46491, O75881, O88962, O93323, P0DOX0, P0DXH8, P18125, P22680, P46634, P48416, P51542, P79739, Q07973, Q08D50, Q09128, Q09660, Q4G0S4, Q50EK5, Q5E980, Q60991, Q62969, Q63688, Q64441, Q64505, Q6EIG3, Q6JD68, Q6P7D4, Q6UW02, Q7YRB2

Diamond homologs: A0A159AKG3, A0A1Z3GBS4, A0A218NGS0, A8NCK6, B5UAQ8, C8VJR0, E9Q5K4, F1B282, H2DH19, H2DH24, H2KYS3, O04773, O18809, O18963, O18993, O46515, O65012, O70537, O77809, O77810, P04800, P05181, P05182, P05183, P0DO35, P0DO40, P0DO41, P0DO42, P11509, P20853, P24454, P24462, P24470, P33266, P37123, P51538, P79102, Q05555, Q05556, Q16696

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

82 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic0
Likely pathogenic0
Uncertain significance53
Likely benign2
Benign2

Top pathogenic / likely-pathogenic (0)

SpliceAI

2159 predictions. Top by Δscore:

VariantEffectΔscore
2:203245844:A:Gacceptor_gain1.0000
2:203245891:GAAAA:Gdonor_gain1.0000
2:203245894:AA:Adonor_gain1.0000
2:203245896:G:GGdonor_gain1.0000
2:203246922:G:GGdonor_gain1.0000
2:203251965:A:AGacceptor_gain1.0000
2:203251965:AGC:Aacceptor_gain1.0000
2:203251966:G:GAacceptor_gain1.0000
2:203251966:GC:Gacceptor_gain1.0000
2:203251966:GCG:Gacceptor_gain1.0000
2:203252107:AAGG:Adonor_loss1.0000
2:203252108:AGG:Adonor_loss1.0000
2:203252110:G:Cdonor_loss1.0000
2:203252111:T:Adonor_loss1.0000
2:203266512:A:AGacceptor_gain1.0000
2:203266513:G:GCacceptor_gain1.0000
2:203266513:GC:Gacceptor_gain1.0000
2:203266513:GCT:Gacceptor_gain1.0000
2:203266513:GCTT:Gacceptor_gain1.0000
2:203266513:GCTTT:Gacceptor_gain1.0000
2:203266649:G:GTdonor_gain1.0000
2:203266677:GCACA:Gdonor_gain1.0000
2:203266678:CACA:Cdonor_gain1.0000
2:203266679:ACA:Adonor_gain1.0000
2:203266679:ACAGT:Adonor_loss1.0000
2:203266680:CA:Cdonor_gain1.0000
2:203266681:AGT:Adonor_loss1.0000
2:203266682:G:GGdonor_gain1.0000
2:203266682:GTAA:Gdonor_loss1.0000
2:203266683:T:Adonor_loss1.0000

AlphaMissense

3040 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
2:203246870:A:CS80R0.998
2:203246872:T:AS80R0.998
2:203246872:T:GS80R0.998
2:203239085:C:AA8D0.989
2:203246868:T:AV79D0.989
2:203266538:T:AW153R0.989
2:203266538:T:CW153R0.989
2:203246760:G:AG43D0.988
2:203246764:T:AN44K0.988
2:203246764:T:GN44K0.988
2:203289797:G:CR335P0.988
2:203296475:T:CF384L0.983
2:203296477:T:AF384L0.983
2:203296477:T:GF384L0.983
2:203246760:G:TG43V0.981
2:203246805:T:CL58P0.979
2:203292269:T:AV364D0.976
2:203246801:T:CF57L0.968
2:203246803:C:AF57L0.968
2:203246803:C:GF57L0.968
2:203246843:T:CF71L0.967
2:203246845:C:AF71L0.967
2:203246845:C:GF71L0.967
2:203289788:A:TE332V0.967
2:203246759:G:CG43R0.965
2:203239115:C:AA18D0.964
2:203296490:T:CF389L0.962
2:203296492:T:AF389L0.962
2:203296492:T:GF389L0.962
2:203296532:T:CF403L0.959

dbSNP variants (sampled 300 via entrez): RS1000065184 (2:203297639 C>A,G,T), RS1000080031 (2:203274172 A>C), RS1000088013 (2:203281945 T>A), RS1000101987 (2:203261973 C>A,G), RS1000352902 (2:203268987 C>T), RS1000376085 (2:203286184 G>A), RS1000378391 (2:203293428 G>A,T), RS1000399951 (2:203286221 C>T), RS1000447371 (2:203293336 C>T), RS1000493735 (2:203252884 C>T), RS1000499607 (2:203292898 G>A), RS1000597383 (2:203297804 C>A), RS1000608156 (2:203255605 C>G,T), RS1000663071 (2:203239278 G>A,C,T), RS1000684495 (2:203280303 C>G,T)

Disease associations

OMIM: gene `` | disease phenotypes:

GenCC curated gene-disease

Mondo (0):

Orphanet (0):

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

17 associations (top):

StudyTraitp-value
GCST000839_5Height2.000000e-06
GCST007094_143Diastolic blood pressure7.000000e-06
GCST007096_201Pulse pressure3.000000e-09
GCST007097_26Pulse pressure8.000000e-06
GCST008876_22Non-lobar intracerebral hemorrhage (MTAG)1.000000e-06
GCST010101_5White matter hyperintensities4.000000e-13
GCST010102_1White matter integrity (fractional anisotropy)7.000000e-08
GCST010698_18Subcortical volume (min-P)1.000000e-08
GCST010699_69Brain morphology (min-P)9.000000e-20
GCST010700_6Cortical thickness (MOSTest)2.000000e-14
GCST010701_36Cortical surface area (MOSTest)2.000000e-24
GCST010702_146Subcortical volume (MOSTest)2.000000e-13
GCST010703_42Brain morphology (MOSTest)1.000000e-08
GCST010726_18Periventricular white matter hyperintensities7.000000e-08
GCST90000025_865Appendicular lean mass7.000000e-15
GCST90011900_136Serum alkaline phosphatase levels5.000000e-24
GCST90013406_59Liver enzyme levels (alkaline phosphatase)7.000000e-30

EFO canonical traits (9, from GWAS)

EFO IDTrait name
EFO:0006336diastolic blood pressure
EFO:0005763pulse pressure measurement
EFO:0010178non-lobar intracerebral hemorrhage
EFO:0005665white matter hyperintensity measurement
EFO:0004641white matter integrity
EFO:0004346neuroimaging measurement
EFO:0004840cortical thickness
EFO:0004980appendicular lean mass
EFO:0004533alkaline phosphatase measurement

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

GtoPdb / IUPHAR curated pharmacology

(IUPHAR/BPS Guide to Pharmacology — expert-curated)

Target class: enzyme — CYP11, CYP17, CYP19, CYP20 and CYP21 families

CTD chemical–gene interactions

29 total (human), top 29 by PubMed support.

ChemicalActions (top 5)PubMed papers
sodium arseniteincreases abundance, increases expression3
Valproic Acidaffects expression, decreases expression2
Cyclosporineincreases expression2
aristolochic acid Idecreases expression1
bisphenol Aincreases expression1
tris(1,3-dichloro-2-propyl)phosphateincreases expression1
butyraldehydedecreases expression1
manganese chlorideincreases abundance, increases expression1
perfluorooctane sulfonic acidincreases expression1
fipronilaffects cotreatment, decreases expression1
2-palmitoylglycerolincreases expression1
Decitabineaffects expression1
Sunitinibincreases expression1
Acetaminophenincreases expression1
Arsenicincreases abundance, increases expression1
Benzo(a)pyreneincreases methylation1
Cadmiumincreases abundance, increases expression1
Cisplatinaffects expression1
DEETaffects cotreatment, decreases expression1
Diethylstilbestroldecreases expression1
Endosulfanincreases expression1
Gallic Aciddecreases expression1
Manganeseincreases abundance, increases expression1
Methyl Methanesulfonateincreases expression1
Silicon Dioxidedecreases expression1
Smokedecreases expression1
Tretinoindecreases expression1
Asbestos, Crocidolitedecreases expression1
Cadmium Chlorideincreases abundance, increases expression1

Clinical trials (associated diseases)

0 trials via MONDO — disease-level, not drug-specific.

No linked Atlas pages yet — the cross-entity mesh grows as the corpus expands.