C22orf31

gene
On this page

Also known as HS747E2AbK747E2.1

Summary

C22orf31 (chromosome 22 open reading frame 31, HGNC:26931) is a protein-coding gene on chromosome 22q12.1, encoding Uncharacterized protein C22orf31 (O95567).

At a glance

  • GWAS associations: 4
  • Clinical variants (ClinVar): 14 total — 1 pathogenic
  • MANE Select transcript: NM_015370

Identifiers

Gene identifiers

FieldValue
HGNC IDHGNC:26931
Approved symbolC22orf31
Namechromosome 22 open reading frame 31
Location22q12.1
Locus typegene with protein product
StatusApproved
AliasesHS747E2A, bK747E2.1
Ensembl geneENSG00000100249
Ensembl biotypeprotein_coding
Entrez25770

Gene structure

Transcript identifiers

Ensembl transcripts: 1 — 1 protein_coding

ENST00000216071

RefSeq mRNA: 2 — MANE Select: NM_015370 NM_001386866, NM_015370

CCDS: CCDS13848

Canonical transcript exons

ENST00000216071 — 3 exons

ExonStartEnd
ENSE000006519982906041529060843
ENSE000008796992905867229059182
ENSE000010481932906179029061831

Expression profiles

Bgee: expression breadth ubiquitous, 140 present calls, max score 96.01.

Top tissues by expression

270 total, by Bgee expression score (0-100, higher = more expressed):

TissueAnatomy IDExpression scoreQuality
male germ line stem cell (sensu Vertebrata) in testisCL:0000089 ∩ UBERON:000047396.01gold quality
pancreatic ductal cellCL:000207983.30silver quality
spermCL:000001979.36gold quality
male germ cellCL:000001578.12gold quality
right testisUBERON:000453475.63gold quality
left testisUBERON:000453375.01gold quality
testisUBERON:000047373.14gold quality
buccal mucosa cellCL:000233671.51silver quality
triceps brachiiUBERON:000150968.81gold quality
gluteal muscleUBERON:000200067.84gold quality
epithelial cell of pancreasCL:000008367.83silver quality
putamenUBERON:000187466.25gold quality
endometrium epitheliumUBERON:000481165.26gold quality
lateral globus pallidusUBERON:000247664.83silver quality
tendon of biceps brachiiUBERON:000818864.66gold quality
hair follicleUBERON:000207364.23gold quality
caudate nucleusUBERON:000187364.15gold quality
parotid glandUBERON:000183163.11gold quality
deltoidUBERON:000147662.62gold quality
skeletal muscle tissue of rectus abdominisUBERON:000451162.06gold quality
heart right ventricleUBERON:000208061.63gold quality
cranial nerve IIUBERON:000094161.38silver quality
epithelium of nasopharynxUBERON:000195161.07gold quality
biceps brachiiUBERON:000150760.88gold quality
paraflocculusUBERON:000535160.78gold quality
nucleus accumbensUBERON:000188260.63gold quality
frontal poleUBERON:000279560.22gold quality
middle frontal gyrusUBERON:000270260.08gold quality
olfactory bulbUBERON:000226459.91gold quality
pigmented layer of retinaUBERON:000178259.87gold quality

Single-cell (SCXA)

Detected in 1 experiment(s), a significant marker in 0.

ExperimentMarker?Max mean expression
E-ANND-3no1.62

Regulation

Is transcription factor: no

Cross-species orthologs

0 orthologs

Protein

Protein identifiers

Uncharacterized protein C22orf31O95567 (reviewed: O95567)

All UniProt accessions (1): O95567

RefSeq proteins (2): NP_001373795, NP_056185* (*=MANE)

Domains & families (InterPro)

IDNameType
IPR028970DUF4662Family

Pfam: PF15578

UniProt features (8 total): compositionally biased region 3, region of interest 2, sequence variant 2, chain 1

Structure

Experimental structures (PDB)

0 structures.

Predicted structure (AlphaFold)

ModelpLDDTFraction very-high
AF-O95567-F157.020.15

Function

Pathways and Gene Ontology

Reactome pathways

0 pathways

MSigDB gene sets: 52 (showing top): GCANCTGNY_MYOD_Q6, CHX10_01, MODULE_99, TGANTCA_AP1_C, TGACATY_UNKNOWN, GATA1_04, LYF1_01, HAND1E47_01, MODULE_48, MODULE_95, TGGAAA_NFAT_Q4_01, TAATTA_CHX10_01, WGGAATGY_TEF1_Q6, E4BP4_01, YTAAYNGCT_UNKNOWN

GO Biological Process (0):

GO Molecular Function (0):

GO Cellular Component (0):

Protein interactions and networks

STRING

90 interactions, top by confidence (×1000):

Protein AProtein BPartner UniProtScore
C22orf31TEDC1Q86SX3541
C22orf31ESYT3A0FGR9497
C22orf31TMED10P49755477
C22orf31TP53I13Q8NBR0476
C22orf31LONRF3Q496Y0447
C22orf31KRT33BQ14525446
C22orf31KLHL20Q9Y2M5409
C22orf31LPCAT2Q7L5N7397
C22orf31NR1H2P55055385
C22orf31SYVN1Q86TM6378
C22orf31C2orf78A6NCI8377
C22orf31ACTBL2Q562R1369
C22orf31METRNLQ641Q3306
C22orf31TBCELQ5QJ74297
C22orf31KIF17Q9P2E2290

IntAct

4 interactions, top by confidence:

ABTypeScore
C22orf31HDAC2psi-mi:“MI:0914”(association)0.530
ECE1C22orf31psi-mi:“MI:0915”(physical association)0.370
C22orf31HDAC1psi-mi:“MI:0914”(association)0.350

BioGRID (10): HDAC1 (Affinity Capture-MS), LACRT (Affinity Capture-MS), LACRT (Affinity Capture-MS), HDAC2 (Affinity Capture-MS), HDAC1 (Affinity Capture-MS), C22orf31 (Biochemical Activity), HDAC2 (Affinity Capture-MS), LACRT (Affinity Capture-MS), MBD2 (Cross-Linking-MS (XL-MS)), SMC6 (Cross-Linking-MS (XL-MS))

ESM2 similar proteins: A0A1B0GVR7, A0PJW8, A2BYT2, A5FRX9, B8G1X0, C1DKL7, G3UWD5, O42659, O62953, O95567, O98453, P05899, P11690, P11794, P19718, P20920, P27975, P32544, P33482, P39971, P52776, P54446, P69516, P69517, P86209, Q02781, Q0ABH1, Q1X6Y0, Q3AMN4, Q3J8R8, Q3K5Z2, Q3MFB7, Q3ZZM0, Q537H7, Q57P89, Q5I162, Q5PHY6, Q6AY31, Q75003, Q7CQJ0

SIGNOR signaling

0 interactions.

Disease & clinical

Clinical variants and AI predictions

ClinVar

14 variants total. Per-class counts are floors (≥ shown; pagination cap):

ClassificationCount (floor)
Pathogenic1
Likely pathogenic0
Uncertain significance10
Likely benign1
Benign1

Top pathogenic / likely-pathogenic (1)

Variant IDHGVSClassification
832563NC_000022.10:g.(?29083885)(29621477_?)delPathogenic

SpliceAI

528 predictions. Top by Δscore:

VariantEffectΔscore
22:29059181:CT:Cacceptor_gain0.9900
22:29060442:ATGC:Adonor_gain0.9900
22:29059188:C:CTacceptor_gain0.9700
22:29060486:T:TAdonor_gain0.9700
22:29060844:C:CCacceptor_gain0.9700
22:29061133:T:TAdonor_gain0.9700
22:29060427:T:TAdonor_gain0.9600
22:29060438:T:TAdonor_gain0.9600
22:29060659:T:TAdonor_gain0.9600
22:29060842:TG:Tacceptor_gain0.9600
22:29059183:C:CCacceptor_gain0.9500
22:29059189:A:Tacceptor_gain0.9500
22:29060660:C:Adonor_gain0.9500
22:29060839:GGGTG:Gacceptor_gain0.9500
22:29060841:GTG:Gacceptor_gain0.9500
22:29060705:TTGC:Tdonor_gain0.9400
22:29060840:GGTGC:Gacceptor_loss0.9400
22:29060842:TGCTA:Tacceptor_loss0.9400
22:29060843:GC:Gacceptor_loss0.9400
22:29060844:CT:Cacceptor_loss0.9400
22:29060845:T:Aacceptor_loss0.9400
22:29060654:CAA:Cdonor_gain0.9300
22:29060840:GGTG:Gacceptor_gain0.9300
22:29061137:G:Adonor_gain0.9000
22:29060850:T:TCacceptor_gain0.8900
22:29060850:T:Cacceptor_gain0.8600
22:29060988:T:Adonor_gain0.8600
22:29059188:C:Tacceptor_gain0.8500
22:29060416:T:TAdonor_gain0.8500
22:29060460:G:Adonor_gain0.8300

AlphaMissense

1884 scored. Top likely-pathogenic:

VariantProtein changeam_pathogenicity
22:29058883:T:AK244N0.987
22:29058883:T:GK244N0.987
22:29058884:T:AK244I0.984
22:29060663:A:GW62R0.960
22:29060663:A:TW62R0.960
22:29060661:C:AW62C0.956
22:29060661:C:GW62C0.956
22:29058896:C:TG240D0.947
22:29058885:T:CK244E0.940
22:29058896:C:AG240V0.939
22:29058884:T:GK244T0.938
22:29059001:A:GI205T0.937
22:29058966:A:CY217D0.936
22:29058887:A:TI243N0.931
22:29058907:G:CS236R0.930
22:29058907:G:TS236R0.930
22:29058909:T:GS236R0.930
22:29059001:A:TI205N0.924
22:29058978:A:CY213D0.920
22:29059001:A:CI205S0.918
22:29058997:A:CH206Q0.913
22:29058997:A:TH206Q0.913
22:29058897:C:GG240R0.911
22:29058887:A:CI243S0.909
22:29058912:A:GY235H0.904
22:29058999:G:CH206D0.902
22:29058992:A:GL208P0.901
22:29060613:C:AK78N0.895
22:29060613:C:GK78N0.895
22:29058867:C:GA250P0.893

dbSNP variants (sampled 300 via entrez): RS1000069822 (22:29064907 G>A), RS1000934495 (22:29064930 T>C), RS1000983241 (22:29064645 C>A), RS1001102414 (22:29063296 G>A,T), RS1001239381 (22:29072819 G>C), RS1001286126 (22:29070848 G>A), RS1001526736 (22:29059368 A>G), RS1001630944 (22:29060232 A>G), RS1001703971 (22:29066378 G>A), RS1001808727 (22:29065547 C>T), RS1002079857 (22:29059790 A>C,G), RS1002291589 (22:29071256 G>C), RS1002392117 (22:29064582 C>T), RS1002624883 (22:29071546 G>A,C), RS1002950566 (22:29074907 A>G)

Disease associations

OMIM: gene `` | disease phenotypes: MIM:114480

GenCC curated gene-disease

Mondo (1): hereditary breast carcinoma (MONDO:0016419)

Orphanet (1): Hereditary breast cancer (Orphanet:227535)

HPO phenotypes

0 total (0 of 0 shown, HPO-id order):

GWAS associations

4 associations (top):

StudyTraitp-value
GCST002553_2Pancreatic cancer1.000000e-08
GCST008395_14End-stage kidney disease2.000000e-07
GCST010658_22High density lipoprotein cholesterol levels7.000000e-08
GCST010662_18Systolic blood pressure2.000000e-08

EFO canonical traits (2, from GWAS)

EFO IDTrait name
EFO:0004612high density lipoprotein cholesterol measurement
EFO:0006335systolic blood pressure

MeSH disease descriptors (1)

DescriptorNameTree numbers
C562840Breast Cancer, Familial (supp.)

Drugs & pharmacology

Drug and pharmacology data

Is drug target: no

PharmGKB: 1 entry (VIP=true, CPIC=false)

CTD chemical–gene interactions

5 total (human), top 5 by PubMed support.

ChemicalActions (top 5)PubMed papers
butyraldehydedecreases expression1
coumarindecreases phosphorylation1
pentanaldecreases expression1
clothianidindecreases expression1
Benzo(a)pyreneaffects methylation1

Clinical trials (associated diseases)

10 trials via MONDO — disease-level, not drug-specific.

TrialPhaseStatusTitle
NCT00040222Not specifiedCOMPLETEDClinical, Genetic, Behavioral, Laboratory and Epidemiologic Characterization of Individuals and Families at High Risk of Breast/Ovarian Cancer
NCT02557776Not specifiedCOMPLETEDWritten Genetic Counseling and Mutation Analysis of BRCA1 and BRCA2 to Patients With Breast Cancer
NCT03495544Not specifiedUNKNOWNStudy Estimating Association Between Germline Mutations and PD-L1 Expression in Breast Cancer
NCT03959267Not specifiedCOMPLETEDTesting a Culturally Adapted Telephone Genetic Counseling Intervention
NCT04058418Not specifiedCOMPLETEDSpecialist Recommendation on FBC (Familial Breast Cancer) Chemoprevention Prescribing
NCT04125914Not specifiedACTIVE_NOT_RECRUITINGWeight Management and Health Behavior Intervention in Lowering Cancer Risk for BRCA Positive and Lynch Syndrome Families
NCT04169542Not specifiedRECRUITINGImpact of COVID-19 Pandemic on Out-of-Pocket Costs, Lost Wages, and Unemployment in Patients With Breast Cancer Undergoing Breast Surgery
NCT04197856Not specifiedACTIVE_NOT_RECRUITINGDirect Information to At-risk Relatives
NCT07292246Not specifiedRECRUITINGA Prospective CohorT Study of HandX - Assisted ENdoscopic MAstectomy: Feasibility and Safety (ATHENA I Study)
NCT07307664Not specifiedRECRUITINGIncreasing Germline Genetic Testing for Patients With Cancer
  • Disease cohort memberships (association, not causation — diseases whose associated-gene cohort lists this gene; a subset are also under Associated diseases): hereditary breast carcinoma