PTEN Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human PTEN. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human PTEN. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 34 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, biogrid_interaction, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, interpro, mim, orphanet, ortholog, paralog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-27 — For the latest data, query BioBTree directly via MCP or API.
View API calls (43)
PTEN

Section 1: Gene Identifiers Core Gene Identifiers

DatabaseIdentifierDescription
HGNCHGNC:9588Approved symbol: PTEN
EnsemblENSG00000171862Human gene
NCBI Entrez5728protein-coding gene
OMIM601728Gene/locus MIM number
CCDSCCDS31238Consensus CDS
Genomic Location
AttributeValue
Chromosome10
Cytogenetic Band10q23.31
Start Position87,862,638
End Position87,971,930
Strand+ (forward)
Genome AssemblyGRCh38/hg38
Gene Length109,293 bp

Section 2: Transcript Identifiers Ensembl Transcripts Total Transcript Count: 22

Transcript IDBiotypeStartEndStrand
ENST00000371953protein_coding87,863,62587,971,930+
ENST00000472832protein_coding87,864,32987,961,309+
ENST00000688308protein_coding87,862,63887,966,885+
ENST00000693560protein_coding87,863,43887,971,930+
ENST00000700021protein_coding87,863,79587,966,885+
ENST00000700029protein_coding87,863,62587,966,885+
ENST00000713839protein_coding87,863,63387,966,885+
ENST00000686459nonsense_mediated_decay87,864,32887,966,671+
ENST00000688922nonsense_mediated_decay87,864,32987,966,885+
ENST00000700022nonsense_mediated_decay87,864,07987,966,805+
ENST00000706955nonsense_mediated_decay87,863,62587,971,930+
ENST00000710265nonsense_mediated_decay87,863,62587,971,930+
ENST00000713837nonsense_mediated_decay87,863,62587,971,930+
ENST00000416679retained_intron87,878,69287,880,427+
ENST00000462694retained_intron87,864,46887,894,326+
ENST00000498703retained_intron87,925,52387,933,487+
ENST00000688158retained_intron87,874,91987,971,930+
ENST00000700023retained_intron87,941,12087,965,530+
ENST00000700024retained_intron87,950,23487,966,870+
ENST00000700025retained_intron87,950,85787,962,450+
ENST00000700026retained_intron87,957,58287,962,052+
ENST00000487939protein_coding_CDS_not_defined87,864,44987,867,049+
RefSeq Transcripts (Human)
RefSeq IDTypeStatusMANE Select
NM_000314mRNAREVIEWEDYES (Canonical)
NM_001304717mRNAREVIEWEDNo
NM_001304718mRNAREVIEWEDNo
RefSeq Proteins (Human)
RefSeq IDTypeStatusMANE Select
NP_000305proteinREVIEWEDYES (Canonical)
NP_001291646proteinREVIEWEDNo
NP_001291647proteinREVIEWEDNo
Canonical Transcript Exons (ENST00000371953) Total Exon Count: 9
Exon IDStartEndLength
ENSE0000401172587,863,62587,864,548924 bp
ENSE0000115635187,894,02587,894,10985 bp
ENSE0000115634487,925,51387,925,55745 bp
ENSE0000359561087,931,04687,931,08944 bp
ENSE0000115633087,933,01387,933,251239 bp
ENSE0000115632787,952,11887,952,259142 bp
ENSE0000372533887,957,85387,958,019167 bp
ENSE0000115631587,960,89487,961,118225 bp
ENSE0000145654187,965,28787,971,9306,644 bp

Section 3: Protein Identifiers UniProt Accessions

UniProt IDNameStatusSpeciesLength
P60484PTENReviewed (Canonical)Human403 aa
C0HLV8PTEN upstream ORF MP31ReviewedHuman-
C0HLV9PTEN upstream ORF MP31ReviewedHuman-
O54857PtenReviewedMouse-
P60483PtenReviewedRat-
O08586PtenReviewedMouse-
Q9PUT6PtenReviewedZebrafish-
Q8T9S7PtenReviewedDrosophila-
Protein Properties (P60484)
PropertyValue
Molecular Weight47,166 Da
Sequence Length403 amino acids
EC Numbers3.1.3.16, 3.1.3.48, 3.1.3.67
Protein Domains and Families Total Domain/Family Count: 10
IDNameType
IPR000387Tyr_Pase_domDomain
IPR003595Tyr_Pase_catDomain
IPR014020Tensin_C2-domDomain
IPR016130Tyr_Pase_ASActive Site
IPR017361Bifunc_PIno_P3_Pase/Pase_PTENFamily
IPR029021Prot-tyrosine_phosphatase-likeHomologous Superfamily
IPR029023Tensin_phosphataseDomain
IPR035892C2_domain_sfHomologous Superfamily
IPR045101PTP_PTENDomain
IPR051281Dual-spec_lipid-protein_phosphFamily

Section 4: Structure Identifiers Experimental Structures (PDB) Total PDB Structure Count: 12

PDB IDTitleMethodResolution
1D5RCrystal Structure of the PTEN Tumor SuppressorX-ray2.1 Å
5BZZHuman phosphatase PTEN in reduced stateX-ray2.2 Å
8X3SHuman WDR5 in complex with PTENX-ray1.87 Å
4O1VSPOP/PTEN regulatory hubX-ray2.0 Å
7PC7SNTG1 PDZ with acetylated PTEN motifX-ray2.1 Å
5BUGPTEN oxidized by H2O2X-ray2.4 Å
5BZXPTEN with bisperoxovanadium complexX-ray2.5 Å
7JULNon-phosphorylated PTENX-ray2.53 Å
7JUKPTEN with tetra-phosphorylated tailX-ray3.15 Å
7JVXPTEN structural basis of regulationX-ray3.2 Å
7JTXPTEN multi-site phosphorylationX-ray3.23 Å
2KYLMAST2-PDZ with PTEN C-terminusNMR-
Predicted Structure (AlphaFold)
EntryGlobal pLDDTSequence LengthFraction Very High Confidence
P6048483.763,3230.70 (70%)

Section 5: Cross-Species Orthologs Orthologs from Ensembl Compara

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000013663Ptenprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000020723Ptenprotein_coding
Zebrafish (Danio rerio)ENSDARG00000056623ptenbprotein_coding
Zebrafish (Danio rerio)ENSDARG00000071018ptenaprotein_coding
Fruit fly (D. melanogaster)FBGN0026379Ptenprotein_coding
Mouse Ortholog Details (ENSMUSG00000013663)
PropertyValue
LocationChromosome 19: 32,734,897-32,803,560
Strand+
Gene Length68,664 bp
Human Paralogs
Gene IDSymbolBiotype
ENSG00000079308TNS1protein_coding
ENSG00000136205TNS3protein_coding
ENSG00000132958TPTE2protein_coding
ENSG00000274391TPTEprotein_coding
ENSG00000122986HVCN1protein_coding
ENSG00000169758TMEM266protein_coding

Section 6: Clinical Variants & AI Predictions ClinVar Summary Total Variant Count: 4,015+

ClassificationCount
Pathogenic100+
Likely Pathogenic50+
Pathogenic/Likely Pathogenic30+
Uncertain Significance (VUS)2,000+
Likely Benign100+
Benign50+
Conflicting100+
TOP 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
141485c.493-2A>GsplicePathogenic (expert panel)
1182096c.209+4A>GsplicePathogenic (expert panel)
142018c.389G>C (p.Arg130Pro)missensePathogenic (expert panel)
1338262c.373A>G (p.Lys125Glu)missensePathogenic (expert panel)
142261c.314G>A (p.Cys105Tyr)missensePathogenic (expert panel)
142269c.737C>T (p.Pro246Leu)missensePathogenic (expert panel)
142423c.802-2A>TsplicePathogenic (expert panel)
141654c.50_51del (p.Gln17fs)frameshiftPathogenic (expert panel)
1067089c.376G>C (p.Ala126Pro)missensePathogenic
1067224c.523G>A (p.Val175Met)missensePathogenic
1068214c.75G>C (p.Leu25Phe)missensePathogenic
1014805c.801G>C (p.Lys267Asn)missensePathogenic
1176553c.517C>G (p.Arg173Gly)missensePathogenic
1195176c.379G>T (p.Gly127Ter)nonsensePathogenic
1195871c.204C>A (p.Tyr68Ter)nonsensePathogenic
142027c.48T>A (p.Tyr16Ter)nonsensePathogenic
1069915c.184A>T (p.Lys62Ter)nonsensePathogenic
1075939c.250A>T (p.Arg84Ter)nonsensePathogenic
1074078c.304A>T (p.Lys102Ter)nonsensePathogenic
1388819c.465T>G (p.Tyr155Ter)nonsensePathogenic
142636c.493G>T (p.Gly165Ter)nonsensePathogenic
1320461c.601G>T (p.Glu201Ter)nonsensePathogenic
142640c.822G>A (p.Trp274Ter)nonsensePathogenic
1456749c.106G>T (p.Gly36Ter)nonsensePathogenic
1422662c.43A>T (p.Arg15Ter)nonsensePathogenic
1317879c.634+1G>TsplicePathogenic
1393293c.801+2T>CsplicePathogenic
1215488c.44_45del (p.Arg15fs)frameshiftPathogenic
1069314c.502_503del (p.Ile168fs)frameshiftPathogenic
1072117c.672dup (p.Tyr225fs)frameshiftPathogenic
142259c.741dup (p.Pro248fs)frameshiftPathogenic
140898c.772_773del (p.Phe258fs)frameshiftPathogenic
1372311c.641dup (p.Phe215fs)frameshiftPathogenic
1330013c.672_673insC (p.Tyr225fs)frameshiftPathogenic
1415847c.590del (p.Lys197fs)frameshiftPathogenic
1453956c.201dup (p.Tyr68fs)frameshiftPathogenic
1454161c.270del (p.Phe90fs)frameshiftPathogenic
1458563c.511dup (p.Gln171fs)frameshiftPathogenic
1458931c.1007dup (p.Tyr336Ter)frameshiftPathogenic
1379224c.143A>G (p.Asn48Ser)missensePathogenic
1198486c.511C>A (p.Gln171Lys)missensePathogenic
1457954c.3G>A (p.Met1Ile)missensePathogenic
1076438c.-541-5533dellarge delPathogenic
1067457g.(?89685260)(89685324_?)dellarge delPathogenic
1076749g.(?89685260)(89725321_?)dellarge delPathogenic
1069185g.89711945_89711946insAluAlu insertionPathogenic
1043063c.959T>C (p.Leu320Ser)missenseLikely pathogenic (expert)
1066565c.746T>G (p.Val249Gly)missenseLikely pathogenic
1066605c.80-2A>CspliceLikely pathogenic
1067478c.1026G>T (p.Lys342Asn)missenseLikely pathogenic
AI-Based Splice Predictions (SpliceAI) Total Predictions: 3,139 TOP 30 Predicted Splice-Altering Variants (Delta Score ≥0.50)
VariantGeneEffectDelta Score
10:87863244:G:CPTENdonor_gain1.00
10:87863202:AC:APTENdonor_gain0.98
10:87863207:G:APTENdonor_gain0.97
10:87863257:G:APTENdonor_gain0.97
10:87863210:T:TAPTENdonor_gain0.94
10:87863657:G:GTPTENdonor_gain0.92
10:87863243:AG:APTENdonor_gain0.91
10:87863334:C:TPTENdonor_gain0.89
10:87863333:GCAGC:GPTENdonor_gain0.88
10:87863251:AT:APTENdonor_gain0.87
10:87863669:T:GPTENdonor_gain0.87
10:87863283:C:TPTENdonor_gain0.85
10:87863354:A:AGPTENdonor_gain0.84
10:87863337:C:GPTENdonor_gain0.84
10:87863639:G:GTPTENdonor_gain0.78
10:87863197:AGCCT:APTENdonor_loss0.78
10:87863198:GCCTA:GPTENdonor_loss0.78
10:87863199:CCTA:CPTENdonor_loss0.78
10:87863200:CT:CPTENdonor_loss0.78
10:87863201:TA:TPTENdonor_loss0.78
10:87863657:G:TPTENdonor_gain0.78
10:87863243:A:ACPTENdonor_gain0.76
10:87863203:CCCTG:CPTENdonor_gain0.76
10:87863705:GGAGA:GPTENdonor_gain0.75
10:87863253:C:APTENdonor_gain0.75
10:87863396:C:GPTENdonor_gain0.75
10:87863202:A:ACPTENdonor_gain0.73
10:87863203:C:CCPTENdonor_gain0.73
10:87863204:C:GPTENdonor_loss0.73
10:87863347:C:TPTENdonor_gain0.72
AlphaMissense Pathogenicity Predictions Total Predictions: 2,714 TOP 50 Predicted Pathogenic Missense Variants
VariantAA ChangeScoreClass
10:87864506:A:GK13E1.000likely_pathogenic
10:87864507:A:TK13I1.000likely_pathogenic
10:87864508:A:CK13N1.000likely_pathogenic
10:87864513:G:CR15T1.000likely_pathogenic
10:87864513:G:TR15I1.000likely_pathogenic
10:87864514:A:CR15S1.000likely_pathogenic
10:87864495:T:AV9D1.000likely_pathogenic
10:87864506:A:CK13Q0.999likely_pathogenic
10:87864507:A:CK13T0.999likely_pathogenic
10:87864498:G:TS10I0.999likely_pathogenic
10:87864498:G:AS10N0.999likely_pathogenic
10:87864499:C:AS10R0.999likely_pathogenic
10:87864509:A:GR14G0.999likely_pathogenic
10:87864512:A:GR15G0.999likely_pathogenic
10:87864515:T:GY16D0.999likely_pathogenic
10:87864516:A:CY16S0.998likely_pathogenic
10:87864494:G:CV9L0.998likely_pathogenic
10:87864495:T:CV9A0.998likely_pathogenic
10:87864502:A:CR11S0.998likely_pathogenic
10:87864511:G:TR14S0.998likely_pathogenic
10:87864483:T:AI5N0.996likely_pathogenic
10:87864513:G:AR15K0.996likely_pathogenic
10:87864494:G:TV9F0.988likely_pathogenic
10:87864495:T:GV9G0.995likely_pathogenic
10:87864497:A:TS10C0.995likely_pathogenic
10:87864501:G:TR11I0.995likely_pathogenic
10:87864515:T:AY16N0.995likely_pathogenic
10:87864509:A:TR14W0.994likely_pathogenic
10:87864512:A:GR15G0.994likely_pathogenic
10:87864501:G:CR11T0.992likely_pathogenic
10:87864510:G:CR14T0.991likely_pathogenic
10:87864515:T:CY16H0.991likely_pathogenic
10:87864516:A:GY16C0.990likely_pathogenic
10:87864486:A:CK6T0.989likely_pathogenic
10:87864485:A:GK6E0.990likely_pathogenic
10:87864510:G:TR14M0.987likely_pathogenic
10:87864498:G:CS10T0.986likely_pathogenic
10:87864483:T:GI5S0.983likely_pathogenic
10:87864504:A:TN12I0.973likely_pathogenic
10:87864483:T:CI5T0.969likely_pathogenic
10:87864485:A:CK6Q0.959likely_pathogenic
10:87864497:A:GS10G0.954likely_pathogenic
10:87864492:T:AI8N0.936likely_pathogenic
10:87864504:A:CN12T0.822likely_pathogenic
10:87864500:A:GR11G0.816likely_pathogenic
10:87864474:C:TT2I0.845likely_pathogenic
10:87864474:C:AT2K0.778likely_pathogenic
10:87864494:G:AV9I0.760likely_pathogenic
10:87864476:G:CA3P0.750likely_pathogenic
10:87864474:C:GT2R0.706likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathway Count: 12

Pathway IDPathway NameDisease Pathway
R-HSA-199418Negative regulation of the PI3K/AKT networkNo
R-HSA-1660499Synthesis of PIPs at the plasma membraneNo
R-HSA-1855204Synthesis of IP3 and IP4 in the cytosolNo
R-HSA-202424Downstream TCR signalingNo
R-HSA-5628897TP53 Regulates Metabolic GenesNo
R-HSA-5689880Ub-specific processing proteasesNo
R-HSA-5689896Ovarian tumor domain proteasesNo
R-HSA-8943723Regulation of PTEN mRNA translationNo
R-HSA-8948747Regulation of PTEN localizationNo
R-HSA-8948751Regulation of PTEN stability and activityNo
R-HSA-8986944Transcriptional Regulation by MECP2No
R-HSA-5674404PTEN Loss of Function in CancerYes
Gene Ontology Annotations Total GO Terms: 93 Molecular Function (20 terms)
GO IDTerm Name
GO:0016314phosphatidylinositol-3,4,5-trisphosphate 3-phosphatase activity
GO:0004438phosphatidylinositol-3-phosphate phosphatase activity
GO:0004721phosphoprotein phosphatase activity
GO:0004722protein serine/threonine phosphatase activity
GO:0004725protein tyrosine phosphatase activity
GO:0051800phosphatidylinositol-3,4-bisphosphate 3-phosphatase activity
GO:0030351inositol-1,3,4,5,6-pentakisphosphate 3-phosphatase activity
GO:0051717inositol-1,3,4,5-tetrakisphosphate 3-phosphatase activity
GO:0052866phosphatidylinositol phosphate phosphatase activity
GO:0008013beta-catenin binding
GO:0008289lipid binding
GO:0019899enzyme binding
GO:0010997anaphase-promoting complex binding
GO:0030165PDZ domain binding
GO:0042802identical protein binding
GO:0140678molecular function inhibitor activity
GO:1990381ubiquitin-specific protease binding
GO:1990757ubiquitin ligase activator activity
Biological Process (TOP 20 of 58 terms)
GO IDTerm Name
GO:0051898negative regulation of PI3K/AKT signaling
GO:0051896regulation of PI3K/AKT signal transduction
GO:0043491PI3K/protein kinase B signal transduction
GO:0006470protein dephosphorylation
GO:0046856phosphatidylinositol dephosphorylation
GO:0006661phosphatidylinositol biosynthetic process
GO:0006915apoptotic process
GO:0008285negative regulation of cell population proliferation
GO:0008284positive regulation of cell population proliferation
GO:0016477cell migration
GO:0030336negative regulation of cell migration
GO:1902807negative regulation of cell cycle G1/S phase transition
GO:2000134negative regulation of G1/S transition of mitotic cell cycle
GO:0010719negative regulation of EMT
GO:0033137negative regulation of peptidyl-serine phosphorylation
GO:0007417central nervous system development
GO:0007611learning or memory
GO:0007416synapse assembly
GO:0060997dendritic spine morphogenesis
GO:0097107postsynaptic density assembly
Cellular Component (15 terms)
GO IDTerm Name
GO:0005634nucleus
GO:0005654nucleoplasm
GO:0005737cytoplasm
GO:0005829cytosol
GO:0005886plasma membrane
GO:0009898cytoplasmic side of plasma membrane
GO:0016324apical plasma membrane
GO:0016605PML body
GO:0005576extracellular region
GO:0014069postsynaptic density
GO:0043005neuron projection
GO:0043197dendritic spine
GO:0042995cell projection
GO:0035749myelin sheath adaxonal region
GO:0043220Schmidt-Lanterman incisure

Section 8: Protein Interactions & Molecular Networks STRING Protein-Protein Interactions Total Interaction Count: 9,614 TOP 50 Highest-Confidence Interactors

UniProt IDGene SymbolScore
Q86UL8SASH1974
Q15831STK11971
P04637TP53969
Q70Z35NEDD4L969
P31749AKT1968
Q5TCQ9PHLPP2962
P42336PIK3CA949
P27986PIK3R1944
Q9NZM5USP13937
P01116KRAS930
P12830CDH1930
P15056BRAF928
P00533EGFR926
P42345MTOR923
Q13485SMAD4909
P42771CDKN2A908
O14745SLC9A3R1905
P23443RPS6KB1904
Q12959DLG1904
P49841GSK3B900
Q12778FOXO1895
Q13315ATM893
P49815TSC2890
P35222CTNNB1881
Q6P0Q8MAGI2878
P51587BRCA2877
Q03135CAV1872
P01106MYC871
P04626ERBB2864
P01111NRAS861
P10275AR853
Q9BT92GPRC5A853
P24385CCND1851
O15530PDPK1848
P08069IGF1R844
O43524FOXO3841
P21359NF1839
Q00987MDM2838
O75874IDH1836
P46527CDKN1B831
P37231PPARG830
Q969H0FBXW7830
P42338PIK3CB829
P12931SRC828
Q16665HIF1A827
P11802CDK4825
P01308INS823
Q14160SCRIB820
Q6R327RICTOR820
P48736PIK3CG818
BioGRID Interactions Total BioGRID Interactions: 1,319+ Key physical interactors include: WWP1, WWP2, NEDD4, SHARPIN, RNF146, CTNNB1, TNKS, TNKS2 Protein Similarity (ESM2 Embedding) Total Similar Proteins: 24
UniProtTop SimilarityAvg Similarity
O54857 (Mouse)1.0000.958
P60483 (Rat)1.0000.957
O08586 (Mouse)1.0000.957
Q13613 (MTMR3)0.9980.969
Q9Z2C4 (MTMR4)0.9980.971
Q13614 (MTMR4)0.9990.950
O60942 (MTMR3)0.9990.962
Q16667 (MTMR1)0.9970.950
Q9Z2D1 (MTMR1)0.9990.952
Sequence Homology (DIAMOND) Total Homologous Proteins: 24
UniProtTop IdentityBitscore
P60483 (Rat)100.0%828
O54857 (Mouse)99.8%825
O08586 (Mouse)99.8%827
Q6XPS396.9%814
Q99KY496.7%2504
Q9GLM490.0%3165
Q9HBL090.0%3186
Q4R6N090.2%811
Q9PUT6 (Zebrafish)88.9%713
P5618087.6%801

Section 9: Transcription Factor Regulatory Data CollecTRI Regulatory Interactions Total Regulatory Links: 66 Upstream Regulators (TFs that regulate PTEN)

TF GeneRegulationConfidence
TP53RepressionHigh
ARActivationHigh
ATF2ActivationHigh
BMI1Repression-
CTNNB1RepressionLow
DNMT1-High
EGR1ActivationHigh
EZH2UnknownHigh
GATA2UnknownHigh
GATA4Repression-
GLI2RepressionLow
HES1RepressionHigh
HNF4A-High
JUNUnknownHigh
KDM5ARepression-
KDM5CRepression-
MTA1RepressionHigh
MXD1RepressionHigh
MYCUnknownLow
NFATC1-High
NFKB1Activation-
PPARGActivationHigh
RBPJRepressionHigh
RELAActivationHigh
SALL4RepressionHigh
SMAD4ActivationHigh
SNAI1ActivationHigh
SP1UnknownHigh
STAT3ActivationHigh
TET1Activation-
TFAP2AActivationHigh
VDRActivation-
YY1ActivationHigh
Downstream Targets (Genes regulated by PTEN)
Target GeneRegulation
ABTB1Activation
DUSP1Activation
EGR2Activation
HCLS1Activation
NDRG1Activation
NFIL3Activation
PINK1Activation
RAD51-
SH2B2Activation
SIGNOR Signaling Interactions Total SIGNOR Interactions: 102 Key Regulatory Mechanisms on PTEN
RegulatorEffectMechanismDirect
CSNK2A1down-regulates activityphosphorylationYes
CSNK2A2down-regulates activityphosphorylationYes
GSK3Bdown-regulates activityphosphorylationYes
STK11down-regulates activityphosphorylationYes
PLK1down-regulates activityphosphorylationYes
PLK3down-regulates activityphosphorylationYes
INSRdown-regulates activityphosphorylationYes
PINK1down-regulates activityphosphorylationYes
SRCup-regulates activityphosphorylationYes
LCKup-regulates activityphosphorylationYes
FRKup-regulates quantityphosphorylationYes
ROCK1up-regulatesphosphorylationYes
MAST2up-regulatesphosphorylationYes
USP7down-regulates activitydeubiquitinationYes
WWP2down-regulates quantityubiquitinationYes
XIAPdown-regulates quantityubiquitinationYes
MKRN1down-regulates quantityubiquitinationYes
PREX2down-regulates activitybindingYes
Key Downstream Effects of PTEN
TargetEffectMechanism
PIP3down-regulates quantitychemical modification
AKT1down-regulates activity-
PI3Kdown-regulates activity-
PIK3CAdown-regulates activity-
PIK3CBdown-regulates activity-
PTK2 (FAK)down-regulates activitydephosphorylation
SRCdown-regulates activitydephosphorylation
IRS1down-regulates activitydephosphorylation
CREB1down-regulates activitydephosphorylation
STAT5Adown-regulatesdephosphorylation
IRF3up-regulates activitydephosphorylation
CFL1up-regulates activitydephosphorylation
MAPTup-regulates activitydephosphorylation
RAB7Aup-regulates activitydephosphorylation
GLI1down-regulates-
RPS6KB1down-regulates activity-
ARup-regulates activitydephosphorylation

Section 10: Drug & Pharmacology Data ChEMBL Targets

ChEMBL IDNameType
CHEMBL2052032PTENSINGLE PROTEIN
CHEMBL4523606SPOP/PTENPROTEIN-PROTEIN INTERACTION
Compounds Targeting PTEN
ChEMBL IDNameTypeDevelopment Phase
CHEMBL301982CELASTROLSmall molecule0 (Preclinical)
CHEMBL2000194-Small molecule0 (Preclinical)
CHEMBL2057662-Small molecule0 (Preclinical)
PharmGKB Entry
AttributeValue
PharmGKB IDPA33942
SymbolPTEN
VIP GeneYes
CPIC GuidelineNo
Chromosomechr10
Note: PTEN is a Very Important Pharmacogene (VIP) in PharmGKB, indicating its significance in drug response and pharmacogenomics research. However, no CPIC dosing guidelines currently exist.

Section 11: Expression Profiles Bgee Expression Summary

AttributeValue
Expression BreadthUbiquitous
Total Present Calls256
Max Expression Score99.31
Average Expression Score94.10
Gold Quality Conditions256
TOP 30 Tissues by Expression Level
Tissue/Cell TypeEntity IDScoreQuality
SpermCL:000001999.31Gold
Endothelial cellCL:000011599.21Gold
Calcaneal tendonUBERON:000370199.20Gold
Upper arm skinUBERON:000426399.15Gold
Epithelial cell of pancreasCL:000008399.00Gold
Kidney epitheliumUBERON:000481998.91Gold
Middle temporal gyrusUBERON:000277198.68Gold
Parietal pleuraUBERON:000240098.62Gold
Ileal mucosaUBERON:000033198.56Gold
Cardiac muscle of right atriumUBERON:000337998.40Gold
Visceral pleuraUBERON:000240198.29Gold
Gingival epitheliumUBERON:000194998.06Gold
GingivaUBERON:000182898.00Gold
Brodmann area 23UBERON:001355498.00Gold
Pancreatic ductal cellCL:000207998.00Gold
Tibialis anteriorUBERON:000138597.93Gold
Mammary gland epitheliumUBERON:000324497.87Gold
Mammary ductUBERON:000176597.87Gold
BloodUBERON:000017897.86Gold
Dorsal root ganglionUBERON:000004497.84Gold
Germinal epithelium of ovaryUBERON:000130497.84Gold
Trigeminal ganglionUBERON:000167597.76Gold
TibiaUBERON:000097997.71Gold
Stromal cell of endometriumCL:000225597.60Gold
Esophagus squamous epitheliumUBERON:000692097.59Gold
Mammalian vulvaUBERON:000099797.47Gold
Superficial temporal arteryUBERON:000161497.45Gold
PericardiumUBERON:000240797.43Gold
Trabecular bone tissueUBERON:000248397.39Gold
Thoracic mammary glandUBERON:000520097.32Gold
Single-Cell Expression Data (SCXA) Total Single-Cell Datasets: 7
Experiment IDDescriptionSpeciesCells
E-MTAB-11268Human hypertrophied heart atlasHomo sapiens64,898
E-MTAB-11121Retinal organoids from hESCsHomo sapiens22,253
E-GEOD-81608Human islet cells (non-diabetic/T2D)Homo sapiens1,600
E-ENAD-27Human islet cell signaturesHomo sapiens1,145
E-CURD-6Normal human bone marrowHomo sapiens1,024
E-GEOD-124858Bone marrow stromal cell subpopulationsHomo sapiens247
E-GEOD-110499Multiple myeloma bone marrowHomo sapiens173

Section 12: Disease Associations GenCC Disease-Gene Validity Total Disease Associations: 17

DiseaseClassificationMOIEvidence
Cowden syndrome 1 (OMIM:158350)DefinitiveADMultiple submitters
Cowden syndrome 1StrongADGenomics England
Macrocephaly-autism syndrome (OMIM:605309)StrongADMultiple submitters
Glioma susceptibility 2 (OMIM:613028)LimitedADAmbry Genetics
LeiomyosarcomaModerateARGenomics England
Renal cell carcinomaModerateADGenomics England
Bannayan-Riley-Ruvalcaba syndromeSupportiveADOrphanet
Cowden diseaseSupportiveADOrphanet
Proteus-like syndromeSupportiveADOrphanet
Activated PI3K-delta syndromeSupportiveADOrphanet
Lhermitte-Duclos diseaseSupportiveADOrphanet
Orphanet Disease Associations Total Orphanet Diseases: 14
Orphanet IDDisease NameTypeGene Count
109Bannayan-Riley-Ruvalcaba syndromeClinical subtype1
201Cowden syndromeClinical subtype9
210548Macrocephaly-intellectual disability-autism syndromeDisease2
65285Lhermitte-Duclos diseaseClinical subtype1
79076Juvenile polyposis of infancyClinical subtype2
137608Segmental outgrowth-lipomatosis-AVM-epidermal nevus syndromeClinical subtype1
2969Proteus-like syndromeClinical subtype1
145Hereditary breast and/or ovarian cancer syndromeDisease15
494547Squamous cell carcinoma of hypopharynxDisease3
494550Squamous cell carcinoma of larynxDisease3
500464SCC of nasal cavity/paranasal sinusesDisease3
500478SCC of oropharynxDisease3
502363SCC of oral cavityDisease3
502366SCC of lipDisease3
HPO Phenotype Associations Total HPO Terms: 238 TOP 50 Clinical Phenotypes
HPO IDPhenotype
HP:0000256Macrocephaly
HP:0001249Intellectual disability
HP:0000717Autism
HP:0001263Global developmental delay
HP:0000750Delayed speech and language development
HP:0001548Overgrowth
HP:0004481Progressive macrocephaly
HP:0005490Postnatal macrocephaly
HP:0000098Tall stature
HP:0001513Obesity
HP:0000238Hydrocephalus
HP:0001250Seizure
HP:0001252Hypotonia
HP:0002664Neoplasm
HP:0004390Hamartomatous polyposis
HP:0010566Hamartoma
HP:0012032Lipoma
HP:0001012Multiple lipomas
HP:0001031Subcutaneous lipoma
HP:0001028Hemangioma
HP:0001048Cavernous hemangioma
HP:0005306Capillary hemangioma
HP:0000820Abnormality of thyroid gland
HP:0000821Hypothyroidism
HP:0000836Hyperthyroidism
HP:0000853Goiter
HP:0000854Thyroid adenoma
HP:0002890Thyroid carcinoma
HP:0006731Follicular thyroid carcinoma
HP:0003002Breast carcinoma
HP:0010619Fibroadenoma of the breast
HP:0005584Renal cell carcinoma
HP:0002858Meningioma
HP:0002861Melanoma
HP:0010784Uterine neoplasm
HP:0007206Hemimegalencephaly
HP:0001382Joint hypermobility
HP:0000957Cafe-au-lait spot
HP:0007565Multiple cafe-au-lait spots
HP:0000995Melanocytic nevus
HP:0001054Numerous nevi
HP:0010816Epidermal nevus
HP:0009720Adenoma sebaceum
HP:0009721Shagreen patch
HP:0000729Autistic behavior
HP:0007018ADHD
HP:0011098Speech apraxia
HP:0002167Abnormal speech pattern
HP:0002808Kyphosis
HP:0002650Scoliosis
GWAS Associations Total GWAS Associations: 39
Study IDTraitP-value
GCST90002381Eosinophil count2.0e-30
GCST90002382Eosinophil percentage5.0e-28
GCST90002396Mean reticulocyte volume9.0e-28
GCST90002398Neutrophil count7.0e-21
GCST90002407White blood cell count8.0e-20
GCST90002403Red blood cell count3.0e-16
GCST004606Eosinophil count2.0e-15
GCST90002392Mean corpuscular volume1.0e-14
GCST90002390Mean corpuscular hemoglobin2.0e-14
GCST004600Eosinophil percentage2.0e-14
GCST010653TSH levels1.0e-13
GCST90002402Platelet count1.0e-11
GCST90013405ALT levels2.0e-11
GCST010118Type 2 diabetes4.0e-10
GCST005976Basophil count2.0e-10
GCST006899TSH levels2.0e-10
GCST90002389Lymphocyte percentage1.0e-10
GCST007096Pulse pressure2.0e-09
GCST90020025Waist-to-hip ratio (BMI adj)2.0e-09
GCST90020027Waist-hip index2.0e-09
GCST90002400Plateletcrit3.0e-09
GCST90002399Neutrophil percentage6.0e-09
GCST90011898ALT levels6.0e-09
GCST90013420Ambidextrousness2.0e-08
GCST008362Birth weight2.0e-08
GCST004346Psoriasis3.0e-08
GCST009380Type 2 diabetes (BMI adj)5.0e-08
GCST009597Multiple sclerosis1.0e-07

DATA SOURCE SUMMARY

DatabaseRecordsVersion
HGNCGene identifiersCurrent
EnsemblGene/transcript/exonGRCh38
NCBI EntrezGeneCurrent
UniProtProteinCurrent
RefSeqTranscripts/proteinsCurrent
PDBStructuresCurrent
AlphaFoldPredicted structurev2
ClinVarClinical variantsCurrent
SpliceAISplice predictionsv1.3
AlphaMissensePathogenicity predictions2023
ReactomePathwaysCurrent
Gene OntologyFunctional annotationsCurrent
STRINGProtein interactionsv12
BioGRIDPhysical interactionsCurrent
CollecTRITF-target relationsCurrent
SIGNORSignalingCurrent
ChEMBLDrug targetsv33
PharmGKBPharmacogenomicsCurrent
BgeeExpressionv15
SCXASingle-cell expressionCurrent
GenCCGene-disease validityCurrent
OrphanetRare diseasesCurrent
HPOPhenotypesCurrent
GWAS CatalogGWAS associationsCurrent

This reference document was compiled from biobtree integrated biological databases. For the most current data, please query the original databases directly.