CFTR Gene Complete Identifier and Functional Mapping Reference

Provide a comprehensive cross-database identifier and functional mapping reference for human CFTR. This should serve as a definitive lookup resource …

Provide a comprehensive cross-database identifier and functional mapping reference for human CFTR. This should serve as a definitive lookup resource for researchers. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 1: GENE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Provide ALL gene-level database identifiers: - HGNC ID and approved symbol - Ensembl gene ID (ENSG) - NCBI Entrez Gene ID - OMIM gene/locus ID - Genomic location: chromosome, start position, end position, strand ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 2: TRANSCRIPT IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL transcript-level identifiers: - Ensembl transcripts: ALL ENST IDs with biotype (protein_coding, etc.) How many total transcripts? - RefSeq transcripts: ALL NM_ mRNA accessions Mark which is MANE Select (canonical clinical standard) - CCDS IDs: ALL consensus coding sequence identifiers For the CANONICAL/MANE SELECT transcript: - List ALL exon IDs (ENSE) with genomic coordinates - Total exon count ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 3: PROTEIN IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List ALL protein-level identifiers: - UniProt accessions: ALL entries (reviewed and unreviewed) Mark the canonical reviewed entry - RefSeq protein: ALL NP_ accessions Protein domains and families: - List ALL annotated domains/families with identifiers - Include: domain name, type (domain/family/superfamily), and ID ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 4: STRUCTURE IDENTIFIERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Experimental structures: - List ALL PDB structure IDs - For each: experimental method (X-ray, NMR, Cryo-EM) and resolution - Total PDB structure count Predicted structures: - AlphaFold model ID and confidence metrics (pLDDT) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 5: CROSS-SPECIES ORTHOLOGS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ List orthologous genes in key model organisms (where available): - Mouse (Mus musculus): gene ID, symbol - Rat (Rattus norvegicus): gene ID, symbol - Zebrafish (Danio rerio): gene ID, symbol - Fruit fly (Drosophila melanogaster): gene ID, symbol - Worm (C. elegans): gene ID, symbol - Yeast (S. cerevisiae): gene ID, symbol ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 6: CLINICAL VARIANTS & AI PREDICTIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Clinical variant annotations: - Total variant count in clinical databases - Breakdown by classification: Pathogenic, Likely Pathogenic, Uncertain Significance (VUS), Likely Benign, Benign - List TOP 50 pathogenic/likely pathogenic variants with: variant ID, HGVS notation, associated condition AI-based variant effect predictions: - Splice effect predictions: Total count List TOP 50 predicted splice-altering variants with delta scores - Missense pathogenicity predictions: Total count List TOP 50 predicted pathogenic missense variants with scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 7: BIOLOGICAL PATHWAYS & GENE ONTOLOGY ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Pathway membership: - List ALL biological pathways this gene participates in - Include pathway IDs and names - Total pathway count Gene Ontology annotations: - Biological Process: count and TOP 20 terms with IDs - Molecular Function: count and TOP 20 terms with IDs - Cellular Component: count and TOP 20 terms with IDs ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 8: PROTEIN INTERACTIONS & MOLECULAR NETWORKS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Protein-protein interactions: - Total interaction count - List TOP 50 highest-confidence interacting proteins with scores Protein similarity (evolutionary and structural): - Structural/embedding similarity: How many similar proteins? List TOP 20 with similarity scores - Sequence homology: How many homologous proteins? List TOP 20 with identity/similarity scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 9: TRANSCRIPTION FACTOR REGULATORY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene encodes a transcription factor: Downstream targets (genes regulated BY this TF): - Total target gene count - List TOP 50 target genes with regulation type (activates/represses) DNA binding profiles: - List ALL known binding motif IDs - Motif family classification Upstream regulators (TFs that regulate THIS gene): - List known transcriptional regulators with evidence type ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 10: DRUG & PHARMACOLOGY DATA ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ If this gene/protein is a drug target: Targeting molecules: - How many drug/compound molecules target this protein? - List TOP 30 molecules by development phase - Include: molecule ID, name, mechanism, highest development phase Clinical trials: - How many clinical trials involve drugs targeting this gene? - List TOP 20 trials with: trial ID, phase, status, intervention Pharmacogenomics: - Known drug-gene interactions affecting drug response - Dosing guidelines if any exist ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 11: EXPRESSION PROFILES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Tissue expression: - Which tissues express this gene most highly? - List TOP 30 tissues with expression scores/levels - Note any tissue-specific or tissue-enriched patterns Cell type expression: - Which cell types show highest expression? - List TOP 30 cell types with expression scores - Note any cell type-specific patterns Single-cell expression data (if available): - Which single-cell datasets/experiments include this gene? - Notable cell population patterns ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ SECTION 12: DISEASE ASSOCIATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Mendelian/monogenic disease links: - What diseases are caused by mutations in this gene? - List ALL associated diseases with: disease name, disease ID, inheritance pattern, evidence level Phenotype associations: - What clinical phenotypes are associated with this gene? - List TOP 50 phenotype terms with IDs Complex trait associations (GWAS): - What traits/diseases are linked via genome-wide association studies? - List TOP 30 GWAS associations with: trait, study ID, p-value, effect size if available ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FORMATTING REQUIREMENTS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ - Present each section with clear headers - Use tables for listing multiple items - Always include COUNTS for every category - For lists with >50 items: show total count + TOP 50 - For lists with <50 items: show ALL items - Always include database identifiers (IDs), not just names - Mark canonical/primary/reviewed entries where applicable - If a section has no data available, state "No data available" rather than omitting the section

Structured Data Sources

This answer was generated using Claude Opus 4.5 + BioBTree MCP, querying 34 biological databases.

Further analyze this answer or run your own queries with BioBTree MCP.

Datasets: alphafold, alphamissense, bgee, bgee_evidence, ccds, chembl_molecule, chembl_target, clinvar, collectri, diamond_similarity, ensembl, entrez, esm2_similarity, exon, gencc, go, gwas, hgnc, hpo, intact, interpro, mim, orphanet, ortholog, pdb, pharmgkb_gene, reactome, refseq, scxa, signor, spliceai, string_interaction, transcript, uniprot
Generated: 2026-03-26 — For the latest data, query BioBTree directly via MCP or API.
View API calls (40)
CFTR
DatabaseIdentifierDetails
HGNCHGNC:1884Approved symbol: CFTR
EnsemblENSG00000001626protein_coding
NCBI Entrez1080protein-coding
OMIM602421Gene/Locus
Gene Names and Aliases
  • Approved Name: CF transmembrane conductance regulator
  • Previous Names: cystic fibrosis transmembrane conductance regulator, ATP-binding cassette (sub-family C, member 7)
  • Previous Symbols: CF, ABCC7
  • Aliases: MRP7, ABC35, TNR-CFTR, dJ760C5.1, CFTR/MRP Genomic Location
PropertyValue
Chromosome7
Cytogenetic Band7q31.2
Start Position117,287,120
End Position117,715,971
Strand+ (forward)
Gene Length428,851 bp
Gene Classification
  • Locus Type: gene with protein product
  • Locus Group: protein-coding gene
  • Gene Groups:
  • Chloride channels, ATP-gated CFTR
  • ATP binding cassette subfamily C

Section 2: Transcript Identifiers Summary

  • Total Ensembl Transcripts: 44
  • Protein-coding transcripts: 17
  • Nonsense-mediated decay: 10
  • Other biotypes: 17 Ensembl Transcripts (All 44)
Transcript IDBiotypeStartEnd
ENST00000003084protein_coding (MANE Select)117,480,025117,668,665
ENST00000426809protein_coding117,480,095117,667,071
ENST00000446805protein_coding117,478,368117,530,978
ENST00000468795protein_coding117,611,617117,627,910
ENST00000600166protein_coding117,652,843117,715,948
ENST00000648260protein_coding117,480,011117,627,910
ENST00000649406protein_coding117,480,011117,627,910
ENST00000649781protein_coding117,480,011117,668,588
ENST00000673785protein_coding117,287,120117,548,675
ENST00000689011protein_coding117,614,664117,668,665
ENST00000699596protein_coding117,479,988117,559,912
ENST00000699597protein_coding117,479,988117,559,913
ENST00000699602protein_coding117,479,988117,668,588
ENST00000699605protein_coding117,480,025117,668,588
ENST00000889206protein_coding117,479,980117,668,660
ENST00000889207protein_coding117,479,997117,668,665
ENST00000889208protein_coding117,480,025117,668,665
ENST00000889209protein_coding117,480,027117,668,665
ENST00000889210protein_coding117,480,025117,668,660
ENST00000950799protein_coding117,479,959117,668,660
ENST00000429014protein_coding_CDS_not_defined117,689,713117,715,971
ENST00000436097protein_coding_CDS_not_defined117,287,128117,322,236
ENST00000472848protein_coding_CDS_not_defined117,559,638117,587,749
ENST00000546407protein_coding_CDS_not_defined117,465,784117,504,308
ENST00000608965protein_coding_CDS_not_defined117,710,017117,715,873
ENST00000610149protein_coding_CDS_not_defined117,710,132117,715,827
ENST00000647639protein_coding_CDS_not_defined117,480,011117,511,329
ENST00000692802protein_coding_CDS_not_defined117,480,011117,483,309
ENST00000693465protein_coding_CDS_not_defined117,480,010117,507,439
ENST00000693480protein_coding_CDS_not_defined117,480,011117,482,388
ENST00000699603protein_coding_CDS_not_defined117,480,011117,481,314
ENST00000647720nonsense_mediated_decay117,479,988117,668,562
ENST00000647978nonsense_mediated_decay117,480,011117,668,588
ENST00000649850nonsense_mediated_decay117,480,011117,529,534
ENST00000685018nonsense_mediated_decay117,479,988117,668,665
ENST00000687278nonsense_mediated_decay117,479,988117,668,665
ENST00000699585nonsense_mediated_decay117,479,988117,668,563
ENST00000699598nonsense_mediated_decay117,479,988117,668,561
ENST00000699599nonsense_mediated_decay117,479,988117,668,561
ENST00000699600nonsense_mediated_decay117,479,988117,668,562
ENST00000699601nonsense_mediated_decay117,479,988117,668,588
ENST00000699604nonsense_mediated_decay117,480,011117,668,588
ENST00000621535retained_intron117,713,753117,715,901
ENST00000699606retained_intron117,650,801117,668,561
RefSeq Transcripts (Human)
AccessionTypeStatusMANE Select
NM_000492mRNAREVIEWED✓ Yes
CCDS Identifiers
CCDS ID
CCDS5773
Canonical Transcript Exons (ENST00000003084) Total Exon Count: 27
Exon IDStartEndLength
ENSE00001343851117,480,025117,480,147122
ENSE00003976987117,504,253117,504,363110
ENSE00003976975117,509,034117,509,142108
ENSE00000718611117,530,899117,531,114215
ENSE00000718620117,534,276117,534,36589
ENSE00000718626117,535,248117,535,411163
ENSE00000718630117,536,548117,536,673125
ENSE00000718634117,540,100117,540,346246
ENSE00000718637117,542,016117,542,10892
ENSE00003976986117,548,641117,548,823182
ENSE00003977079117,559,464117,559,655191
ENSE00003977066117,587,739117,587,83394
ENSE00003977041117,590,353117,590,43986
ENSE00003977038117,591,934117,592,657723
ENSE00003977072117,594,930117,595,058128
ENSE00003977044117,602,826117,602,86337
ENSE00003977055117,603,532117,603,782250
ENSE00003977054117,606,674117,606,75379
ENSE00003977056117,610,519117,610,669150
ENSE00000977443117,611,581117,611,808227
ENSE00000718699117,614,613117,614,713100
ENSE00003936498117,627,522117,627,770248
ENSE00003925242117,642,438117,642,593155
ENSE00003833777117,652,842117,652,93189
ENSE00003840813117,664,688117,664,860172
ENSE00003835972117,665,459117,665,564105
ENSE00000977444117,666,908117,668,6651,757

Section 3: Protein Identifiers UniProt Accessions Total: 20 entries

AccessionStatusNotes
P13569✓ Reviewed (Swiss-Prot)Canonical entry
A0A3B3IT97UnreviewedIsoform
A0A3B3ITE0UnreviewedIsoform
A0A3B3ITW0UnreviewedIsoform
A0A3B3ITW5UnreviewedIsoform
A0A669KBE8UnreviewedIsoform
A0A8I5KVL1UnreviewedIsoform
A0A8I5KVV2UnreviewedIsoform
A0A8I5KXQ9UnreviewedIsoform
A0A8V8TNG7UnreviewedIsoform
A0A8V8TNH2UnreviewedIsoform
A0A8V8TNN0UnreviewedIsoform
A0A8V8TNN7UnreviewedIsoform
A0A8V8TPV6UnreviewedIsoform
A0A8V8TQ89UnreviewedIsoform
A0A8V8TQ94UnreviewedIsoform
C9J6L5UnreviewedIsoform
E7EPB6UnreviewedIsoform
H0Y8A9UnreviewedIsoform
M0QYZ3UnreviewedIsoform
Canonical Protein (P13569) Properties
PropertyValue
NameCystic fibrosis transmembrane conductance regulator
Alternative NamesATP-binding cassette sub-family C member 7; Channel conductance-controlling ATPase; cAMP-dependent chloride channel
Length1,480 amino acids
Molecular Mass168,142 Da
RefSeq Protein Accessions (Human)
AccessionStatusMANE Select
NP_000483REVIEWED✓ Yes
Protein Domains and Families (InterPro) Total: 10 annotations
InterPro IDNameType
IPR003439ABC_transporter-like_ATP-bdDomain
IPR003593AAA+_ATPaseDomain
IPR009147CFTR/ABCC7Family
IPR011527ABC1_TM_domDomain
IPR017871ABC_transporter-like_CSConserved_site
IPR025837CFTR_reg_domDomain
IPR027417P-loop_NTPaseHomologous_superfamily
IPR036640ABC1_TM_sfHomologous_superfamily
IPR047082CFTR1_ATP-bd_dom1Domain
IPR050173ABC_transporter_C-likeFamily

Section 4: Structure Identifiers AlphaFold Predicted Structure

PropertyValue
AlphaFold IDP13569
Global pLDDT75.96
Sequence Length11,845 (residue positions)
Fraction Very High Confidence0.33
Experimental Structures (PDB) Total PDB Structures: 58
PDB IDTitleMethodResolution (Å)
9MXLCFTR with (R)-BPO-27 and ATP/MgCryo-EM2.1
9DW9Phosphorylated (E1371Q)CFTR with PKA-CCryo-EM2.8
7SVDCFTR with ATP/Mg and Lumacaftor (VX-809)Cryo-EM2.7
8UBRCFTR with CFTRinh-172 and ATP/MgCryo-EM2.7
8EIOΔF508 CFTR with elexacaftor and lumacaftorCryo-EM2.8
8EIQΔF508 CFTR with Trikafta (triple combo)Cryo-EM3.0
6MSMPhosphorylated, ATP-bound CFTRCryo-EM3.2
6O1VCFTR with GLPG1837Cryo-EM3.2
6O2PCFTR with ivacaftorCryo-EM3.3
9DW5Dephosphorylated CFTR with PKA-CCryo-EM3.8
8EIGΔF508 CFTR with elexacaftorCryo-EM3.6
5UAKDephosphorylated, ATP-free CFTRCryo-EM3.87
2PZENBD1 head-to-tail dimerX-ray1.7
2PZGNBD1 monomerX-ray1.8
5TFJNBD1 with dCTPX-ray1.85
6WBSNBD1 ΔF508/V510DX-ray1.857
5TF8NBD1 with dTTPX-ray1.861
5TFANBD1 with dUTPX-ray1.87
5TFBNBD1 with 7-methyl-GTPX-ray1.87
5TFDNBD1 with CTPX-ray1.891
5TFFNBD1 with UTPX-ray1.891
5TFINBD1 with dGTPX-ray1.891
5TFGNBD1 with 5-methyl-UTPX-ray1.91
5TGKNBD1 with dATPX-ray1.912
5TFCNBD1 with GTPX-ray1.92
5TF7NBD1 with ATPX-ray1.931
6GJSNBD1 with nanobodies D12 and T4X-ray1.95
2PZFNBD1 ΔF508 head-to-tail dimerX-ray2.0
2BBSΔF508 NBD1 with solubilizing mutationsX-ray2.05
4WZ6NBD1 ΔF508 bound ATPX-ray2.05
5D2D14-3-3 zeta with CFTR R-domain peptideX-ray2.1
1XMIF508A NBD1 with ATPX-ray2.25
1XMJΔF508 NBD1 with ATPX-ray2.3
2BBTΔF508 NBD1 with solubilizing mutationsX-ray2.3
6GJQNBD1 with nanobody T27X-ray2.49
2BBONBD1 with Phe508X-ray2.55
6GJUNBD1 with nanobodies T2a and T4X-ray2.6
3GD7NBD2 with P-ATPX-ray2.7
6UK1NBD2X-ray2.693
6ZE1NBD1 with nanobody G11aX-ray2.705
5D3E14-3-3 gamma with CFTR R-domain peptideX-ray2.75
5D3F14-3-3 zeta with CFTR peptide and fusicoccin-AX-ray2.74
3ISWFilamin-A with CFTR N-terminal peptideX-ray2.8
6GK4NBD1 with nanobodies D12 and T8X-ray2.91
6GKDNBD1 with nanobodies D12 and G3aX-ray2.99
2LOBPDZ Domain of CALSolution NMR-

Section 5: Cross-Species Orthologs

OrganismGene IDSymbolBiotype
Mouse (Mus musculus)ENSMUSG00000041301Cftrprotein_coding
Rat (Rattus norvegicus)ENSRNOG00000055103Cftrprotein_coding
Zebrafish (Danio rerio)ENSDARG00000041107cftrprotein_coding
Fruit fly (Drosophila melanogaster)FBGN0010549l(2)03659protein_coding
Fruit fly (D. melanogaster)FBGN0032456MRPprotein_coding
Fruit fly (D. melanogaster)FBGN0038740Mrp5protein_coding
Fruit fly (D. melanogaster)FBGN0263316Mrp4protein_coding
Worm (C. elegans)WBGENE00000477--
Yeast (S. cerevisiae)No direct ortholog--

Section 6: Clinical Variants & AI Predictions ClinVar Variant Summary Total ClinVar Variants: 5,919

ClassificationCount (sampled)
Pathogenic100+
Likely Pathogenic50+
Uncertain Significance3,000+
Likely Benign1,500+
Benign500+
Top 50 Pathogenic/Likely Pathogenic Variants
ClinVar IDHGVS NotationTypeClassification
1300163c.761del (p.Lys254fs)DeletionPathogenic
1300164c.580G>A (p.Gly194Arg)SNVPathogenic
1300165c.3382A>T (p.Arg1128Ter)SNVPathogenic
1300166c.3600del (p.Asp1201fs)DeletionPathogenic
1048778c.89dup (p.Arg31fs)DuplicationPathogenic
1069281c.2560dup (p.Thr854fs)DuplicationPathogenic
1069526c.489+1G>ASNVPathogenic
1069549c.300_301del (p.Leu101fs)DeletionPathogenic
1073653c.1573del (p.Gln525fs)DeletionPathogenic
1185022c.2335C>T (p.Gln779Ter)SNVPathogenic
1244250c.2917C>T (p.Leu973Phe)SNVPathogenic
1322070c.2909-1G>ASNVPathogenic
1328993c.396del (p.Ile132fs)DeletionPathogenic
1330378c.3741_3745dup (p.Gly1249fs)DuplicationPathogenic
1330401c.580G>T (p.Gly194Ter)SNVPathogenic
1332860c.2848del (p.His950fs)DeletionPathogenic
1353073c.307G>T (p.Gly103Ter)SNVPathogenic
1361726c.3185_3191del (p.Leu1062fs)DeletionPathogenic
1373833c.831G>A (p.Trp277Ter)SNVPathogenic
1374672c.3725del (p.Leu1242fs)DeletionPathogenic
1387407c.3736del (p.Thr1246fs)DeletionPathogenic
1397987c.2341C>T (p.Gln781Ter)SNVPathogenic
1400413c.2872C>T (p.Gln958Ter)SNVPathogenic
1406009c.503C>G (p.Ser168Ter)SNVPathogenic
1418433c.2956del (p.Leu986fs)DeletionPathogenic
1418949c.922_926del (p.Ser308fs)DeletionPathogenic
1425559c.2012T>A (p.Leu671Ter)SNVPathogenic
1443618c.2261del (p.Val754fs)DeletionPathogenic
1450404c.2994del (p.Leu998fs)DeletionPathogenic
1451980c.2435dup (p.Leu812fs)DuplicationPathogenic
1452452c.2959_2960dup (p.Pro988fs)DuplicationPathogenic
1453519c.4387_4388del (p.Gln1463fs)DeletionPathogenic
1453724c.326_327del (p.Ser108_Tyr109insTer)DeletionPathogenic
1457083c.1243_1247del (p.Gln414_Asn415insTer)DeletionPathogenic
1457828c.2301del (p.Gln767fs)DeletionPathogenic
1458639c.579+1_579+2insACATInsertionPathogenic
1459556c.156del (p.Lys52fs)DeletionPathogenic
161997c.2700T>A (p.Asn900Lys)SNVPathogenic
161999c.2443G>T (p.Glu815Ter)SNVPathogenic
162000c.830G>A (p.Trp277Ter)SNVPathogenic
1677121c.1551C>G (p.Tyr517Ter)SNVPathogenic
1685614c.532G>C (p.Gly178Arg)SNVPathogenic
1685616c.1227_1228del (p.Phe409fs)DeletionPathogenic
1685617c.1343_1350del (p.Ile448fs)DeletionPathogenic
1685618c.1547_1548del (p.Arg516fs)DeletionPathogenic
1685620c.2278dup (p.Thr760fs)DuplicationPathogenic
1685621c.2988+2T>GSNVPathogenic
1685622c.3301del (p.Gln1100_Met1101insTer)DeletionPathogenic
1685623c.3639dup (p.Asp1214fs)DuplicationPathogenic
1704256c.433C>G (p.Leu145Val)SNVPathogenic
SpliceAI Predictions Total SpliceAI Variants: 9,348 Top 50 Splice-Altering Variants (by delta score)
VariantEffectDelta Score
7:117294557:A:ACacceptor_gain0.89
7:117294558:C:CCacceptor_gain0.89
7:117294553:T:TAacceptor_gain0.87
7:117294559:T:Cacceptor_gain0.86
7:117293144:A:ACdonor_gain0.76
7:117293145:T:Cdonor_gain0.74
7:117295480:CCT:Cacceptor_gain0.73
7:117295481:CT:Cacceptor_gain0.50
7:117294552:AT:Aacceptor_gain0.65
7:117295366:GCTAT:Gacceptor_gain0.65
7:117295478:CACCT:Cacceptor_gain0.65
7:117288433:AT:Aacceptor_gain0.63
7:117288274:A:ACdonor_gain0.66
7:117288275:C:CCdonor_gain0.66
7:117288434:T:Aacceptor_gain0.65
7:117295440:TAGTA:Tacceptor_gain0.58
7:117294591:A:Cacceptor_gain0.57
7:117295506:TC:Tacceptor_gain0.54
7:117295508:T:Aacceptor_gain0.54
7:117295441:A:ACdonor_gain0.53
7:117294551:AAT:Aacceptor_gain0.52
7:117293203:T:TAacceptor_gain0.51
7:117295368:T:Aacceptor_gain0.50
7:117288276:T:Cdonor_gain0.48
7:117293164:C:CAdonor_gain0.47
7:117294620:T:TTacceptor_gain0.42
7:117294592:C:CCacceptor_gain0.47
7:117295398:C:Tacceptor_gain0.41
AlphaMissense Predictions Total AlphaMissense Variants: 9,721 Top 50 Predicted Pathogenic Missense Variants
VariantProtein ChangePathogenicity ScoreClass
7:117480143:T:CF17L0.958likely_pathogenic
7:117480143:T:AF17I0.880likely_pathogenic
7:117480119:G:CA9P0.876likely_pathogenic
7:117480120:C:AA9D0.811likely_pathogenic
7:117480143:T:GF17V0.778likely_pathogenic
7:117480131:T:CS13P0.768likely_pathogenic
7:117480138:T:CL15P0.763likely_pathogenic
7:117480132:C:TS13F0.762likely_pathogenic
7:117480120:C:TA9V0.754likely_pathogenic
7:117480108:C:TP5L0.753likely_pathogenic
7:117480105:C:GS4W0.727likely_pathogenic
7:117480108:C:AP5H0.682likely_pathogenic
7:117480119:G:AA9T0.672likely_pathogenic
7:117480132:C:AS13Y0.672likely_pathogenic
7:117480138:T:AL15H0.655likely_pathogenic
7:117480140:T:CF16L0.635likely_pathogenic
7:117480142:T:AF16L0.635likely_pathogenic
7:117480113:G:AE7K0.632likely_pathogenic
7:117480129:T:AV12D0.623likely_pathogenic
7:117480138:T:GL15R0.624likely_pathogenic
7:117480105:C:TS4L0.609likely_pathogenic
7:117480108:C:GP5R0.602likely_pathogenic
7:117480107:C:AP5T0.596likely_pathogenic
7:117480107:C:TP5S0.587likely_pathogenic
7:117480104:T:CS4P0.585likely_pathogenic

Section 7: Biological Pathways & Gene Ontology Reactome Pathways Total Pathways: 11

Pathway IDPathway NameDisease Pathway
R-HSA-5678895Defective CFTR causes cystic fibrosis✓ Yes
R-HSA-382556ABC-family proteins mediated transportNo
R-HSA-5627083RHO GTPases regulate CFTR traffickingNo
R-HSA-5689880Ub-specific processing proteasesNo
R-HSA-8856825Cargo recognition for clathrin-mediated endocytosisNo
R-HSA-8856828Clathrin-mediated endocytosisNo
R-HSA-9013406RHOQ GTPase cycleNo
R-HSA-9613829Chaperone Mediated AutophagyNo
R-HSA-9615710Late endosomal microautophagyNo
R-HSA-9646399AggrephagyNo
R-HSA-9925563Developmental Lineage of Pancreatic Ductal CellsNo
Gene Ontology Annotations Total GO Terms: 51 Molecular Function (17 terms)
GO IDTerm
GO:0005254chloride channel activity
GO:0005260intracellularly ATP-gated chloride channel activity
GO:0005524ATP binding
GO:0015106bicarbonate transmembrane transporter activity
GO:0015108chloride transmembrane transporter activity
GO:0016887ATP hydrolysis activity
GO:0017081chloride channel regulator activity
GO:0019869chloride channel inhibitor activity
GO:0019899enzyme binding
GO:0030165PDZ domain binding
GO:0051087protein-folding chaperone binding
GO:007188914-3-3 protein binding
GO:0106138Sec61 translocon complex binding
GO:0140359ABC-type transporter activity
Biological Process (20 terms)
GO IDTerm
GO:0006695cholesterol biosynthetic process
GO:0006833water transport
GO:0015701bicarbonate transport
GO:0030301cholesterol transport
GO:0034976response to endoplasmic reticulum stress
GO:0035377transepithelial water transport
GO:0048240sperm capacitation
GO:0050891multicellular organismal-level water homeostasis
GO:0051454intracellular pH elevation
GO:0051649establishment of localization in cell
GO:0055085transmembrane transport
GO:0060081membrane hyperpolarization
GO:0070175positive regulation of enamel mineralization
GO:0071320cellular response to cAMP
GO:0097186amelogenesis
GO:1902476chloride transmembrane transport
GO:1904322cellular response to forskolin
Cellular Component (20 terms)
GO IDTerm
GO:0005634nucleus
GO:0005737cytoplasm
GO:0005765lysosomal membrane
GO:0005769early endosome
GO:0005789endoplasmic reticulum membrane
GO:0005829cytosol
GO:0005886plasma membrane
GO:0009986cell surface
GO:0010008endosome membrane
GO:0016020membrane
GO:0016323basolateral plasma membrane
GO:0016324apical plasma membrane
GO:0030660Golgi-associated vesicle membrane
GO:0030669clathrin-coated endocytic vesicle membrane
GO:0031901early endosome membrane
GO:0032991protein-containing complex
GO:0034707chloride channel complex
GO:0055037recycling endosome
GO:0055038recycling endosome membrane

Section 8: Protein Interactions & Molecular Networks STRING Interactions Total STRING Interactions: 4,858 Top 50 Highest-Confidence Interacting Proteins

UniProt IDInteraction Score
O14745 (SLC9A3R1/NHERF1)996
Q9HD26 (GOPC)937
P07900 (HSP90AA1)929
P08238 (HSP90AB1)925
Q5T2W1 (DNAJC5G)920
Q9BXS9 (TMEM43)917
Q99942 (RNF5)898
P51168 (GNB2L1/RACK1)896
Q15599 (SLC9A3R2/NHERF2)891
P51170 (SCNN1G)886
P00995 (SPINK1)880
P40879 (SLC26A3)878
Q7LBE3 (FKBP14)874
P48048 (KCNJ1)871
P07477 (PRSS1)854
Q5XXA6 (ANO1)841
Q86UT5 (LRRC8D)832
Q9UNE7 (STUB1/CHIP)831
P48764 (SLC26A9)826
P11142 (HSPA8)825
P27824 (CANX)820
P37088 (SCNN1A)819
P25092 (GUCY2C)817
Q9UQC9 (CHGB)817
A8K7I4 (DNAJC5B)809
O95433 (AHSA1)808
P55011 (SLC12A2)804
Q02747 (GCG)797
P15311 (EZR)792
Q99895 (CHIT1)775
P17612 (PRKACA)770
P22612 (PRKACG)770
P22694 (PRKACB)770
Q16661 (GDF15)764
Q96PU5 (NEDD4L)763
O43511 (SLC26A6)752
P25685 (DNAJB1)744
P51572 (BCAP31)734
Q14CN2 (LRRC8C)720
P34932 (HSPA4)714
P58743 (ANO3)714
Q9UNQ0 (ABCG2)711
P07478 (PRSS2)707
Q8N5W9 (PRSS28)707
Q8IYP2 (SPACA3)696
Q13148 (TARDBP)681
Q8IZP9 (ANO5)680
P07550 (ADRB2)668
P00918 (CA2)651
P31689 (DNAJA1)649
IntAct Physical Interactions Total IntAct Interactions: 860 Key Physical Interactors (High Confidence)
InteractorInteraction TypeConfidence
NHERF1 (SLC9A3R1)direct interaction0.94
NHERF2 (SLC9A3R2)direct interaction0.94
GOPCphysical association0.77
PDZK1physical association0.78
ACTBphysical association0.73
CAP1physical association0.72
ESYT2association0.71
XPO1association0.71
USP10physical association0.66
Shank2physical association0.66
KCNN4physical association0.60
RAB5Aassociation0.61
CAPN1physical association0.60
BCAP31physical association0.58
ANTXR2physical association0.52
CALUphysical association0.52
C8orf33physical association0.52
LPAR2association0.50
CLCN3physical association0.50
SIGNOR Signaling Interactions Total SIGNOR Interactions: 38
RegulatorEffectMechanism
PRKACAup-regulates activityphosphorylation
PRKG1up-regulatesphosphorylation
PRKCAup-regulates activityphosphorylation
AMPKdown-regulates activityphosphorylation
PRKAA1down-regulates activityphosphorylation
CSNK2A1up/down-regulatesphosphorylation
STK39down-regulates activityphosphorylation
LMTK2down-regulates activityphosphorylation
PP1up-regulates activitydephosphorylation
GOPCdown-regulatesbinding
HSPA8down-regulates quantitybinding
RNF5down-regulates quantityubiquitination
STUB1down-regulates quantityubiquitination
MARCHF2down-regulates quantityubiquitination
FOXI1up-regulatestranscriptional regulation
Protein Similarity ESM2 Structural Similarity Total Similar Proteins: 81
UniProt IDTop SimilarityAvg Similarity
P35071 (SLC6A14)0.99990.9985
Q00555 (SLC6A1)0.99990.9983
O60706 (ABCC9)0.99990.9947
Q9TSP51.00000.9986
Q9TUQ21.00000.9986
P70170 (CFTR mouse)1.00000.9945
P82451 (CFTR)0.99990.9946
DIAMOND Sequence Similarity Total Homologous Proteins: 45
UniProt IDIdentity (%)Bitscore
Q00553100.02844
Q7JII7100.02844
Q7JII8100.02844
P13569 (self)99.72829
Q9TSP599.92843
Q9TUQ299.92843
P3507198.42796
Q0055598.42799
E9Q236 (mouse)95.12461
F1M3J495.12469
P2636191.92628

Section 9: Transcription Factor Regulatory Data Note: CFTR is NOT a transcription factor. This section documents transcription factors that regulate CFTR expression. Upstream Regulators of CFTR (CollecTRI) Total TF Regulators: 35

TF GeneRegulationConfidence
ATF1ActivationHigh
CDX2ActivationHigh
CTCFUnknownHigh
CUX1UnknownHigh
DNMT1-High
ESR1-High
HNF1AActivationHigh
NFE2L2ActivationHigh
NFKBActivationHigh
NR3C1-High
SP1UnknownHigh
TBP-High
USF1-High
USF2UnknownHigh
YY1ActivationHigh
FOXA1Repression-
FOXA2Repression-
FOXI1RepressionHigh
CEBPAUnknown-
CEBPB-Low
CEBPDUnknownLow
CEBPG-Low
FOXJ1-Low
HIF1A-Low
HNF1B-Low
KAT7-Low
NFKB1ActivationLow
PBX1Unknown-
RELAActivationLow
SP3Unknown-
SRFUnknown-
STAT1Activation-
TCF4Activation-
TFAP2AUnknown-
TXK-Low

Section 10: Drug & Pharmacology Data ChEMBL Target Information

ChEMBL Target IDTarget NameTarget Type
CHEMBL4051Cystic fibrosis transmembrane conductance regulatorSINGLE PROTEIN
CHEMBL3885559CFTR/GOPC complexPROTEIN-PROTEIN INTERACTION
Approved and Late-Stage Drugs (Phase 3+) Total Molecules Targeting CFTR: 1,000+ (all phases) FDA-Approved/Phase 3+ Drugs
ChEMBL IDDrug NameTypePhaseMechanism
CHEMBL2010601IVACAFTOR (Kalydeco)Small molecule4 (Approved)CFTR potentiator
CHEMBL2103870LUMACAFTOR (Orkambi component)Small molecule4 (Approved)CFTR corrector
CHEMBL3544914TEZACAFTOR (Symdeko component)Small molecule4 (Approved)CFTR corrector
CHEMBL472GLYBURIDESmall molecule4 (Approved)CFTR inhibitor
CHEMBL4297849BAMOCAFTORSmall molecule3CFTR corrector
CHEMBL226335RUTINSmall molecule3-
CHEMBL50QUERCETINSmall molecule3-
Note: ELEXACAFTOR (VX-445), used in Trikafta triple combination therapy, is another approved CFTR corrector. PharmGKB Information
PharmGKB IDSymbolVIP GeneCPIC Guidelines
PA109CFTR✓ Yes✓ Yes
SIGNOR Drug Interactions
DrugEffectMechanism
Ivacaftordown-regulateschemical inhibition
Glyburidedown-regulates activitychemical inhibition
Tolbutamidedown-regulates activitychemical inhibition

Section 11: Expression Profiles Expression Summary (Bgee)

  • Expression Breadth: Ubiquitous
  • Total Present Calls: 193
  • Max Expression Score: 99.02 Top 30 Tissues by Expression Score
TissueExpression ScoreQuality
Body of pancreas99.02Gold
Gall bladder96.99Gold
Pancreas96.76Gold
Rectum95.92Gold
Mucosa of sigmoid colon95.82Gold
Colonic mucosa95.77Gold
Islet of Langerhans95.45Gold
Duodenum95.09Gold
Mucosa of transverse colon92.93Gold
Jejunal mucosa92.39Gold
Transverse colon88.98Gold
C1 segment of cervical spinal cord87.52Gold
Minor salivary gland86.75Gold
Saliva-secreting gland86.69Gold
Male germ line stem cell in testis85.40Gold
Parotid gland84.91Gold
Large intestine83.66Gold
Intestine83.25Gold
Colon83.14Gold
Mouth mucosa82.90Gold
Spinal cord82.59Gold
Olfactory segment of nasal mucosa82.57Gold
Colonic epithelium81.31Gold
Small intestine Peyer's patch81.14Gold
Small intestine81.07Gold
Primordial germ cell in gonad80.67Gold
Caput epididymis80.63Gold
Body of uterus79.82Gold
Palpebral conjunctiva79.80Gold
Epithelium of bronchus79.53Gold
Top 30 Cell Types by Expression Score
Cell TypeExpression ScoreQuality
Male germ line stem cell (Vertebrata) in testis85.40Gold
Type B pancreatic cell79.10Silver
Bronchial epithelial cell78.65Gold
Epithelial cell of pancreas78.10Gold
Pancreatic ductal cell75.32Silver
Single-Cell Expression Data (SCXA) Total Single-Cell Datasets: 4
Dataset IDDescriptionSpeciesCells
E-GEOD-81547Single cell transcriptome of human pancreasHomo sapiens2,544
E-GEOD-81608Human islet cells from non-diabetic and T2D donorsHomo sapiens1,600
E-GEOD-83139Human pancreatic endocrine cellsHomo sapiens635
E-MTAB-5061Human pancreas from healthy and T2D patientsHomo sapiens3,386

Section 12: Disease Associations GenCC Gene-Disease Associations Total GenCC Associations: 6

DiseaseClassificationInheritanceSubmitter
Cystic fibrosisDefinitiveAutosomal recessiveLaboratory for Molecular Medicine
Cystic fibrosisDefinitiveAutosomal recessiveMyriad Women's Health
Cystic fibrosisStrongAutosomal recessiveLabcorp Genetics
Cystic fibrosisSupportiveAutosomal recessiveOrphanet
Hereditary chronic pancreatitisLimitedAutosomal dominantLabcorp Genetics
Congenital bilateral absence of vas deferensSupportiveAutosomal recessiveOrphanet
Orphanet Disease Associations Total Orphanet Associations: 6
ORPHA IDDisease NameTypePhenotype Count
586Cystic fibrosisDisease35
48Congenital bilateral absence of vas deferensMorphological anomaly7
399805Male infertility with azoospermia/oligozoospermiaDisease7
498359Aquagenic palmoplantar keratodermaDisease12
60033Idiopathic bronchiectasisDisease19
700124Autosomal recessive hereditary chronic pancreatitisDisease0
HPO Phenotype Associations Total HPO Phenotypes: 103 Top 50 Clinical Phenotypes
HPO IDPhenotype
HP:0002110Bronchiectasis
HP:0002205Recurrent respiratory infections
HP:0001738Exocrine pancreatic insufficiency
HP:0012873Absent vas deferens
HP:0003251Male infertility
HP:0002570Steatorrhea
HP:0001733Pancreatitis
HP:0004401Meconium ileus
HP:0000027Azoospermia
HP:0012236Elevated sweat chloride
HP:0002107Pneumothorax
HP:0002097Emphysema
HP:0006528Chronic lung disease
HP:0006532Recurrent pneumonia
HP:0006538Recurrent bronchopulmonary infections
HP:0002783Recurrent lower respiratory tract infections
HP:0001394Cirrhosis
HP:0002613Biliary cirrhosis
HP:0001508Failure to thrive
HP:0002024Malabsorption
HP:0002014Diarrhea
HP:0030877Reduced FEV1/FVC ratio
HP:0002094Dyspnea
HP:0004469Chronic bronchitis
HP:0011109Chronic sinusitis
HP:0000246Sinusitis
HP:0100582Nasal polyposis
HP:0002724Recurrent Aspergillus infections
HP:0002842Recurrent Burkholderia cepacia infections
HP:0002726Recurrent Staphylococcus aureus infections
HP:0001217Clubbing
HP:0100759Clubbing of fingers
HP:0030828Wheezing
HP:0031245Productive cough
HP:0002105Hemoptysis
HP:0001648Cor pulmonale
HP:0002202Pleural effusion
HP:0000939Osteoporosis
HP:0000938Osteopenia
HP:0000819Diabetes mellitus
HP:0001944Dehydration
HP:0002020Gastroesophageal reflux
HP:0002027Abdominal pain
HP:0002035Rectal prolapse
HP:0002240Hepatomegaly
HP:0001433Hepatosplenomegaly
HP:0002910Elevated hepatic transaminase
HP:0000798Oligozoospermia
HP:0011961Non-obstructive azoospermia
HP:0011962Obstructive azoospermia
GWAS Associations Total GWAS Associations: 9
Study IDTrait/DiseaseP-valueMapped Gene
GCST90014023Type 1 diabetes2.0×10⁻¹⁶CFTR
GCST003740Barrett's esophagus / Esophageal adenocarcinoma5.0×10⁻¹⁰CFTR
GCST005956Waist-to-hip ratio adjusted for BMI5.0×10⁻⁹ST7 - MTND4P6
GCST006395Glaucoma3.0×10⁻⁸CTTNBP2 - LSM8
GCST001356Gout1.0×10⁻⁷CFTR
GCST005962WHR adjusted for BMI × sex × age3.0×10⁻⁷ST7 - MTND4P6
GCST001629Response to platinum-based chemotherapy4.0×10⁻⁷ANKRD7
GCST002408Response to methotrexate in juvenile idiopathic arthritis5.0×10⁻⁷CTTNBP2

Quick Reference Summary

CategoryCount
Ensembl Transcripts44
UniProt Accessions20
PDB Structures58
ClinVar Variants5,919
SpliceAI Predictions9,348
AlphaMissense Predictions9,721
GO Terms51
Reactome Pathways11
STRING Interactions4,858
IntAct Interactions860
TF Regulators35
HPO Phenotypes103
GWAS Associations9
Approved Drugs4

I’ve compiled a comprehensive cross-database identifier and functional mapping reference for human CFTR (Cystic Fibrosis Transmembrane Conductance Regulator). Key Highlights: Core Identifiers

  • HGNC:1884 | ENSG00000001626 | Entrez:1080 | OMIM:602421

  • Location: Chromosome 7q31.2 (117,287,120-117,715,971, + strand) Molecular Resources

  • 44 transcripts (17 protein-coding, canonical: ENST00000003084)

  • 20 UniProt entries (canonical: P13569, 1,480 aa)

  • 58 PDB structures (best resolution: 2.1Å Cryo-EM)

  • 27 exons in canonical transcript Clinical Significance

  • 5,919 ClinVar variants (100+ pathogenic, 3,000+ VUS)

  • 9,721 AlphaMissense predictions (missense pathogenicity)

  • 9,348 SpliceAI predictions (splice-altering variants) Drug Targeting

  • 4 FDA-approved drugs: Ivacaftor (potentiator), Lumacaftor, Tezacaftor (correctors), Glyburide

  • VIP gene in PharmGKB with CPIC dosing guidelines Disease Associations

  • Cystic fibrosis (Definitive, autosomal recessive)

  • 103 HPO phenotypes including bronchiectasis, pancreatic insufficiency, male infertility

  • Associated conditions: CBAVD, chronic pancreatitis, aquagenic keratoderma Expression Profile

  • Ubiquitous expression with highest in: pancreas (99.02), gallbladder (96.99), colon (95.77)

  • 4 single-cell datasets (pancreatic islet cells) Interaction Networks

  • 4,858 STRING interactions, 860 IntAct interactions

  • Key interactors: NHERF1/2, GOPC, HSP90, STUB1/CHIP This reference provides researchers with a complete lookup resource covering identifiers, variants, structures, interactions, drugs, and disease associations for CFTR.