Data sources

Sugi Atlas builds on top of BioBTree, which integrates the 70 primary biomedical databases listed below. Each row shows the source and its citing paper.

Acknowledgments

Every page in Sugi Atlas rests on the sustained work of the curators, researchers, and engineers behind the databases listed below. Their long-term commitment to open, structured, citable biomedical data is what makes Sugi Atlas possible.

Data freshness

Sugi Atlas follows BioBTree's update cycle. BioBTree's pipeline processes the full federation efficiently, so the atlas is refreshed against the latest snapshot at least monthly — and more frequently when a Sugi Atlas improvement triggers a rebuild.

Additional per-source statistics will be added here in the future.

Datasets 70

SourceReference
AlphaFoldVaradi et al., NAR 2022
AlphaMissenseCheng et al., Science 2023
Antibody dbTheraSAbDab + SAbDab + IMGT (Raybould 2020; Dunbar 2014; Giudicelli 2005)
BAOAbeyruwan et al., 2014
BgeeBastian et al., NAR 2021
BindingDBLiu et al., NAR 2025
BioGRIDOughtred et al., 2019
BRENDAHauenstein et al., NAR 2026
CellosaurusBairoch, J Biomol Tech 2018
CellPhoneDBTroule et al., 2025
ChEBIDegtyarenko et al., NAR 2008
chembl_documentZdrazil et al., NAR 2024
CIViCGriffith et al., Nat Genet 2017
CLDiehl et al., 2016
ClinGen Gene-Disease ValidityRehm et al., NEJM 2015
Clinical TrialsZarin et al., 2011
ClinVarLandrum et al., NAR 2014
CollecTRIMüller-Dott et al., 2023
CORUMTsitsiridis et al., NAR 2023
CTDDavis et al., 2025
CZ CELLxGENECZI Cell Science Program, 2025
dbSNPSherry et al., NAR 2001
DepMapTsherniak et al., Cell 2017
DIAMOND Protein SimilarityBuchfink et al., 2021
ECONadendla et al., NAR 2022
EFOMalone et al., 2010
ENCODE cCREMoore et al., Nature 2020
EnsemblDyer et al., NAR 2025
ESM2 Protein SimilarityLin et al., Science 2023
FANTOM5 PromoterNobusada et al., 2025
GenCCDiStefano et al., 2022
GeneRIFBrown et al., NAR 2015
GOAshburner et al., Nat Genet 2000
GtoPdbHarding et al., NAR (IUPHAR/BPS)
GWAS StudySollis et al., NAR 2023
HGNCSeal et al., NAR 2023
hmdbWishart et al., NAR 2022
HPOGargano et al., NAR 2024
IntActdel Toro et al., 2022
InterproBlum et al., NAR 2025
intOGenMartínez-Jiménez et al., Nat Rev Cancer 2020
JASPARRauluseviciute et al., NAR 2024
LIPID MAPSSud et al., NAR 2007
literature_mappingsNCBI PMC PMID/PMCID/DOI mapping (NCBI/NLM)
MeSHLipscomb, 2000
miRDBChen & Wang, NAR 2020
MONDOVasilevsky et al., 2025
MSigDBLiberzon et al., 2015
NCBI/Entrez GeneMaglott et al., NAR 2011
OBAStefancsik et al., Database (Oxford) 2023
OBIBandrowski et al., PLoS ONE 2016
OrphanetRath et al., 2012
PatentPapadatos et al., NAR 2016
PATOGkoutos et al., Brief Bioinform 2018
PharmGKBGong et al., 2021
Protein Data BankArmstrong et al., NAR 2020
PubChem BioActivityKim et al., NAR 2025
ReactomeMilacic et al., NAR 2024
RefSeqO’Leary et al., NAR 2016
RheaBansal et al., 2022
RNAcentralRNAcentral Consortium, NAR 2026
SC Expression AtlasPapatheodorou et al., NAR 2020
SIGNORLo Surdo et al., 2026
SpliceAIJaganathan et al., Cell 2019
STRINGSzklarczyk et al., NAR 2023
SwissLipidsAimo et al., 2015
TaxonomyFederhen, NAR 2012
UBERONMungall et al., 2012
UniprotUniProt Consortium, NAR 2025
XCOSmith et al., RGD / Mamm Genome