A Comprehensive Biomedical Knowledge Base

BioAtlas

490+ million rows integrating genetics, expression, perturbations, drugs, and disease into one harmonized, queryable PostgreSQL database. This is the same core data layer we use internally at VivaMed.

Research Use Only: BioAtlas is not intended for diagnosis, treatment decisions, or clinical care. Always consult qualified healthcare professionals for medical advice.

FREE
CC BY-SA 3.0
490M+
Rows of Data
40+
Sources Integrated
80M
Colocalizations
79
Core Tables

Solving the Biomedical Data Integration Crisis

Modern biomedical research produces incredible datasets, but they're isolated silos:

  • GWAS Catalog has genetics, but no gene expression
  • LINCS has drug perturbations, but no genetic evidence
  • ChEMBL has drugs, but no patient data or genomics
  • Open Targets connects some pieces, but missing perturbation data

The result? Researchers spend months manually linking databases just to ask: "Which drugs target genetically validated disease genes AND have good safety profiles?"

BioAtlas Solves This. One PostgreSQL database. 40+ sources integrated. 490M+ rows. Unified IDs. Harmonized coordinates. Cross-platform normalized.

Complete Coverage: Drugs, Diseases, Pathways, Mechanisms

BioAtlas provides EVERYTHING about drugs, diseases, pathways, and mechanisms. Competitors give you fragments.

Drug Coverage (COMPLETE)

What You NeedBioAtlasClarivateSTRINGOpen Targets
Drug Information10,380 drugsLimitedNoneLimited
Drug-Target24,987 (11 sources)YesNoneYes
Binding Affinities1.35M (pChEMBL unified)LimitedNoneSome
Adverse Events1.45M (FDA FAERS)YesNoneLimited
Drug Perturbations720K LINCS + 1.5M TahoeLimitedNoneNone
Drug Screening8.6M combo + 3.8M PRISMNoneNoneNone
Activity Scores204M TF/pathwayNoneNoneNone

Disease Coverage (COMPLETE)

What You NeedBioAtlasClarivateSTRINGOpen Targets
Disease Ontology30,259 (MONDO)YesNoneYes
Gene-Disease1.14M associationsYesNoneYes
Disease-Phenotype272,246 (HPO)LimitedNoneYes
Colocalization80M causal genesNoneNoneSome

Pathway & Mechanism Coverage (COMPLETE)

What You NeedBioAtlasClarivateSTRINGOpen Targets
Pathway Definitions2,781 (Reactome)YesNoneYes
TF Regulation31,953 edges (DoRothEA)LimitedNoneLimited
Ligand-Receptor20.9M pairsLimitedNoneSome
TF/Pathway Activities204M scoresNoneNoneNone

Integration & Normalization (UNIQUE)

FeatureBioAtlasEveryone Else
All-in-One DatabaseSQL joins across 40+ sourcesSeparate downloads
ID HarmonizationEnsembl↔ChEMBL↔MONDOManual mapping
Colocalization80M precomputedCompute yourself
Local SQL AccessFull accessAPIs/Web only
PriceFREE$100K-$200K/year

The Bottom Line

For DRUGS

Targets (25K), affinities (1.35M), indications (47K), safety (1.45M), perturbations (2.2M), screening (12M) — ALL in one place

For DISEASES

Ontology (30K), genetics (1.14M associations), phenotypes (272K), variants (299K), colocalization (80M) — COMPLETE coverage

For PATHWAYS

Definitions (2.7K), members (137K), footprints (253K), TF regulation (32K), activities (204M) — FULL mechanism knowledge

For INTEGRATION

BioAtlas is the ONLY one that connects all of these in one queryable database with advanced normalization

Competitors have 1-2 of these. BioAtlas has ALL.

Quick Start: Installation Guide

Prerequisites

  • • PostgreSQL 14+ installed
  • • ~30 GB free disk space
  • • ~8 GB RAM minimum

1. Download Files (Total ~26 GB)

# Core Knowledge Graph (14.2 GB)
huggingface-cli download vivamed/Bio-Atlas bioatlas_public_v1.0.dump

# LINCS Activity Scores (5.1 GB)
huggingface-cli download vivamed/Bio-Atlas bio_kg_v1.0.dump

# Colocalization Data (6.5 GB)
huggingface-cli download vivamed/Bio-Atlas coloc_bayesian.dump

2. Load Database

# Create database
createdb bioatlas

# Load Core Tables (~15 mins)
psql -d bioatlas -f bioatlas_public_v1.0.dump

# Load LINCS Activities (~10 mins)
pg_restore -d bioatlas bio_kg_v1.0.dump

# Load Colocalization (~10 mins)
pg_restore -d bioatlas coloc_bayesian.dump

3. Verify Installation

psql -d bioatlas -c "\dt"  -- Should see 79 tables
psql -d bioatlas -c "SELECT COUNT(*) FROM drug;"  -- 10,380
psql -d bioatlas -c "SELECT COUNT(*) FROM l1000_activity;"  -- 202,282,258

Ready to Transform Your Research?

Download BioAtlas today and query across genetics, drugs, diseases, and pathways in seconds — not months.