- Today
- Holidays
- Birthdays
- Reminders
- Cities
- Atlanta
- Austin
- Baltimore
- Berwyn
- Beverly Hills
- Birmingham
- Boston
- Brooklyn
- Buffalo
- Charlotte
- Chicago
- Cincinnati
- Cleveland
- Columbus
- Dallas
- Denver
- Detroit
- Fort Worth
- Houston
- Indianapolis
- Knoxville
- Las Vegas
- Los Angeles
- Louisville
- Madison
- Memphis
- Miami
- Milwaukee
- Minneapolis
- Nashville
- New Orleans
- New York
- Omaha
- Orlando
- Philadelphia
- Phoenix
- Pittsburgh
- Portland
- Raleigh
- Richmond
- Rutherford
- Sacramento
- Salt Lake City
- San Antonio
- San Diego
- San Francisco
- San Jose
- Seattle
- Tampa
- Tucson
- Washington
Basecamp Research Launches Trillion Gene Atlas to Scale AI-Designed Therapeutics
The Atlas will expand known evolutionary genetic diversity by 100x, collecting novel genomic data from over 100 million new species across thousands of sites globally.
Mar. 18, 2026 at 9:25am
Got story updates? Submit your updates here. ›
Basecamp Research, a frontier AI lab for biological design, has announced the launch of the Trillion Gene Atlas, a scientific initiative to generate and model biological data at the trillion-gene scale. Launched in collaboration with Anthropic, Ultima Genomics and PacBio, and powered by NVIDIA AI infrastructure, the Trillion Gene Atlas aims to expand known evolutionary genetic diversity 100-fold by collecting genomic data from more than 100 million species across thousands of sites worldwide.
Why it matters
With huge increases in model size and computing power, diverse data is a critical enabler for progress in AI drug development and real-world benchmarks. The Trillion Gene Atlas extends Basecamp Research's earlier work in building large-scale, high-quality genomic datasets to train AI models that can design new medicines across diseases and treatment types.
The details
The initiative, which is on the scale of the Human Genome Project, was unveiled during the Health Track at SXSW and the NVIDIA GTC conference in San Jose. By partnering with Anthropic, Ultima Genomics, and PacBio, and powered by NVIDIA AI infrastructure, Basecamp Research aims to compress over two decades of biological data gathering and analysis into less than two years.
- The Trillion Gene Atlas was launched on March 18, 2026.
- The initiative was unveiled during the Health Track at SXSW and the NVIDIA GTC conference in San Jose.
The players
Basecamp Research
A frontier AI lab for biological design that is launching the Trillion Gene Atlas.
Anthropic
A partner in the Trillion Gene Atlas initiative, working to add new capabilities for life sciences and connect its Claude AI to more scientific platforms.
Ultima Genomics
A developer of ultra-high throughput next-generation sequencing (NGS) systems that is providing sequencing technology for the Trillion Gene Atlas.
PacBio
A provider of highly accurate long-read sequencing technology that is enabling the Trillion Gene Atlas to preserve full genomic context and enable subspecies and strain-level resolution.
NVIDIA
The provider of accelerated computing infrastructure that will power the Trillion Gene Atlas to process vast quantities of genetic data at the petabase scale.
What they’re saying
“Today's biological AI models are trained on a narrow slice of life on Earth. The Trillion Gene Atlas expands the known genetic universe by orders of magnitude beyond what is in public databases. Training models at this scale establishes a new paradigm for programmable therapeutic design.”
— Glen Gowers, Co-founder and CEO of Basecamp Research (SXSW)
“Bigger models alone aren't enough. EDEN showed that performance in biological AI follows much steeper scaling trajectories with higher quality and fully contextualized data. The Trillion Gene Atlas extends that principle 100-fold.”
— Phil Lorenz, CTO of Basecamp Research (SXSW)
“Biology has been fundamentally data-starved when compared to other fields like language or computer vision as researchers have lacked the tools required to generate data at scale. We strongly believe that AI will have an immense impact on our understanding of biology and human health, and the UG200 Series was designed from the ground up to enable the massive datasets required for BioAI to deliver on this promise.”
— Gilad Almogy, Founder and CEO of Ultima Genomics (SXSW)
“PacBio HiFi sequencing delivers highly accurate long reads that preserve full genomic context and enables subspecies and even strain-level resolution in complex samples. HiFi data provides the reliable, information-rich foundation biological AI models need to learn from nature at scale and power initiatives like the Trillion Gene Atlas.”
— Christian Henry, President and CEO of PacBio (SXSW)
What’s next
Through parallelized data processing, automated annotation, and large-scale model training, the partners expect to compress a task that previously would have required more than 20 years of processing time to less than two years. This compression of sequencing, assembly, annotation and model training is intended to expand the performance and scope of biological foundation models across therapeutic development.
The takeaway
The Trillion Gene Atlas represents a major leap forward in the scale and diversity of biological data available to train AI systems for drug discovery and design. By expanding the known genetic universe by 100x, this initiative has the potential to unlock new frontiers in programmable therapeutics and accelerate the development of innovative medicines across a wide range of diseases.




