Cloud for AI/ML & Modern Data Science Header Image

 

Cloud for AI/ML & Modern Data Science

Design Agile, Scalable, Cost-Optimized Cloud Infrastructure

April 2 - 4, 2025 ALL TIMES EDT

Adopting and deploying cloud technologies is a critical necessity for digital transformation. With the vast amounts of data required to drive innovation, enable AI/ML capabilities, and support modern data science, cloud solutions are essential. However, navigating the numerous choices to find the best fit for your organization can be challenging. The Cloud for AI/ML & Modern Data Science track, through insightful case studies and best practices, offers guidance on selecting the ideal cloud or hybrid infrastructure and applications to advance R&D, foster collaboration and innovation, and maintain the flexibility needed to keep pace with the technological advances shaping pharmaceutical R&D.

Wednesday, April 2

8:00 amRegistration Open and Morning Coffee

9:00 amRecommended Pre-Conference Workshops and Symposia*

On Wednesday, April 2, 2025, Cambridge Healthtech Institute is pleased to offer five pre-conference Workshops scheduled across two time slots (9:00 am–12:00 pm and 1:15–4:15 pm) and three Symposia from 9:00 am–4:20 pm. All are designed to be instructional, interactive, and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Thursday–Friday.

*Separate registration required. See details on the Symposia here and details on the Workshops here.

4:40 pm

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:45 pm PLENARY KEYNOTE INTRODUCTION:Explainable AI in Drug Discovery

Kshitij Kumar, CEO & Founder, CLOVERTEX

4:55 pm PLENARY KEYNOTE PANEL DISCUSSION:

From Bytes to Breakthroughs: Next-Generation AI Driving the Future of Life Sciences and Healthcare

PANEL MODERATOR:

Abbie Celniker, PhD, Partner, Third Rock Ventures LLC

Next-Generation AI has the potential to revolutionize life sciences by delivering unprecedented insights, automation, and efficiency. But what will those industry transformations look like? This keynote panel convenes leaders from biopharma, healthcare, and emerging tech who are applying AI—generative models and beyond—to accelerate drug discovery, diagnostics, and patient care. Panelists will share real-world case studies, discuss overcoming both technical and organizational challenges, and explore how AI is evolving from predictive tools to autonomous, decision-making systems. Look beyond the hype to uncover where AI is making a tangible impact today and where the next frontiers of innovation lie.

PANELISTS:

Tala Fakhouri, PhD, MPH, Associate Director for Data Science and AI Policy, FDA (participating virtually)

Per Greisen, PhD, President, BioMap

Sofia Guerra, Vice President, Bessemer Venture Partners

Subha Madhavan, PhD, Vice President and Head, AI/ML, Quantitative and Digital Sciences, Pfizer Inc.

Sonya Makhni, MD, Medical Director, Mayo Clinic Platform

6:10 pmWelcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)

The Bio-IT Kickoff Reception is a reunion—reconnect with friends, explore cutting-edge research, and celebrate innovation! Enjoy poster presentations, networking, and vote for the Best of Show and Poster awards.

7:25 pmClose of Day

Thursday, April 3

7:00 amRegistration and Morning Coffee

8:00 am

Organizer's Remarks

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

8:05 am PLENARY KEYNOTE INTRODUCTION:Build for Now & the Future: 8 Critical Pillars for Your Enterprise AI Strategy 

Jesse Cugliotta, Global Industry GTM Lead, Healthcare & Life Sciences, Snowflake, Inc.

HARNESSING AI FOR DRUG DISCOVERY: FROM INFRASTRUCTURE TO IMPLEMENTATION

8:15 am PLENARY KEYNOTE PRESENTATION:

Data and Computing Infrastructure for the Life Sciences: Best Practices, Observations, and Lessons Learned

Chris Dwan, Independent Consultant, Dwan, LLC

This talk will provide practical, real-world advice based on Dwan's quarter century of experience designing and implementing high-performance computing and large-scale data systems for health care and the life sciences. Topics will include network architectures, cloud vs. "terrestrial" infrastructure, practical data strategies, information security, quality and compliance from R&D to the clinic, differentiated computing platforms, human and organizational factors, and of course AI.

8:45 am PLENARY KEYNOTE PRESENTATION:

Generative AI, Aging Research and Robotics as a Platform for Drug Discovery: From Hype to Clinical Efficacy

Alex Zhavoronkov, PhD, Founder & CEO, Insilico Medicine

9:15 amSession Q&A

9:30 amCoffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)

Start your morning with coffee, connections, and cutting-edge research! Enjoy poster presentations, network in the Exhibit Hall, vote for awards, and a chance at a fabulous raffle prize!

10:15 amOrganizer's Welcome Remarks

SETTING UP AND SCALING AGILE DATA AND ANALYTICS ECOSYSTEMS

10:20 am

Chairperson's Remarks

Brigitte Ganter, PhD, Founder & Managing Director, enlightenbio LLC

10:25 am

Scaling AI/ML in Biotech: A Survey of Cloud Trends and Innovations

Drew Dresser, Senior Director of AI and Cloud Engineering, Flagship Pioneering

This presentation surveys how Flagship Pioneering’s portfolio companies use cloud technologies to scale AI/ML models. We’ll examine architecture patterns, strategies, and lessons learned, showcasing methods for scaling compute and pipelines. Additionally, the talk will highlight emerging trends in cloud and AI/ML, offering insights into how these innovations are shaping the future of biotech research. Attendees will gain practical strategies for optimizing cloud environments to drive AI/ML growth in biotech.

10:55 am

Cloud Genetics: A Blueprint for Precision Medicines

Gregory Hinkle, PhD, Vice President, Research Informatics, Alnylam Pharmaceuticals, Inc.

This presentation will detail challenges and solutions in implementing large-scale human genetics to support sequence-based precision medicines like RNAi therapeutics. Key topics will include data management, cost effectiveness, and navigating ever more stringent data security standards. The talk will highlight innovative strategies and technologies that Alnylam put in place that paved the way for the development and delivery of novel RNAi therapeutics to patients in need. Human genetics is at the heart of drug discovery at Alnylam. Effective, large scale human genetics is a data management and computational challenge. The loss of trust and the growth of human genetics projects as data.

11:25 am

Flexible Architecture for Machine Learning for Genomics

Yohann Potier, PhD, Senior Director, Data Platform, Tessera Therapeutics, Inc.

Flexible Architecture for Machine Learning for Genomics: This talk will explore the architecture for developing, deploying, and scaling machine learning models in genomics. It will emphasize the importance of creating a flexible infrastructure that can accommodate a range of ML workloads for drug development. Key topics will cover optimizing model training, managing datasets, and efficiently utilizing compute resources to meet different workload demands while ensuring cost-effectiveness and scalability.

11:55 am Powering AI/ML Workloads and Scaling Science with Nextflow

Evan Floden, CEO & Co-Founder, Seqera

The growth in AI adoption, largely driven by computational advancements, is enabling novel and exciting analysis. However, managing diverse computational needs—such as GPU-parallel processing, mixed compute and hardware requirements, and flexible development environments—can be challenging. Nextflow efficiently handles mixed compute workloads efficiently, making it an ideal choice for AI/ML tasks. Seqera enhances this by streamlining resource management and optimizing cloud-based GPU resource scaling. This presentation explores how Nextflow and Seqera offer a flexible, scalable AI workflow solution.

12:25 pm AI/ML on AWS: Building for GxP Validated Environments

Aaron Jeskey, Senior Cloud Architect, Cloud Engineering, Pinnacle Technology Partners, Inc.

As artificial intelligence and machine learning (AI/ML) transform industries, ensuring compliance with Good Automated Manufacturing Practice (GxP) regulations in life sciences is critical. During this talk, “AI/ML on AWS: Building for GxP Validated Environments,” PTP will explore how to design and deploy AI/ML workflows on AWS that meet stringent GxP requirements. We’ll discuss best practices for leveraging AWS services, ensuring data integrity, traceability, and validation, while maintaining innovation velocity. Attendees will gain insights into architecting robust, compliant solutions for regulated industries, enabling them to harness the power of AI/ML responsibly and effectively. Whether you’re a data scientist, developer, or compliance specialist, this session will equip you with the tools to succeed in GxP-regulated environments.

12:40 pm Towards Foundation Models for Process Development

Karthik Sekar, PhD, Staff Data Scientist, Invert, Inc.

What if bioprocess teams need 80% less data to train models?

Hybrid modeling, which combines physical and black-box models, has revolutionized machine learning applications in bioprocessing. It significantly reduces data requirements while maintaining flexibility compared to pure physical models. However, a critical challenge remains: how can organizations effectively transfer knowledge when switching between cell lines or molecules?

This presentation explores our collaboration with a CDMO partner to develop an innovative foundation modeling approach that harnesses historical process development data to accelerate new projects.

Our approach achieves the same performance as conventional hybrid models with 20% of the data. Join us to explore how this breakthrough could reshape the speed and efficiency of bringing new biotherapeutics to market.

12:55 pmSession Break and Transition to Lunch

1:05 pm LUNCHEON PRESENTATION:
Data: Your Secret Weapon for Innovation in Life Sciences

Paul Brake, Executive Director Life Sciences, Healthcare Life Sciences, Oracle Corp.

Sal Marcuz, Master Principal Enterprise Architect, Oracle Corp.

Without data, there would be no generative AI. Discover how you can implement AI services today across the entire data ecosystem of life sciences, from drug discovery & development to clinical trials & pharmacovigilance. Explore:

  • How to unify your disparate data sources while avoiding duplication and redundancies and begin to implement cutting-edge capabilities

  • Learn how to control your complex data stores through improvements in data management, governance and access

  • Avoid moving your data around by leveraging Oracle’s multicloud relationships with Azure, Google and AWS

1:35 pmRefreshment Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)

Bio-IT's hall is bigger than ever—one break won’t cut it! Enjoy dessert and coffee after lunch, explore booths and posters, vote for awards, and participate in our raffle for a chance to win a prize!

LEVERAGING CLOUD FOR FASTER, BETTER DATA MANAGEMENT AND ANALYTICS

2:25 pm

Chairperson's Remarks

Fabia Fricke, PhD, Pharma Research, Data & Analytics, Roche

2:30 pm

Transforming Drug Discovery: Leveraging Secure, Federated Learning to Collaboratively Train AI Models on the Proprietary Molecular Data of Multiple Organizations

Mohammed AlQuraishi, PhD, Assistant Professor, Systems Biology, Columbia University

John Karanicolas, PhD, Head of Computational Drug Discovery, AbbVie

Robin Roehm, PhD, CEO & Co-Founder, Apheris

A major challenge in advancing AI algorithms for modern drug discovery is the limited availability of protein and ligand structures needed to train AI models. For the first time, leading biopharmaceutical companies enable collaborative training of AI models that predict the 3D structure of molecular complexes using their proprietary protein structure data. In our talk, we will introduce the AI Structural Biology Consortium and present initial results.

3:00 pmPresentation to be Announced

3:30 pm

Harnessing ML for Small Molecule Property Predictions

Fabia Fricke, PhD, Pharma Research, Data & Analytics, Roche

Drug discovery is a collaborative effort, with AI/ML emerging as a transformative partner. Beyond its role in everyday life, AI accelerates drug discovery by streamlining processes and providing innovative solutions. This talk explores how ML is integrated into Roche's in-house developed product suite, enabling teams to efficiently manage small molecules through profiling cascades with automation features. By harnessing ML, the platform facilitates the skipping of assay requests, optimizing the workflow. Described as "Lab-in-the-Loop," this mechanism enriches Roche's digital ecosystem in Discovery and Early Development. Join us to discover how an iterative ML model is revolutionizing drug discovery.

4:00 pm The Precision Medicine AI Agent Network: Intelligence at-Scale

Tobias Guennel, Senior Vice Parsident, Product & Chief Architect, Data Management & Systems Integration & Innovation, QuartzBio

With QuartzBio’s Precision Medicine AI Agent Network, you can ask questions, get answers, and gain insights across the Precision Medicine Value Chain. Domain-specific AI agents, orchestrated by our Precision Medicine Virtual Assistant, enable informatics, translational, and IT teams to conversationally interact with a connected ecosystem of biomarker, sample, and clinical data. This solution semi-autonomously performs tasks based on expected events and observed trends, for faster insights, analytics, visualizations, and accelerated time-to-market.

4:30 pmBest of Show Awards Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)

Unwind with colleagues at our lively reception! Explore posters, vote for the best, network with exhibitors, enjoy a drink, and try to win a raffle prize. Celebrate Best of Show winners!

5:45 pmClose of Day

Friday, April 4

7:00 amRegistration Open and Morning Coffee

7:00 amQuick Bytes & Networking Breakfast—Lifted Rooftop Restaurant & Bar (Sponsorship Opportunity Available)

Start your morning with ‘Quick Bytes & Networking’! Enjoy a cozy restaurant-style setting, quick bites, and speed networking. Connect, converse, and energize your Bio-IT experience before the plenary keynote!

8:00 am

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

8:05 am

Innovative Practices Awards: Excellence in Technological Innovation

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

Since 2003, Bio-IT World has hosted an elite awards program with the goal of highlighting outstanding examples of how technology innovations and strategic initiatives are being applied to advance life sciences research. The 2025 Innovative Practices Awards winners represent excellence in innovation in the areas of informatics, pre-competitive collaboration, clinical and health IT, and genomics. Companies driving the winning entries include Genmab, Genedata, NHS England, IQVIA, Pistoia Alliance, Regeneron, and Quris-AI. For more details about the Awards, visit www.bioitworldexpo.com/innovativepractices.

8:20 am PLENARY KEYNOTE PRESENTATION:

The Longitude Prize on ALS: A Groundbreaking Global Prize Harnessing the Power of AI to Drive Treatment for ALS

Tris Dyson, Founder, Challenge Works

Jeffrey D. Rothstein, MD, PhD, Professor, Neurology and Neuroscience; Director, Brain Science Institute, Johns Hopkins University

The Longitude Prize series brings together the brightest minds to solve the world's most challenging innovation problems. The Longitude Prize on ALS, launching in June 2025, will bring together computational biologists, neurodegenerative researchers and AI-driven biotech globally to uncover novel therapeutic targets for ALS. 

ADVANCING DRUG DISCOVERY AND HEALTHCARE THROUGH DATA-DRIVEN INNOVATION: FROM GENOMICS TO THERAPEUTICS

8:35 am PLENARY KEYNOTE INTRODUCTION:Shaping the Next Era of Precision Health with Multiomics and AI-Driven Predictive Insights

Rami Mehio, Vice President, Head of Global Software and Informatics, Illumina, Inc.

8:45 am PLENARY KEYNOTE PRESENTATION:

Scaling Genomic Medicine: Transforming Newborn Screening through Informatics and Innovation

Robert C. Green, MD, MPH, Professor and Director of Genomes2People Research, Mass General Brigham, Broad Institute, Ariadne Labs, and Harvard Medical School

The BabySeq Project has pioneered the integration of genomic sequencing into newborn and childhood screening, uncovering unexpected risk variants and transforming healthcare delivery. This keynote explores the groundbreaking progress in genomic medicine, featuring real-world stories of families impacted by these discoveries. Learn about the informatics challenges and innovative solutions required to scale genomic screening for national and global implementation, reshaping the future of precision medicine.

9:15 am PLENARY KEYNOTE PRESENTATION:

Unlocking the Power of Machine Learning and Data-at-Scale to Deliver with Speed the Best Therapeutic Candidates

Justin M. Scheer, PhD, Vice President In Silico Discovery & Head, Molecular Computational Team, Johnson & Johnson Innovative Medicine

The challenges of high costs, lengthy timelines, and significant attrition have prompted our industry to integrate AI/ML into all aspects of the business. This presentation highlights J&J's strategic investments in AI/ML technologies to enhance the drug discovery processes, including molecule design and optimization. By investing in these technologies with a modality agnostic approach, J&J aims to tackle the hardest targets in drug discovery, ultimately increasing the success rate of delivering better molecules faster.

9:45 amCoffee Break in the Exhibit Hall with Poster Competition Winners Announced (Sponsorship Opportunity Available)

Bio-IT is all about connections! Explore booths, award-winning posters, and network with clients, colleagues, and exhibitors. Grab coffee, build relationships, and stay for a chance to win a raffle prize!

10:30 amOrganizer's Remarks

BENEFITS OF CLOUD IN BIOPHARMA: CASE STUDIES AND BEST PRACTICES

10:35 am

Chairperson's Remarks

Rajarshi Guha, PhD, Senior Director, Data & Computational Sciences, Vertex Pharmaceuticals, Inc.

10:40 am

Using Durable Workflow Technology to Run Image Analysis in the Cloud

Matthew Gerring, MEng, Senior Manager, Computational Sciences, The Jackson Laboratory

In this talk, we will show how image analysis can be taken from research level software to high quality and production ready systems. Attendees will learn how Jackson Laboratory transitioned research-level software into production-ready systems for scalable and durable image analysis. This presentation will feature biological image data, offer insights into new paradigms for cloud-based analysis, and appeal to those interested in scaling analysis of any type using distributed programming and cloud technologies.

11:10 am

One Research Digital—Ecosystems of FAIR Data Lakes Enabling AI-Augmented Analysis, Modeling, and Reporting

Marcin von Grotthuss, PhD, Director, Data Integration and Analytics, Preclinical and Translational Sciences, Takeda Pharmaceutical Co., Ltd.

The collaborative efforts within Takeda, supported by external vendors, led to the development and prototyping of One Research Digital—an ecosystem comprised of FAIR data lakes. During the design phase, we adhered to the FAIR principles and utilized a data lake infrastructure with a flexible data storage schema as the system's cornerstone. These advancements enable us to enhance AI-augmented analysis, modeling, and reporting across preclinical and translational sciences.

11:40 am

Ultra-Large Virtual Screening is Enabled by Orchestrating Cloud-Based Computation

Rajarshi Guha, PhD, Senior Director, Data & Computational Sciences, Vertex Pharmaceuticals, Inc.

Reaction-based virtual libraries have expanded our access to chemical spaces of billions of virtual molecules. Searching such spaces for molecules of interest can be performed in many ways, but they can usually be parallelized at multiple levels. We describe a virtual screening infrastructure that uses Nextflow to parallelize a Genetic Algorithm Virtual Screening (GAVS) coupled to computational chemistry primitives, such as shape matching and docking, using AWS Lambda. Finally, we present some benchmark results highlighting the efficiency and accuracy of the method, along with the performance gains achieved by virtue of cloud-based parallelization.

12:10 pm Supercharge Computational Drug Discovery with AI-Powered Serverless High-Performance Computing (HPC)

Fengbo Ren, CEO, Computer Science & Engineering, Fovus Corp.

Fovus is an AI-powered, serverless high-performance computing (HPC) platform delivering intelligent, scalable, and cost-efficient supercomputing power at the computational scientists' fingertips. Fovus uses AI to optimize HPC strategies and orchestrates cloud logistics, making cloud HPC a no-brainer and ensuring sustained time-cost optimality for computational drug discovery amid quickly evolving cloud infrastructure. By accelerating time-to-insights and optimizing cloud costs, Fovus helps Biotech clients accelerate Design-Make-Test-Analyze (DMTA) cycles and discover more with less. Join this talk to learn how Fovus can supercharge your computational drug discovery with case studies and GROMACS/AlphaFold 3 benchmarking results.

12:25 pm

Powering AI/ML at Scale: Building a Cloud-Native Infrastructure for Biopharma Innovation

Anand Murthy, Director, AI and Data Platform, Moderna

As AI and machine learning transform biopharma R&D, building a scalable, cost-efficient, and compliant cloud infrastructure is essential for accelerating innovation. We have embraced a fully cloud-native approach to power AI/ML workloads, enabling seamless access to data, high-performance compute, and secure collaboration. This session will explore key architectural decisions, trade-offs considered, and best practices for optimizing cloud environments to support AI/ML at scale. Attendees will gain insights into leveraging cloud technologies to drive scientific breakthroughs while maintaining flexibility, security, and cost efficiency.

12:40 pm Harnessing Agentic AI in R&D Cloud Ecosystems: Accelerating Clinical Innovation

Shakthi Kumar, Chief Strategy and Business Officer, EDETEK Inc

Imagine a world where clinical development is faster, smarter, and more efficient. The fusion of agentic AI with R&D cloud ecosystems is making this vision a reality. Join us to explore how this cutting-edge technology is revolutionizing clinical data management and analytics. (Spoiler: It's a game-changer!) Learn about: Transformative power of R&D Cloud Ecosystems in delivering the next-gen digital data pathways. Innovative impact of agentic AI on clinical workflows. Real-world case studies showcasing the benefits of this integration. (Just a preview!)

1:10 pmSession Break and Transition to Lunch

1:20 pmLuncheon Presentation (Sponsorship Opportunity Available) or Enjoy Lunch on Your Own

1:50 pmRefreshment Break in the Exhibit Hall with Last Chance for Poster Viewing (Sponsorship Opportunity Available)

Feeling tired? Recharge during the final Networking Exhibit Hall break! Visit booths, explore posters, connect with peers, and turn in your Game Cards for a chance to win a raffle prize.

TRENDS FROM THE TRENCHES: BRIDGING TRADITIONAL INSIGHTS WITH INNOVATIVE ADVANCEMENTS

2:30 pm

Chairperson's Remarks

Dirk Petersen, Director of Supercomputing Center, Oregon State University

2:35 pm

Trends from the Trenches

Ari E. Berman, PhD, CEO, BioTeam, LLC

Since 2010, “Trends from the Trenches” has been a cornerstone of the Bio-IT program, delivering candid and occasionally blunt assessments of the most impactful and overhyped IT technologies in life sciences. This talk will provide a deep dive into computing, storage, cloud, data science, machine learning, and more, with a focus on supporting data-intensive science. Looking ahead, this talk will share forward-thinking predictions about emerging technologies and trends poised to shape the future of life sciences innovation, offering actionable insights for navigating the next wave of IT evolution.

3:05 pm

In the Trenches with AI Supercomputing: Driving Innovation in Life Sciences and Quantum Simulations

Dirk Petersen, Director of Supercomputing Center, Oregon State University

Launching in 2026, a new AI supercomputer powered by Nvidia’s latest Rubin-generation GPUs will transform research at Oregon State University’s Huang Collaborative Innovation Complex. This mini talk highlights its capabilities, from accelerating protein structure prediction to advancing quantum simulations to something completely new and different. Learn how you can get access to this cutting-edge resource and drive innovation in life sciences and quantum computing simulations and discover opportunities to collaborate.

3:15 pm

Transforming Big Data into Actionable Insights: Leveraging the Sequence Read Archive (SRA) for Life Sciences and Public Health

J. Rodney Brister, PhD, Acting Program Head, Sequence Read Archive, NCBI, NLM, NIH

As the world's largest publicly available repository of raw sequence data, the Sequence Read Archive (SRA) plays a pivotal role in advancing public health and life sciences research. This presentation highlights state-of-the-art tools and strategies for managing and analyzing the SRA’s massive datasets, showcasing its impact on infectious disease surveillance, genomic epidemiology, and precision medicine. Discover how innovative informatics solutions are transforming raw data into actionable insights for global health challenges.

3:30 pm

The Biologist Explores Learning: Insights on LLMs, Deep Learning, and Personal Discoveries

Brian Osborne, PhD, Senior Principal Consultant, BioTeam, LLC

Many biologists who have spent years coding and thinking in terms of bioinformatics - protein and DNA sequence, genomics - are now engaging with machine learning, NLP, and LLMs. In this talk a bioinformaticist will talk about the many lessons learned and twists and turns encountered in these new fields. Topics will include new ways of thinking about computing with CPUs and GPUs, re-representing data, training, iteration and validation, version control and environments, new definitions of “pipeline”,  and coming face-to-face with prediction and statistics.

3:45 pmSession Q&A

4:05 pmClose of Conference







Conference Tracks