Modern Data Platforms and Storage Infrastructure

Architect, Implement, and Manage Data Storage Solutions that Maximize Speed, Performance, and Cost

May 17 - 18, 2023 ALL TIMES EDT

Is the burden of managing your data growing larger every day? Do you have a scalable and robust data management infrastructure in place to store, process, analyze, transfer, and secure vast quantities of data according to your organization’s policies? Do you know how to achieve availability vs interoperability? What are approaches to scalable distributed/federated data analytics? What conversations are you having about speed vs performance vs cost? Which vendors should you use? How do you evaluate strengths and weaknesses of technology solutions? What data storage and types are efficient? Tremendous efforts and advancements have been made by organizations who have pioneered advances in large-scale data management related to storage platforms, integration and migration plans, and governance. The Modern Data Platforms and Storage Infrastructure track will explore these questions and share best practices of these efforts.

Monday, May 15

– 6:00 pm Hackathon*8:00 am

*Separate Complimentary Registration Required, see Hackathon page to submit your project OR register to participate

– 5:00 PM Registration Open – Come Early and Avoid the Lines2:00 pm

Tuesday, May 16

Registration Open7:00 am

Recommended Pre-Conference Workshops and Symposia*8:00 am

On Tuesday, May 16, 2023 Cambridge Healthtech Institute is pleased to offer nine pre-conference workshops scheduled across three time slots (8:00-10:00 am, 10:30 am-12:30 pm, and 1:45-3:45 pm) and two Symposia from 8:25 am-3:45 pm. All are designed to be instructional, interactive and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Wednesday-Thursday.

*Separate registration required. For details, see Workshop agendas, FAIR Data Symposium agenda, and Knowledge Graphs Symposium agenda.

– 3:45 pm Hackathon*8:00 am

*Separate Complimentary Registration Required, see Hackathon page to submit your project OR register to participate

Refreshment Break and Transition to Plenary Keynote3:45 pm

PLENARY KEYNOTE PROGRAM

4:00 pm

Plenary Keynote Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:05 pm

Innovative Practices Awards

Joseph Cerro, Independent Consultant

Chris Dwan, Independent Consultant, Dwan, LLC

Allison Proffitt, Editorial Director, Bio-IT World

The Innovative Practices Awards recognizes and celebrates innovation that advances life sciences research. Bio-IT World is currently accepting entries for the 2023 Innovative Practices Awards, a competition designed to recognize partnerships and projects pushing our industry forward. Winners will be announced in mid-April 2023, recognized during the Tuesday May 16 Plenary Keynote Program, and scheduled to give a 30-minute podium presentation about their project during the conference. The deadline for entry is March 3, 2023. For more details about the Awards and to submit an application, visit the official Bio-IT World Innovative Practices Awards page: https://www.bio-itworld.com/Award/.

4:20 pm Plenary Keynote Introduction

David Gosalvez, PhD, Executive Director, Strategy & Informatics Portfolio, Revvity Signals

4:30 pm PLENARY KEYNOTE PRESENTATION:

The Promise of Data, Analytics, and Technology: Fueling Scientific and Medical Breakthroughs

Anastasia Christianson, PhD, Vice President, Global Head of AI, ML, Analytics, and Data, Pfizer Inc.

Edward Cox, Head & General Manager, Digital Health & Medicines (DHM), Pfizer Inc.

The 21st century has been referred to as the Century of Biology. With 90% of the world’s 97 zettabytes of data generated in the past 2 years and 30% of today’s data being healthcare related, how are we using data technology and advanced analytics (artificial intelligence, machine learning, and deep learning) to advance our understanding of disease and deliver “breakthroughs that change patients' lives?”

Welcome Reception in the Exhibit Hall with Poster Viewing5:45 pm

Close of Day7:00 pm

Wednesday, May 17

Registration and Morning Coffee7:00 am

PLENARY KEYNOTE PROGRAM

8:00 am

Plenary Keynote Organizer's Remarks

Allison Proffitt, Editorial Director, Bio-IT World

8:05 am PLENARY KEYNOTE INTRODUCTION:

Life Science Automation Opportunities – So Many Options, So Little Time

Santanu Sen, Vice President, Healthcare & Life Sciences, Virtusa

The COVID pandemic has demonstrated that therapies and vaccines can be developed in 18 months with a high degree of safety and efficacy. Pioneering work done by companies involved has shed light to archaic processes that have been in existence for decades with little need for change.  In this presentation, we will discuss collaborative efforts, enabling technologies, regulation, and workflow to automate these processes to advance personalized medicine initiatives.

8:15 am PLENARY KEYNOTE PRESENTATION:

Federated Futures: How the Largest Federated Learning Effort in Medicine Will Inform Our Next Steps

Spyridon Bakas, PhD, Assistant Professor, Radiology & Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania

Raymond Y. Huang, MD, PhD, Division Chief, Neuroradiology, Brigham and Women’s Hospital; Associate Professor of Radiology, Harvard Medical School

Jason Martin, Principal Engineer AI Research Science, Security Solutions Lab, Intel Labs

Is a federated learning model sufficient to handle data from 71 institutions and more than 6,000 patients located on six continents? Researchers from Penn Medicine and Intel Labs say yes. An interdisciplinary team created the largest to-date global federated learning effort to develop an accurate and generalizable machine learning model for detecting glioblastoma borders. We will share what we learned about creating and maintaining such a federation, how the software infrastructure evolved over the course of the study, and how this work will empower the future of high-quality, precision clinical care worldwide.

Coffee Break in the Exhibit Hall with Poster Viewing9:30 am

Organizer's Welcome Remarks10:15 am

ARCHITECTING FOR SUCCESS: SOLUTIONS FOR DATA WORKLOADS AND WORKFLOWS

10:20 am

Chairperson's Remarks

Fernanda S. Foertter, MSc, Director of Developer Relations, Voltron Data

10:25 am

Building an End-to-End Solution for Genomics Data with Amazon Omics

Brendan Gallagher, Head, Business Development, Sentieon, Inc.

Mary Olson, Senior Product Manager, Amazon Omics, Amazon Web Services (AWS)

The human genome acts as the biological blueprint of the human body and has the potential to transform how we discover new therapies and treat disease. However, researchers face a common set of challenges to process genomics data in the cloud from scaling compute across millions of samples to analyzing trends. Using AWS, healthcare and life science organizations can store, query, analyze, and generate insights from genomics and other biological data to improve human health. In this session hear how you can use Amazon Omics to support large-scale analysis and collaborative research with scalable workflows, purpose-built data stores, and multi-modal analytics. Attendees will learn how Amazon Omics can enable them to store petabytes of genomics data at low cost, process data efficiently, and analyze population-scale datasets.

10:55 am

Effective Use of AWS Parallelcluster for Life Science Workloads

Adam Kraut, Director, Infrastructure & Cloud Architecture, BioTeam, Inc.

This presentation will cover all of the tips and tricks we use to deploy the stack in various 'omics and CompChem environments.

11:25 am

Hydra: A Distributed Framework to Create Data Federations

Susmit Shannigrahi, Assistant Professor, Computer Science, Tennessee Technological University

Scientific data is geographically distributed and difficult to access and utilize in workflow due to the distributed and siloed nature of current storage. The Hydra project is funded by the National Science Foundation and creates a framework that allows communities to build data federations on top of existing solutions very easily. It also automatically replicates data to ensure availability. This talk will discuss 1) how to build a federated data storage over existing storage using Hydra and 2) how to bring various data storage mechanisms together so that they can be easily and transparently used in workflows – massively simplifying existing workflows.

11:55 am How a Modern Data, Analytics, and AI Platform Can Transform the Life Sciences Industry

Mike Sanky, Regional Vice President, Healthcare & Life Sciences, Databricks

Life sciences is consuming petabytes of data ranging from medical images to DNA sequences to third party data. The challenge is how to ingest, organize and prepare large, diverse data sets for analytics and machine learning at scale to unlock novel patient insights and research. In this session, we will explore how life sciences can leverage a unified data, analytics, and AI platform to manage, process, and analyze data in real-time.

12:10 pm Leveraging Hybrid Environments for Drug Discovery

Ben Crist, Senior Sales Engineer Manager, WEKA

12:25 pm Breaking Down Data Silos with a Unified Platform Approach for Life Science Domains

Robert Zeigler, Ph.D., Vice President, Product Development, L7 Informatics, Inc.

Data silos hinder companies from fully leveraging their data's potential by erecting barriers among different departments and groups. Silos arise due to varying requirements, like the need for agility in research versus stringent regulatory control in manufacturing. To overcome this, a unified platform that accommodates user needs across life science domains enables data utilization and comparison throughout a product's entire life cycle—from research to manufacturing.

Session Break and Transition to Luncheon Presentation12:55 pm

1:05 pm LUNCHEON PRESENTATION:How to Solve Your Scientific Data Problems

Tal Aharon, Data Architect III, TetraScience

Katja Hall, Scientific Business Analyst, TetraScience

Refreshment Break in the Exhibit Hall with Poster Viewing1:50 pm

ARCHITECTING FOR SUCCESS: SOLUTIONS FOR DATA WORKLOADS AND WORKFLOWS

2:35 pm

Chairperson's Remarks

Fernanda S. Foertter, MSc, Director of Developer Relations, Voltron Data

2:40 pm

Integration as a Backbone Enabler to Connect Legacy and On-Prem World to Cloud

Viviana Echeverry, Global IT Head – Digital Integration, Chapter Section Head in Architecture & Emerging Technologies, Roche

This presentation will discuss how integration plays a key role to connect the legacy and on-prem world to the new Cloud trend. Learn how this has been done in Roche using integration platforms on hybrid installations on-prem & cloud to enable new use cases applications and data flows.

3:10 pm

Trustsphere: A Novel Data Platform to Empower Citizens to Connect, Manage, and Share Patient-Generated Health Data with Clinicians and Researchers

Tibor Van Rooij, PhD, Director, Research Informatics, BC Children's Hospital Research Institute

TrustSphere (trustsphere.ca) a novel data capture and storage infrastructure – including patient-generated health data (PGHD) – empowers citizens with the digital means to manage, protect, and share health-relevant data for their clinical care, and, upon consent, use the streaming data from their wearable devices to become active data donors to research. Initially aimed at children with Type 1 diabetes, this technology will expand shortly to other chronic conditions.

3:40 pm

Decarbonizing High-Performance Computing

Andrew S. Grimshaw, PhD, President, Lancium Compute

Renewable generation is making an increasing percentage of the energy mix. However, renewables face challenges to reach end-use customers. Lancium is locating HPC data centers in areas with excess renewable energy to help advance the deployment of renewables, with data centers acting as giant batteries and grid stabilizers, soaking up power during times of excess generation, and releasing it back out when renewable sources are limited, thus stabilizing the grid.

4:10 pm Powering Your Digital Scientific Ecosystem

Darren Barrington-Light, Senior Manager Product Marketing, Chromatography and Mass Spectrometry Software, Thermo Fisher Scientific

Organizations today recognize the importance of connecting the physical lab with the digital world. The next significant opportunity is to realize the power in connecting the disparate systems and analytical data - from LIMS, CDS, LES, ELN, SDMS and analytical tools in the lab, to production systems such as MES and ERP. Strengthening your digital scientific ecosystem provides access to new levels of success - accelerating research and high quality results.

Best of Show Awards Reception in the Exhibit Hall with Poster Viewing4:40 pm

Close of Day6:00 pm

Thursday, May 18

Registration and Morning Coffee7:30 am

PLENARY KEYNOTE PROGRAM

8:00 am

Plenary Keynote Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

Plenary Keynote Sponsor Introduction (Opportunity Available)8:05 am

8:15 am PLENARY PANEL DISCUSSION:

Assessing Innovation: How Pharma Makes Tech Investment Decisions

PANEL MODERATOR:

Aaron Mann, CEO, Clinical Research Data Sharing Alliance

This panel session will assemble senior leaders who evaluate new technology adoption. We will hold an interactive discussion to help provide transparency in the evaluation and decision-making process for assessing and investing in new technologies. Themes we will cover include: 1) process for evaluating, piloting, and scaling new technologies and technology approaches; 2) how an organization evaluates an emerging technology vendor landscape; 3) when and how a formal buying process becomes required, and 4) identifying key stakeholders, decision-makers, and gatekeepers. 

PANELISTS:

April Bingham, Executive Director, Global Medical Compliance and Governance Chapter, Roche

Peter Mesenbrink, PhD, Executive Director, Biostatistics, Novartis Pharmaceuticals

Maria Palombini, Global Practice Leader, Healthcare & Life Sciences, IEEE Standards Association

Laszlo Vasko, Senior Director, Clinical Innovation R&D IT, Janssen Pharmaceuticals, Inc.

Coffee Break in the Exhibit Hall with Poster Viewing9:30 am

Organizer's Remarks10:15 am

EVALUATING TECHNOLOGY AND DATA MANAGEMENT APPROACHES

10:20 am

Chairperson's Remarks

John Conway, Chief Visioneer Officer, 20/15 Visioneers

10:25 am

Evaluating Technology Approaches Using a Stakeholder-Centric Assessment Framework

Aaron Mann, CEO, Clinical Research Data Sharing Alliance

Peter Mesenbrink, PhD, Executive Director, Biostatistics, Novartis Pharmaceuticals

With rapid advances in technical capabilities and access to a growing range of technology approaches, the “best” choice for an organization isn’t always clear. This presentation introduces a stakeholder-centric technology assessment framework developed by the Clinical Research Data Sharing Alliance. The framework can be used for data sharing technology approach evaluation and pre-RFP scope development, with design principles organizations can apply to a wide range of evaluation scenarios. 

10:55 am

Managing Enterprise Scientific Data: Lessons Learned from the Frederick National Laboratory DME

Sunita Menon, Bioinformatics and Computational Science (BACS) Directorate, Frederick National Laboratory for Cancer Research

Eric Stahlberg, PhD, Director, Cancer Data Science Initiatives, Frederick National Laboratory for Cancer Research

11:25 am Strategic Uncertainty Management in 2023: Embracing Trends in Risk Modeling and Analysis in Portfolio Scenario Planning

Kate Bernstein, Associate Quantitative Analyst, Captario

Anna Sarkisyan, Senior Analyst, Captario

We are constantly met with new risks, variability, and unmanaged uncertainty. The perpetual use of static modeling does not capture the landscape of events that may occur during your various clinical and commercial lifecycles. Captario SUM captures the uncertainties that are pivotal to optimal decision-making in a dynamic fashion. This enables you to easily answer “what-if” questions revealing innovative and actionable insights throughout the value chain of your projects and portfolios.

11:55 am Enhancing Data Integrity and Audit Trails through Advanced Technology Solutions

Catherine Hall, VP of Data & Quality, Endpoints Clinical

Abhay Kini, Ph.D., Director of Life Sciences, Egnyte

Clinical trials have transformed digitally, emphasizing the link between audit trails and data integrity. In response to the release of an addendum to ICH E6 (R2), Endpoint Clinical embarked on a journey to streamline the provisioning of audit trail data to investigators and regulators. This presentation will discuss the strategic choices in developing a platform that delivers a GxP-compliant solution with controlled access for our clients.

12:25 pm How innovations in connectivity will boost collaboration among healthcare professionals

Marc Halbfinger, Mr., CEO, PCCW Global | Console Connect

Delivering better patient outcomes is the common thread that binds the healthcare community. But in an age of economic uncertainty, disrupted supply chains and diminishing resources, the need for greater collaboration across the entire ecosystem has never been greater.

Session Break and Transition to Luncheon Presentation12:55 pm

1:05 pm LUNCHEON PRESENTATION:FAIR Data Isn't Always Enough. Digitization Succeeds by Treating Data as Reality, and the Physical Lab as the Copy

Nathan Clark, Founder, Product | Design | Engineering, Ganymede

FAIR data isn’t delivering enough value if relying on uncertain future AI or meta-analysis uses. We review best practices for wet lab digitization; crisp goals for near-term business automation or statistics is key. Then, physical operations must suit the software layer, reversing the Digital Twin approach that may not be scalable. Data decays quickly over time, and if not immediately "used”, is likely losing utility permanently for future AI or meta-analysis.

Refreshment Break in the Exhibit Hall with Poster Viewing1:50 pm

TRENDS FROM THE TRENCHES

2:35 pm

Chairperson's Remarks

Ari E. Berman, PhD, CEO, BioTeam, Inc.

2:40 pm

Trends from the Trenches

Ari E. Berman, PhD, CEO, BioTeam, Inc.

Adam Kraut, Director, Infrastructure & Cloud Architecture, BioTeam, Inc.

Anna Sowa, PhD, Senior Scientific Consultant, BioTeam, Inc.

Since 2010, the “Trends from the Trenches” presentation has been one of the most popular annual traditions of the Bio-IT Program. The intent of the session is to deliver a candid (and occasionally blunt) assessment of the best, the most worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation has helped scientists, leadership, and IT professionals understand the basic topics related to computing, storage, data transfer, networks, cloud, data science, and machine learning that are involved in supporting data-intensive science. In 2023, consultants from BioTeam will give an overview of the trending issues in life sciences. An interactive Q&A moderated discussion with the audience follows. Come prepared with your questions and commentary for this informative and lively session.

Close of Conference4:10 pm






Exhibit Hall and Keynote Pass

Data Platforms and Storage Infrastructure