Track 5: Cloud Computing

Apply Cloud to Advance Research and Facilitate Collaboration

May 3 - 5, 2022 ALL TIMES EDT

Cloud, the preferred platform for delivery of computing services, is utilized for analyzing, modeling, storing, processing, exploring, and sharing dynamic data. Even with cloud’s maturity though, innovations and applications continue to evolve. Thus, enterprises are transitioning from a piece-meal approach to a more holistic end-to-end digital transformation with cloud at its core. However, architects and development teams face a tough job. They make many choices toward the setup of an enterprise’s cloud footprint and these choices may be lasting decisions. Through case studies, the Cloud Computing track explores the rapid growth and progressive achievements of cloud as well as evolving provider and user experiences and challenges.

Tuesday, May 3

7:00 am Registration Open (Plaza Level Lobby)
8:00 am Recommended Pre-Conference Workshops and Symposium*

On Tuesday, May 3, 2022 Cambridge Healthtech Institute is pleased to offer nine pre-conference workshops scheduled across three time slots (8:00-10:00 am, 10:30 am-12:30 pm, and 1:45-3:45 pm) and a Symposium from 8:25 am-3:45 pm. All are designed to be instructional, interactive and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Wednesday-Thursday.

*Separate registration required. See Workshop page and Symposium page for details.

3:45 pm Session Break and Transition to Plenary Keynote



4:00 pm

Welcome by Conference Organizer

Allison Proffitt, Editorial Director, Bio-IT World
4:05 pm Innovative Practices Award
Mike Tarselli, PhD, Chief Scientific Officer, TetraScience
4:30 pm

Ask What IT Can Do for Bio...and What Bio Can Do for IT

George M. Church, PhD, Robert Winthrop Professor, Genetics, Harvard Medical School

IT for Bio: In May 2021, one haploid human genome (3.055 billion bp) was sequenced completely, but zero diploid. We have 7.7 billion diploid humans yet to be sequenced and correlated with their environments and traits in the Personal Genome Project. Plus, at least one genome from each of over 8.7 million eukaryotic species in the Earth Biogenome project. Plus, monitoring pathogenic and commensal bacteria, allergens, and viruses in the BioWeatherMap. Plus, ancient DNA. We are counting RNA molecules per cell in most (or all) cell types in humans, mice, and many other species throughout development and connectome (with imaging resolution up to 20 nm).   

Bio for IT: Reading and writing DNA has improved exponentially in cost (at least 60 million fold) and is increasingly used for storing non-biological data. The record for editing DNA in vivo is now 24,000 edits per cell and for storing data in vivo is about 1 terabyte per mouse. Enormous chemical and biological 'libraries' can perform 'Natural Computing' for tasks far beyond current von-Neumann silicon and quantum computers. The combination of these – machine learning + megalibraries (ML-ML) is already having commercial impact (e.g. Nabla, Manifold, Dyno, Patch). 

5:45 pm Welcome Reception in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)
7:00 pm Close of Day

Wednesday, May 4

7:00 am Registration Open and Morning Coffee (Plaza Level Lobby)



8:00 am

Welcome by Conference Organizer

Allison Proffitt, Editorial Director, Bio-IT World
Zachary Powers, Chief Information Security Officer, Benchling
8:15 am

Accessing and Securing the Data that Drives Breakthroughs

Allison Proffitt, Editorial Director, Bio-IT World
Rachana Ananthakrishnan, Executive Director, Globus, University of Chicago
Ari E. Berman, PhD, CEO, BioTeam, Inc.
Jonathan C. Silverstein, Chief Research Informatics Officer & Professor, Biomedical Informatics, University of Pittsburgh
Rebecca F. Rosen, PhD, Director, Office of Data Science and Sharing, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health

Life sciences research is generating massive amounts of data that should be accessible to collaborators and colleagues to enable breakthrough discoveries. However, ensuring sensitive data are shared securely in a manner that protects patient privacy and complies with myriad regulations is a daunting task, which often slows the pace of research. Our panel of leading practitioners will share insights on the challenges and best practices of managing protected research data.

9:30 am Coffee Break in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)



10:15 am Organizer's Remarks
10:20 am

Chairperson's Remarks

James Weatherell, Lead Research Computing Architect, Research Computing, Harvard Medical School
10:25 am

The Harvard Medical School – Research Computing Journey to the Cloud

James Weatherell, Lead Research Computing Architect, Research Computing, Harvard Medical School

Over the last two years, HMS Research Computing has been working to define their cloud computing architecture and service offering. Join James Weatherell as he dives into the HMS Research Computing cloud strategy. This presentation aims to highlight key challenges the HMS Research Computing team faced around cloud adoption, along with a look at the cloud architecture and service offerings HMS Research Computing provides their researchers today.

10:55 am

Self Service Analytics for Data Democratization and Innovation in Healthcare

Shahidul Mannan, Head, Data Engineering & Innovation, Data Analytics Organization, Mass General Brigham

In the next era of digitization, organizations are looking for innovation at speed and scale. This means global participation of innovators at all levels in the digital play, where they can bring their ideas, experiment, fail fast, and innovate. This session will cover platform and automation for data democratization and data science-driven innovation in healthcare, collaboration, and productionization through MLOps and DevOps, a key enabler in this innovation journey.

Zachary Powers, Chief Information Security Officer, Benchling

Cloud computing has evolved rapidly over the last twenty years, but common myths, misconceptions, and misinformation continue to influence technology decisions. Understanding modern cloud security, industry trends, and the evolving threat landscape will help dispel these common myths and misconceptions about how trustworthy cloud computing is today. Cloud computing is no longer just about scalability and innovation, it is a key security strategy to address modern threats.

Syed Hassan Abdullah, Data Scientist, Data Sciences, Zifo RnD solutions

We argue, contrary to the prevailing business mantra, that data science is not just an AI/ML magic trick but a more complicated story. In this presentation we emphasise technical cornerstones of this complex story. Essential activities of robust Data Science involve data preparation, management and engineering, model engineering statistics and machine learning, deployment, monitoring and retraining, and data chain of custody. 

George Loftus, Associate Vice President, Network Services, Internet2

The Internet2 Cloud Connect service provides dedicated, secure, and dynamic connections over its recently upgraded 400Gbps national network to four geographically diverse connection points with the major cloud providers. Utilize direct control to provision and manage cloud connections more quickly, simply, and seamlessly with dedicated network connections and extend your data center with software-enabled self-service capabilities to AWS Direct Connect, GCP Partner Interconnect, Microsoft Azure ExpressRoute, and Oracle FastConnect.

12:25 pm Interactive Discussions (Sponsorship Opportunity)

Interactive Discussions are informal, moderated discussions, allowing participants to exchange ideas and experiences and develop future collaborations around a focused topic. Each discussion will be led by a facilitator who keeps the discussion on track and the group engaged. For in-person events, the facilitator will lead from the front of the room while attendees remain seated. For virtual attendees, the format will be in an online networking platform. To get the most out of this format, please come prepared to share examples from your work, be a part of a collective, problem-solving session, and participate in active idea sharing. Please visit the Interactive Discussion page on the conference website for a complete listing of topics and descriptions.

Matthew Rich, Principal, Cloud & Digital Transformation, PwC
Matthew Wilden, Principal, Cyber Risk & Regulatory for Health Services, PwC

74% of business leaders say they are engaged in a cloud strategy, yet over half aren't seeing intended ROI from their investment. Companies are quickly moving to take advantage of the cloud, but how can they ensure they are getting the value they expect? Join PwC for a conversation around how Life Sciences leaders can make a stronger business case for cloud and in turn, drive adoption through business transformation.

12:55 pm Session Break and Transition to Luncheon Presentation
Robert Murphy, Director of Product Marketing, WEKA

With data sets growing at an exponential rate, storing, and analyzing mountains of data requires a modern approach for faster insights. Learn how you can start your research in the cloud or enable sophisticated hybrid workflows to lower costs and accelerate time to discovery for any workload, even ones that you thought couldn’t be run in the cloud.


1:50 pm Refreshment Break in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)


2:35 pm

Chairperson's Remarks

Gerard Randolph, Project Leader, Pharma Research & Early Development Informatics, Roche
2:40 pm

Transforming Patient Experience through Digitalization of Clinical Trials in the Cloud 

Gerard Randolph, Project Leader, Pharma Research & Early Development Informatics, Roche

Learn how a global team of analysts, architects, and SMEs leveraged the cloud, PEGA, and Agile to develop an Integrated Digital Platform and Mobile App to engage participants enrolled in our clinical trials. Our tMate platform supports real-time monitoring, information sharing, communications, and integrates third-party services allowing clinical sites and patients to engage in ways not previously available and only possible in the cloud.

3:10 pm

PrecisionFDA – A Collaborative Platform for Analysis of Biological, Clinical, and Chemical Datasets

Yulia Borodina, Chemist, Office of Data, Analytics, and Research, FDA

PrecisionFDA is a secure, cloud-based, high-performance platform implemented by the FDA for collaboration in the areas of biological, clinical and chemical informatics in order to advance precision medicine. It has expanded to assist the broader community of data scientists for analysis of clinical data and to host data-driven competitions that push the limits of tools and algorithms for large-scale data processing. PrecisionFDA has hosted 31 challenges and app-a-thons on topics including AI/ML development to identify COVID-19 risk factors and predict brain cancer patient outcomes and bioinformatics tool development and benchmarking for genetic variant calling and detection of microbial pathogens. The latest application of this platform provides analysis of chemical data sets, as well as benchmarking computational approaches to cheminformatics, drug discovery, and toxicology.

3:40 pm

A Name-Based Ecosystem for Genomics Data and Workflows

Susmit Shannigrahi, Assistant Professor, Computer Science, Tennessee Tech University

Genomics communities are increasingly relying on cloud services for their computations. However, integrating various workflow components with cloud services can be complex. This talk presents a new name-based ecosystem based on Named Data Networking (NDN) that simplifies workflow integration through the use of content names at the network layer.

Ilyana Rosenberg, Senior Product Manager, Genomics, Microsoft
Roberto Lleras, Senior Scientist, Health Futures, Microsoft Health & Life Sciences

The past decade has seen rapid growth in data across the healthcare and life sciences domains, and the pace of growth is accelerating.  Unlocking the value of this data requires secure and scalable solutions that can only be afforded by the cloud. Come hear Microsoft’s vision for how the cloud will enable genomics-driven biomedical research and how partnerships across the ecosystem will accelerate Precision Medicine R&D. 

4:40 pm Best of Show Awards Reception in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)
6:00 pm Close of Day

Thursday, May 5

7:30 am Registration Open and Morning Coffee (Plaza Level Lobby)



8:00 am

Welcome by Conference Organizer

Allison Proffitt, Editorial Director, Bio-IT World
Nate Raine, Director Data Custodians, Lifebit
8:15 am

Leveraging Large-Scale Human Data to Advance and Accelerate Drug Discovery

Shankar Subramaniam, PhD, Distinguished Professor of Bioengineering; Professor of Chemistry, Biochemistry and Nanotechnology; Adjunct Professor of Cellular & Molecular Medicine, University of California at San Diego

Advances in genomics technologies have led to generation of massive amounts of human data. This has catalyzed new insights into cellular processes in the normal and disease state and facilitated the search for safe and effective medicines. The UK Biobank, All of US and TopMed initiatives are exemplars of this approach. We highlight examples from our lab where meaningful insights have been obtained advancing our understanding of disease biology and its pharmacological application.

9:30 am Coffee Break in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)



10:15 am Organizer's Remarks
Ryan Magruder, Solutions Architect, Sales, Rescale
10:25 am

Modernizing for the New Era of Life Science Innovations

Lita Sands, Head, Life Sciences, Amazon Web Services
Mike Tirozzi, Senior Vice President, Chief Information & Data Officer, Vertex Pharmaceuticals, Inc.
Mike Tarselli, PhD, CSO, TetraScience, Inc.
Anna Berg Åsberg, Global Vice President R&D IT, AstraZeneca
Bill Goodman, Senior Director, Product Management, Digital Science, ThermoFisher

Over the last decade, life sciences organizations have pushed the envelope on what’s possible—from running tens of billions of tests in a single day, to using Alexa-enabled lab equipment to reduce errors, to using AI to develop more targeted clinical trials for precision medicine.  While these innovations touch different parts of the value chain, they each build on a common foundation of cloud modernization. This session will explore the underlying infrastructure behind some of the most innovative breakthroughs in the life sciences industry, and break down how leading life sciences organizations approached their cloud migration journey.  Hear from leaders from AstraZeneca, Vertex Pharmaceuticals, TetraScience, and more to learn how their organizations created an infrastructure for innovation, put technology to work to remove the undifferentiated heavy lifting and focus on what matters most to their organizations, and how the cloud is changing their mission and the type of talent they are attracting. 

Rebecca Carazza, Ph.D, Executive Director, Information Systems, Nimbus Therapeutics
Abhay Kini, Director, Life Sciences, Egnyte

Access to quality data is one of the most important accelerators of growth for emerging biotech.The resulting analysis and insights are enabling a golden age for research and therapy development as modern drug development teams are leveraging technology to unlock scientific data in new ways. Learn how digital workflow automation and the development of NIMBEye, Nimbus’ cloud computing environment has accelerated the deployment of CRO data from days to minutes.

Aniket Deshpande, Senior GTM Specialist, Healthcare and Life Sciences, Amazon Web Services
Joachim de Schrijver, Product Owner, Agilent
Nate Raine, Director, Data Custodians, Lifebit
Eric Dawson, Bioinformatics Scientist - AI, Enterprise Products, NVIDIA

Genome sequencing pipelines and processing large genomic datasets can become cumbersome, limiting genomics adoption for clinical application and the ability for improvement or scale. Turning to transformative cloud-based technologies can help organizations scale their genomic solutions, while optimizing performance and costs. Learn how biopharma, clinical care and health outcomes, and population genomics are assisted by a GPU-accelerated computational genomics application framework, and high-performance, flexible, scalable cloud infrastructure.

12:55 pm Session Break and Transition to Luncheon Presentation
Doug Ricketts, Technical Account Manager, Healthcare Account Management Team, Synology

Even the most secure systems can fall victim to attack or disaster, and when security fails, a complete recovery plan is essential to get your organization back on its feet. Join us to learn about building a robust disaster recovery plan for physical devices, servers, SaaS, and more.

Ryan Magruder, Solutions Architect, Sales, Rescale

Deploying compute intensive workloads on the cloud can be challenging, especially when trying to unify workflows for various cloud providers. Each platform has their own tools for compute, storage, and security which require specific knowledge and skills to build frameworks around. In this session learn how Rescale provides a unified platform for high performance computing built for a multi-cloud environment, as well as how to leverage these tools to accelerate your simulations.

2:05 pm Refreshment Break in the Exhibit Hall with Poster Viewing (Auditorium/Hall C)


2:35 pm

Trends from the Trenches

Chris Dagdigian, Senior Director, BioTeam, Inc.
Matthew Trunnell, Data Commoner
Adam Kraut, Director Infrastructure & Cloud Architecture, BioTeam, Inc.
Anna Sowa, PhD, Senior Scientific Consultant, BioTeam, Inc.
Michelle Bayly, PhD, Senior Scientific Consultant, BioTeam, Inc.

Since 2010, the “Trends from the Trenches” presentation, given by Chris Dagdigian, has been one of the most popular annual traditions on the Bio-IT Program. The intent of the talk is to deliver a candid (and occasionally blunt) assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation has helped scientists, leadership, and IT professionals understand the basic topics related to computing, storage, data transfer, networks, cloud, data science, and machine learning that are involved in supporting data-intensive science. In 2022, Chris will give the “Trends from the Trenches” presentation in its original “state-of-the-state address” followed by guest speakers giving podium talks on relevant topics. An interactive Q&A moderated discussion with the audience follows. Come prepared with your questions and commentary for this informative and lively session. 

4:10 pm Close of Conference

Exhibit Hall and Keynote Pass

Data Platforms and Storage Infrastructure