Data Science and Analytics Technologies

The practice of data science requires the use of data analytics tools and technologies like Python, R, SQL, and Tensorflow and approaches like graph databases and column stores to help data professionals gain extra insights and value from data. The Data Science and Analytics Technologies track will explore popular analytics tools, technologies, languages, and approaches to managing highly complex data that data scientists are using. Most importantly, presentations will explore what problems data science and analytics technologies are solving within the field, specific methods that are being applied, how to assess the value add against cost, and how to set up team structure within the organization. How do you involve the end-user in defining the requirements necessary to make the results of these analytics easily actionable?

Final Agenda

Monday, April 20

9:00 am - 5:00 pm Hackathon*

*Pre-registration required.

Tuesday, April 21

7:30 am Workshop Registration Open and Morning Coffee

8:30 am - 3:30 pm Hackathon*

*Pre-registration required.

8:30 - 11:30 am Recommended Morning Pre-Conference Workshops*

W3. Introduction to Data Visualization for Biomedical Applications

Nils Gehlenborg, PhD, Assistant Professor, Department of Biomedical Informatics, Harvard Medical School

Alexander Lex, PhD, Assistant Professor, SCI Institute, School of Computing, University of Utah

12:30 - 3:30 pm Recommended Afternoon Pre-Conference Workshops*

W10. Data Science Driving Better Informed Decisions

Meghan Raman, Head, R&D Data Lake and Analytics, Bristol-Myers Squibb

*Separate registration required.

2:00 - 6:30 Main Conference Registration Open

4:00 Welcome Remarks

Cindy Crowninshield, RDN, LDN, Executive Event Director, Cambridge Healthtech Institute




4:05 Keynote Introduction

4:15 PLENARY KEYNOTE PRESENTATION: NIH’s Strategic Vision for Data Science

Susan K. Gregurick, PhD, Associate Director, Data Science (ADDS) and Director, Office of Data Science Strategy (ODSS), National Institutes of Health






Rebecca Baker, PhD, Director, HEAL (Helping to End Addiction Long-term) Initiative, Office of the Director, National Institutes of Health






5:00 - 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)

Wednesday, April 22

7:30 am Registration Open and Morning Coffee

8:00 Welcome Remarks

Allison Proffitt, Editorial Director, Bio-IT World




8:05 Keynote Introduction

8:15 Toward Preventive Genomics: Lessons from MedSeq and BabySeq

Robert Green, MD, MPH, Professor of Medicine (Genetics) and Director, G2P Research Program/Preventive Genomics Clinic, Brigham & Women’s Hospital, Broad Institute, and Harvard Medical School




8:45 PANEL DISCUSSION: Game On: How AI, Citizen Science, and Human Computation Are Facilitating the Next Leap Forward

Pietro Michelucci, PhD, Director, Human Computation Institute






Additional Panelists to be Announced

9:45 Coffee Break in the Exhibit Hall with Poster Viewing


10:50 Organizer’s Welcome Remarks

Cambridge Healthtech Institute

10:55 Chairperson’s Remarks

11:00 PANEL DISCUSSION: Creating an Effective Data and Analytics Strategy – Where Do You Start?


Tanya Cashorali, CEO and Founder, TCB Analytics


Helena Deus, PhD, Technology Research Director, Elsevier

Adam Jenkins, Associate Director, Global Data Science, Biogen

Heather Shapiro, PhD, Head of Data Science, Pear Therapeutics

Lauren Young, Senior Scientist, Computational Biology, Beam Therapeutics, Inc.

Data is all over the place and in different formats. What data is actionable? What is the end goal? How do you build out an organization? Are you sure you know what problem you are trying to solve? How do you set up an analytics environment?

12:00 pm Sponsored Presentation (Opportunity Available)

12:30 Session Break

12:40 Luncheon Presentation (Sponsorship Opportunity Available)

1:40 Session Break


1:50 Chairperson’s Remarks

Kees van Bochove, Founder, The Hyve

1:55 FAIR and the (Tr)end of Data Lakes

Kees van Bochove, Founder, The Hyve

2:25 Data Management and Analytics – Innovation and Digitization through Platforms

Meghan Raman, Head, R&D Data Lake and Analytics, Bristol-Myers Squibb

An innovative platform approach to data management and analytics including a foundational data layer, a data catalog and a reproducible analytic environment will provide a digital solution to monetize the data at scale. This talk describes an innovative cutting-edge approach to enterprise data management and analytics, major contributing factors for significant cost savings and monetization of data, digital capabilities to provide competitive differentiation across the pharmaceutical value chain.

2:55 Sponsored Presentation (Opportunity Available)

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing


4:00 Chairperson’s Remarks

Jason Tetrault, Global Head Data Engineering and Emerging Technologies, Takeda

4:05 PANEL DISCUSSION: Making Real Impact with Data Science


Jason Tetrault, Global Head Data Engineering and Emerging Technologies, Takeda


Neda Hassanpour, PhD, Data Scientist, Takeda

Tanya Cashorali, CEO and Founder, TCB Analytics

Additional Panelists to be Announced

5:05 Sponsored Presentation (Opportunity Available)


5:35 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing

6:45 End of Day

Thursday, April 23

7:30 am Registration Open and Morning Coffee

8:00 Organizer’s Remarks

Cindy Crowninshield, RDN, LDN, Executive Event Director, Cambridge Healthtech Institute




8:05 Awards Program Introduction

8:10 Benjamin Franklin Award and Laureate Presentation

J.W. Bizzaro, Managing Director,




Discngine8:35 Bio-IT World Innovative Practices Awards

Allison Proffitt, Editorial Director, Bio-IT World




9:00 AI in Pharma: Where We Are Today and How We Will Succeed in the Future

Natalija Jovanovic, PhD, Chief Digital Officer, Sanofi Pasteur




9:45 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced at 10:00


10:30 Organizer’s Remarks

Cambridge Healthtech Institute

10:35 Chairperson’s Remarks

10:40 Presentation to be Announced

11:10 Takeda’s Progress in Achieving Gxp-Validation of Platypus: Our Cloud-Based Suite of Analytics on Clinical Trial Data

Vijay Challa, Associate Director, Digital Clinical Platform and Technologies, Data Sciences Institute, R&D, Takeda

Takeda’s Platypus project, winner of a 2018 Bio-IT World Best Practices Award, has taken a leap forward by achieving GxP Validation. Platypus has been successfully scaled across Takeda as both a clinical study-level medical review tool and a clinical program-level signal management tool. Platypus sources clinical trial data in a standardized format from CROs, stores and aggregates the data in Takeda’s Data Hub, and visualizes the data using Tableau visualizations. Platypus is Takeda’s first homegrown GxP-validated application on the cloud which required both great adaptation and collaboration. The Platypus validation project is a compelling story that achieved the delicate balance of maintaining the necessary agility while applying the rigor of software validation.

11:40 Sponsored Presentation (Opportunity Available)

12:10 pm Session Break

12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Enjoy Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Last Chance Poster Viewing



1:55 Chairperson’s Remarks

Kevin Davies, PhD, Executive Editor, The CRISPR Journal, Mary Ann Liebert, Inc.


Chris Dagdigian, Co-Founder and Senior Director, Infrastructure, BioTeam, Inc.

Vivien Bonazzi, PhD, Chief Biomedical Data Scientist, Managing Director, Deloitte

Tim Cutts, PhD, Head, Scientific Computing, Wellcome Trust Sanger Institute

Kjiersten Fagnan, PhD, Chief Informatics Officer, Data Science and Informatics Leader, DOE Joint Genome Institute, Lawrence Berkeley National Laboratory

Matthew Trunnell, Vice President and Chief Data Officer, Fred Hutchinson Cancer Research Center

The “Trends from the Trenches” will celebrate its 10th Anniversary at Bio-IT! Since 2010, the “Trends from the Trenches” presentation, given by Chris Dagdigian, has been one of the most popular annual traditions on the Bio-IT Program. The intent of the talk is to deliver a candid (and occasionally blunt) assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation has helped scientists, leadership, and IT professionals understand the basic topics related to computing, storage, data transfer, networks, and cloud that are involved in supporting data-intensive science. In 2020, Chris will give the “Trends from the Trenches” presentation in its original “state-of-the-state address” followed by guest speakers giving podium talks on relevant topics. An interactive Q&A moderated discussion with the audience follows. Come prepared with your questions and commentary for this informative and lively session.

4:00 Close of Conference

Platinum Sponsors