2018 Archived Content
Track 14: Data Transfer

Time and money spent transferring data are hampering the ability of data-intensive scientific researchers to share data and generate new knowledge. Whether migration to cloud or across campus, flexibility and reliability, as well as speed, security, and scalability, must be considered. Through case studies, Track 14 presents both hardware and software enterprise data management solutions that facilitate high-speed data transfer to enable productivity and foster collaboration.

Tuesday, May 15

7:00 am Workshop Registration Open (Commonwealth Hall) and Morning Coffee (Foyer)

8:0011:30 Recommended Morning Pre-Conference Workshops*

W6. An Intro to Blockchain in Life Sciences

W7. Containerizing Applications and Workflows for Large-Scale Reproducible Science

12:304:00 pm Recommended Afternoon Pre-Conference Workshops*

W12. Bio-IT IOT Workshop: Accurate Data for Good Decisions

* Separate registration required.

2:006:30 Main Conference Registration Open (Commonwealth Hall)

4:00 PLENARY KEYNOTE SESSION (Amphitheater & Harborview 2)

5:007:00 Welcome Reception in the Exhibit Hall with Poster Viewing (Commonwealth Hall)

Wednesday, May 16

7:00 am Registration Open (Commonwealth Hall) and Morning Coffee (Foyer)

8:00 PLENARY KEYNOTE SESSION (Amphitheater & Harborview 2)

9:45 Coffee Break in the Exhibit Hall with Poster Viewing (Commonwealth Hall)

Waterfront 1

10:50 Chairperson’s Remarks

Vaughan Wittorff, PhD, Co-Founder & Chief Commercial Officer, PetaGene Ltd.

11:00 Business and Research Responses to the Changing Legal Environment for Data Management

John M. Conley, JD, PhD, William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill; Counsel, Robinson Bradshaw & Hinson

We are in the midst of major legal changes affecting data collection, storage, transfer, and use. For example, in the U.S., a major revision to the Common Rule for human subjects research will take effect in early 2018, the Federal Trade Commission has unexpectedly moved into the regulation of health data, and the federal government has interpreted HIPAA to expand patients' access to raw genomic data. In the European Union, a new General Data Protection Regulation will take effect in 2018, with major implications for both collecting health and research data and transferring it to the U.S. This presentation will review these developments and then discuss how Bio-IT companies and institutions should respond. The most fundamental questions are: who needs to worry, and what should they do?

11:30 A Reusable Cloud-Based Infrastructure for Growing Biotechs

John Keilty, General Manager, Platform Operations, Third Rock Ventures
Karina Chmielewski, Senior Director, Platform Operations, Third Rock Ventures

Pharmaceutical R&D has a data problem. With so many types of data - from experimental, to operational, to clinical, and more - from many different disparate sources, managing data has become a prevalent issue in the industry. The companies hit the hardest are the small, growing biotechs who attempt to rapidly scale innovative science but lack the formal infrastructure to get past these logistical hurdles. This presentation will address these issues and provide a case study on how Third Rock Ventures, a veritable expert on launching biotech startups, is addressing this common problem. By removing the operational bottlenecks involved with data management and storage, Third Rock Ventures enables their portfolio companies to focus on what matters - making a dramatic difference in patients’ lives.

 Internet 212:00 pm Internet2: Leveraging Distributed Resources to Speed Discovery

Dan Taylor, Director, Business Development, Network Services, Internet2

Few Life Sciences organizations take advantage of the vast resources available to R&D organizations for continuous innovation and keeping pace with big data. This session will discuss the infrastructure underlying collaborations that use private, academic, and public resources – including commercial cloud and supercomputing centers storage and processing - to maximize options and speed discovery.


Weka12:15 Storage Systems that Support Tomorrow's Life Science Applications Today

David Hiatt, Director, Product Marketing, Marketing, WekaIO

Research has become increasingly compute intensive. While new tools and analytical processes such as AI and deep learning hold great promise, they stress the supporting IT infrastructure beyond the expectations of system designers. Learn how today's storage systems leverage software to deliver the performance, scale, and cost efficiencies for applications.

12:30 Session Break


12:40 Luncheon Presentation I: Addressing the Big Data Challenges in Genomics and BioImaging

Linda Zhou, Director, Research and Life Sciences Solutions, Western Digital

We will cover the Data challenges in both Genomics and BioImaging, including data growth and scale, the need for both collaboration and security, and the hybrid cloud processing requirements. We will describe best practices for cloud scale storage solutions to address these challenges, with example architectures from real customers in Genomics and BioImaging research.


Panasas1:10 Luncheon Co-Presentation II: Data Storage Benchmark Results for NGS and CryoEM Research

David Sallak, Vice President, Industry Solutions, Panasas, Inc.

Adam Marko, Senior Scientific Consultant, BioTeam, Inc.

Panasas and BioTeam will share benchmark results impacting NGS and CryoEM research. The Benchmarks were performed at the BioTeam Convergence Lab. BWA indexing of the human genome was performed for multiple simultaneous indexes and varying numbers of CPUs. The RELION application was used to perform a 3D classification of a publicly available CryoEM dataset of a human malaria parasite ribosome.

1:40 Session Break


1:50 Chairperson’s Remarks

Eli Dart, Network Engineer, Science Engagement, Energy Sciences Network (ESnet), Lawrence Berkeley National Laboratory

1:55 A Reliable and High-Speed Internet Lane Protocol

Nirmala Shenoy, PhD, Professor, Information Sciences and Technologies, Rochester Institute of Technology

A Reliable and High-Speed Internet Lane (RHSL) protocol was developed based on a novel technique that leverages the structures and connectivity existing between definable network modules. RHSL was demonstrated over the US GENI testbeds for emergency applications. RHSL operations are transparent and independent to the Internet protocol (IP) operations. RHSL bypasses IP and its routing protocols.

2:15 Architecture, Tools, and Platforms - Designing for High-Performance Data Transfer

Eli Dart, Network Engineer, Science Engagement, Energy Sciences Network (ESnet), Lawrence Berkeley National Laboratory

At-scale data transfer in high-performance environments is a challenging capability to deploy. This talk discusses the relationship between architectural frameworks, specific data transfer tools, and data transfer platforms, with a focus on the implications of design and deployment decisions for usability, security, and performance.

2:35 Lab Instrument Data Transfers and File Systems for Processing in the Cloud

Lance Smith, Associate Director, IT, Celgene

Integrate hundreds or even thousands of older lab systems with cloud-based data processing systems without modifying existing systems or lab processes using new cost-effective tools from AWS. This talk discusses methods of transfer and cloud-native file systems available for laboratory use with existing software solutions.

 2:55 Quantum Cryptography

Alicia Sit, Research Scientist, Structured Quantum Optics Group, Advanced Research Complex, University of Ottawa

Quantum Cryptography is on the verge of real-world applications, where perfectly secure information can be distributed among multiple parties. Several quantum cryptographic protocols have been theoretically proposed and independently realized in different experimental conditions. In my talk, I present the recent progress, challenges and development in performing high-dimensional quantum cryptography via quantum key distribution, quantum hacking as well as our recent achievements in simulating quantum computations with structure photons.

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing (Commonwealth Hall)

Cityview 1

4:00 Collaborative Simulation Development Accelerated by Cloud-Based Computing and Software as a Service Model

Howard J. Stamato, MS, Consultant; formerly Associate Director, Drug Product Science and Technology, Bristol-Myers Squibb

Collaborative development of simulations can be enabled by cloud-based computing capability and flexible access to software. Faster development and consumption of models is expected from having a system in which to share the right information in the right context. A system like this has been proposed for a consortium of pharmaceutical companies and is expected to bring advantages and further efficiencies to delivering medicines for patients.

4:30 Lessons and Questions from Multiple Journeys to Cloud Analytics: An Open Discussion

This is an open panel discussion with members from different companies comparing experiences from different journeys. We hope to make this as interactive as possible, as we discuss lessons learned, flexibility needs, things to watch out for, and open questions.

Lovisa Afzelius, PhD, Senior Vice President, Cogen Therapeutics

Sebastien Lefebvre, Senior Director, Data Analytics & Decision Support, Alexion Pharmaceuticals

Valerie Smith, IT Director, Global Head of R&D, Clinical, and Regulatory IT, Alkermes

Jason Tetrault, Global Head, Data Engineering and Emerging Technology, Takeda Pharmaceuticals

Aspera_IBM BW5:00 Overcoming the Challenges of Sharing and Migrating Large Data Sets in Healthcare and Life Sciences

David Mostardi Senior Engineer, IBM Aspera

Healthcare & research are experiencing unprecedented growth in data. Legacy file sharing & cloud migration tools rely on technology that can’t handle such size and volume. Learn through real-world use cases how IBM Aspera accelerates R&D cycles, speeds data workflows, and impacts clinical and research outcomes by leveraging multi-cloud and on premises solutions.

5:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing (Commonwealth Hall)


7:0010:00 Bio-IT World After Hours @Lawn on D
 **Conference Registration Required. Please bring your conference badge, wristband, and photo ID for entry.   

Thursday, May 17

7:30 am Registration Open (Commonwealth Hall) and Morning Coffee (Foyer)

8:00 PLENARY KEYNOTE SESSION & AWARDS PROGRAM (Amphitheater & Harborview 2)

9:45 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced (Commonwealth Hall)

Waterfront 2

10:30 Chairperson’s Remarks

Vahan Simonyan, PhD, Lead Scientist & R&D Director, High-Performance Integrated Virtual Environment (HIVE), FDA

10:40 Healthcare Data Exchange Framework: Scalable Economy of Secure Information and Services

Vahan Simonyan, PhD, Lead Scientist & R&D Director, High-Performance Integrated Virtual Environment (HIVE), FDA

This project demonstrates a unique framework that enables digital transformation of healthcare at a scale that was not possible before. Healthcare Data Exchange Framework has a potential to liberate data, empower patient ownership of data and create a free market where data assetization and securitization might serve as incentives for data sharing.

11:10 Healthcare Security Framework

Jim McGinnis, PhD, Assistant Professor, Engineering Technology, The University of Memphis
Healthcare is in a vulnerable position for infiltration or hacking of data. Sensitive patient data, financial data of the entity and insurance information are just some of the data that needs to be protected. The National Institute of Standards and Technology (NIST) has provided a Cybersecurity framework for general purposes. In the paper we will research some of the underlying layers of Cybersecurity that pertain to Healthcare. This research hopes to provide a concise framework for healthcare providers to use as a guideline for incorporating their own cybersecurity and to help in engaging cybersecurity third-party companies for assistance. The five layers of the NIST framework, Identify, Protect, Detect, Respond and Recover, leave healthcare organizations with a large amount of inhouse examinations in order to protect the data of the organization. While the organization must be diligent in the protection of the data, use of outside resources is a must in providing the utmost due diligence for the protection of the patient data, financial/insurance data of the patient and the entity as a whole. This document will attempt to build and expound on the NIST framework to provide additional guidance to healthcare providers.

11:40 A Modern Approach to Data Storage for Next Generation Sequencing & Medical Imaging

Steve Noel, Principal Systems Engineer, Qumulo

File storage is a critical component of the life sciences research workflow. For researchers to be able to do their work, their storage must be able to scale to and handle billions of files efficiently. They must also be able to access their research data from anywhere in the world. Learn how universal-scale file storage allows research organizations to manage massive, globally distributed file sets with ease.

12:10 pm Session Break

12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Enjoy Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing (Commonwealth Hall)



1:55 Sponsor Introduction

Scott Jeschonek, Director, Cloud Services, Avere Systems

2:054:00 Panel Session: BioTeam Town Hall: 2018 Bio-IT Trends

Chris Dwan, Senior Technologist and Independent Life Sciences Consultant (Moderator)

Ari Berman, PhD, Vice President and General Manager of Consulting Services, BioTeam, Inc.

Tanya Cashorali, Founder, TCB Analytics

Kristen Cleveland, PMP, Director of Operations, BioTeam, Inc.

Chris Dagdigian, Co-Founder and Senior Director, Infrastructure, BioTeam, Inc.

Karl Gutwin, PhD, Senior Scientific Consultant, BioTeam, Inc.

Adam Kraut, Director of Infrastructure and Cloud Architecture, BioTeam, Inc.

Since 2010, the “Trends in the Trenches” presentation, given by Chris Dagdigian, has been one of the most popular annual traditions on the Bio-IT Program. The intent of the talk was to deliver a candid (and occasionally blunt) assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation tried to recap the prior year by discussing what has changed (or not) around infrastructure, storage, computing, and networks. This presentation has helped scientists, leadership, and IT professionals understand the basic topics involved in supporting data intensive science. In 2017, the “Trends in the Trenches” presentation evolved and expanded from 60 minutes to 120 minutes and featured more content, speakers, and interactive discussion. We will continue this format for 2018, featuring short focused podium talks on current trends related to computing, storage/data transfer, networks, cloud, and managing successful IT projects. An interactive Q&A moderated discussion with the audience follows. Come prepared with your questions and commentary for this informative and lively session.

4:00 Conference Adjourns

Exhibit Hall and Keynote Pass

Data Platforms and Storage Infrastructure