Bio IT World Expo 2016  
Bio IT World Expo 2016
2014 Archived Content

IT Infrastructure-Hardware 

Track 1 explores networking infrastructures, integrated computing and storage platforms and solutions, as well as analytics. Biopharma and technology experts will present case studies on how they are defining and refining the next generation of best practices, architecting for scalability and efficiency, accommodating different scientific disciplines and workflows in a common platform, and driving user governance structures.

Final Agenda

Download Brochure | Pre-Conference Workshops 


7:00 am Workshop Registration and Morning Coffee

8:00 - 11:30 Recommended Morning Pre-Conference Workshops*

Aligning Projects with Organization Strategy

12:30 - 4:00 pm Recommended Afternoon Pre-Conference Workshops*

IT & Informatics in Support of Collaboration and Externalization

*Separate Registration Required. Click here for detailed information.

2:00 - 7:00 pm Main Conference Registration

4:00 Event Chairperson's Opening Remarks

Cindy Crowninshield, RD, LDN, Conference Director, Cambridge Healthtech Institute


Click here for detailed information. 

5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing




7:00 am Registration Open and Morning Coffee

8:00 Chairperson's Opening Remarks

Phillips Kuhl, Co-Founder and President, Cambridge Healthtech Institute


Click here for detailed information. 

9:00 Benjamin Franklin Award & Laureate Presentation

9:30 Best Practices Awards Program

9:45 Coffee Break in the Exhibit Hall with Poster Viewing

10:50 Chairperson's Remarks

Rick Friedman, Specialist, HPC Research Computing & Big Data, Dell

11:00 Featured Presentation:

HPC Trends in the Trenches 2014

Chris Dagdigian, Founding Partner & Director, Technology, BioTeam, Inc.

In one of the most popular presentations of the Expo, Chris delivers a candid assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation time for this talk has been extended from 30 to 60 minutes to accommodate the amount of information that is shared. Plus, Chris won’t feel pressured to speak so fast!

12:00 pm The Pan-Cancer Analysis of Whole Genomes Project

Lincoln Stein, Ph.D., Director, Informatics and Bio-Computing, Ontario Institute of Cancer Research; Professor, Molecular Genetics, University of Toronto

The Pan-Cancer Analysis of Whole Genomes project aims to understand common and distinguishing patterns of variation among a diverse set of cancer types. We are performing uniform computational processing of the whole genome sequencing data from the tumors and normal control DNA of more than 2000 donors in order to eliminate differences that are due to different ways of analyzing the data.   

12:40 Luncheon Presentation I: Cloud Networking Agility Drives Biotech Innovation & Discovery 

Chris Campbell, Data Center Specialist, Arista Networks, Inc.

BigData-methods for BioTech now require new hybrid-use of map/reduce & SQL techniques to achieve ever more meaningful correlations across more varied forms of data and across more locations where data resides. Arista and best-of-breed technology partners deliver one simple, leaf-spine, open-standard network topology to span and to scale, to support all applications across all the datastores that scientists wish to combine. Driven by automation-enabled Software-Defined Cloud Networking, Arista applies cloud-like agility and scale for lowest OPEX and unprecedented innovation potential. 

1:10 Luncheon Presentation II: IBM’s Comprehensive Reference Architecture for Genomic Medicine

Frank Lee, Ph.D., Senior Certified Solution Architect, Technical Advisor, Life Sciences, IBM Technical & Platform Computing Worldwide
Dr. Frank N. Lee will describe IBM’s open and extensible reference architecture for genomic medicine, including integration of genomic data within a translational platform. The architecture describes a unique, converged platform for high-throughput genomics and analytics through on-premise, cloud and hybrid delivery. It includes a scalable data repository, powerful workload engine and genomics application center for commonly used applications. 

Infrastructure and Platforms for Big Data: Capabilities and Solutions 

1:50 Chairperson's Remarks
Jacob Farmer, CTO, Cambridge Computer 

1:55 How to Bridge the Gap Between Corporate and Research IT: Leveraging the Best of Two Worlds for Bioinformatics

Thomas Schilling, Ph.D., Director, IT Portfolio Management, R&D, Bayer Business Services GmbH

We will discuss the implementation of our Bayer IT-Innovation Lab which consists of in-house cloud solutions linked to HPC clusters in the R&D ecosystem as well to our external clouds. This platform allows for fast evaluation of innovative IT solutions, prototyping and staging into the productive R&D IT landscapes. This lab was established with a long-term interdisciplinary program with skilled and experienced participants from science and IT.

2:25 High-Performance Integrated Virtual Environment (HIVE) Infrastructure for Big-Data Analysis: Applications to Next-Gen Sequencing Informatics

Vahan Simonyan, Ph.D., Lead Scientist - HIVE, CBER, FDA

The High-Performance Integrated Virtual Environment (HIVE) is a distributed cloud-based environment optimized for the secure storage and parallelized analysis of extra-large Next-Generation Sequencing (NGS) data. Learn about the HIVE infrastructure and technical aspects of implementation which facilitate deposition, retrieval, annotation and computation on NGS data, and analysis of outcomes using visual web environments appropriately built in collaboration with research scientists and regulatory personnel.

2:55 Bridging the Worlds of Files, Objects, NAS, and Cloud: A Blazing Fast Crash Course in Object Storage

Jacob Farmer, CTO, Cambridge Computer

This session is the world’s fastest-paced tutorial on object-based storage.  It starts by defining the term “object” and lists various ways object technology manifests itself in data storage.  Topics covered include: object stores, HPC file systems such as Lustre, cloud storage, cloud file systems, NAS accelerators and gateways, erasure codes, and rules-based metadata systems such as IRODS and Starfish. 


3:10 Accelerating Biomedical Research Discovery: The 100G Internet2 Network – Built and Engineered for the Most Demanding Big Data Science Collaborations

Christian Todorov, Director, Network Services Management, Internet2

Genomic & biomedical researchers have been forced to exchange big data via physical drives as advanced network connectivity was previously unavailable or cost prohibitive. Hear how colleagues are improving big data workflows using the 100G Internet2 Network, which provides the highest data transport rates available, along with dynamic cloud and trust applications that are interconnecting research and accelerating discovery.

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing

4:00 Innovative Data Storage Approaches of a Start-Up

Mark Kapel, Director, Information Technologies & Data Management, Technology Platform, Evogene

This presentation will discuss how to improve overall IT productivity while reducing data housing costs by more than 50%. Evogene is a plant genomics company utilizing a proprietary integrated technology infrastructure to enhance seed traits underlying crop productivity. Since its initiation in 2002, the company set to solve one of the biggest bottle necks in the industry – the challenge of integrating and analyzing vast amounts of genomic data. One of the key methods to overcome the problem was implementing 70% lower operational expanses than those of world's main storage providers like EMC or Netapp. This presentation will demonstrate how Evogene's IT group can be driven by the need for innovation and motivated to provide added value and enterprise grade services by focusing on risk management and agile project approaches which are second nature to Israeli start-ups.

4:30 Lessons Learned: Deploying Very Low Cost Cloud Storage Technology in a Traditional Research HPC Environment

Dirk Petersen, Manager, Scientific Computing, Fred Hutchinson Cancer Research Center

When implementing storage chargebacks we wanted to offer researchers an alternative storage solution that would not cost more than AWS Glacier. We also wanted it to be long term durable, self-protecting, easy to manage, store petabytes, survive the loss of an entire datacenter and deliver predictable performance. Learn how to avoid pitfalls and be able to determine if a solution like this makes sense for your organization.

5:00 Co-Presentation: Speeding up Genomic Software on Modern Computer Hardware

Paolo Narvaez, Ph.D., Principal Engineer, Intel Corporation, Data Center Group

Mauricio Carneiro, Ph.D., Broad Institute

In this talk we will review the collaborative work between the Broad Institute and Intel to improve the computational performance of genomic software. We will look at the GATK best practices DNA pipeline and identify areas where computational improvements are being targeted. We will focus on the recent optimizations released with GATK 3.1 and demonstrate how computational kernels can be accelerated with standard Intel hardware. We will show the 5x improvements made to the GATK's variant caller without using any additional cores by taking advantage of new instructions in recent processors. Finally, we will discuss how genomics software can leverage current and future hardware improvements to deliver maximum performance.

5:30 - 6:30 Best of Show Awards Reception in the Exhibit Hall


7:00 am Registration Open

7:00 Breakfast Presentation Panel: Enabling Technology. Leveraging Data. Transforming Personalized Medicine. 


Ketan Paranjape, Global Director, Healthcare and Life Sciences, Intel Corporation

Sanjay Joshi, CTO, Life Sciences, EMC Isilon

Steve Schwartz, Vice President, Business Development & Strategy, 23&Me

Hans Karten, CEO & CTO, GENALICE

Jason Stowe, CEO and Founder, Cycle Computing 

Adam Berrey, CEO, Curoverse

As we arrive at the $1000 genome, we find the fundamental problems have shifted... it is no longer about shrinking the cost of sequencing but the explosive growth of big data: the downstream analytics with rapidly evolving parameters, data sources and formats; the storage, movement and management of massive datasets and workloads; and perhaps most paradoxical of all, the challenge of articulating the results and translating the latest findings directly into improving patient outcomes.  Learn how Intel is working with a broad range of ecosystem partners & industry experts to accelerate scientific discovery, translate results into clinical practice, and achieve the vision of personalized medicine today.

8:00 Chairperson’s Opening Remarks

Kevin Davies, Ph.D., Vice President Business Development & Publisher C&EN, American Chemical Society; Founding Editor, Bio-IT World


Click here for detailed information. 

10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced

File System Optimization 

10:30 Chairperson's Opening Remarks
Zachary Giles, HPC Administrator, Scientific Computing, Icahn School of Medicine at Mount Sinai 

10:35 File System Optimization for Genomics Workload using SSDs and File System Tiering 

Zachary Giles, HPC Administrator, Scientific Computing, Icahn School of Medicine at Mount Sinai

The audience will gain an understanding of current cluster file system technologies used within the genomics community and possibly changes that can be made to make better use of the file system hardware and software to provide better service for their users. A case study will be presented on usage patterns and new concepts for file system optimization such as “hot files” and SSDs.  

11:05 How to Build a Genomics Platform that Addresses Performance, Data Management and Translational Medicine
Janis Landry Lane, Executive, Life Sciences Industry, IBM Technical & Platform Computing Worldwide
Data used in next generation sequencing is doubling every five months, and HPC computing resources are required to analyze and store it. 
Attend this session and learn IBM’s best practices for HPC data management and archiving. We will describe new ways to integrate genomics data into translational platforms, and help researchers manage the analytics required to realize the genomics promise.  

11:35 Converged Infrastructure for Life Science Research and Communities
George Vacek, Ph.D., Global Business Director, Life Sciences, DataDirect Networks
Dr. Vacek will discuss the recent infrastructure adoption and trends among many of the world’s largest big data sites in Life Science. Topics covered will include architectural elements like iRODs, converged parallel file systems, object storage, secure research communities and specific examples of leading research center use cases.

11:50 Resilient, Compliant, High Performance IT Environments through Flexible Design

John Sabey, President, Sabey Data Center Properties

Sabey Data Centers brings 42 years of experience building and operating critical environments for healthcare and life sciences. Our geo-diverse, energy-efficient, highly robust facilities enable our partners to build and maintain compliant systems through end to end control and redundant architecture.

12:15 pm Luncheon Presentations (Sponsorship Opportunities Available) or Lunch on Your Own

1:15 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing

Managing Big Data and Security Strategies 

1:55 Chairperson's Remarks

Toby Bloom, Ph.D., Deputy Scientific Director, Informatics, New York Genome Center

2:00 Data and Intellectual Property Security in a Global Cyber Environment

Russ Brown, Supervisory Special Agent, Boston Division Criminal Cyber Squad, FBI

Data security is a very significant issue faced by businesses in today’s electronically connected environment. Data “owned” or controlled by a business, as well as applications developed by a business, are categorized as Intellectual Property. The security and protection of Intellectual Property is critical to conducting, maintaining and growing a secure business in the current global environment.


2:30 Information Classification: The Key to a Sane Security Strategy

William Telford, Director, R&D IS Security, Sanofi R&D

Understanding your information is key to your security success and enables collaboration. This talk will address the key topics to consider and challenges that must be overcome.

3:00 PANEL DISCUSSION: The Big Data Storage and Security Maze: Balancing Collaboration and Privacy

Moderator: Toby Bloom, Ph.D., Deputy Scientific Director, Informatics, New York Genome Center

Russ Brown, Supervisory Special Agent, Boston Division Criminal Cyber Squad, FBI

Yaniv Erlich, Ph.D., Principal Investigator and Whitehead Fellow, Whitehead Institute for Biomedical Research

Philip Groth, Ph.D., IT Business Partner, CoE Research, Bayer HealthCare Pharmaceuticals

John Sabey, President, Sabey Data Center Properties

William Telford, Director, R&D IS Security, Sanofi R&D

Big data has led to organizations turning to virtual networks for information storage and processing. Thus, unauthorized access to data and implementation of effective governance structures are growing concerns. To balance knowledge sharing and respect for confidentiality, researchers must consider how and where to store and secure data – plus what data and why. This panel gathers representatives from academia, pharma and IT to discuss these issues. Topics include:

  • How secure is your data?
  • When should you favor data sharing and when should you restrict data flow?
  • What data actually needs to be protected in the first place?
  • What security systems and practices are most appropriate for specific research needs?

4:00 Conference Adjourns

*IBM and the IBM logo are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide.

Download Brochure | Pre-Conference Workshops 


View 2016 Photos & Videos  

View 2016 Brochure
View 2016 Brochure
Platinum Sponsors


Cycle Computing logo small

DDN Storage  

Elsevier R&D Solutions


 IBM Logo Illumnia logo  

Intel Logo  

Precision for Medicine


 Seven Bridges Genomics

View All Sponsors

Official Media Partner

Official PR Partner

View All Media Partners

Conference CD

CD iconOrder the 2015 event proceedings - now available on CD

Complimentary Downloads

View white papers, listen to podcasts, and more!

  • Making the World's Knowledge Computable
  • Bioinformatics in the Cloud
  • The Application of Text Analytics to Drug Safety Surveillance

Related Event

 Medical Informatics World Related