Bio IT World Expo 2016  
Bio IT World Expo 2016

Track 1 - April 21 – 23, 2015

IT Infrastructure – Hardware 

Big Data Storage Capabilities and Solutions in the R&D Ecosystem

Track 1 provides concrete case studies of secure internal and external storage solutions and implementation from the IT and scientist perspectives. Learn best practices of transforming a framework of individualized IT islands to a shared but flexible and scalable IT platform. Themes covered include cloud solutions linked to HPC clusters, accommodating different scientific disciplines and workflows in a common platform, and driving user governance structures.

Final Agenda

Download Brochure | Workshops 

Tuesday, April 21

7:00 am Workshop Registration and Morning Coffee

8:00 – 11:30 Recommended Morning Pre-Conference Workshops*

Aligning Projects with Agile Approach

12:30 – 4:00 pm Recommended Afternoon Pre-Conference Workshops*

Converged IT Infrastructure in Life Science

* Separate registration required


2:00 – 6:30 Main Conference Registration



Click here for detailed information. 


5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing

Wednesday, April 22

7:00 am Registration Open and Morning Coffee



Click here for detailed information. 


9:00 Benjamin Franklin Awards and Laureate Presentation

9:30 Best Practices Awards Program

Internet 29:45 Coffee Break in the Exhibit Hall with Poster Viewing



10:50 Chairperson’s Opening Remarks

Wanmei Ou, Ph.D., Director, Product Strategy in Translational and Precision Medicine, Health Sciences Global Business Unit, Oracle



Chris Dagdigian, Founding Partner & Director, Technology, BioTeam, Inc.

In one of the most popular presentations of the Expo, Chris delivers a candid assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences.


12:00 pm Introduction to EVO:RAIL by VMware

Michael McDonough, Senior Director, EVO:RAIL, VMware

VMware EVO:RAIL™ combines compute, networking, and storage resources into a hyper-converged infrastructure appliance to create a simple, easy to deploy, all-in-one solution offered by Qualified EVO:RAIL Partners. EVO:RAIL is a scalable Software-Defined Data Center (SDDC) building block that delivers compute, networking, storage, and management to empower private and hybrid cloud, end-user computing, test/dev, and branch office environments.

12:30 Session Break

IBM12:40 Luncheon Presentation I: Big Data for Genomics -- SCALE, SPEED and SMART

Frank Lee, Ph.D., Lead Architect, Genomics Solution, IBM

Explosive growth of big data is challenging researchers in genomics and life sciences around the world. Learn about some of the latest solutions, architecture and best practice to 1) acquire, store, access data in scale; 2) build a high-throughput computing infrastructure to process large genomic data set; 3) gain insights and knowledge from the data through translational research. Illustrated through real-life projects and case studies, join this session to learn of the latest approaches to tackle big data, the evolving ecosystem, success stories and lessons learned that highlight the potential for collaboration among genomic research communities. Share in a preview of the upcoming IBM genomics turn-key platform currently under development.

Intel Logo1:10 Luncheon Presentation II: Optimizing Genomic Sequence Searches to Next-Generation Intel Architectures

Bhanu Rekepalli, Ph.D., Senior Scientific Consultant & Principal Investigator, BioTeam Inc.

Upcoming bioinformatics, and biomedical, research requires fast processing and analytic tools due to the immense growth of genomic data added to the biological knowledge base with the advent of next generation sequencing technologies. The design of these tools should adhere efficiently to homogeneous and heterogeneous architectures while supporting scalability, accuracy, and reproducibility. The National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST) for genomics sequence searches is re-designed to scale on hybrid parallel architectures composed of Intel Xeon processors and Intel Xeon Phi coprocessors, denoted here as Highly Scalable Parallel Hybrid BLAST (HSPH- BLAST). Functionality enhancements, such as cross- compilation, dynamic load scheduling, master-worker model, input/output management, and database distribution are discussed. A performance evaluation of HSPH-BLAST demonstrates reduction in execution time, high scalability, and balanced processor utilization. HSPH-BLAST and similar tools integrated into scientific workflows pipelines can allow biologists to easily perform systematic studies resulting in rapid and high-impact scientific discovery.

1:40 Session Break



1:50 Chairperson’s Remarks
Martin Gollery, CEO, Tahoe Informatics 

1:55 Comparisons of Storage Efficiencies through Hadoop

Martin Gollery, CEO, Tahoe Informatics

Hadoop is widely used in ‘Big-Data’ applications, so much so that most modern cluster installations are now installing some version of Hadoop rather than the old style clusters. The focus for many people is the increased speed that is obtained with Hadoop, with scalability that can easily reach into the tens of thousands of nodes. Perhaps less well known are the cost savings that may be achieved with Hadoop storage. This talk will compare and contrast the storage techniques and the costs that are associated with them.

2:25 Rapid Integration of Cancer Genomics Data Using Hadoop and Cloudera’s Impala

Sittichoke Saisanit, Ph.D., Data Scientist, Informatics, Pharmaceutical Research and Early Development Informatics, Roche Innovation Center New York

We explored Cloudera Impala for analysis of cancer genomics data. Without data transformation and reformatting, Impala tables can be created quickly from files on Hadoop file system with a simple command. Such speed and flexibility enable us to interrogate data without spending much time on schema design, index creation, query tuning and data cleaning. Impala can be accessed through Spotfire allowing flexibility of data visualization.
Authors: Sittichoke Saisanit, Zayed Albertyn, Xing Yang, Padmanabha Udupa 

Internet 22:55 Accelerating Biomedical Research Discovery: The 100G Internet2 Network – Built and Engineered for the Most Demanding Big Data Science Collaborations

Christian Todorov, Director, Network Services Management, Internet2

Genomic & biomedical researchers have been forced to exchange big data via physical drives as advanced network connectivity was previously unavailable or cost prohibitive. Hear how colleagues are improving big data workflows using the 100G Internet2 Network, which provides the highest data transport rates available, along with dynamic cloud and trust applications that are interconnecting research and accelerating discovery.

Seagate3:10 Managing Genomic Data at Scale! - Rules Based Intelligent Data Management

Jose L. Alvarez, Principal Engineer, WW Director, Healthcare and Life Sciences, Seagate Cloud and Systems Solutions

The explosion of Genomic data due to new instrument chemistry and more powerful analysis tool sets has created a complex and manual data management problem for high-throughput NGS centers. We will discuss how an intelligent data management solution can address this problem. iRODS (Integrated Rules-Oriented Data System) enables this intelligent data orchestration and can even help with pipeline and workflow automation.

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing



4:00 PANEL PRESENTATION/DISCUSSION: ICTBioMed: International Consortium for Technology in Biomedicine

Moderator: Anil Srivastava, President, Open Health Systems Laboratory

Rolf A. Heckemann M.D., Ph.D., Professor of Medical Imaging and Image Analysis, MedTech West, Sahlgrenska University Hospital 

Amit Saxena, Senior Technical Officer, Bioinformatics Group, Centre for Development of Advanced Computing (C-DAC)

Ken Buetow, Ph.D., Director, Computation & Informatics Core Program, Complex Adaptive Systems, Arizona State University  
Cezary Mazurek, Ph.D., Director, Network Services Department, Poznan Supercomputing and Networking Center (PSNC) 

Prof Asoke K Talukder, Ph.D., Adjunct Professor, Computer Science & Engineering, NIT Warangal; Co-founder & Chief Scientific Officer, InterpretOmics, Bangalore’ Ex DaimlerChrysler Chair Professor, IIIT Bangalore  

Open Health Systems Laboratory has brought together several life sciences supercomputing centers to form the International Consortium for Technology in Biomedicine (ICTBioMed). ICTBioMed members have been working together for almost two years to create a shared global cyberinfrastructure as a seamless and friction-free platform for the researchers worldwide for their collaborative research in consistent with the tenets of team science. ICTBioMed leadership team will present in this session both the shared resources and the research use cases that they have been supporting to validate and further develop the value added cloud services. The panelists will speak to a narrative framework of possible science using, what NSF describes, as International Research Network Connection, pursuing the Big Data to Knowledge goals of NIH.

SGI small logo5:00 Beyond Parallel Filesystems: NVMe Storage for Genomics Workflows

James Reaney, Ph.D., Senior Director, Research Markets, SGI

Network-attached storage. Clustered storage. Distributed parallel filesystem storage. Storage infrastructure for genomics workflows has always been about faster, easier, and especially more scalable storage solutions to keep pace with the data tsunami in next-gen sequencing. SGI and Intel present a new concept in storage architecture for these workflows, one with disruptive potential for the marketplace. Not only faster and very scalable, but drop-dead simple to use too.

EMC25:15 The Expanding Face of Meta Data

Steve Worth, Director of Engineering, EMC

Groups maintaining data repositories at the petabyte-scale are discovering that cataloguing associated metadata is necessary to properly access, recall and analyze data. Capturing and maintaining metadata long term is becoming as critical as the data itself. All the more when you consider the rapid cycling of underlying hardware technologies. We will discuss the evolving nature of metadata along with recent advancements and approaches.

5:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing

6:30 Close of Day

Thursday, April 23

7:00 am Registration Open

Intel Logo

7:00 Breakfast Panel Presentation: Enabling Technology. Leveraging Data. Transforming Personalized Medicine

Moderator: Ketan Paranjape, General Manager Life Sciences, Intel Corp.

Panelists: Sanjay Joshi, CTO, Life Sciences, EMC2 Isilon

Johannes Karten, CTO & Founder, Genalice

Walt Gall, Vice President, Healthcare & Strategic Partnerships, Saffron Technology

Pieter van Rooyen, CEO & President, Edico Genome

Shawn Dolley, Health & Life Science Big Data Expert, Cloudera

The $1000 genome is here, and the fundamental problems have shifted... it is no longer about shrinking the cost of sequencing but the explosive growth of big data: the downstream analytics with rapidly evolving parameters, data sources and formats; the storage, movement and management of massive datasets and workloads, and the challenge of articulating the results and translating the latest findings directly into improving patient outcomes. A panel will discuss these issues and more as we work to achieve the vision of personalized medicine.


Click here for detailed information. 


10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced



10:30 Chairperson’s Remarks

Peter Godman, Co-Founder & CEO, Qumulo

DDN Storage10:40 Intelligent Infrastructure Approaches for Emerging Life Sciences Data Management Issues at Scale

George Vacek, Ph.D., Global Business Director, Life Sciences, DataDirect Networks

Dr. Vacek will deliver several in-depth case studies of global leaders HPC applications in Life Science. Case studies will focus on infrastructure approaches to solve the emerging issues of data at scale, including best practices in supporting high performance local workflows, collaborative and research communities, and life sciences clouds and hybrid cloud solutions.

Qumulo10:55 How Next Generation Scale-Out Storage Fuels Breakthroughs in Life Sciences

Peter Godman, Co-Founder & CEO, Qumulo

Technology advances in DNA sequencing and other research data capture instruments are creating data at an unprecedented rate. As storage footprints grow further into petabyte scale, storage teams increasingly struggle to manage the massive amount of data stored. Next-generation scale-out storage provides instant insight into data at scale, abstracts away the underlying infrastructure, and achieves breakthrough price/performance using intelligent software and commodity hardware.

11:10 Infrastructure, Architecture, and Organization: Data Engineering at Scale at the Broad

Chris Dwan, Assistant Director, Research Computing and Data Services, Broad Institute of MIT and Harvard

As the Broad Institute enters its second decade, we are adapting to genomic research at a global scale. Among other things, this requires adopting hybrid cloud technologies, moving to object models for data storage, and embracing federated solutions for identity and authorization. The social and organizational aspects of these transitions are at least as challenging as the technical. This talk describes the interplay between the human and technical aspects of these changes, as well as specific lessons learned along the way.

Dell_2011(1)11:40 Start Small, Collaborate Often, Grow Big – Scaling NGS Compute and Storage Solutions for Personalized Medicine

James Lowey, Vice President, Technology, Translational Genomics Research Institute (TGen)

Scaling an NGS IT solution doesn’t have to be overwhelming. Collaborating with experienced Clinicians, Researchers, Vendors, and Partners, all using best practices - enables incremental success and effective development for high utilization and impactful results.

EMC211:55 Out of the Trenches and Into the Future: Mixing File and Object Storage Architectures

Patrick Combes, Principal Solution Architect, Life Science & HPC, EMC

Managing genomics and biomedical data across file and object storage architectures currently dominate the conversation within research IT groups. We will share insights and best practices to design and implement on premise, public cloud, and hybrid architectures. These architectures mix file and object approaches to achieve an optimal balance between performance, archive, and data governance & protection requirements.

12:10 pm Session Break

Swiftstack12:20 Luncheon Presentation I: Breaking the $1,000 Genome Sequencing Barrier with Object Storage

Brandon Kruse, Senior Systems Engineer, HudsonAlpha Institute for Biotechnology

Joe Arnold, President and Chief Product Officer, SwiftStack

Peyton McNully, Technology Director, HudsonAlpha Institute for Biotechnology

Andrew Crouse, Ph.D., Intellectual Property and Industry Partnership Manager, HudsonAlpha Institute for Biotechnology

The next generation of human genome sequencers are a revolution in rapid disease diagnostics and custom gene therapy. In this talk, the HudsonAlpha Institute for Biotechnology will present how we are using OpenStack Swift as a component in our genomics-as-a-service. The recently purchased Illumnia X Ten’s automatically place the institute in the petabyte-scale. OpenStack Swift enables long-term sustainable storage using commodity storage hardware and open source software. The talk will present our deployment and configuration of OpenStack Swift. Specifically, how OpenStack Swift storage policies are configured so that we can offer varying levels of storage durability and availability.

12:50 Luncheon Presentation II (Sponsorship Opportunity Available) or Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing



1:55 Chairperson’s Remarks

John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill; Counsel, Robinson Bradshaw & Hinson



Roselie A. Bright, Sc.D., MS, PMP, Program Manager, Office of Information Management and Technology, Office of Informatics Technology and Innovation, Office of Operations, Office of the Commissioner, U.S. Food And Drug Administration (FDA)

OpenFDA was the first innovation created by Taha Kass-Hout, M.D., MS, upon joining FDA as the first Chief Health Information Officer in March 2013. OpenFDA was launched on June 2, 2014, allowing software developers, researchers and the public to tap into adverse events for drugs and medical devices; recalls, for drugs, devices and foods; and labeling for products on the market.

2:30 Global Developments in Privacy and Data Security Law

John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill; Counsel, Robinson Bradshaw & Hinson

The international legal climate governing privacy and data security is changing. The European Union is in the midst of a fundamental shift in its approach. The U.S. still lacks a national data law, so the states and individual federal agencies are groping toward a strategy. This presentation focuses on the impact of these ongoing changes on genomics, bioinformatics and health research.

3:00 PANEL DISCUSSION: Achieving Much-Needed Innovation while Hurdling the Barriers of Stringent Regulation

Moderator: John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill; Counsel, Robinson Bradshaw & Hinson


Roselie A. Bright, Sc.D., MS, PMP, Program Manager, Office of Information Management and Technology, Office of Informatics Technology and Innovation, Office of Operations, Office of the Commissioner, U.S. Food And Drug Administration (FDA)

Dana Caulder, Senior Software Engineer, Bioinformatics and Computational Biology, Genentech

Chris Dwan, Assistant Director, Research Computing and Data Services, Broad Institute of MIT and Harvard

Sanjay Joshi, CTO – Life Sciences, Emerging Technologies Division, EMC

Dave Peterson, Executive Director, Vendor & Third Party Assurance, National IT Compliance, Kaiser Permanente Information Technology

Vas Vasiliadis, Director, Products, Computation Institute, University of Chicago and Argonne National Laboratory

The growth in patient healthcare and life sciences innovations can be attributed to technology enhancements like cloud computing, big data analytics and mobile applications, but may conflict with increasing regulatory compliance demands to ensure protection of healthcare life and quality as well as patient data privacy and security. This panel of esteemed technology solution providers and regulators debates real-world challenges and how regulation must also innovate at technology’s pace.

4:00 Conference Adjourns

Download Brochure | Workshops 

Reg Early


View 2015 Brochure
View 2015 Brochure
View Videos & Photos 
Platinum Sponsors

Cycle Computing logo

DDN Storage  


Illumnia logo  

Intel Logo  


Official Media Partner

Conference CD

CD iconOrder the 2015 event proceedings - now available on CD

Complimentary Downloads

View white papers, listen to podcasts, and more!

  • Making the World's Knowledge Computable
  • Bioinformatics in the Cloud
  • The Application of Text Analytics to Drug Safety Surveillance

Related Event

 Medical Informatics World Related