Bio IT World Expo 2016  
Bio IT World Expo 2016

Track 6 - April 5 – 7, 2016

Next-Gen Sequencing Informatics

Advances in Large-Scale Computing

Tremendous advancements have been made to broaden NGS applications from research to the clinic. Especially as genomics becomes more integrated with precision medicine initiatives. In spite of this, enormous challenges for NGS still exist including real-time sequencing, data storage, processing, scaling, quality control management, security and compliance in the cloud, and interpretation. Track 6 presents case studies on these challenges.

Download Brochure | Workshops

Tuesday, April 5

7:00 am Workshop Registration and Morning Coffee

8:00 – 11:30 Recommended Morning Pre-Conference Workshops* Intelligent Methods Optimization of Algorithms of NGS

12:30 – 4:00 pm Recommended Afternoon Pre-Conference Workshops* Determining Genome Variation and Clinical Utility

* Separate registration required

2:00 – 6:00 Main Conference Registration


Click here for detailed information

5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing

Wednesday, April 6

7:00 am Registration Open and Morning Coffee


Click here for detailed information

9:00 Benjamin Franklin Awards and Laureate Presentation

9:30 Best Practices Awards Program

9:45 Coffee Break in the Exhibit Hall with Poster Viewing


10:50 Chairperson’s Opening Remarks

Hans Cobben, CEO, Bluebee

11:00 Time to Build Personal Genome

Wenming Xiao, Ph.D., Staff Fellow, Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, FDA

Precision medicine is based on interrogation of genetic alteration in one individual, which requires precise and complete characterization of personal genome. Whole genome sequencing has been becoming cheaper and affordable and the challenge of routinely applying it in the precision medicine era largely rests on bioinformatics solution, particularly for personal genome assembly. This study is to establish the best practice of personal genome assembly and quality matrices and to provide guidance for usage of personal genome in clinical application by investigating the impact of various the next-generation sequencing (NGS) parameters, such as coverage, read length, and methods on assembly quality.

11:30 An Innovative and Globally Distributed Genome Management System

Thomas Thies, Senior Scientist, Data/Information Architecture and Terminology, pREDi, Roche

The huge amount of genomic data which needs to be analyzed timely by a globally distributed scientific workforce cannot move around the globe. Instead the analysis pipes are brought to the data. This talk will introduce you to a solution that follows this new paradigm. In addition it will explain how we are leveraging existing HPC environments including governance models which fuel the innovative capacity of our computational scientists.

12:00 pm An Integrated High Performance Analytics Solution for Genomics and Translational Research

Kathy Tzeng, WW Technical Lead, Healthcare and Life Science Solutions, IBM Systems, IBM

Janis Landry-Lane, WW Program Director, Healthcare and Life Science Solutions, IBM Systems, IBM

The rapid advances in sequencing technology are driving the use of genomics information in various domains. Processing raw data from a sequencer and translating it into insights in a timely fashion requires a high performance, scalable analytics solution to integrate genomics information with other data sources. IBM’s approach of building integrated solutions with our customers and partners will be highlighted.

12:30 Session Break

12:40 Luncheon Presentation I: Not Just Noise: Transforming Big Data into Smart Data

Brady Davis, Senior Director, Informatics, Illumina, Inc.

When it comes down to it, big data is only a big deal when you can attach context and meaning to it. Smart data -- that is the right data at the right time to the right person -- can help professionals enhance and inform care decisions. That’s the prize; and while everyone’s got their eyes on it, not everyone knows how to get their hands on it. This session will focus on how Illumina is working to provide solutions that look at data at every stage, from collection and protection to collaboration, storage and analysis.

Cray1:10 Luncheon Presentation II to be Announced

1:40 Session Break


1:50 Chairperson’s Remarks

1:55 A Scalable and Adaptive Framework for Next-Generation Sequencing Analysis

Zhiyan Fu, Ph.D., Chief Scientific Computing Officer, Scientific and Research Computing, Genome Institute of Singapore

With the emergence of sequencing as a general-purpose tool for biomedical research and clinical diagnostics, analysis of large sequencing datasets become a growing challenge where advances in sequencing technology are quickly out-stripping computational resources. Professional bioinformaticians have developed a bunch of automated pipelines and the pipelines are run in high-performance computing infrastructure using command line. However, both the in-house infrastructure and the professional man-power are limited and cannot scale as fast as the sequencing data grow. In this project, we developed a framework that can burst the workloads to third party resources such as public cloud or supercomputer center to provide cost-effective cloud-based services.

2:25 High-Throughput NGS Sequencing Using Ion Proton in a Clinical Genetic Testing Lab

Yirong Wang, Associate Director, Production Informatics, Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai

Clustered Ion Protons provide a highly scalable framework for high throughput sequencing in any genetic testing labs or core sequencing facilities while keeping the cost manageable. Highly customized LIMS and efficient data analysis pipeline also play critical roles in quality control and report generation and delivery. In an initial pilot study, we are able to sequence and process 6000 samples for a large panel (500+ genes) screening under 8 weeks. 

2:55 Presentation to be Announced

3:10 Genomic Analysis on a Loosely Coupled AWS Platform with Highly Distributed NGS Data Analytics at a Massive Scale

Justin Johnson, Associate Director and Principal Genomics Scientist, Translational Oncology, AstraZeneca

The global NGS team at AstraZeneca implements a robust, flexible and consumable platform to perform genomic analysis at scale. The Bina solution was tested by processing tens of thousands of TCGA exomes with modern algorithms against latest reference genome (hg38), in turn demonstrating that the driver mutational landscape of the TCGA can be redefined when comparing against public domain data.

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing


4:00 Lessons Learned Analyzing Thousands of Samples for Clinical Use Cases Using Amazon Web Services

Ravi Madduri, Fellow, Computation Institute, University of Chicago; Project Manager, Math and Computer Science Division, Argonne National Lab

Globus Genomics is a cloud-based, large scale genomics analysis service that is used by research consortiums, healthcare providers for analyzing 1000s of raw genomics datasets. In order to deliver results of the analyses on the tight deadlines, we created cost-aware resource scheduling on AWS resources and reusable recipes for setting up appropriate security controls required for compliance. In this talk, we will present some of the use cases and success stories from our work.

4:30 Federated EHR Network for Patient Cohort Discovery
Bhanu Bahl, Director of Informatics, Harvard Catalyst

Patient Cohort discovery, across multiple healthcare institutions is a challenge. Accrual of sufficient numbers of patients for orphan diseases clinical trials further compounds the challenge. The Shared Health Research Information Network (‘SHRINE’), a Harvard Catalyst’s open source web-based query tool helps overcome the barriers arising due to variability in the source electronic health record (EHR) systems and returns aggregate numbers of patients across all sites with user-defined characteristics, currently demographics, diagnoses, medications, and selected lab values. By allowing semantic interoperability and consistency of data elements, SHRINE leverages the use of the Informatics for Integrating Biology and the Bedside (‘i2b2’) Hive software, an open source scalable informatics framework. Using federated search architecture, real-time queries can be performed across collaborating institutions, each with their own locally managed patient datasets.

5:00 Sponsored Presentation (Opportunity Available)

5:30 – 6:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing

Thursday, April 7

7:00 am Registration and Morning Coffee


Click here for detailed information

10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced


10:30 Chairperson’s Opening Remarks

10:40 Application of Targeted NGS Sequencing in Personalized Clinical Cancer Therapies

Qichao Zhu, Ph.D., Associate Professor, Genetics & Genomics Sciences, Icahn School of Medicine at Mount Sinai

Our current clinical cancer genome research project is focused on the three key components, sequence analysis for patient genetic profiling, biomarker (genetic variation) collection for cancer precision medicine, and the data processing and integration platform application for clinical report. The goal of the project is developing a comprehensive platform that can totally support precision medicine approach in cancer treatment. The approach is based on the approved concepts that tumor biomarkers are associated with patient prognosis and tumor response to therapy and patient genetic profile can be associated with drug metabolism, drug response and toxicity. Personalized tumor genetic profiles, combining with tumor site and other relevant information are then used for determining optimum individualized therapy options. This presentation concentrates on the following major components for our project: 1) Accurately detecting the tumor genetic and molecular variants in terms of both coverage and precision by developing the new algorithms to improve our variant calling; 2) Matching patients with treatments that are more likely to be effective and cause fewer side effects by collecting, curating and associating biomarkers (genetic and molecular variations) with diseases, drugs and treatment plans; and, 3) Handling the cases in a high-throughput manner by developing a web-based pipeline platform for cancer data processing, sequence analysis, data integration and report generation.

11:10 Comparative Analysis of RNA-Seq Techniques to Study Prostate Cancer

Carlos P. Sosa, Ph.D., Biomarker Discovery Group, Mayo Clinic and Adjunct Assistant Professor, Biomedical Informatics and Computational Biology (BICB), University of Minnesota, Rochester, MN

Co-authors: Carlos P. Sosa, Ph.D., Ling Cen, Ph.D., and George Vasmatzis, Ph.D.

11:40 Presentation to be Announced

12:10 pm Session Break

12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing

NGS and Informatics to Advance Precision Care

1:55 Chairperson’s Remarks

2:00 Talk Title to be Announced

Gunaretnam (Guna) Rajagopal, Ph.D., Vice President & Global Head, Computational Sciences, Discovery Sciences, Janssen Research & Development, A Johnson & Johnson Company

2:30 A Clinical Genetics Diagnostic System Incorporating Next-Gen Sequencing and Informatics to Advance Pediatric Precision Care

Marcia Nizzari, MS, CIO, Claritas Genomics

Claritas Genomics serves children affected with complex genetic disorders by providing timely and accurate results, resolving families’ long search for answers. We developed a unique “orthogonal sequencing” approach that simultaneously sequences exomes on both the Illumina NextSeq and the Life Technologies Ion Proton instruments. This talk will cover both the lab approach and the bioinformatics analysis pipelines, key components of Claritas’ enterprise architecture for pediatric precision care.

3:00 Software for Interpretation of Next-Gen Sequencing Data in a Clinical Setting

Neil Miller, Director, Informatics, Center for Pediatric Genomic Medicine, Children’s Mercy, Kansas City

The scale and complexity of NextGen Sequencing Data present unique informatics challenges particularly with the issues of variant characterization and clinical interpretation. The Center for Pediatric Genomic Medicine at Children's Mercy, Kansas City has developed novel software applications which are specifically designed to enable non-expert clinicians and researchers to make use of targeted NGS in the diagnosis and management of rare disease. The software programs described are the analytical backbone of the clinical and research applications at CMH including STAT-seq, a program for the ultra-rapid whole genome sequencing of critically ill patients in the neonatal intensive care unit (NICU). Children's Mercy, Kansas City is a leader in the field of applying genomics to clinical care; STAT-seq was named one of Time Magazine's top 10 medical breakthroughs of 2012. The software developed at CMH has been referenced in multiple publications and will soon become available at no cost for research use. Attendees will learn an overview of an end to end solution for interpretation of NextGen Sequence data which is used extensively in a children's hospital. An introduction to software that will shortly become publicly available.

3:30 Talk Title to be Announced

Michael Zody, Ph.D., Research Director of Computational Biology, New York Genome Center

4:00 Conference Adjourns

Download Brochure | Workshops

Register Now & Save


View 2016 Brochure
View 2016 Brochure
View Videos & Photos 
Platinum Sponsors


Cycle Computing logo small

DDN Storage  



 IBM Logo Illumnia logo  

Intel Logo  

Precision for Medicine

 Seven Bridges Genomics

View All Sponsors

Official Media Partner

View All Media Partners

Conference CD

CD iconOrder the 2015 event proceedings - now available on CD

Complimentary Downloads

View white papers, listen to podcasts, and more!

  • Making the World's Knowledge Computable
  • Bioinformatics in the Cloud
  • The Application of Text Analytics to Drug Safety Surveillance

Related Event

 Medical Informatics World Related