Track 1 explores networking infrastructure and compute management solutions to support big data. Topics to be covered include storage, management, security, regulatory, HPC technologies and techniques, and FPGA and GPU cards to reduce the need for large compute clusters.
Final Agenda
Download Brochure | Pre-Conference Workshops
TUESDAY, APRIL 9
7:00 am Workshop Registration and Morning Coffee
8:00 Pre-Conference Workshops*
*Separate Registration Required
2:00 - 7:00 pm Main Conference Registration
4:00 Event Chairperson’s Opening Remarks
Cindy Crowninshield, RD, LDN, Conference Director, Cambridge Healthtech Institute
4:05 Keynote Introduction
Kevin Brode, Senior Director, Health & Life Sciences, Americas Hitachi Data Systems
» 4:15 PLENARY KEYNOTE
Do Network Pharmacologists Need Robot Chemists?
Andrew L. Hopkins, DPhil, FRSC, FSB, Division of Biological Chemistry and Drug Design, College of Life Sciences, University of Dundee
10 Minute Welcome to the Reception!
Mike Nolte, Regional Sales Manager – East, Okta

5:00 Welcome Reception in the Exhibit Hall with Poster Viewing
Drop off a business card at the CHI Sales booth for a chance to win 1 of 2 iPads® or 1 of 2 Kindle Fires®!*
*Apple® is not a sponsor or participant in this program
WEDNESDAY, APRIL 10
7:00 am Registration and Morning Coffee
8:00 Chairperson’s Opening Remarks
Phillips Kuhl, Co-Founder and President, Cambridge Healthtech Institute
8:05 Keynote Introduction
Sanjay Joshi, CTO, Life Sciences, EMC Isilon
» 8:15 PLENARY KEYNOTE
Atul Butte, M.D., Ph.D., Division Chief and Associate Professor, Stanford University School of Medicine; Director, Center for Pediatric Bioinformatics, Lucile Packard Children’s Hospital; Co-founder, Personalis and Numedii
8:55 Benjamin Franklin Award & Laureate Presentation
9:15 Best Practices Award Program
9:45 Coffee Break in the Exhibit Hall with Poster Viewing
10:50 Chairperson’s Remarks
Brian Bissett, Program Manager, Office of Systems, Social Security Administration
11:00 Best Practices for Building and Managing Multi-Tenant Research Clusters
Chris Dagdigian, Founding Partner and Director of Technology, BioTeam, Inc.
A case study exploring the real world implementation of a shared HPC environment for scientific computing. Topics will include architecting for scalability and efficiency, accommodating different scientific disciplines and workflows in a common platform, building consensus amongst competing interests, and the importance of a user driven governance structure.
11:30 Multi-Million Dollar Data Center Program Management
Brian Bissett, Program Manager, Office of Systems, Social Security Administration
This presentation will detail the program management strategy of the Social Security Administration’s Data Center Program, a $407.4 million dollar program, which processes over 140 million transactions daily, and stores nearly 250 million medical records, with an additional 2 million medical records added every week. Learn best practices to utilize when managing similar programs of this size.
12:00 pm Compute for Personalized Medicine
Ketan Paranjape, Global Director, Healthcare and Life Sciences, Intel Corp.
As we arrive at the $1000 genome, we find the fundamental problems have shifted... it is no longer about shrinking the cost of sequencing but the explosive growth of big data: the downstream analytics with rapidly evolving parameters, data sources and formats; the storage, movement and management of massive datasets and workloads; and perhaps most paradoxical of all, the challenge of articulating the results and translating the latest findings directly into improving patient outcomes using newer, smaller form factor mobile devices. Indeed, as we approach the scale of "impossibly small" for both technology and disease management, the complexity of problems grows by orders of magnitude.
12:30 Luncheon Presentation I: Big Data and Scaling Biosciences Research
Chris Bellmare, Director, Arista Networks
The increased use of common database sources with petabytes of stored data is driving new computing cluster architectures - EMR meets genomic and genetic data. Can your network keep up with increased CPU density bursts to 40GB or 100GB? How are you managing the growth of unstructured data? Do you have fast access to petabytes of processed data, Parallelization of the data, storage, and analysis algorithms? Arista is the leader in Life Sciences Data Center design, and attendees will learn about intelligent placement of compute to storage, as well as how to scale to meet the research needs while keeping the performance up and costs down.
1:00 Luncheon Presentation II: Life Science Computing
Etzard Stolte, Ph.D., CTO, HP Life Sciences
Health and Life Sciences continue to evolve as Information Technology provides new breakthroughs to make R&D more effective and accessible. Application Specific Computing, Big Analytics, Meaning Based Computing and Science Clouds are quickly changing how and where science is done. In this talk, we will present trends and examples leading to the advent of "Information Driven Medicine".
1:40 Chairperson’s Remarks
Michael Schulman, Director, Marketing, ScaleMP
1:45 Storing Really Big Data with Fast Access Economically
Robert Cecil, Ph.D., Professional Staff, The Cleveland Clinic Foundation, Imaging Institute, The Cleveland Clinic Foundation
The Cleveland Clinic Foundation hosts and engineers one of the largest and fastest growing medical archives in the world. This archive is optimized for large file sizes and high bandwidth with requirements for permanent data retention. The extension to genomics and utilization in a cloud computing environment are straight forward. Learn about our storage and data retention evolution.
2:15 Building Bridges: Evolving High Performance Computing in the Life Sciences
Carlos P. Sosa, Manager, Chemistry and Life Sciences Segment, Cray, Inc.; Biomedical Informatics and Computational Biology, University of Minnesota Rochester
As the life sciences community enters the age of data intensive science, knowledge discovery rests upon analyzing massive amounts of data. Cray is evolving the role of supercomputing by developing highly integrated computing and storage solutions that provide unique advantages to enable data-driven approaches. Cray is also actively engaged with the life sciences community to develop these next generation applications.
2:30 Accelerating Bioscience and Technology with Titan, the World’s Fastest Supercomputer
Jack C. Wells, Director of Science, Oak Ridge Leadership Computing Facility, Oak Ridge National Laboratory
Modeling and simulation with petascale computing has supercharged the process of innovation and understanding, dramatically accelerating time-to-insight and time-to-discovery. From petascale acceleration of drug discovery, to designing bio-inspired catalysts for renewable energy, to exploring complex disease mechanisms, petascale computing is delivering high impact results that are transforming bioscience and technology. This presentation will focus on early outcomes from Titan, the world’s fastest supercomputer. We will showcase results from early science applications ready to use Titan's GPU-accelerated architecture. Preliminary Early Science results from users running on Titan will be discussed, including, for example, applications in organic photovoltaic materials and liquid-crystal biosensors. Lastly, details about Titan system setup, OLCF resources, and how to apply for time on Titan's 18,688 GPU accelerated nodes will be shared.
3:00 Fun with Metadata: Solving Age-Old Storage Management Problems
Jacob Farmer, CTO, Cambridge Computer
Starfish is commercial grade software that allows users (via GUI) and applications (via API) to associate metadata with files and directories in conventional file systems, tape archives, disk-based object stores, and cloud storage services. Cambridge Computer is partnering with a number of thought-leading research institutions to define and refine the next generation of best practices for storage management. We are solving problems related to data protection, life cycle management across disparate storage devices, chargebacks, curation, and collaboration within and across institutional boundaries. Oh, by the way, we do all of this at scale.
3:15 Refreshment Break in the Exhibit Hall with Poster Viewing
3:45 Looking Behind the Curtain of Your Data Center
Christopher McLean, Director, Design, Markley Group
Key areas to focus on when choosing a location, and key “due diligence” questions inside of a Data Center.
4:00 Life Science "Big Data" Analytics Appliance – Hadoop Simplicity for the Enterprise
Rajiv Garg, Hadoop/Big Data Product Manager, DataDirect Networks
Simplify Hadoop and deploy an analytics infrastructure in hours, not months. Accelerates your time-to-insight by running Hadoop up to 7X faster. Eliminate programming requirements with a built-in ETL engine. Enable Hadoop-based bioinformatics applications like CloudBLAST and Crossbow. Up to 700% higher efficiency when compared to commodity to truly lower your TCO.
4:15 How to Break the Storage Bottleneck: A New Paradigm is Required to Meet Today's Data Performance Demands
Eric Lomascolo, Director, Solutions Marketing, Xyrartex
ClusterStor™ scale-out file system solutions break the storage bottleneck and help you manage petascale capacities for high performance application needs. As life sciences demand higher resolutions of acquisition and analysis to achieve greater accuracy and insight, so too does the need for faster data throughput. Achieving efficiencies in higher data capacities and performance requires a different approach. Find out how ClusterStor’s new scale-out HPC data storage solution delivers.
4:30 Life Sciences on Amazon Web Services
Jafar Shameem, Business Development, Life Sciences, Amazon Web Services
The availability of utility computing is significantly accelerating life sciences in both academia and industry. Join Jafar Shameem to discuss how customers such as Illumina, Unilever, DNA Nexus and Spiral Genomics are removing the constraints of static IT to deliver new secure services to market more quickly and at scale.
4:45 Accelerating Life Sciences Workloads with Hybrid SSD/SATA Scale-Out NAS Storage
Geoffrey Noer, Senior Director, Product Marketing, Panasas
File systems used for life sciences workloads share a number of characteristics that are quite different from what is typical in other industries using scale-out storage. This talk will cover the research Panasas recently conducted with customers to understand how real-world file systems are actually used. As a result, Panasas created ActiveStor 14, a scale-out NAS solution featuring a hybrid SSD/SATA architecture ideally suited for meeting the demanding I/O requirements of NGS and other prominent BioIT applications.
5:15 Best of Show Awards Reception in the Exhibit Hall
6:15 Exhibit Hall Closes
THURSDAY, APRIL 11
7:00 am Breakfast Presentation Panel: Enabling Technology. Leveraging Data. Transforming Medicine
Panelists:
Samuel Aronson, Executive Director, IT, Partners HealthCare Center for Personalized Genetic Medicine
Sanjay Joshi, CTO, Life Sciences, EMC Isilon
Glen Otero, Life Sciences HPC Solution Architect, Dell
Ketan Paranjape, Global Director, Healthcare & Life Sciences, Intel Corp.
Toby Bloom, Director, Informatics, Genomics Platform at Broad Institute
As we arrive at the $1000 genome, we find the fundamental problems have shifted... it is no longer about shrinking the cost of sequencing but the explosive growth of big data: the downstream analytics with rapidly evolving parameters, data sources and formats; the storage, movement and management of massive datasets and workloads; and perhaps most paradoxical of all, the challenge of articulating the results and translating the latest findings directly into improving patient outcomes. Please join Intel and our distinguished panel to discuss how collaborating with a broad range of ecosystem partners to develop innovative solutions to seemingly intractable problems emerging in healthcare and life sciences today is driving us towards the vision of personalized medicine.
» FEATURED PRESENTATION
8:00 Featured Presentation Introduction
Geoffrey Noer, Senior Director, Product Marketing, Panasas
8:10 Trends in the Trenches 2013
Chris Dagdigian, Founding Partner and Director of Technology, BioTeam, Inc.
HPC Trends in the Trenches is one of the most popular presentations of the Expo! This talk will present how common HPC problems in life science informatics have been approached by organizations of varying type and size. We will discuss observed trends in computing, workflows and data movement, along with details on particularly clever solutions observed in production environments around the world.
8:45 Chairperson’s Opening Remarks
Jason Wang, Co-Founder & CTO, Arpeggi, Inc.
8:50 Managing Big Data: The Genome Center Perspective
Moderator: Kevin Davies, Ph.D., Editor-in-Chief, Bio-IT World
Guy Coates, Ph.D., Informatics Systems Group, The Wellcome Trust Sanger Institute
Eric Jones, Manager, Research Computing, Broad Institute
Xing Xu, Ph.D., Director, Cloud Computing Product, BGI Americas Corporation
Alexander (Sasha) Wait Zaranek, Ph.D., Director, Informatics, Personal Genome Project, Harvard Medical School; Scientific Director, Clinical Future, Inc.
Genome centers not only have the challenge of managing petabytes of data but the implied responsibility of sharing their hard-fought solutions and best practices with the multitude of organizations lacking their IT resources. This special session draws together the IT directors of various world-class genomics institutes to discuss their technological and organizational strategies for managing big data.
9:50 Big Data; The Challenges Around the 5 Vs
Kevin Brode, Senior Director, Health & Life Sciences, Americas Hitachi Data Systems
10:20 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced
10:45 Plenary Keynote Panel Chairperson’s Remarks
Kevin Davies, Ph.D., Editor-in-Chief, Bio-IT World
10:50 Plenary Keynote Panel Introduction
Yury Rozenman, Head of BT for Life Sciences, BT Global Services
Niven R. Narain, President & CTO, Berg Pharma
» PLENARY KEYNOTE PANEL
11:05 The Life Sciences CIO Panel
Panelists:
Remy Evard, CIO, Novartis Institutes for BioMedical Research
Martin Leach, Ph.D., Vice President, R&D IT, Biogen Idec
Andrea T. Norris, Director, Center for Information Technology (CIT) and Chief Information Officer, NIH
Gunaretnam (Guna) Rajagopal, Ph.D., VP & CIO - R&D IT, Research, Bioinformatics & External Innovation, Janssen Pharmaceuticals
Cris Ross, Chief Information Officer, Mayo Clinic
Matthew Trunnell, CIO, Broad Institute of MIT and Harvard
12:15 Luncheon in the Exhibit Hall with Poster Viewing
2:00 Closing Featured Panel Session Introduction
Wanmei Ou, Senior Product Strategist, Oracle Health Sciences
2:10 Panel Session: Building the IT Architecture of the New York Genome Center
Moderator: Kevin Davies, Ph.D., Editor-in-Chief, Bio-IT World
Chris Dwan, Acting Senior Vice President, Information Technology and Research Computing, New York Genome Center
Jim Harding, CTO, Sabey Corporation
Sanjay Joshi, CTO, Life Sciences, EMC Isilon Storage Division
Robert B. Darnell, M.D., Ph.D., President & Scientific Director, New York Genome Center
George Gosselin, CTO, Computer Design & Integration LLC
In 2011, a consortium of 11 major academic and medical organizations in and around New York announced the creation of the New York Genome Center (NYGC). Under the direction of Robert B. Darnell, the NYGC aspires to be a world-class genomics and medical research center, and is currently undergoing construction in the heart of Manhattan. NYGC management has the opportunity to design and create a state-of-the-art IT and data management infrastructure to handle, store and share the output from what will rapidly become one of the world’s foremost genome sequencing facilities. This series of talks will describe the thinking that went into the design, creation and construction of the NYGC’s IT infrastructure and entire data management strategy.
4:00 Conference Adjourns
Download Brochure | Pre-Conference Workshops