Track 1 - April 21 – 23, 2015
IT Infrastructure – Hardware
Big Data Storage Capabilities and Solutions in the R&D Ecosystem
Track 1 provides concrete case studies of secure internal and external storage solutions and implementation from the IT and scientist perspectives. Learn best practices of transforming a framework of individualized IT islands to a shared but flexible and scalable IT platform. Themes covered include cloud solutions linked to HPC clusters, accommodating different scientific disciplines and workflows in a common platform, and driving user governance structures.
Download Brochure | Workshops
Tuesday, April 21
7:00 am Workshop Registration and Morning Coffee
8:00 – 11:30 Recommended Morning Pre-Conference Workshops*
Aligning Projects with Agile Approach
12:30 – 4:00 pm Recommended Afternoon Pre-Conference Workshops*
Converged IT Infrastructure in Life Science
* Separate registration required
2:00 – 6:30 Main Conference Registration
5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing
Wednesday, April 22
7:00 am Registration Open and Morning Coffee
9:00 Benjamin Franklin Awards and Laureate Presentation
9:30 Best Practices Awards Program
9:45 Coffee Break in the Exhibit Hall with Poster Viewing
10:50 Chairperson’s Opening Remarks
» 11:00 FEATURED PRESENTATION: HPC TRENDS IN THE TRENCHES 2015
Chris Dagdigian, Founding Partner & Director, Technology, BioTeam, Inc.
In one of the most popular presentations of the Expo, Chris delivers a candid assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences.
12:00 pm Presentation to be Announced
12:30 Session Break
12:40 Luncheon Presentation I: Big Data for Genomics -- SCALE, SPEED and SMART
Frank Lee, Ph.D., Lead Architect, Genomics Solution, IBM
Explosive growth of big data is challenging researchers in genomics and life sciences around the world. Learn about some of the latest solutions, architecture and best practice to 1) acquire, store, access data in scale; 2) build a high-throughput computing infrastructure to process large genomic data set; 3) gain insights and knowledge from the data through translational research. Illustrated through real-life projects and case studies, join this session to learn of the latest approaches to tackle big data, the evolving ecosystem, success stories and lessons learned that highlight the potential for collaboration among genomic research communities. Share in a preview of the upcoming IBM genomics turn-key platform currently under development.
1:10 Luncheon Presentation II: Accelerate Science. Translate Results. Deliver Today. Optimizing Life Sciences Code for Breakthrough Innovations
Paolo Narvaez, Principal Engineer, Intel Data Center Group
1:40 Session Break
1:50 Chairperson’s Remarks
1:55 Comparisons of Storage Efficiencies through Hadoop
Martin Gollery, CEO, Tahoe Informatics
Hadoop is widely used in ‘Big-Data’ applications, so much so that most modern cluster installations are now installing some version of Hadoop rather than the old style clusters. The focus for many people is the increased speed that is obtained with Hadoop, with scalability that can easily reach into the tens of thousands of nodes. Perhaps less well known are the cost savings that may be achieved with Hadoop storage. This talk will compare and contrast the storage techniques and the costs that are associated with them.
2:25 Rapid Integration of Cancer Genomics Data Using Hadoop and Cloudera’s Impala
Sittichoke Saisanit, Ph.D., Data Scientist, Informatics, Pharmaceutical Research and Early Development, Roche Innovation Center New York
We explored Cloudera Impala for analysis of cancer genomics data. Without data transformation and reformatting, Impala tables can be created quickly from files on Hadoop file system with a simple command. Such speed and flexibility enable us to interrogate data without spending much time on schema design, index creation, query tuning and data cleaning. Impala can be accessed through Spotfire allowing flexibility of data visualization.
Authors: Sittichoke Saisanit, Zayed Albertyn, Xing Yang, Padmanabha Udupa
2:55 Accelerating Biomedical Research Discovery: The 100G Internet2 Network – Built and Engineered for the Most Demanding Big Data Science Collaborations
Christian Todorov, Director, Network Services Management, Internet2
Genomic & biomedical researchers have been forced to exchange big data via physical drives as advanced network connectivity was previously unavailable or cost prohibitive. Hear how colleagues are improving big data workflows using the 100G Internet2 Network, which provides the highest data transport rates available, along with dynamic cloud and trust applications that are interconnecting research and accelerating discovery.
3:10 Managing Genomic Data at Scale! - Rules Based Intelligent Data Management
Jose L. Alvarez, Principal Engineer, WW Director, Healthcare and Life Sciences, Seagate Cloud and Systems Solutions
The explosion of Genomic data due to new instrument chemistry and more powerful analysis tool sets has created a complex and manual data management problem for high-throughput NGS centers. We will discuss how an intelligent data management solution can address this problem. iRODS (Integrated Rules-Oriented Data System) enables this intelligent data orchestration and can even help with pipeline and workflow automation.
3:25 Refreshment Break in the Exhibit Hall with Poster Viewing
4:00 PANEL PRESENTATION/DISCUSSION: ICTBioMed: International Consortium for Technology in Biomedicine
Moderator: Anil Srivastava, President, Open Health Systems Laboratory
Panelists to be Announced
Open Health Systems Laboratory has brought together several life sciences supercomputing centers to form the International Consortium for Technology in Biomedicine (ICTBioMed). ICTBioMed members have been working together for almost two years to create a shared global cyberinfrastructure as a seamless and friction-free platform for the researchers worldwide for their collaborative research in consistent with the tenets of team science. ICTBioMed leadership team will present in this session both the shared resources and the research use cases that they have been supporting to validate and further develop the value added cloud services. The panelists will speak to a narrative framework of possible science using, what NSF describes, as International Research Network Connection, pursuing the Big Data to Knowledge goals of NIH.
5:00 Presentation to be Announced
5:15 Sponsored Presentation (Opportunity Available)
5:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing
6:30 Close of Day
Thursday, April 23
7:00 am Registration Open
7:00 Breakfast Presentation: Enabling Technology. Leveraging Data. Transforming Personalized Medicine
Ketan Paranjape, General Manager Life Sciences, Intel Corp.
Panelists to be Announced
The $1000 genome is here, and the fundamental problems have shifted... it is no longer about shrinking the cost of sequencing but the explosive growth of big data: the downstream analytics with rapidly evolving parameters, data sources and formats; the storage, movement and management of massive datasets and workloads, and the challenge of articulating the results and translating the latest findings directly into improving patient outcomes. A panel will discuss these issues and more as we work to achieve the vision of personalized medicine.
10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced
10:30 Chairperson’s Remarks
Peter Godman, Co-Founder & CEO, Qumulo
10:40 Intelligent Infrastructure Approaches for Emerging Life Sciences Data Management Issues at Scale
George Vacek, Ph.D., Global Business Director, Life Sciences, DataDirect Networks
Dr. Vacek will deliver several in-depth case studies of global leaders HPC applications in Life Science. Case studies will focus on infrastructure approaches to solve the emerging issues of data at scale, including best practices in supporting high performance local workflows, collaborative and research communities, and life sciences clouds and hybrid cloud solutions.
10:55 How Next Generation Scale-Out Storage Fuels Breakthroughs in Life Sciences
Peter Godman, Co-Founder & CEO, Qumulo
Technology advances in DNA sequencing and other research data capture instruments are creating data at an unprecedented rate. As storage footprints grow further into petabyte scale, storage teams increasingly struggle to manage the massive amount of data stored. Next-generation scale-out storage provides instant insight into data at scale, abstracts away the underlying infrastructure, and achieves breakthrough price/performance using intelligent software and commodity hardware.
11:10 Infrastructure, Architecture, and Organization: Data Engineering at Scale at the Broad
Chris Dwan, Assistant Director, Research Computing and Data Services, Broad Institute of MIT and Harvard
As the Broad Institute enters its second decade, we are adapting to genomic research at a global scale. Among other things, this requires adopting hybrid cloud technologies, moving to object models for data storage, and embracing federated solutions for identity and authorization. The social and organizational aspects of these transitions are at least as challenging as the technical. This talk describes the interplay between the human and technical aspects of these changes, as well as specific lessons learned along the way.
11:40 Start Small, Collaborate Often, Grow Big – Scaling NGS Compute and Storage Solutions for Personalized Medicine
Speaker to be Announced
Scaling an NGS IT solution doesn’t have to be overwhelming. Collaborating with experienced Clinicians, Researchers, Vendors, and Partners, all using best practices - enables incremental success and effective development for high utilization and impactful results.
11:55 Sponsored Presentation (Opportunity Available)
12:10 pm Session Break
12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Lunch on Your Own
1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing
1:55 Chairperson’s Remarks
» 2:00 FEATURED PRESENTATION: OPENFDA: FDA’S MOST INNOVATIVE CLOUD-BASED BIG DATA AND ANALYTICS PLATFORM
Taha A. Kass-Hout, M.D., MS, Chief Health Informatics Officer, CTO and Director, Office of Informatics and Technology, FDA
OpenFDA was the first innovation created by Taha Kass-Hout, M.D., MS, upon joining FDA as the first Chief Health Information Officer in March 2013. OpenFDA was launched on June 2, 2014, allowing software developers, researchers and the public to tap into adverse events for drugs and medical devices; recalls, for drugs, devices and foods; and labeling for products on the market.
2:30 Global Developments in Privacy and Data Security Law
John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill
The international legal climate governing privacy and data security is changing. The European Union is in the midst of a fundamental shift in its approach. The U.S. still lacks a national data law, so the states and individual federal agencies are groping toward a strategy. This presentation focuses on the impact of these ongoing changes on genomics, bioinformatics and health research.
3:00 PANEL DISCUSSION: Achieving Much-Needed Innovation while Hurdling the Barriers of Stringent Regulation
Moderator: John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill
Dave Peterson, Executive Director, Vendor & Third Party Assurance, National IT Compliance, Kaiser Permanente Information Technology
Additional Panelists to be Announced
The growth in patient healthcare and life sciences innovations can be attributed to technology enhancements like cloud computing, big data analytics and mobile applications, but may conflict with increasing regulatory compliance demands to ensure protection of healthcare life and quality as well as patient data privacy and security. This panel of esteemed technology solution providers and regulators debates real-world challenges and how regulation must also innovate at technology’s pace.
4:00 Conference Adjourns
Download Brochure | Workshops