BIT Header Desktop  
BIT Header Mobile 
 

Track 2 - April 21 – 23, 2015

Software Development  

Harnessing Data for Scientific Decision Making

Track 2 explores the technology and tools needed to connect data, applications, people, processes, and partners to ensure available, reliable, and actionable information for scientific decision making. Case studies will be presented that address how life science organizations address common problems in harnessing data including analytics, methods and standards, using open source, using in-house vs customized commercial platforms, transparency, efficiency, security, and cost-effective solutions.

Final Agenda

Download Brochure | Workshops 

Tuesday, April 21


7:00 am Workshop Registration and Morning Coffee

8:00 – 11:30 Recommended Morning Pre-Conference Workshops*

Aligning Projects with Agile Approach

Gamification of Science

12:30 – 4:00 pm Recommended Afternoon Pre-Conference Workshops*

Predictive Analytics

Large Scale NGS Analysis Using Globus Genomics

* Separate registration required

2:00 – 6:30 Main Conference Registration

 

» 4:00 PLENARY SESSION 

Click here for detailed information. 

5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing

 

Wednesday, April 22

7:00 am Registration Open and Morning Coffee

 

» 8:00 PLENARY SESSION  

Click here for detailed information. 

9:00 Benjamin Franklin Awards and Laureate Presentation

9:30 Best Practices Awards Program

9:45 Coffee Break in the Exhibit Hall with Poster Viewing

 

TRENDS IN THE TRENCHES 2015 

10:50 Chairperson’s Opening Remarks

 

» 11:00 FEATURED PRESENTATION: HPC TRENDS IN THE TRENCHES 2015

Chris Dagdigian, Founding Partner & Director, Technology, BioTeam, Inc.

In one of the most popular presentations of the Expo, Chris delivers a candid assessment of the best, the worthwhile, and the most overhyped information technologies (IT) for life sciences.

12:00 pm Presentation to be Announced

12:30 Session Break

IBM12:40 Luncheon Presentation I: Big Data for Genomics -- SCALE, SPEED and SMART

Frank Lee, Ph.D., Lead Architect, Genomics Solution, IBM

Explosive growth of big data is challenging researchers in genomics and life sciences around the world. Learn about some of the latest solutions, architecture and best practice to 1) acquire, store, access data in scale; 2) build a high-throughput computing infrastructure to process large genomic data set; 3) gain insights and knowledge from the data through translational research. Illustrated through real-life projects and case studies, join this session to learn of the latest approaches to tackle big data, the evolving ecosystem, success stories and lessons learned that highlight the potential for collaboration among genomic research communities. Share in a preview of the upcoming IBM genomics turn-key platform currently under development.

Intel Logo1:10 Luncheon Presentation II: Accelerate Science. Translate Results. Deliver Today. Optimizing Life Sciences Code for Breakthrough Innovations

Paolo Narvaez, Principal Engineer, Intel Data Center Group

1:40 Session Break

 

USING DATA TO DRIVE DECISIONS 

1:50 Chairperson’s Remarks

1:55 Lies, Damn Lies, and Big Data: How to Best Utilize Data to Drive Decisions

Brian Bissett, Senior Member, Institute of Electrical and Electronics Engineers

Big Data is hailed as the solution to many problems in industry. In many respects this is a fallacy because it only takes a small amount of erroneous data to corrupt the usefulness of a large dataset. While Big Data can be extremely useful in predicting patterns for the masses such as traffic patterns and peak usage hours for a utility, its usefulness begins to diminish in situations where quality is more important than quantity. In addition, the underlying assumption of Big Data that the behavior of the masses is the correct course of action is not always true. The audience will gain an appreciation for how to best utilize data to drive decisions. Common fallacies will be addressed including the notion that Big Data sets are always superior to smaller data sets. The limitations of big data sets, the importance of quality data, effective display of quantitative information, boundary conditions, and the evaluation of quantitative and qualitative factors will all be discussed.

2:25 Data Publication and Discovery Using Globus Research Data Management Software-as-a-Service

Vas Vasiliadis, Director, Products, Computation Institute, University of Chicago and Argonne National Laboratory

Globus is software-as-a-service for research data management, used at dozens of institutions and national facilities for moving, sharing, and publishing big data. Recent additions to Globus include services for data publication and discovery that enable: publication of large research data sets with appropriate policies for all types of institutions and researchers; the ability to publish data directly from your own storage or from cloud storage that you manage, without third party publishers; extensible metadata that describe the specific attributes for your field of research; publication and curation workflows that can be easily tailored to meet institutional requirements; public and restricted collections that give you complete control over who may access your published data; a rich discovery model that allows others to search and use your published data. This presentation will give an overview and demonstration of these services, as well as case studies that illustrate how Globus is increasing researcher productivity and facilitating enhanced collaboration among researchers.

2:55 Sponsored Presentation (Opportunity Available)

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing

4:00 Semantic Integration of Unstructured Safety Study Data: Experiences and Outlook

Alain Nanzer, Ph.D., Global Head Safety & Development Workflows, Pharma Research and Early Development Informatics, Roche Innovation Center Basel

In pharmaceutical R&D vast amounts of study data are generated - in house and externally - which are used to advance drug projects and then end as reports in document management systems or on file shares. Most often these data are lost for further scientific analysis, as no structured search and access is possible. Common approaches to load such data to scientific data warehouses require complex ETL processes to normalize the results, are very labor intensive and not well suited for large sets of unstructured legacy data. The presentation will share our experiences implementing a platform using semantic integration technologies to provide scientists search, evaluation, and advanced visualization capabilities for safety in vivo study data. Furthermore we will show how the platform has been extended providing fast access to real-time study data, and then evolved to a data turntable for external study data and submissions to regulatory authorities. .

4:30 DIVOS: A Platform for Effective in vivo Study Knowledge Management at Genentech

Dana Caulder, Senior Software Engineer, Bioinformatics and Computational Biology, Genentech

Preclinical animal models are essential to understanding the fundamental biology of disease and the efficacy, pharmacokinetic, pharmacodynamic, and toxicity profiles of potential therapeutics. The variety of study designs and endpoints across therapeutic areas makes it challenging to develop systems that meet researchers’ short-term and the business’s long-term needs. In the Bioinformatics department within Genentech Research & Early Development we have developed an in vivo data management platform DIVOS that not only enables researchers across all therapeutic areas to manage their day to day work, but it also enables data reuse and data exploration of historical studies by both bench scientists and statisticians. We will present a DIVOS case study that includes both examples of the scientific successes that have been enabled by the system, and technical / architectural details that underlie DIVOS’s flexibility and extensibility. Attendees will walk away with an overview of a successful case study in in vivo data management, including 1) how the system was architected to be flexible enough to handle data and work processes across multiple therapeutic areas, 2) success factors in both the project/people and technical/implementation realms, 3) the importance of having a committed and engaged user community, and 4) how we’ve achieved that at both the sponsor and bench scientist level.

5:00 Sponsored Presentation (Opportunity Available)

5:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing

6:30 Close of Day

 

Thursday, April 23

7:00 am Registration Open and Morning Coffee

 

» 8:00 PLENARY SESSION PANEL   

Click here for detailed information. 

10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced

 

INTEGRATING AND IMPLEMENTING DATA PLATFORMS AND WORKFLOWS 

10:30 Chairperson’s Remarks

10:40 A Case Study in Building a Clinical Research Database in a Translational Research Environment

Charlie Quinn, Director, Data Management & Software Development, Benaroya Research Institute

We have developed a database that integrates public and private clinical and experimental data in a translational research environment. We will discuss some of the challenges and solutions that we encountered in developing the database. Even though our research is primarily concentrated on autoimmune diseases the techniques and technologies developed are applicable to all. In addition, we will discuss our new open source spreadsheet wrangling tool which is instrumental in allowing us to capture, integrate, and manage the world of excel spreadsheets that live in most research environments.

11:10 Sciencescape - An Innovative Research Discovery Platform that Connects Users to Breaking Research As It Happens, Around the World, and Throughout History

Sam Molyneux, CEO & Co-Founder, Sciencescape

Sciencescape is an online platform that draws on biomedical and life science research from the past 100 years, and adds every paper as it appears - right up to the minute. Using a network-based algorithm called Eigenfactor, Sciencescape takes into account not only how many citations a paper has, but also where those citations come from and why they are important. Sciencescape organizes and delivers the most comprehensive real-time updates of peer-reviewed journals in life sciences based on scientists’ personalized preferences. Through our extensive publisher relationships, we’ve scanned, grouped, tagged, and categorized the full text of the majority of the over 24 million published peer-reviewed biomedical papers. This allows users to connect, organize, and display the scientific literature and stay on the leading edge while broadcasting their ideas and collaborating with their peers. Sciencescape totally transforms the research process, making it much easier for research to be efficient with their time, while also intuitively opening new opportunities for discovery.

11:40 Sponsored Presentation (Opportunity Available)

12:10 pm Session Break

12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing

 

MANAGING BIG DATA AND SECURITY STRATEGIES 

1:55 Chairperson’s Remarks

 

» 2:00 FEATURED PRESENTATION: OPENFDA: FDA’S MOST INNOVATIVE CLOUD-BASED BIG DATA AND ANALYTICS PLATFORM

Taha A. Kass-Hout, M.D., MS, Chief Health Informatics Officer, CTO and Director, Office of Informatics and Technology, FDA

OpenFDA was the first innovation created by Taha Kass-Hout, M.D., MS, upon joining FDA as the first Chief Health Information Officer in March 2013. OpenFDA was launched on June 2, 2014, allowing software developers, researchers and the public to tap into adverse events for drugs and medical devices; recalls, for drugs, devices and foods; and labeling for products on the market.

2:30 Global Developments in Privacy and Data Security Law

John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill

The international legal climate governing privacy and data security is changing. The European Union is in the midst of a fundamental shift in its approach. The U.S. still lacks a national data law, so the states and individual federal agencies are groping toward a strategy. This presentation focuses on the impact of these ongoing changes on genomics, bioinformatics and health research.

3:00 PANEL DISCUSSION: Achieving Much-Needed Innovation while Hurdling the Barriers of Stringent Regulation

Moderator: John M. Conley, J.D., Ph.D., William Rand Kenan, Jr. Professor of Law, University of North Carolina, Chapel Hill

Panelists:

Dave Peterson, Executive Director, Vendor & Third Party Assurance, National IT Compliance, Kaiser Permanente Information Technology

Additional Panelists to be Announced

The growth in patient healthcare and life sciences innovations can be attributed to technology enhancements like cloud computing, big data analytics and mobile applications, but may conflict with increasing regulatory compliance demands to ensure protection of healthcare life and quality as well as patient data privacy and security. This panel of esteemed technology solution providers and regulators debates real-world challenges and how regulation must also innovate at technology’s pace.

4:00 Conference Adjourns


Download Brochure | Workshops 



View 2015 Brochure
 BIT-Agenda-icon 

Platinum Sponsors
 
Aspera 

 Cycle Computing logo 

 Data Direct Networks 

Elsevier 

 

IBM 

Illumnia logo 

Intel Logo 

OKTA 

Oracle Health Sciences 

Seagate  

 SGI small logo 

Thinkmate 

Thomson Reuters 


View All Sponsors 

 


Official Media Partner
  


View Media Partners 


Conference CD

CD iconOrder the 2014 event proceedings - now available on CD 


Complimentary Downloads

View white papers, listen to podcasts, and more! 

  • Making the World's Knowledge Computable
  • Bioinformatics in the Cloud
  • The Application of Text Analytics to Drug Safety Surveillance
 

Related Events

 HIT Logo for BIT 2015
CLN Co-located Event