BIT Header Desktop  
BIT Header Mobile 
 

Track 8 - April 21 – 23, 2015

Pharmaceutical R&D Informatics 

Collaboration, Data Science and Biologics

With the increased generation of innumerable and varied data sets, pharma and biotech must effectively manage and integrate data from all stages of the pharmaceutical value chain to enable more informed decisions. This is seen in the emerging trend of data science groups. Track 8 explores the transformation of current IT and informatics teams into data science groups and current progress made by such groups in the analysis, integration and visualization of complex data sets, including genomic, imaging, clinical, external/internal collaboration and real world data.

Final Agenda

Download Brochure | Workshops 

Tuesday, April 21

7:00 am Workshop Registration and Morning Coffee


8:00 – 11:30 Recommended Morning Pre-Conference Workshops*

Biologics, Bioassay, and Biospecimen Registration Systems

12:30 – 4:00 pm Recommended Afternoon Pre-Conference Workshops*

Finding Innovation in Collaboration Environments: Documentum, Sharepoint, Veeva, and Tigers, Oh My!

* Separate registration required


2:00 – 6:30 Main Conference Registration


» 4:00 PLENARY SESSION 

Click here for detailed information. 


5:00 – 7:00 Welcome Reception in the Exhibit Hall with Poster Viewing

 

 

Wednesday, April 22

7:00 am Registration Open and Morning Coffee


» 8:00 PLENARY SESSION  

Click here for detailed information. 


9:00 Benjamin Franklin Awards and Laureate Presentation

9:30 Best Practices Awards Program

9:45 Coffee Break in the Exhibit Hall with Poster Viewing


DATA SCIENCE & ANALYTICS STRATEGIES

10:50 Chairperson’s Opening Remarks

Jose L. Alvarez, Principal Engineer, WW Director, Healthcare and Life Sciences, Seagate Cloud and Systems Solutions

11:00 The Evolution of Data Science in Translational Medicine

Anastasia Christianson, Head, Translational R&D IT, Bristol-Myers Squibb

Eric Carleen, Director, Data Integration, Bristol-Myers Squibb

The role of the data scientist continues to evolve while the demand for the skills continues to rise. The ever increasing volume of data generated on any one drug project and the availability of even more relevant data externally necessitate strong data handling and data analytics expertise. Thus the role of the data scientist continues to evolve and the skills it requires continue to grow. This presentation will describe how the role has evolved in one Pharma company and how the collaboration between data scientists and related skills across organizational boundaries has delivered valuable insights to project teams.

11:30 Data Science in Translational Clinical Research

James Cai, Head, Data Science, Roche, Translational Clinical Research Center (TCRC)

The intelligent use of Big Data has transformed many industries. It also presents numerous opportunities for pharmaceutical companies as we collect more genomic Big Data directly from patients. In this talk I will outline a Data Science model that emphasizes mixed-capability teams and impact on science and business decisions. I will discuss how quantitative analytical skills, agile programming, novel technologies and business acumen all contribute to this model. I will illustrate with examples where Data Science was applied to clinical research resulting in new scientific insights and better business decisions.

Linguamatics12:00 pm Text Mining from Bench to Bedside - Where’s the Value?

Jane Reed, Ph.D., Head, Life Science Strategy, Linguamatics

Accessing the right information is critical to bench-to-bedside translational research. Much of the data is locked in textual format, such as scientific literature, clinical trial reports or electronic health records.  This talk will demonstrate how advanced text analytics can provide a powerful solution to the challenges faced by researchers and clinicians, who need to extract the key facts rapidly and accurately to gain actionable insights for decision support.

ACD Labs New12:15 Presentation to be Announced


12:30 Session Break

 

 Thomson Reuters12:40 Luncheon Presentation I

Speaker to be Announced

1:10 Luncheon Presentation II (Sponsorship Opportunity Available)

1:40 Session Break

1:50 Chairperson’s Remarks

Daniel H. Robertson, Ph.D., Senior Director, Research IT, Eli Lilly and Company

1:55 Transforming IT and Informatics at Biogen to Drive Research

Hank Wu, Director, R&D IT, Biogen

Transforming IT and Informatics at Biogen is at the heart of the company’s strategic commitment to use technology, data and analytics to inform the drug discovery process, unlock new insights, improve patient care and drive innovation. This presentation shares work in progress and lessons learned at Biogen.

2:25 PANEL DISCUSSION: Growing a Data Science Team

  • Enabling Innovative Data-driven Approaches at the Intersection of Science, Medicine & Economics
  • Assembly, Creation and Implementation of Data Science Groups for Pharma
  • The Data Scientist - an Essential Component of Big Data Analytics – Difficult to Identify
  • What are Data Sciences, Informatics and Bioinformatics?
  • Should data scientists be centralized or embedded within other product/functional teams?
  • How strong of a coder/programmer should members of a data science team be?
  • How much domain knowledge does a data scientist need to have?

Moderator: Martin Leach, Ph.D., Vice President, R&D IT, Biogen Idec

Panelists:

Rainer Fuchs, CIO, Harvard Medical

Jason Johnson, Ph.D., Associate Vice President, Scientific Informatics, Merck

Jake Klamka, Founder, Insight Data Science Fellows Program

Daniel H. Robertson, Ph.D., Senior Director, Research IT, Eli Lilly and Company

Tom Plasterer, Ph.D., Director, US Cross-Science Lead, AstraZeneca

Sarah Aerni, Ph.D., Principal Data Scientist, Pivotal

2:55 Sponsored Presentation (Opportunity Available)

3:25 Refreshment Break in the Exhibit Hall with Poster Viewing


MODELING & ANALYTICS

4:00 The Construction of a Scientific Modeling Culture and Technology Platform at Merck

Chris L. Waller, Ph.D., Director and Head, Scientific Modeling Platforms, Merck Research Laboratories

Merck Research Laboratories is undergoing a transformation in the way that it prosecutes R&D programs. Through the adoption of a “model-driven” culture, enhanced R&D productivity is anticipated. To support this emerging culture, an ambitious IT program has been initiated to implement a harmonized platform to facilitate cross-domain workflows and decision-making through agile persona driven data and predictive model access.

4:30 Separating the Wheat from the Chaff: Using Proprietary and Public Genomic Information to Identify Biomarkers from Cancer Cell Line Profiling Studies

Yue Webster, Ph.D., Senior Research Scientist, LRL IT Informatics, Eli Lilly and Company

Like most companies, Lilly uses large panels of cancer cell lines to discover genes, transcripts, proteins and/or metabolites which influence response to treatment. The potential for generating false positive findings is significant, and low concordance was highlighted by recent publication (Nature 504, 389–393). The use of co-expression networks and integration across various resources helps identify higher quality relationships. Advanced visualization tools help biologists navigate through thousands of putative relationships.

5:00 Sponsored Presentation (Opportunity Available)

5:30 Best of Show Awards Reception in the Exhibit Hall with Poster Viewing

6:30 Close of Day

 

Thursday, April 23

7:00 am Registration Open and Morning Coffee


» 8:00 PLENARY SESSION PANEL 

Click here for detailed information. 


10:00 Coffee Break in the Exhibit Hall and Poster Competition Winners Announced


MODELING & ANALYTICS

10:30 Chairperson’s Remarks

Speaker to be Announced, GGA Software

10:40 Translational R&D Analytics: Delivering ‘Big Insights’ to Drive Translational Research

Kaushal Desai, Associate Director, Translational R&D Analytics and Decision-Support, Research Informatics & Automation, Bristol-Myers Squibb

The emergence of immunotherapy and a focus on systems approaches has led to an unprecedented surge in translational research opportunities for discovery and development of newer treatment paradigms. Organizations leading the race for scientific breakthroughs in patient treatment have accumulated overwhelming quantities of efficacy, survival, safety and biomarker data from decades of preclinical studies and clinical trials. Translational R&D organizations face the arduous task of mining this data to deliver insights that drive translational research. This session will explore case studies demonstrating how translational R&D analytics can inform patient stratification and trial design in early clinical and translational research. The talk will focus on the journey from a lack of discoverability for disjointed datasets to insights that drive key decisions in translational research. Challenges associated with delivering actionable information at the point of decision-making will be highlighted and opportunities to deliver business value will be outlined using real examples from multiple disease areas.

11:10 Integrated Genomics Platform: Putting Patients and Their Genomes into the Focus of Our Research

Nora Manstein, Ph.D., IT Project Manager, Bayer Business Services GmbH

The fast progress in the generation of genomic data has reached the patient. Especially the advent of next generation sequencing and high resolution microarrays enable accurate descriptions of diseases with a strong genetic component ultimately leading to novel therapeutic approaches. Application of these technologies, however, leads to large amounts of data in need of effective storage and analysis. As now several data types (mutation, expression, microRNAs) become available for each patient, patient-centric views and analyses become mandatory. Consistent data handling and storage is a scientific and technological challenge towards both the research organization and the IT infrastructure. We have established the Integrated Genomics Platform (IGP) as a central tool for genomics research in Cardiology, Oncology and Clinical Sciences. The platform supports advanced data analysis and is intended to simplify discovery processes, e.g. for novel therapeutic targets and genetic biomarkers. In this strategic project, we have overcome known bottlenecks and enabled true translational research by establishing a company-wide mandatory repository and toolbox for storage and analysis of genomics data as well as common standards for data annotation, privacy & security.

Schrodinger11:40 Presentation to be Announced

12:10 pm Session Break

12:20 Luncheon Presentation (Sponsorship Opportunity Available) or Lunch on Your Own

1:20 Dessert Refreshment Break in the Exhibit Hall with Poster Viewing


INTEGRATION & INNOVATE CAPTURE OF INTERNAL & EXTERNAL DATA

1:55 Chairperson’s Remarks

Arturo J. Morales, Ph.D., Vice President, Informatics, Beryllium Corp.

2:00 Best Practices to Thrive Under the BioPharma Big Data Deluge

Tom Plasterer, Ph.D., Director, US Cross-Science Lead, AstraZeneca

Information comes at the BioPharma industry from an abundance of sources, ranging from compounds optimization in early R&D up through opining patients in social media discussing the virtues—or lack thereof—of our products. Knowing how to ingest, harmonize and query this information stream can be a tremendous advantage both internally and to the ecosystem of partners and providers we depend upon. Examples using linked data approaches illustrate how the information stream can be tamed, focusing first on getting data out of containers. Once this is established, terminologies can be applied to derive meaningful answers across otherwise-siloed content. The Open PHACTS project and Bio2RDF projects show how this approach has been used to solve real big data questions for BioPharma.

2:30 Beyond Data Integration – Consumable Expert Knowledge in Chemical Biology

Jeremy L. Jenkins, Ph.D., Senior Investigator II, High Throughput Biology, Developmental & Molecular Pathways, Novartis Institutes for BioMedical Research

Much focus in pharmaceutical data management has been placed on data integration. While metadata standards have greatly facilitated integration of disparate datasets, the result can be too much information to display. An emerging challenge is how to create inferences from knowledge bases that enable automation of expert opinions at large scale. We present a system that creates summary-level assertions based on diverse chemical biology data sources to address the problem of ranking tool compounds for targets, and vice versa, quantifying target confidence for compounds. At first pass the approach required domain-specific scientific understanding to engineer; however, application of machine learning methods improved the inference system, as demonstrated by prospective testing of good and bad tool compounds in a panel of cellular reporter gene assays. Overall this approach provides data-driven opinions about compounds that reflect those of an informed chemical biologist.

3:00 Development and Implementation of a Nonclinical Data Warehouse

Gregory Woo, Principal IS Business Systems Analyst, Research & Development Informatics, Amgen, Inc.

Amgen has implemented an integrated data warehouse for nonclinical toxicology studies, including data from internal systems and at Contract Research Organizations. The goal of the system is to allow scientists to rapidly search, query, and visualize historical toxicology, pathology, and toxicogenomics data. This presentation will discuss the system’s design, key features, challenges and lessons learned.

3:30 The Data Integration Challenge

Mark Davies, Technical Lead, Computational Chemical Biology, European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus

The challenge of data integration is a common issue faced by developers of resources in the life science fields. As the size of external data sources grow and approach ‘big data’ scale, is it still feasible to expect a resource to maintain up-to-date links to external resources. Are there alternative approaches that can alleviate the integration burden for the resource provider? A solution developed by the ChEMBL group, led to the creation of the freely available UniChem resource. The UniChem resource allows users to quickly and dynamical integrate the chemical content from a growing number sources, which currently stands at 25 and contains more than 70 million compound structures. We use the example of the new and open SureChEMBL patent system to demonstrate how UniChem can assist with data integration. We also identity the new challenges we face and how we can embrace other technologies and methodologies, such as Linked Data, to help stay on top of the data integration challenge.

4:00 Conference Adjourns


Download Brochure | Workshops 



View 2015 Brochure
 BIT-Agenda-icon 

Platinum Sponsors
 
Aspera 

 Cycle Computing logo 

 Data Direct Networks 

Elsevier 

 

IBM 

Illumnia logo 

Intel Logo 

OKTA 

Oracle Health Sciences 

Seagate  

 SGI small logo 

Thinkmate 

Thomson Reuters 


View All Sponsors 

 


Official Media Partner
  


View Media Partners 


Conference CD

CD iconOrder the 2014 event proceedings - now available on CD 


Complimentary Downloads

View white papers, listen to podcasts, and more! 

  • Making the World's Knowledge Computable
  • Bioinformatics in the Cloud
  • The Application of Text Analytics to Drug Safety Surveillance
 

Related Events

 HIT Logo for BIT 2015
CLN Co-located Event