KNIME Enterprise server usage and global deployment at NIBR
|
|
|
- Dina Long
- 10 years ago
- Views:
Transcription
1 KNIME Enterprise server usage and global deployment at NIBR Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel 8 th KNIME Users Group Meeting Berlin, 26 February 2015
2 Novartis Institutes for BioMedical Research (NIBR) A global network of >6,000 scientists, physicians, and business professionals. 2
3 R&D at Novartis Bringing innovative medicines to patients The Drug Development Process Source: 3 documents/document/n_prod_ pdf
4 R&D at Novartis Bringing innovative medicines to patients 4 Source: documents/document/n_prod_ pdf
5 Timelines and economics Paul, S. M., Mytelka, D. S., Dunwiddie, C. T., Persinger, C. C., Munos, B. H., Lindborg, S. R., & Schacht, A. L. (2010). Nature reviews Drug discovery, 9(3),
6 NIBR: Making it work Our model of research connecting the laboratory to the clinic, and pursuing molecular pathways across a landscape of multiple diseases means that we have to be a highly collaborative organization. Every project is made up of cross-functional teams, drawn from pathways scientists, chemists, disease area specialists, informaticians, clinicians and more. We re doing scientific research, not making widgets Lots of collaboration, lots of technology, lots of data 6
7 Lots of data 7 Presentation Title Presenter Name Date Subject Business Use Only
8 Lots of data Shape of the data generated for a project Hit finding 10 6 rows, 1-2 columns Hit-to-lead 10 3 rows, 5-10 columns Lead optimization 10 2 rows, 10 2 columns Clinic 1 rows, 10 4 columns 8
9 The role of NIBR Informatics (NX) Identifying and driving new opportunities to accelerate science with leading-edge computing and informatics solutions. Traditional IT stuff: service desk, hardware support, network, etc. Designing, building, deploying, and supporting tools/systems for: portfolio management; document management; compliance and reporting lab informatics; sample management and logistics; electronic lab notebooks high-performance computing; large-scale data warehousing and mining, machine learning scientific data analysis; visualization; reporting Pushing the frontier: research and exploration Combination of purchased and in-house developed systems, lots of different technologies, lots of integration work 9
10 NIBR and KNIME We believe KNIME can be really useful, so we want to make it available to all of our scientists We re supporting both people who are using KNIME to solve problems in their own labs/groups and people who want to make tools available to others. Need to support exchange of workflows and information across all our sites Need to be integrated into our data and software environment 10
11 Infrastructure Internal node development Enterprise servers + cluster integration Standardized desktop releases for Windows, Linux, Mac Nightly builds for users comfortable on the bleedingleading edge Dev and test servers to support our node development 11
12 NIBR s KNIME servers 12
13 KNIME for NIBR internal distribution Standardized set of nodes and extensions Customized preferences 13
14 KNIME for NIBR make it supportable Allow a reset to the default configuration without requiring a new install. 14
15 In-house node development make it useful Connections to internal data sources and applications Wrappers around in-house developed algorithms Connection to our web service framework for cheminformatics services 15
16 Open-source node development Chemistry nodes based on the RDKit open-source cheminformatics toolkit useable from C++, Python, Java, C# NIBR scientists/developers actively participate Standard cheminformatics tasks + some nice extras Developed both in-house and together with knime.com 16
17 Sponsored node development Modifications to naïve Bayes nodes to support fingerprints Fingerprint naïve Bayes supporting unbalanced datasets Database schema browser Improvements to database connector, readers Ensemble tree classifier New Python integration 17
18 Integration example 1: Descriptor calculation 18
19 Integration example 2: DART Internal web-based tool used by project teams to do querying and reporting from our data warehouse 19
20 Integration example 2: DART Internal web-based tool used by project teams to do querying and reporting from our data warehouse Access to saved queries and views URL contains full state of query/view 20
21 Integration example 2: DART + KNIME 21
22 Integration example 2: DART + KNIME Access to saved queries and views 22
23 Usage snapshot Unique users per month Overall: 240 unique users, mostly scientists Users by site 23 Notes: 1) stats only include KNIME client 2) December data incomplete
24 What are those users doing with KNIME? Querying and reporting from our warehouse virtual chemistry processing usage statistics mining medchem project data processing and analyzing experimental data machine learning triaging high-throughput screening results looking up chemical catalog numbers in other words: a bit of everything 24
25 KNIME Server usage Primarily used to share workflows Increasingly used as a quick and easy deployment platform for small application/services built in KNIME This is mainly driven by the scientists themselves Areas for improvement: Would be nice if it were easier to sync between servers Would be great if the server could do RESTful web services. Still: enabling scientists to share workflows and make (hopefully) simple applications available to each other is great 25
26 Wrapping up KNIME in heavy use to solve many different problems Enterprise server used to exchange workflows globally Web portal provides a way for scientists to deploy tools to each other KNIME is a great platform for us to build upon 26
27 Acknowledgements NIBR Manuel Schwarze (NX) Mark Duffield (NX) David Nick (NX) Marc Litherland (NX) John Davies (CPC) Richard Lewis (GDC) Remy Evard (NX) knime.com 27
What s Cooking in KNIME
What s Cooking in KNIME Thomas Gabriel Copyright 2015 KNIME.com AG Agenda Querying NoSQL Databases Database Improvements & Big Data Copyright 2015 KNIME.com AG 2 Querying NoSQL Databases MongoDB & CouchDB
#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf
Jenkins as a Scientific Data and Image Processing Platform Ioannis K. Moutsatsos, Ph.D., M.SE. Novartis Institutes for Biomedical Research www.novartis.com June 18, 2014 #jenkinsconf Life Sciences are
Cheminformatics in the Cloud. Michael A. Dippolito DeltaSoft, Inc. 3-June-2009 ChemAxon European User Group Meeting
Cheminformatics in the Cloud Michael A. Dippolito DeltaSoft, Inc. 3-June-2009 ChemAxon European User Group Meeting DeltaSoft Specializing in R&D Informatics since 1996 Based in New Jersey, USA Long term
Professional Education for the Future of Health Informatics. Charles P. Friedman, PhD Schools of Information and Public Health University of Michigan
Professional Education for the Future of Health Informatics Charles P. Friedman, PhD Schools of Information and Public Health University of Michigan 1 Today s Menu What informatics is and isn t Informatics
Cheminformatics and its Role in the Modern Drug Discovery Process
Cheminformatics and its Role in the Modern Drug Discovery Process Novartis Institutes for BioMedical Research Basel, Switzerland With thanks to my colleagues: J. Mühlbacher, B. Rohde, A. Schuffenhauer
Actian Vortex Express 3.0
Actian Vortex Express 3.0 Quick Start Guide AH-3-QS-09 This Documentation is for the end user's informational purposes only and may be subject to change or withdrawal by Actian Corporation ("Actian") at
DBTech Pro Workshop. Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining. Georgios Evangelidis
DBTechNet DBTech Pro Workshop Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining Dimitris A. Dervos [email protected] http://aetos.it.teithe.gr/~dad Georgios Evangelidis
2) Xen Hypervisor 3) UEC
5. Implementation Implementation of the trust model requires first preparing a test bed. It is a cloud computing environment that is required as the first step towards the implementation. Various tools
HTML5 Data Visualization and Manipulation Tool Colorado School of Mines Field Session Summer 2013
HTML5 Data Visualization and Manipulation Tool Colorado School of Mines Field Session Summer 2013 Riley Moses Bri Fidder Jon Lewis Introduction & Product Vision BIMShift is a company that provides all
Talking your Language. E-WorkBook 10 provides a one-platform, single source of truth without adding complexity to research
Talking your Language E-WorkBook 10 provides a one-platform, single source of truth without adding complexity to research Meet E-WorkBook 10 In January 2015 we launched E-WorkBook 10 - the next step in
Job list - Research China
Job list - Research China Job # Job Title Page 010499 Scientist for Mechanical Engineering P2-3 002590 Senior Scientist for Biomedical Engineering P4-5 022831 Research Scientist for MRI P6-7 010501 Scientist
HPC & Visualization. Visualization and High-Performance Computing
HPC & Visualization Visualization and High-Performance Computing Visualization is a critical step in gaining in-depth insight into research problems, empowering understanding that is not possible with
An Easily Accessed Clinical Research Database from your Epic EMR
Loyola University Chicago Health Sciences Division Stritch School of Medicine (SSOM) An Easily Accessed Clinical Research Database from your Epic EMR February 13, 2014 Speakers: Richard H. Kennedy, Ph.D.
Data Mining & Data Stream Mining Open Source Tools
Data Mining & Data Stream Mining Open Source Tools Darshana Parikh, Priyanka Tirkha Student M.Tech, Dept. of CSE, Sri Balaji College Of Engg. & Tech, Jaipur, Rajasthan, India Assistant Professor, Dept.
speed thought Getting the most of CHEMAXON Integration June 2006 of The Power of at the
ETL Data Mining Workflow Engine In Database Analytics Process Knowledge Creation How Soon Can We Deliver? Which Project Is Most Successful? What More Information Do We Need? Where Is The Risk In My Portfolio?
Visualization and Data Analysis with VIDA. Joe Corkery OpenEye Scientific Software
Visualization and Data Analysis with VIDA Joe Corkery OpenEye Scientific Software OpenEye Small software company Efficient large scale 3D computations Tools for managing computed data VIDA Primarily a
Outlines. Business Intelligence. What Is Business Intelligence? Data mining life cycle
Outlines Business Intelligence Lecture 15 Why integrate BI into your smart client application? Integrating Mining into your application Integrating into your application What Is Business Intelligence?
In-Database Analytics
Embedding Analytics in Decision Management Systems In-database analytics offer a powerful tool for embedding advanced analytics in a critical component of IT infrastructure. James Taylor CEO CONTENTS Introducing
SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform
SAS and Oracle: Big Data and Cloud Partnering Innovation Targets the Third Platform David Lawler, Oracle Senior Vice President, Product Management and Strategy Paul Kent, SAS Vice President, Big Data What
How To Learn To Use Big Data
Information Technologies Programs Big Data Specialized Studies Accelerate Your Career extension.uci.edu/bigdata Offered in partnership with University of California, Irvine Extension s professional certificate
Here, There And Everywhere - On Integrating KNIME Workflows Man-Ling Lee
Here, There And Everywhere - On Integrating KNIME Workflows Man-Ling Lee KNIME User Group Meeting 02/13/2014 Tools Used In Drug Discovery 2 Tools Used In Drug Discovery 3 KNIME WebPortal 4 Good entry point
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
RELEASE ANNOUNCEMENT Kaseya Network Discovery and Network Monitoring Version 1.0
KASEYA INTERNATIONAL LIMITED RELEASE ANNOUNCEMENT Network Discovery and Network Monitoring Version 1.0 ANNOUNCEMENT DATE: DECEMBER 2010 TARGET AVAILABILITY: DECEMBER 2010 i TABLE OF CONTENTS OVERVIEW...
Machine Learning with MATLAB David Willingham Application Engineer
Machine Learning with MATLAB David Willingham Application Engineer 2014 The MathWorks, Inc. 1 Goals Overview of machine learning Machine learning models & techniques available in MATLAB Streamlining the
Cheminformatics and Pharmacophore Modeling, Together at Last
Application Guide Cheminformatics and Pharmacophore Modeling, Together at Last SciTegic Pipeline Pilot Bridging Accord Database Explorer and Discovery Studio Carl Colburn Shikha Varma-O Brien Introduction
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
Nathan Brown. The Application of Consensus Modelling and Genetic Algorithms to Interpretable Discriminant Analysis. nathan.brown@novartis.
Nathan Brown [email protected] The Application of Consensus Modelling and Genetic Algorithms to Interpretable Discriminant Analysis Workshop Chemoinformatics in Europe: Research and Teaching 30
DATA MINING ALPHA MINER
DATA MINING ALPHA MINER AlphaMiner is developed by the E-Business Technology Institute (ETI) of the University of Hong Kong under the support from the Innovation and Technology Fund (ITF) of the Government
Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)
Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR) Enable Science in silico & Provide the Right Knowledge to the Right People at the Right Time to enable the
The Data Mining Process
Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data
Do you know how your TSM environment is evolving?
Trend reporting for Tivoli Storage Manager Holger Speh Consulting IT Specialist Do you know how your TSM environment is evolving? Healthy? Well integrated? Data Growth? Accounting? 2 2 Historical Reporting
Anforderungen der Life-Science Industrie an die Hochschulen. Hans Widmer Novartis Institutes for BioMedical Research
Anforderungen der Life-Science Industrie an die Hochschulen Hans Widmer Novartis Institutes for BioMedical Research There s nothing more extraordinary than a normal life 2 What does industry expect from
Machine Learning and Data Mining. Fundamentals, robotics, recognition
Machine Learning and Data Mining Fundamentals, robotics, recognition Machine Learning, Data Mining, Knowledge Discovery in Data Bases Their mutual relations Data Mining, Knowledge Discovery in Databases,
How To Use Data Analysis To Get More Information From A Computer Or Cell Phone To A Computer
Applying Big Data approaches to Competitive Intelligence challenges THOMSON REUTERS IP & SCIENCE PHARMA CI EUROPE CONFERENCE & EXHIBITION TIM MILLER 19 FEBRUARY 2014 BIG DATA, NOT JUST ABOUT VOLUMES Patient
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S.
Game Changers for Researchers: Altmetrics, Big Data, Open Access What Might They Change? Kiki Forsythe, M.L.S. Definition of Game Changer A newly introduced element or factor that changes an existing situation
CUSTOMER Presentation of SAP Predictive Analytics
SAP Predictive Analytics 2.0 2015-02-09 CUSTOMER Presentation of SAP Predictive Analytics Content 1 SAP Predictive Analytics Overview....3 2 Deployment Configurations....4 3 SAP Predictive Analytics Desktop
E6893 Big Data Analytics Lecture 2: Big Data Analytics Platforms
E6893 Big Data Analytics Lecture 2: Big Data Analytics Platforms Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and Big Data
IBM Rational Asset Manager
Providing business intelligence for your software assets IBM Rational Asset Manager Highlights A collaborative software development asset management solution, IBM Enabling effective asset management Rational
How To Synchronize With A Cwr Mobile Crm 2011 Data Management System
CWR Mobility Customer Support Program Page 1 of 10 Version [Status] May 2012 Synchronization Best Practices Configuring CWR Mobile CRM for Success Whitepaper Copyright 2009-2011 CWR Mobility B.V. Synchronization
ELECTRONIC MEDICAL RECORDS. Selecting and Utilizing an Electronic Medical Records Solution. A WHITE PAPER by CureMD.
ELECTRONIC MEDICAL RECORDS Selecting and Utilizing an Electronic Medical Records Solution A WHITE PAPER by CureMD CureMD Healthcare 55 Broad Street New York, NY 10004 Overview United States of America
Cray: Enabling Real-Time Discovery in Big Data
Cray: Enabling Real-Time Discovery in Big Data Discovery is the process of gaining valuable insights into the world around us by recognizing previously unknown relationships between occurrences, objects
Make the Most of Big Data to Drive Innovation Through Reseach
White Paper Make the Most of Big Data to Drive Innovation Through Reseach Bob Burwell, NetApp November 2012 WP-7172 Abstract Monumental data growth is a fact of life in research universities. The ability
Advanced In-Database Analytics
Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??
Introduction Predictive Analytics Tools: Weka
Introduction Predictive Analytics Tools: Weka Predictive Analytics Center of Excellence San Diego Supercomputer Center University of California, San Diego Tools Landscape Considerations Scale User Interface
TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials
TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials Pharmaceutical leader deploys TIBCO Spotfire enterprise analytics platform across its drug discovery organization
Prerequisites. Course Outline
MS-55040: Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot Description This three-day instructor-led course will introduce the students to the concepts of data mining,
New Relic & JMeter - Perfect Performance Testing
TUTORIAL New Relic & JMeter - Perfect Performance Testing by David Sale Contents Introduction 3 Demo Application 4 Hooking Into New Relic 4 What Is JMeter? 6 Installation and Usage 6 Analysis In New Relic
Belatrix Software Factory Sample Automated Load/Stress Testing Success Cases
Belatrix Software Factory Sample Automated Load/Stress Testing Success Cases Introduction. In this white paper we will discuss different cases where the Belatrix Quality Assurance team has implemented
The open source enterprise solution pre-configured for the IT Asset Management www.cmdbuild.org
1 The open source enterprise solution pre-configured for the IT Asset Management www.cmdbuild.org Tecnoteca Srl [email protected] www.tecnoteca.com CMDBuild READY2USE 2 CMDBuild READY2USE is a CMDBuild
Healthcare Big Data Exploration in Real-Time
Healthcare Big Data Exploration in Real-Time Muaz A Mian A Project Submitted in partial fulfillment of the requirements for degree of Masters of Science in Computer Science and Systems University of Washington
Primetime for KNIME:
Primetime for KNIME: Towards an Integrated Analysis and Visualization Environment for RNAi Screening Data F. Oliver Gathmann, Ph. D. Director IT, Cenix BioScience Presentation for: KNIME User Group Meeting
AppBoard TM 2.6. System Requirements. Technical Documentation. Version 2.6.0. July 2015
Technical Documentation AppBoard TM 2.6 System Requirements Version 2.6.0 July 2015 Edge Technologies 1881 Campus Commons Drive Suite 101 Reston, VA 20191 T 703.691.7900 F 703.691.4020 1.888.771.EDGE www.edge-technologies.com
Graphical Web based Tool for Generating Query from Star Schema
Graphical Web based Tool for Generating Query from Star Schema Mohammed Anbar a, Ku Ruhana Ku-Mahamud b a College of Arts and Sciences Universiti Utara Malaysia, 0600 Sintok, Kedah, Malaysia Tel: 604-2449604
User's Guide - Beta 1 Draft
IBM Tivoli Composite Application Manager for Microsoft Applications: Microsoft Hyper-V Server Agent vnext User's Guide - Beta 1 Draft SC27-2319-05 IBM Tivoli Composite Application Manager for Microsoft
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
use ready 2 The open source enterprise solution pre-configured for the IT Asset Management www.cmdbuild.org Tecnoteca Srl
1 ready 2 use The open source enterprise solution pre-configured for the IT Asset Management www.cmdbuild.org Tecnoteca Srl [email protected] www.tecnoteca.com CMDBuild ready2use 2 CMDBuild ready2use
Day 1 - Technology Introduction & Digital Asset Management
SharePoint Developers Academy 2010 Course Syllabus Introduction Day 1 - Technology Introduction & Digital Asset Management 1. Kick Start a. Participant Introductions b. Course Overview c. Training Goals
Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee
Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Technology in Pedagogy, No. 8, April 2012 Written by Kiruthika Ragupathi ([email protected]) Computational thinking is an emerging
KNIME TUTORIAL. Anna Monreale KDD-Lab, University of Pisa Email: [email protected]
KNIME TUTORIAL Anna Monreale KDD-Lab, University of Pisa Email: [email protected] Outline Introduction on KNIME KNIME components Exercise: Market Basket Analysis Exercise: Customer Segmentation Exercise:
Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs
1.1 Introduction Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs For brevity, the Lavastorm Analytics Library (LAL) Predictive and Statistical Analytics Node Pack will be
ADAM 5.5. System Requirements
ADAM 5.5 System Requirements 1 1. Overview The schema below shows an overview of the ADAM components that will be installed and set up. ADAM Server: hosts the ADAM core components. You must install the
LabKey Server: An open source platform for scientific data integration, analysis, and collaboration
LabKey Server: An open source platform for scientific data integration, analysis, and collaboration Mark Igra Partner, LabKey Software [email protected] Presentation Topics Why Scientific Data Integration
MS1b Statistical Data Mining
MS1b Statistical Data Mining Yee Whye Teh Department of Statistics Oxford http://www.stats.ox.ac.uk/~teh/datamining.html Outline Administrivia and Introduction Course Structure Syllabus Introduction to
Creating a universe on Hive with Hortonworks HDP 2.0
Creating a universe on Hive with Hortonworks HDP 2.0 Learn how to create an SAP BusinessObjects Universe on top of Apache Hive 2 using the Hortonworks HDP 2.0 distribution Author(s): Company: Ajay Singh
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis
Webinar will begin shortly Hadoop s Advantages for Machine Learning and Predictive Analytics Presented by Hortonworks & Zementis September 10, 2014 Copyright 2014 Zementis, Inc. All rights reserved. 2
GEM Network Advantages and Disadvantages for Stand-Alone PC
Possible Configurations Turns your Contacts into a Business Network focussed on you GEM can be configured to run in many different ways. From simple stand-alone PC s or Mac s, through Client Server on
Pentaho Data Mining Last Modified on January 22, 2007
Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org
Making Good Use of Data at Hand: Government Data Projects. Mark C. Cooke, Ph.D. Tax Management Associates, Inc.
Making Good Use of Data at Hand: Government Data Projects Mark C. Cooke, Ph.D. Tax Tax Management Associates Privately held company serving state and local government Markets across eighteen (18) states
ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
ON THE IMPLEMENTATION OF ADAPTIVE FLOW MEASUREMENT IN THE SDN-ENABLED NETWORK: A PROTOTYPE
ON THE IMPLEMENTATION OF ADAPTIVE FLOW MEASUREMENT IN THE SDN-ENABLED NETWORK: A PROTOTYPE PANG-WEI TSAI, CHUN-YU HSU, MON-YEN LUO AND CHU-SING YANG NATIONAL CHENG KUNG UNIVERSITY, INSTITUTE OF COMPUTER
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
Implementing a Data Warehouse with Microsoft SQL Server
This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with SQL Server Integration Services, and
WHITEPAPER. A Technical Perspective on the Talena Data Availability Management Solution
WHITEPAPER A Technical Perspective on the Talena Data Availability Management Solution BIG DATA TECHNOLOGY LANDSCAPE Over the past decade, the emergence of social media, mobile, and cloud technologies
MS-55052: SharePoint 2013 End User Level II
MS-55052: SharePoint 2013 End User Level II Description This 3-day Instructor led course Explore several advanced topics of working with SharePoint 2013 sites. Topics include SharePoint Server site definitions
B 3. Biology-Biotechnology-Bridge Program. 2-year Post-Baccalaureate Program. at MIT and its Biotech Partners
B 3 Biology-Biotechnology-Bridge Program 2-year Post-Baccalaureate Program at MIT and its Biotech Partners https://biology.mit.edu/about/postbac_program MIT provides minority outreach in STEM fields at
A Statistician s View of Big Data
A Statistician s View of Big Data Max Kuhn, Ph.D (Pfizer Global R&D, Groton, CT) Kjell Johnson, Ph.D (Arbor Analytics, Ann Arbor MI) What Does Big Data Mean? The advantages and issues related to Big Data
Citrix XenApp-7.6 Administration Training. Course
Citrix XenApp-7.6 Administration Training Course Course Duration : 20 Working Days Class Duration : 3 hours per day Fast Track: - Course duration 10days (Per day 8 hours) Get Fee Details Module 1: Citrix
Find the signal in the noise
Find the signal in the noise Electronic Health Records: The challenge The adoption of Electronic Health Records (EHRs) in the USA is rapidly increasing, due to the Health Information Technology and Clinical
