Exploiting the Amazon.com People Who Bought Also Bought Algorithm in Reagent Selection. Christian Tyrchan, Niklas Falk and Jonas Boström

Size: px
Start display at page:

Download "Exploiting the Amazon.com People Who Bought Also Bought Algorithm in Reagent Selection. Christian Tyrchan, Niklas Falk and Jonas Boström"

Transcription

1 Exploiting the Amazon.com People Who Bought Also Bought Algorithm in Reagent Selection Christian Tyrchan, iklas Falk and Jonas Boström

2 Setting the Scene The current trend is that drug discovery projects are treated as processes creativity might be hampered, and little room for Serendipity? We need new ways of working we want creative users (not feeling stuck in processes) Making novel compounds is at the heart of drug design Thus, the aim of the current work is to enhance discovery, surfacing reagents from deep in the catalog that our chemists wouldn't find on their own. Using a novel approach, where similarity is based on users (not structures).

3 Internet Success Stories ew Technologies ew Sciences Finite State Machines Item-to-Item Collaborative Filtering (ew approaches to improve searches)

4 Recommendation Systems are best known for their use on e-commerce Web sites. attempts to present items that are likely to be of interest to the user. The idea of recommending items at checkout is nothing new

5 The Harry Potter Shopping Cart Amazon.com saw the opportunity to personalize impulse buys

6 The Harry Potter Shopping Cart The idea of recommending items at checkout is nothing new

7 Recommendation Systems Typically, a recommender system compares the user's profile to some reference characteristics, and seeks to predict the 'rating' that a user would give to an item they had not yet considered. Should help a customer find and discover new, relevant, and interesting items Two main categories (based on how the recommendations are made): Content-based recommendations the information item user will be recommended items similar to the ones the user preferred in the past Collaborative recommendations social environment user will be recommended items that people with similar taste liked in the past

8 Content-based and Collaborative Systems Content-based recommendations nly the movies that have a high degree of similarity to what the user s preference are would be recommended. Collaborative recommendations start by finding a set of customers whose purchased items overlap the user s purchased items. The algorithm aggregates items from these similar customers, eliminates items the user has already purchased, and recommends the remaining items to the user. focus on finding similar users represents a user as an -dimensional vector of items.

9 Recommendations needed to work... from sparse data often just a few purchases. it needed to be fast high-quality in real-time. the system needed to scale to massive numbers huge amounts of data. the algorithm must respond immediately to new information customer data is volatile. one of the existing methods were good enough Traditional collaborative filtering does little or no offline computation, nline computation scales with the number of customers and catalog items. The algorithm is impractical on large data sets. Content-based recommendations no news (unless randomization)

10 Item-to-Item Collaborative Filtering item-to-item collaborative filtering matches each of the user s purchased items to similar items, then combines those similar items into a recommendation list. To determine the most-similar match for a given item, the algorithm builds a similaritems table by finding items that customers tend to purchase together. Amazon.com's item-to-item approach computes the cosine between binary vectors representing the purchases in a user-item matrix. Given two vectors of attributes (A and B) the cosine similarity (θ) is represented using a dot product and magnitude as: Recommendations based on items which are most similar to query item. Greg Linden et al. "Amazon.com Recommendations: Item-to-Item Collaborative Filtering, IEEE Internet Computing, 2003, 7,

11 Since it works for Amazon.com, why not try it... to help medicinal chemist select reagents from chemical databases enhance discovery, surfacing reagents from deep in the catalog that our chemists wouldn't find on their own.

12 Exploiting the Amazon.com People Who Bought Also Bought algorithm in Reagent Selection ot only suggesting new reagents, but also solving problems? For example, suggesting possible bioisosters: + reductive amination R H R Final product may be genetoxic. Design idea to avoid AMES positives R H Genetoxic AMES test is one measure of genetic toxicity Aromatic amines are often unwanted fragments in drug design (GeneToxic). Regulatory view: If carcinogenic in animals, it will be a carcinogen in man.

13 Strategy Collect Data Set of Chemical Reagents Get Check-out information Generate Similarity Matrix using Cosine Similarities Import Matrix into an racle database Display Recommendations ISIS/db query items (reagents) which are most similar to query item (reagent). Check-out information

14 Reagent Data Set Extract reagents in Stockroom ( CIMS ) checked out the last 5yrs reagents Filter amount!=0 tweak-1 canonical SMILES generated counter salts were removed (and reagents merged) unique compound id s assigned unique Grouping Assign reagents into 10 functional classes, by SMARTS mapping: tweak-2 Times Check-ut Check-out only once reagents could be mapped onto the 10 functional classes. 194 unique chemists. Reagents

15 Tweak 1 counter-ions Ca 5000 entries include a counter-ion Different salts should give the same results For example, the reagent below exists with and without the hydrochloride salt F F ClH F F F F 3,3,3-TRIFLURPRPYLAMIE 3,3,3-TRIFLURPRPYLAMIE HYDRCHLRIDE The salts are removed, and the data are merged for the vectors.

16 Tweak 2 functional classes A search for amines should only recommend other amines + R reductive amination H R Class Reagents Freq FunctionalGroups primary and secondary amines acids, acid halides, anhydrides, carbamates, carbonates, esters aromatic halides alkyl halides sulphonyl chlorides alcohols aldehydes, ketones boronic acids, trifluoroborates isocyanates, isothiocyanates alpha halide ketones (dual functionalities counted twice)

17 Similarities Data binary User checked-out reagent (1), or not (0). Where the cosine between C0001 and C003 is: Item User C001 C002 C003 Anthony icholls Andrew Grant Morten Langgard = checked-out, 0 = not checked out Frequency almost all-against-all Binned Amazon.com Similarities* *Roughly 85% of the reagents belong in the zero bin

18 Architecture racle and MDL ISIS/Base not web-based system user rows user-by-item matrix item columns updates over-night possible

19 Results What does the frontend look like? Yet Another Similarity Measure? A Dream Come True? Possible ways forwards ther info revealed

20 Frontend, and That little bit extra riginal CIMS CIMS-Recommend Available amount Location

21 Amazon.com vs ther Similarities Lingos and 3 fingerprints are calculated (ECFP6, FPFP6, MDL Public keys). TopX hits compared to topx Amazon-hits. verlap (%) MaxHits* ECFP6 FPFP6 Lingo MDL Public Keys Amazon Hito Molame 1 C C C C0001 FP/Lingos Hito Molame 1 C C C C0134 Max C0955 Max C0251 Results show that Amazon recommendations are, more or less, orthogonal to other searching techniques.

22 Amazon.com vs ther Similarities Top 10 structures selected from the Amazon-like selection and the ECFP4 fingerprint method for two queries Amazon Top 10 H H H H H H F ECFP4 Top 10 Cl Br H H F F

23 Exploiting Recommendation Systems in Reagent Selection Design idea to avoid AMES positives + R reductive amination H R Search database for anline, and get Chemists who requested aniline also requested : All AMES negatives H S The advantage of such a feature is the inherent knowledge-transfer. In the dream scenario such a reagent suggestion could solve an existing problem.

24 Medicinal Chemistry Poll Pre-defined sets? To diverse recommendations? Already better! Since I get everything in one go

25 Most Frequently Checked-ut Reagents ther information easily accessible just ask the right question. Top5 amines H H H o. Checked-out Reagent Top5 aldehydes H o. Check-out Reagent

26 Summary Recommendation systems are useful alternatives to search algorithms since they help users to discover items they might not have found by themselves. We presented a novel dynamic similarity measure personalized information was used to produce reagent recommendations, using Amazon.com s item-to-item collaborative filtering technique. Low threshold for trying first prototype finished within 1-2 weeks (as all infrastructure was in place) maintaining data can readily be updated nightly, weekly In the dream scenario such a [reagent] suggestion could solve an existing problem. not there just yet (too little data need more info ) ur recommendations are, more or less, orthogonal to other similarity measures. Positive comments in small MedChem poll. In the end, what we want is happy satisfied customers!

27 Jens Sadowski for presenting! Acknowledgments

28 Exploiting the Amazon.com People Who Bought Also Bought Algorithm in Reagent Selection Abstract. Amazon.com s People who bought [this book] also bought [these books] is a popular feature on numerous web-sites nowadays. The use of such arecommendersystemcanbeexploitedinmanyareas,alsoindrugdesign.in the current work a system to recommend reagents has been developed, using the item-to-item collaborative filtering technique. The goal is to enhance discovery, surfacing reagents from deep in our corporate reagent database; reagents that medicinal chemists might not have found on their own. Another potential advantage of using personalized information is the inherent knowledge-transfer. That is, in a dream scenario a reagent recommendation could solve an existing problem. Moreover, this novel similarity measure differs from other similarity measures; as it is based on user-item information and not descriptions of molecular structures. It will be shown that the recommendations are, more or less, orthogonal to other methods.

A NOVEL RESEARCH PAPER RECOMMENDATION SYSTEM

A NOVEL RESEARCH PAPER RECOMMENDATION SYSTEM International Journal of Advanced Research in Engineering and Technology (IJARET) Volume 7, Issue 1, Jan-Feb 2016, pp. 07-16, Article ID: IJARET_07_01_002 Available online at http://www.iaeme.com/ijaret/issues.asp?jtype=ijaret&vtype=7&itype=1

More information

Recommendation Tool Using Collaborative Filtering

Recommendation Tool Using Collaborative Filtering Recommendation Tool Using Collaborative Filtering Aditya Mandhare 1, Soniya Nemade 2, M.Kiruthika 3 Student, Computer Engineering Department, FCRIT, Vashi, India 1 Student, Computer Engineering Department,

More information

The Need for Training in Big Data: Experiences and Case Studies

The Need for Training in Big Data: Experiences and Case Studies The Need for Training in Big Data: Experiences and Case Studies Guy Lebanon Amazon Background and Disclaimer All opinions are mine; other perspectives are legitimate. Based on my experience as a professor

More information

MOLECULAR REPRESENTATIONS AND INFRARED SPECTROSCOPY

MOLECULAR REPRESENTATIONS AND INFRARED SPECTROSCOPY MLEULAR REPRESENTATINS AND INFRARED SPETRSPY A STUDENT SULD BE ABLE T: 1. Given a Lewis (dash or dot), condensed, bond-line, or wedge formula of a compound draw the other representations. 2. Give examples

More information

Greg Linden, Brent Smith, and Jeremy York Amazon.com

Greg Linden, Brent Smith, and Jeremy York Amazon.com Industry Report Amazon.com Recommendations Item-to-Item Collaborative Filtering Greg Linden, Brent Smith, and Jeremy York Amazon.com Recommendation algorithms are best known for their use on e-commerce

More information

Intelligent Web Techniques Web Personalization

Intelligent Web Techniques Web Personalization Intelligent Web Techniques Web Personalization Ling Tong Kiong (3089634) Intelligent Web Systems Assignment 1 RMIT University S3089634@student.rmit.edu.au ABSTRACT Web personalization is one of the most

More information

CHEM 51LB EXP 1 SPECTROSCOPIC METHODS: INFRARED AND NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY

CHEM 51LB EXP 1 SPECTROSCOPIC METHODS: INFRARED AND NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY CHEM 51LB EXP 1 SPECTRSCPIC METHDS: INFRARED AND NUCLEAR MAGNETIC RESNANCE SPECTRSCPY REACTINS: None TECHNIQUES: IR Spectroscopy, NMR Spectroscopy Infrared (IR) and nuclear magnetic resonance (NMR) spectroscopy

More information

How to create a web-based molecular structure database with free software

How to create a web-based molecular structure database with free software How to create a web-based molecular structure database with free software Norbert Haider Department of Drug Synthesis Faculty of Life Sciences, University of Vienna norbert.haider@univie.ac.at small molecules

More information

recommendation in e-commerce

recommendation in e-commerce recommendation in e-commerce Luminis Recommendation Services We are leaving the age of information and entering the age of recommendation Chris Anderson Authors: Hans Bossenbroek Hans Gringhuis Luminis

More information

Machine Learning using MapReduce

Machine Learning using MapReduce Machine Learning using MapReduce What is Machine Learning Machine learning is a subfield of artificial intelligence concerned with techniques that allow computers to improve their outputs based on previous

More information

Carboxylic Acid Derivatives and Nitriles

Carboxylic Acid Derivatives and Nitriles Carboxylic Acid Derivatives and itriles Carboxylic Acid Derivatives: There are really only four things to worry about under this heading; acid chlorides, anhydrides, esters and amides. We ll start with

More information

Identification of Unknown Organic Compounds

Identification of Unknown Organic Compounds Identification of Unknown Organic Compounds Introduction The identification and characterization of the structures of unknown substances are an important part of organic chemistry. Although it is often

More information

for excitation to occur, there must be an exact match between the frequency of the applied radiation and the frequency of the vibration

for excitation to occur, there must be an exact match between the frequency of the applied radiation and the frequency of the vibration ! = 1 2"c k (m + M) m M wavenumbers! =!/c = 1/" wavelength frequency! units: cm 1 for excitation to occur, there must be an exact match between the frequency of the applied radiation and the frequency

More information

Dashboards as Easy To Use as Amazon

Dashboards as Easy To Use as Amazon AUGUST 2013 Dashboards as Easy To Use as Amazon Sponsored by QlikView Contents The Business Intelligence Usability Crisis 1 Why Is Amazon So Useful? 1 Why Do We Need a More Amazon-like BI Experience? 3

More information

HOMEWORK PROBLEMS: IR SPECTROSCOPY AND 13C NMR. The peak at 1720 indicates a C=O bond (carbonyl). One possibility is acetone:

HOMEWORK PROBLEMS: IR SPECTROSCOPY AND 13C NMR. The peak at 1720 indicates a C=O bond (carbonyl). One possibility is acetone: HMEWRK PRBLEMS: IR SPECTRSCPY AND 13C NMR 1. You find a bottle on the shelf only labeled C 3 H 6. You take an IR spectrum of the compound and find major peaks at 2950, 1720, and 1400 cm -1. Draw a molecule

More information

Math 215 HW #6 Solutions

Math 215 HW #6 Solutions Math 5 HW #6 Solutions Problem 34 Show that x y is orthogonal to x + y if and only if x = y Proof First, suppose x y is orthogonal to x + y Then since x, y = y, x In other words, = x y, x + y = (x y) T

More information

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Matteo Migliavacca (mm53@kent) School of Computing Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Simple past - Traditional

More information

KNIME Enterprise server usage and global deployment at NIBR

KNIME Enterprise server usage and global deployment at NIBR KNIME Enterprise server usage and global deployment at NIBR Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel 8 th KNIME Users Group Meeting Berlin, 26 February

More information

Experiment 11. Infrared Spectroscopy

Experiment 11. Infrared Spectroscopy Chem 22 Spring 2010 Experiment 11 Infrared Spectroscopy Pre-lab preparation. (1) In Ch 5 and 12 of the text you will find examples of the most common functional groups in organic molecules. In your notebook,

More information

TAN Triaminononane H 2 N NH 2. (4 aminomethyl 1,8 octanediamine) Trifunctional amine with low molecular weight CAS NO. 1572-55-0

TAN Triaminononane H 2 N NH 2. (4 aminomethyl 1,8 octanediamine) Trifunctional amine with low molecular weight CAS NO. 1572-55-0 TA Triaminononane (4 aminomethyl 1,8 octanediamine) Trifunctional amine with low molecular weight AS. 1572-55-0 Triaminononane (TA) is a low molecular weight specialty performance material with three amine

More information

Increase Conversion and Sales, Not your Marketing Budget

Increase Conversion and Sales, Not your Marketing Budget Increase Conversion and Sales, Not your Marketing Budget How to Optimize your Shopify Store for the Holiday Season! Dust off those jingle bells! The holiday season is just around the corner for online

More information

RECOMMENDATION METHOD ON HADOOP AND MAPREDUCE FOR BIG DATA APPLICATIONS

RECOMMENDATION METHOD ON HADOOP AND MAPREDUCE FOR BIG DATA APPLICATIONS RECOMMENDATION METHOD ON HADOOP AND MAPREDUCE FOR BIG DATA APPLICATIONS T.M.S.MEKALARANI #1, M.KALAIVANI *2 # ME, Computer Science and Engineering, Dhanalakshmi College of Engineering, Tambaram, India.

More information

A Clustering Model for Mining Evolving Web User Patterns in Data Stream Environment

A Clustering Model for Mining Evolving Web User Patterns in Data Stream Environment A Clustering Model for Mining Evolving Web User Patterns in Data Stream Environment Edmond H. Wu,MichaelK.Ng, Andy M. Yip,andTonyF.Chan Department of Mathematics, The University of Hong Kong Pokfulam Road,

More information

Austin Peay State University Department of Chemistry CHEM 1021 TESTING FOR ORGANIC FUNCTIONAL GROUPS

Austin Peay State University Department of Chemistry CHEM 1021 TESTING FOR ORGANIC FUNCTIONAL GROUPS TESTING FOR ORGANIC FUNCTIONAL GROUPS Caution: Chromic acid is hazardous as are many of the organic substances in today s experiment. Treat all unknowns with extreme care. Many organic substances are flammable.

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Challenges and Opportunities in Data Mining: Personalization

Challenges and Opportunities in Data Mining: Personalization Challenges and Opportunities in Data Mining: Big Data, Predictive User Modeling, and Personalization Bamshad Mobasher School of Computing DePaul University, April 20, 2012 Google Trends: Data Mining vs.

More information

Web Development QUESTIONNAIRE. Version: 1.0 BIG!

Web Development QUESTIONNAIRE. Version: 1.0 BIG! Web Development QUESTIONNAIRE Version: 1.0 BIG! TABLE OF CONTENTS 1. Client Details... 3 2. Basic Site Info... 4 3. Site Layout and Design... 6 4. Public Features... 7 5. Admin Features...8 6. Ecommerce

More information

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO

Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery. Dr Robert Scoffin CEO Computational Tools for Medicinal Chemists Increasing the Dimensions of Drug Discovery Dr Robert Scoffin CE Agenda > Building Desktop Tools - A History > About Cresset BioMolecular Discovery > Fields,

More information

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.

Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of

More information

Using Data Mining and Machine Learning in Retail

Using Data Mining and Machine Learning in Retail Using Data Mining and Machine Learning in Retail Omeid Seide Senior Manager, Big Data Solutions Sears Holdings Bharat Prasad Big Data Solution Architect Sears Holdings Over a Century of Innovation A Fortune

More information

Search Engines. Stephen Shaw <stesh@netsoc.tcd.ie> 18th of February, 2014. Netsoc

Search Engines. Stephen Shaw <stesh@netsoc.tcd.ie> 18th of February, 2014. Netsoc Search Engines Stephen Shaw Netsoc 18th of February, 2014 Me M.Sc. Artificial Intelligence, University of Edinburgh Would recommend B.A. (Mod.) Computer Science, Linguistics, French,

More information

AN APPROACH TO ANTICIPATE MISSING ITEMS IN SHOPPING CARTS

AN APPROACH TO ANTICIPATE MISSING ITEMS IN SHOPPING CARTS AN APPROACH TO ANTICIPATE MISSING ITEMS IN SHOPPING CARTS Maddela Pradeep 1, V. Nagi Reddy 2 1 M.Tech Scholar(CSE), 2 Assistant Professor, Nalanda Institute Of Technology(NIT), Siddharth Nagar, Guntur,

More information

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015

Sentiment Analysis. D. Skrepetos 1. University of Waterloo. NLP Presenation, 06/17/2015 Sentiment Analysis D. Skrepetos 1 1 Department of Computer Science University of Waterloo NLP Presenation, 06/17/2015 D. Skrepetos (University of Waterloo) Sentiment Analysis NLP Presenation, 06/17/2015

More information

User Data Analytics and Recommender System for Discovery Engine

User Data Analytics and Recommender System for Discovery Engine User Data Analytics and Recommender System for Discovery Engine Yu Wang Master of Science Thesis Stockholm, Sweden 2013 TRITA- ICT- EX- 2013: 88 User Data Analytics and Recommender System for Discovery

More information

Categorical Data Visualization and Clustering Using Subjective Factors

Categorical Data Visualization and Clustering Using Subjective Factors Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,

More information

Avg. 16.4 / 25 Stnd. Dev. 8.2

Avg. 16.4 / 25 Stnd. Dev. 8.2 QUIZ TREE Avg. 16.4 / 25 Stnd. Dev. 8.2 xidation of Alcohols with Chromium (VI): Jones xidation 2 Alcohols are oxidized by a solution of chromium trioxide in aqueous acetone (2), in the presence of an

More information

RECOMMENDATION SYSTEM

RECOMMENDATION SYSTEM RECOMMENDATION SYSTEM October 8, 2013 Team Members: 1) Duygu Kabakcı, 1746064, duygukabakci@gmail.com 2) Işınsu Katırcıoğlu, 1819432, isinsu.katircioglu@gmail.com 3) Sıla Kaya, 1746122, silakaya91@gmail.com

More information

Determining the Structure of an Organic Compound

Determining the Structure of an Organic Compound Determining the Structure of an Organic Compound The analysis of the outcome of a reaction requires that we know the full structure of the products as well as the reactants In the 19 th and early 20 th

More information

Data Mining for Web Personalization

Data Mining for Web Personalization 3 Data Mining for Web Personalization Bamshad Mobasher Center for Web Intelligence School of Computer Science, Telecommunication, and Information Systems DePaul University, Chicago, Illinois, USA mobasher@cs.depaul.edu

More information

The Data Mining Process

The Data Mining Process Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data

More information

Infrared Spectroscopy 紅 外 線 光 譜 儀

Infrared Spectroscopy 紅 外 線 光 譜 儀 Infrared Spectroscopy 紅 外 線 光 譜 儀 Introduction Spectroscopy is an analytical technique which helps determine structure. It destroys little or no sample (nondestructive method). The amount of light absorbed

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

CHEM 51LB: EXPERIMENT 5 SPECTROSCOPIC METHODS: INFRARED AND NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY

CHEM 51LB: EXPERIMENT 5 SPECTROSCOPIC METHODS: INFRARED AND NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY CHEM 51LB: EXPERIMENT 5 SPECTROSCOPIC METHODS: INFRARED AND NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY REACTIONS: None TECHNIQUES: IR, NMR Infrared (IR) and nuclear magnetic resonance (NMR) spectroscopy are

More information

Extending E-Ticketing Service with Mobile Transactions

Extending E-Ticketing Service with Mobile Transactions 1 Extending E-ing Service with Mobile Transactions Shirley Siu, Zhen Sheng Guo, Simon Fong and Sofia Zhuang Abstract The emerging technology in mobile devices presents new business potential to Internet

More information

Group Testing a tool of protecting Network Security

Group Testing a tool of protecting Network Security Group Testing a tool of protecting Network Security Hung-Lin Fu 傅 恆 霖 Department of Applied Mathematics, National Chiao Tung University, Hsin Chu, Taiwan Group testing (General Model) Consider a set N

More information

Survival Organic Chemistry Part I: Molecular Models

Survival Organic Chemistry Part I: Molecular Models Survival Organic Chemistry Part I: Molecular Models The goal in this laboratory experience is to get you so you can easily and quickly move between empirical formulas, molecular formulas, condensed formulas,

More information

Prediction of Heart Disease Using Naïve Bayes Algorithm

Prediction of Heart Disease Using Naïve Bayes Algorithm Prediction of Heart Disease Using Naïve Bayes Algorithm R.Karthiyayini 1, S.Chithaara 2 Assistant Professor, Department of computer Applications, Anna University, BIT campus, Tiruchirapalli, Tamilnadu,

More information

Personalized Information Management for Web Intelligence

Personalized Information Management for Web Intelligence Personalized Information Management for Web Intelligence Ah-Hwee Tan Kent Ridge Digital Labs 21 Heng Mui Keng Terrace, Singapore 119613 Email: ahhwee@krdl.org.sg Abstract Web intelligence can be defined

More information

Recommending News Articles using Cosine Similarity Function Rajendra LVN 1, Qing Wang 2 and John Dilip Raj 1

Recommending News Articles using Cosine Similarity Function Rajendra LVN 1, Qing Wang 2 and John Dilip Raj 1 Paper 1886-2014 Recommending News s using Cosine Similarity Function Rajendra LVN 1, Qing Wang 2 and John Dilip Raj 1 1 GE Capital Retail Finance, 2 Warwick Business School ABSTRACT Predicting news articles

More information

PTAC: Applied Chemistry COURSE OUTLINE & OBJECTIVES ESC Approved November 19, 2004

PTAC: Applied Chemistry COURSE OUTLINE & OBJECTIVES ESC Approved November 19, 2004 INTRODUCTION PTAC: Applied Chemistry COURSE OUTLINE & OBJECTIVES ESC Approved November 19, 2004 A. Introduction to Chemistry Terms 1. Define basic terms associated with chemistry: Organic/inorganic/biochemistry/physical

More information

Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification

Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification Feature vs. Classifier Fusion for Predictive Data Mining a Case Study in Pesticide Classification Henrik Boström School of Humanities and Informatics University of Skövde P.O. Box 408, SE-541 28 Skövde

More information

Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2

Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,

More information

Customer Analytics. Turn Big Data into Big Value

Customer Analytics. Turn Big Data into Big Value Turn Big Data into Big Value All Your Data Integrated in Just One Place BIRT Analytics lets you capture the value of Big Data that speeds right by most enterprises. It analyzes massive volumes of data

More information

ANALYTICS IN BIG DATA ERA

ANALYTICS IN BIG DATA ERA ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY, DISCOVER RELATIONSHIPS AND CLASSIFY HUGE AMOUNT OF DATA MAURIZIO SALUSTI SAS Copyr i g ht 2012, SAS Ins titut

More information

Implementing a Recommender system with graph database Prototype

Implementing a Recommender system with graph database Prototype Implementing a Recommender system with graph database Prototype Seminar Author: Hoang-Qui Cung 07-803-133 hoang-qui.cung@unifr.ch Malek Jedidi 09-214-719 malek.jedidi@unifr.ch Course Name: ebusiness Examiner:

More information

Mass Spec - Fragmentation

Mass Spec - Fragmentation Mass Spec - Fragmentation An extremely useful result of EI ionization in particular is a phenomenon known as fragmentation. The radical cation that is produced when an electron is knocked out of a neutral

More information

WHITE PAPER WORK PROCESS AND TECHNOLOGIES FOR MAGENTO PERFORMANCE (BASED ON FLIGHT CLUB) June, 2014. Project Background

WHITE PAPER WORK PROCESS AND TECHNOLOGIES FOR MAGENTO PERFORMANCE (BASED ON FLIGHT CLUB) June, 2014. Project Background WHITE PAPER WORK PROCESS AND TECHNOLOGIES FOR MAGENTO PERFORMANCE (BASED ON FLIGHT CLUB) June, 2014 Project Background Flight Club is the world s leading sneaker marketplace specialising in storing, shipping,

More information

Symmetric Stretch: allows molecule to move through space

Symmetric Stretch: allows molecule to move through space BACKGROUND INFORMATION Infrared Spectroscopy Before introducing the subject of IR spectroscopy, we must first review some aspects of the electromagnetic spectrum. The electromagnetic spectrum is composed

More information

Web analytics: Data Collected via the Internet

Web analytics: Data Collected via the Internet Database Marketing Fall 2016 Web analytics (incl real-time data) Collaborative filtering Facebook advertising Mobile marketing Slide set 8 1 Web analytics: Data Collected via the Internet Customers can

More information

D-optimal plans in observational studies

D-optimal plans in observational studies D-optimal plans in observational studies Constanze Pumplün Stefan Rüping Katharina Morik Claus Weihs October 11, 2005 Abstract This paper investigates the use of Design of Experiments in observational

More information

Big Data Analytics Verizon Lab, Palo Alto

Big Data Analytics Verizon Lab, Palo Alto Spark Meetup Big Data Analytics Verizon Lab, Palo Alto July 28th, 2015 Copyright 2015 Verizon. All Rights Reserved. Information contained herein is provided AS IS and subject to change without notice.

More information

MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME

MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME FOR IMMEDIATE RELEASE MYRIAD, HITACHI, ORACLE & FRIEDLI JOIN FORCES TO MAP THE ENTIRE HUMAN PROTEOME - $185 Million Collaboration to Determine All Human Protein Interactions And Decipher Biochemical Pathways

More information

Magento-Extension for personalized Recommendations

Magento-Extension for personalized Recommendations Magento-Extension for personalized Recommendations Sell more products and digital media online Increase your advertising revenue Lower costs for service and support Enhance usability of your website Establish

More information

E-Commerce Installation and Configuration Guide

E-Commerce Installation and Configuration Guide E-Commerce Installation and Configuration Guide Rev: 2011-05-19 Sitecore E-Commerce Fundamental Edition 1.1 E-Commerce Installation and Configuration Guide A developer's guide to installing and configuring

More information

Mark Bennett. Search and the Virtual Machine

Mark Bennett. Search and the Virtual Machine Mark Bennett Search and the Virtual Machine Agenda Intro / Business Drivers What to do with Search + Virtual What Makes Search Fast (or Slow!) Virtual Platforms Test Results Trends / Wrap Up / Q & A Business

More information

Automated Collaborative Filtering Applications for Online Recruitment Services

Automated Collaborative Filtering Applications for Online Recruitment Services Automated Collaborative Filtering Applications for Online Recruitment Services Rachael Rafter, Keith Bradley, Barry Smyth Smart Media Institute, Department of Computer Science, University College Dublin,

More information

EXPERIMENT 1: Survival Organic Chemistry: Molecular Models

EXPERIMENT 1: Survival Organic Chemistry: Molecular Models EXPERIMENT 1: Survival Organic Chemistry: Molecular Models Introduction: The goal in this laboratory experience is for you to easily and quickly move between empirical formulas, molecular formulas, condensed

More information

Optimal trading? In what sense?

Optimal trading? In what sense? Optimal trading? In what sense? Market Microstructure in Practice 3/3 Charles-Albert Lehalle Senior Research Advisor, Capital Fund Management, Paris April 2015, Printed the April 13, 2015 CA Lehalle 1

More information

May 2015 Robert Gibbon & Jochen Stroobants

May 2015 Robert Gibbon & Jochen Stroobants May 2015 Robert Gibbon & Jochen Stroobants 1 Robert Gibbon Founder at Big Industries Technical solution architect Hands on knowledge of Big Data design, build and operation Hadoop guru Jochen Stroobants

More information

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices

Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Proc. of Int. Conf. on Advances in Computer Science, AETACS Efficient Iceberg Query Evaluation for Structured Data using Bitmap Indices Ms.Archana G.Narawade a, Mrs.Vaishali Kolhe b a PG student, D.Y.Patil

More information

De novo design in the cloud from mining big data to clinical candidate

De novo design in the cloud from mining big data to clinical candidate De novo design in the cloud from mining big data to clinical candidate Jérémy Besnard Data Science For Pharma Summit 28 th January 2016 Overview the 3 bullet points Cloud based data platform that can efficiently

More information

The Sierra Clustered Database Engine, the technology at the heart of

The Sierra Clustered Database Engine, the technology at the heart of A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel

More information

E-Commerce and the Entrepreneur

E-Commerce and the Entrepreneur E-Commerce and the Entrepreneur The Internet: Changing the Face of Business The most successful companies embrace the Internet as a mechanism for transforming their companies and for changing everything

More information

RANDOM PROJECTIONS FOR SEARCH AND MACHINE LEARNING

RANDOM PROJECTIONS FOR SEARCH AND MACHINE LEARNING = + RANDOM PROJECTIONS FOR SEARCH AND MACHINE LEARNING Stefan Savev Berlin Buzzwords June 2015 KEYWORD-BASED SEARCH Document Data 300 unique words per document 300 000 words in vocabulary Data sparsity:

More information

Fast Trading and Prop Trading

Fast Trading and Prop Trading Fast Trading and Prop Trading B. Biais, F. Declerck, S. Moinas (Toulouse School of Economics) December 11, 2014 Market Microstructure Confronting many viewpoints #3 New market organization, new financial

More information

SPE and HPLC. Dr Iva Chianella Lecturer in Analytical Chemistry Cranfield Health +44 (0) 1234 758322. i.chianella.1998@cranfield.ac.

SPE and HPLC. Dr Iva Chianella Lecturer in Analytical Chemistry Cranfield Health +44 (0) 1234 758322. i.chianella.1998@cranfield.ac. SPE and HPLC Dr Iva Chianella Lecturer in Analytical Chemistry Cranfield Health +44 (0) 1234 758322 i.chianella.1998@cranfield.ac.uk Solid-Phase Extraction- SPE Simple, fast and efficient sample preparation

More information

Business Challenges and Research Directions of Management Analytics in the Big Data Era

Business Challenges and Research Directions of Management Analytics in the Big Data Era Business Challenges and Research Directions of Management Analytics in the Big Data Era Abstract Big data analytics have been embraced as a disruptive technology that will reshape business intelligence,

More information

Search Result Optimization using Annotators

Search Result Optimization using Annotators Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,

More information

How to create and interpret the predictive analysis of a compound

How to create and interpret the predictive analysis of a compound How to create and interpret the predictive analysis of a compound Platform with suite of tools Predict & understand biological effects of small molecules & compounds Predict targets and metabolites, potential

More information

Big Data Text Mining and Visualization. Anton Heijs

Big Data Text Mining and Visualization. Anton Heijs Copyright 2007 by Treparel Information Solutions BV. This report nor any part of it may be copied, circulated, quoted without prior written approval from Treparel7 Treparel Information Solutions BV Delftechpark

More information

BIG DATA: IT MAY BE BIG BUT IS IT SMART?

BIG DATA: IT MAY BE BIG BUT IS IT SMART? BIG DATA: IT MAY BE BIG BUT IS IT SMART? Turning Big Data into winning strategies A GfK Point-of-view 1 Big Data is complex Typical Big Data characteristics?#! %& Variety (data in many forms) Data in different

More information

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1 Slide 29-1 Chapter 29 Overview of Data Warehousing and OLAP Chapter 29 Outline Purpose of Data Warehousing Introduction, Definitions, and Terminology Comparison with Traditional Databases Characteristics

More information

INFRARED SPECTROSCOPY (IR)

INFRARED SPECTROSCOPY (IR) INFRARED SPECTROSCOPY (IR) Theory and Interpretation of IR spectra ASSIGNED READINGS Introduction to technique 25 (p. 833-834 in lab textbook) Uses of the Infrared Spectrum (p. 847-853) Look over pages

More information

Tableau Server Scalability Explained

Tableau Server Scalability Explained Tableau Server Scalability Explained Author: Neelesh Kamkolkar Tableau Software July 2013 p2 Executive Summary In March 2013, we ran scalability tests to understand the scalability of Tableau 8.0. We wanted

More information

Reactions of Aldehydes and Ketones

Reactions of Aldehydes and Ketones Reactions of Aldehydes and Ketones Structure Deduction using lassification Tests 1 Determination of Structure Determining the structure of an unknown organic compound is an exercise in deductive reasoning.

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 4: LINEAR MODELS FOR CLASSIFICATION Introduction In the previous chapter, we explored a class of regression models having particularly simple analytical

More information

Recommender Systems Seminar Topic : Application Tung Do. 28. Januar 2014 TU Darmstadt Thanh Tung Do 1

Recommender Systems Seminar Topic : Application Tung Do. 28. Januar 2014 TU Darmstadt Thanh Tung Do 1 Recommender Systems Seminar Topic : Application Tung Do 28. Januar 2014 TU Darmstadt Thanh Tung Do 1 Agenda Google news personalization : Scalable Online Collaborative Filtering Algorithm, System Components

More information

Database Software. What Is a Database, and How Does It Work?

Database Software. What Is a Database, and How Does It Work? Database Software Timetables Figure 8.15 Sample of a relational database. A relational database has many parts connected by one element your student number, for example. Files Fields Personal Information

More information

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches

Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches Modelling, Extraction and Description of Intrinsic Cues of High Resolution Satellite Images: Independent Component Analysis based approaches PhD Thesis by Payam Birjandi Director: Prof. Mihai Datcu Problematic

More information

Typical Infrared Absorption Frequencies. Functional Class Range (nm) Intensity Assignment Range (nm) Intensity Assignment

Typical Infrared Absorption Frequencies. Functional Class Range (nm) Intensity Assignment Range (nm) Intensity Assignment Typical Infrared Absorption Frequencies Functional Class Range (nm) Intensity Assignment Range (nm) Intensity Assignment Alkanes 2850-3000 CH 3, CH 2 & CH 2 or 3 bands Alkenes 3020-3100 1630-1680 1900-2000

More information

Chemistry Notes for class 12 Chapter 13 Amines

Chemistry Notes for class 12 Chapter 13 Amines 1 P a g e Chemistry Notes for class 12 Chapter 13 Amines Amines constitute an important class of organic compounds derived by replacing one or more hydrogen atoms ofnh 3 molecule by alkyl/aryl group(s).

More information

A Biologically Inspired Approach to Network Vulnerability Identification

A Biologically Inspired Approach to Network Vulnerability Identification A Biologically Inspired Approach to Network Vulnerability Identification Evolving CNO Strategies for CND Todd Hughes, Aron Rubin, Andrew Cortese,, Harris Zebrowitz Senior Member, Engineering Staff Advanced

More information

ammonium salt (acidic)

ammonium salt (acidic) Chem 360 Jasperse Ch. 19 otes. Amines 1 eactions of Amines 1. eaction as a proton base (Section 19-5 and 19-6) amine base -X (proton acid) a X ammonium salt (acidic) Mechanism: equired (protonation) everse

More information

Detection and mitigation of Web Services Attacks using Markov Model

Detection and mitigation of Web Services Attacks using Markov Model Detection and mitigation of Web Services Attacks using Markov Model Vivek Relan RELAN1@UMBC.EDU Bhushan Sonawane BHUSHAN1@UMBC.EDU Department of Computer Science and Engineering, University of Maryland,

More information

W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015

W. Heath Rushing Adsurgo LLC. Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare. Session H-1 JTCC: October 23, 2015 W. Heath Rushing Adsurgo LLC Harness the Power of Text Analytics: Unstructured Data Analysis for Healthcare Session H-1 JTCC: October 23, 2015 Outline Demonstration: Recent article on cnn.com Introduction

More information

MAD2: A Scalable High-Throughput Exact Deduplication Approach for Network Backup Services

MAD2: A Scalable High-Throughput Exact Deduplication Approach for Network Backup Services MAD2: A Scalable High-Throughput Exact Deduplication Approach for Network Backup Services Jiansheng Wei, Hong Jiang, Ke Zhou, Dan Feng School of Computer, Huazhong University of Science and Technology,

More information

Resonance Structures Arrow Pushing Practice

Resonance Structures Arrow Pushing Practice Resonance Structures Arrow Pushing Practice The following is a collection of ions and neutral molecules for which several resonance structures can be drawn. For the ions, the charges can be delocalized

More information

Collaborative Filtering. Radek Pelánek

Collaborative Filtering. Radek Pelánek Collaborative Filtering Radek Pelánek 2015 Collaborative Filtering assumption: users with similar taste in past will have similar taste in future requires only matrix of ratings applicable in many domains

More information

TF-IDF. David Kauchak cs160 Fall 2009 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture6-tfidf.ppt

TF-IDF. David Kauchak cs160 Fall 2009 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture6-tfidf.ppt TF-IDF David Kauchak cs160 Fall 2009 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture6-tfidf.ppt Administrative Homework 3 available soon Assignment 2 available soon Popular media article

More information