SQB: Session-based Query Browsing for More Effective Query Reuse
|
|
|
- Juliet Gwendolyn Boyd
- 9 years ago
- Views:
Transcription
1 SQB: Session-based Query Browsing for More Effective Query Reuse Nodira Khoussainova, YongChul Kwon, Wei-Ting Liao, Magdalena Balazinska, Wolfgang Gatterbauer, and Dan Suciu Department of Computer Science and Engineering University of Washington, Seattle, WA, USA {nodira, yongchul, liaowt, magda, gatter, Abstract. Scientists are generating more data than ever before and are increasingly putting that data (or corresponding metadata) inside a relational database accessible through SQL. Composing non-trivial SQL queries, however, is difficult. Just like code reuse, query reuse can facilitate and accelerate the task of composing new queries. A key challenge is how best to facilitate this reuse. In this paper, we introduce Smart Query Browser (SQB), a tool for searching and browsing through a repository of SQL queries. SQB applies the idea of a query session to improve query browsing and reuse. Informally, a query session is a set of queries authored by a single user to achieve a specific information goal. To evaluate SQB, we conduct a user study with 16 users and find that SQB reduces the query composition time by 57% compared with a traditional, unassisted query composition setting. 1 Introduction Scientists today are able to generate data at an unprecedented scale and rate [1]. Many scientific communities are taking this opportunity to create large-scale, high-quality, curated databases and make them publicly available. Some examples include the Sloan Digital Sky Survey (SDSS) [23], the Incorporated Research Institutions for Seismology (IRIS) [11], and the upcoming Large Synoptic Survey Telescope (LSST) [15]. Because these databases are extremely large, it is impractical and often even impossible for individual scientists to download them locally. Instead, these community datasets are typically stored in relational databases and made available over the Web through powerful query interfaces [26, 24]. Formulating non-trivial SQL queries, however, is a significant challenge for scientists because most are not database experts. One approach to ease this problem is to offer form-based interfaces, where a user fills out some fields in a form and the application composes the SQL query to execute against the database. Such interfaces, however, significantly constrain how users can access the data and thus several scientific repositories enable users to write their own SQL queries in addition to using forms [26]. To leverage such a feature, scientists need the ability to write their own SQL queries. Scientists are technically sophisticated users and can easily grasp simple SELECT- FROM-WHERE queries. To overcome the challenge of writing more complex SQL
2 queries, users often rely on examples found in books, in online tutorials, or obtained from collaborators. Queries from database books and SQL tutorials illustrate generic SQL constructs while queries from collaborators show how to query the database of interest. Furthermore, several large databases (e.g., SDSS) offer a small set of sample queries that can serve as example and be modified to fit a user s specific needs. Sample sets of queries and queries from colleagues can be helpful, but they offer only a limited set of examples. A method for giving users access to a larger number of sample queries would be to use a query browser system: a tool that would allow users to browse through all past queries executed over a database of interest. With access to such a query repository, users would have access to a larger number of potentially useful sample queries. Browsing through past queries, however, can be overwhelming. For example, the SDSS s query log contains more than 106 M queries [13]. Additionally, many queries in a query log can be of low-quality as previous users submitted multiple wrong queries before figuring out how to specify the query that they needed. To be effective, a query browser must thus (1) allow users to efficiently locate relevant queries and (2) distinguish between high-quality and low-quality queries. In this paper, we present the Smart Query Browser (SQB) system. SQB supports efficient retrieval of relevant queries using what we call session-based browsing. SQB enables a user to perform a keyword search over a query log. Instead of simply listing all matching queries, it presents the results as a set of query sessions. A query session, as introduced in our previous work [13], is a set of queries written by a user to achieve a single information goal. SQB thus allows users to view each result query in the context of the task that it aimed to complete. With this approach, SQB helps the user to more rapidly identify relevant queries because the user can decide on the relevance of entire sessions. It also helps users see how simple queries evolved into more complex ones. Finally, query sessions enable users to discriminate between high-quality and lowquality queries: queries that appear near the end of a session tend to be of higher quality because the author has spent time to edit and improve the query [13]. In this paper, we present the SQB system and show the results of its evaluation through a user study with 16 participants. The goal of the study is to investigate whether SQB speeds up the query formulation process by better supporting query reuse. We compare SQB to query formulation with no browser (traditional method) and to a strawman query browser that provides keyword search over the query log but no notion of query sessions. We find that, on average, SQB allows users to complete their tasks 2.3 times faster than the traditional method for writing queries, and 1.3 times faster than with a basic query browser without sessions. 2 SQB Overview We present an overview of SQB s design and implementation. In particular, we describe how SQB utilizes query sessions in order to improve the query browsing experience. Query sessions. Intuitively, a query session is a set of queries that a single user wrote to achieve a specific information goal. For example, an astronomer may want to find in the SDSS database all the stars of a certain brightness in the r-band within 2
3 Fig. 1. Screenshot of SQB. User searches for queries written over the Internet Movie Database (IMDB) containing actor and casts. SQB returns several query sessions. e.g., the first session consists of 3 queries, which reference the tables actor, casts, and movie. The keywords 1900 and 2000 are constants that appear in the queries in the session. arc minutes (i.e., 1 30 th of 1 ) of a known star [13]. The user will likely need to write multiple SQL queries before reaching this goal. To define a query session more formally, we first introduce the notion of a query deriving from another query. We say a query Q 2 derives from a query Q 1, if it occurs later in the query log and the user directly formulated query Q 2 by modifying Q 1 while working toward the same information goal. For example, the astronomer may first write a query that finds all stars with a certain brightness in the r-band (Q 1 ) and then add a predicate on distance to the known star (Q 2 ). Usually, a query Q 2 is derived from a single query Q 1, but can also be derived from multiple queries if, for example, the user merges together several queries to create Q 2 (perhaps using one of the earlier queries as a subquery in the other). A query derivation graph is a directed acyclic graph constructed from a query log. Each node represents a logged query, and there is an edge from Q 1 to Q 2 if and only if Q 2 derives from Q 1. A query session is a connected component of the query derivation graph. If constructed correctly, there is exactly one query session per information goal. Extracting query sessions. SQB is layered on top of a relational database management system (DBMS). SQB first uses the underlying DBMS s query logging functionality to accumulate a query log. Next, it executes the session extraction algorithm [13], which constructs the query derivation graph over the query log, and partitions that graph into query sessions. This part is executed offline. Utilizing query sessions. We outline a typical user interaction with the SQB system, to demonstrate the different ways in which SQB leverages query sessions. Figure 1
4 Fig. 2. SQB automatically computes and displays minimal edits between two queries. Problem Avg. # of SQL queries Stdev. # of SQL queries Fig. 3. Average and standard deviation of the number of queries per problem per student. shows a screenshot of SQB used in the context of composing a query over the Internet Movie Database (IMDB), which is the scenario that we use later in the evaluation. To begin, the user types in keywords to search for queries (e.g., actor AND casts in the screenshot in Figure 1). SQB then uses the full text search and ranking features offered by the underlying DBMS to identify relevant queries. For each query returned by the DBMS, SQB displays the corresponding session. If multiple queries belong to the same session, SQB shows that session only once. In order to rank sessions, SQB assigns each session the rank of its highest-ranked query. Within a session, all matching queries are marked with a blue arrow, and SQB brings to focus first the query that appears latest in the session. Because the queries are grouped according to their semantic information goal, the user needs to read through fewer results. Furthermore, SQB allows easy access to the last query in the session, which tends to be the one of highest quality. To assist the user to quickly evaluate whether a session is potentially useful, SQB attaches a brief summary to each session. The summary includes the number of queries, the referenced tables, and the top five keywords in the session. The summaries are presented with the list of sessions, on the left panel of the GUI (Figure 1). Once the user has selected a session to investigate, SQB lists the queries in execution timestamp order, with their execution duration and result size. Viewing a session at a time allows the user to easily see how queries evolved, so that she can go back to a simpler query or forward to a more complex one. To allow the user to more easily compare queries, SQB also computes and displays the minimal edits to transform one query to the other. Figure 2 shows an example. In the following section, we investigate whether session-based browsing is indeed helpful in query formulation. 3 Evaluation The evaluation dataset consists of all SQL queries written by students in an undergraduate database class, offered at the University of Washington in These queries were automatically logged as students worked on nine different problems for an assignment
5 that used the IMDB database [5]. IMDB contains information about movies, actors, directors, and the connections between them. In total, our dataset contains 7294 queries, written by 52 students. Problem numbers give us ground truth for the query sessions. Figure 3 shows the overall distribution of the number of queries per problem and per student. For the user studies, we used a subset of these queries (492 queries). We evaluate the benefits of SQB for query reuse through a user study. Overall, we find that for trivial queries, query browsing is not helpful and can be distracting. However, complex queries greatly benefit from query browsing. Tasks: Participants were asked to write four SQL queries over the IMDB database (see [5] for the schema): the participant was given an English sentence and was asked to translate the sentence into a SQL query. We present the queries below. Participants: SQB is designed to help novice to intermediate-level SQL users. Therefore, our study participants are graduate students who have either taken an introductory database class or learned SQL through their research. We had a total of 16 participants. Each participant used one interface, and worked on all four tasks. Interfaces: We compare four variants of the SQB interface. The first is the Compose interface, where participants have access to SQB s Compose Query tab (see Figure 1) but no browsing capabilities. It represents the current method for writing SQL queries without assistance. In the second, Semantic interface, participants have access to the SQB tool with full capabilities. All queries are grouped into query sessions. The third and fourth interfaces are called the Basic and Syntactic interfaces, respectively. In these modes, the participants have access to the repository of queries. The queries are either not grouped at all (Basic mode), or grouped according to the set of tables that the queries touched (Syntactic mode). The goal of the Syntactic mode is to compare session-based grouping to another form of query grouping. Study Design: Each participant session was approximately one hour. First, the participant was presented with three examples of English sentences alongside their SQL translations. Next, the participant completed two practice tasks on paper. This exercise served two purposes: (1) to check that the participant had enough SQL knowledge, and (2) to verify that the groups had similar SQL competency, on average. The preparation queries were over a different database than the IMDB database. The table below shows the total completion times of the preparation exercises for each group. Group Average Time Min Time Max Time Compose 4 m 8 s 3 m 29 s 5 m 33 s Semantic 4 m 25 s 3 m 40 s 5 m 40 s Basic 4 m 3 m 10 s 4 m 55 s Syntactic 4 m 23 s 3 m 24 s 5 m 50 s We see that the groups are comparable, with participants in the Semantic group being slowest on average. Following the preparation tasks, we gave each participant a tutorial of the interface they would be using and then let them work on the four SQL tasks on their own. Study Results. To evaluate the interfaces, we compare the completion times for the SQL tasks. If a participant took longer than 30 min, or did not complete the task, we round down their time to 30 min. There were three such instances. One in the Compose group for task three, and two in the Syntactic group for task four. Also, since we have
6 Fig. 4. Total completion time per user, grouped by interface. only four users per interface, we include a participant s timing information even if their query is incorrect. There were a total of five mistakes; three in Compose and two in Syntactic. Most were minor mistakes (i.e., missing or mistyped predicate). Figure 4 shows the total time to complete all four tasks for all sixteen participants. There is one stacked bar per interface group, showing the completion time of each participant in the group. For example, it took the second fastest person in the Compose group about forty minutes to complete all tasks. The second fastest person in the Semantic group, and the second fastest in the Basic group both took almost the same time as the fastest person within the respective group. Therefore, that stripe is hardly visible for those groups. Looking at the left two bars of Figure 4, we see that the average time to complete all tasks with the Semantic interface is 2.3 times faster than the traditional interface for writing queries, Compose. This is a reduction from 46.4 minutes down to 20.1 minutes. Compared to the Basic and Syntactic interfaces, we see that SQB offers a 1.3 factor speed up, and a 1.7 factor speed up, respectively. Next, we examine Figure 5, which presents the average completion time per task. Task 1: Find all the Sci-Fi movies that were released in a leap year. As can be seen in Figure 5, this task is the easiest of the four. It requires a simple join, and no GROUP BY clause. Due to its simplicity, the participants in the Compose group do best because query browsing provides more of a distraction than help. This task shows that for trivial queries, which do not have exact matches within the repository, query browsing is not helpful. Task 2: List all directors who directed 200 movies or more, in ascending order of the number of movies they directed. For this task, we begin to see significant benefits of query browsing. For example, the average completion time in the Semantic group is just under four minutes versus the fourteen minutes average for the Compose interface. This task leverages the query repository because there is a query that differs in the constant and ORDER BY clause. From this task, we learn that complex queries with good matches in the repository benefit greatly from query browsing. Task 3: List all the actors who acted in a film before 1950 and also in a film after Again, this task demonstrates that the SQB tool offers great benefits. The most
7 Fig. 5. Mean task completion time per interface, grouped by task. similar query in the repository to this task only differs in constants. As expected, the participants in the Semantic group, on average, completed this task 3.6 times faster than the users in the Compose group. The graph shows that the Semantic interface performed only marginally better than the Basic one. This was due to one outlier in the Semantic group; the slowest participant in that group took more than twice the time of the second slowest participant. If we were to remove the slowest person from the two groups, then the average completion time for the Semantic group would be 1.3 times faster than the Basic group. A significant challenge that participants faced for this task was in realizing that you need two occurrences of the ACTOR and CASTS tables. Even those using the Basic and Syntactic versions of browser faced this problem because many of them chose to pick the query that was simpler (i.e. had fewer tables in the FROM clause. Those using the Semantic version of the tool did not face this problem, because they were able to see that in the remainder of the query session, this simpler (and incorrect) query was later replaced by a slightly longer but correct version. Once again, we thus see that complex queries with good matches in the repository benefit greatly from session-based query browsing. Task 4: Find the actor with the most number of movies. This task was the most difficult, despite the fact that we gave every participant a hint about the TOP keyword (SQL Server s version of the LIMIT keyword). The closest query in the repository found the movie with the most number of actors. Participants were expected to find this query and restructure it heavily. Ten out of twelve users working with a browser found it. One user from the Basic, and one from the Syntactic group did not find the query. For this task, the Semantic group did 1.5, 2.1, and 2.5 times faster than the Basic, Compose, and Syntactic groups, respectively. It beat Compose because the query is a difficult one, and the Semantic group had available a good candidate query whose structure they could reuse. Semantic beat the other two browsing interfaces because in the Semantic interface users more quickly identified that the query they found was a good one. Whereas in the other two interfaces, the users continued to search for more queries even after they found the best one. In the Semantic group, participants used keyword search an average of 2 times for this task. For the Basic, and Syntactic groups, users had an average of 2.5, and 5.25 searches, respectively.
8 We see that for a complex task, even if the best query matches the task only structurally, SQB is still able to help users significantly. It beats the Basic and Syntactic versions because it helps users to more quickly evaluate the quality of a query. 4 Related Work Web-based tools. If we relax the definition of a query, there are several tools that support query sharing on the web. CoScripter [4] is a web-based system for recording, automating and sharing web browser tasks (e.g., checking flight departure times). Yahoo! Pipes [28], Microsoft Popfly [16], Google Mashup Editor [8], and IBM Lotus Mashups [9] enable users to construct mashups of data from different sources on the web, and later share these mashups with the public. Woogle [7] is a tool for locating web services. Given a web service, Woogle finds similar ones based on the services input and output schemas, as well as web services that compose with the specified one. Commercial DBMS. Commercial tools currently provide query-by-example [19, 29], graphical tools for composing queries [2, 3], and query logging aimed at physical tuning [17, 25, 27]. None provide a query browsing tool aimed at helping user easily reuse, and learn from past queries. Scientific workflows. There are several existing scientific workflow management systems. The MyExperiment project [20] envisions community oriented sharing of scientific workflows. The project is more focused on the social aspects of sharing rather than the automatic curation or browsing of activities within a system. Scriptome [22] is a repository of useful Perl scripts for computational biologists. It is well-curated and allows users to customize the scripts in the repository in predefined ways. Unlike Scriptome, SQB aims to minimize the level of human effort to curate such repositories. In a recent VisTrails paper, the authors demonstrate advanced capabilities of the system such as query-by-example and query-by-analogy with visualizations [21]. SQB is a first step toward such advanced capabilities for SQL queries. Making databases usable. Jagadish et al. [12] discuss the pain points of using database systems today, and describe several usability challenges that database researchers should address. Nandi et al. [18] and Li et al. [14] present different techniques for phrase auto-completion on keyword searches over structured databases. SQB also strives to make databases more usable, however, we are specifically focused on providing users with capabilities that allow them to share, reuse, learn from the past queries. To allow users to more easily understand SQL queries, Ioannidis et al. [10] explore how to automatically translate SQL queries to natural language sentences. Danaparamita and Gatterbauer [6] also investigate this problem, but through the translation of the SQL query into a visual representation. Both projects are complementary to our work, and we hope to extend SQB with this capability in future work. 5 Conclusion We presented SQB, a tool for browsing through past SQL queries. The key insight behind SQB s design is the concept of query sessions. We showed that query sessions
9 help speed up query composition by organizing queries in a large repository in a manner that facilitates the identification of relevant, high-quality queries to use as example. References 1. The fourth paradigm: Data-intensive scientific discovery, BaseNow. Database front-end applications. About SQL Query Builder. basenow.com/help/about_sql_query_builder.asp. 3. T. Catarci, M. F. Costabile, S. Levialdi, and C. Batini. Visual query systems for databases: A survey. J. of Visual Languages & Computing, 8(2): , Coscripter IMDB course assignment. cse444/08au/project/project1/project1.html. 6. J. Danaparamita and W. Gatterbauer. Queryviz: helping users understand sql queries and their patterns. In Proc. EDBT, X. Dong, A. Halevy, J. Madhavan, E. Nemes, and J. Zhang. Similarity search for web services. In Proc. VLDB, pages , Google Mashup Editor IBM Lotus Mashups Y. Ioannidis. From databases to natural language: The unusual direction. In Proc. NLDB, pages 12 16, Incorporated Research Institutions for Seismology H. V. Jagadish, A. Chapman, A. Elkiss, M. Jayapandian, Y. Li, A. Nandi, and C. Yu. Making database systems usable. In Proc. SIGMOD, pages 13 24, N. Khoussainova, Y. Kwon, M. Balazinska, and D. Suciu. Snipsuggest: context-aware autocompletion for sql. Proc. VLDB Endow., 4:22 33, October G. Li, J. Fan, H. Wu, J. Wang, and J. Feng. DBease: Making databases user-friendly and easily accessible. In Proc. of the 5th CIDR Conf., Large Synoptic Survey Telescope Microsoft Popfly Microsoft TechNet. SQL Server TechCenter: Configuring the analysis services query log. config_ssas_querylog.mspx. 18. A. Nandi and H. V. Jagadish. Effective phrase prediction. In Proc. VLDB, pages , Reading objects using query by example. Oracle TopLink developer s guide 10g release 3 (10.1.3). main/_html/qrybas002.htm. 20. D. D. Roure, C. Goble, J. Bhagat, D. Cruickshank, A. Goderis, D. Michaelides, and D. Newman. myexperiment: Defining the social virtual research environment. escience, IEEE International Conference on, 0: , C. E. Scheidegger, H. T. Vo, D. Koop, J. Freire, and C. T. Silva. Querying and re-using workflows with vstrails. In Proc. SIGMOD, pages , The Scriptome - Protocols for Manipulating Biological Data. edu/csb/resources/computational/scrptome/. 23. Sloan Digital Sky Survey IRIS SeismiQuery Siebel business analytics server administration guide: Administering the query log. AdministerQuery14.html.
10 26. Sloan Digital Sky Survey/SkyServer. docs/realquery.asp. 27. Teradata Utility Pack Yahoo Pipes M. M. Zloof. Query-by-example: the invocation and definition of tables and forms. In Proc. VLDB, pages 1 24, 1975.
Exploiting Tag Clouds for Database Browsing and Querying
Exploiting Tag Clouds for Database Browsing and Querying Stefania Leone, Matthias Geel, and Moira C. Norrie Institute for Information Systems, ETH Zurich CH-8092 Zurich, Switzerland {leone geel norrie}@inf.ethz.ch
Using Provenance to Improve Workflow Design
Using Provenance to Improve Workflow Design Frederico T. de Oliveira, Leonardo Murta, Claudia Werner, Marta Mattoso COPPE/ Computer Science Department Federal University of Rio de Janeiro (UFRJ) {ftoliveira,
<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features
1 Oracle SQL Developer 3.0: Overview and New Features Sue Harper Senior Principal Product Manager The following is intended to outline our general product direction. It is intended
CMS Query Suite. CS4440 Project Proposal. Chris Baker Michael Cook Soumo Gorai
CMS Query Suite CS4440 Project Proposal Chris Baker Michael Cook Soumo Gorai I) Motivation Relational databases are great places to efficiently store large amounts of information. However, information
Semantic Analysis of Business Process Executions
Semantic Analysis of Business Process Executions Fabio Casati, Ming-Chien Shan Software Technology Laboratory HP Laboratories Palo Alto HPL-2001-328 December 17 th, 2001* E-mail: [casati, shan] @hpl.hp.com
COURSE SYLLABUS COURSE TITLE:
1 COURSE SYLLABUS COURSE TITLE: FORMAT: CERTIFICATION EXAMS: 55043AC Microsoft End to End Business Intelligence Boot Camp Instructor-led None This course syllabus should be used to determine whether the
SAP Data Services 4.X. An Enterprise Information management Solution
SAP Data Services 4.X An Enterprise Information management Solution Table of Contents I. SAP Data Services 4.X... 3 Highlights Training Objectives Audience Pre Requisites Keys to Success Certification
ORACLE BUSINESS INTELLIGENCE WORKSHOP
ORACLE BUSINESS INTELLIGENCE WORKSHOP Integration of Oracle BI Publisher with Oracle Business Intelligence Enterprise Edition Purpose This tutorial mainly covers how Oracle BI Publisher is integrated with
HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering
HadoopSPARQL : A Hadoop-based Engine for Multiple SPARQL Query Answering Chang Liu 1 Jun Qu 1 Guilin Qi 2 Haofen Wang 1 Yong Yu 1 1 Shanghai Jiaotong University, China {liuchang,qujun51319, whfcarter,yyu}@apex.sjtu.edu.cn
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
SAP InfiniteInsight Explorer Analytical Data Management v7.0
End User Documentation Document Version: 1.0-2014-11 SAP InfiniteInsight Explorer Analytical Data Management v7.0 User Guide CUSTOMER Table of Contents 1 Welcome to this Guide... 3 1.1 What this Document
Search Result Optimization using Annotators
Search Result Optimization using Annotators Vishal A. Kamble 1, Amit B. Chougule 2 1 Department of Computer Science and Engineering, D Y Patil College of engineering, Kolhapur, Maharashtra, India 2 Professor,
Fast and Easy Delivery of Data Mining Insights to Reporting Systems
Fast and Easy Delivery of Data Mining Insights to Reporting Systems Ruben Pulido, Christoph Sieb [email protected], [email protected] Abstract: During the last decade data mining and predictive
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software
The Clear Path to Business Intelligence
SAP Solution in Detail SAP Solutions for Small Businesses and Midsize Companies SAP Crystal Solutions The Clear Path to Business Intelligence Table of Contents 3 Quick Facts 4 Optimize Decisions with SAP
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
KnowledgeSEEKER Marketing Edition
KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers
Oracle Warehouse Builder 10g
Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6
Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes
Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes This white paper will help you learn how to integrate your SalesForce.com data with 3 rd -party on-demand,
Database Application Developer Tools Using Static Analysis and Dynamic Profiling
Database Application Developer Tools Using Static Analysis and Dynamic Profiling Surajit Chaudhuri, Vivek Narasayya, Manoj Syamala Microsoft Research {surajitc,viveknar,manojsy}@microsoft.com Abstract
Microsoft Dynamics NAV 2015 What s new?
What s new? RapidStart Upgrade includes several enhancements for upgrading solutions. RapidStart Upgrade - Code New application merge utilities help partners upgrade their solution - application code and
InfiniteInsight 6.5 sp4
End User Documentation Document Version: 1.0 2013-11-19 CUSTOMER InfiniteInsight 6.5 sp4 Toolkit User Guide Table of Contents Table of Contents About this Document 3 Common Steps 4 Selecting a Data Set...
<no narration for this slide>
1 2 The standard narration text is : After completing this lesson, you will be able to: < > SAP Visual Intelligence is our latest innovation
File S1: Supplementary Information of CloudDOE
File S1: Supplementary Information of CloudDOE Table of Contents 1. Prerequisites of CloudDOE... 2 2. An In-depth Discussion of Deploying a Hadoop Cloud... 2 Prerequisites of deployment... 2 Table S1.
Comparative Analysis of Free IT Monitoring Platforms. Review of SolarWinds, CA Technologies, and Nagios IT monitoring platforms
Comparative Analysis of Free IT Monitoring Platforms Review of SolarWinds, CA Technologies, and Nagios IT monitoring platforms The new CA Nimsoft Monitor Snap solution offers users broad access to monitor
MicroStrategy Course Catalog
MicroStrategy Course Catalog 1 microstrategy.com/education 3 MicroStrategy course matrix 4 MicroStrategy 9 8 MicroStrategy 10 table of contents MicroStrategy course matrix MICROSTRATEGY 9 MICROSTRATEGY
WHITE PAPER Get Your Business Intelligence in a "Box": Start Making Better Decisions Faster with the New HP Business Decision Appliance
WHITE PAPER Get Your Business Intelligence in a "Box": Start Making Better Decisions Faster with the New HP Business Decision Appliance Sponsored by: HP and Microsoft Dan Vesset February 2011 Brian McDonough
2. MOTIVATING SCENARIOS 1. INTRODUCTION
Multiple Dimensions of Concern in Software Testing Stanley M. Sutton, Jr. EC Cubed, Inc. 15 River Road, Suite 310 Wilton, Connecticut 06897 [email protected] 1. INTRODUCTION Software testing is an area
Data Integration with Talend Open Studio Robert A. Nisbet, Ph.D.
Data Integration with Talend Open Studio Robert A. Nisbet, Ph.D. Most college courses in statistical analysis and data mining are focus on the mathematical techniques for analyzing data structures, rather
How to Prepare for the Upgrade to Microsoft Dynamics CRM 2013 (On-premises)
How to Prepare for the Upgrade to Microsoft Dynamics CRM 2013 (On-premises) COMPANY: Microsoft Corporation RELEASED: September 2013 VERSION: 1.0 Copyright This document is provided "as-is". Information
Design Considerations for a Visualization and Simulation Tool for CBMS Data
Design Considerations for a Visualization and Simulation Tool for CBMS Data Nelson Marcos 1,*, Gerardo Largoza 2, Briane Paul Samson 3, Johnn Jelvin S. Base 4, Lawrence Patrick C. Calulo 5, Bervyn S. Co
From Databases to Natural Language: The Unusual Direction
From Databases to Natural Language: The Unusual Direction Yannis Ioannidis Dept. of Informatics & Telecommunications, MaDgIK Lab University of Athens, Hellas (Greece) [email protected] http://www.di.uoa.gr/
MOOCdb: Developing Data Standards for MOOC Data Science
MOOCdb: Developing Data Standards for MOOC Data Science Kalyan Veeramachaneni, Franck Dernoncourt, Colin Taylor, Zachary Pardos, and Una-May O Reilly Massachusetts Institute of Technology, USA. {kalyan,francky,colin
Holistic Performance Analysis of J2EE Applications
Holistic Performance Analysis of J2EE Applications By Madhu Tanikella In order to identify and resolve performance problems of enterprise Java Applications and reduce the time-to-market, performance analysis
Tableau and the Enterprise Data Warehouse: The Visual Approach to Business Intelligence
Tableau and the Enterprise Data Warehouse: The Visual Approach to Business Intelligence AUTHOR Dan Jewett Tableau Software DATE 08/01/2009 Organizations focus on acquiring customers, increasing revenues
CHAPTER 1 INTRODUCTION
1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful
Data Warehousing and OLAP Technology for Knowledge Discovery
542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories
Microsoft SQL Server Installation Guide
Microsoft SQL Server Installation Guide Version 3.0 For SQL Server 2014 Developer & 2012 Express October 2014 Copyright 2010 2014 Robert Schudy, Warren Mansur and Jack Polnar Permission granted for any
Setting up SQL Translation Framework OBE for Database 12cR1
Setting up SQL Translation Framework OBE for Database 12cR1 Overview Purpose This tutorial shows you how to use have an environment ready to demo the new Oracle Database 12c feature, SQL Translation Framework,
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 ISSN 2229-5518
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 5 INTELLIGENT MULTIDIMENSIONAL DATABASE INTERFACE Mona Gharib Mohamed Reda Zahraa E. Mohamed Faculty of Science,
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology November-2015 Volume 2, Issue-6
International Journal of Engineering Research ISSN: 2348-4039 & Management Technology Email: [email protected] November-2015 Volume 2, Issue-6 www.ijermt.org Modeling Big Data Characteristics for Discovering
14.1. bs^ir^qfkd=obcib`qflk= Ñçê=emI=rkfuI=~åÇ=léÉåsjp=eçëíë
14.1 bs^ir^qfkd=obcib`qflk= Ñçê=emI=rkfuI=~åÇ=léÉåsjp=eçëíë bî~äì~íáåö=oéñäéåíáçå=ñçê=emi=rkfui=~åç=lééåsjp=eçëíë This guide walks you quickly through key Reflection features. It covers: Getting Connected
METADATA-DRIVEN QLIKVIEW APPLICATIONS AND POWERFUL DATA INTEGRATION WITH QLIKVIEW EXPRESSOR
METADATA-DRIVEN QLIKVIEW APPLICATIONS AND POWERFUL DATA INTEGRATION WITH QLIKVIEW EXPRESSOR A QlikView Technical Brief Document March 2013 qlikview.com Introduction This technical brief highlights a subset
Tableau Metadata Model
Tableau Metadata Model Author: Marc Reuter Senior Director, Strategic Solutions, Tableau Software March 2012 p2 Most Business Intelligence platforms fall into one of two metadata camps: either model the
Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE
Management Consulting Systems Integration Managed Services WHITE PAPER DATA DISCOVERY VS ENTERPRISE BUSINESS INTELLIGENCE INTRODUCTION Over the past several years a new category of Business Intelligence
Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system
Introduction: management system Introduction s vs. files Basic concepts Brief history of databases Architectures & languages System User / Programmer Application program Software to process queries Software
Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix ABSTRACT INTRODUCTION Data Access
Release 2.1 of SAS Add-In for Microsoft Office Bringing Microsoft PowerPoint into the Mix Jennifer Clegg, SAS Institute Inc., Cary, NC Eric Hill, SAS Institute Inc., Cary, NC ABSTRACT Release 2.1 of SAS
Administrator s Guide
SEO Toolkit 1.3.0 for Sitecore CMS 6.5 Administrator s Guide Rev: 2011-06-07 SEO Toolkit 1.3.0 for Sitecore CMS 6.5 Administrator s Guide How to use the Search Engine Optimization Toolkit to optimize your
A Business Process Services Portal
A Business Process Services Portal IBM Research Report RZ 3782 Cédric Favre 1, Zohar Feldman 3, Beat Gfeller 1, Thomas Gschwind 1, Jana Koehler 1, Jochen M. Küster 1, Oleksandr Maistrenko 1, Alexandru
Microsoft Dynamics NAV
Microsoft Dynamics NAV Maximising value through business insight Business Intelligence White Paper October 2015 CONTENTS Reports were tedious. Earlier it would take days for manual collation. Now all this
SOLUTION BRIEF CA ERwin Modeling. How can I understand, manage and govern complex data assets and improve business agility?
SOLUTION BRIEF CA ERwin Modeling How can I understand, manage and govern complex data assets and improve business agility? SOLUTION BRIEF CA DATABASE MANAGEMENT FOR DB2 FOR z/os DRAFT CA ERwin Modeling
A Data Browsing from Various Sources Driven by the User s Data Models
A Data Browsing from Various Sources Driven by the User s Data Models Guntis Arnicans, Girts Karnitis University of Latvia, Raina blvd. 9, Riga, Latvia {Guntis.Arnicans, Girts.Karnitis}@lu.lv Abstract.
Introduction: Database management system
Introduction Databases vs. files Basic concepts Brief history of databases Architectures & languages Introduction: Database management system User / Programmer Database System Application program Software
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis (Version 1.17) For validation Document version 0.1 7/7/2014 Contents What is SAP Predictive Analytics?... 3
IBM WebSphere ILOG Rules for.net
Automate business decisions and accelerate time-to-market IBM WebSphere ILOG Rules for.net Business rule management for Microsoft.NET and SOA environments Highlights Complete BRMS for.net Integration with
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens
Scalable End-User Access to Big Data http://www.optique-project.eu/ HELLENIC REPUBLIC National and Kapodistrian University of Athens 1 Optique: Improving the competitiveness of European industry For many
Administrator's Guide
Search Engine Optimization Module Administrator's Guide Installation and configuration advice for administrators and developers Sitecore Corporation Table of Contents Chapter 1 Installation 3 Chapter 2
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
Database Optimizing Services
Database Systems Journal vol. I, no. 2/2010 55 Database Optimizing Services Adrian GHENCEA 1, Immo GIEGER 2 1 University Titu Maiorescu Bucharest, Romania 2 Bodenstedt-Wilhelmschule Peine, Deutschland
IBM InfoSphere Optim Test Data Management
IBM InfoSphere Optim Test Data Management Highlights Create referentially intact, right-sized test databases or data warehouses Automate test result comparisons to identify hidden errors and correct defects
Oracle Database 10g Express
Oracle Database 10g Express This tutorial prepares the Oracle Database 10g Express Edition Developer to perform common development and administrative tasks of Oracle Database 10g Express Edition. Objectives
Design of Data Archive in Virtual Test Architecture
Journal of Information Hiding and Multimedia Signal Processing 2014 ISSN 2073-4212 Ubiquitous International Volume 5, Number 1, January 2014 Design of Data Archive in Virtual Test Architecture Lian-Lei
Microsoft Dynamics NAV
Microsoft Dynamics NAV 2015 Microsoft Dynamics NAV Maximising value through business insight Business Intelligence White Paper December 2014 CONTENTS Reports were tedious. Earlier it would take days for
MASTER DATA MANAGEMENT TEST ENABLER
MASTER DATA MANAGEMENT TEST ENABLER Sagar Porov 1, Arupratan Santra 2, Sundaresvaran J 3 Infosys, (India) ABSTRACT All Organization needs to handle important data (customer, employee, product, stores,
PIE. Internal Structure
PIE Internal Structure PIE Composition PIE (Processware Integration Environment) is a set of programs for integration of heterogeneous applications. The final set depends on the purposes of a solution
Cloud Computing: Computing as a Service. Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad
Cloud Computing: Computing as a Service Prof. Daivashala Deshmukh Maharashtra Institute of Technology, Aurangabad Abstract: Computing as a utility. is a dream that dates from the beginning from the computer
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
GXP WebView GEOSPATIAL EXPLOITATION PRODUCTS (GXP )
GEOSPATIAL EXPLOITATION PRODUCTS (GXP ) GXP WebView A Web-based, full-resolution image viewing tool for visualization, annotation, measurement, and publishing providing access to remote data from any location.
Modernizing the User Experience with SAP
User Experience Solutions from SAP Modernizing the User Experience with SAP 2014 SAP SE or an SAP affiliate company. All rights reserved. The User Experience Revolution UX Solutions UX Services Benefits
Lightweight Data Integration using the WebComposition Data Grid Service
Lightweight Data Integration using the WebComposition Data Grid Service Ralph Sommermeier 1, Andreas Heil 2, Martin Gaedke 1 1 Chemnitz University of Technology, Faculty of Computer Science, Distributed
Semantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)
SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box) SQL Server White Paper Published: January 2012 Applies to: SQL Server 2012 Summary: This paper explains the different ways in which databases
Base One's Rich Client Architecture
Base One's Rich Client Architecture Base One provides a unique approach for developing Internet-enabled applications, combining both efficiency and ease of programming through its "Rich Client" architecture.
Microsoft Dynamics NAV
Microsoft Dynamics NAV Maximising value through business insight Business Intelligence White Paper May 2013 Reports were tedious. Earlier it would take days for manual collation. Now all this is available
Microsoft SQL Server Installation Guide
Microsoft SQL Server Installation Guide Version 2.1 For SQL Server 2012 January 2013 Copyright 2010 2013 Robert Schudy, Warren Mansur and Jack Polnar Permission granted for any use of Boston University
Visualization of Semantic Windows with SciDB Integration
Visualization of Semantic Windows with SciDB Integration Hasan Tuna Icingir Department of Computer Science Brown University Providence, RI 02912 [email protected] February 6, 2013 Abstract Interactive Data
Survey of Canadian and International Data Management Initiatives. By Diego Argáez and Kathleen Shearer
Survey of Canadian and International Data Management Initiatives By Diego Argáez and Kathleen Shearer on behalf of the CARL Data Management Working Group (Working paper) April 28, 2008 Introduction Today,
PBI365: Data Analytics and Reporting with Power BI
POWER BI FOR BUSINESS ANALYSTS AND POWER USERS 3 DAYS PBI365: Data Analytics and Reporting with Power BI AUDIENCE FORMAT COURSE DESCRIPTION Business Analysts, Statisticians and Data Scientists Instructor-led
Apache Web Server Execution Tracing Using Third Eye
Apache Web Server Execution Tracing Using Third Eye Raimondas Lencevicius Alexander Ran Rahav Yairi Nokia Research Center, 5 Wayside Road, Burlington, MA 01803, USA [email protected] [email protected]
PATROL From a Database Administrator s Perspective
PATROL From a Database Administrator s Perspective September 28, 2001 Author: Cindy Bean Senior Software Consultant BMC Software, Inc. 3/4/02 2 Table of Contents Introduction 5 Database Administrator Tasks
Auto-Classification in SharePoint. How BA Insight AutoClassifier Integrates with the SharePoint Managed Metadata Service
How BA Insight AutoClassifier Integrates with the SharePoint Managed Metadata Service BA Insight 2015 Table of Contents Abstract... 3 Findability and the Value of Metadata... 3 Finding Information is Hard...
Business Insight Report Authoring Getting Started Guide
Business Insight Report Authoring Getting Started Guide Version: 6.6 Written by: Product Documentation, R&D Date: February 2011 ImageNow and CaptureNow are registered trademarks of Perceptive Software,
<Insert Picture Here> Move to Oracle Database with Oracle SQL Developer Migrations
Move to Oracle Database with Oracle SQL Developer Migrations The following is intended to outline our general product direction. It is intended for information purposes only, and
Surveying and evaluating tools for managing processes for software intensive systems
Master Thesis in Software Engineering 30 Credits, Advanced Level Surveying and evaluating tools for managing processes for software intensive systems Anuradha Suryadevara IDT Mälardalen University, ABB
Predictive Model for supporting ITIL Business Change Management Processes
Predictive Model for supporting ITIL Business Change Management Processes Jan Suchy 1,*, Milan Suchy 1, 1 GRADIENT ECM, Kosicka 56, 82108 Bratislava, Slovakia {Jan.Suchy, Milan.Suchy}@gradientecm.com Abstract.
Easy configuration of NETCONF devices
Easy configuration of NETCONF devices David Alexa 1 Tomas Cejka 2 FIT, CTU in Prague CESNET, a.l.e. Czech Republic Czech Republic [email protected] [email protected] Abstract. It is necessary for developers
Graphical Web based Tool for Generating Query from Star Schema
Graphical Web based Tool for Generating Query from Star Schema Mohammed Anbar a, Ku Ruhana Ku-Mahamud b a College of Arts and Sciences Universiti Utara Malaysia, 0600 Sintok, Kedah, Malaysia Tel: 604-2449604
