THE SHORTEST PATH ALGORITHM PERFORMANCE COMPARISON IN GRAPH AND RELATIONAL DATABASE ON A TRANSPORTATION NETWORK
|
|
|
- Griffin Holland
- 9 years ago
- Views:
Transcription
1 MARIO MILER, M.Eng. DAMIR MEDAK, Ph.D. DRAŽEN ODOBAŠIĆ, M.Eng. Faculty of Geodesy, University of Zagreb Kačićeva 26, 1 Zagreb, Croatia Information and Communication Technology Preliminary Communication Accepted: Mar. 27, 213 Approved: Oct. 12, 213 THE SHORTEST PATH ALGORITHM PERFORMANCE COMPARISON IN GRAPH AND RELATIONAL DATABASE ON A TRANSPORTATION NETWORK ABSTRACT In the field of geoinformation and transportation science, the shortest path is calculated on graph data mostly found in road and transportation networks. This data is often stored in various database systems. Many applications dealing with transportation network require calculation of the shortest path. The objective of this research is to compare the performance of Dijkstra shortest path calculation in PostgreSQL (with pgrouting) and Neo4j graph database for the purpose of determining if there is any difference regarding the speed of the calculation. Benchmarking was done on commodity hardware using OpenStreetMap road network. The first assumption is that Neo4j graph database would be well suited for the shortest path calculation on transportation networks but this does not come without some cost. Memory proved to be an issue in Neo4j setup when dealing with larger transportation networks. KEY WORDS pgrouting, OpenStreetMap, Dijkstra, benchmark, Neo4j, PostgreSQL 1. INTRODUCTION The simplest question in routing that needs an answer is: What is the most effective way to get from here to there?. To answer this question a simple and complex shortest path algorithms are used. Today, the shortest path analysis is used in everyday life, for example in car or personal navigation systems. The problem of the shortest path calculation has been the topic of research for several decades [1 4] but the influence of database system implementation has been rarely considered. With recent rapid development of database technology, there has always been a question: Will a graph database perform better in the shortest path calculation than a relation database?. Relational databases are often used for storing transportation network data. Many of them provide navigation models which permit managing networks and perform network operations [5]. Many researchers have been working with such systems and there are many examples and applications of databasebased routing systems in the traffic and transportation field. Christopher et al. [6] designed a collaborative route planning system for utility vehicles which relies heavily on route calculation based on underlying transportation network stored in relational databases. Innerebner et al. [7] created a web-based system named ISOGA that uses isochrones to perform geographical reachability analysis over a transportation network stored in a relational database. Zlatanova et al. [5] focused on examining the problem of finding the optimal path for moving objects to avoid moving obstacles using a relational database. Most of the databases used today are based on the relational model [8] which is a theoretical basis for relational database management systems (RDBMSs). RDBMSs today are dominant for storing various kinds of structured data. These systems were primarily designed for storing data used by business applications. Over the years, the need for storing other types of data and large amounts like spatial, hierarchical and graph data has emerged but technology remained the same. Relational databases proved to be very efficient in storing and querying large amounts of data but they were not adequate for analyzing relations among entities. One of these examples is the shortest path routing algorithm. This algorithm requires a large amount of joins statements which are computationally very ex- Promet Traffic&Transportation, Vol. 26, 214, No. 1,
2 pensive and cannot be used in a standard SQL query [9]. Most of the time we are forced to adapt a data model to fit into the RDBMS structure, and we do not change the technology that would be more suited for this type of data, e.g. network data. Stonebraker et al. [1] changed the way of thinking about data and found that there is no one-size-fits-all database solution. Instead, each database problem can be solved with another database solution. This led to a large number of alternative storage systems that are used today. Over the last few years there has been a rapid expansion of new database technologies popularly called NoSQL (Not Only SQL). The primary focus of these technologies is on efficient handling of large amounts of semistructured data used in the web space. Although such databases and similar systems already exist (e.g. object-oriented databases or XML stores) only in the recent years have they become adopted on the market and in the community. Although NoSQL is a broad term for almost any database that is different in some way from the standard RDBMS, there are several classifications of NoSQL databases. Most of them can be classified as: key-value, document, wide-column, object and graph stores [11 14]. 1.1 Graph Database System The database that has been growing in popularity is a graph database or also called Graph Database System (GDB). The most significant reason for this growth is the importance of graph data structure in the fields of social, information and biological sciences as well as in the development of computer hardware. The graph data structure fits naturally for modelling of world objects, entities and relationships between them [15]. According to Angles et al. [16, p1] the graph database models are defined as those in which data structures for the schema and instances are modeled as graphs or generalizations of them, and data manipulation is expressed by graph-oriented operations and type constructors. In other words, a graph database is a database management system that is based on graph theory introduced in 1736 [17]. The graph theory uses nodes for storing entities and edges for relationships among them. Graph databases emphasize the relations among entities rather than entities themselves [18]. Any model can be thought of as a representation of reality. A graph model can be thought of as a collection of objects such as people or places and the relationship between them such as friend or living in. Those objects and relationships form a network or a graph [19]. A graph structure is defined as G = ^V, Eh where V = j1, j2, j3, f, jn is a set of vertices and E is a set of edges E = f1, f2, f3, f, fn. An edge f i! E is defined with triple ^ij,, ~ ih where ij,! V and ~ i is a positive real number. A directed edge is defined as i " j. The graph database systems today are still a nascent technology and in major development. Most of them support directed attributed multi-graph also known as property graph. This allows attaching attributes to every node and relationship on top of graph structure [19]. The property graph is an important prerequisite for calculating the weighted shortest path, e.g. using Dijkstra algorithm. The most popular shortest path implementation on a relational database is pgrouting which is implemented on top of the PostgreSQL database. Graph databases already have the shortest path implementations in their core. Neo4j is one of the graph database systems used for semi-structured and network-oriented data and has been in production since 23. It was developed in Java programming language [2] and free for non-commercial use. Neo4j can be used as an embedded or server database. It has started as an embedded database but recently its main usage has been as a server database version and in the future it will continue to develop in that direction. 1.2 Database benchmarks A database benchmark is a standard set of operations and instructions sent to two or more database systems to evaluate relative and quantitative performance in a controlled experiment [21]. The most common standard RDBMS benchmarks are TCP-(C, H, E) benchmarks [22] but there are also other database benchmarks like Bristlecone or Open Source Development Lab Data Base Test Suite (OSDL-DBTS) [23]. These benchmarks attempt to simulate real-world scenarios mostly in the business domain applications and none of these standard database benchmarks can be used for this research. The business domain data structure is significantly different than the one of the transportation networks. Transportation networks have graph-like data structure. The spatial database is a database that has been developed and optimized for storing, querying and manipulating 2D and 3D spatial objects (points, lines, polygons, etc.) in geometric space. They are mostly used by geographic information systems (GIS) but can be used in other domain applications where space plays an important factor. Spatial database benchmarks are by construction similar to standard database benchmarks but their main focus is on spatial query and spatial join performance. There are only a few spatial database benchmark projects, especially ones detailed as TCP benchmarks used in standard RDBMS. One of the most recent and versatile benchmarks is Jackpine [23]. Jackpine is a vector-based spatial database benchmark based on Bristlcone 76 Promet Traffic&Transportation, Vol. 26, 214, No. 1, 75-82
3 benchmark. It is a flexible benchmark that uses Java Database Connectivity (JDBC) driver implementation and can support almost any database. The author of this benchmark used two approaches, micro and macro. The micro benchmark is used for testing topological relationships, spatial analysis functions and data loading queries. This part of the Jackpine is used to assess database processing and index performance. To our knowledge there are only a handful of graph database benchmarks and guidelines [9, 18, 24]. Dominguez-Sal et al. [18] propose guidelines for a graph oriented benchmark differing from any standard or spatial database benchmark. According to these authors the proposed tests are: traversal (which includes the shortest path calculation relevant for this research), graph analysis, connected components, communities, centrality measures, pattern matching, graph anonymisation and some application domain generic operation. Queries that they propose must represent the workload of the real environment in application domain. They also noted that the shortest path graph analysis and real time analysis of traffic networks are one of the application domains where graph databases could prove some benefit. This paper provides a benchmark designed to measure the performance of a Dijkstra shortest path algorithm in a graph database (Neo4j) and relational database (PostgreSQL/pgRouting). 2. RESEARCH METHODOLOGY The evaluation methodology designed to compare both systems involves an objective benchmark comparison based on system documentation and experience. The following section describes data used for the benchmark and specifies hardware and software used for the system being tested. 2.1 Dataset The graph data used in this benchmark is based on Austria road dataset from the OpenStreetMap (OSM) project [25]. OSM data model uses nodes (a point with coordinates), segments (directed link between two nodes) and ways (ordered list of segments) to represent the transportation network [26]. Any of these objects can contain tags. Tags are key/value pairs denoting the type and properties of the object. Basically, OSM is a form of a sparse directed property graph but not stored as such in a native form. OSM nodes represent nodes of the graph and ways form relation between those nodes. Tags can be used as property but in our case we use length of a way as a property for weight in the shortest path calculation. Raw OSM data are stored in the XML based format and as such cannot be imported into Neo4j or PostgreSQL database. In order to have consistent data in both databases, OSM data were converted by using osm2po tool [27]. The osm2po creates a graph representation of provided OSM dataset for direct import into PostgreSQL database. After importing data into PostgreSQL, the data were exported into acceptable cvs format for an import into Neo4j database. A small Java program was created that batch imported cvs file into Neo4j database with creating appropriate indexes. The data insertion time metric was not measured because of different data storage models. The converted data in PostgreSQL were stored as a table with columns: id, source, target and cost. These columns represent relations between nodes with cost (length of the path) as a weight. In Neo4j database, each edge has a non-negative weight ~ i equal to the length of the path. Both of these two data models form the same graph of around 63, nodes and 75, relations. 2.2 Experimental setting A benchmark is created in such a way that it can be repeated, data used is open and all of the fine tuning is described in the official documentation [28]. For both of the databases the same dataset was used. We used Neo4j enterprise server version and pgrouting 1.5 on top of PostgreSQL 8.4 database. The recommendations found in official documentation of each database were used for both setups. For Neo4j case, we provided Java Virtual Machine (JVM) maximum amount of memory available on a testing computer and applied gcr cache type. Cache is a fast storage mechanism used to temporarily store data for future need. Neo4j grc cache type is a special type of cache which provides means of assigning a specific amount of memory for loaded nodes and relationship for the purpose of fast insert/lookup operations [28]. During all of our experiments, we monitored available RAM and possible swap hits to disk. Swap disk (also called virtual memory) is temporary space used on a hard disk as RAM. It is used in cases when data do not fit into RAM. It is slow, inefficient and should be avoided. Tests were performed with cold and hot setup to determine the time needed for a database to load data into memory. The measured cold time is the time required to finish a given set of queries immediately after flushing caches or after reboot. In reality this is achieved by rebooting the system [29]. The hot test run consists of executing the same set of queries as from the cold run, in the same sequence order. The measured hot time is the time required to finish a given set of queries immediately after performing the cold run without flushing any cache or restarting the database [29]. All of the cold tests were executed with cleared disk cache at Promet Traffic&Transportation, Vol. 26, 214, No. 1,
4 operating system level. This is especially important for PostgreSQL database which relies heavily on operating file system cashing. Hot tests were executed right after the cold test with the same queries to assure the use of cached data. Both of the databases were monitored to assure cache hits in hot tests. The cold test begins with database and disk cache completely empty. Empty cache ensures that memory contains no data and that every new test starts independently from the previous one. At least three hot runs were made to assure time stability and that data was cached in running hot test. The client computer, from which the testing was performed, does not cache requests, so all of the tests (cold and hot) were executed with cache empty on the client computer. A 19 pairs of nodes from the dataset described in the previous section were randomly chosen. These pairs represent locations between which the shortest paths is to be calculated on a given transportation network. The distribution of returned path lengths expressed in the number of nodes is shown in Figure 1. Frequency (N) Number of returned nodes (N) Figure 1 - Distribution of returned path lengths in number of nodes Our tests were executed with one, two, four, eight and sixteen threads to simulate concurrent requests on the server. In the context of performance testing, thread is the number of concurrent connections to the database or application. Each thread executes its queries completely independently of other threads. We have not been able to test more than 16 threads because of hardware limitation. Our experiments were conducted on a computer equipped with Intel Core i5 75 at 2.67GHz and 6 GB of RAM and installed Ubuntu Server 1.4 LTS 64-bit operating system with kernel version server and OpenJDK Server VM. For the testing the Apache JMeter was used on a separate computer in the same isolated gigabit network. 3. RESEARCH RESULTS Based on the official Neo4j documentation [28], JVM gcr cache type is recommended for Neo4j in enterprise environment and as seen in Figure 2, our findings proved to be true with Dijkstra shortest path queries. Memory and cache type are two configuration changes that proved to be the most influential ones on the performance of Neo4j routing calculation among all of the recommendations found in Neo4j documentation [28]. Although specified in the documentation as a recommendation flag in production environment, we found that using UseConcMarkSweepGC or UseSerialGC JVM flag slows down Dijkstra shortest path computation by around 3% with one-thread tests. UseParallelGC flag did not have any noticeable effect on computational performance. These flags are part of the mechanism called garbage collector which handles memory in applications written in Java programming language. PostgreSQL and pgrouting on the other hand, did not require any fine tuning to optimize their performance. All of the presented results do not have swap hits to the disk and maximum available RAM was never achieved in any of our successful tests. All of the test runs proved to have stable time as can be seen in Figure 2. None of the time measurements presented in this paper were averaged in any way. Memory consumption presented in Figure 3 is the peak (maximum) value from both cold and hot runs Time (ms) cold hot1 hot2 5 Neo4j(soft) Neo4j(gcr) Neo4j(weak) Neo4j(strong) PostgreSQL hot3 Figure 2 - Duration of cold and three hot tests for Neo4j(by cache type) and PostgreSQL 78 Promet Traffic&Transportation, Vol. 26, 214, No. 1, 75-82
5 Memory (MB) Number of threads (N) Neo4j PostgreSQL Figure 3 - Peak memory consumption by the number of threads Time (s) Neo4j(cold) Neo4j(hot) PostgreSQL(cold) PostgreSQL(hot) Number of threads (N) Figure 4 summarizes the results of the benchmark for a different number of threads on cold and hot test for Neo4j and PostgreSQL database. As we can see in Figure 4, threads over 16 increase computation time for both of the database systems as well as memory consumption as seen in Figure 3. The maximum number of threads depends on the running hardware. It was noted that hot tests, after the first one, did not change, as presented in Figure 2. This demonstrates that all of the data were loaded into the memory and none were released after the query finished. Figure 5 shows constant time between requests for PostgreSQL calculation in contrast to variable time for Neo4j. This means that it would take the same time to load data for a query between two neighbouring nodes or for a full graph traversal, plus the time needed for calculating the shortest path. 4. DISCUSSION The first assumption is that if some graph data structure is stored in a graph database, then the shortest path calculations must perform better because the data are stored in their native form. We demonstrated that graph database is 3% to 35% faster in calculation of the shortest path in transportation use case but Figure 4 - Time required for cold and hot test to finish for Neo4j and PostgreSQL uses more memory, from 2% to 75%, depending on the number of used threads. The first full graph database performance analysis was presented in paper [9] which implemented queries of HPC Scalable Graph Analysis Benchmark v1. (HPC-SGAB) [24]. HPC-SGAB was designed by several leading researchers from academia and industrial companies. In their paper, the authors tested the performance of four graph database systems on a synthetic generated graph: Neo4j, Jena, HypergraphDB and DEX. The test was composed of four kernels: edge and node insertion performance, measuring time needed to find a set of edges that meet a condition, measuring time spent on building a subgraph and traversal performance over the whole graph. The result analysis of this paper was the basis for graph database of choice for our research. The results showed that DEX and Neo4j were the most efficient GDBs at the time of writing. In contrast to databases, the routing can also be done with specialized in-memory routing engines. Currently, there are several in-memory routing engines available such as osm2po, Open Source Routing Machine (OSRM) or GraphHopper. Comparing the special- Time (ms) Cold start Neo4j PostgreSQL Time (ms) Hot start Neo4j PostgreSQL Request (N) Request (N) Figure 5 - Cold and hot test forfirst 2 requests, one thread Promet Traffic&Transportation, Vol. 26, 214, No. 1,
6 ized in-memory routing engines with full-fledged database system would not yield fair comparison because such routing solutions do not provide any of the database management system abilities which provide substantial performance overhead. PostgreSQL and Neo4j are general purpose database management systems and their main purpose is not routing engine but managing and storing of structured and semi-structured data. Although pgrouting is also in-memory routing engine it is still based on data provided by the underlying database management system. pgrouting does not retrieve graph data structure directly from memory for each shortest path calculation as in-memory routing engines do. It relies heavily on the underlying database for data retrieval and manipulation. Dijkstra algorithm was chosen because it was the only algorithm implemented on both solutions. Because both of the database systems have their source code openly available, it can be seen that they both have implemented very similar Dijkstra algorithms. The obvious difference in implementation is because Neo4j uses graph traversal queries whereas pgrouting uses pure mathematical calculation. Our assumption was that GDB would have more algorithm implementations for the shortest path calculation. Both of the tested systems also provide A* shortest path algorithm, but it cannot be tested because its implementation in Neo4j is only available for the embedded database version. Using the embedded version of the database would provide inconsistent results as it would be directly tied to the application. Another thing that has to be taken into account is that Neo4j Dijkstra implementation is closely tied to the Neo4j traversal speed and is still in major development. On the other hand, pgrouting depends on PostgreSQL only for data retrieval, not for the calculation and in most cases of use does not benefit from database indexes. The developers of Neo4j announced bidirectional traversal algorithm which will greatly increase the speed of calculation. Both database systems have the source code openly available, and it could be easily concluded that both use the same Dijkstra implementation. At the beginning of our testing, we wanted to use default configuration for both of the database systems, but that was not possible in case of Neo4j. Neo4js default configuration proved to be insufficient for any Dijkstra shortest path calculation on any of our datasets because it is optimized for smaller localized traversal queries, which have smaller memory requirements. The shortest path algorithms usually use full graph traversal queries, which heavily depend on the available memory. Our findings are that none of the recommendations for PostgreSQL had any noticeable impact for the pgrouting performance. Another thing that has to be noted is that Neo4j server communicates with clients via REST API over HTTP. This creates small overhead on performance but only to the delivery of the results from the server, not on the calculation of the shortest path. This overhead is minor comparing to the time for the calculation of the shortest path and was ignored in our test case. For every query (concurrent or not) that uses pgrouting function, PostgreSQL database will load the data into the memory, compute the shortest path and release data from memory. For example, if two queries are sent at the same time on the same dataset, every query will take its own data into memory and compute the shortest path, and release it after the computation. There is no memory sharing with pgrouting function and memory consumption will be doubled in that case. The number of possible concurrent queries on PostgreSQL depends only on the size of the data and available memory. Neo4j and pgrouting function have different approach in handling of loading data into the memory. Neo4j only loads data required for the current query and that data is shared between requests, if the data is still in the memory. As seen in the previous section, data used for our test are from the Austria OSM dataset. Tests with larger road networks, e.g. road network from Germany, the Netherlands or Italy OSM dataset led to the lack of memory. Another problem that was encountered during testing is infinite traverse loops on some OSM datasets, e.g. the Czech OSM dataset. This is due to the implementation of Dijkstra algorithm in Neo4j. Any position in the graph may be revisited. This is contrary to the default option for other Neo4j traversal queries, which is that no node in the entire graph may be visited more than once. On some specific graph layouts this creates infinite loops during Dijkstra shortest path calculation. Until the writing of this paper this was still default for the Dijkstra implementation in Neo4j. 5. CONCLUSION The graph database management systems are not routing engines and are not suitable for full graph traversal, which is used in the shortest path calculations. Their primary purpose is local graph traversal based on the property graph in cases of use such as social networks, fraud detection, recommendation engines, resource authorization, or computer network management. Although in most cases Neo4j outperforms pgrouting, the Neo4j greed for memory has to be considered. This is especially important for large transportation networks. If the memory is not an issue, then graph database is the right choice for the shortest path calculation. 8 Promet Traffic&Transportation, Vol. 26, 214, No. 1, 75-82
7 MARIO MILER, dipl. ing. Dr. sc. DAMIR MEDAK DRAŽEN ODOBAŠIĆ, dipl. ing. Geodetski fakultet, Sveučilište u Zagrebu Kačićeva 26, 1 Zagreb, Hrvatska SAŽETAK USPOREDBA UČINKOVITOSTI ALGORITMA NAJKRAĆEG PUTA U GRAF I RELACIJSKOJ BAZI PODATAKA NA PROMETNOJ MREŽI U području geoinformatike i prometnih znanosti, najkraći put izračunava se na graf podatkovnoj strukturi od kojih se uglavnom sastoje cestovne i prometne mreže. Ovi podaci se često pohranjuju u razne sustave baza podataka. Mnoge aplikacije koje koriste prometne mreže zahtijevaju izračun najkraćeg puta. Cilj ovog istraživanja je usporediti učinkovitost Dijkstra algoritma najkraćeg puta u PostgreSQL (koristeći pgrouting) i Neo4j graf baze u svrhu određivanja razlike u brzini izračuna najkraćeg puta. Ispitivanje je izvršeno na prosječnom računalu koristeći OpenStreetMap cestovnu mrežu. Unatoč tome što se Neo4j graf baza smatra pogodnom za izračun najkraćeg puta na prometnim mrežama, činjenica je da i takav način ima cijenu. Dokazano je da u Neo4j bazi, memorija računala može biti problem, posebno kod velikih prometnih mreža. KLJUČNE RIJEČI pgrouting, OpenStreetMap, Dijkstra, testiranje, Neo4j, PostgreSQL REFERENCES [1] Dreyfus, S.E.: An Appraisal of Some Shortest-Path Algorithms. Operations Research May 1;17(3): [2] Golden, B.L.: Shortest Path Algorithms: A Comparison. Massachusetts Institute of Technology, Operations Research Center; 1975 [3] Cherkassky, B.V., Goldberg, A.V., Radzik T.: Shortest paths algorithms: theory and experimental evaluation Jan 23; [4] Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, Third Edition. BOOK. The MIT Press; 29. p [5] Zlatanova, S., Baharin, S.S.K.: Optimal Navigation of First Responders Using DBMS. 3 rd International Conference on Information Systems for Crisis Response and Management 4 th International Symposium on GeoInformation for Disaster Management. 28; [6] Tuot, C., Obradovic, D., Fichter, F., Dengel, A.: CRUV - A Collaborative Route-Planning System for Utility Vehicles. 21; [7] Innerebner, M., Böhlen, M., Gamper, J.: ISOGA: a system for geographical reachability analysis. Liang SHL, Wang X, Claramunt C, editors. Berlin, Heidelberg: Springer Berlin Heidelberg; 213. pp [8] Codd, E.F.: A relational model of data for large shared data banks MD computing computers in medical practice. ACM; 197;15(3): [9] Dominguez-Sal, D., Urbón-Bayes, P., Giménez-Vañó, A., Gómez-Villamor, S., Martinez-Bazán, N., Larriba- Pey, J.: Survey of graph database performance on the hpc scalable graph analysis benchmark. Web-Age Information Management. Springer; 21;37 48 [1] Stonebraker, M., Cetintemel, U.: One Size Fits All : An Idea Whose Time Has Come and Gone. 21 st International Conference on Data Engineering ICDE5. Ieee; 25;(Icde):2 11. [11] Strauch, C.: NoSQL Databases. Lecture Notes Stuttgart Media. Hochschule der Medien, Stutgart; 21;1 8. [12] Orend, K.: Analysis and Classification of NoSQL Databases and Evaluation of their Ability to Replace an Object-relational Persistence Layer. Architecture. Citeseer; 21. p. 1. [13] Tiwari, S.: Professional NoSQL. 1 edition. Wrox; 211. [14] Sadalage, P.J., Fowler, M.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. 1 edition. Addison-Wesley Professional; 212. [15] Ciglan, M., Averbuch, A., Hluchy, L.: Benchmarking Traversal Operations over Graph Databases. 212 IEEE 28 th International Conference on Data Engineering Workshops. IEEE; 212. p [16] Angles, R., Gutierrez, C.: Survey of graph database models. ACM Computing Surveys. ACM; 28;4(1):1 39. [17] Euler, L.: Solutio problematis ad geometriam situs pertinentis. Commentarii Academiae Scientiarum Imperialis Petropolitanae. Nature Publishing Group; 1736;8: [18] Dominguez-Sal, D., Martinez-Bazan, N., Muntes-Mulero, V., Baleta, P., Larriba-Pay, J.L.: A discussion on the design of graph database benchmarks. 21 Sep 13;25 4. [19] Rodriguez, M.A., Neubauer, P.: Constructions from Dots and Lines. Science. American Society for Information Science and Technology; 21;X(X): [2] NeoTechnology. The Neo database - A Technology Introduction. 26; Available from: org/neo-technology-introduction.pdf [21] Seng, J-L., Yao, S.B., Hevner, A.R.: Requirements-driven database systems benchmark method. Decision Support Systems. 25 Jan 1;38(4): [22] TCP. Transaction Processing Performance Council [Internet]. 211 [cited 211 Sep 21]. Available from: [23] Ray, S., Simion, B., Demke, Brown A. Jackpine: A benchmark to evaluate spatial database performance. Data Engineering (ICDE), 211 IEEE 27th International Conference on. IEEE; 211. p [24] Bader, D.A., Feo, J., Gilbert, J., Kepner, J., Koester, D.: Hpc scalable graph analysis benchmark. Citeseer. Citeseer; 29;(HiPC 25):1 1. [25] Coast, S.: OpenStreetMap project [Internet] Available from: [26] Ramm, F., Topf, J.: Towards a New Data Model for OpenStreetMap [Internet]. 27 p Available from: [27] Moeller, C.: osm2po [Internet] Available from: Promet Traffic&Transportation, Vol. 26, 214, No. 1,
8 [28] NeoTechnology. The Neo4j Manual v1.7.2 [Internet]. NeoTechnology; 212. Available from: neo4j.org/chunked/1.7.2/ [29] Naphtali Rishe, A.V.: A Benchmarking Technique for DBMSs with Advanced Data Models. 82 Promet Traffic&Transportation, Vol. 26, 214, No. 1, 75-82
Benchmarking graph databases on the problem of community detection
Benchmarking graph databases on the problem of community detection Sotirios Beis, Symeon Papadopoulos, and Yiannis Kompatsiaris Information Technologies Institute, CERTH, 57001, Thermi, Greece {sotbeis,papadop,ikom}@iti.gr
InfiniteGraph: The Distributed Graph Database
A Performance and Distributed Performance Benchmark of InfiniteGraph and a Leading Open Source Graph Database Using Synthetic Data Objectivity, Inc. 640 West California Ave. Suite 240 Sunnyvale, CA 94086
Graph Database Proof of Concept Report
Objectivity, Inc. Graph Database Proof of Concept Report Managing The Internet of Things Table of Contents Executive Summary 3 Background 3 Proof of Concept 4 Dataset 4 Process 4 Query Catalog 4 Environment
NoSQL storage and management of geospatial data with emphasis on serving geospatial data using standard geospatial web services
NoSQL storage and management of geospatial data with emphasis on serving geospatial data using standard geospatial web services Pouria Amirian, Adam Winstanley, Anahid Basiri Department of Computer Science,
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE
PERFORMANCE ANALYSIS OF KERNEL-BASED VIRTUAL MACHINE Sudha M 1, Harish G M 2, Nandan A 3, Usha J 4 1 Department of MCA, R V College of Engineering, Bangalore : 560059, India [email protected] 2 Department
Performance Modeling and Analysis of a Database Server with Write-Heavy Workload
Performance Modeling and Analysis of a Database Server with Write-Heavy Workload Manfred Dellkrantz, Maria Kihl 2, and Anders Robertsson Department of Automatic Control, Lund University 2 Department of
Review of Graph Databases for Big Data Dynamic Entity Scoring
Review of Graph Databases for Big Data Dynamic Entity Scoring M. X. Labute, M. J. Dombroski May 16, 2014 Disclaimer This document was prepared as an account of work sponsored by an agency of the United
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015
E6893 Big Data Analytics Lecture 8: Spark Streams and Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing
Microblogging Queries on Graph Databases: An Introspection
Microblogging Queries on Graph Databases: An Introspection ABSTRACT Oshini Goonetilleke RMIT University, Australia [email protected] Timos Sellis RMIT University, Australia [email protected]
Data sharing in the Big Data era
www.bsc.es Data sharing in the Big Data era Anna Queralt and Toni Cortes Storage System Research Group Introduction What ignited our research Different data models: persistent vs. non persistent New storage
! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I)
! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and
Object Oriented Database Management System for Decision Support System.
International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 3, Issue 6 (June 2014), PP.55-59 Object Oriented Database Management System for Decision
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform
On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform Page 1 of 16 Table of Contents Table of Contents... 2 Introduction... 3 NoSQL Databases... 3 CumuLogic NoSQL Database Service...
Benchmarking Hadoop & HBase on Violin
Technical White Paper Report Technical Report Benchmarking Hadoop & HBase on Violin Harnessing Big Data Analytics at the Speed of Memory Version 1.0 Abstract The purpose of benchmarking is to show advantages
NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre
NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Why NoSQL? In the last thirty years relational databases have been the default choice for serious data storage. An architect
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Software October 2010 TABLE OF CONTENTS INTRODUCTION... 3 BUSINESS AND IT DRIVERS... 4 NOSQL DATA STORES LANDSCAPE...
NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB, Cassandra, and MongoDB
bankmark UG (haftungsbeschränkt) Bahnhofstraße 1 9432 Passau Germany www.bankmark.de [email protected] T +49 851 25 49 49 F +49 851 25 49 499 NoSQL Performance Test In-Memory Performance Comparison of SequoiaDB,
How to Choose Between Hadoop, NoSQL and RDBMS
How to Choose Between Hadoop, NoSQL and RDBMS Keywords: Jean-Pierre Dijcks Oracle Redwood City, CA, USA Big Data, Hadoop, NoSQL Database, Relational Database, SQL, Security, Performance Introduction A
Performance Workload Design
Performance Workload Design The goal of this paper is to show the basic principles involved in designing a workload for performance and scalability testing. We will understand how to achieve these principles
Performance Analysis of Web based Applications on Single and Multi Core Servers
Performance Analysis of Web based Applications on Single and Multi Core Servers Gitika Khare, Diptikant Pathy, Alpana Rajan, Alok Jain, Anil Rawat Raja Ramanna Centre for Advanced Technology Department
How graph databases started the multi-model revolution
How graph databases started the multi-model revolution Luca Garulli Author and CEO @OrientDB QCon Sao Paulo - March 26, 2015 Welcome to Big Data 90% of the data in the world today has been created in the
Delivering Quality in Software Performance and Scalability Testing
Delivering Quality in Software Performance and Scalability Testing Abstract Khun Ban, Robert Scott, Kingsum Chow, and Huijun Yan Software and Services Group, Intel Corporation {khun.ban, robert.l.scott,
Cognos8 Deployment Best Practices for Performance/Scalability. Barnaby Cole Practice Lead, Technical Services
Cognos8 Deployment Best Practices for Performance/Scalability Barnaby Cole Practice Lead, Technical Services Agenda > Cognos 8 Architecture Overview > Cognos 8 Components > Load Balancing > Deployment
Geodatabase Programming with SQL
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold
Oracle Database 10g: Building GIS Applications Using the Oracle Spatial Network Data Model. An Oracle Technical White Paper May 2005
Oracle Database 10g: Building GIS Applications Using the Oracle Spatial Network Data Model An Oracle Technical White Paper May 2005 Building GIS Applications Using the Oracle Spatial Network Data Model
Scalability Factors of JMeter In Performance Testing Projects
Scalability Factors of JMeter In Performance Testing Projects Title Scalability Factors for JMeter In Performance Testing Projects Conference STEP-IN Conference Performance Testing 2008, PUNE Author(s)
How To Store Data On An Ocora Nosql Database On A Flash Memory Device On A Microsoft Flash Memory 2 (Iomemory)
WHITE PAPER Oracle NoSQL Database and SanDisk Offer Cost-Effective Extreme Performance for Big Data 951 SanDisk Drive, Milpitas, CA 95035 www.sandisk.com Table of Contents Abstract... 3 What Is Big Data?...
This guide specifies the required and supported system elements for the application.
System Requirements Contents System Requirements... 2 Supported Operating Systems and Databases...2 Features with Additional Software Requirements... 2 Hardware Requirements... 4 Database Prerequisites...
MID-TIER DEPLOYMENT KB
MID-TIER DEPLOYMENT KB Author: BMC Software, Inc. Date: 23 Dec 2011 PAGE 1 OF 16 23/12/2011 Table of Contents 1. Overview 3 2. Sizing guidelines 3 3. Virtual Environment Notes 4 4. Physical Environment
Data Store Interface Design and Implementation
WDS'07 Proceedings of Contributed Papers, Part I, 110 115, 2007. ISBN 978-80-7378-023-4 MATFYZPRESS Web Storage Interface J. Tykal Charles University, Faculty of Mathematics and Physics, Prague, Czech
GRAPH DATABASE SYSTEMS. h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1
GRAPH DATABASE SYSTEMS h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1 Use Case: Route Finding Source: Neo Technology, Inc. h_da Prof. Dr. Uta Störl Big Data Technologies:
How To Improve Performance In A Database
Some issues on Conceptual Modeling and NoSQL/Big Data Tok Wang Ling National University of Singapore 1 Database Models File system - field, record, fixed length record Hierarchical Model (IMS) - fixed
How To Test For Performance And Scalability On A Server With A Multi-Core Computer (For A Large Server)
Scalability Results Select the right hardware configuration for your organization to optimize performance Table of Contents Introduction... 1 Scalability... 2 Definition... 2 CPU and Memory Usage... 2
System Requirements Table of contents
Table of contents 1 Introduction... 2 2 Knoa Agent... 2 2.1 System Requirements...2 2.2 Environment Requirements...4 3 Knoa Server Architecture...4 3.1 Knoa Server Components... 4 3.2 Server Hardware Setup...5
INTRODUCTION TO CASSANDRA
INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open
An empirical comparison of graph databases
An empirical comparison of graph databases Salim Jouili Eura Nova R&D 1435 Mont-Saint-Guibert, Belgium Email: [email protected] Valentin Vansteenberghe Universite Catholique de Louvain 1348 Louvain-La-Neuve,
Benchmarking Cassandra on Violin
Technical White Paper Report Technical Report Benchmarking Cassandra on Violin Accelerating Cassandra Performance and Reducing Read Latency With Violin Memory Flash-based Storage Arrays Version 1.0 Abstract
Introduction: Database management system
Introduction Databases vs. files Basic concepts Brief history of databases Architectures & languages Introduction: Database management system User / Programmer Database System Application program Software
Understanding Neo4j Scalability
Understanding Neo4j Scalability David Montag January 2013 Understanding Neo4j Scalability Scalability means different things to different people. Common traits associated include: 1. Redundancy in the
Online Remote Data Backup for iscsi-based Storage Systems
Online Remote Data Backup for iscsi-based Storage Systems Dan Zhou, Li Ou, Xubin (Ben) He Department of Electrical and Computer Engineering Tennessee Technological University Cookeville, TN 38505, USA
QuickDB Yet YetAnother Database Management System?
QuickDB Yet YetAnother Database Management System? Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Radim Bača, Peter Chovanec, Michal Krátký, and Petr Lukáš Department of Computer Science, FEECS,
Report Paper: MatLab/Database Connectivity
Report Paper: MatLab/Database Connectivity Samuel Moyle March 2003 Experiment Introduction This experiment was run following a visit to the University of Queensland, where a simulation engine has been
ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski [email protected]
Kamil Bajda-Pawlikowski [email protected] Querying RDF data stored in DBMS: SPARQL to SQL Conversion Yale University technical report #1409 ABSTRACT This paper discusses the design and implementation
A Practical Approach to Process Streaming Data using Graph Database
A Practical Approach to Process Streaming Data using Graph Database Mukul Sharma Research Scholar Department of Computer Science & Engineering SBCET, Jaipur, Rajasthan, India ABSTRACT In today s information
Enterprise Edition Scalability. ecommerce Framework Built to Scale Reading Time: 10 minutes
Enterprise Edition Scalability ecommerce Framework Built to Scale Reading Time: 10 minutes Broadleaf Commerce Scalability About the Broadleaf Commerce Framework Test Methodology Test Results Test 1: High
Cache Configuration Reference
Sitecore CMS 6.2 Cache Configuration Reference Rev: 2009-11-20 Sitecore CMS 6.2 Cache Configuration Reference Tips and Techniques for Administrators and Developers Table of Contents Chapter 1 Introduction...
How To Test For Elulla
EQUELLA Whitepaper Performance Testing Carl Hoffmann Senior Technical Consultant Contents 1 EQUELLA Performance Testing 3 1.1 Introduction 3 1.2 Overview of performance testing 3 2 Why do performance testing?
Comparing SQL and NOSQL databases
COSC 6397 Big Data Analytics Data Formats (II) HBase Edgar Gabriel Spring 2015 Comparing SQL and NOSQL databases Types Development History Data Storage Model SQL One type (SQL database) with minor variations
Introduction. Introduction: Database management system. Introduction: DBS concepts & architecture. Introduction: DBS versus File system
Introduction: management system Introduction s vs. files Basic concepts Brief history of databases Architectures & languages System User / Programmer Application program Software to process queries Software
A Middleware Strategy to Survive Compute Peak Loads in Cloud
A Middleware Strategy to Survive Compute Peak Loads in Cloud Sasko Ristov Ss. Cyril and Methodius University Faculty of Information Sciences and Computer Engineering Skopje, Macedonia Email: [email protected]
Big Data Analytics. Rasoul Karimi
Big Data Analytics Rasoul Karimi Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 1 Introduction
Building a Scalable News Feed Web Service in Clojure
Building a Scalable News Feed Web Service in Clojure This is a good time to be in software. The Internet has made communications between computers and people extremely affordable, even at scale. Cloud
Tableau Server 7.0 scalability
Tableau Server 7.0 scalability February 2012 p2 Executive summary In January 2012, we performed scalability tests on Tableau Server to help our customers plan for large deployments. We tested three different
The State of Cloud Storage
205 Industry Report A Benchmark Comparison of Speed, Availability and Scalability Executive Summary Both 203 and 204 were record-setting years for adoption of cloud services in the enterprise. More than
Oracle8i Spatial: Experiences with Extensible Databases
Oracle8i Spatial: Experiences with Extensible Databases Siva Ravada and Jayant Sharma Spatial Products Division Oracle Corporation One Oracle Drive Nashua NH-03062 {sravada,jsharma}@us.oracle.com 1 Introduction
Windows Server Performance Monitoring
Spot server problems before they are noticed The system s really slow today! How often have you heard that? Finding the solution isn t so easy. The obvious questions to ask are why is it running slowly
1 File Processing Systems
COMP 378 Database Systems Notes for Chapter 1 of Database System Concepts Introduction A database management system (DBMS) is a collection of data and an integrated set of programs that access that data.
Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage
White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage
Tuning WebSphere Application Server ND 7.0. Royal Cyber Inc.
Tuning WebSphere Application Server ND 7.0 Royal Cyber Inc. JVM related problems Application server stops responding Server crash Hung process Out of memory condition Performance degradation Check if the
Agility Database Scalability Testing
Agility Database Scalability Testing V1.6 November 11, 2012 Prepared by on behalf of Table of Contents 1 Introduction... 4 1.1 Brief... 4 2 Scope... 5 3 Test Approach... 6 4 Test environment setup... 7
AgencyPortal v5.1 Performance Test Summary Table of Contents
AgencyPortal v5.1 Performance Test Summary Table of Contents 1. Testing Approach 2 2. Server Profiles 3 3. Software Profiles 3 4. Server Benchmark Summary 4 4.1 Account Template 4 4.1.1 Response Time 4
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
MADOCA II Data Logging System Using NoSQL Database for SPring-8
MADOCA II Data Logging System Using NoSQL Database for SPring-8 A.Yamashita and M.Kago SPring-8/JASRI, Japan NoSQL WED3O03 OR: How I Learned to Stop Worrying and Love Cassandra Outline SPring-8 logging
How to Design and Create Your Own Custom Ext Rep
Combinatorial Block Designs 2009-04-15 Outline Project Intro External Representation Design Database System Deployment System Overview Conclusions 1. Since the project is a specific application in Combinatorial
Evaluating HDFS I/O Performance on Virtualized Systems
Evaluating HDFS I/O Performance on Virtualized Systems Xin Tang [email protected] University of Wisconsin-Madison Department of Computer Sciences Abstract Hadoop as a Service (HaaS) has received increasing
How To Write A Database Program
SQL, NoSQL, and Next Generation DBMSs Shahram Ghandeharizadeh Director of the USC Database Lab Outline A brief history of DBMSs. OSs SQL NoSQL 1960/70 1980+ 2000+ Before Computers Database DBMS/Data Store
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM?
MEASURING WORKLOAD PERFORMANCE IS THE INFRASTRUCTURE A PROBLEM? Ashutosh Shinde Performance Architect [email protected] Validating if the workload generated by the load generating tools is applied
Liferay Portal Performance. Benchmark Study of Liferay Portal Enterprise Edition
Liferay Portal Performance Benchmark Study of Liferay Portal Enterprise Edition Table of Contents Executive Summary... 3 Test Scenarios... 4 Benchmark Configuration and Methodology... 5 Environment Configuration...
MOBILE GEOGRAPHIC INFORMATION SYSTEMS: A CASE STUDY ON MANSOURA UNIVERSITY, EGYPT
MOBILE GEOGRAPHIC INFORMATION SYSTEMS: A CASE STUDY ON MANSOURA UNIVERSITY, EGYPT Asmaa Ahmed Hussein 1, Elkhedr Hassan Eibrahim 2, Aziza Asem 1 1 Faculty of Computer Sciences and information systems,
Performance Comparison of Persistence Frameworks
Performance Comparison of Persistence Frameworks Sabu M. Thampi * Asst. Prof., Department of CSE L.B.S College of Engineering Kasaragod-671542 Kerala, India [email protected] Ashwin A.K S8, Department
11.1 inspectit. 11.1. inspectit
11.1. inspectit Figure 11.1. Overview on the inspectit components [Siegl and Bouillet 2011] 11.1 inspectit The inspectit monitoring tool (website: http://www.inspectit.eu/) has been developed by NovaTec.
An Oracle White Paper July 2011. Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide
Oracle Primavera Contract Management, Business Intelligence Publisher Edition-Sizing Guide An Oracle White Paper July 2011 1 Disclaimer The following is intended to outline our general product direction.
Using Synology SSD Technology to Enhance System Performance Synology Inc.
Using Synology SSD Technology to Enhance System Performance Synology Inc. Synology_SSD_Cache_WP_ 20140512 Table of Contents Chapter 1: Enterprise Challenges and SSD Cache as Solution Enterprise Challenges...
HPC performance applications on Virtual Clusters
Panagiotis Kritikakos EPCC, School of Physics & Astronomy, University of Edinburgh, Scotland - UK [email protected] 4 th IC-SCCE, Athens 7 th July 2010 This work investigates the performance of (Java)
Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
Performance brief for IBM WebSphere Application Server 7.0 with VMware ESX 4.0 on HP ProLiant DL380 G6 server
Performance brief for IBM WebSphere Application Server.0 with VMware ESX.0 on HP ProLiant DL0 G server Table of contents Executive summary... WebSphere test configuration... Server information... WebSphere
Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks
WHITE PAPER July 2014 Achieving Real-Time Business Solutions Using Graph Database Technology and High Performance Networks Contents Executive Summary...2 Background...3 InfiniteGraph...3 High Performance
JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing
JBoss Data Grid Performance Study Comparing Java HotSpot to Azul Zing January 2014 Legal Notices JBoss, Red Hat and their respective logos are trademarks or registered trademarks of Red Hat, Inc. Azul
APPENDIX 1 USER LEVEL IMPLEMENTATION OF PPATPAN IN LINUX SYSTEM
152 APPENDIX 1 USER LEVEL IMPLEMENTATION OF PPATPAN IN LINUX SYSTEM A1.1 INTRODUCTION PPATPAN is implemented in a test bed with five Linux system arranged in a multihop topology. The system is implemented
Microsoft Windows Server 2003 with Internet Information Services (IIS) 6.0 vs. Linux Competitive Web Server Performance Comparison
April 23 11 Aviation Parkway, Suite 4 Morrisville, NC 2756 919-38-28 Fax 919-38-2899 32 B Lakeside Drive Foster City, CA 9444 65-513-8 Fax 65-513-899 www.veritest.com [email protected] Microsoft Windows
Flauncher and DVMS Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically
Flauncher and Deploying and Scheduling Thousands of Virtual Machines on Hundreds of Nodes Distributed Geographically Daniel Balouek, Adrien Lèbre, Flavien Quesnel To cite this version: Daniel Balouek,
Various Load Testing Tools
Various Load Testing Tools Animesh Das May 23, 2014 Animesh Das () Various Load Testing Tools May 23, 2014 1 / 39 Outline 3 Open Source Tools 1 Load Testing 2 Tools available for Load Testing 4 Proprietary
Optimising the Mapnik Rendering Toolchain
Optimising the Mapnik Rendering Toolchain 2.0 Frederik Ramm [email protected] stopwatch CC-BY maedli @ flickr Basic Setup Hetzner dedicated server (US$ 150/mo) Ubuntu Linux Mapnik 2.1 pbf planet file
How To Scale Out Of A Nosql Database
Firebird meets NoSQL (Apache HBase) Case Study Firebird Conference 2011 Luxembourg 25.11.2011 26.11.2011 Thomas Steinmaurer DI +43 7236 3343 896 [email protected] www.scch.at Michael Zwick DI
Comparison of Windows IaaS Environments
Comparison of Windows IaaS Environments Comparison of Amazon Web Services, Expedient, Microsoft, and Rackspace Public Clouds January 5, 215 TABLE OF CONTENTS Executive Summary 2 vcpu Performance Summary
PARALLELS CLOUD SERVER
PARALLELS CLOUD SERVER Performance and Scalability 1 Table of Contents Executive Summary... Error! Bookmark not defined. LAMP Stack Performance Evaluation... Error! Bookmark not defined. Background...
Rackspace Cloud Databases and Container-based Virtualization
Rackspace Cloud Databases and Container-based Virtualization August 2012 J.R. Arredondo @jrarredondo Page 1 of 6 INTRODUCTION When Rackspace set out to build the Cloud Databases product, we asked many
SIGMOD RWE Review Towards Proximity Pattern Mining in Large Graphs
SIGMOD RWE Review Towards Proximity Pattern Mining in Large Graphs Fabian Hueske, TU Berlin June 26, 21 1 Review This document is a review report on the paper Towards Proximity Pattern Mining in Large
Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database
Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica
XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April 2009. Page 1 of 12
XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines A.Zydroń 18 April 2009 Page 1 of 12 1. Introduction...3 2. XTM Database...4 3. JVM and Tomcat considerations...5 4. XTM Engine...5
Chapter 1 - Web Server Management and Cluster Topology
Objectives At the end of this chapter, participants will be able to understand: Web server management options provided by Network Deployment Clustered Application Servers Cluster creation and management
Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook
Talend Real-Time Big Data Talend Real-Time Big Data Overview of Real-time Big Data Pre-requisites to run Setup & Talend License Talend Real-Time Big Data Big Data Setup & About this cookbook What is the
MEAP Edition Manning Early Access Program Neo4j in Action MEAP version 3
MEAP Edition Manning Early Access Program Neo4j in Action MEAP version 3 Copyright 2012 Manning Publications For more information on this and other Manning titles go to www.manning.com brief contents PART
How To Use Arcgis For Free On A Gdb 2.2.2 (For A Gis Server) For A Small Business
Esri Middle East and Africa User Conference December 10 12 Abu Dhabi, UAE Understanding ArcGIS in Virtualization and Cloud Environments Marwa Mabrouk Powerful GIS capabilities Delivered as Web services
Pivot3 Reference Architecture for VMware View Version 1.03
Pivot3 Reference Architecture for VMware View Version 1.03 January 2012 Table of Contents Test and Document History... 2 Test Goals... 3 Reference Architecture Design... 4 Design Overview... 4 The Pivot3
An Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
MAGENTO HOSTING Progressive Server Performance Improvements
MAGENTO HOSTING Progressive Server Performance Improvements Simple Helix, LLC 4092 Memorial Parkway Ste 202 Huntsville, AL 35802 [email protected] 1.866.963.0424 www.simplehelix.com 2 Table of Contents
