Mission-Critical Database with Real-Time Search for Big Data February 17, 2012 Slide 1
Overview About MarkLogic Why MarkLogic Case Studies Technology and Features Slide 2
About MarkLogic 10 years in business specializing in Big Data / unstructured data 50% revenue growth YoY 300+ customers, 500+ successful deployments Ø DoD, IC, State, Financial Services, Media Offices: Silicon Valley, D.C., New York, Austin, London, Frankfurt, Tokyo Public Sector office in McLean, VA with 60 employees Ø Top Secret Facility Clearance Ø TS/SCI/LSPoly cleared personnel Ø Products deployed on NIPR, SIPR, JWICS Slide 3
300+ Customers, 500+ deployments Government Customers Enterprise Customers Financial Services and Other Customers Slide 4
Why MarkLogic The Only Mission-Critical Database with Real-Time Search for Big Data Extremely scalable database (> PB) Ingest any type of data (structured, semi- or unstructured) Real-time Search (< 1 second) Durable data access (ACID-compliant) Highly secure (DCID 6/3 Protection Level 3) Disaster recovery with multi-site data replication Very easy to operate and maintain (Fully web-enabled) Low Sustainment Cost (< 1 DBA) SOA that runs in the Cloud on COTS server Slide 5
What Does MarkLogic Do? Collect information in centralized or distributed repository Load information as is and throw away nothing Process and search information with pinpoint accuracy Extensive full-text, structured, geospatial, and real-time search features Exploit and analyze what you have collected Built-in indexes, entity enrichment, alerts Disseminate content through multiple contexts Send content to multiple devices and users Slide 6
Case Studies Slide 7
National Senior Leadership Decision Support Service Eliminate recurrence of 911 fog of situational awareness Discover and fuse large amounts of information residing in multiple sources and in varying formats so that senior leaders have situational awareness based on actionable intelligence. Slide 8
Rapidly collect, process, exploit and disseminate CELLEX and HUMINT data so that soldiers can quickly identify persons of interest at various human checkpoints. CURRENTLY INGESTED Ingest: CELLEX (XLS, XML, HTM); TIR (DOC, DOCX) CELLEX (Cellebrite, XRY, Athena, CellDek) TacKcal user Biometrics TIRs HUMINT (IIRs, TDs) FUTURE OF EXPLOITATION Video & Pictures CORAL REEF Web ApplicaKon User Entry Point Upload Data Feedback Query CORAL REEF Processing Layer Clean Normalize AnalyKcs Match Store Share CORAL REEF Content Server Store Query Replicate Real- Kme alerkng, search (selector, free text, geospakal, temporal, nodal), trend analysis with assortment of various widgets, integrated support for outside databases, exploit media Strategic User Capable of searching hundreds of selectors at once, search by nodal analysis, see subordinate- unit user analykcs, view trends by larger geographic areas, generate automated RSS feed reports Data Consumer GPS DNI Easy access web- service to download new reports & generated reporkng with RSS feed Slide 9
NCES Enterprise Catalog Challenges Discover and share information stored in the massive collections of documents on the Intelligence Community s classified intranet Solution Metadata catalog that supports Net-centric information discovery Supports multiple DDMS schemas Dynamic faceted navigation of results Web based UI and Web services Benefits 100x faster ingestion of DDMS information than the existing RDBMS infrastructure Able to reach original vision for enterprise catalog project Fully leverage DDMS metadata (geospatial, temporal etc.) Reduced application complexity and improved net-centric services Slide 10
Slide 11
MarkLogic Technology and Features Slide 12
MarkLogic Products Search and discovery Open source intelligence Metadata catalogs Your application Application Services Development and Operations Connectors and Toolkits Slide 13
MarkLogic Server Application Services Development and Operations Connectors and Toolkits Slide 14
MarkLogic Server Unique Architecture All-in-one: DBMS, Search Engine and Web App Web Application Server All in one single process DBMS Search Engine Slide 15
Key Functionality for High Speed Search Geospatial Built-in support for geospatial data Alerting High speed notification of new and updated information Faceted navigation Interactive analysis and navigation Entity enrichment Identify people, places & things Slide 16
MarkLogic - Inside Application Server HTTP Java/C# WS* Database XML Text Binary Full Text Indexes XML Indexes Reverse Indexes Scalar Indexes Geospatial Indexes Content Processing Framework Backup Restore Failover Clustering MarkLogic Server Slide 17
Scaling Shared Nothing Architecture Load Balancer Evaluator Node Evaluator Node Evaluator Node Evaluator Node Data Node Data Node Data Node Data Node Data Storage Data Storage Data Storage Data Storage Slide 18
Application Services Application Services Development and Operations Connectors and Toolkits Slide 19
MarkLogic Application Services Design, develop, and deploy your information applications quickly and easily Application Builder Search API Library Services API Application Builder Accelerated app dev Easy to use browser-based development Prototype in minutes without writing code Jump start on application creation Focus on the user experience Slide 20
Powerful, High-level APIs Library Services API Check in & check out documents for exclusive edits Automatically version content as it is changed Rollback to previous versions Snapshot versions for search, reuse, and delivery Search API Extensible Google-like search grammar Declarative search constraints and facets Customizable snippetting Fast sorting and pagination Type-ahead suggestions based on content, constraints Slide 21
Connector and Toolkits Application Services Development and Operations Connectors and Toolkits Slide 22
Connectors & Toolkits Connectors enable easy integration with enterprise applications Support for industry standard technologies: Java /.Net / XML & XQuery Connector for SharePoint Seamless connectivity to SharePoint Add dynamic delivery to SharePoint Direct workflow integration Toolkits for Word, Excel and PowerPoint Find information quickly Reuse, not re-invent or re-write Extend the current infrastructure for a true information infrastructure SharePoint Copy/Delete Slide 23
MarkLogic Connector for Hadoop Slide 24
Development and Operations Application Services Development and Operations Connectors and Toolkits Slide 25
Development Slide 26
System Monitoring Built-in System Monitoring Console Plugins for Nagios (Open Source) HP Operations Manager Slide 27
Contacts Doug Grover, USAF Account Manager (703) 408-7540 douglas.grover@marklogic.com Ken McCaleb, Senior Systems Engineer ken.mccaleb@marklogic.com MarkLogic Corporation DC 7950 Jones Branch Drive, Suite 200 McLean, VA 22107 +1 703.854.8500 www.marklogic.com Slide 28