How RAI's Hyper Media News aggregation system keeps staff on top of the news
|
|
|
- Hilary Townsend
- 10 years ago
- Views:
Transcription
1 How RAI's Hyper Media News aggregation system keeps staff on top of the news 13 th Libre Software Meeting Media, Radio, Television and Professional Graphics Geneva - Switzerland, 10 th July 2012 Maurizio Montagnuolo
2 Agenda Company presentation Motivations Foundations Audiovisual content processing for TV streams analysis Natural Language Processing (NLP) for text analysis Full text search and retrieval Use case implementation The RAI Automatic Newscast Transcription System (ANTS) The RAI Hyper Media News aggregator The RAI Interactive Newsbook Conclusions and future outlook 2
3 The RAI broadcasts - a short history National Radio broadcasts since early 30 (EIAR) 1950: Radio3 was born 2012: 10 Radio channels National TV broadcasts since : Rai2 was born 1977: Color introduced 1979: Rai3 was born with regional transmissions 1990: First analog satellite transmissions 2005: Youtube official channel 2008: DTT introduced (8 channels) 2012: 14 SD + 1 HD DTT channels 3
4 The RAI s archives TV About 450,000 hrs 320,000 hrs of programming 145,000 hrs: News and Sport 175,000 hrs: Programmes (Entertainment, folk and classic music, theatre,...) 130,000 hrs of Fiction 50,000 hrs: Commercial Films 80,000 hrs: Fiction (TV Series, TV films, Soap operas,...) RADIO About 1,000,000 hrs recorded on a wide variety of media PAPER LIBRARY 80,000 scripts in Rome 15,000 scripts in Firenze Evaluation of further RAI archives planned in short time IMAGE LIBRARY 360,000 Photos RAI 950,000 Photos ex-eri, currently RAI TRADE 4
5 The RAI CRIT The Centre for Research and Technological Innovation (CRIT) is responsible for defining, promoting and developing all aspects of research and innovation in the television industry The Centre is active in many EU projects, and collaborates with universities and industries for supporting Master and PhD thesis, as well as developing new standards and services 5
6 Motivations 6
7 Why content analysis tools in the media industry? Digital switch over introduces more channels More content items produced/published Cross media production (web, TV,...) Reuse material in many different ways Improvements in infrastructure (IT) Better content accessibility Recovery of Cultural Heritage Archive digitisation and annotation Budget limitations Archivist/documentalist staff not increasing CUMULATIVE EFFECT: a lot more digital items to be managed by the same staff, and in a quicker way 7
8 Wish list The world of media is moving fast More challenges New requirements Huge data amounts Up to 10K hours of material for a typical regional news archive Even if a lot is non-textual, we deal with it by Tags, annotations, closed captioning, speech transcripts, Open source tools for audiovisual content analysis, indexing and retrieval can be a solution Speed up the search and retrieval process Automatic speech understanding Automatic translation for multilanguage news aggregation Characters extraction and text summarisation Information extraction and knowledge acquisition 8
9 Foundations 9
10 Architecture for multi-modal news management RSSF DTV Programme detection News Story Segmentation, STT, Categorisation Natural language processing Multimodal Services construction MMAS Indexing Natural language understanding, NE extraction MAi inputs S&R (full text, title, channel, category, ) TVi Co-clustering Dossiers generation outputs Internal processing 10
11 Audiovisual content processing for TV streams analysis The RAI ANTS (Automatic Newscast Transcription System) platform provides a set of tools for automated news segmentation, classification, indexing and retrieval Composed of three main modules Programme detection detects the start/end positions of newscasts from the acquired DTV streams News story segmentation performs segmentation of the acquired programmes into elementary news stories Speech to text analysis extracts text and semantics (i.e., categories, named entities) from the speech content of each story 11
12 RAI ANTS architecture 12
13 Natural Language Processing (NLP) for text analysis Natural language refers to the human ability of understanding ordered sequences of written or spoken words (i.e. phrases) Who? What? Where? When? Why? Language processing is the set of algorithms and tools that make machines able to understand and treat natural language Bla bla bla... Bla bla bla... NLP 13
14 NLP is hard! Computers use number sequences to communicate Numbers are simple Numbers are easy to understand Numbers do not lie Humans use word sequences to communicate Words can be unknown E.g. Supercalifragilisticexpialidocious????? Words can have multiple grammars and meanings To port - Port wine - Usb port Porto Port Words are multilingual Port - Porto - Puerto - Luka - Poort - Porten 14
15 NLP pyramid tasks Paragraph Sentence Word / Token Character Text summarisation Discourse analysis Question answering Sentiment analysis Parsing Sentence detection and chunking Co-reference resolution Named entity recognition (NER) Relationship extraction Machine translation Acronyms and abbreviations detection Segmentation Part of speech (POS) tagging Lemmatisation Stemming Word sense disambiguation (WSD) Encoding Case Punctuation Accents Numbers Symbols 15
16 NLP tools Different tools and libraries available Different implementations and programming platforms C, C++, C#, Java, Python, Perl, Ruby,... Different usage licenses GPL, LGPL, MIT, Apache,... Further detail on Wikipedia List of natural language processing toolkits 16
17 OpenNLP functionalities Machine learning toolkit released under the Apache License 2.0 Pre-built models for several languages Danish, German, English, Spanish, Dutch, Portuguese, Swedish Set of training tools for building further language models Sentence detector Name Finder Tokenization Docu ment Categ orizer Part of Speech Tagger Chu nker Parser Corefe rence Resol ution 17
18 Implementation and usage issues Sentence detection does not mean only splitting at punctuation marks E.g. - Ms. - Mrs ,000,000 Sentence tokenisation needs to Separate possessive endings or abbreviated forms from preceding words, e.g. Maurizio s, can t,... Separate punctuation marks, quotations, brackets (...) from words Maurizio lives in Turin (Italy). Maurizio lives in Turin ( Italy ). A word might have multiple pos tags depending on its context. Named entities might be of multiple types 18
19 Full text S&R - Apache Solr Full text enterprise search server based on Lucene XML/HTTP, JSON Interfaces Distributed as Web application (war) Platform independent, HTTP controlling Web administration interface Access, management, testing Easy configuration via XML files Definition of indexes, data types, operations, etc. Index replication and distribution Search results caching 19
20 Solr requirements Software Operating system: Windows, Linux, Mac,... Java Development Kit (JDK) v1.5 or greater Apache Ant (not required for standard installation) Java EE Application Server Jetty, Apache Tomcat, JBoss, Java Database Connectivity (JDBC) for database interaction Hardware requirements depends on the size and complexity of the data RAM affects indexing, optimisation and searching performance The size of the documents (number of documents, fields per document, fields size, ) affects storage requirements Testing performance SolrMeter
21 Solr configuration solr.xml: defines the number of cores (indexes) available schema.xml: defines all of the details about which fields your documents can contain, and how those fields should be dealt with when adding documents to the index, or when querying those fields. solrconfig.xml: is the file where to put most of the parameters for configuring the Solr cores (query handlers, highlighting, faceting, etc)
22 Solr - Data Import handler (DIH) The Data Import Handler is the component for importing data from external sources (e.g. XML archives, databases,..) Read and Index data from xml/(http/file) based on configuration Read data residing in relational databases Build Solr documents by aggregating data from multiple columns and tables according to configuration Update Solr according to DB updates Detect inserts/update deltas (changes) and do delta imports Make it possible to plugin any kind of data source (ftp, scp, ) and any other format (JSON, CSV, )
23 Use case implementation 23
24 General objectives Define and develop methods and systems for automated content analysis, documentation, indexing in the media domain Example target: news Explosive growth of available informative assets Professional, e.g. Newspapers, press agencies, Radio & TV Amateur, e.g. UGC, social networks, personal blogs Heterogeneity of sources, e.g. the Internet, radio, TV, print media, legacy archives,... 24
25 ANTS, HMN & Interactive Newsbook main components 25
26 ANTS, HMN & Interactive Newsbook - core features Fully automated multimodal content-analysis tools for data extraction and mining TV news programmes detection, segmentation and indexing RSS feeds analysis, hierarchical linking and indexing Aggregation of multimodal news items by affinity measure A novel (generalised) measure for assessing affinity Aggregations are contextualised within automatically extracted information Entities (i.e. persons, places and organisations) Temporal span Categorical topics Social networks popularity and audience scores. Integration with external resources Public Internet search engines, Legacy digital libraries Integration of otherwise disconnected resources 26
27 27
28 28
29 29
30 Filter Panels Named Entities 30
31 31
32 Conclusions The media industry is moving fast New markets, new trends, new technologies for endusers mean new challenges and new requirements and a lot more content items Indexing, integrating and accessing multimodal content in an efficient way is a crucial factor It s time to adopt the more mature results of automation systems and open source solutions in our production infrastructures The future is: make data (re) use and metadata cheaper and quicker 32
33 References RAI Centre for Research and Technological Innovation (CRIT) Automatic Newscast Transcription System (ANTS) Hyper Media News (HMN) Interactive Newsbook Apache OpenNLP Homepage (download, license, documentation,...) Models Apache Solr Homepage (download, features, documentation, Wiki, ) AJAX Solr library (demo, documentation, download) 33
34 34
GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING
MEDIA MONITORING AND ANALYSIS GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING Searchers Reporting Delivery (Player Selection) DATA PROCESSING AND CONTENT REPOSITORY ADMINISTRATION AND MANAGEMENT
RAI ACTIVE NEWS: INTEGRATING KNOWLEDGE IN NEWSROOMS
RAI ACTIVE NEWS: INTEGRATING KNOWLEDGE IN NEWSROOMS M. Montagnuolo and A. Messina RAI Radiotelevisione Italiana, Italy ABSTRACT In the modern digital age methodologies for professional Big Data analytics
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
Software Architecture Document
Software Architecture Document Natural Language Processing Cell Version 1.0 Natural Language Processing Cell Software Architecture Document Version 1.0 1 1. Table of Contents 1. Table of Contents... 2
Automatic Text Analysis Using Drupal
Automatic Text Analysis Using Drupal By Herman Chai Computer Engineering California Polytechnic State University, San Luis Obispo Advised by Dr. Foaad Khosmood June 14, 2013 Abstract Natural language processing
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts
MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS
SIPAC. Signals and Data Identification, Processing, Analysis, and Classification
SIPAC Signals and Data Identification, Processing, Analysis, and Classification Framework for Mass Data Processing with Modules for Data Storage, Production and Configuration SIPAC key features SIPAC is
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast Hassan Sawaf Science Applications International Corporation (SAIC) 7990
Using Apache Solr for Ecommerce Search Applications
Using Apache Solr for Ecommerce Search Applications Rajani Maski Happiest Minds, IT Services SHARING. MINDFUL. INTEGRITY. LEARNING. EXCELLENCE. SOCIAL RESPONSIBILITY. 2 Copyright Information This document
Apache Lucene. Searching the Web and Everything Else. Daniel Naber Mindquarry GmbH ID 380
Apache Lucene Searching the Web and Everything Else Daniel Naber Mindquarry GmbH ID 380 AGENDA 2 > What's a search engine > Lucene Java Features Code example > Solr Features Integration > Nutch Features
Information Retrieval Elasticsearch
Information Retrieval Elasticsearch IR Information retrieval (IR) is the activity of obtaining information resources relevant to an information need from a collection of information resources. Searches
BIRT Document Transform
BIRT Document Transform BIRT Document Transform is the industry leader in enterprise-class, high-volume document transformation. It transforms and repurposes high-volume documents and print streams such
K@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
Natural Language to Relational Query by Using Parsing Compiler
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov
Search and Data Mining: Techniques Text Mining Anya Yarygina Boris Novikov Introduction Generally used to denote any system that analyzes large quantities of natural language text and detects lexical or
Natural Language Processing
Natural Language Processing 2 Open NLP (http://opennlp.apache.org/) Java library for processing natural language text Based on Machine Learning tools maximum entropy, perceptron Includes pre-built models
Distributed Computing and Big Data: Hadoop and MapReduce
Distributed Computing and Big Data: Hadoop and MapReduce Bill Keenan, Director Terry Heinze, Architect Thomson Reuters Research & Development Agenda R&D Overview Hadoop and MapReduce Overview Use Case:
Search and Real-Time Analytics on Big Data
Search and Real-Time Analytics on Big Data Sewook Wee, Ryan Tabora, Jason Rutherglen Accenture & Think Big Analytics Strata New York October, 2012 Big Data: data becomes your core asset. It realizes its
Semantic annotation of requirements for automatic UML class diagram generation
www.ijcsi.org 259 Semantic annotation of requirements for automatic UML class diagram generation Soumaya Amdouni 1, Wahiba Ben Abdessalem Karaa 2 and Sondes Bouabid 3 1 University of tunis High Institute
How To Manage Your Digital Assets On A Computer Or Tablet Device
In This Presentation: What are DAMS? Terms Why use DAMS? DAMS vs. CMS How do DAMS work? Key functions of DAMS DAMS and records management DAMS and DIRKS Examples of DAMS Questions Resources What are DAMS?
Content Management Systems: Drupal Vs Jahia
Content Management Systems: Drupal Vs Jahia Mrudula Talloju Department of Computing and Information Sciences Kansas State University Manhattan, KS 66502. [email protected] Abstract Content Management Systems
Technical Report. The KNIME Text Processing Feature:
Technical Report The KNIME Text Processing Feature: An Introduction Dr. Killian Thiel Dr. Michael Berthold [email protected] [email protected] Copyright 2012 by KNIME.com AG
How to make a good Software Requirement Specification(SRS)
Information Management Software Information Management Software How to make a good Software Requirement Specification(SRS) Click to add text TGMC 2011 Phases Registration SRS Submission Project Submission
Knowledge Discovery from patents using KMX Text Analytics
Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers
Large Scale Text Analysis Using the Map/Reduce
Large Scale Text Analysis Using the Map/Reduce Hierarchy David Buttler This work is performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract
Special Topics in Computer Science
Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS
Æ Æ. PROVYS TVoffice. management reporting. evaluation. planning. integrated workflow and information sharing environment. asset procurement costs
PROVYS TV Office PROVYS TVoffice 2 3 management reporting asset procurement costs asset value management planning acquisition & production programming and promotion content management media management
Shallow Parsing with Apache UIMA
Shallow Parsing with Apache UIMA Graham Wilcock University of Helsinki Finland [email protected] Abstract Apache UIMA (Unstructured Information Management Architecture) is a framework for linguistic
Berlin-Brandenburg Academy of sciences and humanities (BBAW) resources / services
Berlin-Brandenburg Academy of sciences and humanities (BBAW) resources / services speakers: Kai Zimmer and Jörg Didakowski Clarin Workshop WP2 February 2009 BBAW/DWDS The BBAW and its 40 longterm projects
Log management with Logstash and Elasticsearch. Matteo Dessalvi
Log management with Logstash and Elasticsearch Matteo Dessalvi HEPiX 2013 Outline Centralized logging. Logstash: what you can do with it. Logstash + Redis + Elasticsearch. Grok filtering. Elasticsearch
INSPIRE Dashboard. Technical scenario
INSPIRE Dashboard Technical scenario Technical scenarios #1 : GeoNetwork catalogue (include CSW harvester) + custom dashboard #2 : SOLR + Banana dashboard + CSW harvester #3 : EU GeoPortal +? #4 :? + EEA
COPYRIGHT 2011 COPYRIGHT 2012 AXON DIGITAL DESIGN B.V. ALL RIGHTS RESERVED
Subtitle insertion GEP100 - HEP100 Inserting 3Gb/s, HD, subtitles SD embedded and Teletext domain with the Dolby HSI20 E to module PCM decoder with audio shuffler A A application product note COPYRIGHT
Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer
Automated Data Ingestion Bernhard Disselhoff Enterprise Sales Engineer Agenda Pentaho Overview Templated dynamic ETL workflows Pentaho Data Integration (PDI) Use Cases Pentaho Overview Overview What we
UIMA: Unstructured Information Management Architecture for Data Mining Applications and developing an Annotator Component for Sentiment Analysis
UIMA: Unstructured Information Management Architecture for Data Mining Applications and developing an Annotator Component for Sentiment Analysis Jan Hajič, jr. Charles University in Prague Faculty of Mathematics
Content Management System (CMS)
Content Management System (CMS) ASP.NET Web Site User interface to the CMS SQL Server metadata storage, configuration, user management, order history, etc. Windows Service (C#.NET with TCP/IP remote monitoring)
Chapter 7. Using Hadoop Cluster and MapReduce
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow
A Framework-based Online Question Answering System Oliver Scheuer, Dan Shen, Dietrich Klakow Outline General Structure for Online QA System Problems in General Structure Framework-based Online QA system
Survey Results: Requirements and Use Cases for Linguistic Linked Data
Survey Results: Requirements and Use Cases for Linguistic Linked Data 1 Introduction This survey was conducted by the FP7 Project LIDER (http://www.lider-project.eu/) as input into the W3C Community Group
M3039 MPEG 97/ January 1998
INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND ASSOCIATED AUDIO INFORMATION ISO/IEC JTC1/SC29/WG11 M3039
Things Made Easy: One Click CMS Integration with Solr & Drupal
May 10, 2012 Things Made Easy: One Click CMS Integration with Solr & Drupal Peter M. Wolanin, Ph.D. Momentum Specialist (principal engineer), Acquia, Inc. Drupal contributor drupal.org/user/49851 co-maintainer
W3Perl A free logfile analyzer
W3Perl A free logfile analyzer Features Works on Unix / Windows / Mac View last entries based on Perl scripts Web / FTP / Squid / Email servers Session tracking Others log format can be added easily Detailed
A stream computing approach towards scalable NLP
A stream computing approach towards scalable NLP Xabier Artola, Zuhaitz Beloki, Aitor Soroa IXA group. University of the Basque Country. LREC, Reykjavík 2014 Table of contents 1
The Open Source Knowledge Discovery and Document Analysis Platform
Enabling Agile Intelligence through Open Analytics The Open Source Knowledge Discovery and Document Analysis Platform 17/10/2012 1 Agenda Introduction and Agenda Problem Definition Knowledge Discovery
SOA, case Google. Faculty of technology management 07.12.2009 Information Technology Service Oriented Communications CT30A8901.
Faculty of technology management 07.12.2009 Information Technology Service Oriented Communications CT30A8901 SOA, case Google Written by: Sampo Syrjäläinen, 0337918 Jukka Hilvonen, 0337840 1 Contents 1.
Text Analytics Evaluation Case Study - Amdocs
Text Analytics Evaluation Case Study - Amdocs Tom Reamy Chief Knowledge Architect KAPS Group http://www.kapsgroup.com Text Analytics World October 20 New York Agenda Introduction Text Analytics Basics
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project
European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project Janet Delve, University of Portsmouth Kuldar Aas, National Archives of Estonia Rainer Schmidt, Austrian Institute
FOTOSTATION. The best media file organizer. It s organized
FotoStation 7.0 FOTOSTATION The best media file organizer Find, process and share your digital assets Search locally and centrally Automate your workflow with actions Available in 12 languages It s organized
Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall.
Web Analytics Understand your web visitors without web logs or page tags and keep all your data inside your firewall. 5401 Butler Street, Suite 200 Pittsburgh, PA 15201 +1 (412) 408 3167 www.metronomelabs.com
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
Avaya Aura Orchestration Designer
Avaya Aura Orchestration Designer Avaya Aura Orchestration Designer is a unified service creation environment for faster, lower cost design and deployment of voice and multimedia applications and agent
Wiley. Automated Data Collection with R. Text Mining. A Practical Guide to Web Scraping and
Automated Data Collection with R A Practical Guide to Web Scraping and Text Mining Simon Munzert Department of Politics and Public Administration, Germany Christian Rubba University ofkonstanz, Department
Flattening Enterprise Knowledge
Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it
Digital Asset Management. Content Control for Valuable Media Assets
Digital Asset Management Content Control for Valuable Media Assets Overview Digital asset management is a core infrastructure requirement for media organizations and marketing departments that need to
How To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
Using Television and Radio programme for teaching
[Type here] Using Television and Radio programme for teaching BOB Box of Broadcasts This workshop is for anyone who would like to use television and radio recordings in teaching and research, and takes
Building Multilingual Search Index using open source framework
Building Multilingual Search Index using open source framework ABSTRACT Arjun Atreya V 1 Swapnil Chaudhari 1 Pushpak Bhattacharyya 1 Ganesh Ramakrishnan 1 (1) Deptartment of CSE, IIT Bombay {arjun, swapnil,
Hardware, Software and Training Requirements for DMFAS 6
Hardware, Software and Training Requirements for DMFAS 6 DMFAS6/HardwareSoftware/V5 May 2015 2 Hardware, Software and Training Requirements for DMFAS 6 Contents ABOUT THIS DOCUMENT... 4 HARDWARE REQUIREMENTS...
TDAQ Analytics Dashboard
14 October 2010 ATL-DAQ-SLIDE-2010-397 TDAQ Analytics Dashboard A real time analytics web application Outline Messages in the ATLAS TDAQ infrastructure Importance of analysis A dashboard approach Architecture
Joint Research Centre
Joint Research Centre Open Source Monitoring Tools and Applications emm.newsbrief.eu Serving society Stimulating innovation Supporting legislation Open Source Monitoring - Overview EMM Introduction Custom
DataDirect XQuery Technical Overview
DataDirect XQuery Technical Overview Table of Contents 1. Feature Overview... 2 2. Relational Database Support... 3 3. Performance and Scalability for Relational Data... 3 4. XML Input and Output... 4
Why are Organizations Interested?
SAS Text Analytics Mary-Elizabeth ( M-E ) Eddlestone SAS Customer Loyalty [email protected] +1 (607) 256-7929 Why are Organizations Interested? Text Analytics 2009: User Perspectives on Solutions
IBM Content Analytics with Enterprise Search, Version 3.0
IBM Content Analytics with Enterprise Search, Version 3.0 Highlights Enables greater accuracy and control over information with sophisticated natural language processing capabilities to deliver the right
Decision Support System Software Asset Management (SAM)
DecisionSupportSystem SoftwareAssetManagement(SAM) ReleaseNotes Version1.2.3 May,2010 BigFix DSSSAM1.2.3 2009-2010 BigFix, Inc. All rights reserved. BigFix, Fixlet, Relevance Engine, Powered by BigFix
Technical Report. Implementation and Performance Testing of Business Rules Evaluation Systems in a Computing Grid. Brian Fletcher x08872155
Technical Report Implementation and Performance Testing of Business Rules Evaluation Systems in a Computing Grid Brian Fletcher x08872155 Executive Summary 4 Introduction 5 Background 5 Aims 5 Technology
Video, film, and animation are all moving images that are recorded onto videotape,
See also Data Display (Part 3) Document Design (Part 3) Instructions (Part 2) Specifications (Part 2) Visual Communication (Part 3) Video and Animation Video, film, and animation are all moving images
The Knowledge Sharing Infrastructure KSI. Steven Krauwer
The Knowledge Sharing Infrastructure KSI Steven Krauwer 1 Why a KSI? Building or using a complex installation requires specialized skills and expertise. CLARIN is no exception. CLARIN is populated with
Analysis of Data Mining Concepts in Higher Education with Needs to Najran University
590 Analysis of Data Mining Concepts in Higher Education with Needs to Najran University Mohamed Hussain Tawarish 1, Farooqui Waseemuddin 2 Department of Computer Science, Najran Community College. Najran
Manifest for Big Data Pig, Hive & Jaql
Manifest for Big Data Pig, Hive & Jaql Ajay Chotrani, Priyanka Punjabi, Prachi Ratnani, Rupali Hande Final Year Student, Dept. of Computer Engineering, V.E.S.I.T, Mumbai, India Faculty, Computer Engineering,
Mark Bennett. Search and the Virtual Machine
Mark Bennett Search and the Virtual Machine Agenda Intro / Business Drivers What to do with Search + Virtual What Makes Search Fast (or Slow!) Virtual Platforms Test Results Trends / Wrap Up / Q & A Business
Natural Language Processing in the EHR Lifecycle
Insight Driven Health Natural Language Processing in the EHR Lifecycle Cecil O. Lynch, MD, MS [email protected] Health & Public Service Outline Medical Data Landscape Value Proposition of NLP
Best Practices for Structural Metadata Version 1 Yale University Library June 1, 2008
Best Practices for Structural Metadata Version 1 Yale University Library June 1, 2008 Background The Digital Production and Integration Program (DPIP) is sponsoring the development of documentation outlining
Softline VIP Payroll System Requirements v2.9a January 2010
i Softline VIP Payroll System Requirements v2.9a January 2010 Table of Contents Introduction... 1 Assumptions... 1 Standalone Requirements... 2 Note:Peer-to-peer Network Installations... 2 Peer-to-peer
Open Source SOA with Service Component Architecture and Apache Tuscany. Jean-Sebastien Delfino Mario Antollini Raymond Feng
Open Source SOA with Service Component Architecture and Apache Tuscany Jean-Sebastien Delfino Mario Antollini Raymond Feng Learn how to build and deploy Composite Service Applications using Service Component
Abstract 1. INTRODUCTION
A Virtual Database Management System For The Internet Alberto Pan, Lucía Ardao, Manuel Álvarez, Juan Raposo and Ángel Viña University of A Coruña. Spain e-mail: {alberto,lucia,mad,jrs,avc}@gris.des.fi.udc.es
ifinder ENTERPRISE SEARCH
DATA SHEET ifinder ENTERPRISE SEARCH ifinder - the Enterprise Search solution for company-wide information search, information logistics and text mining. CUSTOMER QUOTE IntraFind stands for high quality
MERMIG The advanced collaboration software
What is MERMIG? MERMIG is a powerful, Web based collaboration platform addressing the needs of those who are: involved in working groups, committees, projects or any kind of team work, dealing with multilingual
Developing Microsoft SharePoint Server 2013 Advanced Solutions
Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions Course Details Course Outline Module 1: Creating Robust and Efficient Apps for SharePoint In this module, you will review key
IBM Proof of Technology Discovering business application services, featuring IBM WebSphere Application Server Network Deployment V8
IBM Proof of Technology Discovering business application services, featuring IBM WebSphere Application Server Network Deployment V8 Proof of Technology Introduction Welcome and Introductions Agenda Service
Outline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging
Outline High Performance Computing (HPC) Towards exascale computing: a brief history Challenges in the exascale era Big Data meets HPC Some facts about Big Data Technologies HPC and Big Data converging
White Paper. How Streaming Data Analytics Enables Real-Time Decisions
White Paper How Streaming Data Analytics Enables Real-Time Decisions Contents Introduction... 1 What Is Streaming Analytics?... 1 How Does SAS Event Stream Processing Work?... 2 Overview...2 Event Stream
Course Scheduling Support System
Course Scheduling Support System Roy Levow, Jawad Khan, and Sam Hsu Department of Computer Science and Engineering, Florida Atlantic University Boca Raton, FL 33431 {levow, jkhan, samh}@fau.edu Abstract
Web Traffic Capture. 5401 Butler Street, Suite 200 Pittsburgh, PA 15201 +1 (412) 408 3167 www.metronomelabs.com
Web Traffic Capture Capture your web traffic, filtered and transformed, ready for your applications without web logs or page tags and keep all your data inside your firewall. 5401 Butler Street, Suite
In: Proceedings of RECPAD 2002-12th Portuguese Conference on Pattern Recognition June 27th- 28th, 2002 Aveiro, Portugal
Paper Title: Generic Framework for Video Analysis Authors: Luís Filipe Tavares INESC Porto [email protected] Luís Teixeira INESC Porto, Universidade Católica Portuguesa [email protected] Luís Corte-Real
