Technical Presentations. Arian Pasquali, FEUP, REACTION Data Collection Plataform David Batista, INESC-ID, Sematic Relations Extraction REACTION

Similar documents
REACTION Workshop Overview Porto, FEUP. Mário J. Silva IST/INESC-ID, Portugal REACTION

Task 3 Web Community Sensing & Task 6 Query and Visualization

Task 3 Web Community Sensing

News media analysis at Lab SAPO UPorto. Jorge Teixeira

DataStorm: Large-Scale Data Management in Cloud Environments

SONAE PREPARING FUTURE GROWTH

José M. F. Moura, Director of ICTI at Carnegie Mellon Carnegie Mellon Victor Barroso, Director of ICTI in Portugal

Value of IEEE s Online Collections

What We Can Learn from Looking at Profanity

Oncology Meetings: Gastric Cancer State of Art March 27 and 28th, 2014

Bahia, October 22, Paulo Varandas Organizing Committee

Last year we started with: Workshop One: Year One July 2015: Natural Cork - Tradition: Gain an appreciation and understanding Cork as a material.

Ana Paiva (group coordinator) Jornadas dos 20 anos da Engenharia FEUP, 7-8 nov 2014

LIAAD Artificial Intelligence and Decision Support Lab of INESC TEC. João Mendes Moreira

Program. Program of the 10 th Meeting on Audio Engineering of the AESP (Ver. 1.1) Page. 1 of 5

Twitter Stock Bot. John Matthew Fong The University of Texas at Austin

Doctoral Consortium 2013 Dept. Lenguajes y Sistemas Informáticos UNED

Search and Information Retrieval

IC05 Introduction on Networks &Visualization Nov

Presentation of Nova Doctoral School why, what for and how. João Crespo

Text Mining - Scope and Applications

Student Number Dissertation Seminar "Entrepreneurship and Development" with Susana Frazão Pinheiro

LIST OF ATTORNEYS. Maio Island

MANAGEMENT FUNDAMENTALS

Enhanced Information Access to Social Streams. Enhanced Word Clouds with Entity Grouping

ARTiVIS Building a world wide community environment monitoring platform one prototype at a time

Enhancing Health and. Information Systems and Technologies for. Social Care. Reference. Polytechnic Institute of Leiria, Portugal

VCU-TSA at Semeval-2016 Task 4: Sentiment Analysis in Twitter

How To Understand And Understand Cultural Quarter

Postgraduate Course Fraud Management Detection, Control, Prevention and Action

SPS Sustainability Performance Assessment and Benchmarking Framework of The Public Sector

Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis

Pulsar TRAC. Big Social Data for Research. Made by Face

Data Mining with Hadoop at TACC

VISION AND OBJECTIVES

PhD Program in Electrical and Computer Engineering

Session 1 Peripheral arterial disease and ulcer: basic principles

The XLDB Group at CLEF 2004

ÍNDICE PARTE II CORPORATE GOVERNANCE ASSESSMENT 45

1st SEMESTER (beginning in September) Code Course Year ECTS Degree Lecturer Group(s) in English

SBSC Brazilian Symposium on Collaborative Systems

Capturing Meaningful Competitive Intelligence from the Social Media Movement

The PALAVRAS parser and its Linguateca applications - a mutually productive relationship

Portuguese Research Institutions in History

UFSCar Database Group (UFSCar DB)

Advanced Training and Industrial Research for Complex Engineering Systems, A+

2 nd Workshop on the Economics of ICTs

Orthogonal ray imaging: from dose monitoring in external beam therapy to low-dose morphologic imaging with scanned megavoltage X-rays

Management through the

4th LISBON VASCULAR FORUM 4º FORUM VASCULAR DE LISBOA. LISBON MARRIOTT HOTEL 13 and 14 DECEMBER 2013 PROGRAMA PRELIMINAR PRELIMINARY PROGRAM

Funding and Human Resources

General Meeting s Preparatory Information

Corticeira Amorim, S.G.P.S., S.A.

From rapid prototyping to additive manufacturing: history, trends and current research

Florianópolis, March 21, Elizabeth Wegner Karas Organizing Committee

JUDO th Académica s Treinos Formação: International Training Camp August - Coimbra. Over 400 Participants

LEON EUROCUP 2015 REVIEW. Race 02 - Portugal 10/12/2012

JamiQ Social Media Monitoring Software

JUDO !!!! !!!! 26 th Académica s. International Training Camp August - Coimbra. Over 400 Participants

Database Marketing, Business Intelligence and Knowledge Discovery

Antónia Lopes FCUL, University of Lisbon

Geometry and Topology

Report ThinkBike Workshop Lisboa ThinkBike workshop

CIRGIRDISCO at RepLab2014 Reputation Dimension Task: Using Wikipedia Graph Structure for Classifying the Reputation Dimension of a Tweet

2015 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC 2015) Vila Real, Portugal 8-10 April 2015

Mónica Isabel Gonçalves Carvalheira

and Knowledge Management

Brand Development and Management Enabling Strategy

The Brazilian Academy of Sciences

Luís Carlos dos Santos Marujo

IST/INESC-ID. R. Alves Redol 9 Sala Lisboa PORTUGAL

Multi Product Market Equilibrium with Sequential Search

PEDRO SEQUEIRA CURRICULUM VITAE

SYSTEMS AND SIGNALS ONLINE QUESTIONS AND GRADING

GRAPHICAL USER INTERFACE, ACCESS, SEARCH AND REPORTING

Sentiment analysis for news articles

Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov

Context Capture in Software Development

How To Build A Portuguese Web Search Engine

Transcription:

Agenda 11:30 Welcome + Quick progress report and status summary 11:45 Task leaders summarize ongoing activities (10 min each max) 12:30 Break. 14:00 Technical Presentations 15:00 Break 16:00 Short Technical Presentations / Demos 18:00 Directions for next meeting/next workshop 18:15 Meeting ends.

Technical Presentations Arian Pasquali, FEUP, Data Collection Plataform David Batista, INESC-ID, Sematic Relations Extraction

Short Technical Presentations (5-10 min each) Silvio Moreira - Replab + Semeval Joao Santos Entity Disambiguation Carolina Bento - Coocorrence networks João Oliveira - ngrams Publico 10 anos Raquel Albuquerque- Data Journalism at Público Francisco Couto - O mundo em Pessoa Pedro Saleiro Filtering at Replab Jorge Teixeira Timeline Gustavo Laboreiro Data Preparation Tiago Cunha TweeProfiles (PS) Tomy Rodrigues - RetweePatterns

The Problem... Computational journalism, aka database journalism Intensive use of software tools for news research, production and presentation What is the impact in the routines of newsrooms? What effect will these tools have on the quality of news and the productivity of journalists?

Challenges 1. Automatic content analysis (documents, news, blogs, micro-blogs, comments) 2. Automatic analysis of explicit and implicit social networks 3. Design of rich visualization and interaction interfaces 4. Case-study evaluation of developed computational journalism methodology in a production setting. Critical analysis of practical impact on newsroom quality, efficiency, and economics.

Partnership LASIGE, FCUL >> INESC-ID, IST (Mário J. Silva, Paula Carvalho and Francisco Couto from FCUL) LIACC, FEUP (Eugénio de Oliveira, Eduarda M. Rodrigues, Luís Sarmento, Carlos Soares) CIMJ, FCH/UNL (António Granado) Austin: School of Information and Computer Science at Austin (Luis Francisco-Revilla, Matthew Lease) PT Comunicações, SAPO (Benjamim Júnior, Celso Martinho, Luís Sarmento, Pedro Torres) Público (Sérgio B. Gomes)

Students Inesc-id: David Batista, Silvio Moreira Diogo Figueiredo, João Ramalho, João Oliveira, João Santos, Carolina Bento Rui Silva, David Forte UP: Matko Bosnjak, Arian Pasquali, Gustavo Laboreiro, Andrija Cajic, Nuno Baldaia, Tiago Cunha, Jorge Moreira Jorge Teixeira, Luís Rei (SAPO) UT Austin: Hohyon Ryu, Steven Fazzio UNL: Raquel Albuquerque, Tiago Carvalho

Research tasks 1. Information Mining 2. Information Discovery 3. Web Community Sensing 4. Tracking Information Flow 5. Interaction and Personalization 6. Query and Visualization 7. Computational Newsroom

Research tasks - Leaders 1. Information Mining 2. Information Discovery 3. Web Community Sensing 4. Tracking Information Flow 5. Interaction & Personalization 6. Query and Visualization 7. Computational Newsroom Paula Cravalho Bruno Martins (was Francisco) Carlos Soares (was Eduarda) Francisco Couto (was Matt) Mário J. Silva (was Revilla) Carlos Soares (Sarmento, Eduarda ) António Granado (Mário covers)

Information Mining Development of robust linguistic resources to process different types and genres of texts knowledge resources about media personalities: recognizing and resolving references to named-entities; sentiment lexicons and grammars: detecting the polarity of opinions about relevant personalities annotated corpora: training different text classifiers and evaluating classification procedures

Information Discovery Relationship extraction techniques to support information discovery in journalists activities Entity Ranking: finding the relevant entities for a given topic Entity Distillation: finding relevant resources for a given entity Attribute Selection: finding a list of key aspects to compare and differentiate a given set of entities

Web Community Sensing Modeling the credibility and authority of news sources and opinion makers in social networks Identifying influential individuals and experts on a given news topic Monitoring the community reaction to news stories and the polarity of opinions

Tracking Information Flow Identifying originating source of new ideas and information Understand evolutionary development of ideas through their iterative retelling and revision over time and across sources detecting cases and patterns of re-use (e.g. via memes or larger units of similar text) and information flow for source identification and novelty detection.

Interaction and Personalization Determining which interaction and personalization mechanisms are best suited to: Significantly enhance the user experience Provide the news site with useful, tacit feedback about its readers needs Investigating interactive news interfaces that support both automatic and manual personalization for readers

Query and Visualization Development of tools for querying extracted information and visualizing annotated documents and datasets Continuous scanning of the social web, news sources and various kinds of data streams Sapo already scans and processes many of these streams, in particular the news media

Computational Newsroom Environment where the new tools and resources developed in the project, together with other software will be accessible Will use tools and collect data for case studies to be evaluated observation and structured interviewing of the journalists in contact with the developed tools. The research will try to contextualize the changing nature of media work

More details Started October 1st 2010, 3 years http://dmir.inesc-id.pt/reaction/ 1 st milestone: End of Month 6 Specification First toolset prototype (should have demoed it at the 2011 Collaboratory) 2 nd milestone: End of Month 36 Demonstrable Computational Newsroom Asking for an Extension