INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning jtl@ifi.uio.no

Similar documents
CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test

Shallow Parsing with Apache UIMA

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014

An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)

The Proposition Bank: An Annotated Corpus of Semantic Roles

Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?

Machine Learning for natural language processing

Question Prediction Language Model

Search Engine Based Intelligent Help Desk System: iassist

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System

Outline of today s lecture

ETL Ensembles for Chunking, NER and SRL

Acquiring Reliable Predicate-argument Structures from Raw Corpora for Case Frame Compilation

Sense-Tagging Verbs in English and Chinese. Hoa Trang Dang

Thesis Proposal Verb Semantics for Natural Language Understanding

Automatic Text Analysis Using Drupal

Phase 2 of the D4 Project. Helmut Schmid and Sabine Schulte im Walde

Empirical Machine Translation and its Evaluation

Semantic Role Labeling

Thematic Roles. Saeed: Chapter List of Basic Thematic Roles

Lecture 9. Phrases: Subject/Predicate. English 3318: Studies in English Grammar. Dr. Svetlana Nuernberg

Multi-Engine Machine Translation by Recursive Sentence Decomposition

Chapter 8. Final Results on Dutch Senseval-2 Test Data

Automatic Knowledge Base Construction Systems. Dr. Daisy Zhe Wang CISE Department University of Florida September 3th 2014

Semantic analysis of text and speech

Online Latent Structure Training for Language Acquisition


CS 6740 / INFO Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Artificial Intelligence Exam DT2001 / DT2006 Ordinarie tentamen

Text Analysis beyond Keyword Spotting

Chunk Parsing. Steven Bird Ewan Klein Edward Loper. University of Melbourne, AUSTRALIA. University of Edinburgh, UK. University of Pennsylvania, USA

Using Knowledge Extraction and Maintenance Techniques To Enhance Analytical Performance

Developing a large semantically annotated corpus

Using Predicate-Argument Structures for Information Extraction

Symbiosis of Evolutionary Techniques and Statistical Natural Language Processing

Syntax: Phrases. 1. The phrase

Hybrid Strategies. for better products and shorter time-to-market

Cassandra. References:

L130: Chapter 5d. Dr. Shannon Bischoff. Dr. Shannon Bischoff () L130: Chapter 5d 1 / 25

Detecting Parser Errors Using Web-based Semantic Filters

Parsing Software Requirements with an Ontology-based Semantic Role Labeler

Application of Natural Language Interface to a Machine Translation Problem

Syntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni

Natural Language to Relational Query by Using Parsing Compiler

A Chart Parsing implementation in Answer Set Programming

Paraphrasing controlled English texts

Domain Independent Knowledge Base Population From Structured and Unstructured Data Sources

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

Treebank Search with Tree Automata MonaSearch Querying Linguistic Treebanks with Monadic Second Order Logic

Making Verb Argument Adjunct Distinctions in English

English prepositional passive constructions

Statistical Machine Translation

Putting Frames in Perspective

Linguistic richness and technical aspects of an incremental finite-state parser

TechWatch. Technology and Market Observation powered by SMILA

Special Topics in Computer Science

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

MARY. V NP NP Subject Formation WANT BILL S

Interactive Dynamic Information Extraction

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives

Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov

Modeling Semantic Relations Expressed by Prepositions

Timeline (1) Text Mining Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?

COCOVILA Compiler-Compiler for Visual Languages

Modern Natural Language Interfaces to Databases: Composing Statistical Parsing with Semantic Tractability

Sentence Semantics. General Linguistics Jennifer Spenader, February 2006 (Most slides: Petra Hendriks)

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU

Introduction to formal semantics -

Semantic Analysis of Natural Language Queries Using Domain Ontology for Information Access from Database

Natural Language Processing

Towards portable natural language interfaces to knowledge bases The case of the ORAKEL system

A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning

Compiler I: Syntax Analysis Human Thought

Optimizing Description Logic Subsumption

Learning and Inference over Constrained Output

UNIVERSITY COLLEGE LONDON EXAMINATION FOR INTERNAL STUDENTS

Learning Translation Rules from Bilingual English Filipino Corpus

SIMOnt: A Security Information Management Ontology Framework

Customer Intentions Analysis of Twitter Based on Semantic Patterns

SYNTAX: THE ANALYSIS OF SENTENCE STRUCTURE

PP-Attachment. Chunk/Shallow Parsing. Chunk Parsing. PP-Attachment. Recall the PP-Attachment Problem (demonstrated with XLE):

EVALITA 07 parsing task

Grammars and introduction to machine learning. Computers Playing Jeopardy! Course Stony Brook University

Provided for non-commercial research and educational use only. Not for reproduction, distribution or commercial use.

Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features

Clustering Connectionist and Statistical Language Processing

NewsX. Diploma Thesis. Event Extraction from News Articles. Dresden University of Technology

How To Understand A Sentence In A Syntactic Analysis

Why language is hard. And what Linguistics has to say about it. Natalia Silveira Participation code: eagles

Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1

Constraints in Phrase Structure Grammar

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2

Semantic parsing with Structured SVM Ensemble Classification Models

Speaking of Data: Computational Language Analysis

Generating Sentences from Different Perspectives

A Preliminary Study of Comparative and Evaluative Questions for Business Intelligence

Transcription:

INF5820 Natural Language Processing - NLP H2009 Jan Tore Lønning jtl@ifi.uio.no

Semantic Role Labeling INF5830 Lecture 13 Nov 4, 2009

Today Some words about semantics Thematic/semantic roles PropBank & FrameNet Role labeling

What is the goal of NLP? Applications: (semantic) search Summarization Translation Man-machine interaction, e.g. GPS Semantics Grammars and parsing only a step on the way

Computational semantics Choose adequate semantic representations for utterances Compute representations from utterances Process representations Generate sentences from representations

Semantikk Logikkbasert: x( flyavgang (x) fra(x, oslo, t1) til(x,bodø, t2) tirsdag(t1)) Rammebasert FLY: AVGANG: ANKOMST: BY: BY: oslo DATO: UKEDAG: tirsdag bodø

Alternative representations

More representations

Core Married(adam,eve) Predicate and arguments Logic: Core = atomic formulas The atomic formulas are unstructured In addition: connectives and quantifiers: x (Student(x) Live_in(x, oslo) Happy(x))

In addition to first-order logic Extended logic: Adjectives: small elephant, former president Adverbs: ran fast Propositions as arguments: believes the earth is flat etc. Time and change: built a house, was president Events Co-reference: The foreign minister met the president. He told her..

Alternative representations Classic logic: Married(adam,eve) Davidsonian: e(married(e,adam,eve) Neo-davidsonian, alternative role levels: 1. e(married(e) & SUBJ(e, adam) & OBJ(e, eve)) 2. e(married(e) & ARG0(e, adam) & ARG1(e, eve)) 3. e(married(e) & AGENT(e, adam) & THEME(e, eve)) 4. e(married(e) & Marrier(e, adam) & Marriee(e, eve))

Today Some words about semantics Thematic/semantic roles PropBank & FrameNet Role labeling

Thematic/semantic roles Fine-structure of the core: predicate-argument Deep syntax/shallow semantics Theta roles for syntactic roles Thematic roles for semantic counterpart

Thematic roles Kari ga Ola en bil AGENT BEN THEME Does not correspond to syntactic function Kari ga en bil til Ola AGENT THEME BEN En bil ble gitt Ola av Kari THEME BEN AGENT Ola ble gitt en bil av Kari BEN THEME AGENT

Common roles

Role examples

Good for what? Linguistics: Generalizations: classes of verbs with similar patterns Alternations, e.g. dative shift Hierarchy of roles: Relationship to syntactic functions NLP: Simple inferences Representations for machine translation

Problems Problems: Which roles are there? No agreement How to decide on the particular roles? Fixes: Role types are not firm classes but prototypical: more and less clear-cut instances Two levels: Proto-roles: proto-agent, proto-patient Finer roles

Levin s verb classes In which construction types can a particular verb occur? Kim broke the window The window broke Glass breaks easily Similarly: shatter, smash Not: cut Verbs with same patterns classified together Tried to classify (all) English verbs

Today Some words about semantics Thematic/semantic roles PropBank & FrameNet Role labeling

PropBank Shallow semantic annotation of the Penn treebank Focus on semantic roles Not: quantifiers, co-reference etc.

PropBank cont. Uses simple roles: Arg0, Arg1, Arg2, etc. Relates to Levin s classification Roles consistent across a frameset

FrameNet Fillmore, Berkeley Deeper roles Semantic network, hierarchy

Today Some words about semantics Thematic/semantic roles PropBank & FrameNet Role labeling

Role labeling 1. Finding the constituents that are arguments to a predicate in a sentence 2. Determining their role Supervised learning PropBank or FrameNet or

Gildea & Jurafsky, 2000, 2002 Path-feature NP S VP VBD

Features Predicate: issued + Jurafsky and Martin Phrase-type: NP (or NP-SBJ) + + Headword: Examiner + + Headword POS-tag: NNP + Path: NP S VP VBD + + Voice: active + + Position: before + + Subcategorization: VP NP PP + Palmer et al.

Smoothing

Results (Palmer et al)

Alternative strategies String Chunking Role labeling Role Struct. Tagging PCFGparsing Tree Role labeling Role Struct. Dependencyparsing Dep. Struct. Deep parsing Semantic structures Ranking