Connectionist Models of Situated Language Processing Lecture 10 (part II) Introduction to Psycholinguistics

Similar documents
Systematicity in sentence processing with a recursive self-organizing neural network

Paraphrasing controlled English texts

The role of prosody in toddlers interpretation of verbs argument structure

Lecture 1: OT An Introduction

Clustering Connectionist and Statistical Language Processing

Natural Language to Relational Query by Using Parsing Compiler

Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013.

Electrophysiology of language

6-1. Process Modeling

Introduction to Machine Learning and Data Mining. Prof. Dr. Igor Trajkovski

Mixed-effects regression and eye-tracking data

Introduction to Service Oriented Architectures (SOA)

Prosodic focus marking in Bai

School of Computer Science

The function of nonmanuals in word order processing Does nonmanual marking disambiguate potentially ambiguous argument structures?

Outline of today s lecture

Hybrid Strategies. for better products and shorter time-to-market

An activation-based model of agreement errors in production and comprehension

GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

TECH. Requirements. Why are requirements important? The Requirements Process REQUIREMENTS ELICITATION AND ANALYSIS. Requirements vs.

Information extraction from online XML-encoded documents

Automatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast

A picture is worth five captions

Kybots, knowledge yielding robots German Rigau IXA group, UPV/EHU

Structure of the talk. The semantics of event nominalisation. Event nominalisations and verbal arguments 2

Pronouns: A case of production-before-comprehension

CS4025: Pragmatics. Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature

Statistical Machine Translation

The Open Group Certified Architect (Open CA) Program

A Framework of Context-Sensitive Visualization for User-Centered Interactive Systems

Human Language vs. Animal Communication

Recurrent Neural Networks

Specification for Assessment #3 Organizing and Making Inferences/Predictions from Data

Verb Learning in Non-social Contexts Sudha Arunachalam and Leah Sheline Boston University

[Refer Slide Time: 05:10]

Evaluating Competition-based Models of Word Order

Word Completion and Prediction in Hebrew

Advanced Topics in Software Construction

Open S-BPM: Goals and Architecture

Trameur: A Framework for Annotated Text Corpora Exploration

Interactive Dynamic Information Extraction

Cross-linguistic differences in the interpretation of sentences with more than one QP: German (Frey 1993) and Hungarian (É Kiss 1991)

Impossible Words? Jerry Fodor and Ernest Lepore

1 Introduction. Finding Referents in Time: Eye-Tracking Evidence for the Role of Contrastive Accents

Learning Translation Rules from Bilingual English Filipino Corpus

Actionable Awareness. 5/12/2015 TEI Proprietary TEI Proprietary

ISA OR NOT ISA: THE INTERLINGUAL DILEMMA FOR MACHINE TRANSLATION

Online Latent Structure Training for Language Acquisition

Extracting translation relations for humanreadable dictionaries from bilingual text

COMPUTATIONAL DATA ANALYSIS FOR SYNTAX

Multiple Optimization Using the JMP Statistical Software Kodak Research Conference May 9, 2005

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science

Natural Language Updates to Databases through Dialogue

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

Business Process Redesign and Modelling

Syntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni

Using Neural Networks to Create an Adaptive Character Recognition System

Running Record Recording Sheet

Nexus Guide. The Definitive Guide to Nexus: The exoskeleton of scaled Scrum development. Developed and sustained by Ken Schwaber and Scrum.

A Comparison of Service-oriented, Resource-oriented, and Object-oriented Architecture Styles

Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014

General Problem Solving Model. Software Development Methodology. Chapter 2A

6.2.8 Neural networks for data mining

Visualizing Natural Language Resources

ANALEC: a New Tool for the Dynamic Annotation of Textual Data

1/20/2016 INTRODUCTION

A bachelor of science degree in electrical engineering with a cumulative undergraduate GPA of at least 3.0 on a 4.0 scale

BPM and Simulation. A White Paper. Signavio, Inc. Nov Katharina Clauberg, William Thomas

ANALYSIS OF LEXICO-SYNTACTIC PATTERNS FOR ANTONYM PAIR EXTRACTION FROM A TURKISH CORPUS

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

A. Schedule: Reading, problem set #2, midterm. B. Problem set #1: Aim to have this for you by Thursday (but it could be Tuesday)

CINTIL-PropBank. CINTIL-PropBank Sub-corpus id Sentences Tokens Domain Sentences for regression atsts 779 5,654 Test

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z. Letter

Natural Language Database Interface for the Community Based Monitoring System *

INPUTLOG 6.0 a research tool for logging and analyzing writing process data. Linguistic analysis. Linguistic analysis

Non-nominal Which-Relatives

COSC 3351 Software Design. Recap for the first quiz. Edgar Gabriel. Spring For the 1 st Quiz

CHAPTER-6 DATA WAREHOUSE

A Chart Parsing implementation in Answer Set Programming

What s in a Lexicon. The Lexicon. Lexicon vs. Dictionary. What kind of Information should a Lexicon contain?

the primary emphasis on explanation in terms of factors outside the formal structure of language.

Précis of The Electrophysiology of Language Comprehension: A Neurocomputational Model

ELL Considerations for Common Core-Aligned Tasks in English Language Arts

Please consult the Department of Engineering about the Computer Engineering Emphasis.

Running head: MODALS IN ENGLISH LEARNERS WRITING 1. Epistemic and Root Modals in Chinese Students English Argumentative Writings.

Chapter 11. HCI Development Methodology

AN INTERACTIVE ON-LINE MACHINE TRANSLATION SYSTEM (CHINESE INTO ENGLISH)

Masters in Computing and Information Technology

Tagging with Hidden Markov Models

Copyright 2006 by the American Psychological Association 2006, Vol. 113, No. 2, X/06/$12.00 DOI: / X

Implementing New Approach for Enhancing Performance and Throughput in a Distributed Database

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

Overview of MT techniques. Malek Boualem (FT)

Information extraction from texts. Technical and business challenges

Overview of the TACITUS Project

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Transcription:

Connectionist Models of Situated Language Processing Lecture 10 (part II) Introduction to Psycholinguistics Matthew W. Crocker Pia Knoeferle Department of Computational Linguistics Saarland University Adaptive models of language 2

Why use SRNs? Goals of model behavior: Incremental interpretation Anticipation of role fillers Influence of scene events Processing without the scene Resolution of multiple/conflicting constraints Relative importance of the scene Adaptation to, and seemless integration of, multiple information sources. 3 Connectionist models Output Units Hidden Units Input Units Context Units Purely connectionist models: Simple Recurrent Networks Learn based on experience - Supervised: trained to output a meaning representation - Unsupervised: trained to predict the next word Good a learning a range of linguistic constraints, and integrating multiple information sources 4

Network Architecture Simple Recurrent Network trained with BPTT Enhanced with encoding of scene Entities: (characters) Events: (agent-action-patient) 5 Simulation 1 One network to model four experiments simultaneously Exp 1a & 1b: Linguistic and Stored knowledge No event information available 32 verbs, 48 nouns 96 extra nouns (avoid overfitting) Exp 3a & 3b: Depicted actions Depicted events encoded 48 verbs, 72 nouns Extra unambiguous sentences 6

Training Training data Generated from experimental materials as templates All combinations of referents Test data Actual experimental sentences are held out All test sentences have matching scene Training Regime Trained on final interpretation Scene provided as context 50% of time - reflect experience, and adaption to availability of scene 7 Den Hasen frisst gleich der Fuchs Fox-like Agent anticipated based on experience and stereotypicality Fox Agent anticipated based on experience and scene 8

die Prinzessin waescht der Pirat Princess anticipated because it is the only feminine object depicted 9 die Prinzessin waescht der Pirat 10

die Prinzessin waescht der Pirat Processing of washes enables recovery of all event information from depicted event 11 die Prinzessin waescht der Pirat! Processing of nominative article der establishes role of Princess as Patient 12

die Prinzessin waescht der Pirat 13 Simulation 1: Performance Over 95% accuracy on verb (anticipation) and NP2 Exp 1 & 2: stored knowledge; no depicted events Exp 3 & 4: depicted events; no stored knowledge 14

A Connectionist Model of Scene & Sentence Trained to model materials from 5 visual world studies SRN + Scene Successfully models the use of: experience immediate scene sentence alone priority of the scene Exhibits anticipatory behaviour What about modeling attention in the scene? Mayberry & Crocker, CUNY, 2005. Mayberry, Crocker & Knoeferle, 2005a,b,c. 15 Coordinated Interplay Account Utterance Incremental Interpretation: - Prosodic, lexical, syntactic, semantic constraints Forward Inferencing: - knowledge-driven expectations - scene-driven by events and affordances Priority of scene events over stored knowledge Visual Scene Coordinated Interplay Account Referential Context: - objects in the scene Scene events: - depicted actions and participants Affordances: - events and relationships supported by the scene Attention Referential: - rapid (180ms) looks to depicted noun and verb referents - parallel lexical access Anticipatory: - looks to object likely to be mentioned based on scene or knowledge Priority of anticipatory over lexical identification 16

Revised Architecture Zauberer bespitzelt Pilot gate( t 1) Detektiv verkoestigt Pilot input layer context layer gleich hidden layer bespitzelt Pilot Zauberer PAT gate( t ) Flatter representation of scenes/events Objects and events input directly to the hidden layer Attentional Mechanism: Gating units mask the scene inputs Bind arguments and events 17 Predicting Human Behavior Model use of knowledge, events and scene priority. Single corpus covers two experiments - 16 verbs and 16 nouns - 7008 unique input sentences - Scenes feature 3840 unique events yielding - ~250,000 joint events obeying above constraints Network never trained on conflict conditions of Exp 5 50/50: SVO/OVS, Scene/NoScene Depicted:stereotypical ratio is 4:1 18

Use of Stored Knowledge No Competition Depict: Den Piloten verköstigt gleich der Detektiv! The pilot serves soon the detective Typical: Den Piloten verzaubert gleich der Zauberer! The pilot jinxes soon the magician 19 Use of Scene Knowledge No Competition Depict: Den Piloten verköstigt gleich der Detektiv! The pilot serves soon the detective Typical: Den Piloten verzaubert gleich der Zauberer! The pilot jinxes soon the magician 20

Priority of the Scene over Knowledge Competition: Typical: Den Piloten bespitzelt gleich der Detektiv! The pilot spies on soon the magician Depict: Den Piloten bespitzelt gleich der Zauberer! The pilot spies on soon the magician 21