Introduction to Natural Language Processing: L100a

Similar documents
Special Topics in Computer Science

CS 6740 / INFO Ad-hoc IR. Graduate-level introduction to technologies for the computational treatment of information in humanlanguage

Module Catalogue for the Bachelor Program in Computational Linguistics at the University of Heidelberg

Automatic Text Analysis Using Drupal

English Grammar Checker

Developing Academic Language Skills to Support Reading and Writing. Kenna Rodgers February, 2015 IVC Series

Introduction. BM1 Advanced Natural Language Processing. Alexander Koller. 17 October 2014

Paraphrasing controlled English texts

Information extraction from online XML-encoded documents

Introduction to formal semantics -

An Overview of Applied Linguistics

SOCIS: Scene of Crime Information System - IGR Review Report

Processing: current projects and research at the IXA Group

INF5820 Natural Language Processing - NLP. H2009 Jan Tore Lønning jtl@ifi.uio.no

Teaching Formal Methods for Computational Linguistics at Uppsala University

Any Town Public Schools Specific School Address, City State ZIP

English Descriptive Grammar

Outline of today s lecture

Empirical Machine Translation and its Evaluation

International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN

A cloud-based editor for multilingual grammars

Parsing Technology and its role in Legacy Modernization. A Metaware White Paper

NATURAL LANGUAGE QUERY PROCESSING USING PROBABILISTIC CONTEXT FREE GRAMMAR

A Mixed Trigrams Approach for Context Sensitive Spell Checking

Architecture of an Ontology-Based Domain- Specific Natural Language Question Answering System

Natural Language to Relational Query by Using Parsing Compiler

Natural Language Processing. What s this story about?

Semantic analysis of text and speech

Ling 201 Syntax 1. Jirka Hana April 10, 2006

Linguistics 2288B Introductory General Linguistics

Study Plan for Master of Arts in Applied Linguistics

Psychology G4470. Psychology and Neuropsychology of Language. Spring 2013.

Syntactic Theory. Background and Transformational Grammar. Dr. Dan Flickinger & PD Dr. Valia Kordoni

How the Computer Translates. Svetlana Sokolova President and CEO of PROMT, PhD.

NATURAL LANGUAGE TO SQL CONVERSION SYSTEM

Bachelor in Deaf Studies

COURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE. Fall 2014

Towards a RB-SMT Hybrid System for Translating Patent Claims Results and Perspectives

ÄSSA12, No English Translation Available, 30 credits Svenska som andraspråk 1 A, gy, 30 högskolepoäng First Cycle / Grundnivå

MASTER OF PHILOSOPHY IN ENGLISH AND APPLIED LINGUISTICS

M LTO Multilingual On-Line Translation

Latin Syllabus S2 - S7

What is Artificial Intelligence?

Please see current textbook prices at

Overview of MT techniques. Malek Boualem (FT)

Timeline (1) Text Mining Master TKI. Timeline (2) Timeline (3) Overview. What is Text Mining?

Master of Science in Computer Science

Testing Data-Driven Learning Algorithms for PoS Tagging of Icelandic

Extraction of Legal Definitions from a Japanese Statutory Corpus Toward Construction of a Legal Term Ontology

Natural Language Database Interface for the Community Based Monitoring System *

The course is included in the CPD programme for teachers II.

THE BACHELOR S DEGREE IN SPANISH

European Masters Program in Language and Communication Technologies (LCT) Module Handbook for Prospective Students

An Interactive Hypertextual Environment for MT Training

31 Case Studies: Java Natural Language Tools Available on the Web

Course Syllabus My TOEFL ibt Preparation Course Online sessions: M, W, F 15:00-16:30 PST

University of Massachusetts Boston Applied Linguistics Graduate Program. APLING 601 Introduction to Linguistics. Syllabus

Conley, D. T. (2005). College Knowledge: What it Really Takes for Students to Succeed and What We Can Do to Get Them Ready

Compiler I: Syntax Analysis Human Thought

Statistical Machine Translation

Overview of the TACITUS Project

Classification of Natural Language Interfaces to Databases based on the Architectures

n Introduction n Art of programming language design n Programming language spectrum n Why study programming languages? n Overview of compilation

Marie Dupuch, Frédérique Segond, André Bittar, Luca Dini, Lina Soualmia, Stefan Darmoni, Quentin Gicquel, Marie-Hélène Metzger

Specialty Answering Service. All rights reserved.

Course Description (MA Degree)

The Cambridge English Scale explained A guide to converting practice test scores to Cambridge English Scale scores

COURSES IN ENGLISH AND OTHER LANGUAGES AT THE UNIVERSITY OF HUELVA (update: 24 th July 2014)

Introduction. Philipp Koehn. 28 January 2016

2. The NLP-Resources for Automatic Correction

Reading Competencies

Lecture 1. Basic Concepts of Set Theory, Functions and Relations

Application of Natural Language Interface to a Machine Translation Problem

Basic Parsing Algorithms Chart Parsing

A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow

Learning Translation Rules from Bilingual English Filipino Corpus

Semantic Analysis of Natural Language Queries Using Domain Ontology for Information Access from Database

Professional Learning Guide

CS4025: Pragmatics. Resolving referring Expressions Interpreting intention in dialogue Conversational Implicature

UNIVERSITY OF PUERTO RICO RIO PIEDRAS CAMPUS COLLEGE OF HUMANITIES DEPARTMENT OF ENGLISH

School of Education. EDST5101: Interventionist and Experimental Design and Analysis. Semester 1

Functional Auditory Performance Indicators (FAPI)

Compilers. Introduction to Compilers. Lecture 1. Spring term. Mick O Donnell: michael.odonnell@uam.es Alfonso Ortega: alfonso.ortega@uam.

Why major in linguistics (and what does a linguist do)?

customer care solutions

The Knowledge Sharing Infrastructure KSI. Steven Krauwer

According to the Argentine writer Jorge Luis Borges, in the Celestial Emporium of Benevolent Knowledge, animals are divided

CELTA. Syllabus and Assessment Guidelines. Fourth Edition. Certificate in Teaching English to Speakers of Other Languages

An Introduction to Language Processing with Perl and Prolog

Open Domain Information Extraction. Günter Neumann, DFKI, 2012

Acquisition Lesson Plan for the Concept, Topic or Skill---Not for the Day

ENGLISH LANGUAGE. A Guide to co-teaching The OCR A and AS level English Language Specifications. A LEVEL Teacher Guide.

: Introduction to Machine Learning Dr. Rita Osadchy

Syntax: Phrases. 1. The phrase

USABILITY OF A FILIPINO LANGUAGE TOOLS WEBSITE

Report Writing: Editing the Writing in the Final Draft

Language Meaning and Use

D2.4: Two trained semantic decoders for the Appointment Scheduling task

Flattening Enterprise Knowledge

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages CHARTE NIVEAU B2 Pages 11-14

Transcription:

Introduction to Natural Language Processing: L100a Lecture 1: Introduction Ann Copestake Computer Laboratory University of Cambridge October 2012

Outline of today s lecture Some NLP applications NLP and linguistics Course structure Examples of analysis To do list

Some NLP applications Question answering IBM Watson: Jeopardy! Category: US cities. Its largest airport was named for a World War II hero; its second largest, for a World War II battle. Answer: Chicago

Some NLP applications Question answering IBM Watson: Jeopardy! Category: US cities. Its largest airport was named for a World War II hero; its second largest, for a World War II battle. Answer: Chicago

Some NLP applications Analysis of scientific text The objective of this research was to evaluate techniques for the rapid detection of chromosomal alterations occurring in humans exposed to butadiene. e.g., CRAB project, Korhonen.

Some NLP applications Assessing and teaching English Prompt: Ricky stores his toys in the closet. Where are Ricky s toys?. Child s task: create a grammatically correct answer from a set of words provided (organised by part of speech). System s task: check that the answer is grammatically correct and plausible, and, if possible, provide feedback on mistakes. Flickinger: EPGY, Stanford, Language Arts and Writing for Grades 2 6

Some NLP applications Language learning examples Some correct answers: Ricky s toys are in the closet Ricky s toys are in his closet they are in his closet the toys are in Ricky s closet Ricky s toys are in Ricky s closet Some incorrect answers: Ricky s are his toys in his closet in in in in Ricky s in Ricky s Ricky s Ricky s toys are in the garden

Some NLP applications Simplifying text Example from Siddharthan (2011): The original police inquiry, which led to Mulcaire being jailed in 2007, also discovered evidence that he has successfully intercepted voicemail messages belonging to Rebekah Brooks, who was editor of the Sun when Mulcaire was working exclusively for its Sunday stablemate. The original police inquiry led to Mulcaire being jailed in 2007. The police inquiry also discovered evidence that he has successfully intercepted voicemail messages belonging to Rebekah Brooks. Rebekah Brooks was editor of the Sun. This was when Mulcaire was working exclusively for its Sunday stablemate. Advaith Siddharthan. Text Simplification using Typed Dependencies: A Comparision of the Robustness of Different

Some NLP applications Simplifying text Example from Siddharthan (2011): The original police inquiry, which led to Mulcaire being jailed in 2007, also discovered evidence that he has successfully intercepted voicemail messages belonging to Rebekah Brooks, who was editor of the Sun when Mulcaire was working exclusively for its Sunday stablemate. The original police inquiry led to Mulcaire being jailed in 2007. The police inquiry also discovered evidence that he has successfully intercepted voicemail messages belonging to Rebekah Brooks. Rebekah Brooks was editor of the Sun. This was when Mulcaire was working exclusively for its Sunday stablemate. Advaith Siddharthan. Text Simplification using Typed Dependencies: A Comparision of the Robustness of Different

NLP and linguistics Analysis of text Analysis of text is the core of NLP applications. Level and type of analysis attempted depends on application: Depth of analysis? Meaning? Explicit linguistic analysis vs surface-based techniques? Precision? e.g., testing for grammaticality vs processing noisy data. NLP is moving away from pipelined models: syntactic and semantic analysis used along with surface analysis techniques. Also work on language generation and regeneration.

NLP and linguistics NLP and linguistics The computational modelling of human language (NLP/computational linguistics/cl) 1. Morphology the structure of words. 2. Syntax the way words are used to form phrases. 3. Semantics Compositional semantics the construction of meaning based on syntax. Lexical semantics the meaning of individual words. 4. Pragmatics meaning in context.

Course structure Language and Speech courses in the ACS L100: Introduction to NLP (L100a and b) L106: Spoken Language Processing L113: Word meaning and discourse understanding R211: Biomedical informations (half module) R207: Language and Concepts L101: Machine learning for language processing (L106 is prerequisite) L102: Statistical machine translation L107: Syntax and semantics of natural languages (NB: logic required)

Course structure Syllabus for L100 Linguistics for NLP morphology, syntax, semantics and pragmatics [6 lectures, AAC] Finite-State/Markovian Techniques lemmatisation, part-of-speech (PoS) tagging, phrase chunking and named entity recognition (NER) [4 lectures, SC] Parsing grammars, treebanks, representations and evaluation, statistical parse ranking [4 lectures, SC] Interpretation compositional semantics and entailment, pragmatic inference [2 lectures, AAC] Two practicals.

Course structure Resources for L100a On website: slides (on website after lecture). ejb s handouts: assigned reading before each lecture (do exercises!) introductory logic worksheet assessed exercises Possibly useful background: NLP Part II lecture notes. Recommended Book: Jurafsky and Martin (2009) See also suggestions at the end of the handouts.

Course structure Assessment for L100 Four ticked assignments: first due at 10am October 15 (hand in to student admin). Each worth 5%. Practical reports, due next term.

Course structure Plan for L100a 1. Introduction (this lecture). 2. Morphology and lexical categories (this wednesday). 3. Syntax 1 Review of assignment 1. Syntax. 4. Syntax 2 Review of assignment 2. Using context free grammars. 5. Syntax 3 Review of assignment 3. Grammatical relations. 6. Compositional semantics Review of assignment 4. Semantics. 7. Semantics and pragmatics. 8. Topics arising...

Examples of analysis Examples of analysis Chromosomal alterations were detected in humans exposed to butadiene. Morphological analysis and POS assignment Noun phrase bracketing Syntax tree Grammatical relations Logical form

To do list To do Before the next lecture (Wednesday): read sections 1 4 of Introduction to Linguistics handout and do the associated exercises. Start logic worksheet.