English influence on written Standard Italian: A new concept of regional dialect

Similar documents
Vernacular Language Loyalty and Social Network

MISSION STATEMENT 01. The Maccorp Group was founded in Italy in 1990, opening its first bureau de change in Venice in 1991.

The must of Italy in six nights

ANALYSIS OF LAND USE AND MOBILITY SCENARIOS FOR THE REDUCTION OF TRANSPORT ENERGY IN THE URBAN AREA OF CATANIA

OPENTRACK: Simulation of complex nodes

Ranking Analysis. file://c:\programmi\web CEO\Cache\WCSE\{5CD4ADC5-1EEA-4D77-BB59-10F64F4F2CB4}\WCSE_report.htm

Enterprise digital architects

Timetable sita bus from the airport Venice (Tessera) Maro Polo to Padova station and back to the airport.

Grandi Stazioni Retail. A unique investment opportunity in the Italian travel retail sector

Master of Arts Program in Linguistics for Communication Department of Linguistics Faculty of Liberal Arts Thammasat University

Germany/Austria/Hungary/Italy. railjet + DB/ÖBB EuroCity. Erwin Kastberger. Copenhagen, September 13, 2013

Rai Way. The road to travel in the communication world.

C E D A T 8 5. Innovating services and technologies for speech content management

Student Details. Student ID e Date of Birth 10/02/1989 Place of Birth INDIA Nationality Italian. Education. Turin. Berlin

MiSSiON StAtEMENt 02. The Maccorp group was founded in Italy in 1990, opening its first bureau de change in venice in 1991.

209 THE STRUCTURE AND USE OF ENGLISH.

Combining Social Data and Semantic Content Analysis for L Aquila Social Urban Network

Money Laundering in the Real Estate sector:

Gruppo SCAI. Company Profile. System Integration and Business Development ICT solutions and services, outsourcing, cloud computing

How To Make A Sustainable Energy

Best regards President of Italian Dance Sport Federation Christian Zamblera

Banca Popolare Etica The highest interest is for all. Alessandro Celoni Banca popolare Etica Regional Area Manager Northeast Italy

UNICA Rome Meeting 6-7/12/12. Nicola Vittorio Vice Rector for Education University of Rome Tor Vergata

Prof. Elisabetta Cerbai University of Florence, Vice-Chancellor for Research Sapienza University of Rome, NVA Coordinator

LOGISTICAL INFORMATION

Via Provinciale Lipomo Como Italy Tel Fax Skype: sportstours1

STATISTICAL LABORATORY, USING R FOR BASIC STATISTICAL ANALYSIS

MIDO February 28-March 1-2, 2015

Today Eurocrea is organized in three different business units: Eurocrea Consulting; Eurocrea Financial Services; Eurocrea Merchant.

MIDO Fiera Milano 1 st 3 rd March 2014

Delegated to CNR on December 23rd, New synchronous registration system from September 28 th, 2009

Curriculum Vitae et Studiorum

DBA Group Srl Telecommunication & IT Infrastructures Power & Energy

Incoming Partners. Football packages in Italy 2009/2010 Season Milan e Turin

National Awarding Committee (NAC) for EuroPsy in Italy: Overview

Preliminary results of scanner data analysis and their use to estimate Italian inflation

BERLINO, MARCH PADOVA, APRIL 1-2 TORINO, APRIL BOLOGNA, JUNE 4-5 FIRENZE, JULY 8-9 MILANO, OCTOBER NAPOLI, DECEMBER

How to reach. the Research Centre on Sea Technologies

MILANO. 265 (minimum 30 paying) 270 (minimum 30 paying) 245 (minimum 40 paying) 245 (minimum 40 paying) Milano 1 day Expo 2015 Leonardo da Vinci

Exploratory Spatial Data Analysis

An introduction to peering in Italy

Table 1. Daily newspapers: national circulation (2012)

Using NLP and Ontologies for Notary Document Management Systems

Digital performance of Italy

2 OFFICES BOLOGNA MILAN VENICE TURIN ROME GENOA RAVENNA BOLOGNA MILAN VENICE TURIN ROME GENOA RAVENNA

How To Help A Refugee In Italy

ABSOLUTE SECURITY. IVRI Group. Corporate Profile December 2012

New Frontiers. in Glaucoma. Management. Rome, 7 February 2014 Donna Camilla Savelli Hotel

How To Get An Itinerant Masters Degree

The Italian high quality glass design for contemporary public and private spaces

ITALY TOUR & OPERA 1 Roma: 2 days Firenze: 2 days Verona: ARENA OPERA Venice: 2 days Milano: 2 day Groups 25/50pax

Master of Arts in Teaching English to Speakers of Other Languages (MA TESOL)

Yard Group. Mortgage valuations. Title search Property documentation. Technical Due Diligence Construction, as-built surveys Valuations.

AN OVERVIEW OF THE UK OUTBOUND MARKET

Who we are. We are 17,000 men and women who form a cooperative banking group with 1,800 branches.

Portuguese as Foreign Language Level B.2

Doctoral Program in Social Sciences and Statistics

Consolato dell Oman a Milano

CLAUDIO ROSSETTI Curriculum Vitæ. Place of birth: Rome, Italy. Date of birth: April 12,

Logistical Information

Unlocking Value from. Patanjali V, Lead Data Scientist, Tiger Analytics Anand B, Director Analytics Consulting,Tiger Analytics

BRIXEN WORKSHOP & SUMMER SCHOOL ON INTERNATIONAL TRADE AND FINANCE

TRENTINO MICE CONVENTION BUREAU

CURRICULUM VITAE CLARA GRAZIANO

Processes of urban regionalization in Italy: a focus on mobility practices explained through mobile phone data in the Milan urban region

Curriculum Vitae. Department of Economics and Quantitative Methods University of Lecce Ecotekne, via per Monteroni, Lecce

UniCredit Art Collection Italy, Deutsche Bank Collection Germany, Westfalenbank Collection Germany, DZ BANK Art Collection Germany

ACCADEMIA ITALIANA DI LINGUA

Student Details. Student ID e Date of Birth 16/04/1986 Place of Birth Aosta Nationality Italian. Education. Turin (Eng) Berlin (Eng)

OUR HISTORY. ML ARCHITETTURA is the new company created by. firm which was founded by him and boasts over 25 years of experience

Doe wat je niet laten kan: A usage-based analysis of Dutch causative constructions. Natalia Levshina

EMMEGI is an international press agency based in Florence and gathering freelance reporters both from Italy and from abroad.

Programma LLP/Erasmus Intensive Programme finanziati a.a. 2013/2014

Transcription:

English influence on written Standard Italian: A new concept of regional dialect Costanza Asnaghi QLVL Research Unit - KU Leuven

The Survey Goals: determine if Neostandard Written Italian contains regional variation in the use of Anglicisms; map and describe these written dialects of Neostandard Italian Basis: a computational analysis of lexical variation in Italian online newspapers

Standard Italian vs. Italian Dialects Italian dialects may refer to: - regional languages of Italy - regional Italian, varieties of the Italian language e.g. Standard Italian stiamo arrivando (we are arriving) Venetian sémo drio rivàr regional Italian of Venice stémo rivando

source: RAI website Regional languages of Italy

Standard Italian vs. Italian Dialects Italian dialects may refer to: focus of this research - regional languages of Italy - regional Italian, varieties of the Italian language e.g. Standard Italian stiamo arrivando (we are arriving) Venetian sémo drio rivàr regional Italian of Venice stémo rivando

Standard - Neostandard Processes of restandardization at work in different European languages Vernacular varieties are moving up to function in domains that were previously associated with standard varieties (e.g. Auer 2005, 2011; Auer et al. 2005; Berruto 2005; Kristiansen and Coupland 2011).

Neostandard Italian Italian language currently spoken in the entire territory of the country that differs from the language described in grammars (Berruto 1987). Language characteristics differing from the language described in grammars that are widespread not only in spoken but also in written language (official documents and academic prose excluded), e.g. newspaper language (Tavoni 2005).

Neostandard Italian Neostandard Italian includes many lexical terms borrowed from English. Assumption of present research: the distribution of usage for borrowings varies on a regional basis.

Italy - Facts Area: 301,230 km 2 12th

Italy - Facts 5th Population: 59,530,464

Italy - Facts 20 administrative regions

Italy - Facts 110 provinces

Italy - Facts 110 provinces 102 surveyed in this research + 2 Swiss provinces

Italian-speaking areas in Switzerland Canton Ticino Cantone dei Grigioni

Milano Italy - Facts Torino population density per province Genova Firenze Roma Venezia Bologna Bari Napoli Palermo

Methods of Data Collection

Methods of Data Collection Traditional Postal questionnaires e.g. Davis 1948; fieldworker interviews e.g. Kurath 1939-43; telephone interviews e.g. Labov et al 2006; traditional corpus-based techniques e.g. Szmrecsanyi 2008, Grieve et al 2011.

Methods of Data Collection Traditional Postal questionnaires e.g. Davis 1948; fieldworker interviews e.g. Kurath 1939-43; telephone interviews e.g. Labov et al 2006; traditional corpus-based techniques e.g. Szmrecsanyi 2008, Grieve et al 2011. New Site-restricted web searches Grieve et al. 2013 based on a collection of frequencies of lexical alternations in online newspapers A lexical dialect survey requires an extremely large corpus or a large number of interviews lexical words do not appear often

Sociolinguistic Variable The option of saying the same thing in several different ways: that is, the variants are identical in reference or truth value, but opposed in their social and/or stylistic significance. (Labov 1972)

Sociolinguistic Variable The option of saying the same thing in several different ways: that is, the variants are identical in reference or truth value, but opposed in their social and/or stylistic significance. (Labov 1972) underlying concept

Onomasiological background a retail store offering discounted merchandise

Onomasiological background a retail store offering discounted merchandise spaccio aziendale outlet

Onomasiological background a retail store offering discounted merchandise spaccio aziendale outlet a cloud of dirty air from cars, factories, etc., that is usually found in cities

Onomasiological background a retail store offering discounted merchandise spaccio aziendale outlet a cloud of dirty air from cars, factories, etc., that is usually found in cities inquinamento smog

Lexical Alternation Variables 68 word alternations Anglicism vs. indigenous Italian expression

annual/annuale attachment/allegato audience/pubblico badge/distintivo basket/pallacanestro boss/capo brand/marca,marche break/pausa,pause business/affari,affare company/azienda,aziende competitor/concorrente,concorrenti copyright/diritto d autore,diritti d autore cordless,wireless/senza fili corner/angolo coupon/tagliando,tagliandi cult/di culto -luogo di culto,luoghi di culto deficit/disavanzo,disavanzi device,gadget/dispositivo,dispositivi download/scaricare exit poll/sondaggio elettorale,sondaggi elettorali fair play/correttezza fan page,fanpage/pagina degli ammiratori,pagina fan,pagina dei fan fashion/moda,mode fiction,soap opera,telefilm/ sceneggiato,sceneggiati film/pellicola flop/insuccesso,insuccessi flyer/volantino,volantini follower/seguace,seguaci food/cibo,cibi freelance/libero professionista,libera professionista,liberi professionisti gay/omosessuale,omosessuali girlfriend,girlfriends/fidanzata,fidanzate,mia ragazza,tua ragazza, sua ragazza gossip/pettegolezzo,pettegolezzi killer/sicario,sicari,assassino,assassini light/leggero,leggera,leggeri,leggere lobby/ combriccola,combriccole,combutta,combutte,cricca,cr icche love story,love stories/storia d amore,storie d amore made in Italy/fatto in Italia,fatti in Italia,prodotto in Italia,prodotti in Italia,prodotto italiano,prodotti italiani management/gestione,gestioni manager/dirigente,dirigenti mass media/mezzi di comunicazione di massa meeting/ riunione,riunioni,appuntamento,appuntamenti mobbing/molestia sul lavoro,molestie sul lavoro new economy/nuova economia,nuove economie new entry/nuovo entrato,nuova entrata news/ notizia,notizie,telegiornale,telegiornali,radiogiornale,radiogio rnali nickname/soprannome,soprannomi no comment/nessun commento nursery/asilo nido,asili nido ok/via libera outlet/spaccio aziendale,spacci aziendali part time/orario ridotto,lavoro mezza giornata,lavoro a mezza giornata,lavorare mezza giornata,lavora mezza giornata,lavorano mezza giornata,lavori a mezza giornata penalty/rigore poster/manifesto,manifesti screenshot/immagine dello schermo,immagini dello schermo,fotografia dello schermo,fotografie dello schermo,foto dello schermo slogan/motto,motti smart/intelligente,intelligenti smog/inquinamento,inquinamenti snob/ distinto,distinti,distinta,distinte,eccentrico,eccentrici,ecce ntriche,eccentrica,raffinato,raffinati,raffinata,raffinate,sofisticat o,sofisticati,sofisticata,sofisticate,ricercato,ricercati,ricercata,ri cercate social/sociale,sociali speaker/presentatore,presentatori t-shirt,tshirt,t-short/maglietta,magliette team/gruppo,squadra teenager/adolescente,adolescenti test/prova,prove ticket/contributo USA/Stati Uniti zapping/cambiare canale

Newspaper Selection Sample for this survey: modern newspaper register of Italian as published in mainstream newspapers from across Italy 433 Italian newspapers from 104 Italian (and Swiss) provinces Why newspapers? newspapers are plentiful, freely available in machine readable form, written in Neostandard Italian, and annotated for their place of publication.

Data Extraction (1/3) For each variant of a lexical alternation, the number of pages containing that variant in a series of city newspaper websites was counted.

Example

Data Extraction (2/3) A Python script was used to automatically query online search engines and extract the number of hits from the html source code for the results page.

Data Extraction (3/3) After data collection, the alternation was measured quantitatively as a proportion.

17,800 = 0.72505 17,800 + 6,750

Method Evaluation

Method Evaluation Grieve, Asnaghi, Ruette 2013 Site-restricted web searches for data collection in regional dialectology. American Speech 88

Method Evaluation Grieve, Asnaghi, Ruette 2013 Site-restricted web searches for data collection in regional dialectology. American Speech 88 Harvard Dialect Survey Site-Restricted Web Searches Survey

Method Evaluation frosting vs. icing Grieve, Asnaghi, Ruette 2013 Site-restricted web searches for data collection in regional dialectology. American Speech 88 Harvard Dialect Survey Site-Restricted Web Searches Survey

Method Evaluation Grieve, Asnaghi, Ruette 2013 Site-restricted web searches for data collection in regional dialectology. American Speech 88 frosting vs. icing!"#$%&'()(*+,&'(-./"0/"1(2,/34+5(67"0489( Harvard Dialect Survey http://www4.uwm.edu/fll/linguistics/dialect/staticmaps/q_94.gif :45;#1$(<=>(?#&1#&>(@&5/",#( Site-Restricted Web Searches Survey

Method Application Asnaghi 2013 An analysis of regional lexical variation in California English using site-restricted web searches PhD dissertation

Method Application buddy vs. pal Asnaghi 2013 An analysis of regional lexical variation in California English using site-restricted web searches PhD dissertation

Method Evaluation: Cons Variants should be relatively unambiguous - polysemous words - homonymous words - idioms - proper names - unique localisms - others

tunnel vs. galleria

tunnel vs. galleria galleria d arte, polysemous of galleria (tunnel)

tunnel vs. galleria galleria Vittorio Emanuele, localism

Method Evaluation: Pros Regional patterns can be identified despite the noise thanks to the quantity of data and the advanced statistics. Based on these results, this approach is both a valid and efficient method for gathering data on regional lexical variations.

Creek/Stream Proportion Maps Alternation Anglicism is relatively more frequent < 0.989 < 0.859 < 0.73 < 0.6 < 0.47 indigenous Italian expression is relatively more frequent < 0.341

Creek/Stream Proportion Maps Alternation Anglicism is relatively more frequent < 0.989 made in Italy < 0.859 < 0.73 < 0.6 < 0.47 indigenous Italian expression is relatively more frequent < 0.341 fatto in Italia

Creek/Stream Proportion Maps Alternation Anglicism is relatively more frequent < 0.989 made in Italy < 0.859 < 0.73 < 0.6 < 0.47 indigenous Italian expression is relatively more frequent < 0.341 fatto in Italia

Local Spatial Autocorrelation high cluster dispersion low cluster

Dad/Father, Autocorrelated Maps Gi z-scores Anglicism is relatively more frequent > 3.29 > 2.58 > 1.96 > 1.64 > 1.00 < -1.00 < -1.64 < -1.96 < -2.58 < -3.29 indigenous Italian expression is relatively more frequent

Dad/Father, Autocorrelated Maps Gi z-scores Anglicism is relatively more frequent > 3.29 made in Italy > 2.58 > 1.96 > 1.64 > 1.00 < -1.00 < -1.64 < -1.96 < -2.58 fatto in Italia < -3.29 indigenous Italian expression is relatively more frequent

Dad/Father, Autocorrelated Maps Gi z-scores Anglicism is relatively more frequent > 3.29 made in Italy > 2.58 > 1.96 > 1.64 > 1.00 < -1.00 < -1.64 < -1.96 < -2.58 fatto in Italia < -3.29 indigenous Italian expression is relatively more frequent

smog inquinamento

outlet spaccio aziendale

Factor Analysis

Factor Analysis reduces the complexity of the variables by identifying latent factors in the data

Factor Analysis reduces the complexity of the variables by identifying latent factors in the data

Factor Analysis reduces the complexity of the variables by identifying latent factors in the data

Factor Maps 1 Factor Scores > positive 1.50 loading > 1.00 > 0.50 > 0.00 > -0.50 > -1.00 > -1.50 negative -1.50 loading Factor 1 23.5 %

Factor Maps 1 Factor Scores > positive 1.50 loading > 1.00 > 0.50 > 0.00 > -0.50 > -1.00 > -1.50 negative -1.50 loading Factor 1 23.5 %

Factor 1 23.5 %

Factor 1 23.5 %

Factor 2 14.5 %

Factor 3 11.3 %

Factor 4 8.3 %

Factor 5 6.9 %

Factors 1,2,3 49.3 %

Factors 1,2,3 49.3 %

Factors 1,2,3 source: RAI website 49.3 %

Thank you! English influence on written Standard Italian: A new concept of regional dialect Costanza Asnaghi QLVL Research Unit - KU Leuven