Teaching terms: a corpus-based approach to terminology in ESP classes



Similar documents
THE UNIVERSITY OF BIRMINGHAM. English Language & Applied Linguistics SECOND TERM ESSAY

FOREIGN LANGUAGE PROGRAMME PREPARATION FOR THE TOEIC EXAM

ICAME Journal No. 24. Reviews

AntConc: Design and Development of a Freeware Corpus Analysis Toolkit for the Technical Writing Classroom

The Oxford Learner s Dictionary of Academic English

Level 4 Certificate in English for Business

Reading Listening and speaking Writing. Reading Listening and speaking Writing. Grammar in context: present Identifying the relevance of

EFL Learners Synonymous Errors: A Case Study of Glad and Happy

ONLINE ENGLISH LANGUAGE RESOURCES

CHARTES D'ANGLAIS SOMMAIRE. CHARTE NIVEAU A1 Pages 2-4. CHARTE NIVEAU A2 Pages 5-7. CHARTE NIVEAU B1 Pages CHARTE NIVEAU B2 Pages 11-14

The Facilitating Role of L1 in ESL Classes

Teaching Vocabulary to Young Learners (Linse, 2005, pp )

English Descriptive Grammar

Hugh Trappes-Lomax and Gibson Ferguson, eds. 2002: Language in Language Teacher Education. Amsterdam and Philadelphia: John Benjamins. 250 pp.

7. Corpora and language teaching

Investigating the usefulness of lexical phrases in contemporary coursebooks

Register Differences between Prefabs in Native and EFL English

No Evidence. 8.9 f X

Modern foreign languages

THE USE OF SPECIALISED CORPORA: IMPLICATIONS FOR RESEARCH AND PEDAGOGY

The Teacher as Student in ESP Course Design

ELECTRONIC DICTIONARIES AND ESP STUDENTS

Correlation: ELLIS. English language Learning and Instruction System. and the TOEFL. Test Of English as a Foreign Language

FOREIGN LANGUAGE TEACHING AN INTERVIEW WITH NINA SPADA

Beyond single words: the most frequent collocations in spoken English

Completing tasks for online forums: a look through the students eyes

Teaching Math to English Language Learners

Download Check My Words from:

PTE Academic Preparation Course Outline

A Survey of Online Tools Used in English-Thai and Thai-English Translation by Thai Students

CORPUS VS DICTIONARY IN EFL CLASSES

What do you need to know to learn a foreign language?

A Report on my Foreign Language Learning Experience BA English Language and Communication

Language Meaning and Use

Most EFL teachers in Japan find that there are groups of verbs which consistently

CRITICAL AND CREATIVE THINKING RUBRIC GRADUATE PROGRAMS

Class contents and exam requirements Code (20421) English Language, Second language B1 business

UNIVERSIDAD TECNOLOGICA DE PEREIRA LICENCIATURA EN LENGUA INGLESA SEMINARIO DE GRADO RECASTS AND THEIR USE IN SECOND LANGUAGE DEVELOPMENT

SOUTH SEATTLE COMMUNITY COLLEGE (General Education) COURSE OUTLINE Revision: (Don Bissonnette and Kris Lysaker) July 2009

GRAMMAR, SYNTAX, AND ENGLISH LANGUAGE LEARNERS

Using Blogs as eportfolios in ESL/EFL Writing Classes

English Language (first language, first year)

Paterson Public Schools Bilingual /ESL Department 133 Ellison Street Paterson, NJ 07505

Z E S Z Y T Y N A U K O W E WYśSZEJ SZKOŁY PEDAGOGICZNEJ W RZESZOWIE SERIA FILOLOGICZNA ZESZYT 42/2001 STUDIA ANGLICA RESOVIENSIA 2

Differences in linguistic and discourse features of narrative writing performance. Dr. Bilal Genç 1 Dr. Kağan Büyükkarcı 2 Ali Göksu 3

Difficulties that Arab Students Face in Learning English and the Importance of the Writing Skill Acquisition Key Words:

Customizing an English-Korean Machine Translation System for Patent Translation *

The Learning needs for Air Cadets and Air Force Student Nurses in the English subject

UNIVERSITÀ DEGLI STUDI DELL AQUILA CENTRO LINGUISTICO DI ATENEO

Syllabus: a list of items to be covered in a course / a set of headings. Language syllabus: language elements and linguistic or behavioral skills

NEW DEVELOPMENTS IN TEACHING READING COMPREHENSION SKILLS TO EFL LEARNERS

English Language Proficiency Standards: At A Glance February 19, 2014

Latin Syllabus S2 - S7

Exam Information: Graded Examinations in Spoken English (GESE)

John Benjamins Publishing Company

TEC 2014 Hyderabad. Adam Scott. The Interface of Corpus Linguistics and The Lexical Approach for Teaching ESL Beginner Learners Effectively

Teaching Qualifications. TKTModules 1 3. Teaching Knowledge Test. Handbook for Teachers

COURSE SYLLABUS ESU 561 ASPECTS OF THE ENGLISH LANGUAGE. Fall 2014

INVESTIGATING DISCOURSE MARKERS IN PEDAGOGICAL SETTINGS:

COMPILING A DICTIONARY OF COLLOCATION: PROBLEMS AND SOLUTIONS

Promoting Learner Autonomy and Language Awareness Through Blogging

Using a Dictionary for Help with GERUNDS and INFINITIVES

COLLOCATIONS IN ENGLISH

CELTA. Syllabus and Assessment Guidelines. Fourth Edition. Certificate in Teaching English to Speakers of Other Languages

1 Basic concepts. 1.1 What is morphology?

How To Pass A Cesf

BACHELOR IN ECONOMICS FIRST YEAR

Some Reflections on the Making of the Progressive English Collocations Dictionary

and the Common European Framework of Reference

A discourse approach to teaching modal verbs of deduction. Michael Howard, London Metropolitan University. Background

Milk, bread and toothpaste : Adapting Data Mining techniques for the analysis of collocation at varying levels of discourse

French Language and Culture. Curriculum Framework

Minnesota K-12 Academic Standards in Language Arts Curriculum and Assessment Alignment Form Rewards Intermediate Grades 4-6

Corpus for Classrooms: Ideas for Material Design

Advanced Grammar in Use

Italian Language & Culture Courses for Foreigners. ITALY Language Training

ENGLISH FILE Elementary

Domain Classification of Technical Terms Using the Web

A critical look at software tools in corpus linguistics* 1

Course Engagement and Involvement ENG P Presecondary

Listening Student Learning Outcomes

Online Study Support For Business English Courses

Using the BNC to create and develop educational materials and a website for learners of English

GUIDE TO BUSINESS REPORT WRITING

Handbook. Trinity College London Certificate in Teaching English to Speakers of Other Languages (CertTESOL) at INTO University of Exeter

A + dvancer College Readiness Online Alignment to Florida PERT

Master of Arts in Linguistics Syllabus

Teaching Methodology Modules. Teaching Skills Modules

Learning English with CBC Radio Living in Alberta. Understanding Credit Cards

Transcription:

Teaching terms: a corpus-based approach to terminology in ESP classes Maria João Cotter Lisbon School of Accountancy and Administration (ISCAL) (Portugal) Abstract This paper will build up on corpus linguistic theories and methodologies to analyze terminological data, which will later be applied to the actual context of Business English classes. Our aim is to show that simple corpus mining techniques impact greatly on students performance, since they will engage in real-life tasks, based on authentic material. The first part of the paper concentrates on theoretical issues and starts with the assumption that teaching materials very often do not reflect authentic language use. In the second part of this paper, we will deal with practical applications to the didactics of ESP. With a tailored corpus, the teacher will be able to analyze texts linguistically in order to extract the lexical units that will be more relevant to the current or future professional activity of the students. Key words: collocation, corpus linguistics, ESP, terminology. Introduction Given the specificity of ESP, many will argue that readily available materials in the form of textbooks are usually too broad and not easily adaptable to real learner s needs (Harwood, 2005). Building on this assumption and considering that the lexis plays an essential role in ESP courses, we will focus on terminological issues and endeavour to show that creating customised corpora is an easy task that will prove beneficial for both teachers and learners. In the second part of this paper, we use the theoretical groundings explored and apply them to a Business English course which is roughly based on the Technical English course 3 rd year students at Lisbon School of Accountancy and Administration are required to take. The course is structured around the following topics: Accounting, Banking, Taxation, Auditing, Economics and Ethics, which will be the basis for our terminological approach. Corpus linguistics and ESP Corpus linguistics uses real data to assess hypothesis about language allowing us to explore the relevant linguistic structures for the aims we set for our research. «A corpus can be described as a large collection of authentic texts that have been gathered in electronic form according to a specific set of criteria» (Bowker & Pearson, 2002: 9). McEnery and Wilson (1996) argue that foreign language teachers usually produce simplified examples, which will raise difficulties for students when these are confronted with real, more complex language that sometimes they are incapable of processing. Corpus linguistics can thus contribute to rendering learning a foreign language more effective since students will be faced with «real language». 499

[ ] it is probably easier to search through a file of relevant examples for what is required than to think up something that sounds natural. When sounds natural is examined closely, it usually transpires that it is almost impossible to invent an adequate example; attempts made by language teachers, lexicographers and others to represent usage are often embarrassing and never reliable. (Sinclair, 1997: 31) Sinclair s position may sound too radical but given an ESP context the difficulty of producing examples that «seem real» is even greater. Since ESP teachers are rarely experts in the learners field of knowledge and sometimes they are not native speakers of the language, they will be faced with two types of problems: producing examples in a language that is foreign to them and in a field of knowledge that they do not master. However, the ESP teacher plays a fundamental role in teaching students to recognize technical vocabulary and understanding definitions (Chung & Nation, 2003). If in terms of grammatical structures teachers with a good mastery of the language, and in spite of the aforementioned difficulties, are able to formulate statements that exemplify a given construction relying only on their intuition, lexically speaking, and in the case of ESP in particular, the situation is more complex since lexis is one of the most specific areas in ESP. Both learners and teachers will then have to rely on the materials used for the ESP course as the main source of new vocabulary. When using textbooks, vocabulary choices were already done by the authors and do not take into account the specific needs of each group of learners (Bogaards, 1994). By producing additional materials based on authentic texts, however, teachers will greatly improve their learners proficiency in ESP. In order to produce these materials, teachers can resort to custom-made ESP corpora. Building an ESP corpus Several factors are significant when building a corpus. Sinclair (1991) defines some criteria for the design of general language corpora, recommending the inclusion of all LSP material in a separate corpus. But to start with one has to define the purpose of the corpus. In the second part of this paper, we focus our attention on the creation of an ESP corpus that can be used both in class and to prepare classes. Pearson (1998) talks about several relevant factors that need to be taken into account when building special purpose corpora, such as size, text type and origin, authorship, factuality, technicality, audience, intended outcome, setting and topic. In an ESP corpus the selection of the topics depends on the learners field of knowledge. The criteria of authorship and audience are very important, since these will determine the technicality degree of the texts. Not all these aspects are equally significant, and ESP teachers have to make learner-oriented decisions since «[t]he optimal design of a corpus is highly dependent on the purpose for which it is intended to be used» (Kennedy, 1998: 70). 500

Collocations It s not enough to learn lexical units separately, learners also have to know how to combine those units. Therefore, ESP teachers have to draw learners attention to «chunks of language», i.e., collocations. Collocation is the occurrence of two or more words within a short space of each other ( ) Collocations can be dramatic and interesting because unexpected, or they can be important in the lexical structure of the language because of being frequently repeated. (Sinclair, 1991: 170) The easiest way to identify collocations is by checking the frequency of a given group of words in a corpus. Obviously a thorough analysis of the data should be done because that group may occur more often simply because it is composed of very common words (Biber et al., 1998). A large part of our mental lexicon consists of combinations of words that customarily cooccur. The occurrence of one of the words in such a combination can be said to predict the occurrence of the other(s). (Kjellmer, 1991: 112) Native speakers have a natural tendency to combine words in pre-defined chunks that are easily recognizable by their listeners. Kjellmer (1991) classifies those groups of words in relation to their internal degree of cohesion in a continuum from what is totally unknown to fixed expressions. Conceptually restrained, technical vocabulary is formed by collocations that introduce specialised knowledge in ESP. The identification of this special language is made by inferring idiomatic constructions from concordance samples. The aim is to perceive the fixation of long compounds, and to appreciate the value of this lexical restriction in the subjects. (Curado Fuentes, 2001: 118) By identifying and studying collocations learners will form their mental lexicon not only from isolated independent units, but also from pre-combined units, thus consolidating a conceptual system that will allow them to become more proficient at an initial stage of learning. Retrieving terms from corpora Distinguishing terms from common words is not an easy task. Pearson (1998) offers an interesting discussion on the subject and arrives at the conclusion that what constitutes a term depends on the communicative setting. However, we do know that terms label concepts and must be learnt by ESP students. Compiling and exploring a thematic corpus In order to demonstrate the effectiveness of a corpus-based approach to terminology in the ESP class, we compiled a set of articles on a common topic the impact of the Sarbanes-Oxley Act called BEC_audit. This corpus is composed of 32 texts collected from different online sources written over a year. All texts were stored in a.doc format, complete with date, URL address, graphs and illustrations. Subsequently, irrelevant items were removed (everything but the text itself) and a new.txt file was created, i.e. the corpus proper. 501

The BEC_audit corpus was explored with ConcApp, a freeware concordancer available at http://www.edict.com.hk/pub/concapp/ (27.05.2006). Concordancers provide concordances, i.e., alphabetical lists of words in context, which can be searched according to various criteria. We started by using the option Statistics > Unique Words to count the total number of words in the corpus (19,852), as well as the number of unique words (3,445). FIGURE 1. The nine most frequent words in the BEC_audit corpus. The most frequent units in a corpus are usually functional words, such as prepositions, articles and conjunctions. In Figure 1, we can see that the ninth most common word is audit with 178 instances and a usage rate that is higher than any form of the verb to be. A frequency this high confirms that the corpus was adequately compiled for the purpose of studying Auditing terminology. TABLE 1 The ten most common lexical units in the BEC_audit corpus. Position Lexical Unit Instances 9 th audit 178 12 th said 139 21 st auditors 98 22 nd companies 97 24 th accounting 91 26 th firms 83 29 th internal 76 32 nd Sarbanes-Oxley 70 36 th financial 63 38 th clients 61 Ignoring functional words, Table 1 shows the ten most common lexical units in the BEC_audit corpus. Except for the verb said, all other units are part of the terminology 502

of Auditing. Using the tool Concordance > Search, we analysed these units and observed the contexts in which they occur in order to determine whether they are actually terms in this LSP. This type of analysis will also make the observation of collocations easier and it is essential in the case of adjectives such as internal, for example, which isolated is not a term, although intuitively we may think it makes part of collocations that are relevant for the terminology we are dealing with. FIGURE 2. Associated word search. Figure 2 shows the first twenty instances of the term internal where we can see four complex terminological units: internal accounting, internal audit, internal auditing and internal auditors. ConcApp also enables us to test our intuition regarding expressions that have a great probability of co-occurring by using the option Associated Words. We can define the item we intend to search, the other word that co-occurs with it, and the relation between the two items in case it is relevant: the second word might occur on the right or on the left of the first. We tested the expression Big Four, since the lexical unit big occurs 47 times in the corpus and the unit four occurs 46 times and these data might prove important given the scope of the corpus. In effect, we saw that all 38 co-occurrences of both words are capitalized, and are obviously a term (see Figure 3). FIGURE 3. Collocation Big Four in the BEC_audit corpus. 503

The less frequent words in a corpus might also be interesting, particularly those that occur only once, called hapax legomena. In a small newspaper article with around 200 words, 150 may be hapax legomena; but in larger representative corpora the proportion of hapaxes is usually much lower (Kennedy, 1998). Sometimes these are simply mistakes in the original texts that need to be corrected, but they may also be relevant terms in the LSP we are analysing. In the BEC_audit corpus we found some interesting hapaxes that are clearly auditing terms: bribery, leveraging and stakeholders. Using corpora to make exercises In this section we used the corpus BEC_bank composed of 20 articles on the topic of banking downloaded from the Financial Times website. Our aim is to show that a body of readily-available sentences facilitates the ESP teacher s task when the time comes to make both vocabulary and grammar exercises. Exercise 1 provides an example of sentences taken from the BEC_bank corpus that test the acquisition of Banking terms. Exercise 1 a. They are swimming in card and student loan debts. b. Halifax's new account gives cash-back on debit card purchases, up to 100 a year. c. The big four banks have little incentive to speed up the payments system because they make money from it. It is also possible to search for prefixes or suffixes using ConcApp and thus analyse certain grammar categories, such as adverbs of manner ending in ly or verb constructions. Exercise 2 is aimed at testing the usage of Gerunds and Exercise 3 tests prepositions. Exercise 2 a. Many lenders will charge you double the interest rate for (use) a credit card. b. This type of deal would depend on (get) a price from the broker s agent in the country of origin. c. The group will start (look) at whether to speed up cheque clearing in October. Exercise 3 Disputes cash rose 347 the year ending March 31, from 117 the previous 12 months. One of the main advantages of using a corpus to collect material that can be used to make exercises is the fact that teachers will actually use terminology and syntax that are characteristic of the particular LSP they are teaching. 504

These special purpose corpora, however, are not only advantageous for teachers but also for learners. Several authors have shown that it is possible for learners to explore corpora individually or in group. Gavioli (1997) presents a series of activities that are aimed at providing the students with the necessary tools to be able to ask relevant questions and the capacity to interpret the answers given by the machine; Kennedy & Miceli (2001) describe a similar experiment. Final Remarks The usage of customized ESP corpora is extremely useful for the preparation of materials. The possibility of analysing specific terminology ensures that teachers are conveying information that will actually be valuable for learners in their professional life. The methodology we have described in this article is easy to follow and allows ESP teachers to adapt materials to the specific needs of learners (Aston, 1997). The larger the corpus, the easier it will be to make generalizations and say that the selection of the terms was based on the usage criterion which Bogaards (1994) identifies as the most important one. Much is still to be done in this area, such as ascertain whether learners will benefit more from direct or mediated access to corpora, defining strategies for both direct and mediated access, and how to integrate this experiences in the language learning environment (Chambers, 2005). References Aston, G. (1997). «Enriching the learning environment: corpora in ELT» in A. Wichmann et al. (eds.), 51-64. Biber, D., S. Conrad & R. Reppen (1998). Corpus Linguistics. Investigating Language Structure and Use. Cambridge: Cambridge University Press. Bogaards, P. (1994). Le vocabulaire dans l apprentissage des langues étrangères. Paris: Éditions Didier. Bowker, L. & J. Pearson (2002). Working with Specialized Languages. London: Routledge. Chambers, A. (2005). «Integrating corpus consultation in language studies», Language Learning & Technology 9, 2: 11-125. Chung, T. M. & P. Nation. (2003). «Technical vocabulary in specialized texts», Reading in a Foreign Language 15, 2: 103-116. Curado Fuentes, A. (2001). «Lexical behaviour in academic and technical corpora: implications for ESP development», Language Learning & Technology 5, 3: 106-129. Gavioli, L. (1997). «Exploring texts through the concordancer: guiding the learner» in A. Wichmann et al. (eds.), 83-99. Harwood, N. (2005). «What do we want EAP teaching materials for?» in Journal of English for Academic Purposes, 4: 149-161. Kennedy, G. (1998). An Introduction to Corpus Linguistics. Essex: Pearson Education Limited. 505

Kennedy, C., & Miceli, T. (2001). «An evaluation of intermediate students' approaches to corpus investigation», Language Learning & Technology 5, 3: 77-90. Kjellmer, G. (1991). «A mint of phrases» in K. Aijmer & B. Altenberg (eds.) English Corpus Linguistics. Harlow: Longman, 111-127. McEnery, T. & A. Wilson (1996). Corpus Linguistics. Edinburgh: Edinburgh University Press. Pearson, J. (1998). Terms in Context. Amsterdam: John Benjamins. Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford: Oxford University Press. Sinclair, J. (1997). «Corpus evidence in language description» in A. Wichmann et al. (eds.), 27-39. Wichmann, A., S. Fligelstone, T. McEnery & G. Knowles. (eds.). (1997). Teaching and Language Corpora. Harlow: Longman. 506