National assessment of foreign languages in Sweden



Similar documents
National assessment of foreign languages in Sweden

Package; Teacher Education Campus Roskilde Spring 2017

Programme Plan for Primary Teacher Education, Years 1-7

ETS Automated Scoring and NLP Technologies

Pre-service Performance Assessment Professional Standards for Teachers: See 603 CMR 7.08

TESOL Standards for P-12 ESOL Teacher Education = Unacceptable 2 = Acceptable 3 = Target

Assessment Policy. 1 Introduction. 2 Background

Cartooning and Animation MS. Middle School

All IB teachers are aware that they are also language teachers.

Instruction: Design, Delivery, Assessment Worksheet

Self Assessment Tool for Principals and Vice-Principals

New Jersey Core Curriculum Content Standards for Visual and Performing Arts INTRODUCTION

Comparison of Research Designs Template

Nefertari International Schools IBDP Candidate School Whole School Language Policy

Pre-Requisites EDAM-5001 Early Literacy Guiding Principles and Language

North Carolina TEACHER. evaluation process. Public Schools of North Carolina State Board of Education Department of Public Instruction

READING WITH. Reading with Pennsylvania Reading Specialist Certificate

Foreign Language (FL)

A Guide. to Assessment of Learning Outcomes. for ACEJMC Accreditation

The sophomore Spanish class is team taught by two Spanish teachers during a one hour block. Each

Completed Formal Classroom Observation Form

Languages at key stage : evaluation of the impact of the languages review recommendations: findings from the 2009 survey

Faculty Board for Undergraduate Studies in the Humanities and Social Sciences Translation from Swedish to English S Y L L A B U S

New syllabus for Swedish for Immigrants (sfi)

Standard Two: Knowledge of Mathematics: The teacher shall be knowledgeable about mathematics and mathematics instruction.

Behind the news at Italian schools

LOTE TEACHER COMPETENCIES FOR PROFESSIONAL DEVELOPMENT

Requirements EDAM WORD STUDY K-3: PRINT AWARENESS, LETTER KNOWLEDGE, PHONICS, AND HIGH FREQUENCY WORDS

Reading Specialist (151)

ILLINOIS CERTIFICATION TESTING SYSTEM

Comprehensive Reading Plan K-12 A Supplement to the North Carolina Literacy Plan. North Carolina Department of Public Instruction

WHAT WORKS IN INNOVATION AND EDUCATION IMPROVING TEACHING AND LEARNING FOR ADULTS WITH BASIC SKILL NEEDS THROUGH FORMATIVE ASSESSMENT STUDY OUTLINE

ST. PETER S CHURCH OF ENGLAND (VOLUNTARY AIDED) PRIMARY SCHOOL SOUTH WEALD. Modern Foreign Language Policy

INSTRUCTION AT FSU THE FLORIDA STATE UNIVERSITY OFFICE OF DISTANCE LEARNING. A Guide to Teaching and Learning Practices

COMMUNICATION COMMUNITIES CULTURES COMPARISONS CONNECTIONS. STANDARDS FOR FOREIGN LANGUAGE LEARNING Preparing for the 21st Century

Pedagogical use of multiple choice tests Students create their own tests

MASTER S DEGREE IN EUROPEAN STUDIES

Ribby with Wrea Endowed C.E. Primary School. Modern Foreign Languages Policy

Modern Foreign Languages (MFL)

Philosophizing with children by using Socratic Methods

TEACHERS OF READING.

Program Rating Sheet - Athens State University Athens, Alabama

Basic Assessment Concepts for Teachers and School Administrators.

FORMAL AND INFORMAL FEEDBACK TOOLS TO ENHANCE THE STUDENT LEARNING PROCESS

ICLON. Master Programme on Chinese Language and Culture Education. Claire Smulders, M.A (ICLON) Teacher Educator Chinese Language and Culture

OUTCOMES OF LANGUAGE LEARNING AT THE END OF BASIC EDUCATION IN 2013 TIIVISTELMÄ

Synergies for Better Learning

Principles of Data-Driven Instruction

EXECUTIVE SUMMARY. Nancy C. Rhodes and Ingrid Pufahl. Amount of Language Instruction

COMMUNICATION STRATEGY FOR THE UNIVERSITY OF GOTHENBURG

THE MASTER'S DEGREE IN ENGLISH

Five High Order Thinking Skills

PROGRAM 6 The Role of Assessment in Curriculum Design

curriculum for excellence building the curriculum 2 active learning a guide to developing professional practice

Master of Science in Management

Science teachers pedagogical studies in Finland

Teaching Framework. Competency statements

Tools to Use in Assessment

Making Foreign Languages compulsory at Key Stage 2 Consultation Report: Overview

Using Adult Education Principles to Provide Professional Development for the Foundations of Inclusion

Arizona Association of Student Councils

BEST PRACTICES IN COMMUNITY COLLEGE BUDGETING

English Syllabus for Grades 1-4. Desktop/ Files Returned by Experts August 2008 / English cover, content & introduction Grades 1-4 cv2

Recommended Course Sequence MAJOR LEADING TO PK-4. First Semester. Second Semester. Third Semester. Fourth Semester. 124 Credits

EALTA GUIDELINES FOR GOOD PRACTICE IN LANGUAGE TESTING AND ASSESSMENT

LITERACY: READING LANGUAGE ARTS

Reading Diversity TEACHING TOLERANCE A TOOL FOR SELECTING DIVERSE TEXTS. Extended Edition COMPLEXITY CRITICAL LITERACY DIVERSITY AND REPRESENTATION

Assessment of children s educational achievements in early childhood education

Understanding Types of Assessment Within an RTI Framework

Georgia s Technology Literacy Challenge Fund Grant Application. Bremen City Schools. Section 3: Grant Proposal

Practicum/Internship Handbook. Office of Educational Field Experiences

The University of North Carolina at Pembroke Academic Catalog

Improving the Public School. overview of reform of standards in the Danish public school (primary and lower secondary education)

Language Skills in a Multilingual Society Syed Mohamed Singapore Examinations and Assessment Board

Maths Mastery in Primary Schools

TEXAS RISING STAR WEBINAR SERIES: CURRICULUM AND EARLY LEARNING GUIDELINES RECORDED OCTOBER 29, 2015 NOTES

EA 597 School Planning and Facilities Management (3)

COURSE OF STUDY OUTLINE BUSINESS GRADE 9/10, BUSINESS, OPEN (BTT10/BTT20)

Section 4: Key Informant Interviews

Principal Appraisal Overview

The course is included in the CPD programme for teachers II.

What are some effective standards-based classroom assessment practices?

STANDARDS FOR ENGLISH-AS-A-SECOND LANGUAGE TEACHERS

Advantages and Disadvantages of Various Assessment Methods

Ryburn Valley High School

Linking Singapore s English Language Syllabus 2010 to The Lexile Framework for Reading: STELLAR as an Example

Developing Research & Communication Skills

Deledda International Language Policy

LANG 557 Capstone Paper . Purpose: Format: Content: introduction view of language

FACT SHEET. White Paper on Teacher Education The teacher the role and the education (Report to the Storting No. 11 ( )) Principal elements

Modern Foreign Languages (MFL) Policy 2013

Foundations of the Montessori Method (3 credits)

Transcription:

National assessment of foreign languages in Sweden Gudrun Erickson University of Gothenburg, Sweden Gudrun.Erickson@ped.gu.se The text is based on a summary of a presentation given at the FIPLV Nordic Baltic Region Conference in Tyrifjord, Norway, in June 2004. Published in Sprog og Sprogundervisning 2004:3 (Norway), and Sproglæreren 2004:3 (Denmark). Revisions and additions were made in May 2007 and April 2009. There is a long tradition of assessment at the national level in Sweden. In connection with the present goal- and criterion-referenced system, assessment materials of various kinds, covering different subjects, are offered throughout the school system. Different universities are commissioned by The Swedish National Agency for Education to take responsibility for development and research, e.g. Göteborg University for foreign languages English, French, German and, relatively recently introduced, Spanish. Facts about Sweden The Swedish school system comprises preschool, nine-year compulsory school and three-year upper secondary school, each with its own national curriculum. For compulsory and upper secondary education there are also syllabi for individual subjects. Thus, goals are set nationally, whereas content, materials and methods are to be decided locally. Personal development dialogues between students, teachers and parents/guardians are to be held at least once a term, but formal grades are not awarded until year eight when the pupils are around 14 years old. Teachers are responsible for the evaluation and grading of their own students. There are no formal examinations, but an extensive system of national assessment at different levels is aimed at supporting teachers in their decisions concerning individual students competences in relation to the national goals and grading criteria. Consequently, the national assessment system is advisory, not decisive. Moreover, it needs to be emphasized, that no central marking takes place; teachers take responsibility for the rating of the tests. There is a strong recommendation, but no formal demand, that this be done in collaboration with colleagues. Why national assessment? The main aim of the system of national assessment is to support and advise teachers, and, to a certain extent, students as well, in their decision-making concerning diagnosing, planning and grading. The system also aims at enhancing comparability and equity within the school system. Furthermore, it is used to clarify the view of knowledge and language as expressed in the national curricula and syllabi. Parts of the materials can also be used in national evaluation projects of the school system at large. Thus, there are multiple aims to the system, which obviously is not uncomplicated. There is a fair amount of consensus around the ambition to maintain a system in which assessment is regarded as an integrated part of the didactic process, which, ideally, should be for learning, not just of learning, and of course never against learning. This means that it ought to cover as much as possible of the construct. 1

Students should be offered a variety of tasks, as authentic as possible, and results should be presented in a way that helps each student to gain insights about strengths and weaknesses in his/her individual profile of competence, and to plan, together with the teacher, how learning can be optimized. What is assessed and how? The Swedish national syllabi for foreign languages are to a considerable extent inspired by the Common European Framework of Reference, although the seven stages defined have not been fully linked to the six levels of the Framework. Areas focused on are receptive, productive and interactive skills, but also intercultural communicative competence and what might be referred to as reflective competence, i.e. the ability to analyse and take increasing responsibility for one s own learning, and to evaluate it. Subsystems like vocabulary, grammar and pronunciation are considered important prerequisites but not as goals per se. The national assessment materials are not intended to cover all areas of the syllabi. One reason for this is that they are not to be regarded, or used, as final examinations, but as supplementary, advisory materials. Another reason is that, due to their character, obviously some goals can better be evaluated continuously, during the learning- and teaching process. This applies in particular to the goals aimed at increasing reflective competence, but also to a large extent to those focusing on intercultural, communicative competence. Altogether, at present, full national assessment materials of foreign languages are provided for six different stages of English, three of French and German and two of Spanish. All of them include tasks aimed at testing receptive competence and oral as well as written production and interaction. Models for developing students reflective skills, e.g. self-assessment, are offered for four stages of English and one of French, German and Spanish (the latter also comprising models for peer assessment). Aspects of culture are reflected in the materials, mainly in the choice of texts, and in topics for oral and writing tasks. Additional summative tests for Spanish are underway (publishing to start in 2010). Moreover, there are illustrative materials focusing on partial competencies (oral and written production and interaction) for another two stages one fairly basic for French, German and Spanish, and one for the most advanced stage of English. A typical national test comprises four parts: an oral test, in which pairs or groups of students talk about different subjects, a listening comprehension and a reading comprehension part with a variety of texts and tasks, and a writing test, in which students can usually choose between two different subjects. There are extensive teacher guidelines for all materials. They include test specifications, commented answers and authentic samples of benchmarked oral and written performance, cut-off scores etc. Dvd materials, informing about, and illustrating assessment of oral proficiency in English, French, German and Spanish, have been produced and distributed, free of charge, to all schools offering the courses focused upon. In these programs students take the test, comment on the tasks and assess their own performances. There are also extensive discussions between teachers, analysing and rating the students performances in relation to the national syllabi and grading criteria. All recordings are authentic, i.e., nothing was scripted or rehearsed. Students are informed about the national tests in different ways, in standardized letters, by their teachers, and through extensive sample materials published on the Internet. 2

For French, German and Spanish there is an electronic test bank with different levels of accessibility. For one level of French and German (Spanish to be added) a metaphorical chest of drawers is depicted on the computer screen. The first drawer contains a number of subtests of reading, listening, writing and speaking, with standards set and benchmarks provided, to be used in high-stakes, summative testing. In the second drawer different tasks, previously used in national tests, are offered for teachers to include in their own classroom tests, thus providing stable and thoroughly analysed components, representing skills that teachers may find hard to assess systematically. The first two drawers are firmly locked, i.e., an individual password is needed to be able to open/access them. The third drawer, on the other hand, is open to the public and serves the function of information and practice, if needed. (For further information about the different assessment materials, including samples of tasks, see the project website http://www.ipd.gu.se/enheter/sol/. Information about the Swedish school system can be found at http://www.skolverket.se/sb/d/190.) How materials are developed and standards set The national assessment materials have partly different aims and character, from purely formative and low-stakes, to distinctly summative, compulsory and highstakes. However, they are all based on a set of basic principles, some of which are the following: Making what is most important assessable, not making what is easily measurable the most important; Giving students the chance to show what they actually know and can do, instead of primarily trying to detect/focus on what they do not know/cannot do, e.g. by providing broad, multifaceted, varied, monolingual tests, with to as large an extent as possible progression of difficulty, within and between tasks; Enhancing validity and reliability avoiding bias, for example by developing tests in collaboration with a wide group of stakeholders, and pre-testing all materials in large, randomly selected groups across the country; Detecting and presenting as much as possible of individual students results in profiles strengths as well as weaknesses; Commenting on strengths before weaknesses; when analysing weaknesses, distinguish between errors that [might] disturb and errors that actually destroy communication, i.e. between errors representing different degrees of gravity. Considerations underlying item writing and composition of full materials concern, e.g. content, relevance and level of difficulty in relation to the syllabus in focus, and aspects of time and format as well as of gender and culture. Considerable attention is paid to opinions expressed by students and teachers in connection with piloting and pre-testing of the different tasks. All materials are developed in close cooperation with different categories of experts, among which students, i.e. what might be considered the real stakeholders, should not be forgotten. Contacts with different national and international institutions play an important role. For each material there is a reference group comprising different categories of teachers, teacher trainers and researchers within different fields. Native speakers contribute in various ways to the developmental work. Tasks (items and testlets) are piloted during the initial development stage, revised, and then pre-tested in randomly selected schools throughout the country, normally by around 400 students per task. Anchor items are used to enable 3

comparisons across groups, and across time. During these rounds all students and teachers are asked to comment on the different tasks. A wide range of qualitative as well as quantitative methods are used to analyse the results, i.e. both performance data and opinions. Less well-functioning items and tasks are either removed or adjusted and then pre-tested again, until they are finally considered for inclusion in one of the materials. Standards are set in collaboration with groups of experienced teachers, employing an eclectic approach, i.e. combining different methods for standard setting (as often recommended in the literature), with test-centred as well as student-centred techniques: Teachers take the test, analyse and estimate the items/tasks in relation to the syllabus, and to their experiences of student performances at the level in focus, and then suggest cut-off scores for the different grade levels. Different data from the pretesting rounds are introduced towards the end of each session and play a role in the final decisions made. As for the selection of benchmarked samples of oral and written performances, approximately ten teachers analyse and rate, independently, a large number of authentic samples. The ratings are then analysed, with regard to inter-rater reliability, distribution etc., examples are chosen and comments produced for the teacher guidelines. For the French, German and Spanish materials, based on a common syllabus, a three-phased standard setting model is used. First, standards are set for each test separately, according to the model described above; after that the tests are compared, and standards suggested, by groups comprising teachers who are academically qualified and experienced in teaching two of the languages. In this phase, a list of parameters, produced in collaboration between linguists and psychometricians, is used to make sure that a wide range of relevant aspects are taken into account. Before the final decisions are made about standards and benchmarks, the results from the first two standard setting phases are compared, and data from the different pretesting rounds further considered. Results and reactions Test results are routinely analysed with regards to various aspects of validity and reliability. Matters investigated obviously concern aspects of facility, distribution, internal consistency, and rater agreement. In 2008 a large-scale study of inter-rater agreement and consistency was undertaken, focusing on the final tests of English, Mathematics and Swedish at compulsory school level (results to be published by the National Agency in May 2009). 100 randomly selected, teacher rated tests were independently re-rated by three external raters. The results for English were very positive, with almost total agreement for constructed response items in Listening and Reading comprehension, and correlations between.86 and.93 for Writing (generalizability coefficient.85). Earlier studies of the oral components of the test have indicated roughly the same results as for Writing. Since identical routines are applied in the development of all FL materials, including the teachers scoring guides provided, it can be tentatively assumed that these results are generalizable, at least to some extent, to the other FL testing materials produced within the project as well. Test takers often give valuable suggestions for improvement of tasks, for example concerning content, clarity and perceived level of difficulty, the latter especially useful in sequencing decisions. In general the following has been noted about 4

students attitudes 1 : Students tend to appreciate tasks that are considered authentic, pedagogical, fair and challenging; Fairly irrespective of students level of proficiency, oral tasks and creative writing are often the most appreciated parts of the tests. In addition, it can be mentioned, that basically the same results emerged in a survey of students [and teachers ] views on language testing and assessment, conducted in ten European countries in 2005. A report on this (Erickson & Gustafsson, 2005) can be found on the website of the European Association for Language Testing and Assessment (EALTA), under Resources (http://www.ealta.eu.org/resources.htm). Teachers reactions are generally very positive, both to the principle of national assessment as such, and to the different materials. During the past ten years more than 95 per cent have expressed positive opinions, often concerning the breadth and variation of the tasks, the close connection between the materials and the syllabi, the profiled presentations of results, and the support provided in the guidelines. The criticism which of course also occurs, most often concerns work load broad, qualitative assessment takes time, too much time some teachers seem to feel. The outcomes of the different assessment rounds are analysed and commented on in regular reports published on the Internet. In these both results and reactions are presented and discussed. Furthermore, data from the project are used for research purposes. Examples of areas focused upon are different types of language related issues, various aspects of validation, students perceptions (e.g., test-taker feedback, self-assessment), collaborative processes, and dimensionality issues. Concluding remarks The system of national assessment in Sweden is flexible and dynamic, which means that changes, initiated by different stakeholders and based on thorough development work, are gradually introduced. However, the basic principles as well as the collaborative approach to test development remain the same, since it is felt that this contributes to a continued, positive assessment climate within, and hopefully also outside, the national assessment system. POST SCRIPTUM After a general election in September 2006, when a new government came into power, a number of changes to the Swedish school system have been initiated, e.g., earlier grading, an increased number of grade levels, more national tests, and tests in more subjects. Also, it is proposed that the number of aims of the present assessment system be reduced, with more emphasis placed on comparability/equity, less on individual diagnosis and curriculum implementation. The relationship between test results and final grades is to be clarified, which probably implies a strengthening of the role of the national tests. However, there is nothing indicating a change to an examination system, i.e. where the national test results determine individual students grades. 1 Mainly based on analyses of 15-year-old students comments on tests of English. 5