NWREL Logo Toolkit98

Integrating Assessment With Instruction Readings




Integrating Assessment and Instruction:
Using Assessment as a Tool for Learning

Can There Really Be Assessments That Promote Learning?

Mention the word "assessment" to most students and teachers and they will immediately respond with the concept of "tests"—the kind used to monitor and track student performance. TESTS—the type given to produce grades, select students for special programs, and report to the Board of Education. Wouldn't it be exciting, however, if the word "assessment" was associated not with traditional testing, but with a different process—one that not only provides an opportunity for teachers and students to thoughtfully examine what has been learned, how we know it has been learned, and what needs to happen next to reach a new level of learning, but also is an episode of learning in and of itself?

The following performance assessment in mathematics is an example of such an assessment. It illustrates how assessment and instruction can be integrated as a "tool for learning."

Performance Assessment: Mathematics

Performance assessment is "assessment based on observation and judgment" (Stiggins, 1994); we observe a behavior or examine a work product and judge its quality. This is frequently done in the classroom by teachers, and is a common occurrence in daily life as well—just think of the Olympics, job performance reviews, and the driver's test. These kinds of judgments can be very subjective. The goal is to make these essentially subjective judgments as objective as possible. This is done through the development of performance criteria that clearly define characteristics and indicators of quality student work.

The criteria developed to judge the quality of student work are also critical to making performance assessment a powerful tool for learning because the criteria clearly communicate what is valued in the work—what it takes to be successful. These criteria for quality are teachable to students, allowing them access to the "secrets" of success. (In fact, students can be full partners in the development of the criteria for success.)

Thoughtfully written and teachable criteria support standards-based teaching. When students and teachers begin to share the same vision of success, they quickly develop a vocabulary for discussing this vision. All are working toward the same clear and identifiable standard. This improves student achievement.

& Reference Box

Richard Stiggins, Student Centered Classroom Assessment, 2nd Ed., 1997, Columbus, OH: Macmillan.

To see how this works, we'll use a mathematics assessment example. Let's begin with analyzing samples of student work. Look at the two student solutions in Figure 2.4 and take a moment to jot down your reactions to them. What did you notice? Are there any obvious strengths? Weaknesses? What features did you respond to? Which solution did you think was the best? Why?

Figure 2.4
Mathematics Applications—Grade 4

Most teachers and students examining solutions such as these tend to like and dislike the same sorts of things. Typical comments for the first solution include: vague, may not have understood the problem fully, and had to read a lot into the solution in order to understand what the student did. Typical comments for the second solution include: understood the problem fully, clear solution, communicated well in pictures and words, logical, detailed, organized, insightful, correct answer, explained thinking, used all relevant information, distinguished relevant information from irrelevant information, used data accurately, correct process, on topic, creative, focused, showed process, used pictures correctly, good explanation, and went beyond the minimum. How did your list of comments compare to these?

Û Related Toolkit98 Activities and Chapters

Appendix A—Sampler contains several other rubrics/criteria for judging solutions to math problems. Compare them. Do the same sets of features tend to recur?

Activities 2.1Sorting Student Work, 2.2—How Can We Know They're Learning? involve sorting student work, generating lists of desirable features, and beginning the process of developing performance criteria.

Comparing solutions to mathematics problems helps us all sort through the complexity of features that make solutions successful. Such exercises can help teachers and students explore what makes a solution work. In fact, the same sets of features recur on brainstormed lists from teachers across the country. They form the basis for performance criteria for mathematics applications problems in many places. For example, Sample A.18—Mathematics Problem Solving (in the Sampler in Appendix A) shows a "four-trait" model for assessing such problems. There are four traits because teachers in Oregon felt that their list of comments tended to fall into four major areas: conceptual understanding, procedural knowledge, problem-solving strategies, and communication. Look at the features other teachers tend to place under each trait (Figure 2.5). Do the comments you made on the work in Figure 2.4 fit in the same way?

The four-trait model for analyzing mathematics applications is, thus, basically an attempt to systematically write down what we, as educators, already value so we can be consistent in our vision of success across students, across assignments, and across time. And, equally importantly, so students can share in the vision of success and not have it constantly shifting in unknown ways between teachers or assignments.

Figure 2.5
What Teachers of Mathematics Value

Conceptual Understanding

Distinguished irrelevant information
Used all relevant information
Understood the problem
Creative
Used data accurately

Procedural Knowledge

Used pictures correctly
Correct answer

Problem-Solving Strategies

Logical
Correct process
On-topic
Insightful
Focused

Communication

Detailed
Organized
Good explanation
Communicated well in pictures and Beyond the minimum words
Clear solution
Showed process
Explained thinking

Value of Criteria to Teachers. Have you ever been nervous grading students' open-ended mathematics problems? Those who have rated or scored mathematics during a direct mathematics assessment using criteria such as the four traits unanimously agree that the very process of assessing teaches a great deal about what makes mathematics solutions successful. Raters have to internalize a set of criteria so thoroughly that they can be consistent both with themselves over time and with other raters. During the process of scoring hundreds of papers, raters get to see lots of examples of what good and poor mathematics actually looks like and to analyze it systematically to determine why it is good or poor. These teachers return to the classroom much more confident in commenting on student work and helping students understand what it takes to produce sound, efficient, and understandable solutions to mathematics problems.

Great! As teachers become more consistent in their analysis of student math work, they gain the confidence to share their insights with students. If assessing brings about a shared vocabulary and a deeper sense of what it takes to solve mathematics problems well, then wouldn't students also benefit from the same process? The answer is YES.

Value of Criteria to Students. Students can also learn to analyze sample work and develop a systematic vocabulary for describing quality so they can develop their own sense of what it means to present a good solution. Consider the following quotes from the area of writing assessment and think of how they might also apply to mathematics problem solving, science process skills, group collaboration skills, and other complex goals we have for students.

I want (students) to see evaluation in its best sense—a source to inform teaching and learning. To that end we develop a vocabulary for commenting on the admirable and problematic aspects of writing....The more we examine samples, the richer and more helpful this language of evaluation becomes. (Erikson, 1992)

Winning points may be the final goal of classroom work as it is of the sports endeavor, but the grade, like the final score of the game, never taught anyone how to win again, or why they lost. For the truly successful contenders, playing the game is always about learning the game...however often it seems to be about scoring more wins than losses. (Lucas, 1992)

& Reference Box

G. Hillocks, Jr., 1986. Research on Written Composition. National Council of Teachers of English, (800) 369-6283.

C. Lucas, 1992. "Introduction: Writing Portfolios—Changes and Challenges," in K. Yancey (Ed.). Portfolios in the Writing Classroom, pp. 1-11. National Council of Teachers of Mathematics, (703) 620-9840.

M. Erikson, 1992. "Developing Student Confidence to Evaluate Writing," in Quarterly of the National Writing Project & The Center for the Study of Writing and Literacy, Vol.14, pp. 7-9.

Scales, criteria, and specific questions which students apply to their own or others' writing also have a powerful effect on enhancing quality. Through using the criteria systematically, students appear to internalize them and bring them to bear in generating new material even when they do not have the criteria in front of them. (Hillocks, 1986)

The goal of assessment should not be only to earn grades. The goal of assessment should be to help students know "how to win again." Clear performance criteria have this effect.

This is instructional assessment. Students learn something in the process of assessing and being assessed. They learn about mathematics problem solving and how to apply this knowledge to their own and others' works. They learn what effective mathematics solutions look like and what to do to make their own mathematics skills stronger.

Beyond Mathematics Assessment

The point here is that good performance criteria can be more than just a tool for teachers (or districts or states) to monitor student ability to write, problem solve, be good collaborators, or be critical thinkers. They also help both teachers and students conceptualize and define standards, especially in hard-to-define areas such as critical thinking, problem solving, collaborative learning, and communication. They provide a vocabulary for discussing features of work or behavior that have, in the past, been subtle and ill-defined.

Good performance criteria help teachers answer such questions as: What is expected? What are our standards? What does good performance look like? What do I want to accomplish? What kind of feedback do I give to improve student work next time? They help students to answer similar questions: What is expected? What does good performance look like? How will I know when I'm successful? Why did I win? How can I win again?

There are three essential features to make it work: (1) having a clear picture in our own heads of the criteria for success, (2) making sure the criteria are right—they really do describe the important features of quality, and (3) letting students "in on it."

  1. Having a clear picture in our own heads. Criteria are, in effect, the final and most complete definition of district and teacher goals for students. Notice that it is the performance criteria and not the performance task that have this function. It is the performance criteria that define what we value in student work, define our targets and standards, and are useful in determining when we meet standards.

    In many discussions of integrating assessment and instruction, criteria are left out. There is a lot of talk about the tasks that should be given to students and how one must watch performance on these tasks to monitor where students are. But the question is: "What, exactly, do you watch for?" to determine progress, know when students are ready to move on to the next topic, or determine when mastery has occurred? Criteria tell us what to look for. Overt and public criteria—arrived at through discussion and collaborative learning and accessible to teachers, students, and parents—bring judgmental assessment from the realm of individualistic impressions to the realm of systematic and generalizable documentation of learning. They also result in assessment that promotes learning.

    How does one know when criteria are clear enough?

    • Everyone can list the criteria
    • Everyone gives the same definitions for features listed in the criteria
    • Everyone can identify the same samples of strong and weak student performance
    • Everyone can use the same words for describing why the samples are strong or weak

  2. Making sure the criteria are right. Is handwriting a telling and important feature of math problem solving? Is neatness the most telling and important feature of writing? Do all the sentences in writing have to be complete, or is sentence quality more determined by the flow of the sentences than whether they are complete or sentence fragments? Should setting up the problem correctly be as important as calculating the right answer? A two-minute speech might be desirable, but it that the most telling feature of a quality oral presentation?

    These are the types of decisions that have to be made when developing criteria. The criteria have to describe what everyone really means by quality or they won’t help anybody. If a teacher or student believes that an essential component of quality has been left out or that an unimportant component of quality is prominently in the criteria, there will be no buy-in. If criteria leave out an important feature of quality, like voice in writing, teachers might not teach voice. What they see is what you’ll get, so criteria have to embody the real indicators of a sound performance and leave out that which is not important.

    How does one know when criteria are the right ones?

    • No performance that is weak has to be judged strong using the criteria.
    • No performance that is strong has to be judged weak using the criteria.
    • After repeated use, one can’t think of anything else that needs to be included or left out.
    • The research literature supports the features included.
    • Teachers, students, and parents consistently reproduce the criteria when asked to sort work for quality and describe the reasons for their judgments.

    Û Related Toolkit98 Activities and Chapters

    Not all performance criteria are created equally—they may not all be able to have the instructional effect described in this section. There may be certain design features that make them more or less useful as instructional tools. These features are outlined in Chapter 2 and Activities 3.3—Performance Criteria—Keys to Success and 1.9Going to School.

  3. Letting students in on it—or—students must be involved in assessing. Students learn to the extent that they are the ones doing the judging.

    By encouraging students to engage in self- and peer evaluation teachers empower students to take control of their own learning. When students help determine the criteria for assessment they can make reasonable decisions about the quality of their own work. By engaging in self-assessments, students learn they are ultimately responsible for their own learning. It is in this area that portfolios are so powerful. (Tierney, et al. 1991, p.59)

    & Reference Box

    J. Penick, J., 1989, "Issues-Based Education—A Need and Rationale," in Education and Urban Society, 22(1), pp. 3-8.

    Robert Tierney, M. Carter, & L. Desai, 1991, Portfolio Assessment in the Reading-Writing Classroom, Christopher Gordon, Publishers, Inc., (617) 762-5577.

    To be successful, students must learn how to evaluate rather than how to be evaluated....Evaluation is routine, something to be done at every step of the way as they examine decisions, observations, data, ideas and outcomes....[When] evaluation is a key process, and one over which students have considerable control, they are truly critical thinkers. (Penick, 1989)

    Û Related Toolkit98 Activities and Chapters

    Appendix A—Sampler contains many other examples of performance criteria and rubrics.

    Activities 2.1—Sorting Student Work, 2.2—How Can We Know They're Learning? involve sorting student work, generating lists of desirable features, and beginning the process of developing performance criteria.

    Activity 2.5—How Knowledge of Performance Criteria Affect Performance illustrates the power of rubrics in assisting students to understand the requirements of a task.

    Relationship Between Classroom Performance Assessment and Large-Scale Assessment

    So far, it may seem that we are only discussing classroom performance assessment. But, in reality, the whole enterprise is much more powerful if large-scale and classroom assessment are considered together as a unit. Consider the following points:

    1. What if performance criteria for large-scale assessment (which have to be developed anyway) were to be developed by teachers with classroom uses in mind? In order to develop performance criteria, teachers have to engage in the discussions and do the research necessary to develop a common vision of what it means for students to be successful. The effort that goes into large-scale assessment development, therefore, also serves as staff development and contributes to development of instructional materials.

    2. What if the performance criteria were uniform across teachers, so that regardless of which teacher observes a student performance, the judgment would be the same? Then assessment credibility improves—and students get the same message across all teachers and grades. Large-scale assessment development can take performance assessment out of the realm of individual interpretation and can promote essential dialogue about what, in fact, constitutes our vision of success for students—our standards.

    3. The way we do large-scale assessment sends messages to teachers about what to teach and how to teach it. If large-scale assessment is designed with the instructional end user in mind, there is much more opportunity to influence instruction in desirable and controllable ways.

    Chapter 2 Menu




    This document's URL is:

    Home | Up & Coming | Programs & Projects: Assessment | People | Products & Publications | Topics

    © 2001 Northwest Regional Educational Laboratory


    Email Webmaster
    Tel. 503.275.9500
    NW Lab Home