Toolkit98

There are two parts to a performance assessment—tasks and criteria.
We use the term "tasks" to refer to whatever activity, project, prompt, problem, or assignment is given to the students to do. The quality of the student response to the task is what is assessed using the criteria (scoring guide, rubric).
As needed, brainstorm examples of tasks. Examples of tasks: oral reading, writing a research paper, doing a science lab, group discussion, doing a worksheet on fractions, taking a multiple-choice test. Some assessment tasks are developed specifically to assess student skills; other tasks are those that occur as part on normal on-going instruction. The point is that assessment doesn't have to be conceived of as something that occurs at the end of, and is separate from, instruction.
There are two important quality considerations when designing performance tasks—will it elicit the right performance and will there be anything about the task that might get in the way of any group of students showing what they can do. The first question is covered below in steps 2-5. Steps 6-7 cover the second question.
a. On a third grade writing assessment, the students were given the prompt: "You've heard the story about how the camel got its hump. Think of something in nature and write a story about how it got that way." The intent was to assess imaginative writing by asking students to write an "Aesop's Fable." What was actually obtained were thousands of student essays on why camels have humps. Imaginative writing could not be assessed because the students wrote informational pieces instead.
b. Would a math problem assess problem solving if students had just completed several problems just like it? In this case, the problem might be eliciting recall of information, not problem solving.
c. Two sixth-grade teachers asked students to choose a novel and participate weekly in a book group. They planned to assess student reading comprehension, in part, by means of a final project—an oral presentation. What they got was a good assessment of oral presentation skills, but not a good assessment of reading comprehension skills.
The point here is that if you want to assess critical thinking, for example, you have to have a task that results in students thinking critically. Or, if you want to assess, for example, group process skills, you have to give students a task that enables them to work cooperatively.
Ask the participants to provide examples from their own classrooms of tasks or activities that did not work as intended.
To get this point across, ask participants to think of a successful activity they have used in their classroom with students, and what made it so. Give participants 3-4 minutes to come up with a list of attributes that made their classroom activity successful. Then convene the large group and make a list on an overhead. Common responses at all grade levels are: student choice, active, clear instructions, engaging, relevant, organized, students know the purpose, enough time, several ways to approach the task, proper resources, and accommodates different learning styles.
The interesting thing about the list that teachers generate is that this is the same list used by many authors to illustrate what is meant by "authentic" assessment. For example, see Handout A3.1,H1—Criteria for Performance Tasks. Point this out to participants. Good assessment tasks that will elicit complex, meaty performances from students look a lot like good meaty instructional tasks.
Use examples you have discussed previously; or, if this is a stand-alone activity, a thought-provoking set of tasks are listed in the Activity 3.1—Index in Appendix A—Sampler. This set was selected to represent a range of "meatiness." Therefore, these task types can be though of as "anchors" for the "meatiness" scale. Handout A3.1,H2—Mathematical Applications Test can be used to demonstrate how a multiple-choice test measures up on the "meatiness" scale. Overhead A3.1,O2—Sample Oral Presentation Task can be used to illustrate the weak end of the meatiness scale.
On Handout A3.1,O3, you might highlight the features that tend to make a task "meaty." (Remember that one always has to take into account what one is trying to assess when making the call about "meatiness." For example, if you want to see whether students can write an essay, you must have them write an essay; therefore, choice on ways to respond might not be relevant.)
a. "This requires a lot of writing. What about students who don't write well?"
b. "This requires a lot of reading. What about students who are ESL?"
c. "Will students who have had experience with carnivals do better than students who haven't?"
d. "How will group work affect the judgment of ability of individuals?"
When these comments arise, note that this is what is meant by "sources of bias and distortion" (and other terms like reliability and validity) and keep a running list. Then, at this point, have participants brainstorm additions to the list. You could even categorize the list into major sources of bias and distortion: equity, fairness, and bias; extraneous interference; sampling.
This document's URL is:
Home | Up & Coming | Programs & Projects: Assessment | People | Products & Publications | Topics
© 2001 Northwest Regional Educational Laboratory
Email Webmaster
Tel. 503.275.9500