Test ItemsTest items are the best instrument to measure how well students learned what you intended (learning objectives), as well as how effective instruction was.
|
|
The test scores can be translated into two major instructional findings:
In other words, if few students fail a certain question, these students need remediation but if a great number of students don't perform well, the instruction itself and/or the test item needs to be enhanced.
|
|
The test items that you write must assess the exact performances called for in the learning objectives. The conditions, or givens, in an objective must also be included in the assessment. A good way to determine whether the desired relationship exists between the objective and test items is to answer the following questions:
Example Learning Objective: Non-Appropriate Assessment Item: Appropriate Assessment Item:
|
|
Your test items should clearly indicate the nature of the response that is expected from the student. When your students are unsure about what is expected, their responses may not accurately reflect their knowledge related to an item. Also, always try to keep your test items free of prompts or clues that students can use to determine correct answers.
|
|
The question focus on an important concept, typically a common or potentially catastrophic clinical problem. |
|
The stem must pose a clear question that has a definite answer. The students should be able to read the question and answer the question without reference to the alternatives. |
|
The question should assess application of knowledge, not recall of an isolated fact (i.e. "two-step" thinking.) |
|
All distracters (incorrect options) should be homogenous. The use of options that are plausible, even close, but not the best, forces discrimination. |
|
Questions should not have technical flaws that provide benefit to test-wise examinees, as for example negatively phrased questions and grammatical clues. |
|
When possible, questions should require interpretation of data or information from an image, graph or chart. |
|
Questions should be multiple-choice with at least five choices but if the choices are a list of drugs or structures or enzymes or diseases, the list can be up to ten choices. |
|
Whenever possible, a question should use a patient vignette in the stem of the question. Take a look at some templates from the National Board of Medical Examiners (NBME)'s "Constructing Written Test Questions for the Basic and Clinical Sciences" manual. |
|
Single correct answer should be indicated for all questions. |
|
|
|
Following are some common practices that result in poorly-written test items :
|
|
Unnecessarily complexity may be due to the amount of information included in the item, to poor grammar, or both. |
|
The basis for ordering things should be specified in the assessment item. For example, putting planets in a certain order can be done in several ways: by distance from the sun, by size, or by alphabetical order. |
|
Most things can be described in at least two ways: by their physical features or by their functions. Test items should indicate both what is to be described and what is to be included in the description. |
|
The use of absolutes (e.g., always, never, only, no) should be avoided. Not only absolute statements are usually incorrect, but also some student will know an exception to the keyed answer and get confused or panic. |
|
When multiple-choice items contain alternatives that are obviously incorrect, students have a greater chance of selecting the correct choice. |
|
When number of items to be paired in matching items is the same it increases the chance of students to guess correctly on items not previously learned. |
|
Articles such as "a" and "an", plural word forms, and gender forms may provide clues to correctly answering without having learned the content. |
|
The students should be able to read the question and answer the question without reference to the alternatives -- and their answer should be among the alternatives when they get there... In other words, the stem asks a question that has a definite answer. |
|
Anytime you have a key word that would change the answer if the student misses it, boldface it, CAPITALIZE it, underline it or all of the PREVIOUS. |
|
When best-answer items are used, the qualifier in the stem needs to be emphasized and also you must specify in what way an answer is the "best". |
|
Negatively worded stems should be avoided. |
|
Include as much information in the stem and as little in the options as possible. For example, if the point of an item is to associate a term with its definition, the preferred format would be to present the definition in the stem and several terms as options rather than to present the term in the stem and several definitions as options. |
|
Don't use "all of the above." Recognition of one wrong option eliminates "all of the above," and recognition of two right options identifies it as the answer, even if the other options are completely unknown to the student. All of the above can also be unfair, because it means answer (A) is correct, so student anticipates (A), looks down, sees (A), chooses it, then moves on. Gets it wrong, even though knew (A) was the correct answer -- gets answer right but question wrong. |
|
The use of trickery must be avoided. There is no point in finding out if you can trick the students -- you can.
|
|
Templates can be very helpful when creating questions. The following information about question templates is an excerpt from the National Board of Medical Examiners (NBME)'s "Constructing Written Test Questions for the Basic and Clinical Sciences" manual. You can download the full version of the manual, as a PDF file. |
|
The overall structure of an item can be depicted by an item template. You can typically generate many items using the same template. For example, the following template could be used to generate a series of questions related to gross anatomy: This is a question that could be written using this template:
|
|
Many basic science questions can be presented within the context of a patient vignette. The patient vignettes may include some or all of the following components:
|
|
|
|
Which of the following is abnormal? sites of lesions; |
|
Which of the following findings is most likely? laboratory results; |
|
Which of the following is the most likely cause? underlying mechanisms of the disease; |
|
Which of the following should be administered? drugs; |
|
Which of the following is defective/deficient/nonfunctioning? enzymes; |
|
Copyright © 1996, 1998 National Board of Medical Examiners® (NBME®) |