Language Testing - What Makes a Quality Test
Reliability - the test measures the specific construct consistently and accurately. It discriminates between low scorers and high scorers. You should expect to see similar results from the same test taker with a different version of the test, and across different scorers.
Validity - encapsulates reliability, as well as adequate and appropriate interpretation of scores. Does the test actually measure the specific construct? And is it appropriate to base decision making on evidence of this construct?
Adequate content and skills coverage - The target learning outcomes are covered. The test taker must recognise the correct answer, produce the correct answer or perform in a dynamic context as required.
Practicality - design, administration and scoring are practical.
Impact - the real life consequences are fair. The test will enable decision makers to make good decisions.
Issues/threats to validity
Construct under-representation - the most relevant aspects of the construct are not tested.
Construct irrelevant variance - other factors which are not intended to be measured affect the scores.



















