STATS - test and measures midterm 2

studied byStudied by 8 people
0.0(0)
get a hint
hint

Validity

1 / 56

Tags and Description

57 Terms

1

Validity

Defined as the agreement between a test score or measure and the quality is believed to measure

New cards
2

Standards of psychological testing

Foundation, operations and applications

New cards
3

Face validity

Not an actual category. Measure looks like it has validity but it's based on judgment without systematic evidence

New cards
4

Content related evidence for validity

Evidence that show the test adequately covers the content it is supposed to measure

New cards
5

Construct underrepresentation

Failure to capture important components of a construct

New cards
6

Construct irrelevant variance

Where scores are influenced by factors irrelevant to the construct

New cards
7

Criterion related evidence for validity

Evidence that support the test ability to predict or correlate with external criteria

New cards
8

Construct related evidence for validity

It's evidence that supports the underlying theoretical construct being measured by the test

New cards
9

Predictive evidence

How well a test can predict future outcome like SAT

New cards
10

Effect of restricted range

Most of the data point Falls within a small or limited range of values

New cards
11

Concurrent validity

Evaluating whether a new test or questionnaire to provide results that are consistent with an existing measure

New cards
12

Convergent evidence

Measure correlates well with other tests

New cards
13

Discriminant evidence

A test should have low correlation with the measure of unrelated construct

New cards
14

Criteria - refence list

Have items that are designed to match certain specific instructional objective

New cards
15

Relationship between reliability and validity

We can have reliability without validity but we can't have validity without reliability

New cards
16

Item format

The way in which questions or statements are presented in a test such as true or false multiple choice or polytomous formats

New cards
17

Dichotomous format

A type of format where each item provides two Alternatives true or false, one being correct

New cards
18

Polytomous format

A type of item format where each item has more than two alternatives. this is multiple choice

New cards
19

Distractors

Incorrect choices and multiple choice items that test takers can select

New cards
20

Correction for guessing

A formula used to adjust test scores in multiple choice exams to account for the likelihood of obtaining Answers by random guessing

New cards
21

Omitted responses

Answers left blank or not attempted by test takers which can typically not account in correction for guessing formulas

New cards
22

Random guessing

Selecting answers in multiple choice items without any knowledge of the correct answer which may or may not be advantageous

New cards
23

Speeded tests

Test with time constraints where the correction for guessing formula may only include items attempted, making random guessing and leaving items like have the same expected effect

New cards
24

Elimination method

A strategy where test takers eliminate obviously incorrect Alternatives and multiple choice items increasing their chances of getting the right answer

New cards
25

Likert format

A scale that uses strongly disagree to strongly agree to a particular question

New cards
26

Reverse scoring

Reversing the original scoring used to maintain consistency in a scale construction

New cards
27

Category format

Similar to the Likert method, but with greater numbers of choices

New cards
28

Endpoints

The extreme values or labels of the category scale which should be avoided to minimize potential response bias

New cards
29

Context effect

The phenomenon where ratings on a category format skills may change based on the context or grouping of people

New cards
30

Optimal number of categories

The number of response categories and a format scale varies depending on the level of involvement of respondent, considered sufficient for most waiting tasks

New cards
31

Visual analogue scale

A method where there is a scale that is like a line and and you're supposed to Mark between two endpoints

New cards
32

Confidence intervals

A statistical method used to calculate a range of values that is likely to contain a population parameter

New cards
33

Adjective checklist

A method commonly used and personality measurement where subject received a list of adjectives and indicate how characteristic of them

New cards
34

Q sort

Technique that increases the number of responses but have a subject sort statements into nine piles to describe themselves

New cards
35

Forced choice format

Item formats that require subjects to make choices from given alternative

New cards
36

Checklists

A format that has become less popular in the recent years were subject respond to the list of items

New cards
37

Item writing

The process of creating tests items including selecting appropriate format wording and response choices

New cards
38

All of the above

A response option commonly advised against in item writing as it can be problematic and lead to confusion in multiple choice questions

New cards
39

Item analysis techniques

Methods used to evaluate the effectiveness and quality of test items after they have been administered including measures of reliability difficulty and discrimination

New cards
40

Precise language

The use of clear specific on ambiguous wording and test item to ensure they are accurately assesses the intended trait or knowledge

New cards
41

Subject matter knowledge

A deep understanding of the content and concept being tested in order to create accurate effective test item

New cards
42

Item difficulty

In the context of a test that measures achievement or ability, item difficulty is defined by the number of people who answer the particular item correctly

New cards
43

Optimal difficulty level

Ideal level of difficulty for test items usually halfway between 100% correct responses and the level of success expected by chance

New cards
44

Discriminability

And measure of the value of test items assessing the extent to which individuals who perform well on specific items also perform well on the entire test

New cards
45

Extreme group method

A method to assess an item discriminability by comparing performance of individual who have done well to those who have not done well

New cards
46

Point biserial method

And approach the evaluate discriminability of test items by finding the correlation between performance on a specific item and overall test performance

New cards
47

Item characteristic curve

Represents the relationship between the tests items difficulty and the proportion of examines who answer it correctly

New cards
48

Item discriminability

Is the extent to which high performing individuals on a specific item also perform well on the entire test

New cards
49

Difficulty and discriminability

An items difficulty level is essential and items should ideally have a difficulty between 30% and 70%

New cards
50

Item selection

Final version of the test should consider both difficulty and discriminability

New cards
51

Item response Theory

And you were approved to test construction that considers the probability of getting specific items correct based on the individual's ability level.

New cards
52

Computer adaptive testing

Significant advantage of IRT allowing for personalized assessments

New cards
53

Measurement precision

The choice of this design impact the measurement Precision across various ability levels. computer adaptive testing offers the advantage of maintaining consistent measurement position for defined ability levels

New cards
54
New cards
55
New cards
56
New cards
57
New cards

Explore top notes

note Note
studied byStudied by 10 people
Updated ... ago
4.0 Stars(1)
note Note
studied byStudied by 6 people
Updated ... ago
4.0 Stars(1)
note Note
studied byStudied by 9 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 15 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 6 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 14414 people
Updated ... ago
4.8 Stars(125)
note Note
studied byStudied by 7 people
Updated ... ago
5.0 Stars(1)
note Note
studied byStudied by 150 people
Updated ... ago
5.0 Stars(6)

Explore top flashcards

flashcards Flashcard46 terms
studied byStudied by 11 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard33 terms
studied byStudied by 15 people
Updated ... ago
4.5 Stars(91)
flashcards Flashcard39 terms
studied byStudied by 12 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard31 terms
studied byStudied by 7 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard33 terms
studied byStudied by 2 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard30 terms
studied byStudied by 13 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard40 terms
studied byStudied by 27 people
Updated ... ago
5.0 Stars(1)
flashcards Flashcard33 terms
studied byStudied by 33 people
Updated ... ago
5.0 Stars(2)