A test that intends to measure how many items a test-taker can complete within a certain amount of time is a:
speed test
power test
cognitive ability test
achievement test
A method of assessing a test-taker's performance on a normal day is a/an:
direct behavioral assessment
environmental assessment
maximum performance measurement
typical performance measurement
Which scaling method classifies data into non-ordered, mutually exclusive categories, such as marital status?
ordinal
interval
nominal
ratio
What type of sampling procedure involves individuals being chosen entirely by chance, therefore making it so that each member of the population has an equal chance of being included in the sample?
Cluster sample
Simple random sample
Stratified sample
Normative sample
If every test-taker got a certain test item incorrect, what would the item difficulty index be?
0
1
-1
What method allows the test author(s) to compare test performances from one group to the next?
Criterion-referencing
Cross-validation
Norm-referencing
Item discrimination
The systematic collection and analysis of data is known as:
assessment
a measure
a test
standardization
The four purposes of assessment are screening, diagnosis, , and progress evaluation.
A test that compares a person's test score to a predetermined standard or level of performance is a/an:
norm-referenced test
objective test
subjective test
criterion-referenced test
The main types of cognitive ability tests are intelligence, aptitude affective verbal( aptitude, affective, verbal ), and achievement tests.
An assessment that requires the test-taker to manipulate materials or to select visual stimuli with minimal or no use of language is a:
non-verbal test
performance assessment
Which scaling method has ordered response categories and equal distance between score points, but no absolute zero point?
Ordinal
Ratio
Nominal
Interval
Which scaling level has the highest level of specificity?
An assessment must include historical and data, biopsychosocial information, an interview, and .
A standardized procedure for sampling behavior and describing it with categories or scores, then compared to norms, is a/an:
measure
test
Essays, constructed responses, and open-ended questions are examples of a/an:
A test that predicts a person's capacity to perform some skill or task in the future is a/an:
verbal test
intelligence test
aptitude test
A type of behavioral assessment in which instruments, such as rating scales or checklists, are given to people in a good position to observe the client's typical behavior is a/an:
anecdotal observation
indirect observation
In ordinal data, there is equality in spacing among items.
Which scaling method has an absolute zero point?
What type of sampling procedure involves dividing the entire population into different subgroups, then randomly selecting the final subjects proportionally from the different subgroups?
Representative sample
Which levels of measurement rely on the parameter/normal curve?
Nominal and ordinal
Nominal and ratio
Interval and ratio
Ordinal and interval
When constructing items, the test author(s) should avoid the ceiling effect because it prevents them from measuring low performers.
A negatively discriminating item is one that is:
failing to indicate a relationship between correct response and test performance
answered correctly more often by those who perform poorly on the test
answered correctly more often by those who perform well on the test
With regard to informed consent and assessment, the three conditions that must be met for the client are , , and voluntary-ness.
An advantage of this type of test is that it allows comparability of scores and interpretations across different test-takers:
standardized test
non-standardized test
A test that measures knowledge a person has acquired through instruction or training up to a certain point in his or her academic career is an:
affective test
A type of behavioral observation in which the counselor is physically present in the same environment as the client and collects data on the frequency, duration, and/or magnitude of one or more target behaviors is a/an:
formative evaluation
A requirement for a norm-referenced test is that its makeup must approximate a normal curve.
Interval and ratio data are also known as:
categorical data
continuous data
This type of scale consists of gathering statements, having experts rank them from 1-11, doing calculations, and creating the scale:
expert ranking scale
equal-appearing interval (Thurstone)
Likert scale
Guttman scale
If every test-taker got a certain test item correct, what would the item difficulty index be?
A positively discriminating item is one that is:
This measure of central tendency is sensitive to extreme scores:
Mean
Median
Mode
Ethan scored one standard deviation above the mean. This means he performed better than what percentage of the class?
84%
34.13%
15.75%
2.14%
A test result can stand alone, and can indicate something about a client.
A good way to prevent confirmation bias, which is the tendency to search for, interpret, favor, and recall information in a way that confirms one's preexisting beliefs or hypothesis, is to:
keep to one hypothesis; just being mindful of confirmation bias is preventative
have multiple hypotheses
A test that intends to measure the skill or abilities possessed by a test-taker without the pressure of time limits is a/an:
Kelly enlisted the help of a school counselor to see how her classroom could be altered to better accommodate a particular student's needs. Which kind of assessment would the counselor likely use?
Direct behavioral assessment
Indirect observation
Anecdotal observation
Environmental assessment
Speed tests include items of widely varying difficulty.
The number of incorrect items an examiner must obtain before test administration can be halted is known as the:
ceiling series
basal series
ceiling effect
floor effect
If the mean, median, and mode are the same, then the distribution is:
positively skewed
negatively skewed
normal
This measure of central tendency is the exact midpoint of a distribution:
mode
mean
median
This type of bias in assessment involves test material being more familiar to one group than another:
Content
Internal structure
Predictive
A test that is administered during treatment or instruction, with the intention of informing the author of its effectiveness, is known as a formative evaluation.
In this approach to test development, two disparate groups are taken - one that displays a trait or behavior and one that does not - and administered questions; whichever items most differentiate between the two groups are kept.
Rational scale construction
Expert-ranking scale
Empirical keying
Thurstone scale
If 3/5 of the top scorers answer an item correctly, and 3/5 of the low scorers answer the same item correctly, the item would be:
positively discriminating
negatively discriminating
nondiscriminating
A test's starting point is typically linked to a test-taker's:
sex
age
race
grade
This type of bias involves issues with reliability (i.e., scores may be reliable for one group but not reliable for another):
Internal structure bias
Confirmation
This measure of central tendency is the most frequently occurring score: