Teaching Item Bank — ImplicitifyAI

Research Methods rm-001

Random assignment to conditions primarily protects which type of validity?

External validity
Internal validity
Construct validity
Statistical conclusion validity

Answer: Internal validity

Random assignment equates groups in expectation, ruling out confounds and supporting causal (internal-validity) claims.

Research Methods rm-002

A researcher studies whether income predicts well-being using survey data with no manipulation. This is best described as:

A true experiment
An observational (correlational) design
A randomized controlled trial
A factorial experiment

Answer: An observational (correlational) design

With no manipulation or random assignment, the design is observational and supports association but not strong causal claims.

Research Methods rm-003

Selecting participants because they scored extremely low and retesting them later risks which threat?

Maturation
Regression to the mean
Instrumentation
Demand characteristics

Answer: Regression to the mean

Extreme scores tend to move toward the mean on retest regardless of any intervention.

Research Methods rm-004

The degree to which findings generalize to other people and settings is:

Internal validity
External validity
Face validity
Reliability

Answer: External validity

External validity concerns generalization across populations, settings, and times.

Research Methods rm-005

Differential attrition across conditions is a threat mainly because it can:

Increase statistical power
Make groups non-equivalent by the end of the study
Improve construct validity
Eliminate confounds

Answer: Make groups non-equivalent by the end of the study

If dropout differs by condition, the groups are no longer equivalent, reintroducing confounding.

Statistics st-001

A p-value of .03 means:

There is a 3% chance the null hypothesis is true
Assuming the null is true, data this extreme (or more) occur 3% of the time
The effect is large
The result will replicate 97% of the time

Answer: Assuming the null is true, data this extreme (or more) occur 3% of the time

A p-value is computed assuming the null is true; it is the probability of data at least as extreme as observed.

Statistics st-002

Failing to reject a false null hypothesis is a:

Type I error
Type II error
Sampling error
Measurement error

Answer: Type II error

A Type II error (beta) is a false negative — missing a true effect.

Statistics st-003

Statistical power is defined as:

1 − alpha
1 − beta
The p-value
The effect size

Answer: 1 − beta

Power = 1 − beta, the probability of detecting a true effect of a given size.

Statistics st-004

Which Cohen's d is conventionally considered a 'medium' effect?

0.20
0.50
0.80
1.00

Answer: 0.50

Cohen's conventions: d ≈ .2 small, .5 medium, .8 large.

Statistics st-005

A 95% confidence interval is best interpreted as:

A 95% probability the true value lies in this specific interval
A range of plausible parameter values; 95% of such intervals capture the parameter over repeated sampling
The range containing 95% of the data
The standard error times 95

Answer: A range of plausible parameter values; 95% of such intervals capture the parameter over repeated sampling

The 95% refers to the long-run capture rate of the procedure across repeated samples.

Psychometrics ps-001

Cronbach's alpha is an index of:

Criterion validity
Internal-consistency reliability
Test difficulty
External validity

Answer: Internal-consistency reliability

Alpha summarizes how consistently a set of items measure the same thing.

Psychometrics ps-002

Reliability and validity are related such that:

A valid test must be reliable
A reliable test must be valid
They are unrelated
Validity caps reliability

Answer: A valid test must be reliable

Reliability sets a ceiling on validity; a measure cannot be valid for a purpose if it is not reliable.

Psychometrics ps-003

Convergent and discriminant evidence are forms of:

Content validity
Construct validity
Inter-rater reliability
Face validity

Answer: Construct validity

Convergent (relates to similar constructs) and discriminant (unrelated to different constructs) evidence support construct validity.

Psychometrics ps-004

An item-total correlation near zero suggests the item:

Is too difficult
Does not cohere with the rest of the scale
Has high discrimination
Is perfectly reliable

Answer: Does not cohere with the rest of the scale

Low item-total correlation indicates the item is not measuring the same construct as the scale.

Psychometrics ps-005

A T-score has a mean and standard deviation of:

0 and 1
50 and 10
100 and 15
5 and 2

Answer: 50 and 10

T-scores are standardized to M = 50, SD = 10.

Assessment as-001

A score above a depression screener's cut-off indicates:

A confirmed diagnosis
That a fuller clinical evaluation may be warranted
Treatment is unnecessary
The person is malingering

Answer: That a fuller clinical evaluation may be warranted

Screeners flag the possible need for further assessment; they do not diagnose.

Assessment as-002

Specificity of a screening test refers to:

The proportion of true cases correctly identified
The proportion of non-cases correctly identified
The total accuracy
The base rate of the disorder

Answer: The proportion of non-cases correctly identified

Specificity is the true-negative rate — non-cases correctly screened out.

Assessment as-003

Combining self-report with a performance task is an example of:

Mono-method assessment
Multi-method assessment
Criterion contamination
Norm referencing

Answer: Multi-method assessment

Using different methods reduces shared-method bias and strengthens conclusions on convergence.

Assessment as-004

Reporting a confidence range around a score rather than a single number reflects attention to:

Measurement error
Random assignment
Demand characteristics
Publication bias

Answer: Measurement error

Every observed score contains measurement error; a confidence range communicates that uncertainty.

Assessment as-005

Norm-referenced interpretation is valid only when:

The test is brand new
The examinee resembles the standardization sample
The score is above average
No cut-off is used

Answer: The examinee resembles the standardization sample

Norms generalize only to populations resembling the sample on which they were established.