3rd Year Stats Exam

Question

What is the purpose of performing a linear regression analysis?

Answer 1

To identify potential outliers in the data

Answer 2

To fit the data to a model that defines y as a function or 2 or more variables

Answer 3

To determine the dependence of a dependent variable on a predictor/independent variable

Answer 4

To perform multiple comparisons whilst controlling overall type 1 error rate

Answer 5

To derive robust confidence intervals

Answer 6

The size of the force which the points exert on the line of best fit

Answer 7

The leverage of those data points

Answer 8

The fit and slope of the model

Answer 9

The centre of gravity and pivot point of the data

Answer 10

How well the model fits the data (0 - 1)

Answer 11

The slope coefficient

Answer 12

The distribution of the residuals

Answer 13

The level of multicolinearity in the model

Answer 14

The model can explain 68% of the data and for every unit of independent variable, the dependent variable goes up 12 units

Answer 15

The fit of the model to the data is 0.12% and the influence that the data points have on the model is 0.68%

Answer 16

The data points have an influence of 68% on the model and 12% on the outcome

Answer 17

The model can explain 6.8% of the data and for every unit of independent variable, the dependent variable goes up 0.12 units

Answer 18

Standardised residual >2 is worth checking, if more than 5% of the residuals >2 may indicate model is a poor fit

Answer 19

Standardised residual >3 is worth checking, if more than 5% of the residuals >2 may indicate that the model is a poor fit

Answer 20

Standardised residual >2.5 is worth checking, if more than 5% of the residuals >2 may indicate that the model is a poor fit

Answer 21

Standardised residual >3 is worth checking, if more than 10% of the residuals >2 may indicate that the model is a poor fit

Answer 22

influence of data point on predicted values (0 = no influence, 1 = complete influence)

Answer 23

standardised measures of how much each element of the model would change if data point was removed (values >1 = substantial influence)

Answer 24

how susceptible the mean is to being biased by the outliers present in the data

Answer 25

measure of overall influence of each individual data point on the overall model (>1 = concern)

Answer 26

influence of data point on predicted values (0 = no influence, 1 = complete influence)

Answer 27

standardised measures of how much each element of the model would change if data point was removed (values > 1 = substantial influence)

Answer 28

measure of overall influence of each individual data point on the overall model (> 1 = concern)

Answer 29

precisely how large the standardised residuals are

Answer 30

they are standardised measures of how much each element of the model would change if that data point was removed (values > 1 = substantial influence)

Answer 31

they indicate the influence of that data point on predicted values (0 = no influence, 1 = complete influence)

Answer 32

whether or not the standardised residuals are worth checking and if they indicate that the model is a poor fit

Answer 33

they summarise the equation: 2(k+1)/n where k = number of predictors and n = number of data points

Answer 34

if >2(k+1)/n where k = number of predictors (2 for simple linear regression) and n = number of data points

Answer 35

if >2(k+1)/n where k = number of predictors (1 for simple linear regression) and n = number of data points

Answer 36

if >2(K+1)/n where k = number of data points and n = number of predictors (1 for simple linear regression)

Answer 37

if >n(k+1)/2 where k = number of predictors (1 for simple linear regression) and n = number of data points

Answer 38

if >2(n+1)/k where k = number of predictors (1 for simple linear regression) and n = number of data points

Answer 39

fits the data to a model that defines y as a function of 2 or more variables - determines the effect of an independent variable on the dependent variable taking account of other variables

Answer 40

provides an analysis of variance and determines if an interaction is present in the data

Answer 41

determines the dependence of a dependent variable on a predictor/independent variable and allows outliers to be identified from x, y plot or from standardised residual plot

Answer 42

y = b0 + b1x2 + b2x1

Answer 43

y = b0 + b1x1 + b2x2 +....

Answer 44

y = b0 + b1 + b2x

Answer 45

y = b0 + b1x1 + b2x2

Answer 46

the average variability due to the model divided by the average variability due to the residuals

Answer 47

the unexplained variability divided by the variability due to the model

Answer 48

the signal to noise ratio multiplied by the number of data points

Answer 49

the variance in the model divided by the R-squared value

Answer 50

one slope parameter

Answer 51

two degrees of freedom

Answer 52

one degree of freedom

Answer 53

one R-squared value

Answer 54

standard errors of b coefficients decrease therefore confidence increases

Answer 55

limits F-ratio value and variance inflation factor

Answer 56

coefficients become stable

Answer 57

standard errors of b coefficients increase and therefore confidence decreases

Answer 58

A VIF > 5 or an avereage VIF > 2 is problematic

Answer 59

A VIF > 10 or an average VIF > 1 is problematic

Answer 60

A VIF > 2 or an average VIF > 1 is problematic

Answer 61

A VIF > 10 or an average VIF > 2 is problematic

Answer 62

< 5 is problematic

Answer 63

< 10 is problematic

Answer 64

< 2 is problematic

Answer 65

< 0.1 is problematic

Answer 66

< 1 is problematic

Answer 67

when predicting y using the multiple regression equation

Answer 68

when you want to look inside the model at the effect of individual predictors

Answer 69

when you want to perform separate correlations for each x variable

Answer 70

when you want to quantify the relationship between an independent and dependent variable

Answer 71

always take a colinear variable out

Answer 72

combine predictors into a single predictor (as long as it makes biological sense)

Answer 73

rely on automatic variable selection

Answer 74

remove all outliers

Answer 75

Cook's distance

Answer 76

variance inflation factor

Answer 77

tolerance factor

Answer 78

dependent variables = quantitative or categorical predictor variable = qualitative and continuous

Answer 79

dependent variables = qualitative predictor variable = continuous

Answer 80

dependent variables = qualitative and continuous predictor variable = quantitative or categorical

Answer 81

dependent variables = continuous or categorical predictor variable = quantitative or categorical

Answer 82

assess the DFFit and DFBeta values

Answer 83

use the Welch's test

Answer 84

use Gabriel's test

Answer 85

use the Durbin-Watson test

Answer 86

10 times the number of predictors tested

Answer 87

5 times the number of predictors tested

Answer 88

at least 30

Answer 89

at least 40

Answer 90

effect of height + effect of weight = overall effect on SBP

Answer 91

effect of height + overall effect on SBP = effect of weight

Answer 92

effect of height x effect of weight = overall effect on SBP

Answer 93

paired t-test

Answer 94

unpaired t-test

Answer 95

unpaired, two-tailed t-test

Answer 96

paired, one-tailed t-test

Answer 97

unpaired t-test

Answer 98

paired t-test

Answer 99

simple linear regression

Answer 100

multiple regression

Answer 101

analyses how much of the overall variance can be explained by variation between group means compared to the unexplained variation within a group

Answer 102

fits data to a model that defines y as a function of 2 or more variables

Answer 103

performs separate correlations for each x variable

Answer 104

determines the dependence of a dependent variable on a predictor/independent variable

Answer 105

total squares divided by the degrees of freedom

Answer 106

the F-ratio

Answer 107

the difference between each individual data point and the overall mean

Answer 108

error mean squares divided by degrees of freedom

Answer 109

higher the larger the difference of the group means from the overall mean and smaller the amount of random variability

Answer 110

lower the larger the difference of the group means from the overall mean and smaller the amount of random variability

Answer 111

higher the larger the difference of the group means from the overall mean and larger the amount of random variability

Answer 112

higher the smaller the difference of the group means from the overall mean and smaller the amount of random variability

Answer 113

when effect size is large

Answer 114

when the F-ratio is high

Answer 115

when the degrees of freedom are greater than 10

Answer 116

when the group sizes are equal

Answer 117

Games-Howell's

Answer 118

Durbin-Watson's

Answer 119

performing multiple comparisons whilst controlling overall type 2 error rate

Answer 120

performing multiple comparisons whilst controlling overall type 1 error rate

Answer 121

when there is a specific hypothesis to be tested

Answer 122

when group size is not equal

Answer 123

sample sizes are unequal

Answer 124

sample sizes are equal

Answer 125

you require good trade-off between type 1 and type 2 errors

Answer 126

you are interested in comparing all groups vs a single control group

Answer 127

you wish to cut down on the number of comparisons that you make

Answer 128

when you don't need a high level of confidence

Answer 129

when you aren't performing multiple comparisons

Answer 130

when you require a conservative test

Answer 131

when you need a high level of confidence

Answer 132

when sample sizes are equal

Answer 133

when interested in comparing all groups versus a single control group

Answer 134

when sample sizes are equal

Answer 135

when you want to cut down comparisons

Answer 136

when you want a good trade-off between type 1 and type 2 errors

Answer 137

a false -ve

Answer 138

a type 2 error

Answer 139

a false +ve

Answer 140

biased data

Answer 141

Hochberg's GT2

Answer 142

Games-Howell

Answer 143

Hochberg's GT2

Answer 144

Games-Howell

Answer 145

ten times the number of contrasts

Answer 146

more contrasts

Answer 147

one more contrast

Answer 148

two times the number of contrasts

Answer 149

one fewer contrast

Answer 150

enter weights for most of the variables

Answer 151

trust p-values as you aren't inflating the type 1 error rate

Answer 152

ignore the F-ratio value and R-squared value

Answer 153

not worry about doing any corrections for multiple comparisons

Answer 154

Planned contrasts

Answer 155

Orthogonal contrasts

Answer 156

Polynomial contrasts

Answer 157

each level of one factor is tested against at least one level of the other

Answer 158

performs multiple comparisons whilst controlling overall type 1 error rate

Answer 159

divides total variability in the data set into different sources

Answer 160

each level of one factor is tested at each level of the other

Answer 161

independent ANOVA

Answer 162

one-way ANOVA

Answer 163

repeated measures ANOVA

Answer 164

multiple linear regression

Answer 165

there is no unnecessary variability in the data

Answer 166

there is no interaction

Answer 167

the sphericity assumption is met

Answer 168

group sizes are equal

Answer 169

there is a less than 0.5% chance of committing a type 1error

Answer 170

there is a less than 5% chance of committing a type 2 error

Answer 171

there is a less than 5% chance of committing a type 1 error

Answer 172

there is a less than 0.5% chance of committing a type 2 error

Answer 173

Image:

f2038083-2abf-4c37-83b4-13dcabfdd2e3 (image/png)

Answer 174

Image:

3e4d5d8c-a9ab-4f9e-abc9-b557d619af59 (image/png)

Answer 175

Image:

92d72b11-5b7f-4755-a40d-ff83c6f01bc7 (image/png)

Answer 176

1 - type 1 error rate

Answer 177

1 - type 2 error rate

Answer 178

1 - (type 1 + type 2 error rate)

Answer 179

type 2 error rate - type 1 error rate

Answer 180

increasing effect size. decreasing random variation. decreasing sample size.

Answer 181

increasing effect size. increasing random variation. increasing sample size

Answer 182

decreasing effect size. increasing random variation. increasing sample size

Answer 183

increasing effect size. decreasing random variation. increasing sample size

Answer 184

Welch's correction

Answer 185

Games-Howell test

Answer 186

Sidak correction

Answer 187

standard error of the proportion

Answer 188

(row total + column total)/overall total

Answer 189

(row total - column total)/overall total

Answer 190

(row total x column total)/overall total

Answer 191

df = (rows - 1) x (columns -1)

Answer 192

df = (rows + 1) x (columns +1)

Answer 193

df = (rows - 1) / (columns -1)

Answer 194

df = (rows x 2) + (columns x 2)

Answer 195

The assumption that at least 50% of expected frequency must be more than or equal to 5

Answer 196

Dunnet's test

Answer 197

The assumption that at least 80% of expected frequency must be more than or equal to 5

Answer 198

The same assumptions as multiple linear regression

Answer 199

Image:

c381313b-6b41-4222-9fb1-ff2f06bab05f (image/png)

Answer 200

Image:

7c70109c-216c-42a8-b0cf-2c8ea8d98113 (image/png)

Answer 201

Image:

af436a6c-5311-4a8b-b044-0cb75ac04b0f (image/png)

Answer 202

Image:

046a1efc-e4cc-4bf2-8b89-a68508d03b61 (image/png)

Answer 203

probes where a certain effect is happening

Answer 204

performs an ANOVA to allow you to reject/accept a null hypothesis

Answer 205

analyses the differences between levels of one variable

Answer 206

performs multiple comparisons whilst controlling overall type 2 error rate

Answer 207

Bonferroni

Answer 208

independent

Answer 209

not independent

Answer 210

naturally paired

Answer 211

sorted into even group sizes

Answer 212

“noise” in the relationship between the independent variables and the dependent variable is the same across all values of the independent variables

Answer 213

equality of differences between linked values in each group

Answer 214

well-modeled by a normal distribution and likely for a random variable underlying the data set to be normally distributed

Answer 215

residuals are (roughly) normal and (approximately) independently distributed with a mean of 0 and some constant variance

Answer 216

Welch's correction

Answer 217

Games-Howell test

Answer 218

Gabriel's test

Answer 219

Mauchy's test

Answer 220

Mauchy's test

Answer 221

Welch's correction

Answer 222

Green House-Geisser correction

Answer 223

Gabriel's test

Answer 224

Bonferroni

Answer 225

Durbin-Watson test

Answer 226

Friedman test

Answer 227

Gabriel's test

Answer 228

Hochberg's GT2 test

Answer 229

Non-linear regression

Answer 230

Two-way ANOVA

Answer 231

Independent ANOVA

Answer 232

Mixed model ANOVA

Answer 233

Repeated measures ANOVA

Answer 234

the fit will automatically improve

Answer 235

there is a risk of over-fitting the model

Answer 236

the significance level associated with the F-statistic changes

Answer 237

the R-squared value will increase

Answer 238

to ensure that the interaction between the variables is taken into account

Answer 239

to ensure that the computer has found the global minimum

Answer 240

to ensure that the computer has found the local minimum

Answer 241

to ensure that an accurate scientific relationship is found

Answer 242

add all the standard deviations together and square that value

Answer 243

square the mean from each sample and add those together

Answer 244

square each standard deviation and add them all together

Answer 245

square each standard deviation and add this to the variance

Answer 246

dividing the standard deviations by the degrees of freedom

Answer 247

dividing the sum of squares by the degrees of freedom

Answer 248

multiplying the degrees of freedom by the mean

Answer 249

multiplying the standard deviations by the sum of squares

Answer 250

the population mean is the height and the sum of squares is the distance from the midline of the curve to the edge

Answer 251

the variance is the height and the population mean is the distance from the midline of the curve to the edge

Answer 252

the population standard deviation is the height and the population mean is the distance from the midline of the curve to the edge

Answer 253

the population mean is the height and the population standard deviation is the distance from the midline of the curve to the edge

Answer 254

(x - mean) /sd

Answer 255

(x - sd)/mean

Answer 256

(mean-x)/sd

Answer 257

(x + mean)/sd

Answer 258

approximately 99% of normally-distributed values lie between +- 2 sds from the mean

Answer 259

approximately 95% of normally-distributed values lie between +-2 sds from the mean

Answer 260

approximately 99.9% of normally-distributed values lie between +- 2.6 sds from the mean

Answer 261

approximately 99% of normally-distributed values lie between +- 2.6 sds from the mean

Answer 262

approximately 99.9% of normally-distributed values lie between +- 3 sds from the mean

Answer 263

approximately 95% of normally-distributed values lie between +- 3 sds from the mean

Answer 264

SEM = sd x square root of n and therefore a 95% CI would be +- 1.96 x SEM

Answer 265

SEM = sd/square root of n and therefore a 95% CI would be +- 3 x SEM

Answer 266

SEM = sd x square root of n and therefore a 95% CI would be +- 2.6 x SEM

Answer 267

SEM = sd/square root of n and therefore a 95% CI would be +- 1.96 x SEM

Answer 268

P < 0.05 means that 5% of the results arose by chance if the null hypothesis is true

Answer 269

P < 0.05 means <5% probability of the results arising by chance if the null hypothesis is true

Answer 270

P < 0.05 means <0.05% probability of the results arising by chance if the null hypothesis is true

Answer 271

P < 0.05 means that <0.5% probability of the results arising by chance if the null hypothesis is true

Answer 272

type 1 error rate is conventionally set to 5% ( P < 0.05)

Answer 273

type 2 error rate is conventionally set to 5% ( P < 0.05)

Answer 274

type 1 error rate = 1 - power

Answer 275

type 2 error rate = 1 - power

Answer 276

if you accept a statistical power of 80% it will mean that you have a type 2 error rate of 20%

Answer 277

if you accept a statistical power of 80% it will mean that you have a type 1 error rate of 20%

Answer 278

you will increase the chance of making a type 2 error

Answer 279

you will increase the chance of making a type 1 error

Answer 280

you will inflate your p-value

Answer 281

you will decrease your p-value

Answer 282

+- 0.5 is a large effect

Answer 283

+- 0.1 is a small effect

Answer 284

+- 1 is a small effect

Answer 285

it measures how close the data points are to a straight line that best describes the linear relationship

Answer 286

r = +0.1 refers to a perfect straight line with a positive slope

Answer 287

r = -1 refers to a perfect straight line with a negative slope

Answer 288

by minimising the total sum of squares

Answer 289

by minimising the sum of squares of the residuals

Answer 290

by creating an equation which fits the model best

Answer 291

by entering the data into the computer in Hierarchical form

Answer 292

1 - (SS of the residuals/total SS)

Answer 293

1 + (SS of the residuals/total SS)

Answer 294

1 - (total SS/SS of the residuals)

Answer 295

1 + (total SS/SS of the residuals)

Answer 296

check for sphericity

Answer 297

correct degrees of freedom

Answer 298

ascertain that residuals are random and normally distributed

Answer 299

minimise the sum of squares of the residuals

Answer 300

the thing that you're comparing the baseline to gets a 1

Answer 301

the thing that you're comparing the baseline to gets a 0.1

Answer 302

1 fewer dummy variables than number of categories

Answer 303

baseline condition gets a value of 0

Answer 304

baseline condition gets a value of 1.5

Answer 305

0.05/number of categories

Answer 306

0.05/number of comparisons

Answer 307

0.05/variance

	Creado por Jessica Whittick hace más de 7 años

Siguiente

3rd Year Stats Exam

Descripción

Resumen del Recurso

Pregunta 1

Pregunta 2

Pregunta 3

Pregunta 4

Pregunta 5

Pregunta 6

Pregunta 7

Pregunta 8

Pregunta 9

Pregunta 10

Pregunta 11

Pregunta 12

Pregunta 13

Pregunta 14

Pregunta 15

Pregunta 16

Pregunta 17

Pregunta 18

Pregunta 19

Pregunta 20

Pregunta 21

Pregunta 22

Pregunta 23

Pregunta 24

Pregunta 25

Pregunta 26

Pregunta 27

Pregunta 28

Pregunta 29

Pregunta 30

Pregunta 31

Pregunta 32

Pregunta 33

Pregunta 34

Pregunta 35

Pregunta 36

Pregunta 37

Pregunta 38

Pregunta 39

Pregunta 40

Pregunta 41

Pregunta 42

Pregunta 43

Pregunta 44

Pregunta 45

Pregunta 46

Pregunta 47

Pregunta 48

Pregunta 49

Pregunta 50

Pregunta 51

Pregunta 52

Pregunta 53

Pregunta 54

Pregunta 55

Pregunta 56

Pregunta 57

Pregunta 58

Pregunta 59

Pregunta 60

Pregunta 61

Pregunta 62

Pregunta 63

Pregunta 64

Pregunta 65

Pregunta 66

Pregunta 67

Pregunta 68

Pregunta 69

Pregunta 70

Pregunta 71

Pregunta 72

Pregunta 73

Pregunta 74

Pregunta 75

Pregunta 76