Tutorial on Introduction to biostatistics

Statistical Notes

group missing and non missing and compare dependent variable to find out any significance difference between missing value group and non missing value group
Mean Substitution method - Missing values can be substituted with the mean value of the variable
Missing values can be estimated using regression method instead of just substituting with the mean value
Hot deck imputation – Missing values can be substituted with similar value
Missing values can be estimated based on full information maximum likelihood
Multiple imputation
List wise deletion – cases with missing values can be deleted if the sample is more
Variable deletion – variable with more missing values can be deleted

i. Box plot

ii. Histogram

iii. Q-Q plot

i. Kolmogrov simirnov tes

ii. Shapiro wilk test

iii. Anderson Darling Test

i. Log

ii. Square root

iii. Cube root

iv. Inverse

i. Squaring the variable

ii. Cubing the variable

Apart from the above deleting outliers will also help us to bring data into normality

Structural equation models contain unobserved variables called latent variables or factors and observed variables or indicators
Latent variables Influence observed variables
Structural model represents theoretical relationship among set of latent variables
Measurement model represents latent variables as a linear combination of observed variables
Need examine covariance among observed variables to get less number of latent variables
Multivariate normality assumed – skewness of the data to be checked

Need to check for component with Eigen values greater than 1
Need to check for component in Scree plot with value greater than 1
Need to check the loadings of component on each variable and include variable with high factor loadings for that component
KMO & bartlett’s spherecity test for checking whether correlation and covariance matrix is identity matrix i.e correlation and covariance is 0
Check for determinant of covariance matrix if it is low issue of multicolinearity exists (ex. <0.0001 or 0)