Data Issues
Contents
- 1 Introduction
- 2 Cleaning of time use diary data
- 3 Report on Adapted PPVT-III and 'Who Am I?'
- 4 Imputations to solve missing data problems in Wave 2.5
- 5 Review of main educational program of 4-5 year olds
- 6 Cleaning of income data
- 7 Height differences
- 8 Data issues in Wave 3.5
- 9 Data issues in Wave 4
- 10 Data issues in Wave 5
- 11 Smoking inside the household
- 12 Missing data for Wave 6 items
- 13 Issues with breadwinner questions
- 14 Date of birth corrections
- 15 Minor changes for weight, BMI & and height percentiles and z-scores
- 16 Body fat percentage data corrections
- 17 Wave 4 salary and wages
- 18 Study children allergies (issues with Wave 6 and 7 data)
- 19 After school care issue Wave 7 B cohort
- 20 Who is mother/father issue
- 21 Repeated a year level issue
- 22 Executive functioning - CogState - missing data Wave 7
- 23 Expected/received child support per child
- 24 Reason for change in education institution - SC CAI 6.5 (pc44c3b1):
- 25 Child support - parent living elsewhere PLE 20.8 (pe21p5)
- 26 Informant indicator in LSAC variable naming convention: Approach in Wave 7 and subsequent Waves
- 27 Desired occupation sequencing issue
- 28 Inconsistent placement of SC question
- 29 Difference in health status of household members across waves of LSAC
- 30 Academic Rating Scale score in Wave 7
- 31 Gambling data inconsistencies
- 32 References
- Appendix A: Item-person map
- Appendix B: Principal component analysis
15 Minor changes for weight, BMI & and height percentiles and z-scores
As part of the investigation into the date of birth corrections (outlined in section 14), some inconsistency was discovered in the method for calculating 'SC age in months' across the waves:
- Wave 1 and Wave 2 were calculated using the integer method
- Wave 3 and Wave 4 were calculated using a custom method that correctly derived the number of completed months to determine age in months
- Wave 5 and Wave 6 were calculated using the rounding method.
The calculation method for Waves 1, 2, 5 and 6 has now been changed to match the custom method used for Waves 3 and 4. This method was chosen for all waves as it correctly derives the number of 'true' completed months (using the date of interview as a reference point for age calculation) as opposed to the integer method that simply divides the number of days since the interview by 30.44.
As result of this method change, 'age in months for SC' was recalculated for Waves 1, 2, 5 and 6. A large number of records were altered by +1 or -1 for age in months in these waves. However, only very minor changes were observed, when before and after distributions were compared.