Data Issues
Contents
- 1 Introduction
- 2 Cleaning of time use diary data
- 3 Report on Adapted PPVT-III and 'Who Am I?'
- 4 Imputations to solve missing data problems in Wave 2.5
- 5 Review of main educational program of 4-5 year olds
- 6 Cleaning of income data
- 7 Height differences
- 8 Data issues in Wave 3.5
- 9 Data issues in Wave 4
- 10 Data issues in Wave 5
- 11 Smoking inside the household
- 12 Missing data for Wave 6 items
- 13 Issues with breadwinner questions
- 14 Date of birth corrections
- 15 Minor changes for weight, BMI & and height percentiles and z-scores
- 16 Body fat percentage data corrections
- 17 Wave 4 salary and wages
- 18 Study children allergies (issues with Wave 6 and 7 data)
- 19 After school care issue Wave 7 B cohort
- 20 Who is mother/father issue
- 21 Repeated a year level issue
- 22 Executive functioning - CogState - missing data Wave 7
- 23 Expected/received child support per child
- 24 Reason for change in education institution - SC CAI 6.5 (pc44c3b1):
- 25 Child support - parent living elsewhere PLE 20.8 (pe21p5)
- 26 Informant indicator in LSAC variable naming convention: Approach in Wave 7 and subsequent Waves
- 27 Desired occupation sequencing issue
- 28 Inconsistent placement of SC question
- 29 Difference in health status of household members across waves of LSAC
- 30 Academic Rating Scale score in Wave 7
- 31 Gambling data inconsistencies
- 32 References
- Appendix A: Item-person map
- Appendix B: Principal component analysis
21 Repeated a year level issue
There was an instrument issue in the B cohort Education module for Wave 7. An error in the sequencing specifications meant that the records that had a difference of one year in their 2014 and 2016 grade levels were not asked the questions regarding any repeats of grade/level since last interview. These participants with a difference of only one year in their 2014 and 2016 grade levels were also the most likely to respond in the positive to the repeat year questions.
As a result, the data for which grade/year level was repeated (gpc47a2) and what was the main reason, or other reason, for repeating the grade/level (gpc47a3a, gpc47a3b) have been dropped from the Wave 7 data file. The data for whether a grade has been repeated or not (gpc47a6) has been derived for B cohort by using the grade indicated by the study child in Wave 6 and Wave 7.