Activity focus
- primary vs secondary data
- missing value vs zero
- unit consistency
- real-world understanding of data
Practice set
- Mark each as
primaryorsecondary:- data from a field survey done by your team
- published state health report
- Explain whether the following is valid:
- heights stored as
155 cm,1.62 m,149 cm
- heights stored as
- In a marks sheet, one student has no value entered. Another has
0.- are these the same?
- explain why
- Suggest one better variable name than
xfor attendance.
Answer key
-
- field survey: primary
- health report: secondary
- Not valid without standardization; units are mixed.
- No.
- blank = missing data
0= recorded score/value is zero
- Example:
attendance_percent
Common mistakes
- assuming any published number is automatically reliable
- treating coding convenience as statistical correctness
- ignoring missing values in analysis
