I think Tesla actually used radar data to provide ground truth for this. So they don't even do that to generate ground truth. What happens if things have the same relative speed when doing the labeling? Or low light conditions? How do you account for per part variation in lens and sensor designs and how it messes with your predictions?