Machine learning uncovers missing info about ethnicity in population health data: Study