Description: find labelled datasets (phonetically transcribed speech; segmented or not), unify phonetic transcription, (segment speech into phones),
check/train/improve existing models, implement retraining for expanded IPA symbol sets, data collection and analysis of historical and modern population-scale speech samples,