The approach is based on self-supervised models which learn language based on raw speech. The STELA framework also allows the possibility to generate comparable developmental learning curve at the phonetic and lexical level.
More info on the project coming soon.
Layout on how learning simulations (like STELA) and infants compare.
More info on the project coming soon.
ProsAudit has been integrated to the Zero Resource Speech Challenge language modeling track.
EmphAssess focuses on the transfer of emphasis in Speech-to-Speech models.
From there stems another question: what is language similarity? Can models capture it automatically? And what kind of typology will be captured?
I presented a paper at Speech Prosody 2022 where we did a pilot study at capturing language typology using i-vectors. I am also looking at the effect of language similarity in modelling various speech-related cognitive processes (language discrimination and separation, language familiarity effect, language learning...)
ZeroSpeech 2021 is a challenge aimed at Spoken Language Modelling from raw speech. This task consists in learning language models directly from raw audio in an unknown language, without any annotation or text.
For more info, check out the website (the challenge is still open for new submissions!).