In deep learning and machine learning, having a large enough dataset is key to training a system and getting it to produce results.
So what does a ML researcher do when there just isn’t enough publicly accessible data?
Enter the MLCommons Association, a global engineering consortium with the aim of making ML better for everyone.
MLCommons recently announced the general availability of the People’s Speech Dataset, a 30,000 hour English-language conversational speech dataset, and the Multilingual Spoken Words Corpus, an audio speech dataset with over 340,000 keywords in 50 languages, to help advance ML research.
On this episode of NVIDIA’s AI Podcast, host Noah Kravitz spoke with David Kanter, founder and executive director of MLCommons, and NVIDIA senior AI developer technology engineer David Galvez, about the democratization of access to speech technology and how ML Commons is helping advance the research and development of machine learning for everyone.
You Might Also Like
Take Note: Otter.ai CEO Sam Liang on Bringing Live Captions to a Meeting Near You
Remote work has made us more reliant on virtual conferencing platforms, including Zoom, Skype and Microsoft Teams. Sam Liang, CEO of Otter.ai, explains how his company enhances the virtual meeting experience for all users.
Lilt CEO Spence Green Talks Removing Language Barriers in Business
When large organizations require translation services, there’s no room for the amusing errors often produced by automated apps. Lilt CEO Spence Green aims to correct that using a human-in-the-loop process to achieve fast, accurate and affordable translation.
How Audio Analytic Is Teaching Machines to Listen
From active noise cancellation to digital assistants that are always listening for your commands, audio is perhaps one of the most important but often overlooked aspects of modern technology in our daily lives. Dr. Chris Mitchell, CEO and founder of Audio Analytic, discusses the challenges, and the fun, involved in teaching machines to listen.
Subscribe to the AI Podcast: Now available on Amazon Music
You can now listen to the AI Podcast through Amazon Music.
You can also get the AI Podcast through iTunes, Google Podcasts, Google Play, Castbox, DoggCatcher, Overcast, PlayerFM, Pocket Casts, Podbay, PodBean, PodCruncher, PodKicker, Soundcloud, Spotify, Stitcher and TuneIn.
Make the AI Podcast Better: Have a few minutes to spare? Fill out our listener survey.
The post MLCommons’ David Kanter, NVIDIA’s David Galvez on Improving AI with Publicly Accessible Datasets appeared first on NVIDIA Blog.