Data science /

Create a text dataset from voice recordings

November 17/2 min read
  • Anna Gross
    Anna GrossBusiness Developer

How to use Microsoft’s speech-to-text API to create a dataset, and then train a model with it on the Peltarion platform.

Data scientists are always on the lookout for new datasets, but in spite of the myriad of data being generated and processed everyday they can be surprisingly hard to find. Creating your own datasets is therefore often the most time efficient way to go if you want to start exploring a project idea quickly. 

Here’s one way you can go about creating a dataset for text using Microsoft’s speech-to-text API, and then using it to train a model on the Peltarion platform

  1. Follow this link to find Microsoft’s instructions for their speech-to-text APIs. You can choose to work with recorded audio files or by talking into your microphone.
  2. Create your dataset by recording people speaking and attaching a label to each recorded snippet. 
  3. Turn it into a text dataset using the speech-to-text API according to Microsoft’s instructions above.
  4. Follow one of our tutorials to train a text classification model using the natural language processing model English BERT (our tutorial shows you how to build a model to detect the sentiment of reviews) or multilingual BERT (our tutorial on this can tell you the genre of a book based on an extract from it) depending on which languages your recordings are in.
  5. Deploy the model using our one-click REST API to start using it. 

For more about text-based AI models or multilingual BERT check out our intro series on the topic.

  • Anna Gross

    Anna Gross

    Business Developer

    Anna Gross works with business development at Peltarion, aiming to make deep learning accessible to people from a wide range of industries. Before joining Peltarion, she set up the startup non-profit Project Access. She holds a bachelor’s degree in History from the University of Oxford and has also spent two years at Peking University in China studying Mandarin.

02/ For more on data science