Spoken Language Recognition on Open-Source Datasets

Follow the full discussion on Reddit.
This was my capstone project at SMU. While researching this topic, we found that most papers in spoken language recognition used private or expensive datasets, limiting reproducibility. We also found that many papers made performance claims without comparing to other recent research. We attempted to address these problems in this capstone project. Using public sources such as Mozilla's Common Voice, we constructed an eight-language dataset and compared three neural network architectures published in the last few years.

Comments

There's unfortunately not much to read here yet...

Discover the Best of Machine Learning.

Ever having issues keeping up with everything that's going on in Machine Learning? That's where we help. We're sending out a weekly digest, highlighting the Best of Machine Learning.

Join over 900 Machine Learning Engineers receiving our weekly digest.

Best of Machine LearningBest of Machine Learning

Discover the best guides, books, papers and news in Machine Learning, once per week.

Twitter