Our speech-to-text model is primarily based off of Nvidia's NeMo CitriNet, however, it has been modified and improved to meet our use case.
What ML model/algorithms/techniques do you use?
Updated this week
Our speech-to-text model is primarily based off of Nvidia's NeMo CitriNet, however, it has been modified and improved to meet our use case.