Everybody is special in how we use language – how we speak, the words we use, etc. In an earlier blog post, we saw how speech recognition systems eliminate this variation by training on speech and language data that cover many accents, age groups, or other variations in speaking style you might think of. This creates very robust systems that work well for (nearly) every speaker; we call this “speaker-independent” speech recognition.
But in some cases, the individuality of the speaker matters and can be leveraged to create even better experiences – like our latest Dragon Individual and Dragon Legal offerings ,that are typically used by one user. This allows us to go beyond speaker-independent speech recognition by adapting to each user in a speaker-dependent way. Dragon does this on several levels:
This latter point deserves more attention. Dragon uses Deep Neural Networks end-to-end both at the level of the language model — capturing the frequency of words and in which combinations they typically occur — and of the acoustic model, deciphering the smallest spoken units, or phonemes of a language.
These models are quite large and before they leave our labs, they have already been trained on lots and lots of data. One of the reasons why Neural Networks have taken off only now and not in the late 20th century when they were invented is that training is quite a computing intensive process. We use significant amounts of GPUs (Graphical Processing Unit) to train our models. GPUs were originally invented for computer graphic applications like video games. Computing images and training Deep Neural Networks have a lot in common as both tasks require the application of relatively simple calculations towards lots of data points at the same time, and this is what GPUs are good at. We use multiple GPUs in parallel in one training session to speed up the training process
But how do we apply this outside of our data centers? Adapting those Deep Neural Networks that make up the acoustic model to the speech coming from the user is similar to training them, and we want to make that happen on the user’s PC, Mac or laptop – and we want it to be fast. It is a demanding task as we need to make sure adaptation works with just a little data and computationally it is a very efficient process.
Packaging this process in a way that allows the individual to run it on their desktop or laptop is the culmination of many years of innovation in speech recognition and machine learning R&D. Enjoy the result of a highly accurate Dragon experience that is fully personalized to you and your voice.
Publish Date: August 16, 2016 5:00 AM
|1.)||Call Center Studio|
Call Center Studio
Call Center Studio is the world’s first call center built on Google and is one of the most secure and stable systems with some of the industry’s best reporting. It is one of the most full-featured enterprise grade systems (with the most calling features, one of the best call distribution, outbound dialing features and integrations—including IVR, AI Speech Recognition, blended inbound/outbound calling and includes Google’s new Dialogflow and Speech API. Call Center Studio is the absolute easiest to use (with a 10 minute setup), and is the price performance leader with lower equipment cost and less setup time.
|2.)||Teckinfo Solutions Pvt. Ltd.|
InterDialog UCCS inbound call center software caters to all incoming customer requests. These incoming requests can come through any channel of customer’s choice e.g. voice, video, email, chat, WhatsApp, facebook etc.. company page or from an integrated website chat. Using InterDailog UCCS call centers can respond to inquiries of the customers and they can also register the complaints of customer as a customer support desk.
Voiptime Contact Center
Our contact center solution allows processing the high volume of client requests from different channels (voice, webchat, email, web callbacks), running massive outbound dialing campaigns, and makes all call center operations visible for management. Voiptime Cloud Contact Center is a professional calling solution for outbound and inbound calls. It’s a plug-and-play software that immediately increases the productivity of your call center department. With the help of our solution you are able to:
- Automate lead prospecting and have 4x more live conversations daily;
- Increase the agent occupancy up to 80-90% with the help of the fastest Predictive dialer;
- Smooth out the peaks of calls by...
|Customer Success: 6 steps to make your customer touchpoints count||September 12, 2019 5:00 AM|
|How Apple Business Chat can help enterprises with their call deflection goals||September 10, 2019 5:00 AM|
|Hear the Call for Help! How Siren Detection Can Make Your Car a Force for Good.||September 9, 2019 5:00 AM|
|Nuance IQ is back for the summer||August 26, 2019 5:00 AM|
|Meet your patients where they are with 24/7/365 answers to their portal questions||August 21, 2019 5:00 AM|
|Contact center transformation pitfalls (and how to avoid them)||August 8, 2019 5:00 AM|
|Robots here, there and everywhere, and now in the office||February 14, 2019 5:00 AM|
|3 ways speech recognition helps in police incident reporting||January 22, 2019 5:00 AM|
|Building trust with a virtual assistant voice||January 22, 2019 5:00 AM|
|Television viewing is no longer a one-way street||November 15, 2018 5:00 AM|