Which speech analytic engine is right for your application? - Dialogic - ContactCenterWorld.com Blog
Confession - last week while entering my dark hotel room, I found myself uttering ‘Alexa, lights on’ – to my disappointment the room remained dark....
Indeed real-time speech analytics and natural language processing are changing human behavior (at least it’s changing my behavior) and we seem to be at the forefront of this paradigm shift, but with so many options, which speech analytic engine best? A simple search would generate an abundance of varying opinionated blogs, how-to's and even some voice assistant battle videos (one of my favorites) but still no definitive unified answer.
Recently the Dialogic applications team looked to leverage real-time speech analytics and natural language processing with our video conferencing solution to create a ‘conferencing valet’. The idea was to integrate the speech analytics service as a passive listening participant and trigger actions based on what it heard – in our case it would trigger visual advertisements in the chat window. We needed a cloud service that could quickly and accurately translate the speech of the conference attendees into text then be able to extract specific intents from the speech for actions. This led us to evaluating several vendor offerings and while in the end we decided to integrate using IBM Watson - the short and anti-climactic answer to which real-time speech analytic engine is the best is…… depends.
Now let me explain before you close out this blog – the reason for the non-decisive answer is because each vendor has both strengths and weaknesses, which should be considered, based on the application use case. For example, sacrificing some accuracy for speed – in our ‘Conference Valet’ application, the attendees utterances would need to be analyzed in short quick bursts requiring a moderate level of accuracy in order to extract the intent. Let’s now flip it – sacrificing speed for accuracy with a ‘Doctor/Patient video consultation’ application where the transcripts are needed for compliance and accuracy is critical.
Beyond speed and accuracy, there are value add-on features - take for example, Mod9’s - cloud-based service called ‘ReMeeting’. They specialize in not only high levels of accuracy but also speaker separation and searchability - powerful features that can help innovate specific applications. Last but not least, the ability to train or tune the speech analytics engine 'out of the box' to better serve the specific application. For instance, a voicemail application with email transcriptions almost alway contain a call back telephone number which should be interpreted as an integer rather than words ('my number is 7169.....' vs 'my number is seven one six nine....')
In the end, the best speech analytic engine will *depend* on the *use case* so be sure to compare the strengths (and weaknesses) against your *application requirements* before making a decision.
//Vince - @vfpuglia
Publish Date: September 12, 2017 5:00 AM
2020 Buyers Guide Cloud Contact Center Solutions
Astute Agent gives your agents everything they need to work cases confidently and efficiently. This modern case management CRM is the preferred choice for Consumer Relations and Customer Care teams who support some of the world’s most prestigious brands.
Astute Agent balances agent efficiency with customer experience. Here’s how:
- Automated email responses
Using natural language processing, Astute Agent reads incoming customer emails and automatically supplies a response to agents to review and send.
- Auto-populated case fields
AI capabilities automatically suggest reason codes, product codes, and other case information, saving agents minutes per case.
- Time-saving case feat...
|9.)||Lieber & Associates|
Technology Acquisition Consulting
L&A provides vendor-independent counsel to select, contract for, design, test, and implement cloud and premise-based systems. The firm has experience with all major vendors and many smaller ones. Lieber & Associates' technology consultants are contact center systems specialists with several decades of experience each.
View more from Dialogic
Recent Blog Posts:
|Scaling in the Cloud – Avoid Flying Too Close to the Sun||December 17, 2019 5:00 AM|
|SD-WAN’s Relationship with UCaaS||December 12, 2019 5:00 AM|
|Hearing and Seeing the Difference in UC Platforms||November 7, 2019 5:00 AM|
|Microservices Architecture – What is it, and why should I care?||October 31, 2019 5:00 AM|
|Panning for “Killer Apps” in the Gold Rush of 5G||February 14, 2019 5:00 AM|
|The Dialogic BUZZ UC Platform Swiss Army Knife||October 24, 2018 5:00 AM|
|DialogicONE - IoT Solutions||October 22, 2018 5:00 AM|
|Dialogic PowerMedia MRF – A Solution You Can Depend On||September 25, 2018 5:00 AM|
|Enabling WebRTC with the Dialogic PowerVille Load Balancer||July 16, 2018 5:00 AM|
|Telecom Meets Digital: The Importance of Establishing Controls||May 24, 2018 5:00 AM|