Multimodal: Voice First Buzzwords Explained

by | Oct 16, 2019 | Buzzwords, Voice Technology

Muldimodal: Buzzwords Explained

Welcome to the first instalment of the series Voice First Buzzwords Explained. First up: multimodal.

If you’re reading this soon after we hit publish, you’re right on time. Amazon’s recent event unveiled new and upgraded multimodal devices. But… so what?
 

WHAT DOES MULTIMODAL MEAN?

A device which is operated through 2 or more methods, such as voice and touch.

A REAL-LIFE EXAMPLE OF MULTIMODAL

The most popular example of a multimodal device is the Echo Show. Others include Google Nest Hub and Lenovo Smart Display. Most people think multimodal means screen-based device, like the examples above. However, multimodal also includes our mobile-hosted assistants. Another term which can be used interchangeably with multimodal is voice-first. Smart speakers are built to have voice as their primary input method.
 
As Vixen Labs co-founder JP tweeted: …we often skip over voice on mobile which has been around for way longer. Folks forget how GOOD (yes good!) Siri and Google Assistant have gotten

WHAT DOES MULTIMODAL MEAN IN PRACTICE?

A user interacts with a multimodal device by touching the screen, speaking to the assistant behind it, or both.
Feedback is then given visually or audibly, depending on the user input. Certain data suits particular interactions and responses better. For example, it’s easier for a user to understand what a product looks like by seeing it, rather than Alexa describing it. On the other hand, it’s much simpler for a user to issue a voice command than to touch-scroll through long menus with different options or type out answers into an extensive form.
Multimodal in practice

WHAT ARE THE OPPORTUNITIES PRESENTED BY MULTIMODAL?

1. Usability

We learned to talk before we learned to type, but we learned to interact with technology by touching. Devices able to combine these inputs provide us with a much more holistic experience, which utilises more than one of our senses — and, crucially, connects them together.
 
 

2. Brand Identity

So much of a brand is its visual identity. A device able to show the Instagram-driven consumers of today exactly what content they’re consuming (or which products they can buy) helps to elevate this brand awareness.
 
 

HOW WILL YOUR ORGANISATION MAKE THE MOST OF MULTIMODAL?

The power of the multimodal experience hasn’t been missed by Amazon. As Brett Kinsella wrote for voicebot.ai,
Amazon’s biggest weakness today is its mobile strategy… there are few people using the Alexa app while on-the-go.
This is precisely why the updated smart speaker and new wearable announcements were so exciting.
Multimodal
because when Voice is accessible in different ways, “we can begin to do much more than help with hands-free tasks and accessibility. We can go beyond novelty into true utility.” (JP again.)

Start building your voice strategy today

Much like the web, search and mobile – the first movers get to learn early, build credibility and take market share when a new interface emerges. Voice assistants and conversational AI is no different. Now is the time to get started.

Recent Articles

Pin It on Pinterest