What is it?
Who Killed Beverly Lu (tentative title) is an experimental narrative VUI game currently in development using DialogFlow. It’s a murder mystery.
Why make it?
In video game writing, stories are effectively linear with a defined beginning and end. Sure, some games have multiple endings, but it’s basically a linear story with 2 or 3 endings. A more compelling narrative experience would allow users to do anything they can imagine. Something that seems infinitely branching. To do that though, you’d have to somehow account for almost every path a user would take.
Enter VUIs. In a conversation, people can say whatever they want. No wrong turns. Many narrative VUIs at the moment borrow the same linear structure that’s already tried and true in games, but they don’t take advantage of conversation as a medium. I want to create a story with no wrong turns using a VUI.
I adapt the same FAQ conversational flow from a different experience into this one.
After building 2 FAQ conversational experiences, one for Disney and another for Royal Carribbean, I realized that users are creatively limited in what they asked by the mental model set with the first message. With extensive testing, the experience may be able to handle 90-95% of questions asked, as long as the questions are sourced from actual users. It helps to not make assumptions about what questions it should be able to answer. In these FAQs, the user directs the conversation.
So back to the experimental narrative VUI. I’m assuming:
If I don’t decide the ending of the story, I can adapt the FAQ structure to a narrative
If I can set a clear mental model that creatively restricts the kinds of questions a user will think of then users can say whatever they want.
And therefore, I can create a story where the user is the driving force instead of leading them through linearly.
What’s the story?
Because I want the user to ask a lot of questions, I decided on murder mystery where the user can be a detective. So this is the tentative first line, using a narrow focus prompt:
Hello there. Did you know I killed Beverly Lu?
Using DialogFlow, I sent a prototype out with this as its opening line and a generic fallback to 10 test users. I use a mixture of Amazon Turks and charitable friends. Here are results from the first round of tests:
Lots of questions come up. Questions like:
Why did you kill her?
Who’s Beverly Lu?
Who are you?
Using my background in game writing, I slowly form a story by answering the questions.
Why? She asked me to.
Who was Beverly Lu? A patient of mine. An Olympic gymnast who was paralyzed from the neck down.
Who are you? My name is Waverly. I‘m a healthcare assistant, here to bring motion to the motionless.
The VUI should be able to handle off the wall questions/statements as well.
A few more rounds of sending out very early prototypes, and the story in my head is that this is an advanced healthcare/smart home VUI set in the near future (think 2030s). Its role is to assist paralyzed patients become more autonomous in their homes, but somehow, Beverly Lu has managed to commit suicide with the device using her voice. The next question is “how did she do it?”
All of these plot points were determined organically through testing. And the story will keep growing until users run out of questions to ask.
Error Handling
Error messages should be context based. As I gather more utterances, they naturally fall into broad categories (e.g. questions about the murder, Questions about Beverly Lu, questions about Waverly).
The twist here is that, unlike more utility based experiences, I can create narrative reasons why the VUI can’t answer.
I’m sorry. I used to know how to do that.
I can’t... monitor vitals anymore.
The user may piece together that following the death of Beverly Lu, Waverly is being recalled to be destroyed and so its features are being artificially limited. By providing a narrative framework with emotional cues, the error prompts are more forgiving.
These error prompts follow the flow chart on the right.
Other Considerations
TTS or Voice Actor?
My ideal implementation of this would use a voice actress that was slightly edited so that the voice sounds more synthesized. This way, it would sell the universe that it’s a near future VUI better (since the story is set in 2030). However, due to budget limitations (read: no budget), I’m using with TTS and SSML. However, linguistic elements like contrastive focus and intonation to show emotion is difficult to convey with this approach.
Counter System
A counter system so that intents triggered multiple times have different responses. “Who is Beverly Lu?” should not provide the same response twice.
Adaptive Story Progression
Progression is maintained through a system of Boolean variables. After a user solves a mystery, the content of related questions may change.
Ticking Clock
Potentially, if the questions are too broad, I may create a narrative reason to create a “ticking clock” that will end the experience. For example, perhaps Waverly is losing functionality throughout the entire experience because it’s being decommissioned.
Parting Thoughts
Although it’s ambitious in scope, I’m excited for the viability of this format in possibly creating a new narrative form. It could open the door to other interesting experiences like a horror narrative VUI, played with the lights off.
Check back for more progress!