I kept thinking about how to compare voice control and the confusion of a failed attempt with something in real life.
For instance, if you need to throw away an empty bottle into a bin, you can walk over and simply throw it, or you can try throwing it from a distance to save steps and even attempt to look cool and accurate. When nobody is around, it’s okay to miss once, but if someone is watching… then — you feel the difference between “I hit the bin on my first try” and “I missed the bin and it loudly bounced aside”. It’s the same with voice control. Everything is fine when your phone understands you the first time or when no one is around. As soon as there are witnesses, failing the first try feels like having to go pick up the bottle, walk back, and try hitting it again. It’s easier to just walk up and throw it properly. And on the phone, it’s simpler to just press the buttons on the second try.
