Still suffers the same problem that all Voice Recognition seems to suffer; cannot reliably detect that the speaker has finished speaking.
This was almost worse though because it did feel like a rude person just interrupting instead of a dumb computer not being able to pick up normal social cues around when the person they're listening to has finished.
It's even hard to detect when humans stopped talking when talking to human while having high latency especially at the beginning of the call when you testing how big latency it is.
I think they need to implement the statistical bias where the longer a person talks, the less likely they are going to be stopping at any specific part of their speech. Sorta like the rising sun problem[0]
This was almost worse though because it did feel like a rude person just interrupting instead of a dumb computer not being able to pick up normal social cues around when the person they're listening to has finished.