The user needs to pick up the receiver and speak naturally to issue commands such as booking a cab, ordering food, scheduling ...