Motivation of Utterance of Autonomous Mobile Robots
Yuichiro ANZAI
Department of Computer Science, Keio University
3-14-1, Hiyoshi, Kouhoku-ku, Yokohama 223 JAPAN
e-mail: anzai@aa.cs.keio.ac.jp
A speech dialogue is advantageous over other ways of communication
between humans and computers in that it needs no special training.
Much research has been done for speech dialogue systems, however,
those systems are generally not able to work in dynamic environments.
It is principally because they lack consideration of non-verbal
information, and also relation between verbal and non-verbal
information. Actually, human conversation contains, or is affected
by, much non-verbal information that is somehow shared by speakers and
hearers.
``Mental models'', often referred particularly in the field of
cognitive science, may include this aspect, that is, integration of
verbal and non-verbal information that represent the utterance
situation for participants in conversation.
We believe that most of speech dialogue systems for real-world application
must somehow include formulation and implementation of mental models in the
above sense. Thus our goal has been to model and implement a
dynamical construction process of mental models, using the domain
of human-robot speech dialogue communication.
Actually, last year we succeeded in formulating and implementing
a speech dialogue system that integrates
sensory information from robot's sensors as non-verbal information.
This system, called Linta-II, is able to construct
a mental model from the situation by using what we call
the attention mechanism and by fusing sensor data, and
use it to generate appropriate utterances.
One weakness of this system is that no ``motivation mechanism''
is implemented. Also, the system, though it is reactive
to changes of external environments, needs more capacity for
reactiveness to avoid danger.
Our new system is able to motivate utterances by itself, and
also can manage reactiveness, by incorporating a reactive
planning algorithm like Georgeff's PRF (Procedural
Reasoning System).
We suppose that scenes, in which a speech dialogue with a
autonomous mobile robot is embedded, is limited.
Dialogue plans are described as reactive plans that
possess an ability to respond immediately to
changes of linguistic context obtained from speech inputs and changes
of external environments obtained from sensory inputs.
When the situation matches more than one plan, the dialogue
manager mediates these plans, based on some heuristic strategies.
That is, when there is no plan being executed, the most urgent action or plan
for the situation is selected.
In the other cases the dialogue manager mediates matched plans in
plans being executed: if no plan matches to the situation, the action or
plan is selected in the aformentioned way.
It is assumed that plans has priority values with
the range between zero and one. The priority value of the
plan being executed is reset to one when the dialogue of
the plan is selected. When the plan called from another
plan is selected, the priority value of the plan calling the selected plan
is updated in some predetermined fashion.
We designed and implemented this speech dialogue system
that includes the reactive planning subsystem to
realize utterance motivation and more reactiveness.
To accomplish that, we first implemented a voice recognition
board and a task planner on the autonomous mobile robot
developed in our laboratory.
Then, the dialogue planner mentioned above was implemented,
which completed our system for speech dialogue with autonomous
mobile robots. The overall system integrates goal-oriented
dialogue planning and reactive dialogue planning. The
former is in a sense similar to those in speech dialogue
systems proposed in the conventional work on human-computer
interaction. The latter is concerned with external and contextual
changes. This integration made our speech dialogue system
enable to realize human dialogue with autonomous mobile robots, whose
``mental models'' are reconstructed according to
dynamical changes in external environments.
Keywords: dialogue planning, reactive planning, mental model, sensor fusion, autonomous mobile robot