Date of Award:
5-2012
Document Type:
Thesis
Degree Name:
Master of Science (MS)
Department:
Computer Science
Committee Chair(s)
Daniel L. Bryce
Committee
Daniel L. Bryce
Committee
Curtis E. Dyreson
Committee
Stephen J. Allan
Abstract
The engineering of complete planning domain descriptions is often very costly because of human error or lack of domain knowledge. Learning complete domain descriptions is also very challenging because many features are irrelevant to achieving the goals and data may be scarce. Given incomplete knowledge of their actions, agents can ignore the incompleteness, plan around it, ask questions of a domain expert, or learn through trial and error.
Our agent Goalie learns about the preconditions and effects of its incompletely-specified actions by monitoring the environment state. In conjunction with the plan failure explanations generated by its planner DeFault, Goalie diagnoses past and future action failures. DeFault computes failure explanations for each action and state in the plan and counts the number of incomplete domain interpretations wherein failure will occur. The question-asking strategies employed by our extended Goalie agent using these conjunctive normal form-based plan failure explanations are goal-directed and attempt to approach always successful execution while asking the fewest questions possible. In sum, Goalie:
i) interleaves acting, planning, and question-asking;
ii) synthesizes plans that avoid execution failure due to ignorance of the domain model;
iii) uses these plans to identify relevant (goal-directed) questions;
iv) passively learns about the domain model during execution to improve later replanning attempts;
v) and employs various targeted (goal-directed) strategies to ask questions (actively learn).
Our planner DeFault is the first reason about a domain's incompleteness to avoid potential plan failure. We show that DeFault performs best by counting prime implicants (failure diagnoses) rather than propositional models. Further, we show that by reasoning about incompleteness in planning (as opposed to ignoring it), Goalie fails and replans less often, and executes fewer actions. Finally, we show that goal-directed knowledge acquisition - prioritizing questions based on plan failure diagnoses - leads to fewer questions, lower overall planning and replanning time, and higher success rates than approaches that naively ask many questions or learn by trial and error.
Checksum
a56c7231acc7c4f43eb8aa12eb6088ff
Recommended Citation
Weber, Christopher H., "Planning, Acting, and Learning in Incomplete Domains" (2012). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 1168.
https://digitalcommons.usu.edu/etd/1168
Included in
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .
Comments
This work made publicly available electronically on April 12, 2012.