POMDP Coder

Prompts used by the reproduced pomdp_coder pipeline and runtime.

Offline learning
Induces a reusable symbolic task interface before any model program is proposed.
Converts one annotated episode into symbolic transition records aligned with the induced task interface.
Learns the initial-state sampler.
Learns the generative transition model.
Learns the symbolic observation model.
Learns the reward model.
Repairs the initial-state sampler.
Repairs the generative transition model.
Repairs the symbolic observation model.
Repairs the reward model.
Online planning
Translates a natural-language instruction into a machine-checkable goal DSL used by the runtime planner.
Asks a multimodal LLM to judge which candidate observable atoms are visually supported at the current step.
This folder combines offline learning prompts and online planning/runtime prompts for pomdp_coder.