DexForce · skywhite1024 · Jun 13, 2026 · Jun 14, 2026 · Jun 14, 2026 · Jun 14, 2026
diff --git a/MANIFEST.in b/MANIFEST.in
@@ -1,2 +1,3 @@
 include VERSION
 recursive-include configs/ *
+recursive-include embodichain/gen_sim/action_agent_pipeline/generation/templates *.json
diff --git a/docs/source/api_reference/embodichain/embodichain.agents.rst b/docs/source/api_reference/embodichain/embodichain.agents.rst
diff --git a/docs/source/api_reference/embodichain/embodichain.lab.sim.rst b/docs/source/api_reference/embodichain/embodichain.lab.sim.rst
@@ -78,14 +78,6 @@ Shapes
    :show-inheritance:
    :exclude-members: __init__, copy, replace, to_dict, validate
 
-Atomic Actions
---------------
-
-.. automodule:: embodichain.lab.sim.atom_actions
-   :members:
-   :undoc-members:
-   :show-inheritance:
-
 Objects
 -------
 
@@ -133,6 +125,7 @@ Atomic Actions
    :maxdepth: 1
 
    embodichain.lab.sim.atomic_actions
+
 Shared Types
 ------------
 
@@ -147,4 +140,4 @@ Utility
 .. toctree::
    :maxdepth: 1
 
-   embodichain.lab.sim.utility
+   embodichain.lab.sim.utility
diff --git a/docs/source/api_reference/index.rst b/docs/source/api_reference/index.rst
@@ -22,7 +22,6 @@ The following modules are available in the core ``embodichain`` framework:
 .. autosummary::
    :toctree: embodichain
 
-   agents
    data
    data_pipeline
    lab

diff --git a/docs/source/features/generative_sim/agents.md b/docs/source/features/generative_sim/agents.md
@@ -1,175 +1,68 @@
-# EmbodiAgent（aborted）
+# Action Agent Pipeline
 
-EmbodiAgent is a hierarchical multi-agent system that enables robots to perform complex manipulation tasks through closed-loop planning, code generation, and validation. The system combines vision-language models (VLMs) and large language models (LLMs) to translate high-level goals into executable robot actions.
+The action-agent pipeline is the supported agent workflow for generated tabletop
+manipulation tasks. It converts an image or an existing generated gym project
+into a task-specific simulation config, asks the task model for a JSON task
+graph, compiles that graph into atomic-action specs, and executes it through the
+`AtomicActionsAgent-v3` environment.
 
-## Quick Start
+The legacy Python-code generation agent stack has been removed. New demos and
+task generation should use the modules under
+`embodichain.gen_sim.action_agent_pipeline`.
 
-### Prerequisites
-Ensure you have access to Azure OpenAI or a compatible LLM endpoint.
+## End-to-end Pipeline
 
-```bash
-# Set environment variables
-export AZURE_OPENAI_ENDPOINT="https://your-endpoint.openai.azure.com/"
-export AZURE_OPENAI_API_KEY="your-api-key"
-```
-
-### Using Different LLM/VLM APIs
+Run image-to-scene, config generation, and agent execution in one command:
 
-The system uses LangChain's `AzureChatOpenAI` by default. To use different LLM/VLM providers, you can modify the `create_llm` function in `embodichain/agents/hierarchy/llm.py`.
-
-#### Azure OpenAI
 ```bash
-export AZURE_OPENAI_ENDPOINT="https://your-endpoint.openai.azure.com/"
-export AZURE_OPENAI_API_KEY="your-api-key"
-export OPENAI_API_VERSION="2024-10-21"  # Optional, defaults to "2024-10-21"
+python -m embodichain.gen_sim.action_agent_pipeline.cli.run_agent_pipeline \
+    --use-image2scene \
+    --server "http://127.0.0.1:4523" \
+    --image-name "demo1" \
+    --task_description "Pick up the target object and place it in the basket." \
+    --config-output-dir "gym_project/action_agent_pipeline/configs/demo1_text" \
+    --task_name "Demo1_Text" \
+    --target_body_scale 0.8 \
+    --regenerate
 ```
 
-#### OpenAI
-To use OpenAI directly instead of Azure, modify `llm.py`:
-```python
-from langchain_openai import ChatOpenAI
+## Generate Config Only
 
-def create_llm(*, temperature=0.0, model="gpt-4o"):
-    return ChatOpenAI(
-        temperature=temperature,
-        model=model,
-        api_key=os.getenv("OPENAI_API_KEY"),
-    )
-```
+Use an existing gym project to generate the task config and agent config:
 
-Then set:
 ```bash
-export OPENAI_API_KEY="your-api-key"
-```
-
-#### Other Providers
-You can use other LangChain-compatible providers by modifying the `create_llm` function, for example:
-
-**Anthropic Claude:**
-```python
-from langchain_anthropic import ChatAnthropic
-
-def create_llm(*, temperature=0.0, model="claude-3-opus-20240229"):
-    return ChatAnthropic(
-        temperature=temperature,
-        model=model,
-        anthropic_api_key=os.getenv("ANTHROPIC_API_KEY"),
-    )
+python -m embodichain.gen_sim.action_agent_pipeline.cli.generate_action_agent_config \
+    --gym_project "gym_project/environment/image2tabletop/downloads/example_gym_project" \
+    --output_dir "gym_project/action_agent_pipeline/configs/demo_text" \
+    --task_name "Demo_Text" \
+    --task_description "Pick up the target object and place it in the basket." \
+    --target_body_scale 0.8 \
+    --overwrite
 ```
 
-**Google Gemini:**
-```python
-from langchain_google_genai import ChatGoogleGenerativeAI
+## Run Generated Config
 
-def create_llm(*, temperature=0.0, model="gemini-pro"):
-    return ChatGoogleGenerativeAI(
-        temperature=temperature,
-        model=model,
-        google_api_key=os.getenv("GOOGLE_API_KEY"),
-    )
-```
-
-### Run the System
-
-Run the agent system with the following command:
+Run a previously generated config with the action-agent environment:
 
 ```bash
-python embodichain/lab/scripts/run_agent.py \
-    --task_name YourTask \
-    --gym_config configs/gym/your_task/gym_config.yaml \
-    --agent_config configs/gym/agent/your_agent/agent_config.json \
-    --regenerate False
+python -m embodichain.gen_sim.action_agent_pipeline.cli.run_agent \
+    --task_name "Demo_Text" \
+    --gym_config "gym_project/action_agent_pipeline/configs/demo_text/fast_gym_config.json" \
+    --agent_config "gym_project/action_agent_pipeline/configs/demo_text/agent_config.json" \
+    --regenerate
 ```
 
-**Parameters:**
-- `--task_name`: Name identifier for the task
-- `--gym_config`: Path to the gym environment configuration file (``.json``, ``.yaml``, or ``.yml``)
-- `--agent_config`: Path to the agent configuration file (defines prompts and agent behavior)
-- `--regenerate`: If `True`, forces regeneration of plans/code even if cached
-
-## System Architecture
-
-The system operates on a closed-loop control cycle:
-
-- **Observe**: The `TaskAgent` perceives the environment via multi-view camera inputs.
-- **Plan**: It decomposes the goal into natural language steps.
-- **Code**: The `CodeAgent` translates steps into executable Python code using atomic actions.
-- **Execute**: The code runs in the environment; runtime errors are caught immediately.
-- **Validate**: The `ValidationAgent` analyzes the result images, selects the best camera angle, and judges success.
-- **Refine**: If validation fails, feedback is sent back to the agents to regenerate the plan or code.
-
----
-
-## Core Components
-
-### TaskAgent
-*Located in:* `embodichain/agents/hierarchy/task_agent.py`
-
-Responsible for high-level reasoning. It parses visual observations and outputs a structured plan.
-
-* For every step, it generates a specific condition (e.g., "The cup must be held by the gripper") which is used later by the ValidationAgent.
-* Prompt Strategies:
-    * `one_stage_prompt`: Direct VLM-to-Plan generation.
-    * `two_stage_prompt`: Separates visual analysis from planning logic.
-
-### CodeAgent
-*Located in:* `embodichain/agents/hierarchy/code_agent.py`
-
-Translates natural language plans into executable Python code using atomic actions from the action bank.
-
-* Generates Python code that follows strict coding guidelines (no loops, only provided APIs)
-* Executes code in a sandboxed environment with immediate error detection
-* Uses Abstract Syntax Tree (AST) parsing to ensure code safety and correctness
-* Supports few-shot learning through code examples in the configuration
-
-
-### ValidationAgent
-*Located in:* `embodichain/agents/hierarchy/validation_agent.py`
-
-Closes the loop by verifying if the robot actually achieved what it planned.
-
-* Uses a specialized LLM call (`select_best_view_dir`) to analyze images from all cameras and pick the single best angle that proves the action's outcome, ignoring irrelevant views.
-* If an error occurs (runtime or logic), it generates a detailed explanation which is fed back to the `TaskAgent` or `CodeAgent` for the next attempt.
-
----
-
-## Configuration Guide
-
-The `Agent` configuration block controls the context provided to the LLMs. Prompt files are resolved in the following order:
-
-1. **Config directory**: Task-specific prompt files in the same directory as the agent configuration file (e.g., `configs/gym/agent/pour_water_agent/`)
-2. **Default prompts directory**: Reusable prompt templates in `embodichain/agents/prompts/`
-
-| Parameter | Description | Typical Use |
-| :--- | :--- | :--- |
-| `task_prompt` | Task-specific goal description | "Pour water from the red cup to the blue cup." |
-| `basic_background` | Physical rules & constraints | World coordinate system definitions, safety rules. |
-| `atom_actions` | API Documentation | List of available functions (e.g., `drive(action='pick', ...)`). |
-| `code_prompt` | Coding guidelines | "Use provided APIs only. Do not use loops." |
-| `code_example` | Few-shot examples | Previous successful code snippets to guide style. |
-
----
-
-## File Structure
-
-```text
-embodichain/agents/
-├── hierarchy/
-│   ├── agent_base.py          # Abstract base handling prompts & images
-│   ├── task_agent.py          # Plan generation logic
-│   ├── code_agent.py          # Code generation & AST execution engine
-│   ├── validation_agent.py    # Visual analysis & view selection
-│   └── llm.py                 # LLM configuration and instances
-├── mllm/
-│   └── prompt/                # Prompt templates (LangChain)
-└── prompts/                   # Agent prompt templates
-```
+## Runtime Shape
 
----
+- `TaskAgent` produces a deterministic JSON graph.
+- `CompileAgent` caches and validates the graph artifact.
+- `AgenticGenSimEnv` registers `AtomicActionsAgent-v3` and exposes
+  `create_demo_action_list()`.
+- Runtime graph execution calls atomic actions from
+  `embodichain.gen_sim.action_agent_pipeline.runtime`.
 
 ## See Also
 
-- [Online Data Streaming](../online_data.md) — Streaming live simulation data for training
-- [RL Architecture](../../overview/rl/index.rst) — RL training pipeline and algorithms
-- [Atomic Actions Tutorial](../../tutorial/atomic_actions.rst) — Action primitives used by the CodeAgent
+- [SimReady Asset Pipeline](simready_pipeline.md) — Generating simulation-ready assets
+- [Atomic Actions Tutorial](../../tutorial/atomic_actions.rst) — Atomic action primitives
 - [Supported Tasks](../../resources/task/index.rst) — Available task environments
diff --git a/docs/source/features/generative_sim/index.rst b/docs/source/features/generative_sim/index.rst
@@ -6,4 +6,5 @@ Generative Simulation collects EmbodiChain features for generating simulation-re
 .. toctree::
    :maxdepth: 2
 
+   Action Agent Pipeline <agents.md>
    SimReady Asset Pipeline <simready_pipeline.md>
diff --git a/embodichain/agents/__init__.py b/embodichain/agents/__init__.py
@@ -14,7 +14,6 @@
 # limitations under the License.
 # ----------------------------------------------------------------------------
 
-from . import hierarchy
-from . import mllm
+from __future__ import annotations
 
-__all__ = ["hierarchy", "mllm"]
+__all__: list[str] = []
-Original file line number
+Diff line change
@@ Expand Up @@
     .. autosummary::
        :toctree: embodichain
-       agents
        data
        data_pipeline
        lab
@@ Expand Down @@