Sketchnote diagram for: Google Jules vs Codex CLI: Async Fire-and-Forget vs Interactive Terminal Agent

Google Jules vs Codex CLI: Async Fire-and-Forget vs Interactive Terminal Agent

With Google I/O 2026 kicking off today and the announcement of Project Jitro (Jules V2) alongside OpenAI’s GPT-5.2-Codex release, the two dominant paradigms in agentic coding are now sharply defined ¹². Google Jules treats coding tasks as fire-and-forget background jobs that produce pull requests. Codex CLI treats them as interactive terminal sessions where the developer stays in the loop. Both approaches work; neither is universally better. This article breaks down when each wins and how to use them together.

Architectural Divergence

The fundamental difference is execution surface. Jules runs every task in an isolated Google Cloud VM that clones your repository, executes in a sandboxed environment, and destroys itself after opening a pull request ³. Codex CLI runs locally in your terminal inside a platform-native sandbox (Seatbelt on macOS, Bubblewrap/Landlock on Linux) with direct access to your working tree ⁴.

graph LR
    subgraph "Google Jules"
        A[GitHub Issue / Prompt] --> B[Cloud VM Provisioned]
        B --> C[Repo Cloned]
        C --> D[Gemini Reasons + Plans]
        D --> E[Agent Executes]
        E --> F[PR Opened]
        F --> G[VM Destroyed]
    end

    subgraph "Codex CLI"
        H[Developer Prompt] --> I[Local Sandbox]
        I --> J[Model Reasons]
        J --> K[Inline Approval]
        K --> L[Agent Executes]
        L --> M[Changes in Working Tree]
        M --> N[Developer Reviews Locally]
    end

This distinction cascades into every design decision:

Dimension	Google Jules	Codex CLI
Execution	Remote Cloud VM	Local sandbox
Interaction	Asynchronous; review PR later	Synchronous; approve inline
Output	Pull request on new branch	Changes in working tree
Context	Full repo clone per task	Live working directory
Parallelism	Multiple VMs concurrently	One session per terminal ⁵
Network	Full internet in VM	Restricted by sandbox policy ⁴

Models and Reasoning

Jules uses Gemini 3.1 Pro on paid tiers and Gemini 3 Flash on the free tier ⁶. With the I/O 2026 announcement of Gemini 4’s 10-million-token context window, Jules gains the ability to ingest entire large codebases in a single pass, though reliable performance reportedly sits at 5-7 million tokens ¹.

Codex CLI defaults to GPT-5.5 for complex tasks, with GPT-5.4, GPT-5.3-Codex, and GPT-5.3-Codex-Spark as alternatives ⁷. GPT-5.2-Codex, released today, brings improved long-horizon session stability, stronger context compaction, and enhanced cybersecurity capabilities ². Codex CLI context windows top out at roughly 200,000 tokens for GPT-5.3-Codex ⁷, making it dependent on context compaction for extended sessions rather than raw window size.

# Codex CLI: select model in config.toml
model = "gpt-5.5"

# Or switch mid-session with /model

Jules model selection is implicit – tied to your Google AI subscription tier rather than per-task configuration ⁶.

CLI Tooling Compared

Both platforms now have terminal interfaces, but they serve different purposes.

Jules Tools CLI

Jules Tools launched in October 2025 as a lightweight CLI for managing async tasks ⁸. Its primary commands dispatch work to cloud VMs:

# Submit a task to Jules
jules task create --repo owner/repo "Add unit tests for the auth module"

# Check task status
jules task list --status running

# Interactive TUI for task management
jules /remote

Jules Tools also offers a TUI with /remote for dashboarding and /new for guided task creation ⁸. Critically, Jules Tools is a remote control – it dispatches and monitors work happening elsewhere.

Codex CLI

Codex CLI is the execution surface itself. The agent runs locally, reads files from your working tree, and applies changes directly:

# Interactive session
codex

# Non-interactive execution
codex exec "Refactor the payment module to use the new gateway interface"

# Cloud delegation (closest to Jules pattern)
codex cloud exec "Add integration tests for the auth service"

The codex cloud exec subcommand is the closest analogue to Jules’ model – it delegates work to a remote environment and returns results ⁹. But the default codex invocation keeps you in the loop with inline approvals, live diff rendering, and mid-session steering.

When Jules Wins

Jules excels when:

The task is well-specified and self-contained. Bug fixes with reproduction steps, dependency version bumps, test backfill for existing modules – tasks where the brief is solid and human oversight adds latency rather than value ³.
You need parallelism across repositories. Jules spins up independent VMs per task. Five repositories, five simultaneous tasks, five pull requests by morning ⁶.
The team’s workflow is PR-centric. If every change flows through code review regardless of author, Jules’ PR-first output fits naturally into existing processes ³.
Suggested and Scheduled Tasks apply. Jules’ proactive features – suggesting improvements across up to five repositories and running scheduled maintenance tasks – have no direct Codex CLI equivalent ¹⁰.

When Codex CLI Wins

Codex CLI excels when:

The task needs interactive exploration. Understanding an unfamiliar codebase, debugging a flaky test, prototyping a UI – tasks where you need to steer, ask follow-up questions, and iterate ⁴.
You need immediate feedback on changes. Codex writes directly to your working tree. Run the tests yourself, check the diff, tweak the prompt, continue the session ⁴.
Configuration and customisation depth matters. AGENTS.md, hooks, skills, plugins, MCP servers, approval modes, subagent definitions, execution policies – Codex CLI offers a deep customisation surface that Jules currently lacks ¹¹.
Security boundaries are non-negotiable. Codex CLI’s sandbox restricts network access, filesystem writes, and command execution with configurable policies ⁴. Jules runs in a cloud VM with broader permissions by default.
You work in regulated or air-gapped environments. Codex CLI runs locally with optional self-hosted model providers ¹². Jules requires Google Cloud connectivity.

Project Jitro: The Goal-Driven Shift

The most significant announcement from I/O 2026 for coding agents is Project Jitro, the next evolution of Jules ¹³. Where current agents (including both Jules and Codex CLI) are task-driven – you tell them what to do – Jitro is goal-driven: you define a measurable outcome and the agent determines the changes needed.

graph TD
    subgraph "Task-Driven (Current)"
        T1["Write tests for auth module"] --> T2[Agent executes specific task]
        T2 --> T3[PR with test files]
    end

    subgraph "Goal-Driven (Jitro)"
        G1["Raise test coverage to 80%"] --> G2[Agent analyses coverage gaps]
        G2 --> G3[Agent prioritises highest-impact modules]
        G3 --> G4[Agent writes tests across codebase]
        G4 --> G5[Agent verifies coverage target met]
        G5 --> G6[PR with all changes]
    end

Jitro’s persistent workspace model – where goals, progress, and tool integrations persist across sessions – addresses a genuine gap ¹³. The compounding improvements that never make it into sprint backlogs (accessibility compliance, latency reduction, error rate improvements) are precisely the work best suited to a goal-driven agent that chips away continuously.

This has no direct Codex CLI equivalent today. The closest patterns are:

Codex Cloud automations with scheduled cadences ¹⁴
Scored improvement loops using codex exec with structured output and evaluation harnesses ⁹
Custom skills that encode a goal and verification criteria

But none of these compose into the persistent, metric-tracking workspace Jitro promises.

Using Both Together

The pragmatic approach is to use both. Here is a decision framework:

flowchart TD
    A[New coding task] --> B{Need interactive<br/>exploration?}
    B -->|Yes| C[Codex CLI]
    B -->|No| D{Well-specified<br/>with clear brief?}
    D -->|Yes| E{Needs immediate<br/>local feedback?}
    D -->|No| C
    E -->|Yes| C
    E -->|No| F{Multiple repos<br/>in parallel?}
    F -->|Yes| G[Google Jules]
    F -->|No| H{Team uses<br/>PR-first flow?}
    H -->|Yes| G
    H -->|No| C

A practical dual-agent workflow:

Morning triage with Jules: queue dependency bumps, test backfill, and linter fixes as Jules tasks across your repositories.
Interactive development with Codex CLI: feature work, debugging, and architecture exploration in the terminal.
Afternoon review: review Jules PRs alongside your local Codex changes.

For teams already invested in Codex CLI’s customisation surface (AGENTS.md, hooks, MCP servers, skills), the natural integration point is codex cloud exec for async tasks rather than switching to Jules entirely. For teams with lighter configuration needs and a strong PR review culture, Jules provides a lower-friction path to async agent work.

Configuration Surface Comparison

Feature	Codex CLI	Jules
Instruction files	AGENTS.md (hierarchical, per-directory) ¹¹	Repository-level configuration
Tool integration	MCP servers, plugins, skills ¹¹	GitHub integration, Render ¹⁰
Approval modes	Auto, suggest, full-auto, per-tool overrides ⁴	Plan approval before execution ³
Hooks	Stable lifecycle hooks (PreToolUse, PostToolUse, etc.) ¹⁵	Not applicable
Subagents	Custom agent definitions in TOML ¹¹	Single-agent execution
Non-interactive	`codex exec` with `--output-schema` ⁹	Jules API and Jules Tools CLI ⁸
Scheduling	Via Codex app automations or cron + `codex exec` ¹⁴	Native Scheduled Tasks ¹⁰
Model choice	Per-session, per-config, mid-session switching ⁷	Tied to subscription tier ⁶

Pricing Considerations

Codex CLI is included with paid ChatGPT plans (Plus, Pro, Business, Enterprise), with overflow usage billed at API-style token rates ¹⁶. Jules operates across three tiers tied to Google AI subscriptions, with a free tier using Gemini 3 Flash and paid tiers using Gemini 3.1 Pro ⁶.

For teams already paying for ChatGPT Enterprise or Pro, Codex CLI adds no marginal cost within included quotas. For teams on Google Workspace with Google AI subscriptions, Jules is similarly bundled. The economic decision often follows existing platform commitments rather than per-feature comparison.

What to Watch

Three developments will reshape this comparison in the coming months:

Jitro’s general availability. If goal-driven agents deliver on the promise, every task-driven tool – including Codex CLI – will need an answer to “why am I still writing prompts instead of setting targets?” ¹³
Codex CLI’s evolving cloud surface. The codex cloud exec pattern with --attempts for best-of-N execution is already moving toward async territory ⁹. Further convergence is likely.
Cross-agent portability. As more teams run multiple agents, the ability to share configuration (AGENTS.md is already an open standard adopted by multiple tools ¹¹) and switch between sync and async execution surfaces becomes the real differentiator.

Citations

Google I/O 2026 Developer Briefing, byteiota.com ↩ ↩²
“Introducing GPT-5.2-Codex,” OpenAI, May 2026, openai.com ↩ ↩²
“Jules: Google’s autonomous AI coding agent,” Google Blog, blog.google ↩ ↩² ↩³ ↩⁴
“Features – Codex CLI,” OpenAI Developers, developers.openai.com ↩ ↩² ↩³ ↩⁴ ↩⁵ ↩⁶
“Codex CLI for Cross-Repository Development,” Codex Blog, May 2026, codex.danielvaughan.com ↩
“Google Jules: Gemini Async Coding Agent Guide 2026,” Digital Applied, digitalapplied.com ↩ ↩² ↩³ ↩⁴ ↩⁵
“Models – Codex,” OpenAI Developers, developers.openai.com ↩ ↩² ↩³
“Meet Jules Tools: A Command Line Companion for Google’s Async Coding Agent,” Google Developers Blog, developers.googleblog.com ↩ ↩² ↩³
“Non-interactive mode – Codex,” OpenAI Developers, developers.openai.com ↩ ↩² ↩³ ↩⁴
“Jules from Google Labs introduces proactive coding features,” Google Blog, blog.google ↩ ↩² ↩³
“Custom instructions with AGENTS.md – Codex,” OpenAI Developers, developers.openai.com ↩ ↩² ↩³ ↩⁴ ↩⁵
“Advanced Configuration – Codex,” OpenAI Developers, developers.openai.com ↩
“Google Project Jitro: Jules V2 Moves from Prompts to Goals,” byteiota, byteiota.com ↩ ↩² ↩³
“Automations – Codex app,” OpenAI Developers, developers.openai.com ↩ ↩²
“Hooks – Codex,” OpenAI Developers, developers.openai.com ↩
“Pricing – Codex,” OpenAI Developers, developers.openai.com ↩