Pi – A minimal terminal coding harness

CGamesPlay · 2026-02-25T03:28:43 1771990123

To me, the most interesting thing about Pi and the "claw" phenomenon is what it means for open source. It's becoming passé to ask for feature requests and even to submit PRs to open source repos. Instead of extensions you install, you download a skill file that tells a coding agent how to add a feature. The software stops being an artifact and starts being a living tool that isn't the same as anyone else's copy. I'm curious to see what tooling will emerge for collaborating with this new paradigm.

throwaway13337 · 2026-02-25T04:29:32 1771993772

I see this happening, too.

We know that a lack of control over their environment makes animals, including humans, depressed.

The software we use has so much of this lack of control. It's their way, their branding, their ads, their app. You're the guest on your own device.

It's no wonder everyone hates technology.

A world with software that is malleable, personal, and cheap - this could do a lot of good. Real ownership.

The nerds could always make a home with their linux desktop. Now everyone can. It'll change the equation.

I'm quite optimistic for this future.

h14h · 2026-02-25T09:07:16 1772010436

I'm presently in the process of building (read: directing claude/codex to build) my own AI agent from the ground up, and it's been an absolute blast.

Building it exactly to my design specs, giving it only the tool calls I need, owning all the data it stores about me for RAG, integrating it to the exact services/pipelines I care about... It's nothing short of invigorating to have this degree of control over something so powerful.

In a couple of days work, I have a discord bot that's about as useful as chatgpt, using open models, running on a VPS I manage, for less than $20/mo (including inference). And I have full control over what capabilities I add to it in the future. Truly wild.

hdjrudni · 2026-02-25T05:07:01 1771996021

That's just because corporations got greedy and made their apps suck.

Strip away the ads, the data harvesting, add back the power features, and we'll be happy again. I'm more willing than ever to pay a one-time fee good software. I've started donating to all the free apps I use on a regular basis.

I don't want to own my own slop. That doesn't help me. Use your AI tools to build out the software if you want, but make sure it does a good job. Don't make me fiddle with indeterministic flavor-of-the-month AI gents.

moring · 2026-02-25T08:17:27 1772007447

> That's just because corporations got greedy and made their apps suck.

It is true for me with Linux. I code for a living and I can't change anything because I can't even build most software -- the usual configure/make/make install runs into tons of compiler errors most of the time.

Loss of control is an issue. I'm curious if AI tools will change that though.

safety1st · 2026-02-25T05:38:10 1771997890

I think there's room for both visions. Big Tech is generating more toxic sludge than ever, and yeah sure this is because they're greedy, but more precisely the root cause is how they lobbied Washington and our elected officials agreed to all kinds of pro-corporate, anti-human legislation. Like destroying our right to repair, like criminalizing "circumvention" measures in devices we own, like insane life-destroying penalties for copyright infringement, like looking the other way when Big Tech broke anti-trust laws, etc.

The Big Tech slop can only be fixed in one way, and actually it's really predictable and will work - we need to fix the laws so that they put the rights and flourishing of human beings first, not the rights and flourishing of Big Tech. We need to fix enforcement because there are so many times that these companies just break the law and they get convicted but they get off with a slap on the wrist. We need to legislate a dismantling of barriers to new entrants in the sectors they dominate. Competition for the consumer dollar is the only thing that can force them to be more honest. They need to see that their customers are leaving for something better, otherwise they'll never improve.

But our elected officials have crafted laws and an enforcement system which make 'something better' impossible (or at least highly uneconomical).

Parallel to this if open source projects can develop software which is easier for the user to change via a PR, they totally should. We can and should have the best of both worlds. We should have the big companies producing better "boxed" software. Plus we should have more flexibility to build, tweak and run whatever we want.

mentalgear · 2026-02-25T09:09:46 1772010586

Very good points, I agree and would add : "Interoperability" is the key to bring back competition and open the ecosystem again.

bergfest · 2026-02-25T06:59:09 1772002749

And then they will take away your right to boot whatever you want. For national security reasons and the children, of course.

peepee1982 · 2026-02-25T07:14:56 1772003696

What you're describing is the expected and correct outcome inside a profit-oriented, capitalist system. So the only way I see out of this situation would be changing policy to a more socialist one, which doesn't seem to be so popular among the tech elite, who often think they deserve their financial status because of the 'value' they provide, without specifying what that value is (or its second-order consequences). Whether that's abusing a monopolistic market position they lucked into, making apps as addictive as possible, or building drones that throw bombs on newborns in hospitals.

bandrami · 2026-02-25T08:37:51 1772008671

> a living tool that isn't the same as anyone else's copy

Yes, which is why this model of development is basically dead-in-the-water in terms of institutional adoption. No large firm or government is going to allow that.

raincole · 2026-02-25T09:47:02 1772012822

Large institutions and governments had been pushing back against open source too until it became obviously inevitable.

bandrami · 2026-02-25T10:05:16 1772013916

It wasn't "inevitable", it took Red Hat and some other key players addressing the concerns the businesses and governments had, which took the better part of a decade. If LLMs as an ecosystem don't implode in the next year or so I imagine you'll start to see some big consultancies starting that same process for them.

axelthegerman · 2026-02-25T03:32:41 1771990361

And how great it will be to troubleshoot any issues because everyone is basically running a distinct piece of software

theshrike79 · 2026-02-25T06:58:23 1772002703

It's like the dude who monkey-patches their car and goes to the dealer to complain why the suspension is stiff.

It's because you put 2by4's in place of the shocks, you absolute muppet. And then they either give them a massive bill to fix it properly or politely show them out.

Same will happen in self-modifying software. Some people are self-aware enough to know that "I made this, it's my problem to fix", some will complain to the maker of the harness they used and will be summarily shown the door.

wrxd · 2026-02-25T06:51:28 1772002288

I don’t want to be the one who has to upgrade this software + vibe coded patches.

It’s going to be very likely that once something is patched is to be considered as diverged and very hard to upgrade

sshine · 2026-02-25T03:39:09 1771990749

... made minutes ago.

krickelkrackel · 2026-02-25T06:24:15 1772000655

So everybody will be using (sometimes slightly, sometimes entirely) different software. Like mutations, these adapt to the specific problems in the situation they were prompted to be programmed.

CuriouslyC · 2026-02-25T04:32:18 1771993938

The skill for feature thing is just horrible, it's wasteful to everyone but the maintainer. It feels like a YOLO people are getting away with because people drank some kool-aid.

theshrike79 · 2026-02-25T06:59:15 1772002755

Think of skills more like Excel macros (or any other software with robust macro support). It doesn't make sense for Microsoft to provide the specific workflow you need, but your own sheet needs it.

rcarmo · 2026-02-24T22:40:28 1771972828

My current fave harness. I've been using it to great effect, since it is self-extensible, and added support for it to https://github.com/rcarmo/vibes because it is so much faster than ACP.

solarkraft · 2026-02-25T09:50:43 1772013043

Can you shed some light on the speed difference of the direct integration vs. ACP?

I’m still looking for a generic agent interaction protocol (to make it worth building around) and thought ACP might be it. But (and this is from a cursory look) it seems that even OpenCode, which does support ACP, doesn’t use it for its own UI. So what’s wrong with it and are there better options to hopefully take its place?

gusmally · 2026-02-25T01:41:24 1771983684

Which ones have you compared it against?

rcarmo · 2026-02-25T07:57:00 1772006220

Literally all of them: https://github.com/rcarmo/agentbox

badlogic · 2026-02-24T23:03:10 1771974190

wow, i love this! was about to build this myself, but this looks exactly what i want.

rcarmo · 2026-02-24T23:09:16 1771974556

The better web UI is now part of https://github.com/rcarmo/piclaw (which is essentially the same, but with more polish and a claw-like memory system). So you can pick if you want TS or Python as the back-end :)

badlogic · 2026-02-24T23:11:28 1771974688

if i ever want a claw, i'd obv. go with this :)

rcarmo · 2026-02-24T23:27:46 1771975666

The claw version’s web UI essentially has better thinking output, more visibility of tool calls, and slightly better SSE streaming. I’ve backported some of it to vibes, but if you want to borrow UI stuff, the better bits are in piclaw. I use both constantly on my phone/desktop.

baby · 2026-02-25T04:22:34 1771993354

Wdym harness? Its a coding agent

furryrain · 2026-02-25T04:52:34 1771995154

I think the thesis of Pi is that there isn't much special about agents.

Model + prompt + function calls.

There are many such wrappers, and they differ largely on UI deployment/integration. Harness feels like a decent term, though "coding harness" feels a bit vague.

mr_o47 · 2026-02-25T09:49:39 1772012979

I recently discovered this via a YouTube video a few days ago

I really like the customization aspect of it and you can build tools on fly and even switch model mid session

There’s another project here called oh my pi has anyone here tried it

reacharavindh · 2026-02-25T08:01:01 1772006461

I began with pi, and have been using oh-my-pi the last two weeks.

https://github.com/can1357/oh-my-pi

More of a batteries included version of pi.

mr_o47 · 2026-02-25T09:51:04 1772013064

How’s your experience so far with oh my pi

self_awareness · 2026-02-25T08:35:23 1772008523

Are you running it in some kind of sandbox? Does it have sandboxing features?

tmustier · 2026-02-25T00:38:41 1771979921

I haven’t met a single person who has tried pi for a few days and not made it their daily driver. Once you taste the freedom of being able to set up your tool exactly how you like, there’s really no going back.

and you can build cool stuff on top of it too!

ck_one · 2026-02-25T01:44:00 1771983840

What self-built capabilities do you like the most that claude code doesn't offer?

tomashubelbauer · 2026-02-25T09:13:37 1772010817

Not the person you replied to, but I'll stress the point that it is not just what you can add that Claude Code doesn't offer, but also what you don't need to add that Claude Code does offer that you don't want.

I dislike many things about Claude Code, but I'll pick subagents as one example. Don't want to use them? Tough luck. (AFAIK, it's been a while since I used CC, maybe it is configurable now or was always and I never discovered that.)

With Pi, I just didn't install an extension for that, which I suspect exists, but I have a choice of never finding out.

sshine · 2026-02-25T03:41:06 1771990866

> I haven’t met a single person who has tried pi for a few days and not made it their daily driver.

Pleased to meet you!

For me, it just didn’t compare in quality with Claude CLI and OpenCode. It didn’t finish the job. Interesting for extending, certainly, but not where my productivity gains lie.

esafak · 2026-02-25T05:13:39 1771996419

People seem to be really enjoying rolling everything themselves these days...

theshrike79 · 2026-02-25T08:52:30 1772009550

I've spent way too long working around the jank and extra features in Other People's Software.

Now I can just make my own that does exactly what I want and need, nothing more or nothing less. It's just for me, it's not a SaaS or a "start-up" I'm the CEO of.

ixsploit · 2026-02-25T06:57:32 1772002652

Because it’s very easy todo nowadays. Why making compromises in your workflow anymore?

chriswarbo · 2026-02-25T01:21:31 1771982491

I've been using pi via the pi-coding-agent Emacs package, which uses its RPC mode to populate a pair of Markdown buffers (one for input, one for chat), which I find much nicer than the awful TUIs used by harnesses like gemini-cli (Emacs works perfectly well as a TUI too!).

The extensibility is really nice. It was easy to get it using my preferred issue tracker; and I've recently overridden the built-in `read` and `write` commands to use Emacs buffers instead. I'd like to override `edit` next, but haven't figured out an approach that would play to the strengths of LLMs (i.e. not matching exact text) and Emacs (maybe using tree-sitter queries for matches?). I also gave it a general-purpose `emacs_eval`, which it has used to browse documentation with EWW.

dnouri · 2026-02-25T02:23:21 1771986201

Nice! I'm curious to hear how you're mapping `read` and `write` to Emacs buffers. Does that mean those commands open those files in Emacs and read and write them there?

Let me also drop a link to the Pi Emacs mode here for anyone who wants to check it out: https://github.com/dnouri/pi-coding-agent -- or use: M-x package-install pi-coding-agent

We've been building some fun integrations in there like having RET on the output of `read`, `write`, `edit` tool calls open the corresponding file and location at point in an Emacs buffer. Parity with Pi's fantastic session and tree browsing is hopefully landing soon, too. Also: Magit :-)

thepasch · 2026-02-25T09:11:24 1772010684

Stop advertising pi, people. It _somehow_ continued to fly somewhat under the radar after that whole OpenClaw nonsense. Don’t make Anthropic’s sic their bloodhounds on them like they did on OpenCode.

raincole · 2026-02-25T09:40:50 1772012450

Interestingly, since OpenClaw, there has been ~one post about Pi every week. But practically no one voted any of them except this one.

tietjens · 2026-02-25T09:21:49 1772011309

pi is an officially accepted harness of either Anthropic or OpenAI. I forgot which.

himata4113 · 2026-02-24T22:47:50 1771973270

Preconfigured PI: https://github.com/can1357/oh-my-pi

amin2 · 2026-02-25T09:00:15 1772010015

This looks great but It feels really risky to add more and more tools to the harness from random repos. Nothing against this repo in particular but I wish we had better security and isolation so I that I knew nothing could go wrong and I could just test a bunch of these every day the same way I can install an app on my phone and feel confident it's not going to steal my data.

virtuallynathan · 2026-02-25T03:51:04 1771991464

Big fan of this fork, been using it for everything for the last couple of weeks.

Went from codex/claude code -> opencode -> pi -> oh-my-pi

jannniii · 2026-02-25T08:39:06 1772008746

It is an awesome fork! Tried to contribute also, but community seems quite close knit.

thepasch · 2026-02-25T09:09:52 1772010592

I feel like this misses the point of pi somewhat. The allure of pi is that it allows you to start from scratch and make it entirely your own; that it’s lightweight and uses only what you need. I go through the list of features in this and I think, okay, cool, but why should I use this over OpenCode if I just want a feature-packed (and honestly -bloated) ready-made harness?

mijoharas · 2026-02-25T00:26:58 1771979218

I'd quite like the web tools from oh-my-pi, but able to be extracted to a normal pi tool or plugin... Maybe I should look into that sometime...

esafak · 2026-02-25T05:14:13 1771996453

Why not OpenCode?

jannniii · 2026-02-25T08:40:08 1772008808

Oh-my-bloat.

I am still an avid user of opencode, my own fork though with async tools etc, but it is cumbersome and tries to do too many things.

tietjens · 2026-02-25T09:35:42 1772012142

very interesting, i tried it at the start but haven't come back. could you expand on what you mean?

buremba · 2026-02-25T06:23:25 1772000605

I spent 3 months adopting Codex and Claude Code SDKs only to realize they're just vendor lock-in and brittle. They're intended to be used as CLI so it's not programmable enough as a library. After digging into OpenClaw codebase, I can safely say that the most of its success comes from the underlying harness, pi agent.

pi plugins support adding hooks at every stage, from tool calls to compaction and let you customize the TUI UI as well. I use it for my multi-tenant Openclaw alternative https://github.com/lobu-ai/lobu

If you're building an agent, please don't use proprietary SDKs from model providers. Just stick to ai-sdk or pi agent.

kzahel · 2026-02-25T10:02:42 1772013762

I left some notes about this. I agree with you directionally but practically/economically you want to let users leverage what they're already paying for.

https://yepanywhere.com/subscription-access-approaches/

Captures the ai-sdk and pi-mono.

In an ideal world we would have a pi-cli-mono or similar, like something that is not as powerful as pi but gives a least common denominator sort of interface to access at least claude/codex.

ACP is also something interesting in this space, though I don't honestly know how that fits into this story.

bjackman · 2026-02-25T07:42:08 1772005328

IIUC to reliably use 3P tools you need to use API billing, right? Based on my limited experimentation this is an order of magnitude more expensive than consumer subscriptions like Claude Pro, do I have that right?

("Limited experimentation" = a few months ago I threw $10 into the Anthropic console and did a bit of vibe coding and found my $10 disappeared within a couple of hours).

If so, that would support your concern, it does kinda sound like they're selling marginal Claude Code / Gemini CLI tokens at a loss. Which definitely smells like an aggressive lockin strategy.

buremba · 2026-02-25T08:00:23 1772006423

Technically you're still using claude CLI with this pattern so it's not 3P app calling Anthropic APIs via your OAuth token. Even if you would use Claude Code SDK, your app is 3P so it's in a gray area.

Anthropic docs is intentionally not clear about how 3P tools are defined, is it calling Claude app or the Anthropic API with the OAuth tokens?

vanillameow · 2026-02-25T07:44:53 1772005493

Unfortunately it's currently very utopian for (I would assume) most devs to use something like this when API cost is so prohibitively expensive compared to e.g. Claude Code. I would love to use a lighter and better harness, but I wouldn't love to quintuple my monthly costs. For now the pricing advantage is just too big for me compared to the inconvenience of using CC.

buremba · 2026-02-25T08:02:15 1772006535

You technically still use CC, it's not via SDK but via CLI programmatically triggered via pi.

vanillameow · 2026-02-25T09:42:59 1772012579

Is this in line with Anthropic ToS? They cracked down hard on Clawdbot and the like from what I gathered. I guess if you are still invoking CC it might be fine, but isn't that gonna lead to weird behavior from basically doubling up on harnesses?

siva7 · 2026-02-25T07:00:44 1772002844

I also wondered for months why it feels so difficult to use Openai or Anthropic SDKs until i came to a similar conclusion.

mccoyb · 2026-02-24T23:26:56 1771975616

Pi has made all the right design choices. Shout out to Mario (and Armin the OG stan) — great taste shows itself.

semiinfinitely · 2026-02-24T23:45:58 1771976758

I do not understand why in the age of ai coding we would implement this in javascript

solarkraft · 2026-02-25T09:55:40 1772013340

It’s one of the most productive languages and ecosystems (IMO top 1 over all).

mccoyb · 2026-02-24T23:51:30 1771977090

It’s straightforward: JavaScript is a dynamic language, which allows code (for instance, code implementing an extension to the harness) to be executed and loaded while the harness is running.

This is quite nice — I do think there’s a version of pi’s design choices which could live in a static harness, but fully covering the same capabilities as pi without a dynamic language would be difficult. (You could imagine specifying a programmable UI, etc — various ways to extend the behavior of the system, and you’d like end up with an interpreter in the harness)

At least, you’d like to have a way to hot reload code (Elixir / Erlang could be interesting)

This is my intuition, at least.

jatari · 2026-02-25T00:30:36 1771979436

Code hotloading isn't a particularly difficult feature to implement in any language.

mccoyb · 2026-02-25T00:38:21 1771979901

Sure, but why implement a novel language with said feature if your concern is a harness ... not on implementing a brand new language with this feature?

jauntywundrkind · 2026-02-25T03:31:37 1771990297

Rust can't even dynamically link!

I'm super on board the rust train right now & super loving it. But no, code hot loading is not common.

Most code in the world is dead code. Most languages are for dead code. It's sad. Stop writing dead code (2022) was no where near the first, is decades and decades late in calling this out, but still a good one. https://jackrusher.com/strange-loop-2022/

jasonjmcghee · 2026-02-25T05:45:30 1771998330

Incredible talk and I agree with all the things and I've worked on this problem a bunch.

But Rust can dynamically link with dylib but I believe it's still unstable.

It can also dynamically load with libloading.

sergiomattei · 2026-02-25T00:55:31 1771980931

I built my own harness on Elixir/Erlang[0]. It's very nice, but I see why TypeScript is a popular choice.

No serialization/JSON-RPC layer between a TS CLI and Elixir server. TS TUI libraries utilities are really nice (I rewrote the Elixir-based CLI prototype as it was slowing me down). Easy to extend with custom tools without having to write them in Elixir, which can be intimidating.

But you're right that Erlang's computing vision lends itself super well to this problem space.

[1]: https://github.com/matteing/opal

raincole · 2026-02-25T09:39:12 1772012352

Thank god it's written in JavaScript. I might have skipped it if it were zig or something.

KeplerBoy · 2026-02-25T08:14:52 1772007292

This confused me about openclaw for quite some time. The whole lobster/crustacean theme is just firmly associated with rust in my head. Guess it's just a claude/claw wordplay.

sean_pedersen · 2026-02-25T01:23:24 1771982604

There is a Rust port: https://github.com/Dicklesworthstone/pi_agent_rust

saberience · 2026-02-25T09:40:30 1772012430

If you look at that code it’s possibly the worst rust code I’ve seen in my life. There are several files with 5000 to 10000 lines of code in a single file.

It looks 100% vibe coded by someone who’s a complete neophyte.

jauntywundrkind · 2026-02-25T03:34:18 1771990458

Fwiw @dicklesworthstone / jeff Emanuel is definitely my favorite dragon rider right now, doing the most with AI, to the most effect.

Their agent mail was great & very early in agent orchestration. Code agent search is amazing & will tell you what's happening in every harness. Their Franktui is a ridiculously good rust tui. They have project after project after project after project and they are all so good.

Didn't know they had a rust Pi. Nice.

saberience · 2026-02-25T09:49:43 1772012983

You should look at the code in that project. It’s terrible, I mean, really, really terrible.

It’s clear it was 100% written by Claude using sub-agents which explains the many classes with 5000 lines of rust in a single file.

It’s a huge buggy mess which doesn’t run on my Mac.

If you’re a rust engineer and want a good laugh, go take a look at the agent.rs, auth.rs, or any of the core components.

thomasfromcdnjs · 2026-02-25T06:43:00 1772001780

I am building an entire GPT model framework from the ground up in Typescript + small amounts of c bindings for gpu stuff. https://github.com/thomasdavis/alpha2 (using claude)

Don't hate me aha and no, there is no reason other than I can

Blackarea · 2026-02-24T23:50:35 1771977035

yes! I just don't understand that as well. Up until some time ago claud code's preferred install was a npm i, wasn't it? Please serious answers for why anyone would use a web language for a terminal app

fragmede · 2026-02-25T04:08:03 1771992483

Because it's what the person writing it's preferred language.

So it can share code with the web app.

Because writing it in javascript is easier than writing it in raw brute forced assembly.

andai · 2026-02-25T04:02:31 1771992151

See also: pz: pi coding-agent in Zig

https://news.ycombinator.com/item?id=47120784

moonlion_eth · 2026-02-25T02:28:48 1771986528

i wrote an agent in zig, it kinda sucks tho. the language is just words

rahimnathwani · 2026-02-24T23:31:34 1771975894

Hugging Face now provides instructions for using local models in Pi:

https://x.com/victormustar/status/2026380984866710002

elyase · 2026-02-25T00:22:15 1771978935

there is also pz a drop-in replacement for pi rewritten in Zig. 1.7MB static binary, 3ms startup, 1.4MB RAM idle. Find more at:

https://github.com/elyase/awesome-personal-ai-assistants?tab...

snthpy · 2026-02-25T06:18:54 1772000334

Cool, thanks for this. What about the extensions though? For me the point about pi is minimal base plus configurable extensions you choose.

_neil · 2026-02-25T02:27:01 1771986421

Direct link to pz for those on mobile: https://github.com/joelreymont/pz

thevinter · 2026-02-25T00:43:03 1771980183

Pi was probably the best ad for Claude Code I ever saw.

After my max sub expired I decided to try Kimi on a more open harness, and it ended up being one of the worst (and eye opening experiences) I had with the agentic world so far.

It was completely alienating and so much 'not for me', that afterwards I went back and immediately renewed my claude sub.

https://www.thevinter.com/blog/bad-vibes-from-pi

raincole · 2026-02-25T09:33:28 1772012008

Technically you're not allowed to use Claude subscription account with Pi (according to Anthropic's policy). So yeah, Pi is the best anti-ad against Anthropic.

mccoyb · 2026-02-25T00:58:42 1771981122

> I would say that the project actively expects you to be downloading them to fill any missing gaps you might have.

Where did you get this perspective from?

> I thought pi and its tools were supposed to be minimal and extensible. So why is a subagent extension bundling six agents I never asked for that I can’t disable or remove?

Why do you think a random subagents extension is under the same philosophy as pi?

Your blog post says little about pi proper, it's essentially concerned with issues you had with the ecosystem of extensions, often made by random people who either do or do not get the philosophy? Why would that be up to pi to enforce?

the_mitsuhiko · 2026-02-25T07:34:19 1772004859

Sharing extensions is very much the philosophy. Using them however is less so.

Pi ships with docs that include extensions and the agent looks there for inspiration if you ask it to build a custom extension.

Looking at what others publish is useful!

a96 · 2026-02-25T08:52:24 1772009544

> As it turns out, the opinions in question are that bash should be enabled by default with no restrictions, that the agent should have access to every file on your machine from the start, and that npm is the only package manager worth supporting.

Yep. This is why I've been going "Hell, no!" and will probably keep doing so.

CGamesPlay · 2026-02-25T03:23:55 1771989835

> if I start the agent in ./folder then anything outside of ./folder should be off limits unless I explicitly allow it, and the same goes for bash where everything not on an allowlist should be blocked by default.

Here's the problem with Claude Code: it acts like it's got security, but it's the equivalent of a "do not walk on grass" sign. There's no technical restrictions at play, and the agent can (maliciously or accidentally) bypass the "restrictions".

That's why Pi doesn't have restrictions by default. The logic is: no matter what agent you are using, you should be using it in a real sandbox (container, VM, whatever).

esafak · 2026-02-25T05:30:58 1771997458

But the agent has to interact with the world; fetch docs, push code, fetch comments, etc. You can't sandbox everything. So you push that configuration to your sandbox, which is a worse UX that the harness just asking you at the right time what you'd like to do.

the_mitsuhiko · 2026-02-25T07:30:01 1772004601

I too would like to know what a good UX looks like here but I have doubts that the permission prompts of Claude are the way to go right now.

Within days people become used to just hitting accept and allowlisting pretty much everything. The agents write length scripts into shell scripts or test runners that themselves can be destructive but they immediately allowlisted.

CGamesPlay · 2026-02-25T07:12:07 1772003527

Well, you are imagining a worse UX, but it doesn't have to be. Pi doesn't include a sandboxing story at all (Claude provides an advisory but not mandatory one), but the sandbox doesn't have to be a simple static list of allowed domains/files. It's totally valid to make the "push code" tool in the sandbox send a trigger to code running outside of the sandbox, which then surfaces an interactive prompt to you as a user. That would give you the interactivity you want and be secure against accidentally or deliberately bypassing the sandbox.

rcarmo · 2026-02-25T07:35:51 1772004951

Paraphrasing The Dude, that’s like, just your opinion, man.

tern · 2026-02-25T06:14:17 1772000057

I had a very similar experience. I have different preferences, but ultimately, my takeaway was that if I want to follow my own version of their philosophy, I should just create my own thing.

In the meantime, the codex/cc defaults are better for me.

NamlchakKhandro · 2026-02-25T02:56:32 1771988192

hypegrift

ge96 · 2026-02-24T22:55:46 1771973746

Is that an official term "coding harness"

Wondering if you wanted a similar interface (though a GUI not just CLI) where it's not for coding what would you call that?

Same idea cycle through models, ask question, drag-drop images, etc...

arcanemachiner · 2026-02-24T23:56:35 1771977395

Yes. It seems to be the term that stands out the most, as terms like "AI coding assistant", "agentic coding framework", etc. are too vague to really differentiate these tools.

"harness" fits pretty nicely IMO. It can be used as a single word, and it's not too semantically overloaded to be useful in this context.

rcarmo · 2026-02-24T23:08:06 1771974486

LLM harness has been in vogue for a year now…

outofpaper · 2026-02-24T23:12:08 1771974728

A harness is a collection of stubs and drivers configured to assist with automation or testing. It's a standard term often used in QA as they've been automating things for ages before Gen Ai came on to the scene.

arcanemachiner · 2026-02-24T23:53:48 1771977228

Yes, it is also a device used to control the movement of work animals, which farmers have been using for ages before QA came on to the scene.

arjie · 2026-02-24T22:31:37 1771972297

Has anyone used an open coding agent in headless mode? I have a system cobbled together with exceptions going to a centralized system where I can then have each one pulled out and `claude -p`'d but I'd rather just integrate an open coding agent into the loop because it's less janky and then I'll have it try to fix the problem and propose a PR for me to review. If anyone else has used pi.dev or opencode or aider in this mode (completely non-interactive until the PR) I'd be curious to hear.

EDIT: Thank you to both responders. I'll just try the two options out then.

chriswarbo · 2026-02-25T00:43:42 1771980222

pi has an RPC mode which just sends/receives JSON lines over stdio (including progress updates, and "UI" things like asking for confirmation, if it's configured for that).

That's how the pi-coding-agent Emacs package interacts with pi; and it's how I write automated tests for my own pi extensions (along with a dummy LLM that emits canned responses).

fred_tandemai · 2026-02-24T23:22:21 1771975341

Been using pi exactly for this and it's working great!

rcarmo · 2026-02-24T22:41:05 1771972865

You probably want to look into pi then - it's extremely extensible.

dosinga · 2026-02-24T22:41:37 1771972897

you can run https://block.github.io/goose/ in headless mode (I work on goose)

evalstate · 2026-02-24T23:19:45 1771975185

fast-agent lets you do this as well (and has a skill in its default skills repo to help with automation/running in container/hf job).

infruset · 2026-02-24T22:46:35 1771973195

Note there is a fork oh-my-pi: https://github.com/can1357/oh-my-pi of https://blog.can.ac/2026/02/12/the-harness-problem/ fame. I use it as a daily driver but I also love pi.

jmorgan · 2026-02-24T22:33:52 1771972432

I've been using Pi day to day recently for simple, smaller tasks. It's a great harness for use with smaller parameter size models given the system prompt is quite a bit shorter vs Claude or Codex (and it uses a nice small set of tools by default).

rpastuszak · 2026-02-24T23:58:21 1771977501

Which models do you use and what for? I'm looking for ideas to play with.

jmorgan · 2026-02-25T06:51:39 1772002299

For local models I've been trying it with GLM-4.7-Flash and the new LFM2 24B model. I'm excited to try it with the new Qwen3.5 models that came out today as well.

mongrelion · 2026-02-24T23:24:22 1771975462

Pi ships with powerful defaults but skips features like sub-agents and plan mode

Does anyone have an idea as to why this would be a feature? don't you want to have a discussion with your agent to iron out the details before moving onto the implementation (build) phase?

In any case, looks cool :)

EDIT 1: Formatting EDIT 2: Thanks everyone for your input. I was not aware of the extensibility model that pi had in mind or that you can also iterate your plan on a PLAN.md file. Very interesting approach. I'll have a look and give it a go.

ramoz · 2026-02-24T23:25:38 1771975538

See my comment in the thread but there is an intuitive extension architecture that makes integrating these type of things feel native.

https://github.com/badlogic/pi-mono/tree/main/packages/codin...

alvivar · 2026-02-24T23:27:05 1771975625

I plan all the time. I just tell Pi to create a Plan.md file, and we iterate on it until we are ready to implement.

jauntywundrkind · 2026-02-25T03:40:16 1771990816

Agreed. I rarely find the guardrails of plan to be necessary; I basically never use it on opencode. I have some custom commands I use to ask for plan making, discussion.

As for subagents, Pi has sessions. And it has a full session tree & forking. This is one of my favorite things, in all harnesses: build the thing with half the context, then keep using that as a checkpoint, doing new work, from that same branch point. It means still having a very usable lengthy context window but having good fundamental project knowledge loaded.

miroljub · 2026-02-24T23:27:27 1771975647

Check https://pi.dev/packages

There are already multiple implementations of everything.

With a powerful and extensible core, you don't need everything prepackaged.

muratsu · 2026-02-24T23:53:45 1771977225

I’m working with a friend to build an ui around Pi to make it more user friendly for people who prefer to work with a gui (ala conductor). You can check out the repo: https://github.com/philipp-spiess/modern

ramoz · 2026-02-24T23:57:37 1771977457

In the same spirit, I also ported a planning UI extension for Pi.

https://plannotator.ai/blog/plannotator-meets-pi/

type4 · 2026-02-25T01:34:17 1771983257

What are people using to cost efficiently use this? I was using a Google Ultra sub which gave enough but that’s gone now.

ChatGPT $20/month is alright but I got locked out for a day after a couple hours. Considering the GitHub pro plus plan.

UncleOxidant · 2026-02-25T04:04:00 1771992240

Run Qwen3-coder-next locally. That's what I'm doing (using LMstudio). It's actually a surprisingly capable model. I've had it working on some LLVM-IR manipulation and microcode generation for a kind of VLIW custom processor. I've been pleasantly surprised that it can handle this (LLVM is not easy) - there are also verilog code that define the processor's behavior that it reads to determine the microcode format and expected processor behavior. When I do hit something that it seems to struggle with I can go over to antigravity and get some free Gemini 3 flash usage.

zirror · 2026-02-25T05:30:16 1771997416

What kind of hardware do you run it on?

kristianpaul · 2026-02-25T04:07:50 1771992470

Same here

lambda · 2026-02-25T02:43:04 1771987384

Qwen3 Coder Next in llama.cpp on my own machine. I'm an AI hater, but I need to experiment with it occasionally, I'm not going to pay someone rent for something they trained on my own GitHub, Stack overflow, and Reddit posts.

beacon294 · 2026-02-25T01:47:12 1771984032

FWIW the lockout probably wasn't related... maybe the content you were working on or your context window management somehow triggered something?

rahimnathwani · 2026-02-25T03:09:59 1771988999

You could try minimax 2.5 via openrouter.

ursuscamp · 2026-02-25T03:21:41 1771989701

MiniMax has an incredibly affordable coding plan for $10/month. It has a rolling five hour limit of 100 prompts. 100 prompts doesn't sound like much, but in typical AI company accounting fashion, 1 prompt is not really 1 prompt. I have yet to come even close to hitting the limit with heavy use.

ianlpaterson · 2026-02-25T04:04:13 1771992253

Coming from OpenClaw, it's pretty amazing how fast pi is, particularly paired with Qwen3 that dropped today. It's a magical time.

jasonjmcghee · 2026-02-25T05:51:03 1771998663

What dropped today? Wasn't Qwen3 Coder Next released beginning of the month?

Qwen3.5 released a couple of days ago but I'm not that RAM rich

breisa · 2026-02-25T06:35:15 1772001315

Alibaba released a whole set of new Qwen 3.5 models including a ~120B and a ~35B MoE.

kristianpaul · 2026-02-25T04:06:28 1771992388

Indeed, it seems to just works with a self hosted Qwen3 coder next.

ramoz · 2026-02-24T23:23:14 1771975394

The way you’re able to extend the harness through extension/hook architecture is really cool.

Eg some form of comprehensive planning/spec workflow is best modeled as an extension vs natively built in. And the extension still ends up feeling “native” in use

20022026 · 2026-02-25T03:36:32 1771990592

Anyone tried pi with 5.3-codex vs codex cli?

rcarmo · 2026-02-25T07:36:33 1772004993

I run it almost exclusively with codex models. Zero issues.

isagawa-co · 2026-02-25T01:13:02 1771981982

Interesting approach to planning via extensions. I took a similar direction with enforcement. A governance loop that hooks into the agent's tool calls and blocks execution until protocol is followed. Every 10 actions (configurable), the agent re-centers. No permission popups, but the agent literally can't skip steps.

Open source: https://github.com/isagawa-co/isagawa-kernel

moonlion_eth · 2026-02-25T02:27:27 1771986447

ive been using pi for about a week as daily driver and so far im happy with it. I really like the modular concept and also that its rather minimal

suralind · 2026-02-24T23:39:21 1771976361

I’ve been testing it for a few days on pretty much clean install (no customizations/extensions) and it’s ok. Not sure if I like it yet.

indigodaddy · 2026-02-25T01:43:37 1771983817

too bad I cannot star this.. https://github.com/badlogic/pi-mono/tree/main/packages/codin...

gtirloni · 2026-02-25T01:43:54 1771983834

What's a coding harness? Claude Code is a "harness" and not a TUI?

jasonjmcghee · 2026-02-25T05:56:38 1771998998

The fact that it's a tui isn't particularly relevant. It could be a gui or cli and provide very similar value.

Nearly all of its value is facilitating your interaction with the LLM, the tools it can use, and how it uses them.

ErikBjare · 2026-02-25T01:48:14 1771984094

If you run Claude Code with `-p --output-format json` it's no longer a TUI, but it's still a harness.

fred_tandemai · 2026-02-24T23:14:22 1771974862

Anyone managed to run pi in a completely sandboxed environment? It can only access the cwd and subdirectories

rcarmo · 2026-02-24T23:40:54 1771976454

I run mine inside https://github.com/rcarmo/agentbox (with https://github.com/rcarmo/webterm)

ac29 · 2026-02-25T01:07:48 1771981668

Yeah I wrote a small landlock wrapper using go-landlock to sandbox pi that works well (not public, similar projects are landrun and nono).

Note that if you sandbox to literally just the working directly, pi itself wont run since pretty much every linux application needs to be able to read from /usr and /etc

fjk · 2026-02-24T23:27:50 1771975670

I’ve been tinkering with Gondolin, a micro-vm agent sandbox.

Here’s an example config: https://github.com/earendil-works/gondolin/blob/main/host/ex...

monkey26 · 2026-02-24T23:42:23 1771976543

I do this with an extension. I run all bash tools with bwrap and ACLs for the write and edit tools. Serves my purposes. Opens up access to other required directories, at least for git and rust.

I think I published it. Check the pi package page.

rglover · 2026-02-25T02:33:59 1771986839

Excited to give this a try, looks really well done.

lukasb · 2026-02-24T23:35:48 1771976148

But I can't use my Codex plan with it, right? I have to use an API key?

mccoyb · 2026-02-24T23:37:08 1771976228

You can use your Codex plan with it. OpenAI endorsed it several weeks ago, as far as I remember. That could change, however, but now seems safe.

ac29 · 2026-02-25T01:15:29 1771982129

You can use your Claude or Gemini plan with it too for now, though Anthropic and Google have made it clear this is a ToS violation.

theshrike79 · 2026-02-25T00:12:22 1771978342

Pi makes GPT-5.3-Codex act about on par with Claude easily.

There's something in the default Codex harness that makes it fight with both arms behind its back, maybe the sandboxing is overly paranoid or something.

With Pi I can one-shot many features faster and more accurately than with Codex-cli.

qazplm17 · 2026-02-25T02:23:56 1771986236

Pi treats you like an adult and shows whatever the fuck LLM is doing rather than actively hiding shit from the user. And just for that, once you tasted the freedom and transparency, there’s no way to go back to CC.

WXLCKNO · 2026-02-25T09:22:58 1772011378

After 2.20.0 of Claude code where they started not showing what files are read / searches are made by default .. I fucking love how easy it was to ditch Claude code for pi.

TZubiri · 2026-02-25T02:36:48 1771987008

I think OpenCode is the same.

They are all open source though so you can just find out whats going on if you want right?

mobrienv · 2026-02-25T02:30:13 1771986613

Another batteries included pi setup. Built a lightweight mobile webui to run it on termux and code on my phone.

https://github.com/mikeyobrien/rho

TacticalCoder · 2026-02-25T03:45:44 1771991144

Naming skills though...

fnord77 · 2026-02-25T03:21:49 1771989709

I mean using the captive agents is much cheaper than supplying your api key to a 3rd party agent.

TZubiri · 2026-02-25T02:35:12 1771986912

Wtf is that example gif?

The prompt shown is

"Who's your daddy and what does he do?"

Is this a joke or tech? Is the author a dev or a clown?

NamlchakKhandro · 2026-02-25T02:54:51 1771988091

No one cares about your opinions.

This coding agent certainly couldn't give a fuck.

enneff · 2026-02-25T03:47:39 1771991259

It’s a quote from the movie Kindergarten Cop.

cyanydeez · 2026-02-24T22:27:07 1771972027

The backing to OpenClaw/MoltBot whatever they're calling themselves. Why is it insecure, well, Pi tells you >No permission popups.

Anyway, even if you give your agent permission, there's no secure way to know whether what they're asking to is what they'll actually do, etc.

chriswarbo · 2026-02-25T00:57:48 1771981068

> Why is it insecure, well, Pi tells you >No permission popups.

Pi supports permission popups, but doesn't use them by default. Their example extensions show how to do it (add an event listener for `tool_call` events; to block the call put `block: true` in its result).

> there's no secure way to know whether what they're asking to is what they'll actually do

What do you mean? `tool_call` event listeners are given the parameters of the tool call; so e.g. a call to the `bash` tool will show the exact command that will execute (unless we block it, of course).

himata4113 · 2026-02-24T22:46:20 1771973180

you want to put agents in a sandbox instead such as bwrap anyways.

cermicelli · 2026-02-24T22:30:10 1771972210

Just how expensive was that domain?

jotaen · 2026-02-24T22:37:45 1771972665

README on Github says “pi.dev domain graciously donated by exe.dev” (though that doesn’t say anything about the original price of course).

schpet · 2026-02-24T22:53:43 1771973623

oh that's kind. i hope they keep the old domain up too though: https://shittycodingagent.ai/