Non-stop manager
The agent works while you're away; decisions wait in a tray, not in the chat.
/ Przemysław Tarkowski
Senior engineer, 11 years shipping real software — a 100M+ download game, solo-built AWS infrastructure. Now aimed at AI agents, with a working system to show, not slides.
TarkOS · open source — coming soon
layer
Claude Code harness
agent OS
audito (agent OS)
01
Decisions (Decision-OS)
02
Memory / capture
03
Non-stop autonomy
/ What the system does
Runs on its own
Give it a goal and it works across sessions while I'm away — inventing the next task, not waiting for prompts.
Decides like me — asks only when it must
It owns the reversible calls and batches the irreversible ones into a decision tray for my sign-off.
Measures its own work
Every change is regression-gated and retrieval is scored — the agent grades itself before I ever see it.
Gets better every session
It captures my corrections into explicit rules, so the same steer is never needed twice.
Note The code lives inside my private work harness (open-sourcing soon). I'm happy to walk you through it live — architecture, decisions, the eval loop.
/ Case study · the system in depth
The problem
An agent left to work alone breaks the same way every time: it makes decisions it can't justify, does irreversible things without asking, and loses the thread the moment a context window fills. The gap between an impressive demo and a system you'd actually leave running is engineering, not prompting.
Orchestration & continuity
One orchestrator spawns and supervises worker agents across parallel workstreams, monitoring by work-signals rather than guesswork. When a worker approaches its context ceiling it relay-hops — closing cleanly and handing a successor the full thread (warm resume + lossless history load) — so long-running work survives across sessions, attended in the day or unattended overnight.
Decision-OS — the trust layer
Every decision is classified: reversible ones the agent makes itself, irreversible ones it batches into a tray for my sign-off (default-deny). An append-only, gap-detecting audit trail records each call with its rationale — tamper-evident, the way a regulated environment needs it. This is what turns 'runs by itself' into 'runs by itself, safely'.
The result — operated, not demoed
/ Where this comes from
The EU research I worked on was literally about autonomous orchestration of cloud/edge resources. Today I orchestrate autonomous agents. Same instinct, new substrate.
02 / Capabilities
The agent works while you're away; decisions wait in a tray, not in the chat.
A queue of decisions with consequences and a recommendation. One click instead of meta-work.
Knowledge from every session lands in the right file with provenance. It's git-blame for knowledge.
03 / Skills layer
Where I'm strong today: the operating layer most agent projects lack.
What I'm building on top of that. Today AI writes the implementation under my architecture; I verify by outcomes.
05 / Connect
If you're curious how all of this works together, or you want to talk about agents, devtools, or applied AI, get in touch. Happy to walk you through it.