shipfeedAI news, curated daily

23:56:38 CET
20 MAY23:56:38shipfeed
pull to refreshlast sync
Just in — 30 new
§ tools · storyline

Show HN: Statewright – Visual state machines that make AI agents reliable

Statewright launches a visual state machine framework that constrains AI agent tool access and workflow transitions to improve reliability on smaller 13–20B parameter models.

May 12 · · primary fetch1 sourceupdated May 12 ·

Agentic problem solving in its current state is very brittle. I fell in love with it, but it creates as many problems as it solves.I'm Ben Cochran, I spent 20+ years in the trenches with full-stack Engineering, DevOps, high performance computing & ML with stints at NVIDIA, AMD and various other organizations most recently as a Distinguished Engineer.For agents to work reliably you either need massive parameter counts or massive context windows to keep the solution spaces workable. Most people are brute forcing reliability with bigger models and longer prompts.What if I made the problem smaller instead of making the model bigger?I took a different approach by using smaller models: models in the 13-20B parameter range and set them to task solving real SWE-bench problems.

I constrained the tool and solution spaces using formal state machines. Each state in the machine defines which tools the model can access, how many iterations it gets and what transitions are valid. A planning state gets read-only tools. An implementation state gets edit tools (scoped to prevent mega edits) and write friendly bash tools. The testing state gets bash but only for testing commands. The model…

read full article on github.com
§ sources1 publication · timeline below
  1. github.comShow HN: Statewright – Visual state machines that make AI agents reliableprimary