ai-agent
Agentic AI Infrastructure for magnifying HUMAN capabilities.
PAI is a Personalized AI Platform designed to magnify your capabilities.
It's designed for humans most of all, but can be used by teams, companies, or Federations of Planets desiring to be better versions of themselves.
The scale of the entity doesn't matter: It's a system for understanding, articulating, and realizing its principal's goals using a full-featured Agentic AI Platform.
Related contents:
Matchlock secures AI agent workloads with a Linux-based sandbox.
Matchlock is a CLI tool for running AI agents in ephemeral microVMs - with network allowlisting, secret injection via MITM proxy, and VM-level isolation. Your secrets never enter the VM.
Repository automation, running the coding agents you know and love, with strong guardrails in GitHub Actions.
Use GitHub Copilot, Claude by Anthropic or OpenAI Codex for event-triggered, recurring and scheduled jobs to improve, document and analyze your repository.
Easy Linux virtual machine on MacOS to sandbox LLM agents.
Vibe is a quick, zero-configuration way to spin up a Linux virtual machine on Mac to sandbox LLM agents.
Related contents:
Autonomous AI Agents for Infrastructure. Claude Code for infrastructure. Debug, act, and audit everything Fluid does on your infrastructure.
Fluid is a terminal agent that do work on production infrastructure like VMs/K8s cluster/etc. by making sandbox clones of the infrastructure for AI agents to work on, allowing the agents to run commands, test connections, edit files, and then generate Infra-as-code like an Ansible Playbook to be applied on production.
Pi is a minimal terminal coding harness. Adapt pi to your workflows, not the other way around, without having to fork and modify pi internals. Extend it with TypeScript Extensions, Skills, Prompt Templates, and Themes. Put your extensions, skills, prompt templates, and themes in Pi Packages and share them with others via npm or git.
Related contents:
This is a public, sanitized version of my Abiverse repository for people to use and run their own Abes.
Volition (fondly referred to me as the Abiverse) is a self-hosted, multi-agent system designed to run persistent, self-replicating autonomous LLM-based agents ("Abes") inside isolated Linux containers. These are not chatbots, but I have aimed for them to be long-lived system processes with memory, tools, and constrained authority over real machines. These are supposed to be the 'semi-intelligent layer' between you and your homelab.
Volition has been running continuously in my personal infrastructure with multiple agents for more than a month now. However, this public release is new and has not yet been exercised end-to-end by external users. Expect rough edges in: setup and documentation flow, first-run ergonomics, and non-default configurations Core architecture and invariants are stable, but installation paths will be refined over the next few days as this release is tested in the open.
Related contents:
Autonomous multi-agent coding framework that plans, builds, and validates software for you.
Related contents:
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services.
Embed Copilot's agentic workflows in your application—now available in Technical preview as a programmable SDK for Python, TypeScript, Go, and .NET.
The GitHub Copilot SDK exposes the same engine behind Copilot CLI: a production-tested agent runtime you can invoke programmatically. No need to build your own orchestration—you define agent behavior, Copilot handles planning, tool invocation, file edits, and more.
Related contents:
rtfmbro provides always-up-to-date, version-specific package documentation as context for coding agents. An alternative to context7
The common language for platforms, agents and businesses.
UCP defines building blocks for agentic commerce—from discovering and buying to post purchase experiences—allowing the ecosystem to interoperate through one standard, without custom builds.
Related contents:
An open-source alternative to Claude Cowork, powered by OpenCode.
OpenWork is an extensible, open-source “Claude Work” style system for knowledge workers. It’s a native desktop app that runs OpenCode under the hood, but presents it as a clean, guided workflow.
MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 60.2% Avg@8 score on the challenging GAIA benchmark.
Security, visibility, and authorization for AI agents
Leash wraps AI coding agents in containers and monitors their activity. You define policies in Cedar; Leash enforces them instantly.
Authorize and monitor your AI agents with policy enforcement, sandboxed execution, and real-time observability—ensuring they operate safely within your defined boundaries.
Agent harness framework for building, running, and verifying LLM workflows.
Gambit helps you build reliable LLM workflows by composing small, typed “decks” with clear inputs/outputs and guardrails. Run decks locally, stream traces, and debug with a built-in UI.
The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.
A simple, open format for giving agents new capabilities and expertise. Agent Skills are folders of instructions, scripts, and resources that agents can discover and use to do things more accurately and efficiently.
Related contents:
Semantic search for agents.
A calm, CLI-native way to semantically grep everything, like code, images, pdfs and more.
Related contents:
Ralph is an autonomous AI agent loop that runs repeatedly until all PRD items are complete.
Ralph is an autonomous AI agent loop that runs Amp repeatedly until all PRD items are complete. Each iteration is a fresh Amp instance with clean context. Memory persists via git history, progress.txt, and prd.json.
Related contents:
Move at Kilo Speed. Build, ship, and iterate faster with the most popular open source coding agent.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 on OpenRouter. 750k+ Kilo Coders. 6.1 trillion tokens/month.
Related contents:
A dead-simple unix tool for lightweight open-source local agents.
Orla is a unix tool for running lightweight open-source agents. It is easy to add to a script, use with pipes, or build things on top of.
Related contents:
AI Agent for Troubleshooting Cloud-Native Environments. Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More.
HolmesGPT is an AI agent for investigating problems in your cloud, finding the root cause, and suggesting remediations. It has dozens of built-in integrations for cloud providers, observability tools, and on-call systems.
Related contents:
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods.
A structural code search engine for AI agents. Provides sub-500ms file ranking across massive codebases without embeddings, vector databases, or external dependencies.
Mantic is an infrastructure layer designed to remove unnecessary context retrieval overhead for AI agents. It infers intent from file structure and metadata rather than brute-force reading content, enabling retrieval speeds faster than human reaction time.
A curated catalogue of agentic AI patterns — real‑world tricks, workflows, and mini‑architectures that help autonomous or semi‑autonomous AI agents get useful work done in production.
Related contents:
Select context for coding agents directly from your website.
How? Point at any element and press ⌘C (Mac) or Ctrl+C (Windows/Linux) to copy the file name, React component, and HTML source code.
It makes tools like Cursor, Claude Code, Copilot run up to 3× faster and more accurate.
Multi-agent orchestrator for Claude Code. Track work with convoys; sling to agents.
Related contents:
- Welcome to Gas Town @ Steve Yegge's Medium.
- Gas Town Decoded @ Andrew Lilley Brinker.
- How to think about Gas Town @ Steve Klabnik.
- Agent Psychosis: Are We Going Insane? @ Armin Ronacher's Thoughts and Writings.
- Gas Town’s Agent Patterns, Design Bottlenecks, and Vibecoding at Scale @ Maggie Appleton.
- Move Over Gas Town, Claude Has First-Party Agent Orchestration @ Andrew Lilley Brinker.
Worktrunk is a CLI for Git worktree management, designed for parallel AI agent workflows.
Worktrunk's three core commands make worktrees as easy as branches. Plus, Worktrunk has a bunch of quality-of-life features to simplify working with many parallel changes, including hooks to automate local workflows.
AI Penetration Testing.
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.
Word GPT Plus is a word add-in which integrates the AI&Agent into Microsoft Word.
Word GPT Plus seamlessly integrates AI and Agent directly into Microsoft Word, allowing you to generate, translate, summarize, and polish text directly within your documents. Enhance your writing workflow without leaving your Word environment.
Browser automation without the drama. Browser automation for AI agents and humans.
Vibium is browser automation infrastructure built for AI agents. A single binary handles browser lifecycle, WebDriver BiDi protocol, and exposes an MCP server — so Claude Code (or any MCP client) can drive a browser with zero setup. Works great for AI agents, test automation, and anything else that needs a browser.
AI Agents on a Private GenAI Stack.
♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.
Related contents:
Long-term Memory for AI Agents.
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
On December 17, 2025, an AI agent (Claude) ran git checkout -- on multiple files containing hours of uncommitted work from another agent (Codex). This destroyed the work instantly and silently. The files were eventually recovered from a dangling Git object, but this incident revealed a critical gap: AI agents can execute destructive commands without understanding the consequences.
The AGENTS.md file already forbade such commands, but instructions alone don't prevent execution. This hook provides mechanical enforcement - the command is blocked before it can run.
Agent Skills Marketplace - Claude, Codex & ChatGPT Skills.
Related contents:
Kimi CLI is a new CLI agent that can help you with your software development tasks and terminal operations.
Layrr is the visual editor for real code. Design visually, edit any stack, own everything. A browser coding agent interface for selecting elements and sending instructions directly to Claude Code.
Model Context Protocol Server for Swift.
A Swift implementation of the MCP (Model Context Protocol) for JSON-RPC over various transports.
Agent Lightning is the absolute trainer to light up AI agents.
agent-lightning is an open-source framework for training and optimizing AI agents—enabling reinforcement learning (RL), automatic prompt optimization, supervised fine-tuning, and more—without requiring substantial changes to existing agent code. It works with virtually any agent framework (e.g., LangChain, OpenAI Agents SDK, and AutoGen) and provides modular components to collect agent execution data and iteratively improve agent performance via a decoupled RL training loop.
The Active Reliability Layer for AI Agents. Catch failures, teach fixes, and automate reliability.
Steer is an open-source Python library that intercepts agent failures (hallucinations, bad JSON, PII leaks) and allows you to inject fixes via a local dashboard without changing your code.
Related contents:
A local Apple Documentation crawler and MCP server. Written in Swift.
A Swift-based tool to crawl, index, and serve Apple's developer documentation to AI agents via the Model Context Protocol (MCP).
Related contents:
This interactive, certified course will guide you through building and deploying your own AI Agents.
A desktop app for isolated, parallel agentic development.
mux (Coding Agent Multiplexer) is a cross-platform desktop application for AI-assisted development with isolated workspace management.
The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.
kit is a production-ready toolkit for codebase mapping, symbol extraction, code search, and building LLM-powered developer tools, agents, and workflows.
kit shines for getting precise, accurate, and relevant context to LLMs. Use kit to build code reviewers, code generators and graphs, even full-fledged coding assistants: all enriched with the right code context.
A lightweight spec‑driven framework.
OpenSpec aligns humans and AI coding assistants with spec-driven development so you agree on what to build before any code is written. No API keys required.
AI Browser Automation. Automate browser based workflows with AI.
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows on a large number of websites, replacing brittle or unreliable automation solutions.
Emdash is an orchestration layer for running multiple coding agents in parallel in isolated Git worktrees.
An orchestration layer for running multiple coding agents in parallel, each isolated in its own Git worktree. Run several agent instances concurrently to tackle independent subtasks or experiments.
Emdash lets you develop and test multiple features with multiple agents in parallel. It’s provider-agnostic (we support 10+ CLIs, such as Claude Code and Codex) and runs each agent in its own Git worktree to keep changes clean; when the environment matters, you can run a PR in its own Docker container. Hand off Linear, GitHub, or Jira tickets to an agent, review diffs side-by-side, and keep everything local.
AI Workflows That Agents Build & Run. Open Language for Agents to Build AI Workflows.
The open standard devtool for repeatable AI workflows. Write business logic, not API calls.
Universal AI coding agent manager for 10x engineers.
cmux lets you run Claude Code, Codex CLI, Amp, Gemini CLI, Cursor CLI, Opencode, and other coding agent CLIs in parallel across multiple tasks
The Web Access Layer for AI Agents.
Connect Your Agent to the Web. Powering the Internet of Agents with fast, secure and reliable web access APIs.
Related contents:
The Intelligence Engine. Turn Natural Language Into Action. The Intelligence Layer for AI agents. Connect your models, tools, and data to create agentic apps that can think, act and talk to you.
An all-in-one toolkit to build agentic applications that turn natural language into real-world actions.
Want to build AI-native apps that respond to natural language? Dexto is the intelligence layer that makes it easy to build agentic apps like AI assistants and copilots. Describe your agents, plug in your tools, and watch them respond to plain English.
AI Vision for macOS. Fast Screen Capture & VQA.
Peekaboo is a macOS CLI & optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models.
Related contents:
MCP server helping models to understand your Vite/Nuxt app better.
Related contents:
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!
MCP (Model Context Protocol) is the emerging standard to define how AI agents communicate with applications. Using FastAPI-MCP, creating a secured MCP server to your application takes only 3 lines of code.
Related contents:
AI-Powered n8n Workflow Automation.
Model Context Protocol server enabling AI assistants to build accurate n8n workflows. Access 525+ n8n nodes with 99% property coverage. Reduce configuration errors and speed up workflow creation.
Related contents:
Grab any element on in your app and give it to Cursor, Claude Code, etc.
By default coding agents cannot access elements on your page. React Grab fixes this - just point and click to provide context!
An autonomous agent for deep financial research.
Dexter is an autonomous financial research agent that thinks, plans, and learns as it works. It performs analysis using task planning, self-reflection, and real-time market data. Think Claude Code, but built specifically for financial research.
Building infrastructure for the Internet of Agents.
The AGNTCY project provides the complete infrastructure stack for agent collaboration—discovery, identity, messaging, and observability that works across any vendor or framework. It is the foundational layer that lets specialized agents find each other, verify capabilities, and work together on complex problems.
The MCP server for Azure DevOps, bringing the power of Azure DevOps directly to your agents.
Related contents: