ai-agent
Open-Source MCP servers
Production-ready and experimental MCP servers that extend AI capabilities through file access, database connections, API integrations, and other contextual services.
Related contents:
MCP server for interacting with Neon Management API and databases.
Related contents:
Containerized versions of hundreds of MCP servers 📡 🧠
While experimenting with Model Context Protocol (MCP) servers, we found that setting them up could be a bit tedious and time-consuming. To simplify the process, we created containerized versions of these servers—making it quick, easy, and secure for anyone to get started.
Your agentic CLI developer.
Sidekick is an agentic CLI-based AI tool inspired by Claude Code, Copilot, Windsurf and Cursor. It's meant to be an open source alternative to these tools, providing a similar experience but with the flexibility of using different LLM providers while keeping the agentic workflow.
A 100-line minimalist LLM framework for Agents, Task Decomposition, RAG, etc.
Related contents:
Agents are AI systems that can perform workflows on users' behalf with a high degree of independence. This guide, designed for product and engineering teams, explores how to build agents with frameworks for identifying promise use cases, clear patterns for designing agent logic and orchestration, and best practices to ensure agents run safely, predictably, and effectively. It provides the foundational knowledge needed to confidently start building agents. Building reliable agents means starting with strong foundations - start small, validate with real users, and grow capabilities over time.
Lightweight coding agent that runs in your terminal.
Codex CLI is built for developers who already live in the terminal and want ChatGPT‑level reasoning plus the power to actually run code, manipulate files, and iterate – all under version control. In short, it’s chat‑driven development that understands and executes your repo.
Principles for building reliable LLM applications.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
🎭 Stateful Serverless That Runs Anywhere. The easiest way to build stateful, AI agent, collaborative, or local-first applications. Deploy to Rivet, Cloudflare, Bun, Node.js, and more.
ActorCore is a framework for building stateful, scalable, realtime backend applications. Whether you’re building multiplayer games, collaborative apps, AI agent backends, or any stateful service, ActorCore provides the tools and patterns to simplify your architecture.
Security scanner for MCP servers
MCP-Shield scans your installed MCP (Model Context Protocol) servers and detects vulnerabilities like tool poisoning attacks, exfiltration channels and cross-origin escalations.
Claude Code is Anthropic’s new agentic coding tool. A command-line companion that can read, edit, and manage your projects through natural language prompts. It integrates directly into your development environment, removing friction and helping you focus on what matters most: writing great code.
Related contents:
- Poking Around Claude Code @ Han, Not Solo.
- What makes Claude Code so damn good (and how to recreate that magic in your agent)!? @ minusx.
- Claude Code: Best practices for agentic coding @ Engineering at Anthropic.
- Agentic Coding Recommendations @ Armin Ronacher's Thoughts and Writings.
- Quand Claude Code pilote votre terminal... @ Korben :fr:.
- Turning Claude Code Into My Best Design Partner @ Between the Prompts.
- Using Claude Code to modernize a 25-year-old kernel driver @ Dmitry Brant.
- The babysitter problem @ chrisbeckman.dev.
- Claude Skills are awesome, maybe a bigger deal than MCP @ Simon Willison’s Weblog.
Crafting Effective Prompts for Agentic AI Systems: Patterns and Practices.
🧠 Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Incuding: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new, windsurf, notion, and MetaAI.
An open protocol enabling communication and interoperability between opaque agentic applications.
One of the biggest challenges in enterprise AI adoption is getting agents built on different frameworks and vendors to work together. That’s why we created an open Agent2Agent (A2A) protocol, a collaborative way to help agents across different ecosystems communicate with each other.
Related contents:
The first and the best multi-agent framework. Finding the Scaling Law of Agents
Building Multi-Agent Systems for Task Automation.
🐫 CAMEL is an open-source community dedicated to finding the scaling laws of agents. We believe that studying these agents on a large scale offers valuable insights into their behaviors, capabilities, and potential risks. To facilitate research in this field, we implement and support various types of agents, tasks, prompts, models, and simulated environments.
CAMEL emerges as the earliest LLM-based multi-agent framework, and is now a generic framework to build and use LLM-based agents for real-world task solving. We believe that studying these agents on a large scale offers valuable insights into their behaviors, capabilities, and potential risks. To facilitate research in this field, we implement and support various types of agents, tasks, prompts, models, and simulated environments.
MCP server for your browser.
Browser MCP is a Model Context Provider (MCP) server that allows AI applications to control your browser.
If you want to automate actions on a website, like repeatedly fill out a form, you normally can't do it with AI apps like Cursor or Claude because they don't have access to a web browser. With Browser MCP, you can connect AI apps to your browser so they can automate tasks on your behalf.
Instantly create a Remote MCP server for any GitHub project.
GitMCP is a free, open-source service that seamlessly transforms any GitHub project into a remote Model Context Protocol (MCP) endpoint, enabling AI assistants to access and understand the project's documentation effortlessly.
specialized MCP servers that bring AWS best practices directly to your development workflow .
A Ruby Implementation of the Model Context Protocol.
Connect AI models to your Ruby applications with ease No complex protocols, no integration headaches, no compatibility issues – just beautiful, expressive Ruby code.
An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI.
It allows users to call LLMs and orchestrate agent calls directly within their Airflow pipelines using decorator-based tasks. The SDK leverages the familiar Airflow @task syntax with extensions like @task.llm, @task.llm_branch, and @task.agent.
Transform Al Prototypes into Enterprise-Grade Products.
Langtrace is an Open Source Observability and Evaluations Platform for Al Agents.
Related contents:
PostgreSQL Model Context Protocol (PG-MCP) Server.
A Model Context Protocol (MCP) server for PostgreSQL databases with enhanced capabilities for AI agents.
Playwright Tools for MCP.
A Model Context Protocol (MCP) server that provides browser automation capabilities using Playwright. This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.
Balance agent control with agency. Build resilient language agents as graphs.
Gain control with LangGraph to design agents that reliably handle complex tasks. Build and scale agentic applications with LangGraph Platform.
LangGraph — used by Replit, Uber, LinkedIn, GitLab and more — is a low-level orchestration framework for building controllable agents. While langchain provides integrations and composable components to streamline LLM application development, the LangGraph library enables agent orchestration — offering customizable architectures, long-term memory, and human-in-the-loop to reliably handle complex tasks.
Related contents:
This is MCP server for Claude that gives it terminal control, file system search and diff file editing capabilities.
This is server that allows Claude desktop app to execute long-running terminal commands on your computer and manage processes through Model Context Protocol (MCP) + Built on top of MCP Filesystem Server to provide additional search and replace file editing capabilities .
Fleur is the app store for Claude. The easiest way to discover and install MCPs.
Fleur is a desktop application that serves as an app marketplace for MCPs. It allows you to discover, install, and manage apps that extend the functionality of Claude Desktop and Cursor.
All without having to use a command line. Fleur is made for non-technical users in mind, but is open-source and extensible so developers can make it their own.
Related contents:
MCP server for fetch web page content using Playwright headless browser.
Coding assistant MCP for Claude Desktop.
Make Claude Desktop a pair programming assistant by installing codemcp. With it, you can directly ask Claude to implement features, fix bugs and do refactors on a codebase on your computer; Claude will directly edit files and run tests. Say goodbye to copying code in and out of Claude's chat window!
Model Context Protocol (MCP) is a new, standardized protocol for managing context between large language models (LLMs) and external systems. In this repository, we provide an installer as well as an MCP Server for Cloudflare's API.
This lets you use Claude Desktop, or any MCP Client, to use natural language to accomplish things on your Cloudflare account
Bringing Agentic AI to cloud native.
An open-source framework for DevOps and platform engineers to run AI agents in Kubernetes, automating complex operations and troubleshooting tasks.
Related contents:
Evolving agents is a production-grade environment for orchestrating, evolving, and managing AI agents.
A production-grade framework for creating, managing, and evolving AI agents with intelligent agent-to-agent communication. The framework enables you to build collaborative agent ecosystems that can semantically understand requirements, evolve based on past experiences, and communicate effectively to solve complex tasks.
An in-depth book and reference on building agentic systems like Claude Code. A deep-dive guide into architecture patterns for building responsive, reliable AI coding agents.
There's been a lot of asking about how Claude Code works under the hood. Usually, people see the prompts, but they don't see how it all comes together. This is that book. All of the systems, tools, and commands that go into building one of these.
A practical deep dive and code review into how to build a self-driving coding agent, execution engine, tools and commands. Rather than the prompts and AI engineering, this is the systems and design decisions that go into making agents that are real-time, self-corrective, and useful for productive work.
The Platform for Building Stateful Agents. Build agents with infinite context and human-like memory, that can learn from data and improve with experience. Letta (formerly MemGPT) is a framework for creating LLM services with memory.
👾 Letta is an open source framework for building stateful LLM applications. You can use Letta to build stateful agents with advanced reasoning capabilities and transparent long-term memory. The Letta framework is white box and model-agnostic.
Related contents:
Finding the Scaling Laws of Agents. The first and the best multi-agent framework.
🐫 CAMEL is an open-source community dedicated to finding the scaling laws of agents. We believe that studying these agents on a large scale offers valuable insights into their behaviors, capabilities, and potential risks. To facilitate research in this field, we implement and support various types of agents, tasks, prompts, models, and simulated environments.
The framework enables multi-agent systems to continuously evolve by generating data and interacting with environments. This evolution can be driven by reinforcement learning with verifiable rewards or supervised learning.
Go beyond nascent AI demos. The intelligent AI-native gateway for prompts and agentic apps.
Effortlessly build AI apps that can answer questions and help users get things done. Arch is the AI-native proxy that handles the pesky heavy-lifting so that you can move faster in building agentic apps, prevent harmful outcomes, and rapidly incorporate latest models.
AI-native (edge and LLM) proxy for agents. Move faster by letting Arch handle the pesky heavy lifting in building agentic apps -- ⚡️ query understanding and routing, seamless integration of prompts with tools, and unified access and observability of LLMs. Built by the contributors of Envoy proxy.
The agents.json Specification is an open specification that formally describes contracts for API and agent interactions, built on top of the OpenAPI standard.
The TypeScript framework for agents & workflows with react-like components. Lightning fast dev loop. Easy to learn. Easy to extend.
Build complex AI applications with React-like components. GenSX is a simple typescript framework for building agents and workflows with reusable React-like components. GenSX takes a lot of inspiration from React, but the programming model is very different - it’s a Node.js framework designed for data flow.
OpenAI just published Deep Research, a very nice assistant that can perform deep searches on the web to answer user questions.
However, their agent has a huge downside: it's not open. So we've started a 24-hour rush to replicate and open-source it. Our resulting open-Deep-Research agent took the #1 rank of any open submission on the GAIA leaderboard! ✨
Related contents:
Enable AI to control your browser. Make websites accessible for AI agents.
We make websites accessible for AI agents by extracting all interactive elements, so agents can focus on what makes their beer taste better.
Related contents:
LangGraph is a library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows.
Related contents:
Build and ship AI products on a single, collaborative platform
Take AI products from early-stage ideas to production-grade features with tooling for experimentation, evaluation, deployment, monitoring, and collaboration.
Related contents:
The open-source visual AI programming environment and TypeScript library.
Rivet, the IDE for creating complex AI agents and prompt chaining, and embedding it in your application.
Related contents:
Enable generative AI applications to automate multistep tasks by seamlessly connecting with company systems, APIs, and data sources.
Related contents:
Your on-machine AI agent, automating engineering tasks seamlessly. an open-source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM.
Related contents:
The Production-Ready Open Source AI Framework.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Explore Agent Recipes
Explore common agent recipes with ready to copy code to improve your LLM applications.
Related contents:
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API. - openai/openai-realtime-agents
LLM Agent and Evaluation Framework for Autonomous Penetration Testing.
We introduce HackSynth, a novel Large Language Model (LLM)-based agent capable of autonomous penetration testing. HackSynth's dual-module architecture includes a Planner and a Summarizer, which enable it to generate commands and process feedback iteratively. To benchmark HackSynth, we propose two new Capture The Flag (CTF)-based benchmark sets utilizing the popular platforms PicoCTF and OverTheWire. These benchmarks include two hundred challenges across diverse domains and difficulties, providing a standardized framework for evaluating LLM-based penetration testing agents.
🤗 smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.
A web browser for AI agents & applications. An AI web browsing framework focused on simplicity and extensibility.
Low code LLM Apps Builder. Build LLM Apps Easily.
Open source low-code tool for developers to build customized LLM orchestration flow & AI agents.
Related contents:
AmphiPod is a lightweight, HTTP-based Model Context Protocol (MCP) server implementation written in Go. It simplifies the integration of AI tools by providing an HTTP interface to the MCP specification and executing tools through WebAssembly plugins.
Agent Framework / shim to use Pydantic with LLMs.
PydanticAI is a Python Agent Framework designed to make it less painful to build production grade applications with Generative AI.
Related contents:
🌎💪 BrowserGym, a Gym environment for web task automation.
BrowserGym is meant to provide an open, easy-to-use and extensible framework to accelerate the field of web agent research. It is not meant to be a consumer product. Use with caution!
Open-source framework for building Web Agents. Build and deploy Web Agents in a few lines of code.
Open-source sQL AI Agent. Text2SQL made Easy!
Wren AI is an open-source SQL AI Agent that empowers data, product, and business teams to access insights through AI chat, built-in well designed intuitive UI and UX, integrating seamlessly with tools like Excel and Google Sheets.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Related contents:
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
Related contents:
AI coding engine for complex tasks. Build complex software with AI.
An open source, terminal-based AI coding engine. Designed for real-world tasks that span many files.
Related contents: