llm
The Elegant Desktop Companion for Claude Code.
A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
Related contents:
Your Ultimate Virtual Assistant.
Experience the future of AI assistance with advanced customization, flexible model support, and seamless integration. Newelle brings you the power of AI with unmatched flexibility and control.
Related contents:
Towards Natural‑Language Programming for GitHub Action.
Write agentic workflows in natural language markdown, and run them in GitHub Actions.
A Database for Your Mind.
One shared context across all your apps. Your transcripts inform your notes. Your notes guide your AI. Everything connects through a single folder of plain text and SQLite.
Epicenter is an ecosystem of open-source, local-first apps. Our eventual goal is to store all of your data—notes, transcripts, chat histories—in a single folder of plain text and SQLite. Every tool we build shares this memory. It's open, tweakable, and yours. Grep it, open it in Obsidian, host it wherever you like. The choice is yours.
The Easiest Way to Build Desktop Agents. Bytebot is a turn-key container for developers building desktop agents.
Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containerized Linux desktop environment.
Related contents:
RamaLama strives to make working with AI simple, straightforward, and familiar by using OCI containers.
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.
Related contents:
a Kubernetes-native high-performance distributed LLM inference framework.
llm-d is a Kubernetes-native distributed inference serving stack, providing well-lit paths for anyone to serve large generative AI models at scale, with the fastest time-to-value and competitive performance per dollar for most models across most hardware accelerators.
Related contents:
💫 Industrial-strength Natural Language Processing (NLP) in Python.
spaCy is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products.
spaCy comes with pretrained pipelines and currently supports tokenization and training for 70+ languages. It features state-of-the-art speed and neural network models for tagging, parsing, named entity recognition, text classification and more, multi-task learning with pretrained transformers like BERT, as well as a production-ready training system and easy model packaging, deployment and workflow management. spaCy is commercial open-source software, released under the MIT license.
Related contents:
The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks. Building upon the dense foundational models of the Qwen3 series, it provides a comprehensive range of text embeddings and reranking models in various sizes (0.6B, 4B, and 8B). This series inherits the exceptional multilingual capabilities, long-text understanding, and reasoning skills of its foundational model. The Qwen3 Embedding series represents significant advancements in multiple text embedding and ranking tasks, including text retrieval, code retrieval, text classification, text clustering, and bitext mining.
Related contents:
Yet Another Memory System. Content addressable storage with excellent search.
Persistent memory for LLMs and applications. Content‑addressed storage with deduplication, compression, semantic search, and full‑text indexing.
Chat UI for Coderunner.
coderunner-ui is a local‑first AI workspace that lets you:
- Chat with local or remote LLMs
- Run generated code inside a fully isolated Apple Container VM
- Browse the web and automate tasks via a built‑in headless browser (Playwright) All without sending your data to the cloud.
Related contents:
Run AI Generated Code Locally. A secure local sandbox to run LLM-generated code using Apple containers.
CodeRunner is an MCP (Model Context Protocol) server that executes AI-generated code in a sandboxed environment on your Mac using Apple's native containers.
Related contents:
Your AI workforce launchpad, in your pocket. Talk to Your AI Agents from Anywhere!
Omnara transforms your AI agents (Claude Code, Cursor, GitHub Copilot, and more) from silent workers into communicative teammates. Get real-time visibility into what your agents are doing, respond to their questions instantly, and guide them to success - all from your phone.
Cut Code Review Time & Bugs in Half. Instantly.
Supercharge your team to ship faster with the most advanced AI code reviews.
Related contents:
Chat with AI to build React apps instantly.
🔥 Clone and recreate any website as a modern React app in seconds.
POML (Prompt Orchestration Markup Language) is a novel markup language designed to bring structure, maintainability, and versatility to advanced prompt engineering for Large Language Models (LLMs). It addresses common challenges in prompt development, such as lack of structure, complex data integration, format sensitivity, and inadequate tooling. POML provides a systematic way to organize prompt components, integrate diverse data types seamlessly, and manage presentation variations, empowering developers to create more sophisticated and reliable LLM applications.
Train your AI self, amplify you, bridge the world.
an open-source prototype where you craft your own AI self—a new AI species that preserves you, delivers your context, and defends your interests.
Related contents:
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI.
Related contents:
Claude Code IDE integration for Emacs.
Claude Code IDE for Emacs provides native integration with Claude Code CLI through the Model Context Protocol (MCP). Unlike simple terminal wrappers, this package creates a bidirectional bridge between Claude and Emacs, enabling Claude to understand and leverage Emacs’ powerful features—from LSP and project management to custom Elisp functions. This transforms Claude into a true Emacs-aware AI assistant that works within your existing workflow and can interact with your entire Emacs ecosystem.
stagewise is the first frontend coding agent for existing production-grade web apps 🪄 -- Lives inside your browser 💻 -- Makes changes in local codebase 🤓 -- Compatible with all kinds of frameworks and setups 💪 .
Renderer for the harmony response format to be used with gpt-oss.
The gpt-oss models were trained on the harmony response format for defining conversation structures, generating reasoning output and structuring function calls. If you are not using gpt-oss directly but through an API or a provider like HuggingFace, Ollama, or vLLM, you will not have to be concerned about this as your inference solution will handle the formatting. If you are building your own inference solution, this guide will walk you through the prompt format. The format is designed to mimic the OpenAI Responses API, so if you have used that API before, this format should hopefully feel familiar to you. gpt-oss should not be used without using the harmony format as it will not work correctly.
Add React UI components to your AI agent.
A React package for building AI-powered applications with generative UI, where users interact through natural language.
LLM API Pricing Data.
Up-to-date pricing information for major LLM APIs including OpenAI, Anthropic, Google, and more. Compare costs across different AI models and find the best value for your use case.
Related contents:
Using an LLM to call tools in a loop is the simplest form of an agent. This architecture, however, can yield agents that are “shallow” and fail to plan and act over longer, more complex tasks. Applications like “Deep Research”, "Manus", and “Claude Code” have gotten around this limitation by implementing a combination of four things: a planning tool, sub agents, access to a file system, and a detailed prompt.
deepagents is a Python package that implements these in a general purpose way so that you can easily create a Deep Agent for your application.
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
LangExtract is a Python library that uses LLMs to extract structured information from unstructured text documents based on user-defined instructions. It processes materials such as clinical notes or reports, identifying and organizing key details while ensuring the extracted data corresponds to the source text.
Deep Research at Your Fingertipsat Your Fingertips
Meet DeerFlow, your personal Deep Research assistant. With powerful tools like search engines, web crawlers, Python and MCP services, it delivers instant insights, comprehensive reports, and even captivating podcasts.
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Understand your codebase.The Code Understanding Tool.
Sourcebot is a self-hosted tool that helps you answer questions about your code.
Build and deploy MCP agents.
Spin-up and aggregate MCP servers through a single endpoint and zero friction.
mcp-use is the easiest way to interact with mcp servers with custom agents. 🌐 MCP-Use is the open source way to connect any LLM to any MCP server and build custom MCP agents that have tool access, without using closed source or application clients.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Related contents:
The glamourous AI coding agent for your favourite terminal 💘
Your new coding bestie, now available in your favourite terminal. Your tools, your code, and your workflows, wired into your LLM of choice.
Related contents:
An open-source AI agent that brings the power of Grok directly into your terminal. A conversational AI CLI tool powered by Grok with intelligent text editor capabilities and tool usage.
Convert email newsletters into RSS feeds.
LetterFeed is a self-hosted application that transforms your email newsletters into RSS feeds.
It periodically scans your email inbox via IMAP for new emails from the senders you've configured. When it finds a new email, it processes it, and adds it as a new entry to the corresponding newsletter's RSS feed.
Enter a YouTube URL to get an AI-generated summary.
⚡ A lightweight, self-hosted YouTube video summarizer with Gemini AI
Rapidly build AI apps in Python.
Create web apps without the complexity of frontend development. Used at Google for rapid AI app development.
Mesop is a Python-based UI framework that allows you to rapidly build web apps like demos and internal apps.
Agents.md Guide for OpenAI Codex
Learn how to use Agents.md files to enhance OpenAI Codex and other AI agents in navigating your codebases, running tests, and adhering to your project's standard practices.
Related contents:
Communicate with an LLM provider using a single interface.
any-llm is a Python library providing a single interface to different llm providers.
Build Agents that Never Hallucinate. Deploy the most accurate RAG in the world in two lines of code.
The most accurate document search and store for building AI apps.
Related contents:
AI-Powered Web Scraping & Data Enrichment. AI-powered web search with instant results and follow-up questions.
🔥 Blazing-fast AI search engine with real-time citations, streaming responses, and live data powered by Firecrawl
Related contents:
Chainlit is an open-source Python package to build production ready Conversational AI.
Related contents:
React Native plugin for Apple Intelligence and Foundation Models.
A React Native plugin to access Apple Intelligence Foundation Models using native on-device LLM APIs. This module lets you check model availability, create sessions, generate structured outputs (JSON), and text using Apple's LLMs, all from React Native.
Orchestrate AI Coding Agents. Kanban board to manage your AI coding agents.
AI coding agents are increasingly writing the world's code and human engineers now spend the majority of their time planning, reviewing, and orchestrating tasks. Vibe Kanban streamlines this process.
A simple prompt playground to test and share your prompt templates.
A tool for testing, debugging, and refining AI prompts. Supports variables, templates to improve prompt quality and speed up development.
AI Powered Email, Built to Save You Time Zero is an AI-native email client that manages your inbox, so you don't have to.
Zero is an open-source AI email solution that gives users the power to self-host their own email app while also integrating external services like Gmail and other email providers. Our goal is to modernize and improve emails through AI agents to truly modernize emails.
Reflect the web through AI.
Automatically rewrites the websites into markdown optimised for viewing in the terminal. Read intro blog post here This is a proof-of-concept, bugs are to be expected but feel free to raise an issue or pull request.
Related contents:
Use AI to transform your messy commit history into clean, logical commits that reviewers will love.
Git Smart Squash uses AI to automatically organize your changes into logical, well-structured commits in seconds.
Related contents:
Generate realistic datasets for demos, learning, and dashboards. Instantly preview data, export as CSV or SQL, and explore with Metabase.
This is a tool for routing Claude Code requests to different models, and you can customize any request.
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
Open source real-time translation app for Android that runs locally.
RTranslator is an (almost) open-source, free, and offline real-time translation app for Android.
Connect to someone who has the app, connect Bluetooth headphones, put the phone in your pocket and you can have a conversation as if the other person spoke your language.
AI-powered Postgres Client. cursor for data ⚡️ - explore your postgresql db in seconds.
Standardized Serverless ML Inference Platform on Kubernetes. Highly scalable and standards based Model Inference Platform on Kubernetes for Trusted AI.
KServe provides a Kubernetes Custom Resource Definition for serving predictive and generative machine learning (ML) models. It aims to solve production model serving use cases by providing high abstraction interfaces for Tensorflow, XGBoost, ScikitLearn, PyTorch, Huggingface Transformer/LLM models using standardized data plane protocols.
A browser automation SDK built for developers and LLMs.
Most existing browser automation tools either require you to write low-level code in a framework like Selenium, Playwright, or Puppeteer, or use high-level agents that can be unpredictable in production. By letting developers choose what to write in code vs. natural language, Stagehand is the natural choice for browser automations in production.
An open source framework for building AI-powered apps with familiar code-centric patterns. Genkit makes it easy to develop, integrate, and test AI features with observability and evaluations. Genkit works with various models and platforms.
Genkit is an open-source framework for building full-stack AI-powered applications, built and used in production by Google's Firebase. It provides SDKs for multiple programming languages with varying levels of stability
open-source LLM infrastructure.
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
The AI Shell. AI enabled pair programmer for Claude, GPT, O Series, Grok, Deepseek, Gemini and 300+ models.
A comprehensive coding agent that integrates AI capabilities with your development environment.
Open Source, Distributed, Big Data Enterprise Search Engine.
Datafari is an open source enterprise search solution enriched with AI. It is the perfect product for anyone who needs to search and analyze its corporate data and documents, both within the content and the metadata. Plus, with its genAI modules, it allows to easily leverage mistral, openai, or local LLMs for your company data.
Core building blocks for AI apps. High-quality, accessible, and customizable components for AI interfaces.
Build chat experiences, AI agents, autonomous assistants, and more, quickly and beautifully.
A tool that adds small enhancements to Claude Code.
- Reduced interruptions: Auto-handles permission dialogs based on configurable rules
- Flexible control: Rulesets define which actions to allow automatically
- Tool management: Toolsets configure which tools Claude can use
- Enhanced visibility: System notifications keep you informed
easy secure execution of untrusted user/ai code.
Run untrusted code with VM-level isolation and lightning-fast startup. Built for AI agents, developers, and anyone who needs to execute code safely without compromising on speed or security.