ai
An AI driven recommendation system based on Radarr and Sonarr library information.
Recommendarr is a web application that generates personalized TV show and movie recommendations based on your Sonarr, Radarr, Plex, and Jellyfin libraries using AI.
Free & Open-Source AI Voice Generator.
A powerful, browser-based AI voice generator that lets you create natural-sounding voices without installing anything.
Use it directly in your browser or self-host it for your own applications with OpenAI API compatibility!
WAGMIOS is a self-hosted container management system with AI-powered automation. It enables you to efficiently manage your containers with W.I.L.L.O.W, an AI assistant that optimizes your workflow.
Evolving agents is a production-grade environment for orchestrating, evolving, and managing AI agents.
A production-grade framework for creating, managing, and evolving AI agents with intelligent agent-to-agent communication. The framework enables you to build collaborative agent ecosystems that can semantically understand requirements, evolve based on past experiences, and communicate effectively to solve complex tasks.
An in-depth book and reference on building agentic systems like Claude Code. A deep-dive guide into architecture patterns for building responsive, reliable AI coding agents.
There's been a lot of asking about how Claude Code works under the hood. Usually, people see the prompts, but they don't see how it all comes together. This is that book. All of the systems, tools, and commands that go into building one of these.
A practical deep dive and code review into how to build a self-driving coding agent, execution engine, tools and commands. Rather than the prompts and AI engineering, this is the systems and design decisions that go into making agents that are real-time, self-corrective, and useful for productive work.
Terminal-based AI coding tool that can use any model that supports the OpenAI-style API.
The Platform for Building Stateful Agents. Build agents with infinite context and human-like memory, that can learn from data and improve with experience. Letta (formerly MemGPT) is a framework for creating LLM services with memory.
👾 Letta is an open source framework for building stateful LLM applications. You can use Letta to build stateful agents with advanced reasoning capabilities and transparent long-term memory. The Letta framework is white box and model-agnostic.
Related contents:
Finding the Scaling Laws of Agents. The first and the best multi-agent framework.
🐫 CAMEL is an open-source community dedicated to finding the scaling laws of agents. We believe that studying these agents on a large scale offers valuable insights into their behaviors, capabilities, and potential risks. To facilitate research in this field, we implement and support various types of agents, tasks, prompts, models, and simulated environments.
The framework enables multi-agent systems to continuously evolve by generating data and interacting with environments. This evolution can be driven by reinforcement learning with verifiable rewards or supervised learning.
Ollama Automated Security Intelligence Scanner.
🛡️ An AI-powered security auditing tool that leverages Ollama models to detect and analyze potential security vulnerabilities in your code.
Advanced code security analysis through the power of AI
Related contents:
Opsy - Your AI-Powered SRE Colleague.
Opsy is an intelligent command-line assistant designed for Site Reliability Engineers (SREs), DevOps professionals, and platform engineers. It uses AI to help you navigate operational challenges, troubleshoot issues, and automate routine workflows. Opsy integrates with your existing tools and provides contextual assistance to make your daily operations more efficient.
A command-line Ollama client for scripting.
The Ollama function caller, otherwise known as ofc, is a command-line tool for prompting Ollama models locally on your system. There are other programs out there that do similar things, but they either don't support streaming or don't give me access to important settings, like context length or temperature.
Plauna is a privacy focused service that helps you categorize your e-mails. All of its data is stored only on your computer.
You can parse your e-mails from mbox files or directly by connecting to your IMAP servers. Plauna helps you categorize your e-mails and automates the process so that incoming e-mails are moved to their respective folders on your IMAP server.
Revolutionize Your Development with Windsurf AI IDE
As the first truly intelligent development environment, Codeium Windsurf AI combines powerful AI agents with intuitive co-pilots to create an unparalleled coding experience that adapts to your needs and enhances your productivity.
Related contents:
Al powered voice to text.
Write 3x faster, without lifting a finger.
Related contents:
AI you can trust
Easily summarize emails, docs, articles, and videos across the web — without sacrificing your privacy
Related contents:
curated list of resources for AI Engineering.
Related contents:
The TypeScript framework for agents & workflows with react-like components. Lightning fast dev loop. Easy to learn. Easy to extend.
Build complex AI applications with React-like components. GenSX is a simple typescript framework for building agents and workflows with reusable React-like components. GenSX takes a lot of inspiration from React, but the programming model is very different - it’s a Node.js framework designed for data flow.
Ship Faster with Trae
Trae is an adaptive AI IDE that transforms how you work, collaborating with you to run faster.
Related contents:
Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK! Search with models like Grok 2.0.
A virtual town where AI characters live, chat and socialize. Log in to join the town and the conversation!
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Related contents:
Data processing with ML, LLM and Vision LLM.
Sparrow is an innovative open-source solution for efficient data extraction and processing from various documents and images. It seamlessly handles forms, bank statements, invoices, receipts, and other unstructured data sources. Sparrow stands out with its modular architecture, offering independent services and pipelines all optimized for robust performance.
Related contents:
A real-time chat application integrated with N8N workflows for automated responses and processing.
🪄 Create rich visualizations with AI
Data Formulator is an application from Microsoft Research that uses large language models to transform data, expediting the practice of data visualization.
Data Formulator is an AI-powered tool for analysts to iteratively create rich visualizations. Unlike most chat-based AI tools where users need to describe everything in natural language, Data Formulator combines user interface interactions (UI) and natural language (NL) inputs for easier interaction. This blended approach makes it easier for users to describe their chart designs while delegating data transformation to AI.
Your Novel Writing Toolbox. Write, without restriction.
Novelcrafter's got your back, arming you with the tools you need to dream, draft, and deliver your story.
build ml models in natural language and minimal code.
Create machine learning models with minimal code by describing what you want them to do in plain words. You explain the task, and the library builds a model for you, including data generation, feature engineering, training, and packaging.
OpenAI just published Deep Research, a very nice assistant that can perform deep searches on the web to answer user questions.
However, their agent has a huge downside: it's not open. So we've started a 24-hour rush to replicate and open-source it. Our resulting open-Deep-Research agent took the #1 rank of any open submission on the GAIA leaderboard! ✨
Related contents:
For better or for worse, LLMs are here to stay. We all read content that they produce online, most of us interact with LLM chatbots, and many of us use them to produce content of our own.
In a series of five- to ten-minute lessons, we will explain what these machines are, how they work, and how to thrive in a world where they are everywhere.
You will learn when these systems can save you a lot of time and effort. You will learn when they are likely to steer you wrong. And you will discover how to see through the hype to tell the difference. ?
AI by Hand ✍️ Exercises in Excel
The World's Leading AI Platform for Enterprise. The all-in-one platform for private and secure AI.
Cohere brings you cutting-edge multilingual models, advanced retrieval, and an AI workspace tailored for the modern enterprise — all within a single, secure platform.
Related contents:
LLM inference in C/C++.
The main goal of llama.cpp is to enable LLM inference with minimal setup and state-of-the-art performance on a wide range of hardware - locally and in the cloud.
Related contents:
Simple, secure, and reproducible packaging for AI/ML projects.
KitOps is an open source DevOps tool that packages and versions your AI/ML model, datasets, code, and configuration into a reproducible artifact called a ModelKit. ModelKits are built on existing standards, ensuring compatibility with the tools your data scientists and developers already use.
The Cloud Built for AI.
Train, fine-tune and deploy AI models with RunPod.
Globally distributed GPU cloud for your AI workloads. Deploy any GPU workload seamlessly, so you can focus less on infrastructure and more on running ML models.
Rent GPUs.
Vast.ai is the market leader in low-cost cloud GPU rental. Use one simple interface to save 5-6X on GPU compute.
Open Universal Machine Intellingence. E2E Foundation Model Research Platform. Everything you need to build state-of-the-art foundation models, end-to-end.
Oumi is a fully open-source platform that streamlines the entire lifecycle of foundation models - from data preparation and training to evaluation and deployment. Whether you're developing on a laptop, launching large scale experiments on a cluster, or deploying models in production, Oumi provides the tools and workflows you need.
A short introduction to RLHF and post-training focused on language models.
llamafile lets you distribute and run LLMs with a single file.
Our goal is to make open LLMs much more accessible to both developers and end users. We're doing that by combining llama.cpp with Cosmopolitan Libc into one framework that collapses all the complexity of LLMs down to a single-file executable (called a "llamafile") that runs locally on most computers, with no installation.
Your Open-Source Personal Assistant.
Leon is your open-source personal assistant who can live on your server. He does stuff when you ask him to.
A Prompt Manager that focuses on On-Premise and developer experience.
Build and ship AI products on a single, collaborative platform
Take AI products from early-stage ideas to production-grade features with tooling for experimentation, evaluation, deployment, monitoring, and collaboration.
Related contents:
The open-source visual AI programming environment and TypeScript library.
Rivet, the IDE for creating complex AI agents and prompt chaining, and embedding it in your application.
Related contents:
Enable generative AI applications to automate multistep tasks by seamlessly connecting with company systems, APIs, and data sources.
Related contents:
Cerebras Inference The world’s fastest inference -70x faster than GPU clouds,128K context, 16-bit precision.
Cerebras Inference Llama 3.3 70B runs at 2,200 tokens/s and Llama 3.1 405B at 969 tokens/s – over 70x faster than GPU clouds. Get instant responses to code-gen, summarization, and agentic tasks.
Related contents:
The Production-Ready Open Source AI Framework.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
State-of-the-art Machine Learning for PyTorch, TensorFlow, and JAX.
🤗 Transformers provides APIs and tools to easily download and train state-of-the-art pretrained models. Using pretrained models can reduce your compute costs, carbon footprint, and save you the time and resources required to train a model from scratch.
Related contents:
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator.
ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc.
Related contents:
An app that brings language models directly to your phone.
PocketPal AI is a pocket-sized AI assistant powered by small language models (SLMs) that run directly on your phone. Designed for both iOS and Android, PocketPal AI lets you interact with various SLMs without the need for an internet connection.
Related contents:
Cursor for Designers.
The power of Cursor for your own React website. Onlook lets you visually edit your React website and write your changes back to code in real-time.
The open source Cursor for Designers. Design directly in your live React app and publish your changes to code.
Easy, fast, and cheap LLM serving for everyone.
vLLM is a fast and easy-to-use library for LLM inference and serving.
Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evloved into a community-driven project with contributions from both academia and industry.
Related contents:
- How to serve LLMs with vLLM and OVHcloud AI Deploy @ OVHcloud.
- Episode 616: From Boston to bootc @ Linux Unplugged.
- What is vLLM @ RedHat.
- Faire tourner un LLM localement sur votre ordinateur @ Quoi de neuf les devs ? :fr:.
- Inside vLLM: Anatomy of a High-Throughput LLM Inference System @ Aleksa Gordić blog.
- vLLM : Maîtriser l'Inference Haute Performance pour les LLM @ DevSecOps :fr:.
15 trillion tokens of the finest data the 🌐 web has to offer.
The 🍷 FineWeb dataset consists of more than 15T tokens of cleaned and deduplicated english web data from CommonCrawl. The data processing pipeline is optimized for LLM performance and ran on the 🏭 datatrove library, our large scale data processing library.
🍷 FineWeb was originally meant to be a fully open replication of 🦅 RefinedWeb, with a release of the full dataset under the ODC-By 1.0 license. However, by carefully adding additional filtering steps, we managed to push the performance of 🍷 FineWeb well above that of the original 🦅 RefinedWeb, and models trained on our dataset also outperform models trained on other commonly used high quality web datasets (like C4, Dolma-v1.6, The Pile, SlimPajama, RedPajam2) on our aggregate group of benchmark tasks.
Related contents:
The Annual Conference on Neural Information Processing Systems.
Related content:
Run AI with an API. Run and fine-tune open-source models. Deploy custom models at scale. All with one line of code.
Thousands of models contributed by our community. All the latest open-source models are on Replicate. They’re not just demos — they all actually work and have production-ready APIs.
AI shouldn’t be locked up inside academic papers and demos. Make it real by pushing it to Replicate.
Related contents:
Pack your codebase into AI-friendly formats.
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.
Discover, download, and run local LLMs.
Related contents:
- #104 Développer des projets IA - introduction @ Double Slash :fr:.
- Faire tourner un LLM localement sur votre ordinateur @ Quoi de neuf les devs ? :fr:.
- Drames et dramas d’août @ Le RDV Tech podcast.
- LM Studio : Faire tourner son IA (LLM) facilement (Chat, Developpement, ...) @ Adrien Linuxtricks' YouTube :fr:.
Open-source document processing platform built for knowledge workers.
Rowfill helps extract, analyze, and process data from complex documents, images, PDFs and more with advanced AI capabilities.
Automate and clean your inbox. Clean Up Your Inbox In Minutes.
Bulk unsubscribe from newsletters, automate your emails with AI, block cold emails, and view your analytics. Open-source.
Open weights LLM for French, English, German, Spanish and Italian.
Related contents: