Biapy's Bookmarks

Langfuse

https://langfuse.com/

Open Source LLM Engineering Platform. Traces, evals, prompt management and metrics to debug and improve your LLM application.

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Langfuse @ GitHub.

Related contents:

Self-hosting Langfuse with Kubernetes @ Xebia.

langchain llm llmops open-source prompt-engineering self-hosted web-app

Added 8 months ago

Agenta

https://agenta.ai/

Prompt Engineering, Evaluation, and Observability for LLM apps.

Your End-to-End Collaborative Open Source End-to-End LLM Engineering Platform. Agenta provides integrated tools for prompt engineering, versioning, evaluation, and observability—all in one place.

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM Observability all in one place.

Agenta @ GitHub.

foss llm llmops open-source prompt-engineering self-hosted web-app

Added 8 months ago

PromptPal

https://promptpal.github.io/

A Prompt Manager that focuses on On-Premise and developer experience.

PromptPal @ GitHub.

ai foss llm open-source prompt-engineering self-hosted web-app

Added 8 months ago

Embedditor

https://embedditor.ai/

Embedditor is the open-source MS Word equivalent for embedding that helps you get the most out of your vector search.

⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.

Embedditor @ GitHub.

editor embeddings foss llm open-source self-hosted web-app

Added 8 months ago

🦜🕸️LangGraph

https://langchain-ai.github.io/langgraph/

LangGraph is a library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows.

🦜🕸️LangGraph @ GitHub.

Related contents:

Building effective agents @ Anthropic.

ai-agent foss langchain llm open-source python

Added 8 months ago

Vellum AI

https://www.vellum.ai/

Build and ship AI products on a single, collaborative platform

Take AI products from early-stage ideas to production-grade features with tooling for experimentation, evaluation, deployment, monitoring, and collaboration.

Related contents:

Building effective agents @ Anthropic.

ai ai-agent commercial llm web-service

Added 8 months ago

Rivet

https://rivet.ironcladapp.com/

The open-source visual AI programming environment and TypeScript library.

Rivet, the IDE for creating complex AI agents and prompt chaining, and embedding it in your application.

Rivet @ GitHub.

Related contents:

Building effective agents @ Anthropic.

ai ai-agent foss ide llm open-source software typescript web-app

Added 8 months ago

Amazon Bedrock Agents

https://aws.amazon.com/bedrock/agents/

Enable generative AI applications to automate multistep tasks by seamlessly connecting with company systems, APIs, and data sources.

Related contents:

Building effective agents @ Anthropic.

ai ai-agent amazon aws commercial llm web-service

Added 8 months ago

codename goose

https://block.github.io/goose/

Your on-machine AI agent, automating engineering tasks seamlessly. an open-source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM.

codename goose @ GitHub.

Related contents:

Introducing codename goose @ codename goose blog.

ai-agent automation foss llm open-source

Added 8 months ago

Laravel Faker OpenAI Provider

https://github.com/jpcaparas/laravel-faker-openai

An opinionated Laravel package that extends FakerPHP and uses openai-php/laravel to generate fake data.

A Laravel package that extends FakerPHP by adding an AI-powered data generator using OpenAI. This allows you to generate more realistic and context-aware fake data in your Laravel applications.

Related contents:

faker laravel library llm openai open-source php

Added 8 months ago

Cerebras

https://cerebras.ai/

Cerebras Inference The world’s fastest inference -70x faster than GPU clouds,128K context, 16-bit precision.

Cerebras Inference Llama 3.3 70B runs at 2,200 tokens/s and Llama 3.1 405B at 969 tokens/s – over 70x faster than GPU clouds. Get instant responses to code-gen, summarization, and agentic tasks.

Related contents:

105 - les news web dev pour janvier 2025 @ Double Slash :fr:.

ai commercial inference llm web-service

Added 8 months ago

GenAi France :fr:

https://www.generativeai.paris/

Conférences sur la générative AI.

Related contents:

#307.src - Langchain: Faire de l'IA comme des Lego avec Maxime Thoonsen @ <ifttd>.

conference development france genai llm non-profit

Added 8 months ago

Haystack

https://haystack.deepset.ai/

The Production-Ready Open Source AI Framework.

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Haystack @ GitHub.

ai ai-agent development foss framework genai llm open-source python rag

Added 8 months ago

OpenLIT

https://openlit.io/

OpenTelemetry-native GenAI and LLM Application Observability.

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.

OpenLIT @ GitHub.

foss genai llm metrics observability open-source opentelemetry

Added 8 months ago

OpenLLMetry

https://www.traceloop.com/openllmetry

Open-source observability for your LLM application, based on OpenTelemetry.

OpenLLMetry is a set of extensions built on top of OpenTelemetry that gives you complete observability over your LLM application. Because it uses OpenTelemetry under the hood, it can be connected to your existing observability solutions - Datadog, Honeycomb, and others.

OpenLLMetry @ GitHub.

foss llm metrics observability open-source opentelemetry

Added 8 months ago

ONNX Runtime

https://onnxruntime.ai/

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator.

ONNX Runtime inference can enable faster customer experiences and lower costs, supporting models from deep learning frameworks such as PyTorch and TensorFlow/Keras as well as classical machine learning libraries such as scikit-learn, LightGBM, XGBoost, etc.

ONNX Runtime @ GitHub.

Related contents:

Running inference in web extensions @ dist://ed.

ai foss llm machine-learning open-source pytorch tensorflow

Added 8 months ago

PocketPal AI 📱🚀

https://github.com/a-ghorbani/pocketpal-ai

An app that brings language models directly to your phone.

PocketPal AI is a pocket-sized AI assistant powered by small language models (SLMs) that run directly on your phone. Designed for both iOS and Android, PocketPal AI lets you interact with various SLMs without the need for an internet connection.

Related contents:

PocketPal AI, l'assistant IA 100% local sur Android / iOS @ Korben :fr:.

ai android foss ios llm open-source

Added 8 months ago

Onlook

https://onlook.com/

Cursor for Designers.

The power of Cursor for your own React website. Onlook lets you visually edit your React website and write your changes back to code in real-time.

The open source Cursor for Designers. Design directly in your live React app and publish your changes to code.

Onlook @ GitHub.

ai apache2-licensed code-generator development foss genai llm open-source react web-design

Added 8 months ago

Orange intelligence

https://github.com/sharingan-no-kakashi/orange-intelligence

Orange Intelligence is a powerful, fully customizable productivity tool for macOS. With its elegant floating window interface, you can capture, process, and replace text seamlessly across any application. Whether you're running basic text processing, leveraging the power of large language models (LLMs) like OpenAI or local LLaMA, or creating complex agent systems, Orange Intelligence empowers you to work smarter, faster, and better.

Apple Intelligence is closed, limited, and inflexible. Orange Intelligence brings the power of customization and open source innovation to macOS, making it the perfect productivity tool for developers, researchers, and AI enthusiasts.

llm macos open-source software

Added 9 months ago

vLLM

https://docs.vllm.ai/en/latest/

Easy, fast, and cheap LLM serving for everyone.

vLLM is a fast and easy-to-use library for LLM inference and serving.

Originally developed in the Sky Computing Lab at UC Berkeley, vLLM has evloved into a community-driven project with contributions from both academia and industry.

vLLM @ GitHub.

Related contents:

ai apache2-licensed foss genai llm machine-learning open-source self-hosted

Added 9 months ago

Common Crawl

https://commoncrawl.org/

Open Repository of Web Crawl Data.

Common Crawl maintains a free, open repository of web crawl data that can be used by anyone.

Related contents:

S5E7 - Sommes-nous à l'aube d'un effondrement des IA ? @ Underscore_'s acast :fr:.

crawler llm machine-learning non-profit rag scraping web-service

Added 9 months ago

FineWeb

https://huggingface.co/datasets/HuggingFaceFW/fineweb

15 trillion tokens of the finest data the 🌐 web has to offer.

The 🍷 FineWeb dataset consists of more than 15T tokens of cleaned and deduplicated english web data from CommonCrawl. The data processing pipeline is optimized for LLM performance and ran on the 🏭 datatrove library, our large scale data processing library.

🍷 FineWeb was originally meant to be a fully open replication of 🦅 RefinedWeb, with a release of the full dataset under the ODC-By 1.0 license. However, by carefully adding additional filtering steps, we managed to push the performance of 🍷 FineWeb well above that of the original 🦅 RefinedWeb, and models trained on our dataset also outperform models trained on other commonly used high quality web datasets (like C4, Dolma-v1.6, The Pile, SlimPajama, RedPajam2) on our aggregate group of benchmark tasks.

Related contents:

ai dataset llm machine-learning open-source

Added 9 months ago

NeurIPS Conference

https://neurips.cc/

The Annual Conference on Neural Information Processing Systems.

Related contents:

#104 Développer des projets IA - introduction @ Double Slash :fr:.

ai commercial llm machine-learning web-service

Added 9 months ago

Repomix

https://repomix.com/

Pack your codebase into AI-friendly formats.

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like Claude, ChatGPT, and Gemini.

Repomix @ GitHub.

ai code-generator command-line development foss llm open-source rag

Added 9 months ago

Groq

https://groq.com/

Groq is Fast AI Inference.

Related contents:

#104 Développer des projets IA - introduction @ Double Slash :fr:.

ai commercial llm machine-learning web-service

Added 9 months ago

LM Studio

https://lmstudio.ai/

Discover, download, and run local LLMs.

LM Studio @ GitHub.

Related contents:

ai command-line linux llm machine-learning macos ollama open-source software windows

Added 9 months ago

DeepSeek-R1

https://github.com/deepseek-ai/DeepSeek-R1

We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrated remarkable performance on reasoning. With RL, DeepSeek-R1-Zero naturally emerged with numerous powerful and interesting reasoning behaviors. However, DeepSeek-R1-Zero encounters challenges such as endless repetition, poor readability, and language mixing. To address these issues and further enhance reasoning performance, we introduce DeepSeek-R1, which incorporates cold-start data before RL.

Related contents:

DeepSeek Crushes OpenAI o1 with an MIT-Licensed Model—Developers Are Losing It @ AIM.

data-science foss llm open-source

Added 9 months ago

yek

https://github.com/bodo-run/yek

A fast tool to read text-based files in a repository or directory, chunk them, and serialize them for LLM consumption.

command-line foss llm machine-learning open-source rag

Added 9 months ago

Agent Recipes

https://www.agentrecipes.com/

Explore Agent Recipes

Explore common agent recipes with ready to copy code to improve your LLM applications.

Related contents:

Building effective agents @ Anthropic.

ai-agent design-pattern inspiration llm machine-learning python typescript web-service

Added 9 months ago

Realtime API Agents Demo

https://github.com/openai/openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API. - openai/openai-realtime-agents

ai-agent chatgpt development foss llm openai open-source

Added 9 months ago

Rowfill

https://www.rowfill.com/

Open-source document processing platform built for knowledge workers.

Rowfill helps extract, analyze, and process data from complex documents, images, PDFs and more with advanced AI capabilities.

Rowfill @ GitHub.

ai data-science foss llm open-source self-hosted web-app

Added 9 months ago

structured-logprobs

https://arena-ai.github.io/structured-logprobs/

structured-logprobs is an open-source Python library that enhances OpenAI's structured outputs by providing detailed information about token log probabilities.

This library is designed to offer valuable insights into the reliability of an LLM's structured outputs. It works with OpenAI's Structured Outputs, a feature that ensures the model consistently generates responses adhering to a supplied JSON Schema. This eliminates concerns about missing required keys or hallucinating invalid values.

structured-logprobs @ GitHub.

data-analytics foss llm machine-learning openai open-source python

Added 9 months ago

Inbox Zero

https://www.getinboxzero.com/

Automate and clean your inbox. Clean Up Your Inbox In Minutes.

Bulk unsubscribe from newsletters, automate your emails with AI, block cold emails, and view your analytics. Open-source.

Inbox Zero @ GitHub.

ai email foss llm open-source self-hosted web-app webmail

Added 9 months ago

🚀 E2M

https://github.com/wisupai/e2m

Everything to Markdown.

E2M converts various file types (doc, docx, epub, html, htm, url, pdf, ppt, pptx, mp3, m4a) into Markdown. It’s easy to install, with dedicated parsers and converters, supporting custom configs. E2M offers an all-in-one, flexible, and open-source solution.

converter data-science foss llm markdown open-source

Added 9 months ago

Lucie LLM

https://huggingface.co/collections/OpenLLM-France/lucie-llm-67099ba7b992dee2c32b1f92

Open weights LLM for French, English, German, Spanish and Italian.

Lucie Training @ GitHub.

Related contents:

ai foss france llm machine-learning open-source

Added 9 months ago

OpenLLM France :fr:

https://www.openllm-france.fr/

LLM génératifs ouverts et performants.

Le Consortium OpenLLM France réunit 17 acteurs qui se sont rassemblés dans le prolongement de la création de la communauté OpenLLM France qui fédère à ce jour un écosystème de près de 200 entités (laboratoires publics de recherche, fournisseurs potentiels de données, acteurs technologiques spécialisés, fournisseurs de cas d'usage...). Ces acteurs échangent de manière publique et transparente depuis le début de l’été 2023 sur le serveur Discord de la communauté.

ai france llm machine-learning open-source

Added 9 months ago

OpenRouter

https://openrouter.ai/

A unified interface for LLMs. Better prices, better uptime, no subscription.

OpenRouter @ GitHub.

Related contents:

How I don't use LLMs @ argmin gravitas.

api commercial llm openai web-service

Added 9 months ago

Giskard

https://docs.giskard.ai/en/stable/

🐢 Open-Source Evaluation & Testing for AI & LLM systems.

The testing framework dedicated to ML models, from tabular to LLMs. Control risks of performance, bias and security issues in AI systems.

Giskard @ GitHub.

ai foss framework llm machine-learning open-source testing

Added 9 months ago

sitefetch

https://github.com/egoist/sitefetch

Fetch an entire site and save it as a text file (to be used with AI models).

ai archive command-line data-science llm machine-learning open-source web

Added 9 months ago

LackSynth

https://github.com/aielte-research/HackSynth

LLM Agent and Evaluation Framework for Autonomous Penetration Testing.

We introduce HackSynth, a novel Large Language Model (LLM)-based agent capable of autonomous penetration testing. HackSynth's dual-module architecture includes a Planner and a Summarizer, which enable it to generate commands and process feedback iteratively. To benchmark HackSynth, we propose two new Capture The Flag (CTF)-based benchmark sets utilizing the popular platforms PicoCTF and OverTheWire. These benchmarks include two hundred challenges across diverse domains and difficulties, providing a standardized framework for evaluating LLM-based penetration testing agents.

ai-agent foss llm open-source pentest security

Added 9 months ago

Vera :fr:

https://www.askvera.org/

Le numéro de confiance pour vérifier les faits.

Un seul numéro (gratuit) pour contrer la désinformation et apaiser le débat public.

VERA - L'IA qui chasse les fake news @ Korben :fr:.

fake-news france llm news phone web-service

Added 9 months ago

Model Context Protocol

https://modelcontextprotocol.io/docs/getting-started/intro

MCP is an open protocol that standardizes how applications provide context to LLMs. Think of MCP like a USB-C port for AI applications. Just as USB-C provides a standardized way to connect your devices to various peripherals and accessories, MCP provides a standardized way to connect AI models to different data sources and tools.

Related contents:

claude code-generator coding-assistant developer-experience llm mcp open-source

Added 9 months ago

Tabby

https://www.tabbyml.com/

Opensource, self-hosted AI coding assistant. Secure, flexible, and transparent AI coding.

Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot.

Tabby @ GitHub.

code-generator coding-assistant developer-experience development github-copilot llm open-source self-hosted

Added 9 months ago

code2prompt

https://github.com/mufeedvh/code2prompt

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

code-generator command-line development foss llm open-source prompt-engineering

Added 9 months ago

paperless-gpt

https://github.com/icereed/paperless-gpt

Use LLMs and LLM Vision (OCR) to handle paperless-ngx - Document Digitalization powered by AI.

paperless-gpt seamlessly pairs with paperless-ngx to generate AI-powered document titles and tags, saving you hours of manual sorting. While other tools may offer AI chat features, paperless-gpt stands out by supercharging OCR with LLMs—ensuring high accuracy, even with tricky scans. If you’re craving next-level text extraction and effortless document organization, this is your solution.

ai automation foss llm llm-vision mit-licensed ocr open-source paperless self-hosted

Added 9 months ago

Epigram

https://epigram.news/

Open-Source, Free, and AI-Powered News in Short.

Your open-source, AI-powered news companion that redefines how you consume news. Get bite-sized summaries from trusted sources worldwide, personalized to your interests. Stay informed without the overwhelm.

Epigram @ GitHub.

ai foss llm news open-source rss self-hosted syndication web-app

Added 9 months ago

Paperless-AI

https://clusterzx.github.io/paperless-ai/

An automated document analyzer for Paperless-ngx using OpenAI API and Ollama (Mistral, llama, phi 3, gemma 2) to automatically analyze and tag your documents.

It features: Automode, Manual Mode, Ollama and OpenAI, a Chat function to query your documents with AI, a modern and intuitive Webinterface.

Paperless-AI @ GitHub.

ai data-analytics foss llm ollama open-source paperless web-app

Added 9 months ago

Storyteller

https://smoores.gitlab.io/storyteller/

Storyteller is a self-hosted platform for creating and reading ebooks with synced narration. It's made of three components: the API server, the web interface, and the mobile apps. Together, these components allow you to take audiobooks and ebooks that you already own and automatically synchronize them, as well as read or listen to (or both!) the resulting synced books.

Storyteller @ GitLab.

Related contents:

Episode 140: When Upgrades Go Wrong @ Self Hosted.

ai audiobook ebook foss llm open-source self-hosted text-to-speech web-app

Added 9 months ago

Windsurf Editor

https://codeium.com/windsurf

Built to keep you in flow state

The first agentic IDE, and then some. The Windsurf Editor is where the work of developers and AI truly flow together, allowing for a coding experience that feels like literal magic.

ai code-generator commercial development ide linux llm software windows

Added 9 months ago

Crawl4AI

https://crawl4ai.com/mkdocs/

Open-Source LLM-Friendly Web Crawler & Scraper.

Crawl4AI delivers blazing-fast, AI-ready web crawling tailored for large language models, AI agents, and data pipelines. Fully open source, flexible, and built for real-time performance, Crawl4AI empowers developers with unmatched speed, precision, and deployment ease.

Crawl4AI @ GitHub.

ai crawler foss llm open-source rag scraping

Added 9 months ago

Khoj AI

https://khoj.dev/

Your AI Second Brain. Ask anything, understand documents, create new content.

Khoj is a personal AI app to extend your capabilities. It smoothly scales up from an on-device personal AI to a cloud-scale enterprise AI.

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Khoj @ GitHub.

Related contents:

Khoj - Un assistant IA privé qui vous accompagne au quotidien @ Korben :fr:.

foss llm open-source rag self-hosted web-app

Added 9 months ago

Airbyte

https://airbyte.com/

Open-Source Data Movement for LLMs. AI Platform. Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes.

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Airbyte @ GitHub.

data-integration data-pipeline etl llm self-hosted source-available

Added 9 months ago

Llama Coder

https://llamacoder.together.ai/

AI Code Generator.

An open source Claude Artifacts – generate small apps with one prompt. Powered by Llama 3 405B & Together.ai.

Llama Coder @ GitHub.

code-generator development foss llm open-source web-app

Added 9 months ago

🦜️🔗 LangChain

https://python.langchain.com/docs/introduction/

⚡ Build context-aware reasoning applications ⚡

LangChain is a framework for developing applications powered by large language models (LLMs).

🦜️🔗 LangChain @ GitHub.

foss framework llm open-source python rag

Added 9 months ago

ChatterUI

https://github.com/Vali-98/ChatterUI

Simple frontend for LLMs built in react-native.

ChatterUI is a native mobile frontend for LLMs.

Run LLMs on device or connect to various commercial or open source APIs. ChatterUI aims to provide a mobile-friendly interface with fine-grained control over chat structuring.

Related contents:

android chatgpt foss llm ollama openai open-source react-native

Added 9 months ago

Neural Networks: Zero to Hero

https://github.com/karpathy/nn-zero-to-hero

A course on neural networks that starts all the way at the basics. The course is a series of YouTube videos where we code and train neural networks together. The Jupyter notebooks we build in the videos are then captured here inside the lectures directory. Every lecture also has a set of exercises included in the video description. (This may grow into something more respectable).

e-learning foss llm machine-learning neural-network open-source

Added 9 months ago

MMAudio

https://hkchengrex.com/MMAudio/

MMAudio generates synchronized audio given video and/or text inputs.

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis.

MMAudio @ GitHub.

Related contents:

MMAudio - Un outil pour synchroniser l'audio et la vidéo @ Korben :fr:.

audio foss llm machine-learning open-source sound-generator video

Added 9 months ago