ai
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Curated papers, articles, and blogs on data science & machine learning in production. ⚙️
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
⚡ Building applications with LLMs through composability ⚡.
LangChain is a framework for developing applications powered by language models.
Data Integration, Data Quality, & Analytics Solutions.
Qlik, now with Talend, delivers a data fabric for modern data architectures and next-gen analytics powered by Qlik Staige™, a suite of AI and machine learning capabilities.
Effortlessly remove background from images directly in the browser with no additional costs and privacy concerns.
Open-source Solution for AI Quality. The testing framework dedicated to ML models, from tabular to LLMs Scan AI models to detect risks of biases, performance issues and errors. In 4 lines of code.
A platform for the machine learning lifecycle.
MLflow is a platform to streamline machine learning development, including tracking experiments, packaging code into reproducible runs, and sharing and deploying models. MLflow offers a set of lightweight APIs that can be used with any existing machine learning application or library (TensorFlow, PyTorch, XGBoost, etc), wherever you currently run ML code (e.g. in notebooks, standalone applications or the cloud). MLflow's current components are:
Doctor Dignity is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private.
Ship AI features in minutes. Pezzo enables you to build, test, monitor and instantly ship AI all in one platform, while constantly optimizing for cost and performance.
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
The most powerful vector database for building AI applications. Open-source PostgreSQL database extension for vector data and vector search operations.
Lantern is an open-source PostgreSQL database extension to store vector data, generate embeddings, and handle vector search operations.
one-click face swap
Take a video and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training.
A Multilingual Code Generation Tool.
We introduce CodeGeeX, a large-scale multilingual code generation model with 13 billion parameters, pre-trained on a large code corpus of more than 20 programming languages. As of June 22, 2022, CodeGeeX has been trained on more than 850 billion tokens on a cluster of 1,536 Ascend 910 AI Processors.
txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.
automatically tests prompt injection attacks on ChatGPT instances.
Prompt injection is a type of security vulnerability that can be exploited to control the behavior of a ChatGPT instance. By injecting malicious prompts into the system, an attacker can force the ChatGPT instance to do unintended actions.
Automatic Generation of Visualizations and Infographics with LLMs.
LIDA is a library for generating data visualizations and data-faithful infographics. LIDA is grammar agnostic (will work with any programming language and visualization libraries e.g. matplotlib, seaborn, altair, d3 etc) and works with multiple large language model providers (OpenAI, PaLM, Cohere, Huggingface). Details on the components of LIDA are described in the paper here and in this tutorial notebook. See the project page here for updates!.
Defog's SQLCoder is a state-of-the-art LLM for converting natural language questions to SQL queries.
The HackerNoon Library.
THE LEARN REPO orders technology stories by editor determined subject matter and community determined time reading created. It is an open source lever within the HackerNoon Story Classification System.
Your Guide to Communicating with Artificial Intelligence.
Learn how to use ChatGPT and other AI tools to accomplish your goals using our free and open source curriculum, designed for all skill levels!
Pigo is a pure Go face detection, pupil/eyes localization and facial landmark points detection library based on the Pixel Intensity Comparison-based Object detection paper.
Skybox AI uses AI to generate full 360 degree panoramic images. 2 available mode tabs below give you full creative control of your skybox.
Get up and running with large language models, locally. Run Llama 2 and other models on macOS. Customize and create your own.
NASA and IBM have teamed up to create an AI Foundation Model for Earth Observations, using large-scale satellite and remote sensing data, including the Harmonized Landsat and Sentinel-2 (HLS) data. By embracing the principles of open AI and open science, both organizations are actively contributing to the global mission of promoting knowledge sharing and accelerating innovations in addressing critical environmental challenges. With Hugging Face's platform, they simplify geospatial model training and deployment, making it accessible for open science users, startups, and enterprises on multi-cloud AI platforms like watsonx. Additionally, Hugging Face enables easy sharing of the pipelines of the model family, which our team calls Prithvi, within the community, fostering global collaboration and engagement.
The world's simplest facial recognition api for Python and the command line.
Recognize and manipulate faces from Python or from the command line with the world's simplest face recognition library.
Operating LLMs in production.
An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease.
Search 1000s of free seamless HD PBR textures. Create Textures With Poly.
Generate 3D materials with AI in a free online editor, or search our growing community library.
Trace Pixels To Vectors in Full Color, Fully Automatically, Using AI.
Convert your JPEG and PNG bitmaps to SVG vectors quickly and easily. Fully Automatically. Using AI.
Accurate AI Transcriptions in Minutes.
Web service proposing to transcribe video and/or audio content using AI
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Unlike other APIs, it doesn't rely on proprietary providers such as Google or Azure to perform translations. Instead, its translation engine is powered by the open source Argos Translate library.
open-source geolocation.
Yachay is an open-source Machine Learning community. We have collected decades worth of useful natural language data from traditional media (i.e. New York Times articles), social media (i.e. Twitter & Reddit), messenger channels, tech blogs, GitHub profiles and issues, the dark web, and legal proceedings, as well as the decisions and publications of government regulators and legislators all across the world.
Use Kiota to generate API clients to call any OpenAPI-described API.
Kiota is a command line tool for generating an API client to call any OpenAPI described API you are interested in. The goal is to eliminate the need to take a dependency on a different API SDK for every API that you need to call. Kiota API clients provide a strongly typed experience with all the features you expect from a high quality API SDK, but without having to learn a new library for every HTTP API.
Segment Anything Model (SAM): a new AI model from Meta AI that can "cut out" any object, in any image, with a single click.
SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training.
Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.
Stability AI Language Models.
This repository contains Stability AI's ongoing development of the StableLM series of language models and will be continuously updated with new checkpoints. The following provides an overview of all currently available models. More coming soon.
Play and create AI-generated adventures with infinite possibilities. Not sure where to start?
K8sGPT is a tool for scanning your kubernetes clusters, diagnosing and triaging issues in simple english. It has SRE experience codified into it’s analyzers and helps to pull out the most relevant information to enrich it with AI.
🎚️ Open Source Audio Matching and Mastering. Matchering 2.0 is a novel Containerized Web Application and Python Library for audio matching and mastering.
It follows a simple idea - you take TWO audio files and feed them into Matchering. Our algorithm matches both of these tracks and provides you the mastered TARGET track with the same RMS, FR, peak amplitude and stereo width as the REFERENCE track has.
A Jasper alternative open source with ChatGPT.
This project uses ChatGPT API to create almost any text based output for your need - from marketing content to blog post ideas and a lot more. It uses simple template based components to ask ChatGPT for generating results Creating new templates or tasks take about 30 mins. no more, so you can extend it for your needs or wait for new template release :)
The free AI encyclopedia. AI tools, podcasts, prompts, newsletter, and movies.
A browser interface based on Gradio library for Stable Diffusion.
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications. The kit includes an instruction-tuned 20 billion parameter language model, a 6 billion parameter moderation model, and an extensible retrieval system for including up-to-date responses from custom repositories. It was trained on the OIG-43M training dataset, which was a collaboration between Together, LAION, and Ontocord.ai. Much more than a model release, this is the beginning of an open source project. We are releasing a set of tools and processes for ongoing improvement with community contributions.
Transcribe and translate any audio file.
Free, fast and accurate transcription of audio files. 100% free to use.
Free AI filter for cleaning up spoken audio. Enhance voice recordings for free.
Speech enhancement makes voice recordings sound as if they were recorded in a professional studio.
Generate, Edit & Filter images using the DALL-E 2 API
dallecli is a command line app designed to provide users with the ability to generate, edit and filter images using the DALL-E 2 API provided by OpenAI.
Read less, understand more. Unleash the power of quick and easy reading - just paste your URL for an instant summary!
Welcome to Jotte, an AI-powered graph-based writing tool that helps you create high-quality, informative content with ease. Jotte uses nodes and varying specificities of summaries to carry relevant information through an extremely long set of text, making it an ideal tool for creating long-form content.
Coqui STT (frogSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. frogSTT is battle tested in both production and research rocket
PhotoPrism® is an AI-Powered Photos App for the Decentralized Web.
It makes use of the latest technologies to tag and find pictures automatically without getting in your way. You can run it at home, on a private server, or in the cloud.
The Open Source Privacy-Focused Voice Assistant.
Mycroft is the world’s leading open source voice assistant. It is private by default and completely customizable.
fastai is a deep learning library which provides practitioners with high-level components that can quickly and easily provide state-of-the-art results in standard deep learning domains, and provides researchers with low-level components that can be mixed and matched to build new approaches. It aims to do both things without substantial compromises in ease of use, flexibility, or performance. This is possible thanks to a carefully layered architecture, which expresses common underlying patterns of many deep learning and data processing techniques in terms of decoupled abstractions. These abstractions can be expressed concisely and clearly by leveraging the dynamism of the underlying Python language and the flexibility of the PyTorch library. fastai includes:
Qdrant (read: quadrant ) is a vector similarity search engine and vector database. It provides a production-ready service with a convenient API to store, search, and manage points - vectors with an additional payload. Qdrant is tailored to extended filtering support. It makes it useful for all sorts of neural-network or semantic-based matching, faceted search, and other applications.
Related contents:
An AI-powered Personal Identifiable Information (PII) scanner.. Octopii is an open-source AI-powered Personal Identifiable Information (PII) scanner that can look for image assets such as Government IDs, passports, photos and signatures in a directory.