Search: [machine-learning] - Biapy Web Directory

Buzz Captions https://buzzcaptions.com/

Tue May 30 11:47:52 2023

📧email

Offline audio transcription and translation.

Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.

Buzz Captions @ GitHub.

ImageBind https://github.com/facebookresearch/ImageBind

Wed May 10 07:36:03 2023

📧email

ImageBind One Embedding Space to Bind Them All.

PyTorch implementation and pretrained models for ImageBind. For details, see the paper: ImageBind: One Embedding Space To Bind Them All.

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.

gpt4all https://github.com/nomic-ai/gpt4all

Thu May 4 14:47:16 2023

📧email

an ecosystem of open-source chatbots trained on a massive collections of clean assistant data including code, stories and dialogue.

Demo, data, and code to train open-source assistant-style large language model based on GPT-J and LLaMa

Bark https://github.com/suno-ai/bark

Tue May 2 14:49:59 2023

📧email

🔊 Text-Prompted Generative Audio Model

Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints, which are ready for inference and available for commercial use.

OpenCommit https://github.com/di-sukharev/opencommit

Tue Apr 25 18:58:03 2023

📧email

GPT CLI to auto-generate impressive commits in 1 second 🤯🔫

Easy Diffusion https://github.com/cmdr2/stable-diffusion-ui

Tue Apr 25 11:07:57 2023

📧email

Easiest 1-click way to install and use Stable Diffusion on your computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image.

Semaphore https://github.com/everythingishacked/Semaphore

Mon Apr 24 09:17:03 2023

📧email

A full-body keyboard using gestures to type through computer vision.

Semaphore uses OpenCV and MediaPipe's Pose detection to perform real-time detection of body landmarks from video input. From there, relative differences are calculated to determine specific positions and translate those into keys and commands sent via keyboard.

StableLM: https://github.com/stability-AI/stableLM/

Thu Apr 20 07:58:23 2023

📧email

Stability AI Language Models.

This repository contains Stability AI's ongoing development of the StableLM series of language models and will be continuously updated with new checkpoints. The following provides an overview of all currently available models. More coming soon.

AI Dungeon https://play.aidungeon.io/main/home

Mon Apr 17 09:33:08 2023

📧email

Play and create AI-generated adventures with infinite possibilities. Not sure where to start?

AI Food Generator by Lunchbox https://ai.lunchbox.io/

Thu Apr 13 09:18:41 2023

📧email

Every image is uniquely generated by artificial intelligence. Food items that add an image see 70% more orders and 65% higher sales compared to restaurants that do not.

Lama Cleaner https://lama-cleaner-docs.vercel.app/

Wed Apr 12 21:20:34 2023

📧email

Lama Cleaner is a free, open-source and fully self-hostable inpainting tool powered by state-of-the-art AI models. You can use it to remove any unwanted object, defect, people from your pictures or erase and replace anything on your pictures.

Lama Cleaner @ GitHub

k8sgpt https://github.com/k8sgpt-ai

Mon Apr 10 13:41:40 2023

📧email

K8sGPT is a tool for scanning your kubernetes clusters, diagnosing and triaging issues in simple english. It has SRE experience codified into it’s analyzers and helps to pull out the most relevant information to enrich it with AI.

Matchering https://github.com/sergree/matchering

Mon Mar 27 10:26:59 2023

📧email

🎚️ Open Source Audio Matching and Mastering. Matchering 2.0 is a novel Containerized Web Application and Python Library for audio matching and mastering.

It follows a simple idea - you take TWO audio files and feed them into Matchering. Our algorithm matches both of these tracks and provides you the mastered TARGET track with the same RMS, FR, peak amplitude and stereo width as the REFERENCE track has.

Jema.ai https://jema.ai/

Thu Mar 23 09:44:30 2023

📧email

A Jasper alternative open source with ChatGPT.

This project uses ChatGPT API to create almost any text based output for your need - from marketing content to blog post ideas and a lot more. It uses simple template based components to ask ChatGPT for generating results Creating new templates or tasks take about 30 mins. no more, so you can extend it for your needs or wait for new template release :)

Jema.ai @ GitHub

AIcyclopedia https://www.aicyclopedia.com/

Tue Mar 21 17:07:11 2023

📧email

The free AI encyclopedia.
AI tools, podcasts, prompts, newsletter, and movies.

Stable Diffusion web UI https://github.com/AUTOMATIC1111/stable-diffusion-webui

Mon Mar 20 16:41:58 2023

📧email

A browser interface based on Gradio library for Stable Diffusion.

Papers With Code https://paperswithcode.com/

Fri Mar 17 13:37:40 2023

📧email

The latest in Machine Learning

OpenChatKit https://github.com/togethercomputer/OpenChatKit

Thu Mar 16 15:54:29 2023

📧email

OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications. The kit includes an instruction-tuned 20 billion parameter language model, a 6 billion parameter moderation model, and an extensible retrieval system for including up-to-date responses from custom repositories. It was trained on the OIG-43M training dataset, which was a collaboration between Together, LAION, and Ontocord.ai. Much more than a model release, this is the beginning of an open source project. We are releasing a set of tools and processes for ongoing improvement with community contributions.

Whisper https://openai.com/index/whisper/

Wed Mar 15 08:26:45 2023

📧email

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.

Whisper @ GitHub.

writeout.ai – Transcribe and translate any audio file https://writeout.ai/

Mon Mar 13 10:31:28 2023

📧email

Transcribe and translate any audio file.

Free, fast and accurate transcription of audio files. 100% free to use.

writeout.ai @ GitHub

Links per page

Filters