AI image edition toolbox
Trace Pixels To Vectors in Full Color, Fully Automatically, Using AI.
Convert your JPEG and PNG bitmaps to SVG vectors quickly and easily. Fully Automatically. Using AI.
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
Unlike other APIs, it doesn't rely on proprietary providers such as Google or Azure to perform translations. Instead, its translation engine is powered by the open source Argos Translate library.
open-source geolocation.
Yachay is an open-source Machine Learning community. We have collected decades worth of useful natural language data from traditional media (i.e. New York Times articles), social media (i.e. Twitter & Reddit), messenger channels, tech blogs, GitHub profiles and issues, the dark web, and legal proceedings, as well as the decisions and publications of government regulators and legislators all across the world.
Segment Anything Model (SAM): a new AI model from Meta AI that can "cut out" any object, in any image, with a single click.
SAM is a promptable segmentation system with zero-shot generalization to unfamiliar objects and images, without the need for additional training.
Milvus is an open-source vector database built to power embedding similarity search and AI applications. Milvus makes unstructured data search more accessible, and provides a consistent user experience regardless of the deployment environment.
Convert any music library into a music production sample-library with ML.
Polymath uses machine learning to convert any music library (e.g from Hard-Drive or YouTube) into a music production sample-library. The tool automatically separates songs into stems (beats, bass, etc.), quantizes them to the same tempo and beat-grid (e.g. 120bpm), analyzes musical structure (e.g. verse, chorus, etc.), key (e.g C4, E3, etc.) and other infos (timbre, loudness, etc.), and converts audio to midi. The result is a searchable sample library that streamlines the workflow for music producers, DJs, and ML audio developers.
Falcon LLM is a foundational large language model (LLM) with 40 billion parameters trained on one trillion tokens. TII has now released Falcon LLM – a 40B model.
Offline audio transcription and translation.
Transcribe and translate audio offline on your personal computer. Powered by OpenAI's Whisper.
ImageBind One Embedding Space to Bind Them All.
PyTorch implementation and pretrained models for ImageBind. For details, see the paper: ImageBind: One Embedding Space To Bind Them All.
ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.
🔊 Text-Prompted Generative Audio Model
Bark is a transformer-based text-to-audio model created by Suno. Bark can generate highly realistic, multilingual speech as well as other audio - including music, background noise and simple sound effects. The model can also produce nonverbal communications like laughing, sighing and crying. To support the research community, we are providing access to pretrained model checkpoints, which are ready for inference and available for commercial use.