Search: [computer-vision] - Biapy Web Directory

Ultralytics YOLO https://docs.ultralytics.com/

Thu Nov 21 08:23:00 2024

📧email

Ultralytics YOLO11 is a cutting-edge, state-of-the-art (SOTA) model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility. YOLO11 is designed to be fast, accurate, and easy to use, making it an excellent choice for a wide range of object detection and tracking, instance segmentation, image classification and pose estimation tasks.

Ultralytics YOLO @ GitHub.

Supervision https://supervision.roboflow.com/latest/

Fri Jun 14 08:15:18 2024

📧email

We write your reusable computer vision tools. Whether you need to load your dataset from your hard drive, draw detections on an image or video, or count how many detections are in a zone. You can count on us!

Supervision provides a seamless process for annotating predictions generated by various object detection and segmentation models.

Supervision @ GitHub.

ImageBind https://github.com/facebookresearch/ImageBind

Wed May 10 07:36:03 2023

📧email

ImageBind One Embedding Space to Bind Them All.

PyTorch implementation and pretrained models for ImageBind. For details, see the paper: ImageBind: One Embedding Space To Bind Them All.

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.

Semaphore https://github.com/everythingishacked/Semaphore

Mon Apr 24 09:17:03 2023

📧email

A full-body keyboard using gestures to type through computer vision.

Semaphore uses OpenCV and MediaPipe's Pose detection to perform real-time detection of body landmarks from video input. From there, relative differences are calculated to determine specific positions and translate those into keys and commands sent via keyboard.

GVision https://github.com/GONZOsint/gvision

Mon Apr 17 11:06:47 2023

📧email

GVision is a reverse image search app that use Google Cloud Vision API to detect landmarks and web entities from images, helping you gather valuable information quickly and easily.

YOLOv5 https://github.com/ultralytics/yolov5

Thu Jan 5 09:08:23 2023

📧email

YOLOv5 in PyTorch > ONNX > CoreML > TFLite.
YOLOv5 is the world's most loved vision AI, representing Ultralytics open-source research into future vision AI methods, incorporating lessons learned and best practices evolved over thousands of hours of research and development.

OpenCV https://opencv.org/

Tue Oct 25 13:22:47 2022

📧email

Open Computer Vision. Open source machine learning library for computer vision.

OpenCV @ GitHub.

Altify https://github.com/ParhamP/altify

Sun Nov 27 11:50:31 2016

📧email

Altify automizes the task of inserting alternative text attributes for image tags. Altify uses Microsoft Computer Vision API's deep learning algorithms to caption images in an HTML file and returns a new HTML file in which alt attributes are filled out with their corresponding captions.

Links per page

Filters