<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <title>computer-vision</title>
    <link rel="self" type="application/atom+xml" href="https://links.biapy.com/guest/tags/45/feed"/>
    <updated>2026-06-17T07:59:40+00:00</updated>
    <id>https://links.biapy.com/guest/tags/45/feed</id>
            <entry>
            <id>https://links.biapy.com/links/12402</id>
            <title type="text"><![CDATA[Granite 4.0 3B Vision]]></title>
            <link rel="alternate" href="https://huggingface.co/ibm-granite/granite-4.0-3b-vision" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/12402"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Granite-4.0-3B-Vision is a vision-language model (VLM) designed for enterprise-grade document data extraction. It focuses on specialized, complex extraction tasks that ultracompact models often struggle with.

Related contents:

- [Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents  @ Hugging Face](https://huggingface.co/blog/ibm-granite/granite-4-vision).]]>
            </summary>
            <updated>2026-04-03T14:29:59+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/12057</id>
            <title type="text"><![CDATA[Motion]]></title>
            <link rel="alternate" href="https://motion-project.github.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/12057"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Motion is a highly configurable program that monitor video signals from many types of cameras and depending upon how they are configured, perform actions when movement is detected. 

- [Motion @ GitHub](https://github.com/Motion-Project/motion).

Related contents:

- [Motion - L&amp;#039;outil Linux pour gérer toutes vos caméras de surveillance @ Korben :fr:](https://korben.info/motion-loutil-linux-pour-gerer-toutes-vos-cameras-de-surveillance.html).]]>
            </summary>
            <updated>2026-03-09T09:32:41+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/12044</id>
            <title type="text"><![CDATA[Label Studio]]></title>
            <link rel="alternate" href="https://labelstud.io/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/12044"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Open Source Data Labeling.

The most flexible data labeling platform to fine-tune LLMs, prepare training data, or evaluate AI systems.
 Label Studio is a multi-type data labeling and annotation tool with standardized output format.
Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models.

- [Label Studio @ GitHub](https://github.com/HumanSignal/label-studio/).]]>
            </summary>
            <updated>2026-03-06T14:56:24+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/11032</id>
            <title type="text"><![CDATA[Skyvern]]></title>
            <link rel="alternate" href="https://www.skyvern.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/11032"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[AI Browser Automation.
 Automate browser based workflows with AI.

Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows on a large number of websites, replacing brittle or unreliable automation solutions.

- [Skyvern @ GitHub](https://github.com/Skyvern-AI/skyvern).]]>
            </summary>
            <updated>2025-11-21T12:42:51+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10909</id>
            <title type="text"><![CDATA[Unblink]]></title>
            <link rel="alternate" href="https://github.com/tri2820/unblink" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10909"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[VLM app for video analytics.

Unblink is a camera monitoring application that runs AI vision models on your camera streams in real-time.]]>
            </summary>
            <updated>2025-11-07T14:24:04+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10891</id>
            <title type="text"><![CDATA[🏰 Grayskull]]></title>
            <link rel="alternate" href="https://github.com/zserge/grayskull" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10891"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[A tiny, dependency-free computer vision library in C for embedded systems, drones, and robotics. 

Grayskull is a minimalist, dependency-free computer vision library designed for microcontrollers and other resource-constrained devices. It focuses on grayscale images and provides modern, practical algorithms that fit in a few kilobytes of code. Single-header design, integer-based operations, pure C99.

Related contents:

- [By the power of grayscale! @ zserge&amp;#039;s blog](https://zserge.com/posts/grayskull/).]]>
            </summary>
            <updated>2025-11-05T14:03:45+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10848</id>
            <title type="text"><![CDATA[Image Moderator for S3 Bucket]]></title>
            <link rel="alternate" href="https://github.com/lrasata/infra-s3-image-moderator/tree/v1.0.0" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10848"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[AWS-based automation which scans images stored in an Amazon S3 bucket for inappropriate or unsafe content using Amazon Rekognition.

Related contents:

- [Detect inappropriate images in S3 with AWS Rekognition + Terraform @ Liantsoa R.&amp;#039;s Medium](https://medium.com/@rmliantsoa/detect-inappropriate-images-in-s3-with-aws-rekognition-terraform-b1ddd185c1db).]]>
            </summary>
            <updated>2025-11-03T10:31:56+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10782</id>
            <title type="text"><![CDATA[Peekaboo]]></title>
            <link rel="alternate" href="https://www.peekaboo.boo/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10782"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[AI Vision for macOS. Fast Screen Capture &amp;amp; VQA.

Peekaboo is a macOS CLI &amp;amp; optional MCP server that enables AI agents to capture screenshots of applications, or the entire system, with optional visual question answering through local or remote AI models. 

- [Peekaboo @ GitHub](https://github.com/steipete/Peekaboo).

Related contents:

- [Accelerate developer productivity with these 9 open source AI and MCP projects @ GitHub blog](https://github.blog/open-source/accelerate-developer-productivity-with-these-9-open-source-ai-and-mcp-projects/).]]>
            </summary>
            <updated>2025-10-27T14:08:36+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/10264</id>
            <title type="text"><![CDATA[OpenVision 2]]></title>
            <link rel="alternate" href="https://ucsc-vlaa.github.io/OpenVision2/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/10264"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning.

OpenVision 2: A Family of Generative Pretrained Visual Encoders that removes the text encoder and contrastive loss, training with caption-only supervision.

- [OpenVision &amp;amp; OpenVision 2 @ GitHub](https://github.com/UCSC-VLAA/OpenVision).]]>
            </summary>
            <updated>2025-09-18T06:09:18+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/11</id>
            <title type="text"><![CDATA[Enhance Lab :fr:]]></title>
            <link rel="alternate" href="https://enhancelab.fr/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/11"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[AI and inverse problems for a revolution in digital photography.

Related contents:

- [S5E21 - On a reçu le génie français qui révolutionne la vision artificielle
 @ Underscore_ :fr:](https://shows.acast.com/micode-underscore/episodes/on-a-recu-le-genie-francais-qui-revolutionne-la-vision-artif).]]>
            </summary>
            <updated>2025-09-03T10:04:33+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/134</id>
            <title type="text"><![CDATA[SlouchDetector]]></title>
            <link rel="alternate" href="https://slouchdetector.net/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/134"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[get alerted when you slouch.

SlouchDetector uses MediaPipe face detection to learn your ideal sitting posture and reminds you to sit up when you slouch. All processing happens locally in your browser; no video data is sent to any server.

- [SlouchDetector @ GitHub](https://github.com/alexanderkranga/slouchdetector).

Related contents:

- [SlouchDetector - Quand votre webcam vous rappelle de vous tenir droit @ Korben :fr:](https://korben.info/slouchdetector-webcam-vous-rappelle-vous-tenir.html).]]>
            </summary>
            <updated>2025-10-16T12:06:16+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/321</id>
            <title type="text"><![CDATA[Morphik]]></title>
            <link rel="alternate" href="https://www.morphik.ai/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/321"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Build Agents that Never Hallucinate.
Deploy the most accurate RAG in the world in two lines of code.

 The most accurate document search and store for building AI apps.

- [Morphik @ GitHub](https://github.com/morphik-org/morphik-core).

Related contents:

- [Don&amp;#039;t bother parsing: Just use images for RAG @ Morphik](https://www.morphik.ai/blog/stop-parsing-docs).]]>
            </summary>
            <updated>2026-01-13T13:25:14+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/660</id>
            <title type="text"><![CDATA[sports]]></title>
            <link rel="alternate" href="https://github.com/roboflow/sports" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/660"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[computer vision and sports.

In sports, every centimeter and every second matter. That&amp;#039;s why Roboflow decided to use sports as a testing ground to push our object detection, image segmentation, keypoint detection, and foundational models to their limits. This repository contains reusable tools that can be applied in sports and beyond.]]>
            </summary>
            <updated>2025-08-28T17:48:08+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/849</id>
            <title type="text"><![CDATA[Index]]></title>
            <link rel="alternate" href="https://github.com/lmnr-ai/index" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/849"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Open-Source Browser Agent for autonomously performing complex tasks on the web]]>
            </summary>
            <updated>2025-08-28T18:19:25+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/878</id>
            <title type="text"><![CDATA[Viseron]]></title>
            <link rel="alternate" href="https://github.com/roflcoopter/viseron" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/878"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Self-hosted, local only NVR and AI Computer Vision software. With features such as object detection, motion detection, face recognition and more, it gives you the power to keep an eye on your home, office or any other place you want to monitor.]]>
            </summary>
            <updated>2025-08-28T18:24:28+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/2786</id>
            <title type="text"><![CDATA[Ultralytics YOLO]]></title>
            <link rel="alternate" href="https://docs.ultralytics.com/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/2786"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Ultralytics YOLO11 is a cutting-edge, state-of-the-art (SOTA) model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility. YOLO11 is designed to be fast, accurate, and easy to use, making it an excellent choice for a wide range of object detection and tracking, instance segmentation, image classification and pose estimation tasks.

- [Ultralytics YOLO @ GitHub](https://github.com/ultralytics/ultralytics).]]>
            </summary>
            <updated>2025-08-28T23:41:24+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/4158</id>
            <title type="text"><![CDATA[Supervision]]></title>
            <link rel="alternate" href="https://supervision.roboflow.com/latest/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/4158"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[We write your reusable computer vision tools. Whether you need to load your dataset from your hard drive, draw detections on an image or video, or count how many detections are in a zone. You can count on us!

Supervision provides a seamless process for annotating predictions generated by various object detection and segmentation models.

- [Supervision @ GitHub](https://github.com/roboflow/supervision).]]>
            </summary>
            <updated>2025-08-29T03:30:08+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/5440</id>
            <title type="text"><![CDATA[ImageBind]]></title>
            <link rel="alternate" href="https://github.com/facebookresearch/ImageBind" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/5440"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[ImageBind One Embedding Space to Bind Them All.

PyTorch implementation and pretrained models for ImageBind. For details, see the paper: ImageBind: One Embedding Space To Bind Them All.

ImageBind learns a joint embedding across six different modalities - images, text, audio, depth, thermal, and IMU data. It enables novel emergent applications ‘out-of-the-box’ including cross-modal retrieval, composing modalities with arithmetic, cross-modal detection and generation.]]>
            </summary>
            <updated>2025-08-29T07:03:57+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/5486</id>
            <title type="text"><![CDATA[Semaphore]]></title>
            <link rel="alternate" href="https://github.com/everythingishacked/Semaphore" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/5486"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[A full-body keyboard using gestures to type through computer vision.

Semaphore uses OpenCV and MediaPipe&amp;#039;s Pose detection to perform real-time detection of body landmarks from video input. From there, relative differences are calculated to determine specific positions and translate those into keys and commands sent via keyboard.]]>
            </summary>
            <updated>2025-08-29T07:12:01+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/5522</id>
            <title type="text"><![CDATA[GVision]]></title>
            <link rel="alternate" href="https://github.com/GONZOsint/gvision" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/5522"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[GVision is a reverse image search app that use Google Cloud Vision API to detect landmarks and web entities from images, helping you gather valuable information quickly and easily.]]>
            </summary>
            <updated>2025-08-29T07:17:03+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/6179</id>
            <title type="text"><![CDATA[YOLOv5]]></title>
            <link rel="alternate" href="https://github.com/ultralytics/yolov5" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/6179"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[YOLOv5 in PyTorch &amp;gt; ONNX &amp;gt; CoreML &amp;gt; TFLite.
YOLOv5 is the world&amp;#039;s most loved vision AI, representing Ultralytics open-source research into future vision AI methods, incorporating lessons learned and best practices evolved over thousands of hours of research and development.]]>
            </summary>
            <updated>2025-08-29T09:06:58+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/6634</id>
            <title type="text"><![CDATA[OpenCV]]></title>
            <link rel="alternate" href="https://opencv.org/" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/6634"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Open Computer Vision. Open source machine learning library for computer vision.

- [OpenCV @ GitHub](https://github.com/opencv/opencv).
- [OpenCV on Wheels @ GitHub](https://github.com/opencv/opencv-python).

Related contents:

- [OpenCV Course - Full Tutorial with Python @ freeCodeCamp.org&amp;#039;s YouTube](https://www.youtube.com/watch?v=oXlwWbU8l2o).]]>
            </summary>
            <updated>2025-08-29T10:22:41+00:00</updated>
        </entry>
            <entry>
            <id>https://links.biapy.com/links/7480</id>
            <title type="text"><![CDATA[Altify]]></title>
            <link rel="alternate" href="https://github.com/ParhamP/altify" />
            <link rel="via" type="application/atom+xml" href="https://links.biapy.com/links/7480"/>
            <author>
                <name><![CDATA[Biapy]]></name>
            </author>
            <summary type="text">
                <![CDATA[Altify automizes the task of inserting alternative text attributes for image tags. Altify uses Microsoft Computer Vision API&amp;#039;s deep learning algorithms to caption images in an HTML file and returns a new HTML file in which alt attributes are filled out with their corresponding captions.]]>
            </summary>
            <updated>2025-08-29T12:43:56+00:00</updated>
        </entry>
    </feed>
