Tag: nvidia - Biapy's Bookmarks

nvidia

LACT

https://github.com/ilya-zlobintsev/LACT

Linux GPU Configuration And Monitoring Tool.

This application allows you to control your AMD, Nvidia or Intel GPU on a Linux system.

foss gaming gpu linux mit-licensed nvidia open-source performance radeon software

Added 2 months ago

nvidia/parakeet-tdt-0.6b-v2 @ Hugging Face

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

parakeet-tdt-0.6b-v2 is a 600-million-parameter automatic speech recognition (ASR) model designed for high-quality English transcription, featuring support for punctuation, capitalization, and accurate timestamp prediction.

Related contents:

Transcribe speech 100x faster and 100x cheaper with open models @ Modal.

ai cc-by-4-licensed nvidia speech-recognition

Added 2 months ago

KAI Scheduler

https://github.com/NVIDIA/KAI-Scheduler

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

ai apache2-licensed foss kubernetes llm nvidia open-source scheduler

Added 4 months ago

NVIDIA PhysX

https://nvidia-omniverse.github.io/PhysX/

NVIDIA PhysX SDK.

This repository contains source releases of the PhysX, Flow, and Blast SDKs used in NVIDIA Omniverse.

NVIDIA PhysX @ GitHub.

bsd3-licensed development foss game-engine nvidia open-source physics sdk

Added 6 months ago

NVIDIA Dynamo

https://developer.nvidia.com/dynamo

A Datacenter Scale Distributed Inference Serving Framework.

NVIDIA Dynamo is a high-throughput low-latency inference framework designed for serving generative AI and reasoning models in multi-node distributed environments. Dynamo is designed to be inference engine agnostic (supports TRT-LLM, vLLM, SGLang or others) and captures LLM-specific capabilities.

Dynamo @ GitHub.

Related contents:

A closer look at Dynamo, Nvidia's 'operating system' for AI inference @ The register.

ai apache2-licensed distributed foss genai inference llm machine-learning nvidia open-source

Added 7 months ago

GPU Glossary

https://modal.com/gpu-glossary/readme

We wrote this glossary to solve a problem we ran into working with GPUs here at Modal : the documentation is fragmented, making it difficult to connect concepts at different levels of the stack, like Streaming Multiprocessor Architecture , Compute Capability , and nvcc compiler flags .

cuda data-science documentation e-learning gpu machine-learning nvidia

Added 9 months ago

exo

https://github.com/exo-explore/exo

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Forget expensive NVIDIA GPUs, unify your existing devices into one powerful GPU: iPhone, iPad, Android, Mac, Linux, pretty much any device!

Related contents:

ai android cluster foss ios linux llm machine-learning nvidia open-source python

Added 1 year ago

TensorRT SDK

https://developer.nvidia.com/tensorrt

NVIDIA® TensorRT™ is an ecosystem of APIs for high-performance deep learning inference. TensorRT includes an inference runtime and model optimizations that deliver low latency and high throughput for production applications. The TensorRT ecosystem includes TensorRT, TensorRT-LLM, TensorRT Model Optimizer, and TensorRT Cloud.

TensorRT Open Source Software @ GitHub.

data-science development machine-learning nvidia sdk

Added 1 year ago

nvitop

https://github.com/XuehaiPan/nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

command-line gpu nvidia open-source python

Added 2 years ago

Headless Steam Service

https://github.com/Steam-Headless/docker-steam-headless

A Headless Steam Docker image supporting NVIDIA GPU and accessible via Web UI. Play your games in the browser with audio. Connect another device and use it with Steam Remote Play. Easily deploy a Steam Docker instance in seconds.

docker game gaming headless nvidia self-hosted steam web-app

Added 2 years ago

vramfs

https://github.com/Overv/vramfs

Unused RAM is wasted RAM, so why not put some of that VRAM in your graphics card to work?

vramfs is a utility that uses the FUSE library to create a file system in VRAM. The idea is pretty much the same as a ramdisk, except that it uses the video RAM of a discrete graphics card to store files. It is not intented for serious use, but it does actually work fairly well, especially since consumer GPUs with 4GB or more VRAM are now available.

On the developer's system, the continuous read performance is ~2.4 GB/s and write performance 2.0 GB/s, which is about 1/3 of what is achievable with a ramdisk. That is already decent enough for a device not designed for large data transfers to the host, but future development should aim to get closer to the PCI-e bandwidth limits. See the benchmarks section for more info.

cuda nvidia radeon ramfs sysadmin système

Added 10 years ago