NVIDIA Corporation
- 26.4k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- Model-Optimizer Public
A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
NVIDIA/Model-Optimizer’s past year of commit activity - NeMo-Retriever Public
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
NVIDIA/NeMo-Retriever’s past year of commit activity - NeMo-Relay Public
Multi-language agent runtime for execution scope management, lifecycle events, and middleware on tool and LLM calls.
NVIDIA/NeMo-Relay’s past year of commit activity - infra-controller Public
NVIDIA Infra Controller - Hardware Lifecycle Management and multitenant networking
NVIDIA/infra-controller’s past year of commit activity
Top languages
Loading…