TensorRT

на сайте с 06 января 2024, 09:04
NVIDIA TensorRT-LLM is an open-source library that accelerates and optimizes inference performance of the latest large language models (LLMs) on the NVIDIA AI ...