Search results for
tensorrt-llm
NVIDIA H100 GPUs and TensorRT-LLM Achieve Breakthrough Performance for Mixtral 8x7B
NVIDIA's H100 Tensor Core GPUs and TensorRT-LLM software demonstrate record-breaking performance for the Mixtral 8x7B model, leveraging FP8 precision.
NVIDIA TensorRT-LLM Boosts Hebrew LLM Performance
NVIDIA's TensorRT-LLM and Triton Inference Server optimize performance for Hebrew large language models, overcoming unique linguistic challenges.