fastT5

на сайте с December 16, 2022 16:08

Reduce T5 model size by 3X and increase the inference speed up to 5X. T5 models can be used for several NLP tasks such as summarization, QA, QG, translation, text generation, and more. Sequential text generation is naturally slow, and for larger T5 models it gets even slower. fastT5 makes the T5 models inference faster by running it on onnxruntime. and it also decreases the model size by quantizing it.

Скачать

^* Extension для Google Chrome

Разрабатывая это приложение я хотел бы чтобы любой мог найти похожие инструменты, технологии, техники и приёмы так же легко, как если бы вы искали в Google "Ruby vs ..." или "Awesome Ruby"

— Корнев Руслан (@woto)

Или воспользуйтесь нашим Телеграм ботом для добавления упоминаний.

Подробнее