Loading…
Saved in:
Publication Year: | 2024 |
---|---|
Subject Terms: | Computer Science - Artificial Intelligence |
Description: |
The ever-increasing sizes of large language models necessitate distributed solutions for fast inference that exploit multi-dimensional paral
|
Database: | arXiv |