Petals: 効果的なデバイスの共同推論と快適なチューニング
[Submitted on 2 Sep 2022 (v1), last revised 2 Mar 2023 (this version, v2)] Download PDF Abstract: Many NLP tasks benefit from using large language models (LLMs) that often have more than 100 billion parameters. With the release of BLOOM-176B and OPT-175B, everyone can download pretrained models of this scale. Still, using these models requires…