llm-studio / documentation /docs /tooltips /experiments /_deepspeed-allgather-bucket-size.mdx
qinfeng722's picture
Upload 322 files
5caedb4 verified
raw
history blame contribute delete
182 Bytes
Number of elements allgather at a time. Limits the memory required for the allgather for large model sizes. Smaller values use less GPU memory, but slow down training and validating.