Add pad_head_dim_to_multiple_of field to allow the use of memory efficient attention.
pad_head_dim_to_multiple_of
· Sign up or log in to comment