Open
Description
Attempting to run a persimmon model with the CUDA backend fails an assertion in ggml_cuda_rope: ggml_is_contiguous(src0)
ref #5668 (comment)
Attempting to run a persimmon model with the CUDA backend fails an assertion in ggml_cuda_rope: ggml_is_contiguous(src0)
ref #5668 (comment)