llama-cpp-orin Nix flake to run llama.cpp with CUDA acceleration on the Jetson Orin Nano. To run the default server config: nix run To run llama-server in router mode: nix run .#llama-server-router