Skip to content

Commit d2d3482

Browse files
committed
refactor: Remove maxNumSequences parameter from MakeDecodingBatchInputOutput
- Removed maxNumSequences parameter from createDecoderBatchInputs and related function calls, streamlining the interface. - Updated all relevant implementations and tests to reflect the changes in function signatures. Signed-off-by: Robin Kobus <[email protected]>
1 parent a3d7cd6 commit d2d3482

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

cpp/tensorrt_llm/pybind/batch_manager/algorithms.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ void tensorrt_llm::pybind::batch_manager::algorithms::initBindings(pybind11::mod
136136
.def(py::init())
137137
.def("__call__", &MakeDecodingBatchInputOutput::operator(), py::arg("decoder_input_buffers"),
138138
py::arg("decoder_state"), py::arg("context_requests"), py::arg("generation_requests"), py::arg("logits"),
139-
py::arg("model_config"), py::arg("max_num_sequences"), py::arg("fused_runtime_buffers") = std::nullopt)
139+
py::arg("model_config"), py::arg("fused_runtime_buffers") = std::nullopt)
140140
.def("name", [](MakeDecodingBatchInputOutput const&) { return MakeDecodingBatchInputOutput::name; });
141141

142142
py::class_<LogitsPostProcessor>(m, LogitsPostProcessor::name)

0 commit comments

Comments
 (0)