Accelerator overhaul #101

thesps · 2026-01-19T15:44:12Z

Overview

This PR overhauls the support for building accelerators from conifer models. There are two main aspects:

runtime drivers implemented with XRT C++
better use of load/compute/store paradigm for HLS backend accelerators

XRT drivers

Since its v3.1 release, pynq is deprecating support for Alveo platforms. I explored XRT's provided Python bindings via pyxrt, and made some working implementations. However, it doesn't expose XRT's API for reading and writing registers. This functionality is quite useful for the quality-of-life features of the drivers, in that the IPs produced with conifer contain some high level information about the IP which needs to be retrieved via registers. For that reason this PR provides a native XRT C++ driver implementation, with Python bindings via pybind11. Happily, this also turns out to be marginally faster than pyxrt.

These drivers are split into a standalone package within the repo named conifer-xrt. This enables installing the package via pip, without introducing a dependency on XRT installation for all of conifer. XRT needs to be manually installed by users via AMD/Xilinx, and is only relevant for these accelerators designs.

I also add the nlohmann JSON library as a submodule, since the conifer-xrt package needs to build against it at installation time. In a follow up PR I plan to make use of that for the C++ backend, relating to #87.

This PR solves #99 (in spirit)

load/compute/store

This part relates to the Xilinx HLS backend top-level wrapper, which has been refactored to properly partition the load, compute, and store tasks along with AMD/Xilinx guidelines. I've written up the implementation and observations, but the main plot is here:

Some attempts at this approach for the FPU backend only slowed things down, so I plan to come back to that.

…ation of load/compute/store and use of HLS streams Fix a misplaced cast from score_t to input_t. Remove the now-unused copy_input and copy_output methods

… Move to pyproject.toml for main conifer. Use conifer_xrt_runtime package in the conifer HLS backend driver

…se HLS and FPU runtimes

…ivers to separate files with optional loading. Provide lightweight Python wrappers to C++ XRT classes for utility

Include hls_stream header from Vitis HLS Add ap_types submodule

…date the logic of finding which headers to use based on the externals Tidy up hls_stream include, and remove setup of JSON and ap_types headers from CI before_script Make sure submodules are cloned in CI

Source the XRT environment in the XRT build test Add git to the XRT test setup Clone the submodules in the conifer-xrt CI

thesps force-pushed the xrt branch from 46e829a to 6551110 Compare January 27, 2026 15:56

thesps added 10 commits January 27, 2026 16:57

Overhaul HLS backend accelerator implementation with better implement…

852b307

…ation of load/compute/store and use of HLS streams Fix a misplaced cast from score_t to input_t. Remove the now-unused copy_input and copy_output methods

Implement a C++ XRT runtime for the HLS backend, with Python bindings

66ffcc4

Add conifer_xrt_runtime package, and its pyproject toml requirements.…

099fcdb

… Move to pyproject.toml for main conifer. Use conifer_xrt_runtime package in the conifer HLS backend driver

Implement XRT driver for FPU. Add JSON external and build against it.

eeec531

Implement shape checking safety features for FPU runtime, and harmoni…

4f354e1

…se HLS and FPU runtimes

Update Python side of runtime modules. Separate Pynq and XRT based dr…

576c3cb

…ivers to separate files with optional loading. Provide lightweight Python wrappers to C++ XRT classes for utility

Add nlohmann JSON submodule

2664ec4

Include hls_stream header from Vitis HLS Add ap_types submodule

Make sure all external libraries are included in the package data. Up…

ada5ad1

…date the logic of finding which headers to use based on the externals Tidy up hls_stream include, and remove setup of JSON and ap_types headers from CI before_script Make sure submodules are cloned in CI

Fixes to the external tools paths and resolution

26289d2

Add a test building conifer-xrt companion package

a17b965

Source the XRT environment in the XRT build test Add git to the XRT test setup Clone the submodules in the conifer-xrt CI

thesps force-pushed the xrt branch from 6551110 to a17b965 Compare January 27, 2026 16:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelerator overhaul #101

Accelerator overhaul #101

thesps commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Accelerator overhaul #101

Are you sure you want to change the base?

Accelerator overhaul #101

Conversation

thesps commented Jan 19, 2026

Overview

XRT drivers

load/compute/store

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants