InfiniCore-Infer

本项目是基于 InfiniCore 的推理引擎。

使用方式

编译并安装 InfiniCore 。注意根据提示设置好 INFINI_ROOT 环境变量（默认为 $HOME/.infini）。
编译并安装 InfiniCore-Infer

xmake && xmake install

运行模型推理测试

python jiuge.py [--cpu | --nvidia | --cambricon | --ascend | --metax | --moore] <path/to/model_dir> [n_device]

部署模型推理服务

launch_server.py [-h] [--dev {cpu,nvidia,cambricon,ascend,metax,moore}]
                        [--model-path MODEL_PATH] [--ndev NDEV] [--max-batch MAX_BATCH]
                        [--max-tokens MAX_TOKENS]

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
include		include
scripts		scripts
src		src
.clang-format		.clang-format
.gitignore		.gitignore
README.md		README.md
xmake.lua		xmake.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

InfiniCore-Infer

使用方式

About

Uh oh!

Releases

Packages

Contributors 2

Languages

InfiniTensor/InfiniCore-Infer

Folders and files

Latest commit

History

Repository files navigation

InfiniCore-Infer

使用方式

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages