Tensor Cores

A way to utilize tensor cores is needed, which should draw from the family of [`VectorXXX` intrinsics in  .NET](https://learn.microsoft.com/en-us/dotnet/api/system.numerics.vector-1?view=net-8.0) and/or [Vulkan Cooperative Matrix](https://registry.khronos.org/vulkan/specs/1.1-extensions/html/vkspec.html#VK_NV_cooperative_matrix) extension proposed by NVidia.

Related CUDA documentation: https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#warp-level-matrix-instructions

This is also mentioned in https://github.com/m4rs-mt/ILGPU/issues/923 ,  but the later is more about the support for shorter floats in general.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor Cores #996

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Tensor Cores #996

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions