Milestone v0.3.0

**What would you like to be added**:

See the whole list: https://github.com/InftyAI/llmaz/milestone/3

### We'll focus on three main things:

- [ ] xPyD serving with heterogeneous devices, we need a new orchestration layer build on top of lws
  - [ ] disaggregate PD serving
  - [ ] aggregate PD serving

- [ ] More advanced routing policies, e.g. based on request profile & GPU type
- [ ] GPU spot instances scaling ready for production env

### Glad to have like:

- [ ] Advanced Pod scaling with dedicated scaler

**Why is this needed**:

**Completion requirements**:

This enhancement requires the following artifacts:

- [ ] Design doc
- [ ] API change
- [ ] Docs update

The artifacts should be linked in subsequent comments.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Milestone v0.3.0 #433

We'll focus on three main things:

Glad to have like:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Milestone v0.3.0 #433

Description

We'll focus on three main things:

Glad to have like:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions