Skip to content
This repository was archived by the owner on Jan 9, 2020. It is now read-only.
This repository was archived by the owner on Jan 9, 2020. It is now read-only.

Support various K8S scheduling options for multi-tenancy, prevent deadlocks. #127

Open
@ssuchter

Description

@ssuchter

One of the things we'll want (definitely not in the alpha) to have a good answer to is how multiple tenants on a single Kubernetes cluster will run Spark apps at the same time. Since Big Data apps (including Spark) frequently use a lot of resources (e.g. lots of executor pods) scheduling can become complicated, with the possibility of deadlocks. There was a request that I share this doc that was written for explaining some of the history of the YARN approach to several challenges. We definitely might not choose to solve them the same way, but understanding the issues is important.

Please see the Google Doc here, and comment!

http://tiny.pepperdata.com/yarn-scheduling-choices

I imagine this might turn into some kind of uber-issue in the future...

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions