Skip to content
kube_plugin_be…
Go to file
Code
This branch is 3 commits ahead of valayDave:kube_plugin_beta_test.

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Our changes

Instead of actually executing a metaflow spec this version just generates an Argo workflow YAML on stdout which can then in turn orchestrate the whole DAG on top of Argo instead of orchestrating it natively via metaflow runtime.

Limitations

  • build the yaml in a proper tree-based way instead of string manipulations
  • currently not parsing the DAG properly; just sequentializing all steps (not exploiting potential concurrencies)
  • template is currently fixed to a python image; this needs more flexibility in the final version
  • no step placeholders for resource requirements
  • if input paths exceed 32k characters we cut off atm
  • AWS secrets are visible on pod level
  • current implementation does not support fanout via foreach as it's fully static and doesn't happen at runtime; for a proper fanout we would either need a mini orchestration in a pod (then again we would orchestrate without argo) or not try to generate the whole workflow statically or do orchestrate it via several argo workflows running in sequence (whereas workflow B is only generated when results of workflow A are known for a overall flow of ... -> A -> B -> ...)

Metaflow

Metaflow is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.

For more information, see Metaflow's website.

Getting Started

Getting up and running with Metaflow is easy. Install metaflow from pypi:

pip install metaflow

and access tutorials by typing:

metaflow

Get in Touch

There are several ways to get in touch with us:

Kubernetes Plugin Documentation / Demo

https://github.com/valayDave/metaflow-kube-demo

How to Setup Kuberentes to Work with Metaflow

https://github.com/valayDave/metaflow-on-kubernetes-docs

About

Build and manage real-life data science projects with ease.

Resources

License

Packages

No packages published
You can’t perform that action at this time.