Building a bioinformatics workflow with Flyte
Flyte (https://flyte.org/) is a powerful modern workflow system that was originally developed at Lyft. It is now available as an open source platform for the community. Flyte can run a wide variety of workflow types, including those for machine learning and bioinformatics.
Flyte runs on Kubernetes (https://kubernetes.io/). Flyte can be run in a local environment (on your laptop) or in a remote environment. The remote environment could be on a Docker cluster running on your laptop, on a High Performance Computing (HPC) cluster at your company, or in the cloud on a Kubernetes cluster such as OpenShift (https://developers.redhat.com/products/openshift/).
Flyte defines tasks, which are the fundamental units of computation, implemented as containerized applications. It builds a workflow as a series of tasks that can have dependencies, meaning, for example, that Task B cannot be run until Task A is completed. Flyte entities are broadly...