Viewing a single comment thread. View all comments

amigo213a t1_is9mxyh wrote

I do MLOps on daily basis and have setup something from scratch in my company. The only thing I have to say is that, the most popular ones available out there are not going to help. Take Kubeflow for example, you need to hands on experience with Kubernetes to be able to setup good workflows/pipelines but most of the users who would be working are your Data Scientists/Machine Learning experts who wouldn't be any expert. They hardly build solution that scales as well. So it comes down to the MLOps platform to be able to meet with their weird requirements.

Choosing Kubernetes is a good started point, it lets you scale out, run workloads in isolation and many great things. Either you could setup your own infra in the company or choose one of the managed clusters from AWS/GCP/Azure depending on their pricing. Only good thing about cloud providers is that you don't need to take care of the infra on your side. Like for e.g, if you want to spin-off your own Text-to-Image service then you could easily containerize push the solution onto different region based kubernetes cluster on AWS or other cloud. You can easily get CDN for scaling the serving based on regions easily on AWS.

12

lifesthateasy OP t1_is9o4os wrote

Yes I am leaning towards a kubernetes/kubeflow setup, and I'll definitely be working with architects to get it set up properly. I just want to make sure I have some knowledge about how to approach such a thing, to be able to decide if what we're doing is very wrong or just a little :)

4

jcoffi t1_isb4tja wrote

If you're hiring someone because you don't have the nuanced experience, it doesn't take sense to lean towards anything. Let the expert decide. But be well informed.

2

lifesthateasy OP t1_isbv2bp wrote

Yes, I'm trying to achieve the "well-informed" part here :)

2

Rarc1111 t1_isdw8qg wrote

"They hardly build solution that scales as well. So it comes down to the MLOps platform to be able to meet with their weird requirements."

This.

MLOps is not the bottleneck, go with something as simple as possible, as you will be spending most of your time pretending you are not rewriting their entire code.

4