Submitted by frodo_mavinchotil t3_124xa6b in MachineLearning

I am not comfortable with idea that the codes I write will be logged into their server. Is there any alternate to wandb which can be hosted locally in my machine or in a common server where a team of people can collaborate ?

5

Comments

You must log in or register to comment.

currentscurrents t1_je1cjaq wrote

Wandb can be run locally. There is also tensorboard.

12

MrBrito t1_je2o0sn wrote

WandB is great tool. But I'm not a huge of the licensing for self-hosting it.

The free licence does not allow team collaboration, apparently you have to pay per GB of artifacts above 100GB (even if the data is on private cloud object storage like S3) and is only limited to docker deployments (Kubernetes only on Enterprise edition).

1

Kisscool-citron t1_je2nllg wrote

Wandb only logs what you tell it to log. The data hosted on their server is opt-in, meaning you explicitly log what you want when you use their API.

If you don't trust any third party with your process, the local setup seems pretty straightforward (docker container), info at https://docs.wandb.ai/guides/hosting/basic-setup.

I did use some other experiment tracking software and found wandb to be easier and full of useful features. Granted I didn't try MLFlow, but wandb had almost everything I needed so no point in trying all the options. (I wanted a mix of collaborative Tensorboard+Git+DVC+Pachyderm but on windows)

8

Clicketrie t1_jeapcmv wrote

At least with tools like Comet and W&B you have authentication and you can avoid logging anything you don't want logged. Mlflow has no authentication.

2

GirlScoutCookieGrow t1_jeeyz2u wrote

You don't have to log your code. I don't think there's much sense in being paranoid about it anyways. What do you think will happen, they go thru everyone's code and try to steal ideas? That's silly

1