JealousAd8448 t1_je0evw8 wrote on March 28, 2023 at 2:52 PM

Impressive work! Will check it out and try to contribute to the project 💪

culebra_risa t1_je0g8qt wrote on March 28, 2023 at 3:01 PM

Wow, reducing the finetuning time from 20 hours to 20 minutes is amazing :O

OrionJr t1_je0iequ wrote on March 28, 2023 at 3:15 PM

Can’t seem to install deep speed on windows or wsl

x_ml OP t1_je0mp14 wrote on March 28, 2023 at 3:43 PM

Deepspeed doesn't work on Windows yet but we were able to install in WSL. My colleague installed deepspeed in conda and then installed our package and it seemed to work.

subhash165 t1_je0mvvw wrote on March 28, 2023 at 3:44 PM

I was able to run it with WSL (with miniconda environment)

JealousAd8448 t1_je0mp1z wrote on March 28, 2023 at 3:43 PM

Unfortunately deepspeed is not easy to install on windows. Just use wsl, it did not give any problem to me using conda with python 3.8

sebzim4500 t1_je10iu2 wrote on March 28, 2023 at 5:11 PM

>Lower-precision fine-tuning (like INT8, INT4)

How would this work? Are the weight internally represented as f16 and then rounded stochastically whenever they are used?

MohamedRashad t1_je12hzd wrote on March 28, 2023 at 5:23 PM

Where does the model save after finetuned in the example in the README ?

[R] Build and personalize LLMs on your own data - Take back control with xTuring!

Comments