itsyourboiirow t1_jefs1oh wrote on March 31, 2023 at 6:13 PM

Reply to [D] Simple Questions Thread by AutoModerator

People/organizations to follow on Twitter with all things machine learning (traditional, deep neural networks, LLM, etc)

itsyourboiirow t1_jecqjqd wrote on March 31, 2023 at 1:34 AM

Reply to comment by Evening_Ad6637 in [D] Training a 65b LLaMA model by Business-Lead2679

Training requires a significant more amount of memory as it it has to keep track of the gradient for every parameter. I would check to see how much memory it takes up on your computer.

itsyourboiirow t1_jecqc1d wrote on March 31, 2023 at 1:32 AM

Reply to comment by Nhabls in [D] Training a 65b LLaMA model by Business-Lead2679

This is the only downside I've found. Sometimes it's too darn hard to find an instance.

itsyourboiirow t1_je7n7p8 wrote on March 30, 2023 at 12:17 AM

Reply to [D] Improvements/alternatives to U-net for medical images segmentation? by viertys

Others have mentioned it, but do data augmentation, crop, resize, rotate, etc. and you'll be able to increase the size of your dataset and improve results.

itsyourboiirow OP t1_iy9809b wrote on November 29, 2022 at 6:20 PM

Reply to comment by DinosParkour in [D] Difference between sparse and dense information retrieval by itsyourboiirow

Thanks for the in depth response!

itsyourboiirow t1_iy5aa1i wrote on November 28, 2022 at 9:18 PM

Reply to comment by radarsat1 in [D] What method is state of the art dimensionality reduction by olmec-akeru

Correct. But you don't necessarily have to discard the extra dimensions to do PCA.

itsyourboiirow t1_iskrzq9 wrote on October 16, 2022 at 6:51 PM

Reply to comment by Mmm36sa in [D] Simple Questions Thread by AutoModerator

You could try PCA and a random forest or a K-nearest neighbors

itsyourboiirow t1_iskrfyz wrote on October 16, 2022 at 6:47 PM

Reply to comment by liljontz in [D] Simple Questions Thread by AutoModerator

If you are doing it to learn and for fun, I would look into a Recurrent Neural Network (RNN) or a Long short term memory (LSTM) model for generation. They’re really good at picking up patterns in text. Im sure it would be able to do it well with enough training data.

itsyourboiirow t1_iskqzyh wrote on October 16, 2022 at 6:44 PM

Reply to comment by whydontigetbetter01 in [D] Simple Questions Thread by AutoModerator

I don’t know what flutter is. But PyTorch has methods that will optimize a model for mobile devices and make it GPU compatible for both iOS and Android.

itsyourboiirow t1_iskqchc wrote on October 16, 2022 at 6:40 PM

Reply to comment by ABCDofDataScience in [D] Simple Questions Thread by AutoModerator

Yeah I’m not sure about the details. But I would guess it’s so you can use back propagation and loss functions on your NN.