_triszt t1_issbxym wrote on October 18, 2022 at 10:36 AM

pca

Sadness24_7 OP t1_issrioz wrote on October 18, 2022 at 1:10 PM

I dont think PCA will help me, i need to reduce the number of feature in order to simplify the system im working with. those removed feature will no longer be aquired and thus i cant retrain the model in the future. i need to somehow pick 2-10 features out of 38 for which i can finetune the model and deploy it. only those selected features will be logged for future.

thePedrix t1_isstazu wrote on October 18, 2022 at 1:24 PM

Maybe you can do the PCA and then check the loadings?

Sadness24_7 OP t1_isszoev wrote on October 18, 2022 at 2:12 PM

But what am i looking for tho. i've been looking at loadings matrix for couple minutes but cant really figure out the connections. Lets say i want to select 7 feature out of 38, so i performa pca for 7 components and im looking at loading matrix (correlation between 38 feature's and 7 pca's . do i just look at the component with best correlation with the input features and the 7 highest correlation with that pca component ?

thePedrix t1_ist0fv6 wrote on October 18, 2022 at 2:17 PM

I can’t be sure that it would work, but I would try this:

-PCA for N components

-Plot a graph with the 2 or 3 first principal components (depending on the cumulative explained variance, if 2 is enough, a 2D plot)

-Plot the magnitude of the variables and see which are the most impactful. Pick the X features you want.

-Train the network with those X features.

thePedrix t1_ist0li1 wrote on October 18, 2022 at 2:19 PM

check if this helps

Sadness24_7 OP t1_ist98vt wrote on October 18, 2022 at 3:18 PM

oh, this looks promising, i'll give it a try and see what comes up.