Viewing a single comment thread. View all comments

PredictorX1 t1_ixucpv3 wrote

Usually, the assumption is made that the variables are equally "important", so they are standardized. Most often this is done, for each variable, by subtracting the mean then dividing by the standard deviation. Then, the data is clustered, for instance by k-means to discover the clusters. Have you gathered and prepared the data? What kind of clustering algorithms do you have?

1

[deleted] OP t1_ixudbhw wrote

I think I can use k-means as you mentioned for that, but wouldn't the information of time be lost then? As every timestamp would be treated independent to another - while one program (cluster) is used for a period of time not interrupted.

1