Prestigious_Boat_386

Prestigious_Boat_386 t1_ivuhxf7 wrote

Think they want thecombination of that and splitting up when different people talk and assigning what's said to the person saying it.

Which isn't THAT hard when you already can recognice whos who, sometimes you could even just use main pitch & formants and silent segments. It's just quite a niche application.

1

Prestigious_Boat_386 t1_issl12g wrote

Pca for moderate dimension reduction. Straight up throwing away half of the highly correlated dimensions for very high dimension numbers.

Youd reject the worst dimensions until thw size is low enought to use pda then use pda to reduce to a size your network can handle.

2