abriec

abriec t1_j6f2u54 wrote

It was Marco Polo (and his contemporaries) who heard something akin to “yit pun kok” and mapped it to “cipangu”.

“mei guo/mei kok” is likely more related to how “America” was first transliterated into “mei li jian/mei lei keen” rather than for expressing beauty, although that choice is reflected in the written character :)

3

abriec t1_j0b6frn wrote

What is “good” music?

Certainly not the full picture, but imo one of the reasons we don’t see music generation taking off the same way as image/text is it’s more difficult to evaluate and therefore benchmark and iterate.

It faces similar challenges as generative modelling in other modalities, but is arguably more subjective, time-consuming, and require more training if using human evaluation. A layperson can easily tell if an image or text is “good”, it’s more complex for music once it gets above a certain minimum quality threshold.

From a business perspective it’s harder to sell too given the scope of applications (relative to language and vision), as interesting as the problem sounds to us.

Plus, echoing the other comment, I feel it’s reductionistic to flatten music into spectrograms when there are interlaying elements. My intuition is it’ll be better to model dependencies between individual “tracks” as well. I’m sure there’s extensive work on music generation with good results already, just not quite in the spotlight yet.

3