Is there a notion of generalization in unsupervised learning?

Question

I've been learning a little bit about generalization theory, and in particular, the PAC (and PAC-Bayes) approach to thinking about this problem.

So, I started to wonder if there is an analogous version of "generalization" in Unsupervised Learning? I.e., is there a general framework that encapsulates how "good" an unsupervised learning method is? There's reconstruction error for learning lower dimensional representations, but what about unsupervised clustering?

Any ideas?

Cross-posted: https://ai.stackexchange.com/q/20382/1794, https://cstheory.stackexchange.com/q/46660/5038. Please [do not post the same question on multiple sites](https://meta.stackexchange.com/q/64068). — D.W., Apr 19 '20 at 23:08

nbro · Accepted Answer · 2020-04-19T23:31:09.580

In the paper Generalization in Unsupervised Learning (2015), Abou-Moustafa and Schuurmans develop an approach to assess the generalization of an unsupervised learning algorithm $A$ on a given dataset $S$ and how to compare the generalization ability of two unsupervised learning algorithms $A_1$ and $A_2$, for the same learning task.

They first provide a more abstract and general definition of an unsupervised learning algorithm and loss function. Then they define the expected risk, empirical risk and generalization gap in a similar way to the case of supervised learning. Finally, they derive an upper bound on $A$'s expected loss.

Of course, you should read the paper for more details. Specifically, section 2 (page 3) describes their setting in detail.

Is there a notion of generalization in unsupervised learning?

1 Answers1