What datasets exist for co-clustering?
I am looking for new datasets of documents, from which to extract the matrix terms-documents, to perform co-clustering algorithms.
I am looking forsingle-label datasets only and prefer free access ones.
I already know the following datasets.: CSTR WebKB4 Newsgroups Reuters K1A, K1B, wap (WebACE Project)
Do you know of any others?
You also know of the new co-clustering algorithms created in the last two years? thanks
For datasets, see experiments section of this work.
For new algorithms see:
Vitaladevuni and Basri CVPR 2010
Bagon and Galun arXiv 2011.
Yarkony et al ECCV 2012
Kim et al NIPS 2011
Andres et al ECCV 2012
Just to name a few.