What datasets exist for co-clustering?

I am looking for new datasets of documents, from which to extract the matrix terms-documents, to perform co-clustering algorithms.

I am looking forsingle-label datasets only and prefer free access ones.

I already know the following datasets.: CSTR WebKB4 Newsgroups Reuters K1A, K1B, wap (WebACE Project)

Do you know of any others?

You also know of the new co-clustering algorithms created in the last two years? thanks


For datasets, see experiments section of this work.

For new algorithms see:

Just to name a few.

Need Your Help

Jenkins' GitHub access suddenly stop working

git github jenkins ssh jenkins-plugins

We are using Jenkins version 2.10 on Windows Server 2012 to build a project in a private GitHub repository.