Data Mining

Game Analytics, SDM, papers ...

A lot has happened in one year:

  • I'm now a game data analyst at Game Analytics in Copenhagen. Game Analytics is a middleware solution for web-based game data tracking and analysis, founded in late 2011. I'm one of the co-founders.
  • There were some publications in 2011, finally managed to update the list.
  • For 2012 we got two papers accepted at SDM, acceptance rate is about 27%.

Most Influential Published Research in Game AI ...

The popular Game AI blog AIGameDev.com nominated our CIG paper on Analyzing the Evolution of Social Groups in World of Warcraft as the most influential published research in 2010. Thanks to Alex for the nomination. Unfortunately we didn't win, maybe next year ;)

Featured Article in Science

Our work on data mining in World of Warcraft is described in a featured article in Science (the article reports on the CIG conference held in Copenhagen a few weeks ago). You can download the article from this site.

You can find more details on our work on Wow in the following papers:

  • Christian Thurau, Kristian Kersting, and Christian Bauckhage. Convex Non-Negative Matrix Factorization in the Wild. ICDM 2009. [PDF]
  • Christian Thurau and Christian Bauckhage. Analyzing
    the Evolution of Social Groups in World of Warcraft
    . CIG 2010.
    • We recently developed another large-scale variant of Archetypal Analysis, see the following paper for details:

      • Christian Thurau, Kristian Kersting, and Christian Bauckhage.Yes We Can – Simplex Volume Maximization for Descriptive Web-Scale Matrix Factorization. CIKM 2010. [PDF]

      The code (also for the large-scale archetypal analysis variant) is available in the Python Matrix Factorization Module.

Paper at ACML 2010

Our paper on Hierarchical Convex NMF for Clustering Massive Data got accepted at ACML 2010. The paper extends our last year's ICDM paper on Convex NMF to non-convex data distributions (acceptance rate is about 32%).

PyMF: Python Matrix Factorization Module


I put together some code I wrote earlier this year into a matrix factorization library for Python http://pymf.googlecode.com. PyMF currently includes methods for Non-negative matrix factorization (NMF), Convex non-negative matrix factorization (CNMF), Semi non-negative matrix factorization (SNMF), Archetypal analysis (AA), Simplex volume maximization (SiVM), Convex-hull non-negative matrix factorization (CHNMF), Binary matrix factorization (BNMF), Singular value decomposition (SVD), Principal component analysis (PCA), K-means clustering (Kmeans), CUR decomposition (CUR), Compact matrix decomposition (CMD). Descriptions of the methods and links to the papers are included in the code. The library is not very well tested and probably needs some beautification ;). Most methods should work fine with data stored in hdf5 tables (see h5py for details).

CIKM 2010 paper accepted

Simplex Volume Maximization

Our paper Yes We Can - Simplex Volume Maximization for Descriptive Web-Scale Matrix Factorization got accepted at CIKM 2010 as a short paper. Out of the 945 submissions, 127 (13.4%) were accepted as full papers and 169 (17.9%) as short papers.

CIG 2010 paper accepted + tutorial + special session

Our CIG 2010 (IEEE Conference on Computational Intelligence and Games) paper on Analyzing the Evolution of Social Groups in World of Warcraft got accepted, acceptance rate is 49%.

At CIG, I will give a tutorial on Game Mining – Data Mining in Games, and together with people from KD and VSM we (Chistian Bauckhage, Olana Missura, Thomas Gaertner, and Kristian Kersting) organize a special session on Game Mining.

Paper selected as one of the best at ICDM 2009

Our paper on convex non-negative matrix factorization in the wild has been selected as on of the best papers at the Int. Conf. on Data Mining 2009 (we won a possible publication in Knowledge and Information Systems (KAIS), nice :-)

ICDM 2009 paper accepted

Guilds in World of Warcraft

Our ICDM 2009 paper on Convex Non-Negative Matrix Factorization in the Wild got accepted as a regular paper. Acceptance rate for regular papers is 8.9%. From 786 submissions 140 were selected for presentation, 70 as a regular paper and the remaining 70 as short papers.

ICSC 2009 paper accepted

Archetypal Images

Our ICSC 2009 paper on Archetypal Images in Large Photo Collections got accepted. Acceptance rate is 30%.

Syndicate content