Empirical Inference
In this paper we introduce a novel approach for incrementally building aspect models, and use it to dynamically discover underlying themes from document streams. Using the new approach we present an application which we call query-line tracking i.e., we automatically discover and summarize different themes or stories that appear over time, and that relate to a particular query. We present evaluation on news corpora to demonstrate the strength of our method for both query-line tracking, online indexing and clustering.
| Author(s): | Surendran, A. and Sra, S. |
| Links: | |
| Book Title: | PKDD 2006 |
| Journal: | Knowledge Discovery in Databases: PKDD 2006 |
| Pages: | 633-640 |
| Year: | 2006 |
| Month: | September |
| Day: | 0 |
| Editors: | F{\"u}rnkranz, J. , T. Scheffer, M. Spiliopoulou |
| Publisher: | Springer |
| BibTeX Type: | Conference Paper (inproceedings) |
| Address: | Berlin, Germany |
| DOI: | 10.1007/11871637_65 |
| Event Name: | 10th European Conference on Principles and Practice of Knowledge Discovery in Databases |
| Event Place: | Berlin, Germany |
| Digital: | 0 |
| Electronic Archiving: | grant_archive |
| Language: | en |
| Organization: | Max-Planck-Gesellschaft |
| School: | Biologische Kybernetik |
BibTeX
@inproceedings{5220,
title = {Incremental Aspect Models for Mining Document Streams},
journal = {Knowledge Discovery in Databases: PKDD 2006},
booktitle = {PKDD 2006},
abstract = {In this paper we introduce a novel approach for incrementally building aspect models, and use it to dynamically discover underlying themes from document streams. Using the new approach we present an application which we call query-line tracking i.e., we automatically discover and summarize different themes or stories that appear over time, and that relate to a particular query. We present evaluation on news corpora to demonstrate the strength of our method for both query-line tracking, online indexing and clustering.},
pages = {633-640},
editors = {F{\"u}rnkranz, J. , T. Scheffer, M. Spiliopoulou},
publisher = {Springer},
organization = {Max-Planck-Gesellschaft},
school = {Biologische Kybernetik},
address = {Berlin, Germany},
month = sep,
year = {2006},
author = {Surendran, A. and Sra, S.},
doi = {10.1007/11871637_65},
month_numeric = {9}
}