2008-12-04

BrownBag, 4th of Dec. 2008: Dunja Mladenic - Stream Mining

1. Real-Time Information Processing
The presentation addresses real-time information processing on a rather high-level providing basic information on the topic, relating it to research areas and giving illustrative example of four applications.

2. Predicting Category Additions in a Topic Hierarchy
This paper discusses the problem of predicting the structural changes in an ontology. It addresses ontologies that contain instances in addition to concepts. The focus is on an ontology where the instances are textual documents, but the approach presented in this document is general enough to also work with other kinds of instances, as long as a similarity measure can be defined over them. We examine the changes in the Open Directory Project ontology of Web pages over a period of several years and analyze the most common types of structural changes that took place during that time. We then present an approach for predicting one of the more common types of structural changes, namely the addition of a new concept that becomes the subconcept of an existing parent concept and adopts a few instances of this existing parent concept. We describe how this task can be formulated as a machine-learning problem and present an experimental evaluation of this approach that shows promising results of the proposed approach.

Open Directory Project front page, January 2006Image via Wikipedia


0 komentarji: