file.pdf (844.45 kB)

Inferring users' projects from their workstation contents

Download (844.45 kB)
journal contribution
posted on 01.01.2002, 00:00 by Tom M.(Tom Michael) Mitchell
Abstract: "One key to providing intelligent assistance to workstation users is to construct machine-understandable descriptions of the user's ongoing projects, or activities, (e.g., their committee memberships, writing projects, conference organization activities), and indices describing which emails, meetings, and colleagues relate to which activity. This paper presents a program, ActivityExtractor, which examines the user's workstation contents to infer such activity descriptions. In earlier work [Huang, et al., 2004] we described an algorithm which infers activities by clustering the user's emails based on their word distributions. Here we extend this approach in several ways: (1) by incorporating a social network analysis of email senders and recipients, (2) by considering workstation contents beyond email, including contents accessible via Google Desktop Search, and (3) by allowing simple user input in the form of a list of activity/project names. We describe the ActivityExtractor algorithms and report on experiments applying these to several users' workstations."