Introduction

   CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the field of computer and information science. It has over 730,000 documents with over 8 million citations. It is primarily hosted at the Penn State University under the guidance of Dr. Lee Giles.

The Next Generation CiteSeer or CiteSeerX initiative aims at enhancing the existing search engine by redesigning the architecture for increased utility and reliability, expanding the breadth and depth of the collection, providing personalized services to the users by making use of their individual search histories, exploting patterns of citations etc. This joint effort between the Penn State University and the University of Arkansas, is primarily funded by NSF

The efforts here at UARK mainly focus on designing and developing new personalization features for CiteSeer. Our approach to personalization is based on conceptual user profiles. Our previous experience with KeyConcept proved that conceptual profiles are an efficient way to represent user interests. The goals are listed below.

Goals

  1. To completely classify the CiteSeer collection and provide a mechanism to classify new documents that are added to the collection.
  2. Develop a conceptual browsing and retrieval module for citeseer.
  3. Design and develop a system for tracking user actions and building conceptual profiles.
  4. Provide personalized browsing interfaces for the users.
  5. Design and develop a Recommender System using the conceptual user profiles.

                      Visit the progress link to learn about our approach for achieving these goals.

                                                 © 2007 University of Arkansas. All rights reserved.