Phoenicia: A Model for World-Wide Web Based Campus Information Systems.

Biological Sciences Division Academic Computing, The University of Chicago.

The popularity of the World-Wide-Web has spawned a proliferation of web-based campus information systems, providing enhanced ease of access and publication of a variety of campus information resources. History has shown, however, that more data does not always translate into better information. As the size and complexity of web-based campus information systems continues to increase, a major challenge has become to provide a consistent presentation interface and set of enabling utilities to enable information users and providers to maintain and transform information into structured knowledge.

Phoenicia is a prototype web-based campus information system developed in the context of the Phoenix Project at the University of Chicago. In its current incarnation, Phoenicia consists of a suite of CGI Perl scripts and supporting structured data-stores, developed around an extensive object-oriented model of campus information data. It provides users with personalized on-line access to a wide variety of information resources, ranging from administrative resources (registrar data, promotional/expository information, event calendars, grant and funding opportunities, current research activities, faculty publications) to class materials (syllabi, bulletin boards, lecture notes, quizzes), to general Internet resources. In addition, it supports distributed information management by enabling users to interactively maintain both the information accessed through Phoenicia's interface as well as the interface and structure of the Phoenicia environment themselves.

In this paper we describe: the design considerations that have guided our development efforts thus far; the general features of the Phoenicia architecture; and our plans for further extensions to the system, in support of teaching, research and clinical care.

1. Introduction & Background

As described by Judy Hallman[1], the benefits provided by a CWIS are manyfold: a single "window" into a broad range of information resources; 24-hour access; easier maintenance and timeliness; distributed access capability to larger audiences; ability to present archival data; and cost savings in printing and continuing phone and personnel support.

Most recently, advancements in distributed hypermedia systems have enabled CWISs to become "standardized" to the point where users can now easily and routinely access both local and remote Campus information resources. These systems typically are built using a client-server architecture and employ graphical user interfaces to support rich text, graphics, and other media presentations.

1.1 Gopher

A keyword-based utility for searching most gopher-server menu titles in the entire gopher "space" is available. Called Veronica (Very Easy Rodent-Oriented Net-wide Index to Computerized Archives), its availability has improved the process of managing access to the abundance of information available through distributed Gopher servers. Veronica is only a first step, however, in the types of tools a user needs to locate and use information in an easy to use, relevant, and timely manner.

1.2 The World Wide Web

The WWW marries the ability to easily support and deliver multiple data types (rich text, graphics, sound, and animation) with the use of hypermedia as its method of presentation. Like Gopher, the WWW browsers can access many existing data systems via existing protocols (FTP, NNTP) or via the Web's native protocol, HTTP, and a gateway. Thus, the WWW has become a useful "meta-viewer" for the majority of Internet-based information sources. With this powerful combination of capabilities, the WWW has been adopted by many Colleges and Universities, and is being aggressively pursued as a "next generation" platform for CWISs.

1.3 The Biological Sciences Division Office of Academic Computing & the Phoenix Project

Two significant development projects have been initiated in support of the over-arching effort we call the Phoenix Project. The first is an effective X-Windows based What-You-See-Is-What-You-Get (WYSIWYG) HTML browser/editor that allows users to easily author and publish information on the WWW, and has been described elsewhere.[2]

The second project has been to develop a robust data model to integrate the wide variety of information contained in the world of Phoenix. This data model, and its subsequent presentation via the WWW, we call Phoenicia.

2. Phoenicia Architecture

Our perspective in developing Phoenicia has been to consider the Web as a presentation medium, rather than as a storage or data model; indeed, over 90% of the HTML served from our site is dynamically generated, rather than stored as HTML flat files. The heart of Phoenicia resides, instead, in the data model we have developed to integrate the wide variety of available campus information resources. This information model, implemented with a suite of server-side scripts and a supporting database, is an object-oriented representation of campus information. It supports conventional academic entities such as departments and lectures, and allows users to interactively create and maintain information relating to them through a Web interface. It further enables users, however, to create their own entities either by `re-modeling' existing ones or by creating them de novo. This approach has enabled us to provide members of the university community with a highly flexible and dynamic information environment, through which they can easily access and maintain instructional, research, and administrative information, as well as refine the structure of the environment itself.

2.1 General Features

Our database server, a Sybase relational database, serves dual function as a data store as well as a complex data index. While much of the data maintained through Phoenicia resides in disparate external data stores, the structure of the Phoenicia environment is primarily stored in the database itself. It maintains all object definitions, as well as small atomic data, such as names, telephone numbers and the like. In its role, as an index, it maintains both the relative position or coordinates of objects within the Phoenicia environment, and the addresses of externally stored data.

Database records corresponding to a faculty member's research description, for example, contain data specifying the relationships between the description and other objects within the Phoenicia infrastructure, as well as direct and indirect pointers to the text of the research description itself.

2.2 The Information Model. Demo1

2.2.1 Base objects

telephone-number

photo-of-student

2.2.2 Composite objects

address-book

addresses

person

telephone-number

photo

address-book

Composite objects differ from base-objects in that they do not have values or attributes per se; rather attributes are assigned to them by association (specified by their class membership), through a cascade of pointers to base-objects. This feature endows the data model with exceptional flexibility, as objects are thereby enabled to share common attributes.

robs

address-book

John-CV

John-telephone-number

2.2.3 Class definitions

address-book

home-phone-number

work-phone-number

address-book

2.2.4 Display and Manipulation of Objects

Objects can be explicitely embedded into served Phoenicia documents for display or editing purposes, through the mark-up extension to HTML described below.

object identifier list

display expression

The object identifier list specifies a single, or a set of, object IDs. These are provided as a comma delimited list of IDs, or as an equivalent SQL statement. The display expression consists of an HTML string with embedded place-holders ($1, $2,...), into which the retrieved object values are to be inserted, according to their order in the object identifier list .

address-book

Name:

name

Address

address

Phone:

phone

The mark-up can also be used by Phoenicia scripts as general presentation templates. In such cases, the object identifier list is composed only of class-object IDs; the IDs for the particular instance of the class for which the template is being invoked, is passed to template by the HTML-generating script.

2.3 Information Management Demo2

2.3.1 Technical considerations

The technical integration of these information sources in Phoenicia is relatively straightforward; wherever possible, live data feeds are used. This includes remote SQL transaction processing with the BSD-MIS Sybase server, as well as file-system sharing using NFS mounted volumes. In some cases, however, we must rely on deferred updates, as in the case of our daily data feed (NFS) from the registrar's mainframe.

2.3.2 Administrative Integration

Along with the provision for technical integration of campus information resources, we are also revising current information management practices to take advantage of Phoenicia's enhanced information environment. In particular, our provisions for the distributed access to and maintenance of information resources at the object level make it possible to reconsider common preconceptions governing the creation and maintenance of information resources. The 'document' need no longer constitute the standard information denomination. Instead, Phoenicia supports the direct ownership of data objects, ranging in size and complexity from a simple text field, to a sophisticated description of a university division, encompassing thousands of distinct information objects; users are considered to be both consumers as well as providers of these information objects, and their respective information management duties are assigned to them accordingly.

Fig.3 illustrates this distinction between the conventional data management model (upper panel), and that which we are implementing through Phoenicia (lower panel). Two administrators -- Persons A and B -- are each charged with the maintenance of a report -- reports A and B -- each containing common information elements. In the conventional scheme each administrator maintains copies of both their own and their counterpart's reports (maintenance of their own report requires knowledge of the value of their counterpart's data objects), thereby duplicating all shared data. In our scheme objects are embedded in each of the reports and their maintenance is assigned to their respective owners. Thus, any modification of a red object, belonging to person A, will dynamically be updated in all presentations of that object, including document B.

3. Future Directions

It seems likely that provisions for such direct object support will be unavailable through the Web over the short term, and that we will need to pursue the approach we have adopted for the forseable future. Our development plans do include, however, the extension of our current scheme to support the distribution of Phoenicia object servers across campus. In this regard we shall be looking to extend our object-specifier 'HTML mark-up' to a generalized form, akin to the syntax of the URL markup in HTML. We are also considering extendinng the interoperability of Phoenicia at the object level by integrating support for the Open Doc, Object Linking and Embedding (OLE), and Open Database Connectivity (ODBC) standards into Phoenicia.

[1] CAMPUS-WIDE INFORMATION SYSTEMS: Judy Hallman; May 19, 1992, University of North Carolina at Chapel Hill.

[2] Lavenant, M. G. and Kruper, J. A. "The Phoenix Project: Distributed Hypermedia Authoring" in Proceedings of the First International World-Wide Web Conference, Geneva, 1994.

John Kruper is Director of the Office of Biological Sciences Division Academic Computing (BSDAC) at the University of Chicago This group was founded two years ago to refashion instruction and training in the Biological Sciences by applying new technologies to the teaching and learning process. With a multidiscipinary team consisting of programmers, curriculum specialists, and media designers, BSDAC seeks to make it possible for physicians, scientists, and students to work and study in a fundamentally new manner by electronically liking the classroom, the research laboratory, the clinical exam room, and other remote locations including the home office.
Dr Kruper is also a Lecturer in the Biological Sciences Collegiate Division, where he teaches classes in Genetics, recombinant DNA Technology, Simulation & Modeling in the Biological Sciences, and Molecular BioComputing.
Dr. Kruper received undergraduate degrees in biochemistry and molecular and cell biology from the Pennsylvania State University, a Master's degree in Molecular Virology from the University of Chicago, and a Doctorate of Arts degree in Biology Education from the University of Illinois at Chicago. He also did post-doctoral research with William Wimsatt in the Department of Philosophy at the University of Chicago before assuming his current role as Director of BSD Academic Computing.
Dr. Kruper's research interests include distributed database and hypermedia systems, the use of simulation and model building to support science education, and (with the DNA Learning Center of Cold Spring Harbor Laboratory) characterizing the diffusion of curriculum innovation.

Marc Lavenant has been Lead developer on the Phoenix Project, at the University of Chicago, since its inception two years ago. Prior to joining the project he pursued research on the molecular biology of protein structure and cellular computation. His primary research interest is the design of self-assembling knowledge systems. He is currently responsible for designing and coordinating the development of Phoenicia.

Email correspondence should be addressed to m-lavenant@uchicago.edu