HazDat: Sharing Environmental Hazards Data through the World-Wide Web

Michael Perry, Agency for Toxic Substances and Disease Registry, Atlanta, Georgia, USA
lmp1@atsoaa1.em.cdc.gov http://atsdr1.atsdr.cdc.gov:8080/mikeperry.html

Richard Anderson, Agency for Toxic Substances and Disease Registry, Atlanta, Georgia, USA
rxa5@atsoaa1.em.cdc.gov http://atsdr1.atsdr.cdc.gov:8080/richardanderson.html

Ronald Parker, Agency for Toxic Substances and Disease Registry, Atlanta, Georgia, USA
rzp1@atsoaa1.em.cdc.gov
Abstract:
The Agency for Toxic Substances and Disease Registry (ATSDR), has developed HazDat, a database that contains information on toxic substances released into the environment and information about the effects on human health that might be caused by exposure to these toxic substances. GSQL is a simple forms interface between Mosaic and SQL relational databases that has been used at ATSDR to build a forms-based database access application that allows the user to "drill down" through the information in ATSDR's Internet HazDat database. The forms-based application provides the user with seamless, easy-to-use access to relational database tables, metadata in a data dictionary, HTML hypertext documents, and images.
Keywords:
environment, health, hazardous, toxic, waste, relational, database, SQL, GSQL, WWW, HTML, forms

Introduction

The mission of the Agency for Toxic Substances and Disease Registry [ATSDR], as an agency of the Public Health Service in the U.S. Department of Health and Human Services [DHHS], is to prevent exposure and adverse human health effects and diminished quality of life associated with exposure to hazardous substances from waste sites, unplanned releases, and other sources of pollution present in the environment.

ATSDR is directed by law to perform specific functions concerning the effect on public health of hazardous substances in the environment. These functions include public health assessments of waste sites, health consultations concerning specific hazardous substances, health surveillance and registries, response to emergency releases of hazardous substances, applied research in support of public health assessments, information development and dissemination, and education and training concerning hazardous substances.

In pursuit of its mission over the past 8 years, ATSDR has developed HazDat, a database containing information about the release of hazardous substances into the environment and the effects of those substances on human health. In fact, HazDat is the only database known to us that contains this kind of data in one place. As a result, HazDat is of interest to many people, including public health officials, researchers, and the general public. Because ATSDR has a legislative mandate to develop and disseminate information, we placed a very high priority on making HazDat available to the general public. This goal was achieved in 1994 using the combined technologies of the Internet, the World-Wide Web, Web browsers such as Mosaic, and the GSQL interface to relational databases.

The HazDat Database

HazDat [HAZDAT] is a scientific and administrative database developed to provide access to information on the release of hazardous substances from Superfund sites or emergency events and on the effects of hazardous substances on the health of human populations. The following information is included in HazDat: site characteristics, activities and site events, contaminants identified, contaminant media and maximum concentration levels, impact on population, community health concerns, ATSDR public health threat categorization, ATSDR recommendations, environmental fate of hazardous substances, exposure routes, and physical hazards at the site or event. In addition, HazDat contains substance-specific information such as the ATSDR Priority List of Hazardous Substances, health effects by route and duration of exposure, metabolites, interactions of substances, susceptible populations, and biomarkers of exposure and effect. HazDat also contains data from the U.S. Environmental Protection Agency (EPA) Comprehensive Environmental Response, Compensation, and Liability Information System (CERCLIS) database, including CERCLIS site identification number, site description, latitude and longitude, operable units, and additional site information.

Internet HazDat

In order to make the most-requested information from HazDat available to the public over the Internet, a smaller version of HazDat - Internet HazDat - was developed using a client-server, relational database management system - Sybase SQL Server - running on a small IBM RS/6000 computer connected to the Internet. Designing and implementing the database server in 1993 was the easy part; we then faced the long and difficult job of developing the client application which we would somehow distribute over the Internet to people who would use it to access our database.

In mid-1993, we became aware of the Mosaic [MOSAIC] application developed by the National Center for Supercomputing Applications (NCSA) [NCSA], which was being beta tested by a rapidly growing number of Internet users. Suddenly, the first official version of Mosaic was released, and we were checking the Internet every day to keep up with the rapid developments. Text information was being made available over the World-Wide Web at an incredible rate and in a format that was intuitive to anyone who had ever used a computer. ATSDR disseminates a lot of information as hardcopy text documents, so we began the process of building a server to make this text information available on the Web. The World-Wide Web and Mosaic were such powerful tools, we wondered if it might be possible to make the Internet HazDat database available similar to the way in which WAIS text databases were being made available through Mosaic version 2 - by using Hypertext Markup Language (HTML) forms to accept user input. Apparently, we were not the only ones looking for this kind of functionality.

In February 1994, we found GSQL on the NCSA Web server. GSQL, developed by Jason Ng of the NCSA Software Development Group, was described as "a simple forms interface in Mosaic to SQL databases." We quickly read the GSQL Tutorial [GSQL], which clearly described the procedure for developing an HTML forms interface to a relational database and a few working examples. We found a couple of bugs in our environment which we were able to fix and reported them to Jason Ng. Within 2 weeks we had created three forms for querying our database. Because GSQL converts the data being returned by an SQL database into another HTML form, we were able to convert some of the data being returned by a query into hypertext that could link to a text document or launch another SQL query to the database. This functionality allowed us to build a hypertext "drill-down" scheme into the database queries, enabling the user to get more detailed data and to follow links to related data within the database. The hypertext concept, which makes the World-Wide Web and Mosaic such powerful and intuitive tools, could now be used to guide a user through a relational database as if it was a series of text documents containing just the information requested by the user. In addition, once the response to a query is returned, it may be saved to a local ASCII file and loaded into another application, such as Lotus or Paradox, for further analysis.

Finishing touches were added to the application as we worked on the appearance of the forms and optimized the database for the queries being generated. The data dictionary for Internet HazDat was made available as HTML text documents that can be accessed from many locations within the application. This data dictionary defines all the columns that can be accessed and includes example data from each column. These metadata play an indispensable part in the sharing of any database [WWW94-1]. Also, in what we believe was a first, we included HTML anchors within a Sybase text field to seamlessly link a database record to an external HTML text document. The HTML documents linked to the database are the very popular Public Health Statements - short summary documents taken from the ATSDR Toxicological Profiles that contain information on specific toxic substances often found in and around hazardous waste sites.

Conclusion

There is considerable interest all over the world in the sharing of both environmental and public health data over the World-Wide Web [WWW94-1],[WWW94- 2],[WWW94-3]. By using the rapidly advancing technology of the World-Wide Web, such as Mosaic and GSQL, ATSDR has been able to make available to public health officials, researchers, and the general public important environmental hazards data within the HazDat relational database, along with database metadata in the form of a data dictionary.

Demonstration

Below are World-Wide Web links to two of the database query forms found in the production version of Internet HazDat.
The Site Activity Query Form http://atsdr1.atsdr.cdc.gov:8080/gsql/siteact.script
The Site Activity Query Form was the most complex form created for this application. Many people are interested in information related to a specific hazardous waste site, sites in the same area, sites that are privately owned, etc. The user may specify values for none, one, or more attributes of hazardous waste sites with which ATSDR has been involved. All entries to the site attribute portion of the form serve to limit the query to smaller subsets of sites. If no entries are made, data from all sites in the database, about 2,000 of them, will be returned. The last scrollable select list allows the user to specify additional site data to be included in the database response.

The Site Contaminant Query Form http://atsdr1.atsdr.cdc.gov:8080/gsql/sitecontam.script
Many people are also interested in specific toxic substances found at hazardous waste sites. This fairly simple query answers questions such as "At which sites in your database was dioxin found?"

References:
[ATSDR]
ATSDR's Home Page
http://atsdr1.atsdr.cdc.gov:8080/atsdrhome.html
[DHHS]
U.S. Department of Health and Human Services Resource Locator
http://www.os.dhhs.gov/
[HAZDAT]
HazDat Home Page
http://atsdr1.atsdr.cdc.gov:8080/hazdat.html
[MOSAIC]
The NCSA Mosaic Home Page
http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/NCSAMosaicHome.html
[NCSA]
The National Center for Supercomputing Applications (NCSA) Home Page
http://www.ncsa.uiuc.edu/General/NCSAHome.html
[GSQL]
The GSQL Tutorial
http://www.ncsa.uiuc.edu/SDG/People/jason/pub/gsql/starthere.html
[WWW94-1]
David Crossley: WAIS through the Web - Discovering Environmental Information
http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/crossley/paper.html
[WWW94-2]
Frederick Zimmerman and Peter Young: Information for a Changing World
http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/EarSci/zimmerman/zimmerman.html
[WWW94-3]
Joel D. Register and John L. Gerone: Interfaces for Public Information and Scientific Research
http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/HCI/register/register.html