Accepted Papers
Regular presentations:
- Boris Chidlovskii, Bruno Roustant, Marc Brette, "Wrapper Induction and Maintenance in Documentum ECI"
- Mikhail Bilenko, Beena Kamath, Raymond J. Mooney, "Adaptive Blocking: Learning to Scale Up Record Linkage"
- Eda Baykan, Sebastian de Castelberg, Monika Henzinger, "A Comparison of Techniques for Sampling Web Pages"
- François Bry, Tim Furche, Benedikt Linse, "Let's Mix It: Versatile Access to Web Data in Xcerpt"
- Shuohao Zhang, Curtis Dyreson, "Polymorphic XML Restructuring"
- Jens Bleiholder, Felix Naumann, "Conflict Handling Strategies in an Integrated Information System"
- Peter Bailey, David Hawking, Alexander Krumpholz, "Toward meaningful test collections for information integration benchmarking"
- Armin Roth, Felix Naumann, Tobias Hubner, Martin Schweigert, "System P: Query Answering in PDMS under Limited Resources"
- Jonathan Bunde-Pedersen, Jakob E. Bardram, "Towards an Activity-Based World Wide Web"
- James Masters, Cynthia Matuszek, Michael Witbrock, "Ontology-Based Integration of Knowledge from Semi-Structured Web Pages"
Short presentations:
- Hiroyuki Sato, Iko Pramudiono, Kyoji Iiduka, and Takahiko Murayama, "Automatic RDF Query Generation from Person Related Heterogeneous Data"
- Aviv Segev, Ilan Shimshoni, "Integrating Computer Vision in Web Based Context Recognition"
- Kurt Englmeier1, Javier Pereira2 , Josiane Mothe, "Choreography of Web Services based on Natural Language Storybooks"
- Livia Predoiu, "Information Integration with Bayesian Description Logic Programs"
Description
The explosive growth of the Web has amassed a huge number of information sources on the Internet with unprecedented potential for accessibility. In particular, in recent years, the Web has been rapidly deepened with the prevalence of databases and enriched with structured (or semi-structured) data online. While there are Web sources relevant to virtually any user's query, the morass of sources presents a formidable hurdle to effectively finding such sources, querying them, and aggregating across sources.
The explosive growth of the Web has amassed a huge number of information sources on the Internet with unprecedented potential for accessibility. In particular, in recent years, the Web has been rapidly deepened with the prevalence of databases and enriched with structured (or semi-structured) data online. While there are Web sources relevant to virtually any user's query, the morass of sources presents a formidable hurdle to effectively finding such sources, querying them, and aggregating across sources.
The purpose of this workshop is to bring together researchers in a variety of areas that are all related to the larger problem of information integration on the Web. We aim to promote the awareness of large scale integration on the Web, discuss research directions and agenda, share experience and insights, and build a joint community across disciplines for data and application benchmarks.
The workshop will discuss research problems for Web-based information integration, with a focus on dynamic and large scale integration. These topics include, but are not limited to:
- Novel integration architectures
- Data and application benchmarks
- Information extraction
- Schema matching
- Wrapper learning and generation
- Information gathering
- View integration
- Source discovery
- Source descriptions and meta-data learning
- Source statistics learning
- Web-based query execution and optimization
- Web service composition
- Record linkage and object consolidation
- Resolving inconsistency across sources
- Data mining for integration
Submission Instructions
We encourage participants to submit a paper (3-6 pages) or position abstract (1 page) using the standard WWW paper formatting. Please submit papers in PDF and send them directly to iiweb-sub@cs.uiuc.edu. If your paper is larger than one megabyte, please place the file on an http site and send a pointer to the file.
Important Dates and Deadlines
Paper submission: March 3, 2006
Acceptance Notification: March 31, 2006 April 7, 2006 (Sorry for the delay)
Camera-ready copy: April 10, 2006 April 15, 2006
Workshop: May 22, 2006
Workshop Organizers
Kevin C. Chang
University of Illinois at Urbana-Champaign
Avigdor Gal
Technion - Israel Institute of Technology
Web Chair: Bin He
University of Illinois at Urbana-Champaign
Program Committee
Karl Aberer, EPFL, Switzerland
Hasan Davulcu, Arizona State University, USA
Anhai Doan, University of Illinois at Urbana-Champaign, USA
David Embley, Brigham Young University, USA
Lee Giles, Pennsylvania State University, USA
Fausto Giunchiglia, University of Trento, Italy
Chun-Nan Hsu, Academia Sinica, Taiwan
Subbarao Kambhampati, Arizona State University, USA
Craig Knoblock, University of Southern California, USA
Nicholas Kushmerick, University College Dublin, Ireland
Chen Li, U.C. Irvine, USA
Bing Liu, University of Illinois at Chicago, USA
Frederick H. Lochovsky, University of Science and Technology Hong Kong, China
Giansalvatore Mecca, Universita della Basilicata, Italy
Felix Naumann, Humboldt University, Germany
Zaiqing Nie, Microsoft Reserach Asia, China
Louiqa Raschid, University of Maryland College Park, USA
Marie-Christine Rousset, INRIA, French
Sunita Sarawagi, IIT Bombay, India
Domenico Ursino, University Mediterranea of Reggio Calabria, Italy
Ji-Rong Wen, Microsoft Reserach Asia, China
Clement Yu, University of Illinois at Chicago, USA