Fakultäten » Wirtschaftswissenschaftliche Fakultät » Informatik, Institut für » Prof. Dr. Klaus Dittrich (verstorben) » Ziegler
| Title / Titel | SIRUP | ||
|---|---|---|---|
| Abstract (PDF, 14 KB) | |||
| Original title / Originaltitel | Semantic Integration Reflecting User-Specific Semantic Perspectives | ||
| Summary / Zusammenfassung | In data integration, autonomy of data sources is usually given higher priority than diversity of information needs of data end-users. However, data receivers strongly differ in their information needs and in their conceptual mental models of their particular application area. In the SIRUP (Semantic Integration Reflecting User-specific semantic Perspectives) approach to semantic data integration, we take into account heterogeneity of data receivers. Our goal is to provide means that data from heterogeneous sources can be integrated in a way that it perfectly fits to a particular user's information needs, emphasizing his or her individual way to perceive a domain of interest. In short, SIRUP aims at personalized semantic integration of alphanumeric data. To achieve this, we propose to use a semantic multidatasource language to declaratively manipulate so-called IConcepts. IConcepts are basic conceptual building blocks to which attribute data that refers to the same real-world concept is linked by data providers. Moreover, we provide explicit, queryable semantics by connecting IConcepts to concepts of ontologies. Based on these foundations, declarative selection and modeling of data to be integrated as well as virtual views are supported, both tailored to user-specific information needs. Additionally, SIRUP end-users are shielded from low-level heterogeneity and technical details of underlying data sources due to the fact that data is pre-integrated on a conceptual level through IConcepts. In the beginning of the project, one major goal was to firm up and present the foundations of the SIRUP approach. Besides this, research was conducted concerning the aspect of ontology heterogeneity. In order to not constrain the set of applicable ontologies for making semantics of data explicit, different ontology languages are supported in SIRUP. However, languages to represent ontologies exist in large numbers so that they are nowadays themselves a source of heterogeneity. To cope with this problem, we developed and implemented SOQA, the SIRUP Ontology Query API, which is an ontology language- and platform-independent API for query access to ontological metadata and data that can be represented in a variety of ontology languages. Based on SOQA, we provide SOQA-QL, an SQL-like query language that supports declarative queries against ontological metadata and data, and the SOQA Browser, a tool to graphically inspect all ontology information that can be accessed through SOQA. Thus, a multitude of ontology languages can now be used for data content explication in SIRUP. In later projects phases, we investigated practical details of conflict resolution in SIRUP. On this foundation, we developed a semantic multidatasource language for data integration in SIRUP and experimented with the SIRUP prototype. Weitere Informationen |
||
| Publications / Publikationen | Patrick Ziegler, Klaus R. Dittrich, Ela Hunt: A Call for Personal Semantic Data Integration. In Workshop on Information Integration Methods, Architectures, and Systems (IIMAS 2008) (in conjunction with ICDE 2008), Cancun, Mexico, 2008.Patrick Ziegler: The SIRUP Approach to Personal Semantic Data Integration, Doctoral Thesis, University of Zurich, 2007.Patrick Ziegler and Klaus R. Dittrich: Data Integration — Problems, Approaches, and Perspectives. In John Krogstie, Andreas L. Opdahl, and Sjaak Brinkkemper, editors, Conceptual Modelling in Information Systems Engineering, pages 39–58. Springer, Berlin, 2007.Patrick Ziegler: Evaluation of SIRUP with the SIRUP Classification of Data Integration Conflicts. Technical Report ifi-2007.07, Department of Informatics, University of Zurich, 2007.Patrick Ziegler: Evaluation of SIRUP with the THALIA Benchmark for Data Integration Systems. Technical Report ifi-2007.08, Department of Informatics, University of Zurich, 2007.Patrick Ziegler, Christoph Kiefer, Christoph Sturm, Klaus R. Dittrich, and Abraham Bernstein: Generic Similarity Detection in Ontologies with the SOQA-SimPack Toolkit. In 2006 ACM SIGMOD International Conference on Management of Data (SIGMOD 2006), pages 751-753, Chicago, USA, June 26-29.Patrick Ziegler, Christoph Kiefer, Christoph Sturm, Klaus R. Dittrich, and Abraham Bernstein: Detecting Similarities in Ontologies with the SOQA-SimPack Toolkit. In Yannis Ioannidis, Marc H. Scholl, Joachim W. Schmidt, Florian Matthes, Mike Hatzopoulos, Klemens Boehm, Alfons Kemper, Torsten Grust, Christian Boehm, editors, 10th International Conference on Extending Database Technology (EDBT 2006), volume 3896 of Lecture Notes in Computer Science, pages 59-76, Munich, Germany, March 26-30. Springer.Patrick Ziegler, Christoph Sturm, and Klaus R. Dittrich: The SIRUP Ontology Query API in Action. In 10th International Conference on Extending Database Technology (EDBT 2006), Munich, Germany, March 26-30.Patrick Ziegler, Christoph Sturm, Klaus R. Dittrich: Unified Querying of Ontology Languages with the SIRUP Ontology Query API. In 11. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW 2005), volume P-65 of Lecture Notes in Informatics, pages 325-344, Karlsruhe, Germany, March 2-4, 2005. Gesellschaft für Informatik.Patrick Ziegler and Klaus R. Dittrich. Three Decades of Data Integration - All Problems Solved? In René Jacquart, editor, 18th IFIP World Computer Congress (WCC 2004), volume 12, Building the Information Society, pages 3-12, Toulouse, France, August 22-27, 2004. Kluwer.Patrick Ziegler and Klaus R. Dittrich. User-Specific Semantic Integration of Heterogeneous Data: The SIRUP Approach. In Mokrane Bouzeghoub, Carole Goble, Vipul Kashyap, and Stefano Spaccapietra, editors, First International IFIP Conference on Semantics of a Networked World (ICSNW 2004), volume 3226 of Lecture Notes in Computer Science, pages 44-64, Paris, France, June 17-19, 2004. Springer.Patrick Ziegler. User-Specific Semantic Integration of Heterogeneous Data: What Remains to be Done? Technical Report ifi-2004.01, Department of Informatics, University of Zurich, 2004.Weitere Informationen |
||
| Project leadership and contacts / Projektleitung und Kontakte |
|
||
| Funding source(s) / Unterstützt durch |
Universität Zürich (position pursuing an academic career) |
||
| Duration of Project / Projektdauer | Jan 2002 to Dec 2008 |