Keynote 1 (Monday April 2)
Viewing the Web as a Distributed Knowledge Base
Serge Abiteboul (Professor at Collège de France and Senior researcher
at INRIA Saclay)
Abstract: Information of interest may be found on the Web in a variety of forms, in many systems, and with different access protocols. A typical user may have information on many devices (smartphone, laptop, TV box, etc.), many systems (mailers, blogs, Web sites, etc.), many social networks (Facebook, Picasa, etc.). This same user may have access to more information from family, friends, associations, companies, and organizations. Today, the control and management of the diversity of data and tasks in this setting are beyond the skills of casual users. Facing similar issues, companies see the cost of managing and integrating information skyrocketing.
We are interested here in the management of such data. Our focus is not on harvesting all the data of a particular user or a group of users and then managing it in a centralized manner. Instead, we are concerned with the management of Web data in place in a distributed manner, with a possibly large number of autonomous, heterogeneous systems collaborating to support certain tasks.
Our thesis is that managing the richness and diversity of user-centric data residing on the Web can be tamed using a holistic approach based on a distributed knowledge base. All Web informations are represented as logical facts, and Web data management tasks as logical rules. We discuss Webdamlog, a variant of datalog for distributed data management that we use for this purpose. The automatic reasoning povided by its inference engine, operating over the Web knowledge base, greatly benefits a variety of complex data management tasks that currently require intense work and deep expertise.
This work is part of the Webdam European project, and is joint work with Emilien Antoine (INRIA Saclay, France) and Julia Stoyanovich (University of Pennsylvania,
Bio: Serge Abiteboul, Telecom Paris, PhD computer science, USC Los Angeles, and Thèse d’Etat, University of Paris Sud. He has held professor positions at Stanford and Ecole Polytechnique. He is one of the co-authors of Foundations of Databases, and, recently, of Web Data Management. He co-founded in 2000 a start-up, named Xyleme. He received the 1998 ACM SIGMOD Innovation Award. He has been program chair of a number of conferences including ACM PODS-95, ICALP-94, ICDT-90, ECDL-99 and VLDB-09, ICDE-11, track of WWW-12. He has been awarded in 2008 an ERC Advanced Grant, namely Webdam, on Foundations of Web Data Management. He is a member of the French Academy of Sciences since 2008.
Session Chair: Evaggelia Pitoura
Keynote 2 (Tuesday April 3)
How Different Is Big Data?
Surajit Chaudhuri (Microsoft Corp)
Abstract: One buzzword that has been popular in the last couple of years is Big Data. In simplest terms, Big Data symbolizes the aspiration to build platforms and tools to ingest, store and analyze data that can be voluminous, diverse, and possibly fast changing. In this talk, I will try to reflect on a few of the technical problems presented by the exploration of Big Data. Some of these challenges in data analytics have been addressed by our community in the past in a more traditional relational database context but only with mixed results. I will review these quests and study some of the key lessons learned. At the same time, significant developments such as the emergence of cloud infrastructure and availability of data rich web services hold the potential for transforming our industry. I will discuss the unique opportunities they present for Big Data Analytics.
Bio: Surajit Chaudhuri is a Distinguished Scientist at Microsoft research. His current areas of interest are enterprise data analytics, self-manageability and multi-tenant technology for cloud database services. Surajit is an ACM Fellow, a recipient of the ACM SIGMOD Edgar F. Codd Innovations Award, ACM SIGMOD Contributions Award, and a VLDB 10 year Best Paper Award. Surajit received his Ph.D. from Stanford University and B.Tech from the Indian Institute of Technology, Kharagpur.
Session Chair: Beng Chin Ooi
Keynote 3 (Wednesday April 4)
Accountability and Trust in Cooperative Information Systems
Peter Druschel (Max Planck Institute for Software Systems (MPI-SWS) Kaiserslautern and Saarbrücken, Germany)
Abstract: Cooperation and trust play an increasingly important role in today’s information systems. For instance, peer-to-peer systems like BitTorrent, Sopcast and Skype are powered by resource contributions from participating users; federated systems like the Internet have to respect the interests, policies and laws of participating organizations and countries; in the Cloud, users entrust their data and computation to third-part infrastructure.
In this talk, we consider accountability as a way to facilitate transparency and trust in cooperative systems. We look at practical techniques to account for the integrity of distributed, cooperative computations, and look at some of the difficulties and open problems in accountability.
This talk describes joint work with Paarijaat Aditya, Ioan- nis Avramopoulos, Michael Backes, Andreas Haeberlen, Petr Kuznetsov, Yin Lin, Bruce Maggs, Jennifer Rexford, Rodrigo Rodrigues, Dominique Unruh, Bill Wishon and Mingchen Zhao.
Bio: Peter Druschel is the founding director of the Max Planck Institute for Software Systems (MPI-SWS) in Germany. Previ- ously, he was a Professor of Computer Science and Electrical and Computer Engineering at Rice University in Houston, Texas. He received the Dipl-Ing. (FH) in Data Systems Engi- neering from Fachhochschule Munich, Germany in 1986 and the Ph.D. degree in Computer Science from the University of Arizona in 1994. His research interests include distributed systems and operating systems. He is the recipient of an NSF CAREER Award, Alfred P. Sloan Fellowship and the ACM SIGOPS Mark Weiser Award, and a member of Academia Europaea and the German Academy of Sciences Leopoldina.
Session Chair: Johannes Gehrke