Program

The ICDE program-at-a-glance is also available.

 

Sunday (April 1)

9-5:30pm Workshops
Studio F: Data-Driven Decision Guidance and Support Systems (DGSS)
Studio B: Self-Managing Database Systems (SMDB)
Studio D: Spatio Temporal data Integration and Retrieval (STIR)
Studio E: Data Engineering Meets the Semantic Web (DESWEB)
5:30pm-8pm (Salon 4567): Conference Reception

 

 

Monday (April 2)

9:00-10:00: Keynote 1 (Salon 4567): Serge Abiteboul — Viewing the Web as a Distributed Knowledge Base
10:00-10:30: Coffee break
10:30-12:00: Sessions 1-4, Seminar 1, Demo Group 1

Session 1 (Studio F): Privacy

Session Chair: Murat Kantarcioglu (UT Dallas)

Privacy in Social Networks: How Risky is Your Social Graph?
Cuneyt Gurcan Akcora (University of Insubria)
Barbara Carminati (University of Insubria)
Elena Ferrari (University of Insubria)

Differentially Private Spatial Decompositions
Graham Cormode (AT&T Labs – Research)
Cecilia Procopiuc (AT&T Labs – Research)
Entong Shen (North Carolina State University)
Divesh Srivastava (AT&T Labs – Research)
Ting Yu (North Carolina State University)

Differentially Private Histogram Publication
Jia Xu (Northeastern University, China)
Zhenjie Zhang (Advanced Digital Sciences Center, Illinois at Singapore Pte.)
Xiaokui Xiao (Nanyang Technological University)
Yin Yang (Advanced Digital Sciences Center, Illinois at Singapore Pte.)
Ge Yu (Northeastern University, China)

Privacy-Preserving and Content-Protecting Location Based Queries
Russell Paulet (Victoria University)
Md. Golam Kaosar (Victoria University)
Xun Yi (Victoria University)
Elisa Bertino (Purdue University)

Session 2 (Studio B): Web 2.0 Applications

Session Chair: Kyuseok Shim (SNU)

GeoFeed: A Location-Aware News Feed
Jie Bao (University of Minnesota at Twin Cities)
Mohamed F. Mokbel (University of Minnesota at Twin Cities)
Chi-Yin Chow (City University of Hong Kong)

Entity Search Strategies for Mashup Applications
Stefan Endrullis (University of Leipzig)
Andreas Thor (University of Leipzig)
Erhard Rahm (University of Leipzig)

CI-Rank: Ranking Keyword Search Results Based on Collective Importance
Xiaohui Yu (York University & Shandong University)
Huxia Shi (York University)

Temporal Analytics on Big Data for Web Advertising
Badrish Chandramouli (Microsoft Research)
Jonathan Goldstein (Microsoft Corporation)
Songyun Duan (IBM T. J. Watson Research Center)

Session 3 (Studio C): Storage Management

Session Chair: Alfons Kemper (TUM)

Lookup Tables: Fine-Grained Partitioning for Distributed Databases
Aubrey L. Tatarowicz (MIT)
Carlo Curino (MIT)
Evan P. C. Jones (MIT)
Sam Madden (MIT)

Temporal Support for Persistent Stored Modules
Richard T. Snodgrass (University of Arizona)
Dengfeng Gao (IBM Silicon Valley Lab)
Rui Zhang (University of Arizona)
Stephen W. Thomas (Queen’s University, Kingston)

Energy Efficient Storage Management Cooperated with Large Data Intensive Applications
Norifumi Nishikawa (The University of Tokyo)
Miyuki Nakano (The University of Tokyo)
Masaru Kitsuregawa (The University of Tokyo)

ISOBAR Preconditioner for Effective and High-throughput Lossless Data Compression
Eric R. Schendel (North Carolina State University)
Ye Jin (North Carolina State University)
Neil Shah (North Carolina State University)
Jackie Chen (Sandia National Laboratory)
C.S. Chang (Princeton Plasma Physics Laboratory, Princeton, NJ 08543, USA)
Seung-Hoe Ku (New York University)
Stephane Ethier (Princeton Plasma Physics Laboratory)
Scott Klasky (Oak Ridge National Laboratory)
Robert Latham (Argonne National Laboratory)
Robert Ross (Argonne National Laboratory)
Nagiza F. Samatova (North Carolina State University & Oak Ridge National Laboratory)

Session 4 (Studio D): Data Streams Processing

Session Chair: Bugra Gedik (IBM)

Physically Independent Stream Merging
Badrish Chandramouli (Microsoft Research)
David Maier (Portland State University)
Jonathan Goldstein (Microsoft Corporation)

On Computing Correlated Aggregates over a Data Stream
Srikanta Tirthapura (Iowa State University)
David P. Woodruff (IBM Almaden Research Center)

Accuracy-Aware Uncertain Stream Databases
Tingjian Ge (University of Kentucky)
Fujun Liu (University of Kentucky)

On Discovery of Traveling Companions from Streaming Trajectories
Lu-An Tang (UIUC)
Yu Zheng (MSRA)
Jing Yuan (MSRA)
Jiawei Han (UIUC)
Alice Leung (BBN)
Chih-Chieh Hung (Yahoo!)
Wen-Chih Peng (NCTU)

 

Seminar 1 (Salon 123): Data Management Issues on the Semantic Web

Oktie Hassanzadeh (University of Toronto & IBM Research)
Anastasios Kementsietsidis (IBM Research)
Yannis Velegrakis (University of Trento)

[slides]

Demo group 1 (Studio E):

SMIX Live – A Self-Managing Index Infrastructure for Dynamic Workloads
Thomas Kissinger (Dresden University of Technology)
Hannes Voigt (Dresden University of Technology)
Wolfgang Lehner (Dresden University of Technology)

Multi-Query Stream Processing on FPGAs
Mohammad Sadoghi (University of Toronto)
Rohan Palaniappan (University of Toronto)
Rija Javed (University of Toronto)
Naif Tarafdar (University of Toronto),
Harsh Singh (University of Toronto)
Hans-Arno Jacobsen (University of Toronto)

EUDEMON: A System for Online Video Frame Copy Detection by Earth Mover Distance
Jia Xu (Northeastern University, China)
Qiushi Bai (Northeastern University, China),
Yu Gu (Northeastern University, China)
Anthony Tung (National University of Singapore),
Guoren Wang (Northeastern University, China),
Ge Yu (Northeastern University, China),
Zhenjie Zhang (Advanced Digital Sciences Center, Illinois at Singapore Pte.)

A Dataset Search Engine for the Research Document Corpus
Meiyu Lu (National Univ. of Singapore)
Srinivas Bangalore (AT&T Research Labs),
Graham Cormode (AT&T Labs – Research),
Marios Hadjieleftheriou (AT&T Labs – Research),
Divesh Srivastava (AT&T Labs – Research)

AskFuzzy: Attractive Visual Fuzzy Query Builder
Keivan Kianmehr (University of Western Ontario)
Negar Koochakzadeh (University of Calgary)
Reda Alhajj (University of Calgary)

F2DB: The Flash-Forward Database System
Ulrike Fischer (Dresden University of Technology)
Frank Rosenthal (Dresden University of Technology)
Wolfgang Lehner (Dresden University of Technology)

Provenance-Based Debugging and Drill-Down in Data-Oriented Workflows
Robert Ikeda (Stanford University)
Junsang Cho (Stanford University),
Charlie Fang (Stanford University)
Semih Salihoglu (Stanford University),
Satoshi Torikai (Stanford University)
Jennifer Widom (Stanford University)

12:00 – 2:00pm (Salon 4567): Business Lunch & Award Ceremony
2:00pm – 3:30pm: Sessions 5-8, Seminar 2, Demo Group 2

Session 5 (Studio F): Graphs

Session Chair: Sameh Elnikety (Microsoft)

Iterative Graph Feature Mining for Graph Indexing
Dayu Yuan (Penn State University)
Prasenjit Mitra (Penn State University)
Huiwen Yu (Penn State University)
C. Lee Giles (Penn State University)

An Efficient Graph Indexing Method
Xiaoli Wang (National University of Singapore)
Xiaofeng Ding (Huazhong University of Science and Technology)
Anthony K.H. Tung (National University of Singapore)
Shanshan Ying (National University of Singapore)
Hai Jin (Huazhong University of Science and Technology)

PRAGUE: Towards Blending Practical Visual Subgraph Query Formulation and Query Processing
Changjiu Jin (Nanyang Technological University)
Sourav S Bhowmick (Nanyang Technological Univ)
Byron Choi (Hong Kong Baptist University)
Shuigeng Zhou (Fudan University)

Ego-centric Graph Pattern Census
Walaa Eldin Moustafa (University of Maryland, College Park)
Amol Deshpande (University of Maryland, College Park)
Lise Getoor (University of Maryland, College Park)

Session 6 (Studio B): Uncertain and Probabilistic Databases

Session Chair: Elena Ferrari

Searching Uncertain Data Represented by Non-Axis Parallel Gaussian Mixture Models
Katrin Haegler (University of Munich)
Frank Fiedler (University of Munich)
Christian Boehm (University of Munich)

Aggregate Query Answering on Possibilistic Data with Cardinality Constraints
Graham Cormode (AT&T Labs – Research)
Entong Shen (North Carolina State University)
Divesh Srivastava (AT&T Labs – Research)
Ting Yu (North Carolina State University)

Discovering Threshold-based Frequent Closed Itemsets over Probabilistic Data
Yongxin Tong (Hong Kong Univeristy of Science and Engineering)
Lei Chen (Hong Kong Univeristy of Science and Engineering)
Bolin Ding (University of Illinois at Urbana-Champaign)

Ranking Query Results in Probabilistic Databases: Complexity and Efficient Algorithms
Dan Olteanu (University of Oxford)
Hongkai Wen (University of Oxford)

Session 7 (Studio C): Data Integration and Extraction

Session Chair: Daisy Zhe Wang (UFL)

Joint Entity Resolution
Steven Whang (Stanford University)
Hector Garcia-Molina (Stanford University)

A Self-Configuring Schema Matching System
Eric Peukert (SAP Research Dresden)
Julian Eberius (Dresden University of Technology)
Erhard Rahm (University of Leipzig)

Incremental Detection of Inconsistencies in Distributed Data
Wenfei Fan (University of Edinburgh)
Jianzhong Li (Harbin Institute of Technology)
Nan Tang (University of Edinburgh & Qatar Computing Research Institute)
Wenyuan Yu (University of Edinburgh)

Recomputing Materialized Instances after Changes to Mappings and Data
Todd J. Green (University of California, Davis)
Zachary G. Ives (University of Pennsylvania)

Session 8 (Studio D): Spatio-Temporal Data Management

Session Chair: Lei Chen (HKUST)

SWST: A Disk Based Index for Sliding Window Spatio-Temporal Data
Manish Singh (University of Michigan, Ann Arbor)
Qiang Zhu (University of Michigan, Dearborn)
H.V. Jagadish (University of Michigan, Ann Arbor)

Querying Uncertain Spatio-Temporal Data
Tobias Emrich (Ludwig-Maximilians-Universität München)
Hans-Peter Kriegel (Ludwig-Maximilians-Universität München)
Nikos Mamoulis (University of Hong Kong)
Matthias Renz (Ludwig-Maximilians-Universität München)
Andreas Züfle (Ludwig-Maximilians-Universität München)

The Min-dist Location Selection Query
Jianzhong Qi (University of Melbourne)
Rui Zhang (University of Melbourne)
Lars Kulik (University of Melbourne)
Dan Lin (Missouri University of Science and Technology)
Yuan Xue (University of Melbourne)

Bi-level Locality Sensitive Hashing for K-Nearest Neighbor Computation
Jia Pan (UNC Chapel Hill)
Dinesh Manocha (UNC Chapel Hill)

Seminar 2 (Salon 123): Discovering Multiple Clustering Solutions: Grouping Objects in Different Views of the Data

Emmanuel Müller (Karlsruhe Institute of Technology)
Stephan Günnemann (RWTH Aachen University)
Ines Färber (RWTH Aachen University)
Thomas Seidl (RWTH Aachen University)

[slides]

Demo group 2 (Studio E):

M^3: Stream Processing on Main-Memory MapReduce
Ahmed M. Aly (Purdue University)
Asmaa Sallam (Purdue University)
Bala M. Gnanasekaran (Purdue University)
Long-Van Nguyen-Dinh (Purdue University)
Walid G. Aref (Purdue University)
Mourad Ouzzani (Qatar Computing Research Institute)
Arif Ghafoor (Purdue University)

A Deep Embedding of Queries into Ruby
Torsten Grust (University of Tübingen)
Manuel Mayr (University of Tübingen)

Asking the Right Questions in Crowd Data Sourcing
Rubi Boim (Tel-Aviv University)
Ohad Greenshpan (Tel-Aviv University)
Tova Milo (Tel-Aviv University)
Slava Novgorodov (Tel-Aviv University),
Neoklis Polyzotis (University of California, Santa Cruz)
Wang-Chiew Tan (University of California, Santa Cruz)

LotusX: A Position-Aware XML Graphical Search System with Auto-Completion
Chunbin Lin (Renmin University of China)
Jiaheng Lu (Renmin University of China),
Tok Wang Ling (National Universtiy of Singapore)
Bogdan Cautis (Télécom ParisTech)

Efficient Top-k Keyword Search in Graphs with Polynomial Delay
Mehdi Kargar (York University)
Aijun An (York University)

TEDAS: a Twitter Based Event Detection and Analysis System
Rui Li (University of Illinois at Urbana-Champaign)
Kin Hou Lei (Brigham Young University),
Ravi Khadiwala (University of Illinois at Urbana-Champaign)
Kevin Chen-Chuan Chang (University of Illinois at Urbana-Champaign)

AutoDict: Automated Dictionary Discovery
Fei Chiang (University of Toronto)
Periklis Andritsos (University of Toronto),
Erkang Zhu (University of Toronto)
Renee Miller (University of Toronto)

3:30-4:00pm: Coffee Break
4:00pm – 5:30pm: Sessions 9-12, Seminar 3, Demo Group 3

Session 9 (Studio F): Query Processing

Session Chair: Walid G. Aref (Purdue)

Learning-based Query Performance Modeling and Prediction
Mert Akdere (Brown University)
Ugur Cetintemel (Brown University)
Matteo Riondato (Brown University)
Eli Upfal (Brown Usniversity)
Stanley B. Zdonik (Brown University)

Parametric Plan Caching Using Density-Based Clustering
Gunes Aluc (University of Waterloo)
David E. DeHaan (Sybase, an SAP Company)
Ivan T. Bowman (Sybase, an SAP Company)

Effective and Robust Pruning for Top-Down Join Enumeration Algorithms
Pit Fender (Mannheim University)
Guido Moerkotte (Mannheim University)
Thomas Neumann (Technical University of Munich)
Viktor Leis (Technical University of Munich)

Towards Preference-aware Relational Databases
Anastasios Arvanitis (National Technical University of Athens)
Georgia Koutrika (IBM Almaden Research Center)

Session 10 (Studio B): Location Aware Data Processing

Session Chair: Mohammad Sadoghi (University of Toronto)

A Foundation for Efficient Indoor Distance-Aware Query Processing
Hua Lu (Aalborg University)
Xin Cao (Nanyang Technological University)
Christian S. Jensen (Aarhus University)

LARS: A Location-Aware Recommender System
Justin J. Levandoski (Microsoft Research)
Mohamed Sarwat (University of Minnesota)
Ahmed Eldawy (University of Minnesota)
Mohamed F. Mokbel (University of Minnesota)

Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme
Miao Qiao (The Chinese University of Hong Kong)
Hong Cheng (The Chinese University of Hong Kong)
Lijun Chang (The Chinese University of Hong Kong)
Jeffrey Xu Yu (The Chinese University of Hong Kong)

Desks: Direction-Aware Spatial Keyword Search
Guoliang Li (Tsinghua University)
Jianhua Feng (Tsinghua University)
Jing Xu (Tsinghua University)

Session 11 (Studio C): Map-Reduce based Data Processing

Session Chair: Minqi Zhou (ECNU)

Extending Map-Reduce for Efficient Predicate-Based Sampling
Raman Grover (University of California, Irvine)
Michael Carey (University of California, Irvine)

Fuzzy Joins Using MapReduce
Foto Afrati (National Technical University Athens)
Anish Das Sarma (Google, Inc. – work initiated at Yahoo! Research)
David Menestrina (Google, Inc.)
Aditya Parameswaran (Stanford University)
Jeffrey D. Ullman (Stanford University)

Parallel Top-K Similarity Join Algorithms Using MapReduce
Younghoon Kim (Seoul National University)
Kyuseok Shim (Seoul National University)

Load Balancing in MapReduce Based on Scalable Cardinality Estimates
Benjamin Gufler (Technische Universität München)
Nikolaus Augsten (Free University of Bolzano-Bozen)
Angelika Reiser (Technische Universität München)
Alfons Kemper (Technische Universität München)

Session 12 (Studio D): Social Media

Session Chair: Zack Ives (University of Pennsylvania)

Community Detection with Edge Content in Social Media Networks
Guo-Jun Qi (University of Illinois at Urbana-Champaign)
Charu C. Aggarwal (IBM T. J. Watson Research Center)
Thomas S. Huang (University of Illinois at Urbana-Champaign)

Cross Domain Search by Exploiting Wikipedia
Chen Liu (National University of Singapore)
Sai Wu (National University of Singapore)
Shouxu Jiang (Harbin Institute of Technology)
Anthony K.H. Tung (National University of Singapore)

Provenance-based Indexing Support in Micro-blog Platforms
Junjie Yao (Peking University)
Bin Cui (Peking University)
Zijun Xue (Peking University)
Qingyun Liu (Peking University)

Learning Stochastic Models of Information Flow
Luke Dickens (Imperial College London)
Ian Molloy (IBM T. J. Watson Research Center)
Jorge Lobo (IBM T. J. Watson Research Center)
Pau-Chen Cheng (IBM T. J. Watson Research Center)
Alessandra Russo (Imperial College London)

Seminar 3 (Salon 123): Detecting Clones, Copying and Reuse on the Web

Xin Luna Dong (AT&T Labs–Research)
Divesh Srivastava (AT&T Labs–Research)

[slides]

 

Demo group 3 (Studio E):

Trust & Share: Trusted Information Sharing in Online Social Networks
Barbara Carminati (University of Insubria)
Elena Ferrari (University of Insubria)
Jacopo Girardi (University of Insubria)

Evaluation of Clusterings – Metrics and Visual Support
Elke Achtert (Ludwig-Maximilians-Universität München)
Sascha Goldhofer (Ludwig-Maximilians-Universität München)
Hans-Peter Kriegel (Ludwig-Maximilians-Universität München)
Erich Schubert (Ludwig-Maximilians-Universität München)
Arthur Zimek (Ludwig-Maximilians-Universität München)

Horton: Online Query Execution Engine For Large Distributed Graphs
Mohamed Sarwat (University of Minnesota)
Sameh Elnikety (Microsoft Research)
Yuxiong He (Microsoft Research)
Gabriel Kliot (Microsoft Research)

MXQuery With Hardware Acceleration
Jens Teubner (ETH Zurich)
Peter Fischer (University of Freiburg)

Data^3 – A Kinect Interface for OLAP using Complex Event Processing
Steffen Hirte (Ilmenau University of Technology)
Andreas Seifert (Ilmenau University of Technology)
Stephan Baumann (Ilmenau University of Technology)
Daniel Klan (Ilmenau University of Technology)
Kai-Uwe Sattler (Ilmenau University of Technology)

Analyzing Query Optimization Process: Portraits of Join Enumeration Algorithms
Anisoara Nica (Sybase, An SAP Company)
Ian Charlesworth (University of Waterloo)
Maysum Panju (University of Waterloo)

DPCube: Releasing Differentially Private Data Cubes for Health Information
Yonghui Xiao (Emory University)
James Gardner (Digital Reasoning Systems Inc.)
Li Xiong (Emory University)

Evening-Night: Career Panel

 

 

Tuesday (April 3)

9:00-10:00: Keynote 2 (Salon 4567): Surajit Chaudhuri — How Different Is Big Data?
10:00-10:30: Coffee break
10:30-12:00: Sessions 13-15, Industrial Session 1, Seminar 4, Demo Group 4

Session 13 (Studio F): P2P and Distributed Processing

Session Chair: Guoliang Li (Tsinghua)

BestPeer++: A Peer-to-Peer based Large-scale Data Processing
Gang Chen (NetEase.com Inc. & Zhejiang University)
Tianlei Hu (NetEase.com Inc. & Zhejiang University)
Dawei Jiang (National University of Singapore)
Peng Lu (National University of Singapore)
Kian-Lee Tan (National University of Singapore)
Hoang Tam Vo (National University of Singapore)
Sai Wu (BestPeer Pte. Ltd. & National University of Singapore)

Effective Data Density Estimation in Ring-based P2P Networks
Minqi Zhou (East China Normal University)
Heng Tao Shen (The University of Queensland)
Xiaofang Zhou (The University of Queensland)
Weining Qian (East China Normal University)
Aoying Zhou (East China Normal University)

Processing of Rank Joins in Highly Distributed Systems
Christos Doulkeridis (Norwegian University of Science and Technology (NTNU))
Akrivi Vlachou (Norwegian University of Science and Technology (NTNU))
Kjetil Nørvåg (Norwegian University of Science and Technology (NTNU))
Yannis Kotidis (Athens University of Economics and Business (AUEB))
Neoklis Polyzotis (UC Santa Cruz (UCSC))

Load Balancing for MapReduce-based Entity Resolution
Lars Kolb (University of Leipzig)
Andreas Thor (University of Leipzig)
Erhard Rahm (University of Leipzig)

Session 14 (Studio B): XML and RDF Data Management

Session Chair: Dan Olteanu (Oxford)

Mapping XML to a Wide Sparse Table
Liang Jeff Chen (UCSD)
Philip A. Bernstein (Microsoft Corp.)
Peter Carlin (Microsoft Corp.)
Dimitrije Filipovic (Microsoft Corp.)
Michael Rys (Microsoft Corp.)
Nikita Shamgunov (Facebook Inc.)
James F. Terwilliger (Microsoft Corp.)
Milos Todic (Microsoft Corp.)
Sasa Tomasevic (Microsoft Corp.)
Dragan Tomic (Microsoft Corp.)

Querying XML Data: As You Shape It
Curtis E. Dyreson (Utah State University)
Sourav S. Bhowmick (Nanyang Technological University)

Branch Code: A Labeling Scheme for Efficient Query Answering on Trees
Yanghua Xiao (Fudan University)
Ji Hong (Fudan University)
Wanyun Cui (Fudan University)
Zhenying He (Fudan University)
Wei Wang (Fudan University)
Guodong Feng (Fudan University)

Scalable Multi-Query Optimization for SPARQL
Wangchao Le (University of Utah)
Anastasios Kementsietsidis (IBM T. J. Watson Research Center)
Songyun Duan (IBM T. J. Watson Research Center)
Feifei Li (University of Utah)

Session 15 (Studio C): Performance

Session Chair: Eric Lo (Poly U., Hong Kong)

GSLPI: a Cost-based Query Progress Indicator
Jiexing Li (University of Wisconsin-Madison)
Rimma V. Nehme (Microsoft Jim Gray Systems Lab)
Jeffrey Naughton (University of Wisconsin-Madison)

Micro-Specialization in DBMSes
Rui Zhang (University of Arizona)
Richard T. Snodgrass (University of Arizona)
Saumya Debray (University of Arizona)

Towards Multi-Tenant Performance SLOs
Willis Lang (University of Wisconsin-Madison)
Srinath Shankar (Microsoft Jim Gray Systems Lab)
Jignesh M. Patel (University of Wisconsin-Madison)
Ajay Kalhan (Microsoft Corp.)

Multi-Version Concurrency via Timestamp Range Conflict Management
David Lomet (Microsoft Research)
Alan Fekete (University of Sydney)
Rui Wang (Microsoft Research)
Peter Ward (University of Sydney)

Industrial Session 1 (Studio D): Support for Large Scale Data Analytics

Session Chair: Arbee L.P. Chen (National Chengchi University)

Exploiting Common Subexpressions for Cloud Query Processing
Yasin N. Silva (Arizona State University)
Per-Ake Larson (Microsoft Research)
Jingren Zhou (Microsoft Corp.)

Vectorwise: a Vectorized Analytical DBMS
Marcin Zukowski (Actian Netherlands)
Mark van de Wiel (Actian Corp.)
Peter Boncz (CWI)

Scalable and Numerically Stable Descriptive Statistics in SystemML
Yuanyuan Tian (IBM Almaden Research Center)
Shirish Tatikonda (IBM Almaden Research Center)
Berthold Reinwald (IBM Almaden Research Center)

 

Seminar 4 (Salon 123): Mining Knowledge from Data: An Information Network Analysis Approach

Jiawei Han (University of Illinois at Urbana-Champaign)
Yizhou Sun (University of Illinois at Urbana-Champaign)
Xifeng Yan (University of California at Santa Barbara)
Philip S. Yu (University of Illinois at Chicago)

[slides]

 

Demo Group 4 (Studio E)

Nyaya: a System Supporting the Uniform Management of Large Sets of Semantic Data
Roberto De Virgilio (Universita’ Roma Tre)
Giorgio Orsi (University of Oxford)
Letizia Tanca (Politecnico di Milano)
Riccardo Torlone (Universita’ Roma Tre)

R2DB: A System for Querying and Visualizing Weighted RDF Graphs
Songling Liu (Arizona State University)
Juan Cedeno (Arizona State University)
Selcuk Candan (Arizona State University)
Maria Luisa Sapino (University of Turin)
Shengyu Huang (Arizona State University)
Xinsheng Li (Arizona State University)

Project Daytona: Data Analytics as a Cloud Service
Roger Barga (Microsoft)
Jaliya Ekanayake (Microsoft Research)
Wei Lu (Microsoft Research)

Interactive User Feedback in Ontology Matching Using Signature Vector
Isabel Cruz (University of Illinois at Chicago)
Cosmin Stroe (University of Illinois at Chicago)
Matteo Palmonari (University of Milano-Bicocca)

DObjects+: Enabling Privacy-Preserving Data Federation Services
Pawel Jurczyk (Google Inc.)
Li Xiong (Emory University)
Slawomir Goryczka (Emory University)

Dragoon: An Information Accountability System for High-Performance Databases
Kyriacos Pavlou (University of Arizona)
Richard Snodgrass (University of Arizona)

Intuitive Interaction With Encrypted Query Execution in DataStorm
Ken Smith (MITRE)
Ameet Kini (MITRE)
William Wang (MITRE)
Chris Wolf (MITRE)
M. David Allen (MITRE)
Andrew Sillers (MITRE)

12:00-2:00pm: Funders Session and Lunch (Salon 4567) 

Moderator:  Dr. Frank Olken

Dr. Le Gruenwald  (National Science Foundation)
Dr. Ceren Susut    (Dept. of Energy)
Dr. Peter Lyster  (National Institutes of Health)

Click here for speaker biographies.

2:00pm-3:30pm: Sessions 16-17, Industrial Session 2, Seminar 5, Panel, Demo Group 1

Session 16 (Studio F): Data Extraction and Quality

Session Chair: Anish Das Sarma

Automatic Extraction of Structured Web Data with Domain Knowledge
Nora Derouiche (Télécom ParisTech – CNRS LTCI)
Bogdan Cautis (Télécom ParisTech – CNRS LTCI)
Talel Abdessalem (Télécom ParisTech – CNRS LTCI)

Discovering Conservation Rules
Lukasz Golab (University of Waterloo)
Howard Karloff (AT&T Labs–Research)
Flip Korn (AT&T Labs–Research)
Barna Saha (AT&T Labs–Research)
Divesh Srivastava (AT&T Labs–Research)

Answering Why-not Questions on Top-k Queries
Zhian He (Hong Kong Polytechnic University)
Eric Lo (Hong Kong Polytechnic University)

An Efficient Trie-based Method for Approximate Entity Extraction with Edit-Distance Constraints
Dong Deng (Tsinghua University)
Guoliang Li (Tsinghua University)
Jianhua Feng (Tsinghua University)

Session 17 (Studio B): Top-K Processing

Session Chair: Tingjian Ge (UKY)

On Top-k Structural Similarity Search
Pei Lee (University of British Columbia)
Laks V.S. Lakshmanan (University of British Columbia)
Jeffrey Xu Yu (Chinese University of Hong Kong)

Relevance Matters: Capitalizing on Less (Top-k Matching in Publish/Subscribe)
Mohammad Sadoghi (University of Toronto)
Hans-Arno Jacobsen (University of Toronto)

Efficiently Monitoring Top-k Pairs over Sliding Windows
Zhitao Shen (UNSW)
Muhammad Aamir Cheema (UNSW)
Xuemin Lin (UNSW & ECNU)
Wenjie Zhang (UNSW)
Haixun Wang (Microsoft Research Asia)

Processing and Notifying Range Top-k Subscriptions
Albert Yu (Duke University)
Pankaj K. Agarwal (Duke University)
Jun Yang (Duke University)

Industrial Session 2 (Studio C): Evolving Platforms for New Applications

Session Chair: Rui Zhang (Melbourne)

Earlybird: Real-Time Search at Twitter
Michael Busch (Twitter)
Krishna Gade (Twitter)
Brian Larson (Twitter)
Patrick Lok (Twitter)
Samuel Luckenbill (Twitter)
Jimmy Lin (Twitter)

Data Infrastructure at LinkedIn
LinkedIn Data Infrastructure Team

The Credit Suisse Meta-data Warehouse
Claudio Jossen (Credit Suisse AG)
Lukas Blunschi (ETH Zurich)
Magdalini Mori (Credit Suisse AG)
Donald Kossmann (ETH Zurich)
Kurt Stockinger (Credit Suisse AG)

Panel (Studio D): The Future of Scientific Data Bases

Moderator: Michael Stonebraker (MIT)
Panelists:
Anastasia Ailamaki (EPFL)
Jeremy Kepner (MIT)
Alex Szalay (Johns Hopkins University)

Seminar 5 (Salon 123): Emerging Graph Queries In Linked Data

Arijit Khan (University of California, Santa Barbara)
Yinghui Wu (University of California, Santa Barbara)
Xifeng Yan (University of California, Santa Barbara)

[slides]

 

Demo group 1 (Studio E)

See “Demo Group 1” listing above

3:30pm-4:00pm: Coffee Break
4:00pm-5:00pm: Poster Session, all papers (Salon 4567)
5:30pm: Departure for conference banquet (bus leaves hotel)

 

 

Wednesday (April 4)

9:00-10:00: Keynote 3 (Salon 4567): Peter Druschel — Accountability and Trust in Cooperative Information Systems
10:00-10:30: Coffee Break
10:30-12:00: Sessions 18-20, Industrial Session 3, Seminar 6, Demo Group 2

Session 18 (Studio F): Similarity

Session: Matthias Renz (LMU)

Efficient Exact Similarity Searches using Multiple Token Orderings
Jongik Kim (Chonbuk National University)
Hongrae Lee (Google Inc.)

Efficient Graph Similarity Joins with Edit Distance Constraints
Xiang Zhao (The University of New South Wales & NICTA)
Chuan Xiao (The University of New South Wales)
Xuemin Lin (The University of New South Wales & East China Normal University)
Wei Wang (The University of New South Wales)

Parameter-Free Determination of Distance Thresholds for Metric Distance Constraints
Shaoxu Song (Tsinghua University)
Lei Chen (The Hong Kong University of Science and Technology)
Hong Cheng (The Chinese University of Hong Kong)

Random Error Reduction in Similarity Search on Time Series: A Statistical Approach
Wush Chi-Hsuan Wu (Academia Sinica)
Mi-Yen Yeh (Academia Sinica)
Jian Pei (Simon Fraser University)

Session 19 (Studio B): Text and Strings

Session Chair: Feifei Li (UTAH)

Optimizing Statistical Information Extraction Programs Over Evolving Text
Fei Chen (HP Labs China)
Xixuan Feng (University of Wisconsin-Madison)
Christopher Re (University of Wisconsin-Madison)
Min Wang (HP Labs China)

Approximate String Membership Checking: A Multiple Filter, Optimization-Based Approach
Chong Sun (University of Wisconsin-Madison)
Jeffrey F. Naughton (University of Wisconsin-Madison)
Siddharth Barman (University of Wisconsin-Madison)

On Text Clustering with Side Information
Charu C. Aggarwal (IBM T. J. Watson Research Center)
Yuchen Zhao (University of Illinois at Chicago)
Philip S. Yu (University of Illinois at Chicago)

Fast SLCA and ELCA Computation for XML Keyword Queries based on Set Intersection
Junfeng Zhou (Yanshan University)
Zhifeng Bao (National University of Singapore)
Wei Wang (The University of New South Wales)
Tok Wang Ling (National University of Singapore)
Ziyang Chen (Yanshan University)
Xudong Lin (Yanshan University)
Jingfeng Guo (Yanshan University)

Session 20 (Studio C): Query Processing II

Session Chair: Volker Markl (TUM)

Optimization of Massive Pattern Queries by Dynamic Configuration Morphing
Nikolay Laptev (University of California, Los Angeles)
Carlo Zaniolo (University of California, Los Angeles)

Three-level Processing of Multiple Aggregate Continuous Queries
Shenoda Guirguis (University of Pittsburgh)
Mohamed A. Sharaf (The University of Queensland)
Panos K. Chrysanthis (University of Pittsburgh)
Alexandros Labrinidis (University of Pittsburgh)

Accelerating Range Queries For Brain Simulations
Farhan Tauheed (EPFL)
Laurynas Biveinis (Aalborg University)
Thomas Heinis (EPFL)
Felix Schürmann (EPFL)
Henry Markram (EPFL)
Anastasia Ailamaki (EPFL)

Keyword Query Reformulation on Structured Data
Junjie Yao (Peking University)
Bin Cui (Peking University)
Liansheng Hua (Peking University)
Yuxin Huang (Peking University)

Industrial Session 3 (Studio D): Indexing, Updates and Processing

Session Chair: Adina Crainiceanu (United States Naval Academy)

Efficient Support of XQuery Update Facility in XML Enabled RDBMS
Zhen Hua Liu (Oracle)
Hui Chang (Oracle)
Balasubramanyam Sthanikam (Oracle)

Making Unstructured Data SPARQL Using Semantic Indexing in Oracle Database
Souripriya Das (Oracle)
Seema Sundara (Oracle )
Matthew Perry (Oracle)
Jagannathan Srinivasan (Oracle)
Jayanta Banerjee (Oracle)
Aravind Yalamanchi (Oracle)

A meta-language for MDX queries in eLog Business Solution
Sonia Bergamaschi (University of Modena and Reggio Emilia)
Matteo Interlandi (University of Modena and Reggio Emilia)
Mario Longo (eBilling S.p.A.)
Laura Po (University of Modena and Reggio Emilia)
Maurizio Vincini (University of Modena and Reggio Emilia)

Seminar 6 (Salon 123):  Boolean Matrix Decomposition Problem: Theory, Variations and Applications to Data Engineering

Jaideep Vaidya (Rutgers University)

[slides]

 

Demo Group 2 (Studio E):

See “Demo Group 2” listing above

12:00-2:00pm: Lunch (provided by Conference, Salon 4567)
2:00pm-3:30pm: Sessions 21-23, Demo Group 3

Session 21 (Studio F): Data Mining

Session Chair: Anthony Tung (National University of Singapore)

Predicting Approximate Protein-DNA Binding Cores Using Association Rule Mining
Po-Yuen Wong (The Chinese University of Hong Kong)
Tak-Ming Chan (The Chinese University of Hong Kong)
Man-Hon Wong (The Chinese University of Hong Kong)
Kwong-Sak Leung (The Chinese University of Hong Kong)

Upgrading Uncompetitive Products Economically
Hua Lu (Aalborg University)
Christian S. Jensen (Aarhus University)

Attribute-Based Subsequence Matching and Mining
Yu Peng (The Hong Kong University of Science and Technology)
Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology)
Liangliang Ye (The Hong Kong University of Science and Technology)
Philip S. Yu (University of Illinois at Chicago)

Integrating Frequent Pattern Mining from Multiple Data Domains for Classification
Dhaval Patel (National University of Singapore)
Wynne Hsu (National University of Singapore)
Mong Li Lee (National University of Singapore)

Session 22 (Studio B): Scientific Data, Analysis and Visualization

Session Chair: Christopher Re (WISC)

Efficient Versioning for Scientific Array Databases
Adam Seering (MIT CSAIL)
Philippe Cudre-Mauroux (University of Fribourg)
Samuel Madden (MIT CSAIL)
Michael Stonebraker (MIT CSAIL)

Multidimensional Analysis of Atypical Events in Cyber-Physical Data
Lu-An Tang (UIUC)
Xiao Yu (UIUC)
Sangkyum Kim (UIUC)
Jiawei Han (UIUC)
Wen-Chih Peng (National Chiao Tung University)
Yizhou Sun (UIUC)
Hector Gonzalez (Google)
Sebastian Seith (Morning Star)

HiCS: High Contrast Subspaces for Density-Based Outlier Ranking
Fabian Keller (Karlsruhe Institute of Technology)
Emmanuel Müller (Karlsruhe Institute of Technology)
Klemens Böhm (Karlsruhe Institute of Technology)

Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks
Yang Zhang (The Ohio State University)
Srinivasan Parthasarathy (The Ohio State University)

Session 23 (Studio D): Similarity Search and Detection

Session Chair: Xuemin Lin (UNSW)

Horizontal Reduction: Instance-Level Dimensionality Reduction for Similarity Search in Large Document Databases
Min Soo Kim (KAIST)
Kyu-Young Whang (KAIST)
Yang-Sae Moon (Kangwon National University)

Adaptive Windows for Duplicate Detection
Uwe Draisbach (Hasso-Plattner-Institute)
Felix Naumann (Hasso-Plattner-Institute)
Sascha Szott (Zuse Institute)
Oliver Wonneberg (R. Lindner GmbH & Co. KG)

Efficient Dual-Resolution Layer Indexing for Top-k Queries
Jongwuk Lee (Pohang University of Science and Technology (POSTECH))
Hyunsouk Cho (Pohang University of Science and Technology (POSTECH))
Seung-won Hwang (Pohang University of Science and Technology (POSTECH))

Evaluating Probabilistic Queries over Uncertain Matching
Reynold Cheng (The University of Hong Kong)
Jian Gong (The University of Hong Kong)
David W. Cheung (The University of Hong Kong)
Jiefeng Cheng (Shenzhen Institute of Advanced Technology)

Demo group 3 (Studio E)

See “Demo Group 3” listing above

3:30-4:00: Coffee Break
4:00-5:30: Sessions 24-25, Demo Group 4

Session 24 (Studio B): Sensors Network and Trajectory

Session Chair: Flip Korn (AT&T)

Detecting Outliers in Sensor Networks using the Geometric Approach
Sabbas Burdakis (Technical University of Crete)
Antonios Deligiannakis (Technical University of Crete)

Efficient Threshold Monitoring for Distributed Probabilistic Data
Mingwang Tang (University of Utah)
Feifei Li (University of Utah)
Jeff M. Phillips (University of Utah)
Jeffrey Jestes (University of Utah)

Incorporating Duration Information for Trajectory Classification
Dhaval Patel (National University of Singapore)
Chang Sheng (DBS Bank)
Wynne Hsu (National University of Singapore)
Mong Li Lee (National University of Singapore)

Reducing Uncertainty of Low-Sampling-Rate Trajectories
Kai Zheng (The University of Queensland)
Yu Zheng (Microsoft Research Asia)
Xing Xie (Microsoft Research Asia)
Xiaofang Zhou (The University of Queensland)

Session 25 (Studio D): Error Reduction and Data Security

Session Chair: Graham Cormode (AT&T)

Efficient Similarity Search over Encrypted Data
Mehmet Kuzu (The University of Texas at Dallas)
Mohammad Saiful Islam (The University of Texas at Dallas)
Murat Kantarcioglu (The University of Texas at Dallas)

Obfuscating the Topical Intention in Enterprise Text Search
HweeHwa Pang (Singapore Management University)
Xiaokui Xiao (Nanyang Technological University)
Jialie Shen (Singapore Management University)

Correlation Support for Risk Evaluation in Databases
Katrin Eisenreich (SAP Research)
Jochen Adamek (Technische Universität Berlin)
Philipp Rösch (SAP Research)
Volker Markl (Technische Universität Berlin)
Gregor Hackenbroich (SAP Research)

A Game-Theoretic Approach for High-Assurance of Data Trustworthiness in Sensor Networks
Hyo-Sang Lim (Purdue University & Computer and Telecommunications Engineering Division, South Korea)
Gabriel Ghinita (University of Massachusetts at Boston)
Elisa Bertino (Purdue University)
Murat Kantarcioglu (University of Texas at Dallas)

Demo group 4 (Studio E)

See “Demo Group 4” listing above

 

Thursday (April 5)

9-5:30pm Workshops

Studio B: Data Management in the Cloud (DMC)
Studio D: Graph Data Management: Techniques and Applications (GDM)
Studio F: Secure Data Management on Smartphones and Mobiles (SDMSM)