HILDA 2018: Workshop on Human-In-the-Loop Data Analytics

Introduction

Any data management system needs to work together with people, whose needs determine the goals for the system, and who must provide the input and who need to work effectively with the output. Data management systems will work much better when they take account of the cognitive and physiological characteristics of the people involved. Recent technology trends (such as touch screens, motion detection, and voice recognition) are widening the possibilities for users to interact with systems, and many information-provision industries are shifting to personalized processing to better target their services to the users’ wishes. HILDA is a workshop that will allow researchers and practitioners to exchange ideas and results relating to how data management can be done with awareness of the people who form part of the processes. A sample of topics that is are in the spirit of this workshop includes, but is not limited to: novel query interfaces, interactive query refinement, data exploration and analysis, data visualization, human-assisted data integration and cleaning, perception-aware data processing, database systems designed for highly interactive use cases, empirical studies of database use, and crowd-powered data infrastructure.

HILDA intends to be a forum where people from varied communities engage with one another's ideas. We are keen to have submissions that present initial ideas and visions, just as much as reports on early results, or reflections on completed projects. The workshop will focus on discussion and interaction, rather than static presentations of what is in the paper.

Submission

Authors are invited to submit either of the following: (1) original, unpublished research papers that are not being considered for publication in any other forum. These will be published in the HILDA proceedings, or (2) abstracts of previous work presented in other venues ideally outside the database community.

Abstracts will be considered for discussion at the workshop only and not be included in the HILDA proceedings. Submissions of abstracts must be marked in the title as (Abstract). Research papers must follow the latest ACM Proceedings format (2017). Research papers submitted can be between four and six pages in length, excluding references. Abstracts should be maximally one page long.

Submissions will be handled through EasyChair.

Keynote Speakers

Jock D. Mackinlay, VP Research & Design, Tableau Software: Everyone Needs to Work with Data

Panelists (Pending)

We are excited to host a panel of visualization and database researchers to discuss directions where the two communities can collaborate: visualization and HCI research that can help solve hard database problems, and database research that can benefit hard visualization systems problems.

Polo Chau (Georgia Tech)
Aditya Parameswaran (UIUC)
Surajit Chaudhuri (Microsoft)
Leilani Battle (UMD, College Park)
Carlos Scheidegger (U of Arizona)

Proceedings

All HILDA papers are available in the ACM DL for a one year period from the conference start date here.

Important Dates

Workshop Date: June 10, 2018
(updated) Submissions (Research Papers): March 25 23, 2018, 11.59PM US EDT
Submissions (Abstract-only): April 6, 2018, 11.59PM US EDT
Notification of outcome: ~~April 13, 2018~~ April 16, 2018 11.59PM US EDT (before SIGMOD early registration deadline)
Camera-ready due: April 27, 2018 11:59PM US EDT

Related Workshops

Program Chairs

Carsten Binnig (TU Darmstadt, co-chair)
Juliana Freire (New York University, co-chair)
Eugene Wu (Columbia University, co-chair)

Program Committee

Aditya Parameswaran, University of Illinois
Adriane Chapman, University of Southampton
Anil Bahuman, Reliance Industries
Anushka Anand, Tableau Software
Arnab Nandi, The Ohio State University
Beth Trushkowsky, Harvey Mudd College
Brian Lim, NUS Singapore
Carl-Christian Kanne, Platfora
Carlos Scheidegger, University of Arizona
Chris Re, Stanford University
Dafna Shahaf, Hebrew University Jerusalem
Danyel Fisher, Microsoft Research
Dana Groff, MongoDB
Dominik Moritz, University of Washington
Enrico Bertini, New York University
Giorgio Caviglia, Trifacta Inc
Guoliang Li, Tsinghua University
Harish Doraiswamy, NYU Data Science Center
Iddo Drori, NYU and Columbia University
James Terwilliger, Microsoft Research
Jessica Hullman, University of Washington
Joseph M. Hellerstein, UC Berkeley
Leilani Battle, Univeristy of Maryland, College Park
Martin Kersten, CWI
Olga Papemmanouil, Brandeis University
Oliver Kennedy, University at Buffalo
Patrick Olivier, Newcastle University
Remco Chang, Tufts University
Rick Cole, Tableau Software
Stratos Idreos, Harvard University
Thibault Sellam, Columbia University
Tim Kraska, Brown University
Tiziana Catarci, Sapienza Universit di Roma
Yunyao Li, IBM Research

Steering Committee

Alan Fekete (University of Sydney)
Laura Haas (IBM Research)
Joseph M. Hellerstein (University of California, Berkeley)
Arnab Nandi (The Ohio State University)
Aditya Parameswaran (University of Illinois)

8:00-8:10 Opening

8:10-9:00 Keynote

9:00-10:30 Session 1: Visual Interactive Exploration (Session Chair: Alan Fekete)

Human-in-the-Loop Data Analysis: A Personal Perspective. Anhai Doan (University of Wisconsin-Madison) (toggle abstract)
ViDeTTe Interactive Notebooks. Costas Zarifis (University of California San Diego) (toggle abstract)
Towards a Unified Representation of Insight in Human-in-the-Loop Analytics: A User Study. Eser Kandogan (IBM) and Ulrich Engelke (CSIRO) (toggle abstract)
Evaluating Visual Data Analysis Systems: A Discussion Report. Leilani Battle (University of Washington), Marco Angelini (Sapienza University of Rome), Carsten Binnig (TU Darmstadt), Tiziana Catarci (Sapienza University of Rome), Philipp Eichmann (Brown University), Jean-Daniel Fekete (INRIA), Giuseppe Santucci (Sapienza University of Rome), Michael Sedlmair (Jacobs University Bremen) and Wesley Willett (University of Calgary) (toggle abstract)
(Vis Abstract) Visual Exploration of Deep Learning Models in Industry. Minsuk Kahng (Georgia Institute of Technology), Pierre Andrews (Facebook), Aditya Kalro (Facebook) and Duen Horng Chau (Georgia Institute of Technology) (toggle abstract)

10:30-11:00 Coffee Break (Location: Houston Foyer)

11:10-12:30 Session 2: Engines and Languages (Session Chair: Oliver Kennedy)

DIVE: A Mixed-Initiative System Supporting Integrated Data Exploration Workflows. Kevin Hu (Massachusetts Institute of Technology), Diana Orghian (Massachusetts Institute of Technology) and César Hidalgo (Massachusetts Institute of Technology) (toggle abstract)
Querying Videos Using DNN Generated Labels by Designing DSL driven UIs. Wu Yifan (University of California, Berkeley), Steven Drucker (Microsoft), Matthai Philipose (Microsoft) and Lenin Ravindranath (Microsoft) (toggle abstract)
Optimally Leveraging Density and Locality for Exploratory Browsing and Sampling. Albert Kim (Massachusetts Institute of Technology), Liqi Xu (University of Illinois Urbana-Champaign), Tarique Siddiqui (University of Illinois Urbana-Champaign), Silu Huang (University of Illinois Urbana-Champaign), Samuel Madden (Massachusetts Institute of Technology) and Aditya Parameswaran (University of Illinois Urbana-Champaign) (toggle abstract)
Source Selection Languages: A Usability Evaluation. Ixent Galpin (Universidad Jorge Tadeo Lozano), Edward Abel (The University of Manchester) and Norman Paton (The University of Manchester) (toggle abstract)
Provenance for Interactive Visualizations. Fotis Psallidas (Columbia University) and Eugene Wu (Columbia University) (toggle abstract)
(Vis Abstract) A Visual Interaction Framework for Dimensionality Reduction Based Data Exploration. Marco Cavallo (IBM) and Cagatay Demiralp (IBM) (toggle abstract)

12:30-14:00 Lunch (Location: Houston 123)

14:00-15:30 Session 3: Data Curation & Quality (Session Chair: Carlos Scheidegger)

Beaver: Towards a Declarative Schema Mapping. Zhongjun Jin (University of Michigan), Christopher Baik (University of Michigan), Michael Cafarella (University of Michigan) and H V. Jagadish (University of Michigan). (toggle abstract)
SchemaDrill: Interactive Semi-Structured Schema Design. William Spoth (University at Buffalo), Ting Xie (University at Buffalo), Oliver Kennedy (University at Buffalo), Dieter Gawlick (Oracle), Ying Yang (Oracle), Zhen Hua-Liu (Oracle) and Beda Hammerschmidt (Oracle) (toggle abstract)
What Type of a Matcher Are You? - Human and Algorithmic Matchers Coordination. Roee Shraga (Technion - Israel Institute of Technology), Avigdor Gal (Technion - Israel Institute of Technology) and Haggai Roitman (IBM Research - AI) (toggle abstract)
Draining the Data Swamp: A Similarity-based Approach. Will Brackenbury (University of Chicago), Rui Liu (University of Chicago), Mainack Mondal (University of Chicago), Aaron Elmore (University of Chicago), Blase Ur (University of Chicago), Kyle Chard (University of Chicago) and Michael Franklin (University of Chicago) (toggle abstract)
(Vis Abstract) Interactive Visual Analytics for Simpson’s Paradox Detection. Chenguang Xu (University of oklahoma), Sarah Brown (University of California, Berkeley), Christan Grant (University of oklahoma) and Chris Weaver (University of Oklahoma) (toggle abstract)

15:30-16:30 Poster Session (Location: Houston 56)

16:30-17:30 Panel (Moderator: Eugene Wu)

17:30-18:00 Closing Remarks

Contact

For questions, email us at hildadbworkshop@gmail.com.

Join us on Twitter.