SIGMOD 2020: Accepted Research Papers
(144 papers, both regular and short, in no particular order)- QUAD: Quadratic-Bound-based Kernel Density VisualizationTsz Nam Chan (The University of Hong Kong), Reynold Cheng (The University of Hong Kong), Man Lung Yiu (The Hong Kong Polytechnic University)
- Densely Connected User Community and Location Cluster Search in Location-Based Social NetworksJunghoon Kim (Nanyang Technological University), Tao Guo (Google), Kaiyu Feng (Nanyang Technological University), Gao Cong (Nanyang Technological University), Arijit Khan (Nanyang Technological University), Farhana Choudhury (University of Melbourne)
- Memory-Aware Framework for Efficient Second-Order Random Walk on Large GraphsYingxia Shao (Beijing Univeristy of Posts and Telecommunications), Shiyue Huang (Peking University), Xupeng Miao (Peking University), Bin Cui (Peking University), Lei Chen (Hong Kong University of Science and Technology)
- BinDex: A Two-Layered Index for Fast and Robust ScansLinwei Li (Fudan University), Kai Zhang (Fudan University), Jiading Guo (Fudan University), Wen He (Fudan University), Zhenying He (Fudan University), Yinan Jing (Fudan University), Weili Han (Fudan University), X. Wang (Fudan University)
- The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic ApproachesNaoto Ohsaka (NEC Corporation)
- SPRINTER: A Fast n-ary Join Query Processing Method for Complex OLAP QueriesYoon-Min Nam (Daegu Gyeongbuk Institute of Science and Technology), Donghyoung Han (Daegu Gyeongbuk Institute of Science and Technology), Min-Soo Kim (Korea Advanced Institute of Science and Technology)
- Approximate Pattern Matching in Massive Graphs with Precision and Recall GuaranteesTashin Reza (University of British Columbia), Matei Ripeanu (University of British Columbia), Geoffrey Sanders (Lawrence Livermore National Laboratory), Roger Pearce (Lawrence Livermore National Laboratory)
- Analysis of Indexing Structures for Immutable DataCong Yue (National University of Singapore), Zhongle Xie (National University of Singapore), Meihui Zhang (Beijing Institute of Technology), Gang Chen (Zhejiang University), Beng Chin Ooi (National University of Singapore), Sheng Wang (Alibaba Group), Xiaokui Xiao (National University of Singapore)
- On the Optimization of Recursive Relational Queries: Application to Graph QueriesLouis Jachiet (LTCI, Télécom Paris), Pierre Genevès (Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LIG), Nils Gesbert (Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LIG), Nabil Layaida (Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LIG)
- SCODED: Statistical Constraint Oriented Data Error DetectionJing Nathan Yan (Cornell University), Oliver Schulte (Simon Fraser University), MoHan Zhang (Simon Fraser University), Jiannan Wang (Simon Fraser University), Reynold Cheng (The University of Hong Kong)
- SAGMA: Secure Aggregation Grouped by Multiple AttributesTimon Hackenjos (FZI Research Center for Information Technology), Florian Hahn (University of Twente), Florian Kerschbaum (University of Waterloo)
- Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning ApproachYaoshu Wang (Shenzhen University), Chuan Xiao (Osaka University & Nagoya University), Jianbin Qin (Shenzhen University), Xin Cao (The University of New South Wales), Yifang Sun (The University of New South Wales), Wei Wang (The University of New South Wales), Makoto Onizuka (Osaka University)
- PrIU: A Provenance-Based Approach for Incrementally Updating Regression ModelsYinjun Wu (University of Pennsylvania), Val Tannen (University of Pennsylvania), Susan Davidson (University of Pennsylvania)
- Learning to Validate the Predictions of Black Box Classifiers on Unseen DataSebastian Schelter (New York University), Tammo Rukat (Amazon Research), Felix Biessmann (Beuth University Berlin)
- Towards Interpretable and Learnable Risk Analysis for Entity ResolutionZhaoqiang Chen (Northwestern Polytechnical University), Qun Chen (Northwestern Polytechnical University), Boyi Hou (Northwestern Polytechnical University), Zhanhuai Li (Northwestern Polytechnical University), Guoliang Li (Tsinghua University)
- Organizing Data Lakes for NavigationFatemeh Nargesian (University of Rochester), Ken Pu (University of Ontario Institute of Technology), Erkang Zhu (Microsoft Research), Bahar Ghadiri Bashardoost (University of Toronto), Renée Miller (Northeastern University)
- Sample Debiasing in the Themis Open World Database SystemLaurel Orr (University of Washington), Magda Balazinska (University of Washington), Dan Suciu (University of Washington)
- Mining Approximate Acyclic Schemes from RelationsBatya Kenig (University of Washington), Pranay Mundra (University of Washington), Guna Prasad (University of Washington), Babak Salimi (University of Washington), Dan Suciu (University of Washington)
- IDEBench: A Benchmark for Interactive Data ExplorationPhilipp Eichmann (Brown University), Emanuel Zgraggen (MIT), Carsten Binnig (TU Darmstadt), Tim Kraska (MIT)
- DB4ML - An In-Memory Database Kernel with Machine Learning SupportMatthias Jasny (TU Darmstadt), Tobias Ziegler (TU Darmstadt), Tim Kraska (MIT), Uwe Roehm (The University of Sydney), Carsten Binnig (TU Darmstadt)
- Parallel Index-based Stream Join on a Multicore CPUAmirhesam Shahvarani (Technische Universität München), Hans-Arno Jacobsen (Technische Universität München)
- Fast Join Project Query Evaluation using Matrix MultiplicationShaleen Deep (University of Wisconsin-Madison), Xiao Hu (Duke University), Paraschos Koutris (University of Wisconsin-Madison)
- Factorized Graph Representations for Semi-Supervised Learning from Sparse DataKrishna Kumar P. (IIT Madras), Paul Langton (Northeastern University), Wolfgang Gatterbauer (Northeastern University)
- Equivalence-Invariant Algebraic Provenance for Hyperplane Update QueriesPierre Bourhis (CNRS, UMR 9189 - CRIStAL), Daniel Deutch (Tel Aviv University), Yuval Moskovitch (Tel Aviv University)
- Learning Multi-Dimensional IndexesVikram Nathan (Massachusetts Institute of Technology), Jialin Ding (Massachusetts Institute of Technology), Mohammad Alizadeh (Massachusetts Institute of Technology), Tim Kraska (Massachusetts Institute of Technology)
- RID: Deduplicating Snapshot ComputationsNikos Tsikoudis (Brandeis University), Liuba Shrira (Brandeis University)
- Web Data Extraction using Hybrid Program Synthesis: A Combination of Top-down and Bottom-up InferenceMohammad Raza (Microsoft Corporation), Sumit Gulwani (Microsoft Corporation)
- In-Memory Subgraph Matching: An In-depth StudyShixuan Sun (Hong Kong University of Science and Technology), Qiong Luo (Hong Kong University of Science and Technology)
- Theoretically-Efficient and Practical Parallel DBSCANYiqiu Wang (Massachusetts Institute of Technology), Yan Gu (University of California, Riverside), Julian Shun (Massachusetts Institute of Technology)
- Order-Preserving Key Compression for In-Memory Search TreesHuanchen Zhang (Carnegie Mellon University), Xiaoxuan Liu (Carnegie Mellon University), David Andersen (Carnegie Mellon University), Michael Kaminsky (BrdgAI), Kimberly Keeton (Hewlett Packard Labs), Andrew Pavlo (Carnegie Mellon University)
- Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our FindingsTarique Siddiqui (Microsoft & University of Illinois at Urbana-Champaign), Alekh Jindal (Microsoft), Shi Qiao (Microsoft), Hiren Patel (Microsoft), Wangchao Le (Microsoft)
- A GPU-friendly Geometric Data Model and Algebra for Spatial QueriesHarish Doraiswamy (New York University), Juliana Freire (New York University)
- Extending Graph Patterns with ConditionsGrace Fan (Brown University), Wenfei Fan (University of Edinburgh, Beihang University & Shenzhen University), Yuanhao Li (University of Edinburgh & Shenzhen University), Ping Lu (Beihang University), Chao Tian (Alibaba Group), Jingren Zhou (Alibaba Group)
- Cleaning Denial Constraint Violations through RelaxationStella Giannakopoulou (EPFL), Manos Karpathiotakis (Facebook), Anastasia Ailamaki (EPFL)
- Maintaining Acyclic Foreign-Key Joins under UpdatesQichen Wang (Hong Kong University of Science and Technology), Ke Yi (Hong Kong University of Science and Technology)
- Truss-based Community Search over Large Directed GraphsQing Liu (Hong Kong Baptist University), Minjun Zhao (Zhejiang University), Xin Huang (Hong Kong Baptist University), Jianliang Xu (Hong Kong Baptist University), Yunjun Gao (Zhejiang University)
- Tree-Encoded BitmapsHarald Lang (Technical University of Munich), Alexander Beischl (Technical University of Munich), Viktor Leis (Friedrich Schiller University Jena), Peter Boncz (Centrum Wiskunde & Informatica), Thomas Neumann (Technical University of Munich), Alfons Kemper (Technical University of Munich)
- DBPal: A Fully Pluggable NL2SQL Training PipelineNathaniel Weir (Johns Hopkins University), Prasetya Utama (Technische Universität Darmstadt), Alex Galakatos (Brown University), Andrew Crotty (Brown University), Amir Ilkhechi (Brown University), Shekar Ramaswamy (Brown University), Rohin Bhushan (Brown University), Nadja Geisler (Technische Universität Darmstadt), Benjamin Hättasch (Technische Universität Darmstadt), Steffen Eger (Technische Universität Darmstadt), Ugur Cetintemel (Brown University), Carsten Binnig (Technische Universität Darmstadt)
- Pensieve: Skewness-Aware Version Switching for Efficient Graph ProcessingTangwei Ying (Huazhong University of Science and Technology), Hanhua Chen (Huazhong University of Science and Technology), Hai Jin (Huazhong University of Science and Technology)
- Black or White: How to Develop an AutoTuner for Memory-based AnalyticsMayuresh Kunjir (Duke University), Shivnath Babu (Unravel Data Systems)
- GOGGLES: Automatic Image Labeling with Affinity CodingNilaksh Das (Georgia Institute of Technology), Sanya Chaba (Georgia Institute of Technology), Renzhi Wu (Georgia Institute of Technology), Sakshi Gandhi (Georgia Institute of Technology), Duen Horng Chau (Georgia Institute of Technology), Xu Chu (Georgia Institute of Technology)
- ChronoCache: Predictive and Adaptive Mid-Tier Query Result CachingBradley Glasbergen (University of Waterloo), Kyle Langendoen (University of Waterloo), Michael Abebe (University of Waterloo), Khuzaima Daudjee (University of Waterloo)
- FalconDB: Blockchain-based Collaborative DatabaseYanqing Peng (University of Utah), Min Du (University of California, Berkeley), Feifei Li (University of Utah), Raymond Cheng (University of California, Berkeley), Dawn Song (University of California, Berkeley)
- A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database AnalyticsAnil Shanbhag (Massachusetts Institute of Technology), Samuel Madden (Massachusetts Institute of Technology), Xiangyao Yu (University of Wisconsin-Madison)
- Starling: A Scalable Query Engine on Cloud FunctionsMatthew Perron (Massachusetts Institute of Technology), Raul Castro Fernandez (University of Chicago), David DeWitt (Massachusetts Institute of Technology), Samuel Madden (Massachusetts Institute of Technology)
- Crypt: Crypto-Assisted Differential Privacy on Untrusted ServersAmrita Roy Chowdhury (University of Wisconsin-Madison), Chenghong Wang (Duke University), Xi He (University of Waterloo), Ashwin Machanavajjhala (Duke University), Somesh Jha (University of Wisconsin-Madison)
- A Comprehensive Benchmark Framework for Active Learning Methods in Entity MatchingVenkata Vamsikrishna Meduri (Arizona State University), Lucian Popa (IBM Research, Almaden), Prithviraj Sen (IBM Research, Almaden), Mohamed Sarwat (Arizona State University)
- Timely Reporting of Heavy Hitters using External MemoryPrashant Pandey (Carnegie Mellon University), Shikha Singh (Wellesley College), Michael Bender (Stony Brook University), Jonathan Berry (Sandia National Laboratories), Martín Farach-Colton (Rutgers University), Rob Johnson (VMware Research), Thomas Kroeger (Sandia National Laboratories), Cynthia Phillips (Sandia National Laboratories)
- Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental StudyFarahnaz Akrami (University of Texas at Arlington), Mohammed Samiul Saeef (University of Texas at Arlington), Qingheng Zhang (Nanjing University), Wei Hu (Nanjing University), Chengkai Li (University of Texas at Arlington)
- Improving Approximate Nearest Neighbor Search through Learned Adaptive Early TerminationConglong Li (Carnegie Mellon University), Minjia Zhang (Microsoft AI and Research), David Andersen (Carnegie Mellon University), Yuxiong He (Microsoft AI and Research)
- Continuously Adaptive Similarity SearchHuayi Zhang (Worcester Polytechnic Institute), Lei Cao (Massachusetts Institute of Technology), Yizhou Yan (Worcester Polytechnic Institute), Samuel Madden (Massachusetts Institute of Technology), Elke Rundensteiner (Worcester Polytechnic Institute)
- Facilitating SQL Query Composition and AnalysisZainab Zolaktaf (University of British Columbia), Mostafa Milani (University of British Columbia), Rachel Pottinger (University of British Columbia)
- MIRIS: Fast Object Track Queries in VideoFavyen Bastani (Massachusetts Institute of Technology), Songtao He (Massachusetts Institute of Technology), Arjun Balasingam (Massachusetts Institute of Technology), Karthik Gopalakrishnan (Massachusetts Institute of Technology), Mohammad Alizadeh (Massachusetts Institute of Technology), Hari Balakrishnan (Massachusetts Institute of Technology), Michael Cafarella (Massachusetts Institute of Technology), Tim Kraska (Massachusetts Institute of Technology), Sam Madden (Massachusetts Institute of Technology)
- A Transactional Perspective on Execute-order-validate BlockchainsPingcheng Ruan (National University of Singapore), Dumitrel Loghin (National University of Singapore), Quang-Trung Ta (National University of Singapore), Meihui Zhang (Beijing Institute of Technology), Gang Chen (Zhejiang University), Beng Chin Ooi (National University of Singapore)
- Causality-Guided Adaptive Interventional DebuggingAnna Fariha (University of Massachusetts Amherst), Suman Nath (Microsoft Research), Alexandra Meliou (University of Massachusetts Amherst)
- Duoquest: A Dual-Specification System for Expressive SQL QueriesChristopher Baik (University of Michigan – Ann Arbor), Zhongjun Jin (University of Michigan – Ann Arbor), Michael Cafarella (University of Michigan – Ann Arbor), H. Jagadish (University of Michigan – Ann Arbor)
- SPARQL Rewriting: Towards Desired ResultsXun Jian (The Hong Kong University of Science and Technology), Yue Wang (Shenzhen Institute of Computing Sciences, Shenzhen University), Xiayu Lei (The Hong Kong University of Science and Technology), Libin Zheng (The Hong Kong University of Science and Technology), Lei Chen (The Hong Kong University of Science and Technology)
- Complaint-driven Training Data Debugging for Query 2.0Weiyuan Wu (Simon Fraser University), Lampros Flokas (Columbia University), Eugene Wu (Columbia University), Jiannan Wang (Simon Fraser University)
- Efficient Algorithms for Densest Subgraph Discovery on Large Directed GraphsChenhao Ma (The University of Hong Kong), Yixiang Fang (University of New South Wales), Reynold Cheng (The University of Hong Kong), Laks Lakshmanan (The University of British Columbia), Wenjie Zhang (University of New South Wales), Xuemin Lin (University of New South Wales)
- Cheetah: Accelerating Database Queries with Switch PruningMuhammad Tirmazi (Harvard University), Ran Ben Basat (Harvard University), Jiaqi Gao (Harvard University), Minlan Yu (Harvard University)
- GPU-Accelerated Subgraph Enumeration on Partitioned GraphsWentian Guo (National University of Singapore), Yuchen Li (Singapore Management University), Mo Sha (National University of Singapore), Bingsheng He (National University of Singapore), Xiaokui Xiao (National University of Singapore), Kian-Lee Tan (National University of Singapore)
- Estimating Numerical Distributions under Local Differential PrivacyZitao Li (Purdue University), Tianhao Wang (Purdue University), Milan Lopuhaä-Zwakenberg (Eindhoven University of Technology), Ninghui Li (Purdue University), Boris Škorić (Eindhoven University of Technology)
- Architecting a Query Compiler for Spatial WorkloadsRuby Tahboub (Purdue University), Tiark Rompf (Purdue University)
- G-CARE: A Framework for Performance Benchmarking of Cardinality Estimation Techniques for Subgraph MatchingYeonsu Park (POSTECH), Seongyun Ko (POSTECH), Sourav Bhowmick (NTU), Kyoungmin Kim (POSTECH), Kijae Hong (POSTECH), Wook-Shin Han (POSTECH)
- LISA: A Learned Index Structure for Spatial DataPengfei Li (Zhejiang University), Hua Lu (Roskilde University), Qian Zheng (Nanyang Technological University), Long Yang (Zhejiang University), Gang Pan (Zhejiang University)
- Learning a Partitioning Advisor for Cloud DatabasesBenjamin Hilprecht (TU Darmstadt), Carsten Binnig (TU Darmstadt), Uwe Röhm (The University of Sydney)
- Pump Up the Volume: Processing Large Data on GPUs with Fast InterconnectsClemens Lutz (DFKI GmbH), Sebastian Breß (TU Berlin), Steffen Zeuch (DFKI GmbH), Tilmann Rabl (HPI, University of Potsdam), Volker Markl (DFKI GmbH, TU Berlin)
- Reliable Data Distillation on Graph Convolutional NetworkWentao Zhang (Peking University; National Engineering Laboratory for Big Data Analysis and Applications), Xupeng Miao (Peking University), Yingxia Shao (Beijing University of Posts and Telecommunications, BUPT), Jiawei Jiang (ETH Zurich), Lei Chen (Hong Kong University of Science and Technology), Olivier Ruas (Peking University), Bin Cui (Peking University; National Engineering Laboratory for Big Data Analysis and Applications)
- Functional-Style SQL UDFs With a Capital 'F'Christian Duta (University of Tübingen), Torsten Grust (University of Tübingen)
- SpeakQL: Towards Speech-driven Multimodal Querying of Structured DataVraj Shah (University of California, San Diego), Side Li (University of California, San Diego), Arun Kumar (University of California, San Diego), Lawrence Saul (University of California, San Diego)
- Learning Over Dirty Data Without CleaningJose Picado (Oregon State University), John Davis (Oregon State University), Arash Termehchy (Oregon State University), Ga Young Lee (Oregon State University)
- Locality-Sensitive Hashing Scheme based on Longest Circular Co-SubstringYifan Lei (National University of Singapore), Qiang Huang (National University of Singapore), Mohan Kankanhalli (National University of Singapore), Anthony Tung (National University of Singapore)
- Vista: Optimized System for Declarative Feature Transfer from Deep CNNs at ScaleSupun Nakandala (University of California, San Diego), Arun Kumar (University of California, San Diego)
- Transactional Causal Consistency for Serverless ComputingChenggang Wu (University of California, Berkeley), Vikram Sreekanti (University of California, Berkeley), Joseph Hellerstein (University of California, Berkeley)
- ALEX: An Updatable Adaptive Learned IndexJialin Ding (Massachusetts Institute of Technology), Umar Farooq Minhas (Microsoft Research), Jia Yu (Arizona State University & Microsoft Research), Chi Wang (Microsoft Research), Jaeyoung Do (Microsoft Research), Yinan Li (Microsoft Research), Hantian Zhang (Georgia Institute of Technology & Microsoft Research), Badrish Chandramouli (Microsoft Research), Johannes Gehrke (Microsoft), Donald Kossmann (Microsoft Research), David Lomet (Microsoft Research), Tim Kraska (Massachusetts Institute of Technology)
- Automating Incremental and Asynchronous Evaluation for Recursive Aggregate Data ProcessingQiange Wang (Northeastern University), Yanfeng Zhang (Northeastern University), Hao Wang (Ohio State University), Liang Geng (Northeastern University), Rubao Lee (Ohio State University), Xiaodong Zhang (Ohio State University), Ge Yu (Northeastern University)
- Prompt: Dynamic Data-Partitioning for Distributed Micro-batch Stream Processing SystemsAhmed Abdelhamid (Purdue University), Ahmed Mahmood (Purdue University), Anas Daghistani (Purdue University), Walid Aref (Purdue University)
- Long-lived Transactions Made Less HarmfulJongbin Kim (Hanyang University), Hyunsoo Cho (Hanyang University), Kihwang Kim (Hanyang University), Jaeseon Yu (Hanyang University), Sooyong Kang (Hanyang University), Hyungsoo Jung (Hanyang University)
- Optimizing Machine Learning Workloads in Collaborative EnvironmentsBehrouz Derakhshan (DFKI GmbH), Alireza Rezaei Mahdiraji (DFKI GmbH), Ziawasch Abedjan (TU Berlin), Tilmann Rabl (Hasso Plattner Institute & University of Potsdam), Volker Markl (DFKI GmbH & TU Berlin)
- Rethinking Logging, Checkpoints, and Recovery for High-Performance Storage EnginesMichael Haubenschild (Tableau Software), Caetano Sauer (Tableau Software), Thomas Neumann (Technische Universität München), Viktor Leis (Friedrich-Schiller-Universität Jena)
- Automatically Generating Data Exploration Sessions Using Deep Reinforcement LearningOri Bar El (Tel Aviv University), Tova Milo (Tel Aviv University), Amit Somech (Tel Aviv University)
- Efficient Join Synopsis Maintenance for Data WarehouseZhuoyue Zhao (University of Utah), Feifei Li (University of Utah), Yuxi Liu (University of Utah)
- Architecture-Intact Oracle for Fastest Path and Time Queries on Dynamic Spatial NetworksVictor Junqiu Wei (Noah's Ark Lab, Huawei Technologies), Raymond Chi-Wing Wong (The Hong Kong University of Science and Technology), Cheng Long (Nanyang Technological University)
- Recommending Deployment Strategies for Collaborative TasksDong Wei (New Jersey Institute of Technology), Senjuti Basu Roy (New Jersey Institute of Technology), Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes)
- TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes ApplicationsKaiping Zheng (National University of Singapore), Shaofeng Cai (National University of Singapore), Horng Ruey Chua (National University Health System), Wei Wang (National University of Singapore), Kee Yuan Ngiam (National University Health System), Beng Chin Ooi (National University of Singapore)
- On Multiple Semantics for Declarative Database RepairsAmir Gilad (Tel Aviv University), Daniel Deutch (Tel Aviv University), Sudeepa Roy (Duke University)
- ShapeSearch: A Flexible and Efficient System for Shape-based Exploration of TrendlinesTarique Siddiqui (University of Illinois, Urbana Champaign (UIUC)), Paul Luh (University of Illinois, Urbana Champaign (UIUC)), Zesheng Wang (University of Illinois, Urbana Champaign (UIUC)), Karrie Karahalios (University of Illinois, Urbana Champaign (UIUC)), Aditya Parameswaran (UC Berkeley)
- Rhino: Efficient Management of Very Large Distributed State for Stream Processing EnginesBonaventura Del Monte (Technische Universität Berlin & DFKI GmbH), Steffen Zeuch (Technische Universität Berlin & DFKI GmbH), Tilmann Rabl (Hasso Plattner Institute, University of Potsdam), Volker Markl (Technische Universität Berlin & DFKI GmbH)
- Chiller: Contention-centric Transaction Execution and Data Partitioning for Modern NetworksErfan Zamanian (Brown University), Julian Shun (Massachusetts Institute of Technology), Carsten Binnig (TU Darmstadt), Tim Kraska (Massachusetts Institute of Technology)
- Robust Performance of Main Memory Data Structures by ConfigurationTiemo Bang (Technical University of Darmstadt; SAP SE), Ismail Oukid (Snowflake Inc.), Norman May (SAP SE), Ilia Petrov (Reutlingen University), Carsten Binnig (Technical University of Darmstadt)
- Finding Related Tables in Data Lakes for Interactive Data ScienceYi Zhang (University of Pennsylvania), Zachary Ives (University of Pennsylvania)
- QuickSel: Quick Selectivity Learning with Mixture ModelsYongjoo Park (University of Illinois at Urbana-Champaign), Shucheng Zhong (University of Michigan – Ann Arbor), Barzan Mozafari (University of Michigan – Ann Arbor)
- MONSOON: Multi-Step Optimization and Execution of Queries with Partially Obscured PredicatesSourav Sikdar (Rice University), Chris Jermaine (Rice University)
- External Merge Sort for Top-K QueriesYannis Chronis (University of Wisconsin-Madison), Thanh Do (Google Inc), Goetz Graefe (Google Inc), Keith Peters (Google Inc)
- Marviq: Quality-Aware Geospatial Visualization of Range-Selection Queries Using MaterializationLiming Dong (Tsinghua University), Qiushi Bai (University of California Irvine), Taewoo Kim (University of California Irvine), Taiji Chen (University of California Irvine), Weidong Liu (Tsinghua University), Chen Li (University of California Irvine)
- Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value StoresSiqiang Luo (Harvard University), Subarna Chatterjee (Harvard University), Rafael Ketsetsidis (Harvard University), Niv Dayan (Harvard University), Wilson Qin (Harvard University), Stratos Idreos (Harvard University)
- CHASSIS: Conformity Meets Online Information DiffusionHui Li (Nanyang Technological University), Hui Li (Xidian University), Sourav Bhowmick (Nanyang Technological University)
- Database Benchmarking for Supporting Real-Time Interactive Querying of Large DataLeilani Battle (University of Maryland), Philipp Eichmann (Brown University), Marco Angelini (University of Rome “La Sapienza”), Tiziana Catarci (University of Rome “La Sapienza”), Giuseppe Santucci (University of Rome “La Sapienza”), Yukun Zheng (University of Maryland), Carsten Binnig (Technical University of Darmstadt), Jean-Daniel Fekete (Inria, Univ. Paris-Saclay, CNRS), Dominik Moritz (University of Washington)
- Regular Path Query Evaluation on Streaming GraphsAnil Pacaci (University of Waterloo), Angela Bonifati (Lyon 1University), Tamer M. Özsu (University of Waterloo)
- Exact Single-Source SimRank Computation on Large GraphsHanzhi Wang (Renmin University of China), Zhewei Wei (Renmin University of China), Ye Yuan (Beijing Institute of Technology), Xiaoyong Du (Renmin University of China), Ji-Rong Wen (Renmin University of China)
- DeepSqueeze: Deep Semantic Compression for Tabular DataAmir Ilkhechi (Brown University), Andrew Crotty (Brown University), Alex Galakatos (Brown University), Yicong Mao (Brown University), Grace Fan (Brown University), Xiran Shi (Brown University), Ugur Cetintemel (Brown University)
- Distributed Processing of k Shortest Path Queries over Dynamic Road NetworksZiqiang Yu (Yantai University), Xiaohui Yu (York University), Nick Koudas (University of Toronto), Yang Liu (Wilfrid Laurier University), Yifan Li (York University; Key Laboratory of Urban Land Resources Monitoring and Simulation, MNR), Yueting Chen (York University), Dingyu Yang (Alibaba Group)
- Benchmarking Spreadsheet SystemsSajjadur Rahman (University of Illinois at Urbana-Champaign), Kelly Mack (University of Washington), Mangesh Bendre (VISA Research), Ruilin Zhang (University of Southern California), Karrie Karahalios (University of Illinois at Urbana-Champaign), Aditya Parameswaran (University of California, Berkeley)
- Aggify: Lifting the Curse of Cursor Loops using Custom AggregatesSurabhi Gupta (Microsoft Research India), Sanket Purandare (Harvard University), Karthik Ramachandra (Microsoft Research India)
- Hub Labeling for Shortest Path CountingYikai Zhang (Chinese University of Hong Kong), Jeffrey Xu Yu (Chinese University of Hong Kong)
- Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science NotebooksCong Yan (University of Washington), Yeye He (Microsoft Research)
- Grizzly: Efficient Stream Processing Through Adaptive Query CompilationPhilipp Grulich (Technische Universität Berlin), Breß Sebastian (Technische Universität Berlin), Steffen Zeuch (Technische Universitat Berlin & DFKI GmbH), Jonas Traub (Technische Universität Berlin), Janis von Bleichert (Technische Universität Berlin), Zongxiong Chen (DFKI GmbH), Tilmann Rabl (HPI, University of Potsdam), Volker Markl (Technische Universitat Berlin & DFKI GmbH)
- Influence Maximization Revisited: Efficient Reverse Reachable Set Generation with Bound TightenedQintian Guo (The Chinese University of Hong Kong), Sibo Wang (The Chinese University of Hong Kong), Zhewei Wei (Renmin University of China), Ming Chen (Renmin University of China)
- Deep Learning Models for Selectivity Estimation of Multi-Attribute QueriesShohedul Hasan (University of Texas at Arlington), Saravanan Thirumuruganathan (QCRI, HBKU), Jees Augustine (University of Texas at Arlington), Nick Koudas (University of Toronto), Gautam Das (University of Texas at Arlington)
- Creating Embeddings of Heterogeneous Relational Datasets for Data Integration TasksRiccardo Cappuzzo (EURECOM), Paolo Papotti (EURECOM), Saravanan Thirumuruganathan (QCRI, HBKU)
- ZeroER: Entity Resolution using Zero Labeled ExamplesRenzhi Wu (Georgia Institute of Technology), Sanya Chaba (Georgia Institute of Technology), Saurabh Sawlani (Georgia Institute of Technology), Xu Chu (Georgia Institute of Technology), Saravanan Thirumuruganathan (QCRI, HBKU)
- Global Reinforcement of Social Networks: The Anchored Coreness ProblemQingyuan Linghu (University of New South Wales), Fan Zhang (Guangzhou University), Xuemin Lin (University of New South Wales), Wenjie Zhang (University of New South Wales), Ying Zhang (University of Technology Sydney)
- Application Driven Graph PartitioningWenfei Fan (University of Edinburgh, Beihang University & Shenzhen University), Ruochun Jin (University of Edinburgh), Muyang Liu (University of Edinburgh), Ping Lu (BDBC, Beihang University), Xiaojian Luo (Alibaba Group), Ruiqi Xu (University of Edinburgh), Qiang Yin (Alibaba Group), Wenyuan Yu (Alibaba Group), Jingren Zhou (Alibaba Group)
- Progressive Top-K Nearest Neighbors Search in Large Road NetworksDian Ouyang (University of Sydney), Dong Wen (University of Technology Sydney), Lu Qin (University of Technology Sydney), Lijun Chang (University of Sydney), Ying Zhang (University of Technology Sydney), Xuemin Lin (University of New South Wales)
- A Relational Matrix Algebra and its Implementation in a Column StoreOksana Dolmatova (University of Zürich), Nikolaus Augsten (University of Salzburg), Michael Böhlen (University of Zürich)
- Scaling Up Distance Labeling on Graphs with Core-Periphery PropertiesWentao Li (CAI, FEIT, University of Technology Sydney), Miao Qiao (University of Auckland), Lu Qin (CAI, FEIT, University of Technology Sydney), Ying Zhang (CAI, FEIT, University of Technology Sydney), Lijun Chang (University of Sydney), Xuemin Lin (University of New South Wales)
- A Statistical Perspective on Discovering Functional Dependencies in Noisy DataYunjia Zhang (University of Wisconsin-Madison), Zhihan Guo (University of Wisconsin-Madison), Theodoros Rekatsinas (University of Wisconsin-Madison)
- Adaptive HTAP through Elastic Resource SchedulingAunn Raza (Ecole Polytechnique Fédérale de Lausanne), Periklis Chrysogelos (Ecole Polytechnique Fédérale de Lausanne), Angelos Christos Anadiotis (Ecole Polytechnique), Anastasia Ailamaki (Ecole Polytechnique Fédérale de Lausanne)
- Querying Shared Data with Security HeterogeneityYang Cao (University of Edinburgh), Wenfei Fan (University of Edinburgh, Shenzhen University, & Beihang University), Yanghao Wang (University of Edinburgh), Ke Yi (Hong Kong University of Scienc)
- Near-Optimal Distributed Band-Joins through Recursive PartitioningRundong Li (Google), Wolfgang Gatterbauer (Northeastern University), Mirek Riedewald (Northeastern University)
- Data Series Progressive Similarity Search with Probabilistic Quality GuaranteesAnna Gogolou (Université Paris-Saclay, Inria, CNRS, LRI & LIPADE, University of Paris), Theophanis Tsandilas (Université Paris-Saclay, Inria, CNRS, LRI), Karima Echihabi (IRDA, Rabat IT Center, ENSIAS, Mohammed V University), Anastasia Bezerianos (Université Paris-Saclay, CNRS, Inria, LRI), Themis Palpanas (LIPADE, University of Paris & French University Institute (IUF))
- The Case for a Learned Sorting AlgorithmAni Kristo (Brown University), Kapil Vaidya (Massachusetts Institute of Technology), Ugur Çetintemel (Brown University), Sanchit Misra (Intel Labs), Tim Kraska (Massachusetts Institute of Technology)
- LightSaber: Efficient Window Aggregation on Multi-core ProcessorsGeorgios Theodorakis (Imprerial College London), Alexandros Koliousis (Graphcore Research), Peter Pietzuch (Imprerial College London), Holger Pirk (Imprerial College London)
- SQLCheck: Automated Detection and Diagnosis of SQL Anti-PatternsPrashanth Dintyala (Georgia Institute of Technology), Arpit Narechania (Georgia Institute of Technology), Joy Arulraj (Georgia Institute of Technology)
- Minimization of Classifier Construction Cost for Search QueriesShay Gershtein (Tel Aviv University), Tova Milo (Tel Aviv University), Gefen Morami (Tel Aviv University), Slava Novgorodov (eBay Research)
- Fast and Reliable Missing Data Contingency Analysis with Predicate-ConstraintsXi Liang (University of Chicago), Zechao Shang (University of Chicago), Sanjay Krishnan (University of Chicago), Aaron Elmore (University of Chicago), Michael Franklin (University of Chicago)
- Thrifty Query Execution via IncrementabilityDixin Tang (University of Chicago), Zechao Shang (University of Chicago), Aaron Elmore (University of Chicago), Sanjay Krishnan (University of Chicago), Michael Franklin (University of Chicago)
- Lethe: A Tunable Delete-Aware LSM EngineSubhadeep Sarkar (Boston University), Tarikul Papon (Boston University), Dimitris Staratzis (Boston University), Manos Athanassoulis (Boston University)
- Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud InfrastructureIngo Müller (ETH Zürich), Renato Marroquín (ETH Zürich), Gustavo Alonso (ETH Zürich)
- Causal Relational LearningBabak Salimi (University of Washington), Harsh Parikh (Duke University), Moe Kayali (University of Washington), Lise Getoor (University of California, Santa Cruz), Sudeepa Roy (Duke University), Dan Suciu (University of Washington)
- Debunking Four Long-Standing Misconceptions of Time-Series Distance MeasuresJohn Paparrizos (University of Chicago), Chunwei Liu (University of Chicago), Aaron Elmore (University of Chicago), Michael Franklin (University of Chicago)
- SLIM: Scalable Linkage of Mobility DataFuat Basik (Amazon Web Services), Hakan Ferhatosmanoglu (University of Warwick), Bugra Gedik (Bilkent University)
- Computing Local Sensitivities of Counting Queries with JoinsYuchao Tao (Duke University), Xi He (University of Waterloo), Ashwin Machanavajjhala (Duke University), Sudeepa Roy (Duke University)
- BugDoc: Algorithms to Debug Computational ProcessesRaoni Lourenço (New York University), Juliana Freire (New York University), Dennis Shasha (New York University)
- Handling Highly Contended OLTP Workloads Using Fast Dynamic PartitioningGuna Prasaad (University of Washington), Alvin Cheung (University of California, Berkeley), Dan Suciu (University of Washington)
- Stochastic Package Queries in Probabilistic DatabasesMatteo Brucato (University of Massachusetts Amherst), Nishant Yadav (University of Massachusetts Amherst), Azza Abouzied (New York University Abu Dhabi), Peter Haas (University of Massachusetts Amherst), Alexandra Meliou (University of Massachusetts Amherst)
- A Method for Optimizing Opaque Filter QueriesWenjia He (University of Michigan, Ann Arbor), Michael Anderson (University of Michigan, Ann Arbor), Maxwell Strome (University of Michigan, Ann Arbor), Michael Cafarella (University of Michigan, Ann Arbor)
- QueryVis: Logic-based Diagrams help Users Understand Complicated SQL Queries FasterAristotelis Leventidis (Northeastern University), Jiahui Zhang (Northeastern University), Cody Dunne (Northeastern University), Wolfgang Gatterbauer (Northeastern University), H.V. Jagadish (University of Michigan), Mirek Riedewald (Northeastern University)
- Discovery Algorithms for Embedded Functional DependenciesZiheng Wei (The University of Auckland), Sven Hartmann (Clausthal University of Technology), Sebastian Link (The University of Auckland)
- Active Learning for ML Enhanced Database SystemsLin Ma (Carnegie Mellon University), Bailu Ding (Microsoft Research), Sudipto Das (Amazon Web Services), Adith Swaminathan (Microsoft Research)
- Bitvector-aware Query Optimization for Decision Support QueriesBailu Ding (Microsoft Research), Surajit Chaudhuri (Microsoft Research), Vivek Narasayya (Microsoft Research)
- Qd-tree: Learning Data Layouts for Big Data AnalyticsZongheng Yang (University of California, Berkeley), Badrish Chandramouli (Microsoft Research), Chi Wang (Microsoft Research), Johannes Gehrke (Microsoft), Yinan Li (Microsoft Research), Umar Farooq Minhas (Microsoft Research), Per-Åke Larson (Microsoft Research), Donald Kossmann (Microsoft Research), Rajeev Acharya (Microsoft)
- Effective Travel Time Estimation: When Historical Trajectories over Road Networks MatterHaitao Yuan (Tsinghua University), Guoliang Li (Tsinghua University), Zhifeng Bao (RMIT University), Ling Feng (Tsinghua University)
- Human-in-the-loop Outlier DetectionChengliang Chai (Tsinghua University), Lei Cao (CSAIL, MIT), Guoliang Li (Tsinghua University), Jian Li (Tsinghua University), Yuyu Luo(Tsinghua University), Samuel Madden (CSAIL, MIT)