VLDB 2025: Workshop Schedule

Workshop details last updated on 4 Sept 2025.

Find Workshop Session:

Day 1 (Mon, Sept 1): PHD  ADMS  AIDB  LLM+Graph  QDB  TPCTC 
Day 5 (Fri, Sept 5): TaDA  LLM+Spatial  DaSH  GuideAI  DataAI  DEC  LS-NSL  LSGDA  CDMS 


PHD Workshop 1
Room: Albert (2F)

PHD Workshop 1

8:30-8:40Welcome and Opening Remarks

Workshop Chairs


8:40-9:40Keynote: Adventure and Beauty in Data Systems Research

Gerhard Weikum








PHD Workshop 3
Room: Albert (2F)

PHD Workshop 3

13:00-14:00Panel Discussion

Eugene Wu, Tiziana Catarci, Georgia Koutrika, Aditya Parameswaran


14:00-14:12PDFData-Driven Decisions at Scale: Managing Risk, Diversity, and Provenance

Riddho Haque


14:12-14:24PDFTowards Data-Metadata Flexibility in Property Graph Data Management

Sepehr Sadoughi


14:24-14:36PDFRunning Functions on Pooled Data Without Leakage

Christopher Zhu


14:36-14:42PDFPersonal Data Sovereignty through Federated Access and Policy Control

Vijon Baraku





PHD Workshop 4
Room: Albert (2F)

PHD Workshop 4

15:30-15:42PDFAssessing the Fault Tolerance of Data-Centric Applications

Maria Ramos


15:42-15:54PDFFast or Accurate? Rethinking Time Series Anomaly Detection

Emmanouil Sylligardos


15:54-16:06PDFModeling and Operationalizing Data Ecosystems

Soo-Yon Kim


16:06-16:18PDFOptimizing Data Systems for LLM Workloads

Kerem Akillioglu


16:18-16:30PDFEnumeration-Based Dynamic Query Processing for Linear Algebra Workloads in Data Science

Thomas Munoz Serrano


16:30-16:30Closing Remarks

Workshop Chairs





ADMS Workshop 1
Room: Moore (4F)

ADMS Workshop 1

8:35-9:00PDFHigh Throughput GPU-Accelerated FSST String Compression

Tim Anema (Delft University of Technology); Joost Hoozemans (Voltron Data); Zaid Al-Ars (Delft University of Technology); H. Peter Hofstee (IBM)


9:00-9:30PDFGPU-Accelerated Stochastic Gradient Descent for Scalable Operator Placement in Geo-Distributed Streaming Systems

Tristan Joel Terhaag (Technische Universität Berlin); Xenofon Chatziliadis (Technische Universität Berlin); Eleni Tzirita Zacharatou (Hasso Plattner Institute, University of Potsdam); Volker Markl (Technische Universität Berlin)


9:30-9:55PDFA Hot Take on the Intel Analytics Accelerator for Database Management Systems

Christos Laspias (Carnegie Mellon University); Andrew Pavlo (Carnegie Mellon University); Jignesh Patel (Carnegie Mellon University)





ADMS Workshop 2
Room: Moore (4F)

ADMS Workshop 2

10:30-10:55PDFA Data Aggregation Visualization System supported by Processing-in-Memory

Junyoung Kim (Columbia University); Madhulika Balakumar (Columbia University); Kenneth Ross (Columbia University)


11:00-12:00Keynote: Using what we know about our data: compact metadata and amortisation to exploit locality and sparsity

Paul H J Kelly (Imperial College London)





ADMS Workshop 3
Room: Moore (4F)

ADMS Workshop 3

13:30-13:55PDFDemystifying CXL Memory Bandwidth Expansion for Analytical Workloads

Georgiy Lebedev (EPFL); Hamish Nicholson (EPFL); Musa Ünal (EPFL); Sanidhya Kashyap (EPFL); Anastasia Ailamaki (EPFL)


14:00-15:00Keynote: On the role of storage and hardware acceleration in modern data management systems

Vincent Hsu (IBM Storage); Haris Pozidis (IBM Research)





ADMS Workshop 4
Room: Moore (4F)

ADMS Workshop 4

15:30-15:55PDFCXL-Bench: Benchmarking Shared CXL Memory Access

Marcel Weisgut (Hasso Plattner Institute, University of Potsdam); Daniel Ritter (SAP); Florian Schmeller (Hasso Plattner Institute, University of Potsdam); Pınar Tözün (IT University of Copenhagen); Tilmann Rabl (Hasso Plattner Institute, University of Potsdam)


16:00-16:25PDFRISC-V Meets RDBMS: An Experimental Study of Database Performance on an Open Instruction Set Architecture

Yizhe Zhang (University of New South Wales); Zhengyi Yang (University of New South Wales); Bocheng Han (University of New South Wales); Haoran Ning (Macquarie University); Xin Cao (University of New South Wales); John Shepherd (University of New South Wales); Guanfeng Liu (Macquarie University)


16:30-16:55PDFMicro-architectural Exploration of the Relational Memory Engine (RME) in RISC-V and FireSim

Cole Strickler (University of Kansas); Ju Hyoung Mun (Brandeis University); Connor Sullivan (University of Kansas); Denis Hoornaert (Technical University of Munich); Renato Mancuso (Boston University); Manos Athanassoulis (Boston University); Heechul Yun (University of Kansas)





AIDB Workshop 1
Room: Westminster (4F)

AIDB Workshop 1

10:15-10:30Opening remarks & best paper announcement

Thaleia Doudali; Subru Krishnan


10:30-11:15Keynote: LLM-powered Data Tooling: the Next Frontier

Aditya Parameswaran (UC Berkeley)


11:15-11:30PDFResearch Challenges in Relational Database Management Systems for LLM Queries

Kerem Akillioglu; Anurag Chakraborty; Sairaj Voruganti; M. Tamer Özsu


11:30-11:45PDFLearning What Matters: Automated Feature Selection for Learned Cost Model in Parallel Stream Processing

Pratyush Agnihotri; Carsten Binnig; Manisha Luthra


11:45-12:00PDFInstance-Optimized String Fingerprints

Mihail Stoian; Johannes Thürauf; Andreas Zimmerer; Alexander van Renen; Andreas Kipf





AIDB Workshop 2
Room: Westminster (4F)

AIDB Workshop 2

13:30-14:15Industry Keynote: Building Autonomous Data Services on Azure

Yuanyuan Tian (Microsoft)


14:15-14:30PDFJOB-Complex: A Challenging Benchmark for Traditional & Learned Query Optimization

Johannes Wehrstein; Timo Eckmann; Roman Heinrich; Carsten Binnig


14:30-14:45PDFBootstrapping Learned Cost Models with Synthetic SQL Queries

Michael Nidd; Christoph Miksovic; Thomas Gschwind; Francesco Fusco; Andrea Giovannini; Ioana Giurgiu


14:45-15:00PDFExploring Wavelet Trees as Space-Efficient Physical-to-Sorted Mapping for Learned Indexes

Anwesha Saha; Aneesh Raman; Ryan Marcus; Manos Athanassoulis





AIDB Workshop 3
Room: Westminster (4F)

AIDB Workshop 3

15:30-15:45PDFLearning to Accelerate: Tuning Data Transfer Parameters

Benedikt Didrich; Haralampos Gavriliidis; Vasilis Gkolemis; Matthias Boehm; Volker Markl


15:45-16:00PDFAutoDebugger: Efficient Root Cause Analysis for Anomaly Jobs

Fathelrahman Ali; Yiwen Zhu; Lie Jiang; Zhen Li; Manting Li; Kun Huang; Lijing Lin; Long Tian; Xiaolei Liu; Subru Krishnan


16:00-16:15PDFInferring Missing Data Lineage Links from Schema Metadata Using Transformer-Based Models

Maciej Brzeski; Adam Roman


16:15-16:30PDFTailorSQL: A NL2SQL System Tailored for Your Query Workload

Kapil Vaidya; Jialin Ding; Sebastian Kosak; David Kernert; Chuan Lei; Xiao Qin; Abhinav Tripathy; Ramesh Balan; Balakrishnan Narayanaswamy; Tim Kraska


16:30-16:45PDFMageSQL: Enhancing In-context Learning for Text-to-SQL Applications with LLMs

Chen Shen; Jin Wang; Sajjadur Rahman; Eser Kandogan


16:45-17:00PDFGrounding LLMs for Database Exploration: Intent Scoping and Paraphrasing for Robust NL2SQL

Catalina Dragusin; Katsiaryna Mirylenka; Christoph Miksovic; Michael Glass; Nahuel Defosse; Paolo Scotton; Thomas Gschwind





LLM + Graph Workshop 1
Room: St James (4F)

LLM + Graph Workshop 1

8:30-8:35Opening remarks

Yixiang Fang; Arijit Khan; Tianxing Wu; Da Yan


8:35-9:20Keynote: Exploring the Duality Between Large Language Models and Database Systems

M. Tamer Özsu (University of Waterloo)


9:20-9:50Industry Talk: Applications and Challenges of GraphRAG and Graph Foundation Models

Cheng Chen (ByteDance)


9:50-10:02PDFLLM-assisted Construction of the United States Legislative Graph

Francesco Cambria; Andrea Colombo


10:02-10:14PDFScalable Graph-based Retrieval-Augmented Generation via Locality-Sensitive Hashing

Fangyuan Zhang, Zhengjun Huang, Yingli Zhou, Qingtian Guo, Wensheng Luo, Xiaofang Zhou





LLM + Graph Workshop 2
Room: St James (4F)

LLM + Graph Workshop 2

10:30-11:15Keynote: Towards Graph Foundation Models with Riemannian Geometry

Philip S. Yu (University of Illinois, Chicago)


11:15-11:45Industry Talk: Retrieval and Reasoning with LLMs on Neo4j: Progress and Challenges

Brian Shi (Neo4J)


11:45-11:57PDFLLM-Hype: A Targeted Evaluation Framework for Hypernym-Hyponym Identification in Large Language Models

Qiu Ji, Pengfei Zhu, Haolei Zhu, Yang Sheng, Guilin Qi, Lianlong Wu, Kang Xu, Yuan Meng


11:57-12:09PDFGraph-Enhanced Large Language Models for Spatial Search [Vision]

Nicole Schneider, Kent O'Sullivan, Hanan Samet





LLM + Graph Workshop 3
Room: St James (4F)

LLM + Graph Workshop 3

13:30-14:15Keynote: Reasoning over Property Graphs: Leveraging Large Language Models for Automated Data Consistency

Angela Bonifati (Lyon 1 University)


14:15-14:45Industry Talk: Chat2Graph: A Graph Native Agentic System

Heng Lin (AntGroup)


14:45-14:57PDFxpSHACL: Explainable SHACL Validation using Retrieval-Augmented Generation and Large Language Models

Gustavo Publio, Jose Emilio Labra Gayo


14:57-15:09PDFAutomatic Prompt Optimization for Knowledge Graph Construction: Insights from an Empirical Study

Nandana Mihindukulasooriya, Niharika DSouza, Faisal Chowdhury, Horst Samulowitz





LLM + Graph Workshop 4
Room: St James (4F)

LLM + Graph Workshop 4

15:30-15:42PDFTowards the Next Generation of Agent Systems: From RAG to Agentic AI [Vision]

Yingli Zhou, Shu Wan


15:42-16:45Panel

Panelists: TBA





QDB Workshop 1
Room: Abbey (4F)

QDB Workshop 1

8:30-10:00Keynote: Model Lakes

Renée Miller





QDB Workshop 2
Room: Abbey (4F)

QDB Workshop 2

10:30-10:22PDFOut in the Wild: Investigating the Impact of Imperfect Data on a Tabular Foundation Model

Papastergios, Vasileios; Gounaris, Anastasios


10:22-10:44PDFExploring Privacy-Preserving Record Linkage: A Holistic Framework for Dataset Generation and Detailed Result Analysis

Rohde, Florens; Christen, Victor; Rahm, Erhard


10:44-11:06PDFDynamic Knowledge Graph-based Measurement of Data Quality

Schrott, Johannes; Meindl, Rainer; Lettner, Christian; Hammer, Stefan; Leitner, Magdalena


11:06-11:28PDFEvolving Gracefully: Building Robust and Self-Adaptive Data Cleaning Pipelines for Schema Evolution and Uncertainty

Kramer, Kevin; Restat, Valerie; Störl, Uta





QDB Workshop 3
Room: Abbey (4F)

QDB Workshop 3

13:30-14:30Keynote: From XAI to XEE through Influence and Provenance, and optimising models for fairness when data drifts over time: some work in progress on connecting data and models to ensure quality and trust in both.

Paolo Missier


14:30-15:30Poster Session





QDB Workshop 4
Room: Abbey (4F)

QDB Workshop 4

15:30-15:52PDFLabel Flipping For Group Fairness

Thandri, Shashank; Pradhan, Romila


15:52-16:22PDFPBE Meets LLM: When Few Examples Aren’t Few-Shot Enough

Zhang, Shuning; Park, Yongjoo


16:22-16:44PDFTowards an SLM-based Auditing of Relational Schemas and Data Quality for Practical Data Governance

de Medeiros, Antony


16:45-17:00Closing

Workshop Chairs





TPCTC Workshop 1
Room: Rutherford (4F)

TPCTC Workshop 1

08:45-09:00Opening Remarks

Raghu Nambiar


09:00-09:30A Benchmark for Databases with Varying Value Lengths

Danushka Liyanage; Shubham Pandey; Joshua Goldstein; Michael Cahill; Akon Dey; Alan Fekete; Uwe Röhm


09:30-10:00Benchmarking Role-Based Access Control in Data Management Systems

Mads Cornelius Hansen; Pınar Tözün; Martin Hentschel





TPCTC Workshop 2
Room: Rutherford (4F)

TPCTC Workshop 2

10:30-11:00CH2++: New HOAP for Benchmarking JSON Data Analytics

Michael Carey; Vijay Sarathy; Daniel Nagy; Bo-Chun Wang; Keshav Murthy; Murali Krishna; Peeyush Gupta; Till Westmann


11:00-11:30ScaleBench_AI: Flexible LLM Inference Benchmarking Across Architectures and Environments

Karthik Krishna; Sarthak Dwivedi; Divya Singh


11:30-12:00Tectonic: Bridging Synthetic and Real-World Workloads for Key-Value Benchmarking

Alexander Ott; Shubham Kaushik; Boao Chen; Subhadeep Sarkar





TPCTC Workshop 3
Room: Rutherford (4F)

TPCTC Workshop 3

13:30-14:00Panel: Benchmarking Considerations for Agentic AI Systems

Ajay Dholakia; David Ellison; Miro Hodak; Debojyoti Dutta; Rajiv Ranjan


14:00-14:30DataGenX: Generating Synthetic Relational Data from Annotated SQL Schemas

Ahmad Ghazal; Hanumath Maduri; Sunny Bains; Kenny Chan


14:30-15:00DiStash: A Disaggregated Multi-Stash Transactional Key-Value Store

Shahram Ghandeharizadeh; Yiming Gao; Hieu Nguyen; Jun Li





TPCTC Workshop 4
Room: Rutherford (4F)

TPCTC Workshop 4

15:30-15:50Invited Talk: TBA

Speaker: TBA


15:50-16:20Delivering MLPerf Submissions: Journey to Leadership Performance

Miro Hodak; Meena Arunachalam


16:20-16:50Tabular Data Augmentation for Database Scalability Testing – A Case Study with Medical Insurance Claims Analytics Workloads

Taro Fujimoto; Shinji Fujiwara; Jumpei Sato; Yuto Hayamizu; Kazuo Goda


16:50-17:20Benchmarking Distilled Language Models: Performance and Efficiency in Resource-Constrained Settings

Sachin Gopal Wani; Eric Page; Ajay Dholakia; David Ellison


17:20-17:40Invited Talk: TBA

Speaker: TBA


17:40-17:45Closing Remarks

Meikel Poess





TaDA Workshop 1
Room: Gielgud (2F)

TaDA Workshop 1

8:45-9:00Introduction

Workshop Chairs


9:00-9:30Keynote: TBA

Kavitha Srinivas


9:30-9:45PDFA Vision for SQL-Based Relational Deep Learning

Fahim Shahriar Khan; Ashraf Aboulnaga


9:45-10:00PDFFrom Features to Structure: Task-Aware Graph Construction for Relational and Tabular Learning with GNNs

Tamara Cucumides; Floris Geerts





TaDA Workshop 2
Room: Gielgud (2F)

TaDA Workshop 2

10:00-11:00PDFRelationship Detection on Tabular Data Using Statistical Analysis and Large Language Models

Panagiotis Koletsis; Christos Panagiotopoulos; Georgios Papadopoulos; Vasilis Efthymiou


10:00-11:00PDFImproving Column Type Annotation Using Large Language Models

Amir Babamahmoudi; Davood Rafiei; Mario Nascimento


10:00-11:00PDFQuery Plan Generation for Table Question Answering

Ivan Poddubny; Nikita Dorodnykh


10:00-11:00PDFTable Header Recognition Based on Large Language Models

Ilya I. Okhotin; Nikita Dorodnykh


10:00-11:00PDFTOPJoin: A Context-Aware Multi-Criteria Approach for Joinable Column Search

Harsha Kokel; Aamod Kathiwada; Tejaswini Pedapanti; Haritha Ananthakrishnan; Oktie Hassanzadeh; Horst Samulowitz; Kavitha Srinivas


10:00-11:00PDFEvaluating SQL Selection/Projection over Table Embeddings

Mariam Mellouli; Paolo Papotti


10:00-11:00PDFOptimizing Source Selection for Tuple-Value Discovery

Ahmad Fares; Georgia Troullinou; Silviu Maniu; Sihem Amer-Yahia


10:00-11:00PDFUniversal Embeddings of Tabular Data

Astrid Franz; Frederik Hoppe; Marianne Michaelis; Udo Göbel


10:00-11:00PDFSemForest: Semantic-Aware Ontology Generation with Foundation Models

Guohui Guan; Sachin Konan; Larry Rudolph; Chang Ge


11:00-11:15PDFStructText: A Synthetic Table-to-Text Approach for Benchmark Generation with Multi-Dimensional Evaluation

Satyananda Kashyap; Sola Shirai; Nandana Mihindukulasooriya; Horst Samulowitz


11:15-11:30PDFTowards Fine-Grained Extraction of Scientific Claims from Heterogeneous Tables Using Large Language Models

Daniele Bertillo; Laks V.S. Lakshmanan; Paolo Merialdo; Divesh Srivastava


11:30-12:00Keynote: TBA

Paolo Papotti





LLM + Spatial Workshop 1
Room: Gielgud (2F)

LLM + Spatial Workshop 1

13:30-13:35Introduction

Jianqiu Xu


13:35-14:40Keynote: Spatial Data Systems in the LLM Era: 1+1=3? System Requirements and Research Opportunities

Walid G. Aref


14:40-15:00PDFNALMOBench: Towards Benchmarking Natural Language Interfaces for Moving Objects Databases

Xieyang Wang; Weijia Yi; Mengyi Liu; Chenchen Zong





LLM + Spatial Workshop 2
Room: Gielgud (2F)

LLM + Spatial Workshop 2

15:30-16:20Keynote: Natural Language Maps: Generative AI for Spatial Data Generation, Querying, and Visualization

Ahmed Eldawy


16:20-17:00Keynote: Geospatial entity representation: a step towards city foundation models.

Gao Cong





DaSH Workshop 1
Room: St James (4F)

DaSH Workshop 1

8:30-8:45Workshop Introduction

DaSH Organizer


8:45-9:30Keynote: Building Data-Intensive Systems that Care

Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes)


9:30-9:55Invited Talk: Unleashing Data Science: It's Time to Fix the Data Preparation Problem

El Kindi Rezig (Univ. of Utah)





DaSH Workshop 2
Room: St James (4F)

DaSH Workshop 2

10:30-11:15Keynote: Building AI-Driven Data Catalogs: A Great Playground for Human-in-the-Loop Research

AnHai Doan (Univ. of Wisconsin)


11:15-11:40Invited Talk: Sign2Vis: Automated Data Visualization from Sign Language

Yao Wan (Huazhong University of Science and Technology)


11:40-12:05Invited Talk: “How do you even know that stuff?” Barriers to expertise sharing among spreadsheet users

Nancy Xia (University College London)





DaSH Workshop 3
Room: St James (4F)

DaSH Workshop 3

13:30-13:48PDFReducing Human Effort in Evaluating Small and Medium Language Models as Students and as Teachers

Oleh Prostakov; Viacheslav Hodlevskyi; Nassim Bouarour; Adam Sanchez-Ayte; Noha Ibrahim; Sihem Amer-Yahia


13:48-14:06PDFHuman + AI: Large-scale Data Curation for Multilingual Guardrails

Harshit Rajgarhia; Abhishek Mukherji; Fen Yik; Dominika Borek; Nicole Warren; Prithiviraj Pradeep


14:06-14:24PDFDeepGit: Promoting Exploration and Discovery of Research Software with Human-Curated Graphs

Yilin Xia; Shin-Rong Tsai; Matthew Turk


14:24-14:42PDFHierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals

Lingbo Mo; Shun Jiang; Akash V Maharaj; J. Bernard Hishamunda; Yunyao Li


14:42-15:00PDFAdobe Summit Concierge Evaluation with Human-in-the-loop

Yiru Chen; Sally Fang; Sai Sree Harsha; Dan Luo; Vaishnavi Muppala; Fei Wu; Shun Jiang; Kun Qian; Yunyao Li





DaSH Workshop 4
Room: St James (4F)

DaSH Workshop 4

15:30-16:15Keynote: SQL and Large Language Model: A Marriage Made in Heaven?

Paolo Papotti


16:15-17:00Panel

Panelists: Sihem Amer-Yahia, El Kindi Rezig, AnHai Doan, Paolo Papotti





GuideAI Workshop 1
Room: Westminster (4F)

GuideAI Workshop 1

8:30-9:30Keynote: TBA

Themis Palpanas


9:30-9:45PDFModel Slicing for Responsible AI

Parke Godfrey; Lukasz Golab; Divesh Srivastava; Jarek Szlichta


9:45-10:00PDFExperimentLens: Interactive Visual Analytics and Explainability for ML Experiment Management

Stavros Maroulis; Vassilis Stamatopoulos; Panagiotis Gidarakos; Konstantinos Tsopelas; Nikolas Masouras; Konstantinos Kozanis; Nikolas Theologitis; George Papastefanatos; Giorgos Giannopoulos; Erik Nilsson





GuideAI Workshop 2
Room: Westminster (4F)

GuideAI Workshop 2

10:30-11:15Keynote: TBA

Sudeepa Roy


11:15-11:30PDFLightUL: An Efficient Recommendation Unlearning Framework

Wentao Ning; Haorui He; Reynold Cheng; Nur Al Hasan Haldar; Ben Kao; Nan Huo; Bo Tang; Yupeng Li


11:30-11:45PDFTowards Identifying Intent of Data Errors

Mohamed Ahmed Abdelmaksoud Mohamed; Konrad Rieck; Ziawasch Abedjan


11:45-12:00PDFDBMS-LLM Integration Strategies in Industrial and Business Applications: Current Status and Future Challenges

Zhengtong Yan; Gongsheng Yuan; Qingsong Guo; Jiaheng Lu


11:40-12:00PDFDBMS-LLM Integration Strategies in Industrial and Business Applications: Current Status and Future Challenges

Zhengtong Yan; Gongsheng Yuan; Qingsong Guo; Jiaheng Lu





DataAI Workshop 1
Room: Westminster (4F)

DataAI Workshop 1

13:30-13:35Opening Remarks: Welcome and Introduction to DATAI Workshop 2025

Workshop Chairs


13:35-14:10Keynote: Data-centric Responsible AI from General ML to LLMs

Steven Euijong Whang


14:10-14:45Keynote: Navigating Disruption: The Impact of AI Technologies on Data Integration Research

Ziawasch Abedjan


14:45-14:55PDFSQL-ML: A SQL-Centric Framework for Building Efficient Feature Store

Ahmad Ghazal; Hanumath Maduri; Pekka Kostamaa


14:55-15:05PDFA Low Latency Cache for Cloud RDBMS

Guohai Zhang; Xin Tang; Qingchen Chang; Huanchen Zhang; Kai Hwang; Yuesen Li; Runhuai Huang; Teng Wang; Wusheng Zhang; Ming Zhang; Qingchun Chen; Xiaodong Hou; Qian Wang


15:05-15:15PDFThe Case for Intent-Based Query Rewriting

Gianna Lisa Nicolai; Patrick Hansert; Sebastian Michel


15:15-15:25PDFLightweight Pipelines: Good Enough is Sometimes Better

Camilla Sancricca; Cinzia Cappiello





DataAI Workshop 2
Room: Westminster (4F)

DataAI Workshop 2

15:35-16:05Databases as AI Runtimes

Rihan Hai


16:05-16:35Invited Talk: AI-Driven Data Typing: Toward Semantic and Functional Understanding of Relational Data

Chang Ge


16:35-16:45PDFCleanAgent: Automating Data Standardization with LLM-based Agents

Danrui Qi; Zhengjie Miao; Jiannan Wang


16:45-16:55PDFSoAgent: A Real-world Data Empowered Agent Pool to Facilitate LLM-Driven Generative Social Simulation

Na Ta; Kaiyu Li; Yushu Zhou; Yuhan Liu


16:55-17:05PDFDeepSearch: LLM-powered Data Acquisition for Machine Learning

Kaiyu Li; Zhongxin Hu; Yuxin Gao; Yuyang Wu


17:05-17:15PDFDetecting and Cleaning Errors in Personal Contact Information with Large Language Models

Anna-Christina Glock; Christine Dominka-Kiss; Philipp Korom; Lisa Ehrlinger





DEC Workshop 1
Room: Rutherford (4F)

DEC Workshop 1

8:30-8:45Welcome and Introduction

Workshop Chairs


8:45-9:40Invited Talk: Building a new Data Provider based on AI

Paul Groth


9:40-10:00PDFMINiDM: Multi-Issue Negotiation in Decentralised Data Marketplaces

Soulmaz Gheisari; Jaime Osvaldo Salas; Semih Yumusak; George Konstantinidis





DEC Workshop 2
Room: Rutherford (4F)

DEC Workshop 2

10:30-10:50PDFAn Interpretable Market-based Data Price Prediction Tool

Santiago Andrés Azcoitia; Alicia Cabrero Jiménez


10:50-11:10PDFUxV-DPN: Utility-vs-Value Data Pricing and Negotiation Mechanism in Machine Learning Data Marketplace

Hajar Baghcheband; Carlos Soares; Luis Paulo Reis


11:10-11:30PDFMixture-of-Experts based Model Market

Yizhou Ma; Xikun Jiang; Wenbo Wu; Zhuoqin Yang; Luis-Daniel Ibáñez


11:30-11:50PDFLLMDap: LLM-based Data Profiling and Sharing

Shanshan Jiang; Sondre Sørbø; Phil Tinn; Shang Ferheng Karim; Dumitru Roman


11:50-12:00Closure

Workshop Chairs





LS-NSL Workshop 1
Room: Rutherford (4F)

LS-NSL Workshop 1

13:30-14:15Keynote: On Retrieving & Reasoning LLMs: Myths, Merits, and How to Move Forward

Dan Roth


14:15-14:30PDFGraph Consistency Rule Mining with LLMs: an Exploratory Study

Hoa Le Thi; Angela Bonifati; Andrea Mauri


14:30-14:45PDFModular Neuro-Symbolic Knowledge Graph Completion

Abelardo Carlos Martinez Lorenzo; Alexander Perfilyev; Volker Markl; Martha Clokie; Thomas Sicheritz-Pontén; Zoi Kaoudi


14:45-15:00Certain and Approximately Certain Models for Statistical Learning

Cheng Zhen; Nischal Aryal; Arash Termehchy; Amandeep Singh Chabada





LS-NSL Workshop 2
Room: Rutherford (4F)

LS-NSL Workshop 2

15:30-16:15Keynote: Decoding the Interaction of Symbolic and Parametric Knowledge

Jeff Pan


16:15-16:30ASP Scaffolds for Robust Reasoning and Decoding

Pravana Madhyastha


16:30-16:45Representation Invariance of GNNs: Going Beyond Isomorphism

Jasmin Mousavi; Bishwash Kc; Arash Termehcy


16:45-17:00PDFConstraint-aware Learning of Probabilistic Sequential Models for Multi-Label Classification

Mykhailo Buleshnyi; Anna Polova; Zsolt Zombori; Michael Benedikt





LSGDA Workshop 1
Room: Abbey (4F)

LSGDA Workshop 1

08:30-08:40Opening Remarks

Workshop Chairs


08:40-09:20Keynote: Exploring the Connections Between Social Network Analysis and Graph Analytics

Tamer Özsu (University of Waterloo)


09:20-10:00Keynote: Improving Transportation in Road Networks Using Big Vehicle Trajectory Data

Christian S. Jensen (Aalborg University)





LSGDA Workshop 2
Room: Abbey (4F)

LSGDA Workshop 2

10:30-10:45PDFTop‑r Influential Community Search in Bipartite Graphs

Yanxin Zhang; Zhengyu Hua; Long Yuan


10:45-11:00PDFHyracks Unchained: Efficient Recursion for Navigational Queries in Apache AsterixDB

Glenn Galvizo; Michael Carey


11:00-11:15PDFGAL: Topology-Aware Serialization for Graph Traversals

Zeynep Korkmaz; Tamer Özsu; Khuzaima Daudjee


11:15-11:30PDFTo What Extent Does Quality Matter? The Impact of Graph Data Quality on GNN Model Performance

Jana Vatter; Maurice L. Rochau; Ruben Mayer; Hans-Arno Jacobsen


11:30-11:45PDFEnGraph: Ensemble-Based Augmentation for Graph Anomaly Detection

Andrew Shields; Robert Sheehy; Pat Doody


11:45-12:00PDFImproving the Accessibility of Port Operations in Supply Chain Management Using Graph Data Analysis

Mert Ayas; Frank Laarmann; Leif Meier; Katja Zeume





LSGDA Workshop 3
Room: Abbey (4F)

LSGDA Workshop 3

13:30-14:20Industry Talk: BG3: A Cost Effective and I/O Efficient Graph Database in ByteDance

Chen Cheng


14:20-15:00Keynote: Parallel Graph Structural Analytics: Systems and Visions for Next-Generation Graph Data Science

Da Yan (University of Alabama at Birmingham)





LSGDA Workshop 4
Room: Abbey (4F)

LSGDA Workshop 4

15:30-15:45PDFSemantic Embedding for Enterprise Clustering: A Systematic and Scalable Approach Using Sentence Transformers

Yigong Xiao; Xianzhi Lei; Kecheng Wang; Changan Zhou; Niannian Huang


15:45-16:00PDFShape-Aware, Scale-Agnostic Representation of Dynamic DAGs

Jennifer Neumann; Peter M. Fischer


16:00-16:15PDFGrowing Up HAL: Historic and Property Graph Queries

Muhammad Khan; Ioana Manolescu; Angelos-Christos Anadiotis


16:15-16:30PDFEfficient Betweenness Maximization in Temporal Networks

Xijuan Liu; Kejia Xu; Lele Zhang; Haiyang Hu; Ying Zhang


16:30-16:45PDFSingle-Source Regular Path Querying in Terms of Linear Algebra

Semyon Grigorev; Georgiy Belyanin; Rodion Suvorov


16:45-17:00Closing Remarks & Awards

Workshop Chairs





CDMS Workshop 1
Room: Moore (4F)

CDMS Workshop 1

8:30-8:40Opening Remarks

Workshop Chairs


8:40-9:25Keynote: A Historical Perspective on Extensible and Composable Data Systems

C. Mohan


9:25-9:35PDFA Learned Cost Model-based Cross-engine Optimizer for SQL Workloads

András Strausz; Niels Pardon; Ioana Giurgiu


9:35-9:45PDFRethinking Pluggable Federated Query Optimization: From Laptops to Data Warehouses

Victor Giannakouris; Immanuel Trummer


9:45-9:55PDFEudoxia: a FaaS scheduling simulator for the composable lakehouse

Tapan Srivastava; Jacopo Tagliabue; Ciro Greco





CDMS Workshop 2
Room: Moore (4F)

CDMS Workshop 2

10:30-12:00Panel: The Role of Composability in Building HTAP Systems

C. Mohan (Moderator)





CDMS Workshop 3
Room: Moore (4F)

CDMS Workshop 3

13:30-14:15Keynote: Speedrunning a lakehouse: a composable FaaS over object storage

Jacopo Tagliabue


14:15-15:00Keynote: Theseus a Composable distributed execution runtime: Performance across GPUs, Networks, and Storage

Felipe Aramburu





CDMS Workshop 4
Room: Moore (4F)

CDMS Workshop 4

15:30-15:35PDFComposability and Interoperability for Federated Data Systems

Haralampos Gavriilidis; Leonhard Rose; Joel Ziegler; Jonathan Gerloff; Benedikt Didrich; Midhun Kaippillil Venugopalan; Kaustubh Beedkar; Matthias Boehm; Volker Markl


15:35-15:40PDFComposing XGBoost UDFs with Arrow Flight

Hussain Sultan


15:40-15:45PDFBuilding IBM watsonx.data from Composable Parts

Aditi Pandit


15:45-16:30Keynote: What PostgreSQL Extensibility Can Teach Us About Composable Data Management Systems

Abigale Kim


16:30-16:35PDFThe Deconstructed Warehouse: An Ephemeral Query Engine Design for Apache Iceberg

Ryan Curtin; Jacopo Tagliabue


16:35-16:40PDFDAG lakehouse planning with an ephemeral and embedded graph database

Luca Bigon; Jacopo Tagliabue; Semih Salihoğlu


16:40-16:45PDFGranPipe: Composable Hierarchical Pipelines for Near-Data Processing

Johannes Pietrzyk; Wolfgang Lehner; Dirk Habich; Philippe Bonnet


16:45-16:50PDFLanceDB-Embracing Composability in the Storage Layer

Weston Pace; Chang She; Lei Xu; Will Jones; Rob Meng; Yang Cen


16:50-17:00Closing Remarks

Workshop Chairs