VLDB 2025: Workshop Schedule
Workshop details last updated on 4 Sept 2025.
Find Workshop Session:
| Day 1 (Mon, Sept 1): | PHD ADMS AIDB LLM+Graph QDB TPCTC |
| Day 5 (Fri, Sept 5): | TaDA LLM+Spatial DaSH GuideAI DataAI DEC LS-NSL LSGDA CDMS |
PHD Workshop 1
Room: Albert (2F)
PHD Workshop 1
Welcome and Opening RemarksWorkshop Chairs
Keynote: Adventure and Beauty in Data Systems Research
Gerhard Weikum
PHD Workshop 2
Room: Albert (2F)
PHD Workshop 2
PDFContext-Aware Recommender Systems: Challenges in Personalization and FairnessAnna Dalla Vecchia
PDFLarge Language Models as Control Planes for Industrial-Scale Web Data Extraction
Felipe Marineli
PDFAlgorithm Support in a Graph Database, Done Right
Daan de Graaf
PDFSwellDB: GenAI-Native Query Processing via On-the-Fly Table Generation
Victor Giannakouris
PDFEntropy-Based Anomaly Detection in Evolving Graph Streams
Satoshi Kayano
PDFToward Interpretable Methods for Time Series Analytics
Félix Chavelli
PHD Workshop 3
Room: Albert (2F)
PHD Workshop 3
Panel DiscussionEugene Wu, Tiziana Catarci, Georgia Koutrika, Aditya Parameswaran
PDFData-Driven Decisions at Scale: Managing Risk, Diversity, and Provenance
Riddho Haque
PDFTowards Data-Metadata Flexibility in Property Graph Data Management
Sepehr Sadoughi
PDFRunning Functions on Pooled Data Without Leakage
Christopher Zhu
PDFPersonal Data Sovereignty through Federated Access and Policy Control
Vijon Baraku
PHD Workshop 4
Room: Albert (2F)
PHD Workshop 4
PDFAssessing the Fault Tolerance of Data-Centric ApplicationsMaria Ramos
PDFFast or Accurate? Rethinking Time Series Anomaly Detection
Emmanouil Sylligardos
PDFModeling and Operationalizing Data Ecosystems
Soo-Yon Kim
PDFOptimizing Data Systems for LLM Workloads
Kerem Akillioglu
PDFEnumeration-Based Dynamic Query Processing for Linear Algebra Workloads in Data Science
Thomas Munoz Serrano
Closing Remarks
Workshop Chairs
ADMS Workshop 1
Room: Moore (4F)
ADMS Workshop 1
PDFHigh Throughput GPU-Accelerated FSST String CompressionTim Anema (Delft University of Technology); Joost Hoozemans (Voltron Data); Zaid Al-Ars (Delft University of Technology); H. Peter Hofstee (IBM)
PDFGPU-Accelerated Stochastic Gradient Descent for Scalable Operator Placement in Geo-Distributed Streaming Systems
Tristan Joel Terhaag (Technische Universität Berlin); Xenofon Chatziliadis (Technische Universität Berlin); Eleni Tzirita Zacharatou (Hasso Plattner Institute, University of Potsdam); Volker Markl (Technische Universität Berlin)
PDFA Hot Take on the Intel Analytics Accelerator for Database Management Systems
Christos Laspias (Carnegie Mellon University); Andrew Pavlo (Carnegie Mellon University); Jignesh Patel (Carnegie Mellon University)
ADMS Workshop 2
Room: Moore (4F)
ADMS Workshop 2
PDFA Data Aggregation Visualization System supported by Processing-in-MemoryJunyoung Kim (Columbia University); Madhulika Balakumar (Columbia University); Kenneth Ross (Columbia University)
Keynote: Using what we know about our data: compact metadata and amortisation to exploit locality and sparsity
Paul H J Kelly (Imperial College London)
ADMS Workshop 3
Room: Moore (4F)
ADMS Workshop 3
PDFDemystifying CXL Memory Bandwidth Expansion for Analytical WorkloadsGeorgiy Lebedev (EPFL); Hamish Nicholson (EPFL); Musa Ünal (EPFL); Sanidhya Kashyap (EPFL); Anastasia Ailamaki (EPFL)
Keynote: On the role of storage and hardware acceleration in modern data management systems
Vincent Hsu (IBM Storage); Haris Pozidis (IBM Research)
ADMS Workshop 4
Room: Moore (4F)
ADMS Workshop 4
PDFCXL-Bench: Benchmarking Shared CXL Memory AccessMarcel Weisgut (Hasso Plattner Institute, University of Potsdam); Daniel Ritter (SAP); Florian Schmeller (Hasso Plattner Institute, University of Potsdam); Pınar Tözün (IT University of Copenhagen); Tilmann Rabl (Hasso Plattner Institute, University of Potsdam)
PDFRISC-V Meets RDBMS: An Experimental Study of Database Performance on an Open Instruction Set Architecture
Yizhe Zhang (University of New South Wales); Zhengyi Yang (University of New South Wales); Bocheng Han (University of New South Wales); Haoran Ning (Macquarie University); Xin Cao (University of New South Wales); John Shepherd (University of New South Wales); Guanfeng Liu (Macquarie University)
PDFMicro-architectural Exploration of the Relational Memory Engine (RME) in RISC-V and FireSim
Cole Strickler (University of Kansas); Ju Hyoung Mun (Brandeis University); Connor Sullivan (University of Kansas); Denis Hoornaert (Technical University of Munich); Renato Mancuso (Boston University); Manos Athanassoulis (Boston University); Heechul Yun (University of Kansas)
AIDB Workshop 1
Room: Westminster (4F)
AIDB Workshop 1
Opening remarks & best paper announcementThaleia Doudali; Subru Krishnan
Keynote: LLM-powered Data Tooling: the Next Frontier
Aditya Parameswaran (UC Berkeley)
PDFResearch Challenges in Relational Database Management Systems for LLM Queries
Kerem Akillioglu; Anurag Chakraborty; Sairaj Voruganti; M. Tamer Özsu
PDFLearning What Matters: Automated Feature Selection for Learned Cost Model in Parallel Stream Processing
Pratyush Agnihotri; Carsten Binnig; Manisha Luthra
PDFInstance-Optimized String Fingerprints
Mihail Stoian; Johannes Thürauf; Andreas Zimmerer; Alexander van Renen; Andreas Kipf
AIDB Workshop 2
Room: Westminster (4F)
AIDB Workshop 2
Industry Keynote: Building Autonomous Data Services on AzureYuanyuan Tian (Microsoft)
PDFJOB-Complex: A Challenging Benchmark for Traditional & Learned Query Optimization
Johannes Wehrstein; Timo Eckmann; Roman Heinrich; Carsten Binnig
PDFBootstrapping Learned Cost Models with Synthetic SQL Queries
Michael Nidd; Christoph Miksovic; Thomas Gschwind; Francesco Fusco; Andrea Giovannini; Ioana Giurgiu
PDFExploring Wavelet Trees as Space-Efficient Physical-to-Sorted Mapping for Learned Indexes
Anwesha Saha; Aneesh Raman; Ryan Marcus; Manos Athanassoulis
AIDB Workshop 3
Room: Westminster (4F)
AIDB Workshop 3
PDFLearning to Accelerate: Tuning Data Transfer ParametersBenedikt Didrich; Haralampos Gavriliidis; Vasilis Gkolemis; Matthias Boehm; Volker Markl
PDFAutoDebugger: Efficient Root Cause Analysis for Anomaly Jobs
Fathelrahman Ali; Yiwen Zhu; Lie Jiang; Zhen Li; Manting Li; Kun Huang; Lijing Lin; Long Tian; Xiaolei Liu; Subru Krishnan
PDFInferring Missing Data Lineage Links from Schema Metadata Using Transformer-Based Models
Maciej Brzeski; Adam Roman
PDFTailorSQL: A NL2SQL System Tailored for Your Query Workload
Kapil Vaidya; Jialin Ding; Sebastian Kosak; David Kernert; Chuan Lei; Xiao Qin; Abhinav Tripathy; Ramesh Balan; Balakrishnan Narayanaswamy; Tim Kraska
PDFMageSQL: Enhancing In-context Learning for Text-to-SQL Applications with LLMs
Chen Shen; Jin Wang; Sajjadur Rahman; Eser Kandogan
PDFGrounding LLMs for Database Exploration: Intent Scoping and Paraphrasing for Robust NL2SQL
Catalina Dragusin; Katsiaryna Mirylenka; Christoph Miksovic; Michael Glass; Nahuel Defosse; Paolo Scotton; Thomas Gschwind
LLM + Graph Workshop 1
Room: St James (4F)
LLM + Graph Workshop 1
Opening remarksYixiang Fang; Arijit Khan; Tianxing Wu; Da Yan
Keynote: Exploring the Duality Between Large Language Models and Database Systems
M. Tamer Özsu (University of Waterloo)
Industry Talk: Applications and Challenges of GraphRAG and Graph Foundation Models
Cheng Chen (ByteDance)
PDFLLM-assisted Construction of the United States Legislative Graph
Francesco Cambria; Andrea Colombo
PDFScalable Graph-based Retrieval-Augmented Generation via Locality-Sensitive Hashing
Fangyuan Zhang, Zhengjun Huang, Yingli Zhou, Qingtian Guo, Wensheng Luo, Xiaofang Zhou
LLM + Graph Workshop 2
Room: St James (4F)
LLM + Graph Workshop 2
Keynote: Towards Graph Foundation Models with Riemannian GeometryPhilip S. Yu (University of Illinois, Chicago)
Industry Talk: Retrieval and Reasoning with LLMs on Neo4j: Progress and Challenges
Brian Shi (Neo4J)
PDFLLM-Hype: A Targeted Evaluation Framework for Hypernym-Hyponym Identification in Large Language Models
Qiu Ji, Pengfei Zhu, Haolei Zhu, Yang Sheng, Guilin Qi, Lianlong Wu, Kang Xu, Yuan Meng
PDFGraph-Enhanced Large Language Models for Spatial Search [Vision]
Nicole Schneider, Kent O'Sullivan, Hanan Samet
LLM + Graph Workshop 3
Room: St James (4F)
LLM + Graph Workshop 3
Keynote: Reasoning over Property Graphs: Leveraging Large Language Models for Automated Data ConsistencyAngela Bonifati (Lyon 1 University)
Industry Talk: Chat2Graph: A Graph Native Agentic System
Heng Lin (AntGroup)
PDFxpSHACL: Explainable SHACL Validation using Retrieval-Augmented Generation and Large Language Models
Gustavo Publio, Jose Emilio Labra Gayo
PDFAutomatic Prompt Optimization for Knowledge Graph Construction: Insights from an Empirical Study
Nandana Mihindukulasooriya, Niharika DSouza, Faisal Chowdhury, Horst Samulowitz
LLM + Graph Workshop 4
Room: St James (4F)
LLM + Graph Workshop 4
PDFTowards the Next Generation of Agent Systems: From RAG to Agentic AI [Vision]Yingli Zhou, Shu Wan
Panel
Panelists: TBA
QDB Workshop 2
Room: Abbey (4F)
QDB Workshop 2
PDFOut in the Wild: Investigating the Impact of Imperfect Data on a Tabular Foundation ModelPapastergios, Vasileios; Gounaris, Anastasios
PDFExploring Privacy-Preserving Record Linkage: A Holistic Framework for Dataset Generation and Detailed Result Analysis
Rohde, Florens; Christen, Victor; Rahm, Erhard
PDFDynamic Knowledge Graph-based Measurement of Data Quality
Schrott, Johannes; Meindl, Rainer; Lettner, Christian; Hammer, Stefan; Leitner, Magdalena
PDFEvolving Gracefully: Building Robust and Self-Adaptive Data Cleaning Pipelines for Schema Evolution and Uncertainty
Kramer, Kevin; Restat, Valerie; Störl, Uta
QDB Workshop 3
Room: Abbey (4F)
QDB Workshop 3
Keynote: From XAI to XEE through Influence and Provenance, and optimising models for fairness when data drifts over time: some work in progress on connecting data and models to ensure quality and trust in both.Paolo Missier
Poster Session
QDB Workshop 4
Room: Abbey (4F)
QDB Workshop 4
PDFLabel Flipping For Group FairnessThandri, Shashank; Pradhan, Romila
PDFPBE Meets LLM: When Few Examples Aren’t Few-Shot Enough
Zhang, Shuning; Park, Yongjoo
PDFTowards an SLM-based Auditing of Relational Schemas and Data Quality for Practical Data Governance
de Medeiros, Antony
Closing
Workshop Chairs
TPCTC Workshop 1
Room: Rutherford (4F)
TPCTC Workshop 1
Opening RemarksRaghu Nambiar
A Benchmark for Databases with Varying Value Lengths
Danushka Liyanage; Shubham Pandey; Joshua Goldstein; Michael Cahill; Akon Dey; Alan Fekete; Uwe Röhm
Benchmarking Role-Based Access Control in Data Management Systems
Mads Cornelius Hansen; Pınar Tözün; Martin Hentschel
TPCTC Workshop 2
Room: Rutherford (4F)
TPCTC Workshop 2
CH2++: New HOAP for Benchmarking JSON Data AnalyticsMichael Carey; Vijay Sarathy; Daniel Nagy; Bo-Chun Wang; Keshav Murthy; Murali Krishna; Peeyush Gupta; Till Westmann
ScaleBench_AI: Flexible LLM Inference Benchmarking Across Architectures and Environments
Karthik Krishna; Sarthak Dwivedi; Divya Singh
Tectonic: Bridging Synthetic and Real-World Workloads for Key-Value Benchmarking
Alexander Ott; Shubham Kaushik; Boao Chen; Subhadeep Sarkar
TPCTC Workshop 3
Room: Rutherford (4F)
TPCTC Workshop 3
Panel: Benchmarking Considerations for Agentic AI SystemsAjay Dholakia; David Ellison; Miro Hodak; Debojyoti Dutta; Rajiv Ranjan
DataGenX: Generating Synthetic Relational Data from Annotated SQL Schemas
Ahmad Ghazal; Hanumath Maduri; Sunny Bains; Kenny Chan
DiStash: A Disaggregated Multi-Stash Transactional Key-Value Store
Shahram Ghandeharizadeh; Yiming Gao; Hieu Nguyen; Jun Li
TPCTC Workshop 4
Room: Rutherford (4F)
TPCTC Workshop 4
Invited Talk: TBASpeaker: TBA
Delivering MLPerf Submissions: Journey to Leadership Performance
Miro Hodak; Meena Arunachalam
Tabular Data Augmentation for Database Scalability Testing – A Case Study with Medical Insurance Claims Analytics Workloads
Taro Fujimoto; Shinji Fujiwara; Jumpei Sato; Yuto Hayamizu; Kazuo Goda
Benchmarking Distilled Language Models: Performance and Efficiency in Resource-Constrained Settings
Sachin Gopal Wani; Eric Page; Ajay Dholakia; David Ellison
Invited Talk: TBA
Speaker: TBA
Closing Remarks
Meikel Poess
TaDA Workshop 1
Room: Gielgud (2F)
TaDA Workshop 1
IntroductionWorkshop Chairs
Keynote: TBA
Kavitha Srinivas
PDFA Vision for SQL-Based Relational Deep Learning
Fahim Shahriar Khan; Ashraf Aboulnaga
PDFFrom Features to Structure: Task-Aware Graph Construction for Relational and Tabular Learning with GNNs
Tamara Cucumides; Floris Geerts
TaDA Workshop 2
Room: Gielgud (2F)
TaDA Workshop 2
PDFRelationship Detection on Tabular Data Using Statistical Analysis and Large Language ModelsPanagiotis Koletsis; Christos Panagiotopoulos; Georgios Papadopoulos; Vasilis Efthymiou
PDFImproving Column Type Annotation Using Large Language Models
Amir Babamahmoudi; Davood Rafiei; Mario Nascimento
PDFQuery Plan Generation for Table Question Answering
Ivan Poddubny; Nikita Dorodnykh
PDFTable Header Recognition Based on Large Language Models
Ilya I. Okhotin; Nikita Dorodnykh
PDFTOPJoin: A Context-Aware Multi-Criteria Approach for Joinable Column Search
Harsha Kokel; Aamod Kathiwada; Tejaswini Pedapanti; Haritha Ananthakrishnan; Oktie Hassanzadeh; Horst Samulowitz; Kavitha Srinivas
PDFEvaluating SQL Selection/Projection over Table Embeddings
Mariam Mellouli; Paolo Papotti
PDFOptimizing Source Selection for Tuple-Value Discovery
Ahmad Fares; Georgia Troullinou; Silviu Maniu; Sihem Amer-Yahia
PDFUniversal Embeddings of Tabular Data
Astrid Franz; Frederik Hoppe; Marianne Michaelis; Udo Göbel
PDFSemForest: Semantic-Aware Ontology Generation with Foundation Models
Guohui Guan; Sachin Konan; Larry Rudolph; Chang Ge
PDFStructText: A Synthetic Table-to-Text Approach for Benchmark Generation with Multi-Dimensional Evaluation
Satyananda Kashyap; Sola Shirai; Nandana Mihindukulasooriya; Horst Samulowitz
PDFTowards Fine-Grained Extraction of Scientific Claims from Heterogeneous Tables Using Large Language Models
Daniele Bertillo; Laks V.S. Lakshmanan; Paolo Merialdo; Divesh Srivastava
Keynote: TBA
Paolo Papotti
LLM + Spatial Workshop 1
Room: Gielgud (2F)
LLM + Spatial Workshop 1
IntroductionJianqiu Xu
Keynote: Spatial Data Systems in the LLM Era: 1+1=3? System Requirements and Research Opportunities
Walid G. Aref
PDFNALMOBench: Towards Benchmarking Natural Language Interfaces for Moving Objects Databases
Xieyang Wang; Weijia Yi; Mengyi Liu; Chenchen Zong
LLM + Spatial Workshop 2
Room: Gielgud (2F)
LLM + Spatial Workshop 2
Keynote: Natural Language Maps: Generative AI for Spatial Data Generation, Querying, and VisualizationAhmed Eldawy
Keynote: Geospatial entity representation: a step towards city foundation models.
Gao Cong
DaSH Workshop 1
Room: St James (4F)
DaSH Workshop 1
Workshop IntroductionDaSH Organizer
Keynote: Building Data-Intensive Systems that Care
Sihem Amer-Yahia (CNRS, Univ. Grenoble Alpes)
Invited Talk: Unleashing Data Science: It's Time to Fix the Data Preparation Problem
El Kindi Rezig (Univ. of Utah)
DaSH Workshop 2
Room: St James (4F)
DaSH Workshop 2
Keynote: Building AI-Driven Data Catalogs: A Great Playground for Human-in-the-Loop ResearchAnHai Doan (Univ. of Wisconsin)
Invited Talk: Sign2Vis: Automated Data Visualization from Sign Language
Yao Wan (Huazhong University of Science and Technology)
Invited Talk: “How do you even know that stuff?” Barriers to expertise sharing among spreadsheet users
Nancy Xia (University College London)
DaSH Workshop 3
Room: St James (4F)
DaSH Workshop 3
PDFReducing Human Effort in Evaluating Small and Medium Language Models as Students and as TeachersOleh Prostakov; Viacheslav Hodlevskyi; Nassim Bouarour; Adam Sanchez-Ayte; Noha Ibrahim; Sihem Amer-Yahia
PDFHuman + AI: Large-scale Data Curation for Multilingual Guardrails
Harshit Rajgarhia; Abhishek Mukherji; Fen Yik; Dominika Borek; Nicole Warren; Prithiviraj Pradeep
PDFDeepGit: Promoting Exploration and Discovery of Research Software with Human-Curated Graphs
Yilin Xia; Shin-Rong Tsai; Matthew Turk
PDFHierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals
Lingbo Mo; Shun Jiang; Akash V Maharaj; J. Bernard Hishamunda; Yunyao Li
PDFAdobe Summit Concierge Evaluation with Human-in-the-loop
Yiru Chen; Sally Fang; Sai Sree Harsha; Dan Luo; Vaishnavi Muppala; Fei Wu; Shun Jiang; Kun Qian; Yunyao Li
DaSH Workshop 4
Room: St James (4F)
DaSH Workshop 4
Keynote: SQL and Large Language Model: A Marriage Made in Heaven?Paolo Papotti
Panel
Panelists: Sihem Amer-Yahia, El Kindi Rezig, AnHai Doan, Paolo Papotti
GuideAI Workshop 1
Room: Westminster (4F)
GuideAI Workshop 1
Keynote: TBAThemis Palpanas
PDFModel Slicing for Responsible AI
Parke Godfrey; Lukasz Golab; Divesh Srivastava; Jarek Szlichta
PDFExperimentLens: Interactive Visual Analytics and Explainability for ML Experiment Management
Stavros Maroulis; Vassilis Stamatopoulos; Panagiotis Gidarakos; Konstantinos Tsopelas; Nikolas Masouras; Konstantinos Kozanis; Nikolas Theologitis; George Papastefanatos; Giorgos Giannopoulos; Erik Nilsson
GuideAI Workshop 2
Room: Westminster (4F)
GuideAI Workshop 2
Keynote: TBASudeepa Roy
PDFLightUL: An Efficient Recommendation Unlearning Framework
Wentao Ning; Haorui He; Reynold Cheng; Nur Al Hasan Haldar; Ben Kao; Nan Huo; Bo Tang; Yupeng Li
PDFTowards Identifying Intent of Data Errors
Mohamed Ahmed Abdelmaksoud Mohamed; Konrad Rieck; Ziawasch Abedjan
PDFDBMS-LLM Integration Strategies in Industrial and Business Applications: Current Status and Future Challenges
Zhengtong Yan; Gongsheng Yuan; Qingsong Guo; Jiaheng Lu
PDFDBMS-LLM Integration Strategies in Industrial and Business Applications: Current Status and Future Challenges
Zhengtong Yan; Gongsheng Yuan; Qingsong Guo; Jiaheng Lu
DataAI Workshop 1
Room: Westminster (4F)
DataAI Workshop 1
Opening Remarks: Welcome and Introduction to DATAI Workshop 2025Workshop Chairs
Keynote: Data-centric Responsible AI from General ML to LLMs
Steven Euijong Whang
Keynote: Navigating Disruption: The Impact of AI Technologies on Data Integration Research
Ziawasch Abedjan
PDFSQL-ML: A SQL-Centric Framework for Building Efficient Feature Store
Ahmad Ghazal; Hanumath Maduri; Pekka Kostamaa
PDFA Low Latency Cache for Cloud RDBMS
Guohai Zhang; Xin Tang; Qingchen Chang; Huanchen Zhang; Kai Hwang; Yuesen Li; Runhuai Huang; Teng Wang; Wusheng Zhang; Ming Zhang; Qingchun Chen; Xiaodong Hou; Qian Wang
PDFThe Case for Intent-Based Query Rewriting
Gianna Lisa Nicolai; Patrick Hansert; Sebastian Michel
PDFLightweight Pipelines: Good Enough is Sometimes Better
Camilla Sancricca; Cinzia Cappiello
DataAI Workshop 2
Room: Westminster (4F)
DataAI Workshop 2
Databases as AI RuntimesRihan Hai
Invited Talk: AI-Driven Data Typing: Toward Semantic and Functional Understanding of Relational Data
Chang Ge
PDFCleanAgent: Automating Data Standardization with LLM-based Agents
Danrui Qi; Zhengjie Miao; Jiannan Wang
PDFSoAgent: A Real-world Data Empowered Agent Pool to Facilitate LLM-Driven Generative Social Simulation
Na Ta; Kaiyu Li; Yushu Zhou; Yuhan Liu
PDFDeepSearch: LLM-powered Data Acquisition for Machine Learning
Kaiyu Li; Zhongxin Hu; Yuxin Gao; Yuyang Wu
PDFDetecting and Cleaning Errors in Personal Contact Information with Large Language Models
Anna-Christina Glock; Christine Dominka-Kiss; Philipp Korom; Lisa Ehrlinger
DEC Workshop 1
Room: Rutherford (4F)
DEC Workshop 1
Welcome and IntroductionWorkshop Chairs
Invited Talk: Building a new Data Provider based on AI
Paul Groth
PDFMINiDM: Multi-Issue Negotiation in Decentralised Data Marketplaces
Soulmaz Gheisari; Jaime Osvaldo Salas; Semih Yumusak; George Konstantinidis
DEC Workshop 2
Room: Rutherford (4F)
DEC Workshop 2
PDFAn Interpretable Market-based Data Price Prediction ToolSantiago Andrés Azcoitia; Alicia Cabrero Jiménez
PDFUxV-DPN: Utility-vs-Value Data Pricing and Negotiation Mechanism in Machine Learning Data Marketplace
Hajar Baghcheband; Carlos Soares; Luis Paulo Reis
PDFMixture-of-Experts based Model Market
Yizhou Ma; Xikun Jiang; Wenbo Wu; Zhuoqin Yang; Luis-Daniel Ibáñez
PDFLLMDap: LLM-based Data Profiling and Sharing
Shanshan Jiang; Sondre Sørbø; Phil Tinn; Shang Ferheng Karim; Dumitru Roman
Closure
Workshop Chairs
LS-NSL Workshop 1
Room: Rutherford (4F)
LS-NSL Workshop 1
Keynote: On Retrieving & Reasoning LLMs: Myths, Merits, and How to Move ForwardDan Roth
PDFGraph Consistency Rule Mining with LLMs: an Exploratory Study
Hoa Le Thi; Angela Bonifati; Andrea Mauri
PDFModular Neuro-Symbolic Knowledge Graph Completion
Abelardo Carlos Martinez Lorenzo; Alexander Perfilyev; Volker Markl; Martha Clokie; Thomas Sicheritz-Pontén; Zoi Kaoudi
Certain and Approximately Certain Models for Statistical Learning
Cheng Zhen; Nischal Aryal; Arash Termehchy; Amandeep Singh Chabada
LS-NSL Workshop 2
Room: Rutherford (4F)
LS-NSL Workshop 2
Keynote: Decoding the Interaction of Symbolic and Parametric KnowledgeJeff Pan
ASP Scaffolds for Robust Reasoning and Decoding
Pravana Madhyastha
Representation Invariance of GNNs: Going Beyond Isomorphism
Jasmin Mousavi; Bishwash Kc; Arash Termehcy
PDFConstraint-aware Learning of Probabilistic Sequential Models for Multi-Label Classification
Mykhailo Buleshnyi; Anna Polova; Zsolt Zombori; Michael Benedikt
LSGDA Workshop 1
Room: Abbey (4F)
LSGDA Workshop 1
Opening RemarksWorkshop Chairs
Keynote: Exploring the Connections Between Social Network Analysis and Graph Analytics
Tamer Özsu (University of Waterloo)
Keynote: Improving Transportation in Road Networks Using Big Vehicle Trajectory Data
Christian S. Jensen (Aalborg University)
LSGDA Workshop 2
Room: Abbey (4F)
LSGDA Workshop 2
PDFTop‑r Influential Community Search in Bipartite GraphsYanxin Zhang; Zhengyu Hua; Long Yuan
PDFHyracks Unchained: Efficient Recursion for Navigational Queries in Apache AsterixDB
Glenn Galvizo; Michael Carey
PDFGAL: Topology-Aware Serialization for Graph Traversals
Zeynep Korkmaz; Tamer Özsu; Khuzaima Daudjee
PDFTo What Extent Does Quality Matter? The Impact of Graph Data Quality on GNN Model Performance
Jana Vatter; Maurice L. Rochau; Ruben Mayer; Hans-Arno Jacobsen
PDFEnGraph: Ensemble-Based Augmentation for Graph Anomaly Detection
Andrew Shields; Robert Sheehy; Pat Doody
PDFImproving the Accessibility of Port Operations in Supply Chain Management Using Graph Data Analysis
Mert Ayas; Frank Laarmann; Leif Meier; Katja Zeume
LSGDA Workshop 3
Room: Abbey (4F)
LSGDA Workshop 3
Industry Talk: BG3: A Cost Effective and I/O Efficient Graph Database in ByteDanceChen Cheng
Keynote: Parallel Graph Structural Analytics: Systems and Visions for Next-Generation Graph Data Science
Da Yan (University of Alabama at Birmingham)
LSGDA Workshop 4
Room: Abbey (4F)
LSGDA Workshop 4
PDFSemantic Embedding for Enterprise Clustering: A Systematic and Scalable Approach Using Sentence TransformersYigong Xiao; Xianzhi Lei; Kecheng Wang; Changan Zhou; Niannian Huang
PDFShape-Aware, Scale-Agnostic Representation of Dynamic DAGs
Jennifer Neumann; Peter M. Fischer
PDFGrowing Up HAL: Historic and Property Graph Queries
Muhammad Khan; Ioana Manolescu; Angelos-Christos Anadiotis
PDFEfficient Betweenness Maximization in Temporal Networks
Xijuan Liu; Kejia Xu; Lele Zhang; Haiyang Hu; Ying Zhang
PDFSingle-Source Regular Path Querying in Terms of Linear Algebra
Semyon Grigorev; Georgiy Belyanin; Rodion Suvorov
Closing Remarks & Awards
Workshop Chairs
CDMS Workshop 1
Room: Moore (4F)
CDMS Workshop 1
Opening RemarksWorkshop Chairs
Keynote: A Historical Perspective on Extensible and Composable Data Systems
C. Mohan
PDFA Learned Cost Model-based Cross-engine Optimizer for SQL Workloads
András Strausz; Niels Pardon; Ioana Giurgiu
PDFRethinking Pluggable Federated Query Optimization: From Laptops to Data Warehouses
Victor Giannakouris; Immanuel Trummer
PDFEudoxia: a FaaS scheduling simulator for the composable lakehouse
Tapan Srivastava; Jacopo Tagliabue; Ciro Greco
CDMS Workshop 2
Room: Moore (4F)
CDMS Workshop 2
Panel: The Role of Composability in Building HTAP SystemsC. Mohan (Moderator)
CDMS Workshop 3
Room: Moore (4F)
CDMS Workshop 3
Keynote: Speedrunning a lakehouse: a composable FaaS over object storageJacopo Tagliabue
Keynote: Theseus a Composable distributed execution runtime: Performance across GPUs, Networks, and Storage
Felipe Aramburu
CDMS Workshop 4
Room: Moore (4F)
CDMS Workshop 4
PDFComposability and Interoperability for Federated Data SystemsHaralampos Gavriilidis; Leonhard Rose; Joel Ziegler; Jonathan Gerloff; Benedikt Didrich; Midhun Kaippillil Venugopalan; Kaustubh Beedkar; Matthias Boehm; Volker Markl
PDFComposing XGBoost UDFs with Arrow Flight
Hussain Sultan
PDFBuilding IBM watsonx.data from Composable Parts
Aditi Pandit
Keynote: What PostgreSQL Extensibility Can Teach Us About Composable Data Management Systems
Abigale Kim
PDFThe Deconstructed Warehouse: An Ephemeral Query Engine Design for Apache Iceberg
Ryan Curtin; Jacopo Tagliabue
PDFDAG lakehouse planning with an ephemeral and embedded graph database
Luca Bigon; Jacopo Tagliabue; Semih Salihoğlu
PDFGranPipe: Composable Hierarchical Pipelines for Near-Data Processing
Johannes Pietrzyk; Wolfgang Lehner; Dirk Habich; Philippe Bonnet
PDFLanceDB-Embracing Composability in the Storage Layer
Weston Pace; Chang She; Lei Xu; Will Jones; Rob Meng; Yang Cen
Closing Remarks
Workshop Chairs
