VLDB 2025: Accepted Industrial Papers

Title Authors
Ursa: A Lakehouse-Native Data Streaming Engine for Kafka Sijie Guo (StreamNative);Matteo Merli (StreamNative);Hang Chen (StreamNative);Neng Lu (StreamNative);Penghui Li (StreamNative)
Delta Sharing: An Open Protocol for Cross-Platform Data Sharing Krishna Puttaswamy (Databricks);Abhijit Chakankar (Databricks);Tao Tao (Databricks);Zaheera Valani (Databricks);Ramesh Chandra (Databricks);William Chau (Databricks);Mengxi Chen (Databricks);Akram Chetibi (Databricks);Tianyi Huang (Databricks);Jonathan Keller (Databricks);Celia Kung (Databricks);Andy Liu (Databricks);Charlene Lyu (Databricks);Samarth Shetty (Databricks);Xiaotong Sun (Databricks);Steve Weis (Databricks);Lin Zhou (Databricks);Ryan Zhu (Databricks);Reynold Xin (Databricks);Matei Zaharia (Databricks)
Scribe: How Meta transports terabytes per second in real time Manos Karpathiotakis (Facebook);Vlassios Rizopoulos (Facebook);Artem Gelun (Facebook);Tiziano Carotti (Facebook);Hazem Nada (Facebook);Basri Kahveci (Facebook);Yuri Dolgov (Facebook)
Disaggregated State Management in Apache Flink® 2.0 Yuan Mei (Alibaba Cloud);Zhaoqian Lan (Alibaba Cloud);Lei Huang (Boston University);Yanfei Lei (Alibaba Cloud);Han Yin (Alibaba Cloud);Rui Xia (Alibaba Cloud);Kaitian Hu (Alibaba Cloud);Paris Carbone (KTH Royal Institute of Technology);Vasiliki Kalavri (Boston University);Feng Wang (Alibaba Cloud)
Cloudy With a Chance of JSON Murtadha Al Hubail (Couchbase, Inc.);Ali Alsuliman (Couchbase, Inc.);Wail Alkowaileet (Saudi National Center for AI (NCAI));Michael Blow (Couchbase, Inc.);Michael Carey (UC Irvine);Savyasach Enukonda (Couchbase, Inc.);Peeyush Gupta (Couchbase, Inc.);Santosh Hegde (Couchbase, Inc.);Kamini Jagtiani (Couchbase, Inc.);Abhishek Jindal (Couchbase, Inc.);Nawazish Kahn (Couchbase, Inc.);Mehnaz Mahin (UC Riverside);Ian Maxon (Couchbase, Inc.);M Muralikrishna (Couchbase, Inc.);Keshav Murthy (Couchbase, Inc);Daniel Nagy (Couchbase, Inc.);Preetham Poluparthi (Couchbase, Inc.);Ankit Prabhu (Couchbase, Inc.);Ritik Raj (Couchbase, Inc.);Vijay Sarathy (Couchbase, Inc.);Shahrzad Shirazi (UC Riverside);Utsav Singh (Couchbase, Inc.);Hussain Towaileb (Couchbase, Inc.);Ayush Tripathi (Couchbase, Inc.);Janhavi Tripurwar (Couchbase, Inc.);Bo-Chun Wang (Couchbase, Inc.);Till Westmann (Couchbase, Inc.)
Streaming View: An Efficient Data Processing Engine for Modern Real-time Data Warehouse of Alibaba Cloud Fangyuan Zhang (The Chinese University of Hong Kong);Mengqi Wu (Alibaba Cloud);Chunlei Xu (Alibaba Cloud);Yunong Bao (Alibaba Cloud);Jiyu Qiao (Alibaba Cloud);Yingli Zhou (Alibaba Cloud);Hua Fan (Alibaba Cloud);Caihua Yin (Alibaba Cloud);Wenchao Zhou (Alibaba Cloud);Feifei Li (Alibaba Cloud)
veDB-HTAP: a Highly Integrated, Efficient and Adaptive HTAP System Jianjun Chen (ByteDance Inc);Li Zhang (ByteDance Inc);Yu Xie (ByteDance Inc);Wei Ding (ByteDance Inc);Lixun Cao (ByteDance Inc);Ye Liu (ByteDance Inc);Yonghua Ding (ByteDance Inc);Fangshi Li (ByteDance Inc);Ke Wu (ByteDance Inc);Haibo Xiu (Duke University);Kui Wei (ByteDance Inc);Le Cai (ByteDance Inc);Rui Chang (ByteDance Inc);Yuxiang Chen (ByteDance Inc);Yuanjin Lin (ByteDance Inc);Shangyu Luo (ByteDance Inc);Jianfeng Qian (ByteDance Inc);Xu Wang (ByteDance Inc);Zikang Wang (ByteDance Inc);Jian Zhang (ByteDance Inc);Mingyi Zhang (ByteDance Inc);Shicai Zeng (ByteDance Inc);Jason Sun (ByteDance Inc);Lei Zhang (ByteDance Inc);Rui Shi (ByteDance Inc);Pengwei Zhao (ByteDance Inc)
Freely Moving Between the OLTP and OLAP Worlds: Hermes – an High-Performance OLAP Accelerator for MySQL Tim Gubner (Huawei);Rune Humborstad (Huawei);Manyi Lu (Huawei)
Workload Insights From the Snowflake Data Cloud: What Do Production Analytic Queries Really Look Like? Jan Vincent Szlang (Snowflake);Sebastian Breß (Snowflake);Sebastian Cattes (Snowflake);Jonathan Dees (Snowflake);Florian Funke (Snowflake);Max Heimel (Snowflake);Michel Oleynik (Snowflake);Ismail Oukid (Snowflake);Tobias Maltenberger (Google)
AnalyticDB-PG: A Cloud-native High-performance Data Warehouse in Alibaba Cloud Fangyuan Zhang (The Chinese University of Hong Kong);Caihua Yin (Alibaba Cloud);Hua Fan (Alibaba Cloud Computing);Fenghua Fang (Alibaba Cloud);Yineng Chen (Alibaba Cloud);Xuqi Wang (Alibaba Cloud);Mengqi Wu (Alibaba Cloud);Bing Chen (Alibaba Cloud);Tianbo Jin (Alibaba Cloud);Sibo Wang (The Chinese University of Hong Kong);Wenchao Zhou (Alibaba Cloud);Feifei Li (Alibaba Cloud)
Unlocking the Power of CI/CD for Data Pipelines in Distributed Data Warehouses Hongtao Yang (Google);Zhichen Xu (Google);Sergey Yudin (Google);Andrew Davidson (Google)
Towards Principled, Practical Document Database Design Michael Carey (UC Irvine);Wail Alkowaileet (Saudi National Center for AI (NCAI));Nick DiGeronimo (UC Irvine);Peeyush Gupta (Couchbase);Sachin Smotra (Dataworkz);Till Westmann (Couchbase)
Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models Jun-Peng Zhu (East China Normal University);Boyan Niu (PingCAP);Peng Cai (East China Normal University);Zheming Ni (PingCAP);Jianwei Wan (PingCAP);Kai Xu (PingCAP);Jiajun Huang (PingCAP);Shengbo Ma (PingCAP);Bing Wang (PingCAP);Xuan Zhou (East China Normal University);Guanglei Bao (PingCAP);Donghui Zhang (PingCAP);Liu Tang (PingCAP);Qi Liu (PingCAP)
GaussDB-Vector: A Large-Scale Persistent Real-Time Vector Database for LLM Applications Ji Sun (Tsinghua University);Guoliang Li (Tsinghua University);James Pan (Tsinghua University);Jiang Wang (Huawei);Yongqing Xie (Huawei);Ruicheng Liu (Huawei);Wen Nie (Huawei)
GalaxyWeaver: Autonomous Table-to-Graph Conversion and Schema Optimization with Large Language Models Bing Tong (The Hong Kong University of Science and Technology (Guangzhou));Yan Zhou (Zhejiang CreateLink Technology);Chen Zhang (Zhejiang CreateLink Technology);Jianheng Tang (The Hong Kong University of Science and Technology);Jia Li (The Hong Kong University of Science and Technology);Lei Chen (The Hong Kong University of Science and Technology)
VSAG: An Optimized Search Framework for Graph-based Approximate Nearest Neighbor Search Xiaoyao Zhong (Ant Group);Haotian Li (Ant Group);Jiabao Jin (Ant Group);Mingyu Yang (Ant Group);Deming Chu (Ant Group);Xiangyu Wang (Ant Group);Zhitao Shen (Ant Group);Wei Jia (Ant Group);George Gu (Intel);Yi Xie (Intel);Xuemin Lin (Shanghai Jiaotong University);Heng Tao Shen (Tongji University);Jingkuan Song (Tongji University);Peng Cheng (Tongji University)
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB Nitish Upreti (Microsoft);Harsha Simhadri (Microsoft);Hari Sundar (Microsoft);Krishnan Sundaram (Microsoft);Samer Boshra (Microsoft);Bala Perumalswamy (Microsoft);Shivam Atri (Microsoft);Martin Chisholm (Microsoft);Revti Singh (Microsoft);Greg Yang (Microsoft);Tamara Hass (Microsoft);Nitesh Dudhey (Microsoft);Subramanyam Pattipaka (Microsoft);Mark Hildebrand (Microsoft);Magdalen Manohar (Microsoft);Jack Moffitt (Microsoft);Haiyang Xu (Microsoft);Naren Datha (Microsoft);Suryansh Gupta (Microsoft);Ravi Krishnaswamy (Microsoft);Prashant Gupta (Microsoft);Abhishek Sahu (Microsoft);Hemeswari Varada (Microsoft);Sudhanshu Barthwal (Microsoft);Ritika Mor (Microsoft);James Codella (Microsoft);Shaun Cooper (Microsoft);Kevin Pilch (Microsoft);Simon Moreno (Microsoft);Aayush Kataria (Microsoft);Santosh Kulkarni (Microsoft);Neil Deshpande (Microsoft);Amar Sagare (Microsoft);Dinesh Billa (Microsoft);Zishan Fu (Microsoft);Vipul Vishal (Microsoft)
LEADRE: Multi-Faceted Knowledge Enhanced LLM Empowered Display Advertisement Recommender System Fengxin Li (Renmin University of China);Yi Li (Tencent Inc.);Yue Liu (Tencent Inc.);Chao Zhou (Tencent Inc.);Yuan Wang (Tencent Inc.);Xiaoxiang Deng (Tencent Inc.);Wei Xue (Tencent Inc.);Dapeng Liu (Tencent Inc.);Lei Xiao (Tencent Inc.);Haijie Gu (Tencent Inc.);Jie Jiang (Tencent Inc.);Hongyan Liu (Tsinghua University);Biao Qin (Renmin University of China);Jun He (Renmin University of China)
SiriusBI: A Comprehensive LLM-powered Solution for Data Analytics in Business Intelligence Jie Jiang (Department of Data Platform, TEG, Tencent Inc.);Haining Xie (Department of Data Platform, TEG, Tencent Inc.);Siqi Shen (Center of Machine Learning Research, Peking University);Yu Shen (Department of Data Platform, TEG, Tencent Inc.);Zihan Zhang (Department of Data Platform, TEG, Tencent Inc.);Meng Lei (Department of Data Platform, TEG, Tencent Inc.);Yifeng Zheng (Department of Data Platform, TEG, Tencent Inc.);Yang Li (Department of Data Platform, TEG, Tencent Inc.);Chunyou Li ( Department of Data Platform, TEG, Tencent Inc.);Danqing Huang (Department of Data Platform, TEG, Tencent Inc.);Yinjun Wu (School of Computer Science, Peking University);Wentao Zhang (Center of Machine Learning Research, Peking University);Xiaofeng Yang (Department of Data Platform, TEG, Tencent Inc.);Bin Cui (Department of Data Platform, TEG, Tencent Inc.);Peng Chen (Department of Data Platform, TEG, Tencent Inc.)
SagaLLM: Context Management, Validation, and Transaction Guarantees for Multi-Agent LLM Planning Edward Chang (Stanford University;HTC DeepQ);Longling Geng (Stanford University)
Magnus: A Holistic Approach to Data Management for Large-Scale Machine Learning Workloads Jun Song (ByteDance Inc.);Jingyi Ding (Zhejiang University);Irshad Kandy (ByteDance Inc.);Yanghao Lin (ByteDance Inc.);Zhongjia Wei (ByteDance Inc.);Zilong Zhou (ByteDance Inc.);Zhiwei Peng (ByteDance Inc.);Jixi Shan (ByteDance Inc.);Hongyue Mao (ByteDance Inc.);Xiuqi Huang (Zhejiang University);Xun Song (ByteDance Inc.);Cheng Chen (ByteDance Inc.);Yanjia Li (ByteDance Inc.);Tianhao Yang (ByteDance Inc.);Wei Jia (ByteDance Inc.);Xiaohong Dong (ByteDance Inc.);Kang Lei (ByteDance Inc.);Rui Shi (ByteDance Inc.);Pengwei Zhao (ByteDance Inc.);Wei Chen (Zhejiang University)
DECK: Experiences on Delta Checkpointing for Industrial Recommendation Systems Xin Gao (Meta);Sibasish Acharya (Meta);Sihui Han (Meta);Yongxiong Ren (Meta);Yanli Zhao (Meta);Liang Luo (Meta);Chucheng Wang (Meta);Pradeep Fernando (Meta);Saurabh Mishra (Meta);Siqi Yan (Meta);Yicong Du (Meta);Elzbieta Krepska (Meta);Intaik Park (Meta);Min Ni (Meta);Qunshu Zhang (Meta);Shen Li (Meta)
R-Bot: An LLM-based Query Rewrite System Zhaoyan Sun (Tsinghua University);Xuanhe Zhou (Shanghai Jiao Tong University);Guoliang Li (Tsinghua University);Xiang Yu (Huawei Company);Jianhua Feng (Tsinghua University);Yong Zhang (Tsinghua University)
TuskFlow: An Efficient Graph Database for Long-Running Transactions Georgios Theodorakis (Neo4j);Hugo Firth (Neo4j);James Clarkson (Neo4j);Natacha Crooks (UC Berkeley);Jim Webber (Neo4j)
MD-MVCC: Multi-version Concurrency Control for Schema Changes in Azure SQL Database Panagiotis Antonopoulos (Microsoft);Mansi Chauhan (Microsoft);Shailender Dabas (Microsoft);Rajat Jain (Microsoft);Darshan Kattera (Microsoft);Wonseok Kim (Microsoft);Hanuma Kodavalla (Microsoft);Nikolas Ogg (Microsoft);Prashanth Purnananda (Microsoft);Rahul Ranjan (Microsoft);Alex Swanson (Microsoft);Divyesh Tikmani (Microsoft)
From FASTER to F2: Evolving Concurrent Key-Value Store Designs for Large Skewed Workloads Konstantinos Kanellis (University of Wisconsin-Madison);Badrish Chandramouli (Microsoft Research);Ted Hart (Microsoft Research);Shivaram Venkataraman (University of Wisconsin-Madison)
FDBKeeper: Enabling Scalable Coordination Services for Metadata Management using Distributed Key-Value Databases Jun-Peng Zhu (East China Normal University & PingCAP);Lingfeng Zhang (East China Normal University);Peng Cai (East China Normal University);Xuan Zhou (East China Normal University);Peisen Zhao (East China Normal University);Xue Wang (Moqi Inc);Linpeng Tang (Moqi Inc)
Design and Modular Verification of Distributed Transactions in MongoDB William Schultz (MongoDB);Murat Demirbas (MongoDB)
[Industry] From Scale-Up to Scale-Out: PolarDB’s Journey to Achieving 2 Billion tpmC Xinjun Yang (Alibaba Cloud Computing );Feifei Li (Alibaba Cloud Computing );Yingqiang Zhang (Alibaba Cloud Computing);Hao Chen (Alibaba Cloud Computing);Qingda Hu (Alibaba Cloud Computing );Panfeng Zhou (Alibaba Cloud Computing );Qiang Zhang (Alibaba Cloud Computing );Shuai Li (Alibaba Cloud Computing );Zongzhi Chen (Alibaba Cloud Computing );Zheyu Miao (Alibaba Cloud Computing );Rongbiao Xie (Alibaba Cloud Computing );Chuan Sun (Alibaba Cloud Computing );Zetao Wei (Alibaba Cloud Computing );Jing Fang (Alibaba Cloud Computing );Xingxuan Zhou (Alibaba Cloud Computing );Xiaofei Wu (Alibaba Cloud Computing )
The HANA Native Query Engine for Lakehouse Systems Daniel Ritter (SAP);Mihnea Andrei (SAP);Sukhyeun Cho (SAP);Maik Goergens (SAP);Taehyung Lee (SAP);Norman May (SAP);Amit Pathak (SAP);Paul Willems (SAP)
Automatic Indexing in Oracle Sunil Chakkappen (Oracle);Shreya Kunjibettu (Oracle);Daniel McGreer (Oracle);Masoomeh Kishi (Oracle);Hong Su (Oracle);Mohamed Ziauddin (Oracle);Mohamed Zait (Databricks);Zhan Li (Meta);Yuying Zhang (Google)
GRewriter: Practical Query Rewriting with Automatic Rule Set Expansion in GaussDB Zhe Jiang (Shanghai Jiao Tong University);Zhaoguo Wang (Shanghai Jiao Tong University);Haoning Lan (Shanghai Jiao Tong University);Chuzhe Tang (Shanghai Jiao Tong University);Haoran Ding (Shanghai Jiao Tong University);Lefeng Wang (Shanghai Jiao Tong University);Songyun Zou (Shanghai Jiao Tong University);Zhuoran Wei (Shanghai Jiao Tong University);Yongcun Liu (Huawei Technologies Co.);Xiang Yu (Huawei Technologies Co.);Yang Ren (Huawei Technologies Co.);Guoliang Li (Tsinghua University);Haibo Chen (Shanghai Jiao Tong University)
ScaleCache: Scalable and Production-grade Buffer Management for Disk-based Database Systems Mingyu Liu (Huawei);Junbin Kang (Huawei);Kai Wang (Huawei);Lu Zhang (Huawei);Haibo Chen (Huawei);Xiuchang Li (Huawei);Tianhong Ding (Huawei)
SQL:Trek Automated Index Design at Airbnb Sam Lightstone (airbnb);Ping Wang (airbnb)
Grouping, subsumption, and disjunctive join optimisations in Oracle Rafi Ahmed (Oracle);Krishna Kantikiran Pasupuleti (Oracle);Sriram Tirupattur (Oracle);Lei Sheng (Oracle);Hong Su (Oracle);Mohamed Ziauddin (Oracle)