I am an Assistant Professor in the IIIS (Yao Class) at Tsinghua University. My research interest is in database management systems. I have particular interests in indexing/filtering data structures, data compression, and cloud databases. I received my Ph.D. degree from the Computer Science Department at Carnegie Mellon University. Before joining Tsinghua, I worked at Snowflake as a Postdoctoral Research Fellow.
[CV]
UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis
Yulong Hui, Yao Lu, and Huanchen Zhang
Proceedings of the Thirty-Eighth Annual Conference on Neural Information Processing
(NeurIPS'24), December 2024.
CODE
Cloud-Native Databases: A Survey
Haowen Dong, Chao Zhang, Guoliang Li, and Huanchen Zhang
IEEE Transactions on Knowledge and Data Engineering
(TKDE'24), 36(12), pp. 7772-7791.
Blitzcrank: Fast Semantic Compression for In-memory Online Transaction Processing
Yiming Qiao, Yihan Gao, and Huanchen Zhang
Proceedings of the VLDB Endowment
(VLDB'24), 17(10), pp. 2528-2540.
CODE
An Empirical Evaluation of Columnar Storage Formats
Xinyu Zeng, Yulong Hui, Jiahong Shen, Andrew Pavlo, Wes McKinney, and Huanchen Zhang
Proceedings of the VLDB Endowment
(VLDB'24), 17(2), pp. 148-161.
CODE
NULLS!: Revisiting Null Representation in Modern Columnar Formats
Xinyu Zeng, Ruijun Meng, Andrew Pavlo, Wes McKinney, and Huanchen Zhang
Proceedings of the 20th International Workshop on Data Management on New Hardware
(DaMoN'24), June 2024.
CODE
Making In-Memory Learned Indexes Efficient on Disk
Jiaoyi Zhang, Kai Su, and Huanchen Zhang
Proceedings of the ACM on Management of Data
(SIGMOD'24), 2(3): Article 151, 26 pages.
CODE
GRF: A Global Range Filter for LSM-Trees with Shape Encoding
Hengrui Wang, Te Guo, Junzhao Yang, and Huanchen Zhang
Proceedings of the ACM on Management of Data
(SIGMOD'24), 2(3): Article 141, 27 pages.
PimPam: Efficient Graph Pattern Matching on Real Processing-in-Memory Hardware
Shuangyu Cai, Boyu Tian, Huanchen Zhang, Mingyu Gao
Proceedings of the ACM on Management of Data
(SIGMOD'24), 2(3): Article 161, 25 pages.
LeCo: Lightweight Compression via Learning Serial Correlations
Yihao Liu, Xinyu Zeng, and Huanchen Zhang
Proceedings of the ACM on Management of Data
(SIGMOD'24), 2(1): Article 65, 28 pages.
CODE
SALI: A Scalable Adaptive Learned Index Framework based on Probability Models
Jiake Ge, Huanchen Zhang, Boyu Shi, Yuanhui Luo, Yunda Guo, Yunpeng Chai, Yuxing Chen, and Anqun Pan
Proceedings of the ACM on Management of Data
(SIGMOD'24), 1(4): Article 258, 25 pages.
Beyond Bloom: A Tutorial on Future Feature-Rich Filters
Prashant Pandey, Martín Farach-Colton, Niv Dayan, and Huanchen Zhang
Proceedings of the ACM on Management of Data
(SIGMOD'24) Tutorial, June 2024.
Cost-Intelligent Data Analytics in the Cloud
Huanchen Zhang, Yihao Liu, Jiaqi Yan
Proceedings of the 2024 Conference on
Innovative Data Systems Research (CIDR'24), January 2024.
SLIDES
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
Zhiyu Mei, Wei Fu, Guangju Wang, Huanchen Zhang, and Yi Wu
Proceedings of the Twelfth International Conference on Learning Representations
(ICLR'24), May 2024.
CODE
AdaCom: Adaptive Compression For Databases
Leon Windheuser, Christoph Anneser, Huanchen Zhang, Thomas Neumann, and Alfons Kemper
Proceedings of the 27th International Conference on Extending Database Technology
(EDBT'24), March 2024.
G-Learned Index: Enabling Efficient Learned Index on GPU
Jiesong Liu, Feng Zhang, Lv Lu, Chang Qi, Xiaoguang Guo, Dong Deng, Guoliang Li, Huanchen Zhang, Jidong Zhai,
Hechen Zhang, Yuxing Chen, Anqun Pan, Xiaoyong Du
IEEE Transactions on Parallel and Distributed Systems
(TPDS'24), 35(6), pp. 795-812.
Compressed Data Direct Computing for Databases
Weitao Wan, Feng Zhang Zhang, Chenyang Zhang, Mingde Zhang, Jidong Zhai, Chai, Huanchen Zhang, Wei Lu,
Yuxing Chen, Haixiang Li, Anqun Pan, and Xiaoyong Du
IEEE Transactions on Knowledge and Data Engineering
(TKDE'23), 36(5), pp. 1902-1918.
Efficient Query Re-optimization with Judicious Subquery Selections
Junyi Zhao, Huanchen Zhang, and Yihan Gao
Proceedings of the ACM on Management of Data
(SIGMOD'23), 1(2), pp. 1-26.
CODE
When Tree Meets Hash: Reducing Random Reads for Index Structures on Persistent Memories
Ke Wang, Guanqun Yang, YiWei Li, Huanchen Zhang, and Mingyu Gao
Proceedings of the ACM on Management of Data
(SIGMOD'23), 1(1), pp. 1-26.
CODE
CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression
Zheng Chen, Feng Zhang, JiaWei Guan, Jidong Zhai, Xipeng Shen, Huanchen Zhang, Wentong Shu, and Xiaoyong Du
Proceedings of the ACM on Management of Data
(SIGMOD'23), 1(1), pp. 1-31.
Blink-hash: An Adaptive Hybrid Index for In-Memory Time-Series Databases
Hokeun Cha, Xiangpeng Hao, Tianzheng Wang, Huanchen Zhang, Aditya Akella, and Xiangyao Yu
Proceedings of the VLDB Endowment
(VLDB'23), 16(6), pp. 1235-1248.
CODE
REncoder: A Space-Time Efficient Range Filter with Local Encoder
Wang, Ziwei, Zheng Zhong, Jiarui Guo, Yuhan Wu, Haoyu Li, Tong Yang, Yaofeng Tu, Huanchen Zhang, and Bin Cui
Proceedings of the 39th IEEE International Conference on Data Engineering
(ICDE'23).
CODE
Adaptive Hybrid Indexes
Christoph Anneser, Andreas Kipf, Huanchen Zhang, Thomas Neumann, and Alfons Kemper
Proceedings of the 2022 International Conference on Management of Data
(SIGMOD'22), June 2022, pp. 1626-1639.
Proteus: A Self-Designing Range Filter
Eric R. Knorr, Baptiste Lemaire, Andrew Lim, Siqiang Luo, Huanchen Zhang, Stratos Idreos, Michael Mitzenmacher
Proceedings of the 2022 International Conference on Management of Data
(SIGMOD'22), June 2022, pp. 1670-1684.
CODE
Succinct Range Filters
Huanchen Zhang, Hyeontaek Lim, Viktor Leis,
David G. Andersen, Michael Kaminsky, Kimberly Keeton, and Andrew Pavlo
Communications of the ACM (CACM). 4 (April 2021): 166-173.
Everything is a Transaction: Unifying Logical Concurrency Control and Physical Data Structure Maintenance in
Database Management Systems
Ling Zhang, Matthew Butrovich, Tianyu Li, Yash Nannapanei, Andrew Pavlo, John Rollinson, Huanchen Zhang
Proceedings of the Conference on Innovative Data Systems Research (CIDR'21), Jan. 2021.
Memory-Efficient Search Trees for Database Management Systems
2021 SIGMOD Jim Gray Dissertation Award
Ph.D. Dissertation, February 2020
SLIDES
VIDEO
Order-Preserving
Key Compression for In-Memory Search Trees
Huanchen Zhang, Lily Liu, David G. Andersen, Michael Kaminsky, Kimberly Keeton, and Andrew Pavlo
Proceedings of the 2020 International Conference
on Management of Data (SIGMOD'20), June 2020.
ARXIV
CODE
SLIDES
VIDEO
Succinct Range Filters
Huanchen Zhang, Hyeontaek Lim, Viktor Leis,
David G. Andersen, Michael Kaminsky, Kimberly Keeton, and Andrew Pavlo
ACM Transactions on Database Systems (TODS). 45.2 (2020): 1-31.
Succinct Range Filters
Huanchen Zhang, Hyeontaek Lim, Viktor Leis,
David G. Andersen, Michael Kaminsky, Kimberly Keeton, and Andrew Pavlo
ACM SIGMOD Record. 48.1 (2019): 78-85.
SuRF: Practical Range Query Filtering with Fast Succinct Tries
Best Paper Award
Huanchen Zhang, Hyeontaek Lim, Viktor Leis,
David G. Andersen, Michael Kaminsky, Kimberly Keeton, and Andrew Pavlo
Proceedings of the 2018 International Conference
on Management of Data (SIGMOD'18), June 2018, pp. 323–336.
CODE
DEMO
SLIDES
VIDEO
Building a Bw-Tree Takes More Than Just Buzz Words.
Ziqi Wang, Andrew Pavlo, Hyeontaek Lim, Viktor Leis,
Huanchen Zhang, Michael Kaminsky, and David G. Andersen
Proceedings of the 2018 International Conference on Management of Data
(SIGMOD'18), June 2018, pp. 473-488.
CODE
Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes
Huanchen Zhang, David G. Andersen, Andrew Pavlo,
Michael Kaminsky, Lin Ma, and Rui Shen
Proceedings of the 2016 International Conference on
Management of Data (SIGMOD'16), June 2016, pp. 1567–1581.
SLIDES