Xiaodong Zhang's representative publications

Publications of Xiaodong Zhang

Influential Papers

Books

Data Management: Interactions with Computer Architecture and Systems , Cambridge University Press, 2024.

Selected Publications

"RayDB: Building Databases with Ray Tracing Cores" , Proceedings of the VLDB Endowment, Vol. 19, No. 1, (VLDB 2026), Boston, MA, USA, August 31st - September 4th, 2026.

"A stable marriage requires a shared residence with low contention and mutual complementarity" , Proceedings of 34th International Conference on Parallel Architectures and Compilation Techniques, (PACT 2025), Irvine, California, USA, November 3-6, 2025.

"X-Blossom: massive parallelization of graph maximum matching" , Proceedings of the VLDB Endowment, Vol. 18, No. 10, (VLDB 2025), London, United Kingdom, September 1-5, 2025.

"A case study for ray tracing cores: performance insights with breadth-first search and triangle counting in graphs" , Proceedings of the ACM on Measurement and Analysis of Computing Systems (SIGMETRICS 2025), Stoney Brooks, New York, USA, June 9-13, 2025.

"LibRTS: a spatial indexing library by ray tracing" , Proceedings of 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2025), Las Vegas, USA, March 2-5, 2025.

``RR-Compound: RDMA-fused gPRC for low latency, high throughput, and easy interface", IEEE Transactions on Parallel and Distributed Systems, Vol 35, No. 8, August 2024, pp. 1488-1505.

``X-TED: massive parallelization of tree edit distance", Proceedings of the VLDB Endowment, Vol 17, Issue 7, pp 1683-1696. (VLDB 2024), Guangzhou, China, August 26-30, 2024.

``High-performance spatial data analytics: systematic R&D for scale-out and scale-up solutions from the past to now", Proceedings of the VLDB Endowment, Vol 17, Issue 7, pp 4507-4519. (VLDB 2024), Guangzhou, China, August 26-30, 2024.

``RTScan: efficient scan with ray tracing cores", Proceedings of the VLDB Endowment, Vol 17, Issue 6, pp 1460-1472. (VLDB 2024), Guangzhou, China, August 26-30, 2024.

``RayJoin: fast and precise spatial join", Proceedings of 38th ACM International Conference on Supercomputing (ICS 2024), Kyoto, Japan, June 4-7, 2024.

``UltraPrecise: a GPU-based framework for arbitrary-precision arithmetic in database systems", Proceedings of 40th IEEE International Conference on Data Engineering (ICDE 2024), Utrecht, Netherlands, May 13-17, 2024.

``Maze: a cost-efficient video deduplication system at Web-scale", Proceedings of 2022 ACM Conference on Multimedia (ACM MM 2022), Lisboa, Portugal, October 10-14, 2022.

``NeutronStar: distributed GNN training with hybrid dependency management", Proceedings of 2022 ACM SIGMOD Conference on Management of Data (SIGMOD 2022), Philadelphia, PA, USA, June 12-17, 2022.

``The art of balance: a RateupDB experience of building a CPU/GPU hybrid database product", Proceedings of the VLDB Endowment (VLDB 2021), Copenhagen, Denmark, August 16-20, 2021.

``Mixer: efficiently understanding and retrieving visual content at web-scale", Proceedings of the VLDB Endowment (VLDB 2021), Copenhagen, Denmark, August 16-20, 2021.

``DB-Spinner: making a case for iterative processing in databases", Proceedings of 37th International Conference on Data Engineering (ICDE 2021), Chania, Crete, Greece, April 19-22, 2021.

``NestGPU: Nested query processing on GPU", Proceedings of 37th International Conference on Data Engineering (ICDE 2021), Chania, Crete, Greece, April 19-22, 2021.

``Automating incremental and asynchronous evaluation for recursive aggregate data processing", Proceedings of 2020 ACM SIGMOD Conference on Management of Data (SIGMOD 2020), Portland, OR, USA, June 14-19, 2020.

``Catfish: adaptive RDMA-enabled R-tree for low latency and high throughput" , Proceedings of 39th IEEE International Conference on Distributed Computing Systems (ICDCS 2019), Dallas, Texas, July 7-9, 2019.

``HYPHA: a framework based on separation of parallelism to accelerate persistent homology matrix reduction", Proceedings of 33rd ACM International Conference on Supercomputing (ICS 2019), Phoenix, Arizona, June 26-28, 2019.

``DirectLoad: a fast web-scale index system across large regional centers", Proceedings of 35th International Conference on Data Engineering (ICDE 2019), Macau, China, April 8-11, 2019.

"SEP-Graph: finding shortest execution paths for graph processing under a hybrid framework on GPU" , Proceedings of 24th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2019), Washington DC, USA, February 16-20, 2019.

``A low-cost disk solution enabling LSM-tree to achieve high performance for mixed read/write workloads", ACM Transactions on Storage, Vol. 14, Issue 2, April 2018.

"Software-defined Software: a perspective of machine learning-based software production" , Proceedings of 2018 IEEE 38th International Conference on Distributed Computing, (ICDCS'18), Vienna, Austria, July 2-5, 2018, pp. 1270-1275 (a vision paper).

"SQLoop: high performance iterative processing in data management" , Proceedings of 2018 IEEE 38th International Conference on Distributed Computing, (ICDCS'18), Vienna, Austria, July 2-5, 2018, pp. 1039-1051.

``Software support inside and outside solid-state devices for high performance and high efficiency", Proceedings of the IEEE, Vol. 105, Issue 9, September 2017, pp. 1650-1665.

``Feisu: fast query execution over heterogeneous data sources on large-scale clusters", Proceedings of 33rd International Conference on Data Engineering (ICDE'17), San Diego, California, USA, April 19-22, 2017.

``Spark-GPU: an accelerated in-memory data processing engine on clusters", Proceedings of 2016 IEEE International Conference on Big Data, (IEEE BigData 2016), Washington DC, USA, December 5-8, 2016.

``BCC: reducing false aborts in optimistic concurrency control with low cost for in-memory databases", Proceedings of the VLDB Endowment, Vol. 9, No. 6, February 2016. (presented in the 42nd International Conference on Very Large Data Bases (VLDB 2016), New Delhi, India, September 5-9, 2016.

``Mega-KV: a case for GPUs to maximize the throughput of in-memory key-value stores", Proceedings of the VLDB Endowment, Vol. 8, Issue. 11. July 2015. (presented in the 41th International Conference on Very Large Data Bases (VLDB 2015), Hawaii, USA, August 31 - September 4, 2015.

``Concurrent analytical query processing with GPUs", Proceedings of the VLDB Endowment, Vol. 7, Issue. 11. July 2014. (presented in the 40th International Conference on Very Large Data Bases (VLDB 2014), Hangzhou, China, September 1-5, 2014.

``Understanding insights into the basic structure and essential issues of table placements methods in clusters", Proceedings of the VLDB Endowment, Vol. 6, Issue 14, September 2013, pp. 1750-1761. (presented in the 40th International Conference on Very Large Data Bases (VLDB 2014), Hangzhou, China, September 1-5, 2014.

``Major technical advancements in Apache Hive", Proceedings of 2014 ACM SIGMOD Conference on Management of Data (SIGMOD 2014), Snowbird, Utah, June 22-27, 2014.

``GDM: device memory management for GPGPU computing", Proceedings of 2014 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS 2014), Austin, Texas, June 16-20, 2014.

``UNIK: unsupervised social network spam detection", Proceedings of 22nd ACM International Conference on Information and Knowledge Management (CIKM 2013), San Francisco, October 27 - November 1, 2013.

``S-CAVE: effective SSD caching to improve virtual machine storage performance", Proceedings of 22nd International Conference on Parallel Architectures and Compilation techniques (PACT 2013), Edinburgh, Scotland, September 7-11, 2013.

``Hadoop-GIS: a high performance spatial data warehousing system over MapReduce", Proceedings of the VLDB Endowment, Vol. 6, Issue 11, August 2013, pp. 1009-1020. (presented in the 39th International Conference on Very Large Data Bases (VLDB 2013), Riva del Garda, Trento, Italy, August 26-30, 2013.

The Hadoop-GIS project has initiated the development of a new spatial data analytical ecosystem

see detailed documents.

The 2024 VLDB Endowment Test of Time Award

``The Yin and Yang of processing data warehousing queries on GPU devices", Proceedings of the VLDB Endowment, Vol. 6, Issue 10, August 2013, pp. 817-828. (presented in the 39th International Conference on Very Large Data Bases (VLDB 2013), Riva del Garda, Trento, Italy, August 26-30, 2013.

``LDPC-in-SSD: making advanced error correction codes work effectively in solid state drives", Proceedings of 11th USENIX Conference on File and Storage Technologies (FAST'13), San Jose, California, February 12-15, 2013.

``Accelerating pathology image data cross-comprison on CPU-GPU hybrid systems" , Proceedings of the VLDB Endowment, Vol. 5, Issue 11, July 2012, pp. 1543-1554. (presented in the 38th International Conference on Very Large Databases (VLDB 2012), Istanbul, Turkey, August 27-31, 2012.

The PixelBox algorithm in this paper has been adopted in The Geometric Performance Primitive Library

see detailed documents.

``hStorage-DB: heterogeneity-aware data management to exploit full capacity of hybrid storage systems", Proceedings of the VLDB Endowment, Vol. 5, Issue 10, June 2012, pp. 1076-1087. (presented in the 38th International Conference on Very Large Databases (VLDB 2012), Istanbul, Turkey, August 27-31, 2012.

``Spam behavior analysis and detection in user generated content on social networks", Proceedings of 32nd International Conference on Distributed Computing Systems (ICDCS 2012), Macau, China, June 18-21, 2012.

``BWS: Balanced Work Stealing for time-sharing multicores", Proceedings of ACM EuroSys Conference (EuroSys 2012), Bern, Switzerland, April 10-13, 2012.

``DOT: a matrix model for analyzing, optimizing and deploying software for big data analytics in distributed systems", Proceedings of 2nd ACM Symposium on Cloud Computing (SOCC 2011), Cascais, Portugal, October 27-28, 2011.

``YSmart: Yet another SQL-to-MapReduce Translator", Proceedings of 31st International Conference on Distributed Computing Systems (ICDCS 2011), Minneapolis, Minnesota, June 20-24, 2011. Best Paper Award .

YSmart has been merged into big data warehousing production systems

see detailed documents.

``Hystor: making the best use of solid state drives in high performance storage systems", Proceedings of 25 ACM International Conference on Supercomputing (ICS 2011), Tucson, Arizona, May 31 - June 4, 2011. Best Paper Award .

Hystor has made impact on commercial hybrid storage products, including Apple's Fusion Drive

see detailed documents.

``SRM-Buffer: An OS Buffer Management Technique to Prevent Last Level Caches from Thrashing in multicores", Proceedings of ACM EuroSys Conference (EuroSys 2011), Salzburg, Austria, April 10-13, 2011.

``RCFile: a fast and space-efficient data placement structure in MapReduce-based warehouse systems", Proceedings of International Conference on Data Engineering (ICDE'11), Hannova, Germany, April 11-16, 2011.

RCFile has been adopted in big data warehouse production systems

see detailed documents.

``CAFTL: a content-aware flash translationa layer enhancing the lifespan of flash memory basedsolid state drives", Proceedings of 9th USENIX Conference on File and Storage Technologies (FAST'11), San Jose, California, February 15-17, 2011.

``Essential roles of exploiting internal parallelism of flash memory based solid state drives in high-speed data processing", Proceedings of 17th International Symposium on High Performance Computer Architecture (HPCA-17), San Antonio, Texas, February 12-16, 2011.

``ULCC: a user-level facility for optimizing shared cache performance on multicores", Proceedings of 16th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP 2011), San Antonio, Texas, February 12-16, 2011.

``Splitter: a proxy-based approach for post-migration testing of Web applications", Proceedings of ACM EuroSys 2010, Paris, France, April 13-16, 2010.

``TopBT: a topology-aware and infrastructure-independent BitTorrent client", Proceedings of INFOCOM'10, San Diego, California, March 15-19, 2010.

``CUBS: coordinated upload bandwidth sharing in residential networks" , Proceedings of 17th International Conference on Network Protocols (ICNP 2009), Princeton, NJ, October 13-16, 2009.

``Enabling software management for multicore caches with a lightweight hardware support", Proceedings of 22nd ACM/IEEE Annual Conference on Supercomputing (SC09), Portland, Oregon, November 14-20, 2009.

``Soft-OLP: improving hardware cache performance through software-controlled object-level partitioning", Proceedings of 18th International Conference on Parallel Architectures and Compilation techniques (PACT 2009), Raleigh, North Carolina, September 12-16, 2009.

``MCC-DB: minimizing cache conflicts in multi-core processors for databases", Proceedings of the VLDB Endowment, Vol. 2, Issue 1, PP. 373-384, August 2009. (presented in the 35th International Conference on Very large Data Bases (VLDB 2009), Lyon, France, August 24-28, 2009.

``Analyzing patterns of user content generation in online social networks" , Proceedings of 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD-2009), Paris, France, June 28- July 1st, 2009..

``Understanding intrinsic characteristics and system implications of flash memory based solid state drives", Proceedings of 2009 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS/Performance 2009), Seattle, WA, June 15-19, 2009

``BP-Wrapper: a system framework making any replacement algorithms (almost) lock contention free", Proceedings of 25th International Conference on Data Engineering (ICDE'09), Shanghai, China, March 29- April 4, 2009.

``The stretched exponential distribution of Internet media access patterns" , Proceedings of 27th ACM Symposium on Principles of Distributed Computing (PODC 2008), Toronto, Canada, August 18-21, 2008.

`` Caching for Bursts (C-Burst): let hard disks sleep well and work energetically", Proceedings of the 13th ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED'08), Banglore, India, August 11-13, 2008.

``Gaining insights into multicore cache partitioning: bridging the gap between simulation and real systems", Proceedings of the 14th International Symposium on High Performance Computer Architecture (HPCA-14), Salt Lake City, Utah, February 16-20, 2008.

The OS-based cache partitioning method in this paper has been used in Linux kernel for production systems

see detailed documents.

``PSM-Throttling: minimizing energy comsumption for bulk data communications in WLANs", Proceedings of the 15th International Conference on Network Protocols (ICNP'07), Beijing, China, October 16-19, 2007.

` `SProxy: a caching infrastructure for Internet streaming", IEEE Transactions on Multimedia, Vol. 9, Issue. 5, 2007, pp. 1062-1072.

``DiskSeen: exploiting disk layout and access history to enhance I/O prefetch", Proceedings of 2007 USENIX Annual Technical Conference (USENIX'07), Santa Clara, June 17-22, 2007.

``Delving into Internet streaming media delivery: a quality and resource utilization perspective", Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'06), Rio de Janeiro, Brazil, October 25-27, 2006.

``SmartSaver: turning flash drive into a disk energy saver for mobile computers" Proceedings of the 11th ACM International Symposium on Low Power Electronics and Design (ISLPED'06), Tegernsee, Germany, October 4-6, 2006.

``ASAP: an AS-Aware Peer-relay protocol for high quality VoIP", Processings of the 26th International Conference on Distributed Computing Systems (ICDCS'06), Lisbon, Portugal, July 4-7, 2006.

``Exploiting idle commnication power to improive wireless network performance and energy efficiency", Proceedings of INFOCOM'06, Barcelona, Spain, April 23-29, 2006.
The extended version has been published in IEEE Journal on Selected Areas in Communications, Vol. 25, No. 2, 2007.

``Fast proxy delivery of multiple streaming sessions in shared running buffers", IEEE Transactions on Multimedia, Vol. 7, No. 6, December, 2005, pp 1157-1169.

``DULO: an effective buffer cache management scheme to exploit both temporal and spatial localities", Proceedings of the 4th USENIX Conference on Files and Storage Technologies (FAST'05), San Francisco, CA, December 14-16, 2005.

``Measurement, analysis, and modeling of BitTorrent-like systems", Proceedings of ACM SIGCOMM Internet Measurement Conference (IMC'05), Berkeley, CA, October 19-21, 2005.
The extended version has been published in IEEE Journal on Selected Areas in Communications, Vol. 25, No. 1, 2007.

``Coordinated data prefetching for Web contents", Computer Communications, Vol. 28, Issue 17, 2005, pp. 1947-1958.

``Look-ahead architecture adaptation to reduce processor power consumption", IEEE Micro, Vol. 25, No. 4, 2005, pp. 10-19.

``Segment-based proxy caching for Interent streaming media delivery", IEEE Multimedia, Vol. 12, No. 3, July/September, 2005.

``Analysis of multimedia workloads with implications for Internet streaming", Proceedings of the 14th International World Wide Web Conference, (WWW'05), Chiba, Japan, May 10-14, 2005.

``CLOCK-Pro: an effective improvement of the CLOCK replacement", Proceedings of USENIX Annual Technical Conference (USENIX'05), Anaheim, CA, April 10-15, 2005, pp. 323-336.

Clock-Pro has been adopted in OS kernels and other data processing systems

see detailed documents.

``Token-ordered LRU: an effective page replacement policy and its implementation in Linux systems", Performance Evaluation, Vol. 60, Issue 1-4, 2005, pp. 5-29.

The token algorithm is a part pf Linux Kernel

see detailed documents.

``Design and optimization of large size and low overhead off-chip caches", IEEE Transactions on Computers, Vol. 53, No. 7, 2004, pp. 843-855.

``ULC: A file block placement and replacement protocol to effectively exploit hierarchical locality in multi-level buffer caches" , Proceedings of the 24th International Confernece on Distributed Computing Systems, (ICDCS'04), Tokyo, Japan, March 23-26, 2004.
The extended version has been published in IEEE Transactions on Computers, Vol. 56, No. 1, 2007.

``Designs of high quality streaming proxy systems" , Proceedings of IEEE INFOCOM'04, Hong Kong, March 7-11, 2004.

``Low cost and reliable mutual anonymity protocols in peer-to-peer networks", IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No. 9, 2003, pp. 829-840.

``A popularity-based prediction model for Web prefetching", IEEE Computer, Vol. 36, No. 3, March 2003, pp. 59-66.

``Access-mode predictions for low-power cache design", IEEE Micro, Vol. 22, No. 2, March/April, 2002, pp. 58-71.

``LIRS: an efficient low inter-reference recency set replacement to improve buffer cache performance", Proceedings of the 2002 ACM SIGMETRICS Conference on Measurement and Modeling of Computer Systems (SIGMETRICS'02), Marina Del Rey, California, June 15-19, 2002.
The extended version has been published in IEEE Transactions on Computers, Vol. 54, No. 8, 2005.

The LIRS algorithm has been adopted in major database and software systems

see detailed documents.

``Fine-grain priority scheduling on multi-channel memory systems", Proceedings of the 8th International Symposium on High Performance Computer Architecture, (HPCA-8), Cambridge, Massachusetts, February 2-6, 2002, pp. 107-116.

``Fast bit-reversals on uniprocessors and shared-memory multiprocessors", SIAM Journal on Scientific Computing, Vol. 22, No. 6, 2001, pp. 2113-2134.

``A permutation-based page interleaving scheme to reduce row-buffer conflicts and exploit data locality", Proceedings of the 33rd Annual International Symposium on Microarchitecture, (Micro-33), Monterey, California, December 10-13, 2000. pp. 32-41.

The permutation technique has been widely adopted in different commercial processors.

see detailed documents.

2020 ACM Microarchitecture Test of Time Award.

``Memory hierarchy considerations for cost-effective cluster computing", IEEE Transactions on Computers, Vol. 49, No. 9, 2000, pp. 915-933.

``Improving memory performance of sorting algorithms", ACM Journal on Experimental Algorithmics, Vol. 5, No. 3, 2000, pp. 1-22.

``Cacheminer: a runtime approach to exploit locality on SMP", IEEE Transactions on Parallel and Distributed Systems, Vol. 11, No. 4, 2000, pp. 357-374.

``Two fast and high-associativity cache schemes", IEEE Micro, Vol. 17, No. 5, 1997, pp. 40-49.

``Coordinating parallel processes on networks of workstations", Journal of Parallel and Distributed Computing, Vol. 46, No. 2, 1997, pp. 125-135.

``Adaptively scheduling parallel loops in distributed shared memory systems" , IEEE Transactions on Parallel and Distributed Systems, Vol. 8, No. 1, 1997, pp. 70-81.

``Semi-empirical multiprocessor performance predictions", Journal of Parallel and Distributed Computing, Vol. 39, No. 1, 1996, pp. 14-28.

``Evaluating and designing software mutual exclusion algorithms on shared-memory multiprocessors", IEEE Parallel & Distributed Technology, Spring Issue, 1996, pp. 25-42.

``Multiprocessor scalability predictions through detailed program execution analysis", Proceedings of the 9th ACM International Conference on Supercomputing (ICS'95), Barcelona, Spain, July 1995. Best Paper Award .

``Triangular decoposition methods for solving reducible nonlinear systems of equations", SIAM Journal on Optimization, Vol. 4, No. 2, 1994.

``Latency metric: an experimental method for measuring and evaluating parallel program and archtecture scalability", Journal of Parallel and Distributed Computing, Vol. 22, No. 3, 1994.

``Parallel methods for solving nonliear block bordered systems of equations", SIAM Journal on Scientific and Statistical Computing, Vol. 13, No. 4, 1992.

``System effects of interprocessor communication latency in multicomputers", IEEE Micro, Vol. 11, No. 2, 1991.