Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Center for Research in Intelligent Storage

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Wang et al., 2026]
Guohui Wang, Zhengang Chen, Zhiping Shi, Yong Guan, and Tianyu Wang. Region-based collaborative caching with joint latency and lifetime optimization for hybrid SMR-flash storage. IEEE Transactions on Computers, 75(1):219–232, January 2026.
[Antunes et al., 2025]
Mário Antunes, Tyler Estro, Pranav Bhandari, Anshul Gandhi, Geoff Kuenning, Yifei Liu, Carl Waldspurger, Avani Wildani, and Erez Zadok. Kneeliverse: A universal knee-detection library for performance curves. SoftwareX, 30:102161, 2025. (doi:https://doi.org/10.1016/j.softx.2025.102161)
[Cheng et al., 2025]
Wen Cheng, Shengxia Tu, Yi Liu, Lingfang Zeng, Yang Wang, and André Brinkmann. FlashFox: a secret-sharing approach to securing data deletion for flash-based SSD. The Computer Journal, 68:734–h48, 2025.
[Jacob, 2025]
Ashwin Jacob. RIOCZ: Kernel-level ransomware detection and zone quarantine for ZNS SSDs. Master's thesis, Iowa State Univeristy, 2025.
[Lee, 2025a]
Hyun-Seob Lee. Analysis of the impact of buffer management policies on performance in batch processing write buffers based on block mapping. Journal of Internet of Things and Convergence, 11(5):91–96, 2025.
[Lee, 2025b]
Hyun-Seob Lee. Data mining-based data access address analysis for data management efficiency. Journal of Internet of Things and Convergence, 11(2):105–110, 2025.
[Lee, 2025c]
Hyun-Seob Lee. Data mining based data pattern analysis and memory pinning scheme to improve response time on small mobile devices. Journal of Internet of Things and Convergence, 11(2):151–156, 2025.
[Lee, 2025d]
Hyun-Seob Lee. A design of enhanced block mapping method based on batch processing. Journal of Internet of Things and Convergence, 11(4):71–76, 2025.
[Berend et al., 2025]
Daniel Berend, Shlomi Dolev, Sweta Kumari, Dhruv Mishra, Marina Kogan-Sadetsky, and Archit Somani. DynamicAdaptiveClimb: Adaptive cache replacement with dynamic resizing. ArXiv, arXiv:2511.21235, November 2025.
[Godavari et al., 2025]
Amdewar Godavari, Chapram Sudhakar, and T. Ramesh. File aware distributed deduplication system in cloud environment. Journal of Supercomputing, 81(16), November 2025.
[Han et al., 2025]
Daegyu Han, Jaeyoon Nam, Hokeun Cha, Changdae Kim, Kwangwon Koh, Taehoon Kim, Sang-Hoon Kim, and Beomseok Nam. Disaggregated memory for file-backed pages. ACM Transactions on Storage, November 2025.
[Sajal et al., 2025]
Sultan Mahmud Sajal, Md Salman Estyak, Rubaba Hasan, Timothy Zhu, Bhuvan Urgaonkar, and Siddhartha Sen. TraceScaler: A framework for scaling load in real-world traces for system evaluation. ACM Transactions on Computer Systems, 43(4):1–31, November 2025.
[Su et al., 2025]
Liangkuan Su, Mingwei Lin, Bo Mao, Jianpeng Zhang, and Zeshui Xu. HaParallel: Hit ratio-aware parallel aggressive eviction cache management algorithm for SSDs. ACM Transactions on Storage, November 2025.
[Zhou and Feng, 2025]
Hai Zhou and Dan Feng. Make updates faster: A fast multi-stripe updates framework in erasure-coded storage clusters. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pages 2251–2265, New York, NY, USA, November 2025. ACM. (doi:10.1145/3712285.3759806)
[He et al., 2025]
Jingkai He, Yunpeng Dong, Dong Du, Mo Zou, Zhitai Yu, Yuxin Ren, Ning Jia, Yubin Xia, and Haibo Chen. How to copy memory? coordinated asynchronous copy as a first-class OS service. In Proceedings of the 31st ACM Symposium on Operating Systems Principles, pages 1062–1081, Seoul, Republic of Korea, October 2025.
[Li et al., 2025]
Sizhe Li, Nicolas Christianson, and Tongxin Li. Prediction-specific design of learning-augmented algorithms. ArXiv, arXiv:2510.14887, October 2025.
[Ma et al., 2025]
Ruiying Ma, Chieh-Jan Mike Liang, Yanjie Gao, and Francis Y. Yan. Algorithm generation via creative ideation. ArXiv, arXiv:2510.03851, October 2025.
[Wu et al., 2025]
Jiaojiao Wu, Fan Yang, Zhibing Sha, Li Cai, Zhigang Cai, Balazs Gerofi, Yuanquan Shi, and Jianwei Liao. Cache partition management for improving fairness and i/o responsiveness in NVMe SSDs. IEEE Transactions on Parallel and Distributed Systems, October 2025.
[Xu et al., 2025]
Jingwei Xu, Junbin Kang, Mingkai Dong, Mingyu Liu, Lu Zhang, Shaohong Guo, Ziyan Qiu, Mingzhen You, Ziyi Tian, Anqi Yu, Tianhong Ding, Xinwei Hu, and Haibo Chen. FalconFS: Distributed file system for large-scale deep learning pipeline. ArXiv, arXiv:2507.10367, October 2025.
[Yao et al., 2025]
Zilu Yao, Yinjin Fu, and Nong Xiao. RABBIT: Managing hierarchical memory with intelligent tiering aware deduplication. In Proceedings of the 16th International Conference on Internetware, pages 220–232. ACM, October 2025.
[Zhang et al., 2025]
Cheng Zhang, Peixuan Li, Jinkang Lu, Zhu Yuan, and Ping Xie. A write leveling scheme for synchronizing garbage collection in RAID-enabled SSDs. cluster-comp, 28(16), October 2025.
[Zhou et al., 2025]
Yanbo Zhou, Erci Xu, Anisa Su, Jim Harris, Adam Manzanares, and Steven Swanson. Sleeping with one eye open: Fast, sustainable storage with sandman. In Proceedings of the 31st ACM Symposium on Operating Systems Principles, pages 496–511, Seoul, Republic of Korea, October 2025.
[Bringoltz et al., 2025]
Barak Bringoltz, Elisha Halperin, Ran Feraru, Evgeny Blaichman, and Amit Berman. CLEAR: Command level annotated dataset for ransomware detection. In Proceedings of the 39th Conference on Neural Information Processing Systems (NeurIPS), September 2025.
[Kuba and Yadgar, 2025]
Shalev Kuba and Gala Yadgar. Let it slide: Online deduplicated data migration. In Proceedings of the 18th ACM International Systems and Storage Conference (SYSTOR), pages 136–150, September 2025.
[Liu et al., 2025]
Yachun Liu, Dan Feng, Jianxi Chen, Jing Hu, Zhouxuan Peng, and Jinlei Hu. ZNSFQ: An efficient and high-performance fair queue scheduling scheme for ZNS SSDs. ACM Transactions on Architecture and Code Optimization, September 2025.
[Shen et al., 2025]
Mingyao Shen, Suyash Mahar, Heewoo Kim, Joseph Izraelevitz, and Steven Swanson. AutoSSD: CXL-enhanced autonomous SSDs for low tail latency. In Proceedings of the 34th International Symposium on High Performance Parallel and Distributed Computing, September 2025.
[Wang et al., 2025]
Lin Wang, Yuchong Hu, Shilong Mao, Mingqi Li, Ziling Duan, Yue Huang, Leihua Qin, Dan Feng, Zehui Chen, and Ruliang Dong. Revisiting fragmentation for deduplication in clustered primary storage systems. In IEEE International Conference on Cluster Computing (CLUSTER), pages 1–12. IEEE, September 2025.
[Wei et al., 2025]
Zheng Wei, Jing Xing, Yida Gu, Wenjing Huang, Dong Dai, Guangming Tan, and Dingwen Tao. TSUE: A two-stage data update method for an erasure coded cluster file system. In Proceedings of the 34th International Symposium on High Performance Parallel and Distributed Computing. IEEE, September 2025. (doi:https://doi.org/10.1145/3731545.3731577)
[Yi et al., 2025]
Shushu Yi, Yuda An, Li Peng, Xiurui Pan, Qiao Li, Jieming Yin, Guangyan Zhang, Wenfei Wu, Diyu Zhou, Zhenlin Wang, Xiaolin Wang, Yingwei Luo, Ke Zhou, and Jie Zhang. XBOF: A cost-efficient CXL JBOF with inter-SSD compute resource sharing. ArXiv, arXiv:2509.10251, September 2025.
[Lucas Filho et al., 2025]
Edson Ramiro Lucas Filho, Andreas Efstathiou, Lun Yang, Kebo Fu, Jianqiang Shen, and Herodotos Herodotou. DITIS: An end-to-end system-level simulator and optimizer for distributed tiered storage. SN Computer Science, 6(6):1–28, August 2025.
[Wang et al., 2025]
Li Wang, Shi Qiu, Jianqin Yan, Zhirong Shen, Qingbo Wu, Xin Yao, Meiling Wang, Renhai Chen, and Yiming Zhang. A tale of two paths: Optimizing paravirtualized storage I/O with eBPF. ACM Transactions on Storage, August 2025.
[Zhang, 2025]
Xinxin Zhang. Feedback-based caching replacement method using a multi-strategy combination. In Proceedings of the 5th International Conference on Computer Science and Blockchain (CCSB), pages 349–355, Shenzhen, China, August 2025. IEEE.
[Zhang et al., 2025]
Bo Zhang, Helei Cui, Xingliang Yuan, Zhiwen Yu, and Bin Gyou. USENIX security ’25 artifact appendix: V-ORAM: A versatile and adaptive ORAM framework with service transformation for dynamic workloads. In Artifact Appendices to the Proceedings of the 34th USENIX Security Symposium, pages 689–691, August 2025.
[Gefen et al., 2025]
Itamar Gefen, Aviad Zuck, Daniel Bransky, Moshik Hershcovitch, Danny Harnik, and Dan Tsafrir. Why paying for storage beats free networking in cloud bursting. In Proceedings of the 17th ACM Workshop on Hot Topics in Storage and File Systems, pages 31–37. ACM, July 2025.
[Haas et al., 2025]
Gabriel Haas, Bohyun Lee, Philippe Bonnet, and Viktor Leis. SSD-iq: Uncovering the hidden side of SSD performance. Proceedings of the VLDB Endowment, 18(11):4295–4308, July 2025.
[Ke et al., 2025]
Zhaokang Ke, Jim Diehl, Ya-Shu Chen, and David HC Du. Emerald tiers: Focusing on SSD+ MAID through a green lens. In Proceedings of the 17th ACM Workshop on Hot Topics in Storage and File Systems, pages 61–68. ACM, July 2025.
[Kishani et al., 2025]
Mostafa Kishani, Sina Ahmadi, Saba Ahmadian, Reza Salkhordeh, Zdenek Becvar, Onur Mutlu, André Brinkmann, and Hossein Asadi. ELICA: Efficient and load balanced I/O cache architecture for hyperconverged infrastructures. IEEE Transactions on Parallel and Distributed Systems, July 2025.
[Wang et al., 2025]
Wenjie Wang, Bo Peng, Jianguo Yao, and Haibing Guan. ReHSS: Optimizing latency for cloud hybrid storage systems using in-network placement. In Proceedings of the 33rd IEEE/ACM International Symposium on Quality of Service (IWQoS), pages 1–6, July 2025.
[Lange et al., 2025]
Tomer Lange, Joseph (Seffi) Naor, and Gala Yadgar. Optimal SSD management with predictions. Proceedings of the ACM on Measurement and Analysis of Computer Systems, 9(2), June 2025.
[Peng et al., 2025]
Li Peng, Wenbo Wu, Shushu Yi, Xianzhang Chen, Chenxi Wang, Shengwen Liang, Zhe Wang, Nong Xiao, Qiao Li, Mingzhe Zhang, and Jie Zhang. XHarvest: Rethinking high-performance and cost-efficient SSD architecture with CXL-driven harvesting. In Proceedings of the 52nd International Symposium on Computer Architecture, pages 434–449, June 2025.
[Mahni et al., 2025]
Hocine Mahni, Stéphane Rubini, Sébastien Gougeaud, Philippe Deniel, and Jalil Boukhobza. Multicriteria file-level placement policy for HPC storage. In Proceedings of the 40th ACM Symposium on Applied Computing, pages 1399–1406, May 2025. (doi:https://doi.org/10.1145/3672608.3707969)
[Nie et al., 2025]
Shiqiang Nie, Jie Niu, Chaoyun Yang, Peng Zhang, Qiong Yang, Dong Wang, and Weiguo Wu. Exploiting data duplication to reduce data migration in garbage collection inside SSD. Electronics, 14(9), May 2025.
[Wei et al., 2025]
Bing Wei, Yubin Li, Yi Wu, Ming Zhong, and Ning Luo. Dynamic metadata prefetching and data placement algorithms for high-performance wide-area applications. Computers, Materials & Continua, 84(3), May 2025.
[Zhang et al., 2025]
Jie Zhang, Shushu Yi, Xiurui Pan, Yiming Xu, Qiao Li, Qiang Li, Chenxi Wang, Bo Mao, and Myoungsoo Jung. Enhancing the performance of next-generation SSD arrays: A holistic approach. ACM Transactions on Storage, May 2025.
[Cohen, 2025]
Itamar Cohen. Bandwidth efficient cache selection and cache-content advertisement. IEEE Transactions on Network and Service Management, April 2025.
[Lucas Filho et al., 2025]
Edson Ramiro Lucas Filho, George Savva, Lun Yang, Kebo Fu, Jianqiang Shen, and Herodotos Herodotou. Employing streaming machine learning for modeling workload patterns in multi-tiered data storage systems. Future Internet, 17(4), April 2025.
[Talluri et al., 2025]
Sacheendra Talluri, Dante Niewenhuis, Xiaoyu Chu, Jakob Kyselica, Mehmet Cetin, Alexander Balgavy, and Alexandru Iosup. Cloud uptime archive: Open-access availability data of web, cloud, and gaming services. ArXiv, arXiv:2504.09476, April 2025.
[Cai et al., 2025]
Yuhui Cai, Shiyao Lin, Zhirong Shen, Jiahui Yang, and Jiwu Shu. ChameleonEC: Exploiting tunability of erasure coding for low-interference repair. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA), pages 15–28, March 2025.
[Jiang et al., 2025]
Tianyang Jiang, Guangyan Zhang, Xiaojian Liao, and Yuqi Zhou. Zebra: Efficient redundant array of zoned namespace SSDs enabled by zone random write area (ZRWA). In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA), pages 594–607. IEEE, March 2025.
[Kurniawan et al., 2025]
Daniar H. Kurniawan, Rani Ayu Putri, Peiran Qin, Kahfi S. Zulkifli, Ray AO Sinurat, Janki Bhimani, Sandeep Madireddy, Achmad Imam Kistijantoro, and Haryadi S. Gunawi. Heimdall: Optimizing storage I/O admission with extensive machine learning pipeline. In Proceedings of the 20th ACM European Conference on Computer Systems, pages 1109–1125. ACM, March 2025.

Displaying items 1–50 of 863 in total
Showing citations per page


Member Links