Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Lee, 2023]
Hyun-Seob Lee. A design of hot data classifier and management scheme for an efficient resource management. Journal of Internet of Things and Convergence, 9(5):11–16, 10 2023. (doi:10.20465/KIOTS.2023.9.5.011)
[Luo, 2023]
Hui-Tang Luo. Rethinking Bε tree indexing structure over NVM with the support of multi-write modes. Master's thesis, National Central University, Taiwan, 2023.
[Song and Hu, 2023]
Shaopu Song and Junhao Hu. Skystore: Unified storage across clouds. Project paper, University of California, Berkeley, 2023.
[Wang et al., 2023]
Wei-Chen Wang, Chien-Chung Ho, Yung-Chun Li, Liang-Chi Chen, and Yu-Ming Chang. Reaping both latency and reliability benefits with elaborate sanitization design for 3D TLC NAND flash. IEEE Transactions on Computers, 72(11):3029–3041, 2023. (doi:10.1109/TC.2023.3272280)
[Wu et al., 2023]
Chin-Hsien Wu, Liang-Ting Chen, Ren-Jhen Hsu, and Jian-Yu Dai. A state-aware method for flows with fairness on NVMe SSDs with load balance. IEEE Transactions on Cloud Computing, 11(3):304–305, 2023. (doi:10.1109/TCC.2023.3253864)
[Zhang et al., 2023]
Chi Zhang, Song Liu, Fangxing Yu, Menghan Li, Wei Tang, Fei Liu, and Weiguo Wu. Balloon: An elastic data management strategy for interlaced magnetic recording. Applied Sciences, 13(17), 2023. (doi:10.3390/app13179767)
[Lv et al., 2023]
Yina Lv, Liang Shi, Yunpeng Song, , and Chun Jason Xue. Access characteristic guided partition for NAND flash-based high-density SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(12):4643–4656, December 2023. (doi:10.1109/TCAD.2023.3282175)
[Pang et al., 2023]
Shujie Pang, Yuhui Deng, Genxiong Zhang, Yi Zhou, Xiao Qin, Zhaorui Wu, , and Jie Li. PcGC: A parity-check garbage collection for boosting 3-D NAND flash performance. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(12):4364–4377, December 2023. (doi:10.1109/TCAD.2023.3281517)
[Wang et al., 2023]
Tse-Yuan Wang, Che-Wei Tsao, Yuan-Hao Chang, , and Tei-Wei Kuo. Retention-aware read acceleration strategy for LDPC-based NAND flash memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(12):4597–4605, December 2023. (doi:10.1109/TCAD.2023.3289328)
[Gu et al., 2023]
Yibin Gu, Hua Wang, Man Luo, Tang Jingyu, and Ke Zhou. Offline and online algorithms for cache allocation with monte carlo tree search and a learned model. In Proceedings of the 41st IEEE International Conference on Computer Design, pages 126–133, Washington, DC, November 2023. IEEE. (doi:10.1109/ICCD58817.2023.00028)
[Wang et al., 2023]
Wei-Chen Wang, Chien-Chung Ho, Li Yung-Chun, Chen Liang-Chi, and Yu-Ming Chang. Reaping both latency and reliability benefits with elaborate sanitization design for 3D TLC NAND flash. IEEE Transactions on Computers, 72(11):3029–3041, November 2023. (doi:10.1109/TC.2023.3272280)
[Yu et al., 2023]
Xiaolei Yu, Jing He, Bo Zhang, Xianliang Wang, Qianhui Li, Qi Wang, Zongliang Huo, , and Tianchun Ye. Interleaved LDPC decoding scheme improves 3-D TLC NAND flash memory system performance. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(11):4191–4204, November 2023. (doi:10.1109/TCAD.2023.3266363)
[Ait-Oucheggou et al., 2023]
Lydia Ait-Oucheggou, Stéphane Rubini, Abdella Battou, and Jalil Boukhobza. Investigating multi-tier and QoS-aware caching based on ARC. In Proceedings of the 31st IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, pages 1–4, Stony Brook, NY, October 2023. IEEE. (doi:10.1109/MASCOTS59514.2023.10387601)
[Bor et al., 2023]
Julianna Bor, Giuliano Casale, William Knottenbelt, Evgenia Smirni, and Andreas Stathopoulos. Fitting with matrix exponential mixtures generated by discrete probabilistic scaling. ACM Performance Evaluation Review, 51(2):15––17, October 2023. (doi:10.1145/3626570.3626577)
[Du et al., 2023]
Yajuan Du, Yuan Gao, Siyi Huang, , and Qiao Li. LDPC level prediction toward read performance of high-density flash memories. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 42(10):3264–3274, October 2023. (doi:10.1109/TCAD.2023.3238845)
[Liu et al., 2023]
Yachun Liu, Dan Feng, Jianxi Chen, and Chao Guo. D-IOCost: Dynamic cost-aware fair queueing for better I/O proportionality and performance. In Weizhi Meng, Rongxing Lu, Geyong Min, and Jaideep Vaidya, editors, Proceedings of the 22nd ACM International Conference on Algorithms and Architectures for Parallel Processing, pages 373–391, Tianjin, China, October 2023. Springer Nature Switzerland.
[Yang et al., 2023a]
Juncheng Yang, Yazhuo Zhang, Ziyue Qiu, Yao Yue, , and Rashmi Vinayak. FIFO queues are all you need for cache eviction. In Proceedings of the 29th ACM Symposium on Operating Systems Principles, pages 130–149, Koblenz, Germany, October 2023. ACM. (doi:10.1145/3600006.3613147)
[Yang et al., 2023b]
Junyao Yang, Yuchen Wang, , and Zhenlin Wang. An empirical analysis on memcached's replacement policies. In Proceedings of the 9th International Symposium on Memory Systems, pages 1–10, Washington, DC, October 2023. ACM. (doi:10.1145/3631882.3631883)
[Zou et al., 2023]
Qiang Zou, Yifeng Zhu, Jianxi Chen, Yuhui Deng, and Xiao Qin. Characterization of I/O behaviors in cloud storage workloads. IEEE Transactions on Computers, 72(10):2726–2739, October 2023. (doi:10.1109/TC.2023.3263726)
[Chang et al., 2023]
Jung-Hsiu Chang, Tzu-Yu Chang, Yi-Chao Shih, , and Tseng-Yi Chen. LaDy: Enabling locality-aware deduplication technology on shingled magnetic recording drives. ACM Transactions on Embedded Computing Systems, 22(5s):1–25, September 2023. (doi:10.1145/3607921)
[Lien et al., 2023]
Yi-Han Lien, Yen-Ting Chen, Yuan-Hao Chang, Yu-Pei Liang, , and Wei-Kuan Shih. FSIMR: File-system-aware data management for interlaced magnetic recording. ACM Transactions on Embedded Computing Systems, 22(5s):1–18, September 2023. (doi:10.1145/3607922)
[Son and Kim, 2023]
Ikjoon Son and Jin-Soo Kim. Efficient read disturb management schemes in resource-constrained flash memory controller. In Proceedings of the 12th IEEE Non-Volatile Memory Systems and Applications Symposium, Niigata, Japan, September 2023. IEEE. (doi:10.1109/NVMSA58981.2023.00022)
[Hou and Chang, 2023]
Jia-Xin Hou and Li-Pin Chang. Improving read performance for LDPC-based SSDs with adaptive bit labeling on Vth states. In Proceedings of the 29th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), pages 77–84, Niigata, Japan, August 2023. IEEE. (doi:10.1109/RTCSA58653.2023.00018)
[Wu and Wu, 2023]
Ming-Yan Wu and Chin-Hsien Wu. A multi-stream-aware DRAM allocation strategy inside solid-state drives (SSDs). In Proceedings of the 2023 International Conference on Research in Adaptive and Convergent Systems (RACS), pages 1–6, Gdansk, Poland, August 2023. ACM. (doi:10.1145/3599957.3606209)
[Zhang et al., 2023]
Chi Zhang, Song Liu, Fangxing Yu, Menghan Li, Wei Tang, Fei Liu, , and Weiguo Wu. Balloon: An elastic data management strategy for interlaced magnetic recording. Applied Sciences, 13(17), August 2023. (doi:10.3390/app13179767)
[Chen et al., 2023]
Kuan-Yu Chen, Chin-Hsien Wu, and Cheng-Tze Lee. Short-term and long-term idle time detectors for reducing long-tail latency in solid-state drives. In Proceedings of the 6th IEEE International Symposium on Computer, Consumer and Control, pages 143–146, Taichung, , Taiwan, July 2023. IEEE. (doi:10.1109/IS3C57901.2023.00046)
[Lange et al., 2023]
Tomer Lange, Joseph (Seffi) Naor, and Gala Yadgar. Offline and online algorithms for SSD management. Communications of the ACM, 66(7):129–137, July 2023. (doi:10.1145/3596205)
[Wang et al., 2023]
Tuo Wang, Peixuan Li, Ping Xie, and Xiaofei Wang. SSD multi-level parallel garbage collection. In Proceedings of the 4th IEEE International Symposium on Parallel and Distributed Proceessing, pages 301–306, Guangzhou, China, July 2023. IEEE. (doi:10.1109/ISPDS58840.2023.10235643)
[Wei et al., 2023]
Qian Wei, Yi Li, Zhiping Jia, Mengying Zhao, Zhaoyan Shen, and Bingzhe Li. Reinforcement learning-assisted management for convertible SSDs. In Proceedings of the 60th ACM/EDAC/IEEE Design Automation Conference, pages 1–6, San Francisco, CA, July 2023. ACM. (doi:10.1109/DAC56929.2023.10247929)
[Zhou et al., 2023]
Yang Zhou, Fang Wang, Zhan Shi, Dan Feng, and Yu Du. Fair will go on: A collaboration-aware fairness scheme for NVMe SSD in cloud storage system. In Proceedings of the 60th ACM/IEEE Design Automation Conference, pages 1–6, San Fransciso, CA, July 2023. IEEE. (doi:10.1109/DAC56929.2023.10247718)
[Lin and Chen, 2023]
Ting-Yu Lin and Tseng-Yi Chen. HSMR-RAID: Enabling a low overhead RAID-5 system over a host-managed shingled magnetic recording disk array. In Proceedings of the 38th ACM Symposium on Applied Computing, pages 294–296. ACM, June 2023. (doi:10.1145/3555776.3577820)
[Wu, 2023]
Kun Wu. Optimizing consensus protocols with machine learning models: A cache-based approach. Master's thesis, KTH Royal Institute of Technology, School of Electrical Engineering and Computer Science (EECS), June 2023.
[Yang et al., 2023a]
Juncheng Yang, Ziyue Qiu, Yazhuo Zhang, Yao Yue, , and K. V. Rashmi. FIFO can be better than LRU: the power of lazy promotion and quick demotion. In Proceedings of the 19th Workshop on Hot Topics in Operating Systems, pages 70–79. IEEE, June 2023. (doi:10.1145/3593856.3595887)
[Yang et al., 2023b]
Yiyuan Yang, Rongshang Li, Qiquan Shi, Xijun Li, Gang Hu, Xing Li, and Mingxuan Yuan. SGDP: A stream-graph neural network based data prefetcher. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), pages 1–8, Gold Coast, Australia, June 2023. IEEE. (doi:10.1109/IJCNN54540.2023.10191927)
[Du et al., 2023]
Yajuan Du, Siyi Huang, Yao Zhou, , and Qiao Li. Towards LDPC read performance of 3D flash memories with layer-induced error characteristics. ACM Transactions on Design Automation of Electronic Systems, 28(3):1–25, May 2023. (doi:10.1145/3585075)
[Jia et al., 2023]
Danlin Jia, Yiming Xie, Li Wang, Xiaoqian Zhang, Allen Yang, Xuebin Yao, Mahsa Bayati, Pradeep Subedi, Bo Sheng, and Ningfang Mi. SRC: Mitigate I/O throughput degradation in network congestion control of disaggregated storage systems. In Proceedings of the 37th IEEE International Symposium on Parallel & Distributed Processing, pages 268–278, St. Petersburg, FL, May 2023. IEEE. (doi:10.1109/IPDPS54959.2023.00035)
[Pang et al., 2023]
Lu Pang, Anis Alazzawe, Madhurima Ray, Krishna Kant, , and Jeremy Swift. Adaptive intelligent tiering for modern storage systems. Performance Evaluation, 160, May 2023. (doi:10.1016/j.peva.2023.102332)
[Qiu et al., 2023]
Ziyue Qiu, Juncheng Yang, Juncheng Zhang, Cheng Li, Xiaosong Ma, Qi Chen, Mao Yang, , and Yinlong Xu. FrozenHot cache: Rethinking cache management for modern hardware. In Proceedings of the 18th ACM European Conference on Computer Systems, pages 557–573, Rome, Italy, May 2023. ACM. (doi:10.1145/3552326.3587446)
[Han et al., 2023]
Sangwoo Han, Minjung Cho, Gi Lee, , and Eui-Young Chung. Page type-aware data migration technique for read disturb management of NAND flash memory. IEEE Transactions on Very Large Scale Integration Systems, 31(4):591–595, April 2023. (doi:10.1109/TVLSI.2023.3240172)
[Li et al., 2023]
Tian Li, Zhiming Ding, Jian Miao, Xinjie Lv, Xueyu Gao, Fulin Wang, , and Xiangbin Wan. LOACR: A cache replacement method based on loop assist. In Proceedings of the 4th International Conference on Spatial Data and Intelligence (SpatialDI), pages 239–255, Nanchang, China, April 2023. Springer-Verlag. (doi:10.1007/978-3-031-32910-4_17)

Displaying items 41–80 of 770 in total
Showing citations per page


Member Links