Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Russo et al., 2021]
Gabriele Russo Russo, Valeria Cardellini, Giuliano Casale, and Francesco Lo Presti. MEAD: Model-based vertical auto-scaling for data stream processing. In Proceedings of the 21st IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pages 314–323, 2021. (doi:10.1109/CCGrid51090.2021.00041)
[Shen et al., 2021]
Zhaoyan Shen, Lei Han, Chenlin Ma, Zhiping Jia, Tao Li, and Zili Shao. Leveraging the interplay of RAID and SSD for lifetime optimization of flash-based SSD RAID. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 40(7):1395–1408, 2021. (doi:10.1109/TCAD.2020.3020495)
[Tzouros and Kalogeraki, 2021]
Giannis Tzouros and Vana Kalogeraki. Preserving data availability in edge computing systems with diagonally interleaved coding. In Proceedings of the 24th Pan-Hellenic Conference on Informatics, PCI '20, page 87–90. ACM, 2021. (doi:10.1145/3437120.3437281)
[Wang et al., 2021]
Yi Wang, Jiangfan Huang, Jing Chen, and Rui Mao. PVSensing: A process-variation-aware space allocation strategy for 3d NAND flash memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 41(5):1302–1315, 2021.
[Wei et al., 2021]
Bing Wei, Jigang Wu, Xiaosong Su, Qiang Huang, and Yujun Liu. Adaptive updates for erasure-coded storage systems based on data delta and logging. In International Conference on Parallel and Distributed Computing, Applications and Technologies, pages 187–197. Springer, 2021.
[Wu et al., 2021]
Chin-Hsien Wu, I-Hung Li, and Jian-Jia Chen. A supervised-learning-based garbage collection in solid-state drives (SSDs). IT Professional, 23(6):39–45, 2021.
[Yang, 2021]
Junyao Yang. Efficient Modeling of Random Sampling-Based LRU Cache. PhD thesis, Michigan Technological University, 2021.
[Zhou et al., 2021]
Hai Zhou, Dan Feng, and Yuchong Hu. Multi-level forwarding and scheduling repair technique in heterogeneous network for erasure-coded clusters. In Proceedings of the 50th International Conference on Parallel Processing, pages 1–11, 2021.
[심석보, 2021]
심석보. PCRAM controller 의 hardware prefetcher 를 위한 data buffer 최적화. PhD thesis, 서울대학교 대학원, 2021.
[Guo et al., 2021]
Hanchen Guo, Zhehan Lin, Yunfei Gu, Chentao Wu, Li Jiang, Jie Li, Guangtao Xue, and Minyi Guo. Lazy-WL: A wear-aware load balanced data redistribution method for efficient SSD array scaling. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER), pages 157–168, Melbourne, Australia, September 2021. IEEE. (doi:10.1109/Cluster48925.2021.00030)
[Mounika and Setha, 2021]
K. Mounika and Prof. G. Setha. Utilization of data de duplication to enhance the performance of storage system in the cloud. Indo-American Journal of Pharma and Bio Sciences, 9(3):19–23, September 2021.
[Sha et al., 2021]
Zhibing Sha, Jun Li, Lihao Song, Jiewen Tang, Min Huang, Zhigang Cai, Lianju Qian, Jianwei Liao, and Zhiming Liu. Low I/O intensity-aware partial GC scheduling to reduce long-tail latency in SSDs. ACM Transactions on Architecture and Code Optimization, 18(4), August 2021. (doi:10.1145/3460433)
[Yin et al., 2021]
Lujia Yin, Li Wang, Yiming Zhang, and Yuxing Peng. MapperX: Adaptive metadata maintenance for fast crash recovery of DM-cache based hybrid storage devices. In Proceedings of the USENIX Annual Technical Conference, pages 705–713. USENIX Association, July 2021.
[Kim et al., 2021]
Joonsung Kim, Kanghyun Choi, Wonsik Lee, and Jangwoo Kim. Performance modeling and practical use cases for black-box SSDs. ACM Transactions on Storage, 17(2), June 2021. (doi:10.1145/3440022)
[Lee, 2021]
Myungsub Lee. A block classification method with monitor and restriction in NAND flash memory. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(5):209–215, May 2021. (doi:10.17762/turcomat.v12i5.877)
[Sajal et al., 2021]
Sultan Mahmud Sajal, Rubaba Hasan, Timothy Zhu, Bhuvan Urgaonkar, and Siddhartha Sen. TraceSplitter: A new paradigm for downscaling traces. In Proceedings of the 16th ACM European Conference on Computer Systems, Virtual, April 2021. ACM. (doi:10.1145/3447786.3456262)
[Steven G. Lyons, 2021]
Jr. Steven G. Lyons. A Study of Non-datapath Cache Replacement Algorithms. PhD thesis, Florida International University, March 2021.
[Jiang et al., 2021]
Tianyang Jiang, Guangyan Zhang, Zican Huang, Xiaosong Ma, Junyu Wei, Zhiyue Li, and Weimin Zheng. FusionRAID: Achieving consistent low latency for commodity SSD arrays. In Proceedings of the 19th USENIX Conference on File and Storage Technologies, pages 355–370, Virtual, February 2021. USENIX Association.
[Liu et al., 2021]
Zhang Liu, Hee Won Lee, Yu Xiang, Dirk Grunwald, and Sangtae Ha. eMRC: Efficient miss ratio approximation for multi-tier caching. In Proceedings of the 19th USENIX Conference on File and Storage Technologies, pages 293–306. USENIX Association, February 2021.
[Rodriguez et al., 2021]
Liana V. Rodriguez, Farzana Yusuf, Steven Lyons, Eysler Paz, Raju Rangaswami, Jason Liu, Ming Zhao, and Giri Narasimhan. Learning cache replacement with CACHEUS. In Proceedings of the 19th USENIX Conference on File and Storage Technologies, pages 341–354, Virtual, February 2021. USENIX Association.
[Yadgar et al., 2021]
Gala Yadgar, MOSHE Gabel, Shehbaz Jaffer, and Bianca Schroeder. SSD-based workload characteristics and their performance implications. ACM Transactions on Storage, 17(1), January 2021. (doi:10.1145/3423137)
[Ganfure et al., 2020]
Gaddisa Olani Ganfure, Chun-Feng Wu, Yuan-Hao Chang, and Wei-Kuan Shih. DeepPrefetcher: A deep learning framework for data prefetching in flash storage devices. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(11):3311–3322, 2020. (doi:10.1109/TCAD.2020.3012173)
[Gupta et al., 2020]
Jit Gupta, Krishna Kant, and Ayman Abouelwafa. FussyCache: A caching mechanism for emerging storage hierarchies. In Proceedings of the IEEE International Conference on Cloud Computing Technology and Science, pages 74–81. IEEE, 2020. (doi:10.1109/CloudCom49646.2020.00010)
[Hadizadeh et al., 2020]
Mostafa Hadizadeh, Reza Salkhordeh, and Hossein Asadi. Improving hybrid multi-level I/O cache lifetime using conservative cache management. Journal of Soft Computing and Information Technology, 9(2):165–177, 2020.
[Kakuta et al., 2020]
Yoshiki Kakuta, Reika Kinoshita, Hiroshi Kinoshita, Chihiro Matsui, and Ken Takeuchi. Real-time error monitoring system considering endurance and data-retention characteristics of TaOx ReRAM storage with workloads at data centers. In International Symposium on VLSI Design, Automation and Test, pages 1–4. IEEE, 2020. (doi:10.1109/VLSI-DAT49148.2020.9196379)
[Li et al., 2020]
Jun Li, Zhibing Sha, Zhigang Cai, François Trahay, and Jianwei Liao. Patch-based data management for dual-copy buffers in RAID-enabled SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(11):3956–3967, 2020. (doi:10.1109/TCAD.2020.3012252)
[Ma et al., 2020]
Chenlin Ma, Yi Wang, Zhaoyan Shen, and Zili Shao. KFR: Optimal cache management with K-framed reclamation for drive-managed SMR disks. In Proceedings of the 57th ACM/IEEE Design Automation Conference, pages 1–6, 2020. (doi:10.1109/DAC18072.2020.9218636)
[Mativenga et al., 2020]
Ronnie Mativenga, Prince Hamandawana, Tae-Sun Chung, and Jongik Kim. FTRM: A cache-based fault tolerant recovery mechanism for multi-channel flash devices. Electronics, 9(10), 2020. (doi:10.3390/electronics9101581)
[Miura et al., 2020]
Taisei Miura, Kenichi Kourai, and Saneyasu Yamaguchi. Cache replacement based on LSTM in the second cache in virtualized environment. In Proceedings of the 8th IEEE International Symposium on Computing and Networking Workshops (CANDARW), pages 421–424, Naha, Japan, 2020. (doi:10.1109/CANDARW51189.2020.00086)
[Oe, 2020]
Kazuichi Oe. Consideration for effectively handling parallel workloads on public cloud system, 2020.
[Shu et al., 2020]
Jiwu Shu, Fei Li, Siyang Li, and Youyou Lu. Towards unaligned writes optimization in cloud storage with high-performance SSDs. IEEE Transactions on Parallel and Distributed Systems, 2020. (doi:10.1109/TPDS.2020.3006655)
[Wang et al., 2020]
Shunzhuo Wang, You Zhou, Jiaona Zhou, Fei Wu, and Changsheng Xie. An efficient data migration scheme to optimize garbage collection in SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2020. (doi:10.1109/TCAD.2020.3001262)
[Wu et al., 2020]
Suzhen Wu, Jindong Zhou, Weidong Zhu, Hong Jiang, Zhijie Huang, Zhirong Shen, and Bo Mao. EaD: a collision-free and high performance deduplication scheme for flash storage systems. In Proceedings of the 38th IEEE International Conference on Computer Design, pages 155–162. IEEE, 2020. (doi:10.1109/ICCD50377.2020.00039)
[Yong and Chang, 2020]
Kong-Kiat Yong and Li-Pin Chang. Error diluting: Exploiting 3-D NAND flash process variation for efficient read on LDPC-based SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(11):3467–3478, 2020. (doi:10.1109/TCAD.2020.3012646)
[Zeng, 2020]
Xiangwei Zeng. Improve performance of flash-based SSDs through multi-subpage merge and page-level temperature recognition. International Journal of Science, 7(4):242–246, 2020.
[Zhu et al., 2020]
Guangyu Zhu, Jaehyun Han, and Yongseok Son. A preliminary study: Towards parallel garbage collection for NAND flash-based SSDs. IEEE Access, 8:223574–223587, 2020. (doi:10.1109/ACCESS.2020.3043123)
[Hao et al., 2020]
Mingzhe Hao, Levent Toksoz, Nanqinqin Li, Edward Edberg Halim, Henry Hoffmann, and Haryadi S. Gunawi. LinnOS: Predictability on unpredictable flash storage with a light neural network. In Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, pages 173–190, Virtual, November 2020. USENIX Association.
[Chen et al., 2020]
Si Chen, Jianqiao Liu, and Avani Wildani. CENSUS: Counting interleaved workloads on shared storage. In Proceedings of the 36th IEEE Symposium on Mass Storage Systems and Technologies, Virtual, October 2020.
[Fareed et al., 2020]
Imran Fareed, Mincheol Kang, Wonyoung Lee, , and Soontae Kim. PAPA: Partial page-aware page allocation in TLC flash SSD for performance enhancement. In Proceedings of the IEEE Symposium on Mass Storage Systems and Technologies, Virtual, October 2020. IEEE.
[Lin et al., 2020]
Yu-Chen Lin, Tse-Yuan Wang, Che-Wei Tsao, Yuan-Hao Chang, Jian-Jia Chen, Xue Liu, and Tei-Wei Kuo. Overheating-avoidance remapping scheme for reliability enhancement of 3D PCM storage systems. In Proceedings of the International Conference on Research in Adaptive and Convergent Systems (RACS), pages 239––246, Gwangju, Republic of Korea, October 2020. ACM. (doi:10.1145/3400286.3418248)
[Chakraborttii and Litz, 2020]
Chandranil Chakraborttii and Heiner Litz. Learning I/O access patterns to improve prefetching in SSDs. In Proceedings of the 2020 European Conference on Machine Learning, Ghent, Belgium, September 2020.
[Chang et al., 2020]
Chan Jung Chang, Jerry Chou, Yu-Ching Chou, and I-Hsin Chung. ECS2: A fast erasure coding library for GPU-accelerated storage systems with parallel & direct IO. In IEEE International Conference on Cluster Computing (CLUSTER), pages 349–358, Kobe, Japan, September 2020. IEEE.
[Li et al., 2020]
Jun Li, Bowen Huang, Zhibing Sha, Zhigang Cai, Jianwei Liao, Balazs Gerofi, and Yutaka Ishikawa. Mitigating negative impacts of read disturb in SSDs. ACM Transactions on Design Automation of Electronic Systems, 25, September 2020. (doi:10.1145/3410332)
[Atre et al., 2020]
Nirav Atre, Justine Sherry, Weina Wang, and Daniel S. Berger. Caching with delayed hits. In Proceedings of the 2020 ACM SIGCOMM, August 2020. (doi:10.1145/3387514.3405883)
[Jung et al., 2020]
Myoungsoo Jung, Wonil Choi, Miryeong Kwon, Shekhar Srikantaiah, Joonhyuk Yoo, and Mahmut Taylan Kandemir. Design of a host interface logic for GC-free SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 39(8):1674–1687, August 2020. (doi:10.1109/TCAD.2019.2919035)
[Baek and Park, 2020]
Sung Hoon Baek and Ki-Woong Park. Compatible byte-addressable direct I/O for peripheral memory devices in Linux. Information Systems, 91, July 2020. (doi:10.1016/j.is.2019.101488)
[Carra and Neglia, 2020]
Damiano Carra and Giovanni Neglia. Efficient miss ratio curve computation for heterogeneous content popularity. In Proceedings of the 2020 USENIX Annual Technical Conference, pages 741–751, July 2020.
[Eytan et al., 2020]
Ohad Eytan, Danny Harnik, Effi Ofer, Roy Friedman, and Ronen Kat. It's time to revisit LRU vs. FIFO. In Proceedings of the 12th USENIX Workshop on Hot Topics in Storage and File Systems. USENIX Association, July 2020.
[Jung, 2020]
Myoungsoo Jung. OpenExpress: Fully hardware automated open research framework for future fast NVMe devices. In Proceedings of the 2020 USENIX Annual Technical Conference, pages 649–656, July 2020.
[Oe and Nanri, 2020]
Kazuichi Oe and Takeshi Nanri. Non-volatile memory driver to drastically reduce input-output response time and maintain Linux device-mapper framework. International Journal of Networking and Computing, 10(2):127–143, July 2020.

Displaying items 151–200 of 770 in total
Showing citations per page


Member Links