Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Cui et al., 2019]
Yan Cui, Karthik Prasanna, and Andres Rangel. Kernel policy optimization for computing workloads. United States Patent 2016/10228973, March 12 2019.
[Chen et al., 2019]
Jing Chen, Yi Wang, Amelie Chi Zhou, Rui Mao, and Tao Li. PATCH: Process-variation-resilient space allocation for open-channel SSD with 3D flash. In 2019 Design, Automation and Test in Europe Conference (DATE), Florence, Italy, March 2019. IEEE. (doi:10.23919/DATE.2019.8715197)
[Huang et al., 2019]
Xunsong Huang, Chentao Wu, and Jie Li. OPS: An optimized partial stripe write scheme to improve performance of XOR-based disk arrays tolerating triple disk failures. In Proceedings of the 3rd International Conference on High Performance Compilation, Computing and Communications, pages 139–148, Xi'an, China, March 2019. ACM. (doi:10.1145/3318265.3318274)
[Li et al., 2019]
Huiba Li, Yiming Zhang, Dongsheng Li, Zhiming Zhang, Shengyun Liu, Peng Huang, Zheng Qin, Kai Chen, and Yongqiang Xiong. URSA: Hybrid block storage for cloud-scale virtual disks. In Proceedings of the 14th ACM European Conference on Computer Systems, pages 1–17, Dresden, Germany, March 2019. (doi:10.1145/3302424.3303967)
[Wang et al., 2019]
Xiaohao Wang, Yifan Yuan, You Zhou, Chance C. Coats, and Jian Huang. Project Almanac: A time-traveling solid- state drive. In Proceedings of the 14th ACM European Conference on Computer Systems, pages 1–16, Dresden, Germany, March 2019. (doi:10.1145/3302424.3303983)
[Ajdari et al., 2019]
Mohammadamin Ajdari, Pyeongsu Park, Joonsung Kim, Dongup Kwon, and Jangwoo Kim. CIDR: A cost-effective in-line data reduction system for terabit-per-second scale SSD arrays. In Proceedings of the 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA), pages 28–41, Washington, DC, February 2019. (doi:10.1109/HPCA.2019.00025)
[Dai et al., 2019]
Dong Dai, Forrest Sheng Bao, Jiang Zhou, Xuanhua Shi, and Yong Chen. Vectorizing disks blocks for efficient storage system via deep learning. Parallel Computing, 82:75–90, February 2019.
[Harnik et al., 2019]
Danny Harnik, Moshik Hershcovitch, Yosef Shatsky, Amir Epstein, and Ronen Kat. Sketching volume capacities in deduplicated storage. In Proceedings of the 17th USENIX Conference on File and Storage Technologies, Boston, MA, February 2019. USENIX Association.
[Liao et al., 2019]
Zhuofan Liao, Ruiming Zhang, Shiming He, Daojian Zeng, Jin Wang, and Hye-Jin Kim. Deep learning-based data storage for low latency in data center networks. IEEE Access, 7:26411–26417, February 2019. (doi:10.1109/ACCESS.2019.2901742)
[Di et al., 2019]
Yejia Di, Liang Shi, Congming Gao, Qiao Li, Chun Jason Xue, and Kaijie Wu. Minimizing retention induced refresh through exploiting process variation of flash memory. IEEE Transactions on Computers, 68(1):83–98, January 2019. (doi:10.1109/TC.2018.2858771)
[Li et al., 2019]
Jian-Geng Li, Guan-Yu Chen, Hsung-Pin Chang, and Da-Wei Chang. SSKIP: Lifetime aware page skipping for multi-level cell flash-based solid-state drives. In 2019 International Conference on Electronics, Information, and Communication, Auckland, New Zealand, January 2019. IEEE. (doi:10.23919/ELINFOCOM.2019.8706493)
[Matsui and Takeuchi, 2019]
Chihiro Matsui and Ken Takeuchi. Design of heterogeneously-integrated memory system with storage class memories and NAND flash memories. In Proceedings of the 24thAsia and South Pacific Design Automation Conference, Tokyo, Japan, January 2019. ACM. (doi:10.1145/3287624.3287754)
[Nayak and Patgiri, 2019]
Sabuzima Nayak and Ripon Patgiri. Dr. Hadoop: In search of a needle in a haystack. In 15th International Conference on Distributed Computing and Internet Technology, Odisha, India, January 2019. Springer International Publishing. (doi:10.1007/978-3-030-05366-6_8)
[Sun et al., 2019]
Hui Sun, Jianzhong Huang, Xiao Qin, and Changsheng Xie. DLSpace: Optimizing SSD lifetime via an efficient distributed log space allocation. ACM Transactions on Embedded Computing Systems, 17(92):92:1–92:33, January 2019. (doi:10.1145/3284749)
[Yang et al., 2019]
Ming-Chang Yang, Yuan-Hao Chang, Fenggang Wu, Tei-Wei Kuo, and David H.C. Du. On improving the write responsiveness for host-aware SMR drives. IEEE Transactions on Computers, 68(1):111–124, January 2019. (doi:10.1109/TC.2018.2845383)
[Zhou et al., 2019]
You Zhou, Fei Wu, Zhonghai Lu, Xubin He, Ping Huang, and Changsheng Xie. SCORE: A novel scheme to efficiently cache overlong ECCs in NAND flash memory. ACM Transactions on Architecture and Code Optimization, 15(60):60:1–60:25, January 2019. (doi:10.1145/3291052)
[Amvrosiadis et al., 2018]
George Amvrosiadis, Michael Kuchnik, Jun Woo Park, Chuck Cranor, Gregory R. Ganger, Elisabeth Moore, and Nathan Debardeleben. The Atlas cluster trace repository. ;login:, 43(4):29–35, 2018.
[Arafat et al., 2018]
Hassan Arafat, Ryohei Shimizu, and Koh Johguchi. Hierarchial hybrid solid state drive. In Proceedings of the TENCON, Jeju, Korea, 2018. IEEE. (doi:10.1109/TENCON.2018.8650261)
[Chung, 2018]
Wei-Sheng Chung. Proof of violation with adaptive Huffman coding hash tree for cloud services. Master's thesis, National Central University, 2018.
[Liang et al., 2018]
Jie Liang, Yongkun Li, Hao Chen, and Yinlong Xu. Boosting performance of SSD with chip-level RAID by deferring garbage collection. IEICE Electronics Express, 15(11), 2018. (doi:10.1587/elex.15.20180407)
[Nagel et al., 2018]
Lars Nagel, Tim Süß, Kevin Kremer, M. Umar Hameed, Lingfang Zeng, and André Brinkmann. Time-efficient garbage collection in SSDs. CoRR, 2018.
[Seo et al., 2018]
Seok-Bin Seo, Wanil Kim, and Se Jin Kwon. Efficient page collection scheme for QLC NAND flash memory using cache. International Journal of Advanced Computer Science and Applications, 9(11):458–461, 2018.
[Zhou et al., 2018]
Ke Zhou, Yu Zhang, Ping Huang, Hua Wang, Yongguang Ji, Bin Cheng, and Ying Liu. LEA: A lazy eviction algorithm for SSD cache in cloud block storage. In Proceedings of the 36th IEEE International Conference on Computer Design, Orlando, FL, 2018. IEEE. (doi:10.1109/ICCD.2018.00091)
[Hu et al., 2018]
Cheng Hu, Yuhui Deng, Geyong Min, Ping Huang, and Xiao Qin. QoS promotion in energy-efficient datacenters through peak load scheduling. IEEE Transactions on Cloud Computing, December 2018. (doi:10.1109/TCC.2018.2886187)
[Mei et al., 2018]
Linjun Mei, Dan Feng, Lingfang Zeng, Jianxi Chen, and Jingning Liu. A high-performance and high reliability RAIS5 storage architecture with adaptive stripe. In Algorithms and Architectures for Parallel Processing, Guangzhou, China, December 2018. Springer International Publishing.
[Oe et al., 2018]
Kazuichi Oe, Mitsuru Sato, and Takeshi Nanri. ATSMF: Automated tiered storage with fast memory and slow flash storage to improve reponse time with concentrated input-output (IO) workloads. IEICE Transactions on Information and Systems, E101-D(12):2889–2901, December 2018. (doi:10.1587/transinf.2018PAP0005)
[Xie et al., 2018]
Wei Xie, Yong Chen, and Philip C. Roth. Exploiting internal parallelism for address translation in solid-state drives. ACM Transactions on Storage, 14(32):32:1–32:30, December 2018. (doi:10.1145/3239564)
[Garrett, 2018]
Tyler Garrett. Enabling intra-plane parallel block erase in NAND flash to alleviate the impact of garbage collection. Master's thesis, University of Pittsburgh, Pittsburgh, PA, November 2018.
[Lin et al., 2018]
Ping-Hsien Lin, Yu-Ming Chang, Yung-Chun Li, Wei-Chen Wang, Chien-Chung Ho, and Yuan-Hao Chang. Achieving fast sanitization with zero live data copy for MLC flash memory. In Proceedings of the 2018 IEEE/ACM International Conference on Computer-Aided Design, pages 1–8, San Diego, CA, November 2018. (doi:10.1145/3240765.3240773)
[Xu, 2018]
Jun Xu. Block Trace Analysis and Storage System Optimization, chapter Trace Collection, pages 89–99. Apress, Berkeley, CA, November 2018. (doi:10.1007/978-1-4842-3928-5_3)
[Choi et al., 2018]
Wonil Choi, Myoungsoo Jung, and Mahmut Kandemir. Invalid data-aware coding to enhance the read performance of high-density flash memories. In Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, pages 482–493, Fukuoka, Japan, October 2018. (doi:10.1109/MICRO.2018.00046)
[Hwang and Kwak, 2018]
Sang-Ho Hwang and Jong Wook Kwak. RbWL: Recency-based static wear leveling for lifetime extension and overhead reduction in NAND flash memory systems. IEICE Transactions on Information and Systems, E101-D(10):2518–2522, October 2018. (doi:10.1587/transinf.2018EDL8076)
[Kim et al., 2018]
Joonsung Kim, Pyeongsu Park, Jaehyung Ahn, Jihun Kim, Jong Kim, and Jangwoo Kim. SSDcheck: Timely and accurate prediction of irregular behaviors in black-box SSDs. In Proceedings of the 51st Annual IEEE/ACM International Symposium on Microarchitecture, pages 455–468, Fukuoka, Japan, October 2018. (doi:10.1109/MICRO.2018.00044)
[Kinoshita et al., 2018]
Reika Kinoshita, Chihiro Matsui, Shinpei Matsuda, Yutaka Adachi, and Ken Takeuchi. Maximizing performance/cost figure of merit of storage-type SCM based SSD by adding small capacity of memory-type SCM. In Proceedings of the 2018 Non-Volatile Memory Technology Symposium, Sendai, Japan, October 2018. (doi:10.1109/NVMTS.2018.8603117)
[Li et al., 2018]
Daiping Li, Xiaoyang Qu, Jiguang Wan, Jun Wang, Yang Xia, Xiaozhao Zhuang, and Changsheng Xie. Workload scheduling for massive storage systems with arbitrary renewable supply. IEEE Transactions on Parallel and Distributed Systems, 29(10):2373–2387, October 2018. (doi:10.1109/TPDS.2018.2820070)
[Liu et al., 2018]
Caiyin Liu, Min Lv, Yubiao Pan, Hao Chen, Yongkun Li, Cheng Li, and Yinlong Xu. LCR: Load-aware cache replacement algorithm for flash-based SSDs. In Proceedings of the 2018 IEEE International Conference on Networking, Architecture, and Storage, Chongqing, China, October 2018. (doi:10.1109/NAS.2018.8515727)
[Lv et al., 2018]
Hao Lv, You Zhou, Fei Wu, Weijun Xiao, Xubin He, Zhonghai Lu, and Changsheng Xie. Exploiting minipage-level mapping to improve write efficiency of NAND flash. In Proceedings of the 2018 IEEE International Conference on Networking, Architecture, and Storage, Chongqing, China, October 2018. IEEE. (doi:10.1109/NAS.2018.8515728)
[Paik et al., 2018]
Joon-Young Paik, Joong-Hyun Choi, Rize Jin, Jianming Wang, and Eun-Sun Cho. A storage-level detection mechanism against crypto-ransomware. In Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications, pages 2258–2260, Toronto, Canada, October 2018. (doi:10.1145/3243734.3278491)
[Suzuki et al., 2018]
Atsuya Suzuki, Chihiro Matsui, and Ken Takeuchi. Periodic data eviction algorithm of SCM/NAND flash hybrid SSD with SCM retention time constraint capabilities at extremely high temperature. In Non-Volatile Memory Technology Symposium 2018, Sendai, Japan, October 2018. ACM. (doi:10.1109/NVMTS.2018.8603108)
[Wu et al., 2018]
Fei Wu, Zuo Lu, You Zhou, Xubin He, Zhihu Tan, and Changsheng Xie. OSPADA: One-shot programming aware data allocation policy to improve 3D NAND flash read performance. In Proceedings of the 36th IEEE International Conference on Computer Design, Orlando, FL, October 2018. IEEE. (doi:10.1109/ICCD.2018.00018)

Displaying items 281–320 of 770 in total
Showing citations per page


Member Links