Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Liu et al., 2016]
Jian Liu, Yunpeng Chai, Chang Yan, and Xin Wang. A delayed container organization approach to improve restored speed for deduplication systems. IEEE Transactions on Parallel and Distributed Systems, 27(9):2477–2491, September 2016. (doi:10.1109/TPDS.2015.2509060)
[Oe et al., 2016]
Kazuichi Oe, Takeshi Nanri, and Koji Okamura. Analysis of storage workloads of input-output access locality and designing of hybrid storage system. In Proceedings of the 5th IEEE Congress on Advanced Applied Informatics (IIAI-AAI), pages 971–974, Kumamoto, Japan, September 2016. IEEE. (doi:10.1109/IIAI-AAI.2016.54)
[Rush and Altiparmak, 2016]
Everett Neil Rush and Nihat Altiparmak. Exploiting replication for energy efficiency of heterogeneous storage systems. In Proceedings of the 24th IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, pages 79–84, London, UK, September 2016. (doi:10.1109/MASCOTS.2016.70)
[Waldspurger et al., 2016]
Carl A. Waldspurger, Irfan Ahmad, Alexander Garthwaite, and Nohhyun Park. System and method for efficient cache utility curve construction and cache allocation. United States Patent 9,418,020, August 16 2016.
[Dai et al., 2016]
Dong Dai, Forrest Sheng Bao, Jiang Zhou, and Yong Chen. Block2Vec: A deep learning strategy on mining block correlations in storage systems. In Proceedings of the 2016 45th International Conference on Parallel Processing Workshops, pages 230–239, Philadelphia, PA, August 2016. (doi:10.1109/ICPPW.2016.43)
[Huo et al., 2016]
Zhisheng Huo, Limin Xiao, Qiaoling Zhong, Shupan Li, Ang Li, Li Ruan, Shouxin Wang, and Lihong Fu. MBFS: A parallel metadata search method based on Bloomfilters using MapReduce for large-scale file systems. Journal of Supercomputing, 72(8):3006–3032, August 2016. (doi:10.1007/s11227-015-1464-2)
[Luo et al., 2016]
Dan Luo, Ting Yao, Xiaoyang Qu, Jiguang Wan, and Changsheng Xie. DVS: Dynamic variable-width striping RAID for shingled write disks. In Proceedings of the 2016 IEEE International Conference on Networking, Architecture, and Storage, pages 1–10, Long Beach, CA, August 2016. (doi:10.1109/NAS.2016.7549398)
[Wu and Chen, 2016]
Chin-Hsien Wu and Syuan-An Chen. JOM: A joint operation mechanism for NAND flash memory. ACM Transactions on Embedded Computing Systems, 15(4), August 2016. (doi:10.1145/2915916)
[Wu et al., 2016]
Suzhen Wu, Bo Mao, Xiaolan Chen, and Hong Jiang. LDM: Log disk mirroring with improved performance and reliability for SSD-based disk arrays. ACM Transactions on Storage, 12(4), August 2016. (doi:10.1145/2892639)
[Xie et al., 2016]
Wei Xie, Yong Chen, and Philip C. Roth. Parallel-DFTL: A flash translation layer that exploits internal parallelism in solid state drives. In IEEE International Conference on Networking, Architecture, and Storage, Long Beach, CA, August 2016. (doi:10.1109/NAS.2016.7549413)
[Yang and Zhu, 2016]
Yue Yang and Jianwen Zhu. Write skew and Zipf distribution: Evidence and implications. ACM Transactions on Storage, 12, August 2016. (doi:10.1145/2908557)
[Yeo et al., 2016]
Dong Bin Yeo, Joon-Young Paik, and Tae-Sun Chung. Request-size aware flash translation layer based on page-level mapping. In Proceedings of the 2016 IEEE Computational Science and Engineering, pages 68–71, Paris, France, August 2016. (doi:10.1109/CSE-EUC-DCABES.2016.163)
[Ha et al., 2016]
Keonsoo Ha, Jaeyong Jeong, and Jihong Kim. An integrated approach for managing read disturbs in high-density NAND flash memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 35(7):1079–1091, July 2016. (doi:10.1109/TCAD.2015.2504868)
[Bai et al., 2016]
Sen Bai, Xin Bai, and Xiangjiu Che. Window-LRFU: A cache replacement policy subsumes the LRU and window-LFU policies. Concurrency and Computation: Practice and Experience, 28(9):2670–2684, June 2016. (doi:10.1002/cpe.3730)
[Basak et al., 2016]
Jayanta Basak, Kushal Wadhwani, and Kaladhar Voruganti. Storage workload identification. ACM Transactions on Storage, 12(3), June 2016. (doi:10.1145/2818716)
[Mao et al., 2016]
Bo Mao, Hong Jiang, Suzhen Wu, and Lei Tian. Leveraging data deduplication to improve the performance of primary storage systems in the cloud. IEEE Transactions on Computers, 65(6):1775–1788, June 2016. (doi:10.1109/TC.2015.2455979)
[Ni et al., 2016]
Yuanjiang Ni, Ji Jiang, Dejun Jiang, Xiaosong Ma, Jin Xiong, and Yuangang Wang. S-RAC: SSD friendly caching for data center workloads. In Proceedings of the 9th ACM International Systems and Storage Conference (SYSTOR), pages 1–12, Haifa, Israel, June 2016. (doi:10.1145/2928275.2928284)
[Tanakamaru et al., 2016]
Shuhei Tanakamaru, Shogo Hosaka, Koh Johguchi, Hirofumi Takishita, and Ken Takeuchi. Understanding the relation between the performance and reliability of NAND flash/SCM hybrid solid-state drive. IEEE Transactions on Very Large Scale Integration Systems, 24(6):2208–2219, June 2016. (doi:10.1109/TVLSI.2015.2496976)
[Tarihi et al., 2016]
Mojtaba Tarihi, Hossein Asadi, Alireza Haghdoost, Mohammad Arjomand, and Hamid Sarbazi-Azad. A hybrid non-volatile cache design for solid-state drives using comprehensive I/O characterization. IEEE Transactions on Computers, 65(6):1678–1691, June 2016. (doi:10.1109/TC.2015.2455978)
[Tavakkol et al., 2016]
Arash Tavakkol, Pooyan Mehrvarzy, Mohammad Arjomand, and Hamid Sarbazi-Azad. Performance evaluation of dynamic page allocation strategies in SSDs. ACM Transactions on Modeling and Performance Evaluation of Computing Systems, 1(2), June 2016. (doi:10.1145/2829974)
[Wang et al., 2016]
Wei Wang, Tao Xie, and Abhinav Sharma. SWANS: An interdisk wear-leveling strategy for RAID-0 structured SSD arrays. ACM Transactions on Storage, 12(10):10:1–10:21, June 2016. (doi:10.1145/2756555)
[Wu et al., 2016]
Suzhen Wu, Yanping Lin, Bo Mao, and Hong Jiang. GCaR: Garbage collection aware cache management with improved performance for flash-based SSDs. In Proceedings of the 2016 ACM International Conference on Supercomputing, Istanbul, Turkey, June 2016. ACM. (doi:10.1145/2925426.2926263)
[Altiparmak and Tosun, 2016]
Nihat Altiparmak and Ali Şaman Tosun. Multithreaded maximum flow based optimal replica selection algorithm for heterogeneous storage architectures. IEEE Transactions on Computers, 65(5):1543–1557, May 2016. (doi:10.1109/TC.2015.2451620)
[Chang et al., 2016a]
Y. M. Chang, P. H. Lin, Y. J. Lin, T. C. Kuo, Y. H. Chang, Y. C. Li, H. P. Li, and K. C. Wang. An efficient sudden-power-off-recovery design with guaranteed booting time for solid state drives. In Proceedings of the 8th IEEE International Memory Workshop, pages 1–4, Paris, France, May 2016. IEEE. (doi:10.1109/IMW.2016.7493565)
[Chang et al., 2016b]
Yu-Ming Chang, Yuan-Hao Chang, Tei-Wei Kuo, Yung-Chun Li, and Hsiang-Pang Li. Disturbance relaxation for 3D flash memory. IEEE Transactions on Computers, 65(5):1467–1483, May 2016. (doi:10.1109/TC.2015.2451660)
[Patana-anake et al., 2016]
T. Patana-anake, V. Martin, N. Sandler, Cheng Wu, and H. S. Gunawi. Manylogs: Improved CMR/SMR disk bandwidth and faster durability with scattered logs. In Proceedings of the 32nd IEEE Symposium on Mass Storage Systems and Technologies, pages 1–16, Santa Clara, CA, May 2016. IEEE. (doi:10.1109/MSST.2016.7897075)
[Qu et al., 2016]
Xiaoyang Qu, Jiguang Wan, Jun Wang, Liqiong Liu, Dan Luo, and Changsheng Xie. GreenMatch: Renewable-aware workload scheduling for massive storage systems. In Proceedings of the 2016 IEEE International Symposium on Parallel & Distributed Processing, pages 403–412, Chicago, IL, May 2016. (doi:10.1109/IPDPS.2016.24)
[Yamaga et al., 2016]
Yusuke Yamaga, Chihiro Matsui, Shogo Hachiya, and Ken Takeuchi. Application optimized adaptive ECC with advanced LDPCs to resolve trade-off among reliability, performance, and cost of solid-state drives. In Proceedings of the 8th IEEE International Memory Workshop (IMW), Paris, France, May 2016. (doi:10.1109/IMW.2016.7493568)
[Jung, 2016]
Myoungsoo Jung. Exploring design challenges in getting solid state drives closer to CPU. IEEE Transactions on Computers, 65(4):1103–1115, April 2016. (doi:10.1109/TC.2014.2366772)
[Luo et al., 2016]
Dan Luo, Jiguang Wan, Yifeng Zhu, Nannan Zhao, Feng Li, and Changsheng Xie. Design and implementation of a hybrid shingled write disk system. IEEE Transactions on Parallel and Distributed Systems, 27(4):1017–1029, April 2016. (doi:10.1109/TPDS.2015.2425402)
[Ou et al., 2016]
Jiaxin Ou, Jiwu Shu, and Youyou Lu. A high performance file system for non-volatile main memory. In Proceedings of the 11th ACM European Conference on Computer Systems, New York, NY, USA, April 2016. ACM. (doi:10.1145/2901318.2901324)
[Shahidi et al., 2016]
Narges Shahidi, Mohammad Arjomand, Anand Sivasubramaniam, Mahmut T. Kandemir, and Chita Das. Storage consolidation: Not always a panacea, but can we ease the pain? In Proceedings of the 2016 IEEE International Symposium on Performance Analysis of Systems and Software, pages 159–160, Uppsala, Sweden, April 2016. (doi:10.1109/ISPASS.2016.7482090)
[Yao et al., 2016]
Jie Yao, Hong Jiang, Qiang Cao, Lei Tian, and Changsheng Xie. Elastic-RAID: A new architecture for improved availability of parity-based RAIDs by elastic mirroring. IEEE Transactions on Parallel and Distributed Systems, 27(4):1044–1056, April 2016. (doi:10.1109/TPDS.2015.2432808)
[Clavijo, 2016]
Dulcardo A. Arteaga Clavijo. Flash Caching for Cloud Computing Systems. PhD thesis, Florida International University, Miami, Florida, March 2016.
[Di et al., 2016]
Yejia Di, Liang Shi, Kaijie Wu, and Chun Jason Xue. Exploiting process variation for retention induced refresh minimization on flash memory. In Proceedings of the 2016 Design, Automation and Test in Europe Conference (DATE), pages 391–396, Dresden, Germany, March 2016.
[Arteaga et al., 2016]
Dulcardo Arteaga, Jorge Cabrera, Jing Xu, Swaminathan Sundararaman, and Ming Zhao. CloudCache: On-demand flash cache management for cloud computing. In Proceedings of the 14th USENIX Conference on File and Storage Technologies, pages 355–369, Santa Clara, CA, February 2016.
[Han et al., 2016]
Se Jun Han, Dong Hyun Kang, and Young Ik Eom. Dual write buffer algorithm for improving performance and lifetime of SSDs. Journal of KIISE, 43:177–185, February 2016.
[Huang and Chang, 2016]
Sheng-Min Huang and Li-Pin Chang. Exploiting page correlations for write buffering in page-mapping multichannel SSDs. ACM Transactions on Embedded Computing Systems, 15(1), February 2016. (doi:10.1145/2815622)
[Jung et al., 2016]
Myoungsoo Jung, Wonil Choi, Shuwen Gao, Ellis Herbert Wilson III, David Donofrio, John Shalf, and Mahmut Taylan Kandemir. NANDFlashSim: High-fidelity, microarchitecture-aware NAND flash memory simulation. ACM Transactions on Storage, 12(2), February 2016. (doi:10.1145/2700310)
[Kim et al., 2016]
Jin-Young Kim, Sang-Hoon Park, Hyeokjun Seo, Ki-Whan Song, Sungroh Yoon, and Eui-Young Chung. NAND flash memory with multiple page sizes for high-performance storage devices. IEEE Transactions on Very Large Scale Integration Systems, 24(2):764–768, February 2016. (doi:10.1109/TVLSI.2015.2409055)

Displaying items 481–520 of 770 in total
Showing citations per page


Member Links