Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Chen et al., 2013]
Zhiguang Chen, Nong Xiao, and Fang Liu. An SSD-based accelerator for directory parsing in storage systems containing massive files. Peer-to-Peer Networking and Applications, 6(4):397–408, December 2013.
[Qiu et al., 2013]
Wenwei Qiu, Xiang Chen, Nong Xiao, Fang Liu, and Zhiguang Chen. A new exploration to build flash-based storage systems by co-designing file system and FTL. In Proceedings of the 16th IEEE International Conference on Computational Science and Engineering, Sydney, Australia, December 2013. IEEE. (doi:10.1109/CSE.2013.138)
[Wang et al., 2013]
Ronghui Wang, Ting Cao, Ou Yang, Nong Xiao, and Minxuan Zhang. A study of background cleaning and data allocation for multi-channel SSDs. In Proceedings of the 2nd IEEE International Symposium on Instrumentation & Measurement, Sensor Network and Automation (IMSNA), Toronto, Ontario, Canada, December 2013. IEEE.
[Zhao and Wang, 2013]
Xiao-Yong Zhao and Lei Wang. An activity-based replica placement method of energy-conservation. In Proceedings of the 2013 International Conference on Fuzzy Theory and its Applications, Taipei, Taiwan, December 2013. (doi:10.1109/iFuzzy.2013.6825481)
[Chang et al., 2013]
Yu-Ming Chang, Yuan-Hao Chang, Tei-Wei Kuo, Hsiang-Pang Li, and Yung-Chun Li. A disturb-alleviation scheme for 3D flash memory. In Proceedings of the 2013 IEEE/ACM International Conference on Computer-Aided Design, San Jose, CA, November 2013. IEEE. (doi:10.1109/ICCAD.2013.6691152)
[Katsuno, 2013]
Ian Katsuno. SD storage array: Development and characterization of a many-device storage architecture. Master's thesis, University of Toronto, November 2013.
[Pan et al., 2013]
Wen Pan, Feng Liu, Tao Xie, Yanyan Gao, Yiming Ouyang, and Tian Chen. SPD-RAID4: Splitting parity disk for RAID4 structured parallel SSD arrays. In Proceedings of the IEEE International Conference on High Performance Computing and Communications and IEEE International Conference on Embedded and Ubiquitous Computing, Zhangjiajie, China, November 2013. IEEE. (doi:10.1109/HPCC.and.EUC.2013.12)
[yi Sung, 2013]
Hung yi Sung. Exploiting multi-controller parallelism for solid-state drives. Master's thesis, National Taiwan University of Science and Technology, Taipei, Taiwan, November 2013.
[Emers, 2013]
Joseph Emers. Workload Traces Analysis and Replay in Large Scale Distributed Systems. PhD thesis, Université de Grenoble, Grenoble, France, October 2013.
[Hayashi and Kamoda, 2013]
Shinichi Hayashi and Norihisa Kamoda. Data location optimization method for improvement performance for tiered storage. IEEJ Transactions on Electronics, Information and Systems, 133(10):1989–1997, October 2013. (doi:10.1541/ieejeiss.133.1989)
[Al Assaf et al., 2013]
Maen M. Al Assaf, Xunfei Jiang, Mohamed Riduan Abid, and Xiao Qin. Eco-Storage: A hybrid storage system with energy-efficient informed prefetching. Journal of Signal Processing Systems, 73(3):165–180, September 2013.
[Hachiya et al., 2013]
Shogo Hachiya, Koh Johguchi, Kousuke Miyaji, and Ken Takeuchi. TLC/MLC NAND flash mix-and-match design with exchangeable storage array. In 2013 International Conference on Solid State Devices and Materials, Fukuoka, Japan, September 2013. (doi:10.7567/SSDM.2013.H-3-3)
[Luo et al., 2013]
Tian Luo, Siyuan Ma, Rubao Lee, Xiaodong Zhang, Deng Liu, and Li Zhou. S-CAVE: Effective SSD caching to improve virtual machine storage performance. In Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT), Edinburgh, UK, September 2013. (doi:10.1109/PACT.2013.6618808)
[Wildani, 2013]
Avani Wildani. The Promise of Data Grouping in Large Scale Storage Systems. PhD thesis, University of California, Santa Cruz, September 2013.
[Chang et al., 2013]
Yuan-Hao Chang, Ming-Chang Yang, Tei-Wei Kuo, and Ren-Hung Hwang. A reliability enhancement design under the flash translation layer for MLC-based flash-memory storage systems. ACM Transactions on Embedded Computing Systems, 13(1), August 2013. (doi:10.1145/2512467)
[Guo et al., 2013]
Xufeng Guo, Jianfeng Tan, and Yuping Wang. PAB: Parallelism-aware buffer management scheme for nand-based SSDs. In Proceedings of the 21st IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, San Francisco, CA, August 2013. (doi:10.1109/MASCOTS.2013.18)
[Sun et al., 2013]
Chao Sun, Kousuke Miyaji, Koh Johguchi, and Ken Takeuchi. SCM capacity and NAND over-provisioning requirements for SCM/NAND flash hybrid enterprise SSD. In Proceedings of the 5th IEEE International Memory Workshop (IMW), Monterey, CA, August 2013. (doi:10.1109/IMW.2013.6582099)
[Thereska et al., 2013]
Eno Thereska, Dinan Gunawardena, James W. Scott, and Richard Harper. Distributed file system. United States Patent 9,384,199, July 5 2013.
[Altiparmak and Tosun, 2013]
Nihat Altiparmak and Ali Şaman Tosun. Generalized optimal response time retrieval of replicated data from storage arrays. ACM Transactions on Storage, 9(2), July 2013. (doi:10.1145/2491472.2491474)
[Ha et al., 2013]
Keonsoo Ha, Jaeyong Jeong, and Jihong Kim. A read-disturb management technique for high-density NAND flash memory. In Proceedings of the 4th Asia-Pacific Workshop on Systems, Singapore, July 2013. (doi:10.1145/2500727.2500743)
[Venkataraman et al., 2013]
Kalyana Sundaram Venkataraman, Tong Zhang, Wenzhe Zhao, Hongbin Sun, and Nanning Zheng. Scheduling algorithms for handling updates in shingled magnetic recording. In Proceedings of the 8th IEEE International Conference on Networking, Architecture, and Storage, pages 205–214, Xi'an, Shaanxi, China, July 2013. IEEE.
[Ge et al., 2013]
Xiongzi Ge, Dan Feng, and David H.C. Du. DiscPOP: Power-aware buffer management for disk accesses. Sustainable Computing: Informatics and Systems, 3(2):58–69, June 2013. (doi:10.1016/j.suscom.2012.03.003)
[He et al., 2013]
Wanhui He, Nong Xiao, Fang Liu, Zhiguang Chen, and Yinjin Fu. DL-Dedupe: Dual-Level Deduplication Scheme for Flash-Based SSDs, volume 7901 of Lecture Notes in Computer Science, pages 4–15. Springer-Verlag, Berlin, Germany, June 2013.
[Hu et al., 2013]
Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, and Chao Ren. Exploring and exploiting the multilevel parallelism inside SSDs for improved performance and endurance. IEEE Transactions on Computers, 62(6):1141–1155, June 2013. (doi:10.1109/TC.2012.60)
[Sansottera et al., 2013]
Andrea Sansottera, Giuliano Casale, and Paolo Cremonesi. Fitting second-order acyclic marked Markovian arrival processes. In 2013 43rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks, pages 1–12, Budapest, Hungary, June 2013. (doi:10.1109/DSN.2013.6575347)
[Wei et al., 2013]
Qingsong Wei, Lingfang Zeng, Jianxi Chen, and Cheng Chen. A popularity-aware buffer management to improve buffer hit ratio and write sequentiality for solid-state drive. IEEE Transactions on Magnetics, 49(6):2786–2793, June 2013. (doi:10.1109/TMAG.2013.2249579)
[Yang et al., 2013a]
Jingpei Yang, Ned Plasson, Greg Gillis, Nisha Talagala, Swaminathan Sundararaman, and Robert Wood. HEC: Improving endurance of high performance flash-based cache devices. In Proceedings of the 6th ACM International Systems and Storage Conference (SYSTOR), Haifa, Israel, June 2013. (doi:10.1145/2485732.2485743)
[Yang et al., 2013b]
Ming-Chang Yang, Yuan-Hao Chang, Che-Wei Tsao, and Po-Chun Huang. New ERA: New efficient reliability-aware wear leveling for endurance enhancement of flash storage devices. In Proceedings of the 50th ACM/EDAC/IEEE Design Automation Conference, Austin, TX, June 2013. IEEE.
[Abdurrab et al., 2013]
Abdul R. Abdurrab, Tao Xie, and Wei Wang. DLOOP: A flash translation layer exploiting plane-level parallelism. In Proceedings of the 27th IEEE International Symposium on Parallel & Distributed Processing, pages 908–918, Boston, MA, May 2013. IEEE.
[Prabhakar et al., 2013]
Ramya Prabhakar, Mahmut Kandemir, and Myoungsoo Jung. Disk-cache and parallelism aware I/O scheduling to improve storage system performance. In Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Proceessing, Boston, MA, May 2013. (doi:10.1109/IPDPS.2013.59)
[Prada et al., 2013]
Laura Prada, Alejandro Calderón, Javier Garcia, J. Daniel, and Jesús Carretero. A novel black-box simulation model methodology for predicting performance and energy consumption in commodity storage devices. Simulation Modelling Practice and Theory, 34:48–63, May 2013. (doi:10.1016/j.simpat.2013.01.006)
[Qin et al., 2013]
Yi Qin, Dan Feng, Wei Tong, Jingning Liu, Yang Hu, and Zhiming Zhu. Per-file secure deletion combining with enhanced reliability for SSDs. In Proceedings of the 8th International Conference on Grid and Pervasive Computing (GPC), volume 7861 of Lecture Notes in Computer Science, pages 509–516, Seoul, South Korea, May 2013. Springer-Verlag. (doi:10.1007/978-3-642-38027-3_54)
[Talwadker and Voruganti, 2013]
Rukma Talwadker and Kaladhar Voruganti. Paragone: What's next in block I/O trace modeling. In Proceedings of the 29th IEEE Symposium on Mass Storage Systems and Technologies, Long Beach, CA, May 2013. (doi:10.1109/MSST.2013.6558436)
[Zhang et al., 2013]
Yupu Zhang, Daniel S. Myers, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. Zettabyte reliability with flexible end-to-end data integrity. In Proceedings of the 29th IEEE Symposium on Mass Storage Systems and Technologies, Long Beach, CA, May 2013. (doi:10.1109/MSST.2013.6558423)
[Budilovsky, 2013]
Evgeny Budilovsky. Kernel based mechanisms for high performance I/O. Master's thesis, Tel Aviv University, April 2013.
[Khanafer et al., 2013]
Ali Khanafer, Murali Kodialam, and Krishna P. N. Puttaswamy. The constrained ski-rental problem and its application to online cloud cost optimization. In Proceedings of the IEEE International Conference on Computer Communications, Turin, Italy, April 2013. IEEE. (doi:10.1109/INFCOM.2013.6566944)
[Park et al., 2013]
Dongchul Park, Young Jin Nam, Biplob Debnath, David H. C. Du, Youngkyun Kim, and Youngchul Kim. An on-line hot data identification for flash-based storage using sampling mechanism. ACM SIGAPP Applied Computing Review, 13(1):51–64, March 2013. (doi:10.1145/2460136.2460141)
[Thereska et al., 2013]
Eno Thereska, Austin Donnelly, and Dushyanth Naraynanan. Reducing power consumption of distributed storage systems. United States Patent 8,370,672, February 5 2013.
[Lu et al., 2013]
Youyou Lu, Jiwu Shu, and Weimin Zheng. Extending the lifetime of flash-based storage through reducing write amplification from file systems. In Proceedings of the 11th USENIX Conference on File and Storage Technologies, pages 257–270, San Jose, CA, February 2013. USENIX Association.
[Shin et al., 2013]
Ji-Yong Shin, Mahesh Balakrishnan, Tudor Marian, and Hakim Weatherspoon. Gecko: Contention-oblivious disk arrays for cloud storage. In Proceedings of the 11th USENIX Conference on File and Storage Technologies, San Jose, CA, February 2013.
[Chen et al., 2013]
Zhi-Guang Chen, Nong Xiao, Fang Liu, and Yi-Mo Du. Reorder write sequence by hetero-buffer to extend SSD's lifespan. Journal of Computer Science and Technoogy, 28(1):14–27, January 2013.
[Brook et al., 2012]
Tony Brook, Mason Cabot, Frank Hady, and Matthew Shopsin. Measuring and improving single-user NAS performance. Dongseo University Fall 2012 Course, 2012.
[Chao and Fang, 2012]
Liu Chao and Hu Fang. Research on reliability design of data storage for embedded system. Physics Procedia, 25:1405–1408, 2012. (doi:10.1016/j.phpro.2012.03.253)
[Dai et al., 2012]
Chengjun Dai, Guiquan Liu, Lei Zhang, and Enhong Chen. Storage device performance prediction with hybrid regression models. In Proceedings of the 13th International Conference on Parallel and Distributed Computing, Applications and Technologies, Beijing, China, December 2012.
[Fu et al., 2012]
Yinjin Fu, Hong Jiang, and Nong Xiao. A scalable inline cluster deduplication framework for big data protection. In Proceedings of the 13th ACM/IFIP/Usenix International Middleware Conference, Montreal, Quebec, Canada, December 2012. USENIX Association.
[Jung et al., 2012]
Myoungsoo Jung, Ramya Prabhakar, and Mahmut Taylan Kandemir. Taking garbage collection overheads off the critical path in SSDs. In Proceedings of the 13th ACM/IFIP/Usenix International Middleware Conference, volume 7662 of Lecture Notes in Computer Science, pages 164–186, Montreal, Quebec, Canada, December 2012. Springer-Verlag. (doi:10.1007/978-3-642-35170-9_9)
[Xu et al., 2012]
Zhiyong Xu, Ruixuan Li, and Cheng-Zhong Xu. CAST: A page-level FTL with compact address mapping and parallel data blocks. In Proceedings of the 29th IEEE International Performance, Computing, and Communications Conference, Austin, TX, December 2012. (doi:10.1109/PCCC.2012.6407747)
[Wang et al., 2012]
Mingbang Wang, Youguang Zhang, and Wang Kang. ZFTL: A zone-based flash translation layer with a two-tier selective caching mechanism. In Proceedings of the 14th IEEE International Conference on Communication Technology, Chengdu, China, November 2012. (doi:10.1109/ICCT.2012.6511426)
[Arumugam et al., 2012]
Rajesh Vellore Arumugam, Chuan Heng Foh, Haixiang Shi, and Kyawt Kyawt Khaing. HCache: A hybrid cache management scheme with flash and next generation NVRAM. In Proceedings of the Asia-Pacific Magnetic Recording Conference (APMRC), Singapore, October 2012.
[yu Liu, 2012]
Shih yu Liu. An efficient data management method for SSD-based storage systems. Master's thesis, National Taiwan University of Science and Technology, Taipei, Taiwan, October 2012.

Displaying items 651–700 of 770 in total
Showing citations per page


Member Links