Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Jaffer et al., 2022]
Shehbaz Jaffer, Kaveh Mahdaviani, and Bianca Schroeder. Improving the reliability of next generation SSDs using WOM-v codes. In Proceedings of the 20th USENIX Conference on File and Storage Technologies, pages 117–132, Santa Clara, CA, February 2022. USENIX Association.
[Ahmadian et al., 2021]
Saba Ahmadian, Reza Salkhordeh, Onur Mutlu, and Hossein Asadi. ETICA: Efficient two-level I/O caching architecture for virtualized platforms. IEEE Transactions on Parallel and Distributed Systems, 32(10):2415–2433, 2021. (doi:10.1109/TPDS.2021.3066308)
[Ai et al., 2021]
Liang Ai, Yuhui Deng, Yi Zhou, and Hao Feng. RUE: A caching method for identifying and managing hot data by leveraging resource utilization efficiency. Software—Practice and Experience, 51(11):2252–2273, 2021. (doi:https://doi.org/10.1002/spe.2963)
[Cai et al., 2021]
Zhigang Cai, Lihao Song, and Xiaoning Peng. Efficient caching on parity chunks in RAID-enabled SSDs. IEICE Electronics Express, 18(7):20210061–20210061, 2021. (doi:10.1587/elex.18.20210061)
[Chakraborttii and Litz, 2021a]
Chandranil Chakraborttii and Heiner Litz. Learning I/O access patterns to improve prefetching in SSDs. In Yuxiao Dong, Dunja Mladenić, and Craig Saunders, editors, Proceedings of the Machine Learning and Knowledge Discovery in Databases: Applied Data Science Track, pages 427–443, Bilbao, Spain, 2021. Springer International Publishing.
[Chakraborttii and Litz, 2021b]
Chandranil Chakraborttii and Heiner Litz. Reducing write amplification in flash by death-time prediction of logical block addresses. In Proceedings of the 14th ACM International Systems and Storage Conference (SYSTOR), SYSTOR '21, Haifa, Israel, 2021. ACM. (doi:10.1145/3456727.3463784)
[Chen et al., 2021]
Ping-Xiang Chen, Shuo-Han Chen, Yuan-Hao Chang, Yu-Pei Liang, and Wei-Kuan Shih. Facilitating the efficiency of secure file data and metadata deletion on SMR-based Ext4 file system. In Proceedings of the 26thAsia and South Pacific Design Automation Conference, ASPDAC '21, page 728–733, New York, 2021. ACM. (doi:10.1145/3394885.3431517)
[Du and Li, 2021]
Xiaoming Du and Cong Li. SHARC: Improving adaptive replacement cache with shadow recency cache management. In Proceedings of the 22nd International Middleware Conference, pages 119–131, 2021.
[Fareed et al., 2021]
Imran Fareed, Mincheol Kang, Wonyoung Lee, and Soontae Kim. Update frequency-directed subpage management for mitigating garbage collection and DRAM overheads. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 40(12):2467–2480, 2021.
[Gao et al., 2021]
Yuanning Gao, Xiaofeng Gao, Ruisi Zhang, and Guihai Chen. An end-to-end learning-based metadata management approach for distributed file systems. IEEE Transactions on Computers, 71(5):1021–1034, 2021.
[Jia et al., 2021]
Danlin Jia, Tengpeng Li, Xiaoqian Zhang, Li Wang, Mahsa Bayati, Ron Lee, Bo Sheng, and Ningfang Mi. SNIS: Storage-network iterative simulation for disaggregated storage systems. In Proceedings of the 2021 IEEE International Performance, Computing, and Communications Conference, pages 1–6. IEEE, 2021.
[Kachmar, 2021]
Maher Amine Kachmar. Active Resource Partitioning and Planning for Storage Systems Using Time Series Forecasting and Machine Learning Techniques. PhD thesis, Northeastern University, 2021.
[Kachmar and Kaeli, 2021]
Maher Kachmar and David Kaeli. CALC: A content-aware learning cache for storage systems. In Proceedings of the 2021 IEEE International Conference on Networking, Architecture, and Storage, pages 1–8. IEEE, 2021.
[Kim, 2021]
Jung-Hoon Kim. An FTL-aware host system alleviating severe long latency of NAND flash-based storage. In Proceedings of the 27th IEEE International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA), pages 189–194. IEEE, 2021.
[Kurniawan et al., 2021]
Daniar H. Kurniawan, Levent Toksoz, Mingzhe Hao, Anirudh Badam, Tim Emami, Sandeep Madireddy, Robert B. Ross, Henry Hoffmann, and Haryadi S. Gunawi. IONET: Towards an open machine learning training ground for I/O performance prediction. Technical report, University of Chicago, 2021.
[Lange et al., 2021]
Tomer Lange, Joseph Naor, and Gala Yadgar. Offline and online algorithms for SSD management. Proceedings of the ACM Conference on Measurement and Analysis of Computing Systems, 5(3):1–28, 2021.
[Li, 2021]
Pingguo Li. A hotness-aware write buffer management scheme for the lifetime extension of flash-based solid state drives. In International Multiconference of Engineers and Scientists, Hong Kong, 2021.
[Li et al., 2021a]
Huaicheng Li, Martin L Putra, Ronald Shi, Xing Lin, Gregory R. Ganger, and Haryadi S. Gunawi. lODA: A host/device co-design for strong predictability contract on modern flash storage. In Proceedings of the 28th ACM Symposium on Operating Systems Principles, pages 263–279, 2021.
[Li et al., 2021b]
Jun Li, Minjun Li, Zhigang Cai, François Trahay, Mohamed Wahib, Balazs Gerofi, Zhiming Liu, Min Huang, and Jianwei Liao. Intra-page cache update in SLC-mode with partial programming in high density SSDs. In Proceedings of the 50th International Conference on Parallel Processing, pages 1–10, 2021.
[Liu et al., 2021]
Wenguo Liu, Hui Li, and Lingfang Zeng. STAR: A zone translation scheme to improve the performance of host-aware SMR. In Proceedings of the 6th IEEE International Conference on Big Data Analytics (ICBDA), pages 267–271. IEEE, 2021. (doi:10.1109/ICBDA51983.2021.9402952)
[Lu et al., 2021]
Mengting Lu, Fang Wang, Zongwei Li, and Wenpeng He. EDC: An elastic data cache to optimizing the I/O performance in deduplicated SSDs. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2021.
[Ma et al., 2021]
Chenlin Ma, Zhuokai Zhou, Lei Han, Zhaoyan Shen, Yi Wang, Renhai Chen, and Zili Shao. Rebirth-FTL: Lifetime optimization via approximate storage for NAND flash memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2021.
[Maruf et al., 2021]
Adnan Maruf, Zhengyu Yang, Bridget Davis, Daniel Kim, Jeffrey Wong, Matthew Durand, and Janki Bhimani. Understanding flash-based storage I/O behavior of games. In Proceedings of the 14th IEEE International Conference on Cloud Computing (CLOUD), pages 521–526. IEEE, 2021.
[Miranda et al., 2021]
Mariana Miranda, Tânia Esteves, Bernardo Portela, and João Paulo. S2Dedup: SGX-enabled secure deduplication. In Proceedings of the 14th ACM International Systems and Storage Conference (SYSTOR), Haifa, Israel, 2021. ACM. (doi:10.1145/3456727.3463773)
[Nachman et al., 2021]
Aviv Nachman, Sarai Sheinvald, Ariel Kolikant, and Gala Yadgar. GoSeed: Optimal seeding plan for deduplicated storage. ACM Transactions on Storage, 17(3):1–28, 2021.
[Oe and Nanri, 2021]
Kazuichi Oe and Takeshi Nanri. Proposal and evaluation of IO concentration-aware mechanisms to improve efficiency of hybrid storage systems. IEICE Transactions on Information and Systems, 104(12):2109–2120, 2021.
[Pan et al., 2021]
Cheng Pan, Xiaolin Wang, Yingwei Luo, and Zhenlin Wang. Penalty- and locality-aware memory allocation in redis using enhanced AET. ACM Transactions on Storage, 17(2):1–45, 2021.
[Phyu and Sinha, 2021]
Myat Pwint Phyu and GR Sinha. Efficient data deduplication scheme for scale-out distributed storage. In Data Deduplication Approaches, pages 153–182. Elsevier Science Publishing Co., Inc., 2021.
[Rodriguez et al., 2021]
Liana V. Rodriguez, Alexis Gonzalez, Pratik Poudel, Raju Rangaswami, and Jason Liu. Unifying the data center caching layer: Feasible? profitable? In Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems, pages 50––57, Virtual, 2021. ACM. (doi:10.1145/3465332.3470884)
[Roy et al., 2021]
Tanaya Roy, Jit Gupta, Krishna Kant, Amitangshu Pal, Dave Minturn, and Arash Tavakkol. PLMC: A predictable tail latency mode coordinator for shared nvme SSD with multiple hosts. In Proceedings of the 2021 IEEE International Conference on Networking, Architecture, and Storage, pages 1–6. IEEE, 2021.
[Russo et al., 2021]
Gabriele Russo Russo, Valeria Cardellini, Giuliano Casale, and Francesco Lo Presti. MEAD: Model-based vertical auto-scaling for data stream processing. In Proceedings of the 21st IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pages 314–323, 2021. (doi:10.1109/CCGrid51090.2021.00041)
[Shen et al., 2021]
Zhaoyan Shen, Lei Han, Chenlin Ma, Zhiping Jia, Tao Li, and Zili Shao. Leveraging the interplay of RAID and SSD for lifetime optimization of flash-based SSD RAID. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 40(7):1395–1408, 2021. (doi:10.1109/TCAD.2020.3020495)
[Tzouros and Kalogeraki, 2021]
Giannis Tzouros and Vana Kalogeraki. Preserving data availability in edge computing systems with diagonally interleaved coding. In Proceedings of the 24th Pan-Hellenic Conference on Informatics, PCI '20, page 87–90. ACM, 2021. (doi:10.1145/3437120.3437281)
[Wang et al., 2021]
Yi Wang, Jiangfan Huang, Jing Chen, and Rui Mao. PVSensing: A process-variation-aware space allocation strategy for 3d NAND flash memory. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 41(5):1302–1315, 2021.
[Wei et al., 2021]
Bing Wei, Jigang Wu, Xiaosong Su, Qiang Huang, and Yujun Liu. Adaptive updates for erasure-coded storage systems based on data delta and logging. In International Conference on Parallel and Distributed Computing, Applications and Technologies, pages 187–197. Springer, 2021.
[Wu et al., 2021]
Chin-Hsien Wu, I-Hung Li, and Jian-Jia Chen. A supervised-learning-based garbage collection in solid-state drives (SSDs). IT Professional, 23(6):39–45, 2021.
[Yang, 2021]
Junyao Yang. Efficient Modeling of Random Sampling-Based LRU Cache. PhD thesis, Michigan Technological University, 2021.
[Zhou et al., 2021]
Hai Zhou, Dan Feng, and Yuchong Hu. Multi-level forwarding and scheduling repair technique in heterogeneous network for erasure-coded clusters. In Proceedings of the 50th International Conference on Parallel Processing, pages 1–11, 2021.
[심석보, 2021]
심석보. PCRAM controller 의 hardware prefetcher 를 위한 data buffer 최적화. PhD thesis, 서울대학교 대학원, 2021.
[Guo et al., 2021]
Hanchen Guo, Zhehan Lin, Yunfei Gu, Chentao Wu, Li Jiang, Jie Li, Guangtao Xue, and Minyi Guo. Lazy-WL: A wear-aware load balanced data redistribution method for efficient SSD array scaling. In Proceedings of the IEEE International Conference on Cluster Computing (CLUSTER), pages 157–168, Melbourne, Australia, September 2021. IEEE. (doi:10.1109/Cluster48925.2021.00030)

Displaying items 121–160 of 770 in total
Showing citations per page


Member Links