Static Snapshots
Static Snapshots are traces taken statically of a file system rather than of system calls.
The following traces are free to download under the terms of the
SNIA Trace Data Files Download License.
Please note that cookies must be enabled within your browser in order to
download traces.
For questions about downloading using a shell script,
see Using Shell Scripts,
and for more information about downloading using a Windows
batch script, see
Using Batch Scripts.
Trace Name | Details | Related Tools | Year Recorded | Timespan | Record Count | File Size | Actions |
---|---|---|---|---|---|---|---|
Docker Registry Traces |
Traces obtained by collecting long-span production-level traces from five datacenters in IBM Cloud container registry service during 2017. The traces and trace player tool are further described in the README.
|
The traces can be replayed using the Docker Registry Trace Player tool. | 2017 | 3 months | 40.9 Million | 1.55 GB |
|
FSL-Dedup Traces |
Traces related to the paper
Generating realistic datasets for deduplication analysis by Vasily Tarasov, Amar Mudrankit, Will Bulk, Philip Shilane, Geoff Kuenning, Erez Zadok, appearing in
Proceedings of the 2012 USENIX conference on Annual Technical Conference
(USENIX ATC '12).
|
The traces can be replayed using the FS-Hasher tool. | 2011-2016 | almost 5 years | 708 Billion | 5.14 TB |
|
View Historical Traces
WARNING: These traces are over 10 years old! They should not be used for modern research!
The following traces are free to download under the terms of the
SNIA Trace Data Files Download License.
Please note that cookies must be enabled within your browser in order to
download traces.
For questions about downloading using a shell script,
see Using Shell Scripts,
and for more information about downloading using a Windows
batch script, see
Using Batch Scripts.
Trace Name | Details | Actions | |||||
---|---|---|---|---|---|---|---|
UBC-Dedup |
Traces collected for the paper "A Study of Practical Deduplication" by Dutch T. Meyer and William J. Bolosky of The University of British Columbia and Microsoft Research
|
2009 | 4 months | 3.4 TB |
|
||
Multimedia file sizes |
File-size data collected for the paper "A Study of Irregularities in File-Size Distributions".
|
This trace has no related tools yet. | 2001 | 21 days | 17.1 MB |
|
|
Microsoft Longitudinal Study |
Because of the size of the traces, they have been repackaged by IOTTA into 22 zip files. Each zip files contains information from the original snapshot, directory, and file info files, repackaged by user. More information on how the files were repackaged can be found in the readme-iotta.txt file found in each zip file.
|
2000-2004 | over 4 years | 5 Billion | 91.1 GB |
|
|
Microsoft 1998 Static Study |
Static analysis of 10,568 file systems on 4801 workstations at Microsoft. This data formed the basis for the paper "A Large-Scale Study of File-System Contents" by John R. Douceur and William J. Bolosky, published in ACM SIGMETRICS 1999.
|
1998 | 8 days | 153 Million | 3.73 GB |
|
|
Plan 9 Traces |
This is a time series set of "snapshots" of the contents of the Plan 9 file servers at Bell Labs. One snapshot per day for over ten years. The snapshots were taken on two different machines: bootes and emelie. See the Venti paper for more information.
|
1990-2001 | about 11 years | 0 Bytes |
|