Resources

NOTE: Neither the IOTTA TWG nor SNIA vouch for the accuracy or reliability of any of the traces or other information provided below. Please contact us regarding any broken or inaccurate links.

Jump To:


Tools and Documentation

Microsoft Event Tracing (1) (2)

Stonybrook University Dataseries Documentation Wiki

Storage Research List (Computer Storage Systems Research Discussion Forum)

Traces and Snapshots Public Archive

Re-Animator tracing and replay tool


Storage Conferences

This is a non-exhaustive list of conferences relating to storage and data management. Note that some of these websites do not have stable domains, so please contact us if a link is broken.

ATC
The USENIX Annual Technical Conference (ATC). Hosted every summer.

EuroSys
EuroSys is organized by EuroSys, the European Chapter of SIGOPS, sponsored by ACM SIGOPS. Hosted annually in mid-spring. This conference does not have a stable URL, so we have linked to a Google search.

FAST
The USENIX File and Storage Technologies (FAST) conference. Hosted annually in February.

HotStorage
The USENIX Workshop on Hot Topics in Storage and File Systems. Hosted every summer directly before ATC.

ICDCS
The IEEE International Conference on Distributed Computing Systems (ICDCS). This conference does not have a stable URL so we have linked to a Google search.

ICS
The ACM International Conference on Supercomputing (ICS). Hosted every summer.

MSST
The International Conference on Massive Storage Systems and Technology (MSST). Hosted every summer at the Santa Clara University School of Engineering in Santa Clara, CA.

NAS
The IEEE International Conference on Networking, Architecture, and Storage. Hosted annually.

NVMSA
The IEEE Non-Volatile Memory Systems and Applications Symposium (NVMSA). Hosted annually in the late summer. This conference does not have a stable URL, so we have linked to a Google search.

OSDI
The USENIX Symposium on Operating Systems Design and Implementation (OSDI). Hosted annually.

SIGMETRICS
The ACM Special Interest Group for the computer systems performance evaluation community. Hosted annually in June.

SIGOPS
The ACM Special Interest Group in Operating Systems. Hosts a number of conferences annually.

SoCC
The ACM Symposium on Cloud Computing (SoCC). Hosted annually. This conference does not have a stable URL, so we have linked to a Google search.

SOSP
The ACM Symposium on Operating Systems Principles (SOSP). Hosted annually.

Supercomputing
The International Conference for High Performance Computing, Networking, Storage, and Analysis. Hosted annually in late fall.

SYSTOR
The ACM International Systems and Storage Conference (SYSTOR). Hosted annually in Haifa, Israel.

VLDB
The Very Large Data Bases (VLDB) Conference. Hosted annually in late August.


Storage Research Centers

Carnegie Mellon University
Parallel Data Lab (PDL)

San Diego Supercomputer Center (SDSC)

University of Minnesota
Digital Technology Center (DTC)
Intelligent Storage Consortium (DISC)

Storage Performance Council (SPC)


Papers and Publications

Papers Relating to Traces

[Harter11] Tyler Harter, Chris Dragga, Michael Vaughn, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau.
A File is Not a File: Understanding the I/O Behavior of Apple Desktop Applications.
Department of Computer Sciences, University of Wisconsin, Madison. 2011.

[Ellard03b] Daniel Ellard, Margo Seltzer.
NFS Tricks and Benchmarking Traps.
Proceedings of the FREENIX Technical Conference, San Antonio, Texas. June, 2003.

[Ellard03a] Daniel Ellard, Jonathan Ledlie, Pia Malkani, Margo Seltzer.
Passive NFS Tracing of Email and Research Workloads.
Proceedings of the Second Annual USENIX File and Storage Technologies Conference, pp. 203-216, San Francisco, CA. March, 2003.

[Roselli00] Drew Roselli, Jacob R. Lorch, Thomas E. Anderson.
A Comparison of File System Workloads.
Proceedings of the 2000 USENIX Technical Conference, pp. 44 - 54. San Diego, CA. June, 2000.

[Vogels99] Werner Vogels.
File system usage in Windows NT 4.0.
Proceedings of the 17th Symposium on Operating System Principles, pp. 93 - 109. Kiawah Island Resort, SC. December, 1999.

[Douceur99] John R. Douceur, William J. Bolosky.
A Large-Scale Study of File-System Contents.
Proceedings of SIGMETRICS '99, pp. 59 - 70. Atlanta, GA. May, 1999.

[Kuenning97] Geoffrey H. Kuenning and Gerald J. Popek.
Automated Hoarding for Mobile Computers.
Proceedings of the 16th ACM Symposium on Operating Systems Principles, St. Malo, France, October 5-8, 1997.

[Uysal97] Mustafa Uysal, Anurag Acharya, Joel Saltz.
Requirements of I/O Systems for Parallel Machines: An Application-driven Study.
Technical Report, CS-TR-3802, University of Maryland, College Park. May 1997.

[Mummert96] L. Mummert, M. Satyanarayanan.
Long Term Distributed File Reference Tracing: Implementation and Experience.
Software - Practice and Experience, Vol. 26, No. 6, pp. 705 - 736. June, 1996.

[Blackwell95] Trevor Blackwell, Jeffrey Harris, Margo Seltzer.
Heuristic Cleaning Algorithms in Log-Structured File Systems.
Proceedings of the 1995 USENIX Technical Conference, pp. 277 - 288. New Orleans, LA. January, 1995.

[Griffioen94] Jim Griffioen, Randy Appleton.
Reducing File System Latency using a Predictive Approach.
Proceedings of the Summer 1994 USENIX Technical Conference, pp. 197 - 207. Boston, MA. June, 1994.

[Chiang93] Chi-ming Chiang, Matt W. Mutka.
Characteristics of User File Usage Patterns.
Systems and Software, Vol. 23, No. 3, pp. 257 - 268. December, 1993.

[Ruemmler93] Chris Ruemmler, John Wilkes.
UNIX Disk Access Patterns.
Proceedings of the Winter 1993 USENIX Technical Conference, pp. 405 - 420. San Diego, CA. January, 1993.

[Ramakrishnan92] K.K. Ramakrishnan, Prabuddha Biswas, Ramakrishna Karedla.
Analysis of File I/O Traces in Commercial Computing Environments.
Proceedings of SIGMETRICS '92, pp. 78 - 90. Newport, RI. June, 1992.

[Roselli98] Drew Roselli, Thomas E. Anderson.
Characteristics of File System Workloads.
University of California Berkeley Computer Science Division Technical Report UCB//CSD-98-1029. 1992.

[Shirriff92] Ken Shirriff, John K. Ousterhout.
A Trace-Driven Analysis of Name and Attribute Caching in a Distributed System.
Proceedings of the Winter 1992 USENIX Technical Conference, pp. 315 - 332. San Francisco, CA. January, 1992.

[Miller91] Ethan L. Miller, Randy H. Katz.
Input/Output Behavior of Supercomputing Applications.
Proceedings of the 1991 Conference on Supercomputing, pp. 567 - 576. Albuquerque, NM. November, 1991.

[Baker91] M. Baker, J. Hartman, M. Kupfer, K. Shirriff, and J. Ousterhout.
Measurements of a Distributed File System.
Proceedings of the 13th ACM Symposium of Operating Systems Principles, pp. 198 - 212. October 1991.

[Bozman91] G.P. Bozman, H.H. Ghannad, E.D. Weinberger.
A trace-driven study of CMS file references.
IBM Journal of Research and Development, Vol. 35, No. 5/6, pp. 815 - 828. September/November, 1991.

[Bennet91] J. Michael Bennet, Michael A. Bauer, David Kinchlea.
Characteristics of Files in NFS Environments.
Proceedings of the 1991 ACM Symposium on Small Systems, pp. 33 - 40. 1991.

[Biswas90] P. Biswas, K.K. Ramakrishnan.
File Access Characterization of VAX/VMS Environments.
Proceedings of the 10th International Conference on Distributed Computing Systems, pp. 227 - 234. Paris, France. May, 1990.

[Floyd86] Rick Floyd.
Short-Term File Reference Patterns in a UNIX Environment.
University of Rochester Computer Science Technical Report #177. March, 1986.

[Ousterhout85] J. Ousterhout, H. Costa, D. Harrison, J. Kunze, M. Kupfer, J. Thompson.
A Trace-Driven Analysis of the UNIX 4.2BSD File System.
Proceedings of the 10th Symposium on Operating System Principles, pp. 15 - 24. Orcas Island, WA. December, 1985.

[Satyanarayanan81] M. Satyanarayanan.
A Study of File Sizes and Functional Lifetimes.
Proceedings of the 8th Symposium on Operating System Principles, pp. 96 - 108. Pacific Grove, CA. December, 1981.

[Smith81] A. J. Smith.
Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms.
IEEE Transactions on Software Engineering, Vol SE-7, No. 4, pp. 403 - 417. July, 1981.

Publications That Cite iotta.snia.org

The following publications cite iotta.snia.org as a source of trace data used in their research. They are organized in reverse chronological order. This list attempts to be comprehensive but is not complete; feel free to contact us to suggest additional entries.


[Kachmar and Kaeli, 2020]
Maher Kachmar and David R. Kaeli. A smart background scheduler for storage systems. ArXiv, abs/2006.01402, June 2020.
[Kolosov et al., 2020]
Oleg Kolosov, Gala Yadgar, Sumit Maheshwari, and Emina Soljanin. Benchmarking in the dark: On the absence of comprehensive edge datasets. In Proceedings of the 3rd USENIX Workshop on Hot Topics in Edge Computing, Santa Clara, CA, June 2020. USENIX Association.
[Wajahat et al., 2020]
Muhammad Wajahat, Aditya Yele, Tyler Estro, Anshul Gandhi, and Erez Zadok. Analyzing the distribution fit for storage workload and internet traffic traces. Performance Evaluation, 139, June 2020. (doi:https://doi.org/10.1016/j.peva.2020.102121)
[Zhang et al., 2020]
Baoquan Zhang, Ming-Hong Yang, Xuchao Xie, and David H.C. Du. Idler: I/O workload controlling for better responsiveness on host-aware shingled magnetic recording drives. IEEE Transactions on Computers, 69(6):777–788, June 2020. (doi:10.1109/TC.2020.2966194)
[Akgun et al., 2020]
Ibrahim Umit Akgun, Geoff Kuenning, and Erez Zadok. Re-Animator: Versatile high-fidelity system-call tracing and replaying. In Proceedings of the 13th ACM International Systems and Storage Conference (SYSTOR), pages 61–74, Haifa, Israel, May 2020.
[Hine et al., 2020]
Yusaku Hine, Reika Kinoshita, Yoshiki Kakuta, and Ken Takeuchi. Data allocation algorithm based on write and read frequency for double asymmetric-latency SCM SSD. In 2020 IEEE International Memory Workshop (IMW), pages 1–4, Dresden, Germany, May 2020. (doi:10.1109/IMW48823.2020.9108151)
[Liu et al., 2020]
Weihua Liu, Fei Wu, Meng Zhang, Chengmo Yang, Zhonghai Lu, Jiguang Wan, and Changsheng Xie. DEPS: Exploiting a dynamic error prechecking scheme to improve the read performance of SSD. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, May 2020. (doi:10.1109/TCAD.2020.2994266)
[Phyu and Thein, 2020]
Myat Pwint Phyu and Thandar Thein. Capacity optimized deduplication for big unstructured data in scale-out distributed storage system. Technical report, University of Information Technology Myanmar, May 2020.
[Ye et al., 2020]
Liuqing Ye, Dan Feng, Yuchong Hu, Xueliang Wei, and Yuzhuo Zhang. STC: Sub-packetization tunable codes for fast recovery. Journal of Systems Architecture, 105, May 2020. (doi:10.1016/j.sysarc.2019.101699)
[Zhou et al., 2020]
Ke Zhou, Yu Zhang, Ping Huang, Hua Wang, Yongguang Ji, Bin Cheng, and Ying Liu. Efficient SSD cache for cloud block storage via leveraging block reuse distances. IEEE Transactions on Parallel and Distributed Systems, 31(11):2496–2509, May 2020. (doi:10.1109/TPDS.2020.2994075)

Displaying items 61–70 of 627 in total
Showing citations per page


Member Links