2007 edition of EECE
571r: Data-Intensive Computing Systems
|
|
Topic |
Extra papers |
W1 |
01/09 |
Introduction
to the class, goals, and structure [ppt]. |
|
01/11 |
Introduction
(cont.) Performance enhancement techniques: replication, caching, striping, prefetching, Data
consistency. [ppt, ppt-1] |
[1]
Constructing
Collaborative Desktop Storage Caches
for Large Scientific Datasets, S. Vazhkudai et al. ACM Transaction on
Storage (TOS), 2006. [pdf] |
|
W2 |
01/16 |
Distributed
file systems. (Matei) [1]
Scale and Performance in a Distributed File System, J. H. Howard et al., ACM Transactions on Computer
Systems Feb. 1988, Vol. 6 (1). [pfd] [2]
The Google File System, Ghemawat et al., SOSP
2003 [pdf] NFS
summary slides [ppt] |
xFS
Project: Serverless Network File Service |
01/18 |
Data
replication (Elizeu) [1]
Efficient Replica Maintenance for Distributed Storage Systems, Byung-Gon Chun et al., NSDI’06. [pdf] [2] Drafting Behind
Akamai (Travelocity-Based Detouring), Ao-Jan Su et al.. SIGCOMM’06. [pdf] |
|
|
W3 |
01/23 |
Storage
management (Discussion leader: Armin) [1]
An
end-to-end approach to globally scalable programmable networking, Beck et
al., ACM SIGCOMM Workshop on Future
Directions in Network Architecture, 2003. [pdf] [2]
Storage Resource Managers: Middleware Components for
Grid Storage, A. Shoshani, A. Sim, J. Gu, 19th IEEE Symposium on Mass Storage Systems, 2002
(MSS '02). [pdf] [slides] |
[1]
Implementation Tradeoffs in Storage Allocation for
Grid Computing, D. Thain, Technical
Report [pdf]. [2]
An End-to-End Approach to Globally Scalable Network
Storage, M. Beck et al., ACM
SIGCOMM 2002 [pdf] |
01/25 |
Semantics:
(Discussion leader: Caleb) [1]
Semantically-Smart
Disk Systems, Muthian
Sivathanu et al., FAST’03 [pdf] [2]
Providing
Tunable Consistency for a Parallel File Store, Murali Vilayannur,
Partho Nath, and Anand Sivasubramaniam, FAST’05
[pdf] |
|
|
W4 |
01/30 |
Papers
on data stream analysis and data-mining (no links, emailed) |
[1]
Medusa:
Distributed stream processing [link] [2]
Multi-site
cooperative data stream analysis, ACM SIGOPS Operating Systems Review, 40(3),
31 – 37, 2006. [pdf] |
02/01 |
[1]
Project discussion. (Discussion
leader: Elizeu) [2]
Design and Evaluation of a Continuous Consistency Model
for Replicated Services, Haifeng Yu and Amin Vahdat, OSDI’00
[pdf]. |
[1]
The TACT project: Tunable Availability
and Consistency Tradeoffs |
|
W5 |
02/06 |
Availability:
(Discussion leader: Armin) [1]
TotalRecall: System Support for Automated Availability
Management, R. Bhagwan,
K. Tati, Y. Cheng, et al., NSDI , 2004. [pdf] [2]
Exploiting Availability Prediction in Distributed Systems, James W. Mickens, Brian D. Noble, NSDI
2006. [pdf] |
|
02/08 |
Check-pointing
for parallel applications (Discussion leader: Caleb) [1]
Adaptive incremental checkpointing for
massively parallel systems, Saurabh Agarwal, Rahul Garg, Meeta
S. Gupta, Jose E. Moreira 18th International
Conference on Supercomputing, Malo, France, 2004 [pdf] [2]
Blocking vs. Non-Blocking Coordinated Checkpointing
for Large-Scale Fault Tolerant MPI, Thomas Herault, Pierre Lemarinier,
Franck Cappello, SC2006, |
|
|
W6 |
02/13 |
DHTs. Slides: [ppt] |
DHT
systems: Chord, Pastry, Tapestry,
Bamboo, Kelips,
Symphony, SkipNet |
02/15 |
Midterm
project discussion. |
|
|
W7 |
02/27 |
OpenDHT: A Public DHT Service and Its Uses. Sean Rhea, Brighten Godfrey, Brad Karp, John Kubiatowicz, Sylvia Ratnasamy,
Scott Shenker, Ion Stoica,
and Harlan Yu. SIGCOMM 2005,
August 2005. [ppt], [pdf]) A Case Study in Building Layered DHT Applications, Yatin Chawathe, Sriram Ramabhadran, Sylvia Ratnasamy,
Anthony LaMarca, Scott Shenker,
Joseph Hellerstein, SIGCOMM’05 |
|
03/01 |
Disk
failures (Discussion leader:
Samer) [1]
Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours
Mean to You?, Bianca Schroeder and Garth A. Gibson
(FAST 07) [pdf] [2]
Failure Trends in a Large Disk Drive Population, Eduardo Pinheiro, Wolf-Dietrich Weber, and Luiz André Barroso (FAST’07) [pdf] |
|
|
W8 |
03/06 |
Performance Evaluation / Evaluating Distributed Systems: [1]
Fallacies in Evaluating Decentralized Systems [2]
Using PlanetLab for Network Research, Myths,
Realities and Best Practices. [pdf]. [3]
Should computer scientists experiment more? W.F. Tichy, IEEE Computer, May 1998 (32-40). [pdf] |
The Many Faces of Systems Research - and How to
Evaluate Them, Aaron B. Brown, Anupam Chanda, Rik Farrow, Alexandra Fedorova,
Petros Maniatis, and
Michael L. Scott, HotOS’05 [pdf] |
03/08 |
No
class. |
|
|
W9 |
03/13 |
System
characterization (I) (Discussion leader: Caleb) [1]
Passive NFS Tracing of Email and Research Workloads, Daniel Ellard, Jonathan Ledlie,
Pia Malkani, Margo
Seltzer, FAST 2003. [pdf] [2]
Feasibility of a Serverless Distributed File System Deployed on an
Existing Set of Desktop PCs,
William J. Bolosky, John R. Douceur, David Ely, and
Marvin Theimer, SIGMETRICS 2000 [pdf] |
[1]
An Empirical Study of a Highly Available File System, B. D. Noble and M. Satyanarayanan.
ACM SIGMETRICS’94. [2]
A Five-Year Study of File-System Metadata, Nitin Agrawal,
William J. Bolosky, John R. Douceur, and Jacob R. Lorch, FAST’07. [pdf] [3]
Farsite
project |
03/15 |
DHTs (cont) [ppt] [1]
File System Benchmarking:
Fallacies and Pitfalls, Nikolai Joukov,
Avishay Traeger, Charles
P. Wright, and Erez Zadok,
Technical Report FSL-05-04b [pdf][ppt]. |
[1]
The Impact of DHT Routing Geometry on Resilience and
Proximity, K.P. Gummadi,
et al. SIGCOMM 2003, Karlsruhe, Germany, August 2003. [pdf] [2]
Designing a DHT for Low Latency and High Throughput, Frank Dabek et al. NSDI 2004. [pdf] |
|
W10 |
03/20 |
System
characterization (II) (Discussion leader: Elizeu) [1]
Web Caching and Zipf-like Distributions: Evidence and Implications, Lee Breslau, Pei Cao, Li Fan, Graham Phillips, Scott Shenker, INFOCOM 1999: 126-134. [pdf] [2]
Small-World
File-Sharing Communities, Adriana
Iamnitchi, Matei Ripeanu, Ian Foster, Infocom 2004,
Hong Kong, March 2004. [ps] |
[4] A
Comparison of File System Workloads, Drew Roselli, Jacob
R. Lorch, Thomas E. Anderson, USENIX’00 [pdf] [5] File system
usage in Windows NT 4.0, Werner Voegels, SOSP’99 [pdf] [6]
Filecules in
High-Energy Physics: Characteristics and Impact on Resource Management, Adriana
Iamnitchi, Shyamala Doraimani,
Gabriele Garzoglio, HPDC 2006, Paris, June 2006. [pdf] [7] Interest-Aware
Information Dissemination in Small-World Communities, Adriana
Iamnitchi, Ian Foster, HPDC 2005, Raleigh, NC, July 2005 [pdf]. |
03/22 |
Network
coding and data reliability. [pdf]. [1]
High
Availability in DHTs: Erasure Coding vs.
Replication, Rodrigo Rodrigues and Barbara Liskov,
IPTPS'05, [PostScript
| PDF]
[PowerPoint
Slides] |
[2]
Erasure Coding
vs. Replication: A Quantitative Comparison, Hakim Weatherspoon
and John Kubiatowicz. IPTPS 2002 [pdf]
[3]
Assessing the
performance of Erasure Codes in the Wide Area, Rebecca L. Collins and James
S. Plank, DSN-2005 [pdf] |
|
W11 |
03/27 |
Network
attached storage / Specialized IO (Discussion leader: Armin) [1]
Network Attached Storage
Architecture. Garth A. Gibson and Rodney Van
Meter. Communications of the ACM, November 2000, Vol.43, No.11. [pdf] [2]
Making the Most out of Direct Access Network-Attached Storage, Kostas Magoutis, Salimah
Addetia, Alexandra Fedorova,
Margo I. Seltzer, FAST'03, [pdf] [3]
Structure and Performance of the Direct Access File System, Kostas Magoutis, Salimah
Addetia, Alexandra Fedorova,
Margo I. Seltzer, Jeffrey S. Chase, Andrew J. Gallatin, Richard Kisley, Rajiv G. Wickremesinghe,
Eran Gabber, USENIX’02
[pdf] |
[1]
Application
Performance on the Direct Access File System, Alexandra Fedorova,
Margo Seltzer, Kostas Magoutis, and Salimah Addetia, WOSP'04. [2]
A Performance
Comparison of NFS and iSCSI for IP-Networked
Storage, Peter Radkov, Li Yin, Pawan
Goyal and Prasenjit Sarkar, Prashant Shenoy, FAST’04 [pdf] [3]
End-System
Optimizations for High-Speed TCP,
Jeff Chase, Andrew Gallatin, and Ken Yocum, IEEE
Communications, 39 (4), 2001. [Postscript],
[PDF] |
03/29 |
Journaling,
Log structured FS [1]
File System Logging versus
Clustering: A Performance Comparison. M.
Seltzer, Smith, K., Balakrishnan, H., Chang, J., McMains, S., Padmanabhan, V.,
USENIX’95. [pdf] [2]
Journaling Versus Soft Updates:
Asynchronous Meta-data Protection in File Systems, Margo I. Seltzer, Gregory R. Granger, M. Kirk McKusick,
Keith A. Smith, Craig A. N. Soules, Christopher A.
Stein, USENIX’00 [pdf] |
[3]
The Design
and Implementation of a Log-Structured File System, Mendel Rosenblum and
John K. Ousterhout, SOSP’91 [pdf] [4]
Soft
updates: a solution to the metadata update problem in file systems, Gregory R. Ganger,
Marshall Kirk McKusick, Yale N. Patt,
Craig A. N. Soules, ACM Transactions on Computer
Systems (TOCS), 18(2), 2000 [pdf] [5]
Whitepaper: Red
Hat's New Journaling File System: ext3,
Michael K. Johnson [link] |
|
W12 |
04/03 |
Security / Trust [1]
Strong Accountability for
Network Storage, Aydan
R. Yumerefendi and Jeffrey S. Chase, FAST’07 [pdf] [2]
Samsara: Honor
Among Thieves in Peer-to-Peer Storage,
Landon Cox, Brian Noble, SOSP’03, [pdf] |
[3]
Data
Staging on Untrusted Surrogates, Jason Flinn; Shafeeq Sinnamohideen, Niraj Tolia, and M. Satyanaryanan, FAST’03 [4]
Plutus: Scalable Secure File Sharing on Untrusted Storage, Mahesh Kallahalla, Erik Riedel,; Ram Swaminathan, Qian Wang, and
Kevin Fu, FAST’03 [5]
Strong Security for Network-Attached
Storage, Ethan Miller and Darrell Long, William Freeman, Benjamin
Reed, FAST |
04/05 |
Data archival / Backups: [1]
A Cooperative Internet Backup Scheme, Mark Lillibridge, Sameh Elnikety, Andrew Birrell, Mike Burrows, and Michael Isard,
USENIX’03 [pdf] |
|
|
W13 |
04/16 |
Massive multiplayer online games [1]
Applicability of Group Communication
for Increased Scalability in MMOGs, Knut-Helge Vik et
al. NetGames 2006 [pdf] [2] The
Effects of Loss and Latency on User Performance in Unreal Tournament 2003,
Tom Beigbeder, Rory Coughlan,
Corey Lusher, John Plunkett, Emmanuel Agu, Mark Claypool, NetGames
2004 [pdf] |
|
04/18 |
No
class. Project presentations TBA during the exam period. |
|
Other
papers:
More
systems
§
Pond: The OceanStore Prototype, Sean Rhea, Patrick Eaton, Dennis Geels, Hakim
Weatherspoon, Ben Zhao, and John Kubiatowicz,
FAST’03
§
GPFS: A Shared-Disk File System for Large Computing Clusters,
Frank Schmuck and Roger Haskin, FAST’01
§
Metadata
Efficiency in Versioning File Systems, Craig A. N. Soules, Garth R.
Goodson, John D. Strunk, and Gregory R. Ganger,
§
yFS: A Journaling File System Design for Handling Large
Data Sets with Reduced Seeking, Zhihui Zhang and Kanad Ghose,
§
The Direct Access File System ,
Matt DeBergalis, Peter Corbett, Steve Kleiman, Arthur Lent, Dave Noveck,
Tom Talpey, and Mark Wittle
§ Bayou
§
Ivy: A
Read/Write Peer-to-peer File System. Athicha
Muthitacharoen, Robert Morris, Thomer
Gil, and Benjie Chen. 5th OSDI.
§
Venti, a new approach to archival storage,
§
Safety,
Visibility, and Performance in a Wide-Area File System, Minkyong
Kim, Landon P. Cox, and Brian D. Noble, http://www.sagecertification.org/events/fast02/full_papers/kim/kim.pdf
§ P. F. Corbett and D. G. Feitelson.
The Vesta Parallel File System. In High Performance
Mass Storage and Parallel I/O: Technologies and Applications. IEEE Computer
Society Press and Wiley, 2001.
Trust
/ Security
§
Data
Staging on Untrusted Surrogates, Jason Flinn; Shafeeq Sinnamohideen, Niraj Tolia, and M. Satyanaryanan, FAST’03
§
Plutus: Scalable Secure File Sharing on Untrusted
Storage, Mahesh Kallahalla,
Erik Riedel,; Ram Swaminathan, Qian
Wang, and Kevin Fu, FAST’03
§
Strong Security
for Network-Attached Storage, Ethan
Miller and Darrell Long, William Freeman, Benjamin Reed, FAST
Miscellaneous
§ D. Anderson, J. Chase, and A. Vahdat.
Interposed Request Routing for Scalable Network Storage. Transactions on
Computer Systems (TOCS), 20(1), February 2002.
§ A. C. Arpaci-Dusseau and R.
H. Arpaci-Dusseau. Information and Control in
Gray-Box Systems. In Proceedings of the 18th ACM Symposium on Operating
Systems Principles (SOSP ’01), pages 43–56,
§
Beyond
Bloom Filters: From Approximate Membership Checks to Approximate State Machines,
F. Bonomi, M. Mitzenmacher,
R. Panigraphy, S. Singh, G. Varghese, SIGCOMM’06