主题:【分享】Paper Summary

浏览0 回复2 电梯直达
皎然
结帖率:
100%
关注:0 |粉丝:0
新手级: 新兵
1. What is the problem the authors are trying to solve?

  Existing storage systems do not scale well to petabytes of data and
  terabytes/second throughput.


2. What other approaches or solutions existed at the time that this work was done?

  Lots of other file systems existed. NFS is a standard for distributed
  file systems. Lustre is a distributed object-based file system, as
  is the Panasas file system.


3. What was wrong with the other approaches or solutions?

  All have limitations that prevent them from scaling to the desired level.
  Block-based file systems have problems dealing with the large number of
  blocks in such a system. Other object-based file systems fail to take full
  advantage of the object-based paradigm and still maintain object lists.
为您推荐
您可能想找: 气相色谱仪(GC) 询底价
专属顾问快速对接
立即提交
皎然
结帖率:
100%
关注:0 |粉丝:0
新手级: 新兵
4. What is the authors' approach or solution?

  The authors solution includes:
  - Object-based storage devices
  - A globally known mapping function for locating file data
    (instead of object lists)
  - A scalable metadata manager that dynamically redelegates authority
    for directory subtrees based on load
  - A distributed autonomous system for managing the object stores

5. Why is it better than the other approaches or solutions?

  It scales to petabytes, provides nearly linear performance improvements
  as storage devices are added, degrades gracefully as storage devices are
  removed, and provides very high performance.

6. How did they test their solution?

  They ran parts of the storage system and observed their performance
  under various workloads. Data performance was tested on a single object
  store and on several object stores. Metadata performance was tested on
  a large cluster.

皎然
结帖率:
100%
关注:0 |粉丝:0
新手级: 新兵
7. How does it perform?

  Performance is very good. The system appears to achieve its goals,
  although scalability could be improved in certain scenarios where a lot
  of sharing occurs.

8. Why is this work important?

  This work is important because storage systems continue to grow in size
  and data is becoming increasingly important.

3+ comments/questions

  * Why didn't they directly compare the performance of their system against
    that of any other storage systems?

  * What happens if you scale to exabytes? Will the system still work? What
    factors will limit its ability to scale further?

  * How much of the improvement is due to CRUSH, and how much to the design
    of the other parts of the system? Why didn't they do any tests to isolate
    the benefits of the individual design decisions?
猜你喜欢最新推荐热门推荐更多推荐
品牌合作伙伴