论文标题

研究完全分散的Web3系统的工作量:IPFS

Studying the workload of a fully decentralized Web3 system: IPFS

论文作者

Costa, Pedro Ákos, Leitão, João, Psaras, Yannis

论文摘要

Web3旨在创建一个分散的平台,该平台与支持当今互联网的现代云基础架构具有竞争力。但是,Web3仍然有限,仅支持内容创建和共享,分散融资以及分散沟通的应用程序。这主要是由于支持Web3的技术:区块链,IPFS和LIBP2P,尽管提供了开发Web3应用程序的好的工具,但在设计和性能方面仍然受到限制。这激发了更好地理解这些技术的需求,以实现可以将Web3推向其全部潜力的新颖优化。不幸的是,了解完全分散的大规模分布式系统的当前行为是一项艰巨的任务,因为没有集中权威对系统操作有充分的了解。 为此,在本文中,我们表征了IPF的工作量,这是Web3的关键推动器。为了实现这一目标,我们收集了从用户执行的访问到北美最受欢迎的IPFS网关之一的痕迹。通过对这些痕迹的精细分析,我们收集了向系统的请求数量,并找到了所请求的内容的提供者。有了这些数据,我们既表征了请求和提供的内容的普及以及它们的地理位置(通过将IP地址与MaxMind数据库匹配)。我们的结果表明,IPF中的大多数请求仅针对几个不同的内容,该内容由系统中的大部分同行提供。此外,我们的分析还表明,大多数请求是由位于北美和欧洲的系统中两个最大的提供商提供的。有了这些见解,我们得出结论,当前的IPFS体系结构是最佳的,并为未来提出了研究议程。

Web3 aims at creating a decentralized platform that is competitive with modern cloud infrastructures that support today's Internet. However, Web3 is still limited, supporting only applications in the domains of content creation and sharing, decentralized financing, and decentralized communication. This is mainly due to the technologies supporting Web3: blockchain, IPFS, and libp2p, that although provide a good collection of tools to develop Web3 applications, are still limited in terms of design and performance. This motivates the need to better understand these technologies as to enable novel optimizations that can push Web3 to its full potential. Unfortunately, understanding the current behavior of a fully decentralized large-scale distributed system is a difficult task, as there is no centralized authority that has full knowledge of the system operation. To this end, in this paper we characterize the workload of IPFS, a key enabler of Web3. To achieve this, we have collected traces from accesses performed by users to one of the most popular IPFS gateways located in North America for a period of two weeks. Through the fine analysis of these traces, we gathered the amount of requests to the system, and found the providers of the requested content. With this data, we characterize both the popularity of requested and provided content, as well as their geo-location (by matching IP address with the MaxMind database). Our results show that most of the requests in IPFS are only to a few different content, that is provided by large portion of peers in the system. Furthermore, our analysis also shows that most requests are provided by the two largest portions of providers in the system, located in North America and Europe. With these insights, we conclude that the current IPFS architecture is sub-optimal and propose a research agenda for the future.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源