论文标题

Terrabyte客户:提供访问植物数据的trabytes

The TerraByte Client: providing access to terabytes of plant data

论文作者

Beck, Michael A., Bidinosti, Christopher P., Henry, Christopher J., Ajmani, Manisha

论文摘要

在本文中,我们演示了Terrabyte客户端,Terrabyte客户端是一种从Compute Canada托管的数据门户下载用户定义的工厂数据集的软件。为此,客户端提供了两个关键功能:(1)它允许用户概述可用的数据,以及一种快速的方法,可以直观地检查该数据的示例。为此,客户端将收到数据库的查询结果,并显示满足搜索条件的图像数量。此外,可以在几秒钟内下载样本,以确认数据适合用户的需求。 (2)然后,用户可以将指定的数据下载到自己的驱动器中。这些数据被准备到块服务器端,并发送到用户的最终系统,并自动将其提取到单个文件中。根据可用的带宽和类型的数据,可以在短暂的等待时间内进行简短等待时间后进行检查。 Terrabyte客户端具有完整的图形用户界面,可轻松使用并使用端到端加密。用户界面建立在低级客户端的顶部。这种结合提供客户端程序开源的体系结构使用户可以开发自己的用户界面或直接使用客户端的功能。直接使用的一个示例可能是在较大的应用程序(例如培训机业学习模型)中按需下载特定的数据。

In this paper we demonstrate the TerraByte Client, a software to download user-defined plant datasets from a data portal hosted at Compute Canada. To that end the client offers two key functionalities: (1) It allows the user to get an overview on what data is available and a quick way to visually check samples of that data. For this the client receives the results of queries to a database and displays the number of images that fulfill the search criteria. Furthermore, a sample can be downloaded within seconds to confirm that the data suits the user's needs. (2) The user can then download the specified data to their own drive. This data is prepared into chunks server-side and sent to the user's end-system, where it is automatically extracted into individual files. The first chunks of data are available for inspection after a brief waiting period of a minute or less depending on available bandwidth and type of data. The TerraByte Client has a full graphical user interface for easy usage and uses end-to-end encryption. The user interface is built on top of a low-level client. This architecture in combination of offering the client program open-source makes it possible for the user to develop their own user interface or use the client's functionality directly. An example for direct usage could be to download specific data on demand within a larger application, such as training machine learning models.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源