FAQ
Which cloud object stores does Fusion support?
Fusion supports AWS S3, Azure Blob, and Google Cloud Storage. Fusion can also be used with local storage solutions that support the AWS S3 API.
How does Fusion work?
Fusion implements a FUSE driver that mounts the cloud storage bucket in the job execution context as a POSIX file system. This allows the job script to read and write data files in cloud object storage as if they were local files.
Why is Fusion faster than other FUSE drivers?
Fusion is not a general purpose file system. It has been designed to optimize the data transfer of bioinformatics pipelines by taking advantage of the Nextflow data model.
Why do I need Wave containers to use Fusion?
Fusion is designed to work at the job execution level. This means it must run in a containerized job execution context.
Downloading and installing Fusion manually would require you to rebuild all the containers used by your data pipeline to include the Fusion client each time a new version of the client is released. You would also need to maintain a custom mirror or existing container collections, such as BioContainers.
Wave enables you to add the Fusion client to your pipeline containers at deploy time, without the need to rebuild them or maintain a separate container image collection.
Can Fusion mount more than one bucket in the job's file system?
Yes. Any access to cloud object storage is automatically detected by Fusion and the corresponding buckets are mounted on demand.