Revisit Amazon Web Services re:Invent 2024’s biggest moments and watch keynotes and innovation talks on demand
General
Q: What is Amazon FSx for OpenZFS?
A: Amazon FSx for OpenZFS is a fully managed file storage service that lets you launch, run, and scale fully managed file systems built on the open-source OpenZFS file system. FSx for OpenZFS makes it easy to migrate your on-premises file servers without changing your applications or how you manage data, and to build new high-performance, data-intensive applications on the cloud.
Q: What can I do with Amazon FSx for OpenZFS?
A: Amazon FSx for OpenZFS provides fully managed, highly reliable, and easy-to-use file storage that you can access over the industry-standard Network File System (NFS) protocol from virtually any client or workstation running in Amazon Web Services services or on premises. It gives you a rich set of capabilities for managing and working with your data, such as snapshots, data cloning, and compression. Powered by Amazon Graviton processors, along with the latest Amazon Web Services disk and networking technologies, FSx for OpenZFS delivers over 1 million IOPS, with latencies as low as a few hundred microseconds for your high-performance workloads.
Use FSx for OpenZFS to migrate your on-premises data stored in ZFS or other Linux-based file servers to Amazon Web Services. By providing the same data management capabilities and performance as on-premises NFS or other Linux-based file servers, FSx for OpenZFS lets you move your data without having to modify your existing code or how you manage your data.
If you’re building a new application on Amazon Web Services, FSx for OpenZFS provides rich capabilities to make it easier to develop, test, and run cloud-native applications. And with support for throughput scaling, FSx for OpenZFS makes it easy to scale your file system along with growing application demands and datasets.
Q: What use cases does Amazon FSx for OpenZFS support?
A: Amazon FSx for OpenZFS is designed to support a wide range of Linux and Windows workloads, including big data and analytics, code and artifact repositories, DevOps solutions, web content management, front-end electronic design automation (EDA), genomics research, and media processing. With FSx for OpenZFS, you can deliver the low latency and performance needed for the most demanding applications and workloads.
Q: How does FSx for OpenZFS work with other Amazon Web Services services?
A: Amazon FSx for OpenZFS is accessible from Amazon Elastic Compute Cloud (EC2), Amazon Elastic Container Service (ECS), Amazon Elastic Kubernetes Service (EKS), and Amazon WorkSpaces. You can secure and protect your data with Amazon Virtual Private Cloud (VPC) and Amazon Key Management Service (Amazon KMS). You can monitor storage and performance metrics using Amazon CloudWatch, monitor and secure API calls using Amazon CloudTrail and Amazon Identity and Access Management (IAM), and manage file systems using Amazon CloudFormation.
Q: How do I get started with Amazon FSx for OpenZFS?
A: To get started, create an Amazon FSx for OpenZFS file system using the Amazon Web Services Management Console, Amazon Command Line Interface (Amazon CLI), or Amazon FSx API. While creating a file system, you specify the deployment type (Single-AZ or Multi-AZ), storage capacity, throughput capacity, and optionally specify the level of disk IOPS. Within minutes, your file system is available, and you can access it from Linux or Windows clients with the NFS protocol (v3, v4, v4.1, v4. 2) FSx for OpenZFS presents data to your users and applications as a local directory or drive, and it provides concurrent access for up to thousands of clients.
Q: What is an Amazon FSx file system?
A: A file system is the primary resource in Amazon FSx. You specify the SSD storage capacity, throughput capacity, and IOPS capacity for your file system, and choose an Amazon VPC in which your file system is created.
Q: What operating systems and Amazon Web Services services can I access my file system from?
A: You can access your data from Linux or Windows clients running on Amazon EC2, Amazon ECS, Amazon EKS and Amazon WorkSpaces.
Q: How do I access data on my Amazon FSx for OpenZFS file system?
A: From a Linux instance, you can access your file data using the standard Linux mount command and the domain name system (DNS) name associated with the volume. From a Windows instance, use Windows Powershell to map a drive letter (for example, Z:) to a file share in your file system with the NFS client.
Q: What protocols does Amazon FSx for OpenZFS support?
A: Amazon FSx for OpenZFS provides access to shared file storage over the latest versions of the NFS protocol (v3, v4, v4.1, v4.2).
Q: How do I migrate my data from on premises into Amazon FSx for OpenZFS?
A: You can access your Amazon FSx for OpenZFS file system from your on-premises environment via Amazon Direct Connect, or use standard copy tools, such as rsync or Robocopy, to copy data from on premises to FSx for OpenZFS. You can also use Amazon Snowball to migrate your data to Amazon Web Services services without using Direct Connect.
Q: When should I use Amazon FSx for OpenZFS compared to other Amazon FSx file system types?
A: Refer to Choosing an Amazon FSx file system page for more details on the different Amazon FSx file storage offerings.
Q: What Regions is Amazon FSx for OpenZFS available in?
A: Refer to the Regional Products and Services page for details of Amazon FSx for OpenZFS service availability by Region.
Q: When should I use a Multi-AZ file system vs. a Single-AZ file system?
A: We recommend Multi-AZ file systems for most production workloads given the high availability and durability model it provides. You should use a Multi-AZ file system for workloads that require high availability to ensure continued availability across events such as hardware failures, file system maintenance, and AZ disruption.
Use Single-AZ file systems for the most latency-sensitive applications (e.g., certain EDA and HPC workloads) or as a cost-effective solution for workloads that don’t need high availability or a multi-AZ resilience model: production workloads that have replication and failover built in to the application layer, analytics workloads that can rehydrate or regenerate the data, development and testing workloads, and some production workloads that have relaxed availability and Recovery Point Objective (RPO) needs.
Scale and performance
Q: What performance does Amazon FSx for OpenZFS deliver?
A: FSx for OpenZFS provides fully managed SSD storage with consistent, latencies as low as a few hundred microseconds. Leveraging caching capabilities embedded in OpenZFS, FSx for OpenZFS file systems deliver even higher performance and lower latencies for frequently accessed data.
FSx for OpenZFS file systems can deliver over 1 million IOPS and 21 GB/s throughput when accessing data cached on the file server, and to hundreds of thousands of IOPS and 10 GB/s throughput when accessing data in SSD storage. With multiple parallel connections per client (via NFS n-connect), you can drive up to the maximum throughput and IOPS levels for your file system from a single client.
Q: How do I configure the performance of my file system?
A: When creating a file system, you specify its throughput capacity. Each gigabyte of storage includes three disk IOPS, and you can provision additional disk IOPS as needed. You can scale up or down throughput capacity at any time using the Amazon Web Services Management Console, Amazon CLI, or the Amazon FSx API to meet your changing application needs. Visit the File system performance page for more information.
Q: How much data can I store on Amazon FSx for OpenZFS?
A: You can provision up to a maximum of 512 TiBs on a single file system. You can easily mount multiple file systems concurrently to provide PB-scale storage for your applications.
Q: How do I monitor my file system activity?
A: By default, volumes can grow up to the available capacity on your file system. You have the option to set a quota for a volume to limit how large the volume can grow. You also have the option to set a reservation for a volume to set aside a minimum amount of storage capacity for that volume. Without a reservation, volumes are thin-provisioned, meaning that they consume storage capacity only for the data stored in them. You can change these settings at any time.
Storage administration and management
Q: What is a volume?
A: In addition to representing data in a traditional file-directory structure, FSx for OpenZFS supports the use of volumes within a file system to further organize your data. Each file system contains one or more volumes, which are isolated data containers for directories and files. You can create and manage individual volumes using the Amazon Web Services Management Console, Amazon CLI, or the Amazon FSx API, and clients access volumes by mounting them from Linux or Windows as a network location. Individual volumes can be configured independently, meaning you can set up features such as thin provisioning or compression at the volume level.
Q: How do my volumes contribute to the storage capacity usage in my file system?
A: By default, volumes can grow up to the available capacity on your file system. You have the option to set a quota for a volume to limit how large the volume can grow. You have the option to set a reservation for a volume to set aside a minimum amount of storage capacity for that volume. Without a reservation, volumes are thin-provisioned, meaning that they consume storage capacity only for the data stored in them. You can change these settings at any time.
Q: What capabilities does Amazon FSx for OpenZFS provide for working with data?
A: FSx for OpenZFS offers a rich set of ZFS-powered capabilities for working with data, including point-in-time snapshots, in-place data cloning, and on-demand data replication across file systems. With FSx for OpenZFS, you can create snapshots of your volumes at any time, making it easy to retain, manage, and restore historical versions of your databases or applications. Data cloning enables makes it easy for you to test multiple features and changes in parallel without interrupting your existing users or applications, and without needing to duplicate your data. On-demand data replication across file systems provides a simple and resilient way to synchronize data for disaster recovery, read replicas, and test and development workflows, without the extra work of establishing connectivity between your file systems or detecting and handling network interruptions.
As an example, if you are running a database workload and would like to test a database operation prior to applying it in production environment, you can test the operation by creating a clone of your database, running the operation against the clone volume to validate that it works as expected. Once you are done with your test, you can copy your clone volume to a full-copy volume to retain the results for long-term or production use.
Q: What storage efficiency features does FSx for OpenZFS support?
A: FSx for OpenZFS supports a rich set of storage efficiency features including Zstandard and LZ4 compression to reduce the storage capacity your data consumes, volume reservations and volume quotas to control the amount of space used by individual volumes, and user quotas to control the usage of individual end-users. These features help you reduce or manage the storage footprint in your file system and backup storage.
Q: How does data compression impact file system performance?
A: Enabling compression may reduce file system performance for write-heavy workloads as data is compressed as it is being written to disk. However, for read-heavy workloads, compression can significantly improve the overall throughput performance of your file system as it reduces the amount of data sent between these disks and the storage server. See the Amazon FSx for OpenZFS Performance documentation for more details.
Q: How do I manage a file system? How do I work with ZFS capabilities?
A: Amazon FSx is a fully managed service, so all of the file storage infrastructure is managed for you. When you use Amazon FSx, you avoid the complexity of deploying and maintaining complex file system infrastructure. FSx for OpenZFS offers all of the power of ZFS, without the complexity often found with managing ZFS file systems
To create, view, and delete file systems, volumes, and backups, you can use the Amazon Web Services Management Console, the Amazon CLI, and the Amazon FSx API. We also offer support for advanced ZFS capabilities natively in the FSx console and API, meaning that you don’t need to be an expert in ZFS in order to take advantage of features like ZFS snapshots and cloning.
Availability and durability
Q: What does FSx for OpenZFS do to ensure high availability and durability?
A: With Multi-AZ file systems, Amazon FSx automatically replicates your data both within and across Amazon Web Services Availability Zones (AZs) to ensure high durability and deploys a high-availability (HA) pair of file servers across two AZs with automatic failover and failback to ensure high availability. Additionally, Amazon FSx continuously monitors for hardware failures, and automatically replaces infrastructure components in the event of a failure. File systems automatically fail over and back as needed (typically within 60 seconds), and clients automatically fail over and back with the file system.
With Single-AZ file systems, Amazon FSx automatically replicates your data within an Availability Zone (AZ) to protect it from component failure, and resumes availability across failure events by continuously monitoring for hardware failures and automatically replacing infrastructure components in the event of a failure.
Amazon FSx also takes highly durable backups (stored in S3) of your file system daily, and allows you to take additional backups at any point.
Q: Does Amazon FSx offer a Service Level Agreement (SLA)?
A: Yes. The Amazon FSx SLA provides for a service credit if a customer's monthly uptime percentage is below our service commitment in any billing cycle.
Q: How can I protect and back up my data in FSx for OpenZFS?
A: To protect against end users or applications accidentally deleting or unintentionally modifying files in your volumes, you can take snapshots (point-in-time images of your volumes, stored within your file system) to allow end users or application administrators to easily undo file changes and compare file versions. End users and application administrators can view snapshots in the “.zfs/snapshot” directory at the root of a volume, and can access individual files and folders in those snapshots. You can manage your FSx for OpenZFS snapshots using the Amazon Web Services Management Console, Amazon CLI, or Amazon FSx API.
FSx for OpenZFS also offers file system backups that are designed to support archival, data retention, and compliance needs. Backups are a secondary, offline, highly durable copy of a file system stored in Amazon S3. Backups are point-in-time consistent (representing a consistent view of your entire file data set as it looked at a given time), incremental (you pay only for the changes since the last backup for a given backup), easy to manage (via the Amazon Web Services Management Console, Amazon CLI, or Amazon FSx API), and quick to create and restore (with an RTO of 5-10 minutes).
Q: What capabilities does FSx for OpenZFS provide for disaster recovery and business continuity?
A: Daily automatic backups are enabled by default, and you can update the retention policy at any time with just a few clicks in the Amazon FSx Console. FSx for OpenZFS also provides the ability to copy file system backups to other Amazon Web Services Regions to meet cross-Region disaster recovery or business continuity objectives.
Q: How does a Multi-AZ file system work?
A: When you create a Multi-AZ file system, Amazon FSx automatically provisions two individual file servers: a “preferred” server in your preferred AZ, and a “standby” server in your standby AZ, each with its own set of storage volumes. Only one of these file servers is actively serving data at any time (usually the preferred file server), but both maintain persistent and identical copies of your data.
Amazon FSx continuously monitors the health of your file servers to quickly detect any issues. If your preferred file server becomes unavailable for any reason (e.g., due to an isolated hardware failure, a planned maintenance event, or an AZ-wide interruption), the service will automatically fail over to the standby file server so you can resume file system operations without any loss of availability. Amazon FSx will map the file system’s network address to the newly active standby file server, so your Linux- and Windows-based applications can continue accessing data – uninterrupted – from the same network location. Your Multi-AZ file system will also automatically fail back once the preferred file server is restored to a healthy state. These failover and failback events are transparent to your users and applications: your file system will continue to serve NFS traffic from the same network address, allowing NFS clients to continue accessing your data without manual intervention.
Q: What events would cause Amazon FSx to initiate a failover on a Multi-AZ file system?
A: Amazon FSx automatically performs a failover on a Multi-AZ file system in the event of a loss of availability to the active file server for any reason, including a hardware failure, a loss of connectivity, or an AZ-wide outage. Amazon FSx will also temporarily fail over to the standby file server during planned maintenance during a maintenance window or if you change your file system’s throughput capacity.
Security and accessibility
Q: Does FSx for OpenZFS encrypt my data?
A: Yes. FSx for OpenZFS automatically encrypts data at rest and in transit. Your file system data and backups are always encrypted at rest using keys you manage through Amazon KMS. You can use service-owned keys or your own customer-managed keys. Amazon FSx automatically encrypts data in transit when it is accessed from Amazon EC2 instances that support encryption in transit (supported Amazon EC2 client instances).
Q: What access control capabilities does FSx for OpenZFS provide?
A: You configure your file system to be associated with an Amazon VPC and VPC Security Groups, and only Amazon EC2 instances within that VPC (or a peered VPC) and that match the Security Group rules can access your file system. You control who can administer your file systems, volumes, and backups (for example, create, update, and delete) using Amazon IAM. Additionally, you control which users and groups have access to which files and directories on your volumes using POSIX permissions.
Q: Can I access my FSx for OpenZFS file system from another VPC? Can I access data from on premises?
A: Yes. You can access file systems from another VPC (including a VPC in another Region) using Amazon Transit Gateway or VPC Peering, and you can access file systems from on premises using Amazon Direct Connect.
Q: How do I monitor and secure my file system?
A: You can monitor and secure API calls using Amazon CloudTrail and Amazon IAM and detect and flag suspicious API usage patterns using Amazon GuardDuty.
Pricing and billing
Q: How will I be billed for my use of FSx for OpenZFS?
A: You are billed for file systems based on storage capacity provisioned (per GB-month), throughput provisioned (per MBps-month), and optional additional SSD IOPS provisioned (per IOPS-month). You are also billed for backups based on storage used (per GB-month), and for data transferred (per GB) across Availability Zones, across Amazon Web Services Regions, and to your on-premises environment. While pricing is quoted on a monthly basis, your usage is prorated by the second and you are billed for your average usage over a month.
Q: Do your prices include taxes?
A: Except as otherwise noted, our prices are exclusive of applicable taxes and duties, including VAT and applicable sales tax.