Hammerspace leverages smart metadata handling for AI/ML workloads
Date of creation: March 12, 2024, 2:39 p.m. From SITE: https://www.computerweekly.com Original page link
Original page content Software-defined storage maker Hammerspace claims its Hyperscale NAS functionality will offer a global file system that’s built for artificial intelligence/machine learning (AI/ML) workloads and for the high demands of GPU-driven processing. It offers performance usually only provided by dedicated high performance computing (HPC) storage products, but for data resident in any on-site or cloud location. That’s a win for customers that want to retain data of many different types, potentially located in multiple datacentres or clouds, but which may form training datasets for analytics workloads. Hammerspace essentially allows customers to view, access and manage data wherever it is held and whatever storage it is held on. It’s a core element of Hammerspace’s technology stack that it parses out the metadata from the file at an earlier stage than competitors. In other words, the Linux kernel that Hammerspace is built on separates out the metadata at the client and before it is written to storage. This lightens the load on storage, but also means metadata is offloaded when transmitted for processing such as in AI/ML workloads and GPU farms. It has been leveraged to provide the core offer of Hammerspace’s Hyperscale NAS. Hammerspace is currently only for file data and competes with scale-out NAS makers such as NetApp, Isilon and Qumulo. With Hyperscale NAS, it attempts to provide storage targeted at HPC and AI/ML workloads. “Historically, HPC file system products, like DDN, have been a difficult sell into enterprises that already run file system products,” said Molly Presley, senior vice-president of marketing at Hammerspace. “We’ve been able to take metadata out of the data path and create a parallel file system that removes this overhead. Traditional file system products don’t do that, and it’s not good in an HPC environment. “In the AI world, most organisations don’t know which models they will want to use. So, what we offer gives flexibility, with data that resides in datacentres or in the cloud or is unstructured and that they then decide they want to access for AI training.” Presley cited one customer at the US Los Alamos National Laboratory, where, she said, the organisation runs several different file systems and had struggled with being able to distribute data to collaborators. Another customer cited by Presley runs Isilon scale-out NAS and had hit bottlenecks in how many GPUs it could feed, so the 32-node NAS system had only been able to supply a 300-node render processing farm. Using Hyperscale NAS – because it took metadata out of the I/O path – the customer was able to double the number of rendering nodes to 600. Hammerspace is among a group of products that aim to provide global file access and collaboration with access to the latest version of files from any location. Competitors include Ctera, Nasuni, Panzura and Peer Software. Hyperscale NAS is available now for all Hammerspace customers at no additional cost. Hammerspace licencing is based on the total amount of data under management. In May last year, Hammerspace acquired Rozo Systems for its RozoFS, and in particular its advanced erasure coding capabilities that allow for sharding of files across multiple locations. Read more on global file systems Global file systems: Hybrid cloud and follow-the-sun access. We look at global distributed file systems that put enterprise data under a single file access namespace so that enterprises and branch offices can get to data from anywhere. File, block and object: Storage fundamentals in the cloud era. We look at the three basic ways that storage accesses data – via file, block and object – as well as the ways in which the rise of the cloud and distributed systems have brought changes to them. | Software-defined storage maker separates metadata from files to provide view-from-anywhere file system visibility. It has now leveraged that for AI/ML workloads in Hyperscale NAS
Date of avatar: March 31, 2024, 6:48 p.m.
Tags: metadata management, software-defined storage, hammerspace, real-world impact, processing efficiency, hyperscale nas, market position, ai/ml workloads, global file system, competition, gpu-driven processing
Content: # Part 1: Introduction ## Hammerspace Introduces Hyperscale NAS for AI/ML Workloads and GPU-driven Processing Software-defined storage provider, Hammerspace, is revolutionizing the storage industry with its Hyperscale NAS functionality. Designed specifically for artificial intelligence/machine learning (AI/ML) workloads and GPU-driven processing, Hyperscale NAS offers a global file system that delivers unparalleled performance. Unlike traditional high performance computing (HPC) storage products, Hyperscale NAS can handle data resident in any on-site or cloud location. This is a game-changer for customers who need to retain and manage diverse types of data, which may be spread across multiple data centers or clouds and used as training datasets for analytics workloads. # Part 2: The Power of Hammerspace's Technology ## Enhanced Metadata Management and Processing Efficiency At the core of Hammerspace's technology stack is its innovative approach to metadata management. Unlike its competitors, Hammerspace separates out metadata from the file at the client level, before it is written to storage. This not only lightens the load on storage, but also enables efficient offloading and processing of metadata for AI/ML workloads and GPU farms. By leveraging this unique capability, Hammerspace has created Hyperscale NAS, which offers unparalleled performance for HPC and AI/ML workloads. # Part 3: Success Stories and Competition ## Real-world Impact and Market Position Hammerspace has already made a significant impact in the market, providing file data storage that competes with scale-out NAS vendors like NetApp, Isilon, and Qumulo. Molly Presley, Senior Vice President of Marketing at Hammerspace, highlighted the benefits of Hammerspace's metadata separation approach, citing real-world examples. One customer, the US Los Alamos National Laboratory, struggled with data distribution to collaborators until they adopted Hammerspace's solution. Another customer, who previously used Isilon scale-out NAS, experienced bottlenecks in their GPU feeding capacity. By implementing Hyperscale NAS, they were able to double the number of rendering nodes, significantly improving performance. Hammerspace faces competition from other providers aiming to enable global file access and collaboration, such as Ctera, Nasuni, Panzura, and Peer Software. However, the unique capabilities and performance of Hyperscale NAS set Hammerspace apart. This innovative solution is now available to all Hammerspace customers at no additional cost, with licensing based on the amount of data under management. As the storage industry continues to evolve, Hammerspace remains at the forefront, delivering cutting-edge solutions for the most demanding AI/ML workloads.
Date of avatar: March 12, 2024, 2:47 p.m.
Tags: machine learning, artificial intelligence, software-defined storage, high-performance computing, data management, file data, gpu-driven processing, storage solutions., metadata handling, rendering nodes, hyperscale nas, parallel file system, scale-out nas, hpc file system, data distribution, ai training
Content: Introducing Hyperscale NAS, the latest innovation from software-defined storage provider, Hammerspace. This groundbreaking functionality offers a global file system that is specifically designed to meet the demands of artificial intelligence/machine learning (AI/ML) workloads and GPU-driven processing. With performance capabilities typically only found in dedicated high-performance computing (HPC) storage products, Hyperscale NAS is able to handle data residing in any on-site or cloud location. This is a game-changer for customers who need to retain various types of data, potentially stored in multiple data centers or clouds, and utilize it as training datasets for analytics workloads. What sets Hammerspace apart is its ability to view, access, and manage data regardless of where it is stored or what type of storage is being used. At the core of Hammerspace's technology stack is its unique approach to metadata. Unlike its competitors, Hammerspace separates out the metadata at the client level before it is written to storage. This not only reduces the burden on storage resources but also allows for offloading of metadata during processing, such as in AI/ML workloads and GPU farms. This innovative metadata handling forms the foundation of Hammerspace's Hyperscale NAS offering. While Hammerspace currently focuses on file data, it competes with scale-out NAS providers like NetApp, Isilon, and Qumulo. However, with Hyperscale NAS, Hammerspace aims to cater specifically to HPC and AI/ML workloads. Molly Presley, senior vice-president of marketing at Hammerspace, explains that traditional HPC file system products have struggled to penetrate enterprise environments that already have existing file system solutions. Hammerspace's approach, on the other hand, eliminates overhead by removing metadata from the data path and creating a parallel file system. This flexibility is crucial in the AI world, where organizations often don't know which models they will need to use. Hammerspace allows them to access and utilize data that resides in data centers, the cloud, or is unstructured, enabling AI training as needed. Hammerspace has already seen success with customers such as the US Los Alamos National Laboratory, which had difficulties distributing data to collaborators using different file systems. Another customer, who previously faced bottlenecks with their Isilon scale-out NAS, was able to double the number of rendering nodes from 300 to 600 using Hyperscale NAS. Hammerspace is part of a group of products that aim
Date of avatar: March 12, 2024, 2:46 p.m.
Tags: cloud location, diverse data types, multiple data centers, software-defined storage, gpu-driven processing, high-performance storage, on-site, scale-out nas providers, hammerspace, ai/ml workloads, hyperscale nas, global file system
Content: Hammerspace, a software-defined storage company, claims that its Hyperscale NAS functionality provides a global file system designed for AI/ML workloads and GPU-driven processing. It offers high-performance storage for data located in any on-site or cloud location, making it ideal for customers managing diverse data types across multiple data centers or clouds. Hammerspace separates metadata from files at an early stage, reducing storage load and enabling efficient processing for AI/ML workloads. The company aims to compete with scale-out NAS providers like NetApp, Isilon, and Qumulo, targeting HPC and AI/ML workloads. Hyperscale NAS is available at no additional cost to Hammerspace customers.