ZeroGPU Beta Release

2024 May, 18

Source Link

We are thrilled to announce the beta release of ZeroGPU, a groundbreaking initiative by Hugging Face designed to democratize access to GPU resources for the open-source AI community. Congratulations to @victormustar and the team for making this vision a reality! In recent months, the open-source AI community has witnessed unprecedented growth. Major players like Meta, Apple, NVIDIA, Bytedance, Snowflake, Databricks, Microsoft, and Google have contributed by releasing open models and datasets on Hugging Face. The Hugging Face Hub now hosts over 1 million models, collectively downloaded over a billion times. Many of these models are beginning to outperform proprietary APIs, showcasing the strength and innovation within the open-source community. This vibrant ecosystem is not solely driven by big tech. Academic labs, startups, and independent hobbyists are equally pivotal. For instance, over 35,000 variations of Meta’s Llama models have been shared on Hugging Face in the past year, including more than 7,000 based on Llama-3. These models range from quantized and merged versions to specialized applications in fields like biology and Mandarin language processing. Currently, more than 4 million AI developers are leveraging Hugging Face for their projects. Despite this progress, the open-source community often lacks the extensive resources available to big tech, particularly in terms of GPU access for training and demonstrating models. This disparity is one reason why ChatGPT remains the most widely used AI application today. To address this gap, Hugging Face is launching ZeroGPU, providing a shared infrastructure that allows indie and academic AI developers to run demos on Spaces without the financial burden of GPU costs. Spaces, which have become the most popular platform for building AI demos, have seen over 300,000 demos created using CPUs or paid GPUs, with thousands more being added daily. With ZeroGPU, Hugging Face is committing $10 million in free GPUs to support the continued growth and innovation of the AI ecosystem. Technically, ZeroGPU capitalizes on Hugging Face's extensive experience in hosting and serving over 100 petabytes of data monthly. The ZeroGPU system enables Spaces to utilize multiple GPUs efficiently, dynamically allocating and releasing GPU resources as needed. This approach not only enhances resource efficiency but also reduces energy consumption by sharing GPUs rather than duplicating them. ZeroGPU leverages NVIDIA A100 GPUs to provide robust performance and scalability for the community. With the launch of ZeroGPU, Hugging Face is empowering the open-source AI community, ensuring that lack of GPU resources no longer hinders innovation and development.