Replicate AI
Cloud API to run, fine-tune, and deploy open-source machine learning models.
Please wait while we load the page
Replicate is a cloud API platform that allows users to run open-source machine learning models. It enables users to run and fine-tune models, and deploy custom models at scale with a single line of code. Replicate hosts thousands of models contributed by the community, offering production-ready APIs for various AI tasks such as image generation, video generation, image restoration, captioning, speech generation, music generation, and text generation.
Users can run pre-existing models with a single line of code, fine-tune models with their own data, or deploy custom models using Cog. The platform automatically scales to handle demand, and users only pay for the compute they use.
Choosing Replicate AI is smart if you wanna tap into thousands of open-source AI models without the hassle of setting up infrastructure. It’s got easy APIs for running, fine-tuning, and deploying models at scale, making it perfect for developers and businesses wanting flexibility and power.
The most intelligent Claude model and the first hybrid reasoning model on the market (claude-3-7-sonnet-20250219)
Faster, better FLUX Pro. Text-to-image model with excellent image quality, prompt adherence, and output diversity.
A 12 billion parameter rectified flow transformer capable of generating images from text descriptions
The fastest image generation model tailored for local development and personal use
A reasoning model trained with reinforcement learning, on par with OpenAI o1
State of the art video generation model. Veo 2 can faithfully follow simple and complex instructions, and convincingly simulates real-world physics as well as a wide range of visual styles.
The highest quality Ideogram v3 model. v3 creates images with stunning realism, creative designs, and consistent styles
Recraft V3 (code-named red_panda) is a text-to-image model with the ability to generate long texts, and images in a wide list of styles. As of today, it is SOTA in image generation, proven by the Text-to-Image Benchmark by Artificial Analysis
Accelerated inference for Wan 2.1 14B image to video, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
Accelerated inference for Wan 2.1 14B image to video with high resolution, a comprehensive and open suite of video foundation models that pushes the boundaries of video generation.
cpu
gpu-a100-large
gpu-a100-large-2x
gpu-a100-large-4x
gpu-a100-large-8x
gpu-h100
gpu-l40s
gpu-l40s-2x
gpu-l40s-4x
gpu-l40s-8x
gpu-t4
gpu-h100-2x
gpu-h100-4x
gpu-h100-8x