Find models using semantic search
Browse available models
Get details about a specific model
Browse model collections
Get details about a specific collection
Run a model with your inputs
Check a prediction's status
Stop a running prediction
See your recent predictions
Open an image in your browser
Clean up cached images
Check cache usage
The Replicate API MCP provides a seamless integration with Replicate's model hosting platform, allowing you to search, run, and manage AI models directly from your AI assistant. Access thousands of open-source models including text-to-image generators, language models, and specialized AI tools through a simple interface. With this MCP, you can search for models using semantic search, create and manage predictions, and view generated images directly in your browser. The server handles all the complexity of the Replicate API, providing a streamlined experience for running state-of-the-art AI models without leaving your assistant.
You can install the Replicate API MCP in several ways:
Install the server globally using npm:
npm install -g mcp-replicate
Alternatively, you can run it directly with npx:
npx mcp-replicate
If you prefer to install from source:
git clone https://github.com/deepfates/mcp-replicate
cd mcp-replicate
npm install
npm run build
npm start
The server requires a Replicate API token to function. You can obtain one by:
your_token_here
with your actual Replicate API token:{
"mcpServers": {
"replicate": {
"command": "mcp-replicate",
"env": {
"REPLICATE_API_TOKEN": "your_token_here"
}
}
}
}
If you're using another MCP client, you can set the token as an environment variable:
export REPLICATE_API_TOKEN=your_token_here
Once configured, you'll see a 🔨 hammer icon in the bottom right corner of new chat windows in Claude Desktop, indicating that the tools are available.
You can ask your assistant to:
If the tools aren't showing up:
If you can see the tools but they're not functioning: