Replicate allows users to run AI models via a cloud API without managing infrastructure.
Tool to get authenticated account information. Use when you need to retrieve details about the account associated with the API token.
Tool to cancel a prediction that is still running. Use when you need to stop an in-progress prediction to free up resources or halt execution.
Tool to get a specific collection of models by its slug. Use when you need detailed information about a collection and its models.
Tool to list all collections of models. Use when you need to retrieve available model collections. Collections are curated groupings of related models.
Tool to create a file by uploading content. Use when you need to upload and store a file for later reference.
Tool to create a prediction for a Replicate Deployment. IMPORTANT: This action ONLY works with Replicate Deployments (persistent instances you create and manage), NOT public models. Deployments are created via REPLICATE_DEPLOYMENTS_CREATE. To run public models (e.g., 'meta/llama-2-70b-chat', 'stability-ai/sdxl'), use REPLICATE_MODELS_PREDICTIONS_CREATE instead. Use 'wait_for' to wait until the prediction completes.