Replicate allows users to run AI models via a cloud API without managing infrastructure.
Tool to get authenticated account information. Use when you need to retrieve details about the account associated with the API token.
Tool to get a specific collection of models by its slug. Use when you need detailed information about a collection and its models.
Tool to list all collections of models. Use when you need to retrieve available model collections. Collections are curated groupings of related models.
Tool to create a file by uploading content. Use when you need to upload and store a file for later reference.
Tool to create a prediction for a given deployment. Use when you need to run model inference with specified inputs. Use 'wait_for' to wait until the prediction completes.
Tool to create a new deployment with specified model, version, hardware, and scaling parameters. Use when you need to deploy a model for production use with auto-scaling.